[2024-06-27 10:49:15,061][00794] Saving configuration to ./train_dir/sample_factory/p2.sf/config.json... [2024-06-27 10:49:15,127][00794] Rollout worker 0 uses device cpu [2024-06-27 10:49:15,128][00794] Rollout worker 1 uses device cpu [2024-06-27 10:49:15,128][00794] Rollout worker 2 uses device cpu [2024-06-27 10:49:15,128][00794] Rollout worker 3 uses device cpu [2024-06-27 10:49:15,129][00794] Rollout worker 4 uses device cpu [2024-06-27 10:49:15,129][00794] Rollout worker 5 uses device cpu [2024-06-27 10:49:15,130][00794] Rollout worker 6 uses device cpu [2024-06-27 10:49:15,130][00794] Rollout worker 7 uses device cpu [2024-06-27 10:49:15,130][00794] Rollout worker 8 uses device cpu [2024-06-27 10:49:15,131][00794] Rollout worker 9 uses device cpu [2024-06-27 10:49:15,131][00794] Rollout worker 10 uses device cpu [2024-06-27 10:49:15,132][00794] Rollout worker 11 uses device cpu [2024-06-27 10:49:15,132][00794] Rollout worker 12 uses device cpu [2024-06-27 10:49:15,133][00794] Rollout worker 13 uses device cpu [2024-06-27 10:49:15,133][00794] Rollout worker 14 uses device cpu [2024-06-27 10:49:15,133][00794] Rollout worker 15 uses device cpu [2024-06-27 10:49:15,133][00794] Rollout worker 16 uses device cpu [2024-06-27 10:49:15,134][00794] Rollout worker 17 uses device cpu [2024-06-27 10:49:15,134][00794] Rollout worker 18 uses device cpu [2024-06-27 10:49:15,134][00794] Rollout worker 19 uses device cpu [2024-06-27 10:49:15,134][00794] Rollout worker 20 uses device cpu [2024-06-27 10:49:15,134][00794] Rollout worker 21 uses device cpu [2024-06-27 10:49:15,135][00794] Rollout worker 22 uses device cpu [2024-06-27 10:49:15,135][00794] Rollout worker 23 uses device cpu [2024-06-27 10:49:15,135][00794] Rollout worker 24 uses device cpu [2024-06-27 10:49:15,135][00794] Rollout worker 25 uses device cpu [2024-06-27 10:49:15,136][00794] Rollout worker 26 uses device cpu [2024-06-27 10:49:15,136][00794] Rollout worker 27 uses device cpu [2024-06-27 10:49:15,136][00794] Rollout worker 28 uses device cpu [2024-06-27 10:49:15,136][00794] Rollout worker 29 uses device cpu [2024-06-27 10:49:15,136][00794] Rollout worker 30 uses device cpu [2024-06-27 10:49:15,137][00794] Rollout worker 31 uses device cpu [2024-06-27 10:49:15,707][00794] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 10:49:15,707][00794] InferenceWorker_p0-w0: min num requests: 10 [2024-06-27 10:49:15,750][00794] Starting all processes... [2024-06-27 10:49:15,750][00794] Starting process learner_proc0 [2024-06-27 10:49:16,027][00794] Starting all processes... [2024-06-27 10:49:16,030][00794] Starting process inference_proc0-0 [2024-06-27 10:49:16,031][00794] Starting process rollout_proc0 [2024-06-27 10:49:16,031][00794] Starting process rollout_proc1 [2024-06-27 10:49:16,032][00794] Starting process rollout_proc2 [2024-06-27 10:49:16,032][00794] Starting process rollout_proc3 [2024-06-27 10:49:16,034][00794] Starting process rollout_proc4 [2024-06-27 10:49:16,034][00794] Starting process rollout_proc5 [2024-06-27 10:49:16,034][00794] Starting process rollout_proc6 [2024-06-27 10:49:16,034][00794] Starting process rollout_proc7 [2024-06-27 10:49:16,034][00794] Starting process rollout_proc8 [2024-06-27 10:49:16,035][00794] Starting process rollout_proc9 [2024-06-27 10:49:16,036][00794] Starting process rollout_proc10 [2024-06-27 10:49:16,036][00794] Starting process rollout_proc11 [2024-06-27 10:49:16,037][00794] Starting process rollout_proc12 [2024-06-27 10:49:16,038][00794] Starting process rollout_proc13 [2024-06-27 10:49:16,038][00794] Starting process rollout_proc14 [2024-06-27 10:49:16,038][00794] Starting process rollout_proc15 [2024-06-27 10:49:16,040][00794] Starting process rollout_proc16 [2024-06-27 10:49:16,040][00794] Starting process rollout_proc17 [2024-06-27 10:49:16,041][00794] Starting process rollout_proc18 [2024-06-27 10:49:16,043][00794] Starting process rollout_proc19 [2024-06-27 10:49:16,043][00794] Starting process rollout_proc20 [2024-06-27 10:49:16,044][00794] Starting process rollout_proc21 [2024-06-27 10:49:16,044][00794] Starting process rollout_proc22 [2024-06-27 10:49:16,046][00794] Starting process rollout_proc23 [2024-06-27 10:49:16,048][00794] Starting process rollout_proc24 [2024-06-27 10:49:16,050][00794] Starting process rollout_proc25 [2024-06-27 10:49:16,050][00794] Starting process rollout_proc26 [2024-06-27 10:49:16,050][00794] Starting process rollout_proc27 [2024-06-27 10:49:16,052][00794] Starting process rollout_proc28 [2024-06-27 10:49:16,053][00794] Starting process rollout_proc29 [2024-06-27 10:49:16,057][00794] Starting process rollout_proc30 [2024-06-27 10:49:16,058][00794] Starting process rollout_proc31 [2024-06-27 10:49:18,172][01042] Worker 1 uses CPU cores [1] [2024-06-27 10:49:18,212][01045] Worker 10 uses CPU cores [10] [2024-06-27 10:49:18,240][01049] Worker 14 uses CPU cores [14] [2024-06-27 10:49:18,252][01043] Worker 7 uses CPU cores [7] [2024-06-27 10:49:18,296][01050] Worker 15 uses CPU cores [15] [2024-06-27 10:49:18,320][01039] Worker 4 uses CPU cores [4] [2024-06-27 10:49:18,324][01058] Worker 22 uses CPU cores [22] [2024-06-27 10:49:18,327][01035] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 10:49:18,328][01035] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2024-06-27 10:49:18,328][01046] Worker 9 uses CPU cores [9] [2024-06-27 10:49:18,336][01035] Num visible devices: 1 [2024-06-27 10:49:18,360][01066] Worker 30 uses CPU cores [30] [2024-06-27 10:49:18,367][01052] Worker 16 uses CPU cores [16] [2024-06-27 10:49:18,368][01064] Worker 29 uses CPU cores [29] [2024-06-27 10:49:18,400][01063] Worker 27 uses CPU cores [27] [2024-06-27 10:49:18,408][01057] Worker 21 uses CPU cores [21] [2024-06-27 10:49:18,420][01059] Worker 25 uses CPU cores [25] [2024-06-27 10:49:18,426][01060] Worker 23 uses CPU cores [23] [2024-06-27 10:49:18,440][01054] Worker 17 uses CPU cores [17] [2024-06-27 10:49:18,449][01036] Worker 0 uses CPU cores [0] [2024-06-27 10:49:18,451][01056] Worker 20 uses CPU cores [20] [2024-06-27 10:49:18,466][01048] Worker 12 uses CPU cores [12] [2024-06-27 10:49:18,484][01065] Worker 31 uses CPU cores [31] [2024-06-27 10:49:18,490][01053] Worker 18 uses CPU cores [18] [2024-06-27 10:49:18,506][01040] Worker 5 uses CPU cores [5] [2024-06-27 10:49:18,516][01038] Worker 3 uses CPU cores [3] [2024-06-27 10:49:18,522][01051] Worker 13 uses CPU cores [13] [2024-06-27 10:49:18,537][01047] Worker 8 uses CPU cores [8] [2024-06-27 10:49:18,550][01061] Worker 24 uses CPU cores [24] [2024-06-27 10:49:18,576][01041] Worker 6 uses CPU cores [6] [2024-06-27 10:49:18,576][01037] Worker 2 uses CPU cores [2] [2024-06-27 10:49:18,583][01015] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 10:49:18,583][01015] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2024-06-27 10:49:18,587][01062] Worker 26 uses CPU cores [26] [2024-06-27 10:49:18,590][01015] Num visible devices: 1 [2024-06-27 10:49:18,600][01044] Worker 11 uses CPU cores [11] [2024-06-27 10:49:18,604][01067] Worker 28 uses CPU cores [28] [2024-06-27 10:49:18,605][01015] Setting fixed seed 0 [2024-06-27 10:49:18,606][01015] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 10:49:18,606][01015] Initializing actor-critic model on device cuda:0 [2024-06-27 10:49:18,619][01055] Worker 19 uses CPU cores [19] [2024-06-27 10:49:19,293][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,294][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,295][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,295][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,295][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,295][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,298][01015] RunningMeanStd input shape: (1,) [2024-06-27 10:49:19,298][01015] RunningMeanStd input shape: (1,) [2024-06-27 10:49:19,298][01015] RunningMeanStd input shape: (1,) [2024-06-27 10:49:19,298][01015] RunningMeanStd input shape: (1,) [2024-06-27 10:49:19,299][01015] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:19,333][01015] RunningMeanStd input shape: (1,) [2024-06-27 10:49:19,341][01015] Created Actor Critic model with architecture: [2024-06-27 10:49:19,341][01015] SampleFactoryAgentWrapper( (obs_normalizer): ObservationNormalizer() (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (agent): MettaAgent( (_encoder): MultiFeatureSetEncoder( (feature_set_encoders): ModuleDict( (grid_obs): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (agent): RunningMeanStdInPlace() (altar): RunningMeanStdInPlace() (clock): RunningMeanStdInPlace() (converter): RunningMeanStdInPlace() (generator): RunningMeanStdInPlace() (wall): RunningMeanStdInPlace() (agent:dir): RunningMeanStdInPlace() (agent:energy): RunningMeanStdInPlace() (agent:frozen): RunningMeanStdInPlace() (agent:hp): RunningMeanStdInPlace() (agent:id): RunningMeanStdInPlace() (agent:inv_r1): RunningMeanStdInPlace() (agent:inv_r2): RunningMeanStdInPlace() (agent:inv_r3): RunningMeanStdInPlace() (agent:shield): RunningMeanStdInPlace() (altar:hp): RunningMeanStdInPlace() (altar:state): RunningMeanStdInPlace() (converter:hp): RunningMeanStdInPlace() (converter:state): RunningMeanStdInPlace() (generator:amount): RunningMeanStdInPlace() (generator:hp): RunningMeanStdInPlace() (generator:state): RunningMeanStdInPlace() (wall:hp): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=125, out_features=512, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=512, out_features=512, bias=True) (3): ELU(alpha=1.0) (4): Linear(in_features=512, out_features=512, bias=True) (5): ELU(alpha=1.0) (6): Linear(in_features=512, out_features=512, bias=True) (7): ELU(alpha=1.0) ) ) (global_vars): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (_steps): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (last_action): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (last_action_id): RunningMeanStdInPlace() (last_action_val): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (last_reward): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (last_reward): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (kinship): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (kinship): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=125, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) ) (merged_encoder): Sequential( (0): Linear(in_features=544, out_features=512, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=512, out_features=512, bias=True) (3): ELU(alpha=1.0) (4): Linear(in_features=512, out_features=512, bias=True) (5): ELU(alpha=1.0) ) ) (_decoder): Decoder( (mlp): Identity() ) (_critic_linear): Linear(in_features=512, out_features=1, bias=True) ) (_core): ModelCoreRNN( (core): GRU(512, 512) ) (_action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=16, bias=True) ) ) [2024-06-27 10:49:19,415][01015] Using optimizer [2024-06-27 10:49:19,598][01015] No checkpoints found [2024-06-27 10:49:19,599][01015] Did not load from checkpoint, starting from scratch! [2024-06-27 10:49:19,599][01015] Initialized policy 0 weights for model version 0 [2024-06-27 10:49:19,600][01015] LearnerWorker_p0 finished initialization! [2024-06-27 10:49:19,601][01015] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 10:49:20,311][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,311][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,311][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,311][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,311][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,312][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,320][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,323][01035] RunningMeanStd input shape: (1,) [2024-06-27 10:49:20,324][01035] RunningMeanStd input shape: (1,) [2024-06-27 10:49:20,324][01035] RunningMeanStd input shape: (1,) [2024-06-27 10:49:20,324][01035] RunningMeanStd input shape: (1,) [2024-06-27 10:49:20,324][01035] RunningMeanStd input shape: (11, 11) [2024-06-27 10:49:20,359][01035] RunningMeanStd input shape: (1,) [2024-06-27 10:49:20,384][00794] Inference worker 0-0 is ready! [2024-06-27 10:49:20,385][00794] All inference workers are ready! Signal rollout workers to start! [2024-06-27 10:49:22,704][00794] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 0. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2024-06-27 10:49:23,113][01062] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,118][01061] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,121][01064] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,134][01066] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,141][01058] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,154][01060] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,155][01055] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,156][01053] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,158][01038] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,160][01046] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,162][01044] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,164][01040] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,164][01043] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,165][01057] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,166][01050] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,167][01042] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,169][01037] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,169][01052] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,170][01047] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,170][01054] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,171][01039] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,172][01041] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,172][01049] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,173][01045] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,173][01048] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,174][01036] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,174][01065] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,174][01051] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,176][01056] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,181][01067] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,189][01059] Decorrelating experience for 0 frames... [2024-06-27 10:49:23,203][01063] Decorrelating experience for 0 frames... [2024-06-27 10:49:24,225][01062] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,233][01061] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,233][01064] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,261][01066] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,265][01058] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,277][01055] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,286][01060] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,292][01053] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,309][01057] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,310][01038] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,316][01044] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,316][01046] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,322][01040] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,322][01043] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,325][01065] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,326][01050] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,332][01042] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,334][01037] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,336][01052] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,336][01054] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,337][01047] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,339][01048] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,340][01041] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,342][01049] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,345][01045] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,345][01039] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,346][01056] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,347][01036] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,348][01051] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,354][01067] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,372][01059] Decorrelating experience for 256 frames... [2024-06-27 10:49:24,389][01063] Decorrelating experience for 256 frames... [2024-06-27 10:49:27,708][00794] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 0. Throughput: 0: 7266.1. Samples: 36360. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2024-06-27 10:49:31,473][01050] Worker 15, sleep for 70.312 sec to decorrelate experience collection [2024-06-27 10:49:31,503][01057] Worker 21, sleep for 98.438 sec to decorrelate experience collection [2024-06-27 10:49:31,599][01043] Worker 7, sleep for 32.812 sec to decorrelate experience collection [2024-06-27 10:49:31,959][01049] Worker 14, sleep for 65.625 sec to decorrelate experience collection [2024-06-27 10:49:32,078][01051] Worker 13, sleep for 60.938 sec to decorrelate experience collection [2024-06-27 10:49:32,078][01041] Worker 6, sleep for 28.125 sec to decorrelate experience collection [2024-06-27 10:49:32,087][01053] Worker 18, sleep for 84.375 sec to decorrelate experience collection [2024-06-27 10:49:32,087][01058] Worker 22, sleep for 103.125 sec to decorrelate experience collection [2024-06-27 10:49:32,087][01061] Worker 24, sleep for 112.500 sec to decorrelate experience collection [2024-06-27 10:49:32,087][01055] Worker 19, sleep for 89.062 sec to decorrelate experience collection [2024-06-27 10:49:32,087][01066] Worker 30, sleep for 140.625 sec to decorrelate experience collection [2024-06-27 10:49:32,091][01048] Worker 12, sleep for 56.250 sec to decorrelate experience collection [2024-06-27 10:49:32,092][01044] Worker 11, sleep for 51.562 sec to decorrelate experience collection [2024-06-27 10:49:32,097][01045] Worker 10, sleep for 46.875 sec to decorrelate experience collection [2024-06-27 10:49:32,099][01037] Worker 2, sleep for 9.375 sec to decorrelate experience collection [2024-06-27 10:49:32,102][01062] Worker 26, sleep for 121.875 sec to decorrelate experience collection [2024-06-27 10:49:32,102][01060] Worker 23, sleep for 107.812 sec to decorrelate experience collection [2024-06-27 10:49:32,111][01046] Worker 9, sleep for 42.188 sec to decorrelate experience collection [2024-06-27 10:49:32,120][01047] Worker 8, sleep for 37.500 sec to decorrelate experience collection [2024-06-27 10:49:32,139][01015] Signal inference workers to stop experience collection... [2024-06-27 10:49:32,139][01042] Worker 1, sleep for 4.688 sec to decorrelate experience collection [2024-06-27 10:49:32,140][01064] Worker 29, sleep for 135.938 sec to decorrelate experience collection [2024-06-27 10:49:32,145][01038] Worker 3, sleep for 14.062 sec to decorrelate experience collection [2024-06-27 10:49:32,147][01035] InferenceWorker_p0-w0: stopping experience collection [2024-06-27 10:49:32,150][01065] Worker 31, sleep for 145.312 sec to decorrelate experience collection [2024-06-27 10:49:32,153][01056] Worker 20, sleep for 93.750 sec to decorrelate experience collection [2024-06-27 10:49:32,159][01054] Worker 17, sleep for 79.688 sec to decorrelate experience collection [2024-06-27 10:49:32,686][01015] Signal inference workers to resume experience collection... [2024-06-27 10:49:32,686][01035] InferenceWorker_p0-w0: resuming experience collection [2024-06-27 10:49:32,700][01040] Worker 5, sleep for 23.438 sec to decorrelate experience collection [2024-06-27 10:49:32,700][01067] Worker 28, sleep for 131.250 sec to decorrelate experience collection [2024-06-27 10:49:32,700][01052] Worker 16, sleep for 75.000 sec to decorrelate experience collection [2024-06-27 10:49:32,704][00794] Fps is (10 sec: 1638.4, 60 sec: 1638.4, 300 sec: 1638.4). Total num frames: 16384. Throughput: 0: 32700.4. Samples: 327000. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-27 10:49:32,704][00794] Avg episode reward: [(0, '0.000')] [2024-06-27 10:49:32,706][01059] Worker 25, sleep for 117.188 sec to decorrelate experience collection [2024-06-27 10:49:32,785][01063] Worker 27, sleep for 126.562 sec to decorrelate experience collection [2024-06-27 10:49:33,242][01039] Worker 4, sleep for 18.750 sec to decorrelate experience collection [2024-06-27 10:49:33,844][01035] Updated weights for policy 0, policy_version 10 (0.0015) [2024-06-27 10:49:35,704][00794] Heartbeat connected on Batcher_0 [2024-06-27 10:49:35,706][00794] Heartbeat connected on LearnerWorker_p0 [2024-06-27 10:49:35,720][00794] Heartbeat connected on RolloutWorker_w0 [2024-06-27 10:49:35,779][00794] Heartbeat connected on InferenceWorker_p0-w0 [2024-06-27 10:49:36,850][01042] Worker 1 awakens! [2024-06-27 10:49:36,862][00794] Heartbeat connected on RolloutWorker_w1 [2024-06-27 10:49:37,704][00794] Fps is (10 sec: 16390.6, 60 sec: 10922.7, 300 sec: 10922.7). Total num frames: 163840. Throughput: 0: 22013.3. Samples: 330200. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-27 10:49:37,704][00794] Avg episode reward: [(0, '0.000')] [2024-06-27 10:49:37,705][01015] Saving new best policy, reward=0.000! [2024-06-27 10:49:41,520][01037] Worker 2 awakens! [2024-06-27 10:49:41,525][00794] Heartbeat connected on RolloutWorker_w2 [2024-06-27 10:49:42,704][00794] Fps is (10 sec: 16383.6, 60 sec: 9011.1, 300 sec: 9011.1). Total num frames: 180224. Throughput: 0: 17149.9. Samples: 343000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 10.0) [2024-06-27 10:49:42,704][00794] Avg episode reward: [(0, '0.000')] [2024-06-27 10:49:46,279][01038] Worker 3 awakens! [2024-06-27 10:49:46,291][00794] Heartbeat connected on RolloutWorker_w3 [2024-06-27 10:49:47,704][00794] Fps is (10 sec: 3276.8, 60 sec: 7864.3, 300 sec: 7864.3). Total num frames: 196608. Throughput: 0: 14601.6. Samples: 365040. Policy #0 lag: (min: 0.0, avg: 1.1, max: 11.0) [2024-06-27 10:49:47,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:49:52,005][01039] Worker 4 awakens! [2024-06-27 10:49:52,013][00794] Heartbeat connected on RolloutWorker_w4 [2024-06-27 10:49:52,704][00794] Fps is (10 sec: 4915.3, 60 sec: 7645.9, 300 sec: 7645.9). Total num frames: 229376. Throughput: 0: 12632.0. Samples: 378960. Policy #0 lag: (min: 0.0, avg: 4.4, max: 12.0) [2024-06-27 10:49:52,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:49:52,722][01015] Saving new best policy, reward=0.001! [2024-06-27 10:49:56,238][01040] Worker 5 awakens! [2024-06-27 10:49:56,246][00794] Heartbeat connected on RolloutWorker_w5 [2024-06-27 10:49:57,704][00794] Fps is (10 sec: 8192.1, 60 sec: 7958.0, 300 sec: 7958.0). Total num frames: 278528. Throughput: 0: 12394.3. Samples: 433800. Policy #0 lag: (min: 0.0, avg: 2.1, max: 15.0) [2024-06-27 10:49:57,704][00794] Avg episode reward: [(0, '0.000')] [2024-06-27 10:50:00,205][01041] Worker 6 awakens! [2024-06-27 10:50:00,211][00794] Heartbeat connected on RolloutWorker_w6 [2024-06-27 10:50:00,675][01035] Updated weights for policy 0, policy_version 20 (0.0016) [2024-06-27 10:50:02,704][00794] Fps is (10 sec: 13107.3, 60 sec: 9011.2, 300 sec: 9011.2). Total num frames: 360448. Throughput: 0: 13084.0. Samples: 523360. Policy #0 lag: (min: 0.0, avg: 6.3, max: 18.0) [2024-06-27 10:50:02,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:50:04,512][01043] Worker 7 awakens! [2024-06-27 10:50:04,519][00794] Heartbeat connected on RolloutWorker_w7 [2024-06-27 10:50:07,704][00794] Fps is (10 sec: 14745.5, 60 sec: 9466.3, 300 sec: 9466.3). Total num frames: 425984. Throughput: 0: 12853.3. Samples: 578400. Policy #0 lag: (min: 0.0, avg: 2.9, max: 5.0) [2024-06-27 10:50:07,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:50:09,287][01035] Updated weights for policy 0, policy_version 30 (0.0012) [2024-06-27 10:50:09,720][01047] Worker 8 awakens! [2024-06-27 10:50:09,725][00794] Heartbeat connected on RolloutWorker_w8 [2024-06-27 10:50:12,704][00794] Fps is (10 sec: 18022.0, 60 sec: 10813.4, 300 sec: 10813.4). Total num frames: 540672. Throughput: 0: 14598.6. Samples: 693240. Policy #0 lag: (min: 0.0, avg: 3.2, max: 6.0) [2024-06-27 10:50:12,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:50:14,399][01046] Worker 9 awakens! [2024-06-27 10:50:14,407][00794] Heartbeat connected on RolloutWorker_w9 [2024-06-27 10:50:17,442][01035] Updated weights for policy 0, policy_version 40 (0.0013) [2024-06-27 10:50:17,704][00794] Fps is (10 sec: 24576.0, 60 sec: 12213.5, 300 sec: 12213.5). Total num frames: 671744. Throughput: 0: 11115.5. Samples: 827200. Policy #0 lag: (min: 0.0, avg: 2.8, max: 7.0) [2024-06-27 10:50:17,704][00794] Avg episode reward: [(0, '0.000')] [2024-06-27 10:50:19,072][01045] Worker 10 awakens! [2024-06-27 10:50:19,078][00794] Heartbeat connected on RolloutWorker_w10 [2024-06-27 10:50:22,704][00794] Fps is (10 sec: 24576.3, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 786432. Throughput: 0: 12744.9. Samples: 903720. Policy #0 lag: (min: 0.0, avg: 3.2, max: 7.0) [2024-06-27 10:50:22,704][00794] Avg episode reward: [(0, '0.000')] [2024-06-27 10:50:23,263][01035] Updated weights for policy 0, policy_version 50 (0.0016) [2024-06-27 10:50:23,705][01044] Worker 11 awakens! [2024-06-27 10:50:23,710][00794] Heartbeat connected on RolloutWorker_w11 [2024-06-27 10:50:27,704][00794] Fps is (10 sec: 26214.5, 60 sec: 15565.9, 300 sec: 14367.5). Total num frames: 933888. Throughput: 0: 16078.3. Samples: 1066520. Policy #0 lag: (min: 0.0, avg: 4.7, max: 9.0) [2024-06-27 10:50:27,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:50:28,442][01048] Worker 12 awakens! [2024-06-27 10:50:28,449][00794] Heartbeat connected on RolloutWorker_w12 [2024-06-27 10:50:29,790][01035] Updated weights for policy 0, policy_version 60 (0.0015) [2024-06-27 10:50:32,704][00794] Fps is (10 sec: 29491.1, 60 sec: 17749.3, 300 sec: 15447.8). Total num frames: 1081344. Throughput: 0: 19578.7. Samples: 1246080. Policy #0 lag: (min: 0.0, avg: 5.0, max: 9.0) [2024-06-27 10:50:32,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:50:33,117][01051] Worker 13 awakens! [2024-06-27 10:50:33,125][00794] Heartbeat connected on RolloutWorker_w13 [2024-06-27 10:50:35,019][01035] Updated weights for policy 0, policy_version 70 (0.0017) [2024-06-27 10:50:37,684][01049] Worker 14 awakens! [2024-06-27 10:50:37,690][00794] Heartbeat connected on RolloutWorker_w14 [2024-06-27 10:50:37,704][00794] Fps is (10 sec: 31129.6, 60 sec: 18022.4, 300 sec: 16602.5). Total num frames: 1245184. Throughput: 0: 21405.3. Samples: 1342200. Policy #0 lag: (min: 0.0, avg: 4.6, max: 9.0) [2024-06-27 10:50:37,704][00794] Avg episode reward: [(0, '0.000')] [2024-06-27 10:50:39,869][01035] Updated weights for policy 0, policy_version 80 (0.0024) [2024-06-27 10:50:41,884][01050] Worker 15 awakens! [2024-06-27 10:50:41,893][00794] Heartbeat connected on RolloutWorker_w15 [2024-06-27 10:50:42,704][00794] Fps is (10 sec: 32767.8, 60 sec: 20480.0, 300 sec: 17612.8). Total num frames: 1409024. Throughput: 0: 24490.6. Samples: 1535880. Policy #0 lag: (min: 0.0, avg: 29.6, max: 84.0) [2024-06-27 10:50:42,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:50:45,213][01035] Updated weights for policy 0, policy_version 90 (0.0030) [2024-06-27 10:50:47,704][00794] Fps is (10 sec: 31129.5, 60 sec: 22664.5, 300 sec: 18311.5). Total num frames: 1556480. Throughput: 0: 26652.8. Samples: 1722740. Policy #0 lag: (min: 0.0, avg: 5.6, max: 11.0) [2024-06-27 10:50:47,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:50:47,800][01052] Worker 16 awakens! [2024-06-27 10:50:47,809][00794] Heartbeat connected on RolloutWorker_w16 [2024-06-27 10:50:50,597][01035] Updated weights for policy 0, policy_version 100 (0.0023) [2024-06-27 10:50:51,944][01054] Worker 17 awakens! [2024-06-27 10:50:51,954][00794] Heartbeat connected on RolloutWorker_w17 [2024-06-27 10:50:52,704][00794] Fps is (10 sec: 31129.8, 60 sec: 24849.0, 300 sec: 19114.7). Total num frames: 1720320. Throughput: 0: 27648.0. Samples: 1822560. Policy #0 lag: (min: 1.0, avg: 6.2, max: 12.0) [2024-06-27 10:50:52,704][00794] Avg episode reward: [(0, '0.000')] [2024-06-27 10:50:55,040][01035] Updated weights for policy 0, policy_version 110 (0.0032) [2024-06-27 10:50:56,560][01053] Worker 18 awakens! [2024-06-27 10:50:56,570][00794] Heartbeat connected on RolloutWorker_w18 [2024-06-27 10:50:57,704][00794] Fps is (10 sec: 32767.9, 60 sec: 26760.5, 300 sec: 19833.3). Total num frames: 1884160. Throughput: 0: 29468.1. Samples: 2019300. Policy #0 lag: (min: 0.0, avg: 6.5, max: 14.0) [2024-06-27 10:50:57,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:51:00,691][01035] Updated weights for policy 0, policy_version 120 (0.0028) [2024-06-27 10:51:01,248][01055] Worker 19 awakens! [2024-06-27 10:51:01,259][00794] Heartbeat connected on RolloutWorker_w19 [2024-06-27 10:51:02,704][00794] Fps is (10 sec: 32767.9, 60 sec: 28125.8, 300 sec: 20480.0). Total num frames: 2048000. Throughput: 0: 30956.0. Samples: 2220220. Policy #0 lag: (min: 0.0, avg: 6.5, max: 14.0) [2024-06-27 10:51:02,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:51:04,120][01035] Updated weights for policy 0, policy_version 130 (0.0035) [2024-06-27 10:51:06,000][01056] Worker 20 awakens! [2024-06-27 10:51:06,011][00794] Heartbeat connected on RolloutWorker_w20 [2024-06-27 10:51:07,704][00794] Fps is (10 sec: 32767.8, 60 sec: 29764.2, 300 sec: 21065.1). Total num frames: 2211840. Throughput: 0: 31675.1. Samples: 2329100. Policy #0 lag: (min: 0.0, avg: 23.8, max: 131.0) [2024-06-27 10:51:07,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:51:09,146][01035] Updated weights for policy 0, policy_version 140 (0.0028) [2024-06-27 10:51:09,960][01057] Worker 21 awakens! [2024-06-27 10:51:09,971][00794] Heartbeat connected on RolloutWorker_w21 [2024-06-27 10:51:12,704][00794] Fps is (10 sec: 34406.7, 60 sec: 30856.6, 300 sec: 21746.0). Total num frames: 2392064. Throughput: 0: 32727.5. Samples: 2539260. Policy #0 lag: (min: 0.0, avg: 8.8, max: 16.0) [2024-06-27 10:51:12,704][00794] Avg episode reward: [(0, '0.000')] [2024-06-27 10:51:12,715][01015] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000000146_2392064.pth... [2024-06-27 10:51:14,110][01035] Updated weights for policy 0, policy_version 150 (0.0037) [2024-06-27 10:51:15,230][01058] Worker 22 awakens! [2024-06-27 10:51:15,242][00794] Heartbeat connected on RolloutWorker_w22 [2024-06-27 10:51:17,704][00794] Fps is (10 sec: 39321.6, 60 sec: 32221.8, 300 sec: 22652.6). Total num frames: 2605056. Throughput: 0: 33566.7. Samples: 2756580. Policy #0 lag: (min: 0.0, avg: 44.0, max: 153.0) [2024-06-27 10:51:17,704][00794] Avg episode reward: [(0, '0.000')] [2024-06-27 10:51:18,723][01035] Updated weights for policy 0, policy_version 160 (0.0030) [2024-06-27 10:51:20,015][01060] Worker 23 awakens! [2024-06-27 10:51:20,028][00794] Heartbeat connected on RolloutWorker_w23 [2024-06-27 10:51:22,221][01035] Updated weights for policy 0, policy_version 170 (0.0029) [2024-06-27 10:51:22,704][00794] Fps is (10 sec: 40959.5, 60 sec: 33587.1, 300 sec: 23347.2). Total num frames: 2801664. Throughput: 0: 33957.2. Samples: 2870280. Policy #0 lag: (min: 0.0, avg: 7.0, max: 15.0) [2024-06-27 10:51:22,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:51:24,684][01061] Worker 24 awakens! [2024-06-27 10:51:24,697][00794] Heartbeat connected on RolloutWorker_w24 [2024-06-27 10:51:27,308][01035] Updated weights for policy 0, policy_version 180 (0.0034) [2024-06-27 10:51:27,704][00794] Fps is (10 sec: 34406.4, 60 sec: 33587.1, 300 sec: 23592.9). Total num frames: 2949120. Throughput: 0: 34594.7. Samples: 3092640. Policy #0 lag: (min: 1.0, avg: 6.8, max: 16.0) [2024-06-27 10:51:27,704][00794] Avg episode reward: [(0, '0.000')] [2024-06-27 10:51:29,996][01059] Worker 25 awakens! [2024-06-27 10:51:30,009][00794] Heartbeat connected on RolloutWorker_w25 [2024-06-27 10:51:32,188][01035] Updated weights for policy 0, policy_version 190 (0.0021) [2024-06-27 10:51:32,704][00794] Fps is (10 sec: 32768.2, 60 sec: 34133.3, 300 sec: 24071.9). Total num frames: 3129344. Throughput: 0: 35494.6. Samples: 3320000. Policy #0 lag: (min: 0.0, avg: 8.9, max: 16.0) [2024-06-27 10:51:32,704][00794] Avg episode reward: [(0, '0.000')] [2024-06-27 10:51:34,077][01062] Worker 26 awakens! [2024-06-27 10:51:34,090][00794] Heartbeat connected on RolloutWorker_w26 [2024-06-27 10:51:35,476][01035] Updated weights for policy 0, policy_version 200 (0.0037) [2024-06-27 10:51:37,704][00794] Fps is (10 sec: 40959.8, 60 sec: 35225.5, 300 sec: 24879.4). Total num frames: 3358720. Throughput: 0: 35857.7. Samples: 3436160. Policy #0 lag: (min: 0.0, avg: 10.7, max: 18.0) [2024-06-27 10:51:37,705][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:51:39,448][01063] Worker 27 awakens! [2024-06-27 10:51:39,460][00794] Heartbeat connected on RolloutWorker_w27 [2024-06-27 10:51:40,134][01035] Updated weights for policy 0, policy_version 210 (0.0038) [2024-06-27 10:51:42,704][00794] Fps is (10 sec: 42598.7, 60 sec: 35771.8, 300 sec: 25395.2). Total num frames: 3555328. Throughput: 0: 36712.0. Samples: 3671340. Policy #0 lag: (min: 0.0, avg: 8.3, max: 18.0) [2024-06-27 10:51:42,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:51:43,599][01035] Updated weights for policy 0, policy_version 220 (0.0034) [2024-06-27 10:51:44,050][01067] Worker 28 awakens! [2024-06-27 10:51:44,064][00794] Heartbeat connected on RolloutWorker_w28 [2024-06-27 10:51:47,704][00794] Fps is (10 sec: 36045.1, 60 sec: 36044.8, 300 sec: 25649.4). Total num frames: 3719168. Throughput: 0: 37558.7. Samples: 3910360. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2024-06-27 10:51:47,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:51:48,176][01064] Worker 29 awakens! [2024-06-27 10:51:48,192][00794] Heartbeat connected on RolloutWorker_w29 [2024-06-27 10:51:48,383][01035] Updated weights for policy 0, policy_version 230 (0.0027) [2024-06-27 10:51:51,661][01035] Updated weights for policy 0, policy_version 240 (0.0036) [2024-06-27 10:51:52,704][00794] Fps is (10 sec: 39321.2, 60 sec: 37137.0, 300 sec: 26323.6). Total num frames: 3948544. Throughput: 0: 37789.8. Samples: 4029640. Policy #0 lag: (min: 0.0, avg: 11.5, max: 20.0) [2024-06-27 10:51:52,705][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:51:52,812][01066] Worker 30 awakens! [2024-06-27 10:51:52,827][00794] Heartbeat connected on RolloutWorker_w30 [2024-06-27 10:51:56,339][01035] Updated weights for policy 0, policy_version 250 (0.0041) [2024-06-27 10:51:57,550][01065] Worker 31 awakens! [2024-06-27 10:51:57,564][00794] Heartbeat connected on RolloutWorker_w31 [2024-06-27 10:51:57,704][00794] Fps is (10 sec: 44237.1, 60 sec: 37956.3, 300 sec: 26848.6). Total num frames: 4161536. Throughput: 0: 38488.9. Samples: 4271260. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 10:51:57,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:51:59,757][01035] Updated weights for policy 0, policy_version 260 (0.0034) [2024-06-27 10:52:02,708][00794] Fps is (10 sec: 37668.3, 60 sec: 37953.8, 300 sec: 27032.9). Total num frames: 4325376. Throughput: 0: 39255.2. Samples: 4523220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 10:52:02,709][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:52:04,396][01035] Updated weights for policy 0, policy_version 270 (0.0039) [2024-06-27 10:52:07,704][00794] Fps is (10 sec: 40959.6, 60 sec: 39321.6, 300 sec: 27703.8). Total num frames: 4571136. Throughput: 0: 39377.4. Samples: 4642260. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 10:52:07,708][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:52:07,757][01035] Updated weights for policy 0, policy_version 280 (0.0033) [2024-06-27 10:52:12,323][01035] Updated weights for policy 0, policy_version 290 (0.0028) [2024-06-27 10:52:12,704][00794] Fps is (10 sec: 44254.3, 60 sec: 39594.6, 300 sec: 28045.5). Total num frames: 4767744. Throughput: 0: 40034.2. Samples: 4894180. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-27 10:52:12,705][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:52:15,703][01035] Updated weights for policy 0, policy_version 300 (0.0033) [2024-06-27 10:52:17,704][00794] Fps is (10 sec: 39321.8, 60 sec: 39321.6, 300 sec: 28367.7). Total num frames: 4964352. Throughput: 0: 40510.7. Samples: 5142980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 10:52:17,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:52:20,256][01035] Updated weights for policy 0, policy_version 310 (0.0041) [2024-06-27 10:52:22,704][00794] Fps is (10 sec: 42599.0, 60 sec: 39867.9, 300 sec: 28854.0). Total num frames: 5193728. Throughput: 0: 40482.4. Samples: 5257860. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-27 10:52:22,704][00794] Avg episode reward: [(0, '0.000')] [2024-06-27 10:52:23,932][01035] Updated weights for policy 0, policy_version 320 (0.0032) [2024-06-27 10:52:27,704][00794] Fps is (10 sec: 39321.8, 60 sec: 40140.9, 300 sec: 28959.8). Total num frames: 5357568. Throughput: 0: 40902.3. Samples: 5511940. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 10:52:27,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:52:28,452][01035] Updated weights for policy 0, policy_version 330 (0.0045) [2024-06-27 10:52:29,364][01015] Signal inference workers to stop experience collection... (50 times) [2024-06-27 10:52:29,403][01035] InferenceWorker_p0-w0: stopping experience collection (50 times) [2024-06-27 10:52:29,421][01015] Signal inference workers to resume experience collection... (50 times) [2024-06-27 10:52:29,424][01035] InferenceWorker_p0-w0: resuming experience collection (50 times) [2024-06-27 10:52:31,646][01035] Updated weights for policy 0, policy_version 340 (0.0038) [2024-06-27 10:52:32,704][00794] Fps is (10 sec: 39321.3, 60 sec: 40960.0, 300 sec: 29405.0). Total num frames: 5586944. Throughput: 0: 40963.1. Samples: 5753700. Policy #0 lag: (min: 0.0, avg: 12.8, max: 22.0) [2024-06-27 10:52:32,705][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:52:36,110][01035] Updated weights for policy 0, policy_version 350 (0.0032) [2024-06-27 10:52:37,704][00794] Fps is (10 sec: 47513.5, 60 sec: 41233.2, 300 sec: 29911.3). Total num frames: 5832704. Throughput: 0: 41248.1. Samples: 5885800. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-27 10:52:37,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:52:39,476][01035] Updated weights for policy 0, policy_version 360 (0.0043) [2024-06-27 10:52:42,704][00794] Fps is (10 sec: 37683.3, 60 sec: 40140.8, 300 sec: 29818.9). Total num frames: 5963776. Throughput: 0: 41231.1. Samples: 6126660. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 10:52:42,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:52:44,376][01035] Updated weights for policy 0, policy_version 370 (0.0042) [2024-06-27 10:52:47,518][01035] Updated weights for policy 0, policy_version 380 (0.0046) [2024-06-27 10:52:47,704][00794] Fps is (10 sec: 40959.8, 60 sec: 42052.3, 300 sec: 30450.3). Total num frames: 6242304. Throughput: 0: 41001.4. Samples: 6368120. Policy #0 lag: (min: 0.0, avg: 12.6, max: 24.0) [2024-06-27 10:52:47,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:52:51,877][01035] Updated weights for policy 0, policy_version 390 (0.0026) [2024-06-27 10:52:52,704][00794] Fps is (10 sec: 45875.5, 60 sec: 41233.2, 300 sec: 30583.5). Total num frames: 6422528. Throughput: 0: 41278.3. Samples: 6499780. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-27 10:52:52,704][00794] Avg episode reward: [(0, '0.000')] [2024-06-27 10:52:55,507][01035] Updated weights for policy 0, policy_version 400 (0.0037) [2024-06-27 10:52:57,704][00794] Fps is (10 sec: 34406.8, 60 sec: 40413.9, 300 sec: 30634.3). Total num frames: 6586368. Throughput: 0: 41128.6. Samples: 6744960. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 10:52:57,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:53:00,195][01035] Updated weights for policy 0, policy_version 410 (0.0028) [2024-06-27 10:53:02,704][00794] Fps is (10 sec: 42597.8, 60 sec: 42055.0, 300 sec: 31129.6). Total num frames: 6848512. Throughput: 0: 40991.9. Samples: 6987620. Policy #0 lag: (min: 0.0, avg: 12.7, max: 21.0) [2024-06-27 10:53:02,713][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:53:03,312][01035] Updated weights for policy 0, policy_version 420 (0.0030) [2024-06-27 10:53:07,704][00794] Fps is (10 sec: 42598.0, 60 sec: 40687.0, 300 sec: 31166.0). Total num frames: 7012352. Throughput: 0: 41456.4. Samples: 7123400. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-27 10:53:07,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:53:07,993][01035] Updated weights for policy 0, policy_version 430 (0.0032) [2024-06-27 10:53:11,248][01035] Updated weights for policy 0, policy_version 440 (0.0026) [2024-06-27 10:53:12,704][00794] Fps is (10 sec: 39321.1, 60 sec: 41233.0, 300 sec: 31485.7). Total num frames: 7241728. Throughput: 0: 41220.7. Samples: 7366880. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 10:53:12,712][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:53:12,724][01015] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000000442_7241728.pth... [2024-06-27 10:53:15,946][01035] Updated weights for policy 0, policy_version 450 (0.0032) [2024-06-27 10:53:17,704][00794] Fps is (10 sec: 45874.7, 60 sec: 41779.1, 300 sec: 31791.9). Total num frames: 7471104. Throughput: 0: 41191.5. Samples: 7607320. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 10:53:17,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:53:19,274][01035] Updated weights for policy 0, policy_version 460 (0.0031) [2024-06-27 10:53:22,704][00794] Fps is (10 sec: 39322.3, 60 sec: 40686.9, 300 sec: 31812.3). Total num frames: 7634944. Throughput: 0: 41170.6. Samples: 7738480. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 10:53:22,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:53:22,724][01015] Saving new best policy, reward=0.002! [2024-06-27 10:53:23,830][01035] Updated weights for policy 0, policy_version 470 (0.0036) [2024-06-27 10:53:27,440][01035] Updated weights for policy 0, policy_version 480 (0.0029) [2024-06-27 10:53:27,704][00794] Fps is (10 sec: 39322.1, 60 sec: 41779.2, 300 sec: 32099.3). Total num frames: 7864320. Throughput: 0: 41367.6. Samples: 7988200. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 10:53:27,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:53:31,679][01035] Updated weights for policy 0, policy_version 490 (0.0039) [2024-06-27 10:53:32,704][00794] Fps is (10 sec: 45875.6, 60 sec: 41779.3, 300 sec: 32374.8). Total num frames: 8093696. Throughput: 0: 41428.5. Samples: 8232400. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 10:53:32,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:53:35,233][01035] Updated weights for policy 0, policy_version 500 (0.0033) [2024-06-27 10:53:37,704][00794] Fps is (10 sec: 39321.2, 60 sec: 40413.8, 300 sec: 32382.5). Total num frames: 8257536. Throughput: 0: 41266.1. Samples: 8356760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 10:53:37,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:53:39,476][01035] Updated weights for policy 0, policy_version 510 (0.0037) [2024-06-27 10:53:42,704][00794] Fps is (10 sec: 40959.7, 60 sec: 42325.3, 300 sec: 32705.0). Total num frames: 8503296. Throughput: 0: 41397.7. Samples: 8607860. Policy #0 lag: (min: 0.0, avg: 12.4, max: 24.0) [2024-06-27 10:53:42,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:53:43,226][01035] Updated weights for policy 0, policy_version 520 (0.0028) [2024-06-27 10:53:47,481][01035] Updated weights for policy 0, policy_version 530 (0.0036) [2024-06-27 10:53:47,704][00794] Fps is (10 sec: 42598.6, 60 sec: 40687.0, 300 sec: 32768.0). Total num frames: 8683520. Throughput: 0: 41633.4. Samples: 8861120. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 10:53:47,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:53:51,343][01035] Updated weights for policy 0, policy_version 540 (0.0021) [2024-06-27 10:53:52,704][00794] Fps is (10 sec: 39321.5, 60 sec: 41233.0, 300 sec: 32950.0). Total num frames: 8896512. Throughput: 0: 41197.7. Samples: 8977300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 10:53:52,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:53:55,047][01035] Updated weights for policy 0, policy_version 550 (0.0037) [2024-06-27 10:53:56,723][01015] Signal inference workers to stop experience collection... (100 times) [2024-06-27 10:53:56,723][01015] Signal inference workers to resume experience collection... (100 times) [2024-06-27 10:53:56,754][01035] InferenceWorker_p0-w0: stopping experience collection (100 times) [2024-06-27 10:53:56,754][01035] InferenceWorker_p0-w0: resuming experience collection (100 times) [2024-06-27 10:53:57,708][00794] Fps is (10 sec: 42581.1, 60 sec: 42049.4, 300 sec: 33125.0). Total num frames: 9109504. Throughput: 0: 41419.1. Samples: 9230900. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 10:53:57,708][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:53:59,323][01035] Updated weights for policy 0, policy_version 560 (0.0033) [2024-06-27 10:54:02,616][01035] Updated weights for policy 0, policy_version 570 (0.0038) [2024-06-27 10:54:02,704][00794] Fps is (10 sec: 44236.7, 60 sec: 41506.2, 300 sec: 33353.1). Total num frames: 9338880. Throughput: 0: 41772.5. Samples: 9487080. Policy #0 lag: (min: 1.0, avg: 9.9, max: 21.0) [2024-06-27 10:54:02,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:54:06,875][01035] Updated weights for policy 0, policy_version 580 (0.0043) [2024-06-27 10:54:07,704][00794] Fps is (10 sec: 40976.3, 60 sec: 41779.1, 300 sec: 33400.4). Total num frames: 9519104. Throughput: 0: 41621.7. Samples: 9611460. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-27 10:54:07,705][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:54:10,530][01035] Updated weights for policy 0, policy_version 590 (0.0031) [2024-06-27 10:54:12,704][00794] Fps is (10 sec: 40960.2, 60 sec: 41779.3, 300 sec: 33615.4). Total num frames: 9748480. Throughput: 0: 41625.3. Samples: 9861340. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-27 10:54:12,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:54:14,443][01035] Updated weights for policy 0, policy_version 600 (0.0037) [2024-06-27 10:54:17,704][00794] Fps is (10 sec: 44236.5, 60 sec: 41506.1, 300 sec: 33767.7). Total num frames: 9961472. Throughput: 0: 41774.9. Samples: 10112280. Policy #0 lag: (min: 1.0, avg: 9.9, max: 23.0) [2024-06-27 10:54:17,705][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:54:18,597][01035] Updated weights for policy 0, policy_version 610 (0.0036) [2024-06-27 10:54:22,060][01035] Updated weights for policy 0, policy_version 620 (0.0038) [2024-06-27 10:54:22,704][00794] Fps is (10 sec: 40959.6, 60 sec: 42052.2, 300 sec: 34434.6). Total num frames: 10158080. Throughput: 0: 41783.9. Samples: 10237040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 10:54:22,705][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:54:26,069][01035] Updated weights for policy 0, policy_version 630 (0.0037) [2024-06-27 10:54:27,704][00794] Fps is (10 sec: 40961.1, 60 sec: 41779.3, 300 sec: 35100.6). Total num frames: 10371072. Throughput: 0: 41884.1. Samples: 10492640. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 10:54:27,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:54:29,916][01035] Updated weights for policy 0, policy_version 640 (0.0044) [2024-06-27 10:54:32,704][00794] Fps is (10 sec: 44237.1, 60 sec: 41779.1, 300 sec: 35378.3). Total num frames: 10600448. Throughput: 0: 41797.3. Samples: 10742000. Policy #0 lag: (min: 1.0, avg: 9.9, max: 21.0) [2024-06-27 10:54:32,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:54:34,189][01035] Updated weights for policy 0, policy_version 650 (0.0032) [2024-06-27 10:54:37,542][01035] Updated weights for policy 0, policy_version 660 (0.0030) [2024-06-27 10:54:37,704][00794] Fps is (10 sec: 44235.9, 60 sec: 42598.4, 300 sec: 36044.8). Total num frames: 10813440. Throughput: 0: 42168.0. Samples: 10874860. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 10:54:37,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:54:41,680][01035] Updated weights for policy 0, policy_version 670 (0.0024) [2024-06-27 10:54:42,704][00794] Fps is (10 sec: 40959.7, 60 sec: 41779.1, 300 sec: 36655.7). Total num frames: 11010048. Throughput: 0: 42200.6. Samples: 11129760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 10:54:42,707][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:54:45,247][01035] Updated weights for policy 0, policy_version 680 (0.0034) [2024-06-27 10:54:47,704][00794] Fps is (10 sec: 40960.5, 60 sec: 42325.4, 300 sec: 37266.7). Total num frames: 11223040. Throughput: 0: 42033.4. Samples: 11378580. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 10:54:47,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 10:54:47,705][01015] Saving new best policy, reward=0.004! [2024-06-27 10:54:49,786][01035] Updated weights for policy 0, policy_version 690 (0.0032) [2024-06-27 10:54:52,708][00794] Fps is (10 sec: 42581.7, 60 sec: 42322.5, 300 sec: 37821.5). Total num frames: 11436032. Throughput: 0: 42035.8. Samples: 11503240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 10:54:52,708][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:54:53,158][01035] Updated weights for policy 0, policy_version 700 (0.0048) [2024-06-27 10:54:57,439][01035] Updated weights for policy 0, policy_version 710 (0.0036) [2024-06-27 10:54:57,704][00794] Fps is (10 sec: 40960.0, 60 sec: 42055.1, 300 sec: 38210.8). Total num frames: 11632640. Throughput: 0: 42127.2. Samples: 11757060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 10:54:57,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:55:01,443][01035] Updated weights for policy 0, policy_version 720 (0.0035) [2024-06-27 10:55:02,704][00794] Fps is (10 sec: 40976.3, 60 sec: 41779.2, 300 sec: 38710.7). Total num frames: 11845632. Throughput: 0: 41884.5. Samples: 11997080. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 10:55:02,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:55:05,670][01035] Updated weights for policy 0, policy_version 730 (0.0038) [2024-06-27 10:55:07,704][00794] Fps is (10 sec: 42598.2, 60 sec: 42325.4, 300 sec: 39043.9). Total num frames: 12058624. Throughput: 0: 41965.4. Samples: 12125480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 10:55:07,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:55:09,371][01035] Updated weights for policy 0, policy_version 740 (0.0041) [2024-06-27 10:55:12,704][00794] Fps is (10 sec: 39321.7, 60 sec: 41506.1, 300 sec: 39210.5). Total num frames: 12238848. Throughput: 0: 41987.8. Samples: 12382100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 10:55:12,708][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:55:12,724][01015] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000000747_12238848.pth... [2024-06-27 10:55:12,785][01015] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000000146_2392064.pth [2024-06-27 10:55:13,563][01035] Updated weights for policy 0, policy_version 750 (0.0043) [2024-06-27 10:55:17,113][01035] Updated weights for policy 0, policy_version 760 (0.0032) [2024-06-27 10:55:17,704][00794] Fps is (10 sec: 39321.4, 60 sec: 41506.2, 300 sec: 39543.7). Total num frames: 12451840. Throughput: 0: 42010.2. Samples: 12632460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 10:55:17,705][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:55:21,318][01035] Updated weights for policy 0, policy_version 770 (0.0032) [2024-06-27 10:55:22,704][00794] Fps is (10 sec: 45875.5, 60 sec: 42325.4, 300 sec: 39877.0). Total num frames: 12697600. Throughput: 0: 41930.8. Samples: 12761740. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-27 10:55:22,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:55:24,604][01035] Updated weights for policy 0, policy_version 780 (0.0025) [2024-06-27 10:55:27,708][00794] Fps is (10 sec: 42581.4, 60 sec: 41776.3, 300 sec: 39987.5). Total num frames: 12877824. Throughput: 0: 41811.9. Samples: 13011460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 10:55:27,708][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:55:28,686][01015] Signal inference workers to stop experience collection... (150 times) [2024-06-27 10:55:28,739][01015] Signal inference workers to resume experience collection... (150 times) [2024-06-27 10:55:28,741][01035] InferenceWorker_p0-w0: stopping experience collection (150 times) [2024-06-27 10:55:28,758][01035] InferenceWorker_p0-w0: resuming experience collection (150 times) [2024-06-27 10:55:28,882][01035] Updated weights for policy 0, policy_version 790 (0.0043) [2024-06-27 10:55:32,470][01035] Updated weights for policy 0, policy_version 800 (0.0031) [2024-06-27 10:55:32,708][00794] Fps is (10 sec: 40943.2, 60 sec: 41776.4, 300 sec: 40209.7). Total num frames: 13107200. Throughput: 0: 41946.8. Samples: 13266360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 10:55:32,708][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 10:55:36,480][01035] Updated weights for policy 0, policy_version 810 (0.0034) [2024-06-27 10:55:37,704][00794] Fps is (10 sec: 44254.5, 60 sec: 41779.2, 300 sec: 40376.8). Total num frames: 13320192. Throughput: 0: 42147.3. Samples: 13399700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 10:55:37,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:55:40,096][01035] Updated weights for policy 0, policy_version 820 (0.0030) [2024-06-27 10:55:42,704][00794] Fps is (10 sec: 40976.8, 60 sec: 41779.3, 300 sec: 40543.5). Total num frames: 13516800. Throughput: 0: 42111.1. Samples: 13652060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 10:55:42,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:55:43,997][01035] Updated weights for policy 0, policy_version 830 (0.0038) [2024-06-27 10:55:47,704][00794] Fps is (10 sec: 40960.3, 60 sec: 41779.2, 300 sec: 40710.1). Total num frames: 13729792. Throughput: 0: 42420.5. Samples: 13906000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 10:55:47,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:55:47,970][01035] Updated weights for policy 0, policy_version 840 (0.0040) [2024-06-27 10:55:52,072][01035] Updated weights for policy 0, policy_version 850 (0.0033) [2024-06-27 10:55:52,704][00794] Fps is (10 sec: 42597.9, 60 sec: 41782.0, 300 sec: 40876.7). Total num frames: 13942784. Throughput: 0: 42355.5. Samples: 14031480. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 10:55:52,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:55:55,763][01035] Updated weights for policy 0, policy_version 860 (0.0044) [2024-06-27 10:55:57,704][00794] Fps is (10 sec: 44236.9, 60 sec: 42325.3, 300 sec: 41098.9). Total num frames: 14172160. Throughput: 0: 42425.4. Samples: 14291240. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-27 10:55:57,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:55:59,567][01035] Updated weights for policy 0, policy_version 870 (0.0042) [2024-06-27 10:56:02,704][00794] Fps is (10 sec: 44237.2, 60 sec: 42325.4, 300 sec: 41265.5). Total num frames: 14385152. Throughput: 0: 42507.2. Samples: 14545280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 10:56:02,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:56:03,347][01035] Updated weights for policy 0, policy_version 880 (0.0028) [2024-06-27 10:56:07,088][01035] Updated weights for policy 0, policy_version 890 (0.0039) [2024-06-27 10:56:07,704][00794] Fps is (10 sec: 42598.2, 60 sec: 42325.3, 300 sec: 41376.5). Total num frames: 14598144. Throughput: 0: 42627.1. Samples: 14679960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 10:56:07,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:56:11,377][01035] Updated weights for policy 0, policy_version 900 (0.0050) [2024-06-27 10:56:12,704][00794] Fps is (10 sec: 40959.3, 60 sec: 42598.3, 300 sec: 41321.0). Total num frames: 14794752. Throughput: 0: 42697.9. Samples: 14932700. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 10:56:12,705][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:56:14,954][01035] Updated weights for policy 0, policy_version 910 (0.0036) [2024-06-27 10:56:17,704][00794] Fps is (10 sec: 44236.6, 60 sec: 43144.6, 300 sec: 41487.6). Total num frames: 15040512. Throughput: 0: 42564.2. Samples: 15181580. Policy #0 lag: (min: 1.0, avg: 11.7, max: 20.0) [2024-06-27 10:56:17,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:56:18,872][01035] Updated weights for policy 0, policy_version 920 (0.0035) [2024-06-27 10:56:22,606][01035] Updated weights for policy 0, policy_version 930 (0.0036) [2024-06-27 10:56:22,704][00794] Fps is (10 sec: 44237.4, 60 sec: 42325.3, 300 sec: 41654.2). Total num frames: 15237120. Throughput: 0: 42678.2. Samples: 15320220. Policy #0 lag: (min: 1.0, avg: 10.4, max: 23.0) [2024-06-27 10:56:22,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:56:26,451][01035] Updated weights for policy 0, policy_version 940 (0.0030) [2024-06-27 10:56:27,704][00794] Fps is (10 sec: 39322.3, 60 sec: 42601.4, 300 sec: 41709.8). Total num frames: 15433728. Throughput: 0: 42614.8. Samples: 15569720. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 10:56:27,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:56:30,350][01035] Updated weights for policy 0, policy_version 950 (0.0026) [2024-06-27 10:56:32,704][00794] Fps is (10 sec: 45875.3, 60 sec: 43147.4, 300 sec: 41820.9). Total num frames: 15695872. Throughput: 0: 42355.5. Samples: 15812000. Policy #0 lag: (min: 0.0, avg: 13.5, max: 25.0) [2024-06-27 10:56:32,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:56:34,002][01035] Updated weights for policy 0, policy_version 960 (0.0037) [2024-06-27 10:56:37,704][00794] Fps is (10 sec: 42597.4, 60 sec: 42325.3, 300 sec: 41709.8). Total num frames: 15859712. Throughput: 0: 42682.6. Samples: 15952200. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-27 10:56:37,705][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:56:38,202][01035] Updated weights for policy 0, policy_version 970 (0.0030) [2024-06-27 10:56:42,101][01035] Updated weights for policy 0, policy_version 980 (0.0041) [2024-06-27 10:56:42,704][00794] Fps is (10 sec: 37683.6, 60 sec: 42598.4, 300 sec: 41876.4). Total num frames: 16072704. Throughput: 0: 42535.2. Samples: 16205320. Policy #0 lag: (min: 0.0, avg: 12.2, max: 21.0) [2024-06-27 10:56:42,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:56:46,151][01035] Updated weights for policy 0, policy_version 990 (0.0035) [2024-06-27 10:56:47,704][00794] Fps is (10 sec: 47514.6, 60 sec: 43417.7, 300 sec: 41987.5). Total num frames: 16334848. Throughput: 0: 42282.8. Samples: 16448000. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 10:56:47,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:56:49,606][01035] Updated weights for policy 0, policy_version 1000 (0.0024) [2024-06-27 10:56:51,707][01015] Signal inference workers to stop experience collection... (200 times) [2024-06-27 10:56:51,757][01035] InferenceWorker_p0-w0: stopping experience collection (200 times) [2024-06-27 10:56:51,765][01015] Signal inference workers to resume experience collection... (200 times) [2024-06-27 10:56:51,774][01035] InferenceWorker_p0-w0: resuming experience collection (200 times) [2024-06-27 10:56:52,704][00794] Fps is (10 sec: 40959.4, 60 sec: 42325.3, 300 sec: 41765.3). Total num frames: 16482304. Throughput: 0: 42433.7. Samples: 16589480. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 10:56:52,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:56:53,884][01035] Updated weights for policy 0, policy_version 1010 (0.0051) [2024-06-27 10:56:57,687][01035] Updated weights for policy 0, policy_version 1020 (0.0042) [2024-06-27 10:56:57,704][00794] Fps is (10 sec: 37682.5, 60 sec: 42325.3, 300 sec: 41988.0). Total num frames: 16711680. Throughput: 0: 42221.4. Samples: 16832660. Policy #0 lag: (min: 0.0, avg: 13.2, max: 22.0) [2024-06-27 10:56:57,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:57:01,765][01035] Updated weights for policy 0, policy_version 1030 (0.0034) [2024-06-27 10:57:02,704][00794] Fps is (10 sec: 45875.9, 60 sec: 42598.5, 300 sec: 41932.0). Total num frames: 16941056. Throughput: 0: 42219.2. Samples: 17081440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 10:57:02,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:57:05,245][01035] Updated weights for policy 0, policy_version 1040 (0.0039) [2024-06-27 10:57:07,704][00794] Fps is (10 sec: 40960.3, 60 sec: 42052.3, 300 sec: 41876.4). Total num frames: 17121280. Throughput: 0: 42048.5. Samples: 17212400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 10:57:07,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:57:09,387][01035] Updated weights for policy 0, policy_version 1050 (0.0051) [2024-06-27 10:57:12,704][00794] Fps is (10 sec: 39321.5, 60 sec: 42325.5, 300 sec: 41931.9). Total num frames: 17334272. Throughput: 0: 42070.6. Samples: 17462900. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-27 10:57:12,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:57:12,812][01015] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000001059_17350656.pth... [2024-06-27 10:57:12,863][01015] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000000442_7241728.pth [2024-06-27 10:57:13,187][01035] Updated weights for policy 0, policy_version 1060 (0.0052) [2024-06-27 10:57:16,886][01035] Updated weights for policy 0, policy_version 1070 (0.0037) [2024-06-27 10:57:17,704][00794] Fps is (10 sec: 42598.7, 60 sec: 41779.3, 300 sec: 41876.4). Total num frames: 17547264. Throughput: 0: 42477.9. Samples: 17723500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 10:57:17,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:57:20,658][01035] Updated weights for policy 0, policy_version 1080 (0.0029) [2024-06-27 10:57:22,704][00794] Fps is (10 sec: 42597.8, 60 sec: 42052.2, 300 sec: 42043.0). Total num frames: 17760256. Throughput: 0: 42260.0. Samples: 17853900. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-27 10:57:22,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:57:24,547][01035] Updated weights for policy 0, policy_version 1090 (0.0039) [2024-06-27 10:57:27,704][00794] Fps is (10 sec: 44236.5, 60 sec: 42598.3, 300 sec: 42043.0). Total num frames: 17989632. Throughput: 0: 42199.0. Samples: 18104280. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 10:57:27,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:57:28,101][01035] Updated weights for policy 0, policy_version 1100 (0.0023) [2024-06-27 10:57:32,315][01035] Updated weights for policy 0, policy_version 1110 (0.0046) [2024-06-27 10:57:32,705][00794] Fps is (10 sec: 42595.4, 60 sec: 41505.6, 300 sec: 41876.3). Total num frames: 18186240. Throughput: 0: 42562.3. Samples: 18363340. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-27 10:57:32,705][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:57:36,048][01035] Updated weights for policy 0, policy_version 1120 (0.0035) [2024-06-27 10:57:37,704][00794] Fps is (10 sec: 40959.9, 60 sec: 42325.4, 300 sec: 42154.1). Total num frames: 18399232. Throughput: 0: 42154.3. Samples: 18486420. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 10:57:37,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:57:40,101][01035] Updated weights for policy 0, policy_version 1130 (0.0031) [2024-06-27 10:57:42,704][00794] Fps is (10 sec: 45878.4, 60 sec: 42871.4, 300 sec: 42043.0). Total num frames: 18644992. Throughput: 0: 42421.3. Samples: 18741620. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 10:57:42,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:57:43,629][01035] Updated weights for policy 0, policy_version 1140 (0.0025) [2024-06-27 10:57:47,704][00794] Fps is (10 sec: 42598.1, 60 sec: 41506.0, 300 sec: 42043.0). Total num frames: 18825216. Throughput: 0: 42724.7. Samples: 19004060. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 10:57:47,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:57:47,898][01035] Updated weights for policy 0, policy_version 1150 (0.0030) [2024-06-27 10:57:51,586][01035] Updated weights for policy 0, policy_version 1160 (0.0039) [2024-06-27 10:57:52,704][00794] Fps is (10 sec: 39322.3, 60 sec: 42598.5, 300 sec: 42209.6). Total num frames: 19038208. Throughput: 0: 42513.0. Samples: 19125480. Policy #0 lag: (min: 0.0, avg: 12.4, max: 25.0) [2024-06-27 10:57:52,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:57:55,537][01035] Updated weights for policy 0, policy_version 1170 (0.0045) [2024-06-27 10:57:57,704][00794] Fps is (10 sec: 45875.2, 60 sec: 42871.5, 300 sec: 42154.1). Total num frames: 19283968. Throughput: 0: 42586.1. Samples: 19379280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 10:57:57,705][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:57:59,287][01035] Updated weights for policy 0, policy_version 1180 (0.0030) [2024-06-27 10:58:02,704][00794] Fps is (10 sec: 40959.2, 60 sec: 41779.1, 300 sec: 42154.1). Total num frames: 19447808. Throughput: 0: 42415.8. Samples: 19632220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 10:58:02,705][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 10:58:03,279][01035] Updated weights for policy 0, policy_version 1190 (0.0035) [2024-06-27 10:58:07,295][01035] Updated weights for policy 0, policy_version 1200 (0.0043) [2024-06-27 10:58:07,704][00794] Fps is (10 sec: 39321.7, 60 sec: 42598.4, 300 sec: 42154.1). Total num frames: 19677184. Throughput: 0: 42341.8. Samples: 19759280. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 10:58:07,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:58:11,030][01035] Updated weights for policy 0, policy_version 1210 (0.0033) [2024-06-27 10:58:12,704][00794] Fps is (10 sec: 42599.1, 60 sec: 42325.3, 300 sec: 42043.0). Total num frames: 19873792. Throughput: 0: 42420.5. Samples: 20013200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 10:58:12,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:58:14,859][01035] Updated weights for policy 0, policy_version 1220 (0.0051) [2024-06-27 10:58:16,061][01015] Signal inference workers to stop experience collection... (250 times) [2024-06-27 10:58:16,061][01015] Signal inference workers to resume experience collection... (250 times) [2024-06-27 10:58:16,077][01035] InferenceWorker_p0-w0: stopping experience collection (250 times) [2024-06-27 10:58:16,077][01035] InferenceWorker_p0-w0: resuming experience collection (250 times) [2024-06-27 10:58:17,708][00794] Fps is (10 sec: 40943.4, 60 sec: 42322.4, 300 sec: 42209.0). Total num frames: 20086784. Throughput: 0: 42316.0. Samples: 20267700. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 10:58:17,709][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:58:18,795][01035] Updated weights for policy 0, policy_version 1230 (0.0034) [2024-06-27 10:58:22,337][01035] Updated weights for policy 0, policy_version 1240 (0.0041) [2024-06-27 10:58:22,704][00794] Fps is (10 sec: 44236.1, 60 sec: 42598.4, 300 sec: 42209.6). Total num frames: 20316160. Throughput: 0: 42408.4. Samples: 20394800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 10:58:22,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:58:26,537][01035] Updated weights for policy 0, policy_version 1250 (0.0035) [2024-06-27 10:58:27,704][00794] Fps is (10 sec: 44255.2, 60 sec: 42325.4, 300 sec: 42154.1). Total num frames: 20529152. Throughput: 0: 42425.9. Samples: 20650780. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-27 10:58:27,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:58:30,297][01035] Updated weights for policy 0, policy_version 1260 (0.0039) [2024-06-27 10:58:32,704][00794] Fps is (10 sec: 42598.6, 60 sec: 42598.9, 300 sec: 42320.7). Total num frames: 20742144. Throughput: 0: 42124.1. Samples: 20899640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 10:58:32,708][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:58:34,216][01035] Updated weights for policy 0, policy_version 1270 (0.0039) [2024-06-27 10:58:37,704][00794] Fps is (10 sec: 42598.0, 60 sec: 42598.4, 300 sec: 42209.6). Total num frames: 20955136. Throughput: 0: 42335.0. Samples: 21030560. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 10:58:37,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:58:37,969][01035] Updated weights for policy 0, policy_version 1280 (0.0039) [2024-06-27 10:58:41,894][01035] Updated weights for policy 0, policy_version 1290 (0.0034) [2024-06-27 10:58:42,704][00794] Fps is (10 sec: 44236.8, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 21184512. Throughput: 0: 42413.4. Samples: 21287880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 10:58:42,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:58:46,054][01035] Updated weights for policy 0, policy_version 1300 (0.0024) [2024-06-27 10:58:47,704][00794] Fps is (10 sec: 42597.9, 60 sec: 42598.4, 300 sec: 42320.7). Total num frames: 21381120. Throughput: 0: 42239.9. Samples: 21533020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 10:58:47,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:58:49,712][01035] Updated weights for policy 0, policy_version 1310 (0.0029) [2024-06-27 10:58:52,704][00794] Fps is (10 sec: 40960.1, 60 sec: 42598.3, 300 sec: 42321.3). Total num frames: 21594112. Throughput: 0: 42241.4. Samples: 21660140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 10:58:52,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:58:53,438][01035] Updated weights for policy 0, policy_version 1320 (0.0038) [2024-06-27 10:58:57,455][01035] Updated weights for policy 0, policy_version 1330 (0.0045) [2024-06-27 10:58:57,704][00794] Fps is (10 sec: 42599.4, 60 sec: 42052.4, 300 sec: 42265.2). Total num frames: 21807104. Throughput: 0: 42440.9. Samples: 21923040. Policy #0 lag: (min: 1.0, avg: 10.9, max: 23.0) [2024-06-27 10:58:57,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:59:01,105][01035] Updated weights for policy 0, policy_version 1340 (0.0027) [2024-06-27 10:59:02,704][00794] Fps is (10 sec: 42598.7, 60 sec: 42871.6, 300 sec: 42376.3). Total num frames: 22020096. Throughput: 0: 42343.9. Samples: 22173000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 10:59:02,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:59:05,047][01035] Updated weights for policy 0, policy_version 1350 (0.0030) [2024-06-27 10:59:07,704][00794] Fps is (10 sec: 42598.3, 60 sec: 42598.5, 300 sec: 42320.7). Total num frames: 22233088. Throughput: 0: 42368.6. Samples: 22301380. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 10:59:07,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:59:08,748][01035] Updated weights for policy 0, policy_version 1360 (0.0035) [2024-06-27 10:59:12,589][01035] Updated weights for policy 0, policy_version 1370 (0.0044) [2024-06-27 10:59:12,704][00794] Fps is (10 sec: 42598.5, 60 sec: 42871.5, 300 sec: 42320.7). Total num frames: 22446080. Throughput: 0: 42537.8. Samples: 22564980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 10:59:12,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 10:59:12,727][01015] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000001370_22446080.pth... [2024-06-27 10:59:12,775][01015] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000000747_12238848.pth [2024-06-27 10:59:16,535][01035] Updated weights for policy 0, policy_version 1380 (0.0035) [2024-06-27 10:59:17,704][00794] Fps is (10 sec: 42598.2, 60 sec: 42874.4, 300 sec: 42376.3). Total num frames: 22659072. Throughput: 0: 42513.4. Samples: 22812740. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 10:59:17,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:59:20,684][01035] Updated weights for policy 0, policy_version 1390 (0.0046) [2024-06-27 10:59:22,707][00794] Fps is (10 sec: 40944.7, 60 sec: 42322.8, 300 sec: 42320.2). Total num frames: 22855680. Throughput: 0: 42496.6. Samples: 22943060. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 10:59:22,708][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:59:24,359][01035] Updated weights for policy 0, policy_version 1400 (0.0035) [2024-06-27 10:59:27,704][00794] Fps is (10 sec: 39321.8, 60 sec: 42052.3, 300 sec: 42209.6). Total num frames: 23052288. Throughput: 0: 42311.6. Samples: 23191900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 10:59:27,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:59:28,362][01035] Updated weights for policy 0, policy_version 1410 (0.0038) [2024-06-27 10:59:32,377][01035] Updated weights for policy 0, policy_version 1420 (0.0034) [2024-06-27 10:59:32,704][00794] Fps is (10 sec: 42613.9, 60 sec: 42325.4, 300 sec: 42265.2). Total num frames: 23281664. Throughput: 0: 42516.2. Samples: 23446240. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 10:59:32,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:59:36,102][01035] Updated weights for policy 0, policy_version 1430 (0.0035) [2024-06-27 10:59:37,704][00794] Fps is (10 sec: 42598.2, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 23478272. Throughput: 0: 42451.1. Samples: 23570440. Policy #0 lag: (min: 1.0, avg: 9.5, max: 19.0) [2024-06-27 10:59:37,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 10:59:39,969][01035] Updated weights for policy 0, policy_version 1440 (0.0027) [2024-06-27 10:59:42,704][00794] Fps is (10 sec: 40959.7, 60 sec: 41779.2, 300 sec: 42265.2). Total num frames: 23691264. Throughput: 0: 42362.1. Samples: 23829340. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 10:59:42,705][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:59:43,710][01035] Updated weights for policy 0, policy_version 1450 (0.0033) [2024-06-27 10:59:47,461][01035] Updated weights for policy 0, policy_version 1460 (0.0033) [2024-06-27 10:59:47,704][00794] Fps is (10 sec: 45875.1, 60 sec: 42598.5, 300 sec: 42376.8). Total num frames: 23937024. Throughput: 0: 42477.7. Samples: 24084500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 10:59:47,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:59:51,516][01035] Updated weights for policy 0, policy_version 1470 (0.0035) [2024-06-27 10:59:52,704][00794] Fps is (10 sec: 44236.7, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 24133632. Throughput: 0: 42484.8. Samples: 24213200. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 10:59:52,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 10:59:54,927][01035] Updated weights for policy 0, policy_version 1480 (0.0042) [2024-06-27 10:59:57,704][00794] Fps is (10 sec: 39321.5, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 24330240. Throughput: 0: 42368.8. Samples: 24471580. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 10:59:57,720][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 10:59:59,144][01015] Signal inference workers to stop experience collection... (300 times) [2024-06-27 10:59:59,192][01035] InferenceWorker_p0-w0: stopping experience collection (300 times) [2024-06-27 10:59:59,213][01015] Signal inference workers to resume experience collection... (300 times) [2024-06-27 10:59:59,213][01035] InferenceWorker_p0-w0: resuming experience collection (300 times) [2024-06-27 10:59:59,347][01035] Updated weights for policy 0, policy_version 1490 (0.0034) [2024-06-27 11:00:02,373][01035] Updated weights for policy 0, policy_version 1500 (0.0028) [2024-06-27 11:00:02,704][00794] Fps is (10 sec: 44237.0, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 24576000. Throughput: 0: 42416.4. Samples: 24721480. Policy #0 lag: (min: 0.0, avg: 11.7, max: 26.0) [2024-06-27 11:00:02,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:00:06,950][01035] Updated weights for policy 0, policy_version 1510 (0.0036) [2024-06-27 11:00:07,704][00794] Fps is (10 sec: 44236.5, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 24772608. Throughput: 0: 42553.6. Samples: 24857820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 11:00:07,705][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:00:10,107][01035] Updated weights for policy 0, policy_version 1520 (0.0032) [2024-06-27 11:00:12,704][00794] Fps is (10 sec: 39321.3, 60 sec: 42052.1, 300 sec: 42431.8). Total num frames: 24969216. Throughput: 0: 42497.2. Samples: 25104280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-27 11:00:12,705][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:00:14,644][01035] Updated weights for policy 0, policy_version 1530 (0.0044) [2024-06-27 11:00:17,704][00794] Fps is (10 sec: 44237.1, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 25214976. Throughput: 0: 42464.9. Samples: 25357160. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 11:00:17,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:00:17,787][01035] Updated weights for policy 0, policy_version 1540 (0.0028) [2024-06-27 11:00:22,246][01035] Updated weights for policy 0, policy_version 1550 (0.0030) [2024-06-27 11:00:22,704][00794] Fps is (10 sec: 42598.6, 60 sec: 42327.8, 300 sec: 42432.4). Total num frames: 25395200. Throughput: 0: 42665.3. Samples: 25490380. Policy #0 lag: (min: 1.0, avg: 10.0, max: 22.0) [2024-06-27 11:00:22,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:00:25,699][01035] Updated weights for policy 0, policy_version 1560 (0.0041) [2024-06-27 11:00:27,704][00794] Fps is (10 sec: 37683.1, 60 sec: 42325.3, 300 sec: 42321.3). Total num frames: 25591808. Throughput: 0: 42384.4. Samples: 25736640. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 11:00:27,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:00:30,054][01035] Updated weights for policy 0, policy_version 1570 (0.0032) [2024-06-27 11:00:32,704][00794] Fps is (10 sec: 44237.3, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 25837568. Throughput: 0: 42462.7. Samples: 25995320. Policy #0 lag: (min: 2.0, avg: 10.9, max: 21.0) [2024-06-27 11:00:32,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:00:33,462][01035] Updated weights for policy 0, policy_version 1580 (0.0025) [2024-06-27 11:00:37,704][00794] Fps is (10 sec: 42598.8, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 26017792. Throughput: 0: 42575.2. Samples: 26129080. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 11:00:37,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:00:38,056][01035] Updated weights for policy 0, policy_version 1590 (0.0037) [2024-06-27 11:00:41,127][01035] Updated weights for policy 0, policy_version 1600 (0.0033) [2024-06-27 11:00:42,704][00794] Fps is (10 sec: 40959.5, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 26247168. Throughput: 0: 42383.5. Samples: 26378840. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 11:00:42,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:00:45,705][01035] Updated weights for policy 0, policy_version 1610 (0.0037) [2024-06-27 11:00:47,704][00794] Fps is (10 sec: 45875.0, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 26476544. Throughput: 0: 42539.2. Samples: 26635740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 11:00:47,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:00:48,996][01035] Updated weights for policy 0, policy_version 1620 (0.0041) [2024-06-27 11:00:52,704][00794] Fps is (10 sec: 42598.4, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 26673152. Throughput: 0: 42445.8. Samples: 26767880. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 11:00:52,705][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:00:53,552][01035] Updated weights for policy 0, policy_version 1630 (0.0034) [2024-06-27 11:00:56,499][01035] Updated weights for policy 0, policy_version 1640 (0.0037) [2024-06-27 11:00:57,704][00794] Fps is (10 sec: 40959.8, 60 sec: 42598.4, 300 sec: 42376.2). Total num frames: 26886144. Throughput: 0: 42594.3. Samples: 27021020. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-27 11:00:57,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:01:00,976][01035] Updated weights for policy 0, policy_version 1650 (0.0033) [2024-06-27 11:01:02,704][00794] Fps is (10 sec: 44237.4, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 27115520. Throughput: 0: 42701.0. Samples: 27278700. Policy #0 lag: (min: 1.0, avg: 8.5, max: 21.0) [2024-06-27 11:01:02,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:01:03,972][01035] Updated weights for policy 0, policy_version 1660 (0.0038) [2024-06-27 11:01:07,704][00794] Fps is (10 sec: 44236.8, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 27328512. Throughput: 0: 42853.8. Samples: 27418800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 11:01:07,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:01:08,491][01035] Updated weights for policy 0, policy_version 1670 (0.0041) [2024-06-27 11:01:11,760][01035] Updated weights for policy 0, policy_version 1680 (0.0052) [2024-06-27 11:01:12,704][00794] Fps is (10 sec: 42597.6, 60 sec: 42871.5, 300 sec: 42376.2). Total num frames: 27541504. Throughput: 0: 42920.4. Samples: 27668060. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 11:01:12,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:01:12,733][01015] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000001681_27541504.pth... [2024-06-27 11:01:12,775][01015] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000001059_17350656.pth [2024-06-27 11:01:16,151][01035] Updated weights for policy 0, policy_version 1690 (0.0029) [2024-06-27 11:01:17,704][00794] Fps is (10 sec: 44237.1, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 27770880. Throughput: 0: 42853.8. Samples: 27923740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 11:01:17,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:01:19,572][01035] Updated weights for policy 0, policy_version 1700 (0.0027) [2024-06-27 11:01:22,704][00794] Fps is (10 sec: 40960.9, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 27951104. Throughput: 0: 42862.7. Samples: 28057900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 11:01:22,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:01:23,606][01035] Updated weights for policy 0, policy_version 1710 (0.0035) [2024-06-27 11:01:27,131][01035] Updated weights for policy 0, policy_version 1720 (0.0034) [2024-06-27 11:01:27,704][00794] Fps is (10 sec: 42598.1, 60 sec: 43417.6, 300 sec: 42376.2). Total num frames: 28196864. Throughput: 0: 42990.7. Samples: 28313420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 11:01:27,708][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:01:31,074][01035] Updated weights for policy 0, policy_version 1730 (0.0036) [2024-06-27 11:01:32,372][01015] Signal inference workers to stop experience collection... (350 times) [2024-06-27 11:01:32,373][01015] Signal inference workers to resume experience collection... (350 times) [2024-06-27 11:01:32,414][01035] InferenceWorker_p0-w0: stopping experience collection (350 times) [2024-06-27 11:01:32,414][01035] InferenceWorker_p0-w0: resuming experience collection (350 times) [2024-06-27 11:01:32,704][00794] Fps is (10 sec: 45875.1, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 28409856. Throughput: 0: 42942.7. Samples: 28568160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 11:01:32,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:01:34,918][01035] Updated weights for policy 0, policy_version 1740 (0.0033) [2024-06-27 11:01:37,704][00794] Fps is (10 sec: 40960.3, 60 sec: 43144.5, 300 sec: 42487.3). Total num frames: 28606464. Throughput: 0: 42857.9. Samples: 28696480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 11:01:37,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:01:38,663][01035] Updated weights for policy 0, policy_version 1750 (0.0034) [2024-06-27 11:01:42,418][01035] Updated weights for policy 0, policy_version 1760 (0.0039) [2024-06-27 11:01:42,704][00794] Fps is (10 sec: 42597.8, 60 sec: 43144.5, 300 sec: 42376.2). Total num frames: 28835840. Throughput: 0: 43047.1. Samples: 28958140. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 11:01:42,705][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:01:46,361][01035] Updated weights for policy 0, policy_version 1770 (0.0036) [2024-06-27 11:01:47,705][00794] Fps is (10 sec: 44233.1, 60 sec: 42870.9, 300 sec: 42598.3). Total num frames: 29048832. Throughput: 0: 43066.3. Samples: 29216720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 11:01:47,705][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:01:49,980][01035] Updated weights for policy 0, policy_version 1780 (0.0039) [2024-06-27 11:01:52,704][00794] Fps is (10 sec: 40960.0, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 29245440. Throughput: 0: 42740.9. Samples: 29342140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 11:01:52,705][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:01:53,986][01035] Updated weights for policy 0, policy_version 1790 (0.0039) [2024-06-27 11:01:57,704][00794] Fps is (10 sec: 42602.0, 60 sec: 43144.6, 300 sec: 42487.3). Total num frames: 29474816. Throughput: 0: 42793.5. Samples: 29593760. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 11:01:57,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:01:57,810][01035] Updated weights for policy 0, policy_version 1800 (0.0032) [2024-06-27 11:02:01,694][01035] Updated weights for policy 0, policy_version 1810 (0.0044) [2024-06-27 11:02:02,707][00794] Fps is (10 sec: 42584.0, 60 sec: 42595.9, 300 sec: 42542.4). Total num frames: 29671424. Throughput: 0: 42865.6. Samples: 29852840. Policy #0 lag: (min: 1.0, avg: 8.8, max: 21.0) [2024-06-27 11:02:02,708][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:02:05,485][01035] Updated weights for policy 0, policy_version 1820 (0.0027) [2024-06-27 11:02:07,704][00794] Fps is (10 sec: 42597.9, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 29900800. Throughput: 0: 42590.1. Samples: 29974460. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 11:02:07,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:02:09,973][01035] Updated weights for policy 0, policy_version 1830 (0.0036) [2024-06-27 11:02:12,704][00794] Fps is (10 sec: 44252.2, 60 sec: 42871.6, 300 sec: 42598.4). Total num frames: 30113792. Throughput: 0: 42573.4. Samples: 30229220. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 11:02:12,705][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:02:13,007][01035] Updated weights for policy 0, policy_version 1840 (0.0035) [2024-06-27 11:02:17,704][00794] Fps is (10 sec: 40960.5, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 30310400. Throughput: 0: 42780.4. Samples: 30493280. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 11:02:17,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 11:02:17,707][01035] Updated weights for policy 0, policy_version 1850 (0.0032) [2024-06-27 11:02:20,891][01035] Updated weights for policy 0, policy_version 1860 (0.0032) [2024-06-27 11:02:22,704][00794] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 42542.9). Total num frames: 30539776. Throughput: 0: 42517.8. Samples: 30609780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 11:02:22,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:02:25,229][01035] Updated weights for policy 0, policy_version 1870 (0.0038) [2024-06-27 11:02:27,704][00794] Fps is (10 sec: 42598.5, 60 sec: 42325.4, 300 sec: 42543.0). Total num frames: 30736384. Throughput: 0: 42368.6. Samples: 30864720. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-27 11:02:27,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:02:28,538][01035] Updated weights for policy 0, policy_version 1880 (0.0034) [2024-06-27 11:02:32,704][00794] Fps is (10 sec: 40959.7, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 30949376. Throughput: 0: 42518.1. Samples: 31130000. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 11:02:32,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:02:32,941][01035] Updated weights for policy 0, policy_version 1890 (0.0038) [2024-06-27 11:02:36,129][01035] Updated weights for policy 0, policy_version 1900 (0.0039) [2024-06-27 11:02:37,704][00794] Fps is (10 sec: 44236.1, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 31178752. Throughput: 0: 42524.4. Samples: 31255740. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 11:02:37,705][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 11:02:40,424][01035] Updated weights for policy 0, policy_version 1910 (0.0031) [2024-06-27 11:02:42,704][00794] Fps is (10 sec: 44236.8, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 31391744. Throughput: 0: 42577.7. Samples: 31509760. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 11:02:42,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:02:43,891][01035] Updated weights for policy 0, policy_version 1920 (0.0030) [2024-06-27 11:02:47,704][00794] Fps is (10 sec: 40960.9, 60 sec: 42326.0, 300 sec: 42542.9). Total num frames: 31588352. Throughput: 0: 42622.5. Samples: 31770700. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 11:02:47,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:02:47,927][01035] Updated weights for policy 0, policy_version 1930 (0.0026) [2024-06-27 11:02:51,890][01035] Updated weights for policy 0, policy_version 1940 (0.0042) [2024-06-27 11:02:52,449][01015] Signal inference workers to stop experience collection... (400 times) [2024-06-27 11:02:52,503][01015] Signal inference workers to resume experience collection... (400 times) [2024-06-27 11:02:52,504][01035] InferenceWorker_p0-w0: stopping experience collection (400 times) [2024-06-27 11:02:52,522][01035] InferenceWorker_p0-w0: resuming experience collection (400 times) [2024-06-27 11:02:52,704][00794] Fps is (10 sec: 44237.3, 60 sec: 43144.6, 300 sec: 42542.9). Total num frames: 31834112. Throughput: 0: 42642.8. Samples: 31893380. Policy #0 lag: (min: 0.0, avg: 11.5, max: 20.0) [2024-06-27 11:02:52,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:02:55,536][01035] Updated weights for policy 0, policy_version 1950 (0.0037) [2024-06-27 11:02:57,704][00794] Fps is (10 sec: 45871.8, 60 sec: 42871.0, 300 sec: 42709.4). Total num frames: 32047104. Throughput: 0: 42687.4. Samples: 32150180. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 11:02:57,705][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:02:59,349][01035] Updated weights for policy 0, policy_version 1960 (0.0041) [2024-06-27 11:03:02,704][00794] Fps is (10 sec: 37683.2, 60 sec: 42327.8, 300 sec: 42487.3). Total num frames: 32210944. Throughput: 0: 42639.6. Samples: 32412060. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-27 11:03:02,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:03:03,309][01035] Updated weights for policy 0, policy_version 1970 (0.0041) [2024-06-27 11:03:07,007][01035] Updated weights for policy 0, policy_version 1980 (0.0035) [2024-06-27 11:03:07,708][00794] Fps is (10 sec: 40945.7, 60 sec: 42595.6, 300 sec: 42653.3). Total num frames: 32456704. Throughput: 0: 42727.6. Samples: 32532700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 11:03:07,709][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:03:11,070][01035] Updated weights for policy 0, policy_version 1990 (0.0032) [2024-06-27 11:03:12,708][00794] Fps is (10 sec: 47493.9, 60 sec: 42868.5, 300 sec: 42709.5). Total num frames: 32686080. Throughput: 0: 42739.2. Samples: 32788160. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 11:03:12,708][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:03:12,721][01015] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000001995_32686080.pth... [2024-06-27 11:03:12,782][01015] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000001370_22446080.pth [2024-06-27 11:03:14,695][01035] Updated weights for policy 0, policy_version 2000 (0.0028) [2024-06-27 11:03:17,704][00794] Fps is (10 sec: 39337.5, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 32849920. Throughput: 0: 42589.3. Samples: 33046520. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 11:03:17,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:03:17,705][01015] Saving new best policy, reward=0.005! [2024-06-27 11:03:18,771][01035] Updated weights for policy 0, policy_version 2010 (0.0038) [2024-06-27 11:03:22,448][01035] Updated weights for policy 0, policy_version 2020 (0.0033) [2024-06-27 11:03:22,704][00794] Fps is (10 sec: 42615.9, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 33112064. Throughput: 0: 42456.5. Samples: 33166280. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 11:03:22,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:03:26,401][01035] Updated weights for policy 0, policy_version 2030 (0.0032) [2024-06-27 11:03:27,704][00794] Fps is (10 sec: 45875.5, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 33308672. Throughput: 0: 42677.4. Samples: 33430240. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-27 11:03:27,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:03:29,951][01035] Updated weights for policy 0, policy_version 2040 (0.0035) [2024-06-27 11:03:32,708][00794] Fps is (10 sec: 39305.5, 60 sec: 42595.5, 300 sec: 42542.3). Total num frames: 33505280. Throughput: 0: 42724.9. Samples: 33693500. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 11:03:32,708][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:03:33,982][01035] Updated weights for policy 0, policy_version 2050 (0.0030) [2024-06-27 11:03:37,704][00794] Fps is (10 sec: 42598.2, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 33734656. Throughput: 0: 42720.4. Samples: 33815800. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-27 11:03:37,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:03:37,807][01035] Updated weights for policy 0, policy_version 2060 (0.0024) [2024-06-27 11:03:41,492][01035] Updated weights for policy 0, policy_version 2070 (0.0033) [2024-06-27 11:03:42,704][00794] Fps is (10 sec: 45893.9, 60 sec: 42871.5, 300 sec: 42654.0). Total num frames: 33964032. Throughput: 0: 42772.6. Samples: 34074920. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 11:03:42,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:03:45,430][01035] Updated weights for policy 0, policy_version 2080 (0.0032) [2024-06-27 11:03:47,704][00794] Fps is (10 sec: 40960.1, 60 sec: 42598.3, 300 sec: 42542.9). Total num frames: 34144256. Throughput: 0: 42737.3. Samples: 34335240. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 11:03:47,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:03:49,098][01035] Updated weights for policy 0, policy_version 2090 (0.0035) [2024-06-27 11:03:52,704][00794] Fps is (10 sec: 40960.4, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 34373632. Throughput: 0: 42852.9. Samples: 34460900. Policy #0 lag: (min: 2.0, avg: 11.4, max: 22.0) [2024-06-27 11:03:52,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:03:53,057][01035] Updated weights for policy 0, policy_version 2100 (0.0049) [2024-06-27 11:03:56,731][01035] Updated weights for policy 0, policy_version 2110 (0.0037) [2024-06-27 11:03:57,704][00794] Fps is (10 sec: 45875.0, 60 sec: 42598.8, 300 sec: 42653.9). Total num frames: 34603008. Throughput: 0: 42790.1. Samples: 34713540. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 11:03:57,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 11:04:00,654][01035] Updated weights for policy 0, policy_version 2120 (0.0031) [2024-06-27 11:04:02,704][00794] Fps is (10 sec: 40959.7, 60 sec: 42871.4, 300 sec: 42542.9). Total num frames: 34783232. Throughput: 0: 42738.7. Samples: 34969760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 11:04:02,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:04:04,430][01035] Updated weights for policy 0, policy_version 2130 (0.0046) [2024-06-27 11:04:06,182][01015] Signal inference workers to stop experience collection... (450 times) [2024-06-27 11:04:06,232][01035] InferenceWorker_p0-w0: stopping experience collection (450 times) [2024-06-27 11:04:06,235][01015] Signal inference workers to resume experience collection... (450 times) [2024-06-27 11:04:06,243][01035] InferenceWorker_p0-w0: resuming experience collection (450 times) [2024-06-27 11:04:07,704][00794] Fps is (10 sec: 40960.6, 60 sec: 42601.4, 300 sec: 42598.4). Total num frames: 35012608. Throughput: 0: 42821.4. Samples: 35093240. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 11:04:07,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:04:08,338][01035] Updated weights for policy 0, policy_version 2140 (0.0042) [2024-06-27 11:04:12,413][01035] Updated weights for policy 0, policy_version 2150 (0.0032) [2024-06-27 11:04:12,708][00794] Fps is (10 sec: 45856.3, 60 sec: 42598.4, 300 sec: 42653.3). Total num frames: 35241984. Throughput: 0: 42711.6. Samples: 35352440. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 11:04:12,709][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:04:16,346][01035] Updated weights for policy 0, policy_version 2160 (0.0033) [2024-06-27 11:04:17,704][00794] Fps is (10 sec: 40959.8, 60 sec: 42871.5, 300 sec: 42598.9). Total num frames: 35422208. Throughput: 0: 42393.7. Samples: 35601040. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 11:04:17,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:04:20,034][01035] Updated weights for policy 0, policy_version 2170 (0.0043) [2024-06-27 11:04:22,704][00794] Fps is (10 sec: 39337.5, 60 sec: 42052.2, 300 sec: 42653.9). Total num frames: 35635200. Throughput: 0: 42430.2. Samples: 35725160. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 11:04:22,705][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:04:23,951][01035] Updated weights for policy 0, policy_version 2180 (0.0032) [2024-06-27 11:04:27,696][01035] Updated weights for policy 0, policy_version 2190 (0.0042) [2024-06-27 11:04:27,704][00794] Fps is (10 sec: 45875.3, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 35880960. Throughput: 0: 42482.7. Samples: 35986640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 11:04:27,705][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:04:31,865][01035] Updated weights for policy 0, policy_version 2200 (0.0045) [2024-06-27 11:04:32,704][00794] Fps is (10 sec: 44237.2, 60 sec: 42874.4, 300 sec: 42709.5). Total num frames: 36077568. Throughput: 0: 42291.1. Samples: 36238340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 11:04:32,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:04:35,261][01035] Updated weights for policy 0, policy_version 2210 (0.0024) [2024-06-27 11:04:37,704][00794] Fps is (10 sec: 40959.9, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 36290560. Throughput: 0: 42281.7. Samples: 36363580. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 11:04:37,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:04:39,459][01035] Updated weights for policy 0, policy_version 2220 (0.0024) [2024-06-27 11:04:42,704][00794] Fps is (10 sec: 44237.1, 60 sec: 42598.5, 300 sec: 42654.0). Total num frames: 36519936. Throughput: 0: 42477.5. Samples: 36625020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 11:04:42,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:04:42,799][01035] Updated weights for policy 0, policy_version 2230 (0.0041) [2024-06-27 11:04:47,369][01035] Updated weights for policy 0, policy_version 2240 (0.0029) [2024-06-27 11:04:47,704][00794] Fps is (10 sec: 40960.1, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 36700160. Throughput: 0: 42402.3. Samples: 36877860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 11:04:47,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:04:50,347][01035] Updated weights for policy 0, policy_version 2250 (0.0041) [2024-06-27 11:04:52,705][00794] Fps is (10 sec: 40953.2, 60 sec: 42597.2, 300 sec: 42709.3). Total num frames: 36929536. Throughput: 0: 42277.1. Samples: 36995780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 11:04:52,706][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:04:55,003][01035] Updated weights for policy 0, policy_version 2260 (0.0037) [2024-06-27 11:04:57,704][00794] Fps is (10 sec: 45874.6, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 37158912. Throughput: 0: 42423.8. Samples: 37261340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 11:04:57,705][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:04:57,875][01035] Updated weights for policy 0, policy_version 2270 (0.0039) [2024-06-27 11:05:02,708][00794] Fps is (10 sec: 40949.7, 60 sec: 42595.5, 300 sec: 42597.8). Total num frames: 37339136. Throughput: 0: 42528.6. Samples: 37515000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 11:05:02,709][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:05:03,036][01035] Updated weights for policy 0, policy_version 2280 (0.0033) [2024-06-27 11:05:05,907][01035] Updated weights for policy 0, policy_version 2290 (0.0032) [2024-06-27 11:05:07,704][00794] Fps is (10 sec: 39322.2, 60 sec: 42325.3, 300 sec: 42654.0). Total num frames: 37552128. Throughput: 0: 42385.0. Samples: 37632480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 11:05:07,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:05:10,707][01035] Updated weights for policy 0, policy_version 2300 (0.0031) [2024-06-27 11:05:12,704][00794] Fps is (10 sec: 45893.3, 60 sec: 42601.2, 300 sec: 42653.9). Total num frames: 37797888. Throughput: 0: 42558.9. Samples: 37901800. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 11:05:12,705][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:05:12,720][01015] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000002307_37797888.pth... [2024-06-27 11:05:12,764][01015] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000001681_27541504.pth [2024-06-27 11:05:13,649][01035] Updated weights for policy 0, policy_version 2310 (0.0041) [2024-06-27 11:05:17,704][00794] Fps is (10 sec: 44236.3, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 37994496. Throughput: 0: 42599.0. Samples: 38155300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 11:05:17,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:05:18,289][01035] Updated weights for policy 0, policy_version 2320 (0.0045) [2024-06-27 11:05:21,482][01035] Updated weights for policy 0, policy_version 2330 (0.0033) [2024-06-27 11:05:22,704][00794] Fps is (10 sec: 39321.9, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 38191104. Throughput: 0: 42606.6. Samples: 38280880. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 11:05:22,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:05:25,807][01035] Updated weights for policy 0, policy_version 2340 (0.0048) [2024-06-27 11:05:27,708][00794] Fps is (10 sec: 42581.2, 60 sec: 42322.4, 300 sec: 42653.3). Total num frames: 38420480. Throughput: 0: 42514.3. Samples: 38538340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 11:05:27,709][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:05:29,177][01035] Updated weights for policy 0, policy_version 2350 (0.0033) [2024-06-27 11:05:32,704][00794] Fps is (10 sec: 42598.9, 60 sec: 42325.4, 300 sec: 42709.5). Total num frames: 38617088. Throughput: 0: 42623.1. Samples: 38795900. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 11:05:32,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:05:33,368][01035] Updated weights for policy 0, policy_version 2360 (0.0044) [2024-06-27 11:05:36,841][01035] Updated weights for policy 0, policy_version 2370 (0.0033) [2024-06-27 11:05:37,704][00794] Fps is (10 sec: 42615.7, 60 sec: 42598.3, 300 sec: 42709.5). Total num frames: 38846464. Throughput: 0: 42758.8. Samples: 38919860. Policy #0 lag: (min: 1.0, avg: 11.3, max: 21.0) [2024-06-27 11:05:37,705][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:05:41,214][01035] Updated weights for policy 0, policy_version 2380 (0.0026) [2024-06-27 11:05:42,704][00794] Fps is (10 sec: 44236.2, 60 sec: 42325.2, 300 sec: 42653.9). Total num frames: 39059456. Throughput: 0: 42619.6. Samples: 39179220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 11:05:42,705][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:05:44,677][01035] Updated weights for policy 0, policy_version 2390 (0.0037) [2024-06-27 11:05:47,704][00794] Fps is (10 sec: 40960.0, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 39256064. Throughput: 0: 42564.7. Samples: 39430240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 11:05:47,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:05:48,978][01035] Updated weights for policy 0, policy_version 2400 (0.0023) [2024-06-27 11:05:52,257][01035] Updated weights for policy 0, policy_version 2410 (0.0028) [2024-06-27 11:05:52,704][00794] Fps is (10 sec: 44236.9, 60 sec: 42872.6, 300 sec: 42765.0). Total num frames: 39501824. Throughput: 0: 42863.0. Samples: 39561320. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 11:05:52,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:05:56,426][01015] Signal inference workers to stop experience collection... (500 times) [2024-06-27 11:05:56,426][01015] Signal inference workers to resume experience collection... (500 times) [2024-06-27 11:05:56,444][01035] InferenceWorker_p0-w0: stopping experience collection (500 times) [2024-06-27 11:05:56,444][01035] InferenceWorker_p0-w0: resuming experience collection (500 times) [2024-06-27 11:05:56,573][01035] Updated weights for policy 0, policy_version 2420 (0.0039) [2024-06-27 11:05:57,704][00794] Fps is (10 sec: 42598.2, 60 sec: 42052.3, 300 sec: 42598.4). Total num frames: 39682048. Throughput: 0: 42515.2. Samples: 39814980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 11:05:57,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:06:00,007][01035] Updated weights for policy 0, policy_version 2430 (0.0029) [2024-06-27 11:06:02,704][00794] Fps is (10 sec: 37683.1, 60 sec: 42328.2, 300 sec: 42542.9). Total num frames: 39878656. Throughput: 0: 42569.8. Samples: 40070940. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 11:06:02,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:06:04,167][01035] Updated weights for policy 0, policy_version 2440 (0.0036) [2024-06-27 11:06:07,647][01035] Updated weights for policy 0, policy_version 2450 (0.0043) [2024-06-27 11:06:07,704][00794] Fps is (10 sec: 45875.4, 60 sec: 43144.5, 300 sec: 42709.5). Total num frames: 40140800. Throughput: 0: 42637.3. Samples: 40199560. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 11:06:07,705][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:06:11,901][01035] Updated weights for policy 0, policy_version 2460 (0.0037) [2024-06-27 11:06:12,704][00794] Fps is (10 sec: 44236.8, 60 sec: 42052.3, 300 sec: 42542.8). Total num frames: 40321024. Throughput: 0: 42610.9. Samples: 40455660. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 11:06:12,705][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:06:15,569][01035] Updated weights for policy 0, policy_version 2470 (0.0037) [2024-06-27 11:06:17,708][00794] Fps is (10 sec: 37668.1, 60 sec: 42049.5, 300 sec: 42597.8). Total num frames: 40517632. Throughput: 0: 42477.5. Samples: 40707560. Policy #0 lag: (min: 1.0, avg: 10.9, max: 24.0) [2024-06-27 11:06:17,708][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:06:19,445][01035] Updated weights for policy 0, policy_version 2480 (0.0029) [2024-06-27 11:06:22,704][00794] Fps is (10 sec: 44237.0, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 40763392. Throughput: 0: 42636.9. Samples: 40838520. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 11:06:22,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:06:23,140][01035] Updated weights for policy 0, policy_version 2490 (0.0022) [2024-06-27 11:06:27,503][01035] Updated weights for policy 0, policy_version 2500 (0.0028) [2024-06-27 11:06:27,704][00794] Fps is (10 sec: 44255.2, 60 sec: 42328.3, 300 sec: 42542.9). Total num frames: 40960000. Throughput: 0: 42552.2. Samples: 41094060. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 11:06:27,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:06:30,652][01035] Updated weights for policy 0, policy_version 2510 (0.0026) [2024-06-27 11:06:32,704][00794] Fps is (10 sec: 40960.4, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 41172992. Throughput: 0: 42780.1. Samples: 41355340. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 11:06:32,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:06:34,992][01035] Updated weights for policy 0, policy_version 2520 (0.0031) [2024-06-27 11:06:37,704][00794] Fps is (10 sec: 45874.3, 60 sec: 42871.4, 300 sec: 42653.9). Total num frames: 41418752. Throughput: 0: 42730.2. Samples: 41484180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 11:06:37,704][00794] Avg episode reward: [(0, '0.006')] [2024-06-27 11:06:37,705][01015] Saving new best policy, reward=0.006! [2024-06-27 11:06:38,327][01035] Updated weights for policy 0, policy_version 2530 (0.0062) [2024-06-27 11:06:42,518][01035] Updated weights for policy 0, policy_version 2540 (0.0036) [2024-06-27 11:06:42,704][00794] Fps is (10 sec: 44236.7, 60 sec: 42598.5, 300 sec: 42598.5). Total num frames: 41615360. Throughput: 0: 42767.7. Samples: 41739520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 11:06:42,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:06:45,970][01035] Updated weights for policy 0, policy_version 2550 (0.0026) [2024-06-27 11:06:47,704][00794] Fps is (10 sec: 40958.4, 60 sec: 42871.2, 300 sec: 42653.9). Total num frames: 41828352. Throughput: 0: 42841.0. Samples: 41998800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 11:06:47,705][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:06:50,035][01035] Updated weights for policy 0, policy_version 2560 (0.0030) [2024-06-27 11:06:52,704][00794] Fps is (10 sec: 40960.0, 60 sec: 42052.3, 300 sec: 42542.9). Total num frames: 42024960. Throughput: 0: 42854.3. Samples: 42128000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 11:06:52,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:06:53,686][01035] Updated weights for policy 0, policy_version 2570 (0.0032) [2024-06-27 11:06:57,698][01035] Updated weights for policy 0, policy_version 2580 (0.0041) [2024-06-27 11:06:57,704][00794] Fps is (10 sec: 44237.8, 60 sec: 43144.4, 300 sec: 42709.9). Total num frames: 42270720. Throughput: 0: 43063.4. Samples: 42393520. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 11:06:57,708][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:07:01,297][01035] Updated weights for policy 0, policy_version 2590 (0.0028) [2024-06-27 11:07:02,704][00794] Fps is (10 sec: 45874.9, 60 sec: 43417.6, 300 sec: 42653.9). Total num frames: 42483712. Throughput: 0: 43103.4. Samples: 42647040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 11:07:02,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:07:05,317][01035] Updated weights for policy 0, policy_version 2600 (0.0039) [2024-06-27 11:07:07,704][00794] Fps is (10 sec: 40961.1, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 42680320. Throughput: 0: 43033.0. Samples: 42775000. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 11:07:07,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:07:09,034][01035] Updated weights for policy 0, policy_version 2610 (0.0046) [2024-06-27 11:07:12,704][00794] Fps is (10 sec: 42598.9, 60 sec: 43144.6, 300 sec: 42709.5). Total num frames: 42909696. Throughput: 0: 43111.1. Samples: 43034060. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-27 11:07:12,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:07:12,800][01015] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000002620_42926080.pth... [2024-06-27 11:07:12,809][01035] Updated weights for policy 0, policy_version 2620 (0.0036) [2024-06-27 11:07:12,866][01015] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000001995_32686080.pth [2024-06-27 11:07:16,639][01035] Updated weights for policy 0, policy_version 2630 (0.0040) [2024-06-27 11:07:17,704][00794] Fps is (10 sec: 45875.0, 60 sec: 43693.6, 300 sec: 42709.5). Total num frames: 43139072. Throughput: 0: 42976.8. Samples: 43289300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 11:07:17,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:07:20,301][01035] Updated weights for policy 0, policy_version 2640 (0.0034) [2024-06-27 11:07:22,704][00794] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 43335680. Throughput: 0: 43024.1. Samples: 43420260. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 11:07:22,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:07:24,284][01035] Updated weights for policy 0, policy_version 2650 (0.0041) [2024-06-27 11:07:26,869][01015] Signal inference workers to stop experience collection... (550 times) [2024-06-27 11:07:26,869][01015] Signal inference workers to resume experience collection... (550 times) [2024-06-27 11:07:26,885][01035] InferenceWorker_p0-w0: stopping experience collection (550 times) [2024-06-27 11:07:26,885][01035] InferenceWorker_p0-w0: resuming experience collection (550 times) [2024-06-27 11:07:27,704][00794] Fps is (10 sec: 42598.4, 60 sec: 43417.5, 300 sec: 42765.0). Total num frames: 43565056. Throughput: 0: 43113.7. Samples: 43679640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 11:07:27,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:07:27,747][01035] Updated weights for policy 0, policy_version 2660 (0.0044) [2024-06-27 11:07:31,946][01035] Updated weights for policy 0, policy_version 2670 (0.0027) [2024-06-27 11:07:32,704][00794] Fps is (10 sec: 44236.6, 60 sec: 43417.6, 300 sec: 42709.5). Total num frames: 43778048. Throughput: 0: 43060.9. Samples: 43936520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 11:07:32,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:07:35,624][01035] Updated weights for policy 0, policy_version 2680 (0.0035) [2024-06-27 11:07:37,704][00794] Fps is (10 sec: 42598.6, 60 sec: 42871.6, 300 sec: 42709.5). Total num frames: 43991040. Throughput: 0: 43049.8. Samples: 44065240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 11:07:37,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:07:39,811][01035] Updated weights for policy 0, policy_version 2690 (0.0028) [2024-06-27 11:07:42,704][00794] Fps is (10 sec: 42597.5, 60 sec: 43144.4, 300 sec: 42765.0). Total num frames: 44204032. Throughput: 0: 42835.2. Samples: 44321100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 11:07:42,705][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:07:43,166][01035] Updated weights for policy 0, policy_version 2700 (0.0029) [2024-06-27 11:07:47,395][01035] Updated weights for policy 0, policy_version 2710 (0.0029) [2024-06-27 11:07:47,704][00794] Fps is (10 sec: 42598.1, 60 sec: 43144.9, 300 sec: 42653.9). Total num frames: 44417024. Throughput: 0: 42885.8. Samples: 44576900. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 11:07:47,708][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:07:50,830][01035] Updated weights for policy 0, policy_version 2720 (0.0035) [2024-06-27 11:07:52,704][00794] Fps is (10 sec: 42598.9, 60 sec: 43417.5, 300 sec: 42654.0). Total num frames: 44630016. Throughput: 0: 42906.1. Samples: 44705780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 11:07:52,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:07:55,055][01035] Updated weights for policy 0, policy_version 2730 (0.0029) [2024-06-27 11:07:57,708][00794] Fps is (10 sec: 44218.9, 60 sec: 43141.8, 300 sec: 42875.5). Total num frames: 44859392. Throughput: 0: 42971.6. Samples: 44967960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 11:07:57,708][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:07:58,487][01035] Updated weights for policy 0, policy_version 2740 (0.0035) [2024-06-27 11:08:02,548][01035] Updated weights for policy 0, policy_version 2750 (0.0031) [2024-06-27 11:08:02,704][00794] Fps is (10 sec: 42599.0, 60 sec: 42871.5, 300 sec: 42710.1). Total num frames: 45056000. Throughput: 0: 42956.1. Samples: 45222320. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 11:08:02,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:08:05,975][01035] Updated weights for policy 0, policy_version 2760 (0.0040) [2024-06-27 11:08:07,704][00794] Fps is (10 sec: 40976.8, 60 sec: 43144.5, 300 sec: 42654.5). Total num frames: 45268992. Throughput: 0: 42852.9. Samples: 45348640. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 11:08:07,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:08:10,007][01035] Updated weights for policy 0, policy_version 2770 (0.0033) [2024-06-27 11:08:12,704][00794] Fps is (10 sec: 44236.3, 60 sec: 43144.4, 300 sec: 42876.1). Total num frames: 45498368. Throughput: 0: 42910.6. Samples: 45610620. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 11:08:12,704][00794] Avg episode reward: [(0, '0.006')] [2024-06-27 11:08:13,542][01035] Updated weights for policy 0, policy_version 2780 (0.0045) [2024-06-27 11:08:17,574][01035] Updated weights for policy 0, policy_version 2790 (0.0031) [2024-06-27 11:08:17,704][00794] Fps is (10 sec: 44236.5, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 45711360. Throughput: 0: 42768.8. Samples: 45861120. Policy #0 lag: (min: 1.0, avg: 9.4, max: 21.0) [2024-06-27 11:08:17,704][00794] Avg episode reward: [(0, '0.007')] [2024-06-27 11:08:17,705][01015] Saving new best policy, reward=0.007! [2024-06-27 11:08:21,621][01035] Updated weights for policy 0, policy_version 2800 (0.0039) [2024-06-27 11:08:22,708][00794] Fps is (10 sec: 42581.3, 60 sec: 43141.6, 300 sec: 42764.4). Total num frames: 45924352. Throughput: 0: 42698.3. Samples: 45986840. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 11:08:22,708][00794] Avg episode reward: [(0, '0.007')] [2024-06-27 11:08:25,027][01035] Updated weights for policy 0, policy_version 2810 (0.0039) [2024-06-27 11:08:27,704][00794] Fps is (10 sec: 44237.1, 60 sec: 43144.6, 300 sec: 42876.7). Total num frames: 46153728. Throughput: 0: 43112.7. Samples: 46261160. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 11:08:27,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:08:29,016][01035] Updated weights for policy 0, policy_version 2820 (0.0037) [2024-06-27 11:08:32,612][01035] Updated weights for policy 0, policy_version 2830 (0.0031) [2024-06-27 11:08:32,708][00794] Fps is (10 sec: 44237.0, 60 sec: 43141.6, 300 sec: 42820.0). Total num frames: 46366720. Throughput: 0: 43003.7. Samples: 46512240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 11:08:32,708][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:08:36,490][01035] Updated weights for policy 0, policy_version 2840 (0.0042) [2024-06-27 11:08:37,704][00794] Fps is (10 sec: 40960.2, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 46563328. Throughput: 0: 43126.8. Samples: 46646480. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 11:08:37,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:08:40,061][01035] Updated weights for policy 0, policy_version 2850 (0.0029) [2024-06-27 11:08:42,704][00794] Fps is (10 sec: 40977.0, 60 sec: 42871.7, 300 sec: 42820.6). Total num frames: 46776320. Throughput: 0: 43236.9. Samples: 46913440. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 11:08:42,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:08:43,863][01035] Updated weights for policy 0, policy_version 2860 (0.0038) [2024-06-27 11:08:47,704][00794] Fps is (10 sec: 42598.1, 60 sec: 42871.5, 300 sec: 42765.0). Total num frames: 46989312. Throughput: 0: 43180.8. Samples: 47165460. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 11:08:47,704][00794] Avg episode reward: [(0, '0.006')] [2024-06-27 11:08:47,977][01035] Updated weights for policy 0, policy_version 2870 (0.0040) [2024-06-27 11:08:51,588][01035] Updated weights for policy 0, policy_version 2880 (0.0033) [2024-06-27 11:08:52,704][00794] Fps is (10 sec: 45874.3, 60 sec: 43417.6, 300 sec: 42820.6). Total num frames: 47235072. Throughput: 0: 43321.7. Samples: 47298120. Policy #0 lag: (min: 1.0, avg: 9.1, max: 21.0) [2024-06-27 11:08:52,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:08:55,535][01035] Updated weights for policy 0, policy_version 2890 (0.0031) [2024-06-27 11:08:57,704][00794] Fps is (10 sec: 42598.7, 60 sec: 42601.3, 300 sec: 42820.6). Total num frames: 47415296. Throughput: 0: 43275.7. Samples: 47558020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 11:08:57,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:08:59,186][01035] Updated weights for policy 0, policy_version 2900 (0.0030) [2024-06-27 11:08:59,766][01015] Signal inference workers to stop experience collection... (600 times) [2024-06-27 11:08:59,766][01015] Signal inference workers to resume experience collection... (600 times) [2024-06-27 11:08:59,805][01035] InferenceWorker_p0-w0: stopping experience collection (600 times) [2024-06-27 11:08:59,805][01035] InferenceWorker_p0-w0: resuming experience collection (600 times) [2024-06-27 11:09:02,704][00794] Fps is (10 sec: 39322.2, 60 sec: 42871.5, 300 sec: 42765.0). Total num frames: 47628288. Throughput: 0: 43243.7. Samples: 47807080. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 11:09:02,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:09:03,470][01035] Updated weights for policy 0, policy_version 2910 (0.0044) [2024-06-27 11:09:06,821][01035] Updated weights for policy 0, policy_version 2920 (0.0047) [2024-06-27 11:09:07,704][00794] Fps is (10 sec: 47513.7, 60 sec: 43690.7, 300 sec: 42876.7). Total num frames: 47890432. Throughput: 0: 43436.9. Samples: 47941320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-27 11:09:07,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:09:10,981][01035] Updated weights for policy 0, policy_version 2930 (0.0040) [2024-06-27 11:09:12,704][00794] Fps is (10 sec: 42597.9, 60 sec: 42598.4, 300 sec: 42820.5). Total num frames: 48054272. Throughput: 0: 42927.0. Samples: 48192880. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 11:09:12,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:09:12,718][01015] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000002933_48054272.pth... [2024-06-27 11:09:12,775][01015] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000002307_37797888.pth [2024-06-27 11:09:14,630][01035] Updated weights for policy 0, policy_version 2940 (0.0030) [2024-06-27 11:09:17,704][00794] Fps is (10 sec: 40959.3, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 48300032. Throughput: 0: 43083.4. Samples: 48450820. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 11:09:17,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:09:18,465][01035] Updated weights for policy 0, policy_version 2950 (0.0030) [2024-06-27 11:09:22,206][01035] Updated weights for policy 0, policy_version 2960 (0.0038) [2024-06-27 11:09:22,704][00794] Fps is (10 sec: 45875.1, 60 sec: 43147.4, 300 sec: 42820.5). Total num frames: 48513024. Throughput: 0: 43159.0. Samples: 48588640. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-27 11:09:22,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:09:25,851][01035] Updated weights for policy 0, policy_version 2970 (0.0028) [2024-06-27 11:09:27,708][00794] Fps is (10 sec: 40943.6, 60 sec: 42595.5, 300 sec: 42820.0). Total num frames: 48709632. Throughput: 0: 42889.8. Samples: 48843660. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 11:09:27,709][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:09:29,670][01035] Updated weights for policy 0, policy_version 2980 (0.0036) [2024-06-27 11:09:32,704][00794] Fps is (10 sec: 44236.7, 60 sec: 43147.4, 300 sec: 42931.6). Total num frames: 48955392. Throughput: 0: 43055.0. Samples: 49102940. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 11:09:32,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:09:33,685][01035] Updated weights for policy 0, policy_version 2990 (0.0035) [2024-06-27 11:09:37,208][01035] Updated weights for policy 0, policy_version 3000 (0.0032) [2024-06-27 11:09:37,704][00794] Fps is (10 sec: 45893.7, 60 sec: 43417.5, 300 sec: 42876.1). Total num frames: 49168384. Throughput: 0: 43094.7. Samples: 49237380. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 11:09:37,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:09:41,378][01035] Updated weights for policy 0, policy_version 3010 (0.0041) [2024-06-27 11:09:42,708][00794] Fps is (10 sec: 39306.0, 60 sec: 42868.5, 300 sec: 42875.5). Total num frames: 49348608. Throughput: 0: 42877.8. Samples: 49487700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-27 11:09:42,710][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:09:44,870][01035] Updated weights for policy 0, policy_version 3020 (0.0039) [2024-06-27 11:09:47,704][00794] Fps is (10 sec: 40959.9, 60 sec: 43144.5, 300 sec: 42876.3). Total num frames: 49577984. Throughput: 0: 43095.0. Samples: 49746360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 11:09:47,705][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:09:48,930][01035] Updated weights for policy 0, policy_version 3030 (0.0026) [2024-06-27 11:09:52,361][01035] Updated weights for policy 0, policy_version 3040 (0.0034) [2024-06-27 11:09:52,704][00794] Fps is (10 sec: 47532.8, 60 sec: 43144.6, 300 sec: 42931.6). Total num frames: 49823744. Throughput: 0: 43133.6. Samples: 49882340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 11:09:52,705][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:09:56,543][01035] Updated weights for policy 0, policy_version 3050 (0.0029) [2024-06-27 11:09:57,704][00794] Fps is (10 sec: 40960.7, 60 sec: 42871.5, 300 sec: 42876.7). Total num frames: 49987584. Throughput: 0: 43181.0. Samples: 50136020. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-27 11:09:57,704][00794] Avg episode reward: [(0, '0.007')] [2024-06-27 11:09:59,833][01035] Updated weights for policy 0, policy_version 3060 (0.0042) [2024-06-27 11:10:02,706][00794] Fps is (10 sec: 39314.7, 60 sec: 43143.2, 300 sec: 42931.4). Total num frames: 50216960. Throughput: 0: 43122.4. Samples: 50391400. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 11:10:02,706][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:10:04,223][01035] Updated weights for policy 0, policy_version 3070 (0.0032) [2024-06-27 11:10:07,704][00794] Fps is (10 sec: 45874.5, 60 sec: 42598.3, 300 sec: 42876.1). Total num frames: 50446336. Throughput: 0: 42994.7. Samples: 50523400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 11:10:07,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:10:08,007][01035] Updated weights for policy 0, policy_version 3080 (0.0043) [2024-06-27 11:10:11,759][01035] Updated weights for policy 0, policy_version 3090 (0.0030) [2024-06-27 11:10:12,704][00794] Fps is (10 sec: 42606.0, 60 sec: 43144.6, 300 sec: 42876.1). Total num frames: 50642944. Throughput: 0: 43020.4. Samples: 50779400. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-27 11:10:12,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:10:15,508][01035] Updated weights for policy 0, policy_version 3100 (0.0031) [2024-06-27 11:10:17,704][00794] Fps is (10 sec: 42598.7, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 50872320. Throughput: 0: 43104.1. Samples: 51042620. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 11:10:17,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:10:19,378][01035] Updated weights for policy 0, policy_version 3110 (0.0030) [2024-06-27 11:10:22,704][00794] Fps is (10 sec: 45875.2, 60 sec: 43144.6, 300 sec: 42987.8). Total num frames: 51101696. Throughput: 0: 43010.3. Samples: 51172840. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 11:10:22,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:10:22,931][01035] Updated weights for policy 0, policy_version 3120 (0.0043) [2024-06-27 11:10:27,035][01035] Updated weights for policy 0, policy_version 3130 (0.0043) [2024-06-27 11:10:27,704][00794] Fps is (10 sec: 40960.2, 60 sec: 42874.4, 300 sec: 42931.6). Total num frames: 51281920. Throughput: 0: 43094.6. Samples: 51426780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 11:10:27,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:10:30,446][01035] Updated weights for policy 0, policy_version 3140 (0.0032) [2024-06-27 11:10:32,707][00794] Fps is (10 sec: 40948.7, 60 sec: 42596.5, 300 sec: 42931.2). Total num frames: 51511296. Throughput: 0: 43221.5. Samples: 51691440. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 11:10:32,707][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:10:34,722][01035] Updated weights for policy 0, policy_version 3150 (0.0039) [2024-06-27 11:10:37,704][00794] Fps is (10 sec: 47513.2, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 51757056. Throughput: 0: 43086.2. Samples: 51821220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 11:10:37,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:10:38,111][01035] Updated weights for policy 0, policy_version 3160 (0.0025) [2024-06-27 11:10:42,458][01035] Updated weights for policy 0, policy_version 3170 (0.0032) [2024-06-27 11:10:42,704][00794] Fps is (10 sec: 42610.0, 60 sec: 43147.4, 300 sec: 42987.2). Total num frames: 51937280. Throughput: 0: 43074.1. Samples: 52074360. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-27 11:10:42,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:10:45,960][01035] Updated weights for policy 0, policy_version 3180 (0.0034) [2024-06-27 11:10:47,440][01015] Signal inference workers to stop experience collection... (650 times) [2024-06-27 11:10:47,441][01015] Signal inference workers to resume experience collection... (650 times) [2024-06-27 11:10:47,487][01035] InferenceWorker_p0-w0: stopping experience collection (650 times) [2024-06-27 11:10:47,488][01035] InferenceWorker_p0-w0: resuming experience collection (650 times) [2024-06-27 11:10:47,704][00794] Fps is (10 sec: 39321.9, 60 sec: 42871.5, 300 sec: 42876.1). Total num frames: 52150272. Throughput: 0: 43282.2. Samples: 52339020. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-27 11:10:47,704][00794] Avg episode reward: [(0, '0.006')] [2024-06-27 11:10:50,162][01035] Updated weights for policy 0, policy_version 3190 (0.0028) [2024-06-27 11:10:52,704][00794] Fps is (10 sec: 45875.1, 60 sec: 42871.5, 300 sec: 43098.3). Total num frames: 52396032. Throughput: 0: 43101.8. Samples: 52462980. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 11:10:52,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:10:53,348][01035] Updated weights for policy 0, policy_version 3200 (0.0034) [2024-06-27 11:10:57,704][00794] Fps is (10 sec: 42598.7, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 52576256. Throughput: 0: 43094.7. Samples: 52718660. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 11:10:57,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:10:57,818][01035] Updated weights for policy 0, policy_version 3210 (0.0041) [2024-06-27 11:11:00,941][01035] Updated weights for policy 0, policy_version 3220 (0.0035) [2024-06-27 11:11:02,704][00794] Fps is (10 sec: 40960.2, 60 sec: 43145.8, 300 sec: 42931.6). Total num frames: 52805632. Throughput: 0: 43106.2. Samples: 52982400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 11:11:02,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:11:05,490][01035] Updated weights for policy 0, policy_version 3230 (0.0029) [2024-06-27 11:11:07,704][00794] Fps is (10 sec: 47512.8, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 53051392. Throughput: 0: 43117.7. Samples: 53113140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 11:11:07,705][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:11:08,515][01035] Updated weights for policy 0, policy_version 3240 (0.0037) [2024-06-27 11:11:12,704][00794] Fps is (10 sec: 42597.7, 60 sec: 43144.4, 300 sec: 43098.8). Total num frames: 53231616. Throughput: 0: 42973.6. Samples: 53360600. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 11:11:12,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:11:12,726][01015] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000003249_53231616.pth... [2024-06-27 11:11:12,786][01015] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000002620_42926080.pth [2024-06-27 11:11:13,033][01035] Updated weights for policy 0, policy_version 3250 (0.0027) [2024-06-27 11:11:16,142][01035] Updated weights for policy 0, policy_version 3260 (0.0029) [2024-06-27 11:11:17,704][00794] Fps is (10 sec: 40960.1, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 53460992. Throughput: 0: 42916.8. Samples: 53622580. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 11:11:17,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:11:20,451][01035] Updated weights for policy 0, policy_version 3270 (0.0033) [2024-06-27 11:11:22,704][00794] Fps is (10 sec: 44237.2, 60 sec: 42871.4, 300 sec: 43098.2). Total num frames: 53673984. Throughput: 0: 42938.2. Samples: 53753440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 11:11:22,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:11:23,649][01035] Updated weights for policy 0, policy_version 3280 (0.0037) [2024-06-27 11:11:27,704][00794] Fps is (10 sec: 40960.5, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 53870592. Throughput: 0: 43031.6. Samples: 54010780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 11:11:27,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:11:27,994][01035] Updated weights for policy 0, policy_version 3290 (0.0045) [2024-06-27 11:11:31,167][01035] Updated weights for policy 0, policy_version 3300 (0.0031) [2024-06-27 11:11:32,704][00794] Fps is (10 sec: 42598.3, 60 sec: 43146.4, 300 sec: 42987.2). Total num frames: 54099968. Throughput: 0: 42960.8. Samples: 54272260. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 11:11:32,705][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:11:35,546][01035] Updated weights for policy 0, policy_version 3310 (0.0027) [2024-06-27 11:11:37,704][00794] Fps is (10 sec: 44236.1, 60 sec: 42598.4, 300 sec: 43042.7). Total num frames: 54312960. Throughput: 0: 43096.0. Samples: 54402300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 11:11:37,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:11:38,724][01035] Updated weights for policy 0, policy_version 3320 (0.0027) [2024-06-27 11:11:42,704][00794] Fps is (10 sec: 42598.7, 60 sec: 43144.5, 300 sec: 43042.8). Total num frames: 54525952. Throughput: 0: 43136.8. Samples: 54659820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 11:11:42,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:11:43,162][01035] Updated weights for policy 0, policy_version 3330 (0.0031) [2024-06-27 11:11:46,648][01035] Updated weights for policy 0, policy_version 3340 (0.0037) [2024-06-27 11:11:47,708][00794] Fps is (10 sec: 44220.1, 60 sec: 43414.8, 300 sec: 43153.2). Total num frames: 54755328. Throughput: 0: 42940.3. Samples: 54914880. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 11:11:47,708][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:11:50,860][01035] Updated weights for policy 0, policy_version 3350 (0.0037) [2024-06-27 11:11:52,704][00794] Fps is (10 sec: 44236.9, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 54968320. Throughput: 0: 43053.9. Samples: 55050560. Policy #0 lag: (min: 1.0, avg: 8.8, max: 21.0) [2024-06-27 11:11:52,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:11:54,213][01035] Updated weights for policy 0, policy_version 3360 (0.0032) [2024-06-27 11:11:57,704][00794] Fps is (10 sec: 39336.4, 60 sec: 42871.3, 300 sec: 42931.6). Total num frames: 55148544. Throughput: 0: 43122.7. Samples: 55301120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 11:11:57,705][00794] Avg episode reward: [(0, '0.006')] [2024-06-27 11:11:58,456][01035] Updated weights for policy 0, policy_version 3370 (0.0027) [2024-06-27 11:12:01,881][01035] Updated weights for policy 0, policy_version 3380 (0.0039) [2024-06-27 11:12:02,704][00794] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 55394304. Throughput: 0: 43135.6. Samples: 55563680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 11:12:02,704][00794] Avg episode reward: [(0, '0.006')] [2024-06-27 11:12:06,010][01035] Updated weights for policy 0, policy_version 3390 (0.0028) [2024-06-27 11:12:07,708][00794] Fps is (10 sec: 47494.6, 60 sec: 42868.6, 300 sec: 43097.6). Total num frames: 55623680. Throughput: 0: 43205.5. Samples: 55697860. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 11:12:07,708][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:12:09,451][01035] Updated weights for policy 0, policy_version 3400 (0.0032) [2024-06-27 11:12:12,704][00794] Fps is (10 sec: 40960.1, 60 sec: 42871.6, 300 sec: 42931.6). Total num frames: 55803904. Throughput: 0: 42995.1. Samples: 55945560. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 11:12:12,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:12:13,643][01035] Updated weights for policy 0, policy_version 3410 (0.0045) [2024-06-27 11:12:17,408][01035] Updated weights for policy 0, policy_version 3420 (0.0037) [2024-06-27 11:12:17,704][00794] Fps is (10 sec: 40976.4, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 56033280. Throughput: 0: 42971.1. Samples: 56205960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 11:12:17,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:12:21,173][01035] Updated weights for policy 0, policy_version 3430 (0.0055) [2024-06-27 11:12:22,704][00794] Fps is (10 sec: 44236.4, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 56246272. Throughput: 0: 43131.6. Samples: 56343220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 11:12:22,704][00794] Avg episode reward: [(0, '0.006')] [2024-06-27 11:12:22,935][01015] Signal inference workers to stop experience collection... (700 times) [2024-06-27 11:12:22,979][01035] InferenceWorker_p0-w0: stopping experience collection (700 times) [2024-06-27 11:12:22,986][01015] Signal inference workers to resume experience collection... (700 times) [2024-06-27 11:12:22,997][01035] InferenceWorker_p0-w0: resuming experience collection (700 times) [2024-06-27 11:12:25,123][01035] Updated weights for policy 0, policy_version 3440 (0.0031) [2024-06-27 11:12:27,704][00794] Fps is (10 sec: 42598.7, 60 sec: 43144.5, 300 sec: 42987.2). Total num frames: 56459264. Throughput: 0: 42828.9. Samples: 56587120. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 11:12:27,708][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:12:28,782][01035] Updated weights for policy 0, policy_version 3450 (0.0036) [2024-06-27 11:12:32,704][00794] Fps is (10 sec: 42598.3, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 56672256. Throughput: 0: 42966.2. Samples: 56848200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 11:12:32,705][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:12:32,885][01035] Updated weights for policy 0, policy_version 3460 (0.0041) [2024-06-27 11:12:36,265][01035] Updated weights for policy 0, policy_version 3470 (0.0029) [2024-06-27 11:12:37,704][00794] Fps is (10 sec: 44236.3, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 56901632. Throughput: 0: 42962.1. Samples: 56983860. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-27 11:12:37,705][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:12:40,527][01035] Updated weights for policy 0, policy_version 3480 (0.0034) [2024-06-27 11:12:42,704][00794] Fps is (10 sec: 44237.3, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 57114624. Throughput: 0: 42967.7. Samples: 57234660. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-27 11:12:42,704][00794] Avg episode reward: [(0, '0.001')] [2024-06-27 11:12:44,345][01035] Updated weights for policy 0, policy_version 3490 (0.0045) [2024-06-27 11:12:47,704][00794] Fps is (10 sec: 40960.2, 60 sec: 42601.1, 300 sec: 42987.2). Total num frames: 57311232. Throughput: 0: 42817.3. Samples: 57490460. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 11:12:47,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:12:48,176][01035] Updated weights for policy 0, policy_version 3500 (0.0041) [2024-06-27 11:12:51,818][01035] Updated weights for policy 0, policy_version 3510 (0.0035) [2024-06-27 11:12:52,704][00794] Fps is (10 sec: 39321.5, 60 sec: 42325.3, 300 sec: 42876.7). Total num frames: 57507840. Throughput: 0: 42701.6. Samples: 57619260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 11:12:52,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:12:55,879][01035] Updated weights for policy 0, policy_version 3520 (0.0024) [2024-06-27 11:12:57,704][00794] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 43098.2). Total num frames: 57769984. Throughput: 0: 42954.2. Samples: 57878500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 11:12:57,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:12:59,423][01035] Updated weights for policy 0, policy_version 3530 (0.0026) [2024-06-27 11:13:02,704][00794] Fps is (10 sec: 45874.9, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 57966592. Throughput: 0: 42702.7. Samples: 58127580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 11:13:02,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:13:03,621][01035] Updated weights for policy 0, policy_version 3540 (0.0038) [2024-06-27 11:13:06,990][01035] Updated weights for policy 0, policy_version 3550 (0.0023) [2024-06-27 11:13:07,704][00794] Fps is (10 sec: 39321.6, 60 sec: 42328.2, 300 sec: 42931.6). Total num frames: 58163200. Throughput: 0: 42340.0. Samples: 58248520. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 11:13:07,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:13:11,127][01035] Updated weights for policy 0, policy_version 3560 (0.0038) [2024-06-27 11:13:12,704][00794] Fps is (10 sec: 44236.2, 60 sec: 43417.4, 300 sec: 43042.7). Total num frames: 58408960. Throughput: 0: 42810.5. Samples: 58513600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 11:13:12,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:13:12,716][01015] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000003565_58408960.pth... [2024-06-27 11:13:12,766][01015] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000002933_48054272.pth [2024-06-27 11:13:14,488][01035] Updated weights for policy 0, policy_version 3570 (0.0023) [2024-06-27 11:13:17,704][00794] Fps is (10 sec: 44236.6, 60 sec: 42871.5, 300 sec: 42987.8). Total num frames: 58605568. Throughput: 0: 42655.1. Samples: 58767680. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 11:13:17,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:13:18,668][01035] Updated weights for policy 0, policy_version 3580 (0.0043) [2024-06-27 11:13:22,055][01035] Updated weights for policy 0, policy_version 3590 (0.0030) [2024-06-27 11:13:22,704][00794] Fps is (10 sec: 40960.7, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 58818560. Throughput: 0: 42461.4. Samples: 58894620. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-27 11:13:22,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:13:26,109][01035] Updated weights for policy 0, policy_version 3600 (0.0037) [2024-06-27 11:13:27,704][00794] Fps is (10 sec: 42599.0, 60 sec: 42871.5, 300 sec: 42932.2). Total num frames: 59031552. Throughput: 0: 42724.0. Samples: 59157240. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 11:13:27,704][00794] Avg episode reward: [(0, '0.006')] [2024-06-27 11:13:29,388][01015] Signal inference workers to stop experience collection... (750 times) [2024-06-27 11:13:29,388][01015] Signal inference workers to resume experience collection... (750 times) [2024-06-27 11:13:29,426][01035] InferenceWorker_p0-w0: stopping experience collection (750 times) [2024-06-27 11:13:29,426][01035] InferenceWorker_p0-w0: resuming experience collection (750 times) [2024-06-27 11:13:29,523][01035] Updated weights for policy 0, policy_version 3610 (0.0042) [2024-06-27 11:13:32,704][00794] Fps is (10 sec: 42597.9, 60 sec: 42871.4, 300 sec: 42987.1). Total num frames: 59244544. Throughput: 0: 42740.8. Samples: 59413800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 11:13:32,705][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:13:34,049][01035] Updated weights for policy 0, policy_version 3620 (0.0038) [2024-06-27 11:13:37,580][01035] Updated weights for policy 0, policy_version 3630 (0.0037) [2024-06-27 11:13:37,704][00794] Fps is (10 sec: 44236.4, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 59473920. Throughput: 0: 42642.6. Samples: 59538180. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 11:13:37,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:13:41,583][01035] Updated weights for policy 0, policy_version 3640 (0.0031) [2024-06-27 11:13:42,704][00794] Fps is (10 sec: 42598.5, 60 sec: 42598.3, 300 sec: 42987.2). Total num frames: 59670528. Throughput: 0: 42585.2. Samples: 59794840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 11:13:42,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:13:45,419][01035] Updated weights for policy 0, policy_version 3650 (0.0044) [2024-06-27 11:13:47,708][00794] Fps is (10 sec: 42581.2, 60 sec: 43141.7, 300 sec: 42931.1). Total num frames: 59899904. Throughput: 0: 42754.0. Samples: 60051680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 11:13:47,708][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:13:49,171][01035] Updated weights for policy 0, policy_version 3660 (0.0033) [2024-06-27 11:13:52,704][00794] Fps is (10 sec: 42598.5, 60 sec: 43144.4, 300 sec: 42987.1). Total num frames: 60096512. Throughput: 0: 42954.1. Samples: 60181460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 11:13:52,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:13:53,078][01035] Updated weights for policy 0, policy_version 3670 (0.0032) [2024-06-27 11:13:56,767][01035] Updated weights for policy 0, policy_version 3680 (0.0045) [2024-06-27 11:13:57,704][00794] Fps is (10 sec: 40976.8, 60 sec: 42325.4, 300 sec: 42987.2). Total num frames: 60309504. Throughput: 0: 42833.1. Samples: 60441080. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 11:13:57,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:14:00,519][01035] Updated weights for policy 0, policy_version 3690 (0.0036) [2024-06-27 11:14:02,708][00794] Fps is (10 sec: 44219.4, 60 sec: 42868.6, 300 sec: 42875.5). Total num frames: 60538880. Throughput: 0: 42869.5. Samples: 60696980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 11:14:02,708][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:14:04,683][01035] Updated weights for policy 0, policy_version 3700 (0.0036) [2024-06-27 11:14:07,708][00794] Fps is (10 sec: 42581.1, 60 sec: 42868.6, 300 sec: 42986.6). Total num frames: 60735488. Throughput: 0: 42926.4. Samples: 60826480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 11:14:07,708][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:14:08,155][01035] Updated weights for policy 0, policy_version 3710 (0.0043) [2024-06-27 11:14:12,237][01035] Updated weights for policy 0, policy_version 3720 (0.0034) [2024-06-27 11:14:12,704][00794] Fps is (10 sec: 40976.3, 60 sec: 42325.4, 300 sec: 42876.1). Total num frames: 60948480. Throughput: 0: 42790.5. Samples: 61082820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 11:14:12,705][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:14:15,813][01035] Updated weights for policy 0, policy_version 3730 (0.0042) [2024-06-27 11:14:17,704][00794] Fps is (10 sec: 45893.1, 60 sec: 43144.5, 300 sec: 42987.2). Total num frames: 61194240. Throughput: 0: 42799.6. Samples: 61339780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 11:14:17,705][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:14:20,264][01035] Updated weights for policy 0, policy_version 3740 (0.0042) [2024-06-27 11:14:22,704][00794] Fps is (10 sec: 44236.6, 60 sec: 42871.4, 300 sec: 42987.7). Total num frames: 61390848. Throughput: 0: 42923.9. Samples: 61469760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 11:14:22,705][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:14:23,605][01035] Updated weights for policy 0, policy_version 3750 (0.0034) [2024-06-27 11:14:27,704][00794] Fps is (10 sec: 39322.5, 60 sec: 42598.4, 300 sec: 42820.6). Total num frames: 61587456. Throughput: 0: 42941.5. Samples: 61727200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 11:14:27,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:14:27,791][01035] Updated weights for policy 0, policy_version 3760 (0.0040) [2024-06-27 11:14:31,248][01035] Updated weights for policy 0, policy_version 3770 (0.0037) [2024-06-27 11:14:32,704][00794] Fps is (10 sec: 44237.1, 60 sec: 43144.6, 300 sec: 42931.6). Total num frames: 61833216. Throughput: 0: 42946.9. Samples: 61984120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 11:14:32,705][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:14:35,607][01035] Updated weights for policy 0, policy_version 3780 (0.0032) [2024-06-27 11:14:37,704][00794] Fps is (10 sec: 45874.5, 60 sec: 42871.4, 300 sec: 43043.3). Total num frames: 62046208. Throughput: 0: 42960.0. Samples: 62114660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 11:14:37,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:14:39,036][01035] Updated weights for policy 0, policy_version 3790 (0.0045) [2024-06-27 11:14:42,704][00794] Fps is (10 sec: 40959.7, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 62242816. Throughput: 0: 42914.1. Samples: 62372220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 11:14:42,705][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:14:43,073][01035] Updated weights for policy 0, policy_version 3800 (0.0038) [2024-06-27 11:14:46,541][01035] Updated weights for policy 0, policy_version 3810 (0.0027) [2024-06-27 11:14:47,704][00794] Fps is (10 sec: 40960.6, 60 sec: 42601.3, 300 sec: 42820.6). Total num frames: 62455808. Throughput: 0: 43032.0. Samples: 62633240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 11:14:47,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:14:50,626][01035] Updated weights for policy 0, policy_version 3820 (0.0030) [2024-06-27 11:14:52,704][00794] Fps is (10 sec: 44237.0, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 62685184. Throughput: 0: 42993.1. Samples: 62761000. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 11:14:52,705][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:14:54,058][01035] Updated weights for policy 0, policy_version 3830 (0.0026) [2024-06-27 11:14:57,704][00794] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 42931.9). Total num frames: 62881792. Throughput: 0: 43045.9. Samples: 63019880. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 11:14:57,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:14:58,092][01035] Updated weights for policy 0, policy_version 3840 (0.0041) [2024-06-27 11:15:01,635][01035] Updated weights for policy 0, policy_version 3850 (0.0032) [2024-06-27 11:15:02,702][01015] Signal inference workers to stop experience collection... (800 times) [2024-06-27 11:15:02,708][00794] Fps is (10 sec: 40943.7, 60 sec: 42598.4, 300 sec: 42875.5). Total num frames: 63094784. Throughput: 0: 43112.2. Samples: 63280000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 11:15:02,708][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:15:02,727][01035] InferenceWorker_p0-w0: stopping experience collection (800 times) [2024-06-27 11:15:02,759][01015] Signal inference workers to resume experience collection... (800 times) [2024-06-27 11:15:02,760][01035] InferenceWorker_p0-w0: resuming experience collection (800 times) [2024-06-27 11:15:05,589][01035] Updated weights for policy 0, policy_version 3860 (0.0025) [2024-06-27 11:15:07,704][00794] Fps is (10 sec: 44236.3, 60 sec: 43147.4, 300 sec: 42987.2). Total num frames: 63324160. Throughput: 0: 43147.6. Samples: 63411400. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 11:15:07,705][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:15:09,165][01035] Updated weights for policy 0, policy_version 3870 (0.0031) [2024-06-27 11:15:12,704][00794] Fps is (10 sec: 44254.8, 60 sec: 43144.6, 300 sec: 42931.6). Total num frames: 63537152. Throughput: 0: 43079.9. Samples: 63665800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 11:15:12,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:15:12,721][01015] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000003878_63537152.pth... [2024-06-27 11:15:12,782][01015] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000003249_53231616.pth [2024-06-27 11:15:13,078][01035] Updated weights for policy 0, policy_version 3880 (0.0036) [2024-06-27 11:15:16,720][01035] Updated weights for policy 0, policy_version 3890 (0.0038) [2024-06-27 11:15:17,704][00794] Fps is (10 sec: 42598.7, 60 sec: 42598.5, 300 sec: 42876.1). Total num frames: 63750144. Throughput: 0: 43038.3. Samples: 63920840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 11:15:17,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:15:20,670][01035] Updated weights for policy 0, policy_version 3900 (0.0039) [2024-06-27 11:15:22,704][00794] Fps is (10 sec: 42598.8, 60 sec: 42871.6, 300 sec: 42987.2). Total num frames: 63963136. Throughput: 0: 43026.4. Samples: 64050840. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 11:15:22,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:15:24,462][01035] Updated weights for policy 0, policy_version 3910 (0.0031) [2024-06-27 11:15:27,704][00794] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 42987.6). Total num frames: 64192512. Throughput: 0: 42961.9. Samples: 64305500. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 11:15:27,704][00794] Avg episode reward: [(0, '0.006')] [2024-06-27 11:15:28,410][01035] Updated weights for policy 0, policy_version 3920 (0.0035) [2024-06-27 11:15:32,158][01035] Updated weights for policy 0, policy_version 3930 (0.0029) [2024-06-27 11:15:32,704][00794] Fps is (10 sec: 44236.4, 60 sec: 42871.5, 300 sec: 42876.1). Total num frames: 64405504. Throughput: 0: 42886.6. Samples: 64563140. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 11:15:32,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:15:36,017][01035] Updated weights for policy 0, policy_version 3940 (0.0036) [2024-06-27 11:15:37,708][00794] Fps is (10 sec: 40943.3, 60 sec: 42595.6, 300 sec: 42931.0). Total num frames: 64602112. Throughput: 0: 42920.2. Samples: 64692580. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 11:15:37,708][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:15:39,702][01035] Updated weights for policy 0, policy_version 3950 (0.0038) [2024-06-27 11:15:42,704][00794] Fps is (10 sec: 42598.1, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 64831488. Throughput: 0: 42873.2. Samples: 64949180. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 11:15:42,705][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:15:43,685][01035] Updated weights for policy 0, policy_version 3960 (0.0032) [2024-06-27 11:15:47,266][01035] Updated weights for policy 0, policy_version 3970 (0.0027) [2024-06-27 11:15:47,704][00794] Fps is (10 sec: 45893.8, 60 sec: 43417.6, 300 sec: 42931.6). Total num frames: 65060864. Throughput: 0: 42856.8. Samples: 65208380. Policy #0 lag: (min: 1.0, avg: 9.3, max: 20.0) [2024-06-27 11:15:47,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:15:51,170][01035] Updated weights for policy 0, policy_version 3980 (0.0036) [2024-06-27 11:15:52,704][00794] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 42987.1). Total num frames: 65257472. Throughput: 0: 42737.3. Samples: 65334580. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 11:15:52,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:15:54,915][01035] Updated weights for policy 0, policy_version 3990 (0.0040) [2024-06-27 11:15:57,704][00794] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 42987.2). Total num frames: 65486848. Throughput: 0: 42932.0. Samples: 65597740. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 11:15:57,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:15:58,776][01035] Updated weights for policy 0, policy_version 4000 (0.0021) [2024-06-27 11:16:02,506][01035] Updated weights for policy 0, policy_version 4010 (0.0038) [2024-06-27 11:16:02,704][00794] Fps is (10 sec: 44236.9, 60 sec: 43420.5, 300 sec: 42876.1). Total num frames: 65699840. Throughput: 0: 43003.0. Samples: 65855980. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 11:16:02,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:16:06,531][01035] Updated weights for policy 0, policy_version 4020 (0.0032) [2024-06-27 11:16:07,704][00794] Fps is (10 sec: 40960.1, 60 sec: 42871.5, 300 sec: 42931.7). Total num frames: 65896448. Throughput: 0: 42936.4. Samples: 65982980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2024-06-27 11:16:07,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:16:10,234][01035] Updated weights for policy 0, policy_version 4030 (0.0027) [2024-06-27 11:16:12,704][00794] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 66125824. Throughput: 0: 43167.9. Samples: 66248060. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 11:16:12,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:16:14,118][01035] Updated weights for policy 0, policy_version 4040 (0.0039) [2024-06-27 11:16:17,704][00794] Fps is (10 sec: 44236.6, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 66338816. Throughput: 0: 43167.5. Samples: 66505680. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 11:16:17,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:16:17,787][01035] Updated weights for policy 0, policy_version 4050 (0.0031) [2024-06-27 11:16:21,711][01035] Updated weights for policy 0, policy_version 4060 (0.0023) [2024-06-27 11:16:22,705][00794] Fps is (10 sec: 40953.4, 60 sec: 42870.2, 300 sec: 42931.4). Total num frames: 66535424. Throughput: 0: 43160.5. Samples: 66634700. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 11:16:22,706][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:16:25,461][01035] Updated weights for policy 0, policy_version 4070 (0.0032) [2024-06-27 11:16:26,255][01015] Signal inference workers to stop experience collection... (850 times) [2024-06-27 11:16:26,286][01035] InferenceWorker_p0-w0: stopping experience collection (850 times) [2024-06-27 11:16:26,370][01015] Signal inference workers to resume experience collection... (850 times) [2024-06-27 11:16:26,370][01035] InferenceWorker_p0-w0: resuming experience collection (850 times) [2024-06-27 11:16:27,704][00794] Fps is (10 sec: 42598.4, 60 sec: 42871.4, 300 sec: 42931.6). Total num frames: 66764800. Throughput: 0: 43131.6. Samples: 66890100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 11:16:27,716][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:16:29,296][01035] Updated weights for policy 0, policy_version 4080 (0.0028) [2024-06-27 11:16:32,704][00794] Fps is (10 sec: 44243.9, 60 sec: 42871.4, 300 sec: 42931.6). Total num frames: 66977792. Throughput: 0: 43031.5. Samples: 67144800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 11:16:32,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:16:33,093][01035] Updated weights for policy 0, policy_version 4090 (0.0033) [2024-06-27 11:16:36,878][01035] Updated weights for policy 0, policy_version 4100 (0.0027) [2024-06-27 11:16:37,704][00794] Fps is (10 sec: 42598.2, 60 sec: 43147.4, 300 sec: 42931.6). Total num frames: 67190784. Throughput: 0: 43162.2. Samples: 67276880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 11:16:37,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:16:40,506][01035] Updated weights for policy 0, policy_version 4110 (0.0031) [2024-06-27 11:16:42,704][00794] Fps is (10 sec: 44236.8, 60 sec: 43144.5, 300 sec: 42932.2). Total num frames: 67420160. Throughput: 0: 43132.8. Samples: 67538720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 11:16:42,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:16:44,300][01035] Updated weights for policy 0, policy_version 4120 (0.0037) [2024-06-27 11:16:47,704][00794] Fps is (10 sec: 44237.2, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 67633152. Throughput: 0: 43009.4. Samples: 67791400. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 11:16:47,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:16:48,243][01035] Updated weights for policy 0, policy_version 4130 (0.0036) [2024-06-27 11:16:51,897][01035] Updated weights for policy 0, policy_version 4140 (0.0037) [2024-06-27 11:16:52,704][00794] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 67846144. Throughput: 0: 43168.0. Samples: 67925540. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 11:16:52,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:16:55,698][01035] Updated weights for policy 0, policy_version 4150 (0.0027) [2024-06-27 11:16:57,704][00794] Fps is (10 sec: 42598.7, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 68059136. Throughput: 0: 42980.6. Samples: 68182180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 11:16:57,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:16:59,411][01035] Updated weights for policy 0, policy_version 4160 (0.0038) [2024-06-27 11:17:02,704][00794] Fps is (10 sec: 44236.8, 60 sec: 43144.6, 300 sec: 42932.2). Total num frames: 68288512. Throughput: 0: 43087.1. Samples: 68444600. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 11:17:02,704][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:17:03,178][01035] Updated weights for policy 0, policy_version 4170 (0.0038) [2024-06-27 11:17:06,938][01035] Updated weights for policy 0, policy_version 4180 (0.0028) [2024-06-27 11:17:07,704][00794] Fps is (10 sec: 44236.0, 60 sec: 43417.5, 300 sec: 43042.7). Total num frames: 68501504. Throughput: 0: 43221.1. Samples: 68579580. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 11:17:07,704][00794] Avg episode reward: [(0, '0.005')] [2024-06-27 11:17:10,706][01035] Updated weights for policy 0, policy_version 4190 (0.0035) [2024-06-27 11:17:12,708][00794] Fps is (10 sec: 40943.3, 60 sec: 42868.6, 300 sec: 42931.0). Total num frames: 68698112. Throughput: 0: 43166.3. Samples: 68832760. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 11:17:12,708][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:17:12,723][01015] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000004193_68698112.pth... [2024-06-27 11:17:12,772][01015] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000003565_58408960.pth [2024-06-27 11:17:14,413][01035] Updated weights for policy 0, policy_version 4200 (0.0028) [2024-06-27 11:17:17,704][00794] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 68943872. Throughput: 0: 43365.0. Samples: 69096220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 11:17:17,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:17:18,315][01035] Updated weights for policy 0, policy_version 4210 (0.0026) [2024-06-27 11:17:21,869][01035] Updated weights for policy 0, policy_version 4220 (0.0031) [2024-06-27 11:17:22,704][00794] Fps is (10 sec: 44254.0, 60 sec: 43418.7, 300 sec: 42987.1). Total num frames: 69140480. Throughput: 0: 43456.7. Samples: 69232440. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 11:17:22,704][00794] Avg episode reward: [(0, '0.003')] [2024-06-27 11:17:26,237][01035] Updated weights for policy 0, policy_version 4230 (0.0028) [2024-06-27 11:17:27,704][00794] Fps is (10 sec: 40960.0, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 69353472. Throughput: 0: 43295.6. Samples: 69487020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 11:17:27,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:17:29,586][01035] Updated weights for policy 0, policy_version 4240 (0.0054) [2024-06-27 11:17:32,704][00794] Fps is (10 sec: 45876.5, 60 sec: 43690.8, 300 sec: 43042.7). Total num frames: 69599232. Throughput: 0: 43420.9. Samples: 69745340. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 11:17:32,704][00794] Avg episode reward: [(0, '0.002')] [2024-06-27 11:17:33,653][01035] Updated weights for policy 0, policy_version 4250 (0.0038) [2024-06-27 11:17:37,708][00794] Fps is (10 sec: 42580.9, 60 sec: 43141.6, 300 sec: 42931.0). Total num frames: 69779456. Throughput: 0: 43380.0. Samples: 69877820. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 11:17:37,717][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 11:17:37,903][01035] Updated weights for policy 0, policy_version 4260 (0.0032) [2024-06-27 11:17:41,297][01035] Updated weights for policy 0, policy_version 4270 (0.0030) [2024-06-27 11:17:42,704][00794] Fps is (10 sec: 39321.2, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 69992448. Throughput: 0: 43396.3. Samples: 70135020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 11:17:42,705][00794] Avg episode reward: [(0, '0.004')] [2024-06-27 13:19:28,599][03472] Saving configuration to ./train_dir/sample_factory/p2.sf/config.json... [2024-06-27 13:19:28,665][03472] Rollout worker 0 uses device cpu [2024-06-27 13:19:28,666][03472] Rollout worker 1 uses device cpu [2024-06-27 13:19:28,667][03472] Rollout worker 2 uses device cpu [2024-06-27 13:19:28,667][03472] Rollout worker 3 uses device cpu [2024-06-27 13:19:28,668][03472] Rollout worker 4 uses device cpu [2024-06-27 13:19:28,668][03472] Rollout worker 5 uses device cpu [2024-06-27 13:19:28,669][03472] Rollout worker 6 uses device cpu [2024-06-27 13:19:28,669][03472] Rollout worker 7 uses device cpu [2024-06-27 13:19:28,670][03472] Rollout worker 8 uses device cpu [2024-06-27 13:19:28,670][03472] Rollout worker 9 uses device cpu [2024-06-27 13:19:28,670][03472] Rollout worker 10 uses device cpu [2024-06-27 13:19:28,671][03472] Rollout worker 11 uses device cpu [2024-06-27 13:19:28,671][03472] Rollout worker 12 uses device cpu [2024-06-27 13:19:28,672][03472] Rollout worker 13 uses device cpu [2024-06-27 13:19:28,672][03472] Rollout worker 14 uses device cpu [2024-06-27 13:19:28,673][03472] Rollout worker 15 uses device cpu [2024-06-27 13:19:28,673][03472] Rollout worker 16 uses device cpu [2024-06-27 13:19:28,673][03472] Rollout worker 17 uses device cpu [2024-06-27 13:19:28,673][03472] Rollout worker 18 uses device cpu [2024-06-27 13:19:28,673][03472] Rollout worker 19 uses device cpu [2024-06-27 13:19:28,673][03472] Rollout worker 20 uses device cpu [2024-06-27 13:19:28,673][03472] Rollout worker 21 uses device cpu [2024-06-27 13:19:28,674][03472] Rollout worker 22 uses device cpu [2024-06-27 13:19:28,674][03472] Rollout worker 23 uses device cpu [2024-06-27 13:19:28,674][03472] Rollout worker 24 uses device cpu [2024-06-27 13:19:28,674][03472] Rollout worker 25 uses device cpu [2024-06-27 13:19:28,674][03472] Rollout worker 26 uses device cpu [2024-06-27 13:19:28,674][03472] Rollout worker 27 uses device cpu [2024-06-27 13:19:28,674][03472] Rollout worker 28 uses device cpu [2024-06-27 13:19:28,674][03472] Rollout worker 29 uses device cpu [2024-06-27 13:19:28,674][03472] Rollout worker 30 uses device cpu [2024-06-27 13:19:28,674][03472] Rollout worker 31 uses device cpu [2024-06-27 13:19:29,261][03472] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 13:19:29,262][03472] InferenceWorker_p0-w0: min num requests: 10 [2024-06-27 13:19:29,319][03472] Starting all processes... [2024-06-27 13:19:29,320][03472] Starting process learner_proc0 [2024-06-27 13:19:29,593][03472] Starting all processes... [2024-06-27 13:19:29,595][03472] Starting process inference_proc0-0 [2024-06-27 13:19:29,596][03472] Starting process rollout_proc0 [2024-06-27 13:19:29,596][03472] Starting process rollout_proc1 [2024-06-27 13:19:29,597][03472] Starting process rollout_proc2 [2024-06-27 13:19:29,599][03472] Starting process rollout_proc3 [2024-06-27 13:19:29,599][03472] Starting process rollout_proc4 [2024-06-27 13:19:29,599][03472] Starting process rollout_proc5 [2024-06-27 13:19:29,600][03472] Starting process rollout_proc6 [2024-06-27 13:19:29,601][03472] Starting process rollout_proc7 [2024-06-27 13:19:29,603][03472] Starting process rollout_proc8 [2024-06-27 13:19:29,603][03472] Starting process rollout_proc9 [2024-06-27 13:19:29,605][03472] Starting process rollout_proc10 [2024-06-27 13:19:29,605][03472] Starting process rollout_proc11 [2024-06-27 13:19:29,605][03472] Starting process rollout_proc12 [2024-06-27 13:19:29,605][03472] Starting process rollout_proc13 [2024-06-27 13:19:29,606][03472] Starting process rollout_proc14 [2024-06-27 13:19:29,606][03472] Starting process rollout_proc15 [2024-06-27 13:19:29,607][03472] Starting process rollout_proc16 [2024-06-27 13:19:29,607][03472] Starting process rollout_proc17 [2024-06-27 13:19:29,607][03472] Starting process rollout_proc18 [2024-06-27 13:19:29,607][03472] Starting process rollout_proc19 [2024-06-27 13:19:29,608][03472] Starting process rollout_proc20 [2024-06-27 13:19:29,608][03472] Starting process rollout_proc21 [2024-06-27 13:19:29,608][03472] Starting process rollout_proc22 [2024-06-27 13:19:29,609][03472] Starting process rollout_proc23 [2024-06-27 13:19:29,612][03472] Starting process rollout_proc24 [2024-06-27 13:19:29,612][03472] Starting process rollout_proc25 [2024-06-27 13:19:29,617][03472] Starting process rollout_proc26 [2024-06-27 13:19:29,618][03472] Starting process rollout_proc27 [2024-06-27 13:19:29,618][03472] Starting process rollout_proc28 [2024-06-27 13:19:29,620][03472] Starting process rollout_proc29 [2024-06-27 13:19:29,621][03472] Starting process rollout_proc30 [2024-06-27 13:19:29,623][03472] Starting process rollout_proc31 [2024-06-27 13:19:31,631][03719] Worker 14 uses CPU cores [14] [2024-06-27 13:19:31,744][03705] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 13:19:31,745][03705] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2024-06-27 13:19:31,754][03705] Num visible devices: 1 [2024-06-27 13:19:31,795][03716] Worker 10 uses CPU cores [10] [2024-06-27 13:19:31,800][03707] Worker 1 uses CPU cores [1] [2024-06-27 13:19:31,827][03724] Worker 18 uses CPU cores [18] [2024-06-27 13:19:31,836][03720] Worker 13 uses CPU cores [13] [2024-06-27 13:19:31,856][03709] Worker 2 uses CPU cores [2] [2024-06-27 13:19:31,860][03728] Worker 22 uses CPU cores [22] [2024-06-27 13:19:31,864][03730] Worker 24 uses CPU cores [24] [2024-06-27 13:19:31,871][03726] Worker 20 uses CPU cores [20] [2024-06-27 13:19:31,884][03732] Worker 26 uses CPU cores [26] [2024-06-27 13:19:31,892][03731] Worker 25 uses CPU cores [25] [2024-06-27 13:19:31,916][03722] Worker 15 uses CPU cores [15] [2024-06-27 13:19:31,928][03710] Worker 5 uses CPU cores [5] [2024-06-27 13:19:31,943][03713] Worker 7 uses CPU cores [7] [2024-06-27 13:19:31,956][03706] Worker 0 uses CPU cores [0] [2024-06-27 13:19:31,958][03708] Worker 3 uses CPU cores [3] [2024-06-27 13:19:31,962][03714] Worker 8 uses CPU cores [8] [2024-06-27 13:19:32,028][03685] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 13:19:32,028][03685] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2024-06-27 13:19:32,037][03685] Num visible devices: 1 [2024-06-27 13:19:32,048][03715] Worker 9 uses CPU cores [9] [2024-06-27 13:19:32,048][03685] Setting fixed seed 0 [2024-06-27 13:19:32,049][03685] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 13:19:32,049][03685] Initializing actor-critic model on device cuda:0 [2024-06-27 13:19:32,064][03737] Worker 30 uses CPU cores [30] [2024-06-27 13:19:32,073][03725] Worker 19 uses CPU cores [19] [2024-06-27 13:19:32,082][03733] Worker 28 uses CPU cores [28] [2024-06-27 13:19:32,096][03718] Worker 12 uses CPU cores [12] [2024-06-27 13:19:32,102][03736] Worker 27 uses CPU cores [27] [2024-06-27 13:19:32,124][03723] Worker 17 uses CPU cores [17] [2024-06-27 13:19:32,129][03729] Worker 21 uses CPU cores [21] [2024-06-27 13:19:32,129][03717] Worker 11 uses CPU cores [11] [2024-06-27 13:19:32,131][03735] Worker 31 uses CPU cores [31] [2024-06-27 13:19:32,131][03727] Worker 23 uses CPU cores [23] [2024-06-27 13:19:32,132][03711] Worker 4 uses CPU cores [4] [2024-06-27 13:19:32,144][03712] Worker 6 uses CPU cores [6] [2024-06-27 13:19:32,187][03721] Worker 16 uses CPU cores [16] [2024-06-27 13:19:32,226][03734] Worker 29 uses CPU cores [29] [2024-06-27 13:19:32,793][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,794][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,795][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,798][03685] RunningMeanStd input shape: (1,) [2024-06-27 13:19:32,798][03685] RunningMeanStd input shape: (1,) [2024-06-27 13:19:32,798][03685] RunningMeanStd input shape: (1,) [2024-06-27 13:19:32,798][03685] RunningMeanStd input shape: (1,) [2024-06-27 13:19:32,798][03685] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:32,834][03685] RunningMeanStd input shape: (1,) [2024-06-27 13:19:32,842][03685] Created Actor Critic model with architecture: [2024-06-27 13:19:32,842][03685] SampleFactoryAgentWrapper( (obs_normalizer): ObservationNormalizer() (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (agent): MettaAgent( (_encoder): MultiFeatureSetEncoder( (feature_set_encoders): ModuleDict( (grid_obs): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (agent): RunningMeanStdInPlace() (altar): RunningMeanStdInPlace() (clock): RunningMeanStdInPlace() (converter): RunningMeanStdInPlace() (generator): RunningMeanStdInPlace() (wall): RunningMeanStdInPlace() (agent:dir): RunningMeanStdInPlace() (agent:energy): RunningMeanStdInPlace() (agent:frozen): RunningMeanStdInPlace() (agent:hp): RunningMeanStdInPlace() (agent:id): RunningMeanStdInPlace() (agent:inv_r1): RunningMeanStdInPlace() (agent:inv_r2): RunningMeanStdInPlace() (agent:inv_r3): RunningMeanStdInPlace() (agent:shield): RunningMeanStdInPlace() (altar:hp): RunningMeanStdInPlace() (altar:state): RunningMeanStdInPlace() (converter:hp): RunningMeanStdInPlace() (converter:state): RunningMeanStdInPlace() (generator:amount): RunningMeanStdInPlace() (generator:hp): RunningMeanStdInPlace() (generator:state): RunningMeanStdInPlace() (wall:hp): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=125, out_features=512, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=512, out_features=512, bias=True) (3): ELU(alpha=1.0) (4): Linear(in_features=512, out_features=512, bias=True) (5): ELU(alpha=1.0) (6): Linear(in_features=512, out_features=512, bias=True) (7): ELU(alpha=1.0) ) ) (global_vars): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (_steps): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (last_action): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (last_action_id): RunningMeanStdInPlace() (last_action_val): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (last_reward): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (last_reward): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (kinship): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (kinship): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=125, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) ) (merged_encoder): Sequential( (0): Linear(in_features=544, out_features=512, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=512, out_features=512, bias=True) (3): ELU(alpha=1.0) (4): Linear(in_features=512, out_features=512, bias=True) (5): ELU(alpha=1.0) ) ) (_decoder): Decoder( (mlp): Identity() ) (_critic_linear): Linear(in_features=512, out_features=1, bias=True) ) (_core): ModelCoreRNN( (core): GRU(512, 512) ) (_action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=16, bias=True) ) ) [2024-06-27 13:19:32,906][03685] Using optimizer [2024-06-27 13:19:33,090][03685] Loading state from checkpoint ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000004193_68698112.pth... [2024-06-27 13:19:33,105][03685] Loading model from checkpoint [2024-06-27 13:19:33,106][03685] Loaded experiment state at self.train_step=4193, self.env_steps=68698112 [2024-06-27 13:19:33,106][03685] Initialized policy 0 weights for model version 4193 [2024-06-27 13:19:33,108][03685] LearnerWorker_p0 finished initialization! [2024-06-27 13:19:33,108][03685] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,829][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,830][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,830][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,830][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,830][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,830][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,830][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,830][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,830][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,833][03705] RunningMeanStd input shape: (1,) [2024-06-27 13:19:33,833][03705] RunningMeanStd input shape: (1,) [2024-06-27 13:19:33,833][03705] RunningMeanStd input shape: (1,) [2024-06-27 13:19:33,833][03705] RunningMeanStd input shape: (1,) [2024-06-27 13:19:33,834][03705] RunningMeanStd input shape: (11, 11) [2024-06-27 13:19:33,869][03705] RunningMeanStd input shape: (1,) [2024-06-27 13:19:33,894][03472] Inference worker 0-0 is ready! [2024-06-27 13:19:33,894][03472] All inference workers are ready! Signal rollout workers to start! [2024-06-27 13:19:36,278][03472] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 68698112. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2024-06-27 13:19:36,553][03721] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,559][03728] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,606][03726] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,611][03724] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,617][03723] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,632][03732] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,636][03733] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,637][03729] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,654][03727] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,663][03712] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,665][03734] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,668][03722] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,669][03715] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,669][03720] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,670][03706] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,671][03708] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,673][03707] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,675][03719] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,676][03710] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,676][03713] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,676][03711] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,678][03709] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,678][03714] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,679][03718] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,679][03717] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,681][03716] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,684][03737] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,689][03731] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,689][03735] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,691][03736] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,691][03730] Decorrelating experience for 0 frames... [2024-06-27 13:19:36,714][03725] Decorrelating experience for 0 frames... [2024-06-27 13:19:37,643][03728] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,649][03721] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,701][03726] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,715][03724] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,727][03723] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,740][03729] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,741][03732] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,774][03727] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,777][03733] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,786][03734] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,806][03712] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,811][03722] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,819][03720] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,821][03715] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,822][03708] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,824][03706] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,830][03719] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,833][03707] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,838][03710] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,838][03713] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,847][03718] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,847][03711] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,848][03709] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,849][03714] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,851][03737] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,852][03717] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,854][03716] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,862][03736] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,867][03731] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,870][03730] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,873][03735] Decorrelating experience for 256 frames... [2024-06-27 13:19:37,898][03725] Decorrelating experience for 256 frames... [2024-06-27 13:19:41,278][03472] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 68698112. Throughput: 0: 7632.1. Samples: 38160. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2024-06-27 13:19:44,792][03729] Worker 21, sleep for 98.438 sec to decorrelate experience collection [2024-06-27 13:19:44,796][03724] Worker 18, sleep for 84.375 sec to decorrelate experience collection [2024-06-27 13:19:44,815][03709] Worker 2, sleep for 9.375 sec to decorrelate experience collection [2024-06-27 13:19:44,821][03714] Worker 8, sleep for 37.500 sec to decorrelate experience collection [2024-06-27 13:19:44,821][03718] Worker 12, sleep for 56.250 sec to decorrelate experience collection [2024-06-27 13:19:44,823][03727] Worker 23, sleep for 107.812 sec to decorrelate experience collection [2024-06-27 13:19:44,830][03708] Worker 3, sleep for 14.062 sec to decorrelate experience collection [2024-06-27 13:19:44,842][03707] Worker 1, sleep for 4.688 sec to decorrelate experience collection [2024-06-27 13:19:44,873][03733] Worker 28, sleep for 131.250 sec to decorrelate experience collection [2024-06-27 13:19:44,879][03736] Worker 27, sleep for 126.562 sec to decorrelate experience collection [2024-06-27 13:19:44,887][03722] Worker 15, sleep for 70.312 sec to decorrelate experience collection [2024-06-27 13:19:44,906][03735] Worker 31, sleep for 145.312 sec to decorrelate experience collection [2024-06-27 13:19:44,942][03725] Worker 19, sleep for 89.062 sec to decorrelate experience collection [2024-06-27 13:19:44,969][03685] Signal inference workers to stop experience collection... [2024-06-27 13:19:45,003][03705] InferenceWorker_p0-w0: stopping experience collection [2024-06-27 13:19:45,009][03713] Worker 7, sleep for 32.812 sec to decorrelate experience collection [2024-06-27 13:19:45,042][03710] Worker 5, sleep for 23.438 sec to decorrelate experience collection [2024-06-27 13:19:45,584][03685] Signal inference workers to resume experience collection... [2024-06-27 13:19:45,584][03705] InferenceWorker_p0-w0: resuming experience collection [2024-06-27 13:19:46,104][03719] Worker 14, sleep for 65.625 sec to decorrelate experience collection [2024-06-27 13:19:46,105][03715] Worker 9, sleep for 42.188 sec to decorrelate experience collection [2024-06-27 13:19:46,105][03720] Worker 13, sleep for 60.938 sec to decorrelate experience collection [2024-06-27 13:19:46,109][03717] Worker 11, sleep for 51.562 sec to decorrelate experience collection [2024-06-27 13:19:46,122][03711] Worker 4, sleep for 18.750 sec to decorrelate experience collection [2024-06-27 13:19:46,124][03728] Worker 22, sleep for 103.125 sec to decorrelate experience collection [2024-06-27 13:19:46,125][03721] Worker 16, sleep for 75.000 sec to decorrelate experience collection [2024-06-27 13:19:46,133][03716] Worker 10, sleep for 46.875 sec to decorrelate experience collection [2024-06-27 13:19:46,136][03723] Worker 17, sleep for 79.688 sec to decorrelate experience collection [2024-06-27 13:19:46,138][03712] Worker 6, sleep for 28.125 sec to decorrelate experience collection [2024-06-27 13:19:46,141][03734] Worker 29, sleep for 135.938 sec to decorrelate experience collection [2024-06-27 13:19:46,141][03732] Worker 26, sleep for 121.875 sec to decorrelate experience collection [2024-06-27 13:19:46,168][03726] Worker 20, sleep for 93.750 sec to decorrelate experience collection [2024-06-27 13:19:46,168][03737] Worker 30, sleep for 140.625 sec to decorrelate experience collection [2024-06-27 13:19:46,168][03730] Worker 24, sleep for 112.500 sec to decorrelate experience collection [2024-06-27 13:19:46,174][03731] Worker 25, sleep for 117.188 sec to decorrelate experience collection [2024-06-27 13:19:46,278][03472] Fps is (10 sec: 8192.1, 60 sec: 8192.1, 300 sec: 8192.1). Total num frames: 68780032. Throughput: 0: 32354.4. Samples: 323540. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-27 13:19:46,278][03472] Avg episode reward: [(0, '0.001')] [2024-06-27 13:19:46,825][03705] Updated weights for policy 0, policy_version 4203 (0.0018) [2024-06-27 13:19:49,257][03472] Heartbeat connected on Batcher_0 [2024-06-27 13:19:49,259][03472] Heartbeat connected on LearnerWorker_p0 [2024-06-27 13:19:49,264][03472] Heartbeat connected on RolloutWorker_w0 [2024-06-27 13:19:49,316][03472] Heartbeat connected on InferenceWorker_p0-w0 [2024-06-27 13:19:49,552][03707] Worker 1 awakens! [2024-06-27 13:19:49,561][03472] Heartbeat connected on RolloutWorker_w1 [2024-06-27 13:19:51,278][03472] Fps is (10 sec: 16383.8, 60 sec: 10922.6, 300 sec: 10922.6). Total num frames: 68861952. Throughput: 0: 22027.9. Samples: 330420. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-27 13:19:51,278][03472] Avg episode reward: [(0, '0.001')] [2024-06-27 13:19:54,232][03709] Worker 2 awakens! [2024-06-27 13:19:54,241][03472] Heartbeat connected on RolloutWorker_w2 [2024-06-27 13:19:56,278][03472] Fps is (10 sec: 9830.2, 60 sec: 9011.1, 300 sec: 9011.1). Total num frames: 68878336. Throughput: 0: 17256.9. Samples: 345140. Policy #0 lag: (min: 0.0, avg: 9.3, max: 10.0) [2024-06-27 13:19:56,287][03472] Avg episode reward: [(0, '0.001')] [2024-06-27 13:19:58,919][03708] Worker 3 awakens! [2024-06-27 13:19:58,924][03472] Heartbeat connected on RolloutWorker_w3 [2024-06-27 13:20:01,278][03472] Fps is (10 sec: 3276.8, 60 sec: 7864.3, 300 sec: 7864.3). Total num frames: 68894720. Throughput: 0: 14764.0. Samples: 369100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 10.0) [2024-06-27 13:20:01,278][03472] Avg episode reward: [(0, '0.001')] [2024-06-27 13:20:04,966][03711] Worker 4 awakens! [2024-06-27 13:20:04,976][03472] Heartbeat connected on RolloutWorker_w4 [2024-06-27 13:20:06,278][03472] Fps is (10 sec: 6553.7, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 68943872. Throughput: 0: 12794.0. Samples: 383820. Policy #0 lag: (min: 0.0, avg: 4.4, max: 12.0) [2024-06-27 13:20:06,278][03472] Avg episode reward: [(0, '0.002')] [2024-06-27 13:20:08,579][03710] Worker 5 awakens! [2024-06-27 13:20:08,584][03472] Heartbeat connected on RolloutWorker_w5 [2024-06-27 13:20:11,278][03472] Fps is (10 sec: 9830.5, 60 sec: 8426.1, 300 sec: 8426.1). Total num frames: 68993024. Throughput: 0: 13130.3. Samples: 459560. Policy #0 lag: (min: 0.0, avg: 1.2, max: 3.0) [2024-06-27 13:20:11,278][03472] Avg episode reward: [(0, '0.002')] [2024-06-27 13:20:12,362][03705] Updated weights for policy 0, policy_version 4213 (0.0017) [2024-06-27 13:20:14,364][03712] Worker 6 awakens! [2024-06-27 13:20:14,369][03472] Heartbeat connected on RolloutWorker_w6 [2024-06-27 13:20:16,278][03472] Fps is (10 sec: 14745.6, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 69091328. Throughput: 0: 14052.5. Samples: 562100. Policy #0 lag: (min: 0.0, avg: 2.2, max: 5.0) [2024-06-27 13:20:16,278][03472] Avg episode reward: [(0, '0.002')] [2024-06-27 13:20:17,920][03713] Worker 7 awakens! [2024-06-27 13:20:17,926][03472] Heartbeat connected on RolloutWorker_w7 [2024-06-27 13:20:20,250][03705] Updated weights for policy 0, policy_version 4223 (0.0012) [2024-06-27 13:20:21,278][03472] Fps is (10 sec: 21299.1, 60 sec: 11286.8, 300 sec: 11286.8). Total num frames: 69206016. Throughput: 0: 13946.2. Samples: 627580. Policy #0 lag: (min: 0.0, avg: 2.2, max: 6.0) [2024-06-27 13:20:21,278][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:20:22,420][03714] Worker 8 awakens! [2024-06-27 13:20:22,425][03472] Heartbeat connected on RolloutWorker_w8 [2024-06-27 13:20:26,278][03472] Fps is (10 sec: 22937.6, 60 sec: 12451.9, 300 sec: 12451.9). Total num frames: 69320704. Throughput: 0: 16089.8. Samples: 762200. Policy #0 lag: (min: 0.0, avg: 2.6, max: 6.0) [2024-06-27 13:20:26,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:20:28,038][03705] Updated weights for policy 0, policy_version 4233 (0.0012) [2024-06-27 13:20:28,392][03715] Worker 9 awakens! [2024-06-27 13:20:28,399][03472] Heartbeat connected on RolloutWorker_w9 [2024-06-27 13:20:31,278][03472] Fps is (10 sec: 22937.3, 60 sec: 13405.1, 300 sec: 13405.1). Total num frames: 69435392. Throughput: 0: 13046.2. Samples: 910620. Policy #0 lag: (min: 0.0, avg: 14.1, max: 40.0) [2024-06-27 13:20:31,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:20:32,883][03705] Updated weights for policy 0, policy_version 4243 (0.0018) [2024-06-27 13:20:33,109][03716] Worker 10 awakens! [2024-06-27 13:20:33,114][03472] Heartbeat connected on RolloutWorker_w10 [2024-06-27 13:20:36,278][03472] Fps is (10 sec: 27852.8, 60 sec: 15018.7, 300 sec: 15018.7). Total num frames: 69599232. Throughput: 0: 14914.7. Samples: 1001580. Policy #0 lag: (min: 0.0, avg: 18.2, max: 50.0) [2024-06-27 13:20:36,278][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:20:37,768][03717] Worker 11 awakens! [2024-06-27 13:20:37,776][03472] Heartbeat connected on RolloutWorker_w11 [2024-06-27 13:20:38,803][03705] Updated weights for policy 0, policy_version 4253 (0.0015) [2024-06-27 13:20:41,171][03718] Worker 12 awakens! [2024-06-27 13:20:41,178][03472] Heartbeat connected on RolloutWorker_w12 [2024-06-27 13:20:41,278][03472] Fps is (10 sec: 34406.8, 60 sec: 18022.4, 300 sec: 16636.1). Total num frames: 69779456. Throughput: 0: 18883.6. Samples: 1194900. Policy #0 lag: (min: 0.0, avg: 20.0, max: 59.0) [2024-06-27 13:20:41,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:20:43,953][03705] Updated weights for policy 0, policy_version 4263 (0.0016) [2024-06-27 13:20:46,278][03472] Fps is (10 sec: 34406.1, 60 sec: 19387.7, 300 sec: 17788.3). Total num frames: 69943296. Throughput: 0: 22941.8. Samples: 1401480. Policy #0 lag: (min: 0.0, avg: 3.2, max: 9.0) [2024-06-27 13:20:46,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:20:47,140][03720] Worker 13 awakens! [2024-06-27 13:20:47,148][03472] Heartbeat connected on RolloutWorker_w13 [2024-06-27 13:20:47,892][03705] Updated weights for policy 0, policy_version 4273 (0.0017) [2024-06-27 13:20:51,278][03472] Fps is (10 sec: 31129.5, 60 sec: 20480.0, 300 sec: 18568.5). Total num frames: 70090752. Throughput: 0: 24961.7. Samples: 1507100. Policy #0 lag: (min: 0.0, avg: 4.0, max: 10.0) [2024-06-27 13:20:51,278][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:20:51,828][03719] Worker 14 awakens! [2024-06-27 13:20:51,833][03472] Heartbeat connected on RolloutWorker_w14 [2024-06-27 13:20:52,704][03705] Updated weights for policy 0, policy_version 4283 (0.0021) [2024-06-27 13:20:55,300][03722] Worker 15 awakens! [2024-06-27 13:20:55,307][03472] Heartbeat connected on RolloutWorker_w15 [2024-06-27 13:20:56,278][03472] Fps is (10 sec: 34406.5, 60 sec: 23483.8, 300 sec: 19865.6). Total num frames: 70287360. Throughput: 0: 27869.3. Samples: 1713680. Policy #0 lag: (min: 0.0, avg: 5.7, max: 10.0) [2024-06-27 13:20:56,278][03472] Avg episode reward: [(0, '0.002')] [2024-06-27 13:20:57,264][03705] Updated weights for policy 0, policy_version 4293 (0.0023) [2024-06-27 13:21:01,225][03721] Worker 16 awakens! [2024-06-27 13:21:01,234][03472] Heartbeat connected on RolloutWorker_w16 [2024-06-27 13:21:01,278][03472] Fps is (10 sec: 37683.3, 60 sec: 26214.4, 300 sec: 20817.3). Total num frames: 70467584. Throughput: 0: 30062.2. Samples: 1914900. Policy #0 lag: (min: 0.0, avg: 5.6, max: 11.0) [2024-06-27 13:21:01,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:21:02,191][03705] Updated weights for policy 0, policy_version 4303 (0.0024) [2024-06-27 13:21:05,920][03723] Worker 17 awakens! [2024-06-27 13:21:05,929][03472] Heartbeat connected on RolloutWorker_w17 [2024-06-27 13:21:06,278][03472] Fps is (10 sec: 34406.3, 60 sec: 28125.8, 300 sec: 21481.2). Total num frames: 70631424. Throughput: 0: 30855.9. Samples: 2016100. Policy #0 lag: (min: 0.0, avg: 6.1, max: 11.0) [2024-06-27 13:21:06,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:21:07,020][03705] Updated weights for policy 0, policy_version 4313 (0.0037) [2024-06-27 13:21:09,269][03724] Worker 18 awakens! [2024-06-27 13:21:09,281][03472] Heartbeat connected on RolloutWorker_w18 [2024-06-27 13:21:11,278][03472] Fps is (10 sec: 34406.1, 60 sec: 30310.3, 300 sec: 22247.7). Total num frames: 70811648. Throughput: 0: 32744.4. Samples: 2235700. Policy #0 lag: (min: 0.0, avg: 6.8, max: 12.0) [2024-06-27 13:21:11,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:21:11,837][03705] Updated weights for policy 0, policy_version 4323 (0.0024) [2024-06-27 13:21:14,108][03725] Worker 19 awakens! [2024-06-27 13:21:14,119][03472] Heartbeat connected on RolloutWorker_w19 [2024-06-27 13:21:16,221][03705] Updated weights for policy 0, policy_version 4333 (0.0030) [2024-06-27 13:21:16,278][03472] Fps is (10 sec: 36044.6, 60 sec: 31675.7, 300 sec: 22937.6). Total num frames: 70991872. Throughput: 0: 34364.0. Samples: 2457000. Policy #0 lag: (min: 1.0, avg: 7.5, max: 12.0) [2024-06-27 13:21:16,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:21:20,016][03726] Worker 20 awakens! [2024-06-27 13:21:20,027][03472] Heartbeat connected on RolloutWorker_w20 [2024-06-27 13:21:20,236][03705] Updated weights for policy 0, policy_version 4343 (0.0028) [2024-06-27 13:21:21,278][03472] Fps is (10 sec: 34406.0, 60 sec: 32494.8, 300 sec: 23405.7). Total num frames: 71155712. Throughput: 0: 34686.5. Samples: 2562480. Policy #0 lag: (min: 0.0, avg: 6.1, max: 13.0) [2024-06-27 13:21:21,279][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:21:23,328][03729] Worker 21 awakens! [2024-06-27 13:21:23,340][03472] Heartbeat connected on RolloutWorker_w21 [2024-06-27 13:21:24,726][03705] Updated weights for policy 0, policy_version 4353 (0.0020) [2024-06-27 13:21:26,278][03472] Fps is (10 sec: 37683.5, 60 sec: 34133.3, 300 sec: 24278.1). Total num frames: 71368704. Throughput: 0: 35418.6. Samples: 2788740. Policy #0 lag: (min: 0.0, avg: 5.8, max: 14.0) [2024-06-27 13:21:26,278][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:21:26,291][03685] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000004356_71368704.pth... [2024-06-27 13:21:26,341][03685] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000003878_63537152.pth [2024-06-27 13:21:29,257][03728] Worker 22 awakens! [2024-06-27 13:21:29,267][03472] Heartbeat connected on RolloutWorker_w22 [2024-06-27 13:21:29,277][03705] Updated weights for policy 0, policy_version 4363 (0.0026) [2024-06-27 13:21:31,278][03472] Fps is (10 sec: 39322.5, 60 sec: 35225.7, 300 sec: 24789.7). Total num frames: 71548928. Throughput: 0: 35885.9. Samples: 3016340. Policy #0 lag: (min: 0.0, avg: 6.0, max: 15.0) [2024-06-27 13:21:31,278][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:21:32,736][03727] Worker 23 awakens! [2024-06-27 13:21:32,748][03472] Heartbeat connected on RolloutWorker_w23 [2024-06-27 13:21:33,537][03705] Updated weights for policy 0, policy_version 4373 (0.0028) [2024-06-27 13:21:36,278][03472] Fps is (10 sec: 39321.2, 60 sec: 36044.7, 300 sec: 25531.7). Total num frames: 71761920. Throughput: 0: 36186.6. Samples: 3135500. Policy #0 lag: (min: 0.0, avg: 7.2, max: 16.0) [2024-06-27 13:21:36,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:21:37,585][03705] Updated weights for policy 0, policy_version 4383 (0.0023) [2024-06-27 13:21:38,768][03730] Worker 24 awakens! [2024-06-27 13:21:38,778][03472] Heartbeat connected on RolloutWorker_w24 [2024-06-27 13:21:41,278][03472] Fps is (10 sec: 39320.9, 60 sec: 36044.7, 300 sec: 25952.2). Total num frames: 71942144. Throughput: 0: 36879.9. Samples: 3373280. Policy #0 lag: (min: 0.0, avg: 7.7, max: 16.0) [2024-06-27 13:21:41,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:21:42,022][03705] Updated weights for policy 0, policy_version 4393 (0.0028) [2024-06-27 13:21:43,460][03731] Worker 25 awakens! [2024-06-27 13:21:43,473][03472] Heartbeat connected on RolloutWorker_w25 [2024-06-27 13:21:45,557][03705] Updated weights for policy 0, policy_version 4403 (0.0034) [2024-06-27 13:21:46,278][03472] Fps is (10 sec: 39321.9, 60 sec: 36864.0, 300 sec: 26592.5). Total num frames: 72155136. Throughput: 0: 37604.8. Samples: 3607120. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2024-06-27 13:21:46,278][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:21:48,047][03732] Worker 26 awakens! [2024-06-27 13:21:48,060][03472] Heartbeat connected on RolloutWorker_w26 [2024-06-27 13:21:49,749][03705] Updated weights for policy 0, policy_version 4413 (0.0024) [2024-06-27 13:21:51,278][03472] Fps is (10 sec: 39321.7, 60 sec: 37410.1, 300 sec: 26942.6). Total num frames: 72335360. Throughput: 0: 38183.9. Samples: 3734380. Policy #0 lag: (min: 0.0, avg: 8.2, max: 17.0) [2024-06-27 13:21:51,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:21:51,540][03736] Worker 27 awakens! [2024-06-27 13:21:51,552][03472] Heartbeat connected on RolloutWorker_w27 [2024-06-27 13:21:53,941][03705] Updated weights for policy 0, policy_version 4423 (0.0029) [2024-06-27 13:21:56,216][03733] Worker 28 awakens! [2024-06-27 13:21:56,231][03472] Heartbeat connected on RolloutWorker_w28 [2024-06-27 13:21:56,278][03472] Fps is (10 sec: 40960.2, 60 sec: 37956.3, 300 sec: 27618.7). Total num frames: 72564736. Throughput: 0: 38752.5. Samples: 3979560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 13:21:56,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:21:58,329][03705] Updated weights for policy 0, policy_version 4433 (0.0030) [2024-06-27 13:22:01,278][03472] Fps is (10 sec: 42598.4, 60 sec: 38229.3, 300 sec: 28022.3). Total num frames: 72761344. Throughput: 0: 39312.4. Samples: 4226060. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2024-06-27 13:22:01,278][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:22:02,189][03734] Worker 29 awakens! [2024-06-27 13:22:02,196][03705] Updated weights for policy 0, policy_version 4443 (0.0026) [2024-06-27 13:22:02,203][03472] Heartbeat connected on RolloutWorker_w29 [2024-06-27 13:22:05,945][03705] Updated weights for policy 0, policy_version 4453 (0.0036) [2024-06-27 13:22:06,278][03472] Fps is (10 sec: 40959.4, 60 sec: 39048.5, 300 sec: 28508.1). Total num frames: 72974336. Throughput: 0: 39704.9. Samples: 4349200. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 13:22:06,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:22:06,892][03737] Worker 30 awakens! [2024-06-27 13:22:06,907][03472] Heartbeat connected on RolloutWorker_w30 [2024-06-27 13:22:10,090][03685] Signal inference workers to stop experience collection... (50 times) [2024-06-27 13:22:10,092][03685] Signal inference workers to resume experience collection... (50 times) [2024-06-27 13:22:10,108][03705] InferenceWorker_p0-w0: stopping experience collection (50 times) [2024-06-27 13:22:10,112][03705] Updated weights for policy 0, policy_version 4463 (0.0033) [2024-06-27 13:22:10,122][03705] InferenceWorker_p0-w0: resuming experience collection (50 times) [2024-06-27 13:22:10,320][03735] Worker 31 awakens! [2024-06-27 13:22:10,332][03472] Heartbeat connected on RolloutWorker_w31 [2024-06-27 13:22:11,278][03472] Fps is (10 sec: 42598.8, 60 sec: 39594.7, 300 sec: 28962.7). Total num frames: 73187328. Throughput: 0: 40468.0. Samples: 4609800. Policy #0 lag: (min: 0.0, avg: 89.3, max: 265.0) [2024-06-27 13:22:11,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:22:13,522][03705] Updated weights for policy 0, policy_version 4473 (0.0041) [2024-06-27 13:22:16,280][03472] Fps is (10 sec: 42589.4, 60 sec: 40139.4, 300 sec: 29388.4). Total num frames: 73400320. Throughput: 0: 41093.0. Samples: 4865620. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 13:22:16,281][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:22:17,559][03705] Updated weights for policy 0, policy_version 4483 (0.0041) [2024-06-27 13:22:20,911][03705] Updated weights for policy 0, policy_version 4493 (0.0030) [2024-06-27 13:22:21,278][03472] Fps is (10 sec: 44236.5, 60 sec: 41233.1, 300 sec: 29888.4). Total num frames: 73629696. Throughput: 0: 41217.8. Samples: 4990300. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 13:22:21,281][03472] Avg episode reward: [(0, '0.002')] [2024-06-27 13:22:25,096][03705] Updated weights for policy 0, policy_version 4503 (0.0030) [2024-06-27 13:22:26,278][03472] Fps is (10 sec: 45885.4, 60 sec: 41506.1, 300 sec: 30358.6). Total num frames: 73859072. Throughput: 0: 41965.9. Samples: 5261740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 13:22:26,278][03472] Avg episode reward: [(0, '0.002')] [2024-06-27 13:22:28,548][03705] Updated weights for policy 0, policy_version 4513 (0.0022) [2024-06-27 13:22:31,278][03472] Fps is (10 sec: 42598.6, 60 sec: 41779.1, 300 sec: 30614.7). Total num frames: 74055680. Throughput: 0: 42553.3. Samples: 5522020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 13:22:31,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:22:32,471][03705] Updated weights for policy 0, policy_version 4523 (0.0041) [2024-06-27 13:22:35,946][03705] Updated weights for policy 0, policy_version 4533 (0.0038) [2024-06-27 13:22:36,278][03472] Fps is (10 sec: 40960.3, 60 sec: 41779.3, 300 sec: 30947.6). Total num frames: 74268672. Throughput: 0: 42550.8. Samples: 5649160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 13:22:36,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:22:39,858][03705] Updated weights for policy 0, policy_version 4543 (0.0026) [2024-06-27 13:22:41,278][03472] Fps is (10 sec: 45875.2, 60 sec: 42871.5, 300 sec: 31439.6). Total num frames: 74514432. Throughput: 0: 43109.3. Samples: 5919480. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-27 13:22:41,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:22:43,294][03705] Updated weights for policy 0, policy_version 4553 (0.0032) [2024-06-27 13:22:46,278][03472] Fps is (10 sec: 42597.6, 60 sec: 42325.3, 300 sec: 31560.7). Total num frames: 74694656. Throughput: 0: 43440.9. Samples: 6180900. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 13:22:46,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:22:47,304][03705] Updated weights for policy 0, policy_version 4563 (0.0027) [2024-06-27 13:22:50,834][03705] Updated weights for policy 0, policy_version 4573 (0.0030) [2024-06-27 13:22:51,278][03472] Fps is (10 sec: 42598.8, 60 sec: 43417.7, 300 sec: 32011.8). Total num frames: 74940416. Throughput: 0: 43486.4. Samples: 6306080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 13:22:51,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:22:54,841][03705] Updated weights for policy 0, policy_version 4583 (0.0034) [2024-06-27 13:22:56,278][03472] Fps is (10 sec: 45876.0, 60 sec: 43144.6, 300 sec: 32276.5). Total num frames: 75153408. Throughput: 0: 43660.5. Samples: 6574520. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 13:22:56,278][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:22:58,300][03705] Updated weights for policy 0, policy_version 4593 (0.0032) [2024-06-27 13:23:01,278][03472] Fps is (10 sec: 40959.6, 60 sec: 43144.6, 300 sec: 32448.3). Total num frames: 75350016. Throughput: 0: 43770.2. Samples: 6835180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 13:23:01,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:23:02,241][03705] Updated weights for policy 0, policy_version 4603 (0.0044) [2024-06-27 13:23:05,925][03705] Updated weights for policy 0, policy_version 4613 (0.0040) [2024-06-27 13:23:06,278][03472] Fps is (10 sec: 44236.3, 60 sec: 43690.7, 300 sec: 32846.0). Total num frames: 75595776. Throughput: 0: 43832.9. Samples: 6962780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 13:23:06,278][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:23:09,761][03705] Updated weights for policy 0, policy_version 4623 (0.0033) [2024-06-27 13:23:11,278][03472] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 33072.8). Total num frames: 75808768. Throughput: 0: 43597.3. Samples: 7223620. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-27 13:23:11,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:23:13,505][03705] Updated weights for policy 0, policy_version 4633 (0.0032) [2024-06-27 13:23:16,278][03472] Fps is (10 sec: 40959.7, 60 sec: 43419.1, 300 sec: 33214.8). Total num frames: 76005376. Throughput: 0: 43616.3. Samples: 7484760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 13:23:16,278][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:23:17,245][03705] Updated weights for policy 0, policy_version 4643 (0.0040) [2024-06-27 13:23:21,019][03705] Updated weights for policy 0, policy_version 4653 (0.0027) [2024-06-27 13:23:21,280][03472] Fps is (10 sec: 44227.2, 60 sec: 43689.1, 300 sec: 33568.7). Total num frames: 76251136. Throughput: 0: 43651.6. Samples: 7613580. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 13:23:21,280][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:23:24,729][03705] Updated weights for policy 0, policy_version 4663 (0.0033) [2024-06-27 13:23:25,867][03685] Signal inference workers to stop experience collection... (100 times) [2024-06-27 13:23:25,913][03705] InferenceWorker_p0-w0: stopping experience collection (100 times) [2024-06-27 13:23:25,920][03685] Signal inference workers to resume experience collection... (100 times) [2024-06-27 13:23:25,943][03705] InferenceWorker_p0-w0: resuming experience collection (100 times) [2024-06-27 13:23:26,278][03472] Fps is (10 sec: 45875.6, 60 sec: 43417.6, 300 sec: 33765.3). Total num frames: 76464128. Throughput: 0: 43330.2. Samples: 7869340. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 13:23:26,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:23:26,293][03685] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000004667_76464128.pth... [2024-06-27 13:23:26,336][03685] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000004193_68698112.pth [2024-06-27 13:23:28,508][03705] Updated weights for policy 0, policy_version 4673 (0.0033) [2024-06-27 13:23:31,278][03472] Fps is (10 sec: 40969.1, 60 sec: 43417.6, 300 sec: 33883.5). Total num frames: 76660736. Throughput: 0: 43389.5. Samples: 8133420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 13:23:31,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:23:32,319][03705] Updated weights for policy 0, policy_version 4683 (0.0028) [2024-06-27 13:23:35,998][03705] Updated weights for policy 0, policy_version 4693 (0.0034) [2024-06-27 13:23:36,278][03472] Fps is (10 sec: 42598.8, 60 sec: 43690.6, 300 sec: 34133.3). Total num frames: 76890112. Throughput: 0: 43407.9. Samples: 8259440. Policy #0 lag: (min: 0.0, avg: 11.4, max: 25.0) [2024-06-27 13:23:36,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:23:40,069][03705] Updated weights for policy 0, policy_version 4703 (0.0029) [2024-06-27 13:23:41,280][03472] Fps is (10 sec: 44226.8, 60 sec: 43142.9, 300 sec: 34305.8). Total num frames: 77103104. Throughput: 0: 43351.1. Samples: 8525420. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 13:23:41,281][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:23:43,414][03705] Updated weights for policy 0, policy_version 4713 (0.0036) [2024-06-27 13:23:46,279][03472] Fps is (10 sec: 40955.6, 60 sec: 43416.9, 300 sec: 34406.3). Total num frames: 77299712. Throughput: 0: 43342.1. Samples: 8785620. Policy #0 lag: (min: 1.0, avg: 12.0, max: 23.0) [2024-06-27 13:23:46,280][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:23:47,470][03705] Updated weights for policy 0, policy_version 4723 (0.0041) [2024-06-27 13:23:50,879][03705] Updated weights for policy 0, policy_version 4733 (0.0029) [2024-06-27 13:23:51,279][03472] Fps is (10 sec: 44242.4, 60 sec: 43416.8, 300 sec: 34695.4). Total num frames: 77545472. Throughput: 0: 43285.8. Samples: 8910680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 13:23:51,279][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:23:54,953][03705] Updated weights for policy 0, policy_version 4743 (0.0032) [2024-06-27 13:23:56,278][03472] Fps is (10 sec: 44241.9, 60 sec: 43144.6, 300 sec: 34784.5). Total num frames: 77742080. Throughput: 0: 43258.3. Samples: 9170240. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-27 13:23:56,278][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:23:58,722][03705] Updated weights for policy 0, policy_version 4753 (0.0024) [2024-06-27 13:24:01,278][03472] Fps is (10 sec: 42602.7, 60 sec: 43690.7, 300 sec: 34993.8). Total num frames: 77971456. Throughput: 0: 43238.4. Samples: 9430480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 13:24:01,278][03472] Avg episode reward: [(0, '0.002')] [2024-06-27 13:24:02,726][03705] Updated weights for policy 0, policy_version 4763 (0.0030) [2024-06-27 13:24:06,237][03705] Updated weights for policy 0, policy_version 4773 (0.0031) [2024-06-27 13:24:06,280][03472] Fps is (10 sec: 45864.8, 60 sec: 43416.1, 300 sec: 35195.0). Total num frames: 78200832. Throughput: 0: 43282.7. Samples: 9561300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 13:24:06,280][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:24:10,384][03705] Updated weights for policy 0, policy_version 4783 (0.0027) [2024-06-27 13:24:11,278][03472] Fps is (10 sec: 42598.6, 60 sec: 43144.6, 300 sec: 35270.3). Total num frames: 78397440. Throughput: 0: 43293.0. Samples: 9817520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 13:24:11,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:24:14,163][03705] Updated weights for policy 0, policy_version 4793 (0.0038) [2024-06-27 13:24:16,278][03472] Fps is (10 sec: 40968.8, 60 sec: 43417.7, 300 sec: 35401.1). Total num frames: 78610432. Throughput: 0: 43047.9. Samples: 10070580. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 13:24:16,278][03472] Avg episode reward: [(0, '0.002')] [2024-06-27 13:24:17,933][03705] Updated weights for policy 0, policy_version 4803 (0.0032) [2024-06-27 13:24:21,278][03472] Fps is (10 sec: 42598.1, 60 sec: 42873.0, 300 sec: 35527.4). Total num frames: 78823424. Throughput: 0: 43170.2. Samples: 10202100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 13:24:21,278][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:24:21,726][03705] Updated weights for policy 0, policy_version 4813 (0.0027) [2024-06-27 13:24:25,534][03705] Updated weights for policy 0, policy_version 4823 (0.0033) [2024-06-27 13:24:26,278][03472] Fps is (10 sec: 40959.7, 60 sec: 42598.4, 300 sec: 35592.8). Total num frames: 79020032. Throughput: 0: 43085.2. Samples: 10464160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 13:24:26,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:24:29,219][03705] Updated weights for policy 0, policy_version 4833 (0.0036) [2024-06-27 13:24:31,278][03472] Fps is (10 sec: 40959.8, 60 sec: 42871.4, 300 sec: 35711.6). Total num frames: 79233024. Throughput: 0: 42897.4. Samples: 10715960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 13:24:31,282][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:24:33,334][03705] Updated weights for policy 0, policy_version 4843 (0.0037) [2024-06-27 13:24:36,279][03472] Fps is (10 sec: 45869.0, 60 sec: 43143.5, 300 sec: 36544.5). Total num frames: 79478784. Throughput: 0: 42984.4. Samples: 10845000. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 13:24:36,280][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:24:36,729][03705] Updated weights for policy 0, policy_version 4853 (0.0031) [2024-06-27 13:24:38,939][03685] Signal inference workers to stop experience collection... (150 times) [2024-06-27 13:24:38,976][03705] InferenceWorker_p0-w0: stopping experience collection (150 times) [2024-06-27 13:24:38,987][03685] Signal inference workers to resume experience collection... (150 times) [2024-06-27 13:24:38,993][03705] InferenceWorker_p0-w0: resuming experience collection (150 times) [2024-06-27 13:24:40,810][03705] Updated weights for policy 0, policy_version 4863 (0.0042) [2024-06-27 13:24:41,278][03472] Fps is (10 sec: 45875.0, 60 sec: 43146.1, 300 sec: 36988.9). Total num frames: 79691776. Throughput: 0: 43158.5. Samples: 11112380. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 13:24:41,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:24:44,399][03705] Updated weights for policy 0, policy_version 4873 (0.0023) [2024-06-27 13:24:46,278][03472] Fps is (10 sec: 40965.7, 60 sec: 43145.3, 300 sec: 37377.7). Total num frames: 79888384. Throughput: 0: 43008.3. Samples: 11365860. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 13:24:46,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:24:48,324][03705] Updated weights for policy 0, policy_version 4883 (0.0040) [2024-06-27 13:24:51,278][03472] Fps is (10 sec: 42598.9, 60 sec: 42872.2, 300 sec: 38099.8). Total num frames: 80117760. Throughput: 0: 42990.1. Samples: 11495760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 13:24:51,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:24:52,080][03705] Updated weights for policy 0, policy_version 4893 (0.0027) [2024-06-27 13:24:55,846][03705] Updated weights for policy 0, policy_version 4903 (0.0037) [2024-06-27 13:24:56,278][03472] Fps is (10 sec: 45875.6, 60 sec: 43417.6, 300 sec: 38821.8). Total num frames: 80347136. Throughput: 0: 43203.1. Samples: 11761660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 13:24:56,278][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:24:59,743][03705] Updated weights for policy 0, policy_version 4913 (0.0040) [2024-06-27 13:25:01,278][03472] Fps is (10 sec: 42598.2, 60 sec: 42871.4, 300 sec: 39321.6). Total num frames: 80543744. Throughput: 0: 43190.7. Samples: 12014160. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 13:25:01,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:25:03,322][03705] Updated weights for policy 0, policy_version 4923 (0.0033) [2024-06-27 13:25:06,278][03472] Fps is (10 sec: 40960.1, 60 sec: 42600.0, 300 sec: 39877.0). Total num frames: 80756736. Throughput: 0: 43133.4. Samples: 12143100. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-27 13:25:06,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:25:07,159][03705] Updated weights for policy 0, policy_version 4933 (0.0038) [2024-06-27 13:25:10,783][03705] Updated weights for policy 0, policy_version 4943 (0.0031) [2024-06-27 13:25:11,278][03472] Fps is (10 sec: 45875.0, 60 sec: 43417.5, 300 sec: 40376.8). Total num frames: 81002496. Throughput: 0: 43291.2. Samples: 12412260. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 13:25:11,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:25:14,608][03705] Updated weights for policy 0, policy_version 4953 (0.0038) [2024-06-27 13:25:16,278][03472] Fps is (10 sec: 44236.9, 60 sec: 43144.6, 300 sec: 40654.5). Total num frames: 81199104. Throughput: 0: 43489.5. Samples: 12672980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 13:25:16,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:25:18,183][03705] Updated weights for policy 0, policy_version 4963 (0.0028) [2024-06-27 13:25:21,278][03472] Fps is (10 sec: 40960.0, 60 sec: 43144.5, 300 sec: 40987.8). Total num frames: 81412096. Throughput: 0: 43442.3. Samples: 12799840. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 13:25:21,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:25:22,543][03705] Updated weights for policy 0, policy_version 4973 (0.0034) [2024-06-27 13:25:26,106][03705] Updated weights for policy 0, policy_version 4983 (0.0036) [2024-06-27 13:25:26,278][03472] Fps is (10 sec: 44235.6, 60 sec: 43690.6, 300 sec: 41376.5). Total num frames: 81641472. Throughput: 0: 43321.7. Samples: 13061860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 13:25:26,278][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:25:26,371][03685] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000004984_81657856.pth... [2024-06-27 13:25:26,411][03685] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000004356_71368704.pth [2024-06-27 13:25:30,060][03705] Updated weights for policy 0, policy_version 4993 (0.0034) [2024-06-27 13:25:31,278][03472] Fps is (10 sec: 44237.4, 60 sec: 43690.7, 300 sec: 41543.2). Total num frames: 81854464. Throughput: 0: 43362.4. Samples: 13317160. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 13:25:31,278][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:25:33,678][03705] Updated weights for policy 0, policy_version 5003 (0.0029) [2024-06-27 13:25:36,280][03472] Fps is (10 sec: 42589.8, 60 sec: 43144.0, 300 sec: 41653.9). Total num frames: 82067456. Throughput: 0: 43318.7. Samples: 13445200. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 13:25:36,281][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:25:37,804][03705] Updated weights for policy 0, policy_version 5013 (0.0045) [2024-06-27 13:25:41,262][03705] Updated weights for policy 0, policy_version 5023 (0.0047) [2024-06-27 13:25:41,278][03472] Fps is (10 sec: 44236.6, 60 sec: 43417.7, 300 sec: 41876.4). Total num frames: 82296832. Throughput: 0: 43282.2. Samples: 13709360. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-27 13:25:41,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:25:45,467][03705] Updated weights for policy 0, policy_version 5033 (0.0038) [2024-06-27 13:25:46,278][03472] Fps is (10 sec: 44246.6, 60 sec: 43690.7, 300 sec: 42098.5). Total num frames: 82509824. Throughput: 0: 43367.1. Samples: 13965680. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 13:25:46,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:25:48,774][03705] Updated weights for policy 0, policy_version 5043 (0.0034) [2024-06-27 13:25:51,278][03472] Fps is (10 sec: 42598.0, 60 sec: 43417.5, 300 sec: 42154.1). Total num frames: 82722816. Throughput: 0: 43339.0. Samples: 14093360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 13:25:51,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:25:52,954][03705] Updated weights for policy 0, policy_version 5053 (0.0032) [2024-06-27 13:25:56,195][03705] Updated weights for policy 0, policy_version 5063 (0.0037) [2024-06-27 13:25:56,280][03472] Fps is (10 sec: 44226.9, 60 sec: 43416.0, 300 sec: 42320.4). Total num frames: 82952192. Throughput: 0: 43305.9. Samples: 14361120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 13:25:56,280][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:25:59,668][03685] Signal inference workers to stop experience collection... (200 times) [2024-06-27 13:25:59,668][03685] Signal inference workers to resume experience collection... (200 times) [2024-06-27 13:25:59,685][03705] InferenceWorker_p0-w0: stopping experience collection (200 times) [2024-06-27 13:25:59,685][03705] InferenceWorker_p0-w0: resuming experience collection (200 times) [2024-06-27 13:26:00,466][03705] Updated weights for policy 0, policy_version 5073 (0.0043) [2024-06-27 13:26:01,278][03472] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 42487.3). Total num frames: 83165184. Throughput: 0: 43268.4. Samples: 14620060. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 13:26:01,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:26:03,744][03705] Updated weights for policy 0, policy_version 5083 (0.0040) [2024-06-27 13:26:06,278][03472] Fps is (10 sec: 42608.0, 60 sec: 43690.6, 300 sec: 42598.4). Total num frames: 83378176. Throughput: 0: 43276.5. Samples: 14747280. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 13:26:06,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:26:07,946][03705] Updated weights for policy 0, policy_version 5093 (0.0036) [2024-06-27 13:26:11,278][03472] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 42709.5). Total num frames: 83591168. Throughput: 0: 43288.3. Samples: 15009820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 13:26:11,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:26:11,376][03705] Updated weights for policy 0, policy_version 5103 (0.0034) [2024-06-27 13:26:15,712][03705] Updated weights for policy 0, policy_version 5113 (0.0050) [2024-06-27 13:26:16,278][03472] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 42876.1). Total num frames: 83804160. Throughput: 0: 43365.8. Samples: 15268620. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-27 13:26:16,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:26:19,033][03705] Updated weights for policy 0, policy_version 5123 (0.0028) [2024-06-27 13:26:21,278][03472] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 42931.6). Total num frames: 84033536. Throughput: 0: 43336.4. Samples: 15395240. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 13:26:21,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:26:23,301][03705] Updated weights for policy 0, policy_version 5133 (0.0040) [2024-06-27 13:26:26,278][03472] Fps is (10 sec: 40959.5, 60 sec: 42871.6, 300 sec: 42931.6). Total num frames: 84213760. Throughput: 0: 43225.7. Samples: 15654520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 13:26:26,278][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:26:26,660][03705] Updated weights for policy 0, policy_version 5143 (0.0033) [2024-06-27 13:26:30,777][03705] Updated weights for policy 0, policy_version 5153 (0.0024) [2024-06-27 13:26:31,278][03472] Fps is (10 sec: 40959.7, 60 sec: 43144.4, 300 sec: 42987.2). Total num frames: 84443136. Throughput: 0: 43280.0. Samples: 15913280. Policy #0 lag: (min: 1.0, avg: 9.1, max: 21.0) [2024-06-27 13:26:31,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:26:34,271][03705] Updated weights for policy 0, policy_version 5163 (0.0042) [2024-06-27 13:26:36,278][03472] Fps is (10 sec: 47513.3, 60 sec: 43692.2, 300 sec: 43209.3). Total num frames: 84688896. Throughput: 0: 43292.9. Samples: 16041540. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 13:26:36,278][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:26:38,463][03705] Updated weights for policy 0, policy_version 5173 (0.0042) [2024-06-27 13:26:41,278][03472] Fps is (10 sec: 44237.4, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 84885504. Throughput: 0: 43207.1. Samples: 16305340. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 13:26:41,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:26:41,809][03705] Updated weights for policy 0, policy_version 5183 (0.0038) [2024-06-27 13:26:46,002][03705] Updated weights for policy 0, policy_version 5193 (0.0035) [2024-06-27 13:26:46,278][03472] Fps is (10 sec: 39322.3, 60 sec: 42871.5, 300 sec: 43209.4). Total num frames: 85082112. Throughput: 0: 43143.6. Samples: 16561520. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 13:26:46,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:26:49,349][03705] Updated weights for policy 0, policy_version 5203 (0.0041) [2024-06-27 13:26:51,278][03472] Fps is (10 sec: 44236.6, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 85327872. Throughput: 0: 43171.6. Samples: 16690000. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 13:26:51,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:26:53,387][03705] Updated weights for policy 0, policy_version 5213 (0.0042) [2024-06-27 13:26:56,278][03472] Fps is (10 sec: 44236.6, 60 sec: 42873.1, 300 sec: 43264.9). Total num frames: 85524480. Throughput: 0: 43168.8. Samples: 16952420. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 13:26:56,278][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:26:56,885][03705] Updated weights for policy 0, policy_version 5223 (0.0032) [2024-06-27 13:27:00,842][03705] Updated weights for policy 0, policy_version 5233 (0.0030) [2024-06-27 13:27:01,278][03472] Fps is (10 sec: 40959.9, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 85737472. Throughput: 0: 43184.4. Samples: 17211920. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 13:27:01,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:27:04,387][03705] Updated weights for policy 0, policy_version 5243 (0.0028) [2024-06-27 13:27:06,278][03472] Fps is (10 sec: 45874.8, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 85983232. Throughput: 0: 43277.7. Samples: 17342740. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 13:27:06,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:27:08,566][03705] Updated weights for policy 0, policy_version 5253 (0.0019) [2024-06-27 13:27:11,278][03472] Fps is (10 sec: 44236.7, 60 sec: 43144.5, 300 sec: 43320.7). Total num frames: 86179840. Throughput: 0: 43527.1. Samples: 17613240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-27 13:27:11,280][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:27:11,776][03705] Updated weights for policy 0, policy_version 5263 (0.0035) [2024-06-27 13:27:15,851][03705] Updated weights for policy 0, policy_version 5273 (0.0029) [2024-06-27 13:27:16,278][03472] Fps is (10 sec: 42598.4, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 86409216. Throughput: 0: 43545.8. Samples: 17872840. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 13:27:16,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:27:19,099][03685] Signal inference workers to stop experience collection... (250 times) [2024-06-27 13:27:19,145][03705] InferenceWorker_p0-w0: stopping experience collection (250 times) [2024-06-27 13:27:19,153][03685] Signal inference workers to resume experience collection... (250 times) [2024-06-27 13:27:19,155][03705] InferenceWorker_p0-w0: resuming experience collection (250 times) [2024-06-27 13:27:19,316][03705] Updated weights for policy 0, policy_version 5283 (0.0032) [2024-06-27 13:27:21,278][03472] Fps is (10 sec: 45875.0, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 86638592. Throughput: 0: 43617.4. Samples: 18004320. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-27 13:27:21,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:27:23,367][03705] Updated weights for policy 0, policy_version 5293 (0.0030) [2024-06-27 13:27:26,278][03472] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 86835200. Throughput: 0: 43605.2. Samples: 18267580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 13:27:26,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:27:26,323][03685] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000005301_86851584.pth... [2024-06-27 13:27:26,373][03685] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000004667_76464128.pth [2024-06-27 13:27:26,834][03705] Updated weights for policy 0, policy_version 5303 (0.0029) [2024-06-27 13:27:30,895][03705] Updated weights for policy 0, policy_version 5313 (0.0021) [2024-06-27 13:27:31,280][03472] Fps is (10 sec: 40951.2, 60 sec: 43416.0, 300 sec: 43320.1). Total num frames: 87048192. Throughput: 0: 43569.3. Samples: 18522240. Policy #0 lag: (min: 1.0, avg: 10.0, max: 22.0) [2024-06-27 13:27:31,280][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:27:34,367][03705] Updated weights for policy 0, policy_version 5323 (0.0037) [2024-06-27 13:27:36,278][03472] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 87293952. Throughput: 0: 43607.0. Samples: 18652320. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-27 13:27:36,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:27:38,419][03705] Updated weights for policy 0, policy_version 5333 (0.0034) [2024-06-27 13:27:41,278][03472] Fps is (10 sec: 45885.2, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 87506944. Throughput: 0: 43833.3. Samples: 18924920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 13:27:41,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:27:41,784][03705] Updated weights for policy 0, policy_version 5343 (0.0025) [2024-06-27 13:27:45,804][03705] Updated weights for policy 0, policy_version 5353 (0.0036) [2024-06-27 13:27:46,278][03472] Fps is (10 sec: 40959.5, 60 sec: 43690.5, 300 sec: 43264.8). Total num frames: 87703552. Throughput: 0: 43752.7. Samples: 19180800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 13:27:46,278][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:27:49,292][03705] Updated weights for policy 0, policy_version 5363 (0.0034) [2024-06-27 13:27:51,278][03472] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 87932928. Throughput: 0: 43729.9. Samples: 19310580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 13:27:51,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:27:53,372][03705] Updated weights for policy 0, policy_version 5373 (0.0030) [2024-06-27 13:27:56,278][03472] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 88129536. Throughput: 0: 43497.8. Samples: 19570640. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 13:27:56,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:27:57,075][03705] Updated weights for policy 0, policy_version 5383 (0.0027) [2024-06-27 13:28:00,905][03705] Updated weights for policy 0, policy_version 5393 (0.0040) [2024-06-27 13:28:01,278][03472] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43264.9). Total num frames: 88358912. Throughput: 0: 43392.4. Samples: 19825500. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 13:28:01,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:28:04,596][03705] Updated weights for policy 0, policy_version 5403 (0.0032) [2024-06-27 13:28:06,278][03472] Fps is (10 sec: 45874.9, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 88588288. Throughput: 0: 43415.1. Samples: 19958000. Policy #0 lag: (min: 1.0, avg: 9.3, max: 21.0) [2024-06-27 13:28:06,287][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:28:08,459][03705] Updated weights for policy 0, policy_version 5413 (0.0043) [2024-06-27 13:28:11,278][03472] Fps is (10 sec: 42597.5, 60 sec: 43417.4, 300 sec: 43320.4). Total num frames: 88784896. Throughput: 0: 43355.8. Samples: 20218600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 13:28:11,278][03472] Avg episode reward: [(0, '0.008')] [2024-06-27 13:28:11,422][03685] Saving new best policy, reward=0.008! [2024-06-27 13:28:12,166][03705] Updated weights for policy 0, policy_version 5423 (0.0035) [2024-06-27 13:28:15,978][03705] Updated weights for policy 0, policy_version 5433 (0.0029) [2024-06-27 13:28:16,280][03472] Fps is (10 sec: 42589.3, 60 sec: 43416.0, 300 sec: 43264.9). Total num frames: 89014272. Throughput: 0: 43423.1. Samples: 20476280. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 13:28:16,280][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:28:19,483][03705] Updated weights for policy 0, policy_version 5443 (0.0040) [2024-06-27 13:28:21,278][03472] Fps is (10 sec: 45876.3, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 89243648. Throughput: 0: 43468.5. Samples: 20608400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 13:28:21,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:28:23,461][03705] Updated weights for policy 0, policy_version 5453 (0.0040) [2024-06-27 13:28:26,278][03472] Fps is (10 sec: 44246.7, 60 sec: 43690.7, 300 sec: 43375.9). Total num frames: 89456640. Throughput: 0: 43219.1. Samples: 20869780. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 13:28:26,278][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:28:26,991][03705] Updated weights for policy 0, policy_version 5463 (0.0037) [2024-06-27 13:28:31,043][03705] Updated weights for policy 0, policy_version 5473 (0.0039) [2024-06-27 13:28:31,278][03472] Fps is (10 sec: 42598.2, 60 sec: 43692.2, 300 sec: 43320.4). Total num frames: 89669632. Throughput: 0: 43232.5. Samples: 21126260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 13:28:31,278][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:28:34,519][03705] Updated weights for policy 0, policy_version 5483 (0.0033) [2024-06-27 13:28:36,278][03472] Fps is (10 sec: 44237.1, 60 sec: 43417.7, 300 sec: 43376.3). Total num frames: 89899008. Throughput: 0: 43223.1. Samples: 21255620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 13:28:36,278][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:28:38,604][03705] Updated weights for policy 0, policy_version 5493 (0.0038) [2024-06-27 13:28:41,278][03472] Fps is (10 sec: 42598.6, 60 sec: 43144.5, 300 sec: 43376.1). Total num frames: 90095616. Throughput: 0: 43236.9. Samples: 21516300. Policy #0 lag: (min: 1.0, avg: 9.3, max: 20.0) [2024-06-27 13:28:41,278][03472] Avg episode reward: [(0, '0.008')] [2024-06-27 13:28:42,402][03705] Updated weights for policy 0, policy_version 5503 (0.0040) [2024-06-27 13:28:46,278][03472] Fps is (10 sec: 40959.5, 60 sec: 43417.7, 300 sec: 43265.0). Total num frames: 90308608. Throughput: 0: 43357.3. Samples: 21776580. Policy #0 lag: (min: 1.0, avg: 9.3, max: 20.0) [2024-06-27 13:28:46,278][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:28:46,479][03705] Updated weights for policy 0, policy_version 5513 (0.0047) [2024-06-27 13:28:47,390][03685] Signal inference workers to stop experience collection... (300 times) [2024-06-27 13:28:47,391][03685] Signal inference workers to resume experience collection... (300 times) [2024-06-27 13:28:47,414][03705] InferenceWorker_p0-w0: stopping experience collection (300 times) [2024-06-27 13:28:47,414][03705] InferenceWorker_p0-w0: resuming experience collection (300 times) [2024-06-27 13:28:50,380][03705] Updated weights for policy 0, policy_version 5523 (0.0037) [2024-06-27 13:28:51,278][03472] Fps is (10 sec: 45872.1, 60 sec: 43690.1, 300 sec: 43431.4). Total num frames: 90554368. Throughput: 0: 43306.1. Samples: 21906800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 13:28:51,279][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:28:53,910][03705] Updated weights for policy 0, policy_version 5533 (0.0036) [2024-06-27 13:28:56,278][03472] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 90734592. Throughput: 0: 43267.8. Samples: 22165640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 13:28:56,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:28:57,972][03705] Updated weights for policy 0, policy_version 5543 (0.0034) [2024-06-27 13:29:01,278][03472] Fps is (10 sec: 39324.2, 60 sec: 43144.5, 300 sec: 43209.6). Total num frames: 90947584. Throughput: 0: 43219.0. Samples: 22421040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 13:29:01,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:29:01,715][03705] Updated weights for policy 0, policy_version 5553 (0.0033) [2024-06-27 13:29:05,439][03705] Updated weights for policy 0, policy_version 5563 (0.0031) [2024-06-27 13:29:06,278][03472] Fps is (10 sec: 45874.8, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 91193344. Throughput: 0: 43064.0. Samples: 22546280. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 13:29:06,278][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:29:09,677][03705] Updated weights for policy 0, policy_version 5573 (0.0033) [2024-06-27 13:29:11,278][03472] Fps is (10 sec: 44237.2, 60 sec: 43417.8, 300 sec: 43320.4). Total num frames: 91389952. Throughput: 0: 43092.1. Samples: 22808920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 13:29:11,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:29:12,919][03705] Updated weights for policy 0, policy_version 5583 (0.0027) [2024-06-27 13:29:16,284][03472] Fps is (10 sec: 40934.6, 60 sec: 43141.6, 300 sec: 43319.5). Total num frames: 91602944. Throughput: 0: 43113.6. Samples: 23066640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 13:29:16,285][03472] Avg episode reward: [(0, '0.003')] [2024-06-27 13:29:17,231][03705] Updated weights for policy 0, policy_version 5593 (0.0027) [2024-06-27 13:29:20,521][03705] Updated weights for policy 0, policy_version 5603 (0.0040) [2024-06-27 13:29:21,278][03472] Fps is (10 sec: 44236.6, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 91832320. Throughput: 0: 43039.9. Samples: 23192420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 13:29:21,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:29:24,732][03705] Updated weights for policy 0, policy_version 5613 (0.0031) [2024-06-27 13:29:26,278][03472] Fps is (10 sec: 42624.9, 60 sec: 42871.4, 300 sec: 43375.9). Total num frames: 92028928. Throughput: 0: 43081.3. Samples: 23454960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 13:29:26,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:29:26,320][03685] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000005618_92045312.pth... [2024-06-27 13:29:26,367][03685] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000004984_81657856.pth [2024-06-27 13:29:28,032][03705] Updated weights for policy 0, policy_version 5623 (0.0041) [2024-06-27 13:29:31,278][03472] Fps is (10 sec: 40959.9, 60 sec: 42871.5, 300 sec: 43265.1). Total num frames: 92241920. Throughput: 0: 42970.3. Samples: 23710240. Policy #0 lag: (min: 1.0, avg: 10.6, max: 24.0) [2024-06-27 13:29:31,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:29:32,238][03705] Updated weights for policy 0, policy_version 5633 (0.0044) [2024-06-27 13:29:35,561][03705] Updated weights for policy 0, policy_version 5643 (0.0050) [2024-06-27 13:29:36,278][03472] Fps is (10 sec: 44237.0, 60 sec: 42871.4, 300 sec: 43320.4). Total num frames: 92471296. Throughput: 0: 42947.8. Samples: 23839420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 13:29:36,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:29:39,999][03705] Updated weights for policy 0, policy_version 5653 (0.0035) [2024-06-27 13:29:41,280][03472] Fps is (10 sec: 44226.9, 60 sec: 43142.9, 300 sec: 43375.6). Total num frames: 92684288. Throughput: 0: 43023.6. Samples: 24101800. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 13:29:41,280][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:29:43,077][03705] Updated weights for policy 0, policy_version 5663 (0.0030) [2024-06-27 13:29:46,278][03472] Fps is (10 sec: 40960.0, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 92880896. Throughput: 0: 43051.1. Samples: 24358340. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-27 13:29:46,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:29:47,501][03705] Updated weights for policy 0, policy_version 5673 (0.0039) [2024-06-27 13:29:50,674][03705] Updated weights for policy 0, policy_version 5683 (0.0036) [2024-06-27 13:29:51,278][03472] Fps is (10 sec: 44246.5, 60 sec: 42871.9, 300 sec: 43320.4). Total num frames: 93126656. Throughput: 0: 43122.2. Samples: 24486780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 13:29:51,278][03472] Avg episode reward: [(0, '0.009')] [2024-06-27 13:29:51,279][03685] Saving new best policy, reward=0.009! [2024-06-27 13:29:55,317][03705] Updated weights for policy 0, policy_version 5693 (0.0034) [2024-06-27 13:29:56,280][03472] Fps is (10 sec: 45865.2, 60 sec: 43416.0, 300 sec: 43375.6). Total num frames: 93339648. Throughput: 0: 43164.9. Samples: 24751440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 13:29:56,280][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:29:58,288][03705] Updated weights for policy 0, policy_version 5703 (0.0043) [2024-06-27 13:30:01,278][03472] Fps is (10 sec: 39321.5, 60 sec: 42871.4, 300 sec: 43264.8). Total num frames: 93519872. Throughput: 0: 43122.8. Samples: 25006900. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 13:30:01,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:30:02,850][03705] Updated weights for policy 0, policy_version 5713 (0.0040) [2024-06-27 13:30:05,991][03705] Updated weights for policy 0, policy_version 5723 (0.0047) [2024-06-27 13:30:06,280][03472] Fps is (10 sec: 42598.1, 60 sec: 42869.9, 300 sec: 43264.5). Total num frames: 93765632. Throughput: 0: 42969.4. Samples: 25126140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 13:30:06,281][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:30:10,433][03705] Updated weights for policy 0, policy_version 5733 (0.0028) [2024-06-27 13:30:11,278][03472] Fps is (10 sec: 45875.8, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 93978624. Throughput: 0: 43002.3. Samples: 25390060. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 13:30:11,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:30:13,758][03705] Updated weights for policy 0, policy_version 5743 (0.0044) [2024-06-27 13:30:16,284][03472] Fps is (10 sec: 39306.2, 60 sec: 42598.4, 300 sec: 43208.4). Total num frames: 94158848. Throughput: 0: 43085.2. Samples: 25649340. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-27 13:30:16,293][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:30:18,139][03705] Updated weights for policy 0, policy_version 5753 (0.0040) [2024-06-27 13:30:18,381][03685] Signal inference workers to stop experience collection... (350 times) [2024-06-27 13:30:18,411][03705] InferenceWorker_p0-w0: stopping experience collection (350 times) [2024-06-27 13:30:18,427][03685] Signal inference workers to resume experience collection... (350 times) [2024-06-27 13:30:18,428][03705] InferenceWorker_p0-w0: resuming experience collection (350 times) [2024-06-27 13:30:21,278][03472] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 94404608. Throughput: 0: 42856.1. Samples: 25767940. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 13:30:21,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:30:21,330][03705] Updated weights for policy 0, policy_version 5763 (0.0039) [2024-06-27 13:30:25,641][03705] Updated weights for policy 0, policy_version 5773 (0.0030) [2024-06-27 13:30:26,278][03472] Fps is (10 sec: 44264.1, 60 sec: 42871.5, 300 sec: 43209.3). Total num frames: 94601216. Throughput: 0: 43035.0. Samples: 26038280. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 13:30:26,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:30:28,810][03705] Updated weights for policy 0, policy_version 5783 (0.0033) [2024-06-27 13:30:31,278][03472] Fps is (10 sec: 40959.8, 60 sec: 42871.5, 300 sec: 43209.7). Total num frames: 94814208. Throughput: 0: 42971.1. Samples: 26292040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 13:30:31,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:30:33,198][03705] Updated weights for policy 0, policy_version 5793 (0.0039) [2024-06-27 13:30:36,278][03472] Fps is (10 sec: 45875.6, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 95059968. Throughput: 0: 43015.6. Samples: 26422480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 13:30:36,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:30:36,386][03705] Updated weights for policy 0, policy_version 5803 (0.0023) [2024-06-27 13:30:40,823][03705] Updated weights for policy 0, policy_version 5813 (0.0039) [2024-06-27 13:30:41,278][03472] Fps is (10 sec: 44236.4, 60 sec: 42873.0, 300 sec: 43209.3). Total num frames: 95256576. Throughput: 0: 43023.8. Samples: 26687420. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 13:30:41,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:30:43,865][03705] Updated weights for policy 0, policy_version 5823 (0.0028) [2024-06-27 13:30:46,278][03472] Fps is (10 sec: 40960.0, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 95469568. Throughput: 0: 43037.0. Samples: 26943560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 13:30:46,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:30:48,335][03705] Updated weights for policy 0, policy_version 5833 (0.0043) [2024-06-27 13:30:51,278][03472] Fps is (10 sec: 45875.2, 60 sec: 43144.5, 300 sec: 43265.2). Total num frames: 95715328. Throughput: 0: 43242.1. Samples: 27071940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 13:30:51,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:30:51,573][03705] Updated weights for policy 0, policy_version 5843 (0.0031) [2024-06-27 13:30:55,877][03705] Updated weights for policy 0, policy_version 5853 (0.0041) [2024-06-27 13:30:56,278][03472] Fps is (10 sec: 44237.0, 60 sec: 42873.1, 300 sec: 43209.3). Total num frames: 95911936. Throughput: 0: 43280.5. Samples: 27337680. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-27 13:30:56,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:30:59,246][03705] Updated weights for policy 0, policy_version 5863 (0.0036) [2024-06-27 13:31:01,278][03472] Fps is (10 sec: 39322.1, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 96108544. Throughput: 0: 43198.4. Samples: 27593000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 13:31:01,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:31:03,666][03705] Updated weights for policy 0, policy_version 5873 (0.0032) [2024-06-27 13:31:06,280][03472] Fps is (10 sec: 44227.9, 60 sec: 43144.8, 300 sec: 43264.6). Total num frames: 96354304. Throughput: 0: 43289.6. Samples: 27716060. Policy #0 lag: (min: 1.0, avg: 9.2, max: 21.0) [2024-06-27 13:31:06,280][03472] Avg episode reward: [(0, '0.008')] [2024-06-27 13:31:06,978][03705] Updated weights for policy 0, policy_version 5883 (0.0033) [2024-06-27 13:31:11,066][03705] Updated weights for policy 0, policy_version 5893 (0.0035) [2024-06-27 13:31:11,278][03472] Fps is (10 sec: 44236.4, 60 sec: 42871.4, 300 sec: 43209.3). Total num frames: 96550912. Throughput: 0: 43334.7. Samples: 27988340. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 13:31:11,278][03472] Avg episode reward: [(0, '0.009')] [2024-06-27 13:31:14,861][03705] Updated weights for policy 0, policy_version 5903 (0.0048) [2024-06-27 13:31:16,278][03472] Fps is (10 sec: 40967.7, 60 sec: 43422.0, 300 sec: 43153.8). Total num frames: 96763904. Throughput: 0: 43195.9. Samples: 28235860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 13:31:16,282][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:31:18,879][03705] Updated weights for policy 0, policy_version 5913 (0.0031) [2024-06-27 13:31:21,278][03472] Fps is (10 sec: 45875.6, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 97009664. Throughput: 0: 43160.0. Samples: 28364680. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 13:31:21,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:31:22,602][03705] Updated weights for policy 0, policy_version 5923 (0.0031) [2024-06-27 13:31:26,278][03472] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 97189888. Throughput: 0: 43116.0. Samples: 28627640. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 13:31:26,278][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:31:26,292][03685] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000005932_97189888.pth... [2024-06-27 13:31:26,360][03685] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000005301_86851584.pth [2024-06-27 13:31:26,511][03705] Updated weights for policy 0, policy_version 5933 (0.0025) [2024-06-27 13:31:30,124][03705] Updated weights for policy 0, policy_version 5943 (0.0023) [2024-06-27 13:31:31,278][03472] Fps is (10 sec: 39321.8, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 97402880. Throughput: 0: 43019.1. Samples: 28879420. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 13:31:31,278][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:31:33,955][03705] Updated weights for policy 0, policy_version 5953 (0.0034) [2024-06-27 13:31:36,278][03472] Fps is (10 sec: 47514.4, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 97665024. Throughput: 0: 43129.9. Samples: 29012780. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 13:31:36,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:31:37,622][03705] Updated weights for policy 0, policy_version 5963 (0.0036) [2024-06-27 13:31:39,959][03685] Signal inference workers to stop experience collection... (400 times) [2024-06-27 13:31:40,013][03705] InferenceWorker_p0-w0: stopping experience collection (400 times) [2024-06-27 13:31:40,016][03685] Signal inference workers to resume experience collection... (400 times) [2024-06-27 13:31:40,039][03705] InferenceWorker_p0-w0: resuming experience collection (400 times) [2024-06-27 13:31:41,278][03472] Fps is (10 sec: 42598.4, 60 sec: 42871.6, 300 sec: 43209.3). Total num frames: 97828864. Throughput: 0: 43123.1. Samples: 29278220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 13:31:41,278][03472] Avg episode reward: [(0, '0.006')] [2024-06-27 13:31:41,618][03705] Updated weights for policy 0, policy_version 5973 (0.0046) [2024-06-27 13:31:45,102][03705] Updated weights for policy 0, policy_version 5983 (0.0035) [2024-06-27 13:31:46,278][03472] Fps is (10 sec: 39320.7, 60 sec: 43144.4, 300 sec: 43153.8). Total num frames: 98058240. Throughput: 0: 43148.7. Samples: 29534700. Policy #0 lag: (min: 0.0, avg: 12.0, max: 23.0) [2024-06-27 13:31:46,279][03472] Avg episode reward: [(0, '0.008')] [2024-06-27 13:31:49,061][03705] Updated weights for policy 0, policy_version 5993 (0.0048) [2024-06-27 13:31:51,278][03472] Fps is (10 sec: 49151.3, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 98320384. Throughput: 0: 43324.0. Samples: 29665560. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 13:31:51,278][03472] Avg episode reward: [(0, '0.008')] [2024-06-27 13:31:52,628][03705] Updated weights for policy 0, policy_version 6003 (0.0041) [2024-06-27 13:31:56,278][03472] Fps is (10 sec: 44237.5, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 98500608. Throughput: 0: 43170.7. Samples: 29931020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-27 13:31:56,278][03472] Avg episode reward: [(0, '0.007')] [2024-06-27 13:31:56,387][03705] Updated weights for policy 0, policy_version 6013 (0.0030) [2024-06-27 13:32:00,059][03705] Updated weights for policy 0, policy_version 6023 (0.0034) [2024-06-27 13:32:01,278][03472] Fps is (10 sec: 39321.4, 60 sec: 43417.5, 300 sec: 43153.8). Total num frames: 98713600. Throughput: 0: 43419.1. Samples: 30189720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-27 13:32:01,278][03472] Avg episode reward: [(0, '0.002')] [2024-06-27 13:32:03,875][03705] Updated weights for policy 0, policy_version 6033 (0.0039) [2024-06-27 13:32:06,278][03472] Fps is (10 sec: 44237.1, 60 sec: 43146.0, 300 sec: 43264.9). Total num frames: 98942976. Throughput: 0: 43324.0. Samples: 30314260. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 13:32:06,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:32:07,478][03705] Updated weights for policy 0, policy_version 6043 (0.0021) [2024-06-27 13:32:11,278][03472] Fps is (10 sec: 44237.5, 60 sec: 43417.7, 300 sec: 43209.3). Total num frames: 99155968. Throughput: 0: 43370.8. Samples: 30579320. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 13:32:11,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:32:11,381][03705] Updated weights for policy 0, policy_version 6053 (0.0055) [2024-06-27 13:32:14,984][03705] Updated weights for policy 0, policy_version 6063 (0.0029) [2024-06-27 13:32:16,278][03472] Fps is (10 sec: 40959.7, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 99352576. Throughput: 0: 43540.8. Samples: 30838760. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-27 13:32:16,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:32:18,876][03705] Updated weights for policy 0, policy_version 6073 (0.0043) [2024-06-27 13:32:21,278][03472] Fps is (10 sec: 42598.3, 60 sec: 42871.5, 300 sec: 43209.3). Total num frames: 99581952. Throughput: 0: 43415.1. Samples: 30966460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 13:32:21,278][03472] Avg episode reward: [(0, '0.005')] [2024-06-27 13:32:22,478][03705] Updated weights for policy 0, policy_version 6083 (0.0035) [2024-06-27 13:32:26,278][03472] Fps is (10 sec: 45875.5, 60 sec: 43690.8, 300 sec: 43265.2). Total num frames: 99811328. Throughput: 0: 43419.1. Samples: 31232080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 13:32:26,278][03472] Avg episode reward: [(0, '0.002')] [2024-06-27 13:32:26,331][03705] Updated weights for policy 0, policy_version 6093 (0.0034) [2024-06-27 13:32:30,099][03705] Updated weights for policy 0, policy_version 6103 (0.0021) [2024-06-27 13:32:31,278][03472] Fps is (10 sec: 42598.3, 60 sec: 43417.5, 300 sec: 43098.3). Total num frames: 100007936. Throughput: 0: 43361.5. Samples: 31485960. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 13:32:31,278][03472] Avg episode reward: [(0, '0.002')] [2024-06-27 13:32:33,857][03705] Updated weights for policy 0, policy_version 6113 (0.0028) [2024-06-27 13:32:36,278][03472] Fps is (10 sec: 42598.5, 60 sec: 42871.5, 300 sec: 43153.8). Total num frames: 100237312. Throughput: 0: 43379.3. Samples: 31617620. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 13:32:36,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:32:37,599][03705] Updated weights for policy 0, policy_version 6123 (0.0028) [2024-06-27 13:32:41,278][03472] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 43264.9). Total num frames: 100466688. Throughput: 0: 43429.4. Samples: 31885340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 13:32:41,278][03472] Avg episode reward: [(0, '0.004')] [2024-06-27 13:32:41,493][03705] Updated weights for policy 0, policy_version 6133 (0.0043) [2024-06-27 13:41:51,148][06674] Saving configuration to ./train_dir/sample_factory/p2.sf/config.json... [2024-06-27 13:41:51,181][06674] Rollout worker 0 uses device cpu [2024-06-27 13:41:51,181][06674] Rollout worker 1 uses device cpu [2024-06-27 13:41:51,181][06674] Rollout worker 2 uses device cpu [2024-06-27 13:41:51,182][06674] Rollout worker 3 uses device cpu [2024-06-27 13:41:51,182][06674] Rollout worker 4 uses device cpu [2024-06-27 13:41:51,182][06674] Rollout worker 5 uses device cpu [2024-06-27 13:41:51,182][06674] Rollout worker 6 uses device cpu [2024-06-27 13:41:51,182][06674] Rollout worker 7 uses device cpu [2024-06-27 13:41:51,182][06674] Rollout worker 8 uses device cpu [2024-06-27 13:41:51,182][06674] Rollout worker 9 uses device cpu [2024-06-27 13:41:51,182][06674] Rollout worker 10 uses device cpu [2024-06-27 13:41:51,183][06674] Rollout worker 11 uses device cpu [2024-06-27 13:41:51,183][06674] Rollout worker 12 uses device cpu [2024-06-27 13:41:51,183][06674] Rollout worker 13 uses device cpu [2024-06-27 13:41:51,183][06674] Rollout worker 14 uses device cpu [2024-06-27 13:41:51,183][06674] Rollout worker 15 uses device cpu [2024-06-27 13:41:51,183][06674] Rollout worker 16 uses device cpu [2024-06-27 13:41:51,183][06674] Rollout worker 17 uses device cpu [2024-06-27 13:41:51,184][06674] Rollout worker 18 uses device cpu [2024-06-27 13:41:51,184][06674] Rollout worker 19 uses device cpu [2024-06-27 13:41:51,184][06674] Rollout worker 20 uses device cpu [2024-06-27 13:41:51,184][06674] Rollout worker 21 uses device cpu [2024-06-27 13:41:51,184][06674] Rollout worker 22 uses device cpu [2024-06-27 13:41:51,184][06674] Rollout worker 23 uses device cpu [2024-06-27 13:41:51,184][06674] Rollout worker 24 uses device cpu [2024-06-27 13:41:51,184][06674] Rollout worker 25 uses device cpu [2024-06-27 13:41:51,185][06674] Rollout worker 26 uses device cpu [2024-06-27 13:41:51,185][06674] Rollout worker 27 uses device cpu [2024-06-27 13:41:51,185][06674] Rollout worker 28 uses device cpu [2024-06-27 13:41:51,185][06674] Rollout worker 29 uses device cpu [2024-06-27 13:41:51,185][06674] Rollout worker 30 uses device cpu [2024-06-27 13:41:51,185][06674] Rollout worker 31 uses device cpu [2024-06-27 13:41:51,768][06674] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 13:41:51,769][06674] InferenceWorker_p0-w0: min num requests: 10 [2024-06-27 13:41:51,814][06674] Starting all processes... [2024-06-27 13:41:51,815][06674] Starting process learner_proc0 [2024-06-27 13:41:52,090][06674] Starting all processes... [2024-06-27 13:41:52,093][06674] Starting process inference_proc0-0 [2024-06-27 13:41:52,093][06674] Starting process rollout_proc3 [2024-06-27 13:41:52,093][06674] Starting process rollout_proc1 [2024-06-27 13:41:52,093][06674] Starting process rollout_proc2 [2024-06-27 13:41:52,093][06674] Starting process rollout_proc0 [2024-06-27 13:41:52,099][06674] Starting process rollout_proc16 [2024-06-27 13:41:52,094][06674] Starting process rollout_proc5 [2024-06-27 13:41:52,094][06674] Starting process rollout_proc6 [2024-06-27 13:41:52,094][06674] Starting process rollout_proc7 [2024-06-27 13:41:52,095][06674] Starting process rollout_proc8 [2024-06-27 13:41:52,097][06674] Starting process rollout_proc9 [2024-06-27 13:41:52,099][06674] Starting process rollout_proc10 [2024-06-27 13:41:52,099][06674] Starting process rollout_proc11 [2024-06-27 13:41:52,099][06674] Starting process rollout_proc12 [2024-06-27 13:41:52,099][06674] Starting process rollout_proc13 [2024-06-27 13:41:52,099][06674] Starting process rollout_proc14 [2024-06-27 13:41:52,099][06674] Starting process rollout_proc15 [2024-06-27 13:41:52,093][06674] Starting process rollout_proc4 [2024-06-27 13:41:52,100][06674] Starting process rollout_proc17 [2024-06-27 13:41:52,100][06674] Starting process rollout_proc18 [2024-06-27 13:41:52,101][06674] Starting process rollout_proc19 [2024-06-27 13:41:52,103][06674] Starting process rollout_proc20 [2024-06-27 13:41:52,103][06674] Starting process rollout_proc21 [2024-06-27 13:41:52,106][06674] Starting process rollout_proc22 [2024-06-27 13:41:52,107][06674] Starting process rollout_proc23 [2024-06-27 13:41:52,110][06674] Starting process rollout_proc24 [2024-06-27 13:41:52,111][06674] Starting process rollout_proc25 [2024-06-27 13:41:52,111][06674] Starting process rollout_proc26 [2024-06-27 13:41:52,113][06674] Starting process rollout_proc27 [2024-06-27 13:41:52,114][06674] Starting process rollout_proc28 [2024-06-27 13:41:52,114][06674] Starting process rollout_proc29 [2024-06-27 13:41:52,116][06674] Starting process rollout_proc30 [2024-06-27 13:41:52,117][06674] Starting process rollout_proc31 [2024-06-27 13:41:54,240][06934] Worker 25 uses CPU cores [25] [2024-06-27 13:41:54,260][06915] Worker 7 uses CPU cores [7] [2024-06-27 13:41:54,270][06887] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 13:41:54,271][06887] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2024-06-27 13:41:54,280][06927] Worker 19 uses CPU cores [19] [2024-06-27 13:41:54,280][06887] Num visible devices: 1 [2024-06-27 13:41:54,292][06920] Worker 14 uses CPU cores [14] [2024-06-27 13:41:54,308][06887] Setting fixed seed 0 [2024-06-27 13:41:54,309][06887] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 13:41:54,309][06887] Initializing actor-critic model on device cuda:0 [2024-06-27 13:41:54,320][06917] Worker 10 uses CPU cores [10] [2024-06-27 13:41:54,336][06910] Worker 2 uses CPU cores [2] [2024-06-27 13:41:54,344][06911] Worker 16 uses CPU cores [16] [2024-06-27 13:41:54,360][06918] Worker 12 uses CPU cores [12] [2024-06-27 13:41:54,368][06935] Worker 27 uses CPU cores [27] [2024-06-27 13:41:54,374][06909] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 13:41:54,374][06909] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2024-06-27 13:41:54,382][06909] Num visible devices: 1 [2024-06-27 13:41:54,384][06939] Worker 31 uses CPU cores [31] [2024-06-27 13:41:54,384][06937] Worker 29 uses CPU cores [29] [2024-06-27 13:41:54,400][06938] Worker 28 uses CPU cores [28] [2024-06-27 13:41:54,404][06929] Worker 21 uses CPU cores [21] [2024-06-27 13:41:54,432][06925] Worker 17 uses CPU cores [17] [2024-06-27 13:41:54,447][06924] Worker 15 uses CPU cores [15] [2024-06-27 13:41:54,484][06912] Worker 0 uses CPU cores [0] [2024-06-27 13:41:54,514][06913] Worker 5 uses CPU cores [5] [2024-06-27 13:41:54,519][06933] Worker 26 uses CPU cores [26] [2024-06-27 13:41:54,522][06926] Worker 18 uses CPU cores [18] [2024-06-27 13:41:54,532][06916] Worker 8 uses CPU cores [8] [2024-06-27 13:41:54,550][06919] Worker 13 uses CPU cores [13] [2024-06-27 13:41:54,556][06930] Worker 22 uses CPU cores [22] [2024-06-27 13:41:54,557][06907] Worker 3 uses CPU cores [3] [2024-06-27 13:41:54,567][06921] Worker 9 uses CPU cores [9] [2024-06-27 13:41:54,647][06928] Worker 20 uses CPU cores [20] [2024-06-27 13:41:54,647][06908] Worker 1 uses CPU cores [1] [2024-06-27 13:41:54,664][06914] Worker 6 uses CPU cores [6] [2024-06-27 13:41:54,692][06923] Worker 4 uses CPU cores [4] [2024-06-27 13:41:54,697][06936] Worker 30 uses CPU cores [30] [2024-06-27 13:41:54,700][06922] Worker 11 uses CPU cores [11] [2024-06-27 13:41:54,712][06932] Worker 24 uses CPU cores [24] [2024-06-27 13:41:54,755][06931] Worker 23 uses CPU cores [23] [2024-06-27 13:41:55,160][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,160][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,160][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,160][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,160][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,160][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,161][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,164][06887] RunningMeanStd input shape: (1,) [2024-06-27 13:41:55,164][06887] RunningMeanStd input shape: (1,) [2024-06-27 13:41:55,165][06887] RunningMeanStd input shape: (1,) [2024-06-27 13:41:55,165][06887] RunningMeanStd input shape: (1,) [2024-06-27 13:41:55,165][06887] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:55,199][06887] RunningMeanStd input shape: (1,) [2024-06-27 13:41:55,207][06887] Created Actor Critic model with architecture: [2024-06-27 13:41:55,207][06887] SampleFactoryAgentWrapper( (obs_normalizer): ObservationNormalizer() (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (agent): MettaAgent( (_encoder): MultiFeatureSetEncoder( (feature_set_encoders): ModuleDict( (grid_obs): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (agent): RunningMeanStdInPlace() (altar): RunningMeanStdInPlace() (clock): RunningMeanStdInPlace() (converter): RunningMeanStdInPlace() (generator): RunningMeanStdInPlace() (wall): RunningMeanStdInPlace() (agent:dir): RunningMeanStdInPlace() (agent:energy): RunningMeanStdInPlace() (agent:frozen): RunningMeanStdInPlace() (agent:hp): RunningMeanStdInPlace() (agent:id): RunningMeanStdInPlace() (agent:inv_r1): RunningMeanStdInPlace() (agent:inv_r2): RunningMeanStdInPlace() (agent:inv_r3): RunningMeanStdInPlace() (agent:shield): RunningMeanStdInPlace() (altar:hp): RunningMeanStdInPlace() (altar:state): RunningMeanStdInPlace() (converter:hp): RunningMeanStdInPlace() (converter:state): RunningMeanStdInPlace() (generator:amount): RunningMeanStdInPlace() (generator:hp): RunningMeanStdInPlace() (generator:state): RunningMeanStdInPlace() (wall:hp): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=125, out_features=512, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=512, out_features=512, bias=True) (3): ELU(alpha=1.0) (4): Linear(in_features=512, out_features=512, bias=True) (5): ELU(alpha=1.0) (6): Linear(in_features=512, out_features=512, bias=True) (7): ELU(alpha=1.0) ) ) (global_vars): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (_steps): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (last_action): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (last_action_id): RunningMeanStdInPlace() (last_action_val): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (last_reward): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (last_reward): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (kinship): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (kinship): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=125, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) ) (merged_encoder): Sequential( (0): Linear(in_features=544, out_features=512, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=512, out_features=512, bias=True) (3): ELU(alpha=1.0) (4): Linear(in_features=512, out_features=512, bias=True) (5): ELU(alpha=1.0) ) ) (_decoder): Decoder( (mlp): Identity() ) (_critic_linear): Linear(in_features=512, out_features=1, bias=True) ) (_core): ModelCoreRNN( (core): GRU(512, 512) ) (_action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=16, bias=True) ) ) [2024-06-27 13:41:55,280][06887] Using optimizer [2024-06-27 13:41:55,464][06887] Loading state from checkpoint ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000005932_97189888.pth... [2024-06-27 13:41:55,478][06887] Loading model from checkpoint [2024-06-27 13:41:55,480][06887] Loaded experiment state at self.train_step=5932, self.env_steps=97189888 [2024-06-27 13:41:55,480][06887] Initialized policy 0 weights for model version 5932 [2024-06-27 13:41:55,481][06887] LearnerWorker_p0 finished initialization! [2024-06-27 13:41:55,481][06887] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-27 13:41:56,231][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,231][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,231][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,231][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,231][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,231][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,231][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,232][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,235][06909] RunningMeanStd input shape: (1,) [2024-06-27 13:41:56,236][06909] RunningMeanStd input shape: (1,) [2024-06-27 13:41:56,236][06909] RunningMeanStd input shape: (1,) [2024-06-27 13:41:56,236][06909] RunningMeanStd input shape: (1,) [2024-06-27 13:41:56,236][06909] RunningMeanStd input shape: (11, 11) [2024-06-27 13:41:56,272][06909] RunningMeanStd input shape: (1,) [2024-06-27 13:41:56,298][06674] Inference worker 0-0 is ready! [2024-06-27 13:41:56,299][06674] All inference workers are ready! Signal rollout workers to start! [2024-06-27 13:41:58,850][06674] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 97189888. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2024-06-27 13:41:58,973][06911] Decorrelating experience for 0 frames... [2024-06-27 13:41:58,979][06929] Decorrelating experience for 0 frames... [2024-06-27 13:41:58,979][06934] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,009][06926] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,010][06931] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,029][06936] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,044][06937] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,052][06928] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,055][06933] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,056][06939] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,060][06930] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,069][06915] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,073][06908] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,074][06907] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,077][06913] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,079][06910] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,081][06932] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,086][06925] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,086][06912] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,086][06920] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,087][06914] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,090][06924] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,090][06923] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,090][06921] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,090][06922] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,090][06938] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,090][06916] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,090][06919] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,091][06917] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,092][06918] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,112][06927] Decorrelating experience for 0 frames... [2024-06-27 13:41:59,117][06935] Decorrelating experience for 0 frames... [2024-06-27 13:42:00,052][06929] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,056][06911] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,066][06934] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,091][06926] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,099][06931] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,117][06936] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,141][06937] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,157][06928] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,162][06939] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,164][06933] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,164][06930] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,198][06915] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,209][06907] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,213][06932] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,218][06908] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,219][06913] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,220][06910] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,232][06925] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,239][06914] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,242][06920] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,249][06923] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,249][06922] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,250][06912] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,254][06924] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,254][06938] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,254][06921] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,257][06919] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,258][06916] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,258][06917] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,259][06918] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,290][06927] Decorrelating experience for 256 frames... [2024-06-27 13:42:00,296][06935] Decorrelating experience for 256 frames... [2024-06-27 13:42:03,850][06674] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 97189888. Throughput: 0: 8248.0. Samples: 41240. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2024-06-27 13:42:07,140][06919] Worker 13, sleep for 60.938 sec to decorrelate experience collection [2024-06-27 13:42:07,159][06929] Worker 21, sleep for 98.438 sec to decorrelate experience collection [2024-06-27 13:42:07,165][06908] Worker 1, sleep for 4.688 sec to decorrelate experience collection [2024-06-27 13:42:07,166][06936] Worker 30, sleep for 140.625 sec to decorrelate experience collection [2024-06-27 13:42:07,166][06934] Worker 25, sleep for 117.188 sec to decorrelate experience collection [2024-06-27 13:42:07,185][06918] Worker 12, sleep for 56.250 sec to decorrelate experience collection [2024-06-27 13:42:07,186][06920] Worker 14, sleep for 65.625 sec to decorrelate experience collection [2024-06-27 13:42:07,195][06933] Worker 26, sleep for 121.875 sec to decorrelate experience collection [2024-06-27 13:42:07,205][06937] Worker 29, sleep for 135.938 sec to decorrelate experience collection [2024-06-27 13:42:07,217][06921] Worker 9, sleep for 42.188 sec to decorrelate experience collection [2024-06-27 13:42:07,218][06930] Worker 22, sleep for 103.125 sec to decorrelate experience collection [2024-06-27 13:42:07,218][06925] Worker 17, sleep for 79.688 sec to decorrelate experience collection [2024-06-27 13:42:07,253][06917] Worker 10, sleep for 46.875 sec to decorrelate experience collection [2024-06-27 13:42:07,267][06910] Worker 2, sleep for 9.375 sec to decorrelate experience collection [2024-06-27 13:42:07,268][06907] Worker 3, sleep for 14.062 sec to decorrelate experience collection [2024-06-27 13:42:07,279][06887] Signal inference workers to stop experience collection... [2024-06-27 13:42:07,302][06913] Worker 5, sleep for 23.438 sec to decorrelate experience collection [2024-06-27 13:42:07,310][06909] InferenceWorker_p0-w0: stopping experience collection [2024-06-27 13:42:07,335][06935] Worker 27, sleep for 126.562 sec to decorrelate experience collection [2024-06-27 13:42:07,339][06915] Worker 7, sleep for 32.812 sec to decorrelate experience collection [2024-06-27 13:42:07,801][06887] Signal inference workers to resume experience collection... [2024-06-27 13:42:07,801][06909] InferenceWorker_p0-w0: resuming experience collection [2024-06-27 13:42:07,825][06927] Worker 19, sleep for 89.062 sec to decorrelate experience collection [2024-06-27 13:42:07,827][06923] Worker 4, sleep for 18.750 sec to decorrelate experience collection [2024-06-27 13:42:08,302][06924] Worker 15, sleep for 70.312 sec to decorrelate experience collection [2024-06-27 13:42:08,323][06914] Worker 6, sleep for 28.125 sec to decorrelate experience collection [2024-06-27 13:42:08,325][06928] Worker 20, sleep for 93.750 sec to decorrelate experience collection [2024-06-27 13:42:08,349][06916] Worker 8, sleep for 37.500 sec to decorrelate experience collection [2024-06-27 13:42:08,410][06922] Worker 11, sleep for 51.562 sec to decorrelate experience collection [2024-06-27 13:42:08,412][06931] Worker 23, sleep for 107.812 sec to decorrelate experience collection [2024-06-27 13:42:08,413][06911] Worker 16, sleep for 75.000 sec to decorrelate experience collection [2024-06-27 13:42:08,413][06938] Worker 28, sleep for 131.250 sec to decorrelate experience collection [2024-06-27 13:42:08,413][06932] Worker 24, sleep for 112.500 sec to decorrelate experience collection [2024-06-27 13:42:08,418][06939] Worker 31, sleep for 145.312 sec to decorrelate experience collection [2024-06-27 13:42:08,468][06926] Worker 18, sleep for 84.375 sec to decorrelate experience collection [2024-06-27 13:42:08,850][06674] Fps is (10 sec: 13107.2, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 97320960. Throughput: 0: 32730.0. Samples: 327300. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-27 13:42:08,850][06674] Avg episode reward: [(0, '0.001')] [2024-06-27 13:42:08,985][06909] Updated weights for policy 0, policy_version 5942 (0.0014) [2024-06-27 13:42:11,765][06674] Heartbeat connected on Batcher_0 [2024-06-27 13:42:11,766][06674] Heartbeat connected on LearnerWorker_p0 [2024-06-27 13:42:11,777][06674] Heartbeat connected on RolloutWorker_w0 [2024-06-27 13:42:11,834][06674] Heartbeat connected on InferenceWorker_p0-w0 [2024-06-27 13:42:11,876][06908] Worker 1 awakens! [2024-06-27 13:42:11,883][06674] Heartbeat connected on RolloutWorker_w1 [2024-06-27 13:42:13,850][06674] Fps is (10 sec: 16384.0, 60 sec: 10922.6, 300 sec: 10922.6). Total num frames: 97353728. Throughput: 0: 22041.3. Samples: 330620. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-27 13:42:13,850][06674] Avg episode reward: [(0, '0.001')] [2024-06-27 13:42:16,688][06910] Worker 2 awakens! [2024-06-27 13:42:16,693][06674] Heartbeat connected on RolloutWorker_w2 [2024-06-27 13:42:18,850][06674] Fps is (10 sec: 4915.2, 60 sec: 9011.2, 300 sec: 9011.2). Total num frames: 97370112. Throughput: 0: 17269.0. Samples: 345380. Policy #0 lag: (min: 0.0, avg: 9.3, max: 10.0) [2024-06-27 13:42:18,850][06674] Avg episode reward: [(0, '0.001')] [2024-06-27 13:42:21,400][06907] Worker 3 awakens! [2024-06-27 13:42:21,404][06674] Heartbeat connected on RolloutWorker_w3 [2024-06-27 13:42:23,850][06674] Fps is (10 sec: 3276.8, 60 sec: 7864.3, 300 sec: 7864.3). Total num frames: 97386496. Throughput: 0: 14752.7. Samples: 368820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 10.0) [2024-06-27 13:42:23,851][06674] Avg episode reward: [(0, '0.002')] [2024-06-27 13:42:26,668][06923] Worker 4 awakens! [2024-06-27 13:42:26,674][06674] Heartbeat connected on RolloutWorker_w4 [2024-06-27 13:42:28,850][06674] Fps is (10 sec: 6553.6, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 97435648. Throughput: 0: 12779.4. Samples: 383380. Policy #0 lag: (min: 0.0, avg: 4.4, max: 12.0) [2024-06-27 13:42:28,850][06674] Avg episode reward: [(0, '0.002')] [2024-06-27 13:42:30,802][06913] Worker 5 awakens! [2024-06-27 13:42:30,808][06674] Heartbeat connected on RolloutWorker_w5 [2024-06-27 13:42:33,577][06909] Updated weights for policy 0, policy_version 5952 (0.0014) [2024-06-27 13:42:33,850][06674] Fps is (10 sec: 13107.5, 60 sec: 9362.3, 300 sec: 9362.3). Total num frames: 97517568. Throughput: 0: 13446.3. Samples: 470620. Policy #0 lag: (min: 0.0, avg: 6.5, max: 17.0) [2024-06-27 13:42:33,850][06674] Avg episode reward: [(0, '0.002')] [2024-06-27 13:42:36,458][06914] Worker 6 awakens! [2024-06-27 13:42:36,463][06674] Heartbeat connected on RolloutWorker_w6 [2024-06-27 13:42:38,850][06674] Fps is (10 sec: 16384.0, 60 sec: 10240.0, 300 sec: 10240.0). Total num frames: 97599488. Throughput: 0: 14254.0. Samples: 570160. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-27 13:42:38,850][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:42:40,252][06915] Worker 7 awakens! [2024-06-27 13:42:40,259][06674] Heartbeat connected on RolloutWorker_w7 [2024-06-27 13:42:42,458][06909] Updated weights for policy 0, policy_version 5962 (0.0012) [2024-06-27 13:42:43,850][06674] Fps is (10 sec: 18022.4, 60 sec: 11286.8, 300 sec: 11286.8). Total num frames: 97697792. Throughput: 0: 14062.2. Samples: 632800. Policy #0 lag: (min: 0.0, avg: 1.9, max: 5.0) [2024-06-27 13:42:43,850][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:42:45,951][06916] Worker 8 awakens! [2024-06-27 13:42:45,956][06674] Heartbeat connected on RolloutWorker_w8 [2024-06-27 13:42:48,850][06674] Fps is (10 sec: 22937.7, 60 sec: 12779.5, 300 sec: 12779.5). Total num frames: 97828864. Throughput: 0: 16123.1. Samples: 766780. Policy #0 lag: (min: 0.0, avg: 1.9, max: 5.0) [2024-06-27 13:42:48,850][06674] Avg episode reward: [(0, '0.003')] [2024-06-27 13:42:49,504][06921] Worker 9 awakens! [2024-06-27 13:42:49,511][06674] Heartbeat connected on RolloutWorker_w9 [2024-06-27 13:42:49,653][06909] Updated weights for policy 0, policy_version 5972 (0.0012) [2024-06-27 13:42:53,850][06674] Fps is (10 sec: 27852.8, 60 sec: 14298.8, 300 sec: 14298.8). Total num frames: 97976320. Throughput: 0: 13425.3. Samples: 931440. Policy #0 lag: (min: 0.0, avg: 14.1, max: 39.0) [2024-06-27 13:42:53,850][06674] Avg episode reward: [(0, '0.003')] [2024-06-27 13:42:54,229][06917] Worker 10 awakens! [2024-06-27 13:42:54,235][06674] Heartbeat connected on RolloutWorker_w10 [2024-06-27 13:42:55,145][06909] Updated weights for policy 0, policy_version 5982 (0.0015) [2024-06-27 13:42:58,850][06674] Fps is (10 sec: 29491.0, 60 sec: 15564.8, 300 sec: 15564.8). Total num frames: 98123776. Throughput: 0: 15542.2. Samples: 1030020. Policy #0 lag: (min: 0.0, avg: 3.3, max: 7.0) [2024-06-27 13:42:58,850][06674] Avg episode reward: [(0, '0.003')] [2024-06-27 13:42:59,889][06909] Updated weights for policy 0, policy_version 5992 (0.0013) [2024-06-27 13:43:00,074][06922] Worker 11 awakens! [2024-06-27 13:43:00,083][06674] Heartbeat connected on RolloutWorker_w11 [2024-06-27 13:43:03,536][06918] Worker 12 awakens! [2024-06-27 13:43:03,542][06674] Heartbeat connected on RolloutWorker_w12 [2024-06-27 13:43:03,850][06674] Fps is (10 sec: 34406.2, 60 sec: 18841.6, 300 sec: 17392.2). Total num frames: 98320384. Throughput: 0: 19749.8. Samples: 1234120. Policy #0 lag: (min: 0.0, avg: 3.6, max: 8.0) [2024-06-27 13:43:03,850][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:43:04,048][06909] Updated weights for policy 0, policy_version 6002 (0.0014) [2024-06-27 13:43:08,176][06919] Worker 13 awakens! [2024-06-27 13:43:08,183][06674] Heartbeat connected on RolloutWorker_w13 [2024-06-27 13:43:08,850][06674] Fps is (10 sec: 36044.6, 60 sec: 19387.7, 300 sec: 18490.5). Total num frames: 98484224. Throughput: 0: 23874.3. Samples: 1443160. Policy #0 lag: (min: 0.0, avg: 4.3, max: 9.0) [2024-06-27 13:43:08,850][06674] Avg episode reward: [(0, '0.003')] [2024-06-27 13:43:09,547][06909] Updated weights for policy 0, policy_version 6012 (0.0016) [2024-06-27 13:43:12,913][06920] Worker 14 awakens! [2024-06-27 13:43:12,919][06674] Heartbeat connected on RolloutWorker_w14 [2024-06-27 13:43:13,722][06909] Updated weights for policy 0, policy_version 6022 (0.0019) [2024-06-27 13:43:13,850][06674] Fps is (10 sec: 34406.3, 60 sec: 21845.3, 300 sec: 19660.8). Total num frames: 98664448. Throughput: 0: 26008.4. Samples: 1553760. Policy #0 lag: (min: 0.0, avg: 30.1, max: 88.0) [2024-06-27 13:43:13,850][06674] Avg episode reward: [(0, '0.003')] [2024-06-27 13:43:18,233][06909] Updated weights for policy 0, policy_version 6032 (0.0020) [2024-06-27 13:43:18,712][06924] Worker 15 awakens! [2024-06-27 13:43:18,721][06674] Heartbeat connected on RolloutWorker_w15 [2024-06-27 13:43:18,850][06674] Fps is (10 sec: 34406.5, 60 sec: 24302.9, 300 sec: 20480.0). Total num frames: 98828288. Throughput: 0: 28924.8. Samples: 1772240. Policy #0 lag: (min: 0.0, avg: 30.1, max: 88.0) [2024-06-27 13:43:18,850][06674] Avg episode reward: [(0, '0.003')] [2024-06-27 13:43:23,437][06909] Updated weights for policy 0, policy_version 6042 (0.0028) [2024-06-27 13:43:23,512][06911] Worker 16 awakens! [2024-06-27 13:43:23,522][06674] Heartbeat connected on RolloutWorker_w16 [2024-06-27 13:43:23,850][06674] Fps is (10 sec: 34406.6, 60 sec: 27033.7, 300 sec: 21395.6). Total num frames: 99008512. Throughput: 0: 31373.3. Samples: 1981960. Policy #0 lag: (min: 0.0, avg: 5.7, max: 12.0) [2024-06-27 13:43:23,850][06674] Avg episode reward: [(0, '0.003')] [2024-06-27 13:43:27,008][06925] Worker 17 awakens! [2024-06-27 13:43:27,019][06674] Heartbeat connected on RolloutWorker_w17 [2024-06-27 13:43:27,732][06909] Updated weights for policy 0, policy_version 6052 (0.0021) [2024-06-27 13:43:28,850][06674] Fps is (10 sec: 37683.2, 60 sec: 29491.2, 300 sec: 22391.5). Total num frames: 99205120. Throughput: 0: 32293.7. Samples: 2086020. Policy #0 lag: (min: 0.0, avg: 5.6, max: 11.0) [2024-06-27 13:43:28,850][06674] Avg episode reward: [(0, '0.003')] [2024-06-27 13:43:32,298][06909] Updated weights for policy 0, policy_version 6062 (0.0023) [2024-06-27 13:43:32,940][06926] Worker 18 awakens! [2024-06-27 13:43:32,951][06674] Heartbeat connected on RolloutWorker_w18 [2024-06-27 13:43:33,850][06674] Fps is (10 sec: 37682.9, 60 sec: 31129.5, 300 sec: 23110.1). Total num frames: 99385344. Throughput: 0: 34094.1. Samples: 2301020. Policy #0 lag: (min: 0.0, avg: 5.8, max: 12.0) [2024-06-27 13:43:33,850][06674] Avg episode reward: [(0, '0.002')] [2024-06-27 13:43:36,657][06909] Updated weights for policy 0, policy_version 6072 (0.0021) [2024-06-27 13:43:36,992][06927] Worker 19 awakens! [2024-06-27 13:43:37,003][06674] Heartbeat connected on RolloutWorker_w19 [2024-06-27 13:43:38,850][06674] Fps is (10 sec: 36044.9, 60 sec: 32768.0, 300 sec: 23756.8). Total num frames: 99565568. Throughput: 0: 35381.7. Samples: 2523620. Policy #0 lag: (min: 0.0, avg: 47.7, max: 140.0) [2024-06-27 13:43:38,850][06674] Avg episode reward: [(0, '0.003')] [2024-06-27 13:43:40,851][06909] Updated weights for policy 0, policy_version 6082 (0.0022) [2024-06-27 13:43:42,176][06928] Worker 20 awakens! [2024-06-27 13:43:42,187][06674] Heartbeat connected on RolloutWorker_w20 [2024-06-27 13:43:43,850][06674] Fps is (10 sec: 34406.9, 60 sec: 33860.3, 300 sec: 24185.9). Total num frames: 99729408. Throughput: 0: 35733.4. Samples: 2638020. Policy #0 lag: (min: 0.0, avg: 6.2, max: 14.0) [2024-06-27 13:43:43,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:43:44,939][06909] Updated weights for policy 0, policy_version 6092 (0.0032) [2024-06-27 13:43:45,696][06929] Worker 21 awakens! [2024-06-27 13:43:45,708][06674] Heartbeat connected on RolloutWorker_w21 [2024-06-27 13:43:48,850][06674] Fps is (10 sec: 37682.9, 60 sec: 35225.5, 300 sec: 25022.8). Total num frames: 99942400. Throughput: 0: 36287.5. Samples: 2867060. Policy #0 lag: (min: 0.0, avg: 6.2, max: 14.0) [2024-06-27 13:43:48,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:43:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000006100_99942400.pth... [2024-06-27 13:43:48,914][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000005618_92045312.pth [2024-06-27 13:43:49,711][06909] Updated weights for policy 0, policy_version 6102 (0.0026) [2024-06-27 13:43:50,408][06930] Worker 22 awakens! [2024-06-27 13:43:50,419][06674] Heartbeat connected on RolloutWorker_w22 [2024-06-27 13:43:53,850][06674] Fps is (10 sec: 39321.2, 60 sec: 35771.7, 300 sec: 25502.1). Total num frames: 100122624. Throughput: 0: 36768.5. Samples: 3097740. Policy #0 lag: (min: 0.0, avg: 6.8, max: 14.0) [2024-06-27 13:43:53,850][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:43:53,911][06909] Updated weights for policy 0, policy_version 6112 (0.0025) [2024-06-27 13:43:56,328][06931] Worker 23 awakens! [2024-06-27 13:43:56,338][06674] Heartbeat connected on RolloutWorker_w23 [2024-06-27 13:43:57,761][06909] Updated weights for policy 0, policy_version 6122 (0.0035) [2024-06-27 13:43:58,850][06674] Fps is (10 sec: 39321.6, 60 sec: 36864.0, 300 sec: 26214.4). Total num frames: 100335616. Throughput: 0: 37054.6. Samples: 3221220. Policy #0 lag: (min: 0.0, avg: 7.3, max: 15.0) [2024-06-27 13:43:58,850][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:44:01,012][06932] Worker 24 awakens! [2024-06-27 13:44:01,024][06674] Heartbeat connected on RolloutWorker_w24 [2024-06-27 13:44:02,069][06909] Updated weights for policy 0, policy_version 6132 (0.0028) [2024-06-27 13:44:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 36864.0, 300 sec: 26738.7). Total num frames: 100532224. Throughput: 0: 37598.2. Samples: 3464160. Policy #0 lag: (min: 0.0, avg: 7.6, max: 16.0) [2024-06-27 13:44:03,850][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:44:04,452][06934] Worker 25 awakens! [2024-06-27 13:44:04,465][06674] Heartbeat connected on RolloutWorker_w25 [2024-06-27 13:44:05,970][06909] Updated weights for policy 0, policy_version 6142 (0.0026) [2024-06-27 13:44:08,850][06674] Fps is (10 sec: 39321.2, 60 sec: 37410.0, 300 sec: 27222.6). Total num frames: 100728832. Throughput: 0: 38259.4. Samples: 3703640. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2024-06-27 13:44:08,851][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:44:09,168][06933] Worker 26 awakens! [2024-06-27 13:44:09,181][06674] Heartbeat connected on RolloutWorker_w26 [2024-06-27 13:44:09,760][06909] Updated weights for policy 0, policy_version 6152 (0.0037) [2024-06-27 13:44:13,850][06674] Fps is (10 sec: 40959.7, 60 sec: 37956.2, 300 sec: 27792.1). Total num frames: 100941824. Throughput: 0: 38492.8. Samples: 3818200. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2024-06-27 13:44:13,850][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:44:13,998][06935] Worker 27 awakens! [2024-06-27 13:44:14,012][06674] Heartbeat connected on RolloutWorker_w27 [2024-06-27 13:44:14,190][06909] Updated weights for policy 0, policy_version 6162 (0.0031) [2024-06-27 13:44:17,796][06909] Updated weights for policy 0, policy_version 6172 (0.0023) [2024-06-27 13:44:18,850][06674] Fps is (10 sec: 42599.1, 60 sec: 38775.5, 300 sec: 28320.9). Total num frames: 101154816. Throughput: 0: 39358.2. Samples: 4072140. Policy #0 lag: (min: 0.0, avg: 7.9, max: 17.0) [2024-06-27 13:44:18,850][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:44:19,760][06938] Worker 28 awakens! [2024-06-27 13:44:19,771][06674] Heartbeat connected on RolloutWorker_w28 [2024-06-27 13:44:22,157][06909] Updated weights for policy 0, policy_version 6182 (0.0027) [2024-06-27 13:44:23,240][06937] Worker 29 awakens! [2024-06-27 13:44:23,255][06674] Heartbeat connected on RolloutWorker_w29 [2024-06-27 13:44:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 39321.5, 300 sec: 28813.2). Total num frames: 101367808. Throughput: 0: 39937.7. Samples: 4320820. Policy #0 lag: (min: 0.0, avg: 8.4, max: 18.0) [2024-06-27 13:44:23,851][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:44:25,334][06909] Updated weights for policy 0, policy_version 6192 (0.0031) [2024-06-27 13:44:27,889][06936] Worker 30 awakens! [2024-06-27 13:44:27,902][06674] Heartbeat connected on RolloutWorker_w30 [2024-06-27 13:44:28,850][06674] Fps is (10 sec: 40959.6, 60 sec: 39321.5, 300 sec: 29163.5). Total num frames: 101564416. Throughput: 0: 40323.4. Samples: 4452580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2024-06-27 13:44:28,851][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:44:29,665][06909] Updated weights for policy 0, policy_version 6202 (0.0032) [2024-06-27 13:44:33,480][06909] Updated weights for policy 0, policy_version 6212 (0.0033) [2024-06-27 13:44:33,814][06939] Worker 31 awakens! [2024-06-27 13:44:33,830][06674] Heartbeat connected on RolloutWorker_w31 [2024-06-27 13:44:33,850][06674] Fps is (10 sec: 40960.6, 60 sec: 39867.8, 300 sec: 29596.9). Total num frames: 101777408. Throughput: 0: 40885.9. Samples: 4706920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 13:44:33,850][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:44:37,158][06909] Updated weights for policy 0, policy_version 6222 (0.0028) [2024-06-27 13:44:38,850][06674] Fps is (10 sec: 45875.7, 60 sec: 40960.0, 300 sec: 30208.0). Total num frames: 102023168. Throughput: 0: 41568.4. Samples: 4968320. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 13:44:38,851][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:44:40,824][06909] Updated weights for policy 0, policy_version 6232 (0.0032) [2024-06-27 13:44:43,851][06674] Fps is (10 sec: 45871.6, 60 sec: 41778.6, 300 sec: 30583.3). Total num frames: 102236160. Throughput: 0: 41898.0. Samples: 5106660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 13:44:43,851][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:44:44,428][06909] Updated weights for policy 0, policy_version 6242 (0.0039) [2024-06-27 13:44:48,494][06909] Updated weights for policy 0, policy_version 6252 (0.0034) [2024-06-27 13:44:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 41779.3, 300 sec: 30936.9). Total num frames: 102449152. Throughput: 0: 42440.9. Samples: 5374000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 13:44:48,850][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:44:51,634][06909] Updated weights for policy 0, policy_version 6262 (0.0038) [2024-06-27 13:44:53,850][06674] Fps is (10 sec: 44240.0, 60 sec: 42598.4, 300 sec: 31363.7). Total num frames: 102678528. Throughput: 0: 42972.6. Samples: 5637400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 13:44:53,850][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:44:55,880][06909] Updated weights for policy 0, policy_version 6272 (0.0040) [2024-06-27 13:44:57,203][06887] Signal inference workers to stop experience collection... (50 times) [2024-06-27 13:44:57,204][06887] Signal inference workers to resume experience collection... (50 times) [2024-06-27 13:44:57,222][06909] InferenceWorker_p0-w0: stopping experience collection (50 times) [2024-06-27 13:44:57,222][06909] InferenceWorker_p0-w0: resuming experience collection (50 times) [2024-06-27 13:44:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 42598.4, 300 sec: 31675.7). Total num frames: 102891520. Throughput: 0: 43354.3. Samples: 5769140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 13:44:58,853][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:44:59,070][06909] Updated weights for policy 0, policy_version 6282 (0.0033) [2024-06-27 13:45:03,362][06909] Updated weights for policy 0, policy_version 6292 (0.0027) [2024-06-27 13:45:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 42871.5, 300 sec: 31971.0). Total num frames: 103104512. Throughput: 0: 43625.0. Samples: 6035260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 13:45:03,850][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:45:06,398][06909] Updated weights for policy 0, policy_version 6302 (0.0038) [2024-06-27 13:45:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.7, 300 sec: 32336.8). Total num frames: 103333888. Throughput: 0: 43902.8. Samples: 6296440. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 13:45:08,850][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:45:10,690][06909] Updated weights for policy 0, policy_version 6312 (0.0050) [2024-06-27 13:45:13,669][06909] Updated weights for policy 0, policy_version 6322 (0.0035) [2024-06-27 13:45:13,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43963.7, 300 sec: 32768.0). Total num frames: 103579648. Throughput: 0: 44064.5. Samples: 6435480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 13:45:13,851][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:45:18,263][06909] Updated weights for policy 0, policy_version 6332 (0.0032) [2024-06-27 13:45:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 32849.9). Total num frames: 103759872. Throughput: 0: 44250.2. Samples: 6698180. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 13:45:18,850][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:45:21,026][06909] Updated weights for policy 0, policy_version 6342 (0.0028) [2024-06-27 13:45:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 33247.5). Total num frames: 104005632. Throughput: 0: 44309.3. Samples: 6962240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 13:45:23,853][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:45:25,779][06909] Updated weights for policy 0, policy_version 6352 (0.0030) [2024-06-27 13:45:28,446][06909] Updated weights for policy 0, policy_version 6362 (0.0030) [2024-06-27 13:45:28,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44509.9, 300 sec: 33548.2). Total num frames: 104235008. Throughput: 0: 44231.3. Samples: 7097040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-27 13:45:28,850][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:45:33,085][06909] Updated weights for policy 0, policy_version 6372 (0.0025) [2024-06-27 13:45:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 33606.2). Total num frames: 104415232. Throughput: 0: 44047.5. Samples: 7356140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 13:45:33,850][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:45:36,251][06909] Updated weights for policy 0, policy_version 6382 (0.0041) [2024-06-27 13:45:38,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 33959.6). Total num frames: 104660992. Throughput: 0: 44015.6. Samples: 7618100. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 13:45:38,850][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:45:40,485][06909] Updated weights for policy 0, policy_version 6392 (0.0039) [2024-06-27 13:45:43,593][06909] Updated weights for policy 0, policy_version 6402 (0.0038) [2024-06-27 13:45:43,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44237.4, 300 sec: 34224.4). Total num frames: 104890368. Throughput: 0: 44205.4. Samples: 7758380. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 13:45:43,850][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:45:47,804][06909] Updated weights for policy 0, policy_version 6412 (0.0040) [2024-06-27 13:45:48,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 34263.9). Total num frames: 105070592. Throughput: 0: 44069.6. Samples: 8018400. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-27 13:45:48,850][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:45:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000006413_105070592.pth... [2024-06-27 13:45:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000005932_97189888.pth [2024-06-27 13:45:51,097][06909] Updated weights for policy 0, policy_version 6422 (0.0031) [2024-06-27 13:45:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 34580.7). Total num frames: 105316352. Throughput: 0: 43941.8. Samples: 8273820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 13:45:53,850][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:45:55,642][06909] Updated weights for policy 0, policy_version 6432 (0.0043) [2024-06-27 13:45:58,642][06909] Updated weights for policy 0, policy_version 6442 (0.0023) [2024-06-27 13:45:58,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.7, 300 sec: 34816.0). Total num frames: 105545728. Throughput: 0: 43847.5. Samples: 8408620. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 13:45:58,851][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:46:03,099][06909] Updated weights for policy 0, policy_version 6452 (0.0029) [2024-06-27 13:46:03,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 34841.1). Total num frames: 105725952. Throughput: 0: 43684.5. Samples: 8663980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 13:46:03,850][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:46:06,167][06909] Updated weights for policy 0, policy_version 6462 (0.0035) [2024-06-27 13:46:08,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.8, 300 sec: 35127.3). Total num frames: 105971712. Throughput: 0: 43677.8. Samples: 8927740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 13:46:08,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:46:10,471][06909] Updated weights for policy 0, policy_version 6472 (0.0028) [2024-06-27 13:46:13,811][06909] Updated weights for policy 0, policy_version 6482 (0.0028) [2024-06-27 13:46:13,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43690.8, 300 sec: 35338.1). Total num frames: 106201088. Throughput: 0: 43717.9. Samples: 9064340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 13:46:13,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:46:17,929][06909] Updated weights for policy 0, policy_version 6492 (0.0034) [2024-06-27 13:46:18,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 35351.6). Total num frames: 106381312. Throughput: 0: 43641.3. Samples: 9320000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 13:46:18,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:46:21,372][06909] Updated weights for policy 0, policy_version 6502 (0.0029) [2024-06-27 13:46:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 35612.0). Total num frames: 106627072. Throughput: 0: 43505.3. Samples: 9575840. Policy #0 lag: (min: 2.0, avg: 10.9, max: 23.0) [2024-06-27 13:46:23,850][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:46:25,582][06909] Updated weights for policy 0, policy_version 6512 (0.0034) [2024-06-27 13:46:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 35741.4). Total num frames: 106840064. Throughput: 0: 43383.5. Samples: 9710640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 13:46:28,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:46:28,955][06909] Updated weights for policy 0, policy_version 6522 (0.0038) [2024-06-27 13:46:32,991][06909] Updated weights for policy 0, policy_version 6532 (0.0029) [2024-06-27 13:46:33,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43417.6, 300 sec: 35746.9). Total num frames: 107020288. Throughput: 0: 43327.2. Samples: 9968120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 13:46:33,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:46:36,700][06887] Signal inference workers to stop experience collection... (100 times) [2024-06-27 13:46:36,745][06887] Signal inference workers to resume experience collection... (100 times) [2024-06-27 13:46:36,751][06909] InferenceWorker_p0-w0: stopping experience collection (100 times) [2024-06-27 13:46:36,754][06909] Updated weights for policy 0, policy_version 6542 (0.0035) [2024-06-27 13:46:36,782][06909] InferenceWorker_p0-w0: resuming experience collection (100 times) [2024-06-27 13:46:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 36044.8). Total num frames: 107282432. Throughput: 0: 43367.9. Samples: 10225380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 13:46:38,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:46:40,545][06909] Updated weights for policy 0, policy_version 6552 (0.0040) [2024-06-27 13:46:43,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43417.5, 300 sec: 36159.8). Total num frames: 107495424. Throughput: 0: 43457.9. Samples: 10364220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 13:46:43,851][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:46:44,156][06909] Updated weights for policy 0, policy_version 6562 (0.0034) [2024-06-27 13:46:47,977][06909] Updated weights for policy 0, policy_version 6572 (0.0028) [2024-06-27 13:46:48,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.8, 300 sec: 36214.3). Total num frames: 107692032. Throughput: 0: 43618.7. Samples: 10626820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 13:46:48,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:46:51,565][06909] Updated weights for policy 0, policy_version 6582 (0.0044) [2024-06-27 13:46:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 36433.6). Total num frames: 107937792. Throughput: 0: 43421.8. Samples: 10881720. Policy #0 lag: (min: 1.0, avg: 11.5, max: 22.0) [2024-06-27 13:46:53,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:46:55,362][06909] Updated weights for policy 0, policy_version 6592 (0.0030) [2024-06-27 13:46:58,852][06674] Fps is (10 sec: 45865.4, 60 sec: 43416.2, 300 sec: 37155.3). Total num frames: 108150784. Throughput: 0: 43476.2. Samples: 11020860. Policy #0 lag: (min: 1.0, avg: 11.5, max: 22.0) [2024-06-27 13:46:58,852][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:46:59,278][06909] Updated weights for policy 0, policy_version 6602 (0.0033) [2024-06-27 13:47:02,808][06909] Updated weights for policy 0, policy_version 6612 (0.0030) [2024-06-27 13:47:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 37433.3). Total num frames: 108363776. Throughput: 0: 43632.9. Samples: 11283480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-27 13:47:03,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:47:06,617][06909] Updated weights for policy 0, policy_version 6622 (0.0034) [2024-06-27 13:47:08,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43690.6, 300 sec: 38099.7). Total num frames: 108593152. Throughput: 0: 43786.3. Samples: 11546220. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 13:47:08,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:47:10,259][06909] Updated weights for policy 0, policy_version 6632 (0.0038) [2024-06-27 13:47:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.6, 300 sec: 38766.2). Total num frames: 108806144. Throughput: 0: 43852.9. Samples: 11684020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 13:47:13,850][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:47:13,935][06909] Updated weights for policy 0, policy_version 6642 (0.0030) [2024-06-27 13:47:17,883][06909] Updated weights for policy 0, policy_version 6652 (0.0024) [2024-06-27 13:47:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 39432.7). Total num frames: 109019136. Throughput: 0: 43889.3. Samples: 11943140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 13:47:18,850][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:47:21,346][06909] Updated weights for policy 0, policy_version 6662 (0.0027) [2024-06-27 13:47:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 40043.6). Total num frames: 109248512. Throughput: 0: 43871.1. Samples: 12199580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 13:47:23,854][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:47:25,410][06909] Updated weights for policy 0, policy_version 6672 (0.0036) [2024-06-27 13:47:28,843][06909] Updated weights for policy 0, policy_version 6682 (0.0023) [2024-06-27 13:47:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 40543.5). Total num frames: 109477888. Throughput: 0: 43727.2. Samples: 12331940. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 13:47:28,850][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:47:32,868][06909] Updated weights for policy 0, policy_version 6692 (0.0034) [2024-06-27 13:47:33,852][06674] Fps is (10 sec: 44228.0, 60 sec: 44508.4, 300 sec: 40987.5). Total num frames: 109690880. Throughput: 0: 43761.9. Samples: 12596200. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 13:47:33,852][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:47:36,223][06909] Updated weights for policy 0, policy_version 6702 (0.0038) [2024-06-27 13:47:38,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.6, 300 sec: 41321.0). Total num frames: 109887488. Throughput: 0: 43952.4. Samples: 12859580. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 13:47:38,851][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:47:38,881][06887] Saving new best policy, reward=0.010! [2024-06-27 13:47:40,206][06909] Updated weights for policy 0, policy_version 6712 (0.0042) [2024-06-27 13:47:43,734][06909] Updated weights for policy 0, policy_version 6722 (0.0041) [2024-06-27 13:47:43,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43963.8, 300 sec: 41709.8). Total num frames: 110133248. Throughput: 0: 43682.0. Samples: 12986460. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 13:47:43,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:47:47,705][06909] Updated weights for policy 0, policy_version 6732 (0.0039) [2024-06-27 13:47:48,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 41931.9). Total num frames: 110346240. Throughput: 0: 43749.0. Samples: 13252180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 13:47:48,850][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:47:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000006735_110346240.pth... [2024-06-27 13:47:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000006100_99942400.pth [2024-06-27 13:47:51,282][06909] Updated weights for policy 0, policy_version 6742 (0.0029) [2024-06-27 13:47:53,852][06674] Fps is (10 sec: 40951.5, 60 sec: 43416.1, 300 sec: 42098.3). Total num frames: 110542848. Throughput: 0: 43692.7. Samples: 13512480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 13:47:53,852][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:47:55,439][06909] Updated weights for policy 0, policy_version 6752 (0.0033) [2024-06-27 13:47:58,720][06909] Updated weights for policy 0, policy_version 6762 (0.0033) [2024-06-27 13:47:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43965.3, 300 sec: 42265.2). Total num frames: 110788608. Throughput: 0: 43476.5. Samples: 13640460. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 13:47:58,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:48:03,122][06909] Updated weights for policy 0, policy_version 6772 (0.0033) [2024-06-27 13:48:03,850][06674] Fps is (10 sec: 44245.2, 60 sec: 43690.6, 300 sec: 42376.2). Total num frames: 110985216. Throughput: 0: 43639.4. Samples: 13906920. Policy #0 lag: (min: 1.0, avg: 9.2, max: 19.0) [2024-06-27 13:48:03,851][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:48:06,230][06909] Updated weights for policy 0, policy_version 6782 (0.0034) [2024-06-27 13:48:08,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43417.5, 300 sec: 42487.3). Total num frames: 111198208. Throughput: 0: 43716.4. Samples: 14166820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 13:48:08,851][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:48:10,571][06909] Updated weights for policy 0, policy_version 6792 (0.0029) [2024-06-27 13:48:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 42709.5). Total num frames: 111427584. Throughput: 0: 43736.7. Samples: 14300100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 13:48:13,850][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:48:14,095][06909] Updated weights for policy 0, policy_version 6802 (0.0034) [2024-06-27 13:48:14,915][06887] Signal inference workers to stop experience collection... (150 times) [2024-06-27 13:48:14,928][06909] InferenceWorker_p0-w0: stopping experience collection (150 times) [2024-06-27 13:48:14,929][06887] Signal inference workers to resume experience collection... (150 times) [2024-06-27 13:48:14,944][06909] InferenceWorker_p0-w0: resuming experience collection (150 times) [2024-06-27 13:48:18,056][06909] Updated weights for policy 0, policy_version 6812 (0.0028) [2024-06-27 13:48:18,856][06674] Fps is (10 sec: 44210.2, 60 sec: 43686.2, 300 sec: 42819.7). Total num frames: 111640576. Throughput: 0: 43709.8. Samples: 14563320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 13:48:18,857][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:48:21,389][06909] Updated weights for policy 0, policy_version 6822 (0.0030) [2024-06-27 13:48:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 42876.1). Total num frames: 111853568. Throughput: 0: 43601.0. Samples: 14821620. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 13:48:23,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:48:25,760][06909] Updated weights for policy 0, policy_version 6832 (0.0040) [2024-06-27 13:48:28,850][06674] Fps is (10 sec: 44263.5, 60 sec: 43417.5, 300 sec: 43042.7). Total num frames: 112082944. Throughput: 0: 43688.8. Samples: 14952460. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 13:48:28,851][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:48:29,116][06909] Updated weights for policy 0, policy_version 6842 (0.0037) [2024-06-27 13:48:33,188][06909] Updated weights for policy 0, policy_version 6852 (0.0039) [2024-06-27 13:48:33,850][06674] Fps is (10 sec: 42596.2, 60 sec: 43145.6, 300 sec: 43098.2). Total num frames: 112279552. Throughput: 0: 43615.0. Samples: 15214880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 13:48:33,851][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:48:36,499][06909] Updated weights for policy 0, policy_version 6862 (0.0043) [2024-06-27 13:48:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 112508928. Throughput: 0: 43711.3. Samples: 15479400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 13:48:38,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:48:40,589][06909] Updated weights for policy 0, policy_version 6872 (0.0031) [2024-06-27 13:48:43,850][06674] Fps is (10 sec: 45877.8, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 112738304. Throughput: 0: 43848.4. Samples: 15613640. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 13:48:43,850][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:48:43,883][06909] Updated weights for policy 0, policy_version 6882 (0.0042) [2024-06-27 13:48:48,050][06909] Updated weights for policy 0, policy_version 6892 (0.0028) [2024-06-27 13:48:48,850][06674] Fps is (10 sec: 40960.6, 60 sec: 42871.5, 300 sec: 43376.0). Total num frames: 112918528. Throughput: 0: 43628.7. Samples: 15870200. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 13:48:48,850][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:48:51,261][06909] Updated weights for policy 0, policy_version 6902 (0.0027) [2024-06-27 13:48:53,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43419.0, 300 sec: 43431.5). Total num frames: 113147904. Throughput: 0: 43658.7. Samples: 16131460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 13:48:53,850][06674] Avg episode reward: [(0, '0.004')] [2024-06-27 13:48:55,507][06909] Updated weights for policy 0, policy_version 6912 (0.0032) [2024-06-27 13:48:58,764][06909] Updated weights for policy 0, policy_version 6922 (0.0023) [2024-06-27 13:48:58,850][06674] Fps is (10 sec: 49151.2, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 113410048. Throughput: 0: 43647.6. Samples: 16264240. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 13:48:58,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:49:03,439][06909] Updated weights for policy 0, policy_version 6932 (0.0037) [2024-06-27 13:49:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43144.7, 300 sec: 43542.6). Total num frames: 113573888. Throughput: 0: 43525.5. Samples: 16521700. Policy #0 lag: (min: 0.0, avg: 11.6, max: 20.0) [2024-06-27 13:49:03,850][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:49:06,228][06909] Updated weights for policy 0, policy_version 6942 (0.0041) [2024-06-27 13:49:08,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 113803264. Throughput: 0: 43551.9. Samples: 16781460. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-27 13:49:08,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:49:10,879][06909] Updated weights for policy 0, policy_version 6952 (0.0034) [2024-06-27 13:49:13,651][06909] Updated weights for policy 0, policy_version 6962 (0.0034) [2024-06-27 13:49:13,850][06674] Fps is (10 sec: 49151.1, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 114065408. Throughput: 0: 43635.9. Samples: 16916080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 13:49:13,851][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:49:18,325][06909] Updated weights for policy 0, policy_version 6972 (0.0035) [2024-06-27 13:49:18,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43422.1, 300 sec: 43653.7). Total num frames: 114245632. Throughput: 0: 43563.6. Samples: 17175220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 13:49:18,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:49:21,278][06909] Updated weights for policy 0, policy_version 6982 (0.0033) [2024-06-27 13:49:23,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 114475008. Throughput: 0: 43488.0. Samples: 17436360. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 13:49:23,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:49:25,751][06909] Updated weights for policy 0, policy_version 6992 (0.0033) [2024-06-27 13:49:28,808][06909] Updated weights for policy 0, policy_version 7002 (0.0038) [2024-06-27 13:49:28,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 114720768. Throughput: 0: 43421.6. Samples: 17567620. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 13:49:28,850][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:49:33,381][06909] Updated weights for policy 0, policy_version 7012 (0.0027) [2024-06-27 13:49:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43691.0, 300 sec: 43653.6). Total num frames: 114900992. Throughput: 0: 43576.8. Samples: 17831160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 13:49:33,851][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:49:36,416][06909] Updated weights for policy 0, policy_version 7022 (0.0040) [2024-06-27 13:49:38,850][06674] Fps is (10 sec: 39322.3, 60 sec: 43417.7, 300 sec: 43653.8). Total num frames: 115113984. Throughput: 0: 43648.6. Samples: 18095640. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-27 13:49:38,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:49:40,957][06909] Updated weights for policy 0, policy_version 7032 (0.0041) [2024-06-27 13:49:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 115359744. Throughput: 0: 43609.4. Samples: 18226660. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-27 13:49:43,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:49:43,929][06909] Updated weights for policy 0, policy_version 7042 (0.0040) [2024-06-27 13:49:48,322][06909] Updated weights for policy 0, policy_version 7052 (0.0042) [2024-06-27 13:49:48,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43963.6, 300 sec: 43653.6). Total num frames: 115556352. Throughput: 0: 43783.8. Samples: 18491980. Policy #0 lag: (min: 1.0, avg: 12.8, max: 21.0) [2024-06-27 13:49:48,851][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:49:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000007053_115556352.pth... [2024-06-27 13:49:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000006413_105070592.pth [2024-06-27 13:49:51,507][06909] Updated weights for policy 0, policy_version 7062 (0.0029) [2024-06-27 13:49:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 115785728. Throughput: 0: 43816.6. Samples: 18753200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 13:49:53,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:49:55,734][06909] Updated weights for policy 0, policy_version 7072 (0.0033) [2024-06-27 13:49:58,851][06909] Updated weights for policy 0, policy_version 7082 (0.0034) [2024-06-27 13:49:58,852][06674] Fps is (10 sec: 47504.7, 60 sec: 43689.2, 300 sec: 43820.0). Total num frames: 116031488. Throughput: 0: 43727.5. Samples: 18883900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 13:49:58,852][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:50:03,145][06909] Updated weights for policy 0, policy_version 7092 (0.0029) [2024-06-27 13:50:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43653.7). Total num frames: 116211712. Throughput: 0: 43789.8. Samples: 19145760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 13:50:03,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:50:06,309][06909] Updated weights for policy 0, policy_version 7102 (0.0042) [2024-06-27 13:50:08,339][06887] Signal inference workers to stop experience collection... (200 times) [2024-06-27 13:50:08,389][06909] InferenceWorker_p0-w0: stopping experience collection (200 times) [2024-06-27 13:50:08,396][06887] Signal inference workers to resume experience collection... (200 times) [2024-06-27 13:50:08,403][06909] InferenceWorker_p0-w0: resuming experience collection (200 times) [2024-06-27 13:50:08,850][06674] Fps is (10 sec: 40968.0, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 116441088. Throughput: 0: 43733.3. Samples: 19404360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 13:50:08,851][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 13:50:08,970][06887] Saving new best policy, reward=0.011! [2024-06-27 13:50:10,588][06909] Updated weights for policy 0, policy_version 7112 (0.0032) [2024-06-27 13:50:13,782][06909] Updated weights for policy 0, policy_version 7122 (0.0033) [2024-06-27 13:50:13,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43690.7, 300 sec: 43820.2). Total num frames: 116686848. Throughput: 0: 43787.1. Samples: 19538040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 13:50:13,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:50:18,234][06909] Updated weights for policy 0, policy_version 7132 (0.0029) [2024-06-27 13:50:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 116867072. Throughput: 0: 43655.5. Samples: 19795660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 13:50:18,851][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 13:50:21,654][06909] Updated weights for policy 0, policy_version 7142 (0.0030) [2024-06-27 13:50:23,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 117096448. Throughput: 0: 43504.0. Samples: 20053320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 13:50:23,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:50:25,789][06909] Updated weights for policy 0, policy_version 7152 (0.0052) [2024-06-27 13:50:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43144.5, 300 sec: 43709.2). Total num frames: 117309440. Throughput: 0: 43436.8. Samples: 20181320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 13:50:28,854][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:50:29,259][06909] Updated weights for policy 0, policy_version 7162 (0.0038) [2024-06-27 13:50:33,234][06909] Updated weights for policy 0, policy_version 7172 (0.0039) [2024-06-27 13:50:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 117522432. Throughput: 0: 43329.0. Samples: 20441780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 13:50:33,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:50:36,760][06909] Updated weights for policy 0, policy_version 7182 (0.0042) [2024-06-27 13:50:38,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 117719040. Throughput: 0: 43344.8. Samples: 20703720. Policy #0 lag: (min: 1.0, avg: 10.4, max: 21.0) [2024-06-27 13:50:38,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:50:40,697][06909] Updated weights for policy 0, policy_version 7192 (0.0026) [2024-06-27 13:50:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 117964800. Throughput: 0: 43214.9. Samples: 20828480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 13:50:43,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:50:44,443][06909] Updated weights for policy 0, policy_version 7202 (0.0041) [2024-06-27 13:50:48,501][06909] Updated weights for policy 0, policy_version 7212 (0.0045) [2024-06-27 13:50:48,852][06674] Fps is (10 sec: 45865.7, 60 sec: 43689.2, 300 sec: 43597.8). Total num frames: 118177792. Throughput: 0: 43314.8. Samples: 21095020. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 13:50:48,853][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:50:52,059][06909] Updated weights for policy 0, policy_version 7222 (0.0024) [2024-06-27 13:50:53,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43144.6, 300 sec: 43487.1). Total num frames: 118374400. Throughput: 0: 43195.2. Samples: 21348140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 13:50:53,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:50:55,974][06909] Updated weights for policy 0, policy_version 7232 (0.0040) [2024-06-27 13:50:58,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43145.9, 300 sec: 43709.2). Total num frames: 118620160. Throughput: 0: 43050.3. Samples: 21475300. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 13:50:58,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:50:59,468][06909] Updated weights for policy 0, policy_version 7242 (0.0032) [2024-06-27 13:51:03,463][06909] Updated weights for policy 0, policy_version 7252 (0.0031) [2024-06-27 13:51:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 118816768. Throughput: 0: 43305.0. Samples: 21744380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 13:51:03,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:51:06,885][06909] Updated weights for policy 0, policy_version 7262 (0.0042) [2024-06-27 13:51:08,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43144.6, 300 sec: 43487.0). Total num frames: 119029760. Throughput: 0: 43275.1. Samples: 22000700. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 13:51:08,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:51:11,221][06909] Updated weights for policy 0, policy_version 7272 (0.0046) [2024-06-27 13:51:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 42598.5, 300 sec: 43598.1). Total num frames: 119242752. Throughput: 0: 43203.6. Samples: 22125480. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 13:51:13,851][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:51:14,751][06909] Updated weights for policy 0, policy_version 7282 (0.0039) [2024-06-27 13:51:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 119455744. Throughput: 0: 43108.8. Samples: 22381680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 13:51:18,851][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:51:18,997][06909] Updated weights for policy 0, policy_version 7292 (0.0040) [2024-06-27 13:51:22,267][06909] Updated weights for policy 0, policy_version 7302 (0.0022) [2024-06-27 13:51:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43144.5, 300 sec: 43542.6). Total num frames: 119685120. Throughput: 0: 43084.5. Samples: 22642520. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-27 13:51:23,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 13:51:26,453][06909] Updated weights for policy 0, policy_version 7312 (0.0048) [2024-06-27 13:51:28,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43144.7, 300 sec: 43653.7). Total num frames: 119898112. Throughput: 0: 43264.5. Samples: 22775380. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 13:51:28,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:51:29,574][06909] Updated weights for policy 0, policy_version 7322 (0.0039) [2024-06-27 13:51:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 43487.0). Total num frames: 120111104. Throughput: 0: 43211.8. Samples: 23039460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 13:51:33,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:51:33,882][06909] Updated weights for policy 0, policy_version 7332 (0.0032) [2024-06-27 13:51:37,100][06909] Updated weights for policy 0, policy_version 7342 (0.0031) [2024-06-27 13:51:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.8, 300 sec: 43542.6). Total num frames: 120340480. Throughput: 0: 43356.5. Samples: 23299180. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 13:51:38,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 13:51:41,148][06887] Signal inference workers to stop experience collection... (250 times) [2024-06-27 13:51:41,153][06887] Signal inference workers to resume experience collection... (250 times) [2024-06-27 13:51:41,169][06909] InferenceWorker_p0-w0: stopping experience collection (250 times) [2024-06-27 13:51:41,169][06909] InferenceWorker_p0-w0: resuming experience collection (250 times) [2024-06-27 13:51:41,310][06909] Updated weights for policy 0, policy_version 7352 (0.0037) [2024-06-27 13:51:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 120569856. Throughput: 0: 43372.5. Samples: 23427060. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 13:51:43,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:51:44,562][06909] Updated weights for policy 0, policy_version 7362 (0.0030) [2024-06-27 13:51:48,777][06909] Updated weights for policy 0, policy_version 7372 (0.0033) [2024-06-27 13:51:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43419.1, 300 sec: 43542.6). Total num frames: 120782848. Throughput: 0: 43237.8. Samples: 23690080. Policy #0 lag: (min: 1.0, avg: 10.6, max: 21.0) [2024-06-27 13:51:48,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:51:48,874][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000007372_120782848.pth... [2024-06-27 13:51:48,929][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000006735_110346240.pth [2024-06-27 13:51:52,249][06909] Updated weights for policy 0, policy_version 7382 (0.0035) [2024-06-27 13:51:53,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43417.5, 300 sec: 43487.3). Total num frames: 120979456. Throughput: 0: 43291.1. Samples: 23948800. Policy #0 lag: (min: 1.0, avg: 10.6, max: 21.0) [2024-06-27 13:51:53,851][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:51:56,409][06909] Updated weights for policy 0, policy_version 7392 (0.0029) [2024-06-27 13:51:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 121225216. Throughput: 0: 43386.2. Samples: 24077860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 13:51:58,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:51:59,765][06909] Updated weights for policy 0, policy_version 7402 (0.0028) [2024-06-27 13:52:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 121421824. Throughput: 0: 43576.1. Samples: 24342600. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 13:52:03,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:52:04,109][06909] Updated weights for policy 0, policy_version 7412 (0.0027) [2024-06-27 13:52:07,257][06909] Updated weights for policy 0, policy_version 7422 (0.0029) [2024-06-27 13:52:08,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 121634816. Throughput: 0: 43651.6. Samples: 24606840. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 13:52:08,850][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:52:11,493][06909] Updated weights for policy 0, policy_version 7432 (0.0038) [2024-06-27 13:52:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.8, 300 sec: 43542.6). Total num frames: 121864192. Throughput: 0: 43632.0. Samples: 24738820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 13:52:13,850][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:52:14,799][06909] Updated weights for policy 0, policy_version 7442 (0.0042) [2024-06-27 13:52:18,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 122077184. Throughput: 0: 43625.6. Samples: 25002620. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 13:52:18,851][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:52:18,986][06909] Updated weights for policy 0, policy_version 7452 (0.0030) [2024-06-27 13:52:22,431][06909] Updated weights for policy 0, policy_version 7462 (0.0039) [2024-06-27 13:52:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 122290176. Throughput: 0: 43704.5. Samples: 25265880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 13:52:23,850][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:52:26,470][06909] Updated weights for policy 0, policy_version 7472 (0.0034) [2024-06-27 13:52:28,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43690.6, 300 sec: 43487.3). Total num frames: 122519552. Throughput: 0: 43695.1. Samples: 25393340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 13:52:28,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:52:30,011][06909] Updated weights for policy 0, policy_version 7482 (0.0036) [2024-06-27 13:52:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 122732544. Throughput: 0: 43608.9. Samples: 25652480. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 13:52:33,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:52:34,031][06909] Updated weights for policy 0, policy_version 7492 (0.0036) [2024-06-27 13:52:37,726][06909] Updated weights for policy 0, policy_version 7502 (0.0028) [2024-06-27 13:52:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 122945536. Throughput: 0: 43554.2. Samples: 25908740. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 13:52:38,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:52:41,494][06909] Updated weights for policy 0, policy_version 7512 (0.0033) [2024-06-27 13:52:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 123158528. Throughput: 0: 43554.7. Samples: 26037820. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 13:52:43,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 13:52:45,715][06909] Updated weights for policy 0, policy_version 7522 (0.0042) [2024-06-27 13:52:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43542.9). Total num frames: 123387904. Throughput: 0: 43605.3. Samples: 26304840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 13:52:48,850][06674] Avg episode reward: [(0, '0.015')] [2024-06-27 13:52:48,896][06887] Saving new best policy, reward=0.015! [2024-06-27 13:52:48,907][06909] Updated weights for policy 0, policy_version 7532 (0.0037) [2024-06-27 13:52:50,472][06887] Signal inference workers to stop experience collection... (300 times) [2024-06-27 13:52:50,473][06887] Signal inference workers to resume experience collection... (300 times) [2024-06-27 13:52:50,516][06909] InferenceWorker_p0-w0: stopping experience collection (300 times) [2024-06-27 13:52:50,517][06909] InferenceWorker_p0-w0: resuming experience collection (300 times) [2024-06-27 13:52:53,262][06909] Updated weights for policy 0, policy_version 7542 (0.0039) [2024-06-27 13:52:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 123600896. Throughput: 0: 43456.8. Samples: 26562400. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 13:52:53,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 13:52:56,373][06909] Updated weights for policy 0, policy_version 7552 (0.0038) [2024-06-27 13:52:58,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 123813888. Throughput: 0: 43313.1. Samples: 26687920. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 13:52:58,851][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:53:00,828][06909] Updated weights for policy 0, policy_version 7562 (0.0026) [2024-06-27 13:53:03,761][06909] Updated weights for policy 0, policy_version 7572 (0.0051) [2024-06-27 13:53:03,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 124059648. Throughput: 0: 43441.0. Samples: 26957460. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 13:53:03,850][06674] Avg episode reward: [(0, '0.007')] [2024-06-27 13:53:08,210][06909] Updated weights for policy 0, policy_version 7582 (0.0030) [2024-06-27 13:53:08,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43690.7, 300 sec: 43487.1). Total num frames: 124256256. Throughput: 0: 43372.4. Samples: 27217640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 13:53:08,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:53:11,796][06909] Updated weights for policy 0, policy_version 7592 (0.0030) [2024-06-27 13:53:13,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.5, 300 sec: 43543.4). Total num frames: 124485632. Throughput: 0: 43309.2. Samples: 27342260. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-27 13:53:13,851][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:53:15,602][06909] Updated weights for policy 0, policy_version 7602 (0.0037) [2024-06-27 13:53:18,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.7, 300 sec: 43542.5). Total num frames: 124698624. Throughput: 0: 43500.3. Samples: 27610000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 13:53:18,851][06674] Avg episode reward: [(0, '0.005')] [2024-06-27 13:53:19,186][06909] Updated weights for policy 0, policy_version 7612 (0.0029) [2024-06-27 13:53:23,107][06909] Updated weights for policy 0, policy_version 7622 (0.0035) [2024-06-27 13:53:23,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 124895232. Throughput: 0: 43635.5. Samples: 27872340. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 13:53:23,850][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:53:26,609][06909] Updated weights for policy 0, policy_version 7632 (0.0031) [2024-06-27 13:53:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.5, 300 sec: 43598.2). Total num frames: 125140992. Throughput: 0: 43609.2. Samples: 28000240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 13:53:28,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:53:30,520][06909] Updated weights for policy 0, policy_version 7642 (0.0039) [2024-06-27 13:53:33,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 125353984. Throughput: 0: 43558.7. Samples: 28264980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 13:53:33,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:53:34,117][06909] Updated weights for policy 0, policy_version 7652 (0.0036) [2024-06-27 13:53:37,961][06909] Updated weights for policy 0, policy_version 7662 (0.0037) [2024-06-27 13:53:38,852][06674] Fps is (10 sec: 40952.2, 60 sec: 43416.1, 300 sec: 43431.2). Total num frames: 125550592. Throughput: 0: 43420.3. Samples: 28516400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 13:53:38,852][06674] Avg episode reward: [(0, '0.006')] [2024-06-27 13:53:41,655][06909] Updated weights for policy 0, policy_version 7672 (0.0031) [2024-06-27 13:53:43,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.6, 300 sec: 43542.5). Total num frames: 125763584. Throughput: 0: 43454.7. Samples: 28643380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 13:53:43,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:53:45,425][06909] Updated weights for policy 0, policy_version 7682 (0.0033) [2024-06-27 13:53:48,850][06674] Fps is (10 sec: 44245.4, 60 sec: 43417.5, 300 sec: 43542.6). Total num frames: 125992960. Throughput: 0: 43327.9. Samples: 28907220. Policy #0 lag: (min: 0.0, avg: 11.3, max: 20.0) [2024-06-27 13:53:48,853][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:53:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000007690_125992960.pth... [2024-06-27 13:53:48,949][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000007053_115556352.pth [2024-06-27 13:53:49,222][06909] Updated weights for policy 0, policy_version 7692 (0.0029) [2024-06-27 13:53:53,368][06909] Updated weights for policy 0, policy_version 7702 (0.0034) [2024-06-27 13:53:53,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 126189568. Throughput: 0: 43141.3. Samples: 29159000. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 13:53:53,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:53:56,903][06909] Updated weights for policy 0, policy_version 7712 (0.0045) [2024-06-27 13:53:58,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43144.7, 300 sec: 43487.0). Total num frames: 126402560. Throughput: 0: 43297.1. Samples: 29290620. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 13:53:58,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:54:00,899][06909] Updated weights for policy 0, policy_version 7722 (0.0036) [2024-06-27 13:54:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 42871.5, 300 sec: 43487.0). Total num frames: 126631936. Throughput: 0: 43193.5. Samples: 29553700. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 13:54:03,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 13:54:04,528][06909] Updated weights for policy 0, policy_version 7732 (0.0032) [2024-06-27 13:54:08,314][06909] Updated weights for policy 0, policy_version 7742 (0.0032) [2024-06-27 13:54:08,850][06674] Fps is (10 sec: 45874.0, 60 sec: 43417.4, 300 sec: 43375.9). Total num frames: 126861312. Throughput: 0: 43075.4. Samples: 29810740. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 13:54:08,850][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 13:54:12,233][06909] Updated weights for policy 0, policy_version 7752 (0.0028) [2024-06-27 13:54:13,625][06887] Signal inference workers to stop experience collection... (350 times) [2024-06-27 13:54:13,626][06887] Signal inference workers to resume experience collection... (350 times) [2024-06-27 13:54:13,664][06909] InferenceWorker_p0-w0: stopping experience collection (350 times) [2024-06-27 13:54:13,668][06909] InferenceWorker_p0-w0: resuming experience collection (350 times) [2024-06-27 13:54:13,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43143.2, 300 sec: 43486.7). Total num frames: 127074304. Throughput: 0: 43212.8. Samples: 29944900. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 13:54:13,852][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 13:54:15,746][06909] Updated weights for policy 0, policy_version 7762 (0.0040) [2024-06-27 13:54:18,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 127287296. Throughput: 0: 43199.5. Samples: 30208960. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 13:54:18,850][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 13:54:19,960][06909] Updated weights for policy 0, policy_version 7772 (0.0026) [2024-06-27 13:54:23,263][06909] Updated weights for policy 0, policy_version 7782 (0.0030) [2024-06-27 13:54:23,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43690.7, 300 sec: 43376.0). Total num frames: 127516672. Throughput: 0: 43273.5. Samples: 30463620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 13:54:23,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 13:54:27,602][06909] Updated weights for policy 0, policy_version 7792 (0.0040) [2024-06-27 13:54:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 42871.6, 300 sec: 43431.5). Total num frames: 127713280. Throughput: 0: 43534.8. Samples: 30602440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 13:54:28,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:54:31,120][06909] Updated weights for policy 0, policy_version 7802 (0.0037) [2024-06-27 13:54:33,850][06674] Fps is (10 sec: 39321.8, 60 sec: 42598.4, 300 sec: 43375.9). Total num frames: 127909888. Throughput: 0: 43377.0. Samples: 30859180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 13:54:33,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:54:35,122][06909] Updated weights for policy 0, policy_version 7812 (0.0032) [2024-06-27 13:54:38,758][06909] Updated weights for policy 0, policy_version 7822 (0.0047) [2024-06-27 13:54:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43419.1, 300 sec: 43376.0). Total num frames: 128155648. Throughput: 0: 43428.0. Samples: 31113260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 13:54:38,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:54:42,611][06909] Updated weights for policy 0, policy_version 7832 (0.0036) [2024-06-27 13:54:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 128368640. Throughput: 0: 43608.8. Samples: 31253020. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 13:54:43,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:54:46,179][06909] Updated weights for policy 0, policy_version 7842 (0.0033) [2024-06-27 13:54:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 42871.6, 300 sec: 43320.4). Total num frames: 128565248. Throughput: 0: 43458.2. Samples: 31509320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 13:54:48,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:54:50,133][06909] Updated weights for policy 0, policy_version 7852 (0.0038) [2024-06-27 13:54:53,742][06909] Updated weights for policy 0, policy_version 7862 (0.0041) [2024-06-27 13:54:53,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43689.1, 300 sec: 43320.4). Total num frames: 128811008. Throughput: 0: 43344.0. Samples: 31761300. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 13:54:53,853][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:54:57,699][06909] Updated weights for policy 0, policy_version 7872 (0.0028) [2024-06-27 13:54:58,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.5, 300 sec: 43431.5). Total num frames: 129024000. Throughput: 0: 43428.5. Samples: 31899100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 13:54:58,851][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:55:01,469][06909] Updated weights for policy 0, policy_version 7882 (0.0030) [2024-06-27 13:55:03,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 129220608. Throughput: 0: 43291.5. Samples: 32157080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 13:55:03,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:55:05,128][06909] Updated weights for policy 0, policy_version 7892 (0.0024) [2024-06-27 13:55:08,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43144.7, 300 sec: 43264.9). Total num frames: 129449984. Throughput: 0: 43451.6. Samples: 32418940. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 13:55:08,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:55:08,883][06909] Updated weights for policy 0, policy_version 7902 (0.0038) [2024-06-27 13:55:13,042][06909] Updated weights for policy 0, policy_version 7912 (0.0028) [2024-06-27 13:55:13,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43146.1, 300 sec: 43376.0). Total num frames: 129662976. Throughput: 0: 43356.9. Samples: 32553500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 13:55:13,850][06674] Avg episode reward: [(0, '0.008')] [2024-06-27 13:55:16,264][06909] Updated weights for policy 0, policy_version 7922 (0.0028) [2024-06-27 13:55:18,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 129875968. Throughput: 0: 43437.7. Samples: 32813880. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 13:55:18,851][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:55:20,468][06909] Updated weights for policy 0, policy_version 7932 (0.0031) [2024-06-27 13:55:23,809][06909] Updated weights for policy 0, policy_version 7942 (0.0034) [2024-06-27 13:55:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 130121728. Throughput: 0: 43564.4. Samples: 33073660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-27 13:55:23,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:55:27,882][06909] Updated weights for policy 0, policy_version 7952 (0.0043) [2024-06-27 13:55:28,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 130318336. Throughput: 0: 43421.4. Samples: 33206980. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-27 13:55:28,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:55:31,258][06909] Updated weights for policy 0, policy_version 7962 (0.0042) [2024-06-27 13:55:33,852][06674] Fps is (10 sec: 40951.7, 60 sec: 43689.2, 300 sec: 43431.2). Total num frames: 130531328. Throughput: 0: 43434.9. Samples: 33463980. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 13:55:33,852][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:55:35,541][06909] Updated weights for policy 0, policy_version 7972 (0.0030) [2024-06-27 13:55:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 130760704. Throughput: 0: 43562.1. Samples: 33721500. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 13:55:38,850][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 13:55:38,885][06909] Updated weights for policy 0, policy_version 7982 (0.0042) [2024-06-27 13:55:39,916][06887] Signal inference workers to stop experience collection... (400 times) [2024-06-27 13:55:39,964][06909] InferenceWorker_p0-w0: stopping experience collection (400 times) [2024-06-27 13:55:39,974][06887] Signal inference workers to resume experience collection... (400 times) [2024-06-27 13:55:39,981][06909] InferenceWorker_p0-w0: resuming experience collection (400 times) [2024-06-27 13:55:43,421][06909] Updated weights for policy 0, policy_version 7992 (0.0033) [2024-06-27 13:55:43,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43144.6, 300 sec: 43320.7). Total num frames: 130957312. Throughput: 0: 43333.0. Samples: 33849080. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 13:55:43,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 13:55:46,519][06909] Updated weights for policy 0, policy_version 8002 (0.0038) [2024-06-27 13:55:48,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 131186688. Throughput: 0: 43311.1. Samples: 34106080. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 13:55:48,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 13:55:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000008007_131186688.pth... [2024-06-27 13:55:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000007372_120782848.pth [2024-06-27 13:55:50,828][06909] Updated weights for policy 0, policy_version 8012 (0.0036) [2024-06-27 13:55:53,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43419.2, 300 sec: 43376.0). Total num frames: 131416064. Throughput: 0: 43364.5. Samples: 34370340. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-27 13:55:53,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:55:54,038][06909] Updated weights for policy 0, policy_version 8022 (0.0035) [2024-06-27 13:55:58,339][06909] Updated weights for policy 0, policy_version 8032 (0.0034) [2024-06-27 13:55:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43144.6, 300 sec: 43375.9). Total num frames: 131612672. Throughput: 0: 43306.6. Samples: 34502300. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 13:55:58,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 13:56:01,469][06909] Updated weights for policy 0, policy_version 8042 (0.0041) [2024-06-27 13:56:03,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 131825664. Throughput: 0: 43255.6. Samples: 34760380. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 13:56:03,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:56:06,075][06909] Updated weights for policy 0, policy_version 8052 (0.0044) [2024-06-27 13:56:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 132071424. Throughput: 0: 43180.5. Samples: 35016780. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 13:56:08,850][06674] Avg episode reward: [(0, '0.019')] [2024-06-27 13:56:08,943][06887] Saving new best policy, reward=0.019! [2024-06-27 13:56:08,946][06909] Updated weights for policy 0, policy_version 8062 (0.0035) [2024-06-27 13:56:13,590][06909] Updated weights for policy 0, policy_version 8072 (0.0030) [2024-06-27 13:56:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43144.5, 300 sec: 43376.0). Total num frames: 132251648. Throughput: 0: 43041.3. Samples: 35143840. Policy #0 lag: (min: 1.0, avg: 10.9, max: 21.0) [2024-06-27 13:56:13,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 13:56:16,770][06909] Updated weights for policy 0, policy_version 8082 (0.0034) [2024-06-27 13:56:18,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.7, 300 sec: 43376.0). Total num frames: 132481024. Throughput: 0: 42991.3. Samples: 35398500. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 13:56:18,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:56:21,189][06909] Updated weights for policy 0, policy_version 8092 (0.0040) [2024-06-27 13:56:23,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 132710400. Throughput: 0: 43130.7. Samples: 35662380. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 13:56:23,850][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 13:56:24,423][06909] Updated weights for policy 0, policy_version 8102 (0.0029) [2024-06-27 13:56:28,779][06909] Updated weights for policy 0, policy_version 8112 (0.0038) [2024-06-27 13:56:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 132907008. Throughput: 0: 43235.1. Samples: 35794660. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 13:56:28,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 13:56:31,900][06909] Updated weights for policy 0, policy_version 8122 (0.0037) [2024-06-27 13:56:33,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43419.0, 300 sec: 43375.9). Total num frames: 133136384. Throughput: 0: 43157.8. Samples: 36048180. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 13:56:33,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 13:56:36,331][06909] Updated weights for policy 0, policy_version 8132 (0.0026) [2024-06-27 13:56:38,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43144.4, 300 sec: 43320.4). Total num frames: 133349376. Throughput: 0: 43117.6. Samples: 36310640. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 13:56:38,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:56:39,408][06909] Updated weights for policy 0, policy_version 8142 (0.0040) [2024-06-27 13:56:43,770][06909] Updated weights for policy 0, policy_version 8152 (0.0029) [2024-06-27 13:56:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 133562368. Throughput: 0: 43132.8. Samples: 36443280. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 13:56:43,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 13:56:46,902][06909] Updated weights for policy 0, policy_version 8162 (0.0036) [2024-06-27 13:56:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 133791744. Throughput: 0: 43217.7. Samples: 36705180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 13:56:48,856][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 13:56:51,184][06909] Updated weights for policy 0, policy_version 8172 (0.0039) [2024-06-27 13:56:53,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 134004736. Throughput: 0: 43291.1. Samples: 36964880. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 13:56:53,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 13:56:54,498][06909] Updated weights for policy 0, policy_version 8182 (0.0028) [2024-06-27 13:56:58,695][06909] Updated weights for policy 0, policy_version 8192 (0.0029) [2024-06-27 13:56:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 134217728. Throughput: 0: 43387.5. Samples: 37096280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 13:56:58,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 13:57:02,019][06909] Updated weights for policy 0, policy_version 8202 (0.0028) [2024-06-27 13:57:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 134430720. Throughput: 0: 43467.6. Samples: 37354540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 13:57:03,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 13:57:06,248][06909] Updated weights for policy 0, policy_version 8212 (0.0030) [2024-06-27 13:57:07,167][06887] Signal inference workers to stop experience collection... (450 times) [2024-06-27 13:57:07,215][06909] InferenceWorker_p0-w0: stopping experience collection (450 times) [2024-06-27 13:57:07,279][06887] Signal inference workers to resume experience collection... (450 times) [2024-06-27 13:57:07,279][06909] InferenceWorker_p0-w0: resuming experience collection (450 times) [2024-06-27 13:57:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 134660096. Throughput: 0: 43451.5. Samples: 37617700. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 13:57:08,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 13:57:09,482][06909] Updated weights for policy 0, policy_version 8222 (0.0031) [2024-06-27 13:57:13,647][06909] Updated weights for policy 0, policy_version 8232 (0.0032) [2024-06-27 13:57:13,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43376.0). Total num frames: 134873088. Throughput: 0: 43559.9. Samples: 37754860. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 13:57:13,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 13:57:16,881][06909] Updated weights for policy 0, policy_version 8242 (0.0034) [2024-06-27 13:57:18,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 135086080. Throughput: 0: 43708.9. Samples: 38015080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 13:57:18,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 13:57:21,102][06909] Updated weights for policy 0, policy_version 8252 (0.0044) [2024-06-27 13:57:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 135315456. Throughput: 0: 43745.4. Samples: 38279180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 13:57:23,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:57:24,410][06909] Updated weights for policy 0, policy_version 8262 (0.0038) [2024-06-27 13:57:28,562][06909] Updated weights for policy 0, policy_version 8272 (0.0024) [2024-06-27 13:57:28,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.7, 300 sec: 43376.0). Total num frames: 135528448. Throughput: 0: 43766.4. Samples: 38412760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 13:57:28,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:57:31,902][06909] Updated weights for policy 0, policy_version 8282 (0.0051) [2024-06-27 13:57:33,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43417.7, 300 sec: 43376.0). Total num frames: 135741440. Throughput: 0: 43750.8. Samples: 38673960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 13:57:33,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:57:36,470][06909] Updated weights for policy 0, policy_version 8292 (0.0041) [2024-06-27 13:57:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.8, 300 sec: 43431.5). Total num frames: 135970816. Throughput: 0: 43790.3. Samples: 38935440. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 13:57:38,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:57:39,514][06909] Updated weights for policy 0, policy_version 8302 (0.0043) [2024-06-27 13:57:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 136167424. Throughput: 0: 43859.2. Samples: 39069940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 13:57:43,850][06674] Avg episode reward: [(0, '0.009')] [2024-06-27 13:57:43,899][06909] Updated weights for policy 0, policy_version 8312 (0.0030) [2024-06-27 13:57:47,041][06909] Updated weights for policy 0, policy_version 8322 (0.0034) [2024-06-27 13:57:48,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 136396800. Throughput: 0: 43843.9. Samples: 39327520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 13:57:48,851][06674] Avg episode reward: [(0, '0.016')] [2024-06-27 13:57:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000008325_136396800.pth... [2024-06-27 13:57:48,929][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000007690_125992960.pth [2024-06-27 13:57:51,324][06909] Updated weights for policy 0, policy_version 8332 (0.0029) [2024-06-27 13:57:53,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.7, 300 sec: 43487.0). Total num frames: 136642560. Throughput: 0: 43859.5. Samples: 39591380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 13:57:53,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 13:57:54,647][06909] Updated weights for policy 0, policy_version 8342 (0.0040) [2024-06-27 13:57:58,805][06909] Updated weights for policy 0, policy_version 8352 (0.0037) [2024-06-27 13:57:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 136839168. Throughput: 0: 43753.0. Samples: 39723740. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 13:57:58,850][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 13:58:02,167][06909] Updated weights for policy 0, policy_version 8362 (0.0042) [2024-06-27 13:58:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 137052160. Throughput: 0: 43597.4. Samples: 39976960. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 13:58:03,850][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 13:58:06,287][06909] Updated weights for policy 0, policy_version 8372 (0.0029) [2024-06-27 13:58:08,844][06887] Signal inference workers to stop experience collection... (500 times) [2024-06-27 13:58:08,845][06887] Signal inference workers to resume experience collection... (500 times) [2024-06-27 13:58:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43376.0). Total num frames: 137281536. Throughput: 0: 43623.2. Samples: 40242220. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 13:58:08,851][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 13:58:08,861][06909] InferenceWorker_p0-w0: stopping experience collection (500 times) [2024-06-27 13:58:08,892][06909] InferenceWorker_p0-w0: resuming experience collection (500 times) [2024-06-27 13:58:09,657][06909] Updated weights for policy 0, policy_version 8382 (0.0031) [2024-06-27 13:58:13,850][06674] Fps is (10 sec: 42596.7, 60 sec: 43417.4, 300 sec: 43320.4). Total num frames: 137478144. Throughput: 0: 43587.1. Samples: 40374200. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 13:58:13,851][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 13:58:13,922][06909] Updated weights for policy 0, policy_version 8392 (0.0035) [2024-06-27 13:58:17,098][06909] Updated weights for policy 0, policy_version 8402 (0.0041) [2024-06-27 13:58:18,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43689.2, 300 sec: 43431.2). Total num frames: 137707520. Throughput: 0: 43440.6. Samples: 40628880. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 13:58:18,853][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 13:58:21,439][06909] Updated weights for policy 0, policy_version 8412 (0.0033) [2024-06-27 13:58:23,850][06674] Fps is (10 sec: 47515.4, 60 sec: 43963.8, 300 sec: 43431.5). Total num frames: 137953280. Throughput: 0: 43531.0. Samples: 40894340. Policy #0 lag: (min: 2.0, avg: 11.6, max: 22.0) [2024-06-27 13:58:23,850][06674] Avg episode reward: [(0, '0.015')] [2024-06-27 13:58:24,702][06909] Updated weights for policy 0, policy_version 8422 (0.0038) [2024-06-27 13:58:28,805][06909] Updated weights for policy 0, policy_version 8432 (0.0031) [2024-06-27 13:58:28,850][06674] Fps is (10 sec: 44246.1, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 138149888. Throughput: 0: 43458.6. Samples: 41025580. Policy #0 lag: (min: 2.0, avg: 11.6, max: 22.0) [2024-06-27 13:58:28,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 13:58:32,182][06909] Updated weights for policy 0, policy_version 8442 (0.0028) [2024-06-27 13:58:33,852][06674] Fps is (10 sec: 40951.8, 60 sec: 43689.2, 300 sec: 43431.5). Total num frames: 138362880. Throughput: 0: 43432.3. Samples: 41282060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 13:58:33,852][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 13:58:36,339][06909] Updated weights for policy 0, policy_version 8452 (0.0028) [2024-06-27 13:58:38,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43963.6, 300 sec: 43542.6). Total num frames: 138608640. Throughput: 0: 43362.1. Samples: 41542680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 13:58:38,851][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 13:58:39,816][06909] Updated weights for policy 0, policy_version 8462 (0.0029) [2024-06-27 13:58:43,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43690.6, 300 sec: 43376.0). Total num frames: 138788864. Throughput: 0: 43291.5. Samples: 41671860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 13:58:43,850][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 13:58:44,103][06909] Updated weights for policy 0, policy_version 8472 (0.0030) [2024-06-27 13:58:47,734][06909] Updated weights for policy 0, policy_version 8482 (0.0034) [2024-06-27 13:58:48,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 139018240. Throughput: 0: 43427.0. Samples: 41931180. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 13:58:48,850][06674] Avg episode reward: [(0, '0.015')] [2024-06-27 13:58:51,560][06909] Updated weights for policy 0, policy_version 8492 (0.0029) [2024-06-27 13:58:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43417.6, 300 sec: 43542.5). Total num frames: 139247616. Throughput: 0: 43241.8. Samples: 42188100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 13:58:53,850][06674] Avg episode reward: [(0, '0.015')] [2024-06-27 13:58:55,175][06909] Updated weights for policy 0, policy_version 8502 (0.0037) [2024-06-27 13:58:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 139444224. Throughput: 0: 43287.1. Samples: 42322100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 13:58:58,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 13:58:58,929][06909] Updated weights for policy 0, policy_version 8512 (0.0047) [2024-06-27 13:59:02,614][06909] Updated weights for policy 0, policy_version 8522 (0.0042) [2024-06-27 13:59:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 139657216. Throughput: 0: 43420.2. Samples: 42582700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 13:59:03,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:59:06,750][06909] Updated weights for policy 0, policy_version 8532 (0.0034) [2024-06-27 13:59:08,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43690.6, 300 sec: 43487.3). Total num frames: 139902976. Throughput: 0: 43303.0. Samples: 42842980. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 13:59:08,851][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:59:09,967][06909] Updated weights for policy 0, policy_version 8542 (0.0028) [2024-06-27 13:59:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43691.0, 300 sec: 43431.5). Total num frames: 140099584. Throughput: 0: 43484.4. Samples: 42982380. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 13:59:13,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:59:14,162][06909] Updated weights for policy 0, policy_version 8552 (0.0031) [2024-06-27 13:59:17,339][06909] Updated weights for policy 0, policy_version 8562 (0.0033) [2024-06-27 13:59:18,850][06674] Fps is (10 sec: 39322.2, 60 sec: 43146.0, 300 sec: 43320.4). Total num frames: 140296192. Throughput: 0: 43517.9. Samples: 43240280. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 13:59:18,850][06674] Avg episode reward: [(0, '0.016')] [2024-06-27 13:59:21,577][06909] Updated weights for policy 0, policy_version 8572 (0.0040) [2024-06-27 13:59:22,469][06887] Signal inference workers to stop experience collection... (550 times) [2024-06-27 13:59:22,469][06887] Signal inference workers to resume experience collection... (550 times) [2024-06-27 13:59:22,515][06909] InferenceWorker_p0-w0: stopping experience collection (550 times) [2024-06-27 13:59:22,515][06909] InferenceWorker_p0-w0: resuming experience collection (550 times) [2024-06-27 13:59:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 140558336. Throughput: 0: 43487.3. Samples: 43499600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 13:59:23,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 13:59:24,780][06909] Updated weights for policy 0, policy_version 8582 (0.0026) [2024-06-27 13:59:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.5, 300 sec: 43542.5). Total num frames: 140754944. Throughput: 0: 43740.4. Samples: 43640180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 13:59:28,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 13:59:28,966][06909] Updated weights for policy 0, policy_version 8592 (0.0029) [2024-06-27 13:59:32,138][06909] Updated weights for policy 0, policy_version 8602 (0.0029) [2024-06-27 13:59:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43419.1, 300 sec: 43431.5). Total num frames: 140967936. Throughput: 0: 43669.9. Samples: 43896320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 13:59:33,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 13:59:36,300][06909] Updated weights for policy 0, policy_version 8612 (0.0028) [2024-06-27 13:59:38,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 141213696. Throughput: 0: 43884.1. Samples: 44162880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 13:59:38,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 13:59:39,482][06909] Updated weights for policy 0, policy_version 8622 (0.0026) [2024-06-27 13:59:43,622][06909] Updated weights for policy 0, policy_version 8632 (0.0041) [2024-06-27 13:59:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 141426688. Throughput: 0: 44015.1. Samples: 44302780. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 13:59:43,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 13:59:46,918][06909] Updated weights for policy 0, policy_version 8642 (0.0038) [2024-06-27 13:59:48,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.7, 300 sec: 43487.3). Total num frames: 141639680. Throughput: 0: 43930.1. Samples: 44559560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 13:59:48,851][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 13:59:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000008645_141639680.pth... [2024-06-27 13:59:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000008007_131186688.pth [2024-06-27 13:59:51,126][06909] Updated weights for policy 0, policy_version 8652 (0.0044) [2024-06-27 13:59:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 141869056. Throughput: 0: 44058.8. Samples: 44825620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 13:59:53,850][06674] Avg episode reward: [(0, '0.015')] [2024-06-27 13:59:54,442][06909] Updated weights for policy 0, policy_version 8662 (0.0038) [2024-06-27 13:59:58,736][06909] Updated weights for policy 0, policy_version 8672 (0.0031) [2024-06-27 13:59:58,852][06674] Fps is (10 sec: 44228.3, 60 sec: 43962.2, 300 sec: 43597.8). Total num frames: 142082048. Throughput: 0: 43887.3. Samples: 44957400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 13:59:58,852][06674] Avg episode reward: [(0, '0.016')] [2024-06-27 14:00:01,837][06909] Updated weights for policy 0, policy_version 8682 (0.0031) [2024-06-27 14:00:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43542.6). Total num frames: 142295040. Throughput: 0: 44003.6. Samples: 45220440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 14:00:03,850][06674] Avg episode reward: [(0, '0.016')] [2024-06-27 14:00:06,511][06909] Updated weights for policy 0, policy_version 8692 (0.0034) [2024-06-27 14:00:08,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 142524416. Throughput: 0: 44108.5. Samples: 45484480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 14:00:08,850][06674] Avg episode reward: [(0, '0.015')] [2024-06-27 14:00:09,217][06909] Updated weights for policy 0, policy_version 8702 (0.0031) [2024-06-27 14:00:13,850][06674] Fps is (10 sec: 42597.5, 60 sec: 43690.5, 300 sec: 43542.6). Total num frames: 142721024. Throughput: 0: 44019.0. Samples: 45621040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 14:00:13,851][06674] Avg episode reward: [(0, '0.018')] [2024-06-27 14:00:14,020][06909] Updated weights for policy 0, policy_version 8712 (0.0033) [2024-06-27 14:00:16,697][06909] Updated weights for policy 0, policy_version 8722 (0.0037) [2024-06-27 14:00:18,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43431.5). Total num frames: 142934016. Throughput: 0: 43904.5. Samples: 45872020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 14:00:18,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 14:00:18,899][06887] Signal inference workers to stop experience collection... (600 times) [2024-06-27 14:00:18,950][06909] InferenceWorker_p0-w0: stopping experience collection (600 times) [2024-06-27 14:00:19,016][06887] Signal inference workers to resume experience collection... (600 times) [2024-06-27 14:00:19,016][06909] InferenceWorker_p0-w0: resuming experience collection (600 times) [2024-06-27 14:00:21,500][06909] Updated weights for policy 0, policy_version 8732 (0.0026) [2024-06-27 14:00:23,850][06674] Fps is (10 sec: 44237.9, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 143163392. Throughput: 0: 43740.9. Samples: 46131220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 14:00:23,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 14:00:24,783][06909] Updated weights for policy 0, policy_version 8742 (0.0035) [2024-06-27 14:00:28,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43542.8). Total num frames: 143376384. Throughput: 0: 43593.6. Samples: 46264500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 14:00:28,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 14:00:28,934][06909] Updated weights for policy 0, policy_version 8752 (0.0024) [2024-06-27 14:00:32,251][06909] Updated weights for policy 0, policy_version 8762 (0.0026) [2024-06-27 14:00:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43542.6). Total num frames: 143605760. Throughput: 0: 43580.6. Samples: 46520680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 14:00:33,850][06674] Avg episode reward: [(0, '0.018')] [2024-06-27 14:00:36,302][06909] Updated weights for policy 0, policy_version 8772 (0.0027) [2024-06-27 14:00:38,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 143835136. Throughput: 0: 43640.9. Samples: 46789460. Policy #0 lag: (min: 1.0, avg: 10.2, max: 24.0) [2024-06-27 14:00:38,850][06674] Avg episode reward: [(0, '0.019')] [2024-06-27 14:00:39,659][06909] Updated weights for policy 0, policy_version 8782 (0.0034) [2024-06-27 14:00:43,827][06909] Updated weights for policy 0, policy_version 8792 (0.0034) [2024-06-27 14:00:43,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 144048128. Throughput: 0: 43513.0. Samples: 46915400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:00:43,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:00:47,260][06909] Updated weights for policy 0, policy_version 8802 (0.0036) [2024-06-27 14:00:48,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 43542.5). Total num frames: 144261120. Throughput: 0: 43470.1. Samples: 47176600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:00:48,853][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:00:51,342][06909] Updated weights for policy 0, policy_version 8812 (0.0029) [2024-06-27 14:00:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 144474112. Throughput: 0: 43461.2. Samples: 47440240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 14:00:53,850][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 14:00:55,025][06909] Updated weights for policy 0, policy_version 8822 (0.0038) [2024-06-27 14:00:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43419.1, 300 sec: 43598.1). Total num frames: 144687104. Throughput: 0: 43291.8. Samples: 47569160. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 14:00:58,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 14:00:58,925][06909] Updated weights for policy 0, policy_version 8832 (0.0031) [2024-06-27 14:01:02,704][06909] Updated weights for policy 0, policy_version 8842 (0.0043) [2024-06-27 14:01:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43542.5). Total num frames: 144916480. Throughput: 0: 43628.8. Samples: 47835320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 14:01:03,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:01:06,564][06909] Updated weights for policy 0, policy_version 8852 (0.0035) [2024-06-27 14:01:08,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 145129472. Throughput: 0: 43451.4. Samples: 48086540. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:01:08,850][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 14:01:10,221][06909] Updated weights for policy 0, policy_version 8862 (0.0041) [2024-06-27 14:01:13,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 145326080. Throughput: 0: 43476.6. Samples: 48220940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:01:13,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 14:01:14,034][06909] Updated weights for policy 0, policy_version 8872 (0.0024) [2024-06-27 14:01:17,699][06909] Updated weights for policy 0, policy_version 8882 (0.0034) [2024-06-27 14:01:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 145571840. Throughput: 0: 43641.3. Samples: 48484540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:01:18,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 14:01:21,558][06909] Updated weights for policy 0, policy_version 8892 (0.0036) [2024-06-27 14:01:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 145784832. Throughput: 0: 43345.3. Samples: 48740000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:01:23,850][06674] Avg episode reward: [(0, '0.016')] [2024-06-27 14:01:25,048][06909] Updated weights for policy 0, policy_version 8902 (0.0037) [2024-06-27 14:01:28,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 145997824. Throughput: 0: 43651.7. Samples: 48879720. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 14:01:28,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 14:01:29,088][06909] Updated weights for policy 0, policy_version 8912 (0.0040) [2024-06-27 14:01:32,528][06909] Updated weights for policy 0, policy_version 8922 (0.0039) [2024-06-27 14:01:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 146210816. Throughput: 0: 43493.4. Samples: 49133800. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 14:01:33,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 14:01:36,575][06909] Updated weights for policy 0, policy_version 8932 (0.0027) [2024-06-27 14:01:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 43598.1). Total num frames: 146423808. Throughput: 0: 43393.4. Samples: 49392940. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 14:01:38,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 14:01:40,327][06909] Updated weights for policy 0, policy_version 8942 (0.0034) [2024-06-27 14:01:43,446][06887] Signal inference workers to stop experience collection... (650 times) [2024-06-27 14:01:43,477][06909] InferenceWorker_p0-w0: stopping experience collection (650 times) [2024-06-27 14:01:43,502][06887] Signal inference workers to resume experience collection... (650 times) [2024-06-27 14:01:43,503][06909] InferenceWorker_p0-w0: resuming experience collection (650 times) [2024-06-27 14:01:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 146653184. Throughput: 0: 43577.2. Samples: 49530140. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-27 14:01:43,851][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 14:01:44,050][06909] Updated weights for policy 0, policy_version 8952 (0.0036) [2024-06-27 14:01:47,745][06909] Updated weights for policy 0, policy_version 8962 (0.0043) [2024-06-27 14:01:48,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43143.1, 300 sec: 43542.3). Total num frames: 146849792. Throughput: 0: 43363.9. Samples: 49786780. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-27 14:01:48,852][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 14:01:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000008963_146849792.pth... [2024-06-27 14:01:48,913][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000008325_136396800.pth [2024-06-27 14:01:51,671][06909] Updated weights for policy 0, policy_version 8972 (0.0027) [2024-06-27 14:01:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 147095552. Throughput: 0: 43475.2. Samples: 50042920. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-27 14:01:53,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 14:01:55,204][06909] Updated weights for policy 0, policy_version 8982 (0.0035) [2024-06-27 14:01:58,850][06674] Fps is (10 sec: 44245.2, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 147292160. Throughput: 0: 43594.0. Samples: 50182680. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 14:01:58,851][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 14:01:59,167][06909] Updated weights for policy 0, policy_version 8992 (0.0036) [2024-06-27 14:02:02,640][06909] Updated weights for policy 0, policy_version 9002 (0.0023) [2024-06-27 14:02:03,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43144.5, 300 sec: 43542.5). Total num frames: 147505152. Throughput: 0: 43477.3. Samples: 50441020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 14:02:03,850][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 14:02:06,619][06909] Updated weights for policy 0, policy_version 9012 (0.0031) [2024-06-27 14:02:08,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 147750912. Throughput: 0: 43543.5. Samples: 50699460. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-27 14:02:08,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 14:02:10,233][06909] Updated weights for policy 0, policy_version 9022 (0.0026) [2024-06-27 14:02:13,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 147947520. Throughput: 0: 43536.9. Samples: 50838880. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-27 14:02:13,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:02:14,200][06909] Updated weights for policy 0, policy_version 9032 (0.0030) [2024-06-27 14:02:17,722][06909] Updated weights for policy 0, policy_version 9042 (0.0027) [2024-06-27 14:02:18,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43144.6, 300 sec: 43542.6). Total num frames: 148160512. Throughput: 0: 43626.2. Samples: 51096980. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-27 14:02:18,850][06674] Avg episode reward: [(0, '0.018')] [2024-06-27 14:02:21,587][06909] Updated weights for policy 0, policy_version 9052 (0.0045) [2024-06-27 14:02:23,850][06674] Fps is (10 sec: 45873.8, 60 sec: 43690.5, 300 sec: 43653.6). Total num frames: 148406272. Throughput: 0: 43628.6. Samples: 51356240. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 14:02:23,851][06674] Avg episode reward: [(0, '0.018')] [2024-06-27 14:02:25,297][06909] Updated weights for policy 0, policy_version 9062 (0.0029) [2024-06-27 14:02:28,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43690.5, 300 sec: 43653.6). Total num frames: 148619264. Throughput: 0: 43678.6. Samples: 51495680. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-27 14:02:28,850][06674] Avg episode reward: [(0, '0.019')] [2024-06-27 14:02:29,015][06909] Updated weights for policy 0, policy_version 9072 (0.0039) [2024-06-27 14:02:32,878][06909] Updated weights for policy 0, policy_version 9082 (0.0035) [2024-06-27 14:02:33,854][06674] Fps is (10 sec: 42580.7, 60 sec: 43687.4, 300 sec: 43597.4). Total num frames: 148832256. Throughput: 0: 43742.6. Samples: 51755300. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-27 14:02:33,855][06674] Avg episode reward: [(0, '0.015')] [2024-06-27 14:02:36,463][06909] Updated weights for policy 0, policy_version 9092 (0.0027) [2024-06-27 14:02:38,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 149061632. Throughput: 0: 43814.2. Samples: 52014560. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 14:02:38,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:02:40,330][06909] Updated weights for policy 0, policy_version 9102 (0.0028) [2024-06-27 14:02:43,850][06674] Fps is (10 sec: 42617.1, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 149258240. Throughput: 0: 43683.7. Samples: 52148440. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 14:02:43,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:02:44,139][06909] Updated weights for policy 0, policy_version 9112 (0.0029) [2024-06-27 14:02:47,890][06909] Updated weights for policy 0, policy_version 9122 (0.0040) [2024-06-27 14:02:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43965.2, 300 sec: 43542.6). Total num frames: 149487616. Throughput: 0: 43671.6. Samples: 52406240. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 14:02:48,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 14:02:51,588][06909] Updated weights for policy 0, policy_version 9132 (0.0037) [2024-06-27 14:02:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 149716992. Throughput: 0: 43761.8. Samples: 52668740. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 14:02:53,850][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 14:02:55,354][06909] Updated weights for policy 0, policy_version 9142 (0.0029) [2024-06-27 14:02:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 149913600. Throughput: 0: 43518.2. Samples: 52797200. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 14:02:58,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 14:02:59,222][06909] Updated weights for policy 0, policy_version 9152 (0.0043) [2024-06-27 14:03:02,860][06909] Updated weights for policy 0, policy_version 9162 (0.0041) [2024-06-27 14:03:03,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.8, 300 sec: 43542.6). Total num frames: 150126592. Throughput: 0: 43605.4. Samples: 53059220. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 14:03:03,850][06674] Avg episode reward: [(0, '0.015')] [2024-06-27 14:03:06,719][06909] Updated weights for policy 0, policy_version 9172 (0.0029) [2024-06-27 14:03:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43417.6, 300 sec: 43653.7). Total num frames: 150355968. Throughput: 0: 43541.1. Samples: 53315580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 14:03:08,850][06674] Avg episode reward: [(0, '0.015')] [2024-06-27 14:03:10,431][06909] Updated weights for policy 0, policy_version 9182 (0.0050) [2024-06-27 14:03:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43598.4). Total num frames: 150568960. Throughput: 0: 43479.3. Samples: 53452240. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 14:03:13,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 14:03:14,128][06909] Updated weights for policy 0, policy_version 9192 (0.0049) [2024-06-27 14:03:17,876][06909] Updated weights for policy 0, policy_version 9202 (0.0037) [2024-06-27 14:03:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 150781952. Throughput: 0: 43475.7. Samples: 53711520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 14:03:18,850][06674] Avg episode reward: [(0, '0.015')] [2024-06-27 14:03:21,783][06909] Updated weights for policy 0, policy_version 9212 (0.0044) [2024-06-27 14:03:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.9, 300 sec: 43653.6). Total num frames: 151027712. Throughput: 0: 43456.4. Samples: 53970100. Policy #0 lag: (min: 1.0, avg: 9.7, max: 22.0) [2024-06-27 14:03:23,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:03:25,528][06909] Updated weights for policy 0, policy_version 9222 (0.0033) [2024-06-27 14:03:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 43542.9). Total num frames: 151207936. Throughput: 0: 43413.8. Samples: 54102060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-27 14:03:28,850][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 14:03:29,426][06909] Updated weights for policy 0, policy_version 9232 (0.0036) [2024-06-27 14:03:33,084][06909] Updated weights for policy 0, policy_version 9242 (0.0032) [2024-06-27 14:03:33,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43420.8, 300 sec: 43487.0). Total num frames: 151437312. Throughput: 0: 43285.7. Samples: 54354100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-27 14:03:33,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:03:36,818][06909] Updated weights for policy 0, policy_version 9252 (0.0027) [2024-06-27 14:03:37,881][06887] Signal inference workers to stop experience collection... (700 times) [2024-06-27 14:03:37,882][06887] Signal inference workers to resume experience collection... (700 times) [2024-06-27 14:03:37,899][06909] InferenceWorker_p0-w0: stopping experience collection (700 times) [2024-06-27 14:03:37,929][06909] InferenceWorker_p0-w0: resuming experience collection (700 times) [2024-06-27 14:03:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43144.5, 300 sec: 43598.1). Total num frames: 151650304. Throughput: 0: 43293.4. Samples: 54616940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 14:03:38,850][06674] Avg episode reward: [(0, '0.024')] [2024-06-27 14:03:38,855][06887] Saving new best policy, reward=0.024! [2024-06-27 14:03:40,986][06909] Updated weights for policy 0, policy_version 9262 (0.0033) [2024-06-27 14:03:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 151879680. Throughput: 0: 43482.7. Samples: 54753920. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 14:03:43,850][06674] Avg episode reward: [(0, '0.026')] [2024-06-27 14:03:43,850][06887] Saving new best policy, reward=0.026! [2024-06-27 14:03:44,224][06909] Updated weights for policy 0, policy_version 9272 (0.0041) [2024-06-27 14:03:48,538][06909] Updated weights for policy 0, policy_version 9282 (0.0028) [2024-06-27 14:03:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 152076288. Throughput: 0: 43351.5. Samples: 55010040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 14:03:48,850][06674] Avg episode reward: [(0, '0.026')] [2024-06-27 14:03:48,941][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000009283_152092672.pth... [2024-06-27 14:03:48,983][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000008645_141639680.pth [2024-06-27 14:03:51,704][06909] Updated weights for policy 0, policy_version 9292 (0.0045) [2024-06-27 14:03:53,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 152322048. Throughput: 0: 43409.3. Samples: 55269000. Policy #0 lag: (min: 1.0, avg: 11.3, max: 21.0) [2024-06-27 14:03:53,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 14:03:55,952][06909] Updated weights for policy 0, policy_version 9302 (0.0032) [2024-06-27 14:03:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 152518656. Throughput: 0: 43436.8. Samples: 55406900. Policy #0 lag: (min: 1.0, avg: 11.3, max: 21.0) [2024-06-27 14:03:58,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 14:03:59,291][06909] Updated weights for policy 0, policy_version 9312 (0.0028) [2024-06-27 14:04:03,475][06909] Updated weights for policy 0, policy_version 9322 (0.0035) [2024-06-27 14:04:03,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 152731648. Throughput: 0: 43255.6. Samples: 55658020. Policy #0 lag: (min: 2.0, avg: 11.8, max: 25.0) [2024-06-27 14:04:03,850][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 14:04:06,811][06909] Updated weights for policy 0, policy_version 9332 (0.0031) [2024-06-27 14:04:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 152977408. Throughput: 0: 43344.4. Samples: 55920600. Policy #0 lag: (min: 0.0, avg: 12.4, max: 25.0) [2024-06-27 14:04:08,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 14:04:10,929][06909] Updated weights for policy 0, policy_version 9342 (0.0040) [2024-06-27 14:04:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43144.5, 300 sec: 43598.1). Total num frames: 153157632. Throughput: 0: 43406.7. Samples: 56055360. Policy #0 lag: (min: 0.0, avg: 12.4, max: 25.0) [2024-06-27 14:04:13,850][06674] Avg episode reward: [(0, '0.011')] [2024-06-27 14:04:14,563][06909] Updated weights for policy 0, policy_version 9352 (0.0036) [2024-06-27 14:04:18,431][06909] Updated weights for policy 0, policy_version 9362 (0.0041) [2024-06-27 14:04:18,851][06674] Fps is (10 sec: 40957.1, 60 sec: 43417.2, 300 sec: 43486.9). Total num frames: 153387008. Throughput: 0: 43494.9. Samples: 56311400. Policy #0 lag: (min: 0.0, avg: 11.9, max: 25.0) [2024-06-27 14:04:18,851][06674] Avg episode reward: [(0, '0.010')] [2024-06-27 14:04:22,261][06909] Updated weights for policy 0, policy_version 9372 (0.0054) [2024-06-27 14:04:23,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 153632768. Throughput: 0: 43420.8. Samples: 56570880. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 14:04:23,859][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 14:04:26,145][06909] Updated weights for policy 0, policy_version 9382 (0.0032) [2024-06-27 14:04:28,850][06674] Fps is (10 sec: 42601.5, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 153812992. Throughput: 0: 43412.9. Samples: 56707500. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 14:04:28,850][06674] Avg episode reward: [(0, '0.012')] [2024-06-27 14:04:29,578][06909] Updated weights for policy 0, policy_version 9392 (0.0034) [2024-06-27 14:04:33,537][06909] Updated weights for policy 0, policy_version 9402 (0.0041) [2024-06-27 14:04:33,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 154042368. Throughput: 0: 43432.1. Samples: 56964480. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 14:04:33,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:04:36,957][06909] Updated weights for policy 0, policy_version 9412 (0.0032) [2024-06-27 14:04:38,850][06674] Fps is (10 sec: 49151.3, 60 sec: 44236.7, 300 sec: 43653.6). Total num frames: 154304512. Throughput: 0: 43556.9. Samples: 57229060. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 14:04:38,851][06674] Avg episode reward: [(0, '0.018')] [2024-06-27 14:04:40,943][06909] Updated weights for policy 0, policy_version 9422 (0.0031) [2024-06-27 14:04:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 154468352. Throughput: 0: 43568.4. Samples: 57367480. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 14:04:43,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:04:44,458][06909] Updated weights for policy 0, policy_version 9432 (0.0028) [2024-06-27 14:04:46,872][06887] Signal inference workers to stop experience collection... (750 times) [2024-06-27 14:04:46,919][06909] InferenceWorker_p0-w0: stopping experience collection (750 times) [2024-06-27 14:04:46,926][06887] Signal inference workers to resume experience collection... (750 times) [2024-06-27 14:04:46,937][06909] InferenceWorker_p0-w0: resuming experience collection (750 times) [2024-06-27 14:04:48,318][06909] Updated weights for policy 0, policy_version 9442 (0.0034) [2024-06-27 14:04:48,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 154697728. Throughput: 0: 43721.7. Samples: 57625500. Policy #0 lag: (min: 1.0, avg: 11.8, max: 21.0) [2024-06-27 14:04:48,851][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:04:51,850][06909] Updated weights for policy 0, policy_version 9452 (0.0038) [2024-06-27 14:04:53,850][06674] Fps is (10 sec: 49152.1, 60 sec: 43963.8, 300 sec: 43653.9). Total num frames: 154959872. Throughput: 0: 43744.4. Samples: 57889100. Policy #0 lag: (min: 0.0, avg: 11.8, max: 20.0) [2024-06-27 14:04:53,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:04:55,806][06909] Updated weights for policy 0, policy_version 9462 (0.0036) [2024-06-27 14:04:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 155123712. Throughput: 0: 43809.3. Samples: 58026780. Policy #0 lag: (min: 0.0, avg: 11.8, max: 20.0) [2024-06-27 14:04:58,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 14:04:59,298][06909] Updated weights for policy 0, policy_version 9472 (0.0035) [2024-06-27 14:05:03,306][06909] Updated weights for policy 0, policy_version 9482 (0.0031) [2024-06-27 14:05:03,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 155353088. Throughput: 0: 43789.1. Samples: 58281880. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-27 14:05:03,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 14:05:07,010][06909] Updated weights for policy 0, policy_version 9492 (0.0027) [2024-06-27 14:05:08,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43690.6, 300 sec: 43653.7). Total num frames: 155598848. Throughput: 0: 43911.6. Samples: 58546900. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-27 14:05:08,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 14:05:10,756][06909] Updated weights for policy 0, policy_version 9502 (0.0039) [2024-06-27 14:05:13,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 155795456. Throughput: 0: 43780.5. Samples: 58677620. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-27 14:05:13,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 14:05:14,542][06909] Updated weights for policy 0, policy_version 9512 (0.0034) [2024-06-27 14:05:18,133][06909] Updated weights for policy 0, policy_version 9522 (0.0058) [2024-06-27 14:05:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43964.2, 300 sec: 43598.1). Total num frames: 156024832. Throughput: 0: 43790.1. Samples: 58935040. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 14:05:18,851][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:05:22,184][06909] Updated weights for policy 0, policy_version 9532 (0.0032) [2024-06-27 14:05:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 156254208. Throughput: 0: 43808.5. Samples: 59200440. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 14:05:23,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:05:25,587][06909] Updated weights for policy 0, policy_version 9542 (0.0037) [2024-06-27 14:05:28,851][06674] Fps is (10 sec: 42593.0, 60 sec: 43962.7, 300 sec: 43542.4). Total num frames: 156450816. Throughput: 0: 43754.3. Samples: 59336480. Policy #0 lag: (min: 0.0, avg: 11.3, max: 20.0) [2024-06-27 14:05:28,852][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:05:29,615][06909] Updated weights for policy 0, policy_version 9552 (0.0039) [2024-06-27 14:05:33,023][06909] Updated weights for policy 0, policy_version 9562 (0.0045) [2024-06-27 14:05:33,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43690.5, 300 sec: 43487.0). Total num frames: 156663808. Throughput: 0: 43661.3. Samples: 59590260. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:05:33,850][06674] Avg episode reward: [(0, '0.020')] [2024-06-27 14:05:37,187][06909] Updated weights for policy 0, policy_version 9572 (0.0047) [2024-06-27 14:05:38,850][06674] Fps is (10 sec: 45881.1, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 156909568. Throughput: 0: 43604.4. Samples: 59851300. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:05:38,850][06674] Avg episode reward: [(0, '0.022')] [2024-06-27 14:05:41,157][06909] Updated weights for policy 0, policy_version 9582 (0.0033) [2024-06-27 14:05:43,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 157089792. Throughput: 0: 43557.8. Samples: 59986880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 14:05:43,850][06674] Avg episode reward: [(0, '0.022')] [2024-06-27 14:05:44,678][06909] Updated weights for policy 0, policy_version 9592 (0.0024) [2024-06-27 14:05:48,547][06909] Updated weights for policy 0, policy_version 9602 (0.0034) [2024-06-27 14:05:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 157319168. Throughput: 0: 43622.2. Samples: 60244880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 14:05:48,850][06674] Avg episode reward: [(0, '0.020')] [2024-06-27 14:05:48,905][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000009603_157335552.pth... [2024-06-27 14:05:48,950][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000008963_146849792.pth [2024-06-27 14:05:52,327][06909] Updated weights for policy 0, policy_version 9612 (0.0040) [2024-06-27 14:05:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43144.5, 300 sec: 43598.1). Total num frames: 157548544. Throughput: 0: 43488.4. Samples: 60503880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 14:05:53,850][06674] Avg episode reward: [(0, '0.016')] [2024-06-27 14:05:55,907][06909] Updated weights for policy 0, policy_version 9622 (0.0032) [2024-06-27 14:05:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43542.6). Total num frames: 157761536. Throughput: 0: 43469.6. Samples: 60633760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 14:05:58,850][06674] Avg episode reward: [(0, '0.015')] [2024-06-27 14:05:59,782][06909] Updated weights for policy 0, policy_version 9632 (0.0035) [2024-06-27 14:06:03,459][06909] Updated weights for policy 0, policy_version 9642 (0.0033) [2024-06-27 14:06:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 157990912. Throughput: 0: 43568.5. Samples: 60895620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 14:06:03,850][06674] Avg episode reward: [(0, '0.018')] [2024-06-27 14:06:05,870][06887] Signal inference workers to stop experience collection... (800 times) [2024-06-27 14:06:05,870][06887] Signal inference workers to resume experience collection... (800 times) [2024-06-27 14:06:05,917][06909] InferenceWorker_p0-w0: stopping experience collection (800 times) [2024-06-27 14:06:05,918][06909] InferenceWorker_p0-w0: resuming experience collection (800 times) [2024-06-27 14:06:07,428][06909] Updated weights for policy 0, policy_version 9652 (0.0040) [2024-06-27 14:06:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 158220288. Throughput: 0: 43444.4. Samples: 61155440. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 14:06:08,851][06674] Avg episode reward: [(0, '0.019')] [2024-06-27 14:06:10,868][06909] Updated weights for policy 0, policy_version 9662 (0.0047) [2024-06-27 14:06:13,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 158416896. Throughput: 0: 43320.5. Samples: 61285840. Policy #0 lag: (min: 1.0, avg: 8.8, max: 21.0) [2024-06-27 14:06:13,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 14:06:14,813][06909] Updated weights for policy 0, policy_version 9672 (0.0032) [2024-06-27 14:06:18,664][06909] Updated weights for policy 0, policy_version 9682 (0.0037) [2024-06-27 14:06:18,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.6, 300 sec: 43542.5). Total num frames: 158629888. Throughput: 0: 43507.6. Samples: 61548100. Policy #0 lag: (min: 1.0, avg: 8.8, max: 21.0) [2024-06-27 14:06:18,851][06674] Avg episode reward: [(0, '0.024')] [2024-06-27 14:06:22,266][06909] Updated weights for policy 0, policy_version 9692 (0.0035) [2024-06-27 14:06:23,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 158859264. Throughput: 0: 43571.5. Samples: 61812020. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-27 14:06:23,850][06674] Avg episode reward: [(0, '0.025')] [2024-06-27 14:06:26,058][06909] Updated weights for policy 0, policy_version 9702 (0.0034) [2024-06-27 14:06:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43418.5, 300 sec: 43542.5). Total num frames: 159055872. Throughput: 0: 43538.6. Samples: 61946120. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-27 14:06:28,850][06674] Avg episode reward: [(0, '0.025')] [2024-06-27 14:06:29,626][06909] Updated weights for policy 0, policy_version 9712 (0.0034) [2024-06-27 14:06:33,554][06909] Updated weights for policy 0, policy_version 9722 (0.0031) [2024-06-27 14:06:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 159285248. Throughput: 0: 43601.8. Samples: 62206960. Policy #0 lag: (min: 1.0, avg: 10.8, max: 21.0) [2024-06-27 14:06:33,850][06674] Avg episode reward: [(0, '0.019')] [2024-06-27 14:06:37,543][06909] Updated weights for policy 0, policy_version 9732 (0.0041) [2024-06-27 14:06:38,856][06674] Fps is (10 sec: 45847.8, 60 sec: 43413.3, 300 sec: 43597.2). Total num frames: 159514624. Throughput: 0: 43615.9. Samples: 62466860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 14:06:38,856][06674] Avg episode reward: [(0, '0.015')] [2024-06-27 14:06:41,191][06909] Updated weights for policy 0, policy_version 9742 (0.0023) [2024-06-27 14:06:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43598.4). Total num frames: 159711232. Throughput: 0: 43636.9. Samples: 62597420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 14:06:43,850][06674] Avg episode reward: [(0, '0.014')] [2024-06-27 14:06:44,998][06909] Updated weights for policy 0, policy_version 9752 (0.0024) [2024-06-27 14:06:48,615][06909] Updated weights for policy 0, policy_version 9762 (0.0030) [2024-06-27 14:06:48,850][06674] Fps is (10 sec: 42623.9, 60 sec: 43690.7, 300 sec: 43542.5). Total num frames: 159940608. Throughput: 0: 43648.9. Samples: 62859820. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-27 14:06:48,850][06674] Avg episode reward: [(0, '0.013')] [2024-06-27 14:06:52,527][06909] Updated weights for policy 0, policy_version 9772 (0.0047) [2024-06-27 14:06:53,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 160169984. Throughput: 0: 43579.7. Samples: 63116520. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 14:06:53,850][06674] Avg episode reward: [(0, '0.015')] [2024-06-27 14:06:56,152][06909] Updated weights for policy 0, policy_version 9782 (0.0035) [2024-06-27 14:06:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 160366592. Throughput: 0: 43699.9. Samples: 63252340. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 14:06:58,850][06674] Avg episode reward: [(0, '0.015')] [2024-06-27 14:06:59,903][06909] Updated weights for policy 0, policy_version 9792 (0.0034) [2024-06-27 14:07:03,722][06909] Updated weights for policy 0, policy_version 9802 (0.0025) [2024-06-27 14:07:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 160595968. Throughput: 0: 43701.9. Samples: 63514680. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:07:03,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:07:07,305][06909] Updated weights for policy 0, policy_version 9812 (0.0044) [2024-06-27 14:07:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 160825344. Throughput: 0: 43581.8. Samples: 63773200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:07:08,850][06674] Avg episode reward: [(0, '0.019')] [2024-06-27 14:07:11,364][06909] Updated weights for policy 0, policy_version 9822 (0.0036) [2024-06-27 14:07:13,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 161021952. Throughput: 0: 43632.4. Samples: 63909580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 14:07:13,850][06674] Avg episode reward: [(0, '0.019')] [2024-06-27 14:07:14,792][06909] Updated weights for policy 0, policy_version 9832 (0.0042) [2024-06-27 14:07:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 161251328. Throughput: 0: 43651.9. Samples: 64171300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 14:07:18,851][06674] Avg episode reward: [(0, '0.019')] [2024-06-27 14:07:18,864][06909] Updated weights for policy 0, policy_version 9842 (0.0037) [2024-06-27 14:07:20,635][06887] Signal inference workers to stop experience collection... (850 times) [2024-06-27 14:07:20,694][06909] InferenceWorker_p0-w0: stopping experience collection (850 times) [2024-06-27 14:07:20,751][06887] Signal inference workers to resume experience collection... (850 times) [2024-06-27 14:07:20,751][06909] InferenceWorker_p0-w0: resuming experience collection (850 times) [2024-06-27 14:07:22,182][06909] Updated weights for policy 0, policy_version 9852 (0.0035) [2024-06-27 14:07:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 161480704. Throughput: 0: 43676.9. Samples: 64432060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 14:07:23,850][06674] Avg episode reward: [(0, '0.020')] [2024-06-27 14:07:26,319][06909] Updated weights for policy 0, policy_version 9862 (0.0040) [2024-06-27 14:07:28,852][06674] Fps is (10 sec: 45866.1, 60 sec: 44235.3, 300 sec: 43654.0). Total num frames: 161710080. Throughput: 0: 43738.9. Samples: 64565760. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 14:07:28,853][06674] Avg episode reward: [(0, '0.022')] [2024-06-27 14:07:29,656][06909] Updated weights for policy 0, policy_version 9872 (0.0042) [2024-06-27 14:07:33,771][06909] Updated weights for policy 0, policy_version 9882 (0.0030) [2024-06-27 14:07:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 161906688. Throughput: 0: 43726.8. Samples: 64827520. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 14:07:33,850][06674] Avg episode reward: [(0, '0.030')] [2024-06-27 14:07:33,892][06887] Saving new best policy, reward=0.030! [2024-06-27 14:07:37,173][06909] Updated weights for policy 0, policy_version 9892 (0.0032) [2024-06-27 14:07:38,852][06674] Fps is (10 sec: 42598.5, 60 sec: 43693.6, 300 sec: 43653.3). Total num frames: 162136064. Throughput: 0: 43606.0. Samples: 65078880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 14:07:38,853][06674] Avg episode reward: [(0, '0.025')] [2024-06-27 14:07:41,320][06909] Updated weights for policy 0, policy_version 9902 (0.0033) [2024-06-27 14:07:43,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 162316288. Throughput: 0: 43582.2. Samples: 65213540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 14:07:43,851][06674] Avg episode reward: [(0, '0.020')] [2024-06-27 14:07:44,915][06909] Updated weights for policy 0, policy_version 9912 (0.0029) [2024-06-27 14:07:48,850][06674] Fps is (10 sec: 40968.7, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 162545664. Throughput: 0: 43549.3. Samples: 65474400. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 14:07:48,850][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:07:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000009921_162545664.pth... [2024-06-27 14:07:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000009283_152092672.pth [2024-06-27 14:07:49,091][06909] Updated weights for policy 0, policy_version 9922 (0.0034) [2024-06-27 14:07:52,411][06909] Updated weights for policy 0, policy_version 9932 (0.0030) [2024-06-27 14:07:53,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 162791424. Throughput: 0: 43507.5. Samples: 65731040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 14:07:53,851][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:07:56,746][06909] Updated weights for policy 0, policy_version 9942 (0.0030) [2024-06-27 14:07:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 162971648. Throughput: 0: 43540.6. Samples: 65868900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 14:07:58,850][06674] Avg episode reward: [(0, '0.020')] [2024-06-27 14:07:59,969][06909] Updated weights for policy 0, policy_version 9952 (0.0028) [2024-06-27 14:08:03,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 163201024. Throughput: 0: 43448.6. Samples: 66126480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 14:08:03,850][06674] Avg episode reward: [(0, '0.025')] [2024-06-27 14:08:04,230][06909] Updated weights for policy 0, policy_version 9962 (0.0029) [2024-06-27 14:08:07,633][06909] Updated weights for policy 0, policy_version 9972 (0.0045) [2024-06-27 14:08:08,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 163446784. Throughput: 0: 43441.0. Samples: 66386900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 14:08:08,850][06674] Avg episode reward: [(0, '0.027')] [2024-06-27 14:08:11,785][06909] Updated weights for policy 0, policy_version 9982 (0.0034) [2024-06-27 14:08:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 163643392. Throughput: 0: 43476.6. Samples: 66522120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 14:08:13,850][06674] Avg episode reward: [(0, '0.019')] [2024-06-27 14:08:15,071][06909] Updated weights for policy 0, policy_version 9992 (0.0029) [2024-06-27 14:08:18,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 163840000. Throughput: 0: 43316.8. Samples: 66776780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 14:08:18,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:08:19,487][06909] Updated weights for policy 0, policy_version 10002 (0.0026) [2024-06-27 14:08:22,544][06909] Updated weights for policy 0, policy_version 10012 (0.0029) [2024-06-27 14:08:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 164085760. Throughput: 0: 43496.5. Samples: 67036140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 14:08:23,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:08:27,272][06909] Updated weights for policy 0, policy_version 10022 (0.0032) [2024-06-27 14:08:28,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43146.1, 300 sec: 43598.1). Total num frames: 164298752. Throughput: 0: 43571.7. Samples: 67174260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 14:08:28,850][06674] Avg episode reward: [(0, '0.022')] [2024-06-27 14:08:30,054][06909] Updated weights for policy 0, policy_version 10032 (0.0028) [2024-06-27 14:08:33,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43144.5, 300 sec: 43542.6). Total num frames: 164495360. Throughput: 0: 43401.3. Samples: 67427460. Policy #0 lag: (min: 1.0, avg: 11.1, max: 23.0) [2024-06-27 14:08:33,850][06674] Avg episode reward: [(0, '0.022')] [2024-06-27 14:08:34,713][06909] Updated weights for policy 0, policy_version 10042 (0.0039) [2024-06-27 14:08:37,462][06909] Updated weights for policy 0, policy_version 10052 (0.0040) [2024-06-27 14:08:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43419.1, 300 sec: 43598.1). Total num frames: 164741120. Throughput: 0: 43449.9. Samples: 67686280. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 14:08:38,850][06674] Avg episode reward: [(0, '0.020')] [2024-06-27 14:08:42,194][06909] Updated weights for policy 0, policy_version 10062 (0.0028) [2024-06-27 14:08:43,133][06887] Signal inference workers to stop experience collection... (900 times) [2024-06-27 14:08:43,133][06887] Signal inference workers to resume experience collection... (900 times) [2024-06-27 14:08:43,174][06909] InferenceWorker_p0-w0: stopping experience collection (900 times) [2024-06-27 14:08:43,175][06909] InferenceWorker_p0-w0: resuming experience collection (900 times) [2024-06-27 14:08:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 164937728. Throughput: 0: 43409.3. Samples: 67822320. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 14:08:43,850][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:08:44,916][06909] Updated weights for policy 0, policy_version 10072 (0.0030) [2024-06-27 14:08:48,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 165150720. Throughput: 0: 43446.6. Samples: 68081580. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 14:08:48,851][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:08:49,666][06909] Updated weights for policy 0, policy_version 10082 (0.0031) [2024-06-27 14:08:52,482][06909] Updated weights for policy 0, policy_version 10092 (0.0032) [2024-06-27 14:08:53,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43417.7, 300 sec: 43653.6). Total num frames: 165396480. Throughput: 0: 43351.5. Samples: 68337720. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 14:08:53,850][06674] Avg episode reward: [(0, '0.024')] [2024-06-27 14:08:57,103][06909] Updated weights for policy 0, policy_version 10102 (0.0027) [2024-06-27 14:08:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.5, 300 sec: 43598.1). Total num frames: 165593088. Throughput: 0: 43499.1. Samples: 68479580. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 14:08:58,850][06674] Avg episode reward: [(0, '0.022')] [2024-06-27 14:08:59,930][06909] Updated weights for policy 0, policy_version 10112 (0.0034) [2024-06-27 14:09:03,852][06674] Fps is (10 sec: 40951.9, 60 sec: 43416.1, 300 sec: 43486.7). Total num frames: 165806080. Throughput: 0: 43610.1. Samples: 68739320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:09:03,852][06674] Avg episode reward: [(0, '0.027')] [2024-06-27 14:09:04,537][06909] Updated weights for policy 0, policy_version 10122 (0.0036) [2024-06-27 14:09:07,305][06909] Updated weights for policy 0, policy_version 10132 (0.0039) [2024-06-27 14:09:08,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 166068224. Throughput: 0: 43601.5. Samples: 68998200. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:09:08,850][06674] Avg episode reward: [(0, '0.028')] [2024-06-27 14:09:11,913][06909] Updated weights for policy 0, policy_version 10142 (0.0038) [2024-06-27 14:09:13,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43144.6, 300 sec: 43542.7). Total num frames: 166232064. Throughput: 0: 43606.6. Samples: 69136560. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 14:09:13,850][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:09:14,914][06909] Updated weights for policy 0, policy_version 10152 (0.0037) [2024-06-27 14:09:18,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 166461440. Throughput: 0: 43745.7. Samples: 69396020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 14:09:18,850][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:09:19,301][06909] Updated weights for policy 0, policy_version 10162 (0.0022) [2024-06-27 14:09:22,668][06909] Updated weights for policy 0, policy_version 10172 (0.0043) [2024-06-27 14:09:23,850][06674] Fps is (10 sec: 49151.6, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 166723584. Throughput: 0: 43677.2. Samples: 69651760. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-27 14:09:23,850][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:09:26,720][06909] Updated weights for policy 0, policy_version 10182 (0.0037) [2024-06-27 14:09:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43144.5, 300 sec: 43542.6). Total num frames: 166887424. Throughput: 0: 43798.2. Samples: 69793240. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 14:09:28,850][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:09:30,211][06909] Updated weights for policy 0, policy_version 10192 (0.0020) [2024-06-27 14:09:33,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 167116800. Throughput: 0: 43636.9. Samples: 70045240. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 14:09:33,850][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:09:34,170][06909] Updated weights for policy 0, policy_version 10202 (0.0037) [2024-06-27 14:09:37,877][06909] Updated weights for policy 0, policy_version 10212 (0.0048) [2024-06-27 14:09:38,850][06674] Fps is (10 sec: 49151.3, 60 sec: 43963.6, 300 sec: 43764.7). Total num frames: 167378944. Throughput: 0: 43703.1. Samples: 70304360. Policy #0 lag: (min: 1.0, avg: 9.3, max: 20.0) [2024-06-27 14:09:38,850][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:09:41,666][06909] Updated weights for policy 0, policy_version 10222 (0.0036) [2024-06-27 14:09:43,739][06887] Signal inference workers to stop experience collection... (950 times) [2024-06-27 14:09:43,740][06887] Signal inference workers to resume experience collection... (950 times) [2024-06-27 14:09:43,755][06909] InferenceWorker_p0-w0: stopping experience collection (950 times) [2024-06-27 14:09:43,756][06909] InferenceWorker_p0-w0: resuming experience collection (950 times) [2024-06-27 14:09:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 167542784. Throughput: 0: 43654.4. Samples: 70444020. Policy #0 lag: (min: 1.0, avg: 9.3, max: 20.0) [2024-06-27 14:09:43,850][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:09:45,426][06909] Updated weights for policy 0, policy_version 10232 (0.0045) [2024-06-27 14:09:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 43487.0). Total num frames: 167788544. Throughput: 0: 43627.7. Samples: 70702480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 14:09:48,850][06674] Avg episode reward: [(0, '0.024')] [2024-06-27 14:09:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000010241_167788544.pth... [2024-06-27 14:09:48,905][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000009603_157335552.pth [2024-06-27 14:09:49,066][06909] Updated weights for policy 0, policy_version 10242 (0.0036) [2024-06-27 14:09:53,061][06909] Updated weights for policy 0, policy_version 10252 (0.0039) [2024-06-27 14:09:53,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 168017920. Throughput: 0: 43644.9. Samples: 70962220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:09:53,850][06674] Avg episode reward: [(0, '0.022')] [2024-06-27 14:09:56,521][06909] Updated weights for policy 0, policy_version 10262 (0.0032) [2024-06-27 14:09:58,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 168198144. Throughput: 0: 43578.3. Samples: 71097580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:09:58,850][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:10:00,445][06909] Updated weights for policy 0, policy_version 10272 (0.0026) [2024-06-27 14:10:03,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43692.1, 300 sec: 43487.0). Total num frames: 168427520. Throughput: 0: 43539.1. Samples: 71355280. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 14:10:03,851][06674] Avg episode reward: [(0, '0.026')] [2024-06-27 14:10:04,482][06909] Updated weights for policy 0, policy_version 10282 (0.0032) [2024-06-27 14:10:07,870][06909] Updated weights for policy 0, policy_version 10292 (0.0039) [2024-06-27 14:10:08,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 168673280. Throughput: 0: 43613.3. Samples: 71614360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 14:10:08,850][06674] Avg episode reward: [(0, '0.024')] [2024-06-27 14:10:11,971][06909] Updated weights for policy 0, policy_version 10302 (0.0030) [2024-06-27 14:10:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.6, 300 sec: 43542.6). Total num frames: 168869888. Throughput: 0: 43530.5. Samples: 71752120. Policy #0 lag: (min: 0.0, avg: 12.3, max: 23.0) [2024-06-27 14:10:13,850][06674] Avg episode reward: [(0, '0.023')] [2024-06-27 14:10:15,273][06909] Updated weights for policy 0, policy_version 10312 (0.0039) [2024-06-27 14:10:18,852][06674] Fps is (10 sec: 40951.9, 60 sec: 43689.2, 300 sec: 43486.7). Total num frames: 169082880. Throughput: 0: 43628.7. Samples: 72008620. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 14:10:18,852][06674] Avg episode reward: [(0, '0.026')] [2024-06-27 14:10:19,378][06909] Updated weights for policy 0, policy_version 10322 (0.0038) [2024-06-27 14:10:22,859][06909] Updated weights for policy 0, policy_version 10332 (0.0034) [2024-06-27 14:10:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.5, 300 sec: 43653.8). Total num frames: 169328640. Throughput: 0: 43751.1. Samples: 72273160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 14:10:23,850][06674] Avg episode reward: [(0, '0.022')] [2024-06-27 14:10:26,671][06909] Updated weights for policy 0, policy_version 10342 (0.0027) [2024-06-27 14:10:28,856][06674] Fps is (10 sec: 44219.1, 60 sec: 43959.3, 300 sec: 43597.2). Total num frames: 169525248. Throughput: 0: 43661.6. Samples: 72409060. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 14:10:28,865][06674] Avg episode reward: [(0, '0.019')] [2024-06-27 14:10:30,235][06909] Updated weights for policy 0, policy_version 10352 (0.0029) [2024-06-27 14:10:33,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 169738240. Throughput: 0: 43601.4. Samples: 72664540. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 14:10:33,850][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:10:34,175][06909] Updated weights for policy 0, policy_version 10362 (0.0036) [2024-06-27 14:10:38,158][06909] Updated weights for policy 0, policy_version 10372 (0.0046) [2024-06-27 14:10:38,850][06674] Fps is (10 sec: 44263.7, 60 sec: 43144.6, 300 sec: 43653.6). Total num frames: 169967616. Throughput: 0: 43642.2. Samples: 72926120. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 14:10:38,850][06674] Avg episode reward: [(0, '0.032')] [2024-06-27 14:10:38,889][06887] Saving new best policy, reward=0.032! [2024-06-27 14:10:40,231][06887] Signal inference workers to stop experience collection... (1000 times) [2024-06-27 14:10:40,259][06909] InferenceWorker_p0-w0: stopping experience collection (1000 times) [2024-06-27 14:10:40,297][06887] Signal inference workers to resume experience collection... (1000 times) [2024-06-27 14:10:40,297][06909] InferenceWorker_p0-w0: resuming experience collection (1000 times) [2024-06-27 14:10:41,501][06909] Updated weights for policy 0, policy_version 10382 (0.0040) [2024-06-27 14:10:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 170180608. Throughput: 0: 43556.4. Samples: 73057620. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 14:10:43,850][06674] Avg episode reward: [(0, '0.018')] [2024-06-27 14:10:45,445][06909] Updated weights for policy 0, policy_version 10392 (0.0043) [2024-06-27 14:10:48,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 170393600. Throughput: 0: 43646.2. Samples: 73319360. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 14:10:48,850][06674] Avg episode reward: [(0, '0.023')] [2024-06-27 14:10:49,167][06909] Updated weights for policy 0, policy_version 10402 (0.0039) [2024-06-27 14:10:52,893][06909] Updated weights for policy 0, policy_version 10412 (0.0033) [2024-06-27 14:10:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 170622976. Throughput: 0: 43772.1. Samples: 73584100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 14:10:53,850][06674] Avg episode reward: [(0, '0.028')] [2024-06-27 14:10:56,552][06909] Updated weights for policy 0, policy_version 10422 (0.0040) [2024-06-27 14:10:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 43542.6). Total num frames: 170835968. Throughput: 0: 43609.0. Samples: 73714520. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 14:10:58,850][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:11:00,329][06909] Updated weights for policy 0, policy_version 10432 (0.0042) [2024-06-27 14:11:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43542.6). Total num frames: 171065344. Throughput: 0: 43745.9. Samples: 73977100. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 14:11:03,850][06674] Avg episode reward: [(0, '0.020')] [2024-06-27 14:11:03,983][06909] Updated weights for policy 0, policy_version 10442 (0.0025) [2024-06-27 14:11:07,764][06909] Updated weights for policy 0, policy_version 10452 (0.0032) [2024-06-27 14:11:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 171278336. Throughput: 0: 43662.3. Samples: 74237960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:11:08,850][06674] Avg episode reward: [(0, '0.026')] [2024-06-27 14:11:11,597][06909] Updated weights for policy 0, policy_version 10462 (0.0028) [2024-06-27 14:11:13,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 171491328. Throughput: 0: 43541.9. Samples: 74368180. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:11:13,850][06674] Avg episode reward: [(0, '0.022')] [2024-06-27 14:11:15,101][06909] Updated weights for policy 0, policy_version 10472 (0.0024) [2024-06-27 14:11:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43965.2, 300 sec: 43598.1). Total num frames: 171720704. Throughput: 0: 43806.6. Samples: 74635840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:11:18,850][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:11:19,226][06909] Updated weights for policy 0, policy_version 10482 (0.0032) [2024-06-27 14:11:22,661][06909] Updated weights for policy 0, policy_version 10492 (0.0038) [2024-06-27 14:11:23,851][06674] Fps is (10 sec: 44233.3, 60 sec: 43417.2, 300 sec: 43653.5). Total num frames: 171933696. Throughput: 0: 43749.1. Samples: 74894860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:11:23,851][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:11:26,583][06909] Updated weights for policy 0, policy_version 10502 (0.0032) [2024-06-27 14:11:28,851][06674] Fps is (10 sec: 42591.8, 60 sec: 43693.9, 300 sec: 43597.9). Total num frames: 172146688. Throughput: 0: 43815.8. Samples: 75029400. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 14:11:28,852][06674] Avg episode reward: [(0, '0.029')] [2024-06-27 14:11:30,268][06909] Updated weights for policy 0, policy_version 10512 (0.0044) [2024-06-27 14:11:33,850][06674] Fps is (10 sec: 44240.2, 60 sec: 43963.8, 300 sec: 43599.0). Total num frames: 172376064. Throughput: 0: 43825.5. Samples: 75291500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 14:11:33,850][06674] Avg episode reward: [(0, '0.030')] [2024-06-27 14:11:33,944][06909] Updated weights for policy 0, policy_version 10522 (0.0027) [2024-06-27 14:11:37,530][06909] Updated weights for policy 0, policy_version 10532 (0.0022) [2024-06-27 14:11:38,850][06674] Fps is (10 sec: 45881.4, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 172605440. Throughput: 0: 43766.9. Samples: 75553620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 14:11:38,850][06674] Avg episode reward: [(0, '0.032')] [2024-06-27 14:11:41,830][06909] Updated weights for policy 0, policy_version 10542 (0.0025) [2024-06-27 14:11:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 172818432. Throughput: 0: 43972.4. Samples: 75693280. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 14:11:43,850][06674] Avg episode reward: [(0, '0.030')] [2024-06-27 14:11:44,878][06909] Updated weights for policy 0, policy_version 10552 (0.0046) [2024-06-27 14:11:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 43542.5). Total num frames: 173015040. Throughput: 0: 44031.0. Samples: 75958500. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 14:11:48,850][06674] Avg episode reward: [(0, '0.029')] [2024-06-27 14:11:48,874][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000010560_173015040.pth... [2024-06-27 14:11:48,933][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000009921_162545664.pth [2024-06-27 14:11:49,311][06909] Updated weights for policy 0, policy_version 10562 (0.0039) [2024-06-27 14:11:52,527][06909] Updated weights for policy 0, policy_version 10572 (0.0042) [2024-06-27 14:11:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 43764.7). Total num frames: 173277184. Throughput: 0: 43947.6. Samples: 76215600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 14:11:53,850][06674] Avg episode reward: [(0, '0.031')] [2024-06-27 14:11:56,960][06909] Updated weights for policy 0, policy_version 10582 (0.0037) [2024-06-27 14:11:58,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 173473792. Throughput: 0: 44205.6. Samples: 76357440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 14:11:58,851][06674] Avg episode reward: [(0, '0.034')] [2024-06-27 14:11:58,860][06887] Saving new best policy, reward=0.034! [2024-06-27 14:12:00,021][06909] Updated weights for policy 0, policy_version 10592 (0.0036) [2024-06-27 14:12:03,209][06887] Signal inference workers to stop experience collection... (1050 times) [2024-06-27 14:12:03,244][06909] InferenceWorker_p0-w0: stopping experience collection (1050 times) [2024-06-27 14:12:03,266][06887] Signal inference workers to resume experience collection... (1050 times) [2024-06-27 14:12:03,266][06909] InferenceWorker_p0-w0: resuming experience collection (1050 times) [2024-06-27 14:12:03,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 173670400. Throughput: 0: 43986.2. Samples: 76615220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 14:12:03,859][06674] Avg episode reward: [(0, '0.029')] [2024-06-27 14:12:04,346][06909] Updated weights for policy 0, policy_version 10602 (0.0040) [2024-06-27 14:12:07,452][06909] Updated weights for policy 0, policy_version 10612 (0.0032) [2024-06-27 14:12:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 173916160. Throughput: 0: 43951.6. Samples: 76872660. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:12:08,850][06674] Avg episode reward: [(0, '0.029')] [2024-06-27 14:12:11,685][06909] Updated weights for policy 0, policy_version 10622 (0.0031) [2024-06-27 14:12:13,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.7, 300 sec: 43709.2). Total num frames: 174145536. Throughput: 0: 44230.0. Samples: 77019680. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:12:13,850][06674] Avg episode reward: [(0, '0.030')] [2024-06-27 14:12:14,848][06909] Updated weights for policy 0, policy_version 10632 (0.0040) [2024-06-27 14:12:18,850][06674] Fps is (10 sec: 40961.0, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 174325760. Throughput: 0: 44113.8. Samples: 77276620. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 14:12:18,850][06674] Avg episode reward: [(0, '0.031')] [2024-06-27 14:12:19,010][06909] Updated weights for policy 0, policy_version 10642 (0.0025) [2024-06-27 14:12:22,271][06909] Updated weights for policy 0, policy_version 10652 (0.0029) [2024-06-27 14:12:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44237.3, 300 sec: 43653.9). Total num frames: 174587904. Throughput: 0: 44022.8. Samples: 77534640. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 14:12:23,850][06674] Avg episode reward: [(0, '0.033')] [2024-06-27 14:12:26,624][06909] Updated weights for policy 0, policy_version 10662 (0.0037) [2024-06-27 14:12:28,851][06674] Fps is (10 sec: 47509.1, 60 sec: 44237.3, 300 sec: 43709.0). Total num frames: 174800896. Throughput: 0: 44167.2. Samples: 77680840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 14:12:28,851][06674] Avg episode reward: [(0, '0.038')] [2024-06-27 14:12:28,994][06887] Saving new best policy, reward=0.038! [2024-06-27 14:12:29,558][06909] Updated weights for policy 0, policy_version 10672 (0.0038) [2024-06-27 14:12:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.6, 300 sec: 43598.4). Total num frames: 174997504. Throughput: 0: 43946.4. Samples: 77936080. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 14:12:33,850][06674] Avg episode reward: [(0, '0.031')] [2024-06-27 14:12:34,065][06909] Updated weights for policy 0, policy_version 10682 (0.0027) [2024-06-27 14:12:36,997][06909] Updated weights for policy 0, policy_version 10692 (0.0040) [2024-06-27 14:12:38,850][06674] Fps is (10 sec: 44240.5, 60 sec: 43963.9, 300 sec: 43820.3). Total num frames: 175243264. Throughput: 0: 43992.0. Samples: 78195240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 14:12:38,850][06674] Avg episode reward: [(0, '0.024')] [2024-06-27 14:12:41,491][06909] Updated weights for policy 0, policy_version 10702 (0.0037) [2024-06-27 14:12:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 175439872. Throughput: 0: 43955.1. Samples: 78335420. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 14:12:43,850][06674] Avg episode reward: [(0, '0.023')] [2024-06-27 14:12:44,526][06909] Updated weights for policy 0, policy_version 10712 (0.0038) [2024-06-27 14:12:48,838][06909] Updated weights for policy 0, policy_version 10722 (0.0027) [2024-06-27 14:12:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.9, 300 sec: 43653.7). Total num frames: 175669248. Throughput: 0: 43945.4. Samples: 78592760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 14:12:48,850][06674] Avg episode reward: [(0, '0.030')] [2024-06-27 14:12:52,231][06909] Updated weights for policy 0, policy_version 10732 (0.0033) [2024-06-27 14:12:53,851][06674] Fps is (10 sec: 45870.6, 60 sec: 43689.9, 300 sec: 43820.1). Total num frames: 175898624. Throughput: 0: 44087.1. Samples: 78856620. Policy #0 lag: (min: 1.0, avg: 10.4, max: 21.0) [2024-06-27 14:12:53,852][06674] Avg episode reward: [(0, '0.032')] [2024-06-27 14:12:56,156][06909] Updated weights for policy 0, policy_version 10742 (0.0034) [2024-06-27 14:12:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 176095232. Throughput: 0: 43798.7. Samples: 78990620. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 14:12:58,850][06674] Avg episode reward: [(0, '0.030')] [2024-06-27 14:12:59,441][06909] Updated weights for policy 0, policy_version 10752 (0.0032) [2024-06-27 14:13:03,715][06909] Updated weights for policy 0, policy_version 10762 (0.0033) [2024-06-27 14:13:03,850][06674] Fps is (10 sec: 42602.8, 60 sec: 44236.8, 300 sec: 43653.6). Total num frames: 176324608. Throughput: 0: 43905.2. Samples: 79252360. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 14:13:03,850][06674] Avg episode reward: [(0, '0.032')] [2024-06-27 14:13:07,058][06909] Updated weights for policy 0, policy_version 10772 (0.0024) [2024-06-27 14:13:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 176553984. Throughput: 0: 43831.5. Samples: 79507060. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-27 14:13:08,850][06674] Avg episode reward: [(0, '0.022')] [2024-06-27 14:13:11,317][06909] Updated weights for policy 0, policy_version 10782 (0.0031) [2024-06-27 14:13:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 176766976. Throughput: 0: 43639.5. Samples: 79644580. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-27 14:13:13,859][06674] Avg episode reward: [(0, '0.029')] [2024-06-27 14:13:14,509][06909] Updated weights for policy 0, policy_version 10792 (0.0037) [2024-06-27 14:13:18,784][06909] Updated weights for policy 0, policy_version 10802 (0.0045) [2024-06-27 14:13:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.7, 300 sec: 43709.2). Total num frames: 176979968. Throughput: 0: 43815.4. Samples: 79907780. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:13:18,850][06674] Avg episode reward: [(0, '0.029')] [2024-06-27 14:13:21,838][06909] Updated weights for policy 0, policy_version 10812 (0.0025) [2024-06-27 14:13:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 43820.2). Total num frames: 177225728. Throughput: 0: 43859.6. Samples: 80168920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:13:23,850][06674] Avg episode reward: [(0, '0.029')] [2024-06-27 14:13:26,235][06909] Updated weights for policy 0, policy_version 10822 (0.0022) [2024-06-27 14:13:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43964.3, 300 sec: 43875.8). Total num frames: 177438720. Throughput: 0: 43764.4. Samples: 80304820. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 14:13:28,850][06674] Avg episode reward: [(0, '0.020')] [2024-06-27 14:13:29,180][06909] Updated weights for policy 0, policy_version 10832 (0.0042) [2024-06-27 14:13:33,537][06909] Updated weights for policy 0, policy_version 10842 (0.0038) [2024-06-27 14:13:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 177635328. Throughput: 0: 43820.9. Samples: 80564700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 14:13:33,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:13:35,958][06887] Signal inference workers to stop experience collection... (1100 times) [2024-06-27 14:13:36,012][06909] InferenceWorker_p0-w0: stopping experience collection (1100 times) [2024-06-27 14:13:36,016][06887] Signal inference workers to resume experience collection... (1100 times) [2024-06-27 14:13:36,022][06909] InferenceWorker_p0-w0: resuming experience collection (1100 times) [2024-06-27 14:13:36,562][06909] Updated weights for policy 0, policy_version 10852 (0.0033) [2024-06-27 14:13:38,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 177881088. Throughput: 0: 43881.1. Samples: 80831220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 14:13:38,850][06674] Avg episode reward: [(0, '0.022')] [2024-06-27 14:13:40,879][06909] Updated weights for policy 0, policy_version 10862 (0.0031) [2024-06-27 14:13:43,853][06674] Fps is (10 sec: 45861.2, 60 sec: 44234.6, 300 sec: 43875.4). Total num frames: 178094080. Throughput: 0: 43865.1. Samples: 80964680. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 14:13:43,853][06674] Avg episode reward: [(0, '0.023')] [2024-06-27 14:13:44,109][06909] Updated weights for policy 0, policy_version 10872 (0.0025) [2024-06-27 14:13:48,243][06909] Updated weights for policy 0, policy_version 10882 (0.0035) [2024-06-27 14:13:48,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 178307072. Throughput: 0: 43939.1. Samples: 81229620. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 14:13:48,850][06674] Avg episode reward: [(0, '0.023')] [2024-06-27 14:13:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000010883_178307072.pth... [2024-06-27 14:13:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000010241_167788544.pth [2024-06-27 14:13:51,532][06909] Updated weights for policy 0, policy_version 10892 (0.0029) [2024-06-27 14:13:53,850][06674] Fps is (10 sec: 44250.1, 60 sec: 43964.5, 300 sec: 43875.8). Total num frames: 178536448. Throughput: 0: 44052.5. Samples: 81489420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:13:53,850][06674] Avg episode reward: [(0, '0.026')] [2024-06-27 14:13:56,000][06909] Updated weights for policy 0, policy_version 10902 (0.0038) [2024-06-27 14:13:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 43876.1). Total num frames: 178749440. Throughput: 0: 44040.0. Samples: 81626380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:13:58,850][06674] Avg episode reward: [(0, '0.028')] [2024-06-27 14:13:59,050][06909] Updated weights for policy 0, policy_version 10912 (0.0033) [2024-06-27 14:14:03,386][06909] Updated weights for policy 0, policy_version 10922 (0.0027) [2024-06-27 14:14:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 178962432. Throughput: 0: 44013.9. Samples: 81888400. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 14:14:03,850][06674] Avg episode reward: [(0, '0.028')] [2024-06-27 14:14:06,763][06909] Updated weights for policy 0, policy_version 10932 (0.0041) [2024-06-27 14:14:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 179191808. Throughput: 0: 44041.3. Samples: 82150780. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-27 14:14:08,850][06674] Avg episode reward: [(0, '0.027')] [2024-06-27 14:14:10,786][06909] Updated weights for policy 0, policy_version 10942 (0.0037) [2024-06-27 14:14:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 179404800. Throughput: 0: 44062.7. Samples: 82287640. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-27 14:14:13,851][06674] Avg episode reward: [(0, '0.026')] [2024-06-27 14:14:14,151][06909] Updated weights for policy 0, policy_version 10952 (0.0030) [2024-06-27 14:14:18,055][06909] Updated weights for policy 0, policy_version 10962 (0.0034) [2024-06-27 14:14:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 179617792. Throughput: 0: 44215.9. Samples: 82554420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:14:18,850][06674] Avg episode reward: [(0, '0.028')] [2024-06-27 14:14:21,450][06909] Updated weights for policy 0, policy_version 10972 (0.0051) [2024-06-27 14:14:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 179847168. Throughput: 0: 44007.9. Samples: 82811580. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:14:23,850][06674] Avg episode reward: [(0, '0.028')] [2024-06-27 14:14:25,408][06909] Updated weights for policy 0, policy_version 10982 (0.0033) [2024-06-27 14:14:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 180076544. Throughput: 0: 44197.6. Samples: 82953440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 14:14:28,850][06674] Avg episode reward: [(0, '0.028')] [2024-06-27 14:14:28,917][06909] Updated weights for policy 0, policy_version 10992 (0.0039) [2024-06-27 14:14:32,882][06909] Updated weights for policy 0, policy_version 11002 (0.0049) [2024-06-27 14:14:33,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 43764.7). Total num frames: 180289536. Throughput: 0: 44182.6. Samples: 83217840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:14:33,851][06674] Avg episode reward: [(0, '0.021')] [2024-06-27 14:14:36,299][06909] Updated weights for policy 0, policy_version 11012 (0.0035) [2024-06-27 14:14:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 180518912. Throughput: 0: 44196.9. Samples: 83478280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:14:38,850][06674] Avg episode reward: [(0, '0.033')] [2024-06-27 14:14:40,474][06909] Updated weights for policy 0, policy_version 11022 (0.0042) [2024-06-27 14:14:43,734][06909] Updated weights for policy 0, policy_version 11032 (0.0023) [2024-06-27 14:14:43,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44239.0, 300 sec: 43931.3). Total num frames: 180748288. Throughput: 0: 44196.4. Samples: 83615220. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 14:14:43,850][06674] Avg episode reward: [(0, '0.038')] [2024-06-27 14:14:47,887][06909] Updated weights for policy 0, policy_version 11042 (0.0033) [2024-06-27 14:14:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 180944896. Throughput: 0: 44395.1. Samples: 83886180. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 14:14:48,850][06674] Avg episode reward: [(0, '0.032')] [2024-06-27 14:14:50,924][06909] Updated weights for policy 0, policy_version 11052 (0.0038) [2024-06-27 14:14:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 181174272. Throughput: 0: 44277.8. Samples: 84143280. Policy #0 lag: (min: 1.0, avg: 11.9, max: 22.0) [2024-06-27 14:14:53,850][06674] Avg episode reward: [(0, '0.016')] [2024-06-27 14:14:55,167][06909] Updated weights for policy 0, policy_version 11062 (0.0024) [2024-06-27 14:14:58,575][06909] Updated weights for policy 0, policy_version 11072 (0.0024) [2024-06-27 14:14:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 181403648. Throughput: 0: 44295.7. Samples: 84280940. Policy #0 lag: (min: 1.0, avg: 11.9, max: 22.0) [2024-06-27 14:14:58,850][06674] Avg episode reward: [(0, '0.017')] [2024-06-27 14:15:02,554][06909] Updated weights for policy 0, policy_version 11082 (0.0041) [2024-06-27 14:15:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 181600256. Throughput: 0: 44134.7. Samples: 84540480. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 14:15:03,850][06674] Avg episode reward: [(0, '0.020')] [2024-06-27 14:15:06,008][06909] Updated weights for policy 0, policy_version 11092 (0.0027) [2024-06-27 14:15:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 181829632. Throughput: 0: 44186.6. Samples: 84799980. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 14:15:08,850][06674] Avg episode reward: [(0, '0.032')] [2024-06-27 14:15:09,815][06909] Updated weights for policy 0, policy_version 11102 (0.0035) [2024-06-27 14:15:13,669][06909] Updated weights for policy 0, policy_version 11112 (0.0031) [2024-06-27 14:15:13,850][06674] Fps is (10 sec: 45874.3, 60 sec: 44236.7, 300 sec: 43987.2). Total num frames: 182059008. Throughput: 0: 44115.9. Samples: 84938660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 14:15:13,851][06674] Avg episode reward: [(0, '0.030')] [2024-06-27 14:15:17,075][06909] Updated weights for policy 0, policy_version 11122 (0.0034) [2024-06-27 14:15:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 182272000. Throughput: 0: 44054.7. Samples: 85200300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:15:18,851][06674] Avg episode reward: [(0, '0.030')] [2024-06-27 14:15:19,205][06887] Signal inference workers to stop experience collection... (1150 times) [2024-06-27 14:15:19,252][06909] InferenceWorker_p0-w0: stopping experience collection (1150 times) [2024-06-27 14:15:19,260][06887] Signal inference workers to resume experience collection... (1150 times) [2024-06-27 14:15:19,276][06909] InferenceWorker_p0-w0: resuming experience collection (1150 times) [2024-06-27 14:15:21,049][06909] Updated weights for policy 0, policy_version 11132 (0.0029) [2024-06-27 14:15:23,850][06674] Fps is (10 sec: 44237.9, 60 sec: 44236.8, 300 sec: 43987.8). Total num frames: 182501376. Throughput: 0: 44210.7. Samples: 85467760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:15:23,850][06674] Avg episode reward: [(0, '0.025')] [2024-06-27 14:15:24,593][06909] Updated weights for policy 0, policy_version 11142 (0.0036) [2024-06-27 14:15:28,407][06909] Updated weights for policy 0, policy_version 11152 (0.0041) [2024-06-27 14:15:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 182730752. Throughput: 0: 44272.5. Samples: 85607480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 14:15:28,850][06674] Avg episode reward: [(0, '0.024')] [2024-06-27 14:15:31,953][06909] Updated weights for policy 0, policy_version 11162 (0.0037) [2024-06-27 14:15:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 182927360. Throughput: 0: 43957.8. Samples: 85864280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 14:15:33,850][06674] Avg episode reward: [(0, '0.028')] [2024-06-27 14:15:35,750][06909] Updated weights for policy 0, policy_version 11172 (0.0028) [2024-06-27 14:15:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 183156736. Throughput: 0: 44206.7. Samples: 86132580. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-27 14:15:38,850][06674] Avg episode reward: [(0, '0.030')] [2024-06-27 14:15:39,282][06909] Updated weights for policy 0, policy_version 11182 (0.0029) [2024-06-27 14:15:43,176][06909] Updated weights for policy 0, policy_version 11192 (0.0029) [2024-06-27 14:15:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 183386112. Throughput: 0: 44148.4. Samples: 86267620. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-27 14:15:43,850][06674] Avg episode reward: [(0, '0.033')] [2024-06-27 14:15:46,729][06909] Updated weights for policy 0, policy_version 11202 (0.0036) [2024-06-27 14:15:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 183599104. Throughput: 0: 44109.4. Samples: 86525400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 14:15:48,850][06674] Avg episode reward: [(0, '0.034')] [2024-06-27 14:15:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000011206_183599104.pth... [2024-06-27 14:15:48,919][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000010560_173015040.pth [2024-06-27 14:15:50,468][06909] Updated weights for policy 0, policy_version 11212 (0.0035) [2024-06-27 14:15:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 183828480. Throughput: 0: 44176.9. Samples: 86787940. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-27 14:15:53,850][06674] Avg episode reward: [(0, '0.040')] [2024-06-27 14:15:53,975][06887] Saving new best policy, reward=0.040! [2024-06-27 14:15:54,459][06909] Updated weights for policy 0, policy_version 11222 (0.0027) [2024-06-27 14:15:58,168][06909] Updated weights for policy 0, policy_version 11232 (0.0021) [2024-06-27 14:15:58,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 184057856. Throughput: 0: 44169.9. Samples: 86926300. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-27 14:15:58,850][06674] Avg episode reward: [(0, '0.028')] [2024-06-27 14:16:01,764][06909] Updated weights for policy 0, policy_version 11242 (0.0028) [2024-06-27 14:16:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 184254464. Throughput: 0: 44153.0. Samples: 87187180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:16:03,850][06674] Avg episode reward: [(0, '0.032')] [2024-06-27 14:16:05,526][06909] Updated weights for policy 0, policy_version 11252 (0.0020) [2024-06-27 14:16:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 184500224. Throughput: 0: 44027.9. Samples: 87449020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:16:08,850][06674] Avg episode reward: [(0, '0.029')] [2024-06-27 14:16:09,224][06909] Updated weights for policy 0, policy_version 11262 (0.0032) [2024-06-27 14:16:12,949][06909] Updated weights for policy 0, policy_version 11272 (0.0035) [2024-06-27 14:16:13,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 184713216. Throughput: 0: 43806.1. Samples: 87578760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:16:13,850][06674] Avg episode reward: [(0, '0.029')] [2024-06-27 14:16:16,624][06909] Updated weights for policy 0, policy_version 11282 (0.0042) [2024-06-27 14:16:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.9, 300 sec: 44042.5). Total num frames: 184926208. Throughput: 0: 43993.4. Samples: 87843980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:16:18,850][06674] Avg episode reward: [(0, '0.031')] [2024-06-27 14:16:20,476][06909] Updated weights for policy 0, policy_version 11292 (0.0026) [2024-06-27 14:16:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 185139200. Throughput: 0: 43861.7. Samples: 88106360. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 14:16:23,850][06674] Avg episode reward: [(0, '0.031')] [2024-06-27 14:16:24,365][06909] Updated weights for policy 0, policy_version 11302 (0.0023) [2024-06-27 14:16:27,878][06909] Updated weights for policy 0, policy_version 11312 (0.0031) [2024-06-27 14:16:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 185352192. Throughput: 0: 43797.8. Samples: 88238520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 14:16:28,850][06674] Avg episode reward: [(0, '0.024')] [2024-06-27 14:16:31,742][06909] Updated weights for policy 0, policy_version 11322 (0.0041) [2024-06-27 14:16:32,782][06887] Signal inference workers to stop experience collection... (1200 times) [2024-06-27 14:16:32,785][06887] Signal inference workers to resume experience collection... (1200 times) [2024-06-27 14:16:32,798][06909] InferenceWorker_p0-w0: stopping experience collection (1200 times) [2024-06-27 14:16:32,799][06909] InferenceWorker_p0-w0: resuming experience collection (1200 times) [2024-06-27 14:16:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 185581568. Throughput: 0: 43970.2. Samples: 88504060. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-27 14:16:33,850][06674] Avg episode reward: [(0, '0.025')] [2024-06-27 14:16:35,293][06909] Updated weights for policy 0, policy_version 11332 (0.0024) [2024-06-27 14:16:38,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 185794560. Throughput: 0: 43869.7. Samples: 88762080. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 14:16:38,851][06674] Avg episode reward: [(0, '0.028')] [2024-06-27 14:16:39,186][06909] Updated weights for policy 0, policy_version 11342 (0.0026) [2024-06-27 14:16:42,656][06909] Updated weights for policy 0, policy_version 11352 (0.0051) [2024-06-27 14:16:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 186023936. Throughput: 0: 43774.3. Samples: 88896140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 14:16:43,850][06674] Avg episode reward: [(0, '0.037')] [2024-06-27 14:16:46,610][06909] Updated weights for policy 0, policy_version 11362 (0.0033) [2024-06-27 14:16:48,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 186236928. Throughput: 0: 43872.9. Samples: 89161460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 14:16:48,850][06674] Avg episode reward: [(0, '0.035')] [2024-06-27 14:16:50,089][06909] Updated weights for policy 0, policy_version 11372 (0.0039) [2024-06-27 14:16:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 186449920. Throughput: 0: 43811.2. Samples: 89420520. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 14:16:53,850][06674] Avg episode reward: [(0, '0.032')] [2024-06-27 14:16:54,142][06909] Updated weights for policy 0, policy_version 11382 (0.0036) [2024-06-27 14:16:57,742][06909] Updated weights for policy 0, policy_version 11392 (0.0030) [2024-06-27 14:16:58,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43689.2, 300 sec: 44097.7). Total num frames: 186679296. Throughput: 0: 43827.4. Samples: 89551080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 14:16:58,852][06674] Avg episode reward: [(0, '0.032')] [2024-06-27 14:17:01,670][06909] Updated weights for policy 0, policy_version 11402 (0.0035) [2024-06-27 14:17:03,853][06674] Fps is (10 sec: 44223.1, 60 sec: 43961.5, 300 sec: 43986.4). Total num frames: 186892288. Throughput: 0: 43828.1. Samples: 89816380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 14:17:03,853][06674] Avg episode reward: [(0, '0.042')] [2024-06-27 14:17:03,986][06887] Saving new best policy, reward=0.042! [2024-06-27 14:17:05,122][06909] Updated weights for policy 0, policy_version 11412 (0.0050) [2024-06-27 14:17:08,850][06674] Fps is (10 sec: 44245.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 187121664. Throughput: 0: 43861.7. Samples: 90080140. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 14:17:08,851][06674] Avg episode reward: [(0, '0.042')] [2024-06-27 14:17:09,284][06909] Updated weights for policy 0, policy_version 11422 (0.0027) [2024-06-27 14:17:12,386][06909] Updated weights for policy 0, policy_version 11432 (0.0029) [2024-06-27 14:17:13,850][06674] Fps is (10 sec: 45889.2, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 187351040. Throughput: 0: 43890.2. Samples: 90213580. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 14:17:13,850][06674] Avg episode reward: [(0, '0.034')] [2024-06-27 14:17:16,722][06909] Updated weights for policy 0, policy_version 11442 (0.0038) [2024-06-27 14:17:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 187547648. Throughput: 0: 43858.2. Samples: 90477680. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 14:17:18,850][06674] Avg episode reward: [(0, '0.040')] [2024-06-27 14:17:20,009][06909] Updated weights for policy 0, policy_version 11452 (0.0033) [2024-06-27 14:17:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43987.0). Total num frames: 187777024. Throughput: 0: 43943.8. Samples: 90739540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:17:23,850][06674] Avg episode reward: [(0, '0.031')] [2024-06-27 14:17:24,348][06909] Updated weights for policy 0, policy_version 11462 (0.0026) [2024-06-27 14:17:27,377][06909] Updated weights for policy 0, policy_version 11472 (0.0029) [2024-06-27 14:17:28,852][06674] Fps is (10 sec: 45865.5, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 188006400. Throughput: 0: 43832.2. Samples: 90868680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:17:28,852][06674] Avg episode reward: [(0, '0.033')] [2024-06-27 14:17:31,871][06909] Updated weights for policy 0, policy_version 11482 (0.0039) [2024-06-27 14:17:33,852][06674] Fps is (10 sec: 44227.2, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 188219392. Throughput: 0: 43861.5. Samples: 91135320. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 14:17:33,852][06674] Avg episode reward: [(0, '0.028')] [2024-06-27 14:17:35,007][06909] Updated weights for policy 0, policy_version 11492 (0.0027) [2024-06-27 14:17:38,850][06674] Fps is (10 sec: 42607.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 188432384. Throughput: 0: 43886.1. Samples: 91395400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 14:17:38,850][06674] Avg episode reward: [(0, '0.028')] [2024-06-27 14:17:39,311][06909] Updated weights for policy 0, policy_version 11502 (0.0040) [2024-06-27 14:17:42,464][06909] Updated weights for policy 0, policy_version 11512 (0.0030) [2024-06-27 14:17:43,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 188661760. Throughput: 0: 43963.3. Samples: 91529340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 14:17:43,850][06674] Avg episode reward: [(0, '0.026')] [2024-06-27 14:17:44,736][06887] Signal inference workers to stop experience collection... (1250 times) [2024-06-27 14:17:44,736][06887] Signal inference workers to resume experience collection... (1250 times) [2024-06-27 14:17:44,766][06909] InferenceWorker_p0-w0: stopping experience collection (1250 times) [2024-06-27 14:17:44,766][06909] InferenceWorker_p0-w0: resuming experience collection (1250 times) [2024-06-27 14:17:46,770][06909] Updated weights for policy 0, policy_version 11522 (0.0032) [2024-06-27 14:17:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.6, 300 sec: 43987.0). Total num frames: 188874752. Throughput: 0: 44052.2. Samples: 91798600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 14:17:48,859][06674] Avg episode reward: [(0, '0.028')] [2024-06-27 14:17:48,989][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000011529_188891136.pth... [2024-06-27 14:17:49,043][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000010883_178307072.pth [2024-06-27 14:17:49,815][06909] Updated weights for policy 0, policy_version 11532 (0.0040) [2024-06-27 14:17:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 189087744. Throughput: 0: 44065.9. Samples: 92063100. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 14:17:53,850][06674] Avg episode reward: [(0, '0.030')] [2024-06-27 14:17:54,308][06909] Updated weights for policy 0, policy_version 11542 (0.0035) [2024-06-27 14:17:57,227][06909] Updated weights for policy 0, policy_version 11552 (0.0031) [2024-06-27 14:17:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 189317120. Throughput: 0: 43916.0. Samples: 92189800. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 14:17:58,850][06674] Avg episode reward: [(0, '0.045')] [2024-06-27 14:17:58,863][06887] Saving new best policy, reward=0.045! [2024-06-27 14:18:01,618][06909] Updated weights for policy 0, policy_version 11562 (0.0037) [2024-06-27 14:18:03,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44239.0, 300 sec: 44042.4). Total num frames: 189546496. Throughput: 0: 44122.6. Samples: 92463200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 14:18:03,850][06674] Avg episode reward: [(0, '0.053')] [2024-06-27 14:18:03,851][06887] Saving new best policy, reward=0.053! [2024-06-27 14:18:04,811][06909] Updated weights for policy 0, policy_version 11572 (0.0038) [2024-06-27 14:18:08,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 189743104. Throughput: 0: 44259.0. Samples: 92731200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:18:08,850][06674] Avg episode reward: [(0, '0.052')] [2024-06-27 14:18:09,066][06909] Updated weights for policy 0, policy_version 11582 (0.0030) [2024-06-27 14:18:12,199][06909] Updated weights for policy 0, policy_version 11592 (0.0044) [2024-06-27 14:18:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.6, 300 sec: 44098.0). Total num frames: 189988864. Throughput: 0: 44207.7. Samples: 92857940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:18:13,851][06674] Avg episode reward: [(0, '0.031')] [2024-06-27 14:18:16,576][06909] Updated weights for policy 0, policy_version 11602 (0.0040) [2024-06-27 14:18:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 190218240. Throughput: 0: 44230.9. Samples: 93125620. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-27 14:18:18,850][06674] Avg episode reward: [(0, '0.036')] [2024-06-27 14:18:19,529][06909] Updated weights for policy 0, policy_version 11612 (0.0031) [2024-06-27 14:18:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 190398464. Throughput: 0: 44456.9. Samples: 93395960. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-27 14:18:23,850][06674] Avg episode reward: [(0, '0.031')] [2024-06-27 14:18:24,039][06909] Updated weights for policy 0, policy_version 11622 (0.0036) [2024-06-27 14:18:26,961][06909] Updated weights for policy 0, policy_version 11632 (0.0038) [2024-06-27 14:18:28,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43692.2, 300 sec: 44042.4). Total num frames: 190627840. Throughput: 0: 44208.9. Samples: 93518740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:18:28,850][06674] Avg episode reward: [(0, '0.037')] [2024-06-27 14:18:31,389][06909] Updated weights for policy 0, policy_version 11642 (0.0033) [2024-06-27 14:18:33,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 190873600. Throughput: 0: 44240.9. Samples: 93789440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:18:33,851][06674] Avg episode reward: [(0, '0.038')] [2024-06-27 14:18:34,332][06909] Updated weights for policy 0, policy_version 11652 (0.0034) [2024-06-27 14:18:38,793][06909] Updated weights for policy 0, policy_version 11662 (0.0030) [2024-06-27 14:18:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43987.3). Total num frames: 191070208. Throughput: 0: 44319.9. Samples: 94057500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-27 14:18:38,850][06674] Avg episode reward: [(0, '0.038')] [2024-06-27 14:18:41,689][06909] Updated weights for policy 0, policy_version 11672 (0.0033) [2024-06-27 14:18:43,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 191315968. Throughput: 0: 44251.6. Samples: 94181120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-27 14:18:43,850][06674] Avg episode reward: [(0, '0.045')] [2024-06-27 14:18:46,039][06909] Updated weights for policy 0, policy_version 11682 (0.0027) [2024-06-27 14:18:48,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 191545344. Throughput: 0: 44125.8. Samples: 94448860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:18:48,850][06674] Avg episode reward: [(0, '0.045')] [2024-06-27 14:18:48,961][06909] Updated weights for policy 0, policy_version 11692 (0.0040) [2024-06-27 14:18:52,281][06887] Signal inference workers to stop experience collection... (1300 times) [2024-06-27 14:18:52,281][06887] Signal inference workers to resume experience collection... (1300 times) [2024-06-27 14:18:52,292][06909] InferenceWorker_p0-w0: stopping experience collection (1300 times) [2024-06-27 14:18:52,293][06909] InferenceWorker_p0-w0: resuming experience collection (1300 times) [2024-06-27 14:18:53,427][06909] Updated weights for policy 0, policy_version 11702 (0.0032) [2024-06-27 14:18:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 191741952. Throughput: 0: 44192.9. Samples: 94719880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:18:53,850][06674] Avg episode reward: [(0, '0.044')] [2024-06-27 14:18:56,350][06909] Updated weights for policy 0, policy_version 11712 (0.0033) [2024-06-27 14:18:58,856][06674] Fps is (10 sec: 44210.3, 60 sec: 44505.4, 300 sec: 44152.6). Total num frames: 191987712. Throughput: 0: 44104.0. Samples: 94842880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 14:18:58,856][06674] Avg episode reward: [(0, '0.051')] [2024-06-27 14:19:00,953][06909] Updated weights for policy 0, policy_version 11722 (0.0033) [2024-06-27 14:19:03,823][06909] Updated weights for policy 0, policy_version 11732 (0.0029) [2024-06-27 14:19:03,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 192217088. Throughput: 0: 44067.5. Samples: 95108660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 14:19:03,851][06674] Avg episode reward: [(0, '0.041')] [2024-06-27 14:19:08,374][06909] Updated weights for policy 0, policy_version 11742 (0.0031) [2024-06-27 14:19:08,850][06674] Fps is (10 sec: 39344.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 192380928. Throughput: 0: 43992.8. Samples: 95375640. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-27 14:19:08,850][06674] Avg episode reward: [(0, '0.044')] [2024-06-27 14:19:11,338][06909] Updated weights for policy 0, policy_version 11752 (0.0048) [2024-06-27 14:19:13,850][06674] Fps is (10 sec: 42599.2, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 192643072. Throughput: 0: 43980.5. Samples: 95497860. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-27 14:19:13,850][06674] Avg episode reward: [(0, '0.042')] [2024-06-27 14:19:15,750][06909] Updated weights for policy 0, policy_version 11762 (0.0025) [2024-06-27 14:19:18,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 192856064. Throughput: 0: 44023.4. Samples: 95770500. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-27 14:19:18,851][06674] Avg episode reward: [(0, '0.038')] [2024-06-27 14:19:19,002][06909] Updated weights for policy 0, policy_version 11772 (0.0029) [2024-06-27 14:19:23,575][06909] Updated weights for policy 0, policy_version 11782 (0.0037) [2024-06-27 14:19:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 193052672. Throughput: 0: 44105.0. Samples: 96042220. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 14:19:23,850][06674] Avg episode reward: [(0, '0.038')] [2024-06-27 14:19:26,435][06909] Updated weights for policy 0, policy_version 11792 (0.0036) [2024-06-27 14:19:28,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 193298432. Throughput: 0: 44102.9. Samples: 96165760. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 14:19:28,851][06674] Avg episode reward: [(0, '0.041')] [2024-06-27 14:19:30,961][06909] Updated weights for policy 0, policy_version 11802 (0.0032) [2024-06-27 14:19:33,835][06909] Updated weights for policy 0, policy_version 11812 (0.0034) [2024-06-27 14:19:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 193527808. Throughput: 0: 43994.7. Samples: 96428620. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-27 14:19:33,850][06674] Avg episode reward: [(0, '0.041')] [2024-06-27 14:19:38,273][06909] Updated weights for policy 0, policy_version 11822 (0.0031) [2024-06-27 14:19:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 193724416. Throughput: 0: 43991.5. Samples: 96699500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-27 14:19:38,850][06674] Avg episode reward: [(0, '0.043')] [2024-06-27 14:19:41,295][06909] Updated weights for policy 0, policy_version 11832 (0.0035) [2024-06-27 14:19:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 193953792. Throughput: 0: 44125.4. Samples: 96828260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 14:19:43,850][06674] Avg episode reward: [(0, '0.042')] [2024-06-27 14:19:45,643][06909] Updated weights for policy 0, policy_version 11842 (0.0038) [2024-06-27 14:19:48,726][06909] Updated weights for policy 0, policy_version 11852 (0.0028) [2024-06-27 14:19:48,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 194183168. Throughput: 0: 44104.2. Samples: 97093340. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 14:19:48,850][06674] Avg episode reward: [(0, '0.037')] [2024-06-27 14:19:48,910][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000011853_194199552.pth... [2024-06-27 14:19:48,966][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000011206_183599104.pth [2024-06-27 14:19:49,599][06887] Signal inference workers to stop experience collection... (1350 times) [2024-06-27 14:19:49,600][06887] Signal inference workers to resume experience collection... (1350 times) [2024-06-27 14:19:49,620][06909] InferenceWorker_p0-w0: stopping experience collection (1350 times) [2024-06-27 14:19:49,620][06909] InferenceWorker_p0-w0: resuming experience collection (1350 times) [2024-06-27 14:19:53,008][06909] Updated weights for policy 0, policy_version 11862 (0.0024) [2024-06-27 14:19:53,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 194396160. Throughput: 0: 44113.9. Samples: 97360760. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:19:53,850][06674] Avg episode reward: [(0, '0.031')] [2024-06-27 14:19:56,231][06909] Updated weights for policy 0, policy_version 11872 (0.0037) [2024-06-27 14:19:58,852][06674] Fps is (10 sec: 45865.6, 60 sec: 44239.7, 300 sec: 44208.7). Total num frames: 194641920. Throughput: 0: 44132.6. Samples: 97483920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:19:58,852][06674] Avg episode reward: [(0, '0.050')] [2024-06-27 14:20:00,589][06909] Updated weights for policy 0, policy_version 11882 (0.0036) [2024-06-27 14:20:03,615][06909] Updated weights for policy 0, policy_version 11892 (0.0036) [2024-06-27 14:20:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 194838528. Throughput: 0: 44085.1. Samples: 97754320. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 14:20:03,850][06674] Avg episode reward: [(0, '0.049')] [2024-06-27 14:20:07,964][06909] Updated weights for policy 0, policy_version 11902 (0.0026) [2024-06-27 14:20:08,850][06674] Fps is (10 sec: 40968.5, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 195051520. Throughput: 0: 43935.5. Samples: 98019320. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 14:20:08,850][06674] Avg episode reward: [(0, '0.052')] [2024-06-27 14:20:10,968][06909] Updated weights for policy 0, policy_version 11912 (0.0025) [2024-06-27 14:20:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 195280896. Throughput: 0: 44042.8. Samples: 98147680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:20:13,850][06674] Avg episode reward: [(0, '0.039')] [2024-06-27 14:20:15,446][06909] Updated weights for policy 0, policy_version 11922 (0.0036) [2024-06-27 14:20:18,663][06909] Updated weights for policy 0, policy_version 11932 (0.0038) [2024-06-27 14:20:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 195510272. Throughput: 0: 44131.9. Samples: 98414560. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 14:20:18,850][06674] Avg episode reward: [(0, '0.039')] [2024-06-27 14:20:22,993][06909] Updated weights for policy 0, policy_version 11942 (0.0049) [2024-06-27 14:20:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 195706880. Throughput: 0: 43985.5. Samples: 98678840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 14:20:23,850][06674] Avg episode reward: [(0, '0.053')] [2024-06-27 14:20:26,053][06909] Updated weights for policy 0, policy_version 11952 (0.0029) [2024-06-27 14:20:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 195952640. Throughput: 0: 43931.2. Samples: 98805160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:20:28,850][06674] Avg episode reward: [(0, '0.043')] [2024-06-27 14:20:30,293][06909] Updated weights for policy 0, policy_version 11962 (0.0037) [2024-06-27 14:20:33,401][06909] Updated weights for policy 0, policy_version 11972 (0.0040) [2024-06-27 14:20:33,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 196165632. Throughput: 0: 44042.6. Samples: 99075260. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:20:33,850][06674] Avg episode reward: [(0, '0.041')] [2024-06-27 14:20:37,620][06909] Updated weights for policy 0, policy_version 11982 (0.0029) [2024-06-27 14:20:38,850][06674] Fps is (10 sec: 42597.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 196378624. Throughput: 0: 43925.6. Samples: 99337420. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 14:20:38,850][06674] Avg episode reward: [(0, '0.033')] [2024-06-27 14:20:40,859][06909] Updated weights for policy 0, policy_version 11992 (0.0045) [2024-06-27 14:20:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 196608000. Throughput: 0: 44092.2. Samples: 99467980. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 14:20:43,850][06674] Avg episode reward: [(0, '0.040')] [2024-06-27 14:20:45,049][06909] Updated weights for policy 0, policy_version 12002 (0.0039) [2024-06-27 14:20:48,324][06909] Updated weights for policy 0, policy_version 12012 (0.0037) [2024-06-27 14:20:48,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 196804608. Throughput: 0: 44019.2. Samples: 99735180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:20:48,850][06674] Avg episode reward: [(0, '0.034')] [2024-06-27 14:20:52,388][06909] Updated weights for policy 0, policy_version 12022 (0.0035) [2024-06-27 14:20:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 197033984. Throughput: 0: 43896.3. Samples: 99994660. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:20:53,850][06674] Avg episode reward: [(0, '0.040')] [2024-06-27 14:20:55,832][06909] Updated weights for policy 0, policy_version 12032 (0.0023) [2024-06-27 14:20:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43419.0, 300 sec: 44042.4). Total num frames: 197246976. Throughput: 0: 44091.9. Samples: 100131820. Policy #0 lag: (min: 1.0, avg: 10.5, max: 23.0) [2024-06-27 14:20:58,850][06674] Avg episode reward: [(0, '0.043')] [2024-06-27 14:20:59,773][06909] Updated weights for policy 0, policy_version 12042 (0.0027) [2024-06-27 14:21:02,989][06887] Signal inference workers to stop experience collection... (1400 times) [2024-06-27 14:21:03,044][06909] InferenceWorker_p0-w0: stopping experience collection (1400 times) [2024-06-27 14:21:03,106][06887] Signal inference workers to resume experience collection... (1400 times) [2024-06-27 14:21:03,106][06909] InferenceWorker_p0-w0: resuming experience collection (1400 times) [2024-06-27 14:21:03,246][06909] Updated weights for policy 0, policy_version 12052 (0.0031) [2024-06-27 14:21:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 197476352. Throughput: 0: 44199.0. Samples: 100403520. Policy #0 lag: (min: 1.0, avg: 10.5, max: 23.0) [2024-06-27 14:21:03,850][06674] Avg episode reward: [(0, '0.039')] [2024-06-27 14:21:07,221][06909] Updated weights for policy 0, policy_version 12062 (0.0040) [2024-06-27 14:21:08,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 197705728. Throughput: 0: 44018.2. Samples: 100659660. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 14:21:08,850][06674] Avg episode reward: [(0, '0.040')] [2024-06-27 14:21:10,780][06909] Updated weights for policy 0, policy_version 12072 (0.0027) [2024-06-27 14:21:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 197902336. Throughput: 0: 44236.4. Samples: 100795800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 14:21:13,851][06674] Avg episode reward: [(0, '0.040')] [2024-06-27 14:21:14,495][06909] Updated weights for policy 0, policy_version 12082 (0.0031) [2024-06-27 14:21:18,005][06909] Updated weights for policy 0, policy_version 12092 (0.0041) [2024-06-27 14:21:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 198148096. Throughput: 0: 44178.3. Samples: 101063280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 14:21:18,850][06674] Avg episode reward: [(0, '0.039')] [2024-06-27 14:21:21,960][06909] Updated weights for policy 0, policy_version 12102 (0.0037) [2024-06-27 14:21:23,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 198377472. Throughput: 0: 44083.3. Samples: 101321160. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 14:21:23,850][06674] Avg episode reward: [(0, '0.041')] [2024-06-27 14:21:25,480][06909] Updated weights for policy 0, policy_version 12112 (0.0039) [2024-06-27 14:21:28,852][06674] Fps is (10 sec: 42589.3, 60 sec: 43689.1, 300 sec: 44042.1). Total num frames: 198574080. Throughput: 0: 44181.9. Samples: 101456260. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 14:21:28,853][06674] Avg episode reward: [(0, '0.050')] [2024-06-27 14:21:29,567][06909] Updated weights for policy 0, policy_version 12122 (0.0035) [2024-06-27 14:21:33,029][06909] Updated weights for policy 0, policy_version 12132 (0.0028) [2024-06-27 14:21:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 198803456. Throughput: 0: 44201.7. Samples: 101724260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:21:33,850][06674] Avg episode reward: [(0, '0.048')] [2024-06-27 14:21:36,811][06909] Updated weights for policy 0, policy_version 12142 (0.0022) [2024-06-27 14:21:38,850][06674] Fps is (10 sec: 45885.0, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 199032832. Throughput: 0: 44249.8. Samples: 101985900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:21:38,850][06674] Avg episode reward: [(0, '0.043')] [2024-06-27 14:21:40,358][06909] Updated weights for policy 0, policy_version 12152 (0.0025) [2024-06-27 14:21:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 199229440. Throughput: 0: 44213.9. Samples: 102121440. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 14:21:43,850][06674] Avg episode reward: [(0, '0.042')] [2024-06-27 14:21:44,254][06909] Updated weights for policy 0, policy_version 12162 (0.0029) [2024-06-27 14:21:47,766][06909] Updated weights for policy 0, policy_version 12172 (0.0030) [2024-06-27 14:21:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 199458816. Throughput: 0: 44110.2. Samples: 102388480. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 14:21:48,850][06674] Avg episode reward: [(0, '0.042')] [2024-06-27 14:21:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000012174_199458816.pth... [2024-06-27 14:21:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000011529_188891136.pth [2024-06-27 14:21:51,638][06909] Updated weights for policy 0, policy_version 12182 (0.0028) [2024-06-27 14:21:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.9, 300 sec: 44098.3). Total num frames: 199688192. Throughput: 0: 44061.8. Samples: 102642440. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-27 14:21:53,850][06674] Avg episode reward: [(0, '0.053')] [2024-06-27 14:21:55,074][06909] Updated weights for policy 0, policy_version 12192 (0.0025) [2024-06-27 14:21:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.4). Total num frames: 199901184. Throughput: 0: 44195.1. Samples: 102784580. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-27 14:21:58,850][06674] Avg episode reward: [(0, '0.053')] [2024-06-27 14:21:59,054][06909] Updated weights for policy 0, policy_version 12202 (0.0044) [2024-06-27 14:22:02,510][06909] Updated weights for policy 0, policy_version 12212 (0.0029) [2024-06-27 14:22:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 200114176. Throughput: 0: 44090.7. Samples: 103047360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:22:03,850][06674] Avg episode reward: [(0, '0.046')] [2024-06-27 14:22:06,291][06909] Updated weights for policy 0, policy_version 12222 (0.0028) [2024-06-27 14:22:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 200343552. Throughput: 0: 44309.4. Samples: 103315080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:22:08,850][06674] Avg episode reward: [(0, '0.046')] [2024-06-27 14:22:09,886][06909] Updated weights for policy 0, policy_version 12232 (0.0033) [2024-06-27 14:22:13,815][06909] Updated weights for policy 0, policy_version 12242 (0.0024) [2024-06-27 14:22:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 200572928. Throughput: 0: 44266.2. Samples: 103448140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:22:13,850][06674] Avg episode reward: [(0, '0.039')] [2024-06-27 14:22:17,298][06909] Updated weights for policy 0, policy_version 12252 (0.0022) [2024-06-27 14:22:18,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 200785920. Throughput: 0: 44069.2. Samples: 103707380. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:22:18,851][06674] Avg episode reward: [(0, '0.039')] [2024-06-27 14:22:21,412][06909] Updated weights for policy 0, policy_version 12262 (0.0051) [2024-06-27 14:22:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44042.7). Total num frames: 200998912. Throughput: 0: 44128.5. Samples: 103971680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 14:22:23,850][06674] Avg episode reward: [(0, '0.041')] [2024-06-27 14:22:24,637][06909] Updated weights for policy 0, policy_version 12272 (0.0033) [2024-06-27 14:22:28,754][06909] Updated weights for policy 0, policy_version 12282 (0.0048) [2024-06-27 14:22:28,852][06674] Fps is (10 sec: 44228.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 201228288. Throughput: 0: 44055.3. Samples: 104104020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 14:22:28,852][06674] Avg episode reward: [(0, '0.045')] [2024-06-27 14:22:31,994][06909] Updated weights for policy 0, policy_version 12292 (0.0022) [2024-06-27 14:22:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 201457664. Throughput: 0: 43913.8. Samples: 104364600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 14:22:33,850][06674] Avg episode reward: [(0, '0.046')] [2024-06-27 14:22:35,527][06887] Signal inference workers to stop experience collection... (1450 times) [2024-06-27 14:22:35,528][06887] Signal inference workers to resume experience collection... (1450 times) [2024-06-27 14:22:35,549][06909] InferenceWorker_p0-w0: stopping experience collection (1450 times) [2024-06-27 14:22:35,550][06909] InferenceWorker_p0-w0: resuming experience collection (1450 times) [2024-06-27 14:22:36,154][06909] Updated weights for policy 0, policy_version 12302 (0.0031) [2024-06-27 14:22:38,850][06674] Fps is (10 sec: 44245.4, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 201670656. Throughput: 0: 44404.8. Samples: 104640660. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 14:22:38,850][06674] Avg episode reward: [(0, '0.043')] [2024-06-27 14:22:39,330][06909] Updated weights for policy 0, policy_version 12312 (0.0038) [2024-06-27 14:22:43,408][06909] Updated weights for policy 0, policy_version 12322 (0.0035) [2024-06-27 14:22:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 201900032. Throughput: 0: 44089.3. Samples: 104768600. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:22:43,850][06674] Avg episode reward: [(0, '0.045')] [2024-06-27 14:22:46,876][06909] Updated weights for policy 0, policy_version 12332 (0.0022) [2024-06-27 14:22:48,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 202113024. Throughput: 0: 44148.0. Samples: 105034020. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 14:22:48,850][06674] Avg episode reward: [(0, '0.056')] [2024-06-27 14:22:48,899][06887] Saving new best policy, reward=0.056! [2024-06-27 14:22:51,015][06909] Updated weights for policy 0, policy_version 12342 (0.0028) [2024-06-27 14:22:53,852][06674] Fps is (10 sec: 45866.0, 60 sec: 44508.3, 300 sec: 44208.7). Total num frames: 202358784. Throughput: 0: 44206.0. Samples: 105304440. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 14:22:53,852][06674] Avg episode reward: [(0, '0.055')] [2024-06-27 14:22:54,207][06909] Updated weights for policy 0, policy_version 12352 (0.0030) [2024-06-27 14:22:58,265][06909] Updated weights for policy 0, policy_version 12362 (0.0040) [2024-06-27 14:22:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 202555392. Throughput: 0: 44166.3. Samples: 105435620. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 14:22:58,850][06674] Avg episode reward: [(0, '0.055')] [2024-06-27 14:23:01,609][06909] Updated weights for policy 0, policy_version 12372 (0.0037) [2024-06-27 14:23:03,850][06674] Fps is (10 sec: 42606.7, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 202784768. Throughput: 0: 44247.6. Samples: 105698520. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 14:23:03,851][06674] Avg episode reward: [(0, '0.052')] [2024-06-27 14:23:05,677][06909] Updated weights for policy 0, policy_version 12382 (0.0027) [2024-06-27 14:23:08,850][06674] Fps is (10 sec: 45874.2, 60 sec: 44509.7, 300 sec: 44153.5). Total num frames: 203014144. Throughput: 0: 44423.4. Samples: 105970740. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:23:08,850][06674] Avg episode reward: [(0, '0.052')] [2024-06-27 14:23:08,926][06909] Updated weights for policy 0, policy_version 12392 (0.0035) [2024-06-27 14:23:13,111][06909] Updated weights for policy 0, policy_version 12402 (0.0024) [2024-06-27 14:23:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 203243520. Throughput: 0: 44393.5. Samples: 106101640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:23:13,850][06674] Avg episode reward: [(0, '0.043')] [2024-06-27 14:23:16,548][06909] Updated weights for policy 0, policy_version 12412 (0.0036) [2024-06-27 14:23:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 203440128. Throughput: 0: 44440.9. Samples: 106364440. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 14:23:18,850][06674] Avg episode reward: [(0, '0.042')] [2024-06-27 14:23:20,500][06909] Updated weights for policy 0, policy_version 12422 (0.0040) [2024-06-27 14:23:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 203669504. Throughput: 0: 44263.7. Samples: 106632520. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 14:23:23,864][06674] Avg episode reward: [(0, '0.041')] [2024-06-27 14:23:23,896][06909] Updated weights for policy 0, policy_version 12432 (0.0041) [2024-06-27 14:23:27,839][06909] Updated weights for policy 0, policy_version 12442 (0.0037) [2024-06-27 14:23:28,852][06674] Fps is (10 sec: 45866.1, 60 sec: 44509.9, 300 sec: 44153.2). Total num frames: 203898880. Throughput: 0: 44336.7. Samples: 106763840. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 14:23:28,852][06674] Avg episode reward: [(0, '0.045')] [2024-06-27 14:23:31,453][06909] Updated weights for policy 0, policy_version 12452 (0.0034) [2024-06-27 14:23:33,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 204095488. Throughput: 0: 44196.4. Samples: 107022860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 14:23:33,850][06674] Avg episode reward: [(0, '0.047')] [2024-06-27 14:23:35,279][06909] Updated weights for policy 0, policy_version 12462 (0.0029) [2024-06-27 14:23:38,852][06909] Updated weights for policy 0, policy_version 12472 (0.0044) [2024-06-27 14:23:38,850][06674] Fps is (10 sec: 44245.6, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 204341248. Throughput: 0: 44151.3. Samples: 107291160. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:23:38,852][06674] Avg episode reward: [(0, '0.048')] [2024-06-27 14:23:42,716][06909] Updated weights for policy 0, policy_version 12482 (0.0036) [2024-06-27 14:23:43,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 204554240. Throughput: 0: 44131.0. Samples: 107421520. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:23:43,851][06674] Avg episode reward: [(0, '0.048')] [2024-06-27 14:23:46,229][06909] Updated weights for policy 0, policy_version 12492 (0.0037) [2024-06-27 14:23:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 204767232. Throughput: 0: 44259.1. Samples: 107690180. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:23:48,851][06674] Avg episode reward: [(0, '0.051')] [2024-06-27 14:23:48,878][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000012498_204767232.pth... [2024-06-27 14:23:48,935][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000011853_194199552.pth [2024-06-27 14:23:50,193][06909] Updated weights for policy 0, policy_version 12502 (0.0032) [2024-06-27 14:23:53,661][06909] Updated weights for policy 0, policy_version 12512 (0.0038) [2024-06-27 14:23:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43965.2, 300 sec: 44098.9). Total num frames: 204996608. Throughput: 0: 44003.7. Samples: 107950900. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:23:53,850][06674] Avg episode reward: [(0, '0.052')] [2024-06-27 14:23:57,613][06909] Updated weights for policy 0, policy_version 12522 (0.0034) [2024-06-27 14:23:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 205209600. Throughput: 0: 44150.3. Samples: 108088400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:23:58,850][06674] Avg episode reward: [(0, '0.049')] [2024-06-27 14:24:01,068][06909] Updated weights for policy 0, policy_version 12532 (0.0028) [2024-06-27 14:24:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 205422592. Throughput: 0: 44232.5. Samples: 108354900. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:24:03,850][06674] Avg episode reward: [(0, '0.055')] [2024-06-27 14:24:04,926][06909] Updated weights for policy 0, policy_version 12542 (0.0026) [2024-06-27 14:24:05,380][06887] Signal inference workers to stop experience collection... (1500 times) [2024-06-27 14:24:05,380][06887] Signal inference workers to resume experience collection... (1500 times) [2024-06-27 14:24:05,408][06909] InferenceWorker_p0-w0: stopping experience collection (1500 times) [2024-06-27 14:24:05,408][06909] InferenceWorker_p0-w0: resuming experience collection (1500 times) [2024-06-27 14:24:08,371][06909] Updated weights for policy 0, policy_version 12552 (0.0035) [2024-06-27 14:24:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 205668352. Throughput: 0: 43939.6. Samples: 108609800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 14:24:08,850][06674] Avg episode reward: [(0, '0.050')] [2024-06-27 14:24:12,356][06909] Updated weights for policy 0, policy_version 12562 (0.0040) [2024-06-27 14:24:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 205864960. Throughput: 0: 44050.0. Samples: 108746000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 14:24:13,850][06674] Avg episode reward: [(0, '0.056')] [2024-06-27 14:24:15,986][06909] Updated weights for policy 0, policy_version 12572 (0.0028) [2024-06-27 14:24:18,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 206061568. Throughput: 0: 44092.9. Samples: 109007040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:24:18,850][06674] Avg episode reward: [(0, '0.068')] [2024-06-27 14:24:18,875][06887] Saving new best policy, reward=0.068! [2024-06-27 14:24:19,823][06909] Updated weights for policy 0, policy_version 12582 (0.0043) [2024-06-27 14:24:23,370][06909] Updated weights for policy 0, policy_version 12592 (0.0040) [2024-06-27 14:24:23,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 206323712. Throughput: 0: 43924.0. Samples: 109267740. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 14:24:23,851][06674] Avg episode reward: [(0, '0.067')] [2024-06-27 14:24:27,247][06909] Updated weights for policy 0, policy_version 12602 (0.0035) [2024-06-27 14:24:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43692.1, 300 sec: 44042.4). Total num frames: 206520320. Throughput: 0: 44248.0. Samples: 109412680. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 14:24:28,850][06674] Avg episode reward: [(0, '0.061')] [2024-06-27 14:24:30,700][06909] Updated weights for policy 0, policy_version 12612 (0.0029) [2024-06-27 14:24:33,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 206733312. Throughput: 0: 44005.5. Samples: 109670420. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 14:24:33,850][06674] Avg episode reward: [(0, '0.061')] [2024-06-27 14:24:34,658][06909] Updated weights for policy 0, policy_version 12622 (0.0032) [2024-06-27 14:24:38,054][06909] Updated weights for policy 0, policy_version 12632 (0.0033) [2024-06-27 14:24:38,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 206979072. Throughput: 0: 44017.6. Samples: 109931700. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 14:24:38,852][06674] Avg episode reward: [(0, '0.062')] [2024-06-27 14:24:42,126][06909] Updated weights for policy 0, policy_version 12642 (0.0033) [2024-06-27 14:24:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 207175680. Throughput: 0: 44134.7. Samples: 110074460. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 14:24:43,850][06674] Avg episode reward: [(0, '0.053')] [2024-06-27 14:24:45,443][06909] Updated weights for policy 0, policy_version 12652 (0.0040) [2024-06-27 14:24:48,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 207388672. Throughput: 0: 43712.0. Samples: 110321940. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 14:24:48,850][06674] Avg episode reward: [(0, '0.064')] [2024-06-27 14:24:49,557][06909] Updated weights for policy 0, policy_version 12662 (0.0031) [2024-06-27 14:24:53,082][06909] Updated weights for policy 0, policy_version 12672 (0.0039) [2024-06-27 14:24:53,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43690.6, 300 sec: 43987.2). Total num frames: 207618048. Throughput: 0: 43989.6. Samples: 110589340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 14:24:53,851][06674] Avg episode reward: [(0, '0.064')] [2024-06-27 14:24:56,899][06909] Updated weights for policy 0, policy_version 12682 (0.0036) [2024-06-27 14:24:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 207831040. Throughput: 0: 44073.8. Samples: 110729320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 14:24:58,850][06674] Avg episode reward: [(0, '0.067')] [2024-06-27 14:25:00,661][06909] Updated weights for policy 0, policy_version 12692 (0.0033) [2024-06-27 14:25:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 208060416. Throughput: 0: 43984.0. Samples: 110986320. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 14:25:03,850][06674] Avg episode reward: [(0, '0.065')] [2024-06-27 14:25:04,341][06909] Updated weights for policy 0, policy_version 12702 (0.0036) [2024-06-27 14:25:06,797][06887] Signal inference workers to stop experience collection... (1550 times) [2024-06-27 14:25:06,799][06887] Signal inference workers to resume experience collection... (1550 times) [2024-06-27 14:25:06,816][06909] InferenceWorker_p0-w0: stopping experience collection (1550 times) [2024-06-27 14:25:06,850][06909] InferenceWorker_p0-w0: resuming experience collection (1550 times) [2024-06-27 14:25:08,193][06909] Updated weights for policy 0, policy_version 12712 (0.0035) [2024-06-27 14:25:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43417.5, 300 sec: 44042.4). Total num frames: 208273408. Throughput: 0: 44086.2. Samples: 111251620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 14:25:08,851][06674] Avg episode reward: [(0, '0.065')] [2024-06-27 14:25:11,631][06909] Updated weights for policy 0, policy_version 12722 (0.0032) [2024-06-27 14:25:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 208486400. Throughput: 0: 43886.3. Samples: 111387560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 14:25:13,850][06674] Avg episode reward: [(0, '0.069')] [2024-06-27 14:25:13,851][06887] Saving new best policy, reward=0.069! [2024-06-27 14:25:15,563][06909] Updated weights for policy 0, policy_version 12732 (0.0026) [2024-06-27 14:25:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 208732160. Throughput: 0: 43985.7. Samples: 111649780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 14:25:18,850][06674] Avg episode reward: [(0, '0.069')] [2024-06-27 14:25:19,607][06909] Updated weights for policy 0, policy_version 12742 (0.0047) [2024-06-27 14:25:23,075][06909] Updated weights for policy 0, policy_version 12752 (0.0036) [2024-06-27 14:25:23,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 208945152. Throughput: 0: 44043.3. Samples: 111913640. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 14:25:23,850][06674] Avg episode reward: [(0, '0.073')] [2024-06-27 14:25:24,040][06887] Saving new best policy, reward=0.073! [2024-06-27 14:25:27,046][06909] Updated weights for policy 0, policy_version 12762 (0.0035) [2024-06-27 14:25:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 209141760. Throughput: 0: 43797.2. Samples: 112045340. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:25:28,850][06674] Avg episode reward: [(0, '0.068')] [2024-06-27 14:25:30,623][06909] Updated weights for policy 0, policy_version 12772 (0.0039) [2024-06-27 14:25:33,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 209387520. Throughput: 0: 44185.8. Samples: 112310300. Policy #0 lag: (min: 1.0, avg: 10.4, max: 21.0) [2024-06-27 14:25:33,850][06674] Avg episode reward: [(0, '0.061')] [2024-06-27 14:25:34,405][06909] Updated weights for policy 0, policy_version 12782 (0.0042) [2024-06-27 14:25:38,150][06909] Updated weights for policy 0, policy_version 12792 (0.0035) [2024-06-27 14:25:38,850][06674] Fps is (10 sec: 49151.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 209633280. Throughput: 0: 44114.2. Samples: 112574480. Policy #0 lag: (min: 1.0, avg: 10.4, max: 21.0) [2024-06-27 14:25:38,850][06674] Avg episode reward: [(0, '0.068')] [2024-06-27 14:25:41,720][06909] Updated weights for policy 0, policy_version 12802 (0.0038) [2024-06-27 14:25:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 209829888. Throughput: 0: 44031.5. Samples: 112710740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:25:43,850][06674] Avg episode reward: [(0, '0.070')] [2024-06-27 14:25:45,514][06909] Updated weights for policy 0, policy_version 12812 (0.0031) [2024-06-27 14:25:48,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 210042880. Throughput: 0: 44152.3. Samples: 112973180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:25:48,850][06674] Avg episode reward: [(0, '0.062')] [2024-06-27 14:25:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000012820_210042880.pth... [2024-06-27 14:25:48,939][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000012174_199458816.pth [2024-06-27 14:25:49,245][06909] Updated weights for policy 0, policy_version 12822 (0.0036) [2024-06-27 14:25:52,892][06909] Updated weights for policy 0, policy_version 12832 (0.0032) [2024-06-27 14:25:53,854][06674] Fps is (10 sec: 45855.7, 60 sec: 44506.8, 300 sec: 44208.4). Total num frames: 210288640. Throughput: 0: 44183.0. Samples: 113240040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 14:25:53,854][06674] Avg episode reward: [(0, '0.066')] [2024-06-27 14:25:56,581][06909] Updated weights for policy 0, policy_version 12842 (0.0041) [2024-06-27 14:25:58,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 210485248. Throughput: 0: 44142.2. Samples: 113373960. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 14:25:58,850][06674] Avg episode reward: [(0, '0.081')] [2024-06-27 14:25:58,911][06887] Saving new best policy, reward=0.081! [2024-06-27 14:26:00,649][06909] Updated weights for policy 0, policy_version 12852 (0.0033) [2024-06-27 14:26:03,850][06674] Fps is (10 sec: 42617.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 210714624. Throughput: 0: 44090.4. Samples: 113633840. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 14:26:03,850][06674] Avg episode reward: [(0, '0.077')] [2024-06-27 14:26:03,930][06909] Updated weights for policy 0, policy_version 12862 (0.0030) [2024-06-27 14:26:07,977][06909] Updated weights for policy 0, policy_version 12872 (0.0034) [2024-06-27 14:26:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 210927616. Throughput: 0: 44166.1. Samples: 113901120. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 14:26:08,850][06674] Avg episode reward: [(0, '0.065')] [2024-06-27 14:26:11,446][06909] Updated weights for policy 0, policy_version 12882 (0.0027) [2024-06-27 14:26:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 211156992. Throughput: 0: 44188.0. Samples: 114033800. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 14:26:13,853][06674] Avg episode reward: [(0, '0.065')] [2024-06-27 14:26:15,365][06909] Updated weights for policy 0, policy_version 12892 (0.0041) [2024-06-27 14:26:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 211369984. Throughput: 0: 44111.9. Samples: 114295340. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 14:26:18,850][06674] Avg episode reward: [(0, '0.074')] [2024-06-27 14:26:18,873][06909] Updated weights for policy 0, policy_version 12902 (0.0038) [2024-06-27 14:26:22,799][06909] Updated weights for policy 0, policy_version 12912 (0.0051) [2024-06-27 14:26:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.7, 300 sec: 44153.8). Total num frames: 211599360. Throughput: 0: 44121.5. Samples: 114559940. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 14:26:23,850][06674] Avg episode reward: [(0, '0.084')] [2024-06-27 14:26:23,850][06887] Saving new best policy, reward=0.084! [2024-06-27 14:26:26,344][06909] Updated weights for policy 0, policy_version 12922 (0.0038) [2024-06-27 14:26:28,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44783.0, 300 sec: 44153.5). Total num frames: 211828736. Throughput: 0: 43947.6. Samples: 114688380. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 14:26:28,850][06674] Avg episode reward: [(0, '0.083')] [2024-06-27 14:26:30,196][06909] Updated weights for policy 0, policy_version 12932 (0.0044) [2024-06-27 14:26:33,663][06909] Updated weights for policy 0, policy_version 12942 (0.0040) [2024-06-27 14:26:33,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 212041728. Throughput: 0: 44009.4. Samples: 114953600. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 14:26:33,850][06674] Avg episode reward: [(0, '0.085')] [2024-06-27 14:26:33,851][06887] Saving new best policy, reward=0.085! [2024-06-27 14:26:37,728][06909] Updated weights for policy 0, policy_version 12952 (0.0042) [2024-06-27 14:26:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.8, 300 sec: 44153.5). Total num frames: 212254720. Throughput: 0: 44001.1. Samples: 115219900. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 14:26:38,850][06674] Avg episode reward: [(0, '0.081')] [2024-06-27 14:26:40,986][06909] Updated weights for policy 0, policy_version 12962 (0.0034) [2024-06-27 14:26:43,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 212467712. Throughput: 0: 43848.5. Samples: 115347140. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:26:43,850][06674] Avg episode reward: [(0, '0.078')] [2024-06-27 14:26:45,121][06909] Updated weights for policy 0, policy_version 12972 (0.0033) [2024-06-27 14:26:47,520][06887] Signal inference workers to stop experience collection... (1600 times) [2024-06-27 14:26:47,521][06887] Signal inference workers to resume experience collection... (1600 times) [2024-06-27 14:26:47,545][06909] InferenceWorker_p0-w0: stopping experience collection (1600 times) [2024-06-27 14:26:47,545][06909] InferenceWorker_p0-w0: resuming experience collection (1600 times) [2024-06-27 14:26:48,339][06909] Updated weights for policy 0, policy_version 12982 (0.0033) [2024-06-27 14:26:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 212697088. Throughput: 0: 44024.7. Samples: 115614960. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:26:48,850][06674] Avg episode reward: [(0, '0.086')] [2024-06-27 14:26:52,547][06909] Updated weights for policy 0, policy_version 12992 (0.0046) [2024-06-27 14:26:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43693.8, 300 sec: 44097.9). Total num frames: 212910080. Throughput: 0: 43943.1. Samples: 115878560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:26:53,854][06674] Avg episode reward: [(0, '0.085')] [2024-06-27 14:26:55,733][06909] Updated weights for policy 0, policy_version 13002 (0.0032) [2024-06-27 14:26:58,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 213155840. Throughput: 0: 43979.1. Samples: 116012860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:26:58,850][06674] Avg episode reward: [(0, '0.096')] [2024-06-27 14:26:58,868][06887] Saving new best policy, reward=0.096! [2024-06-27 14:26:59,843][06909] Updated weights for policy 0, policy_version 13012 (0.0035) [2024-06-27 14:27:03,431][06909] Updated weights for policy 0, policy_version 13022 (0.0042) [2024-06-27 14:27:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 213352448. Throughput: 0: 44012.5. Samples: 116275900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 14:27:03,850][06674] Avg episode reward: [(0, '0.109')] [2024-06-27 14:27:03,971][06887] Saving new best policy, reward=0.109! [2024-06-27 14:27:07,781][06909] Updated weights for policy 0, policy_version 13032 (0.0028) [2024-06-27 14:27:08,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 213565440. Throughput: 0: 43884.0. Samples: 116534720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 14:27:08,850][06674] Avg episode reward: [(0, '0.105')] [2024-06-27 14:27:10,905][06909] Updated weights for policy 0, policy_version 13042 (0.0046) [2024-06-27 14:27:13,856][06674] Fps is (10 sec: 44210.1, 60 sec: 43959.3, 300 sec: 44097.1). Total num frames: 213794816. Throughput: 0: 43875.0. Samples: 116663020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 14:27:13,857][06674] Avg episode reward: [(0, '0.090')] [2024-06-27 14:27:15,113][06909] Updated weights for policy 0, policy_version 13052 (0.0028) [2024-06-27 14:27:18,432][06909] Updated weights for policy 0, policy_version 13062 (0.0033) [2024-06-27 14:27:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 214007808. Throughput: 0: 44044.5. Samples: 116935600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 14:27:18,850][06674] Avg episode reward: [(0, '0.101')] [2024-06-27 14:27:22,430][06909] Updated weights for policy 0, policy_version 13072 (0.0026) [2024-06-27 14:27:23,850][06674] Fps is (10 sec: 42623.8, 60 sec: 43690.6, 300 sec: 44042.7). Total num frames: 214220800. Throughput: 0: 43864.8. Samples: 117193820. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-27 14:27:23,850][06674] Avg episode reward: [(0, '0.110')] [2024-06-27 14:27:25,709][06909] Updated weights for policy 0, policy_version 13082 (0.0035) [2024-06-27 14:27:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 214450176. Throughput: 0: 44054.6. Samples: 117329600. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-27 14:27:28,850][06674] Avg episode reward: [(0, '0.109')] [2024-06-27 14:27:29,719][06909] Updated weights for policy 0, policy_version 13092 (0.0039) [2024-06-27 14:27:33,095][06909] Updated weights for policy 0, policy_version 13102 (0.0039) [2024-06-27 14:27:33,852][06674] Fps is (10 sec: 45866.2, 60 sec: 43962.3, 300 sec: 44097.7). Total num frames: 214679552. Throughput: 0: 44014.1. Samples: 117595680. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 14:27:33,852][06674] Avg episode reward: [(0, '0.100')] [2024-06-27 14:27:37,054][06909] Updated weights for policy 0, policy_version 13112 (0.0029) [2024-06-27 14:27:38,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 214876160. Throughput: 0: 44119.6. Samples: 117863940. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 14:27:38,850][06674] Avg episode reward: [(0, '0.108')] [2024-06-27 14:27:40,796][06909] Updated weights for policy 0, policy_version 13122 (0.0036) [2024-06-27 14:27:43,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 215105536. Throughput: 0: 43992.5. Samples: 117992520. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:27:43,850][06674] Avg episode reward: [(0, '0.108')] [2024-06-27 14:27:44,351][06909] Updated weights for policy 0, policy_version 13132 (0.0034) [2024-06-27 14:27:48,167][06909] Updated weights for policy 0, policy_version 13142 (0.0037) [2024-06-27 14:27:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 43987.2). Total num frames: 215334912. Throughput: 0: 44105.8. Samples: 118260660. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:27:48,850][06674] Avg episode reward: [(0, '0.113')] [2024-06-27 14:27:48,874][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000013143_215334912.pth... [2024-06-27 14:27:48,940][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000012498_204767232.pth [2024-06-27 14:27:48,945][06887] Saving new best policy, reward=0.113! [2024-06-27 14:27:51,826][06909] Updated weights for policy 0, policy_version 13152 (0.0026) [2024-06-27 14:27:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 215547904. Throughput: 0: 44230.2. Samples: 118525080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 14:27:53,850][06674] Avg episode reward: [(0, '0.104')] [2024-06-27 14:27:55,558][06909] Updated weights for policy 0, policy_version 13162 (0.0027) [2024-06-27 14:27:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 215777280. Throughput: 0: 44298.3. Samples: 118656180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 14:27:58,851][06674] Avg episode reward: [(0, '0.093')] [2024-06-27 14:27:59,191][06909] Updated weights for policy 0, policy_version 13172 (0.0033) [2024-06-27 14:28:02,985][06909] Updated weights for policy 0, policy_version 13182 (0.0035) [2024-06-27 14:28:03,852][06674] Fps is (10 sec: 45865.6, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 216006656. Throughput: 0: 44164.7. Samples: 118923100. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 14:28:03,852][06674] Avg episode reward: [(0, '0.100')] [2024-06-27 14:28:06,517][06909] Updated weights for policy 0, policy_version 13192 (0.0021) [2024-06-27 14:28:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 216219648. Throughput: 0: 44348.9. Samples: 119189520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 14:28:08,850][06674] Avg episode reward: [(0, '0.100')] [2024-06-27 14:28:10,414][06909] Updated weights for policy 0, policy_version 13202 (0.0032) [2024-06-27 14:28:13,850][06674] Fps is (10 sec: 44245.9, 60 sec: 44241.3, 300 sec: 44098.0). Total num frames: 216449024. Throughput: 0: 44279.2. Samples: 119322160. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 14:28:13,850][06674] Avg episode reward: [(0, '0.099')] [2024-06-27 14:28:13,891][06909] Updated weights for policy 0, policy_version 13212 (0.0032) [2024-06-27 14:28:17,796][06909] Updated weights for policy 0, policy_version 13222 (0.0029) [2024-06-27 14:28:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 216662016. Throughput: 0: 44273.5. Samples: 119587900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 14:28:18,854][06674] Avg episode reward: [(0, '0.094')] [2024-06-27 14:28:21,054][06887] Signal inference workers to stop experience collection... (1650 times) [2024-06-27 14:28:21,058][06887] Signal inference workers to resume experience collection... (1650 times) [2024-06-27 14:28:21,104][06909] InferenceWorker_p0-w0: stopping experience collection (1650 times) [2024-06-27 14:28:21,104][06909] InferenceWorker_p0-w0: resuming experience collection (1650 times) [2024-06-27 14:28:21,336][06909] Updated weights for policy 0, policy_version 13232 (0.0038) [2024-06-27 14:28:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 216875008. Throughput: 0: 44114.6. Samples: 119849100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-27 14:28:23,850][06674] Avg episode reward: [(0, '0.107')] [2024-06-27 14:28:25,277][06909] Updated weights for policy 0, policy_version 13242 (0.0037) [2024-06-27 14:28:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 217104384. Throughput: 0: 44283.0. Samples: 119985260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-27 14:28:28,850][06674] Avg episode reward: [(0, '0.106')] [2024-06-27 14:28:29,053][06909] Updated weights for policy 0, policy_version 13252 (0.0027) [2024-06-27 14:28:32,606][06909] Updated weights for policy 0, policy_version 13262 (0.0029) [2024-06-27 14:28:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 217333760. Throughput: 0: 44228.5. Samples: 120250940. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:28:33,850][06674] Avg episode reward: [(0, '0.102')] [2024-06-27 14:28:36,447][06909] Updated weights for policy 0, policy_version 13272 (0.0045) [2024-06-27 14:28:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 217546752. Throughput: 0: 44233.2. Samples: 120515580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:28:38,850][06674] Avg episode reward: [(0, '0.102')] [2024-06-27 14:28:40,111][06909] Updated weights for policy 0, policy_version 13282 (0.0029) [2024-06-27 14:28:43,763][06909] Updated weights for policy 0, policy_version 13292 (0.0040) [2024-06-27 14:28:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 217776128. Throughput: 0: 44146.3. Samples: 120642760. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 14:28:43,851][06674] Avg episode reward: [(0, '0.107')] [2024-06-27 14:28:47,323][06909] Updated weights for policy 0, policy_version 13302 (0.0037) [2024-06-27 14:28:48,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 218005504. Throughput: 0: 44233.1. Samples: 120913500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 14:28:48,850][06674] Avg episode reward: [(0, '0.109')] [2024-06-27 14:28:51,055][06909] Updated weights for policy 0, policy_version 13312 (0.0034) [2024-06-27 14:28:53,856][06674] Fps is (10 sec: 42572.7, 60 sec: 44232.3, 300 sec: 44041.5). Total num frames: 218202112. Throughput: 0: 44366.9. Samples: 121186300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 14:28:53,857][06674] Avg episode reward: [(0, '0.101')] [2024-06-27 14:28:54,803][06909] Updated weights for policy 0, policy_version 13322 (0.0033) [2024-06-27 14:28:58,477][06909] Updated weights for policy 0, policy_version 13332 (0.0046) [2024-06-27 14:28:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 218447872. Throughput: 0: 44210.2. Samples: 121311620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 14:28:58,850][06674] Avg episode reward: [(0, '0.107')] [2024-06-27 14:29:02,081][06909] Updated weights for policy 0, policy_version 13342 (0.0038) [2024-06-27 14:29:03,850][06674] Fps is (10 sec: 45903.1, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 218660864. Throughput: 0: 44333.8. Samples: 121582920. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 14:29:03,850][06674] Avg episode reward: [(0, '0.137')] [2024-06-27 14:29:03,889][06887] Saving new best policy, reward=0.137! [2024-06-27 14:29:05,808][06909] Updated weights for policy 0, policy_version 13352 (0.0028) [2024-06-27 14:29:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 218873856. Throughput: 0: 44411.1. Samples: 121847600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 14:29:08,850][06674] Avg episode reward: [(0, '0.143')] [2024-06-27 14:29:08,996][06887] Saving new best policy, reward=0.143! [2024-06-27 14:29:09,453][06909] Updated weights for policy 0, policy_version 13362 (0.0036) [2024-06-27 14:29:13,275][06909] Updated weights for policy 0, policy_version 13372 (0.0022) [2024-06-27 14:29:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 219086848. Throughput: 0: 44121.4. Samples: 121970720. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 14:29:13,850][06674] Avg episode reward: [(0, '0.141')] [2024-06-27 14:29:17,202][06909] Updated weights for policy 0, policy_version 13382 (0.0038) [2024-06-27 14:29:18,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 219332608. Throughput: 0: 44160.9. Samples: 122238180. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 14:29:18,850][06674] Avg episode reward: [(0, '0.123')] [2024-06-27 14:29:20,589][06909] Updated weights for policy 0, policy_version 13392 (0.0030) [2024-06-27 14:29:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 219529216. Throughput: 0: 44310.3. Samples: 122509540. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 14:29:23,850][06674] Avg episode reward: [(0, '0.139')] [2024-06-27 14:29:24,654][06909] Updated weights for policy 0, policy_version 13402 (0.0021) [2024-06-27 14:29:28,047][06909] Updated weights for policy 0, policy_version 13412 (0.0028) [2024-06-27 14:29:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 219758592. Throughput: 0: 44157.4. Samples: 122629840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 14:29:28,850][06674] Avg episode reward: [(0, '0.138')] [2024-06-27 14:29:31,969][06909] Updated weights for policy 0, policy_version 13422 (0.0034) [2024-06-27 14:29:33,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 220004352. Throughput: 0: 44168.9. Samples: 122901100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:29:33,850][06674] Avg episode reward: [(0, '0.121')] [2024-06-27 14:29:35,394][06909] Updated weights for policy 0, policy_version 13432 (0.0051) [2024-06-27 14:29:38,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 220168192. Throughput: 0: 43948.9. Samples: 123163740. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:29:38,850][06674] Avg episode reward: [(0, '0.126')] [2024-06-27 14:29:39,035][06887] Signal inference workers to stop experience collection... (1700 times) [2024-06-27 14:29:39,040][06887] Signal inference workers to resume experience collection... (1700 times) [2024-06-27 14:29:39,083][06909] InferenceWorker_p0-w0: stopping experience collection (1700 times) [2024-06-27 14:29:39,083][06909] InferenceWorker_p0-w0: resuming experience collection (1700 times) [2024-06-27 14:29:39,585][06909] Updated weights for policy 0, policy_version 13442 (0.0026) [2024-06-27 14:29:42,740][06909] Updated weights for policy 0, policy_version 13452 (0.0030) [2024-06-27 14:29:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 220413952. Throughput: 0: 43941.3. Samples: 123288980. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 14:29:43,852][06674] Avg episode reward: [(0, '0.139')] [2024-06-27 14:29:46,952][06909] Updated weights for policy 0, policy_version 13462 (0.0029) [2024-06-27 14:29:48,850][06674] Fps is (10 sec: 49152.6, 60 sec: 44236.8, 300 sec: 44209.1). Total num frames: 220659712. Throughput: 0: 43988.9. Samples: 123562420. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 14:29:48,850][06674] Avg episode reward: [(0, '0.132')] [2024-06-27 14:29:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000013468_220659712.pth... [2024-06-27 14:29:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000012820_210042880.pth [2024-06-27 14:29:50,126][06909] Updated weights for policy 0, policy_version 13472 (0.0033) [2024-06-27 14:29:53,852][06674] Fps is (10 sec: 44228.0, 60 sec: 44239.8, 300 sec: 44153.2). Total num frames: 220856320. Throughput: 0: 44092.3. Samples: 123831840. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 14:29:53,852][06674] Avg episode reward: [(0, '0.143')] [2024-06-27 14:29:54,448][06909] Updated weights for policy 0, policy_version 13482 (0.0034) [2024-06-27 14:29:57,494][06909] Updated weights for policy 0, policy_version 13492 (0.0022) [2024-06-27 14:29:58,850][06674] Fps is (10 sec: 40959.2, 60 sec: 43690.5, 300 sec: 44097.9). Total num frames: 221069312. Throughput: 0: 44126.0. Samples: 123956400. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 14:29:58,850][06674] Avg episode reward: [(0, '0.143')] [2024-06-27 14:30:01,878][06909] Updated weights for policy 0, policy_version 13502 (0.0033) [2024-06-27 14:30:03,850][06674] Fps is (10 sec: 45884.7, 60 sec: 44236.8, 300 sec: 44209.1). Total num frames: 221315072. Throughput: 0: 44146.7. Samples: 124224780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 14:30:03,850][06674] Avg episode reward: [(0, '0.139')] [2024-06-27 14:30:05,306][06909] Updated weights for policy 0, policy_version 13512 (0.0046) [2024-06-27 14:30:08,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 221511680. Throughput: 0: 44039.6. Samples: 124491320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 14:30:08,850][06674] Avg episode reward: [(0, '0.139')] [2024-06-27 14:30:09,212][06909] Updated weights for policy 0, policy_version 13522 (0.0034) [2024-06-27 14:30:12,657][06909] Updated weights for policy 0, policy_version 13532 (0.0040) [2024-06-27 14:30:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 221741056. Throughput: 0: 44265.7. Samples: 124621800. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:30:13,852][06674] Avg episode reward: [(0, '0.130')] [2024-06-27 14:30:16,577][06909] Updated weights for policy 0, policy_version 13542 (0.0023) [2024-06-27 14:30:18,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 221986816. Throughput: 0: 44152.8. Samples: 124887980. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:30:18,851][06674] Avg episode reward: [(0, '0.129')] [2024-06-27 14:30:20,103][06909] Updated weights for policy 0, policy_version 13552 (0.0035) [2024-06-27 14:30:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 222183424. Throughput: 0: 44290.7. Samples: 125156820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 14:30:23,850][06674] Avg episode reward: [(0, '0.137')] [2024-06-27 14:30:24,002][06909] Updated weights for policy 0, policy_version 13562 (0.0035) [2024-06-27 14:30:27,444][06909] Updated weights for policy 0, policy_version 13572 (0.0044) [2024-06-27 14:30:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 222396416. Throughput: 0: 44244.9. Samples: 125280000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 14:30:28,850][06674] Avg episode reward: [(0, '0.167')] [2024-06-27 14:30:28,864][06887] Saving new best policy, reward=0.167! [2024-06-27 14:30:31,559][06909] Updated weights for policy 0, policy_version 13582 (0.0027) [2024-06-27 14:30:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 222642176. Throughput: 0: 44063.1. Samples: 125545260. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 14:30:33,851][06674] Avg episode reward: [(0, '0.167')] [2024-06-27 14:30:34,829][06909] Updated weights for policy 0, policy_version 13592 (0.0036) [2024-06-27 14:30:38,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 222838784. Throughput: 0: 44100.3. Samples: 125816260. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 14:30:38,850][06674] Avg episode reward: [(0, '0.167')] [2024-06-27 14:30:39,075][06909] Updated weights for policy 0, policy_version 13602 (0.0038) [2024-06-27 14:30:42,297][06909] Updated weights for policy 0, policy_version 13612 (0.0036) [2024-06-27 14:30:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 223084544. Throughput: 0: 44198.8. Samples: 125945340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:30:43,850][06674] Avg episode reward: [(0, '0.167')] [2024-06-27 14:30:46,398][06909] Updated weights for policy 0, policy_version 13622 (0.0029) [2024-06-27 14:30:48,852][06674] Fps is (10 sec: 44228.7, 60 sec: 43689.4, 300 sec: 44042.8). Total num frames: 223281152. Throughput: 0: 44126.3. Samples: 126210540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:30:48,852][06674] Avg episode reward: [(0, '0.166')] [2024-06-27 14:30:49,966][06909] Updated weights for policy 0, policy_version 13632 (0.0034) [2024-06-27 14:30:53,737][06909] Updated weights for policy 0, policy_version 13642 (0.0032) [2024-06-27 14:30:53,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44238.4, 300 sec: 44153.5). Total num frames: 223510528. Throughput: 0: 44092.5. Samples: 126475480. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 14:30:53,850][06674] Avg episode reward: [(0, '0.155')] [2024-06-27 14:30:57,301][06909] Updated weights for policy 0, policy_version 13652 (0.0041) [2024-06-27 14:30:58,850][06674] Fps is (10 sec: 45883.4, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 223739904. Throughput: 0: 44277.4. Samples: 126614280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 14:30:58,850][06674] Avg episode reward: [(0, '0.165')] [2024-06-27 14:31:01,160][06909] Updated weights for policy 0, policy_version 13662 (0.0035) [2024-06-27 14:31:03,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 223952896. Throughput: 0: 44066.6. Samples: 126870980. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:31:03,850][06674] Avg episode reward: [(0, '0.173')] [2024-06-27 14:31:03,851][06887] Saving new best policy, reward=0.173! [2024-06-27 14:31:04,800][06909] Updated weights for policy 0, policy_version 13672 (0.0025) [2024-06-27 14:31:04,810][06887] Signal inference workers to stop experience collection... (1750 times) [2024-06-27 14:31:04,810][06887] Signal inference workers to resume experience collection... (1750 times) [2024-06-27 14:31:04,820][06909] InferenceWorker_p0-w0: stopping experience collection (1750 times) [2024-06-27 14:31:04,821][06909] InferenceWorker_p0-w0: resuming experience collection (1750 times) [2024-06-27 14:31:08,588][06909] Updated weights for policy 0, policy_version 13682 (0.0040) [2024-06-27 14:31:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 224165888. Throughput: 0: 44075.6. Samples: 127140220. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:31:08,850][06674] Avg episode reward: [(0, '0.175')] [2024-06-27 14:31:08,896][06887] Saving new best policy, reward=0.175! [2024-06-27 14:31:12,174][06909] Updated weights for policy 0, policy_version 13692 (0.0036) [2024-06-27 14:31:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 224395264. Throughput: 0: 44316.0. Samples: 127274220. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:31:13,850][06674] Avg episode reward: [(0, '0.173')] [2024-06-27 14:31:16,165][06909] Updated weights for policy 0, policy_version 13702 (0.0034) [2024-06-27 14:31:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.7, 300 sec: 44042.4). Total num frames: 224591872. Throughput: 0: 44102.2. Samples: 127529860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 14:31:18,850][06674] Avg episode reward: [(0, '0.170')] [2024-06-27 14:31:19,502][06909] Updated weights for policy 0, policy_version 13712 (0.0031) [2024-06-27 14:31:23,429][06909] Updated weights for policy 0, policy_version 13722 (0.0027) [2024-06-27 14:31:23,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 224854016. Throughput: 0: 44206.0. Samples: 127805540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 14:31:23,850][06674] Avg episode reward: [(0, '0.161')] [2024-06-27 14:31:27,044][06909] Updated weights for policy 0, policy_version 13732 (0.0043) [2024-06-27 14:31:28,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 225067008. Throughput: 0: 44288.1. Samples: 127938300. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 14:31:28,850][06674] Avg episode reward: [(0, '0.170')] [2024-06-27 14:31:30,824][06909] Updated weights for policy 0, policy_version 13742 (0.0038) [2024-06-27 14:31:33,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 225263616. Throughput: 0: 44189.2. Samples: 128198980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 14:31:33,850][06674] Avg episode reward: [(0, '0.176')] [2024-06-27 14:31:34,599][06909] Updated weights for policy 0, policy_version 13752 (0.0034) [2024-06-27 14:31:38,276][06909] Updated weights for policy 0, policy_version 13762 (0.0030) [2024-06-27 14:31:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 225509376. Throughput: 0: 44192.8. Samples: 128464160. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 14:31:38,850][06674] Avg episode reward: [(0, '0.189')] [2024-06-27 14:31:38,919][06887] Saving new best policy, reward=0.189! [2024-06-27 14:31:41,930][06909] Updated weights for policy 0, policy_version 13772 (0.0035) [2024-06-27 14:31:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 225722368. Throughput: 0: 44062.1. Samples: 128597080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 14:31:43,850][06674] Avg episode reward: [(0, '0.182')] [2024-06-27 14:31:45,738][06909] Updated weights for policy 0, policy_version 13782 (0.0034) [2024-06-27 14:31:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44238.1, 300 sec: 44153.5). Total num frames: 225935360. Throughput: 0: 44147.7. Samples: 128857620. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-27 14:31:48,850][06674] Avg episode reward: [(0, '0.173')] [2024-06-27 14:31:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000013790_225935360.pth... [2024-06-27 14:31:48,930][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000013143_215334912.pth [2024-06-27 14:31:49,241][06909] Updated weights for policy 0, policy_version 13792 (0.0040) [2024-06-27 14:31:53,044][06909] Updated weights for policy 0, policy_version 13802 (0.0032) [2024-06-27 14:31:53,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 226181120. Throughput: 0: 43968.9. Samples: 129118820. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-27 14:31:53,850][06674] Avg episode reward: [(0, '0.167')] [2024-06-27 14:31:57,095][06909] Updated weights for policy 0, policy_version 13812 (0.0023) [2024-06-27 14:31:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 226377728. Throughput: 0: 44075.5. Samples: 129257620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 14:31:58,851][06674] Avg episode reward: [(0, '0.196')] [2024-06-27 14:31:58,981][06887] Saving new best policy, reward=0.196! [2024-06-27 14:32:00,459][06909] Updated weights for policy 0, policy_version 13822 (0.0035) [2024-06-27 14:32:03,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 226590720. Throughput: 0: 44084.8. Samples: 129513680. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 14:32:03,850][06674] Avg episode reward: [(0, '0.184')] [2024-06-27 14:32:04,506][06909] Updated weights for policy 0, policy_version 13832 (0.0033) [2024-06-27 14:32:07,838][06909] Updated weights for policy 0, policy_version 13842 (0.0042) [2024-06-27 14:32:08,852][06674] Fps is (10 sec: 44228.2, 60 sec: 44235.3, 300 sec: 44154.1). Total num frames: 226820096. Throughput: 0: 43830.1. Samples: 129777980. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 14:32:08,852][06674] Avg episode reward: [(0, '0.191')] [2024-06-27 14:32:11,831][06909] Updated weights for policy 0, policy_version 13852 (0.0041) [2024-06-27 14:32:13,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 227049472. Throughput: 0: 43982.4. Samples: 129917520. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 14:32:13,851][06674] Avg episode reward: [(0, '0.191')] [2024-06-27 14:32:15,059][06909] Updated weights for policy 0, policy_version 13862 (0.0024) [2024-06-27 14:32:18,850][06674] Fps is (10 sec: 42606.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 227246080. Throughput: 0: 44043.5. Samples: 130180940. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 14:32:18,850][06674] Avg episode reward: [(0, '0.178')] [2024-06-27 14:32:19,352][06909] Updated weights for policy 0, policy_version 13872 (0.0035) [2024-06-27 14:32:22,477][06909] Updated weights for policy 0, policy_version 13882 (0.0027) [2024-06-27 14:32:23,850][06674] Fps is (10 sec: 45876.2, 60 sec: 44236.9, 300 sec: 44264.6). Total num frames: 227508224. Throughput: 0: 44059.1. Samples: 130446820. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 14:32:23,850][06674] Avg episode reward: [(0, '0.167')] [2024-06-27 14:32:27,016][06909] Updated weights for policy 0, policy_version 13892 (0.0025) [2024-06-27 14:32:27,615][06887] Signal inference workers to stop experience collection... (1800 times) [2024-06-27 14:32:27,664][06909] InferenceWorker_p0-w0: stopping experience collection (1800 times) [2024-06-27 14:32:27,674][06887] Signal inference workers to resume experience collection... (1800 times) [2024-06-27 14:32:27,682][06909] InferenceWorker_p0-w0: resuming experience collection (1800 times) [2024-06-27 14:32:28,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.7, 300 sec: 44153.8). Total num frames: 227704832. Throughput: 0: 44038.7. Samples: 130578820. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 14:32:28,850][06674] Avg episode reward: [(0, '0.192')] [2024-06-27 14:32:29,970][06909] Updated weights for policy 0, policy_version 13902 (0.0034) [2024-06-27 14:32:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 227917824. Throughput: 0: 44065.3. Samples: 130840560. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 14:32:33,850][06674] Avg episode reward: [(0, '0.192')] [2024-06-27 14:32:34,507][06909] Updated weights for policy 0, policy_version 13912 (0.0049) [2024-06-27 14:32:37,187][06909] Updated weights for policy 0, policy_version 13922 (0.0030) [2024-06-27 14:32:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 228163584. Throughput: 0: 44126.2. Samples: 131104500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 14:32:38,850][06674] Avg episode reward: [(0, '0.187')] [2024-06-27 14:32:41,838][06909] Updated weights for policy 0, policy_version 13932 (0.0035) [2024-06-27 14:32:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 228360192. Throughput: 0: 44257.9. Samples: 131249220. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 14:32:43,850][06674] Avg episode reward: [(0, '0.202')] [2024-06-27 14:32:43,922][06887] Saving new best policy, reward=0.202! [2024-06-27 14:32:44,731][06909] Updated weights for policy 0, policy_version 13942 (0.0041) [2024-06-27 14:32:48,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 228556800. Throughput: 0: 44198.3. Samples: 131502600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:32:48,850][06674] Avg episode reward: [(0, '0.194')] [2024-06-27 14:32:49,259][06909] Updated weights for policy 0, policy_version 13952 (0.0035) [2024-06-27 14:32:52,503][06909] Updated weights for policy 0, policy_version 13962 (0.0028) [2024-06-27 14:32:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 228818944. Throughput: 0: 44149.1. Samples: 131764600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:32:53,850][06674] Avg episode reward: [(0, '0.196')] [2024-06-27 14:32:56,576][06909] Updated weights for policy 0, policy_version 13972 (0.0037) [2024-06-27 14:32:58,850][06674] Fps is (10 sec: 47514.2, 60 sec: 44236.9, 300 sec: 44153.8). Total num frames: 229031936. Throughput: 0: 44205.6. Samples: 131906760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2024-06-27 14:32:58,850][06674] Avg episode reward: [(0, '0.215')] [2024-06-27 14:32:58,858][06887] Saving new best policy, reward=0.215! [2024-06-27 14:32:59,927][06909] Updated weights for policy 0, policy_version 13982 (0.0033) [2024-06-27 14:33:03,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 229228544. Throughput: 0: 44051.3. Samples: 132163240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2024-06-27 14:33:03,850][06674] Avg episode reward: [(0, '0.223')] [2024-06-27 14:33:03,958][06887] Saving new best policy, reward=0.223! [2024-06-27 14:33:03,962][06909] Updated weights for policy 0, policy_version 13992 (0.0031) [2024-06-27 14:33:07,289][06909] Updated weights for policy 0, policy_version 14002 (0.0024) [2024-06-27 14:33:08,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44511.3, 300 sec: 44209.0). Total num frames: 229490688. Throughput: 0: 43900.4. Samples: 132422340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 19.0) [2024-06-27 14:33:08,850][06674] Avg episode reward: [(0, '0.225')] [2024-06-27 14:33:08,864][06887] Saving new best policy, reward=0.225! [2024-06-27 14:33:11,387][06909] Updated weights for policy 0, policy_version 14012 (0.0023) [2024-06-27 14:33:13,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 229687296. Throughput: 0: 44120.8. Samples: 132564260. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 14:33:13,850][06674] Avg episode reward: [(0, '0.216')] [2024-06-27 14:33:14,654][06909] Updated weights for policy 0, policy_version 14022 (0.0031) [2024-06-27 14:33:18,730][06909] Updated weights for policy 0, policy_version 14032 (0.0031) [2024-06-27 14:33:18,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 229900288. Throughput: 0: 44147.5. Samples: 132827200. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 14:33:18,850][06674] Avg episode reward: [(0, '0.214')] [2024-06-27 14:33:21,967][06909] Updated weights for policy 0, policy_version 14042 (0.0039) [2024-06-27 14:33:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 230146048. Throughput: 0: 44013.2. Samples: 133085100. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 14:33:23,851][06674] Avg episode reward: [(0, '0.220')] [2024-06-27 14:33:26,135][06909] Updated weights for policy 0, policy_version 14052 (0.0022) [2024-06-27 14:33:28,855][06674] Fps is (10 sec: 44213.1, 60 sec: 43959.7, 300 sec: 44097.1). Total num frames: 230342656. Throughput: 0: 43888.9. Samples: 133224460. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 14:33:28,856][06674] Avg episode reward: [(0, '0.228')] [2024-06-27 14:33:28,870][06887] Saving new best policy, reward=0.228! [2024-06-27 14:33:29,465][06909] Updated weights for policy 0, policy_version 14062 (0.0035) [2024-06-27 14:33:33,669][06909] Updated weights for policy 0, policy_version 14072 (0.0040) [2024-06-27 14:33:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 230555648. Throughput: 0: 44107.6. Samples: 133487440. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-27 14:33:33,850][06674] Avg episode reward: [(0, '0.224')] [2024-06-27 14:33:36,811][06909] Updated weights for policy 0, policy_version 14082 (0.0042) [2024-06-27 14:33:38,856][06674] Fps is (10 sec: 45872.4, 60 sec: 43959.3, 300 sec: 44152.6). Total num frames: 230801408. Throughput: 0: 44003.0. Samples: 133745000. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-27 14:33:38,856][06674] Avg episode reward: [(0, '0.214')] [2024-06-27 14:33:41,024][06909] Updated weights for policy 0, policy_version 14092 (0.0026) [2024-06-27 14:33:43,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 231014400. Throughput: 0: 43998.2. Samples: 133886680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:33:43,850][06674] Avg episode reward: [(0, '0.210')] [2024-06-27 14:33:44,118][06909] Updated weights for policy 0, policy_version 14102 (0.0037) [2024-06-27 14:33:48,350][06909] Updated weights for policy 0, policy_version 14112 (0.0035) [2024-06-27 14:33:48,850][06674] Fps is (10 sec: 40984.1, 60 sec: 44236.7, 300 sec: 44098.8). Total num frames: 231211008. Throughput: 0: 44182.0. Samples: 134151440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:33:48,851][06674] Avg episode reward: [(0, '0.210')] [2024-06-27 14:33:48,911][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000014113_231227392.pth... [2024-06-27 14:33:48,957][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000013468_220659712.pth [2024-06-27 14:33:51,555][06909] Updated weights for policy 0, policy_version 14122 (0.0020) [2024-06-27 14:33:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 231473152. Throughput: 0: 44089.8. Samples: 134406380. Policy #0 lag: (min: 1.0, avg: 10.7, max: 23.0) [2024-06-27 14:33:53,850][06674] Avg episode reward: [(0, '0.211')] [2024-06-27 14:33:55,750][06909] Updated weights for policy 0, policy_version 14132 (0.0037) [2024-06-27 14:33:58,597][06887] Signal inference workers to stop experience collection... (1850 times) [2024-06-27 14:33:58,597][06887] Signal inference workers to resume experience collection... (1850 times) [2024-06-27 14:33:58,647][06909] InferenceWorker_p0-w0: stopping experience collection (1850 times) [2024-06-27 14:33:58,647][06909] InferenceWorker_p0-w0: resuming experience collection (1850 times) [2024-06-27 14:33:58,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44236.5, 300 sec: 44153.4). Total num frames: 231686144. Throughput: 0: 44195.3. Samples: 134553060. Policy #0 lag: (min: 1.0, avg: 10.7, max: 23.0) [2024-06-27 14:33:58,851][06674] Avg episode reward: [(0, '0.222')] [2024-06-27 14:33:58,997][06909] Updated weights for policy 0, policy_version 14142 (0.0039) [2024-06-27 14:34:03,081][06909] Updated weights for policy 0, policy_version 14152 (0.0024) [2024-06-27 14:34:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 231882752. Throughput: 0: 44151.2. Samples: 134814000. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:34:03,850][06674] Avg episode reward: [(0, '0.227')] [2024-06-27 14:34:06,447][06909] Updated weights for policy 0, policy_version 14162 (0.0039) [2024-06-27 14:34:08,850][06674] Fps is (10 sec: 44238.1, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 232128512. Throughput: 0: 44321.8. Samples: 135079580. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:34:08,850][06674] Avg episode reward: [(0, '0.228')] [2024-06-27 14:34:10,319][06909] Updated weights for policy 0, policy_version 14172 (0.0025) [2024-06-27 14:34:13,704][06909] Updated weights for policy 0, policy_version 14182 (0.0038) [2024-06-27 14:34:13,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 232357888. Throughput: 0: 44420.4. Samples: 135223140. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 14:34:13,850][06674] Avg episode reward: [(0, '0.228')] [2024-06-27 14:34:17,516][06909] Updated weights for policy 0, policy_version 14192 (0.0034) [2024-06-27 14:34:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 232554496. Throughput: 0: 44477.8. Samples: 135488940. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 14:34:18,850][06674] Avg episode reward: [(0, '0.228')] [2024-06-27 14:34:21,055][06909] Updated weights for policy 0, policy_version 14202 (0.0039) [2024-06-27 14:34:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 232800256. Throughput: 0: 44570.4. Samples: 135750400. Policy #0 lag: (min: 1.0, avg: 12.0, max: 22.0) [2024-06-27 14:34:23,850][06674] Avg episode reward: [(0, '0.219')] [2024-06-27 14:34:25,369][06909] Updated weights for policy 0, policy_version 14212 (0.0043) [2024-06-27 14:34:28,551][06909] Updated weights for policy 0, policy_version 14222 (0.0030) [2024-06-27 14:34:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44513.9, 300 sec: 44098.0). Total num frames: 233013248. Throughput: 0: 44448.5. Samples: 135886860. Policy #0 lag: (min: 1.0, avg: 12.0, max: 22.0) [2024-06-27 14:34:28,850][06674] Avg episode reward: [(0, '0.232')] [2024-06-27 14:34:28,940][06887] Saving new best policy, reward=0.232! [2024-06-27 14:34:32,691][06909] Updated weights for policy 0, policy_version 14232 (0.0027) [2024-06-27 14:34:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 233209856. Throughput: 0: 44449.5. Samples: 136151660. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 14:34:33,850][06674] Avg episode reward: [(0, '0.250')] [2024-06-27 14:34:33,851][06887] Saving new best policy, reward=0.250! [2024-06-27 14:34:36,041][06909] Updated weights for policy 0, policy_version 14242 (0.0034) [2024-06-27 14:34:38,854][06674] Fps is (10 sec: 44219.9, 60 sec: 44238.5, 300 sec: 44208.5). Total num frames: 233455616. Throughput: 0: 44768.7. Samples: 136421140. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 14:34:38,854][06674] Avg episode reward: [(0, '0.251')] [2024-06-27 14:34:39,943][06909] Updated weights for policy 0, policy_version 14252 (0.0041) [2024-06-27 14:34:43,577][06909] Updated weights for policy 0, policy_version 14262 (0.0037) [2024-06-27 14:34:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 233668608. Throughput: 0: 44510.6. Samples: 136556020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 14:34:43,850][06674] Avg episode reward: [(0, '0.242')] [2024-06-27 14:34:47,268][06909] Updated weights for policy 0, policy_version 14272 (0.0032) [2024-06-27 14:34:48,850][06674] Fps is (10 sec: 42613.9, 60 sec: 44509.9, 300 sec: 44153.8). Total num frames: 233881600. Throughput: 0: 44480.7. Samples: 136815640. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-27 14:34:48,851][06674] Avg episode reward: [(0, '0.227')] [2024-06-27 14:34:50,896][06909] Updated weights for policy 0, policy_version 14282 (0.0039) [2024-06-27 14:34:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44209.1). Total num frames: 234110976. Throughput: 0: 44402.8. Samples: 137077700. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-27 14:34:53,850][06674] Avg episode reward: [(0, '0.233')] [2024-06-27 14:34:54,739][06909] Updated weights for policy 0, policy_version 14292 (0.0038) [2024-06-27 14:34:58,336][06909] Updated weights for policy 0, policy_version 14302 (0.0027) [2024-06-27 14:34:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44237.0, 300 sec: 44153.5). Total num frames: 234340352. Throughput: 0: 44217.7. Samples: 137212940. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 14:34:58,850][06674] Avg episode reward: [(0, '0.233')] [2024-06-27 14:35:02,232][06909] Updated weights for policy 0, policy_version 14312 (0.0033) [2024-06-27 14:35:03,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 234536960. Throughput: 0: 44186.7. Samples: 137477340. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 14:35:03,850][06674] Avg episode reward: [(0, '0.251')] [2024-06-27 14:35:05,730][06909] Updated weights for policy 0, policy_version 14322 (0.0034) [2024-06-27 14:35:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 234766336. Throughput: 0: 44255.1. Samples: 137741880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:35:08,850][06674] Avg episode reward: [(0, '0.244')] [2024-06-27 14:35:09,625][06909] Updated weights for policy 0, policy_version 14332 (0.0021) [2024-06-27 14:35:13,145][06909] Updated weights for policy 0, policy_version 14342 (0.0032) [2024-06-27 14:35:13,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 235012096. Throughput: 0: 44249.3. Samples: 137878080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 14:35:13,850][06674] Avg episode reward: [(0, '0.247')] [2024-06-27 14:35:17,167][06909] Updated weights for policy 0, policy_version 14352 (0.0036) [2024-06-27 14:35:18,850][06674] Fps is (10 sec: 44234.5, 60 sec: 44236.4, 300 sec: 44153.4). Total num frames: 235208704. Throughput: 0: 44240.3. Samples: 138142500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:35:18,851][06674] Avg episode reward: [(0, '0.243')] [2024-06-27 14:35:20,410][06909] Updated weights for policy 0, policy_version 14362 (0.0023) [2024-06-27 14:35:23,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 235421696. Throughput: 0: 44125.4. Samples: 138406620. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:35:23,851][06674] Avg episode reward: [(0, '0.239')] [2024-06-27 14:35:24,789][06909] Updated weights for policy 0, policy_version 14372 (0.0040) [2024-06-27 14:35:27,902][06909] Updated weights for policy 0, policy_version 14382 (0.0030) [2024-06-27 14:35:28,850][06674] Fps is (10 sec: 45878.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 235667456. Throughput: 0: 43924.1. Samples: 138532600. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 14:35:28,850][06674] Avg episode reward: [(0, '0.261')] [2024-06-27 14:35:28,933][06887] Saving new best policy, reward=0.261! [2024-06-27 14:35:32,223][06909] Updated weights for policy 0, policy_version 14392 (0.0040) [2024-06-27 14:35:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 235880448. Throughput: 0: 44113.0. Samples: 138800720. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 14:35:33,850][06674] Avg episode reward: [(0, '0.266')] [2024-06-27 14:35:33,851][06887] Saving new best policy, reward=0.266! [2024-06-27 14:35:34,482][06887] Signal inference workers to stop experience collection... (1900 times) [2024-06-27 14:35:34,489][06887] Signal inference workers to resume experience collection... (1900 times) [2024-06-27 14:35:34,519][06909] InferenceWorker_p0-w0: stopping experience collection (1900 times) [2024-06-27 14:35:34,520][06909] InferenceWorker_p0-w0: resuming experience collection (1900 times) [2024-06-27 14:35:35,178][06909] Updated weights for policy 0, policy_version 14402 (0.0027) [2024-06-27 14:35:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43966.5, 300 sec: 44098.0). Total num frames: 236093440. Throughput: 0: 44164.4. Samples: 139065100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 14:35:38,850][06674] Avg episode reward: [(0, '0.250')] [2024-06-27 14:35:39,605][06909] Updated weights for policy 0, policy_version 14412 (0.0032) [2024-06-27 14:35:42,681][06909] Updated weights for policy 0, policy_version 14422 (0.0037) [2024-06-27 14:35:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44209.3). Total num frames: 236322816. Throughput: 0: 43995.6. Samples: 139192740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 14:35:43,850][06674] Avg episode reward: [(0, '0.258')] [2024-06-27 14:35:46,980][06909] Updated weights for policy 0, policy_version 14432 (0.0037) [2024-06-27 14:35:48,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 236535808. Throughput: 0: 44021.7. Samples: 139458320. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 14:35:48,850][06674] Avg episode reward: [(0, '0.260')] [2024-06-27 14:35:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000014437_236535808.pth... [2024-06-27 14:35:48,913][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000013790_225935360.pth [2024-06-27 14:35:50,100][06909] Updated weights for policy 0, policy_version 14442 (0.0023) [2024-06-27 14:35:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 236748800. Throughput: 0: 43964.9. Samples: 139720300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 14:35:53,853][06674] Avg episode reward: [(0, '0.269')] [2024-06-27 14:35:53,854][06887] Saving new best policy, reward=0.269! [2024-06-27 14:35:54,473][06909] Updated weights for policy 0, policy_version 14452 (0.0041) [2024-06-27 14:35:57,530][06909] Updated weights for policy 0, policy_version 14462 (0.0033) [2024-06-27 14:35:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 236978176. Throughput: 0: 43854.6. Samples: 139851540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 14:35:58,850][06674] Avg episode reward: [(0, '0.274')] [2024-06-27 14:35:58,868][06887] Saving new best policy, reward=0.274! [2024-06-27 14:36:01,726][06909] Updated weights for policy 0, policy_version 14472 (0.0027) [2024-06-27 14:36:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 237174784. Throughput: 0: 43851.1. Samples: 140115780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 14:36:03,850][06674] Avg episode reward: [(0, '0.269')] [2024-06-27 14:36:04,884][06909] Updated weights for policy 0, policy_version 14482 (0.0039) [2024-06-27 14:36:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 237404160. Throughput: 0: 43808.1. Samples: 140377980. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 14:36:08,850][06674] Avg episode reward: [(0, '0.257')] [2024-06-27 14:36:09,144][06909] Updated weights for policy 0, policy_version 14492 (0.0026) [2024-06-27 14:36:12,585][06909] Updated weights for policy 0, policy_version 14502 (0.0046) [2024-06-27 14:36:13,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.7, 300 sec: 44264.6). Total num frames: 237649920. Throughput: 0: 43893.7. Samples: 140507820. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 14:36:13,850][06674] Avg episode reward: [(0, '0.257')] [2024-06-27 14:36:16,545][06909] Updated weights for policy 0, policy_version 14512 (0.0042) [2024-06-27 14:36:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43964.1, 300 sec: 44042.4). Total num frames: 237846528. Throughput: 0: 43895.1. Samples: 140776000. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 14:36:18,850][06674] Avg episode reward: [(0, '0.278')] [2024-06-27 14:36:18,879][06887] Saving new best policy, reward=0.278! [2024-06-27 14:36:20,421][06909] Updated weights for policy 0, policy_version 14522 (0.0036) [2024-06-27 14:36:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 238075904. Throughput: 0: 43802.6. Samples: 141036220. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 14:36:23,850][06674] Avg episode reward: [(0, '0.274')] [2024-06-27 14:36:24,642][06909] Updated weights for policy 0, policy_version 14532 (0.0031) [2024-06-27 14:36:27,883][06909] Updated weights for policy 0, policy_version 14542 (0.0034) [2024-06-27 14:36:28,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.6, 300 sec: 44209.0). Total num frames: 238305280. Throughput: 0: 43751.0. Samples: 141161540. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 14:36:28,851][06674] Avg episode reward: [(0, '0.272')] [2024-06-27 14:36:32,019][06909] Updated weights for policy 0, policy_version 14552 (0.0042) [2024-06-27 14:36:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 238518272. Throughput: 0: 43835.6. Samples: 141430920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 14:36:33,850][06674] Avg episode reward: [(0, '0.272')] [2024-06-27 14:36:35,339][06909] Updated weights for policy 0, policy_version 14562 (0.0033) [2024-06-27 14:36:38,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 238714880. Throughput: 0: 43756.8. Samples: 141689360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 14:36:38,851][06674] Avg episode reward: [(0, '0.268')] [2024-06-27 14:36:39,507][06909] Updated weights for policy 0, policy_version 14572 (0.0036) [2024-06-27 14:36:42,749][06909] Updated weights for policy 0, policy_version 14582 (0.0037) [2024-06-27 14:36:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 238960640. Throughput: 0: 43727.9. Samples: 141819300. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 14:36:43,850][06674] Avg episode reward: [(0, '0.268')] [2024-06-27 14:36:46,850][06909] Updated weights for policy 0, policy_version 14592 (0.0027) [2024-06-27 14:36:48,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 239173632. Throughput: 0: 43793.4. Samples: 142086480. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 14:36:48,850][06674] Avg episode reward: [(0, '0.278')] [2024-06-27 14:36:50,639][06909] Updated weights for policy 0, policy_version 14602 (0.0036) [2024-06-27 14:36:53,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43417.5, 300 sec: 43986.9). Total num frames: 239353856. Throughput: 0: 43771.9. Samples: 142347720. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 14:36:53,850][06674] Avg episode reward: [(0, '0.257')] [2024-06-27 14:36:54,282][06909] Updated weights for policy 0, policy_version 14612 (0.0039) [2024-06-27 14:36:58,043][06909] Updated weights for policy 0, policy_version 14622 (0.0042) [2024-06-27 14:36:58,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 239616000. Throughput: 0: 43641.6. Samples: 142471700. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 14:36:58,851][06674] Avg episode reward: [(0, '0.271')] [2024-06-27 14:37:01,669][06909] Updated weights for policy 0, policy_version 14632 (0.0034) [2024-06-27 14:37:02,876][06887] Signal inference workers to stop experience collection... (1950 times) [2024-06-27 14:37:02,876][06887] Signal inference workers to resume experience collection... (1950 times) [2024-06-27 14:37:02,887][06909] InferenceWorker_p0-w0: stopping experience collection (1950 times) [2024-06-27 14:37:02,887][06909] InferenceWorker_p0-w0: resuming experience collection (1950 times) [2024-06-27 14:37:03,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 239828992. Throughput: 0: 43564.0. Samples: 142736380. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 14:37:03,850][06674] Avg episode reward: [(0, '0.275')] [2024-06-27 14:37:05,518][06909] Updated weights for policy 0, policy_version 14642 (0.0034) [2024-06-27 14:37:08,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43417.6, 300 sec: 43931.4). Total num frames: 240009216. Throughput: 0: 43580.9. Samples: 142997360. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 14:37:08,850][06674] Avg episode reward: [(0, '0.270')] [2024-06-27 14:37:09,246][06909] Updated weights for policy 0, policy_version 14652 (0.0038) [2024-06-27 14:37:12,981][06909] Updated weights for policy 0, policy_version 14662 (0.0026) [2024-06-27 14:37:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.5, 300 sec: 44098.0). Total num frames: 240254976. Throughput: 0: 43575.2. Samples: 143122420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 14:37:13,850][06674] Avg episode reward: [(0, '0.260')] [2024-06-27 14:37:16,893][06909] Updated weights for policy 0, policy_version 14672 (0.0045) [2024-06-27 14:37:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 240467968. Throughput: 0: 43389.0. Samples: 143383420. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:37:18,850][06674] Avg episode reward: [(0, '0.269')] [2024-06-27 14:37:20,385][06909] Updated weights for policy 0, policy_version 14682 (0.0032) [2024-06-27 14:37:23,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43144.6, 300 sec: 43931.3). Total num frames: 240664576. Throughput: 0: 43577.0. Samples: 143650320. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:37:23,850][06674] Avg episode reward: [(0, '0.269')] [2024-06-27 14:37:24,356][06909] Updated weights for policy 0, policy_version 14692 (0.0029) [2024-06-27 14:37:27,870][06909] Updated weights for policy 0, policy_version 14702 (0.0038) [2024-06-27 14:37:28,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 240910336. Throughput: 0: 43431.1. Samples: 143773700. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-27 14:37:28,850][06674] Avg episode reward: [(0, '0.279')] [2024-06-27 14:37:28,863][06887] Saving new best policy, reward=0.279! [2024-06-27 14:37:31,902][06909] Updated weights for policy 0, policy_version 14712 (0.0032) [2024-06-27 14:37:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.7, 300 sec: 43931.3). Total num frames: 241123328. Throughput: 0: 43332.9. Samples: 144036460. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-27 14:37:33,850][06674] Avg episode reward: [(0, '0.279')] [2024-06-27 14:37:35,360][06909] Updated weights for policy 0, policy_version 14722 (0.0037) [2024-06-27 14:37:38,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 241336320. Throughput: 0: 43526.3. Samples: 144306400. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 14:37:38,850][06674] Avg episode reward: [(0, '0.288')] [2024-06-27 14:37:38,865][06887] Saving new best policy, reward=0.288! [2024-06-27 14:37:39,438][06909] Updated weights for policy 0, policy_version 14732 (0.0029) [2024-06-27 14:37:42,846][06909] Updated weights for policy 0, policy_version 14742 (0.0033) [2024-06-27 14:37:43,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43417.6, 300 sec: 44097.9). Total num frames: 241565696. Throughput: 0: 43429.3. Samples: 144426020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 14:37:43,850][06674] Avg episode reward: [(0, '0.290')] [2024-06-27 14:37:43,851][06887] Saving new best policy, reward=0.290! [2024-06-27 14:37:46,924][06909] Updated weights for policy 0, policy_version 14752 (0.0037) [2024-06-27 14:37:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 241795072. Throughput: 0: 43481.8. Samples: 144693060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 14:37:48,850][06674] Avg episode reward: [(0, '0.292')] [2024-06-27 14:37:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000014758_241795072.pth... [2024-06-27 14:37:48,913][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000014113_231227392.pth [2024-06-27 14:37:48,923][06887] Saving new best policy, reward=0.292! [2024-06-27 14:37:50,244][06909] Updated weights for policy 0, policy_version 14762 (0.0031) [2024-06-27 14:37:53,852][06674] Fps is (10 sec: 42590.2, 60 sec: 43962.3, 300 sec: 43931.0). Total num frames: 241991680. Throughput: 0: 43642.5. Samples: 144961360. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 14:37:53,852][06674] Avg episode reward: [(0, '0.296')] [2024-06-27 14:37:53,853][06887] Saving new best policy, reward=0.296! [2024-06-27 14:37:54,402][06909] Updated weights for policy 0, policy_version 14772 (0.0023) [2024-06-27 14:37:57,680][06909] Updated weights for policy 0, policy_version 14782 (0.0030) [2024-06-27 14:37:58,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43144.6, 300 sec: 43986.9). Total num frames: 242204672. Throughput: 0: 43581.4. Samples: 145083580. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 14:37:58,850][06674] Avg episode reward: [(0, '0.296')] [2024-06-27 14:38:01,887][06909] Updated weights for policy 0, policy_version 14792 (0.0047) [2024-06-27 14:38:03,850][06674] Fps is (10 sec: 45884.7, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 242450432. Throughput: 0: 43637.3. Samples: 145347100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 14:38:03,850][06674] Avg episode reward: [(0, '0.274')] [2024-06-27 14:38:05,142][06909] Updated weights for policy 0, policy_version 14802 (0.0049) [2024-06-27 14:38:08,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43417.5, 300 sec: 43820.2). Total num frames: 242614272. Throughput: 0: 43719.8. Samples: 145617720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 14:38:08,851][06674] Avg episode reward: [(0, '0.281')] [2024-06-27 14:38:09,474][06909] Updated weights for policy 0, policy_version 14812 (0.0041) [2024-06-27 14:38:10,673][06887] Signal inference workers to stop experience collection... (2000 times) [2024-06-27 14:38:10,673][06887] Signal inference workers to resume experience collection... (2000 times) [2024-06-27 14:38:10,732][06909] InferenceWorker_p0-w0: stopping experience collection (2000 times) [2024-06-27 14:38:10,732][06909] InferenceWorker_p0-w0: resuming experience collection (2000 times) [2024-06-27 14:38:12,583][06909] Updated weights for policy 0, policy_version 14822 (0.0041) [2024-06-27 14:38:13,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 242860032. Throughput: 0: 43554.8. Samples: 145733660. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 14:38:13,850][06674] Avg episode reward: [(0, '0.291')] [2024-06-27 14:38:17,170][06909] Updated weights for policy 0, policy_version 14832 (0.0036) [2024-06-27 14:38:18,850][06674] Fps is (10 sec: 49152.9, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 243105792. Throughput: 0: 43736.0. Samples: 146004580. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 14:38:18,850][06674] Avg episode reward: [(0, '0.304')] [2024-06-27 14:38:18,876][06887] Saving new best policy, reward=0.304! [2024-06-27 14:38:20,037][06909] Updated weights for policy 0, policy_version 14842 (0.0032) [2024-06-27 14:38:23,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.6, 300 sec: 43821.1). Total num frames: 243269632. Throughput: 0: 43636.0. Samples: 146270020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 14:38:23,850][06674] Avg episode reward: [(0, '0.297')] [2024-06-27 14:38:24,909][06909] Updated weights for policy 0, policy_version 14852 (0.0036) [2024-06-27 14:38:27,445][06909] Updated weights for policy 0, policy_version 14862 (0.0038) [2024-06-27 14:38:28,852][06674] Fps is (10 sec: 40951.3, 60 sec: 43416.2, 300 sec: 43931.0). Total num frames: 243515392. Throughput: 0: 43627.0. Samples: 146389320. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 14:38:28,852][06674] Avg episode reward: [(0, '0.291')] [2024-06-27 14:38:32,456][06909] Updated weights for policy 0, policy_version 14872 (0.0028) [2024-06-27 14:38:33,850][06674] Fps is (10 sec: 49152.0, 60 sec: 43963.8, 300 sec: 43932.2). Total num frames: 243761152. Throughput: 0: 43851.1. Samples: 146666360. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 14:38:33,850][06674] Avg episode reward: [(0, '0.304')] [2024-06-27 14:38:34,824][06909] Updated weights for policy 0, policy_version 14882 (0.0027) [2024-06-27 14:38:38,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43417.6, 300 sec: 43820.2). Total num frames: 243941376. Throughput: 0: 43715.3. Samples: 146928460. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-27 14:38:38,854][06674] Avg episode reward: [(0, '0.302')] [2024-06-27 14:38:39,834][06909] Updated weights for policy 0, policy_version 14892 (0.0032) [2024-06-27 14:38:42,345][06909] Updated weights for policy 0, policy_version 14902 (0.0041) [2024-06-27 14:38:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 244187136. Throughput: 0: 43690.7. Samples: 147049660. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-27 14:38:43,850][06674] Avg episode reward: [(0, '0.305')] [2024-06-27 14:38:47,271][06909] Updated weights for policy 0, policy_version 14912 (0.0033) [2024-06-27 14:38:48,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 244416512. Throughput: 0: 43895.6. Samples: 147322400. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 14:38:48,850][06674] Avg episode reward: [(0, '0.300')] [2024-06-27 14:38:49,949][06909] Updated weights for policy 0, policy_version 14922 (0.0039) [2024-06-27 14:38:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43419.1, 300 sec: 43764.8). Total num frames: 244596736. Throughput: 0: 43881.5. Samples: 147592380. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 14:38:53,850][06674] Avg episode reward: [(0, '0.296')] [2024-06-27 14:38:54,634][06909] Updated weights for policy 0, policy_version 14932 (0.0043) [2024-06-27 14:38:57,687][06909] Updated weights for policy 0, policy_version 14942 (0.0031) [2024-06-27 14:38:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 244858880. Throughput: 0: 43951.1. Samples: 147711460. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 14:38:58,850][06674] Avg episode reward: [(0, '0.299')] [2024-06-27 14:39:02,250][06909] Updated weights for policy 0, policy_version 14952 (0.0037) [2024-06-27 14:39:03,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 245071872. Throughput: 0: 44045.6. Samples: 147986640. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 14:39:03,851][06674] Avg episode reward: [(0, '0.307')] [2024-06-27 14:39:03,978][06887] Saving new best policy, reward=0.307! [2024-06-27 14:39:05,195][06909] Updated weights for policy 0, policy_version 14962 (0.0030) [2024-06-27 14:39:08,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44236.9, 300 sec: 43764.7). Total num frames: 245268480. Throughput: 0: 43923.1. Samples: 148246560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 14:39:08,850][06674] Avg episode reward: [(0, '0.295')] [2024-06-27 14:39:09,783][06909] Updated weights for policy 0, policy_version 14972 (0.0032) [2024-06-27 14:39:12,774][06909] Updated weights for policy 0, policy_version 14982 (0.0034) [2024-06-27 14:39:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 245514240. Throughput: 0: 43978.4. Samples: 148368260. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 14:39:13,850][06674] Avg episode reward: [(0, '0.309')] [2024-06-27 14:39:13,851][06887] Saving new best policy, reward=0.309! [2024-06-27 14:39:17,250][06887] Signal inference workers to stop experience collection... (2050 times) [2024-06-27 14:39:17,303][06887] Signal inference workers to resume experience collection... (2050 times) [2024-06-27 14:39:17,304][06909] InferenceWorker_p0-w0: stopping experience collection (2050 times) [2024-06-27 14:39:17,311][06909] Updated weights for policy 0, policy_version 14992 (0.0044) [2024-06-27 14:39:17,333][06909] InferenceWorker_p0-w0: resuming experience collection (2050 times) [2024-06-27 14:39:18,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 245727232. Throughput: 0: 43774.5. Samples: 148636220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 14:39:18,852][06674] Avg episode reward: [(0, '0.316')] [2024-06-27 14:39:18,859][06887] Saving new best policy, reward=0.316! [2024-06-27 14:39:20,230][06909] Updated weights for policy 0, policy_version 15002 (0.0033) [2024-06-27 14:39:23,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 245907456. Throughput: 0: 43820.0. Samples: 148900360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 14:39:23,850][06674] Avg episode reward: [(0, '0.316')] [2024-06-27 14:39:24,635][06909] Updated weights for policy 0, policy_version 15012 (0.0043) [2024-06-27 14:39:27,732][06909] Updated weights for policy 0, policy_version 15022 (0.0041) [2024-06-27 14:39:28,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43965.2, 300 sec: 43875.8). Total num frames: 246153216. Throughput: 0: 43862.2. Samples: 149023460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 14:39:28,850][06674] Avg episode reward: [(0, '0.308')] [2024-06-27 14:39:32,053][06909] Updated weights for policy 0, policy_version 15032 (0.0043) [2024-06-27 14:39:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43690.6, 300 sec: 43820.8). Total num frames: 246382592. Throughput: 0: 43800.4. Samples: 149293420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 14:39:33,850][06674] Avg episode reward: [(0, '0.301')] [2024-06-27 14:39:35,191][06909] Updated weights for policy 0, policy_version 15042 (0.0028) [2024-06-27 14:39:38,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43962.3, 300 sec: 43764.4). Total num frames: 246579200. Throughput: 0: 43561.5. Samples: 149552740. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 14:39:38,852][06674] Avg episode reward: [(0, '0.298')] [2024-06-27 14:39:39,466][06909] Updated weights for policy 0, policy_version 15052 (0.0031) [2024-06-27 14:39:42,576][06909] Updated weights for policy 0, policy_version 15062 (0.0036) [2024-06-27 14:39:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 246824960. Throughput: 0: 43812.9. Samples: 149683040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 24.0) [2024-06-27 14:39:43,850][06674] Avg episode reward: [(0, '0.308')] [2024-06-27 14:39:46,865][06909] Updated weights for policy 0, policy_version 15072 (0.0033) [2024-06-27 14:39:48,850][06674] Fps is (10 sec: 45884.8, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 247037952. Throughput: 0: 43707.3. Samples: 149953460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 24.0) [2024-06-27 14:39:48,850][06674] Avg episode reward: [(0, '0.316')] [2024-06-27 14:39:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000015078_247037952.pth... [2024-06-27 14:39:48,937][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000014437_236535808.pth [2024-06-27 14:39:49,941][06909] Updated weights for policy 0, policy_version 15082 (0.0023) [2024-06-27 14:39:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 247234560. Throughput: 0: 43782.7. Samples: 150216780. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 14:39:53,850][06674] Avg episode reward: [(0, '0.310')] [2024-06-27 14:39:54,707][06909] Updated weights for policy 0, policy_version 15092 (0.0032) [2024-06-27 14:39:57,508][06909] Updated weights for policy 0, policy_version 15102 (0.0023) [2024-06-27 14:39:58,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43417.5, 300 sec: 43820.2). Total num frames: 247463936. Throughput: 0: 43788.8. Samples: 150338760. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 14:39:58,851][06674] Avg episode reward: [(0, '0.319')] [2024-06-27 14:39:58,921][06887] Saving new best policy, reward=0.319! [2024-06-27 14:40:02,141][06909] Updated weights for policy 0, policy_version 15112 (0.0031) [2024-06-27 14:40:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 247693312. Throughput: 0: 43836.5. Samples: 150608860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 14:40:03,850][06674] Avg episode reward: [(0, '0.319')] [2024-06-27 14:40:05,086][06909] Updated weights for policy 0, policy_version 15122 (0.0030) [2024-06-27 14:40:08,856][06674] Fps is (10 sec: 42573.5, 60 sec: 43686.3, 300 sec: 43652.8). Total num frames: 247889920. Throughput: 0: 43723.1. Samples: 150868160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 14:40:08,856][06674] Avg episode reward: [(0, '0.328')] [2024-06-27 14:40:08,870][06887] Saving new best policy, reward=0.328! [2024-06-27 14:40:09,651][06909] Updated weights for policy 0, policy_version 15132 (0.0024) [2024-06-27 14:40:12,764][06909] Updated weights for policy 0, policy_version 15142 (0.0035) [2024-06-27 14:40:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43764.8). Total num frames: 248119296. Throughput: 0: 43759.2. Samples: 150992620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 14:40:13,850][06674] Avg episode reward: [(0, '0.319')] [2024-06-27 14:40:17,198][06909] Updated weights for policy 0, policy_version 15152 (0.0027) [2024-06-27 14:40:18,850][06674] Fps is (10 sec: 45903.0, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 248348672. Throughput: 0: 43687.2. Samples: 151259340. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 14:40:18,850][06674] Avg episode reward: [(0, '0.313')] [2024-06-27 14:40:20,305][06909] Updated weights for policy 0, policy_version 15162 (0.0040) [2024-06-27 14:40:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 248545280. Throughput: 0: 43715.3. Samples: 151519840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 14:40:23,850][06674] Avg episode reward: [(0, '0.318')] [2024-06-27 14:40:24,594][06909] Updated weights for policy 0, policy_version 15172 (0.0041) [2024-06-27 14:40:27,731][06909] Updated weights for policy 0, policy_version 15182 (0.0030) [2024-06-27 14:40:28,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 248758272. Throughput: 0: 43645.4. Samples: 151647080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 14:40:28,850][06674] Avg episode reward: [(0, '0.320')] [2024-06-27 14:40:32,041][06909] Updated weights for policy 0, policy_version 15192 (0.0024) [2024-06-27 14:40:33,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 249004032. Throughput: 0: 43510.7. Samples: 151911440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 14:40:33,850][06674] Avg episode reward: [(0, '0.313')] [2024-06-27 14:40:35,684][06909] Updated weights for policy 0, policy_version 15202 (0.0033) [2024-06-27 14:40:38,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43690.7, 300 sec: 43653.3). Total num frames: 249200640. Throughput: 0: 43518.9. Samples: 152175220. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 14:40:38,852][06674] Avg episode reward: [(0, '0.316')] [2024-06-27 14:40:39,578][06909] Updated weights for policy 0, policy_version 15212 (0.0041) [2024-06-27 14:40:43,153][06909] Updated weights for policy 0, policy_version 15222 (0.0044) [2024-06-27 14:40:43,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43144.6, 300 sec: 43653.7). Total num frames: 249413632. Throughput: 0: 43580.2. Samples: 152299860. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 14:40:43,850][06674] Avg episode reward: [(0, '0.320')] [2024-06-27 14:40:45,776][06887] Signal inference workers to stop experience collection... (2100 times) [2024-06-27 14:40:45,829][06887] Signal inference workers to resume experience collection... (2100 times) [2024-06-27 14:40:45,830][06909] InferenceWorker_p0-w0: stopping experience collection (2100 times) [2024-06-27 14:40:45,842][06909] InferenceWorker_p0-w0: resuming experience collection (2100 times) [2024-06-27 14:40:47,014][06909] Updated weights for policy 0, policy_version 15232 (0.0028) [2024-06-27 14:40:48,850][06674] Fps is (10 sec: 45884.7, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 249659392. Throughput: 0: 43421.9. Samples: 152562840. Policy #0 lag: (min: 1.0, avg: 9.3, max: 21.0) [2024-06-27 14:40:48,850][06674] Avg episode reward: [(0, '0.320')] [2024-06-27 14:40:50,629][06909] Updated weights for policy 0, policy_version 15242 (0.0030) [2024-06-27 14:40:53,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43689.1, 300 sec: 43653.3). Total num frames: 249856000. Throughput: 0: 43489.6. Samples: 152825020. Policy #0 lag: (min: 1.0, avg: 9.3, max: 21.0) [2024-06-27 14:40:53,852][06674] Avg episode reward: [(0, '0.328')] [2024-06-27 14:40:54,511][06909] Updated weights for policy 0, policy_version 15252 (0.0043) [2024-06-27 14:40:58,151][06909] Updated weights for policy 0, policy_version 15262 (0.0032) [2024-06-27 14:40:58,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 250068992. Throughput: 0: 43449.3. Samples: 152947840. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 14:40:58,850][06674] Avg episode reward: [(0, '0.316')] [2024-06-27 14:41:01,971][06909] Updated weights for policy 0, policy_version 15272 (0.0043) [2024-06-27 14:41:03,850][06674] Fps is (10 sec: 45884.3, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 250314752. Throughput: 0: 43451.4. Samples: 153214660. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 14:41:03,850][06674] Avg episode reward: [(0, '0.318')] [2024-06-27 14:41:05,622][06909] Updated weights for policy 0, policy_version 15282 (0.0027) [2024-06-27 14:41:08,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43422.0, 300 sec: 43542.6). Total num frames: 250494976. Throughput: 0: 43527.2. Samples: 153478560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 14:41:08,850][06674] Avg episode reward: [(0, '0.324')] [2024-06-27 14:41:09,518][06909] Updated weights for policy 0, policy_version 15292 (0.0037) [2024-06-27 14:41:13,095][06909] Updated weights for policy 0, policy_version 15302 (0.0031) [2024-06-27 14:41:13,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43144.5, 300 sec: 43598.1). Total num frames: 250707968. Throughput: 0: 43368.4. Samples: 153598660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 14:41:13,850][06674] Avg episode reward: [(0, '0.322')] [2024-06-27 14:41:16,926][06909] Updated weights for policy 0, policy_version 15312 (0.0035) [2024-06-27 14:41:18,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 250970112. Throughput: 0: 43417.7. Samples: 153865240. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 14:41:18,850][06674] Avg episode reward: [(0, '0.324')] [2024-06-27 14:41:20,616][06909] Updated weights for policy 0, policy_version 15322 (0.0031) [2024-06-27 14:41:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 251150336. Throughput: 0: 43486.0. Samples: 154132000. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 14:41:23,850][06674] Avg episode reward: [(0, '0.327')] [2024-06-27 14:41:24,451][06909] Updated weights for policy 0, policy_version 15332 (0.0037) [2024-06-27 14:41:28,081][06909] Updated weights for policy 0, policy_version 15342 (0.0041) [2024-06-27 14:41:28,852][06674] Fps is (10 sec: 40951.5, 60 sec: 43689.1, 300 sec: 43597.8). Total num frames: 251379712. Throughput: 0: 43440.6. Samples: 154254780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 14:41:28,852][06674] Avg episode reward: [(0, '0.326')] [2024-06-27 14:41:32,046][06909] Updated weights for policy 0, policy_version 15352 (0.0037) [2024-06-27 14:41:33,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43690.5, 300 sec: 43764.7). Total num frames: 251625472. Throughput: 0: 43520.3. Samples: 154521260. Policy #0 lag: (min: 1.0, avg: 9.6, max: 22.0) [2024-06-27 14:41:33,850][06674] Avg episode reward: [(0, '0.330')] [2024-06-27 14:41:33,851][06887] Saving new best policy, reward=0.330! [2024-06-27 14:41:35,578][06909] Updated weights for policy 0, policy_version 15362 (0.0037) [2024-06-27 14:41:38,855][06674] Fps is (10 sec: 42584.9, 60 sec: 43415.3, 300 sec: 43541.8). Total num frames: 251805696. Throughput: 0: 43574.2. Samples: 154786000. Policy #0 lag: (min: 1.0, avg: 9.6, max: 22.0) [2024-06-27 14:41:38,856][06674] Avg episode reward: [(0, '0.327')] [2024-06-27 14:41:39,751][06909] Updated weights for policy 0, policy_version 15372 (0.0038) [2024-06-27 14:41:43,030][06909] Updated weights for policy 0, policy_version 15382 (0.0039) [2024-06-27 14:41:43,855][06674] Fps is (10 sec: 39301.7, 60 sec: 43413.8, 300 sec: 43541.8). Total num frames: 252018688. Throughput: 0: 43453.3. Samples: 154903460. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-27 14:41:43,856][06674] Avg episode reward: [(0, '0.323')] [2024-06-27 14:41:47,171][06909] Updated weights for policy 0, policy_version 15392 (0.0028) [2024-06-27 14:41:48,850][06674] Fps is (10 sec: 47538.7, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 252280832. Throughput: 0: 43483.2. Samples: 155171400. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-27 14:41:48,850][06674] Avg episode reward: [(0, '0.330')] [2024-06-27 14:41:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000015398_252280832.pth... [2024-06-27 14:41:48,914][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000014758_241795072.pth [2024-06-27 14:41:50,481][06909] Updated weights for policy 0, policy_version 15402 (0.0032) [2024-06-27 14:41:53,850][06674] Fps is (10 sec: 44259.3, 60 sec: 43419.0, 300 sec: 43542.6). Total num frames: 252461056. Throughput: 0: 43530.5. Samples: 155437440. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-27 14:41:53,850][06674] Avg episode reward: [(0, '0.325')] [2024-06-27 14:41:54,588][06909] Updated weights for policy 0, policy_version 15412 (0.0046) [2024-06-27 14:41:58,302][06909] Updated weights for policy 0, policy_version 15422 (0.0027) [2024-06-27 14:41:58,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 252674048. Throughput: 0: 43610.3. Samples: 155561120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 14:41:58,850][06674] Avg episode reward: [(0, '0.322')] [2024-06-27 14:42:02,374][06909] Updated weights for policy 0, policy_version 15432 (0.0032) [2024-06-27 14:42:03,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 252919808. Throughput: 0: 43579.1. Samples: 155826300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 14:42:03,850][06674] Avg episode reward: [(0, '0.328')] [2024-06-27 14:42:05,751][06909] Updated weights for policy 0, policy_version 15442 (0.0035) [2024-06-27 14:42:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.5, 300 sec: 43542.6). Total num frames: 253100032. Throughput: 0: 43543.5. Samples: 156091460. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 14:42:08,850][06674] Avg episode reward: [(0, '0.320')] [2024-06-27 14:42:09,744][06909] Updated weights for policy 0, policy_version 15452 (0.0031) [2024-06-27 14:42:13,215][06909] Updated weights for policy 0, policy_version 15462 (0.0026) [2024-06-27 14:42:13,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 253329408. Throughput: 0: 43485.2. Samples: 156211520. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 14:42:13,850][06674] Avg episode reward: [(0, '0.323')] [2024-06-27 14:42:17,250][06909] Updated weights for policy 0, policy_version 15472 (0.0029) [2024-06-27 14:42:18,850][06674] Fps is (10 sec: 49152.2, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 253591552. Throughput: 0: 43477.4. Samples: 156477740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 14:42:18,850][06674] Avg episode reward: [(0, '0.335')] [2024-06-27 14:42:18,877][06887] Saving new best policy, reward=0.335! [2024-06-27 14:42:19,708][06887] Signal inference workers to stop experience collection... (2150 times) [2024-06-27 14:42:19,708][06887] Signal inference workers to resume experience collection... (2150 times) [2024-06-27 14:42:19,754][06909] InferenceWorker_p0-w0: stopping experience collection (2150 times) [2024-06-27 14:42:19,754][06909] InferenceWorker_p0-w0: resuming experience collection (2150 times) [2024-06-27 14:42:20,735][06909] Updated weights for policy 0, policy_version 15482 (0.0032) [2024-06-27 14:42:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 253755392. Throughput: 0: 43463.7. Samples: 156741640. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 14:42:23,850][06674] Avg episode reward: [(0, '0.331')] [2024-06-27 14:42:24,757][06909] Updated weights for policy 0, policy_version 15492 (0.0037) [2024-06-27 14:42:28,506][06909] Updated weights for policy 0, policy_version 15502 (0.0042) [2024-06-27 14:42:28,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43419.1, 300 sec: 43598.1). Total num frames: 253984768. Throughput: 0: 43527.6. Samples: 156861980. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 14:42:28,850][06674] Avg episode reward: [(0, '0.333')] [2024-06-27 14:42:32,286][06909] Updated weights for policy 0, policy_version 15512 (0.0033) [2024-06-27 14:42:33,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 254230528. Throughput: 0: 43452.0. Samples: 157126740. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 14:42:33,850][06674] Avg episode reward: [(0, '0.332')] [2024-06-27 14:42:36,114][06909] Updated weights for policy 0, policy_version 15522 (0.0031) [2024-06-27 14:42:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43148.3, 300 sec: 43487.0). Total num frames: 254394368. Throughput: 0: 43496.5. Samples: 157394780. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 14:42:38,850][06674] Avg episode reward: [(0, '0.335')] [2024-06-27 14:42:39,772][06909] Updated weights for policy 0, policy_version 15532 (0.0033) [2024-06-27 14:42:43,512][06909] Updated weights for policy 0, policy_version 15542 (0.0037) [2024-06-27 14:42:43,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43694.4, 300 sec: 43542.6). Total num frames: 254640128. Throughput: 0: 43345.2. Samples: 157511660. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 14:42:43,850][06674] Avg episode reward: [(0, '0.335')] [2024-06-27 14:42:47,167][06909] Updated weights for policy 0, policy_version 15552 (0.0023) [2024-06-27 14:42:48,850][06674] Fps is (10 sec: 49151.8, 60 sec: 43417.5, 300 sec: 43709.5). Total num frames: 254885888. Throughput: 0: 43423.5. Samples: 157780360. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 14:42:48,854][06674] Avg episode reward: [(0, '0.336')] [2024-06-27 14:42:48,878][06887] Saving new best policy, reward=0.336! [2024-06-27 14:42:50,886][06909] Updated weights for policy 0, policy_version 15562 (0.0021) [2024-06-27 14:42:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 255066112. Throughput: 0: 43582.8. Samples: 158052680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 14:42:53,850][06674] Avg episode reward: [(0, '0.335')] [2024-06-27 14:42:54,878][06909] Updated weights for policy 0, policy_version 15572 (0.0022) [2024-06-27 14:42:58,361][06909] Updated weights for policy 0, policy_version 15582 (0.0044) [2024-06-27 14:42:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 255295488. Throughput: 0: 43411.4. Samples: 158165040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 14:42:58,851][06674] Avg episode reward: [(0, '0.334')] [2024-06-27 14:43:02,444][06909] Updated weights for policy 0, policy_version 15592 (0.0032) [2024-06-27 14:43:03,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 255541248. Throughput: 0: 43464.4. Samples: 158433640. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-27 14:43:03,850][06674] Avg episode reward: [(0, '0.338')] [2024-06-27 14:43:03,851][06887] Saving new best policy, reward=0.338! [2024-06-27 14:43:06,047][06909] Updated weights for policy 0, policy_version 15602 (0.0041) [2024-06-27 14:43:08,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 255705088. Throughput: 0: 43551.2. Samples: 158701440. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-27 14:43:08,850][06674] Avg episode reward: [(0, '0.336')] [2024-06-27 14:43:09,984][06909] Updated weights for policy 0, policy_version 15612 (0.0040) [2024-06-27 14:43:13,763][06909] Updated weights for policy 0, policy_version 15622 (0.0037) [2024-06-27 14:43:13,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43542.5). Total num frames: 255950848. Throughput: 0: 43392.4. Samples: 158814640. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-27 14:43:13,850][06674] Avg episode reward: [(0, '0.336')] [2024-06-27 14:43:17,526][06909] Updated weights for policy 0, policy_version 15632 (0.0036) [2024-06-27 14:43:18,115][06887] Signal inference workers to stop experience collection... (2200 times) [2024-06-27 14:43:18,115][06887] Signal inference workers to resume experience collection... (2200 times) [2024-06-27 14:43:18,131][06909] InferenceWorker_p0-w0: stopping experience collection (2200 times) [2024-06-27 14:43:18,132][06909] InferenceWorker_p0-w0: resuming experience collection (2200 times) [2024-06-27 14:43:18,850][06674] Fps is (10 sec: 49151.5, 60 sec: 43417.6, 300 sec: 43820.2). Total num frames: 256196608. Throughput: 0: 43424.8. Samples: 159080860. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 14:43:18,850][06674] Avg episode reward: [(0, '0.337')] [2024-06-27 14:43:21,469][06909] Updated weights for policy 0, policy_version 15642 (0.0029) [2024-06-27 14:43:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43598.4). Total num frames: 256376832. Throughput: 0: 43441.2. Samples: 159349640. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 14:43:23,850][06674] Avg episode reward: [(0, '0.337')] [2024-06-27 14:43:25,129][06909] Updated weights for policy 0, policy_version 15652 (0.0031) [2024-06-27 14:43:28,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 256589824. Throughput: 0: 43403.6. Samples: 159464820. Policy #0 lag: (min: 1.0, avg: 13.2, max: 23.0) [2024-06-27 14:43:28,850][06674] Avg episode reward: [(0, '0.331')] [2024-06-27 14:43:29,243][06909] Updated weights for policy 0, policy_version 15662 (0.0043) [2024-06-27 14:43:32,656][06909] Updated weights for policy 0, policy_version 15672 (0.0032) [2024-06-27 14:43:33,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 256851968. Throughput: 0: 43370.6. Samples: 159732040. Policy #0 lag: (min: 1.0, avg: 13.2, max: 23.0) [2024-06-27 14:43:33,851][06674] Avg episode reward: [(0, '0.332')] [2024-06-27 14:43:36,934][06909] Updated weights for policy 0, policy_version 15682 (0.0040) [2024-06-27 14:43:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 257015808. Throughput: 0: 43241.7. Samples: 159998560. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 14:43:38,850][06674] Avg episode reward: [(0, '0.333')] [2024-06-27 14:43:40,060][06909] Updated weights for policy 0, policy_version 15692 (0.0050) [2024-06-27 14:43:43,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 257245184. Throughput: 0: 43277.9. Samples: 160112540. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 14:43:43,850][06674] Avg episode reward: [(0, '0.331')] [2024-06-27 14:43:44,434][06909] Updated weights for policy 0, policy_version 15702 (0.0044) [2024-06-27 14:43:47,450][06909] Updated weights for policy 0, policy_version 15712 (0.0031) [2024-06-27 14:43:48,852][06674] Fps is (10 sec: 49142.2, 60 sec: 43689.2, 300 sec: 43764.4). Total num frames: 257507328. Throughput: 0: 43371.4. Samples: 160385440. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 14:43:48,852][06674] Avg episode reward: [(0, '0.328')] [2024-06-27 14:43:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000015717_257507328.pth... [2024-06-27 14:43:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000015078_247037952.pth [2024-06-27 14:43:51,875][06909] Updated weights for policy 0, policy_version 15722 (0.0026) [2024-06-27 14:43:53,856][06674] Fps is (10 sec: 40935.2, 60 sec: 43140.2, 300 sec: 43375.1). Total num frames: 257654784. Throughput: 0: 43349.7. Samples: 160652440. Policy #0 lag: (min: 0.0, avg: 12.1, max: 20.0) [2024-06-27 14:43:53,857][06674] Avg episode reward: [(0, '0.335')] [2024-06-27 14:43:55,164][06909] Updated weights for policy 0, policy_version 15732 (0.0041) [2024-06-27 14:43:58,850][06674] Fps is (10 sec: 39329.4, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 257900544. Throughput: 0: 43464.0. Samples: 160770520. Policy #0 lag: (min: 0.0, avg: 12.1, max: 20.0) [2024-06-27 14:43:58,850][06674] Avg episode reward: [(0, '0.336')] [2024-06-27 14:43:59,319][06909] Updated weights for policy 0, policy_version 15742 (0.0044) [2024-06-27 14:44:02,770][06909] Updated weights for policy 0, policy_version 15752 (0.0024) [2024-06-27 14:44:03,851][06674] Fps is (10 sec: 49176.5, 60 sec: 43416.9, 300 sec: 43653.5). Total num frames: 258146304. Throughput: 0: 43502.2. Samples: 161038500. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-27 14:44:03,851][06674] Avg episode reward: [(0, '0.344')] [2024-06-27 14:44:03,852][06887] Saving new best policy, reward=0.344! [2024-06-27 14:44:06,850][06909] Updated weights for policy 0, policy_version 15762 (0.0037) [2024-06-27 14:44:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 258310144. Throughput: 0: 43413.9. Samples: 161303260. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-27 14:44:08,850][06674] Avg episode reward: [(0, '0.333')] [2024-06-27 14:44:10,281][06909] Updated weights for policy 0, policy_version 15772 (0.0040) [2024-06-27 14:44:13,850][06674] Fps is (10 sec: 40964.0, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 258555904. Throughput: 0: 43454.2. Samples: 161420260. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 14:44:13,850][06674] Avg episode reward: [(0, '0.326')] [2024-06-27 14:44:14,765][06909] Updated weights for policy 0, policy_version 15782 (0.0045) [2024-06-27 14:44:17,707][06909] Updated weights for policy 0, policy_version 15792 (0.0038) [2024-06-27 14:44:18,850][06674] Fps is (10 sec: 49152.0, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 258801664. Throughput: 0: 43567.1. Samples: 161692560. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 14:44:18,850][06674] Avg episode reward: [(0, '0.329')] [2024-06-27 14:44:22,366][06909] Updated weights for policy 0, policy_version 15802 (0.0035) [2024-06-27 14:44:23,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43144.7, 300 sec: 43431.5). Total num frames: 258965504. Throughput: 0: 43614.3. Samples: 161961200. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 14:44:23,850][06674] Avg episode reward: [(0, '0.333')] [2024-06-27 14:44:25,195][06909] Updated weights for policy 0, policy_version 15812 (0.0023) [2024-06-27 14:44:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 259211264. Throughput: 0: 43615.0. Samples: 162075220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 14:44:28,851][06674] Avg episode reward: [(0, '0.335')] [2024-06-27 14:44:29,821][06909] Updated weights for policy 0, policy_version 15822 (0.0050) [2024-06-27 14:44:32,319][06887] Signal inference workers to stop experience collection... (2250 times) [2024-06-27 14:44:32,319][06887] Signal inference workers to resume experience collection... (2250 times) [2024-06-27 14:44:32,338][06909] InferenceWorker_p0-w0: stopping experience collection (2250 times) [2024-06-27 14:44:32,338][06909] InferenceWorker_p0-w0: resuming experience collection (2250 times) [2024-06-27 14:44:32,786][06909] Updated weights for policy 0, policy_version 15832 (0.0026) [2024-06-27 14:44:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 42871.6, 300 sec: 43542.9). Total num frames: 259424256. Throughput: 0: 43481.1. Samples: 162342000. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 14:44:33,850][06674] Avg episode reward: [(0, '0.337')] [2024-06-27 14:44:37,138][06909] Updated weights for policy 0, policy_version 15842 (0.0046) [2024-06-27 14:44:38,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 259620864. Throughput: 0: 43551.0. Samples: 162611980. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 14:44:38,850][06674] Avg episode reward: [(0, '0.337')] [2024-06-27 14:44:40,251][06909] Updated weights for policy 0, policy_version 15852 (0.0030) [2024-06-27 14:44:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 259866624. Throughput: 0: 43575.6. Samples: 162731420. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 14:44:43,850][06674] Avg episode reward: [(0, '0.334')] [2024-06-27 14:44:44,559][06909] Updated weights for policy 0, policy_version 15862 (0.0041) [2024-06-27 14:44:47,777][06909] Updated weights for policy 0, policy_version 15872 (0.0040) [2024-06-27 14:44:48,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43146.0, 300 sec: 43598.1). Total num frames: 260096000. Throughput: 0: 43453.8. Samples: 162993880. Policy #0 lag: (min: 2.0, avg: 9.1, max: 22.0) [2024-06-27 14:44:48,850][06674] Avg episode reward: [(0, '0.338')] [2024-06-27 14:44:52,075][06909] Updated weights for policy 0, policy_version 15882 (0.0032) [2024-06-27 14:44:53,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43421.9, 300 sec: 43376.0). Total num frames: 260259840. Throughput: 0: 43448.4. Samples: 163258440. Policy #0 lag: (min: 2.0, avg: 9.1, max: 22.0) [2024-06-27 14:44:53,851][06674] Avg episode reward: [(0, '0.337')] [2024-06-27 14:44:55,289][06909] Updated weights for policy 0, policy_version 15892 (0.0035) [2024-06-27 14:44:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 260521984. Throughput: 0: 43519.9. Samples: 163378660. Policy #0 lag: (min: 0.0, avg: 12.2, max: 22.0) [2024-06-27 14:44:58,850][06674] Avg episode reward: [(0, '0.332')] [2024-06-27 14:44:59,375][06909] Updated weights for policy 0, policy_version 15902 (0.0030) [2024-06-27 14:45:02,648][06909] Updated weights for policy 0, policy_version 15912 (0.0037) [2024-06-27 14:45:03,850][06674] Fps is (10 sec: 49152.6, 60 sec: 43418.4, 300 sec: 43599.0). Total num frames: 260751360. Throughput: 0: 43501.9. Samples: 163650140. Policy #0 lag: (min: 0.0, avg: 12.2, max: 22.0) [2024-06-27 14:45:03,852][06674] Avg episode reward: [(0, '0.339')] [2024-06-27 14:45:06,754][06909] Updated weights for policy 0, policy_version 15922 (0.0036) [2024-06-27 14:45:08,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 260931584. Throughput: 0: 43518.1. Samples: 163919520. Policy #0 lag: (min: 0.0, avg: 12.2, max: 22.0) [2024-06-27 14:45:08,850][06674] Avg episode reward: [(0, '0.347')] [2024-06-27 14:45:08,865][06887] Saving new best policy, reward=0.347! [2024-06-27 14:45:10,228][06909] Updated weights for policy 0, policy_version 15932 (0.0027) [2024-06-27 14:45:13,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 261160960. Throughput: 0: 43599.2. Samples: 164037180. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 14:45:13,850][06674] Avg episode reward: [(0, '0.334')] [2024-06-27 14:45:14,805][06909] Updated weights for policy 0, policy_version 15942 (0.0048) [2024-06-27 14:45:17,874][06909] Updated weights for policy 0, policy_version 15952 (0.0033) [2024-06-27 14:45:18,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 261406720. Throughput: 0: 43558.1. Samples: 164302120. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 14:45:18,850][06674] Avg episode reward: [(0, '0.333')] [2024-06-27 14:45:22,305][06909] Updated weights for policy 0, policy_version 15962 (0.0038) [2024-06-27 14:45:23,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.5, 300 sec: 43487.0). Total num frames: 261586944. Throughput: 0: 43512.0. Samples: 164570020. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 14:45:23,850][06674] Avg episode reward: [(0, '0.337')] [2024-06-27 14:45:25,331][06909] Updated weights for policy 0, policy_version 15972 (0.0038) [2024-06-27 14:45:28,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43417.5, 300 sec: 43431.4). Total num frames: 261816320. Throughput: 0: 43467.0. Samples: 164687440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 14:45:28,850][06674] Avg episode reward: [(0, '0.347')] [2024-06-27 14:45:29,710][06909] Updated weights for policy 0, policy_version 15982 (0.0025) [2024-06-27 14:45:32,781][06909] Updated weights for policy 0, policy_version 15992 (0.0040) [2024-06-27 14:45:33,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.7, 300 sec: 43598.4). Total num frames: 262062080. Throughput: 0: 43540.9. Samples: 164953220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 14:45:33,850][06674] Avg episode reward: [(0, '0.342')] [2024-06-27 14:45:37,243][06909] Updated weights for policy 0, policy_version 16002 (0.0031) [2024-06-27 14:45:38,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.8, 300 sec: 43487.0). Total num frames: 262242304. Throughput: 0: 43602.7. Samples: 165220560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:45:38,850][06674] Avg episode reward: [(0, '0.337')] [2024-06-27 14:45:40,295][06909] Updated weights for policy 0, policy_version 16012 (0.0027) [2024-06-27 14:45:43,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43144.6, 300 sec: 43375.9). Total num frames: 262455296. Throughput: 0: 43547.7. Samples: 165338300. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 14:45:43,850][06674] Avg episode reward: [(0, '0.333')] [2024-06-27 14:45:44,715][06909] Updated weights for policy 0, policy_version 16022 (0.0034) [2024-06-27 14:45:47,786][06909] Updated weights for policy 0, policy_version 16032 (0.0035) [2024-06-27 14:45:48,852][06674] Fps is (10 sec: 47503.9, 60 sec: 43689.2, 300 sec: 43598.1). Total num frames: 262717440. Throughput: 0: 43530.9. Samples: 165609120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 14:45:48,852][06674] Avg episode reward: [(0, '0.337')] [2024-06-27 14:45:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000016035_262717440.pth... [2024-06-27 14:45:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000015398_252280832.pth [2024-06-27 14:45:52,098][06887] Signal inference workers to stop experience collection... (2300 times) [2024-06-27 14:45:52,155][06909] InferenceWorker_p0-w0: stopping experience collection (2300 times) [2024-06-27 14:45:52,159][06887] Signal inference workers to resume experience collection... (2300 times) [2024-06-27 14:45:52,168][06909] InferenceWorker_p0-w0: resuming experience collection (2300 times) [2024-06-27 14:45:52,294][06909] Updated weights for policy 0, policy_version 16042 (0.0034) [2024-06-27 14:45:53,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 262881280. Throughput: 0: 43410.2. Samples: 165872980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 14:45:53,850][06674] Avg episode reward: [(0, '0.342')] [2024-06-27 14:45:55,155][06909] Updated weights for policy 0, policy_version 16052 (0.0033) [2024-06-27 14:45:58,850][06674] Fps is (10 sec: 39329.1, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 263110656. Throughput: 0: 43505.6. Samples: 165994940. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 14:45:58,850][06674] Avg episode reward: [(0, '0.342')] [2024-06-27 14:45:59,995][06909] Updated weights for policy 0, policy_version 16062 (0.0038) [2024-06-27 14:46:02,768][06909] Updated weights for policy 0, policy_version 16072 (0.0025) [2024-06-27 14:46:03,850][06674] Fps is (10 sec: 49152.1, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 263372800. Throughput: 0: 43444.8. Samples: 166257140. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 14:46:03,850][06674] Avg episode reward: [(0, '0.337')] [2024-06-27 14:46:07,524][06909] Updated weights for policy 0, policy_version 16082 (0.0022) [2024-06-27 14:46:08,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 263536640. Throughput: 0: 43451.7. Samples: 166525340. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 14:46:08,850][06674] Avg episode reward: [(0, '0.336')] [2024-06-27 14:46:10,210][06909] Updated weights for policy 0, policy_version 16092 (0.0037) [2024-06-27 14:46:13,850][06674] Fps is (10 sec: 37683.4, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 263749632. Throughput: 0: 43504.5. Samples: 166645140. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:46:13,851][06674] Avg episode reward: [(0, '0.336')] [2024-06-27 14:46:15,052][06909] Updated weights for policy 0, policy_version 16102 (0.0027) [2024-06-27 14:46:17,567][06909] Updated weights for policy 0, policy_version 16112 (0.0034) [2024-06-27 14:46:18,852][06674] Fps is (10 sec: 49141.7, 60 sec: 43689.2, 300 sec: 43653.3). Total num frames: 264028160. Throughput: 0: 43548.7. Samples: 166913000. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 14:46:18,852][06674] Avg episode reward: [(0, '0.336')] [2024-06-27 14:46:22,704][06909] Updated weights for policy 0, policy_version 16122 (0.0032) [2024-06-27 14:46:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43417.6, 300 sec: 43431.8). Total num frames: 264192000. Throughput: 0: 43549.7. Samples: 167180300. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 14:46:23,851][06674] Avg episode reward: [(0, '0.333')] [2024-06-27 14:46:25,111][06909] Updated weights for policy 0, policy_version 16132 (0.0041) [2024-06-27 14:46:28,850][06674] Fps is (10 sec: 37690.7, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 264404992. Throughput: 0: 43514.6. Samples: 167296460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 14:46:28,850][06674] Avg episode reward: [(0, '0.332')] [2024-06-27 14:46:30,217][06909] Updated weights for policy 0, policy_version 16142 (0.0036) [2024-06-27 14:46:32,630][06909] Updated weights for policy 0, policy_version 16152 (0.0038) [2024-06-27 14:46:33,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43417.7, 300 sec: 43598.9). Total num frames: 264667136. Throughput: 0: 43333.1. Samples: 167559020. Policy #0 lag: (min: 1.0, avg: 11.9, max: 24.0) [2024-06-27 14:46:33,850][06674] Avg episode reward: [(0, '0.333')] [2024-06-27 14:46:37,671][06909] Updated weights for policy 0, policy_version 16162 (0.0042) [2024-06-27 14:46:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43487.8). Total num frames: 264847360. Throughput: 0: 43481.5. Samples: 167829640. Policy #0 lag: (min: 1.0, avg: 11.9, max: 24.0) [2024-06-27 14:46:38,850][06674] Avg episode reward: [(0, '0.339')] [2024-06-27 14:46:40,479][06909] Updated weights for policy 0, policy_version 16172 (0.0033) [2024-06-27 14:46:43,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 265060352. Throughput: 0: 43421.4. Samples: 167948900. Policy #0 lag: (min: 1.0, avg: 11.9, max: 24.0) [2024-06-27 14:46:43,853][06674] Avg episode reward: [(0, '0.338')] [2024-06-27 14:46:45,229][06909] Updated weights for policy 0, policy_version 16182 (0.0034) [2024-06-27 14:46:47,999][06909] Updated weights for policy 0, policy_version 16192 (0.0030) [2024-06-27 14:46:48,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43419.1, 300 sec: 43598.1). Total num frames: 265322496. Throughput: 0: 43602.7. Samples: 168219260. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 14:46:48,850][06674] Avg episode reward: [(0, '0.337')] [2024-06-27 14:46:52,550][06909] Updated weights for policy 0, policy_version 16202 (0.0033) [2024-06-27 14:46:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 43542.6). Total num frames: 265519104. Throughput: 0: 43604.8. Samples: 168487560. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 14:46:53,851][06674] Avg episode reward: [(0, '0.341')] [2024-06-27 14:46:55,405][06909] Updated weights for policy 0, policy_version 16212 (0.0031) [2024-06-27 14:46:58,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43417.7, 300 sec: 43375.9). Total num frames: 265715712. Throughput: 0: 43650.7. Samples: 168609420. Policy #0 lag: (min: 0.0, avg: 13.3, max: 25.0) [2024-06-27 14:46:58,851][06674] Avg episode reward: [(0, '0.333')] [2024-06-27 14:46:59,920][06909] Updated weights for policy 0, policy_version 16222 (0.0030) [2024-06-27 14:47:02,235][06887] Signal inference workers to stop experience collection... (2350 times) [2024-06-27 14:47:02,237][06887] Signal inference workers to resume experience collection... (2350 times) [2024-06-27 14:47:02,258][06909] InferenceWorker_p0-w0: stopping experience collection (2350 times) [2024-06-27 14:47:02,258][06909] InferenceWorker_p0-w0: resuming experience collection (2350 times) [2024-06-27 14:47:02,876][06909] Updated weights for policy 0, policy_version 16232 (0.0027) [2024-06-27 14:47:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43417.7, 300 sec: 43653.6). Total num frames: 265977856. Throughput: 0: 43600.2. Samples: 168874920. Policy #0 lag: (min: 0.0, avg: 13.3, max: 25.0) [2024-06-27 14:47:03,850][06674] Avg episode reward: [(0, '0.335')] [2024-06-27 14:47:07,439][06909] Updated weights for policy 0, policy_version 16242 (0.0028) [2024-06-27 14:47:08,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43689.2, 300 sec: 43486.7). Total num frames: 266158080. Throughput: 0: 43560.3. Samples: 169140600. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 14:47:08,853][06674] Avg episode reward: [(0, '0.338')] [2024-06-27 14:47:10,287][06909] Updated weights for policy 0, policy_version 16252 (0.0025) [2024-06-27 14:47:13,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 266371072. Throughput: 0: 43654.7. Samples: 169260920. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 14:47:13,850][06674] Avg episode reward: [(0, '0.340')] [2024-06-27 14:47:15,176][06909] Updated weights for policy 0, policy_version 16262 (0.0031) [2024-06-27 14:47:17,869][06909] Updated weights for policy 0, policy_version 16272 (0.0043) [2024-06-27 14:47:18,850][06674] Fps is (10 sec: 47523.0, 60 sec: 43419.0, 300 sec: 43653.6). Total num frames: 266633216. Throughput: 0: 43754.6. Samples: 169527980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 14:47:18,850][06674] Avg episode reward: [(0, '0.340')] [2024-06-27 14:47:22,846][06909] Updated weights for policy 0, policy_version 16282 (0.0033) [2024-06-27 14:47:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 266813440. Throughput: 0: 43557.7. Samples: 169789740. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 14:47:23,850][06674] Avg episode reward: [(0, '0.337')] [2024-06-27 14:47:25,520][06909] Updated weights for policy 0, policy_version 16292 (0.0041) [2024-06-27 14:47:28,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 267026432. Throughput: 0: 43551.0. Samples: 169908700. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 14:47:28,850][06674] Avg episode reward: [(0, '0.337')] [2024-06-27 14:47:30,485][06909] Updated weights for policy 0, policy_version 16302 (0.0036) [2024-06-27 14:47:32,914][06909] Updated weights for policy 0, policy_version 16312 (0.0033) [2024-06-27 14:47:33,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 267288576. Throughput: 0: 43473.8. Samples: 170175580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:47:33,850][06674] Avg episode reward: [(0, '0.338')] [2024-06-27 14:47:37,976][06909] Updated weights for policy 0, policy_version 16322 (0.0030) [2024-06-27 14:47:38,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 267468800. Throughput: 0: 43554.7. Samples: 170447520. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:47:38,850][06674] Avg episode reward: [(0, '0.341')] [2024-06-27 14:47:40,278][06909] Updated weights for policy 0, policy_version 16332 (0.0030) [2024-06-27 14:47:43,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.7, 300 sec: 43375.9). Total num frames: 267681792. Throughput: 0: 43537.4. Samples: 170568600. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:47:43,850][06674] Avg episode reward: [(0, '0.335')] [2024-06-27 14:47:45,387][06909] Updated weights for policy 0, policy_version 16342 (0.0032) [2024-06-27 14:47:47,958][06909] Updated weights for policy 0, policy_version 16352 (0.0023) [2024-06-27 14:47:48,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 267943936. Throughput: 0: 43487.6. Samples: 170831860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:47:48,850][06674] Avg episode reward: [(0, '0.342')] [2024-06-27 14:47:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000016354_267943936.pth... [2024-06-27 14:47:48,926][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000015717_257507328.pth [2024-06-27 14:47:52,816][06909] Updated weights for policy 0, policy_version 16362 (0.0047) [2024-06-27 14:47:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 268124160. Throughput: 0: 43627.3. Samples: 171103740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:47:53,850][06674] Avg episode reward: [(0, '0.348')] [2024-06-27 14:47:53,851][06887] Saving new best policy, reward=0.348! [2024-06-27 14:47:55,728][06909] Updated weights for policy 0, policy_version 16372 (0.0032) [2024-06-27 14:47:58,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.7, 300 sec: 43375.9). Total num frames: 268337152. Throughput: 0: 43531.6. Samples: 171219840. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 14:47:58,850][06674] Avg episode reward: [(0, '0.344')] [2024-06-27 14:48:00,317][06909] Updated weights for policy 0, policy_version 16382 (0.0028) [2024-06-27 14:48:03,214][06909] Updated weights for policy 0, policy_version 16392 (0.0041) [2024-06-27 14:48:03,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 268582912. Throughput: 0: 43408.1. Samples: 171481340. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 14:48:03,850][06674] Avg episode reward: [(0, '0.343')] [2024-06-27 14:48:08,087][06909] Updated weights for policy 0, policy_version 16402 (0.0032) [2024-06-27 14:48:08,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43692.2, 300 sec: 43487.0). Total num frames: 268779520. Throughput: 0: 43678.8. Samples: 171755280. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-27 14:48:08,850][06674] Avg episode reward: [(0, '0.343')] [2024-06-27 14:48:10,528][06909] Updated weights for policy 0, policy_version 16412 (0.0032) [2024-06-27 14:48:13,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.8, 300 sec: 43376.0). Total num frames: 268992512. Throughput: 0: 43750.0. Samples: 171877440. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-27 14:48:13,850][06674] Avg episode reward: [(0, '0.346')] [2024-06-27 14:48:15,281][06887] Signal inference workers to stop experience collection... (2400 times) [2024-06-27 14:48:15,331][06909] InferenceWorker_p0-w0: stopping experience collection (2400 times) [2024-06-27 14:48:15,336][06887] Signal inference workers to resume experience collection... (2400 times) [2024-06-27 14:48:15,347][06909] InferenceWorker_p0-w0: resuming experience collection (2400 times) [2024-06-27 14:48:15,500][06909] Updated weights for policy 0, policy_version 16422 (0.0039) [2024-06-27 14:48:18,069][06909] Updated weights for policy 0, policy_version 16432 (0.0033) [2024-06-27 14:48:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 269238272. Throughput: 0: 43713.4. Samples: 172142680. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-27 14:48:18,850][06674] Avg episode reward: [(0, '0.349')] [2024-06-27 14:48:22,920][06909] Updated weights for policy 0, policy_version 16442 (0.0044) [2024-06-27 14:48:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 269434880. Throughput: 0: 43546.2. Samples: 172407100. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 14:48:23,850][06674] Avg episode reward: [(0, '0.343')] [2024-06-27 14:48:25,510][06909] Updated weights for policy 0, policy_version 16452 (0.0022) [2024-06-27 14:48:28,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43963.8, 300 sec: 43431.5). Total num frames: 269664256. Throughput: 0: 43601.7. Samples: 172530680. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 14:48:28,859][06674] Avg episode reward: [(0, '0.345')] [2024-06-27 14:48:30,346][06909] Updated weights for policy 0, policy_version 16462 (0.0031) [2024-06-27 14:48:33,036][06909] Updated weights for policy 0, policy_version 16472 (0.0039) [2024-06-27 14:48:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 269893632. Throughput: 0: 43724.8. Samples: 172799480. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 14:48:33,850][06674] Avg episode reward: [(0, '0.342')] [2024-06-27 14:48:37,770][06909] Updated weights for policy 0, policy_version 16482 (0.0037) [2024-06-27 14:48:38,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43542.5). Total num frames: 270090240. Throughput: 0: 43558.7. Samples: 173063880. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 14:48:38,850][06674] Avg episode reward: [(0, '0.346')] [2024-06-27 14:48:40,608][06909] Updated weights for policy 0, policy_version 16492 (0.0034) [2024-06-27 14:48:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43376.2). Total num frames: 270303232. Throughput: 0: 43745.4. Samples: 173188380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 14:48:43,850][06674] Avg episode reward: [(0, '0.344')] [2024-06-27 14:48:45,170][06909] Updated weights for policy 0, policy_version 16502 (0.0037) [2024-06-27 14:48:48,102][06909] Updated weights for policy 0, policy_version 16512 (0.0041) [2024-06-27 14:48:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.5, 300 sec: 43710.1). Total num frames: 270548992. Throughput: 0: 43758.1. Samples: 173450460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 14:48:48,850][06674] Avg episode reward: [(0, '0.349')] [2024-06-27 14:48:52,753][06909] Updated weights for policy 0, policy_version 16522 (0.0023) [2024-06-27 14:48:53,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43416.2, 300 sec: 43486.7). Total num frames: 270729216. Throughput: 0: 43656.2. Samples: 173719900. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 14:48:53,852][06674] Avg episode reward: [(0, '0.348')] [2024-06-27 14:48:55,719][06909] Updated weights for policy 0, policy_version 16532 (0.0043) [2024-06-27 14:48:58,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43417.6, 300 sec: 43376.1). Total num frames: 270942208. Throughput: 0: 43692.3. Samples: 173843600. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-27 14:48:58,850][06674] Avg episode reward: [(0, '0.339')] [2024-06-27 14:49:00,131][06909] Updated weights for policy 0, policy_version 16542 (0.0029) [2024-06-27 14:49:03,176][06909] Updated weights for policy 0, policy_version 16552 (0.0042) [2024-06-27 14:49:03,852][06674] Fps is (10 sec: 47513.5, 60 sec: 43689.1, 300 sec: 43708.9). Total num frames: 271204352. Throughput: 0: 43628.2. Samples: 174106040. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-27 14:49:03,853][06674] Avg episode reward: [(0, '0.340')] [2024-06-27 14:49:08,015][06909] Updated weights for policy 0, policy_version 16562 (0.0037) [2024-06-27 14:49:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 271400960. Throughput: 0: 43669.4. Samples: 174372220. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 14:49:08,850][06674] Avg episode reward: [(0, '0.347')] [2024-06-27 14:49:09,755][06887] Signal inference workers to stop experience collection... (2450 times) [2024-06-27 14:49:09,805][06887] Signal inference workers to resume experience collection... (2450 times) [2024-06-27 14:49:09,806][06909] InferenceWorker_p0-w0: stopping experience collection (2450 times) [2024-06-27 14:49:09,830][06909] InferenceWorker_p0-w0: resuming experience collection (2450 times) [2024-06-27 14:49:10,627][06909] Updated weights for policy 0, policy_version 16572 (0.0049) [2024-06-27 14:49:13,850][06674] Fps is (10 sec: 40968.5, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 271613952. Throughput: 0: 43671.3. Samples: 174495880. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 14:49:13,850][06674] Avg episode reward: [(0, '0.348')] [2024-06-27 14:49:15,519][06909] Updated weights for policy 0, policy_version 16582 (0.0027) [2024-06-27 14:49:18,207][06909] Updated weights for policy 0, policy_version 16592 (0.0031) [2024-06-27 14:49:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 271859712. Throughput: 0: 43457.0. Samples: 174755040. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 14:49:18,850][06674] Avg episode reward: [(0, '0.352')] [2024-06-27 14:49:18,859][06887] Saving new best policy, reward=0.352! [2024-06-27 14:49:22,976][06909] Updated weights for policy 0, policy_version 16602 (0.0034) [2024-06-27 14:49:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.8, 300 sec: 43542.6). Total num frames: 272056320. Throughput: 0: 43632.1. Samples: 175027320. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-27 14:49:23,850][06674] Avg episode reward: [(0, '0.353')] [2024-06-27 14:49:25,792][06909] Updated weights for policy 0, policy_version 16612 (0.0041) [2024-06-27 14:49:28,856][06674] Fps is (10 sec: 40934.9, 60 sec: 43413.3, 300 sec: 43541.7). Total num frames: 272269312. Throughput: 0: 43541.2. Samples: 175148000. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-27 14:49:28,856][06674] Avg episode reward: [(0, '0.351')] [2024-06-27 14:49:30,394][06909] Updated weights for policy 0, policy_version 16622 (0.0032) [2024-06-27 14:49:33,412][06909] Updated weights for policy 0, policy_version 16632 (0.0029) [2024-06-27 14:49:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 272515072. Throughput: 0: 43491.6. Samples: 175407580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 24.0) [2024-06-27 14:49:33,850][06674] Avg episode reward: [(0, '0.352')] [2024-06-27 14:49:38,242][06909] Updated weights for policy 0, policy_version 16642 (0.0031) [2024-06-27 14:49:38,850][06674] Fps is (10 sec: 42623.9, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 272695296. Throughput: 0: 43617.0. Samples: 175682580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 24.0) [2024-06-27 14:49:38,850][06674] Avg episode reward: [(0, '0.345')] [2024-06-27 14:49:40,886][06909] Updated weights for policy 0, policy_version 16652 (0.0040) [2024-06-27 14:49:43,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 272908288. Throughput: 0: 43620.5. Samples: 175806520. Policy #0 lag: (min: 0.0, avg: 9.7, max: 24.0) [2024-06-27 14:49:43,850][06674] Avg episode reward: [(0, '0.347')] [2024-06-27 14:49:45,487][06909] Updated weights for policy 0, policy_version 16662 (0.0031) [2024-06-27 14:49:48,138][06909] Updated weights for policy 0, policy_version 16672 (0.0024) [2024-06-27 14:49:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 273154048. Throughput: 0: 43605.0. Samples: 176068180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 14:49:48,850][06674] Avg episode reward: [(0, '0.342')] [2024-06-27 14:49:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000016672_273154048.pth... [2024-06-27 14:49:48,912][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000016035_262717440.pth [2024-06-27 14:49:53,087][06909] Updated weights for policy 0, policy_version 16682 (0.0034) [2024-06-27 14:49:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43965.2, 300 sec: 43542.6). Total num frames: 273367040. Throughput: 0: 43675.1. Samples: 176337600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 14:49:53,851][06674] Avg episode reward: [(0, '0.350')] [2024-06-27 14:49:55,652][06909] Updated weights for policy 0, policy_version 16692 (0.0050) [2024-06-27 14:49:58,852][06674] Fps is (10 sec: 42590.0, 60 sec: 43962.3, 300 sec: 43486.7). Total num frames: 273580032. Throughput: 0: 43657.1. Samples: 176460540. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 14:49:58,852][06674] Avg episode reward: [(0, '0.347')] [2024-06-27 14:50:00,504][06909] Updated weights for policy 0, policy_version 16702 (0.0034) [2024-06-27 14:50:03,513][06909] Updated weights for policy 0, policy_version 16712 (0.0039) [2024-06-27 14:50:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43419.0, 300 sec: 43653.6). Total num frames: 273809408. Throughput: 0: 43588.3. Samples: 176716520. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 14:50:03,850][06674] Avg episode reward: [(0, '0.349')] [2024-06-27 14:50:07,884][06887] Signal inference workers to stop experience collection... (2500 times) [2024-06-27 14:50:07,884][06887] Signal inference workers to resume experience collection... (2500 times) [2024-06-27 14:50:07,897][06909] InferenceWorker_p0-w0: stopping experience collection (2500 times) [2024-06-27 14:50:07,906][06909] InferenceWorker_p0-w0: resuming experience collection (2500 times) [2024-06-27 14:50:08,033][06909] Updated weights for policy 0, policy_version 16722 (0.0036) [2024-06-27 14:50:08,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 274022400. Throughput: 0: 43624.3. Samples: 176990420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 14:50:08,850][06674] Avg episode reward: [(0, '0.350')] [2024-06-27 14:50:10,895][06909] Updated weights for policy 0, policy_version 16732 (0.0035) [2024-06-27 14:50:13,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 274235392. Throughput: 0: 43761.5. Samples: 177117000. Policy #0 lag: (min: 2.0, avg: 10.8, max: 22.0) [2024-06-27 14:50:13,850][06674] Avg episode reward: [(0, '0.351')] [2024-06-27 14:50:15,416][06909] Updated weights for policy 0, policy_version 16742 (0.0038) [2024-06-27 14:50:18,299][06909] Updated weights for policy 0, policy_version 16752 (0.0030) [2024-06-27 14:50:18,852][06674] Fps is (10 sec: 44227.9, 60 sec: 43416.1, 300 sec: 43653.3). Total num frames: 274464768. Throughput: 0: 43700.2. Samples: 177374180. Policy #0 lag: (min: 2.0, avg: 10.8, max: 22.0) [2024-06-27 14:50:18,852][06674] Avg episode reward: [(0, '0.344')] [2024-06-27 14:50:22,955][06909] Updated weights for policy 0, policy_version 16762 (0.0038) [2024-06-27 14:50:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 274677760. Throughput: 0: 43479.2. Samples: 177639140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:50:23,852][06674] Avg episode reward: [(0, '0.354')] [2024-06-27 14:50:23,853][06887] Saving new best policy, reward=0.354! [2024-06-27 14:50:26,323][06909] Updated weights for policy 0, policy_version 16772 (0.0029) [2024-06-27 14:50:28,850][06674] Fps is (10 sec: 40968.0, 60 sec: 43421.9, 300 sec: 43431.5). Total num frames: 274874368. Throughput: 0: 43656.3. Samples: 177771060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:50:28,851][06674] Avg episode reward: [(0, '0.354')] [2024-06-27 14:50:30,535][06909] Updated weights for policy 0, policy_version 16782 (0.0027) [2024-06-27 14:50:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 275103744. Throughput: 0: 43370.8. Samples: 178019860. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 14:50:33,850][06674] Avg episode reward: [(0, '0.352')] [2024-06-27 14:50:33,933][06909] Updated weights for policy 0, policy_version 16792 (0.0043) [2024-06-27 14:50:37,877][06909] Updated weights for policy 0, policy_version 16802 (0.0026) [2024-06-27 14:50:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 275333120. Throughput: 0: 43369.2. Samples: 178289220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 14:50:38,851][06674] Avg episode reward: [(0, '0.353')] [2024-06-27 14:50:41,446][06909] Updated weights for policy 0, policy_version 16812 (0.0036) [2024-06-27 14:50:43,851][06674] Fps is (10 sec: 42592.5, 60 sec: 43689.7, 300 sec: 43431.6). Total num frames: 275529728. Throughput: 0: 43663.4. Samples: 178425360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 14:50:43,852][06674] Avg episode reward: [(0, '0.343')] [2024-06-27 14:50:45,299][06909] Updated weights for policy 0, policy_version 16822 (0.0028) [2024-06-27 14:50:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 275759104. Throughput: 0: 43470.6. Samples: 178672700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 14:50:48,850][06674] Avg episode reward: [(0, '0.349')] [2024-06-27 14:50:49,088][06909] Updated weights for policy 0, policy_version 16832 (0.0032) [2024-06-27 14:50:52,990][06909] Updated weights for policy 0, policy_version 16842 (0.0035) [2024-06-27 14:50:53,850][06674] Fps is (10 sec: 45881.3, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 275988480. Throughput: 0: 43361.4. Samples: 178941680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 14:50:53,850][06674] Avg episode reward: [(0, '0.352')] [2024-06-27 14:50:56,665][06909] Updated weights for policy 0, policy_version 16852 (0.0044) [2024-06-27 14:50:58,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43419.1, 300 sec: 43431.5). Total num frames: 276185088. Throughput: 0: 43437.3. Samples: 179071680. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 14:50:58,850][06674] Avg episode reward: [(0, '0.349')] [2024-06-27 14:51:00,494][06909] Updated weights for policy 0, policy_version 16862 (0.0035) [2024-06-27 14:51:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.7, 300 sec: 43653.6). Total num frames: 276414464. Throughput: 0: 43308.2. Samples: 179322960. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 14:51:03,850][06674] Avg episode reward: [(0, '0.354')] [2024-06-27 14:51:04,115][06909] Updated weights for policy 0, policy_version 16872 (0.0034) [2024-06-27 14:51:08,119][06909] Updated weights for policy 0, policy_version 16882 (0.0033) [2024-06-27 14:51:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 276627456. Throughput: 0: 43590.7. Samples: 179600720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 14:51:08,850][06674] Avg episode reward: [(0, '0.356')] [2024-06-27 14:51:08,959][06887] Saving new best policy, reward=0.356! [2024-06-27 14:51:11,585][06909] Updated weights for policy 0, policy_version 16892 (0.0031) [2024-06-27 14:51:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43431.8). Total num frames: 276840448. Throughput: 0: 43450.8. Samples: 179726340. Policy #0 lag: (min: 1.0, avg: 12.8, max: 22.0) [2024-06-27 14:51:13,850][06674] Avg episode reward: [(0, '0.354')] [2024-06-27 14:51:15,664][06909] Updated weights for policy 0, policy_version 16902 (0.0032) [2024-06-27 14:51:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43419.1, 300 sec: 43653.7). Total num frames: 277069824. Throughput: 0: 43442.6. Samples: 179974780. Policy #0 lag: (min: 1.0, avg: 12.8, max: 22.0) [2024-06-27 14:51:18,850][06674] Avg episode reward: [(0, '0.355')] [2024-06-27 14:51:19,127][06909] Updated weights for policy 0, policy_version 16912 (0.0033) [2024-06-27 14:51:23,277][06909] Updated weights for policy 0, policy_version 16922 (0.0034) [2024-06-27 14:51:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43653.7). Total num frames: 277282816. Throughput: 0: 43487.8. Samples: 180246160. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2024-06-27 14:51:23,850][06674] Avg episode reward: [(0, '0.356')] [2024-06-27 14:51:26,576][06909] Updated weights for policy 0, policy_version 16932 (0.0042) [2024-06-27 14:51:28,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 277495808. Throughput: 0: 43247.8. Samples: 180371460. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2024-06-27 14:51:28,850][06674] Avg episode reward: [(0, '0.359')] [2024-06-27 14:51:28,855][06887] Saving new best policy, reward=0.359! [2024-06-27 14:51:30,850][06909] Updated weights for policy 0, policy_version 16942 (0.0033) [2024-06-27 14:51:33,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 277725184. Throughput: 0: 43449.9. Samples: 180627940. Policy #0 lag: (min: 0.0, avg: 8.0, max: 20.0) [2024-06-27 14:51:33,850][06674] Avg episode reward: [(0, '0.355')] [2024-06-27 14:51:34,063][06909] Updated weights for policy 0, policy_version 16952 (0.0036) [2024-06-27 14:51:38,199][06887] Signal inference workers to stop experience collection... (2550 times) [2024-06-27 14:51:38,200][06887] Signal inference workers to resume experience collection... (2550 times) [2024-06-27 14:51:38,241][06909] InferenceWorker_p0-w0: stopping experience collection (2550 times) [2024-06-27 14:51:38,242][06909] InferenceWorker_p0-w0: resuming experience collection (2550 times) [2024-06-27 14:51:38,342][06909] Updated weights for policy 0, policy_version 16962 (0.0032) [2024-06-27 14:51:38,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 277954560. Throughput: 0: 43447.6. Samples: 180896820. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 14:51:38,850][06674] Avg episode reward: [(0, '0.363')] [2024-06-27 14:51:38,856][06887] Saving new best policy, reward=0.363! [2024-06-27 14:51:41,523][06909] Updated weights for policy 0, policy_version 16972 (0.0031) [2024-06-27 14:51:43,850][06674] Fps is (10 sec: 39322.2, 60 sec: 43145.5, 300 sec: 43376.0). Total num frames: 278118400. Throughput: 0: 43388.0. Samples: 181024140. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 14:51:43,850][06674] Avg episode reward: [(0, '0.356')] [2024-06-27 14:51:45,783][06909] Updated weights for policy 0, policy_version 16982 (0.0022) [2024-06-27 14:51:48,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 278380544. Throughput: 0: 43623.0. Samples: 181286000. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 14:51:48,850][06674] Avg episode reward: [(0, '0.357')] [2024-06-27 14:51:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000016991_278380544.pth... [2024-06-27 14:51:48,916][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000016354_267943936.pth [2024-06-27 14:51:49,438][06909] Updated weights for policy 0, policy_version 16992 (0.0037) [2024-06-27 14:51:53,257][06909] Updated weights for policy 0, policy_version 17002 (0.0054) [2024-06-27 14:51:53,850][06674] Fps is (10 sec: 50790.6, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 278626304. Throughput: 0: 43346.3. Samples: 181551300. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 14:51:53,850][06674] Avg episode reward: [(0, '0.355')] [2024-06-27 14:51:56,967][06909] Updated weights for policy 0, policy_version 17012 (0.0030) [2024-06-27 14:51:58,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 278790144. Throughput: 0: 43444.4. Samples: 181681340. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 14:51:58,850][06674] Avg episode reward: [(0, '0.351')] [2024-06-27 14:52:00,652][06909] Updated weights for policy 0, policy_version 17022 (0.0038) [2024-06-27 14:52:03,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43417.6, 300 sec: 43598.4). Total num frames: 279019520. Throughput: 0: 43636.0. Samples: 181938400. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-27 14:52:03,850][06674] Avg episode reward: [(0, '0.356')] [2024-06-27 14:52:04,465][06909] Updated weights for policy 0, policy_version 17032 (0.0035) [2024-06-27 14:52:08,292][06909] Updated weights for policy 0, policy_version 17042 (0.0023) [2024-06-27 14:52:08,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 279265280. Throughput: 0: 43438.1. Samples: 182200880. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-27 14:52:08,850][06674] Avg episode reward: [(0, '0.359')] [2024-06-27 14:52:11,947][06909] Updated weights for policy 0, policy_version 17052 (0.0029) [2024-06-27 14:52:13,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 279429120. Throughput: 0: 43541.9. Samples: 182330840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 14:52:13,850][06674] Avg episode reward: [(0, '0.358')] [2024-06-27 14:52:15,780][06909] Updated weights for policy 0, policy_version 17062 (0.0039) [2024-06-27 14:52:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 279691264. Throughput: 0: 43575.6. Samples: 182588840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 14:52:18,851][06674] Avg episode reward: [(0, '0.357')] [2024-06-27 14:52:19,432][06909] Updated weights for policy 0, policy_version 17072 (0.0026) [2024-06-27 14:52:23,227][06909] Updated weights for policy 0, policy_version 17082 (0.0041) [2024-06-27 14:52:23,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43690.6, 300 sec: 43653.7). Total num frames: 279904256. Throughput: 0: 43506.2. Samples: 182854600. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 14:52:23,850][06674] Avg episode reward: [(0, '0.357')] [2024-06-27 14:52:27,490][06909] Updated weights for policy 0, policy_version 17092 (0.0037) [2024-06-27 14:52:28,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43144.6, 300 sec: 43375.9). Total num frames: 280084480. Throughput: 0: 43519.4. Samples: 182982520. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 14:52:28,851][06674] Avg episode reward: [(0, '0.361')] [2024-06-27 14:52:30,978][06909] Updated weights for policy 0, policy_version 17102 (0.0029) [2024-06-27 14:52:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 280330240. Throughput: 0: 43424.9. Samples: 183240120. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 14:52:33,850][06674] Avg episode reward: [(0, '0.356')] [2024-06-27 14:52:34,910][06909] Updated weights for policy 0, policy_version 17112 (0.0037) [2024-06-27 14:52:38,670][06909] Updated weights for policy 0, policy_version 17122 (0.0031) [2024-06-27 14:52:38,850][06674] Fps is (10 sec: 44237.3, 60 sec: 42871.5, 300 sec: 43542.6). Total num frames: 280526848. Throughput: 0: 43554.6. Samples: 183511260. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 14:52:38,850][06674] Avg episode reward: [(0, '0.355')] [2024-06-27 14:52:42,358][06887] Signal inference workers to stop experience collection... (2600 times) [2024-06-27 14:52:42,367][06887] Signal inference workers to resume experience collection... (2600 times) [2024-06-27 14:52:42,372][06909] InferenceWorker_p0-w0: stopping experience collection (2600 times) [2024-06-27 14:52:42,375][06909] Updated weights for policy 0, policy_version 17132 (0.0033) [2024-06-27 14:52:42,389][06909] InferenceWorker_p0-w0: resuming experience collection (2600 times) [2024-06-27 14:52:43,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 280739840. Throughput: 0: 43382.2. Samples: 183633540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 14:52:43,850][06674] Avg episode reward: [(0, '0.358')] [2024-06-27 14:52:46,230][06909] Updated weights for policy 0, policy_version 17142 (0.0039) [2024-06-27 14:52:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 280985600. Throughput: 0: 43389.4. Samples: 183890920. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 14:52:48,850][06674] Avg episode reward: [(0, '0.359')] [2024-06-27 14:52:49,725][06909] Updated weights for policy 0, policy_version 17152 (0.0033) [2024-06-27 14:52:53,734][06909] Updated weights for policy 0, policy_version 17162 (0.0039) [2024-06-27 14:52:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 42598.4, 300 sec: 43542.6). Total num frames: 281182208. Throughput: 0: 43591.2. Samples: 184162480. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 14:52:53,850][06674] Avg episode reward: [(0, '0.359')] [2024-06-27 14:52:57,301][06909] Updated weights for policy 0, policy_version 17172 (0.0032) [2024-06-27 14:52:58,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 281395200. Throughput: 0: 43353.9. Samples: 184281760. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 14:52:58,850][06674] Avg episode reward: [(0, '0.359')] [2024-06-27 14:53:01,207][06909] Updated weights for policy 0, policy_version 17182 (0.0033) [2024-06-27 14:53:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 281640960. Throughput: 0: 43315.2. Samples: 184538020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-27 14:53:03,850][06674] Avg episode reward: [(0, '0.359')] [2024-06-27 14:53:05,013][06909] Updated weights for policy 0, policy_version 17192 (0.0024) [2024-06-27 14:53:08,816][06909] Updated weights for policy 0, policy_version 17202 (0.0025) [2024-06-27 14:53:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 42871.5, 300 sec: 43542.5). Total num frames: 281837568. Throughput: 0: 43366.2. Samples: 184806080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-27 14:53:08,850][06674] Avg episode reward: [(0, '0.357')] [2024-06-27 14:53:12,455][06909] Updated weights for policy 0, policy_version 17212 (0.0038) [2024-06-27 14:53:13,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 282050560. Throughput: 0: 43237.8. Samples: 184928220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 14:53:13,850][06674] Avg episode reward: [(0, '0.362')] [2024-06-27 14:53:16,542][06909] Updated weights for policy 0, policy_version 17222 (0.0038) [2024-06-27 14:53:18,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 282296320. Throughput: 0: 43276.3. Samples: 185187560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 14:53:18,850][06674] Avg episode reward: [(0, '0.364')] [2024-06-27 14:53:18,855][06887] Saving new best policy, reward=0.364! [2024-06-27 14:53:20,145][06909] Updated weights for policy 0, policy_version 17232 (0.0027) [2024-06-27 14:53:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 42871.4, 300 sec: 43431.5). Total num frames: 282476544. Throughput: 0: 43265.7. Samples: 185458220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 14:53:23,850][06674] Avg episode reward: [(0, '0.360')] [2024-06-27 14:53:24,034][06909] Updated weights for policy 0, policy_version 17242 (0.0031) [2024-06-27 14:53:27,593][06909] Updated weights for policy 0, policy_version 17252 (0.0022) [2024-06-27 14:53:28,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 282705920. Throughput: 0: 43329.0. Samples: 185583340. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-27 14:53:28,850][06674] Avg episode reward: [(0, '0.362')] [2024-06-27 14:53:31,475][06909] Updated weights for policy 0, policy_version 17262 (0.0043) [2024-06-27 14:53:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 282935296. Throughput: 0: 43260.8. Samples: 185837660. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-27 14:53:33,850][06674] Avg episode reward: [(0, '0.362')] [2024-06-27 14:53:35,289][06909] Updated weights for policy 0, policy_version 17272 (0.0018) [2024-06-27 14:53:38,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 283115520. Throughput: 0: 43160.8. Samples: 186104720. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 14:53:38,850][06674] Avg episode reward: [(0, '0.361')] [2024-06-27 14:53:39,075][06909] Updated weights for policy 0, policy_version 17282 (0.0035) [2024-06-27 14:53:42,865][06909] Updated weights for policy 0, policy_version 17292 (0.0032) [2024-06-27 14:53:43,852][06674] Fps is (10 sec: 42587.3, 60 sec: 43688.8, 300 sec: 43431.1). Total num frames: 283361280. Throughput: 0: 43243.7. Samples: 186227840. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 14:53:43,853][06674] Avg episode reward: [(0, '0.358')] [2024-06-27 14:53:46,555][06909] Updated weights for policy 0, policy_version 17302 (0.0024) [2024-06-27 14:53:46,926][06887] Signal inference workers to stop experience collection... (2650 times) [2024-06-27 14:53:46,954][06909] InferenceWorker_p0-w0: stopping experience collection (2650 times) [2024-06-27 14:53:47,040][06887] Signal inference workers to resume experience collection... (2650 times) [2024-06-27 14:53:47,040][06909] InferenceWorker_p0-w0: resuming experience collection (2650 times) [2024-06-27 14:53:48,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43144.4, 300 sec: 43542.8). Total num frames: 283574272. Throughput: 0: 43227.0. Samples: 186483240. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 14:53:48,850][06674] Avg episode reward: [(0, '0.367')] [2024-06-27 14:53:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000017308_283574272.pth... [2024-06-27 14:53:48,916][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000016672_273154048.pth [2024-06-27 14:53:48,925][06887] Saving new best policy, reward=0.367! [2024-06-27 14:53:50,405][06909] Updated weights for policy 0, policy_version 17312 (0.0031) [2024-06-27 14:53:53,850][06674] Fps is (10 sec: 40970.8, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 283770880. Throughput: 0: 43151.1. Samples: 186747880. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 14:53:53,850][06674] Avg episode reward: [(0, '0.365')] [2024-06-27 14:53:54,398][06909] Updated weights for policy 0, policy_version 17322 (0.0033) [2024-06-27 14:53:58,142][06909] Updated weights for policy 0, policy_version 17332 (0.0036) [2024-06-27 14:53:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 43376.3). Total num frames: 284000256. Throughput: 0: 43176.5. Samples: 186871160. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 14:53:58,850][06674] Avg episode reward: [(0, '0.363')] [2024-06-27 14:54:01,818][06909] Updated weights for policy 0, policy_version 17342 (0.0040) [2024-06-27 14:54:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 284229632. Throughput: 0: 43244.6. Samples: 187133560. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 14:54:03,850][06674] Avg episode reward: [(0, '0.362')] [2024-06-27 14:54:05,646][06909] Updated weights for policy 0, policy_version 17352 (0.0033) [2024-06-27 14:54:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 284426240. Throughput: 0: 43028.1. Samples: 187394480. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 14:54:08,850][06674] Avg episode reward: [(0, '0.361')] [2024-06-27 14:54:09,431][06909] Updated weights for policy 0, policy_version 17362 (0.0029) [2024-06-27 14:54:13,427][06909] Updated weights for policy 0, policy_version 17372 (0.0031) [2024-06-27 14:54:13,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 284639232. Throughput: 0: 43041.3. Samples: 187520200. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 14:54:13,850][06674] Avg episode reward: [(0, '0.360')] [2024-06-27 14:54:17,073][06909] Updated weights for policy 0, policy_version 17382 (0.0034) [2024-06-27 14:54:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43144.7, 300 sec: 43487.0). Total num frames: 284884992. Throughput: 0: 43319.2. Samples: 187787020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 14:54:18,850][06674] Avg episode reward: [(0, '0.364')] [2024-06-27 14:54:20,770][06909] Updated weights for policy 0, policy_version 17392 (0.0034) [2024-06-27 14:54:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 43432.4). Total num frames: 285081600. Throughput: 0: 43130.6. Samples: 188045600. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 14:54:23,850][06674] Avg episode reward: [(0, '0.367')] [2024-06-27 14:54:24,710][06909] Updated weights for policy 0, policy_version 17402 (0.0034) [2024-06-27 14:54:28,310][06909] Updated weights for policy 0, policy_version 17412 (0.0034) [2024-06-27 14:54:28,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 285294592. Throughput: 0: 43251.9. Samples: 188174060. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:54:28,850][06674] Avg episode reward: [(0, '0.366')] [2024-06-27 14:54:32,131][06909] Updated weights for policy 0, policy_version 17422 (0.0042) [2024-06-27 14:54:33,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 285540352. Throughput: 0: 43594.4. Samples: 188444980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:54:33,850][06674] Avg episode reward: [(0, '0.353')] [2024-06-27 14:54:36,055][06909] Updated weights for policy 0, policy_version 17432 (0.0023) [2024-06-27 14:54:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 285736960. Throughput: 0: 43365.4. Samples: 188699320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 14:54:38,850][06674] Avg episode reward: [(0, '0.354')] [2024-06-27 14:54:39,555][06909] Updated weights for policy 0, policy_version 17442 (0.0049) [2024-06-27 14:54:43,503][06909] Updated weights for policy 0, policy_version 17452 (0.0039) [2024-06-27 14:54:43,850][06674] Fps is (10 sec: 39321.4, 60 sec: 42873.4, 300 sec: 43320.4). Total num frames: 285933568. Throughput: 0: 43517.3. Samples: 188829440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 14:54:43,850][06674] Avg episode reward: [(0, '0.354')] [2024-06-27 14:54:46,981][06909] Updated weights for policy 0, policy_version 17462 (0.0032) [2024-06-27 14:54:48,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 286195712. Throughput: 0: 43708.4. Samples: 189100440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 14:54:48,850][06674] Avg episode reward: [(0, '0.354')] [2024-06-27 14:54:50,942][06909] Updated weights for policy 0, policy_version 17472 (0.0030) [2024-06-27 14:54:52,332][06887] Signal inference workers to stop experience collection... (2700 times) [2024-06-27 14:54:52,333][06887] Signal inference workers to resume experience collection... (2700 times) [2024-06-27 14:54:52,349][06909] InferenceWorker_p0-w0: stopping experience collection (2700 times) [2024-06-27 14:54:52,349][06909] InferenceWorker_p0-w0: resuming experience collection (2700 times) [2024-06-27 14:54:53,856][06674] Fps is (10 sec: 45848.4, 60 sec: 43686.4, 300 sec: 43430.9). Total num frames: 286392320. Throughput: 0: 43550.7. Samples: 189354520. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-27 14:54:53,856][06674] Avg episode reward: [(0, '0.346')] [2024-06-27 14:54:54,636][06909] Updated weights for policy 0, policy_version 17482 (0.0036) [2024-06-27 14:54:58,478][06909] Updated weights for policy 0, policy_version 17492 (0.0043) [2024-06-27 14:54:58,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 286588928. Throughput: 0: 43612.1. Samples: 189482740. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-27 14:54:58,850][06674] Avg episode reward: [(0, '0.341')] [2024-06-27 14:55:01,982][06909] Updated weights for policy 0, policy_version 17502 (0.0038) [2024-06-27 14:55:03,850][06674] Fps is (10 sec: 45902.3, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 286851072. Throughput: 0: 43657.3. Samples: 189751600. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-27 14:55:03,850][06674] Avg episode reward: [(0, '0.341')] [2024-06-27 14:55:05,944][06909] Updated weights for policy 0, policy_version 17512 (0.0036) [2024-06-27 14:55:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 287047680. Throughput: 0: 43725.8. Samples: 190013260. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 14:55:08,850][06674] Avg episode reward: [(0, '0.358')] [2024-06-27 14:55:09,451][06909] Updated weights for policy 0, policy_version 17522 (0.0025) [2024-06-27 14:55:13,443][06909] Updated weights for policy 0, policy_version 17532 (0.0043) [2024-06-27 14:55:13,850][06674] Fps is (10 sec: 39321.0, 60 sec: 43417.6, 300 sec: 43320.7). Total num frames: 287244288. Throughput: 0: 43531.0. Samples: 190132960. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 14:55:13,850][06674] Avg episode reward: [(0, '0.361')] [2024-06-27 14:55:16,988][06909] Updated weights for policy 0, policy_version 17542 (0.0027) [2024-06-27 14:55:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 287490048. Throughput: 0: 43350.7. Samples: 190395760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 14:55:18,850][06674] Avg episode reward: [(0, '0.358')] [2024-06-27 14:55:21,022][06909] Updated weights for policy 0, policy_version 17552 (0.0043) [2024-06-27 14:55:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 287670272. Throughput: 0: 43568.7. Samples: 190659920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 14:55:23,851][06674] Avg episode reward: [(0, '0.358')] [2024-06-27 14:55:24,510][06909] Updated weights for policy 0, policy_version 17562 (0.0057) [2024-06-27 14:55:28,505][06909] Updated weights for policy 0, policy_version 17572 (0.0036) [2024-06-27 14:55:28,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 287899648. Throughput: 0: 43431.1. Samples: 190783840. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 14:55:28,850][06674] Avg episode reward: [(0, '0.361')] [2024-06-27 14:55:32,385][06909] Updated weights for policy 0, policy_version 17582 (0.0033) [2024-06-27 14:55:33,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 288145408. Throughput: 0: 43254.3. Samples: 191046880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 14:55:33,850][06674] Avg episode reward: [(0, '0.366')] [2024-06-27 14:55:36,207][06909] Updated weights for policy 0, policy_version 17592 (0.0035) [2024-06-27 14:55:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.5, 300 sec: 43431.7). Total num frames: 288342016. Throughput: 0: 43391.3. Samples: 191306880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 14:55:38,850][06674] Avg episode reward: [(0, '0.352')] [2024-06-27 14:55:39,963][06909] Updated weights for policy 0, policy_version 17602 (0.0036) [2024-06-27 14:55:43,675][06909] Updated weights for policy 0, policy_version 17612 (0.0032) [2024-06-27 14:55:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43376.0). Total num frames: 288555008. Throughput: 0: 43253.8. Samples: 191429160. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 14:55:43,850][06674] Avg episode reward: [(0, '0.361')] [2024-06-27 14:55:47,474][06909] Updated weights for policy 0, policy_version 17622 (0.0035) [2024-06-27 14:55:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 288800768. Throughput: 0: 43216.2. Samples: 191696340. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 14:55:48,851][06674] Avg episode reward: [(0, '0.365')] [2024-06-27 14:55:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000017627_288800768.pth... [2024-06-27 14:55:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000016991_278380544.pth [2024-06-27 14:55:51,096][06909] Updated weights for policy 0, policy_version 17632 (0.0038) [2024-06-27 14:55:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43148.7, 300 sec: 43375.9). Total num frames: 288980992. Throughput: 0: 43039.5. Samples: 191950040. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 14:55:53,850][06674] Avg episode reward: [(0, '0.358')] [2024-06-27 14:55:55,315][06909] Updated weights for policy 0, policy_version 17642 (0.0042) [2024-06-27 14:55:58,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 289193984. Throughput: 0: 42988.5. Samples: 192067440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 14:55:58,850][06674] Avg episode reward: [(0, '0.362')] [2024-06-27 14:55:58,880][06909] Updated weights for policy 0, policy_version 17652 (0.0033) [2024-06-27 14:56:02,841][06909] Updated weights for policy 0, policy_version 17662 (0.0022) [2024-06-27 14:56:03,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 289439744. Throughput: 0: 43161.8. Samples: 192338040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 14:56:03,850][06674] Avg episode reward: [(0, '0.358')] [2024-06-27 14:56:06,414][06909] Updated weights for policy 0, policy_version 17672 (0.0035) [2024-06-27 14:56:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 43320.4). Total num frames: 289619968. Throughput: 0: 43029.4. Samples: 192596240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 14:56:08,850][06674] Avg episode reward: [(0, '0.357')] [2024-06-27 14:56:10,374][06909] Updated weights for policy 0, policy_version 17682 (0.0030) [2024-06-27 14:56:10,791][06887] Signal inference workers to stop experience collection... (2750 times) [2024-06-27 14:56:10,791][06887] Signal inference workers to resume experience collection... (2750 times) [2024-06-27 14:56:10,830][06909] InferenceWorker_p0-w0: stopping experience collection (2750 times) [2024-06-27 14:56:10,830][06909] InferenceWorker_p0-w0: resuming experience collection (2750 times) [2024-06-27 14:56:13,852][06674] Fps is (10 sec: 40951.6, 60 sec: 43416.2, 300 sec: 43320.1). Total num frames: 289849344. Throughput: 0: 43071.5. Samples: 192722140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 14:56:13,852][06674] Avg episode reward: [(0, '0.361')] [2024-06-27 14:56:14,393][06909] Updated weights for policy 0, policy_version 17692 (0.0027) [2024-06-27 14:56:18,191][06909] Updated weights for policy 0, policy_version 17702 (0.0036) [2024-06-27 14:56:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 290078720. Throughput: 0: 43145.3. Samples: 192988420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 14:56:18,850][06674] Avg episode reward: [(0, '0.366')] [2024-06-27 14:56:21,830][06909] Updated weights for policy 0, policy_version 17712 (0.0024) [2024-06-27 14:56:23,850][06674] Fps is (10 sec: 44245.5, 60 sec: 43690.7, 300 sec: 43376.0). Total num frames: 290291712. Throughput: 0: 42987.6. Samples: 193241320. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 14:56:23,851][06674] Avg episode reward: [(0, '0.366')] [2024-06-27 14:56:25,535][06909] Updated weights for policy 0, policy_version 17722 (0.0031) [2024-06-27 14:56:28,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 290488320. Throughput: 0: 43174.7. Samples: 193372020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 14:56:28,850][06674] Avg episode reward: [(0, '0.364')] [2024-06-27 14:56:29,346][06909] Updated weights for policy 0, policy_version 17732 (0.0031) [2024-06-27 14:56:33,027][06909] Updated weights for policy 0, policy_version 17742 (0.0034) [2024-06-27 14:56:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 290734080. Throughput: 0: 43271.3. Samples: 193643540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:56:33,850][06674] Avg episode reward: [(0, '0.367')] [2024-06-27 14:56:36,896][06909] Updated weights for policy 0, policy_version 17752 (0.0031) [2024-06-27 14:56:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 290947072. Throughput: 0: 43319.2. Samples: 193899400. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:56:38,850][06674] Avg episode reward: [(0, '0.364')] [2024-06-27 14:56:40,512][06909] Updated weights for policy 0, policy_version 17762 (0.0031) [2024-06-27 14:56:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 291143680. Throughput: 0: 43581.4. Samples: 194028600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 14:56:43,850][06674] Avg episode reward: [(0, '0.365')] [2024-06-27 14:56:44,271][06909] Updated weights for policy 0, policy_version 17772 (0.0034) [2024-06-27 14:56:47,895][06909] Updated weights for policy 0, policy_version 17782 (0.0032) [2024-06-27 14:56:48,850][06674] Fps is (10 sec: 42597.9, 60 sec: 42871.5, 300 sec: 43209.3). Total num frames: 291373056. Throughput: 0: 43494.1. Samples: 194295280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:56:48,850][06674] Avg episode reward: [(0, '0.363')] [2024-06-27 14:56:51,804][06909] Updated weights for policy 0, policy_version 17792 (0.0036) [2024-06-27 14:56:53,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 291586048. Throughput: 0: 43485.2. Samples: 194553080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:56:53,850][06674] Avg episode reward: [(0, '0.362')] [2024-06-27 14:56:55,617][06909] Updated weights for policy 0, policy_version 17802 (0.0034) [2024-06-27 14:56:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 291799040. Throughput: 0: 43477.0. Samples: 194678520. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:56:58,850][06674] Avg episode reward: [(0, '0.355')] [2024-06-27 14:56:59,254][06909] Updated weights for policy 0, policy_version 17812 (0.0044) [2024-06-27 14:57:03,050][06909] Updated weights for policy 0, policy_version 17822 (0.0038) [2024-06-27 14:57:03,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 292028416. Throughput: 0: 43397.4. Samples: 194941300. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:57:03,850][06674] Avg episode reward: [(0, '0.366')] [2024-06-27 14:57:06,765][06909] Updated weights for policy 0, policy_version 17832 (0.0038) [2024-06-27 14:57:08,852][06674] Fps is (10 sec: 44227.9, 60 sec: 43689.2, 300 sec: 43431.2). Total num frames: 292241408. Throughput: 0: 43493.6. Samples: 195198620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 14:57:08,852][06674] Avg episode reward: [(0, '0.360')] [2024-06-27 14:57:10,594][06909] Updated weights for policy 0, policy_version 17842 (0.0032) [2024-06-27 14:57:13,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43146.0, 300 sec: 43209.4). Total num frames: 292438016. Throughput: 0: 43565.4. Samples: 195332460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 14:57:13,850][06674] Avg episode reward: [(0, '0.358')] [2024-06-27 14:57:14,625][06909] Updated weights for policy 0, policy_version 17852 (0.0039) [2024-06-27 14:57:18,066][06909] Updated weights for policy 0, policy_version 17862 (0.0043) [2024-06-27 14:57:18,850][06674] Fps is (10 sec: 42606.8, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 292667392. Throughput: 0: 43295.5. Samples: 195591840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 14:57:18,850][06674] Avg episode reward: [(0, '0.363')] [2024-06-27 14:57:22,205][06909] Updated weights for policy 0, policy_version 17872 (0.0031) [2024-06-27 14:57:23,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 292896768. Throughput: 0: 43252.8. Samples: 195845780. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 14:57:23,850][06674] Avg episode reward: [(0, '0.363')] [2024-06-27 14:57:25,639][06909] Updated weights for policy 0, policy_version 17882 (0.0040) [2024-06-27 14:57:28,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 293076992. Throughput: 0: 43350.2. Samples: 195979360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:57:28,850][06674] Avg episode reward: [(0, '0.356')] [2024-06-27 14:57:29,686][06909] Updated weights for policy 0, policy_version 17892 (0.0028) [2024-06-27 14:57:33,087][06909] Updated weights for policy 0, policy_version 17902 (0.0032) [2024-06-27 14:57:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 293322752. Throughput: 0: 43300.4. Samples: 196243800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 14:57:33,850][06674] Avg episode reward: [(0, '0.356')] [2024-06-27 14:57:37,355][06909] Updated weights for policy 0, policy_version 17912 (0.0042) [2024-06-27 14:57:37,963][06887] Signal inference workers to stop experience collection... (2800 times) [2024-06-27 14:57:37,963][06887] Signal inference workers to resume experience collection... (2800 times) [2024-06-27 14:57:37,992][06909] InferenceWorker_p0-w0: stopping experience collection (2800 times) [2024-06-27 14:57:37,993][06909] InferenceWorker_p0-w0: resuming experience collection (2800 times) [2024-06-27 14:57:38,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 293552128. Throughput: 0: 43185.5. Samples: 196496420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 14:57:38,850][06674] Avg episode reward: [(0, '0.365')] [2024-06-27 14:57:40,717][06909] Updated weights for policy 0, policy_version 17922 (0.0033) [2024-06-27 14:57:43,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 293732352. Throughput: 0: 43438.2. Samples: 196633240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 14:57:43,850][06674] Avg episode reward: [(0, '0.362')] [2024-06-27 14:57:44,780][06909] Updated weights for policy 0, policy_version 17932 (0.0035) [2024-06-27 14:57:48,266][06909] Updated weights for policy 0, policy_version 17942 (0.0028) [2024-06-27 14:57:48,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43416.2, 300 sec: 43375.6). Total num frames: 293978112. Throughput: 0: 43429.6. Samples: 196895720. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 14:57:48,852][06674] Avg episode reward: [(0, '0.364')] [2024-06-27 14:57:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000017943_293978112.pth... [2024-06-27 14:57:48,943][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000017308_283574272.pth [2024-06-27 14:57:52,382][06909] Updated weights for policy 0, policy_version 17952 (0.0037) [2024-06-27 14:57:53,850][06674] Fps is (10 sec: 49152.2, 60 sec: 43963.9, 300 sec: 43487.0). Total num frames: 294223872. Throughput: 0: 43418.5. Samples: 197152360. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 14:57:53,850][06674] Avg episode reward: [(0, '0.367')] [2024-06-27 14:57:55,776][06909] Updated weights for policy 0, policy_version 17962 (0.0033) [2024-06-27 14:57:58,852][06674] Fps is (10 sec: 39321.7, 60 sec: 42870.1, 300 sec: 43153.5). Total num frames: 294371328. Throughput: 0: 43254.9. Samples: 197279020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 14:57:58,852][06674] Avg episode reward: [(0, '0.367')] [2024-06-27 14:57:59,896][06909] Updated weights for policy 0, policy_version 17972 (0.0032) [2024-06-27 14:58:03,436][06909] Updated weights for policy 0, policy_version 17982 (0.0041) [2024-06-27 14:58:03,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 294633472. Throughput: 0: 43334.2. Samples: 197541880. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 14:58:03,850][06674] Avg episode reward: [(0, '0.362')] [2024-06-27 14:58:07,511][06909] Updated weights for policy 0, policy_version 17992 (0.0041) [2024-06-27 14:58:08,850][06674] Fps is (10 sec: 49161.6, 60 sec: 43692.1, 300 sec: 43431.5). Total num frames: 294862848. Throughput: 0: 43474.7. Samples: 197802140. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 14:58:08,850][06674] Avg episode reward: [(0, '0.365')] [2024-06-27 14:58:10,918][06909] Updated weights for policy 0, policy_version 18002 (0.0033) [2024-06-27 14:58:13,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43144.4, 300 sec: 43153.8). Total num frames: 295026688. Throughput: 0: 43327.9. Samples: 197929120. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 14:58:13,850][06674] Avg episode reward: [(0, '0.370')] [2024-06-27 14:58:13,851][06887] Saving new best policy, reward=0.370! [2024-06-27 14:58:14,984][06909] Updated weights for policy 0, policy_version 18012 (0.0034) [2024-06-27 14:58:18,379][06909] Updated weights for policy 0, policy_version 18022 (0.0039) [2024-06-27 14:58:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 295288832. Throughput: 0: 43358.7. Samples: 198194940. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-27 14:58:18,850][06674] Avg episode reward: [(0, '0.367')] [2024-06-27 14:58:22,577][06909] Updated weights for policy 0, policy_version 18032 (0.0029) [2024-06-27 14:58:23,850][06674] Fps is (10 sec: 49152.1, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 295518208. Throughput: 0: 43457.3. Samples: 198452000. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-27 14:58:23,850][06674] Avg episode reward: [(0, '0.368')] [2024-06-27 14:58:25,953][06909] Updated weights for policy 0, policy_version 18042 (0.0026) [2024-06-27 14:58:28,850][06674] Fps is (10 sec: 37683.3, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 295665664. Throughput: 0: 43368.9. Samples: 198584840. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-27 14:58:28,850][06674] Avg episode reward: [(0, '0.364')] [2024-06-27 14:58:30,052][06909] Updated weights for policy 0, policy_version 18052 (0.0035) [2024-06-27 14:58:33,499][06909] Updated weights for policy 0, policy_version 18062 (0.0040) [2024-06-27 14:58:33,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 295927808. Throughput: 0: 43230.3. Samples: 198841000. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2024-06-27 14:58:33,851][06674] Avg episode reward: [(0, '0.368')] [2024-06-27 14:58:37,812][06909] Updated weights for policy 0, policy_version 18072 (0.0033) [2024-06-27 14:58:38,850][06674] Fps is (10 sec: 49151.6, 60 sec: 43417.5, 300 sec: 43376.3). Total num frames: 296157184. Throughput: 0: 43340.8. Samples: 199102700. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2024-06-27 14:58:38,850][06674] Avg episode reward: [(0, '0.360')] [2024-06-27 14:58:41,179][06909] Updated weights for policy 0, policy_version 18082 (0.0028) [2024-06-27 14:58:43,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43144.4, 300 sec: 43209.3). Total num frames: 296321024. Throughput: 0: 43397.4. Samples: 199231820. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-27 14:58:43,856][06674] Avg episode reward: [(0, '0.365')] [2024-06-27 14:58:45,261][06909] Updated weights for policy 0, policy_version 18092 (0.0039) [2024-06-27 14:58:48,619][06909] Updated weights for policy 0, policy_version 18102 (0.0041) [2024-06-27 14:58:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43419.0, 300 sec: 43431.5). Total num frames: 296583168. Throughput: 0: 43330.2. Samples: 199491740. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-27 14:58:48,850][06674] Avg episode reward: [(0, '0.366')] [2024-06-27 14:58:51,868][06887] Signal inference workers to stop experience collection... (2850 times) [2024-06-27 14:58:51,868][06887] Signal inference workers to resume experience collection... (2850 times) [2024-06-27 14:58:51,900][06909] InferenceWorker_p0-w0: stopping experience collection (2850 times) [2024-06-27 14:58:51,900][06909] InferenceWorker_p0-w0: resuming experience collection (2850 times) [2024-06-27 14:58:53,026][06909] Updated weights for policy 0, policy_version 18112 (0.0026) [2024-06-27 14:58:53,852][06674] Fps is (10 sec: 47504.5, 60 sec: 42870.0, 300 sec: 43375.6). Total num frames: 296796160. Throughput: 0: 43383.0. Samples: 199754460. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-27 14:58:53,853][06674] Avg episode reward: [(0, '0.371')] [2024-06-27 14:58:56,061][06909] Updated weights for policy 0, policy_version 18122 (0.0025) [2024-06-27 14:58:58,850][06674] Fps is (10 sec: 39322.3, 60 sec: 43419.1, 300 sec: 43209.3). Total num frames: 296976384. Throughput: 0: 43342.4. Samples: 199879520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 19.0) [2024-06-27 14:58:58,850][06674] Avg episode reward: [(0, '0.369')] [2024-06-27 14:59:00,562][06909] Updated weights for policy 0, policy_version 18132 (0.0031) [2024-06-27 14:59:03,639][06909] Updated weights for policy 0, policy_version 18142 (0.0032) [2024-06-27 14:59:03,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 297238528. Throughput: 0: 43318.6. Samples: 200144280. Policy #0 lag: (min: 0.0, avg: 10.2, max: 19.0) [2024-06-27 14:59:03,851][06674] Avg episode reward: [(0, '0.363')] [2024-06-27 14:59:07,930][06909] Updated weights for policy 0, policy_version 18152 (0.0024) [2024-06-27 14:59:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 42871.5, 300 sec: 43376.0). Total num frames: 297435136. Throughput: 0: 43374.7. Samples: 200403860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 14:59:08,850][06674] Avg episode reward: [(0, '0.365')] [2024-06-27 14:59:11,174][06909] Updated weights for policy 0, policy_version 18162 (0.0030) [2024-06-27 14:59:13,850][06674] Fps is (10 sec: 39322.3, 60 sec: 43417.7, 300 sec: 43209.3). Total num frames: 297631744. Throughput: 0: 43234.3. Samples: 200530380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 14:59:13,850][06674] Avg episode reward: [(0, '0.368')] [2024-06-27 14:59:15,400][06909] Updated weights for policy 0, policy_version 18172 (0.0032) [2024-06-27 14:59:18,773][06909] Updated weights for policy 0, policy_version 18182 (0.0046) [2024-06-27 14:59:18,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 297893888. Throughput: 0: 43470.3. Samples: 200797160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 14:59:18,850][06674] Avg episode reward: [(0, '0.363')] [2024-06-27 14:59:22,845][06909] Updated weights for policy 0, policy_version 18192 (0.0038) [2024-06-27 14:59:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 42871.5, 300 sec: 43376.0). Total num frames: 298090496. Throughput: 0: 43510.4. Samples: 201060660. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 14:59:23,850][06674] Avg episode reward: [(0, '0.362')] [2024-06-27 14:59:26,391][06909] Updated weights for policy 0, policy_version 18202 (0.0032) [2024-06-27 14:59:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 43264.9). Total num frames: 298303488. Throughput: 0: 43446.3. Samples: 201186900. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 14:59:28,850][06674] Avg episode reward: [(0, '0.368')] [2024-06-27 14:59:30,456][06909] Updated weights for policy 0, policy_version 18212 (0.0033) [2024-06-27 14:59:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.7, 300 sec: 43375.9). Total num frames: 298532864. Throughput: 0: 43524.6. Samples: 201450340. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 14:59:33,850][06674] Avg episode reward: [(0, '0.369')] [2024-06-27 14:59:33,874][06909] Updated weights for policy 0, policy_version 18222 (0.0029) [2024-06-27 14:59:37,857][06909] Updated weights for policy 0, policy_version 18232 (0.0028) [2024-06-27 14:59:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 298745856. Throughput: 0: 43547.7. Samples: 201714020. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 14:59:38,850][06674] Avg episode reward: [(0, '0.366')] [2024-06-27 14:59:41,270][06909] Updated weights for policy 0, policy_version 18242 (0.0036) [2024-06-27 14:59:43,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43962.3, 300 sec: 43264.6). Total num frames: 298958848. Throughput: 0: 43481.0. Samples: 201836260. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 14:59:43,852][06674] Avg episode reward: [(0, '0.366')] [2024-06-27 14:59:45,359][06909] Updated weights for policy 0, policy_version 18252 (0.0028) [2024-06-27 14:59:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 43376.8). Total num frames: 299188224. Throughput: 0: 43391.1. Samples: 202096880. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 14:59:48,850][06674] Avg episode reward: [(0, '0.360')] [2024-06-27 14:59:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000018261_299188224.pth... [2024-06-27 14:59:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000017627_288800768.pth [2024-06-27 14:59:49,175][06909] Updated weights for policy 0, policy_version 18262 (0.0044) [2024-06-27 14:59:52,936][06909] Updated weights for policy 0, policy_version 18272 (0.0043) [2024-06-27 14:59:53,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43146.0, 300 sec: 43375.9). Total num frames: 299384832. Throughput: 0: 43402.2. Samples: 202356960. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 14:59:53,850][06674] Avg episode reward: [(0, '0.359')] [2024-06-27 14:59:56,724][06909] Updated weights for policy 0, policy_version 18282 (0.0040) [2024-06-27 14:59:58,855][06674] Fps is (10 sec: 40937.5, 60 sec: 43686.5, 300 sec: 43208.5). Total num frames: 299597824. Throughput: 0: 43553.1. Samples: 202490520. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 14:59:58,856][06674] Avg episode reward: [(0, '0.366')] [2024-06-27 15:00:00,459][06909] Updated weights for policy 0, policy_version 18292 (0.0033) [2024-06-27 15:00:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 299810816. Throughput: 0: 43371.6. Samples: 202748880. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 15:00:03,850][06674] Avg episode reward: [(0, '0.369')] [2024-06-27 15:00:04,342][06909] Updated weights for policy 0, policy_version 18302 (0.0034) [2024-06-27 15:00:08,402][06909] Updated weights for policy 0, policy_version 18312 (0.0034) [2024-06-27 15:00:08,850][06674] Fps is (10 sec: 44261.8, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 300040192. Throughput: 0: 43209.3. Samples: 203005080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 15:00:08,850][06674] Avg episode reward: [(0, '0.366')] [2024-06-27 15:00:11,836][06909] Updated weights for policy 0, policy_version 18322 (0.0035) [2024-06-27 15:00:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43264.9). Total num frames: 300253184. Throughput: 0: 43338.7. Samples: 203137140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 15:00:13,851][06674] Avg episode reward: [(0, '0.370')] [2024-06-27 15:00:15,935][06909] Updated weights for policy 0, policy_version 18332 (0.0040) [2024-06-27 15:00:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 42871.6, 300 sec: 43376.0). Total num frames: 300466176. Throughput: 0: 43132.1. Samples: 203391280. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 15:00:18,850][06674] Avg episode reward: [(0, '0.372')] [2024-06-27 15:00:18,860][06887] Saving new best policy, reward=0.372! [2024-06-27 15:00:19,460][06909] Updated weights for policy 0, policy_version 18342 (0.0032) [2024-06-27 15:00:23,420][06909] Updated weights for policy 0, policy_version 18352 (0.0043) [2024-06-27 15:00:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 300695552. Throughput: 0: 43052.5. Samples: 203651380. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 15:00:23,850][06674] Avg episode reward: [(0, '0.368')] [2024-06-27 15:00:26,580][06887] Signal inference workers to stop experience collection... (2900 times) [2024-06-27 15:00:26,580][06887] Signal inference workers to resume experience collection... (2900 times) [2024-06-27 15:00:26,606][06909] InferenceWorker_p0-w0: stopping experience collection (2900 times) [2024-06-27 15:00:26,606][06909] InferenceWorker_p0-w0: resuming experience collection (2900 times) [2024-06-27 15:00:26,944][06909] Updated weights for policy 0, policy_version 18362 (0.0022) [2024-06-27 15:00:28,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 300892160. Throughput: 0: 43357.1. Samples: 203787240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 15:00:28,850][06674] Avg episode reward: [(0, '0.365')] [2024-06-27 15:00:30,816][06909] Updated weights for policy 0, policy_version 18372 (0.0041) [2024-06-27 15:00:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 301121536. Throughput: 0: 43262.7. Samples: 204043700. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 15:00:33,850][06674] Avg episode reward: [(0, '0.372')] [2024-06-27 15:00:34,462][06909] Updated weights for policy 0, policy_version 18382 (0.0040) [2024-06-27 15:00:38,304][06909] Updated weights for policy 0, policy_version 18392 (0.0040) [2024-06-27 15:00:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 301350912. Throughput: 0: 43231.5. Samples: 204302380. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 15:00:38,850][06674] Avg episode reward: [(0, '0.373')] [2024-06-27 15:00:41,981][06909] Updated weights for policy 0, policy_version 18402 (0.0033) [2024-06-27 15:00:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43146.0, 300 sec: 43209.3). Total num frames: 301547520. Throughput: 0: 43228.5. Samples: 204435560. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 15:00:43,850][06674] Avg episode reward: [(0, '0.373')] [2024-06-27 15:00:43,851][06887] Saving new best policy, reward=0.373! [2024-06-27 15:00:45,789][06909] Updated weights for policy 0, policy_version 18412 (0.0033) [2024-06-27 15:00:48,850][06674] Fps is (10 sec: 40960.2, 60 sec: 42871.5, 300 sec: 43320.4). Total num frames: 301760512. Throughput: 0: 43173.3. Samples: 204691680. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 15:00:48,850][06674] Avg episode reward: [(0, '0.371')] [2024-06-27 15:00:49,528][06909] Updated weights for policy 0, policy_version 18422 (0.0026) [2024-06-27 15:00:53,454][06909] Updated weights for policy 0, policy_version 18432 (0.0026) [2024-06-27 15:00:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 301989888. Throughput: 0: 43238.7. Samples: 204950820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 15:00:53,850][06674] Avg episode reward: [(0, '0.367')] [2024-06-27 15:00:57,251][06909] Updated weights for policy 0, policy_version 18442 (0.0038) [2024-06-27 15:00:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43148.6, 300 sec: 43209.3). Total num frames: 302186496. Throughput: 0: 43349.7. Samples: 205087880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 15:00:58,850][06674] Avg episode reward: [(0, '0.372')] [2024-06-27 15:01:00,928][06909] Updated weights for policy 0, policy_version 18452 (0.0028) [2024-06-27 15:01:03,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 302415872. Throughput: 0: 43326.5. Samples: 205340980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 15:01:03,853][06674] Avg episode reward: [(0, '0.377')] [2024-06-27 15:01:03,854][06887] Saving new best policy, reward=0.377! [2024-06-27 15:01:04,718][06909] Updated weights for policy 0, policy_version 18462 (0.0046) [2024-06-27 15:01:08,385][06909] Updated weights for policy 0, policy_version 18472 (0.0032) [2024-06-27 15:01:08,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43690.7, 300 sec: 43431.8). Total num frames: 302661632. Throughput: 0: 43295.6. Samples: 205599680. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 15:01:08,850][06674] Avg episode reward: [(0, '0.375')] [2024-06-27 15:01:12,290][06909] Updated weights for policy 0, policy_version 18482 (0.0038) [2024-06-27 15:01:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 302841856. Throughput: 0: 43300.5. Samples: 205735760. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 15:01:13,850][06674] Avg episode reward: [(0, '0.372')] [2024-06-27 15:01:16,088][06909] Updated weights for policy 0, policy_version 18492 (0.0045) [2024-06-27 15:01:18,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43144.4, 300 sec: 43264.9). Total num frames: 303054848. Throughput: 0: 43252.0. Samples: 205990040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 15:01:18,851][06674] Avg episode reward: [(0, '0.370')] [2024-06-27 15:01:19,974][06909] Updated weights for policy 0, policy_version 18502 (0.0039) [2024-06-27 15:01:23,658][06909] Updated weights for policy 0, policy_version 18512 (0.0027) [2024-06-27 15:01:23,852][06674] Fps is (10 sec: 45865.9, 60 sec: 43416.1, 300 sec: 43431.2). Total num frames: 303300608. Throughput: 0: 43251.0. Samples: 206248760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 15:01:23,852][06674] Avg episode reward: [(0, '0.370')] [2024-06-27 15:01:27,633][06909] Updated weights for policy 0, policy_version 18522 (0.0041) [2024-06-27 15:01:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 303480832. Throughput: 0: 43159.6. Samples: 206377740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 15:01:28,850][06674] Avg episode reward: [(0, '0.373')] [2024-06-27 15:01:31,571][06909] Updated weights for policy 0, policy_version 18532 (0.0027) [2024-06-27 15:01:33,850][06674] Fps is (10 sec: 40967.8, 60 sec: 43144.5, 300 sec: 43264.8). Total num frames: 303710208. Throughput: 0: 43095.0. Samples: 206630960. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 15:01:33,851][06674] Avg episode reward: [(0, '0.369')] [2024-06-27 15:01:35,190][06909] Updated weights for policy 0, policy_version 18542 (0.0029) [2024-06-27 15:01:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 42871.5, 300 sec: 43320.4). Total num frames: 303923200. Throughput: 0: 43201.3. Samples: 206894880. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 15:01:38,850][06674] Avg episode reward: [(0, '0.367')] [2024-06-27 15:01:39,133][06909] Updated weights for policy 0, policy_version 18552 (0.0037) [2024-06-27 15:01:42,721][06909] Updated weights for policy 0, policy_version 18562 (0.0028) [2024-06-27 15:01:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 304136192. Throughput: 0: 43084.8. Samples: 207026700. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 15:01:43,850][06674] Avg episode reward: [(0, '0.368')] [2024-06-27 15:01:46,600][06909] Updated weights for policy 0, policy_version 18572 (0.0031) [2024-06-27 15:01:48,856][06674] Fps is (10 sec: 42572.4, 60 sec: 43140.2, 300 sec: 43264.0). Total num frames: 304349184. Throughput: 0: 43074.3. Samples: 207279580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 15:01:48,856][06674] Avg episode reward: [(0, '0.368')] [2024-06-27 15:01:48,888][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000018577_304365568.pth... [2024-06-27 15:01:48,940][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000017943_293978112.pth [2024-06-27 15:01:50,196][06909] Updated weights for policy 0, policy_version 18582 (0.0037) [2024-06-27 15:01:53,820][06887] Signal inference workers to stop experience collection... (2950 times) [2024-06-27 15:01:53,821][06887] Signal inference workers to resume experience collection... (2950 times) [2024-06-27 15:01:53,844][06909] InferenceWorker_p0-w0: stopping experience collection (2950 times) [2024-06-27 15:01:53,844][06909] InferenceWorker_p0-w0: resuming experience collection (2950 times) [2024-06-27 15:01:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 42871.4, 300 sec: 43264.9). Total num frames: 304562176. Throughput: 0: 43063.5. Samples: 207537540. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 15:01:53,850][06674] Avg episode reward: [(0, '0.371')] [2024-06-27 15:01:54,402][06909] Updated weights for policy 0, policy_version 18592 (0.0042) [2024-06-27 15:01:57,629][06909] Updated weights for policy 0, policy_version 18602 (0.0028) [2024-06-27 15:01:58,850][06674] Fps is (10 sec: 42624.4, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 304775168. Throughput: 0: 42926.3. Samples: 207667440. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 15:01:58,850][06674] Avg episode reward: [(0, '0.371')] [2024-06-27 15:02:01,734][06909] Updated weights for policy 0, policy_version 18612 (0.0034) [2024-06-27 15:02:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43417.6, 300 sec: 43320.7). Total num frames: 305020928. Throughput: 0: 43105.3. Samples: 207929780. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 15:02:03,850][06674] Avg episode reward: [(0, '0.369')] [2024-06-27 15:02:05,612][06909] Updated weights for policy 0, policy_version 18622 (0.0046) [2024-06-27 15:02:08,854][06674] Fps is (10 sec: 45856.7, 60 sec: 42868.6, 300 sec: 43375.3). Total num frames: 305233920. Throughput: 0: 43192.3. Samples: 208192500. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 15:02:08,854][06674] Avg episode reward: [(0, '0.366')] [2024-06-27 15:02:09,338][06909] Updated weights for policy 0, policy_version 18632 (0.0027) [2024-06-27 15:02:13,101][06909] Updated weights for policy 0, policy_version 18642 (0.0038) [2024-06-27 15:02:13,855][06674] Fps is (10 sec: 40939.7, 60 sec: 43140.9, 300 sec: 43264.1). Total num frames: 305430528. Throughput: 0: 43120.0. Samples: 208318360. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 15:02:13,855][06674] Avg episode reward: [(0, '0.365')] [2024-06-27 15:02:16,772][06909] Updated weights for policy 0, policy_version 18652 (0.0036) [2024-06-27 15:02:18,850][06674] Fps is (10 sec: 44254.6, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 305676288. Throughput: 0: 43437.5. Samples: 208585640. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 15:02:18,850][06674] Avg episode reward: [(0, '0.364')] [2024-06-27 15:02:20,471][06909] Updated weights for policy 0, policy_version 18662 (0.0030) [2024-06-27 15:02:23,850][06674] Fps is (10 sec: 45898.5, 60 sec: 43146.0, 300 sec: 43431.5). Total num frames: 305889280. Throughput: 0: 43463.1. Samples: 208850720. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 15:02:23,851][06674] Avg episode reward: [(0, '0.371')] [2024-06-27 15:02:24,311][06909] Updated weights for policy 0, policy_version 18672 (0.0029) [2024-06-27 15:02:28,725][06909] Updated weights for policy 0, policy_version 18682 (0.0038) [2024-06-27 15:02:28,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 306085888. Throughput: 0: 43310.7. Samples: 208975680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 15:02:28,850][06674] Avg episode reward: [(0, '0.376')] [2024-06-27 15:02:31,830][06909] Updated weights for policy 0, policy_version 18692 (0.0036) [2024-06-27 15:02:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 306315264. Throughput: 0: 43564.9. Samples: 209239740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 15:02:33,855][06674] Avg episode reward: [(0, '0.358')] [2024-06-27 15:02:36,158][06909] Updated weights for policy 0, policy_version 18702 (0.0036) [2024-06-27 15:02:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 306544640. Throughput: 0: 43599.7. Samples: 209499520. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:02:38,850][06674] Avg episode reward: [(0, '0.359')] [2024-06-27 15:02:39,304][06909] Updated weights for policy 0, policy_version 18712 (0.0028) [2024-06-27 15:02:43,602][06909] Updated weights for policy 0, policy_version 18722 (0.0025) [2024-06-27 15:02:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43265.2). Total num frames: 306741248. Throughput: 0: 43638.6. Samples: 209631180. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:02:43,850][06674] Avg episode reward: [(0, '0.371')] [2024-06-27 15:02:46,818][06909] Updated weights for policy 0, policy_version 18732 (0.0038) [2024-06-27 15:02:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43695.1, 300 sec: 43209.3). Total num frames: 306970624. Throughput: 0: 43611.7. Samples: 209892300. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:02:48,850][06674] Avg episode reward: [(0, '0.377')] [2024-06-27 15:02:50,961][06909] Updated weights for policy 0, policy_version 18742 (0.0048) [2024-06-27 15:02:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 43487.3). Total num frames: 307200000. Throughput: 0: 43559.4. Samples: 210152500. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 15:02:53,850][06674] Avg episode reward: [(0, '0.378')] [2024-06-27 15:02:54,464][06909] Updated weights for policy 0, policy_version 18752 (0.0051) [2024-06-27 15:02:58,333][06909] Updated weights for policy 0, policy_version 18762 (0.0035) [2024-06-27 15:02:58,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43689.2, 300 sec: 43264.6). Total num frames: 307396608. Throughput: 0: 43699.4. Samples: 210284700. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 15:02:58,852][06674] Avg episode reward: [(0, '0.373')] [2024-06-27 15:03:01,941][06909] Updated weights for policy 0, policy_version 18772 (0.0034) [2024-06-27 15:03:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 307609600. Throughput: 0: 43504.8. Samples: 210543360. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 15:03:03,850][06674] Avg episode reward: [(0, '0.377')] [2024-06-27 15:03:05,818][06909] Updated weights for policy 0, policy_version 18782 (0.0037) [2024-06-27 15:03:08,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43420.5, 300 sec: 43431.5). Total num frames: 307838976. Throughput: 0: 43426.2. Samples: 210804900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 15:03:08,850][06674] Avg episode reward: [(0, '0.379')] [2024-06-27 15:03:08,857][06887] Saving new best policy, reward=0.379! [2024-06-27 15:03:09,402][06909] Updated weights for policy 0, policy_version 18792 (0.0044) [2024-06-27 15:03:13,446][06909] Updated weights for policy 0, policy_version 18802 (0.0047) [2024-06-27 15:03:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43694.4, 300 sec: 43264.9). Total num frames: 308051968. Throughput: 0: 43615.6. Samples: 210938380. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 15:03:13,850][06674] Avg episode reward: [(0, '0.371')] [2024-06-27 15:03:16,948][06909] Updated weights for policy 0, policy_version 18812 (0.0034) [2024-06-27 15:03:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 308281344. Throughput: 0: 43442.7. Samples: 211194660. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 15:03:18,850][06674] Avg episode reward: [(0, '0.369')] [2024-06-27 15:03:19,859][06887] Signal inference workers to stop experience collection... (3000 times) [2024-06-27 15:03:19,864][06887] Signal inference workers to resume experience collection... (3000 times) [2024-06-27 15:03:19,896][06909] InferenceWorker_p0-w0: stopping experience collection (3000 times) [2024-06-27 15:03:19,896][06909] InferenceWorker_p0-w0: resuming experience collection (3000 times) [2024-06-27 15:03:21,164][06909] Updated weights for policy 0, policy_version 18822 (0.0040) [2024-06-27 15:03:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 308477952. Throughput: 0: 43501.7. Samples: 211457100. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 15:03:23,850][06674] Avg episode reward: [(0, '0.372')] [2024-06-27 15:03:24,333][06909] Updated weights for policy 0, policy_version 18832 (0.0020) [2024-06-27 15:03:28,552][06909] Updated weights for policy 0, policy_version 18842 (0.0039) [2024-06-27 15:03:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 308707328. Throughput: 0: 43558.3. Samples: 211591300. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 15:03:28,850][06674] Avg episode reward: [(0, '0.379')] [2024-06-27 15:03:31,928][06909] Updated weights for policy 0, policy_version 18852 (0.0033) [2024-06-27 15:03:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 308936704. Throughput: 0: 43393.4. Samples: 211845000. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 15:03:33,850][06674] Avg episode reward: [(0, '0.377')] [2024-06-27 15:03:36,588][06909] Updated weights for policy 0, policy_version 18862 (0.0032) [2024-06-27 15:03:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 309133312. Throughput: 0: 43580.1. Samples: 212113600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 15:03:38,850][06674] Avg episode reward: [(0, '0.377')] [2024-06-27 15:03:39,542][06909] Updated weights for policy 0, policy_version 18872 (0.0037) [2024-06-27 15:03:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 309346304. Throughput: 0: 43479.3. Samples: 212241180. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 15:03:43,850][06674] Avg episode reward: [(0, '0.379')] [2024-06-27 15:03:44,050][06909] Updated weights for policy 0, policy_version 18882 (0.0031) [2024-06-27 15:03:47,023][06909] Updated weights for policy 0, policy_version 18892 (0.0031) [2024-06-27 15:03:48,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43690.6, 300 sec: 43376.2). Total num frames: 309592064. Throughput: 0: 43527.5. Samples: 212502100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-27 15:03:48,864][06674] Avg episode reward: [(0, '0.385')] [2024-06-27 15:03:48,880][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000018896_309592064.pth... [2024-06-27 15:03:48,938][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000018261_299188224.pth [2024-06-27 15:03:48,948][06887] Saving new best policy, reward=0.385! [2024-06-27 15:03:51,411][06909] Updated weights for policy 0, policy_version 18902 (0.0029) [2024-06-27 15:03:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 309805056. Throughput: 0: 43587.1. Samples: 212766320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-27 15:03:53,850][06674] Avg episode reward: [(0, '0.384')] [2024-06-27 15:03:54,559][06909] Updated weights for policy 0, policy_version 18912 (0.0033) [2024-06-27 15:03:58,856][06674] Fps is (10 sec: 40935.3, 60 sec: 43414.6, 300 sec: 43264.0). Total num frames: 310001664. Throughput: 0: 43469.6. Samples: 212894780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 15:03:58,866][06674] Avg episode reward: [(0, '0.377')] [2024-06-27 15:03:59,037][06909] Updated weights for policy 0, policy_version 18922 (0.0034) [2024-06-27 15:04:02,352][06909] Updated weights for policy 0, policy_version 18932 (0.0034) [2024-06-27 15:04:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43431.5). Total num frames: 310247424. Throughput: 0: 43582.7. Samples: 213155880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 15:04:03,850][06674] Avg episode reward: [(0, '0.376')] [2024-06-27 15:04:06,563][06909] Updated weights for policy 0, policy_version 18942 (0.0040) [2024-06-27 15:04:08,850][06674] Fps is (10 sec: 45903.0, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 310460416. Throughput: 0: 43606.7. Samples: 213419400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 15:04:08,850][06674] Avg episode reward: [(0, '0.378')] [2024-06-27 15:04:09,986][06909] Updated weights for policy 0, policy_version 18952 (0.0050) [2024-06-27 15:04:13,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43417.5, 300 sec: 43264.9). Total num frames: 310657024. Throughput: 0: 43410.9. Samples: 213544800. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 15:04:13,850][06674] Avg episode reward: [(0, '0.380')] [2024-06-27 15:04:14,190][06909] Updated weights for policy 0, policy_version 18962 (0.0031) [2024-06-27 15:04:17,341][06909] Updated weights for policy 0, policy_version 18972 (0.0039) [2024-06-27 15:04:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 310886400. Throughput: 0: 43631.4. Samples: 213808420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 15:04:18,854][06674] Avg episode reward: [(0, '0.383')] [2024-06-27 15:04:21,769][06909] Updated weights for policy 0, policy_version 18982 (0.0041) [2024-06-27 15:04:23,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43690.7, 300 sec: 43376.0). Total num frames: 311099392. Throughput: 0: 43471.6. Samples: 214069820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 15:04:23,850][06674] Avg episode reward: [(0, '0.382')] [2024-06-27 15:04:24,773][06909] Updated weights for policy 0, policy_version 18992 (0.0041) [2024-06-27 15:04:28,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43144.4, 300 sec: 43264.8). Total num frames: 311296000. Throughput: 0: 43541.7. Samples: 214200560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 15:04:28,850][06674] Avg episode reward: [(0, '0.373')] [2024-06-27 15:04:29,183][06909] Updated weights for policy 0, policy_version 19002 (0.0022) [2024-06-27 15:04:32,351][06909] Updated weights for policy 0, policy_version 19012 (0.0043) [2024-06-27 15:04:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 311525376. Throughput: 0: 43425.4. Samples: 214456240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 15:04:33,850][06674] Avg episode reward: [(0, '0.379')] [2024-06-27 15:04:36,805][06909] Updated weights for policy 0, policy_version 19022 (0.0041) [2024-06-27 15:04:38,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43690.7, 300 sec: 43376.3). Total num frames: 311754752. Throughput: 0: 43447.6. Samples: 214721460. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 15:04:38,850][06674] Avg episode reward: [(0, '0.377')] [2024-06-27 15:04:39,952][06909] Updated weights for policy 0, policy_version 19032 (0.0051) [2024-06-27 15:04:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 311951360. Throughput: 0: 43478.4. Samples: 214851040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 15:04:43,850][06674] Avg episode reward: [(0, '0.377')] [2024-06-27 15:04:44,162][06909] Updated weights for policy 0, policy_version 19042 (0.0031) [2024-06-27 15:04:47,590][06909] Updated weights for policy 0, policy_version 19052 (0.0044) [2024-06-27 15:04:48,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 312197120. Throughput: 0: 43436.3. Samples: 215110520. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 15:04:48,853][06674] Avg episode reward: [(0, '0.374')] [2024-06-27 15:04:51,695][06909] Updated weights for policy 0, policy_version 19062 (0.0024) [2024-06-27 15:04:53,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43417.6, 300 sec: 43432.3). Total num frames: 312410112. Throughput: 0: 43277.7. Samples: 215366900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 15:04:53,850][06674] Avg episode reward: [(0, '0.375')] [2024-06-27 15:04:55,290][06909] Updated weights for policy 0, policy_version 19072 (0.0038) [2024-06-27 15:04:56,520][06887] Signal inference workers to stop experience collection... (3050 times) [2024-06-27 15:04:56,521][06887] Signal inference workers to resume experience collection... (3050 times) [2024-06-27 15:04:56,556][06909] InferenceWorker_p0-w0: stopping experience collection (3050 times) [2024-06-27 15:04:56,556][06909] InferenceWorker_p0-w0: resuming experience collection (3050 times) [2024-06-27 15:04:58,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43421.9, 300 sec: 43375.9). Total num frames: 312606720. Throughput: 0: 43371.1. Samples: 215496500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 15:04:58,851][06674] Avg episode reward: [(0, '0.382')] [2024-06-27 15:04:59,316][06909] Updated weights for policy 0, policy_version 19082 (0.0039) [2024-06-27 15:05:02,963][06909] Updated weights for policy 0, policy_version 19092 (0.0043) [2024-06-27 15:05:03,850][06674] Fps is (10 sec: 40960.6, 60 sec: 42871.5, 300 sec: 43320.4). Total num frames: 312819712. Throughput: 0: 43223.2. Samples: 215753460. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 15:05:03,850][06674] Avg episode reward: [(0, '0.381')] [2024-06-27 15:05:07,045][06909] Updated weights for policy 0, policy_version 19102 (0.0033) [2024-06-27 15:05:08,850][06674] Fps is (10 sec: 42599.2, 60 sec: 42871.5, 300 sec: 43320.4). Total num frames: 313032704. Throughput: 0: 43188.0. Samples: 216013280. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 15:05:08,850][06674] Avg episode reward: [(0, '0.380')] [2024-06-27 15:05:10,734][06909] Updated weights for policy 0, policy_version 19112 (0.0035) [2024-06-27 15:05:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.7, 300 sec: 43375.9). Total num frames: 313262080. Throughput: 0: 43154.8. Samples: 216142520. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 15:05:13,850][06674] Avg episode reward: [(0, '0.382')] [2024-06-27 15:05:14,500][06909] Updated weights for policy 0, policy_version 19122 (0.0022) [2024-06-27 15:05:18,096][06909] Updated weights for policy 0, policy_version 19132 (0.0026) [2024-06-27 15:05:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 313475072. Throughput: 0: 43352.0. Samples: 216407080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 15:05:18,850][06674] Avg episode reward: [(0, '0.386')] [2024-06-27 15:05:21,886][06909] Updated weights for policy 0, policy_version 19142 (0.0038) [2024-06-27 15:05:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 313704448. Throughput: 0: 43306.1. Samples: 216670240. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 15:05:23,851][06674] Avg episode reward: [(0, '0.388')] [2024-06-27 15:05:23,851][06887] Saving new best policy, reward=0.388! [2024-06-27 15:05:25,562][06909] Updated weights for policy 0, policy_version 19152 (0.0035) [2024-06-27 15:05:28,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.7, 300 sec: 43375.9). Total num frames: 313917440. Throughput: 0: 43319.4. Samples: 216800420. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 15:05:28,851][06674] Avg episode reward: [(0, '0.388')] [2024-06-27 15:05:29,586][06909] Updated weights for policy 0, policy_version 19162 (0.0031) [2024-06-27 15:05:33,055][06909] Updated weights for policy 0, policy_version 19172 (0.0034) [2024-06-27 15:05:33,852][06674] Fps is (10 sec: 42590.1, 60 sec: 43416.1, 300 sec: 43320.1). Total num frames: 314130432. Throughput: 0: 43338.6. Samples: 217060840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:05:33,852][06674] Avg episode reward: [(0, '0.378')] [2024-06-27 15:05:36,957][06909] Updated weights for policy 0, policy_version 19182 (0.0037) [2024-06-27 15:05:38,852][06674] Fps is (10 sec: 44228.1, 60 sec: 43416.1, 300 sec: 43431.2). Total num frames: 314359808. Throughput: 0: 43455.4. Samples: 217322480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:05:38,852][06674] Avg episode reward: [(0, '0.384')] [2024-06-27 15:05:40,573][06909] Updated weights for policy 0, policy_version 19192 (0.0032) [2024-06-27 15:05:43,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 314572800. Throughput: 0: 43497.1. Samples: 217453860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:05:43,850][06674] Avg episode reward: [(0, '0.378')] [2024-06-27 15:05:44,520][06909] Updated weights for policy 0, policy_version 19202 (0.0032) [2024-06-27 15:05:48,171][06909] Updated weights for policy 0, policy_version 19212 (0.0022) [2024-06-27 15:05:48,856][06674] Fps is (10 sec: 40943.5, 60 sec: 42867.2, 300 sec: 43319.5). Total num frames: 314769408. Throughput: 0: 43570.0. Samples: 217714380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 15:05:48,856][06674] Avg episode reward: [(0, '0.384')] [2024-06-27 15:05:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000019212_314769408.pth... [2024-06-27 15:05:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000018577_304365568.pth [2024-06-27 15:05:51,918][06909] Updated weights for policy 0, policy_version 19222 (0.0025) [2024-06-27 15:05:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 315015168. Throughput: 0: 43559.1. Samples: 217973440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 15:05:53,850][06674] Avg episode reward: [(0, '0.386')] [2024-06-27 15:05:55,730][06909] Updated weights for policy 0, policy_version 19232 (0.0031) [2024-06-27 15:05:58,850][06674] Fps is (10 sec: 45902.9, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 315228160. Throughput: 0: 43701.8. Samples: 218109100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 15:05:58,850][06674] Avg episode reward: [(0, '0.381')] [2024-06-27 15:05:59,286][06909] Updated weights for policy 0, policy_version 19242 (0.0027) [2024-06-27 15:06:03,309][06909] Updated weights for policy 0, policy_version 19252 (0.0031) [2024-06-27 15:06:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 315424768. Throughput: 0: 43696.0. Samples: 218373400. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 15:06:03,850][06674] Avg episode reward: [(0, '0.387')] [2024-06-27 15:06:06,675][06909] Updated weights for policy 0, policy_version 19262 (0.0031) [2024-06-27 15:06:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43487.0). Total num frames: 315670528. Throughput: 0: 43442.7. Samples: 218625160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 15:06:08,850][06674] Avg episode reward: [(0, '0.389')] [2024-06-27 15:06:10,851][06909] Updated weights for policy 0, policy_version 19272 (0.0028) [2024-06-27 15:06:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 315883520. Throughput: 0: 43493.9. Samples: 218757640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:06:13,850][06674] Avg episode reward: [(0, '0.386')] [2024-06-27 15:06:14,114][06909] Updated weights for policy 0, policy_version 19282 (0.0028) [2024-06-27 15:06:18,374][06909] Updated weights for policy 0, policy_version 19292 (0.0032) [2024-06-27 15:06:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43376.2). Total num frames: 316096512. Throughput: 0: 43589.0. Samples: 219022260. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:06:18,850][06674] Avg episode reward: [(0, '0.387')] [2024-06-27 15:06:21,846][06909] Updated weights for policy 0, policy_version 19302 (0.0033) [2024-06-27 15:06:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.7, 300 sec: 43542.5). Total num frames: 316325888. Throughput: 0: 43532.2. Samples: 219281340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:06:23,850][06674] Avg episode reward: [(0, '0.386')] [2024-06-27 15:06:25,937][06909] Updated weights for policy 0, policy_version 19312 (0.0033) [2024-06-27 15:06:25,946][06887] Signal inference workers to stop experience collection... (3100 times) [2024-06-27 15:06:25,946][06887] Signal inference workers to resume experience collection... (3100 times) [2024-06-27 15:06:25,960][06909] InferenceWorker_p0-w0: stopping experience collection (3100 times) [2024-06-27 15:06:25,960][06909] InferenceWorker_p0-w0: resuming experience collection (3100 times) [2024-06-27 15:06:28,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43689.3, 300 sec: 43486.7). Total num frames: 316538880. Throughput: 0: 43601.6. Samples: 219416020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:06:28,852][06674] Avg episode reward: [(0, '0.386')] [2024-06-27 15:06:29,397][06909] Updated weights for policy 0, policy_version 19322 (0.0032) [2024-06-27 15:06:33,439][06909] Updated weights for policy 0, policy_version 19332 (0.0030) [2024-06-27 15:06:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43419.1, 300 sec: 43431.5). Total num frames: 316735488. Throughput: 0: 43670.3. Samples: 219679280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:06:33,850][06674] Avg episode reward: [(0, '0.385')] [2024-06-27 15:06:36,812][06909] Updated weights for policy 0, policy_version 19342 (0.0037) [2024-06-27 15:06:38,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43419.1, 300 sec: 43487.0). Total num frames: 316964864. Throughput: 0: 43578.2. Samples: 219934460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:06:38,850][06674] Avg episode reward: [(0, '0.387')] [2024-06-27 15:06:40,861][06909] Updated weights for policy 0, policy_version 19352 (0.0041) [2024-06-27 15:06:43,852][06674] Fps is (10 sec: 45864.6, 60 sec: 43689.0, 300 sec: 43543.1). Total num frames: 317194240. Throughput: 0: 43535.1. Samples: 220068280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:06:43,853][06674] Avg episode reward: [(0, '0.390')] [2024-06-27 15:06:43,853][06887] Saving new best policy, reward=0.390! [2024-06-27 15:06:44,314][06909] Updated weights for policy 0, policy_version 19362 (0.0023) [2024-06-27 15:06:48,327][06909] Updated weights for policy 0, policy_version 19372 (0.0024) [2024-06-27 15:06:48,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43695.1, 300 sec: 43487.0). Total num frames: 317390848. Throughput: 0: 43374.6. Samples: 220325260. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:06:48,850][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 15:06:48,864][06887] Saving new best policy, reward=0.393! [2024-06-27 15:06:52,069][06909] Updated weights for policy 0, policy_version 19382 (0.0029) [2024-06-27 15:06:53,850][06674] Fps is (10 sec: 42608.5, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 317620224. Throughput: 0: 43451.6. Samples: 220580480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:06:53,851][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 15:06:55,814][06909] Updated weights for policy 0, policy_version 19392 (0.0047) [2024-06-27 15:06:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 43376.0). Total num frames: 317816832. Throughput: 0: 43500.9. Samples: 220715180. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 15:06:58,850][06674] Avg episode reward: [(0, '0.390')] [2024-06-27 15:06:59,753][06909] Updated weights for policy 0, policy_version 19402 (0.0030) [2024-06-27 15:07:03,351][06909] Updated weights for policy 0, policy_version 19412 (0.0031) [2024-06-27 15:07:03,852][06674] Fps is (10 sec: 42587.9, 60 sec: 43688.9, 300 sec: 43431.7). Total num frames: 318046208. Throughput: 0: 43433.2. Samples: 220976860. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 15:07:03,853][06674] Avg episode reward: [(0, '0.388')] [2024-06-27 15:07:07,129][06909] Updated weights for policy 0, policy_version 19422 (0.0024) [2024-06-27 15:07:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.6, 300 sec: 43543.3). Total num frames: 318275584. Throughput: 0: 43412.1. Samples: 221234880. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 15:07:08,850][06674] Avg episode reward: [(0, '0.390')] [2024-06-27 15:07:11,200][06909] Updated weights for policy 0, policy_version 19432 (0.0036) [2024-06-27 15:07:13,850][06674] Fps is (10 sec: 42608.6, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 318472192. Throughput: 0: 43295.2. Samples: 221364220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 15:07:13,850][06674] Avg episode reward: [(0, '0.389')] [2024-06-27 15:07:14,706][06909] Updated weights for policy 0, policy_version 19442 (0.0047) [2024-06-27 15:07:18,673][06909] Updated weights for policy 0, policy_version 19452 (0.0031) [2024-06-27 15:07:18,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 318701568. Throughput: 0: 43208.4. Samples: 221623660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 15:07:18,850][06674] Avg episode reward: [(0, '0.385')] [2024-06-27 15:07:22,324][06909] Updated weights for policy 0, policy_version 19462 (0.0041) [2024-06-27 15:07:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 318914560. Throughput: 0: 43219.9. Samples: 221879360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 15:07:23,850][06674] Avg episode reward: [(0, '0.388')] [2024-06-27 15:07:26,199][06909] Updated weights for policy 0, policy_version 19472 (0.0031) [2024-06-27 15:07:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43146.0, 300 sec: 43431.5). Total num frames: 319127552. Throughput: 0: 43142.3. Samples: 222009580. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 15:07:28,850][06674] Avg episode reward: [(0, '0.387')] [2024-06-27 15:07:30,003][06909] Updated weights for policy 0, policy_version 19482 (0.0036) [2024-06-27 15:07:33,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43417.7, 300 sec: 43376.0). Total num frames: 319340544. Throughput: 0: 43180.6. Samples: 222268380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 15:07:33,850][06674] Avg episode reward: [(0, '0.388')] [2024-06-27 15:07:34,098][06909] Updated weights for policy 0, policy_version 19492 (0.0030) [2024-06-27 15:07:37,493][06909] Updated weights for policy 0, policy_version 19502 (0.0046) [2024-06-27 15:07:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 319569920. Throughput: 0: 43410.6. Samples: 222533960. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 15:07:38,850][06674] Avg episode reward: [(0, '0.388')] [2024-06-27 15:07:41,417][06909] Updated weights for policy 0, policy_version 19512 (0.0027) [2024-06-27 15:07:43,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43146.2, 300 sec: 43431.5). Total num frames: 319782912. Throughput: 0: 43365.7. Samples: 222666640. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 15:07:43,850][06674] Avg episode reward: [(0, '0.387')] [2024-06-27 15:07:44,909][06909] Updated weights for policy 0, policy_version 19522 (0.0038) [2024-06-27 15:07:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 319995904. Throughput: 0: 43226.3. Samples: 222921940. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 15:07:48,850][06674] Avg episode reward: [(0, '0.388')] [2024-06-27 15:07:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000019531_319995904.pth... [2024-06-27 15:07:48,936][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000018896_309592064.pth [2024-06-27 15:07:49,134][06909] Updated weights for policy 0, policy_version 19532 (0.0033) [2024-06-27 15:07:52,530][06909] Updated weights for policy 0, policy_version 19542 (0.0031) [2024-06-27 15:07:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43144.5, 300 sec: 43431.8). Total num frames: 320208896. Throughput: 0: 43330.1. Samples: 223184740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 15:07:53,850][06674] Avg episode reward: [(0, '0.387')] [2024-06-27 15:07:55,611][06887] Signal inference workers to stop experience collection... (3150 times) [2024-06-27 15:07:55,660][06909] InferenceWorker_p0-w0: stopping experience collection (3150 times) [2024-06-27 15:07:55,722][06887] Signal inference workers to resume experience collection... (3150 times) [2024-06-27 15:07:55,722][06909] InferenceWorker_p0-w0: resuming experience collection (3150 times) [2024-06-27 15:07:56,547][06909] Updated weights for policy 0, policy_version 19552 (0.0029) [2024-06-27 15:07:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 320421888. Throughput: 0: 43330.3. Samples: 223314080. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 15:07:58,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:07:59,906][06909] Updated weights for policy 0, policy_version 19562 (0.0025) [2024-06-27 15:08:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43419.3, 300 sec: 43431.5). Total num frames: 320651264. Throughput: 0: 43452.1. Samples: 223579000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:08:03,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:08:03,860][06909] Updated weights for policy 0, policy_version 19572 (0.0033) [2024-06-27 15:08:07,308][06909] Updated weights for policy 0, policy_version 19582 (0.0030) [2024-06-27 15:08:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 320880640. Throughput: 0: 43606.8. Samples: 223841660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:08:08,850][06674] Avg episode reward: [(0, '0.389')] [2024-06-27 15:08:11,283][06909] Updated weights for policy 0, policy_version 19592 (0.0032) [2024-06-27 15:08:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 321093632. Throughput: 0: 43466.6. Samples: 223965580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:08:13,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:08:13,851][06887] Saving new best policy, reward=0.398! [2024-06-27 15:08:15,298][06909] Updated weights for policy 0, policy_version 19602 (0.0028) [2024-06-27 15:08:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 321306624. Throughput: 0: 43607.5. Samples: 224230720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 15:08:18,850][06674] Avg episode reward: [(0, '0.390')] [2024-06-27 15:08:18,879][06909] Updated weights for policy 0, policy_version 19612 (0.0034) [2024-06-27 15:08:22,670][06909] Updated weights for policy 0, policy_version 19622 (0.0043) [2024-06-27 15:08:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 321536000. Throughput: 0: 43503.5. Samples: 224491620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 15:08:23,850][06674] Avg episode reward: [(0, '0.390')] [2024-06-27 15:08:26,215][06909] Updated weights for policy 0, policy_version 19632 (0.0033) [2024-06-27 15:08:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 321748992. Throughput: 0: 43454.7. Samples: 224622100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 15:08:28,850][06674] Avg episode reward: [(0, '0.386')] [2024-06-27 15:08:30,024][06909] Updated weights for policy 0, policy_version 19642 (0.0042) [2024-06-27 15:08:33,667][06909] Updated weights for policy 0, policy_version 19652 (0.0030) [2024-06-27 15:08:33,856][06674] Fps is (10 sec: 44210.2, 60 sec: 43959.2, 300 sec: 43541.7). Total num frames: 321978368. Throughput: 0: 43667.0. Samples: 224887220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 15:08:33,857][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:08:38,066][06909] Updated weights for policy 0, policy_version 19662 (0.0042) [2024-06-27 15:08:38,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 322158592. Throughput: 0: 43590.2. Samples: 225146300. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 15:08:38,850][06674] Avg episode reward: [(0, '0.384')] [2024-06-27 15:08:41,165][06909] Updated weights for policy 0, policy_version 19672 (0.0023) [2024-06-27 15:08:43,850][06674] Fps is (10 sec: 42624.5, 60 sec: 43690.8, 300 sec: 43431.5). Total num frames: 322404352. Throughput: 0: 43508.5. Samples: 225271960. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 15:08:43,850][06674] Avg episode reward: [(0, '0.384')] [2024-06-27 15:08:45,420][06909] Updated weights for policy 0, policy_version 19682 (0.0047) [2024-06-27 15:08:48,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 322617344. Throughput: 0: 43534.7. Samples: 225538060. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 15:08:48,850][06674] Avg episode reward: [(0, '0.381')] [2024-06-27 15:08:49,082][06909] Updated weights for policy 0, policy_version 19692 (0.0037) [2024-06-27 15:08:52,876][06909] Updated weights for policy 0, policy_version 19702 (0.0038) [2024-06-27 15:08:53,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.7, 300 sec: 43432.4). Total num frames: 322813952. Throughput: 0: 43509.8. Samples: 225799600. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 15:08:53,850][06674] Avg episode reward: [(0, '0.387')] [2024-06-27 15:08:56,528][06909] Updated weights for policy 0, policy_version 19712 (0.0038) [2024-06-27 15:08:58,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 43487.0). Total num frames: 323076096. Throughput: 0: 43671.1. Samples: 225930780. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-27 15:08:58,850][06674] Avg episode reward: [(0, '0.391')] [2024-06-27 15:09:00,337][06909] Updated weights for policy 0, policy_version 19722 (0.0042) [2024-06-27 15:09:03,853][06674] Fps is (10 sec: 44221.4, 60 sec: 43415.1, 300 sec: 43375.4). Total num frames: 323256320. Throughput: 0: 43635.8. Samples: 226194480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-27 15:09:03,854][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:09:04,173][06909] Updated weights for policy 0, policy_version 19732 (0.0041) [2024-06-27 15:09:07,615][06887] Signal inference workers to stop experience collection... (3200 times) [2024-06-27 15:09:07,616][06887] Signal inference workers to resume experience collection... (3200 times) [2024-06-27 15:09:07,658][06909] InferenceWorker_p0-w0: stopping experience collection (3200 times) [2024-06-27 15:09:07,659][06909] InferenceWorker_p0-w0: resuming experience collection (3200 times) [2024-06-27 15:09:07,755][06909] Updated weights for policy 0, policy_version 19742 (0.0029) [2024-06-27 15:09:08,854][06674] Fps is (10 sec: 39306.1, 60 sec: 43141.6, 300 sec: 43430.9). Total num frames: 323469312. Throughput: 0: 43490.4. Samples: 226448860. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-27 15:09:08,854][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:09:11,616][06909] Updated weights for policy 0, policy_version 19752 (0.0044) [2024-06-27 15:09:13,850][06674] Fps is (10 sec: 45891.1, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 323715072. Throughput: 0: 43553.8. Samples: 226582020. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 15:09:13,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:09:15,211][06909] Updated weights for policy 0, policy_version 19762 (0.0035) [2024-06-27 15:09:18,850][06674] Fps is (10 sec: 44254.1, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 323911680. Throughput: 0: 43461.8. Samples: 226842740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 15:09:18,850][06674] Avg episode reward: [(0, '0.388')] [2024-06-27 15:09:19,256][06909] Updated weights for policy 0, policy_version 19772 (0.0042) [2024-06-27 15:09:22,750][06909] Updated weights for policy 0, policy_version 19782 (0.0038) [2024-06-27 15:09:23,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 324124672. Throughput: 0: 43186.2. Samples: 227089680. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 15:09:23,850][06674] Avg episode reward: [(0, '0.387')] [2024-06-27 15:09:26,999][06909] Updated weights for policy 0, policy_version 19792 (0.0036) [2024-06-27 15:09:28,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 324354048. Throughput: 0: 43380.0. Samples: 227224060. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 15:09:28,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:09:30,333][06909] Updated weights for policy 0, policy_version 19802 (0.0045) [2024-06-27 15:09:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43148.9, 300 sec: 43431.5). Total num frames: 324567040. Throughput: 0: 43287.5. Samples: 227486000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 15:09:33,850][06674] Avg episode reward: [(0, '0.388')] [2024-06-27 15:09:34,512][06909] Updated weights for policy 0, policy_version 19812 (0.0038) [2024-06-27 15:09:37,824][06909] Updated weights for policy 0, policy_version 19822 (0.0038) [2024-06-27 15:09:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 324780032. Throughput: 0: 43088.0. Samples: 227738560. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 15:09:38,850][06674] Avg episode reward: [(0, '0.386')] [2024-06-27 15:09:41,944][06909] Updated weights for policy 0, policy_version 19832 (0.0041) [2024-06-27 15:09:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43144.4, 300 sec: 43376.0). Total num frames: 324993024. Throughput: 0: 43077.8. Samples: 227869280. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 15:09:43,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:09:45,515][06909] Updated weights for policy 0, policy_version 19842 (0.0042) [2024-06-27 15:09:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43144.5, 300 sec: 43376.0). Total num frames: 325206016. Throughput: 0: 43129.1. Samples: 228135140. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 15:09:48,850][06674] Avg episode reward: [(0, '0.382')] [2024-06-27 15:09:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000019849_325206016.pth... [2024-06-27 15:09:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000019212_314769408.pth [2024-06-27 15:09:49,598][06909] Updated weights for policy 0, policy_version 19852 (0.0043) [2024-06-27 15:09:53,469][06909] Updated weights for policy 0, policy_version 19862 (0.0027) [2024-06-27 15:09:53,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43416.1, 300 sec: 43431.2). Total num frames: 325419008. Throughput: 0: 42997.0. Samples: 228383640. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-27 15:09:53,852][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:09:57,527][06909] Updated weights for policy 0, policy_version 19872 (0.0038) [2024-06-27 15:09:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 42871.5, 300 sec: 43487.0). Total num frames: 325648384. Throughput: 0: 43048.4. Samples: 228519200. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-27 15:09:58,851][06674] Avg episode reward: [(0, '0.390')] [2024-06-27 15:10:01,080][06909] Updated weights for policy 0, policy_version 19882 (0.0032) [2024-06-27 15:10:03,850][06674] Fps is (10 sec: 40968.1, 60 sec: 42873.9, 300 sec: 43375.9). Total num frames: 325828608. Throughput: 0: 42982.2. Samples: 228776940. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-27 15:10:03,851][06674] Avg episode reward: [(0, '0.390')] [2024-06-27 15:10:05,087][06909] Updated weights for policy 0, policy_version 19892 (0.0030) [2024-06-27 15:10:08,562][06909] Updated weights for policy 0, policy_version 19902 (0.0036) [2024-06-27 15:10:08,853][06674] Fps is (10 sec: 42583.5, 60 sec: 43417.9, 300 sec: 43431.0). Total num frames: 326074368. Throughput: 0: 43195.8. Samples: 229033640. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-27 15:10:08,854][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:10:12,584][06909] Updated weights for policy 0, policy_version 19912 (0.0028) [2024-06-27 15:10:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 42871.4, 300 sec: 43431.5). Total num frames: 326287360. Throughput: 0: 43203.4. Samples: 229168220. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-27 15:10:13,853][06674] Avg episode reward: [(0, '0.389')] [2024-06-27 15:10:16,064][06909] Updated weights for policy 0, policy_version 19922 (0.0035) [2024-06-27 15:10:18,850][06674] Fps is (10 sec: 40974.6, 60 sec: 42871.5, 300 sec: 43320.4). Total num frames: 326483968. Throughput: 0: 42985.4. Samples: 229420340. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-27 15:10:18,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:10:20,087][06909] Updated weights for policy 0, policy_version 19932 (0.0035) [2024-06-27 15:10:23,647][06909] Updated weights for policy 0, policy_version 19942 (0.0027) [2024-06-27 15:10:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 326729728. Throughput: 0: 43128.8. Samples: 229679360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 15:10:23,850][06674] Avg episode reward: [(0, '0.387')] [2024-06-27 15:10:27,838][06909] Updated weights for policy 0, policy_version 19952 (0.0037) [2024-06-27 15:10:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 42871.4, 300 sec: 43376.2). Total num frames: 326926336. Throughput: 0: 43228.9. Samples: 229814580. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 15:10:28,850][06674] Avg episode reward: [(0, '0.389')] [2024-06-27 15:10:31,116][06909] Updated weights for policy 0, policy_version 19962 (0.0045) [2024-06-27 15:10:33,850][06674] Fps is (10 sec: 39321.2, 60 sec: 42598.3, 300 sec: 43265.2). Total num frames: 327122944. Throughput: 0: 43013.6. Samples: 230070760. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 15:10:33,851][06674] Avg episode reward: [(0, '0.390')] [2024-06-27 15:10:35,431][06909] Updated weights for policy 0, policy_version 19972 (0.0026) [2024-06-27 15:10:38,433][06887] Signal inference workers to stop experience collection... (3250 times) [2024-06-27 15:10:38,435][06887] Signal inference workers to resume experience collection... (3250 times) [2024-06-27 15:10:38,452][06909] InferenceWorker_p0-w0: stopping experience collection (3250 times) [2024-06-27 15:10:38,467][06909] InferenceWorker_p0-w0: resuming experience collection (3250 times) [2024-06-27 15:10:38,590][06909] Updated weights for policy 0, policy_version 19982 (0.0036) [2024-06-27 15:10:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 327385088. Throughput: 0: 43189.9. Samples: 230327100. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 15:10:38,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:10:42,941][06909] Updated weights for policy 0, policy_version 19992 (0.0031) [2024-06-27 15:10:43,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43144.6, 300 sec: 43432.4). Total num frames: 327581696. Throughput: 0: 43301.4. Samples: 230467760. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 15:10:43,850][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 15:10:45,967][06909] Updated weights for policy 0, policy_version 20002 (0.0044) [2024-06-27 15:10:48,850][06674] Fps is (10 sec: 39321.9, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 327778304. Throughput: 0: 43228.1. Samples: 230722200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 15:10:48,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:10:50,534][06909] Updated weights for policy 0, policy_version 20012 (0.0036) [2024-06-27 15:10:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43419.1, 300 sec: 43376.0). Total num frames: 328024064. Throughput: 0: 43251.0. Samples: 230979780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 15:10:53,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:10:54,440][06909] Updated weights for policy 0, policy_version 20022 (0.0032) [2024-06-27 15:10:58,003][06909] Updated weights for policy 0, policy_version 20032 (0.0027) [2024-06-27 15:10:58,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 328237056. Throughput: 0: 43212.0. Samples: 231112760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 15:10:58,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:11:01,878][06909] Updated weights for policy 0, policy_version 20042 (0.0041) [2024-06-27 15:11:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 328433664. Throughput: 0: 43423.2. Samples: 231374380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:11:03,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:11:05,626][06909] Updated weights for policy 0, policy_version 20052 (0.0031) [2024-06-27 15:11:08,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43420.0, 300 sec: 43375.9). Total num frames: 328679424. Throughput: 0: 43296.7. Samples: 231627720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:11:08,851][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:11:09,269][06909] Updated weights for policy 0, policy_version 20062 (0.0046) [2024-06-27 15:11:13,242][06909] Updated weights for policy 0, policy_version 20072 (0.0038) [2024-06-27 15:11:13,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 328892416. Throughput: 0: 43247.5. Samples: 231760720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:11:13,850][06674] Avg episode reward: [(0, '0.386')] [2024-06-27 15:11:16,679][06909] Updated weights for policy 0, policy_version 20082 (0.0026) [2024-06-27 15:11:18,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 329089024. Throughput: 0: 43275.6. Samples: 232018160. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 15:11:18,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:11:20,842][06909] Updated weights for policy 0, policy_version 20092 (0.0026) [2024-06-27 15:11:23,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43417.6, 300 sec: 43376.2). Total num frames: 329334784. Throughput: 0: 43301.4. Samples: 232275660. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 15:11:23,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:11:24,066][06909] Updated weights for policy 0, policy_version 20102 (0.0038) [2024-06-27 15:11:28,418][06909] Updated weights for policy 0, policy_version 20112 (0.0035) [2024-06-27 15:11:28,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43416.1, 300 sec: 43375.6). Total num frames: 329531392. Throughput: 0: 43302.5. Samples: 232416460. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 15:11:28,852][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 15:11:31,503][06909] Updated weights for policy 0, policy_version 20122 (0.0029) [2024-06-27 15:11:33,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 329728000. Throughput: 0: 43268.0. Samples: 232669260. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 15:11:33,850][06674] Avg episode reward: [(0, '0.385')] [2024-06-27 15:11:36,122][06909] Updated weights for policy 0, policy_version 20132 (0.0045) [2024-06-27 15:11:38,850][06674] Fps is (10 sec: 45883.8, 60 sec: 43417.5, 300 sec: 43376.3). Total num frames: 329990144. Throughput: 0: 43314.0. Samples: 232928920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 15:11:38,851][06674] Avg episode reward: [(0, '0.379')] [2024-06-27 15:11:38,977][06909] Updated weights for policy 0, policy_version 20142 (0.0032) [2024-06-27 15:11:43,602][06909] Updated weights for policy 0, policy_version 20152 (0.0047) [2024-06-27 15:11:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 330186752. Throughput: 0: 43408.1. Samples: 233066120. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 15:11:43,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:11:46,593][06909] Updated weights for policy 0, policy_version 20162 (0.0039) [2024-06-27 15:11:48,850][06674] Fps is (10 sec: 39322.2, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 330383360. Throughput: 0: 43262.6. Samples: 233321200. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 15:11:48,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:11:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000020165_330383360.pth... [2024-06-27 15:11:48,934][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000019531_319995904.pth [2024-06-27 15:11:51,040][06909] Updated weights for policy 0, policy_version 20172 (0.0042) [2024-06-27 15:11:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 330629120. Throughput: 0: 43262.9. Samples: 233574540. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 15:11:53,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:11:54,642][06909] Updated weights for policy 0, policy_version 20182 (0.0040) [2024-06-27 15:11:58,637][06909] Updated weights for policy 0, policy_version 20192 (0.0037) [2024-06-27 15:11:58,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43417.6, 300 sec: 43376.3). Total num frames: 330842112. Throughput: 0: 43314.7. Samples: 233709880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 15:11:58,850][06674] Avg episode reward: [(0, '0.391')] [2024-06-27 15:12:02,250][06909] Updated weights for policy 0, policy_version 20202 (0.0036) [2024-06-27 15:12:03,850][06674] Fps is (10 sec: 39321.1, 60 sec: 43144.4, 300 sec: 43209.3). Total num frames: 331022336. Throughput: 0: 43261.8. Samples: 233964940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 15:12:03,850][06674] Avg episode reward: [(0, '0.388')] [2024-06-27 15:12:06,160][06909] Updated weights for policy 0, policy_version 20212 (0.0034) [2024-06-27 15:12:08,856][06674] Fps is (10 sec: 42573.5, 60 sec: 43140.4, 300 sec: 43375.1). Total num frames: 331268096. Throughput: 0: 43339.6. Samples: 234226200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 15:12:08,856][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:12:09,672][06909] Updated weights for policy 0, policy_version 20222 (0.0028) [2024-06-27 15:12:13,373][06887] Signal inference workers to stop experience collection... (3300 times) [2024-06-27 15:12:13,380][06887] Signal inference workers to resume experience collection... (3300 times) [2024-06-27 15:12:13,420][06909] InferenceWorker_p0-w0: stopping experience collection (3300 times) [2024-06-27 15:12:13,421][06909] InferenceWorker_p0-w0: resuming experience collection (3300 times) [2024-06-27 15:12:13,513][06909] Updated weights for policy 0, policy_version 20232 (0.0034) [2024-06-27 15:12:13,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 331497472. Throughput: 0: 43289.5. Samples: 234364400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 15:12:13,852][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:12:17,071][06909] Updated weights for policy 0, policy_version 20242 (0.0046) [2024-06-27 15:12:18,850][06674] Fps is (10 sec: 40984.3, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 331677696. Throughput: 0: 43275.9. Samples: 234616680. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 15:12:18,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:12:21,172][06909] Updated weights for policy 0, policy_version 20252 (0.0045) [2024-06-27 15:12:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 42871.4, 300 sec: 43320.4). Total num frames: 331907072. Throughput: 0: 43220.6. Samples: 234873840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 15:12:23,850][06674] Avg episode reward: [(0, '0.386')] [2024-06-27 15:12:25,050][06909] Updated weights for policy 0, policy_version 20262 (0.0032) [2024-06-27 15:12:28,683][06909] Updated weights for policy 0, policy_version 20272 (0.0042) [2024-06-27 15:12:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43419.1, 300 sec: 43375.9). Total num frames: 332136448. Throughput: 0: 43168.8. Samples: 235008720. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 15:12:28,850][06674] Avg episode reward: [(0, '0.390')] [2024-06-27 15:12:32,500][06909] Updated weights for policy 0, policy_version 20282 (0.0029) [2024-06-27 15:12:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.5, 300 sec: 43264.9). Total num frames: 332333056. Throughput: 0: 43175.5. Samples: 235264100. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 15:12:33,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:12:36,614][06909] Updated weights for policy 0, policy_version 20292 (0.0035) [2024-06-27 15:12:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 42871.6, 300 sec: 43320.4). Total num frames: 332562432. Throughput: 0: 43299.9. Samples: 235523040. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 15:12:38,852][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:12:39,911][06909] Updated weights for policy 0, policy_version 20302 (0.0038) [2024-06-27 15:12:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 332775424. Throughput: 0: 43197.4. Samples: 235653760. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 15:12:43,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:12:44,003][06909] Updated weights for policy 0, policy_version 20312 (0.0043) [2024-06-27 15:12:47,457][06909] Updated weights for policy 0, policy_version 20322 (0.0041) [2024-06-27 15:12:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 333004800. Throughput: 0: 43323.6. Samples: 235914500. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 15:12:48,851][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:12:51,643][06909] Updated weights for policy 0, policy_version 20332 (0.0044) [2024-06-27 15:12:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 333217792. Throughput: 0: 43223.9. Samples: 236171020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 15:12:53,850][06674] Avg episode reward: [(0, '0.391')] [2024-06-27 15:12:55,047][06909] Updated weights for policy 0, policy_version 20342 (0.0024) [2024-06-27 15:12:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 333414400. Throughput: 0: 43073.8. Samples: 236302720. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 15:12:58,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:12:59,449][06909] Updated weights for policy 0, policy_version 20352 (0.0050) [2024-06-27 15:13:02,747][06909] Updated weights for policy 0, policy_version 20362 (0.0040) [2024-06-27 15:13:03,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43264.8). Total num frames: 333643776. Throughput: 0: 43062.6. Samples: 236554500. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 15:13:03,851][06674] Avg episode reward: [(0, '0.390')] [2024-06-27 15:13:06,978][06909] Updated weights for policy 0, policy_version 20372 (0.0035) [2024-06-27 15:13:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43148.8, 300 sec: 43264.9). Total num frames: 333856768. Throughput: 0: 43160.0. Samples: 236816040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:13:08,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:13:10,472][06909] Updated weights for policy 0, policy_version 20382 (0.0034) [2024-06-27 15:13:13,850][06674] Fps is (10 sec: 42599.1, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 334069760. Throughput: 0: 43001.9. Samples: 236943800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:13:13,850][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 15:13:14,458][06909] Updated weights for policy 0, policy_version 20392 (0.0041) [2024-06-27 15:13:17,944][06909] Updated weights for policy 0, policy_version 20402 (0.0035) [2024-06-27 15:13:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 334282752. Throughput: 0: 42997.4. Samples: 237198980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:13:18,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:13:21,852][06909] Updated weights for policy 0, policy_version 20412 (0.0030) [2024-06-27 15:13:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 334512128. Throughput: 0: 43099.1. Samples: 237462500. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-27 15:13:23,850][06674] Avg episode reward: [(0, '0.391')] [2024-06-27 15:13:25,774][06909] Updated weights for policy 0, policy_version 20422 (0.0033) [2024-06-27 15:13:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 43154.7). Total num frames: 334708736. Throughput: 0: 43109.3. Samples: 237593680. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-27 15:13:28,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:13:29,459][06909] Updated weights for policy 0, policy_version 20432 (0.0041) [2024-06-27 15:13:33,483][06909] Updated weights for policy 0, policy_version 20442 (0.0031) [2024-06-27 15:13:33,855][06674] Fps is (10 sec: 40939.5, 60 sec: 43141.0, 300 sec: 43264.1). Total num frames: 334921728. Throughput: 0: 42918.4. Samples: 237846040. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-27 15:13:33,855][06674] Avg episode reward: [(0, '0.390')] [2024-06-27 15:13:37,097][06909] Updated weights for policy 0, policy_version 20452 (0.0032) [2024-06-27 15:13:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 335151104. Throughput: 0: 43077.4. Samples: 238109500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 15:13:38,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:13:41,113][06909] Updated weights for policy 0, policy_version 20462 (0.0028) [2024-06-27 15:13:43,850][06674] Fps is (10 sec: 44258.5, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 335364096. Throughput: 0: 43100.4. Samples: 238242240. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 15:13:43,850][06674] Avg episode reward: [(0, '0.391')] [2024-06-27 15:13:44,742][06909] Updated weights for policy 0, policy_version 20472 (0.0036) [2024-06-27 15:13:48,565][06909] Updated weights for policy 0, policy_version 20482 (0.0027) [2024-06-27 15:13:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 335577088. Throughput: 0: 43208.6. Samples: 238498880. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 15:13:48,850][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 15:13:48,951][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000020483_335593472.pth... [2024-06-27 15:13:48,999][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000019849_325206016.pth [2024-06-27 15:13:52,103][06887] Signal inference workers to stop experience collection... (3350 times) [2024-06-27 15:13:52,104][06887] Signal inference workers to resume experience collection... (3350 times) [2024-06-27 15:13:52,129][06909] InferenceWorker_p0-w0: stopping experience collection (3350 times) [2024-06-27 15:13:52,129][06909] InferenceWorker_p0-w0: resuming experience collection (3350 times) [2024-06-27 15:13:52,251][06909] Updated weights for policy 0, policy_version 20492 (0.0030) [2024-06-27 15:13:53,856][06674] Fps is (10 sec: 42573.1, 60 sec: 42867.2, 300 sec: 43097.4). Total num frames: 335790080. Throughput: 0: 43182.2. Samples: 238759500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:13:53,856][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:13:55,984][06909] Updated weights for policy 0, policy_version 20502 (0.0027) [2024-06-27 15:13:58,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43144.5, 300 sec: 43209.8). Total num frames: 336003072. Throughput: 0: 43351.0. Samples: 238894600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:13:58,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:13:59,765][06909] Updated weights for policy 0, policy_version 20512 (0.0043) [2024-06-27 15:14:03,393][06909] Updated weights for policy 0, policy_version 20522 (0.0034) [2024-06-27 15:14:03,850][06674] Fps is (10 sec: 44263.9, 60 sec: 43144.7, 300 sec: 43265.5). Total num frames: 336232448. Throughput: 0: 43431.2. Samples: 239153380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:14:03,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:14:07,547][06909] Updated weights for policy 0, policy_version 20532 (0.0040) [2024-06-27 15:14:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 336445440. Throughput: 0: 43190.6. Samples: 239406080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:14:08,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:14:11,217][06909] Updated weights for policy 0, policy_version 20542 (0.0044) [2024-06-27 15:14:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 336658432. Throughput: 0: 43158.7. Samples: 239535820. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:14:13,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:14:15,076][06909] Updated weights for policy 0, policy_version 20552 (0.0031) [2024-06-27 15:14:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 336871424. Throughput: 0: 43328.8. Samples: 239795620. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:14:18,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:14:18,865][06887] Saving new best policy, reward=0.399! [2024-06-27 15:14:18,875][06909] Updated weights for policy 0, policy_version 20562 (0.0039) [2024-06-27 15:14:22,577][06909] Updated weights for policy 0, policy_version 20572 (0.0038) [2024-06-27 15:14:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 337100800. Throughput: 0: 43236.9. Samples: 240055160. Policy #0 lag: (min: 1.0, avg: 10.5, max: 20.0) [2024-06-27 15:14:23,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:14:26,475][06909] Updated weights for policy 0, policy_version 20582 (0.0040) [2024-06-27 15:14:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 337297408. Throughput: 0: 43203.1. Samples: 240186380. Policy #0 lag: (min: 1.0, avg: 10.5, max: 20.0) [2024-06-27 15:14:28,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:14:30,051][06909] Updated weights for policy 0, policy_version 20592 (0.0036) [2024-06-27 15:14:33,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43421.3, 300 sec: 43209.3). Total num frames: 337526784. Throughput: 0: 43255.6. Samples: 240445380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:14:33,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:14:33,909][06909] Updated weights for policy 0, policy_version 20602 (0.0032) [2024-06-27 15:14:37,641][06909] Updated weights for policy 0, policy_version 20612 (0.0037) [2024-06-27 15:14:38,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 337756160. Throughput: 0: 43256.5. Samples: 240705780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:14:38,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:14:38,869][06887] Saving new best policy, reward=0.404! [2024-06-27 15:14:41,758][06909] Updated weights for policy 0, policy_version 20622 (0.0030) [2024-06-27 15:14:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 337952768. Throughput: 0: 43204.1. Samples: 240838780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:14:43,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:14:45,043][06909] Updated weights for policy 0, policy_version 20632 (0.0034) [2024-06-27 15:14:48,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43144.4, 300 sec: 43209.6). Total num frames: 338165760. Throughput: 0: 43195.8. Samples: 241097200. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 15:14:48,851][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:14:49,194][06909] Updated weights for policy 0, policy_version 20642 (0.0055) [2024-06-27 15:14:52,728][06909] Updated weights for policy 0, policy_version 20652 (0.0036) [2024-06-27 15:14:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43422.0, 300 sec: 43209.3). Total num frames: 338395136. Throughput: 0: 43336.6. Samples: 241356220. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 15:14:53,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:14:56,871][06909] Updated weights for policy 0, policy_version 20662 (0.0030) [2024-06-27 15:14:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 338608128. Throughput: 0: 43419.1. Samples: 241489680. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 15:14:58,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:15:00,139][06909] Updated weights for policy 0, policy_version 20672 (0.0040) [2024-06-27 15:15:03,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43144.5, 300 sec: 43209.9). Total num frames: 338821120. Throughput: 0: 43326.3. Samples: 241745300. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 15:15:03,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:15:04,533][06909] Updated weights for policy 0, policy_version 20682 (0.0037) [2024-06-27 15:15:07,485][06909] Updated weights for policy 0, policy_version 20692 (0.0039) [2024-06-27 15:15:08,852][06674] Fps is (10 sec: 45865.7, 60 sec: 43689.2, 300 sec: 43320.1). Total num frames: 339066880. Throughput: 0: 43426.0. Samples: 242009420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 15:15:08,852][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:15:11,937][06909] Updated weights for policy 0, policy_version 20702 (0.0030) [2024-06-27 15:15:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 339263488. Throughput: 0: 43498.8. Samples: 242143820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 15:15:13,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:15:15,324][06909] Updated weights for policy 0, policy_version 20712 (0.0040) [2024-06-27 15:15:18,850][06674] Fps is (10 sec: 40968.6, 60 sec: 43417.7, 300 sec: 43209.3). Total num frames: 339476480. Throughput: 0: 43375.5. Samples: 242397280. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 15:15:18,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:15:18,995][06887] Saving new best policy, reward=0.406! [2024-06-27 15:15:19,601][06909] Updated weights for policy 0, policy_version 20722 (0.0023) [2024-06-27 15:15:23,159][06909] Updated weights for policy 0, policy_version 20732 (0.0032) [2024-06-27 15:15:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 339705856. Throughput: 0: 43334.2. Samples: 242655820. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 15:15:23,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:15:23,962][06887] Saving new best policy, reward=0.408! [2024-06-27 15:15:26,991][06909] Updated weights for policy 0, policy_version 20742 (0.0028) [2024-06-27 15:15:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 339902464. Throughput: 0: 43363.5. Samples: 242790140. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 15:15:28,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:15:30,554][06909] Updated weights for policy 0, policy_version 20752 (0.0046) [2024-06-27 15:15:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43264.9). Total num frames: 340148224. Throughput: 0: 43353.4. Samples: 243048100. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 15:15:33,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:15:34,425][06909] Updated weights for policy 0, policy_version 20762 (0.0037) [2024-06-27 15:15:37,954][06909] Updated weights for policy 0, policy_version 20772 (0.0035) [2024-06-27 15:15:38,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 340361216. Throughput: 0: 43311.8. Samples: 243305260. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 15:15:38,851][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:15:42,316][06909] Updated weights for policy 0, policy_version 20782 (0.0029) [2024-06-27 15:15:42,972][06887] Signal inference workers to stop experience collection... (3400 times) [2024-06-27 15:15:42,972][06887] Signal inference workers to resume experience collection... (3400 times) [2024-06-27 15:15:43,011][06909] InferenceWorker_p0-w0: stopping experience collection (3400 times) [2024-06-27 15:15:43,011][06909] InferenceWorker_p0-w0: resuming experience collection (3400 times) [2024-06-27 15:15:43,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43144.4, 300 sec: 43264.8). Total num frames: 340541440. Throughput: 0: 43289.2. Samples: 243437700. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 15:15:43,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:15:45,556][06909] Updated weights for policy 0, policy_version 20792 (0.0046) [2024-06-27 15:15:48,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 340770816. Throughput: 0: 43425.7. Samples: 243699460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 15:15:48,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:15:48,875][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000020799_340770816.pth... [2024-06-27 15:15:48,942][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000020165_330383360.pth [2024-06-27 15:15:49,893][06909] Updated weights for policy 0, policy_version 20802 (0.0028) [2024-06-27 15:15:53,032][06909] Updated weights for policy 0, policy_version 20812 (0.0037) [2024-06-27 15:15:53,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43690.6, 300 sec: 43320.4). Total num frames: 341016576. Throughput: 0: 43309.6. Samples: 243958260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 15:15:53,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:15:57,236][06909] Updated weights for policy 0, policy_version 20822 (0.0037) [2024-06-27 15:15:58,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 341196800. Throughput: 0: 43402.7. Samples: 244096940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 15:15:58,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:16:00,852][06909] Updated weights for policy 0, policy_version 20832 (0.0031) [2024-06-27 15:16:03,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 341409792. Throughput: 0: 43423.1. Samples: 244351320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 15:16:03,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:16:04,786][06909] Updated weights for policy 0, policy_version 20842 (0.0035) [2024-06-27 15:16:08,389][06909] Updated weights for policy 0, policy_version 20852 (0.0037) [2024-06-27 15:16:08,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43146.0, 300 sec: 43264.9). Total num frames: 341655552. Throughput: 0: 43416.3. Samples: 244609560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 15:16:08,851][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:16:12,481][06909] Updated weights for policy 0, policy_version 20862 (0.0038) [2024-06-27 15:16:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43144.4, 300 sec: 43264.9). Total num frames: 341852160. Throughput: 0: 43327.5. Samples: 244739880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 15:16:13,851][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:16:16,214][06909] Updated weights for policy 0, policy_version 20872 (0.0025) [2024-06-27 15:16:18,852][06674] Fps is (10 sec: 42590.0, 60 sec: 43416.1, 300 sec: 43209.0). Total num frames: 342081536. Throughput: 0: 43236.7. Samples: 244993840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 15:16:18,852][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:16:20,057][06909] Updated weights for policy 0, policy_version 20882 (0.0024) [2024-06-27 15:16:23,739][06909] Updated weights for policy 0, policy_version 20892 (0.0030) [2024-06-27 15:16:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43144.5, 300 sec: 43265.2). Total num frames: 342294528. Throughput: 0: 43446.4. Samples: 245260340. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 15:16:23,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:16:27,507][06909] Updated weights for policy 0, policy_version 20902 (0.0025) [2024-06-27 15:16:28,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 342507520. Throughput: 0: 43298.7. Samples: 245386140. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 15:16:28,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:16:31,190][06909] Updated weights for policy 0, policy_version 20912 (0.0038) [2024-06-27 15:16:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43144.5, 300 sec: 43209.4). Total num frames: 342736896. Throughput: 0: 43161.4. Samples: 245641720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 15:16:33,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:16:35,361][06909] Updated weights for policy 0, policy_version 20922 (0.0036) [2024-06-27 15:16:38,595][06909] Updated weights for policy 0, policy_version 20932 (0.0043) [2024-06-27 15:16:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43144.7, 300 sec: 43264.9). Total num frames: 342949888. Throughput: 0: 43291.1. Samples: 245906360. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 15:16:38,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:16:42,858][06909] Updated weights for policy 0, policy_version 20942 (0.0038) [2024-06-27 15:16:43,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 343130112. Throughput: 0: 43028.4. Samples: 246033220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 15:16:43,851][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:16:46,185][06909] Updated weights for policy 0, policy_version 20952 (0.0038) [2024-06-27 15:16:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.7, 300 sec: 43209.3). Total num frames: 343375872. Throughput: 0: 42996.9. Samples: 246286180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 15:16:48,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:16:50,417][06909] Updated weights for policy 0, policy_version 20962 (0.0029) [2024-06-27 15:16:53,852][06674] Fps is (10 sec: 45865.9, 60 sec: 42870.0, 300 sec: 43209.0). Total num frames: 343588864. Throughput: 0: 43006.1. Samples: 246544920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 15:16:53,852][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:16:53,990][06909] Updated weights for policy 0, policy_version 20972 (0.0038) [2024-06-27 15:16:57,876][06909] Updated weights for policy 0, policy_version 20982 (0.0033) [2024-06-27 15:16:58,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 343785472. Throughput: 0: 42980.5. Samples: 246674000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:16:58,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:17:01,488][06909] Updated weights for policy 0, policy_version 20992 (0.0029) [2024-06-27 15:17:03,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43690.6, 300 sec: 43265.7). Total num frames: 344031232. Throughput: 0: 43244.1. Samples: 246939740. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:17:03,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:17:05,550][06909] Updated weights for policy 0, policy_version 21002 (0.0029) [2024-06-27 15:17:08,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 344244224. Throughput: 0: 43101.8. Samples: 247199920. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:17:08,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:17:08,923][06909] Updated weights for policy 0, policy_version 21012 (0.0033) [2024-06-27 15:17:12,943][06909] Updated weights for policy 0, policy_version 21022 (0.0029) [2024-06-27 15:17:13,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 344440832. Throughput: 0: 43196.8. Samples: 247330000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 15:17:13,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:17:16,348][06909] Updated weights for policy 0, policy_version 21032 (0.0041) [2024-06-27 15:17:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43419.0, 300 sec: 43320.4). Total num frames: 344686592. Throughput: 0: 43390.6. Samples: 247594300. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 15:17:18,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:17:20,421][06909] Updated weights for policy 0, policy_version 21042 (0.0027) [2024-06-27 15:17:23,819][06909] Updated weights for policy 0, policy_version 21052 (0.0038) [2024-06-27 15:17:23,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43690.6, 300 sec: 43320.4). Total num frames: 344915968. Throughput: 0: 43270.6. Samples: 247853540. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 15:17:23,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:17:28,278][06909] Updated weights for policy 0, policy_version 21062 (0.0028) [2024-06-27 15:17:28,856][06674] Fps is (10 sec: 40935.5, 60 sec: 43140.2, 300 sec: 43264.0). Total num frames: 345096192. Throughput: 0: 43301.8. Samples: 247982060. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 15:17:28,856][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:17:30,181][06887] Signal inference workers to stop experience collection... (3450 times) [2024-06-27 15:17:30,227][06909] InferenceWorker_p0-w0: stopping experience collection (3450 times) [2024-06-27 15:17:30,230][06887] Signal inference workers to resume experience collection... (3450 times) [2024-06-27 15:17:30,244][06909] InferenceWorker_p0-w0: resuming experience collection (3450 times) [2024-06-27 15:17:31,259][06909] Updated weights for policy 0, policy_version 21072 (0.0027) [2024-06-27 15:17:33,851][06674] Fps is (10 sec: 40956.3, 60 sec: 43143.9, 300 sec: 43264.7). Total num frames: 345325568. Throughput: 0: 43482.2. Samples: 248242920. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 15:17:33,851][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:17:35,630][06909] Updated weights for policy 0, policy_version 21082 (0.0033) [2024-06-27 15:17:38,647][06909] Updated weights for policy 0, policy_version 21092 (0.0028) [2024-06-27 15:17:38,850][06674] Fps is (10 sec: 47542.1, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 345571328. Throughput: 0: 43403.3. Samples: 248497980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 15:17:38,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:17:43,712][06909] Updated weights for policy 0, policy_version 21102 (0.0042) [2024-06-27 15:17:43,850][06674] Fps is (10 sec: 40963.8, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 345735168. Throughput: 0: 43492.9. Samples: 248631180. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 15:17:43,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:17:46,493][06909] Updated weights for policy 0, policy_version 21112 (0.0028) [2024-06-27 15:17:48,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43417.5, 300 sec: 43264.9). Total num frames: 345980928. Throughput: 0: 43348.8. Samples: 248890440. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 15:17:48,851][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:17:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000021117_345980928.pth... [2024-06-27 15:17:48,916][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000020483_335593472.pth [2024-06-27 15:17:51,233][06909] Updated weights for policy 0, policy_version 21122 (0.0029) [2024-06-27 15:17:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43419.1, 300 sec: 43320.4). Total num frames: 346193920. Throughput: 0: 43358.2. Samples: 249151040. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 15:17:53,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:17:54,017][06909] Updated weights for policy 0, policy_version 21132 (0.0021) [2024-06-27 15:17:58,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 346374144. Throughput: 0: 43262.7. Samples: 249276820. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 15:17:58,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:17:58,923][06909] Updated weights for policy 0, policy_version 21142 (0.0046) [2024-06-27 15:18:02,014][06909] Updated weights for policy 0, policy_version 21152 (0.0026) [2024-06-27 15:18:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 346619904. Throughput: 0: 43125.9. Samples: 249534960. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 15:18:03,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:18:06,323][06909] Updated weights for policy 0, policy_version 21162 (0.0030) [2024-06-27 15:18:08,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 346849280. Throughput: 0: 43290.3. Samples: 249801600. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 15:18:08,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:18:09,498][06909] Updated weights for policy 0, policy_version 21172 (0.0054) [2024-06-27 15:18:13,739][06909] Updated weights for policy 0, policy_version 21182 (0.0042) [2024-06-27 15:18:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 347045888. Throughput: 0: 43261.0. Samples: 249928540. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-27 15:18:13,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:18:17,075][06909] Updated weights for policy 0, policy_version 21192 (0.0041) [2024-06-27 15:18:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 347275264. Throughput: 0: 43272.0. Samples: 250190120. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-27 15:18:18,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:18:21,350][06909] Updated weights for policy 0, policy_version 21202 (0.0042) [2024-06-27 15:18:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 42871.5, 300 sec: 43320.4). Total num frames: 347488256. Throughput: 0: 43409.4. Samples: 250451400. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-27 15:18:23,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:18:24,566][06909] Updated weights for policy 0, policy_version 21212 (0.0037) [2024-06-27 15:18:28,852][06674] Fps is (10 sec: 40953.3, 60 sec: 43147.7, 300 sec: 43265.4). Total num frames: 347684864. Throughput: 0: 43220.6. Samples: 250576180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 15:18:28,852][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:18:28,911][06909] Updated weights for policy 0, policy_version 21222 (0.0036) [2024-06-27 15:18:32,119][06909] Updated weights for policy 0, policy_version 21232 (0.0045) [2024-06-27 15:18:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43418.3, 300 sec: 43320.4). Total num frames: 347930624. Throughput: 0: 43242.4. Samples: 250836340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 15:18:33,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:18:36,301][06909] Updated weights for policy 0, policy_version 21242 (0.0037) [2024-06-27 15:18:38,850][06674] Fps is (10 sec: 44244.4, 60 sec: 42598.5, 300 sec: 43264.9). Total num frames: 348127232. Throughput: 0: 43424.5. Samples: 251105140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 15:18:38,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:18:39,532][06909] Updated weights for policy 0, policy_version 21252 (0.0030) [2024-06-27 15:18:43,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 348340224. Throughput: 0: 43390.3. Samples: 251229380. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 15:18:43,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:18:44,006][06909] Updated weights for policy 0, policy_version 21262 (0.0026) [2024-06-27 15:18:46,980][06909] Updated weights for policy 0, policy_version 21272 (0.0035) [2024-06-27 15:18:47,932][06887] Signal inference workers to stop experience collection... (3500 times) [2024-06-27 15:18:47,977][06909] InferenceWorker_p0-w0: stopping experience collection (3500 times) [2024-06-27 15:18:47,999][06887] Signal inference workers to resume experience collection... (3500 times) [2024-06-27 15:18:48,000][06909] InferenceWorker_p0-w0: resuming experience collection (3500 times) [2024-06-27 15:18:48,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43690.7, 300 sec: 43432.4). Total num frames: 348602368. Throughput: 0: 43537.6. Samples: 251494160. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 15:18:48,856][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:18:51,569][06909] Updated weights for policy 0, policy_version 21282 (0.0037) [2024-06-27 15:18:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 348782592. Throughput: 0: 43407.9. Samples: 251754960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 15:18:53,851][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:18:55,160][06909] Updated weights for policy 0, policy_version 21292 (0.0034) [2024-06-27 15:18:58,850][06674] Fps is (10 sec: 37683.7, 60 sec: 43417.7, 300 sec: 43209.3). Total num frames: 348979200. Throughput: 0: 43289.8. Samples: 251876580. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 15:18:58,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:18:59,226][06909] Updated weights for policy 0, policy_version 21302 (0.0038) [2024-06-27 15:19:02,843][06909] Updated weights for policy 0, policy_version 21312 (0.0025) [2024-06-27 15:19:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 349241344. Throughput: 0: 43402.2. Samples: 252143220. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 15:19:03,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:19:06,806][06909] Updated weights for policy 0, policy_version 21322 (0.0043) [2024-06-27 15:19:08,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43144.4, 300 sec: 43320.4). Total num frames: 349437952. Throughput: 0: 43287.9. Samples: 252399360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 15:19:08,851][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:19:10,230][06909] Updated weights for policy 0, policy_version 21332 (0.0026) [2024-06-27 15:19:13,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 349634560. Throughput: 0: 43461.2. Samples: 252531860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 15:19:13,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:19:14,145][06909] Updated weights for policy 0, policy_version 21342 (0.0033) [2024-06-27 15:19:17,678][06909] Updated weights for policy 0, policy_version 21352 (0.0038) [2024-06-27 15:19:18,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 349880320. Throughput: 0: 43615.1. Samples: 252799020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 15:19:18,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:19:21,516][06909] Updated weights for policy 0, policy_version 21362 (0.0036) [2024-06-27 15:19:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 350093312. Throughput: 0: 43325.7. Samples: 253054800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 15:19:23,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:19:25,134][06909] Updated weights for policy 0, policy_version 21372 (0.0031) [2024-06-27 15:19:28,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43418.8, 300 sec: 43264.9). Total num frames: 350289920. Throughput: 0: 43465.3. Samples: 253185320. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 15:19:28,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:19:29,169][06909] Updated weights for policy 0, policy_version 21382 (0.0036) [2024-06-27 15:19:32,509][06909] Updated weights for policy 0, policy_version 21392 (0.0037) [2024-06-27 15:19:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 350519296. Throughput: 0: 43529.9. Samples: 253453000. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 15:19:33,850][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 15:19:36,786][06909] Updated weights for policy 0, policy_version 21402 (0.0034) [2024-06-27 15:19:38,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 350748672. Throughput: 0: 43450.6. Samples: 253710240. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 15:19:38,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:19:40,249][06909] Updated weights for policy 0, policy_version 21412 (0.0044) [2024-06-27 15:19:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 350945280. Throughput: 0: 43593.4. Samples: 253838280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 15:19:43,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:19:44,164][06909] Updated weights for policy 0, policy_version 21422 (0.0031) [2024-06-27 15:19:47,875][06909] Updated weights for policy 0, policy_version 21432 (0.0027) [2024-06-27 15:19:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 42871.6, 300 sec: 43320.4). Total num frames: 351174656. Throughput: 0: 43470.8. Samples: 254099400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 15:19:48,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:19:48,888][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000021435_351191040.pth... [2024-06-27 15:19:48,945][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000020799_340770816.pth [2024-06-27 15:19:51,636][06909] Updated weights for policy 0, policy_version 21442 (0.0027) [2024-06-27 15:19:53,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43690.7, 300 sec: 43375.9). Total num frames: 351404032. Throughput: 0: 43660.5. Samples: 254364080. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-27 15:19:53,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:19:55,526][06909] Updated weights for policy 0, policy_version 21452 (0.0029) [2024-06-27 15:19:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43376.0). Total num frames: 351617024. Throughput: 0: 43528.9. Samples: 254490660. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-27 15:19:58,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:19:58,983][06909] Updated weights for policy 0, policy_version 21462 (0.0023) [2024-06-27 15:20:02,883][06909] Updated weights for policy 0, policy_version 21472 (0.0037) [2024-06-27 15:20:03,850][06674] Fps is (10 sec: 40960.5, 60 sec: 42871.5, 300 sec: 43209.6). Total num frames: 351813632. Throughput: 0: 43431.1. Samples: 254753420. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-27 15:20:03,850][06674] Avg episode reward: [(0, '0.390')] [2024-06-27 15:20:06,880][06909] Updated weights for policy 0, policy_version 21482 (0.0041) [2024-06-27 15:20:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43375.9). Total num frames: 352059392. Throughput: 0: 43394.7. Samples: 255007560. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:20:08,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:20:10,627][06909] Updated weights for policy 0, policy_version 21492 (0.0042) [2024-06-27 15:20:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43320.4). Total num frames: 352256000. Throughput: 0: 43480.0. Samples: 255141920. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:20:13,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:20:14,475][06909] Updated weights for policy 0, policy_version 21502 (0.0029) [2024-06-27 15:20:18,148][06909] Updated weights for policy 0, policy_version 21512 (0.0035) [2024-06-27 15:20:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 352485376. Throughput: 0: 43388.4. Samples: 255405480. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:20:18,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:20:22,130][06909] Updated weights for policy 0, policy_version 21522 (0.0031) [2024-06-27 15:20:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.7, 300 sec: 43376.0). Total num frames: 352698368. Throughput: 0: 43305.0. Samples: 255658960. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-27 15:20:23,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:20:25,597][06909] Updated weights for policy 0, policy_version 21532 (0.0032) [2024-06-27 15:20:28,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 352894976. Throughput: 0: 43512.4. Samples: 255796340. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-27 15:20:28,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:20:29,445][06909] Updated weights for policy 0, policy_version 21542 (0.0043) [2024-06-27 15:20:33,093][06909] Updated weights for policy 0, policy_version 21552 (0.0034) [2024-06-27 15:20:33,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43417.5, 300 sec: 43264.9). Total num frames: 353124352. Throughput: 0: 43350.1. Samples: 256050160. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-27 15:20:33,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:20:34,852][06887] Signal inference workers to stop experience collection... (3550 times) [2024-06-27 15:20:34,854][06887] Signal inference workers to resume experience collection... (3550 times) [2024-06-27 15:20:34,871][06909] InferenceWorker_p0-w0: stopping experience collection (3550 times) [2024-06-27 15:20:34,903][06909] InferenceWorker_p0-w0: resuming experience collection (3550 times) [2024-06-27 15:20:36,990][06909] Updated weights for policy 0, policy_version 21562 (0.0025) [2024-06-27 15:20:38,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 353353728. Throughput: 0: 43212.8. Samples: 256308660. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-27 15:20:38,851][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:20:40,550][06909] Updated weights for policy 0, policy_version 21572 (0.0037) [2024-06-27 15:20:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 353550336. Throughput: 0: 43377.8. Samples: 256442660. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-27 15:20:43,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:20:44,694][06909] Updated weights for policy 0, policy_version 21582 (0.0042) [2024-06-27 15:20:47,914][06909] Updated weights for policy 0, policy_version 21592 (0.0031) [2024-06-27 15:20:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.5, 300 sec: 43264.8). Total num frames: 353779712. Throughput: 0: 43206.5. Samples: 256697720. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-27 15:20:48,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:20:52,189][06909] Updated weights for policy 0, policy_version 21602 (0.0026) [2024-06-27 15:20:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 354009088. Throughput: 0: 43432.9. Samples: 256962040. Policy #0 lag: (min: 2.0, avg: 11.2, max: 23.0) [2024-06-27 15:20:53,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:20:55,366][06909] Updated weights for policy 0, policy_version 21612 (0.0035) [2024-06-27 15:20:58,850][06674] Fps is (10 sec: 40960.6, 60 sec: 42871.5, 300 sec: 43320.4). Total num frames: 354189312. Throughput: 0: 43494.2. Samples: 257099160. Policy #0 lag: (min: 2.0, avg: 11.2, max: 23.0) [2024-06-27 15:20:58,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:20:59,527][06909] Updated weights for policy 0, policy_version 21622 (0.0044) [2024-06-27 15:21:02,681][06909] Updated weights for policy 0, policy_version 21632 (0.0042) [2024-06-27 15:21:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.5, 300 sec: 43264.9). Total num frames: 354418688. Throughput: 0: 43376.4. Samples: 257357420. Policy #0 lag: (min: 2.0, avg: 11.2, max: 23.0) [2024-06-27 15:21:03,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:21:06,905][06909] Updated weights for policy 0, policy_version 21642 (0.0032) [2024-06-27 15:21:08,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 354664448. Throughput: 0: 43626.5. Samples: 257622160. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 15:21:08,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:21:10,665][06909] Updated weights for policy 0, policy_version 21652 (0.0036) [2024-06-27 15:21:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.5, 300 sec: 43320.7). Total num frames: 354861056. Throughput: 0: 43564.3. Samples: 257756740. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 15:21:13,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:21:14,306][06909] Updated weights for policy 0, policy_version 21662 (0.0034) [2024-06-27 15:21:18,034][06909] Updated weights for policy 0, policy_version 21672 (0.0037) [2024-06-27 15:21:18,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 355090432. Throughput: 0: 43594.3. Samples: 258011900. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 15:21:18,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:21:22,044][06909] Updated weights for policy 0, policy_version 21682 (0.0032) [2024-06-27 15:21:23,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 355303424. Throughput: 0: 43686.8. Samples: 258274560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 15:21:23,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:21:25,903][06909] Updated weights for policy 0, policy_version 21692 (0.0044) [2024-06-27 15:21:28,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43962.2, 300 sec: 43375.6). Total num frames: 355532800. Throughput: 0: 43597.0. Samples: 258404620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 15:21:28,852][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:21:29,761][06909] Updated weights for policy 0, policy_version 21702 (0.0047) [2024-06-27 15:21:33,322][06909] Updated weights for policy 0, policy_version 21712 (0.0025) [2024-06-27 15:21:33,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 355729408. Throughput: 0: 43680.0. Samples: 258663320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 15:21:33,851][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:21:37,501][06909] Updated weights for policy 0, policy_version 21722 (0.0023) [2024-06-27 15:21:38,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 355975168. Throughput: 0: 43459.5. Samples: 258917720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:21:38,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:21:40,793][06909] Updated weights for policy 0, policy_version 21732 (0.0045) [2024-06-27 15:21:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 356155392. Throughput: 0: 43332.8. Samples: 259049140. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:21:43,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:21:44,813][06909] Updated weights for policy 0, policy_version 21742 (0.0028) [2024-06-27 15:21:48,217][06909] Updated weights for policy 0, policy_version 21752 (0.0025) [2024-06-27 15:21:48,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.7, 300 sec: 43376.2). Total num frames: 356384768. Throughput: 0: 43331.2. Samples: 259307320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:21:48,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:21:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000021752_356384768.pth... [2024-06-27 15:21:48,925][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000021117_345980928.pth [2024-06-27 15:21:52,415][06909] Updated weights for policy 0, policy_version 21762 (0.0032) [2024-06-27 15:21:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 356597760. Throughput: 0: 43128.1. Samples: 259562920. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 15:21:53,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:21:55,906][06909] Updated weights for policy 0, policy_version 21772 (0.0030) [2024-06-27 15:21:57,350][06887] Signal inference workers to stop experience collection... (3600 times) [2024-06-27 15:21:57,391][06909] InferenceWorker_p0-w0: stopping experience collection (3600 times) [2024-06-27 15:21:57,401][06887] Signal inference workers to resume experience collection... (3600 times) [2024-06-27 15:21:57,412][06909] InferenceWorker_p0-w0: resuming experience collection (3600 times) [2024-06-27 15:21:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43320.4). Total num frames: 356810752. Throughput: 0: 43054.8. Samples: 259694200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 15:21:58,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:22:00,073][06909] Updated weights for policy 0, policy_version 21782 (0.0045) [2024-06-27 15:22:03,798][06909] Updated weights for policy 0, policy_version 21792 (0.0036) [2024-06-27 15:22:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43375.9). Total num frames: 357040128. Throughput: 0: 43196.0. Samples: 259955720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 15:22:03,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:22:07,511][06909] Updated weights for policy 0, policy_version 21802 (0.0039) [2024-06-27 15:22:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 357253120. Throughput: 0: 43030.1. Samples: 260210920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 15:22:08,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:22:11,234][06909] Updated weights for policy 0, policy_version 21812 (0.0047) [2024-06-27 15:22:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 357466112. Throughput: 0: 43185.6. Samples: 260347880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 15:22:13,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:22:14,898][06909] Updated weights for policy 0, policy_version 21822 (0.0033) [2024-06-27 15:22:18,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 357679104. Throughput: 0: 43274.3. Samples: 260610660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 15:22:18,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:22:19,291][06909] Updated weights for policy 0, policy_version 21832 (0.0026) [2024-06-27 15:22:22,396][06909] Updated weights for policy 0, policy_version 21842 (0.0029) [2024-06-27 15:22:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.5, 300 sec: 43432.4). Total num frames: 357908480. Throughput: 0: 43208.0. Samples: 260862080. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-27 15:22:23,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:22:26,899][06909] Updated weights for policy 0, policy_version 21852 (0.0037) [2024-06-27 15:22:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 42873.0, 300 sec: 43320.6). Total num frames: 358105088. Throughput: 0: 43289.0. Samples: 260997140. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-27 15:22:28,850][06674] Avg episode reward: [(0, '0.388')] [2024-06-27 15:22:29,897][06909] Updated weights for policy 0, policy_version 21862 (0.0032) [2024-06-27 15:22:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 358334464. Throughput: 0: 43399.1. Samples: 261260280. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-27 15:22:33,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:22:34,236][06909] Updated weights for policy 0, policy_version 21872 (0.0029) [2024-06-27 15:22:37,602][06909] Updated weights for policy 0, policy_version 21882 (0.0034) [2024-06-27 15:22:38,850][06674] Fps is (10 sec: 45874.1, 60 sec: 43144.4, 300 sec: 43487.0). Total num frames: 358563840. Throughput: 0: 43354.0. Samples: 261513860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 15:22:38,851][06674] Avg episode reward: [(0, '0.385')] [2024-06-27 15:22:41,595][06909] Updated weights for policy 0, policy_version 21892 (0.0029) [2024-06-27 15:22:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 358760448. Throughput: 0: 43555.1. Samples: 261654180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 15:22:43,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:22:45,201][06909] Updated weights for policy 0, policy_version 21902 (0.0032) [2024-06-27 15:22:48,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 358989824. Throughput: 0: 43567.5. Samples: 261916260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 15:22:48,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:22:48,923][06909] Updated weights for policy 0, policy_version 21912 (0.0030) [2024-06-27 15:22:52,621][06909] Updated weights for policy 0, policy_version 21922 (0.0023) [2024-06-27 15:22:53,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 359219200. Throughput: 0: 43444.6. Samples: 262165920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 15:22:53,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:22:56,675][06909] Updated weights for policy 0, policy_version 21932 (0.0038) [2024-06-27 15:22:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 359415808. Throughput: 0: 43533.8. Samples: 262306900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 15:22:58,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:23:00,032][06909] Updated weights for policy 0, policy_version 21942 (0.0031) [2024-06-27 15:23:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 359645184. Throughput: 0: 43525.4. Samples: 262569300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 15:23:03,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:23:03,945][06909] Updated weights for policy 0, policy_version 21952 (0.0025) [2024-06-27 15:23:07,329][06887] Signal inference workers to stop experience collection... (3650 times) [2024-06-27 15:23:07,370][06909] InferenceWorker_p0-w0: stopping experience collection (3650 times) [2024-06-27 15:23:07,377][06887] Signal inference workers to resume experience collection... (3650 times) [2024-06-27 15:23:07,392][06909] InferenceWorker_p0-w0: resuming experience collection (3650 times) [2024-06-27 15:23:07,550][06909] Updated weights for policy 0, policy_version 21962 (0.0034) [2024-06-27 15:23:08,852][06674] Fps is (10 sec: 45865.9, 60 sec: 43689.3, 300 sec: 43486.7). Total num frames: 359874560. Throughput: 0: 43692.3. Samples: 262828320. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 15:23:08,852][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:23:11,466][06909] Updated weights for policy 0, policy_version 21972 (0.0027) [2024-06-27 15:23:13,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 360071168. Throughput: 0: 43679.4. Samples: 262962720. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 15:23:13,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:23:15,072][06909] Updated weights for policy 0, policy_version 21982 (0.0050) [2024-06-27 15:23:18,852][06674] Fps is (10 sec: 40960.5, 60 sec: 43416.2, 300 sec: 43375.7). Total num frames: 360284160. Throughput: 0: 43560.8. Samples: 263220600. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 15:23:18,852][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:23:19,155][06909] Updated weights for policy 0, policy_version 21992 (0.0040) [2024-06-27 15:23:22,612][06909] Updated weights for policy 0, policy_version 22002 (0.0038) [2024-06-27 15:23:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.6, 300 sec: 43542.8). Total num frames: 360529920. Throughput: 0: 43597.4. Samples: 263475740. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 15:23:23,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:23:26,901][06909] Updated weights for policy 0, policy_version 22012 (0.0027) [2024-06-27 15:23:28,850][06674] Fps is (10 sec: 44245.4, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 360726528. Throughput: 0: 43539.2. Samples: 263613440. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 15:23:28,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:23:30,073][06909] Updated weights for policy 0, policy_version 22022 (0.0036) [2024-06-27 15:23:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 360939520. Throughput: 0: 43500.4. Samples: 263873780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 15:23:33,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:23:34,411][06909] Updated weights for policy 0, policy_version 22032 (0.0036) [2024-06-27 15:23:37,544][06909] Updated weights for policy 0, policy_version 22042 (0.0032) [2024-06-27 15:23:38,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.8, 300 sec: 43542.5). Total num frames: 361185280. Throughput: 0: 43574.1. Samples: 264126760. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-27 15:23:38,851][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:23:41,854][06909] Updated weights for policy 0, policy_version 22052 (0.0035) [2024-06-27 15:23:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 361381888. Throughput: 0: 43600.5. Samples: 264268920. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-27 15:23:43,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:23:44,867][06909] Updated weights for policy 0, policy_version 22062 (0.0026) [2024-06-27 15:23:48,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 361594880. Throughput: 0: 43445.3. Samples: 264524340. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-27 15:23:48,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:23:48,871][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000022070_361594880.pth... [2024-06-27 15:23:48,924][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000021435_351191040.pth [2024-06-27 15:23:49,846][06909] Updated weights for policy 0, policy_version 22072 (0.0032) [2024-06-27 15:23:52,758][06909] Updated weights for policy 0, policy_version 22082 (0.0033) [2024-06-27 15:23:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 361840640. Throughput: 0: 43397.6. Samples: 264781120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 15:23:53,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:23:57,353][06909] Updated weights for policy 0, policy_version 22092 (0.0033) [2024-06-27 15:23:58,852][06674] Fps is (10 sec: 44227.9, 60 sec: 43689.2, 300 sec: 43375.7). Total num frames: 362037248. Throughput: 0: 43305.3. Samples: 264911540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 15:23:58,852][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:24:00,466][06909] Updated weights for policy 0, policy_version 22102 (0.0025) [2024-06-27 15:24:03,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 362250240. Throughput: 0: 43281.4. Samples: 265168180. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 15:24:03,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:24:04,827][06909] Updated weights for policy 0, policy_version 22112 (0.0037) [2024-06-27 15:24:07,856][06909] Updated weights for policy 0, policy_version 22122 (0.0036) [2024-06-27 15:24:08,850][06674] Fps is (10 sec: 42606.1, 60 sec: 43145.8, 300 sec: 43487.0). Total num frames: 362463232. Throughput: 0: 43431.0. Samples: 265430140. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 15:24:08,851][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:24:12,222][06909] Updated weights for policy 0, policy_version 22132 (0.0039) [2024-06-27 15:24:13,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 362676224. Throughput: 0: 43453.2. Samples: 265568840. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 15:24:13,851][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:24:15,346][06909] Updated weights for policy 0, policy_version 22142 (0.0032) [2024-06-27 15:24:18,852][06674] Fps is (10 sec: 42590.6, 60 sec: 43417.5, 300 sec: 43375.6). Total num frames: 362889216. Throughput: 0: 43373.2. Samples: 265825660. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 15:24:18,853][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:24:19,697][06909] Updated weights for policy 0, policy_version 22152 (0.0042) [2024-06-27 15:24:22,827][06909] Updated weights for policy 0, policy_version 22162 (0.0031) [2024-06-27 15:24:23,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43144.7, 300 sec: 43487.0). Total num frames: 363118592. Throughput: 0: 43441.0. Samples: 266081600. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 15:24:23,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:24:27,183][06909] Updated weights for policy 0, policy_version 22172 (0.0036) [2024-06-27 15:24:28,249][06887] Signal inference workers to stop experience collection... (3700 times) [2024-06-27 15:24:28,298][06909] InferenceWorker_p0-w0: stopping experience collection (3700 times) [2024-06-27 15:24:28,364][06887] Signal inference workers to resume experience collection... (3700 times) [2024-06-27 15:24:28,364][06909] InferenceWorker_p0-w0: resuming experience collection (3700 times) [2024-06-27 15:24:28,850][06674] Fps is (10 sec: 45884.7, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 363347968. Throughput: 0: 43293.8. Samples: 266217140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 15:24:28,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:24:30,288][06909] Updated weights for policy 0, policy_version 22182 (0.0034) [2024-06-27 15:24:33,852][06674] Fps is (10 sec: 42589.2, 60 sec: 43416.1, 300 sec: 43375.6). Total num frames: 363544576. Throughput: 0: 43363.8. Samples: 266475800. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 15:24:33,853][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:24:34,695][06909] Updated weights for policy 0, policy_version 22192 (0.0047) [2024-06-27 15:24:37,671][06909] Updated weights for policy 0, policy_version 22202 (0.0025) [2024-06-27 15:24:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43144.6, 300 sec: 43487.0). Total num frames: 363773952. Throughput: 0: 43289.3. Samples: 266729140. Policy #0 lag: (min: 2.0, avg: 11.8, max: 22.0) [2024-06-27 15:24:38,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:24:42,122][06909] Updated weights for policy 0, policy_version 22212 (0.0039) [2024-06-27 15:24:43,850][06674] Fps is (10 sec: 42607.0, 60 sec: 43144.4, 300 sec: 43375.9). Total num frames: 363970560. Throughput: 0: 43437.4. Samples: 266866140. Policy #0 lag: (min: 2.0, avg: 11.8, max: 22.0) [2024-06-27 15:24:43,851][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:24:45,442][06909] Updated weights for policy 0, policy_version 22222 (0.0044) [2024-06-27 15:24:48,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 364183552. Throughput: 0: 43526.7. Samples: 267126880. Policy #0 lag: (min: 2.0, avg: 11.8, max: 22.0) [2024-06-27 15:24:48,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:24:49,747][06909] Updated weights for policy 0, policy_version 22232 (0.0041) [2024-06-27 15:24:52,908][06909] Updated weights for policy 0, policy_version 22242 (0.0035) [2024-06-27 15:24:53,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 364429312. Throughput: 0: 43218.0. Samples: 267374940. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-27 15:24:53,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:24:57,523][06909] Updated weights for policy 0, policy_version 22252 (0.0039) [2024-06-27 15:24:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 42599.9, 300 sec: 43320.4). Total num frames: 364593152. Throughput: 0: 43168.6. Samples: 267511420. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-27 15:24:58,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:25:00,702][06909] Updated weights for policy 0, policy_version 22262 (0.0029) [2024-06-27 15:25:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 364838912. Throughput: 0: 43292.2. Samples: 267773720. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-27 15:25:03,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:25:05,163][06909] Updated weights for policy 0, policy_version 22272 (0.0030) [2024-06-27 15:25:08,093][06909] Updated weights for policy 0, policy_version 22282 (0.0051) [2024-06-27 15:25:08,850][06674] Fps is (10 sec: 49151.7, 60 sec: 43690.8, 300 sec: 43487.0). Total num frames: 365084672. Throughput: 0: 43181.3. Samples: 268024760. Policy #0 lag: (min: 0.0, avg: 12.2, max: 22.0) [2024-06-27 15:25:08,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:25:12,550][06909] Updated weights for policy 0, policy_version 22292 (0.0026) [2024-06-27 15:25:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.7, 300 sec: 43375.9). Total num frames: 365281280. Throughput: 0: 43336.9. Samples: 268167300. Policy #0 lag: (min: 0.0, avg: 12.2, max: 22.0) [2024-06-27 15:25:13,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:25:15,623][06909] Updated weights for policy 0, policy_version 22302 (0.0036) [2024-06-27 15:25:18,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43419.0, 300 sec: 43375.9). Total num frames: 365494272. Throughput: 0: 43309.0. Samples: 268424620. Policy #0 lag: (min: 0.0, avg: 12.2, max: 22.0) [2024-06-27 15:25:18,851][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:25:19,901][06909] Updated weights for policy 0, policy_version 22312 (0.0035) [2024-06-27 15:25:23,098][06909] Updated weights for policy 0, policy_version 22322 (0.0032) [2024-06-27 15:25:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 365740032. Throughput: 0: 43339.6. Samples: 268679420. Policy #0 lag: (min: 1.0, avg: 12.0, max: 20.0) [2024-06-27 15:25:23,850][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 15:25:27,403][06909] Updated weights for policy 0, policy_version 22332 (0.0037) [2024-06-27 15:25:28,852][06674] Fps is (10 sec: 42590.4, 60 sec: 42870.0, 300 sec: 43375.7). Total num frames: 365920256. Throughput: 0: 43490.6. Samples: 268823300. Policy #0 lag: (min: 1.0, avg: 12.0, max: 20.0) [2024-06-27 15:25:28,852][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:25:30,600][06909] Updated weights for policy 0, policy_version 22342 (0.0043) [2024-06-27 15:25:32,738][06887] Signal inference workers to stop experience collection... (3750 times) [2024-06-27 15:25:32,777][06909] InferenceWorker_p0-w0: stopping experience collection (3750 times) [2024-06-27 15:25:32,795][06887] Signal inference workers to resume experience collection... (3750 times) [2024-06-27 15:25:32,798][06909] InferenceWorker_p0-w0: resuming experience collection (3750 times) [2024-06-27 15:25:33,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43419.1, 300 sec: 43376.0). Total num frames: 366149632. Throughput: 0: 43318.1. Samples: 269076200. Policy #0 lag: (min: 1.0, avg: 12.0, max: 20.0) [2024-06-27 15:25:33,851][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:25:34,738][06909] Updated weights for policy 0, policy_version 22352 (0.0025) [2024-06-27 15:25:38,213][06909] Updated weights for policy 0, policy_version 22362 (0.0030) [2024-06-27 15:25:38,850][06674] Fps is (10 sec: 47522.9, 60 sec: 43690.6, 300 sec: 43542.5). Total num frames: 366395392. Throughput: 0: 43533.8. Samples: 269333960. Policy #0 lag: (min: 1.0, avg: 12.3, max: 23.0) [2024-06-27 15:25:38,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:25:42,130][06909] Updated weights for policy 0, policy_version 22372 (0.0033) [2024-06-27 15:25:43,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 366559232. Throughput: 0: 43700.5. Samples: 269477940. Policy #0 lag: (min: 1.0, avg: 12.3, max: 23.0) [2024-06-27 15:25:43,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:25:45,632][06909] Updated weights for policy 0, policy_version 22382 (0.0033) [2024-06-27 15:25:48,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 366788608. Throughput: 0: 43584.9. Samples: 269735040. Policy #0 lag: (min: 1.0, avg: 12.3, max: 23.0) [2024-06-27 15:25:48,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:25:48,884][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000022388_366804992.pth... [2024-06-27 15:25:48,944][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000021752_356384768.pth [2024-06-27 15:25:49,866][06909] Updated weights for policy 0, policy_version 22392 (0.0039) [2024-06-27 15:25:53,100][06909] Updated weights for policy 0, policy_version 22402 (0.0032) [2024-06-27 15:25:53,850][06674] Fps is (10 sec: 49151.2, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 367050752. Throughput: 0: 43687.0. Samples: 269990680. Policy #0 lag: (min: 0.0, avg: 12.1, max: 20.0) [2024-06-27 15:25:53,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:25:57,525][06909] Updated weights for policy 0, policy_version 22412 (0.0037) [2024-06-27 15:25:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43431.5). Total num frames: 367230976. Throughput: 0: 43588.0. Samples: 270128760. Policy #0 lag: (min: 0.0, avg: 12.1, max: 20.0) [2024-06-27 15:25:58,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:26:00,945][06909] Updated weights for policy 0, policy_version 22422 (0.0038) [2024-06-27 15:26:03,852][06674] Fps is (10 sec: 40951.9, 60 sec: 43689.2, 300 sec: 43375.7). Total num frames: 367460352. Throughput: 0: 43596.8. Samples: 270386560. Policy #0 lag: (min: 0.0, avg: 12.1, max: 20.0) [2024-06-27 15:26:03,852][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:26:05,417][06909] Updated weights for policy 0, policy_version 22432 (0.0041) [2024-06-27 15:26:08,464][06909] Updated weights for policy 0, policy_version 22442 (0.0043) [2024-06-27 15:26:08,854][06674] Fps is (10 sec: 47492.1, 60 sec: 43687.4, 300 sec: 43541.9). Total num frames: 367706112. Throughput: 0: 43617.4. Samples: 270642400. Policy #0 lag: (min: 0.0, avg: 12.1, max: 20.0) [2024-06-27 15:26:08,855][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:26:12,829][06909] Updated weights for policy 0, policy_version 22452 (0.0028) [2024-06-27 15:26:13,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 367886336. Throughput: 0: 43450.4. Samples: 270778480. Policy #0 lag: (min: 0.0, avg: 12.2, max: 22.0) [2024-06-27 15:26:13,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:26:15,882][06909] Updated weights for policy 0, policy_version 22462 (0.0027) [2024-06-27 15:26:18,852][06674] Fps is (10 sec: 39331.1, 60 sec: 43416.2, 300 sec: 43375.6). Total num frames: 368099328. Throughput: 0: 43527.4. Samples: 271035020. Policy #0 lag: (min: 0.0, avg: 12.2, max: 22.0) [2024-06-27 15:26:18,853][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:26:20,381][06909] Updated weights for policy 0, policy_version 22472 (0.0044) [2024-06-27 15:26:23,361][06909] Updated weights for policy 0, policy_version 22482 (0.0031) [2024-06-27 15:26:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.6, 300 sec: 43431.8). Total num frames: 368345088. Throughput: 0: 43419.2. Samples: 271287820. Policy #0 lag: (min: 0.0, avg: 12.2, max: 22.0) [2024-06-27 15:26:23,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:26:27,814][06909] Updated weights for policy 0, policy_version 22492 (0.0043) [2024-06-27 15:26:28,850][06674] Fps is (10 sec: 45884.7, 60 sec: 43965.2, 300 sec: 43487.0). Total num frames: 368558080. Throughput: 0: 43392.8. Samples: 271430620. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 15:26:28,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:26:30,991][06909] Updated weights for policy 0, policy_version 22502 (0.0032) [2024-06-27 15:26:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 368754688. Throughput: 0: 43549.4. Samples: 271694760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 15:26:33,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:26:35,164][06909] Updated weights for policy 0, policy_version 22512 (0.0037) [2024-06-27 15:26:38,387][06909] Updated weights for policy 0, policy_version 22522 (0.0026) [2024-06-27 15:26:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 369000448. Throughput: 0: 43530.3. Samples: 271949540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 15:26:38,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:26:42,529][06909] Updated weights for policy 0, policy_version 22532 (0.0035) [2024-06-27 15:26:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43431.5). Total num frames: 369197056. Throughput: 0: 43613.8. Samples: 272091380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 15:26:43,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:26:45,721][06909] Updated weights for policy 0, policy_version 22542 (0.0034) [2024-06-27 15:26:48,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 369410048. Throughput: 0: 43540.6. Samples: 272345800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 15:26:48,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:26:49,703][06887] Signal inference workers to stop experience collection... (3800 times) [2024-06-27 15:26:49,743][06909] InferenceWorker_p0-w0: stopping experience collection (3800 times) [2024-06-27 15:26:49,750][06887] Signal inference workers to resume experience collection... (3800 times) [2024-06-27 15:26:49,756][06909] InferenceWorker_p0-w0: resuming experience collection (3800 times) [2024-06-27 15:26:50,097][06909] Updated weights for policy 0, policy_version 22552 (0.0028) [2024-06-27 15:26:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43144.6, 300 sec: 43487.0). Total num frames: 369639424. Throughput: 0: 43525.7. Samples: 272600860. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 15:26:53,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:26:53,929][06909] Updated weights for policy 0, policy_version 22562 (0.0029) [2024-06-27 15:26:57,991][06909] Updated weights for policy 0, policy_version 22572 (0.0047) [2024-06-27 15:26:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 369852416. Throughput: 0: 43582.1. Samples: 272739680. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 15:26:58,851][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:27:01,285][06909] Updated weights for policy 0, policy_version 22582 (0.0027) [2024-06-27 15:27:03,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43146.1, 300 sec: 43376.0). Total num frames: 370049024. Throughput: 0: 43632.8. Samples: 272998400. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 15:27:03,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:27:05,418][06909] Updated weights for policy 0, policy_version 22592 (0.0023) [2024-06-27 15:27:08,670][06909] Updated weights for policy 0, policy_version 22602 (0.0041) [2024-06-27 15:27:08,852][06674] Fps is (10 sec: 45866.4, 60 sec: 43419.4, 300 sec: 43542.3). Total num frames: 370311168. Throughput: 0: 43626.5. Samples: 273251100. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 15:27:08,852][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:27:13,000][06909] Updated weights for policy 0, policy_version 22612 (0.0036) [2024-06-27 15:27:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 370507776. Throughput: 0: 43500.5. Samples: 273388140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-27 15:27:13,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:27:16,099][06909] Updated weights for policy 0, policy_version 22622 (0.0029) [2024-06-27 15:27:18,850][06674] Fps is (10 sec: 40968.1, 60 sec: 43692.1, 300 sec: 43431.5). Total num frames: 370720768. Throughput: 0: 43460.4. Samples: 273650480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-27 15:27:18,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:27:20,367][06909] Updated weights for policy 0, policy_version 22632 (0.0031) [2024-06-27 15:27:23,435][06909] Updated weights for policy 0, policy_version 22642 (0.0022) [2024-06-27 15:27:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 370966528. Throughput: 0: 43459.1. Samples: 273905200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-27 15:27:23,851][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:27:27,783][06909] Updated weights for policy 0, policy_version 22652 (0.0029) [2024-06-27 15:27:28,851][06674] Fps is (10 sec: 45869.4, 60 sec: 43689.7, 300 sec: 43542.4). Total num frames: 371179520. Throughput: 0: 43423.1. Samples: 274045480. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 15:27:28,852][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:27:30,830][06909] Updated weights for policy 0, policy_version 22662 (0.0026) [2024-06-27 15:27:33,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 371359744. Throughput: 0: 43591.7. Samples: 274307420. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 15:27:33,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:27:35,389][06909] Updated weights for policy 0, policy_version 22672 (0.0043) [2024-06-27 15:27:38,552][06909] Updated weights for policy 0, policy_version 22682 (0.0031) [2024-06-27 15:27:38,850][06674] Fps is (10 sec: 44242.3, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 371621888. Throughput: 0: 43385.7. Samples: 274553220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 15:27:38,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:27:42,879][06909] Updated weights for policy 0, policy_version 22692 (0.0019) [2024-06-27 15:27:43,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.7, 300 sec: 43542.6). Total num frames: 371834880. Throughput: 0: 43521.4. Samples: 274698140. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 15:27:43,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:27:45,881][06909] Updated weights for policy 0, policy_version 22702 (0.0029) [2024-06-27 15:27:48,850][06674] Fps is (10 sec: 37683.7, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 371998720. Throughput: 0: 43632.0. Samples: 274961840. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 15:27:48,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:27:48,928][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000022706_372015104.pth... [2024-06-27 15:27:48,975][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000022070_361594880.pth [2024-06-27 15:27:50,421][06909] Updated weights for policy 0, policy_version 22712 (0.0029) [2024-06-27 15:27:53,472][06909] Updated weights for policy 0, policy_version 22722 (0.0028) [2024-06-27 15:27:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 372277248. Throughput: 0: 43597.6. Samples: 275212900. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 15:27:53,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:27:57,064][06887] Signal inference workers to stop experience collection... (3850 times) [2024-06-27 15:27:57,065][06887] Signal inference workers to resume experience collection... (3850 times) [2024-06-27 15:27:57,103][06909] InferenceWorker_p0-w0: stopping experience collection (3850 times) [2024-06-27 15:27:57,103][06909] InferenceWorker_p0-w0: resuming experience collection (3850 times) [2024-06-27 15:27:58,141][06909] Updated weights for policy 0, policy_version 22732 (0.0027) [2024-06-27 15:27:58,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 372473856. Throughput: 0: 43785.1. Samples: 275358480. Policy #0 lag: (min: 0.0, avg: 10.1, max: 25.0) [2024-06-27 15:27:58,851][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:28:00,992][06909] Updated weights for policy 0, policy_version 22742 (0.0038) [2024-06-27 15:28:03,850][06674] Fps is (10 sec: 37683.0, 60 sec: 43417.5, 300 sec: 43320.7). Total num frames: 372654080. Throughput: 0: 43653.4. Samples: 275614880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 25.0) [2024-06-27 15:28:03,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:28:05,767][06909] Updated weights for policy 0, policy_version 22752 (0.0030) [2024-06-27 15:28:08,330][06909] Updated weights for policy 0, policy_version 22762 (0.0028) [2024-06-27 15:28:08,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43692.2, 300 sec: 43598.1). Total num frames: 372932608. Throughput: 0: 43415.6. Samples: 275858900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 25.0) [2024-06-27 15:28:08,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:28:13,390][06909] Updated weights for policy 0, policy_version 22772 (0.0030) [2024-06-27 15:28:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43417.6, 300 sec: 43487.3). Total num frames: 373112832. Throughput: 0: 43579.5. Samples: 276006500. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 15:28:13,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:28:16,352][06909] Updated weights for policy 0, policy_version 22782 (0.0051) [2024-06-27 15:28:18,850][06674] Fps is (10 sec: 37682.9, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 373309440. Throughput: 0: 43376.3. Samples: 276259360. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 15:28:18,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:28:20,874][06909] Updated weights for policy 0, policy_version 22792 (0.0029) [2024-06-27 15:28:23,806][06909] Updated weights for policy 0, policy_version 22802 (0.0022) [2024-06-27 15:28:23,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 373587968. Throughput: 0: 43440.4. Samples: 276508040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 15:28:23,851][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:28:28,248][06909] Updated weights for policy 0, policy_version 22812 (0.0042) [2024-06-27 15:28:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43145.5, 300 sec: 43487.0). Total num frames: 373768192. Throughput: 0: 43492.4. Samples: 276655300. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 15:28:28,850][06674] Avg episode reward: [(0, '0.390')] [2024-06-27 15:28:31,177][06909] Updated weights for policy 0, policy_version 22822 (0.0038) [2024-06-27 15:28:33,850][06674] Fps is (10 sec: 37683.8, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 373964800. Throughput: 0: 43355.1. Samples: 276912820. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-27 15:28:33,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:28:35,793][06909] Updated weights for policy 0, policy_version 22832 (0.0044) [2024-06-27 15:28:38,801][06909] Updated weights for policy 0, policy_version 22842 (0.0035) [2024-06-27 15:28:38,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 374243328. Throughput: 0: 43305.8. Samples: 277161660. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-27 15:28:38,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:28:43,442][06909] Updated weights for policy 0, policy_version 22852 (0.0035) [2024-06-27 15:28:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43144.6, 300 sec: 43487.0). Total num frames: 374423552. Throughput: 0: 43262.4. Samples: 277305280. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-27 15:28:43,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:28:46,621][06909] Updated weights for policy 0, policy_version 22862 (0.0030) [2024-06-27 15:28:48,850][06674] Fps is (10 sec: 37682.8, 60 sec: 43690.6, 300 sec: 43320.4). Total num frames: 374620160. Throughput: 0: 43234.2. Samples: 277560420. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 15:28:48,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:28:51,010][06909] Updated weights for policy 0, policy_version 22872 (0.0036) [2024-06-27 15:28:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43417.6, 300 sec: 43542.9). Total num frames: 374882304. Throughput: 0: 43367.0. Samples: 277810420. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 15:28:53,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:28:54,241][06909] Updated weights for policy 0, policy_version 22882 (0.0038) [2024-06-27 15:28:58,722][06909] Updated weights for policy 0, policy_version 22892 (0.0027) [2024-06-27 15:28:58,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43144.7, 300 sec: 43431.5). Total num frames: 375062528. Throughput: 0: 43268.1. Samples: 277953560. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 15:28:58,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:29:02,282][06909] Updated weights for policy 0, policy_version 22902 (0.0037) [2024-06-27 15:29:03,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 375275520. Throughput: 0: 43348.5. Samples: 278210040. Policy #0 lag: (min: 1.0, avg: 12.3, max: 20.0) [2024-06-27 15:29:03,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:29:06,303][06909] Updated weights for policy 0, policy_version 22912 (0.0028) [2024-06-27 15:29:06,960][06887] Signal inference workers to stop experience collection... (3900 times) [2024-06-27 15:29:06,962][06887] Signal inference workers to resume experience collection... (3900 times) [2024-06-27 15:29:06,981][06909] InferenceWorker_p0-w0: stopping experience collection (3900 times) [2024-06-27 15:29:06,981][06909] InferenceWorker_p0-w0: resuming experience collection (3900 times) [2024-06-27 15:29:08,852][06674] Fps is (10 sec: 47503.2, 60 sec: 43416.1, 300 sec: 43597.8). Total num frames: 375537664. Throughput: 0: 43388.8. Samples: 278460620. Policy #0 lag: (min: 1.0, avg: 12.3, max: 20.0) [2024-06-27 15:29:08,853][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:29:09,986][06909] Updated weights for policy 0, policy_version 22922 (0.0034) [2024-06-27 15:29:13,802][06909] Updated weights for policy 0, policy_version 22932 (0.0036) [2024-06-27 15:29:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.5, 300 sec: 43487.3). Total num frames: 375717888. Throughput: 0: 43231.4. Samples: 278600720. Policy #0 lag: (min: 1.0, avg: 12.3, max: 20.0) [2024-06-27 15:29:13,851][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:29:17,731][06909] Updated weights for policy 0, policy_version 22942 (0.0030) [2024-06-27 15:29:18,856][06674] Fps is (10 sec: 39306.0, 60 sec: 43686.3, 300 sec: 43430.6). Total num frames: 375930880. Throughput: 0: 43286.2. Samples: 278860960. Policy #0 lag: (min: 0.0, avg: 13.0, max: 26.0) [2024-06-27 15:29:18,856][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:29:21,411][06909] Updated weights for policy 0, policy_version 22952 (0.0041) [2024-06-27 15:29:23,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43144.6, 300 sec: 43487.0). Total num frames: 376176640. Throughput: 0: 43139.1. Samples: 279102920. Policy #0 lag: (min: 0.0, avg: 13.0, max: 26.0) [2024-06-27 15:29:23,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:29:25,350][06909] Updated weights for policy 0, policy_version 22962 (0.0043) [2024-06-27 15:29:28,850][06674] Fps is (10 sec: 42624.1, 60 sec: 43144.5, 300 sec: 43431.8). Total num frames: 376356864. Throughput: 0: 43153.7. Samples: 279247200. Policy #0 lag: (min: 0.0, avg: 13.0, max: 26.0) [2024-06-27 15:29:28,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:29:28,987][06909] Updated weights for policy 0, policy_version 22972 (0.0033) [2024-06-27 15:29:32,808][06909] Updated weights for policy 0, policy_version 22982 (0.0031) [2024-06-27 15:29:33,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 376569856. Throughput: 0: 43219.3. Samples: 279505280. Policy #0 lag: (min: 0.0, avg: 12.9, max: 21.0) [2024-06-27 15:29:33,859][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:29:36,510][06909] Updated weights for policy 0, policy_version 22992 (0.0038) [2024-06-27 15:29:38,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43144.4, 300 sec: 43598.1). Total num frames: 376832000. Throughput: 0: 43261.7. Samples: 279757200. Policy #0 lag: (min: 0.0, avg: 12.9, max: 21.0) [2024-06-27 15:29:38,859][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:29:40,582][06909] Updated weights for policy 0, policy_version 23002 (0.0032) [2024-06-27 15:29:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 42871.5, 300 sec: 43431.5). Total num frames: 376995840. Throughput: 0: 43263.9. Samples: 279900440. Policy #0 lag: (min: 0.0, avg: 12.9, max: 21.0) [2024-06-27 15:29:43,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:29:44,092][06909] Updated weights for policy 0, policy_version 23012 (0.0031) [2024-06-27 15:29:47,993][06909] Updated weights for policy 0, policy_version 23022 (0.0034) [2024-06-27 15:29:48,850][06674] Fps is (10 sec: 37683.7, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 377208832. Throughput: 0: 43247.2. Samples: 280156160. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 15:29:48,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:29:48,958][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000023024_377225216.pth... [2024-06-27 15:29:49,037][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000022388_366804992.pth [2024-06-27 15:29:51,612][06909] Updated weights for policy 0, policy_version 23032 (0.0048) [2024-06-27 15:29:53,850][06674] Fps is (10 sec: 49151.5, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 377487360. Throughput: 0: 43245.9. Samples: 280406600. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 15:29:53,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:29:55,475][06909] Updated weights for policy 0, policy_version 23042 (0.0037) [2024-06-27 15:29:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 377667584. Throughput: 0: 43360.6. Samples: 280551940. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 15:29:58,856][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 15:29:58,993][06909] Updated weights for policy 0, policy_version 23052 (0.0027) [2024-06-27 15:30:02,940][06909] Updated weights for policy 0, policy_version 23062 (0.0028) [2024-06-27 15:30:03,850][06674] Fps is (10 sec: 37683.4, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 377864192. Throughput: 0: 43391.6. Samples: 280813320. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 15:30:03,851][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:30:06,611][06909] Updated weights for policy 0, policy_version 23072 (0.0031) [2024-06-27 15:30:08,852][06674] Fps is (10 sec: 47503.8, 60 sec: 43417.6, 300 sec: 43597.8). Total num frames: 378142720. Throughput: 0: 43515.8. Samples: 281061220. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 15:30:08,852][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:30:10,595][06909] Updated weights for policy 0, policy_version 23082 (0.0028) [2024-06-27 15:30:13,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43144.7, 300 sec: 43431.5). Total num frames: 378306560. Throughput: 0: 43485.4. Samples: 281204040. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 15:30:13,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:30:14,334][06909] Updated weights for policy 0, policy_version 23092 (0.0033) [2024-06-27 15:30:15,860][06887] Signal inference workers to stop experience collection... (3950 times) [2024-06-27 15:30:15,914][06887] Signal inference workers to resume experience collection... (3950 times) [2024-06-27 15:30:15,915][06909] InferenceWorker_p0-w0: stopping experience collection (3950 times) [2024-06-27 15:30:15,941][06909] InferenceWorker_p0-w0: resuming experience collection (3950 times) [2024-06-27 15:30:18,345][06909] Updated weights for policy 0, policy_version 23102 (0.0033) [2024-06-27 15:30:18,850][06674] Fps is (10 sec: 39329.9, 60 sec: 43422.0, 300 sec: 43375.9). Total num frames: 378535936. Throughput: 0: 43518.2. Samples: 281463600. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 15:30:18,850][06674] Avg episode reward: [(0, '0.439')] [2024-06-27 15:30:18,947][06887] Saving new best policy, reward=0.439! [2024-06-27 15:30:21,679][06909] Updated weights for policy 0, policy_version 23112 (0.0032) [2024-06-27 15:30:23,850][06674] Fps is (10 sec: 49151.2, 60 sec: 43690.6, 300 sec: 43653.9). Total num frames: 378798080. Throughput: 0: 43459.2. Samples: 281712860. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 15:30:23,851][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:30:25,869][06909] Updated weights for policy 0, policy_version 23122 (0.0040) [2024-06-27 15:30:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 378961920. Throughput: 0: 43436.4. Samples: 281855080. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 15:30:28,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:30:29,186][06909] Updated weights for policy 0, policy_version 23132 (0.0030) [2024-06-27 15:30:33,266][06909] Updated weights for policy 0, policy_version 23142 (0.0029) [2024-06-27 15:30:33,850][06674] Fps is (10 sec: 36044.7, 60 sec: 43144.4, 300 sec: 43264.9). Total num frames: 379158528. Throughput: 0: 43380.3. Samples: 282108280. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 15:30:33,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:30:36,548][06909] Updated weights for policy 0, policy_version 23152 (0.0026) [2024-06-27 15:30:38,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43417.7, 300 sec: 43653.6). Total num frames: 379437056. Throughput: 0: 43563.7. Samples: 282366960. Policy #0 lag: (min: 0.0, avg: 7.2, max: 20.0) [2024-06-27 15:30:38,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:30:40,659][06909] Updated weights for policy 0, policy_version 23162 (0.0025) [2024-06-27 15:30:43,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.7, 300 sec: 43542.6). Total num frames: 379633664. Throughput: 0: 43620.9. Samples: 282514880. Policy #0 lag: (min: 0.0, avg: 7.2, max: 20.0) [2024-06-27 15:30:43,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:30:44,082][06909] Updated weights for policy 0, policy_version 23172 (0.0032) [2024-06-27 15:30:48,535][06909] Updated weights for policy 0, policy_version 23182 (0.0033) [2024-06-27 15:30:48,850][06674] Fps is (10 sec: 37682.9, 60 sec: 43417.5, 300 sec: 43264.9). Total num frames: 379813888. Throughput: 0: 43415.1. Samples: 282767000. Policy #0 lag: (min: 0.0, avg: 7.2, max: 20.0) [2024-06-27 15:30:48,851][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:30:51,575][06909] Updated weights for policy 0, policy_version 23192 (0.0040) [2024-06-27 15:30:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43144.6, 300 sec: 43542.6). Total num frames: 380076032. Throughput: 0: 43550.5. Samples: 283020900. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-27 15:30:53,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:30:56,045][06909] Updated weights for policy 0, policy_version 23202 (0.0034) [2024-06-27 15:30:58,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43417.6, 300 sec: 43431.8). Total num frames: 380272640. Throughput: 0: 43486.2. Samples: 283160920. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-27 15:30:58,850][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 15:30:59,185][06909] Updated weights for policy 0, policy_version 23212 (0.0034) [2024-06-27 15:31:03,466][06909] Updated weights for policy 0, policy_version 23222 (0.0034) [2024-06-27 15:31:03,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43417.6, 300 sec: 43265.5). Total num frames: 380469248. Throughput: 0: 43368.8. Samples: 283415200. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-27 15:31:03,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:31:06,635][06909] Updated weights for policy 0, policy_version 23232 (0.0033) [2024-06-27 15:31:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 42872.9, 300 sec: 43487.0). Total num frames: 380715008. Throughput: 0: 43514.3. Samples: 283671000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-27 15:31:08,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:31:11,194][06909] Updated weights for policy 0, policy_version 23242 (0.0036) [2024-06-27 15:31:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.4, 300 sec: 43431.8). Total num frames: 380911616. Throughput: 0: 43511.4. Samples: 283813100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-27 15:31:13,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:31:14,456][06909] Updated weights for policy 0, policy_version 23252 (0.0027) [2024-06-27 15:31:18,547][06909] Updated weights for policy 0, policy_version 23262 (0.0032) [2024-06-27 15:31:18,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 381124608. Throughput: 0: 43468.6. Samples: 284064360. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-27 15:31:18,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:31:22,024][06909] Updated weights for policy 0, policy_version 23272 (0.0038) [2024-06-27 15:31:23,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 381386752. Throughput: 0: 43414.1. Samples: 284320600. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 15:31:23,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:31:24,267][06887] Signal inference workers to stop experience collection... (4000 times) [2024-06-27 15:31:24,269][06887] Signal inference workers to resume experience collection... (4000 times) [2024-06-27 15:31:24,288][06909] InferenceWorker_p0-w0: stopping experience collection (4000 times) [2024-06-27 15:31:24,320][06909] InferenceWorker_p0-w0: resuming experience collection (4000 times) [2024-06-27 15:31:26,395][06909] Updated weights for policy 0, policy_version 23282 (0.0035) [2024-06-27 15:31:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 381566976. Throughput: 0: 43259.5. Samples: 284461560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 15:31:28,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:31:29,444][06909] Updated weights for policy 0, policy_version 23292 (0.0036) [2024-06-27 15:31:33,818][06909] Updated weights for policy 0, policy_version 23302 (0.0041) [2024-06-27 15:31:33,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 381779968. Throughput: 0: 43164.4. Samples: 284709400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 15:31:33,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:31:36,981][06909] Updated weights for policy 0, policy_version 23312 (0.0023) [2024-06-27 15:31:38,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43144.4, 300 sec: 43487.0). Total num frames: 382025728. Throughput: 0: 43172.8. Samples: 284963680. Policy #0 lag: (min: 2.0, avg: 9.9, max: 21.0) [2024-06-27 15:31:38,851][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:31:41,333][06909] Updated weights for policy 0, policy_version 23322 (0.0038) [2024-06-27 15:31:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 42871.5, 300 sec: 43376.0). Total num frames: 382205952. Throughput: 0: 43180.8. Samples: 285104060. Policy #0 lag: (min: 2.0, avg: 9.9, max: 21.0) [2024-06-27 15:31:43,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:31:44,605][06909] Updated weights for policy 0, policy_version 23332 (0.0047) [2024-06-27 15:31:48,793][06909] Updated weights for policy 0, policy_version 23342 (0.0029) [2024-06-27 15:31:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 382435328. Throughput: 0: 43249.7. Samples: 285361440. Policy #0 lag: (min: 2.0, avg: 9.9, max: 21.0) [2024-06-27 15:31:48,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:31:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000023342_382435328.pth... [2024-06-27 15:31:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000022706_372015104.pth [2024-06-27 15:31:52,037][06909] Updated weights for policy 0, policy_version 23352 (0.0040) [2024-06-27 15:31:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 382664704. Throughput: 0: 43252.0. Samples: 285617340. Policy #0 lag: (min: 1.0, avg: 11.9, max: 21.0) [2024-06-27 15:31:53,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:31:56,235][06909] Updated weights for policy 0, policy_version 23362 (0.0034) [2024-06-27 15:31:58,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 382861312. Throughput: 0: 43085.5. Samples: 285751940. Policy #0 lag: (min: 1.0, avg: 11.9, max: 21.0) [2024-06-27 15:31:58,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:31:59,641][06909] Updated weights for policy 0, policy_version 23372 (0.0043) [2024-06-27 15:32:03,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.5, 300 sec: 43265.1). Total num frames: 383074304. Throughput: 0: 43229.6. Samples: 286009700. Policy #0 lag: (min: 1.0, avg: 11.9, max: 21.0) [2024-06-27 15:32:03,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:32:04,196][06909] Updated weights for policy 0, policy_version 23382 (0.0033) [2024-06-27 15:32:07,262][06909] Updated weights for policy 0, policy_version 23392 (0.0044) [2024-06-27 15:32:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 383320064. Throughput: 0: 43155.2. Samples: 286262580. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 15:32:08,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:32:11,914][06909] Updated weights for policy 0, policy_version 23402 (0.0026) [2024-06-27 15:32:13,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43144.7, 300 sec: 43320.4). Total num frames: 383500288. Throughput: 0: 43158.3. Samples: 286403680. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 15:32:13,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:32:14,784][06909] Updated weights for policy 0, policy_version 23412 (0.0037) [2024-06-27 15:32:18,850][06674] Fps is (10 sec: 39321.0, 60 sec: 43144.4, 300 sec: 43209.3). Total num frames: 383713280. Throughput: 0: 43196.8. Samples: 286653260. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 15:32:18,851][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:32:19,320][06909] Updated weights for policy 0, policy_version 23422 (0.0033) [2024-06-27 15:32:22,445][06909] Updated weights for policy 0, policy_version 23432 (0.0029) [2024-06-27 15:32:23,856][06674] Fps is (10 sec: 49122.0, 60 sec: 43413.3, 300 sec: 43430.8). Total num frames: 383991808. Throughput: 0: 43270.7. Samples: 286911120. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 15:32:23,856][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:32:27,045][06909] Updated weights for policy 0, policy_version 23442 (0.0039) [2024-06-27 15:32:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 384172032. Throughput: 0: 43310.1. Samples: 287053020. Policy #0 lag: (min: 0.0, avg: 12.0, max: 20.0) [2024-06-27 15:32:28,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:32:29,812][06909] Updated weights for policy 0, policy_version 23452 (0.0041) [2024-06-27 15:32:33,850][06674] Fps is (10 sec: 37706.0, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 384368640. Throughput: 0: 43307.6. Samples: 287310280. Policy #0 lag: (min: 0.0, avg: 12.0, max: 20.0) [2024-06-27 15:32:33,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:32:34,402][06909] Updated weights for policy 0, policy_version 23462 (0.0030) [2024-06-27 15:32:35,068][06887] Signal inference workers to stop experience collection... (4050 times) [2024-06-27 15:32:35,121][06887] Signal inference workers to resume experience collection... (4050 times) [2024-06-27 15:32:35,122][06909] InferenceWorker_p0-w0: stopping experience collection (4050 times) [2024-06-27 15:32:35,137][06909] InferenceWorker_p0-w0: resuming experience collection (4050 times) [2024-06-27 15:32:37,645][06909] Updated weights for policy 0, policy_version 23472 (0.0033) [2024-06-27 15:32:38,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43417.7, 300 sec: 43376.0). Total num frames: 384630784. Throughput: 0: 43320.5. Samples: 287566760. Policy #0 lag: (min: 0.0, avg: 12.0, max: 20.0) [2024-06-27 15:32:38,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:32:41,829][06909] Updated weights for policy 0, policy_version 23482 (0.0029) [2024-06-27 15:32:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 384827392. Throughput: 0: 43495.1. Samples: 287709220. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-27 15:32:43,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:32:45,094][06909] Updated weights for policy 0, policy_version 23492 (0.0036) [2024-06-27 15:32:48,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 385024000. Throughput: 0: 43317.9. Samples: 287959000. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-27 15:32:48,850][06674] Avg episode reward: [(0, '0.391')] [2024-06-27 15:32:49,315][06909] Updated weights for policy 0, policy_version 23502 (0.0036) [2024-06-27 15:32:52,453][06909] Updated weights for policy 0, policy_version 23512 (0.0034) [2024-06-27 15:32:53,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43417.7, 300 sec: 43376.0). Total num frames: 385269760. Throughput: 0: 43379.2. Samples: 288214640. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-27 15:32:53,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:32:56,975][06909] Updated weights for policy 0, policy_version 23522 (0.0045) [2024-06-27 15:32:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 385466368. Throughput: 0: 43473.8. Samples: 288360000. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-27 15:32:58,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:32:59,884][06909] Updated weights for policy 0, policy_version 23532 (0.0028) [2024-06-27 15:33:03,850][06674] Fps is (10 sec: 37682.7, 60 sec: 42871.5, 300 sec: 43098.2). Total num frames: 385646592. Throughput: 0: 43615.2. Samples: 288615940. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-27 15:33:03,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:33:04,656][06909] Updated weights for policy 0, policy_version 23542 (0.0042) [2024-06-27 15:33:07,423][06909] Updated weights for policy 0, policy_version 23552 (0.0028) [2024-06-27 15:33:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 385925120. Throughput: 0: 43541.4. Samples: 288870220. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-27 15:33:08,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:33:12,125][06909] Updated weights for policy 0, policy_version 23562 (0.0032) [2024-06-27 15:33:13,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 386121728. Throughput: 0: 43552.9. Samples: 289012900. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-27 15:33:13,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:33:14,830][06909] Updated weights for policy 0, policy_version 23572 (0.0034) [2024-06-27 15:33:18,850][06674] Fps is (10 sec: 37683.1, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 386301952. Throughput: 0: 43440.4. Samples: 289265100. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-27 15:33:18,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:33:19,690][06909] Updated weights for policy 0, policy_version 23582 (0.0031) [2024-06-27 15:33:22,479][06909] Updated weights for policy 0, policy_version 23592 (0.0032) [2024-06-27 15:33:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43148.9, 300 sec: 43431.5). Total num frames: 386580480. Throughput: 0: 43378.7. Samples: 289518800. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-27 15:33:23,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:33:27,253][06909] Updated weights for policy 0, policy_version 23602 (0.0040) [2024-06-27 15:33:28,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 386777088. Throughput: 0: 43460.0. Samples: 289664920. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-27 15:33:28,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:33:30,107][06909] Updated weights for policy 0, policy_version 23612 (0.0030) [2024-06-27 15:33:33,850][06674] Fps is (10 sec: 37682.8, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 386957312. Throughput: 0: 43324.8. Samples: 289908620. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-27 15:33:33,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:33:35,275][06909] Updated weights for policy 0, policy_version 23622 (0.0025) [2024-06-27 15:33:37,921][06909] Updated weights for policy 0, policy_version 23632 (0.0028) [2024-06-27 15:33:38,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 387235840. Throughput: 0: 43310.1. Samples: 290163600. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-27 15:33:38,854][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:33:42,776][06909] Updated weights for policy 0, policy_version 23642 (0.0036) [2024-06-27 15:33:42,861][06887] Signal inference workers to stop experience collection... (4100 times) [2024-06-27 15:33:42,905][06909] InferenceWorker_p0-w0: stopping experience collection (4100 times) [2024-06-27 15:33:42,923][06887] Signal inference workers to resume experience collection... (4100 times) [2024-06-27 15:33:42,925][06909] InferenceWorker_p0-w0: resuming experience collection (4100 times) [2024-06-27 15:33:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43144.5, 300 sec: 43376.0). Total num frames: 387416064. Throughput: 0: 43231.9. Samples: 290305440. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-27 15:33:43,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:33:45,353][06909] Updated weights for policy 0, policy_version 23652 (0.0036) [2024-06-27 15:33:48,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43417.5, 300 sec: 43209.3). Total num frames: 387629056. Throughput: 0: 43208.4. Samples: 290560320. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 15:33:48,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:33:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000023659_387629056.pth... [2024-06-27 15:33:48,907][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000023024_377225216.pth [2024-06-27 15:33:50,230][06909] Updated weights for policy 0, policy_version 23662 (0.0029) [2024-06-27 15:33:52,727][06909] Updated weights for policy 0, policy_version 23672 (0.0030) [2024-06-27 15:33:53,856][06674] Fps is (10 sec: 45847.5, 60 sec: 43413.1, 300 sec: 43430.6). Total num frames: 387874816. Throughput: 0: 43255.5. Samples: 290816980. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 15:33:53,857][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:33:57,665][06909] Updated weights for policy 0, policy_version 23682 (0.0044) [2024-06-27 15:33:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.5, 300 sec: 43376.0). Total num frames: 388071424. Throughput: 0: 43255.2. Samples: 290959380. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 15:33:58,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:34:00,268][06909] Updated weights for policy 0, policy_version 23692 (0.0034) [2024-06-27 15:34:03,852][06674] Fps is (10 sec: 39338.7, 60 sec: 43689.4, 300 sec: 43153.8). Total num frames: 388268032. Throughput: 0: 43101.5. Samples: 291204740. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-27 15:34:03,852][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:34:05,157][06909] Updated weights for policy 0, policy_version 23702 (0.0039) [2024-06-27 15:34:07,762][06909] Updated weights for policy 0, policy_version 23712 (0.0031) [2024-06-27 15:34:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43144.5, 300 sec: 43376.0). Total num frames: 388513792. Throughput: 0: 43263.9. Samples: 291465680. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-27 15:34:08,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:34:12,524][06909] Updated weights for policy 0, policy_version 23722 (0.0036) [2024-06-27 15:34:13,850][06674] Fps is (10 sec: 44244.6, 60 sec: 43144.6, 300 sec: 43321.3). Total num frames: 388710400. Throughput: 0: 43285.3. Samples: 291612760. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-27 15:34:13,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:34:15,246][06909] Updated weights for policy 0, policy_version 23732 (0.0030) [2024-06-27 15:34:18,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43209.3). Total num frames: 388923392. Throughput: 0: 43399.6. Samples: 291861600. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 15:34:18,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:34:20,325][06909] Updated weights for policy 0, policy_version 23742 (0.0029) [2024-06-27 15:34:22,943][06909] Updated weights for policy 0, policy_version 23752 (0.0038) [2024-06-27 15:34:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 389169152. Throughput: 0: 43434.2. Samples: 292118140. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 15:34:23,854][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:34:27,758][06909] Updated weights for policy 0, policy_version 23762 (0.0032) [2024-06-27 15:34:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 389365760. Throughput: 0: 43421.4. Samples: 292259400. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 15:34:28,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:34:30,453][06909] Updated weights for policy 0, policy_version 23772 (0.0032) [2024-06-27 15:34:33,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43209.3). Total num frames: 389578752. Throughput: 0: 43394.2. Samples: 292513060. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 15:34:33,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:34:35,089][06909] Updated weights for policy 0, policy_version 23782 (0.0034) [2024-06-27 15:34:38,247][06909] Updated weights for policy 0, policy_version 23792 (0.0033) [2024-06-27 15:34:38,856][06674] Fps is (10 sec: 49122.2, 60 sec: 43686.3, 300 sec: 43597.2). Total num frames: 389857280. Throughput: 0: 43570.7. Samples: 292777660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 15:34:38,856][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:34:42,918][06909] Updated weights for policy 0, policy_version 23802 (0.0030) [2024-06-27 15:34:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 390021120. Throughput: 0: 43471.0. Samples: 292915580. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 15:34:43,851][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:34:45,525][06887] Signal inference workers to stop experience collection... (4150 times) [2024-06-27 15:34:45,526][06887] Signal inference workers to resume experience collection... (4150 times) [2024-06-27 15:34:45,570][06909] InferenceWorker_p0-w0: stopping experience collection (4150 times) [2024-06-27 15:34:45,571][06909] InferenceWorker_p0-w0: resuming experience collection (4150 times) [2024-06-27 15:34:45,655][06909] Updated weights for policy 0, policy_version 23812 (0.0032) [2024-06-27 15:34:48,852][06674] Fps is (10 sec: 37697.7, 60 sec: 43416.1, 300 sec: 43209.0). Total num frames: 390234112. Throughput: 0: 43536.5. Samples: 293163900. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 15:34:48,852][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:34:50,527][06909] Updated weights for policy 0, policy_version 23822 (0.0028) [2024-06-27 15:34:53,146][06909] Updated weights for policy 0, policy_version 23832 (0.0032) [2024-06-27 15:34:53,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43422.1, 300 sec: 43431.5). Total num frames: 390479872. Throughput: 0: 43486.8. Samples: 293422580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:34:53,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:34:57,950][06909] Updated weights for policy 0, policy_version 23842 (0.0047) [2024-06-27 15:34:58,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 390660096. Throughput: 0: 43331.5. Samples: 293562680. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:34:58,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:35:00,555][06909] Updated weights for policy 0, policy_version 23852 (0.0022) [2024-06-27 15:35:03,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43691.9, 300 sec: 43209.6). Total num frames: 390889472. Throughput: 0: 43401.8. Samples: 293814680. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:35:03,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:35:05,274][06909] Updated weights for policy 0, policy_version 23862 (0.0039) [2024-06-27 15:35:07,959][06909] Updated weights for policy 0, policy_version 23872 (0.0038) [2024-06-27 15:35:08,850][06674] Fps is (10 sec: 49152.1, 60 sec: 43963.7, 300 sec: 43542.5). Total num frames: 391151616. Throughput: 0: 43628.9. Samples: 294081440. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 15:35:08,851][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:35:12,712][06909] Updated weights for policy 0, policy_version 23882 (0.0046) [2024-06-27 15:35:13,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 391299072. Throughput: 0: 43578.7. Samples: 294220440. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 15:35:13,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:35:15,945][06909] Updated weights for policy 0, policy_version 23892 (0.0026) [2024-06-27 15:35:18,850][06674] Fps is (10 sec: 39321.1, 60 sec: 43690.6, 300 sec: 43209.3). Total num frames: 391544832. Throughput: 0: 43531.9. Samples: 294472000. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 15:35:18,851][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:35:20,679][06909] Updated weights for policy 0, policy_version 23902 (0.0034) [2024-06-27 15:35:23,372][06909] Updated weights for policy 0, policy_version 23912 (0.0038) [2024-06-27 15:35:23,850][06674] Fps is (10 sec: 49151.5, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 391790592. Throughput: 0: 43413.8. Samples: 294731020. Policy #0 lag: (min: 1.0, avg: 10.8, max: 24.0) [2024-06-27 15:35:23,852][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:35:28,102][06909] Updated weights for policy 0, policy_version 23922 (0.0026) [2024-06-27 15:35:28,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 391954432. Throughput: 0: 43366.7. Samples: 294867080. Policy #0 lag: (min: 1.0, avg: 10.8, max: 24.0) [2024-06-27 15:35:28,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:35:30,859][06909] Updated weights for policy 0, policy_version 23932 (0.0039) [2024-06-27 15:35:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43264.9). Total num frames: 392200192. Throughput: 0: 43541.2. Samples: 295123160. Policy #0 lag: (min: 1.0, avg: 10.8, max: 24.0) [2024-06-27 15:35:33,851][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:35:35,438][06909] Updated weights for policy 0, policy_version 23942 (0.0037) [2024-06-27 15:35:38,213][06909] Updated weights for policy 0, policy_version 23952 (0.0033) [2024-06-27 15:35:38,850][06674] Fps is (10 sec: 49152.3, 60 sec: 43148.9, 300 sec: 43431.5). Total num frames: 392445952. Throughput: 0: 43609.3. Samples: 295385000. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-27 15:35:38,852][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:35:42,853][06909] Updated weights for policy 0, policy_version 23962 (0.0037) [2024-06-27 15:35:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 392626176. Throughput: 0: 43569.0. Samples: 295523280. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-27 15:35:43,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:35:45,635][06909] Updated weights for policy 0, policy_version 23972 (0.0028) [2024-06-27 15:35:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43692.3, 300 sec: 43320.4). Total num frames: 392855552. Throughput: 0: 43641.8. Samples: 295778560. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-27 15:35:48,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:35:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000023978_392855552.pth... [2024-06-27 15:35:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000023342_382435328.pth [2024-06-27 15:35:50,360][06909] Updated weights for policy 0, policy_version 23982 (0.0041) [2024-06-27 15:35:51,414][06887] Signal inference workers to stop experience collection... (4200 times) [2024-06-27 15:35:51,462][06909] InferenceWorker_p0-w0: stopping experience collection (4200 times) [2024-06-27 15:35:51,474][06887] Signal inference workers to resume experience collection... (4200 times) [2024-06-27 15:35:51,478][06909] InferenceWorker_p0-w0: resuming experience collection (4200 times) [2024-06-27 15:35:53,473][06909] Updated weights for policy 0, policy_version 23992 (0.0033) [2024-06-27 15:35:53,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 393101312. Throughput: 0: 43406.7. Samples: 296034740. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-27 15:35:53,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:35:58,092][06909] Updated weights for policy 0, policy_version 24002 (0.0038) [2024-06-27 15:35:58,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.7, 300 sec: 43376.0). Total num frames: 393265152. Throughput: 0: 43326.7. Samples: 296170140. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 15:35:58,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:36:00,863][06909] Updated weights for policy 0, policy_version 24012 (0.0028) [2024-06-27 15:36:03,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 393494528. Throughput: 0: 43449.9. Samples: 296427240. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 15:36:03,851][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:36:05,603][06909] Updated weights for policy 0, policy_version 24022 (0.0040) [2024-06-27 15:36:08,381][06909] Updated weights for policy 0, policy_version 24032 (0.0037) [2024-06-27 15:36:08,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 393740288. Throughput: 0: 43379.5. Samples: 296683100. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 15:36:08,851][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:36:13,018][06909] Updated weights for policy 0, policy_version 24042 (0.0030) [2024-06-27 15:36:13,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43375.9). Total num frames: 393920512. Throughput: 0: 43450.3. Samples: 296822340. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 15:36:13,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:36:16,184][06909] Updated weights for policy 0, policy_version 24052 (0.0041) [2024-06-27 15:36:18,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 394149888. Throughput: 0: 43477.2. Samples: 297079640. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 15:36:18,851][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:36:20,425][06909] Updated weights for policy 0, policy_version 24062 (0.0026) [2024-06-27 15:36:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 394379264. Throughput: 0: 43301.9. Samples: 297333580. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 15:36:23,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:36:23,967][06909] Updated weights for policy 0, policy_version 24072 (0.0027) [2024-06-27 15:36:28,144][06909] Updated weights for policy 0, policy_version 24082 (0.0023) [2024-06-27 15:36:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43376.0). Total num frames: 394575872. Throughput: 0: 43259.5. Samples: 297469960. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 15:36:28,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:36:31,768][06909] Updated weights for policy 0, policy_version 24092 (0.0027) [2024-06-27 15:36:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 394805248. Throughput: 0: 43260.5. Samples: 297725280. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 15:36:33,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:36:35,783][06909] Updated weights for policy 0, policy_version 24102 (0.0036) [2024-06-27 15:36:38,850][06674] Fps is (10 sec: 44237.3, 60 sec: 42871.5, 300 sec: 43431.5). Total num frames: 395018240. Throughput: 0: 43309.9. Samples: 297983680. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 15:36:38,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:36:39,167][06909] Updated weights for policy 0, policy_version 24112 (0.0027) [2024-06-27 15:36:43,177][06909] Updated weights for policy 0, policy_version 24122 (0.0027) [2024-06-27 15:36:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.5, 300 sec: 43376.0). Total num frames: 395231232. Throughput: 0: 43270.1. Samples: 298117300. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 15:36:43,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:36:46,705][06909] Updated weights for policy 0, policy_version 24132 (0.0033) [2024-06-27 15:36:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 395444224. Throughput: 0: 43365.4. Samples: 298378680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 15:36:48,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:36:51,197][06909] Updated weights for policy 0, policy_version 24142 (0.0027) [2024-06-27 15:36:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 395689984. Throughput: 0: 43367.1. Samples: 298634620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 15:36:53,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:36:54,147][06909] Updated weights for policy 0, policy_version 24152 (0.0034) [2024-06-27 15:36:58,650][06909] Updated weights for policy 0, policy_version 24162 (0.0024) [2024-06-27 15:36:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 395870208. Throughput: 0: 43280.4. Samples: 298769960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 15:36:58,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:37:01,814][06909] Updated weights for policy 0, policy_version 24172 (0.0030) [2024-06-27 15:37:03,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 396099584. Throughput: 0: 43293.1. Samples: 299027820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 15:37:03,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:37:06,166][06909] Updated weights for policy 0, policy_version 24182 (0.0046) [2024-06-27 15:37:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43144.6, 300 sec: 43487.0). Total num frames: 396328960. Throughput: 0: 43422.1. Samples: 299287580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 15:37:08,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:37:09,298][06909] Updated weights for policy 0, policy_version 24192 (0.0034) [2024-06-27 15:37:13,573][06909] Updated weights for policy 0, policy_version 24202 (0.0040) [2024-06-27 15:37:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 396525568. Throughput: 0: 43373.0. Samples: 299421740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 15:37:13,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:37:16,403][06887] Signal inference workers to stop experience collection... (4250 times) [2024-06-27 15:37:16,434][06909] InferenceWorker_p0-w0: stopping experience collection (4250 times) [2024-06-27 15:37:16,449][06887] Signal inference workers to resume experience collection... (4250 times) [2024-06-27 15:37:16,456][06909] InferenceWorker_p0-w0: resuming experience collection (4250 times) [2024-06-27 15:37:16,589][06909] Updated weights for policy 0, policy_version 24212 (0.0043) [2024-06-27 15:37:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 43321.3). Total num frames: 396771328. Throughput: 0: 43464.0. Samples: 299681160. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2024-06-27 15:37:18,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:37:20,992][06909] Updated weights for policy 0, policy_version 24222 (0.0033) [2024-06-27 15:37:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 396984320. Throughput: 0: 43419.0. Samples: 299937540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2024-06-27 15:37:23,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:37:24,533][06909] Updated weights for policy 0, policy_version 24232 (0.0037) [2024-06-27 15:37:28,646][06909] Updated weights for policy 0, policy_version 24242 (0.0040) [2024-06-27 15:37:28,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 397180928. Throughput: 0: 43410.3. Samples: 300070760. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2024-06-27 15:37:28,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:37:31,862][06909] Updated weights for policy 0, policy_version 24252 (0.0038) [2024-06-27 15:37:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 397426688. Throughput: 0: 43484.9. Samples: 300335500. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-27 15:37:33,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:37:36,251][06909] Updated weights for policy 0, policy_version 24262 (0.0034) [2024-06-27 15:37:38,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43690.5, 300 sec: 43431.5). Total num frames: 397639680. Throughput: 0: 43607.5. Samples: 300596960. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-27 15:37:38,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:37:39,268][06909] Updated weights for policy 0, policy_version 24272 (0.0034) [2024-06-27 15:37:43,635][06909] Updated weights for policy 0, policy_version 24282 (0.0028) [2024-06-27 15:37:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 397852672. Throughput: 0: 43591.0. Samples: 300731560. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-27 15:37:43,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:37:46,760][06909] Updated weights for policy 0, policy_version 24292 (0.0028) [2024-06-27 15:37:48,856][06674] Fps is (10 sec: 44210.7, 60 sec: 43959.3, 300 sec: 43430.6). Total num frames: 398082048. Throughput: 0: 43766.9. Samples: 300997600. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 15:37:48,857][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:37:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000024297_398082048.pth... [2024-06-27 15:37:48,926][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000023659_387629056.pth [2024-06-27 15:37:50,926][06909] Updated weights for policy 0, policy_version 24302 (0.0034) [2024-06-27 15:37:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 398295040. Throughput: 0: 43778.5. Samples: 301257620. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 15:37:53,851][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:37:54,160][06909] Updated weights for policy 0, policy_version 24312 (0.0028) [2024-06-27 15:37:58,466][06909] Updated weights for policy 0, policy_version 24322 (0.0026) [2024-06-27 15:37:58,850][06674] Fps is (10 sec: 42623.7, 60 sec: 43963.6, 300 sec: 43598.1). Total num frames: 398508032. Throughput: 0: 43691.8. Samples: 301387880. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 15:37:58,856][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 15:38:01,563][06909] Updated weights for policy 0, policy_version 24332 (0.0040) [2024-06-27 15:38:03,852][06674] Fps is (10 sec: 44228.4, 60 sec: 43962.2, 300 sec: 43431.2). Total num frames: 398737408. Throughput: 0: 43764.6. Samples: 301650660. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 15:38:03,852][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:38:05,899][06909] Updated weights for policy 0, policy_version 24342 (0.0035) [2024-06-27 15:38:08,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 398950400. Throughput: 0: 43769.8. Samples: 301907180. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 15:38:08,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:38:09,368][06909] Updated weights for policy 0, policy_version 24352 (0.0040) [2024-06-27 15:38:13,377][06909] Updated weights for policy 0, policy_version 24362 (0.0031) [2024-06-27 15:38:13,850][06674] Fps is (10 sec: 42606.8, 60 sec: 43963.6, 300 sec: 43598.1). Total num frames: 399163392. Throughput: 0: 43779.5. Samples: 302040840. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 15:38:13,852][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:38:17,114][06909] Updated weights for policy 0, policy_version 24372 (0.0032) [2024-06-27 15:38:18,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43417.4, 300 sec: 43375.9). Total num frames: 399376384. Throughput: 0: 43736.7. Samples: 302303660. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 15:38:18,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:38:20,869][06909] Updated weights for policy 0, policy_version 24382 (0.0038) [2024-06-27 15:38:23,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 399589376. Throughput: 0: 43597.6. Samples: 302558840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 15:38:23,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:38:24,708][06909] Updated weights for policy 0, policy_version 24392 (0.0035) [2024-06-27 15:38:28,525][06909] Updated weights for policy 0, policy_version 24402 (0.0039) [2024-06-27 15:38:28,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 399818752. Throughput: 0: 43480.5. Samples: 302688180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 15:38:28,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:38:32,330][06909] Updated weights for policy 0, policy_version 24412 (0.0038) [2024-06-27 15:38:33,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 400031744. Throughput: 0: 43430.7. Samples: 302951720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 15:38:33,851][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:38:36,143][06909] Updated weights for policy 0, policy_version 24422 (0.0034) [2024-06-27 15:38:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 400244736. Throughput: 0: 43344.9. Samples: 303208140. Policy #0 lag: (min: 1.0, avg: 11.2, max: 24.0) [2024-06-27 15:38:38,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:38:39,756][06909] Updated weights for policy 0, policy_version 24432 (0.0039) [2024-06-27 15:38:43,589][06909] Updated weights for policy 0, policy_version 24442 (0.0024) [2024-06-27 15:38:43,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 400457728. Throughput: 0: 43443.8. Samples: 303342840. Policy #0 lag: (min: 1.0, avg: 11.2, max: 24.0) [2024-06-27 15:38:43,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:38:47,606][06909] Updated weights for policy 0, policy_version 24452 (0.0029) [2024-06-27 15:38:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43148.8, 300 sec: 43376.8). Total num frames: 400670720. Throughput: 0: 43417.0. Samples: 303604340. Policy #0 lag: (min: 1.0, avg: 11.2, max: 24.0) [2024-06-27 15:38:48,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:38:49,164][06887] Signal inference workers to stop experience collection... (4300 times) [2024-06-27 15:38:49,164][06887] Signal inference workers to resume experience collection... (4300 times) [2024-06-27 15:38:49,220][06909] InferenceWorker_p0-w0: stopping experience collection (4300 times) [2024-06-27 15:38:49,220][06909] InferenceWorker_p0-w0: resuming experience collection (4300 times) [2024-06-27 15:38:51,074][06909] Updated weights for policy 0, policy_version 24462 (0.0033) [2024-06-27 15:38:53,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 400900096. Throughput: 0: 43364.8. Samples: 303858600. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 15:38:53,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:38:55,032][06909] Updated weights for policy 0, policy_version 24472 (0.0035) [2024-06-27 15:38:58,727][06909] Updated weights for policy 0, policy_version 24482 (0.0034) [2024-06-27 15:38:58,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43417.7, 300 sec: 43542.8). Total num frames: 401113088. Throughput: 0: 43469.9. Samples: 303996980. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 15:38:58,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:39:02,406][06909] Updated weights for policy 0, policy_version 24492 (0.0041) [2024-06-27 15:39:03,850][06674] Fps is (10 sec: 40960.8, 60 sec: 42873.0, 300 sec: 43376.0). Total num frames: 401309696. Throughput: 0: 43297.5. Samples: 304252040. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 15:39:03,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:39:06,241][06909] Updated weights for policy 0, policy_version 24502 (0.0030) [2024-06-27 15:39:08,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43417.5, 300 sec: 43542.5). Total num frames: 401555456. Throughput: 0: 43366.4. Samples: 304510340. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 15:39:08,851][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:39:09,809][06909] Updated weights for policy 0, policy_version 24512 (0.0034) [2024-06-27 15:39:13,725][06909] Updated weights for policy 0, policy_version 24522 (0.0043) [2024-06-27 15:39:13,853][06674] Fps is (10 sec: 45858.7, 60 sec: 43415.1, 300 sec: 43542.0). Total num frames: 401768448. Throughput: 0: 43529.0. Samples: 304647140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 15:39:13,854][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:39:17,540][06909] Updated weights for policy 0, policy_version 24532 (0.0038) [2024-06-27 15:39:18,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43144.6, 300 sec: 43376.0). Total num frames: 401965056. Throughput: 0: 43452.9. Samples: 304907100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 15:39:18,850][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 15:39:21,327][06909] Updated weights for policy 0, policy_version 24542 (0.0046) [2024-06-27 15:39:23,850][06674] Fps is (10 sec: 44252.6, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 402210816. Throughput: 0: 43398.8. Samples: 305161080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 15:39:23,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:39:25,042][06909] Updated weights for policy 0, policy_version 24552 (0.0036) [2024-06-27 15:39:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43144.6, 300 sec: 43487.0). Total num frames: 402407424. Throughput: 0: 43372.4. Samples: 305294600. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 15:39:28,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:39:28,908][06909] Updated weights for policy 0, policy_version 24562 (0.0040) [2024-06-27 15:39:32,356][06909] Updated weights for policy 0, policy_version 24572 (0.0033) [2024-06-27 15:39:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.7, 300 sec: 43321.3). Total num frames: 402636800. Throughput: 0: 43354.8. Samples: 305555300. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 15:39:33,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:39:36,517][06909] Updated weights for policy 0, policy_version 24582 (0.0041) [2024-06-27 15:39:38,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 402866176. Throughput: 0: 43476.5. Samples: 305815040. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 15:39:38,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:39:39,916][06909] Updated weights for policy 0, policy_version 24592 (0.0045) [2024-06-27 15:39:43,830][06909] Updated weights for policy 0, policy_version 24602 (0.0038) [2024-06-27 15:39:43,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43689.1, 300 sec: 43542.6). Total num frames: 403079168. Throughput: 0: 43384.7. Samples: 305949380. Policy #0 lag: (min: 1.0, avg: 9.9, max: 22.0) [2024-06-27 15:39:43,852][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:39:47,287][06909] Updated weights for policy 0, policy_version 24612 (0.0032) [2024-06-27 15:39:48,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.7, 300 sec: 43375.9). Total num frames: 403275776. Throughput: 0: 43484.4. Samples: 306208840. Policy #0 lag: (min: 1.0, avg: 9.9, max: 22.0) [2024-06-27 15:39:48,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:39:48,975][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000024615_403292160.pth... [2024-06-27 15:39:49,029][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000023978_392855552.pth [2024-06-27 15:39:51,374][06909] Updated weights for policy 0, policy_version 24622 (0.0032) [2024-06-27 15:39:53,850][06674] Fps is (10 sec: 44245.5, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 403521536. Throughput: 0: 43500.6. Samples: 306467860. Policy #0 lag: (min: 1.0, avg: 9.9, max: 22.0) [2024-06-27 15:39:53,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:39:55,149][06909] Updated weights for policy 0, policy_version 24632 (0.0028) [2024-06-27 15:39:58,762][06909] Updated weights for policy 0, policy_version 24642 (0.0042) [2024-06-27 15:39:58,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43690.5, 300 sec: 43542.5). Total num frames: 403734528. Throughput: 0: 43599.7. Samples: 306608980. Policy #0 lag: (min: 1.0, avg: 9.9, max: 22.0) [2024-06-27 15:39:58,851][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:40:02,549][06909] Updated weights for policy 0, policy_version 24652 (0.0023) [2024-06-27 15:40:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 43320.4). Total num frames: 403931136. Throughput: 0: 43581.8. Samples: 306868280. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:40:03,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:40:06,293][06909] Updated weights for policy 0, policy_version 24662 (0.0027) [2024-06-27 15:40:08,852][06674] Fps is (10 sec: 44228.4, 60 sec: 43689.3, 300 sec: 43653.3). Total num frames: 404176896. Throughput: 0: 43634.9. Samples: 307124740. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:40:08,852][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:40:10,007][06909] Updated weights for policy 0, policy_version 24672 (0.0037) [2024-06-27 15:40:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43420.1, 300 sec: 43487.0). Total num frames: 404373504. Throughput: 0: 43732.8. Samples: 307262580. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:40:13,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:40:14,091][06909] Updated weights for policy 0, policy_version 24682 (0.0029) [2024-06-27 15:40:17,515][06909] Updated weights for policy 0, policy_version 24692 (0.0042) [2024-06-27 15:40:18,850][06674] Fps is (10 sec: 40968.7, 60 sec: 43690.7, 300 sec: 43376.0). Total num frames: 404586496. Throughput: 0: 43652.0. Samples: 307519640. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 15:40:18,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:40:21,597][06909] Updated weights for policy 0, policy_version 24702 (0.0022) [2024-06-27 15:40:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 404832256. Throughput: 0: 43615.2. Samples: 307777720. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 15:40:23,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:40:24,824][06887] Signal inference workers to stop experience collection... (4350 times) [2024-06-27 15:40:24,824][06887] Signal inference workers to resume experience collection... (4350 times) [2024-06-27 15:40:24,863][06909] InferenceWorker_p0-w0: stopping experience collection (4350 times) [2024-06-27 15:40:24,863][06909] InferenceWorker_p0-w0: resuming experience collection (4350 times) [2024-06-27 15:40:24,950][06909] Updated weights for policy 0, policy_version 24712 (0.0029) [2024-06-27 15:40:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 405012480. Throughput: 0: 43593.1. Samples: 307910980. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 15:40:28,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:40:29,094][06909] Updated weights for policy 0, policy_version 24722 (0.0028) [2024-06-27 15:40:32,362][06909] Updated weights for policy 0, policy_version 24732 (0.0037) [2024-06-27 15:40:33,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.5, 300 sec: 43431.5). Total num frames: 405258240. Throughput: 0: 43702.0. Samples: 308175440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 15:40:33,851][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:40:36,571][06909] Updated weights for policy 0, policy_version 24742 (0.0028) [2024-06-27 15:40:38,852][06674] Fps is (10 sec: 45865.7, 60 sec: 43416.2, 300 sec: 43542.3). Total num frames: 405471232. Throughput: 0: 43684.7. Samples: 308433760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 15:40:38,852][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:40:40,010][06909] Updated weights for policy 0, policy_version 24752 (0.0036) [2024-06-27 15:40:43,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43419.0, 300 sec: 43487.0). Total num frames: 405684224. Throughput: 0: 43467.7. Samples: 308565020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 15:40:43,851][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:40:44,083][06909] Updated weights for policy 0, policy_version 24762 (0.0027) [2024-06-27 15:40:47,593][06909] Updated weights for policy 0, policy_version 24772 (0.0023) [2024-06-27 15:40:48,850][06674] Fps is (10 sec: 44245.5, 60 sec: 43963.7, 300 sec: 43431.5). Total num frames: 405913600. Throughput: 0: 43533.3. Samples: 308827280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 15:40:48,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:40:51,726][06909] Updated weights for policy 0, policy_version 24782 (0.0031) [2024-06-27 15:40:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 406126592. Throughput: 0: 43431.4. Samples: 309079060. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 15:40:53,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 15:40:55,365][06909] Updated weights for policy 0, policy_version 24792 (0.0035) [2024-06-27 15:40:58,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43144.6, 300 sec: 43487.0). Total num frames: 406323200. Throughput: 0: 43307.5. Samples: 309211420. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 15:40:58,850][06674] Avg episode reward: [(0, '0.387')] [2024-06-27 15:40:59,258][06909] Updated weights for policy 0, policy_version 24802 (0.0042) [2024-06-27 15:41:03,001][06909] Updated weights for policy 0, policy_version 24812 (0.0028) [2024-06-27 15:41:03,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 406552576. Throughput: 0: 43321.6. Samples: 309469120. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 15:41:03,851][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 15:41:06,973][06909] Updated weights for policy 0, policy_version 24822 (0.0038) [2024-06-27 15:41:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43145.9, 300 sec: 43542.5). Total num frames: 406765568. Throughput: 0: 43334.6. Samples: 309727780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 15:41:08,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:41:10,403][06909] Updated weights for policy 0, policy_version 24832 (0.0046) [2024-06-27 15:41:13,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 406978560. Throughput: 0: 43343.1. Samples: 309861420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 15:41:13,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:41:14,466][06909] Updated weights for policy 0, policy_version 24842 (0.0024) [2024-06-27 15:41:17,773][06909] Updated weights for policy 0, policy_version 24852 (0.0031) [2024-06-27 15:41:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 407207936. Throughput: 0: 43247.7. Samples: 310121580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 15:41:18,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:41:21,915][06909] Updated weights for policy 0, policy_version 24862 (0.0031) [2024-06-27 15:41:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43144.6, 300 sec: 43542.6). Total num frames: 407420928. Throughput: 0: 43300.7. Samples: 310382200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 15:41:23,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:41:25,286][06909] Updated weights for policy 0, policy_version 24872 (0.0033) [2024-06-27 15:41:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 407633920. Throughput: 0: 43347.9. Samples: 310515680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 15:41:28,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:41:29,477][06909] Updated weights for policy 0, policy_version 24882 (0.0034) [2024-06-27 15:41:32,919][06909] Updated weights for policy 0, policy_version 24892 (0.0034) [2024-06-27 15:41:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43144.7, 300 sec: 43487.0). Total num frames: 407846912. Throughput: 0: 43240.5. Samples: 310773100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 15:41:33,850][06674] Avg episode reward: [(0, '0.385')] [2024-06-27 15:41:37,098][06909] Updated weights for policy 0, policy_version 24902 (0.0044) [2024-06-27 15:41:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43419.0, 300 sec: 43542.6). Total num frames: 408076288. Throughput: 0: 43381.7. Samples: 311031240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 15:41:38,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:41:40,754][06909] Updated weights for policy 0, policy_version 24912 (0.0029) [2024-06-27 15:41:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 42871.5, 300 sec: 43431.5). Total num frames: 408256512. Throughput: 0: 43293.0. Samples: 311159600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 15:41:43,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:41:44,565][06909] Updated weights for policy 0, policy_version 24922 (0.0025) [2024-06-27 15:41:48,193][06909] Updated weights for policy 0, policy_version 24932 (0.0028) [2024-06-27 15:41:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 408502272. Throughput: 0: 43438.3. Samples: 311423840. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 15:41:48,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:41:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000024933_408502272.pth... [2024-06-27 15:41:48,926][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000024297_398082048.pth [2024-06-27 15:41:49,119][06887] Signal inference workers to stop experience collection... (4400 times) [2024-06-27 15:41:49,120][06887] Signal inference workers to resume experience collection... (4400 times) [2024-06-27 15:41:49,137][06909] InferenceWorker_p0-w0: stopping experience collection (4400 times) [2024-06-27 15:41:49,138][06909] InferenceWorker_p0-w0: resuming experience collection (4400 times) [2024-06-27 15:41:51,993][06909] Updated weights for policy 0, policy_version 24942 (0.0029) [2024-06-27 15:41:53,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 408731648. Throughput: 0: 43320.1. Samples: 311677180. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 15:41:53,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:41:55,953][06909] Updated weights for policy 0, policy_version 24952 (0.0039) [2024-06-27 15:41:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 408928256. Throughput: 0: 43364.8. Samples: 311812840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 15:41:58,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 15:41:59,437][06909] Updated weights for policy 0, policy_version 24962 (0.0035) [2024-06-27 15:42:03,399][06909] Updated weights for policy 0, policy_version 24972 (0.0038) [2024-06-27 15:42:03,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 409157632. Throughput: 0: 43449.7. Samples: 312076820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 15:42:03,851][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:42:07,316][06909] Updated weights for policy 0, policy_version 24982 (0.0033) [2024-06-27 15:42:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43417.6, 300 sec: 43542.5). Total num frames: 409370624. Throughput: 0: 43211.4. Samples: 312326720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 15:42:08,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:42:11,063][06909] Updated weights for policy 0, policy_version 24992 (0.0024) [2024-06-27 15:42:13,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 409583616. Throughput: 0: 43312.6. Samples: 312464740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:42:13,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:42:14,845][06909] Updated weights for policy 0, policy_version 25002 (0.0038) [2024-06-27 15:42:18,657][06909] Updated weights for policy 0, policy_version 25012 (0.0033) [2024-06-27 15:42:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 409796608. Throughput: 0: 43379.5. Samples: 312725180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:42:18,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:42:22,320][06909] Updated weights for policy 0, policy_version 25022 (0.0030) [2024-06-27 15:42:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43417.5, 300 sec: 43542.5). Total num frames: 410025984. Throughput: 0: 43416.9. Samples: 312985000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:42:23,851][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:42:26,005][06909] Updated weights for policy 0, policy_version 25032 (0.0041) [2024-06-27 15:42:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 410238976. Throughput: 0: 43589.2. Samples: 313121120. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 15:42:28,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:42:29,612][06909] Updated weights for policy 0, policy_version 25042 (0.0045) [2024-06-27 15:42:33,564][06909] Updated weights for policy 0, policy_version 25052 (0.0028) [2024-06-27 15:42:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 410451968. Throughput: 0: 43533.8. Samples: 313382860. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:42:33,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:42:37,247][06909] Updated weights for policy 0, policy_version 25062 (0.0037) [2024-06-27 15:42:38,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 410664960. Throughput: 0: 43638.7. Samples: 313640920. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:42:38,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:42:41,047][06909] Updated weights for policy 0, policy_version 25072 (0.0031) [2024-06-27 15:42:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43432.4). Total num frames: 410894336. Throughput: 0: 43475.6. Samples: 313769240. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:42:43,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:42:44,790][06909] Updated weights for policy 0, policy_version 25082 (0.0031) [2024-06-27 15:42:48,520][06909] Updated weights for policy 0, policy_version 25092 (0.0036) [2024-06-27 15:42:48,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 411107328. Throughput: 0: 43393.4. Samples: 314029520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 15:42:48,851][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:42:52,467][06909] Updated weights for policy 0, policy_version 25102 (0.0030) [2024-06-27 15:42:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 411336704. Throughput: 0: 43544.1. Samples: 314286200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 15:42:53,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:42:56,333][06909] Updated weights for policy 0, policy_version 25112 (0.0032) [2024-06-27 15:42:58,852][06674] Fps is (10 sec: 42590.1, 60 sec: 43416.2, 300 sec: 43375.9). Total num frames: 411533312. Throughput: 0: 43538.9. Samples: 314424080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 15:42:58,852][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:42:59,831][06909] Updated weights for policy 0, policy_version 25122 (0.0023) [2024-06-27 15:43:03,707][06909] Updated weights for policy 0, policy_version 25132 (0.0026) [2024-06-27 15:43:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 411762688. Throughput: 0: 43491.6. Samples: 314682300. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 15:43:03,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:43:07,290][06909] Updated weights for policy 0, policy_version 25142 (0.0041) [2024-06-27 15:43:08,850][06674] Fps is (10 sec: 44245.3, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 411975680. Throughput: 0: 43406.2. Samples: 314938280. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 15:43:08,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:43:11,173][06909] Updated weights for policy 0, policy_version 25152 (0.0032) [2024-06-27 15:43:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 412188672. Throughput: 0: 43361.9. Samples: 315072400. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 15:43:13,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:43:15,096][06909] Updated weights for policy 0, policy_version 25162 (0.0036) [2024-06-27 15:43:18,848][06909] Updated weights for policy 0, policy_version 25172 (0.0037) [2024-06-27 15:43:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 412418048. Throughput: 0: 43372.0. Samples: 315334600. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 15:43:18,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:43:22,653][06909] Updated weights for policy 0, policy_version 25182 (0.0036) [2024-06-27 15:43:23,076][06887] Signal inference workers to stop experience collection... (4450 times) [2024-06-27 15:43:23,076][06887] Signal inference workers to resume experience collection... (4450 times) [2024-06-27 15:43:23,128][06909] InferenceWorker_p0-w0: stopping experience collection (4450 times) [2024-06-27 15:43:23,128][06909] InferenceWorker_p0-w0: resuming experience collection (4450 times) [2024-06-27 15:43:23,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 412631040. Throughput: 0: 43218.9. Samples: 315585780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 15:43:23,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:43:26,294][06909] Updated weights for policy 0, policy_version 25192 (0.0039) [2024-06-27 15:43:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 412844032. Throughput: 0: 43422.6. Samples: 315723260. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 15:43:28,851][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:43:30,158][06909] Updated weights for policy 0, policy_version 25202 (0.0032) [2024-06-27 15:43:33,850][06674] Fps is (10 sec: 42599.4, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 413057024. Throughput: 0: 43406.9. Samples: 315982820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 15:43:33,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:43:33,872][06909] Updated weights for policy 0, policy_version 25212 (0.0040) [2024-06-27 15:43:37,992][06909] Updated weights for policy 0, policy_version 25222 (0.0036) [2024-06-27 15:43:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 413286400. Throughput: 0: 43502.2. Samples: 316243800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 15:43:38,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:43:41,310][06909] Updated weights for policy 0, policy_version 25232 (0.0028) [2024-06-27 15:43:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 413499392. Throughput: 0: 43204.2. Samples: 316368180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 15:43:43,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:43:45,546][06909] Updated weights for policy 0, policy_version 25242 (0.0030) [2024-06-27 15:43:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 413712384. Throughput: 0: 43276.1. Samples: 316629720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 15:43:48,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:43:48,950][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000025252_413728768.pth... [2024-06-27 15:43:48,956][06909] Updated weights for policy 0, policy_version 25252 (0.0028) [2024-06-27 15:43:48,999][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000024615_403292160.pth [2024-06-27 15:43:53,023][06909] Updated weights for policy 0, policy_version 25262 (0.0037) [2024-06-27 15:43:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 413925376. Throughput: 0: 43266.3. Samples: 316885260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 15:43:53,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:43:56,614][06909] Updated weights for policy 0, policy_version 25272 (0.0023) [2024-06-27 15:43:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43419.1, 300 sec: 43487.0). Total num frames: 414138368. Throughput: 0: 43239.1. Samples: 317018160. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-27 15:43:58,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:44:00,457][06909] Updated weights for policy 0, policy_version 25282 (0.0032) [2024-06-27 15:44:03,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43144.5, 300 sec: 43376.0). Total num frames: 414351360. Throughput: 0: 43246.6. Samples: 317280700. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-27 15:44:03,852][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:44:04,121][06909] Updated weights for policy 0, policy_version 25292 (0.0028) [2024-06-27 15:44:08,124][06909] Updated weights for policy 0, policy_version 25302 (0.0030) [2024-06-27 15:44:08,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43143.1, 300 sec: 43376.2). Total num frames: 414564352. Throughput: 0: 43243.9. Samples: 317531840. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-27 15:44:08,852][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:44:11,531][06909] Updated weights for policy 0, policy_version 25312 (0.0023) [2024-06-27 15:44:13,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 414793728. Throughput: 0: 43139.7. Samples: 317664540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 15:44:13,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:44:15,712][06909] Updated weights for policy 0, policy_version 25322 (0.0033) [2024-06-27 15:44:18,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43144.6, 300 sec: 43375.9). Total num frames: 415006720. Throughput: 0: 43132.3. Samples: 317923780. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 15:44:18,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:44:19,351][06909] Updated weights for policy 0, policy_version 25332 (0.0032) [2024-06-27 15:44:23,526][06909] Updated weights for policy 0, policy_version 25342 (0.0046) [2024-06-27 15:44:23,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 415219712. Throughput: 0: 43187.5. Samples: 318187240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 15:44:23,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:44:26,835][06909] Updated weights for policy 0, policy_version 25352 (0.0036) [2024-06-27 15:44:28,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 415449088. Throughput: 0: 43308.7. Samples: 318317080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 15:44:28,851][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:44:30,894][06909] Updated weights for policy 0, policy_version 25362 (0.0026) [2024-06-27 15:44:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 415662080. Throughput: 0: 43448.3. Samples: 318584900. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 15:44:33,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:44:34,181][06909] Updated weights for policy 0, policy_version 25372 (0.0034) [2024-06-27 15:44:38,597][06909] Updated weights for policy 0, policy_version 25382 (0.0032) [2024-06-27 15:44:38,850][06674] Fps is (10 sec: 40960.5, 60 sec: 42871.5, 300 sec: 43320.7). Total num frames: 415858688. Throughput: 0: 43542.2. Samples: 318844660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 15:44:38,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:44:41,645][06909] Updated weights for policy 0, policy_version 25392 (0.0035) [2024-06-27 15:44:43,852][06674] Fps is (10 sec: 44228.2, 60 sec: 43416.1, 300 sec: 43486.7). Total num frames: 416104448. Throughput: 0: 43436.3. Samples: 318972880. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 15:44:43,861][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:44:46,130][06909] Updated weights for policy 0, policy_version 25402 (0.0028) [2024-06-27 15:44:48,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 416317440. Throughput: 0: 43463.6. Samples: 319236560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-27 15:44:48,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:44:49,355][06909] Updated weights for policy 0, policy_version 25412 (0.0040) [2024-06-27 15:44:49,647][06887] Signal inference workers to stop experience collection... (4500 times) [2024-06-27 15:44:49,703][06887] Signal inference workers to resume experience collection... (4500 times) [2024-06-27 15:44:49,703][06909] InferenceWorker_p0-w0: stopping experience collection (4500 times) [2024-06-27 15:44:49,716][06909] InferenceWorker_p0-w0: resuming experience collection (4500 times) [2024-06-27 15:44:53,545][06909] Updated weights for policy 0, policy_version 25422 (0.0035) [2024-06-27 15:44:53,850][06674] Fps is (10 sec: 40968.0, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 416514048. Throughput: 0: 43610.8. Samples: 319494240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-27 15:44:53,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:44:56,999][06909] Updated weights for policy 0, policy_version 25432 (0.0037) [2024-06-27 15:44:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 416759808. Throughput: 0: 43537.2. Samples: 319623720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-27 15:44:58,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:45:01,314][06909] Updated weights for policy 0, policy_version 25442 (0.0030) [2024-06-27 15:45:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43144.6, 300 sec: 43265.2). Total num frames: 416940032. Throughput: 0: 43476.4. Samples: 319880220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-27 15:45:03,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:45:04,519][06909] Updated weights for policy 0, policy_version 25452 (0.0028) [2024-06-27 15:45:08,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43146.0, 300 sec: 43320.4). Total num frames: 417153024. Throughput: 0: 43471.2. Samples: 320143440. Policy #0 lag: (min: 2.0, avg: 11.9, max: 22.0) [2024-06-27 15:45:08,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:45:08,916][06909] Updated weights for policy 0, policy_version 25462 (0.0041) [2024-06-27 15:45:12,030][06909] Updated weights for policy 0, policy_version 25472 (0.0026) [2024-06-27 15:45:13,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 417415168. Throughput: 0: 43490.3. Samples: 320274140. Policy #0 lag: (min: 2.0, avg: 11.9, max: 22.0) [2024-06-27 15:45:13,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:45:16,324][06909] Updated weights for policy 0, policy_version 25482 (0.0039) [2024-06-27 15:45:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 42871.4, 300 sec: 43209.3). Total num frames: 417579008. Throughput: 0: 43289.8. Samples: 320532940. Policy #0 lag: (min: 2.0, avg: 11.9, max: 22.0) [2024-06-27 15:45:18,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 15:45:19,473][06909] Updated weights for policy 0, policy_version 25492 (0.0031) [2024-06-27 15:45:23,852][06674] Fps is (10 sec: 39313.9, 60 sec: 43143.1, 300 sec: 43375.6). Total num frames: 417808384. Throughput: 0: 43167.0. Samples: 320787260. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 15:45:23,852][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:45:23,986][06909] Updated weights for policy 0, policy_version 25502 (0.0040) [2024-06-27 15:45:27,266][06909] Updated weights for policy 0, policy_version 25512 (0.0030) [2024-06-27 15:45:28,850][06674] Fps is (10 sec: 49152.1, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 418070528. Throughput: 0: 43211.2. Samples: 320917300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 15:45:28,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:45:31,455][06909] Updated weights for policy 0, policy_version 25522 (0.0037) [2024-06-27 15:45:33,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43144.6, 300 sec: 43320.7). Total num frames: 418250752. Throughput: 0: 43171.6. Samples: 321179280. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 15:45:33,856][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:45:34,773][06909] Updated weights for policy 0, policy_version 25532 (0.0048) [2024-06-27 15:45:38,850][06674] Fps is (10 sec: 37683.8, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 418447360. Throughput: 0: 43228.6. Samples: 321439520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 15:45:38,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:45:39,146][06909] Updated weights for policy 0, policy_version 25542 (0.0030) [2024-06-27 15:45:42,115][06909] Updated weights for policy 0, policy_version 25552 (0.0035) [2024-06-27 15:45:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43419.1, 300 sec: 43376.0). Total num frames: 418709504. Throughput: 0: 43196.5. Samples: 321567560. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 15:45:43,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:45:46,756][06909] Updated weights for policy 0, policy_version 25562 (0.0037) [2024-06-27 15:45:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 418889728. Throughput: 0: 43373.9. Samples: 321832040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 15:45:48,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:45:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000025568_418906112.pth... [2024-06-27 15:45:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000024933_408502272.pth [2024-06-27 15:45:49,883][06909] Updated weights for policy 0, policy_version 25572 (0.0032) [2024-06-27 15:45:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.7, 300 sec: 43376.0). Total num frames: 419119104. Throughput: 0: 43224.0. Samples: 322088520. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 15:45:53,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:45:54,095][06909] Updated weights for policy 0, policy_version 25582 (0.0028) [2024-06-27 15:45:57,518][06909] Updated weights for policy 0, policy_version 25592 (0.0035) [2024-06-27 15:45:58,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 419364864. Throughput: 0: 43209.5. Samples: 322218560. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 15:45:58,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:46:01,697][06909] Updated weights for policy 0, policy_version 25602 (0.0035) [2024-06-27 15:46:03,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 419545088. Throughput: 0: 43361.4. Samples: 322484200. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 15:46:03,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 15:46:04,760][06887] Signal inference workers to stop experience collection... (4550 times) [2024-06-27 15:46:04,760][06887] Signal inference workers to resume experience collection... (4550 times) [2024-06-27 15:46:04,770][06909] InferenceWorker_p0-w0: stopping experience collection (4550 times) [2024-06-27 15:46:04,782][06909] InferenceWorker_p0-w0: resuming experience collection (4550 times) [2024-06-27 15:46:04,919][06909] Updated weights for policy 0, policy_version 25612 (0.0035) [2024-06-27 15:46:08,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43375.9). Total num frames: 419774464. Throughput: 0: 43484.6. Samples: 322743980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 15:46:08,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:46:09,053][06909] Updated weights for policy 0, policy_version 25622 (0.0038) [2024-06-27 15:46:12,600][06909] Updated weights for policy 0, policy_version 25632 (0.0031) [2024-06-27 15:46:13,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 420020224. Throughput: 0: 43511.2. Samples: 322875300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 15:46:13,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:46:16,490][06909] Updated weights for policy 0, policy_version 25642 (0.0038) [2024-06-27 15:46:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 420200448. Throughput: 0: 43465.8. Samples: 323135240. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 15:46:18,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:46:20,068][06909] Updated weights for policy 0, policy_version 25652 (0.0034) [2024-06-27 15:46:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43692.2, 300 sec: 43376.0). Total num frames: 420429824. Throughput: 0: 43400.4. Samples: 323392540. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 15:46:23,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 15:46:23,907][06909] Updated weights for policy 0, policy_version 25662 (0.0025) [2024-06-27 15:46:27,464][06909] Updated weights for policy 0, policy_version 25672 (0.0036) [2024-06-27 15:46:28,852][06674] Fps is (10 sec: 47503.7, 60 sec: 43416.2, 300 sec: 43486.7). Total num frames: 420675584. Throughput: 0: 43558.9. Samples: 323527800. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 15:46:28,852][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:46:31,255][06909] Updated weights for policy 0, policy_version 25682 (0.0034) [2024-06-27 15:46:33,851][06674] Fps is (10 sec: 40956.9, 60 sec: 43144.0, 300 sec: 43264.8). Total num frames: 420839424. Throughput: 0: 43548.6. Samples: 323791760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 15:46:33,851][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 15:46:34,874][06909] Updated weights for policy 0, policy_version 25692 (0.0037) [2024-06-27 15:46:38,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43963.6, 300 sec: 43487.0). Total num frames: 421085184. Throughput: 0: 43596.4. Samples: 324050360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 15:46:38,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:46:39,279][06909] Updated weights for policy 0, policy_version 25702 (0.0035) [2024-06-27 15:46:42,302][06909] Updated weights for policy 0, policy_version 25712 (0.0031) [2024-06-27 15:46:43,850][06674] Fps is (10 sec: 47516.7, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 421314560. Throughput: 0: 43699.4. Samples: 324185040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 15:46:43,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:46:46,587][06909] Updated weights for policy 0, policy_version 25722 (0.0040) [2024-06-27 15:46:48,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 421494784. Throughput: 0: 43592.1. Samples: 324445840. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-27 15:46:48,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:46:49,792][06909] Updated weights for policy 0, policy_version 25732 (0.0041) [2024-06-27 15:46:53,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 421724160. Throughput: 0: 43490.2. Samples: 324701040. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-27 15:46:53,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:46:54,142][06909] Updated weights for policy 0, policy_version 25742 (0.0027) [2024-06-27 15:46:57,717][06909] Updated weights for policy 0, policy_version 25752 (0.0040) [2024-06-27 15:46:58,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 421969920. Throughput: 0: 43581.7. Samples: 324836480. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-27 15:46:58,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:47:01,514][06909] Updated weights for policy 0, policy_version 25762 (0.0033) [2024-06-27 15:47:03,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 422150144. Throughput: 0: 43613.6. Samples: 325097860. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-27 15:47:03,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:47:05,171][06909] Updated weights for policy 0, policy_version 25772 (0.0037) [2024-06-27 15:47:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 422395904. Throughput: 0: 43517.2. Samples: 325350820. Policy #0 lag: (min: 1.0, avg: 12.0, max: 26.0) [2024-06-27 15:47:08,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:47:08,926][06909] Updated weights for policy 0, policy_version 25782 (0.0026) [2024-06-27 15:47:12,710][06909] Updated weights for policy 0, policy_version 25792 (0.0036) [2024-06-27 15:47:13,850][06674] Fps is (10 sec: 47514.6, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 422625280. Throughput: 0: 43504.7. Samples: 325485420. Policy #0 lag: (min: 1.0, avg: 12.0, max: 26.0) [2024-06-27 15:47:13,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:47:16,858][06909] Updated weights for policy 0, policy_version 25802 (0.0029) [2024-06-27 15:47:18,170][06887] Signal inference workers to stop experience collection... (4600 times) [2024-06-27 15:47:18,170][06887] Signal inference workers to resume experience collection... (4600 times) [2024-06-27 15:47:18,185][06909] InferenceWorker_p0-w0: stopping experience collection (4600 times) [2024-06-27 15:47:18,185][06909] InferenceWorker_p0-w0: resuming experience collection (4600 times) [2024-06-27 15:47:18,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 422805504. Throughput: 0: 43338.6. Samples: 325741960. Policy #0 lag: (min: 1.0, avg: 12.0, max: 26.0) [2024-06-27 15:47:18,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:47:20,492][06909] Updated weights for policy 0, policy_version 25812 (0.0024) [2024-06-27 15:47:23,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 423018496. Throughput: 0: 43353.9. Samples: 326001280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 15:47:23,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:47:24,429][06909] Updated weights for policy 0, policy_version 25822 (0.0028) [2024-06-27 15:47:27,955][06909] Updated weights for policy 0, policy_version 25832 (0.0049) [2024-06-27 15:47:28,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43419.0, 300 sec: 43487.0). Total num frames: 423280640. Throughput: 0: 43309.3. Samples: 326133960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 15:47:28,851][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:47:32,194][06909] Updated weights for policy 0, policy_version 25842 (0.0032) [2024-06-27 15:47:33,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43418.1, 300 sec: 43320.4). Total num frames: 423444480. Throughput: 0: 43200.4. Samples: 326389860. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 15:47:33,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:47:35,468][06909] Updated weights for policy 0, policy_version 25852 (0.0038) [2024-06-27 15:47:38,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 423690240. Throughput: 0: 43400.0. Samples: 326654040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 15:47:38,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:47:39,804][06909] Updated weights for policy 0, policy_version 25862 (0.0033) [2024-06-27 15:47:42,862][06909] Updated weights for policy 0, policy_version 25872 (0.0023) [2024-06-27 15:47:43,850][06674] Fps is (10 sec: 49151.7, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 423936000. Throughput: 0: 43375.1. Samples: 326788360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 15:47:43,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:47:47,393][06909] Updated weights for policy 0, policy_version 25882 (0.0031) [2024-06-27 15:47:48,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.5, 300 sec: 43264.9). Total num frames: 424099840. Throughput: 0: 43455.6. Samples: 327053360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 15:47:48,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 15:47:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000025885_424099840.pth... [2024-06-27 15:47:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000025252_413728768.pth [2024-06-27 15:47:50,288][06909] Updated weights for policy 0, policy_version 25892 (0.0040) [2024-06-27 15:47:53,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43431.8). Total num frames: 424345600. Throughput: 0: 43424.1. Samples: 327304900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 15:47:53,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:47:55,035][06909] Updated weights for policy 0, policy_version 25902 (0.0031) [2024-06-27 15:47:57,851][06909] Updated weights for policy 0, policy_version 25912 (0.0041) [2024-06-27 15:47:58,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43144.6, 300 sec: 43376.0). Total num frames: 424558592. Throughput: 0: 43380.0. Samples: 327437520. Policy #0 lag: (min: 2.0, avg: 11.1, max: 24.0) [2024-06-27 15:47:58,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:48:02,420][06909] Updated weights for policy 0, policy_version 25922 (0.0031) [2024-06-27 15:48:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 424755200. Throughput: 0: 43370.6. Samples: 327693640. Policy #0 lag: (min: 2.0, avg: 11.1, max: 24.0) [2024-06-27 15:48:03,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:48:05,645][06909] Updated weights for policy 0, policy_version 25932 (0.0037) [2024-06-27 15:48:08,850][06674] Fps is (10 sec: 40960.0, 60 sec: 42871.6, 300 sec: 43320.4). Total num frames: 424968192. Throughput: 0: 43399.5. Samples: 327954260. Policy #0 lag: (min: 2.0, avg: 11.1, max: 24.0) [2024-06-27 15:48:08,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:48:10,051][06909] Updated weights for policy 0, policy_version 25942 (0.0035) [2024-06-27 15:48:13,133][06909] Updated weights for policy 0, policy_version 25952 (0.0036) [2024-06-27 15:48:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 425213952. Throughput: 0: 43315.7. Samples: 328083160. Policy #0 lag: (min: 2.0, avg: 11.1, max: 24.0) [2024-06-27 15:48:13,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:48:17,796][06909] Updated weights for policy 0, policy_version 25962 (0.0032) [2024-06-27 15:48:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 425394176. Throughput: 0: 43412.1. Samples: 328343400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 15:48:18,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:48:20,685][06909] Updated weights for policy 0, policy_version 25972 (0.0033) [2024-06-27 15:48:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.5, 300 sec: 43375.9). Total num frames: 425639936. Throughput: 0: 43334.6. Samples: 328604100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 15:48:23,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:48:25,390][06909] Updated weights for policy 0, policy_version 25982 (0.0028) [2024-06-27 15:48:28,186][06909] Updated weights for policy 0, policy_version 25992 (0.0044) [2024-06-27 15:48:28,850][06674] Fps is (10 sec: 49151.6, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 425885696. Throughput: 0: 43314.2. Samples: 328737500. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 15:48:28,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:48:32,815][06909] Updated weights for policy 0, policy_version 26002 (0.0035) [2024-06-27 15:48:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 426049536. Throughput: 0: 43129.8. Samples: 328994200. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 15:48:33,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:48:35,924][06909] Updated weights for policy 0, policy_version 26012 (0.0038) [2024-06-27 15:48:38,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 426278912. Throughput: 0: 43220.4. Samples: 329249820. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 15:48:38,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:48:40,261][06909] Updated weights for policy 0, policy_version 26022 (0.0025) [2024-06-27 15:48:43,157][06887] Signal inference workers to stop experience collection... (4650 times) [2024-06-27 15:48:43,157][06887] Signal inference workers to resume experience collection... (4650 times) [2024-06-27 15:48:43,204][06909] InferenceWorker_p0-w0: stopping experience collection (4650 times) [2024-06-27 15:48:43,204][06909] InferenceWorker_p0-w0: resuming experience collection (4650 times) [2024-06-27 15:48:43,289][06909] Updated weights for policy 0, policy_version 26032 (0.0031) [2024-06-27 15:48:43,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 426524672. Throughput: 0: 43333.7. Samples: 329387540. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 15:48:43,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:48:47,710][06909] Updated weights for policy 0, policy_version 26042 (0.0039) [2024-06-27 15:48:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 426704896. Throughput: 0: 43223.1. Samples: 329638680. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 15:48:48,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:48:50,843][06909] Updated weights for policy 0, policy_version 26052 (0.0032) [2024-06-27 15:48:53,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 426934272. Throughput: 0: 43275.9. Samples: 329901680. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:48:53,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:48:55,113][06909] Updated weights for policy 0, policy_version 26062 (0.0031) [2024-06-27 15:48:58,315][06909] Updated weights for policy 0, policy_version 26072 (0.0041) [2024-06-27 15:48:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 427163648. Throughput: 0: 43364.9. Samples: 330034580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:48:58,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:49:02,732][06909] Updated weights for policy 0, policy_version 26082 (0.0031) [2024-06-27 15:49:03,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43416.1, 300 sec: 43375.9). Total num frames: 427360256. Throughput: 0: 43284.2. Samples: 330291280. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 15:49:03,852][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:49:05,989][06909] Updated weights for policy 0, policy_version 26092 (0.0025) [2024-06-27 15:49:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 427589632. Throughput: 0: 43391.7. Samples: 330556720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 15:49:08,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:49:10,343][06909] Updated weights for policy 0, policy_version 26102 (0.0038) [2024-06-27 15:49:13,634][06909] Updated weights for policy 0, policy_version 26112 (0.0033) [2024-06-27 15:49:13,850][06674] Fps is (10 sec: 45884.4, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 427819008. Throughput: 0: 43284.0. Samples: 330685280. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 15:49:13,851][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:49:18,045][06909] Updated weights for policy 0, policy_version 26122 (0.0039) [2024-06-27 15:49:18,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 427999232. Throughput: 0: 43304.4. Samples: 330942900. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 15:49:18,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:49:21,138][06909] Updated weights for policy 0, policy_version 26132 (0.0028) [2024-06-27 15:49:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.7, 300 sec: 43376.0). Total num frames: 428244992. Throughput: 0: 43425.4. Samples: 331203960. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 15:49:23,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:49:25,520][06909] Updated weights for policy 0, policy_version 26142 (0.0029) [2024-06-27 15:49:28,775][06909] Updated weights for policy 0, policy_version 26152 (0.0038) [2024-06-27 15:49:28,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 428474368. Throughput: 0: 43371.5. Samples: 331339260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 15:49:28,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:49:33,041][06909] Updated weights for policy 0, policy_version 26162 (0.0039) [2024-06-27 15:49:33,851][06674] Fps is (10 sec: 39318.3, 60 sec: 43144.0, 300 sec: 43320.3). Total num frames: 428638208. Throughput: 0: 43473.4. Samples: 331595020. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 15:49:33,851][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:49:36,247][06909] Updated weights for policy 0, policy_version 26172 (0.0038) [2024-06-27 15:49:38,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43144.5, 300 sec: 43265.2). Total num frames: 428867584. Throughput: 0: 43435.5. Samples: 331856280. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 15:49:38,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:49:40,497][06909] Updated weights for policy 0, policy_version 26182 (0.0023) [2024-06-27 15:49:43,658][06909] Updated weights for policy 0, policy_version 26192 (0.0035) [2024-06-27 15:49:43,850][06674] Fps is (10 sec: 50794.5, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 429146112. Throughput: 0: 43498.1. Samples: 331992000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 15:49:43,852][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:49:47,970][06909] Updated weights for policy 0, policy_version 26202 (0.0034) [2024-06-27 15:49:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 429293568. Throughput: 0: 43362.0. Samples: 332242480. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 15:49:48,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:49:48,945][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000026203_429309952.pth... [2024-06-27 15:49:48,988][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000025568_418906112.pth [2024-06-27 15:49:51,220][06909] Updated weights for policy 0, policy_version 26212 (0.0045) [2024-06-27 15:49:53,850][06674] Fps is (10 sec: 36045.2, 60 sec: 42871.5, 300 sec: 43209.3). Total num frames: 429506560. Throughput: 0: 43303.2. Samples: 332505360. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 15:49:53,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:49:55,547][06909] Updated weights for policy 0, policy_version 26222 (0.0045) [2024-06-27 15:49:58,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 429768704. Throughput: 0: 43343.5. Samples: 332635740. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 15:49:58,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:49:59,191][06909] Updated weights for policy 0, policy_version 26232 (0.0033) [2024-06-27 15:50:01,392][06887] Signal inference workers to stop experience collection... (4700 times) [2024-06-27 15:50:01,396][06887] Signal inference workers to resume experience collection... (4700 times) [2024-06-27 15:50:01,435][06909] InferenceWorker_p0-w0: stopping experience collection (4700 times) [2024-06-27 15:50:01,435][06909] InferenceWorker_p0-w0: resuming experience collection (4700 times) [2024-06-27 15:50:03,386][06909] Updated weights for policy 0, policy_version 26242 (0.0034) [2024-06-27 15:50:03,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43419.1, 300 sec: 43431.5). Total num frames: 429965312. Throughput: 0: 43407.6. Samples: 332896240. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 15:50:03,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:50:06,760][06909] Updated weights for policy 0, policy_version 26252 (0.0027) [2024-06-27 15:50:08,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 430178304. Throughput: 0: 43468.1. Samples: 333160020. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 15:50:08,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:50:10,667][06909] Updated weights for policy 0, policy_version 26262 (0.0037) [2024-06-27 15:50:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43144.6, 300 sec: 43487.0). Total num frames: 430407680. Throughput: 0: 43434.7. Samples: 333293820. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 15:50:13,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:50:14,181][06909] Updated weights for policy 0, policy_version 26272 (0.0038) [2024-06-27 15:50:18,103][06909] Updated weights for policy 0, policy_version 26282 (0.0037) [2024-06-27 15:50:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.8, 300 sec: 43431.8). Total num frames: 430620672. Throughput: 0: 43497.7. Samples: 333552380. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 15:50:18,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:50:21,603][06909] Updated weights for policy 0, policy_version 26292 (0.0037) [2024-06-27 15:50:23,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43416.1, 300 sec: 43320.1). Total num frames: 430850048. Throughput: 0: 43495.9. Samples: 333813680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 15:50:23,852][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:50:25,848][06909] Updated weights for policy 0, policy_version 26302 (0.0044) [2024-06-27 15:50:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 431063040. Throughput: 0: 43372.5. Samples: 333943760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 15:50:28,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:50:29,134][06909] Updated weights for policy 0, policy_version 26312 (0.0028) [2024-06-27 15:50:33,176][06909] Updated weights for policy 0, policy_version 26322 (0.0024) [2024-06-27 15:50:33,856][06674] Fps is (10 sec: 42581.3, 60 sec: 43959.9, 300 sec: 43486.1). Total num frames: 431276032. Throughput: 0: 43716.3. Samples: 334209980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 15:50:33,856][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:50:36,560][06909] Updated weights for policy 0, policy_version 26332 (0.0033) [2024-06-27 15:50:38,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43375.9). Total num frames: 431505408. Throughput: 0: 43560.3. Samples: 334465580. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-27 15:50:38,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:50:40,970][06909] Updated weights for policy 0, policy_version 26342 (0.0040) [2024-06-27 15:50:43,850][06674] Fps is (10 sec: 45903.0, 60 sec: 43144.6, 300 sec: 43542.6). Total num frames: 431734784. Throughput: 0: 43538.3. Samples: 334594960. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-27 15:50:43,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:50:44,111][06909] Updated weights for policy 0, policy_version 26352 (0.0042) [2024-06-27 15:50:48,596][06909] Updated weights for policy 0, policy_version 26362 (0.0035) [2024-06-27 15:50:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 43431.5). Total num frames: 431931392. Throughput: 0: 43608.0. Samples: 334858600. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-27 15:50:48,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:50:51,579][06909] Updated weights for policy 0, policy_version 26372 (0.0033) [2024-06-27 15:50:53,852][06674] Fps is (10 sec: 40951.3, 60 sec: 43962.1, 300 sec: 43320.1). Total num frames: 432144384. Throughput: 0: 43431.2. Samples: 335114520. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-27 15:50:53,853][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:50:56,099][06909] Updated weights for policy 0, policy_version 26382 (0.0030) [2024-06-27 15:50:58,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.7, 300 sec: 43542.5). Total num frames: 432390144. Throughput: 0: 43436.3. Samples: 335248460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 15:50:58,851][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:50:58,997][06909] Updated weights for policy 0, policy_version 26392 (0.0038) [2024-06-27 15:51:03,486][06909] Updated weights for policy 0, policy_version 26402 (0.0033) [2024-06-27 15:51:03,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 432570368. Throughput: 0: 43593.7. Samples: 335514100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 15:51:03,852][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:51:06,642][06909] Updated weights for policy 0, policy_version 26412 (0.0039) [2024-06-27 15:51:08,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.6, 300 sec: 43320.4). Total num frames: 432799744. Throughput: 0: 43530.8. Samples: 335772480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 15:51:08,850][06674] Avg episode reward: [(0, '0.387')] [2024-06-27 15:51:10,798][06909] Updated weights for policy 0, policy_version 26422 (0.0028) [2024-06-27 15:51:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 433012736. Throughput: 0: 43663.4. Samples: 335908620. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 15:51:13,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:51:14,238][06909] Updated weights for policy 0, policy_version 26432 (0.0038) [2024-06-27 15:51:18,286][06909] Updated weights for policy 0, policy_version 26442 (0.0034) [2024-06-27 15:51:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 433225728. Throughput: 0: 43442.7. Samples: 336164640. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 15:51:18,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 15:51:21,553][06909] Updated weights for policy 0, policy_version 26452 (0.0041) [2024-06-27 15:51:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43419.1, 300 sec: 43320.7). Total num frames: 433455104. Throughput: 0: 43646.7. Samples: 336429680. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 15:51:23,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:51:25,742][06909] Updated weights for policy 0, policy_version 26462 (0.0036) [2024-06-27 15:51:28,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.6, 300 sec: 43598.2). Total num frames: 433700864. Throughput: 0: 43723.0. Samples: 336562500. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 15:51:28,856][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:51:29,142][06909] Updated weights for policy 0, policy_version 26472 (0.0022) [2024-06-27 15:51:33,514][06909] Updated weights for policy 0, policy_version 26482 (0.0026) [2024-06-27 15:51:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43422.0, 300 sec: 43376.0). Total num frames: 433881088. Throughput: 0: 43661.4. Samples: 336823360. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-27 15:51:33,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:51:36,547][06909] Updated weights for policy 0, policy_version 26492 (0.0024) [2024-06-27 15:51:38,856][06674] Fps is (10 sec: 40935.7, 60 sec: 43413.3, 300 sec: 43375.1). Total num frames: 434110464. Throughput: 0: 43742.4. Samples: 337083100. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-27 15:51:38,856][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 15:51:40,883][06909] Updated weights for policy 0, policy_version 26502 (0.0025) [2024-06-27 15:51:43,668][06887] Signal inference workers to stop experience collection... (4750 times) [2024-06-27 15:51:43,668][06887] Signal inference workers to resume experience collection... (4750 times) [2024-06-27 15:51:43,712][06909] InferenceWorker_p0-w0: stopping experience collection (4750 times) [2024-06-27 15:51:43,712][06909] InferenceWorker_p0-w0: resuming experience collection (4750 times) [2024-06-27 15:51:43,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 434356224. Throughput: 0: 43708.5. Samples: 337215340. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-27 15:51:43,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:51:43,977][06909] Updated weights for policy 0, policy_version 26512 (0.0042) [2024-06-27 15:51:48,238][06909] Updated weights for policy 0, policy_version 26522 (0.0038) [2024-06-27 15:51:48,850][06674] Fps is (10 sec: 42623.8, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 434536448. Throughput: 0: 43658.6. Samples: 337478740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 15:51:48,851][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:51:48,883][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000026523_434552832.pth... [2024-06-27 15:51:48,950][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000025885_424099840.pth [2024-06-27 15:51:51,560][06909] Updated weights for policy 0, policy_version 26532 (0.0040) [2024-06-27 15:51:53,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43419.1, 300 sec: 43320.4). Total num frames: 434749440. Throughput: 0: 43717.4. Samples: 337739760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 15:51:53,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:51:55,693][06909] Updated weights for policy 0, policy_version 26542 (0.0041) [2024-06-27 15:51:58,852][06674] Fps is (10 sec: 45865.8, 60 sec: 43416.1, 300 sec: 43542.3). Total num frames: 434995200. Throughput: 0: 43523.8. Samples: 337867280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 15:51:58,853][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:51:59,262][06909] Updated weights for policy 0, policy_version 26552 (0.0024) [2024-06-27 15:52:03,182][06909] Updated weights for policy 0, policy_version 26562 (0.0036) [2024-06-27 15:52:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43376.0). Total num frames: 435191808. Throughput: 0: 43526.3. Samples: 338123320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 15:52:03,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:52:06,968][06909] Updated weights for policy 0, policy_version 26572 (0.0022) [2024-06-27 15:52:08,850][06674] Fps is (10 sec: 40968.7, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 435404800. Throughput: 0: 43539.6. Samples: 338388960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 15:52:08,851][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:52:10,947][06909] Updated weights for policy 0, policy_version 26582 (0.0033) [2024-06-27 15:52:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 43542.6). Total num frames: 435650560. Throughput: 0: 43488.6. Samples: 338519480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 15:52:13,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:52:14,427][06909] Updated weights for policy 0, policy_version 26592 (0.0039) [2024-06-27 15:52:18,662][06909] Updated weights for policy 0, policy_version 26602 (0.0030) [2024-06-27 15:52:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 435847168. Throughput: 0: 43380.4. Samples: 338775480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 15:52:18,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:52:22,202][06909] Updated weights for policy 0, policy_version 26612 (0.0031) [2024-06-27 15:52:23,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 436060160. Throughput: 0: 43300.1. Samples: 339031340. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 15:52:23,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:52:26,156][06909] Updated weights for policy 0, policy_version 26622 (0.0039) [2024-06-27 15:52:28,852][06674] Fps is (10 sec: 45865.7, 60 sec: 43416.2, 300 sec: 43597.8). Total num frames: 436305920. Throughput: 0: 43241.1. Samples: 339161280. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 15:52:28,852][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:52:29,538][06909] Updated weights for policy 0, policy_version 26632 (0.0027) [2024-06-27 15:52:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 436486144. Throughput: 0: 43316.2. Samples: 339427960. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 15:52:33,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:52:33,902][06909] Updated weights for policy 0, policy_version 26642 (0.0025) [2024-06-27 15:52:37,283][06909] Updated weights for policy 0, policy_version 26652 (0.0040) [2024-06-27 15:52:38,850][06674] Fps is (10 sec: 40968.6, 60 sec: 43422.0, 300 sec: 43320.4). Total num frames: 436715520. Throughput: 0: 43125.8. Samples: 339680420. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 15:52:38,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:52:41,408][06909] Updated weights for policy 0, policy_version 26662 (0.0030) [2024-06-27 15:52:43,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43144.5, 300 sec: 43542.6). Total num frames: 436944896. Throughput: 0: 43273.1. Samples: 339814480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:52:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 15:52:44,750][06909] Updated weights for policy 0, policy_version 26672 (0.0025) [2024-06-27 15:52:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.7, 300 sec: 43375.9). Total num frames: 437141504. Throughput: 0: 43350.6. Samples: 340074100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:52:48,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:52:49,096][06909] Updated weights for policy 0, policy_version 26682 (0.0034) [2024-06-27 15:52:52,279][06909] Updated weights for policy 0, policy_version 26692 (0.0028) [2024-06-27 15:52:53,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 437354496. Throughput: 0: 43078.2. Samples: 340327480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:52:53,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 15:52:56,661][06909] Updated weights for policy 0, policy_version 26702 (0.0035) [2024-06-27 15:52:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43146.1, 300 sec: 43487.0). Total num frames: 437583872. Throughput: 0: 43321.7. Samples: 340468960. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 15:52:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 15:52:59,841][06909] Updated weights for policy 0, policy_version 26712 (0.0034) [2024-06-27 15:53:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 437780480. Throughput: 0: 43280.9. Samples: 340723120. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 15:53:03,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:53:04,076][06909] Updated weights for policy 0, policy_version 26722 (0.0031) [2024-06-27 15:53:07,245][06909] Updated weights for policy 0, policy_version 26732 (0.0031) [2024-06-27 15:53:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 438009856. Throughput: 0: 43416.9. Samples: 340985100. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 15:53:08,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 15:53:11,451][06909] Updated weights for policy 0, policy_version 26742 (0.0036) [2024-06-27 15:53:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43144.5, 300 sec: 43542.6). Total num frames: 438239232. Throughput: 0: 43397.6. Samples: 341114080. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 15:53:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 15:53:14,828][06909] Updated weights for policy 0, policy_version 26752 (0.0033) [2024-06-27 15:53:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 438452224. Throughput: 0: 43327.5. Samples: 341377700. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:53:18,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:53:19,157][06909] Updated weights for policy 0, policy_version 26762 (0.0032) [2024-06-27 15:53:22,379][06909] Updated weights for policy 0, policy_version 26772 (0.0044) [2024-06-27 15:53:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 438665216. Throughput: 0: 43417.8. Samples: 341634220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:53:23,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:53:26,861][06909] Updated weights for policy 0, policy_version 26782 (0.0037) [2024-06-27 15:53:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43146.0, 300 sec: 43542.6). Total num frames: 438894592. Throughput: 0: 43511.6. Samples: 341772500. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:53:28,852][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:53:30,140][06909] Updated weights for policy 0, policy_version 26792 (0.0032) [2024-06-27 15:53:33,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 439091200. Throughput: 0: 43592.0. Samples: 342035740. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:53:33,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:53:34,155][06909] Updated weights for policy 0, policy_version 26802 (0.0026) [2024-06-27 15:53:37,603][06887] Signal inference workers to stop experience collection... (4800 times) [2024-06-27 15:53:37,651][06909] InferenceWorker_p0-w0: stopping experience collection (4800 times) [2024-06-27 15:53:37,659][06887] Signal inference workers to resume experience collection... (4800 times) [2024-06-27 15:53:37,670][06909] InferenceWorker_p0-w0: resuming experience collection (4800 times) [2024-06-27 15:53:37,676][06909] Updated weights for policy 0, policy_version 26812 (0.0032) [2024-06-27 15:53:38,856][06674] Fps is (10 sec: 44210.4, 60 sec: 43686.3, 300 sec: 43430.6). Total num frames: 439336960. Throughput: 0: 43687.5. Samples: 342293680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 15:53:38,856][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:53:41,753][06909] Updated weights for policy 0, policy_version 26822 (0.0042) [2024-06-27 15:53:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 439549952. Throughput: 0: 43557.3. Samples: 342429040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 15:53:43,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:53:45,107][06909] Updated weights for policy 0, policy_version 26832 (0.0033) [2024-06-27 15:53:48,850][06674] Fps is (10 sec: 40984.5, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 439746560. Throughput: 0: 43640.8. Samples: 342686960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 15:53:48,851][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:53:48,978][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000026841_439762944.pth... [2024-06-27 15:53:49,041][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000026203_429309952.pth [2024-06-27 15:53:49,212][06909] Updated weights for policy 0, policy_version 26842 (0.0034) [2024-06-27 15:53:52,578][06909] Updated weights for policy 0, policy_version 26852 (0.0045) [2024-06-27 15:53:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 439975936. Throughput: 0: 43524.9. Samples: 342943720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 15:53:53,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:53:56,574][06909] Updated weights for policy 0, policy_version 26862 (0.0043) [2024-06-27 15:53:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43144.4, 300 sec: 43431.8). Total num frames: 440172544. Throughput: 0: 43683.8. Samples: 343079860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 15:53:58,851][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:54:00,142][06909] Updated weights for policy 0, policy_version 26872 (0.0037) [2024-06-27 15:54:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43487.0). Total num frames: 440418304. Throughput: 0: 43450.2. Samples: 343332960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 15:54:03,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:54:03,962][06909] Updated weights for policy 0, policy_version 26882 (0.0036) [2024-06-27 15:54:08,107][06909] Updated weights for policy 0, policy_version 26892 (0.0028) [2024-06-27 15:54:08,850][06674] Fps is (10 sec: 45876.1, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 440631296. Throughput: 0: 43611.5. Samples: 343596740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 15:54:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 15:54:11,649][06909] Updated weights for policy 0, policy_version 26902 (0.0034) [2024-06-27 15:54:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.5, 300 sec: 43542.6). Total num frames: 440844288. Throughput: 0: 43369.8. Samples: 343724140. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 15:54:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 15:54:15,650][06909] Updated weights for policy 0, policy_version 26912 (0.0030) [2024-06-27 15:54:18,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43690.5, 300 sec: 43487.0). Total num frames: 441073664. Throughput: 0: 43312.7. Samples: 343984820. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 15:54:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 15:54:19,009][06909] Updated weights for policy 0, policy_version 26922 (0.0035) [2024-06-27 15:54:23,282][06909] Updated weights for policy 0, policy_version 26932 (0.0027) [2024-06-27 15:54:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 43487.0). Total num frames: 441303040. Throughput: 0: 43521.8. Samples: 344251900. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 15:54:23,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:54:26,599][06909] Updated weights for policy 0, policy_version 26942 (0.0038) [2024-06-27 15:54:28,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43417.6, 300 sec: 43598.2). Total num frames: 441499648. Throughput: 0: 43389.8. Samples: 344381580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 15:54:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 15:54:30,683][06909] Updated weights for policy 0, policy_version 26952 (0.0042) [2024-06-27 15:54:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 441712640. Throughput: 0: 43368.5. Samples: 344638540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 15:54:33,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:54:34,074][06909] Updated weights for policy 0, policy_version 26962 (0.0036) [2024-06-27 15:54:38,063][06909] Updated weights for policy 0, policy_version 26972 (0.0031) [2024-06-27 15:54:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43421.9, 300 sec: 43375.9). Total num frames: 441942016. Throughput: 0: 43438.2. Samples: 344898440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 15:54:38,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 15:54:40,132][06887] Signal inference workers to stop experience collection... (4850 times) [2024-06-27 15:54:40,132][06887] Signal inference workers to resume experience collection... (4850 times) [2024-06-27 15:54:40,146][06909] InferenceWorker_p0-w0: stopping experience collection (4850 times) [2024-06-27 15:54:40,147][06909] InferenceWorker_p0-w0: resuming experience collection (4850 times) [2024-06-27 15:54:41,829][06909] Updated weights for policy 0, policy_version 26982 (0.0045) [2024-06-27 15:54:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43144.5, 300 sec: 43542.6). Total num frames: 442138624. Throughput: 0: 43313.4. Samples: 345028960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 15:54:43,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:54:45,526][06909] Updated weights for policy 0, policy_version 26992 (0.0039) [2024-06-27 15:54:48,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 442368000. Throughput: 0: 43443.2. Samples: 345287900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 15:54:48,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:54:49,223][06909] Updated weights for policy 0, policy_version 27002 (0.0036) [2024-06-27 15:54:52,906][06909] Updated weights for policy 0, policy_version 27012 (0.0025) [2024-06-27 15:54:53,852][06674] Fps is (10 sec: 44227.3, 60 sec: 43416.0, 300 sec: 43431.2). Total num frames: 442580992. Throughput: 0: 43541.8. Samples: 345556220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 15:54:53,853][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:54:56,659][06909] Updated weights for policy 0, policy_version 27022 (0.0030) [2024-06-27 15:54:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.8, 300 sec: 43487.0). Total num frames: 442793984. Throughput: 0: 43666.3. Samples: 345689120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 15:54:58,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:55:00,773][06909] Updated weights for policy 0, policy_version 27032 (0.0038) [2024-06-27 15:55:03,850][06674] Fps is (10 sec: 45884.9, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 443039744. Throughput: 0: 43497.0. Samples: 345942180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 15:55:03,851][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 15:55:03,985][06909] Updated weights for policy 0, policy_version 27042 (0.0036) [2024-06-27 15:55:08,615][06909] Updated weights for policy 0, policy_version 27052 (0.0041) [2024-06-27 15:55:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 443236352. Throughput: 0: 43454.6. Samples: 346207360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:55:08,851][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:55:11,634][06909] Updated weights for policy 0, policy_version 27062 (0.0026) [2024-06-27 15:55:13,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 443449344. Throughput: 0: 43294.7. Samples: 346329840. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:55:13,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:55:16,129][06909] Updated weights for policy 0, policy_version 27072 (0.0029) [2024-06-27 15:55:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.7, 300 sec: 43487.3). Total num frames: 443678720. Throughput: 0: 43448.8. Samples: 346593740. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:55:18,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:55:19,104][06909] Updated weights for policy 0, policy_version 27082 (0.0052) [2024-06-27 15:55:23,646][06909] Updated weights for policy 0, policy_version 27092 (0.0042) [2024-06-27 15:55:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 42871.5, 300 sec: 43431.5). Total num frames: 443875328. Throughput: 0: 43596.5. Samples: 346860280. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-27 15:55:23,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:55:26,642][06909] Updated weights for policy 0, policy_version 27102 (0.0040) [2024-06-27 15:55:28,852][06674] Fps is (10 sec: 44227.9, 60 sec: 43689.1, 300 sec: 43543.1). Total num frames: 444121088. Throughput: 0: 43467.8. Samples: 346985100. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-27 15:55:28,853][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 15:55:31,105][06909] Updated weights for policy 0, policy_version 27112 (0.0033) [2024-06-27 15:55:33,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 444334080. Throughput: 0: 43536.9. Samples: 347247060. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-27 15:55:33,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 15:55:34,026][06909] Updated weights for policy 0, policy_version 27122 (0.0022) [2024-06-27 15:55:38,605][06909] Updated weights for policy 0, policy_version 27132 (0.0029) [2024-06-27 15:55:38,850][06674] Fps is (10 sec: 40968.4, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 444530688. Throughput: 0: 43435.0. Samples: 347510700. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-27 15:55:38,851][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:55:39,567][06887] Signal inference workers to stop experience collection... (4900 times) [2024-06-27 15:55:39,599][06909] InferenceWorker_p0-w0: stopping experience collection (4900 times) [2024-06-27 15:55:39,620][06887] Signal inference workers to resume experience collection... (4900 times) [2024-06-27 15:55:39,620][06909] InferenceWorker_p0-w0: resuming experience collection (4900 times) [2024-06-27 15:55:41,877][06909] Updated weights for policy 0, policy_version 27142 (0.0036) [2024-06-27 15:55:43,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 444760064. Throughput: 0: 43120.3. Samples: 347629540. Policy #0 lag: (min: 0.0, avg: 12.0, max: 27.0) [2024-06-27 15:55:43,851][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:55:46,071][06909] Updated weights for policy 0, policy_version 27152 (0.0025) [2024-06-27 15:55:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 43542.9). Total num frames: 444989440. Throughput: 0: 43313.8. Samples: 347891300. Policy #0 lag: (min: 0.0, avg: 12.0, max: 27.0) [2024-06-27 15:55:48,851][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:55:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000027160_444989440.pth... [2024-06-27 15:55:48,927][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000026523_434552832.pth [2024-06-27 15:55:49,310][06909] Updated weights for policy 0, policy_version 27162 (0.0026) [2024-06-27 15:55:53,436][06909] Updated weights for policy 0, policy_version 27172 (0.0044) [2024-06-27 15:55:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43419.1, 300 sec: 43375.9). Total num frames: 445186048. Throughput: 0: 43253.8. Samples: 348153780. Policy #0 lag: (min: 0.0, avg: 12.0, max: 27.0) [2024-06-27 15:55:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 15:55:56,934][06909] Updated weights for policy 0, policy_version 27182 (0.0028) [2024-06-27 15:55:58,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 445399040. Throughput: 0: 43301.6. Samples: 348278420. Policy #0 lag: (min: 0.0, avg: 12.0, max: 27.0) [2024-06-27 15:55:58,851][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:56:01,419][06909] Updated weights for policy 0, policy_version 27192 (0.0027) [2024-06-27 15:56:03,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43144.7, 300 sec: 43487.0). Total num frames: 445628416. Throughput: 0: 43225.9. Samples: 348538900. Policy #0 lag: (min: 0.0, avg: 7.4, max: 20.0) [2024-06-27 15:56:03,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:56:04,643][06909] Updated weights for policy 0, policy_version 27202 (0.0037) [2024-06-27 15:56:08,850][06674] Fps is (10 sec: 40959.9, 60 sec: 42871.4, 300 sec: 43375.9). Total num frames: 445808640. Throughput: 0: 43268.8. Samples: 348807380. Policy #0 lag: (min: 0.0, avg: 7.4, max: 20.0) [2024-06-27 15:56:08,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:56:09,049][06909] Updated weights for policy 0, policy_version 27212 (0.0041) [2024-06-27 15:56:12,002][06909] Updated weights for policy 0, policy_version 27222 (0.0028) [2024-06-27 15:56:13,850][06674] Fps is (10 sec: 42597.5, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 446054400. Throughput: 0: 43066.8. Samples: 348923020. Policy #0 lag: (min: 0.0, avg: 7.4, max: 20.0) [2024-06-27 15:56:13,851][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:56:16,610][06909] Updated weights for policy 0, policy_version 27232 (0.0027) [2024-06-27 15:56:18,850][06674] Fps is (10 sec: 49152.5, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 446300160. Throughput: 0: 43295.0. Samples: 349195340. Policy #0 lag: (min: 1.0, avg: 10.1, max: 23.0) [2024-06-27 15:56:18,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 15:56:19,484][06909] Updated weights for policy 0, policy_version 27242 (0.0040) [2024-06-27 15:56:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 446464000. Throughput: 0: 43312.9. Samples: 349459780. Policy #0 lag: (min: 1.0, avg: 10.1, max: 23.0) [2024-06-27 15:56:23,851][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:56:24,219][06909] Updated weights for policy 0, policy_version 27252 (0.0036) [2024-06-27 15:56:27,043][06909] Updated weights for policy 0, policy_version 27262 (0.0036) [2024-06-27 15:56:28,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43146.1, 300 sec: 43487.0). Total num frames: 446709760. Throughput: 0: 43409.0. Samples: 349582940. Policy #0 lag: (min: 1.0, avg: 10.1, max: 23.0) [2024-06-27 15:56:28,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:56:31,652][06909] Updated weights for policy 0, policy_version 27272 (0.0043) [2024-06-27 15:56:33,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43417.5, 300 sec: 43487.9). Total num frames: 446939136. Throughput: 0: 43545.8. Samples: 349850860. Policy #0 lag: (min: 1.0, avg: 10.1, max: 23.0) [2024-06-27 15:56:33,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:56:34,723][06909] Updated weights for policy 0, policy_version 27282 (0.0031) [2024-06-27 15:56:38,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 447119360. Throughput: 0: 43694.9. Samples: 350120040. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:56:38,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:56:38,897][06887] Signal inference workers to stop experience collection... (4950 times) [2024-06-27 15:56:38,900][06887] Signal inference workers to resume experience collection... (4950 times) [2024-06-27 15:56:38,928][06909] InferenceWorker_p0-w0: stopping experience collection (4950 times) [2024-06-27 15:56:38,928][06909] InferenceWorker_p0-w0: resuming experience collection (4950 times) [2024-06-27 15:56:39,044][06909] Updated weights for policy 0, policy_version 27292 (0.0032) [2024-06-27 15:56:42,197][06909] Updated weights for policy 0, policy_version 27302 (0.0035) [2024-06-27 15:56:43,852][06674] Fps is (10 sec: 42590.1, 60 sec: 43416.2, 300 sec: 43486.7). Total num frames: 447365120. Throughput: 0: 43537.7. Samples: 350237700. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:56:43,852][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 15:56:46,546][06909] Updated weights for policy 0, policy_version 27312 (0.0028) [2024-06-27 15:56:48,852][06674] Fps is (10 sec: 45865.5, 60 sec: 43143.1, 300 sec: 43486.7). Total num frames: 447578112. Throughput: 0: 43563.3. Samples: 350499340. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:56:48,852][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:56:50,013][06909] Updated weights for policy 0, policy_version 27322 (0.0037) [2024-06-27 15:56:53,850][06674] Fps is (10 sec: 40968.4, 60 sec: 43144.6, 300 sec: 43320.7). Total num frames: 447774720. Throughput: 0: 43513.9. Samples: 350765500. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:56:53,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:56:54,121][06909] Updated weights for policy 0, policy_version 27332 (0.0033) [2024-06-27 15:56:57,658][06909] Updated weights for policy 0, policy_version 27342 (0.0033) [2024-06-27 15:56:58,850][06674] Fps is (10 sec: 44245.5, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 448020480. Throughput: 0: 43620.1. Samples: 350885920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2024-06-27 15:56:58,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 15:57:01,666][06909] Updated weights for policy 0, policy_version 27352 (0.0049) [2024-06-27 15:57:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 448217088. Throughput: 0: 43399.2. Samples: 351148300. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2024-06-27 15:57:03,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:57:05,210][06909] Updated weights for policy 0, policy_version 27362 (0.0040) [2024-06-27 15:57:08,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 448413696. Throughput: 0: 43489.4. Samples: 351416800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2024-06-27 15:57:08,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:57:09,165][06909] Updated weights for policy 0, policy_version 27372 (0.0031) [2024-06-27 15:57:12,830][06909] Updated weights for policy 0, policy_version 27382 (0.0035) [2024-06-27 15:57:13,856][06674] Fps is (10 sec: 45847.4, 60 sec: 43686.4, 300 sec: 43486.1). Total num frames: 448675840. Throughput: 0: 43323.9. Samples: 351532780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 15:57:13,856][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:57:16,571][06909] Updated weights for policy 0, policy_version 27392 (0.0028) [2024-06-27 15:57:18,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 448888832. Throughput: 0: 43276.9. Samples: 351798320. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 15:57:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 15:57:20,358][06909] Updated weights for policy 0, policy_version 27402 (0.0041) [2024-06-27 15:57:23,850][06674] Fps is (10 sec: 40984.8, 60 sec: 43690.7, 300 sec: 43320.7). Total num frames: 449085440. Throughput: 0: 43243.9. Samples: 352066020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 15:57:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 15:57:24,310][06909] Updated weights for policy 0, policy_version 27412 (0.0037) [2024-06-27 15:57:27,906][06909] Updated weights for policy 0, policy_version 27422 (0.0031) [2024-06-27 15:57:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.5, 300 sec: 43542.5). Total num frames: 449331200. Throughput: 0: 43346.7. Samples: 352188220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 15:57:28,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:57:31,891][06909] Updated weights for policy 0, policy_version 27432 (0.0037) [2024-06-27 15:57:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 449544192. Throughput: 0: 43437.4. Samples: 352453940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 15:57:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 15:57:35,773][06909] Updated weights for policy 0, policy_version 27442 (0.0039) [2024-06-27 15:57:38,850][06674] Fps is (10 sec: 40960.9, 60 sec: 43690.6, 300 sec: 43376.0). Total num frames: 449740800. Throughput: 0: 43374.7. Samples: 352717360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 15:57:38,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:57:39,199][06909] Updated weights for policy 0, policy_version 27452 (0.0043) [2024-06-27 15:57:43,041][06909] Updated weights for policy 0, policy_version 27462 (0.0035) [2024-06-27 15:57:43,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43419.1, 300 sec: 43487.0). Total num frames: 449970176. Throughput: 0: 43502.8. Samples: 352843540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 15:57:43,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:57:46,700][06909] Updated weights for policy 0, policy_version 27472 (0.0028) [2024-06-27 15:57:48,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43692.1, 300 sec: 43542.5). Total num frames: 450199552. Throughput: 0: 43532.7. Samples: 353107280. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 15:57:48,851][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:57:48,878][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000027478_450199552.pth... [2024-06-27 15:57:48,931][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000026841_439762944.pth [2024-06-27 15:57:50,608][06909] Updated weights for policy 0, policy_version 27482 (0.0028) [2024-06-27 15:57:53,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 450379776. Throughput: 0: 43426.6. Samples: 353371000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:57:53,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:57:54,453][06909] Updated weights for policy 0, policy_version 27492 (0.0030) [2024-06-27 15:57:58,054][06909] Updated weights for policy 0, policy_version 27502 (0.0024) [2024-06-27 15:57:58,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43144.6, 300 sec: 43487.0). Total num frames: 450609152. Throughput: 0: 43473.4. Samples: 353488820. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:57:58,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 15:58:01,884][06909] Updated weights for policy 0, policy_version 27512 (0.0033) [2024-06-27 15:58:02,560][06887] Signal inference workers to stop experience collection... (5000 times) [2024-06-27 15:58:02,616][06909] InferenceWorker_p0-w0: stopping experience collection (5000 times) [2024-06-27 15:58:02,617][06887] Signal inference workers to resume experience collection... (5000 times) [2024-06-27 15:58:02,625][06909] InferenceWorker_p0-w0: resuming experience collection (5000 times) [2024-06-27 15:58:03,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.7, 300 sec: 43542.6). Total num frames: 450854912. Throughput: 0: 43440.5. Samples: 353753140. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 15:58:03,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:58:05,437][06909] Updated weights for policy 0, policy_version 27522 (0.0025) [2024-06-27 15:58:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43375.9). Total num frames: 451035136. Throughput: 0: 43317.4. Samples: 354015300. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 15:58:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 15:58:09,758][06909] Updated weights for policy 0, policy_version 27532 (0.0023) [2024-06-27 15:58:13,109][06909] Updated weights for policy 0, policy_version 27542 (0.0026) [2024-06-27 15:58:13,850][06674] Fps is (10 sec: 39321.9, 60 sec: 42875.8, 300 sec: 43375.9). Total num frames: 451248128. Throughput: 0: 43397.5. Samples: 354141100. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 15:58:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 15:58:17,163][06909] Updated weights for policy 0, policy_version 27552 (0.0038) [2024-06-27 15:58:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 451477504. Throughput: 0: 43353.8. Samples: 354404860. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 15:58:18,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:58:20,517][06909] Updated weights for policy 0, policy_version 27562 (0.0039) [2024-06-27 15:58:23,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43144.4, 300 sec: 43320.4). Total num frames: 451674112. Throughput: 0: 43303.0. Samples: 354666000. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 15:58:23,851][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 15:58:24,764][06909] Updated weights for policy 0, policy_version 27572 (0.0040) [2024-06-27 15:58:27,983][06909] Updated weights for policy 0, policy_version 27582 (0.0028) [2024-06-27 15:58:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43144.7, 300 sec: 43487.0). Total num frames: 451919872. Throughput: 0: 43261.7. Samples: 354790320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 15:58:28,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:58:32,285][06909] Updated weights for policy 0, policy_version 27592 (0.0026) [2024-06-27 15:58:33,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43417.6, 300 sec: 43432.4). Total num frames: 452149248. Throughput: 0: 43216.5. Samples: 355052020. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 15:58:33,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:58:35,523][06909] Updated weights for policy 0, policy_version 27602 (0.0041) [2024-06-27 15:58:38,850][06674] Fps is (10 sec: 40959.2, 60 sec: 43144.4, 300 sec: 43320.4). Total num frames: 452329472. Throughput: 0: 43209.2. Samples: 355315420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 15:58:38,851][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 15:58:39,820][06909] Updated weights for policy 0, policy_version 27612 (0.0034) [2024-06-27 15:58:43,586][06909] Updated weights for policy 0, policy_version 27622 (0.0044) [2024-06-27 15:58:43,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 452575232. Throughput: 0: 43286.7. Samples: 355436720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 15:58:43,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:58:47,385][06909] Updated weights for policy 0, policy_version 27632 (0.0030) [2024-06-27 15:58:48,856][06674] Fps is (10 sec: 47485.7, 60 sec: 43413.3, 300 sec: 43486.1). Total num frames: 452804608. Throughput: 0: 43294.7. Samples: 355701660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:58:48,857][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 15:58:51,004][06909] Updated weights for policy 0, policy_version 27642 (0.0036) [2024-06-27 15:58:53,851][06674] Fps is (10 sec: 39316.5, 60 sec: 43143.6, 300 sec: 43375.8). Total num frames: 452968448. Throughput: 0: 43380.5. Samples: 355967480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:58:53,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 15:58:55,021][06909] Updated weights for policy 0, policy_version 27652 (0.0026) [2024-06-27 15:58:58,353][06909] Updated weights for policy 0, policy_version 27662 (0.0047) [2024-06-27 15:58:58,850][06674] Fps is (10 sec: 42624.1, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 453230592. Throughput: 0: 43135.5. Samples: 356082200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 15:58:58,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 15:59:02,729][06909] Updated weights for policy 0, policy_version 27672 (0.0042) [2024-06-27 15:59:03,850][06674] Fps is (10 sec: 47519.9, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 453443584. Throughput: 0: 43321.9. Samples: 356354340. Policy #0 lag: (min: 0.0, avg: 12.4, max: 26.0) [2024-06-27 15:59:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 15:59:06,098][06909] Updated weights for policy 0, policy_version 27682 (0.0036) [2024-06-27 15:59:08,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 453623808. Throughput: 0: 43304.6. Samples: 356614700. Policy #0 lag: (min: 0.0, avg: 12.4, max: 26.0) [2024-06-27 15:59:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 15:59:10,167][06909] Updated weights for policy 0, policy_version 27692 (0.0030) [2024-06-27 15:59:13,529][06909] Updated weights for policy 0, policy_version 27702 (0.0023) [2024-06-27 15:59:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43376.0). Total num frames: 453869568. Throughput: 0: 43264.5. Samples: 356737220. Policy #0 lag: (min: 0.0, avg: 12.4, max: 26.0) [2024-06-27 15:59:13,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 15:59:17,608][06909] Updated weights for policy 0, policy_version 27712 (0.0030) [2024-06-27 15:59:18,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 454098944. Throughput: 0: 43421.8. Samples: 357006000. Policy #0 lag: (min: 0.0, avg: 12.4, max: 26.0) [2024-06-27 15:59:18,851][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:59:20,908][06909] Updated weights for policy 0, policy_version 27722 (0.0031) [2024-06-27 15:59:23,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 454262784. Throughput: 0: 43430.8. Samples: 357269800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 15:59:23,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 15:59:24,959][06887] Signal inference workers to stop experience collection... (5050 times) [2024-06-27 15:59:25,007][06909] InferenceWorker_p0-w0: stopping experience collection (5050 times) [2024-06-27 15:59:25,008][06887] Signal inference workers to resume experience collection... (5050 times) [2024-06-27 15:59:25,017][06909] InferenceWorker_p0-w0: resuming experience collection (5050 times) [2024-06-27 15:59:25,159][06909] Updated weights for policy 0, policy_version 27732 (0.0035) [2024-06-27 15:59:28,285][06909] Updated weights for policy 0, policy_version 27742 (0.0033) [2024-06-27 15:59:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 454524928. Throughput: 0: 43413.3. Samples: 357390320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 15:59:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 15:59:32,535][06909] Updated weights for policy 0, policy_version 27752 (0.0040) [2024-06-27 15:59:33,852][06674] Fps is (10 sec: 49142.1, 60 sec: 43416.2, 300 sec: 43431.2). Total num frames: 454754304. Throughput: 0: 43566.1. Samples: 357661960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 15:59:33,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 15:59:36,189][06909] Updated weights for policy 0, policy_version 27762 (0.0025) [2024-06-27 15:59:38,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43144.7, 300 sec: 43320.4). Total num frames: 454918144. Throughput: 0: 43480.0. Samples: 357924020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 15:59:38,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 15:59:40,219][06909] Updated weights for policy 0, policy_version 27772 (0.0034) [2024-06-27 15:59:43,706][06909] Updated weights for policy 0, policy_version 27782 (0.0025) [2024-06-27 15:59:43,850][06674] Fps is (10 sec: 42606.5, 60 sec: 43417.5, 300 sec: 43431.5). Total num frames: 455180288. Throughput: 0: 43531.0. Samples: 358041100. Policy #0 lag: (min: 1.0, avg: 11.0, max: 27.0) [2024-06-27 15:59:43,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 15:59:47,779][06909] Updated weights for policy 0, policy_version 27792 (0.0032) [2024-06-27 15:59:48,850][06674] Fps is (10 sec: 50790.2, 60 sec: 43695.1, 300 sec: 43542.9). Total num frames: 455426048. Throughput: 0: 43671.5. Samples: 358319560. Policy #0 lag: (min: 1.0, avg: 11.0, max: 27.0) [2024-06-27 15:59:48,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 15:59:48,937][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000027798_455442432.pth... [2024-06-27 15:59:48,991][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000027160_444989440.pth [2024-06-27 15:59:51,009][06909] Updated weights for policy 0, policy_version 27802 (0.0041) [2024-06-27 15:59:53,850][06674] Fps is (10 sec: 37683.5, 60 sec: 43145.4, 300 sec: 43264.9). Total num frames: 455557120. Throughput: 0: 43602.2. Samples: 358576800. Policy #0 lag: (min: 1.0, avg: 11.0, max: 27.0) [2024-06-27 15:59:53,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 15:59:55,294][06909] Updated weights for policy 0, policy_version 27812 (0.0031) [2024-06-27 15:59:58,476][06909] Updated weights for policy 0, policy_version 27822 (0.0038) [2024-06-27 15:59:58,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.7, 300 sec: 43376.0). Total num frames: 455835648. Throughput: 0: 43498.2. Samples: 358694640. Policy #0 lag: (min: 1.0, avg: 11.0, max: 27.0) [2024-06-27 15:59:58,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:00:02,866][06909] Updated weights for policy 0, policy_version 27832 (0.0030) [2024-06-27 16:00:03,850][06674] Fps is (10 sec: 49152.2, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 456048640. Throughput: 0: 43560.5. Samples: 358966220. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 16:00:03,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:00:06,140][06909] Updated weights for policy 0, policy_version 27842 (0.0040) [2024-06-27 16:00:08,850][06674] Fps is (10 sec: 39321.0, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 456228864. Throughput: 0: 43574.2. Samples: 359230640. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 16:00:08,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 16:00:10,396][06909] Updated weights for policy 0, policy_version 27852 (0.0043) [2024-06-27 16:00:13,523][06909] Updated weights for policy 0, policy_version 27862 (0.0041) [2024-06-27 16:00:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.5, 300 sec: 43431.5). Total num frames: 456491008. Throughput: 0: 43611.4. Samples: 359352840. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 16:00:13,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:00:17,964][06909] Updated weights for policy 0, policy_version 27872 (0.0039) [2024-06-27 16:00:18,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 456704000. Throughput: 0: 43533.0. Samples: 359620860. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 16:00:18,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:00:20,942][06909] Updated weights for policy 0, policy_version 27882 (0.0034) [2024-06-27 16:00:23,850][06674] Fps is (10 sec: 37683.0, 60 sec: 43417.5, 300 sec: 43209.6). Total num frames: 456867840. Throughput: 0: 43515.8. Samples: 359882240. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 16:00:23,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:00:25,425][06909] Updated weights for policy 0, policy_version 27892 (0.0031) [2024-06-27 16:00:28,635][06887] Signal inference workers to stop experience collection... (5100 times) [2024-06-27 16:00:28,641][06887] Signal inference workers to resume experience collection... (5100 times) [2024-06-27 16:00:28,686][06909] InferenceWorker_p0-w0: stopping experience collection (5100 times) [2024-06-27 16:00:28,686][06909] InferenceWorker_p0-w0: resuming experience collection (5100 times) [2024-06-27 16:00:28,791][06909] Updated weights for policy 0, policy_version 27902 (0.0030) [2024-06-27 16:00:28,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 457146368. Throughput: 0: 43658.3. Samples: 360005720. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 16:00:28,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:00:32,720][06909] Updated weights for policy 0, policy_version 27912 (0.0030) [2024-06-27 16:00:33,850][06674] Fps is (10 sec: 50790.3, 60 sec: 43692.0, 300 sec: 43542.5). Total num frames: 457375744. Throughput: 0: 43508.7. Samples: 360277460. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 16:00:33,851][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 16:00:36,040][06909] Updated weights for policy 0, policy_version 27922 (0.0030) [2024-06-27 16:00:38,850][06674] Fps is (10 sec: 37683.5, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 457523200. Throughput: 0: 43569.9. Samples: 360537440. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 16:00:38,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:00:40,507][06909] Updated weights for policy 0, policy_version 27932 (0.0037) [2024-06-27 16:00:43,682][06909] Updated weights for policy 0, policy_version 27942 (0.0031) [2024-06-27 16:00:43,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 457801728. Throughput: 0: 43565.7. Samples: 360655100. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 16:00:43,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:00:48,103][06909] Updated weights for policy 0, policy_version 27952 (0.0032) [2024-06-27 16:00:48,850][06674] Fps is (10 sec: 49151.3, 60 sec: 43144.4, 300 sec: 43487.0). Total num frames: 458014720. Throughput: 0: 43581.2. Samples: 360927380. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 16:00:48,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:00:51,127][06909] Updated weights for policy 0, policy_version 27962 (0.0041) [2024-06-27 16:00:53,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43963.8, 300 sec: 43376.0). Total num frames: 458194944. Throughput: 0: 43465.4. Samples: 361186580. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 16:00:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:00:55,553][06909] Updated weights for policy 0, policy_version 27972 (0.0031) [2024-06-27 16:00:58,752][06909] Updated weights for policy 0, policy_version 27982 (0.0039) [2024-06-27 16:00:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 458457088. Throughput: 0: 43623.6. Samples: 361315900. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-27 16:00:58,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:01:02,911][06909] Updated weights for policy 0, policy_version 27992 (0.0031) [2024-06-27 16:01:03,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 458670080. Throughput: 0: 43705.4. Samples: 361587600. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-27 16:01:03,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:01:06,069][06909] Updated weights for policy 0, policy_version 28002 (0.0033) [2024-06-27 16:01:08,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 43431.5). Total num frames: 458866688. Throughput: 0: 43564.5. Samples: 361842640. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-27 16:01:08,854][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:01:10,632][06909] Updated weights for policy 0, policy_version 28012 (0.0040) [2024-06-27 16:01:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.7, 300 sec: 43375.9). Total num frames: 459096064. Throughput: 0: 43515.1. Samples: 361963900. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-27 16:01:13,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:01:13,972][06909] Updated weights for policy 0, policy_version 28022 (0.0044) [2024-06-27 16:01:18,107][06909] Updated weights for policy 0, policy_version 28032 (0.0034) [2024-06-27 16:01:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.6, 300 sec: 43542.5). Total num frames: 459309056. Throughput: 0: 43569.8. Samples: 362238100. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-27 16:01:18,851][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:01:21,455][06909] Updated weights for policy 0, policy_version 28042 (0.0031) [2024-06-27 16:01:23,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 459489280. Throughput: 0: 43426.6. Samples: 362491640. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-27 16:01:23,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:01:25,609][06909] Updated weights for policy 0, policy_version 28052 (0.0029) [2024-06-27 16:01:28,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 459751424. Throughput: 0: 43588.5. Samples: 362616580. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-27 16:01:28,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:01:28,996][06909] Updated weights for policy 0, policy_version 28062 (0.0036) [2024-06-27 16:01:32,935][06909] Updated weights for policy 0, policy_version 28072 (0.0023) [2024-06-27 16:01:33,850][06674] Fps is (10 sec: 49152.1, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 459980800. Throughput: 0: 43622.7. Samples: 362890400. Policy #0 lag: (min: 1.0, avg: 8.5, max: 21.0) [2024-06-27 16:01:33,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:01:36,211][06909] Updated weights for policy 0, policy_version 28082 (0.0034) [2024-06-27 16:01:38,853][06674] Fps is (10 sec: 42583.6, 60 sec: 44234.2, 300 sec: 43431.3). Total num frames: 460177408. Throughput: 0: 43631.3. Samples: 363150140. Policy #0 lag: (min: 1.0, avg: 8.5, max: 21.0) [2024-06-27 16:01:38,854][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:01:40,893][06909] Updated weights for policy 0, policy_version 28092 (0.0027) [2024-06-27 16:01:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43487.3). Total num frames: 460406784. Throughput: 0: 43499.6. Samples: 363273380. Policy #0 lag: (min: 1.0, avg: 8.5, max: 21.0) [2024-06-27 16:01:43,852][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:01:44,353][06909] Updated weights for policy 0, policy_version 28102 (0.0035) [2024-06-27 16:01:48,028][06887] Signal inference workers to stop experience collection... (5150 times) [2024-06-27 16:01:48,077][06909] InferenceWorker_p0-w0: stopping experience collection (5150 times) [2024-06-27 16:01:48,080][06887] Signal inference workers to resume experience collection... (5150 times) [2024-06-27 16:01:48,092][06909] InferenceWorker_p0-w0: resuming experience collection (5150 times) [2024-06-27 16:01:48,243][06909] Updated weights for policy 0, policy_version 28112 (0.0024) [2024-06-27 16:01:48,850][06674] Fps is (10 sec: 44252.5, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 460619776. Throughput: 0: 43473.4. Samples: 363543900. Policy #0 lag: (min: 1.0, avg: 8.5, max: 21.0) [2024-06-27 16:01:48,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:01:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000028114_460619776.pth... [2024-06-27 16:01:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000027478_450199552.pth [2024-06-27 16:01:51,681][06909] Updated weights for policy 0, policy_version 28122 (0.0024) [2024-06-27 16:01:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43431.5). Total num frames: 460832768. Throughput: 0: 43532.1. Samples: 363801580. Policy #0 lag: (min: 0.0, avg: 11.9, max: 25.0) [2024-06-27 16:01:53,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:01:55,940][06909] Updated weights for policy 0, policy_version 28132 (0.0038) [2024-06-27 16:01:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 461062144. Throughput: 0: 43645.8. Samples: 363927960. Policy #0 lag: (min: 0.0, avg: 11.9, max: 25.0) [2024-06-27 16:01:58,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:01:59,008][06909] Updated weights for policy 0, policy_version 28142 (0.0034) [2024-06-27 16:02:03,382][06909] Updated weights for policy 0, policy_version 28152 (0.0023) [2024-06-27 16:02:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 43542.6). Total num frames: 461258752. Throughput: 0: 43522.4. Samples: 364196600. Policy #0 lag: (min: 0.0, avg: 11.9, max: 25.0) [2024-06-27 16:02:03,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:02:06,521][06909] Updated weights for policy 0, policy_version 28162 (0.0035) [2024-06-27 16:02:08,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.7, 300 sec: 43376.8). Total num frames: 461471744. Throughput: 0: 43645.9. Samples: 364455700. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 16:02:08,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:02:10,771][06909] Updated weights for policy 0, policy_version 28172 (0.0024) [2024-06-27 16:02:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 461717504. Throughput: 0: 43687.2. Samples: 364582500. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 16:02:13,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 16:02:13,902][06909] Updated weights for policy 0, policy_version 28182 (0.0029) [2024-06-27 16:02:18,354][06909] Updated weights for policy 0, policy_version 28192 (0.0029) [2024-06-27 16:02:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 461914112. Throughput: 0: 43541.3. Samples: 364849760. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 16:02:18,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 16:02:21,391][06909] Updated weights for policy 0, policy_version 28202 (0.0034) [2024-06-27 16:02:23,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.8, 300 sec: 43320.4). Total num frames: 462110720. Throughput: 0: 43544.3. Samples: 365109480. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 16:02:23,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 16:02:25,914][06909] Updated weights for policy 0, policy_version 28212 (0.0031) [2024-06-27 16:02:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 462356480. Throughput: 0: 43509.3. Samples: 365231300. Policy #0 lag: (min: 0.0, avg: 12.5, max: 21.0) [2024-06-27 16:02:28,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 16:02:29,307][06909] Updated weights for policy 0, policy_version 28222 (0.0031) [2024-06-27 16:02:33,350][06909] Updated weights for policy 0, policy_version 28232 (0.0037) [2024-06-27 16:02:33,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43417.6, 300 sec: 43542.5). Total num frames: 462585856. Throughput: 0: 43525.7. Samples: 365502560. Policy #0 lag: (min: 0.0, avg: 12.5, max: 21.0) [2024-06-27 16:02:33,852][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:02:36,646][06909] Updated weights for policy 0, policy_version 28242 (0.0034) [2024-06-27 16:02:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43420.1, 300 sec: 43431.5). Total num frames: 462782464. Throughput: 0: 43506.1. Samples: 365759360. Policy #0 lag: (min: 0.0, avg: 12.5, max: 21.0) [2024-06-27 16:02:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:02:40,922][06909] Updated weights for policy 0, policy_version 28252 (0.0027) [2024-06-27 16:02:43,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 463028224. Throughput: 0: 43662.3. Samples: 365892760. Policy #0 lag: (min: 0.0, avg: 12.5, max: 21.0) [2024-06-27 16:02:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:02:43,943][06909] Updated weights for policy 0, policy_version 28262 (0.0039) [2024-06-27 16:02:48,321][06909] Updated weights for policy 0, policy_version 28272 (0.0036) [2024-06-27 16:02:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.5, 300 sec: 43542.6). Total num frames: 463224832. Throughput: 0: 43615.9. Samples: 366159320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:02:48,851][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:02:51,436][06909] Updated weights for policy 0, policy_version 28282 (0.0037) [2024-06-27 16:02:53,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 463421440. Throughput: 0: 43566.2. Samples: 366416180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:02:53,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 16:02:55,771][06909] Updated weights for policy 0, policy_version 28292 (0.0029) [2024-06-27 16:02:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 463683584. Throughput: 0: 43623.9. Samples: 366545580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:02:58,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 16:02:58,922][06909] Updated weights for policy 0, policy_version 28302 (0.0035) [2024-06-27 16:03:03,422][06909] Updated weights for policy 0, policy_version 28312 (0.0036) [2024-06-27 16:03:03,820][06887] Signal inference workers to stop experience collection... (5200 times) [2024-06-27 16:03:03,821][06887] Signal inference workers to resume experience collection... (5200 times) [2024-06-27 16:03:03,839][06909] InferenceWorker_p0-w0: stopping experience collection (5200 times) [2024-06-27 16:03:03,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 463896576. Throughput: 0: 43684.6. Samples: 366815560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:03:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:03:03,867][06909] InferenceWorker_p0-w0: resuming experience collection (5200 times) [2024-06-27 16:03:06,803][06909] Updated weights for policy 0, policy_version 28322 (0.0024) [2024-06-27 16:03:08,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 464076800. Throughput: 0: 43644.7. Samples: 367073500. Policy #0 lag: (min: 1.0, avg: 13.4, max: 21.0) [2024-06-27 16:03:08,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:03:10,859][06909] Updated weights for policy 0, policy_version 28332 (0.0036) [2024-06-27 16:03:13,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 464338944. Throughput: 0: 43695.6. Samples: 367197600. Policy #0 lag: (min: 1.0, avg: 13.4, max: 21.0) [2024-06-27 16:03:13,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 16:03:14,157][06909] Updated weights for policy 0, policy_version 28342 (0.0039) [2024-06-27 16:03:18,295][06909] Updated weights for policy 0, policy_version 28352 (0.0028) [2024-06-27 16:03:18,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 464551936. Throughput: 0: 43737.8. Samples: 367470760. Policy #0 lag: (min: 1.0, avg: 13.4, max: 21.0) [2024-06-27 16:03:18,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:03:21,474][06909] Updated weights for policy 0, policy_version 28362 (0.0046) [2024-06-27 16:03:23,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 464732160. Throughput: 0: 43771.7. Samples: 367729080. Policy #0 lag: (min: 1.0, avg: 13.4, max: 21.0) [2024-06-27 16:03:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:03:25,747][06909] Updated weights for policy 0, policy_version 28372 (0.0039) [2024-06-27 16:03:28,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 43542.6). Total num frames: 464994304. Throughput: 0: 43656.0. Samples: 367857280. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:03:28,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 16:03:29,439][06909] Updated weights for policy 0, policy_version 28382 (0.0038) [2024-06-27 16:03:33,145][06909] Updated weights for policy 0, policy_version 28392 (0.0042) [2024-06-27 16:03:33,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 465190912. Throughput: 0: 43650.7. Samples: 368123600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:03:33,851][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:03:36,726][06909] Updated weights for policy 0, policy_version 28402 (0.0026) [2024-06-27 16:03:38,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 465403904. Throughput: 0: 43777.2. Samples: 368386160. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:03:38,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:03:40,698][06909] Updated weights for policy 0, policy_version 28412 (0.0039) [2024-06-27 16:03:43,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43417.6, 300 sec: 43487.9). Total num frames: 465633280. Throughput: 0: 43722.8. Samples: 368513100. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 16:03:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:03:44,257][06909] Updated weights for policy 0, policy_version 28422 (0.0039) [2024-06-27 16:03:48,369][06909] Updated weights for policy 0, policy_version 28432 (0.0040) [2024-06-27 16:03:48,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.8, 300 sec: 43653.8). Total num frames: 465846272. Throughput: 0: 43502.2. Samples: 368773160. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 16:03:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:03:48,949][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000028434_465862656.pth... [2024-06-27 16:03:48,999][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000027798_455442432.pth [2024-06-27 16:03:51,602][06909] Updated weights for policy 0, policy_version 28442 (0.0045) [2024-06-27 16:03:53,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 466042880. Throughput: 0: 43630.7. Samples: 369036880. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 16:03:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:03:55,851][06909] Updated weights for policy 0, policy_version 28452 (0.0030) [2024-06-27 16:03:58,843][06909] Updated weights for policy 0, policy_version 28462 (0.0029) [2024-06-27 16:03:58,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 466321408. Throughput: 0: 43769.0. Samples: 369167200. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 16:03:58,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:04:03,131][06909] Updated weights for policy 0, policy_version 28472 (0.0030) [2024-06-27 16:04:03,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43690.5, 300 sec: 43709.2). Total num frames: 466518016. Throughput: 0: 43645.8. Samples: 369434820. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 16:04:03,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:04:04,093][06887] Signal inference workers to stop experience collection... (5250 times) [2024-06-27 16:04:04,112][06909] InferenceWorker_p0-w0: stopping experience collection (5250 times) [2024-06-27 16:04:04,203][06887] Signal inference workers to resume experience collection... (5250 times) [2024-06-27 16:04:04,203][06909] InferenceWorker_p0-w0: resuming experience collection (5250 times) [2024-06-27 16:04:06,145][06909] Updated weights for policy 0, policy_version 28482 (0.0033) [2024-06-27 16:04:08,850][06674] Fps is (10 sec: 37682.9, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 466698240. Throughput: 0: 43797.7. Samples: 369699980. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 16:04:08,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:04:10,733][06909] Updated weights for policy 0, policy_version 28492 (0.0034) [2024-06-27 16:04:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 466960384. Throughput: 0: 43671.5. Samples: 369822500. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 16:04:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:04:13,954][06909] Updated weights for policy 0, policy_version 28502 (0.0038) [2024-06-27 16:04:18,168][06909] Updated weights for policy 0, policy_version 28512 (0.0036) [2024-06-27 16:04:18,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 467156992. Throughput: 0: 43709.9. Samples: 370090540. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 16:04:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:04:21,362][06909] Updated weights for policy 0, policy_version 28522 (0.0041) [2024-06-27 16:04:23,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 43542.6). Total num frames: 467369984. Throughput: 0: 43632.5. Samples: 370349620. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 16:04:23,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:04:25,755][06909] Updated weights for policy 0, policy_version 28532 (0.0040) [2024-06-27 16:04:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 43542.9). Total num frames: 467599360. Throughput: 0: 43576.0. Samples: 370474020. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 16:04:28,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:04:29,306][06909] Updated weights for policy 0, policy_version 28542 (0.0032) [2024-06-27 16:04:33,123][06909] Updated weights for policy 0, policy_version 28552 (0.0033) [2024-06-27 16:04:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 467812352. Throughput: 0: 43681.7. Samples: 370738840. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 16:04:33,850][06674] Avg episode reward: [(0, '0.406')] [2024-06-27 16:04:36,821][06909] Updated weights for policy 0, policy_version 28562 (0.0022) [2024-06-27 16:04:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.7, 300 sec: 43487.1). Total num frames: 468008960. Throughput: 0: 43850.3. Samples: 371010140. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 16:04:38,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 16:04:40,472][06909] Updated weights for policy 0, policy_version 28572 (0.0031) [2024-06-27 16:04:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 468254720. Throughput: 0: 43648.4. Samples: 371131380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:04:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:04:44,117][06909] Updated weights for policy 0, policy_version 28582 (0.0032) [2024-06-27 16:04:48,335][06909] Updated weights for policy 0, policy_version 28592 (0.0030) [2024-06-27 16:04:48,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 468500480. Throughput: 0: 43783.2. Samples: 371405060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:04:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:04:51,355][06909] Updated weights for policy 0, policy_version 28602 (0.0031) [2024-06-27 16:04:53,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.7, 300 sec: 43542.5). Total num frames: 468680704. Throughput: 0: 43815.9. Samples: 371671700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:04:53,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:04:55,658][06909] Updated weights for policy 0, policy_version 28612 (0.0036) [2024-06-27 16:04:58,761][06909] Updated weights for policy 0, policy_version 28622 (0.0027) [2024-06-27 16:04:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 468942848. Throughput: 0: 43728.5. Samples: 371790280. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 16:04:58,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:05:03,414][06909] Updated weights for policy 0, policy_version 28632 (0.0027) [2024-06-27 16:05:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 469139456. Throughput: 0: 43712.2. Samples: 372057600. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 16:05:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:05:05,999][06887] Signal inference workers to stop experience collection... (5300 times) [2024-06-27 16:05:06,040][06909] InferenceWorker_p0-w0: stopping experience collection (5300 times) [2024-06-27 16:05:06,049][06887] Signal inference workers to resume experience collection... (5300 times) [2024-06-27 16:05:06,057][06909] InferenceWorker_p0-w0: resuming experience collection (5300 times) [2024-06-27 16:05:06,184][06909] Updated weights for policy 0, policy_version 28642 (0.0021) [2024-06-27 16:05:08,850][06674] Fps is (10 sec: 37682.6, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 469319680. Throughput: 0: 43772.4. Samples: 372319380. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 16:05:08,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:05:10,713][06909] Updated weights for policy 0, policy_version 28652 (0.0032) [2024-06-27 16:05:13,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 469581824. Throughput: 0: 43730.6. Samples: 372441900. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 16:05:13,859][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:05:14,080][06909] Updated weights for policy 0, policy_version 28662 (0.0027) [2024-06-27 16:05:18,311][06909] Updated weights for policy 0, policy_version 28672 (0.0041) [2024-06-27 16:05:18,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.6, 300 sec: 43820.3). Total num frames: 469794816. Throughput: 0: 43828.8. Samples: 372711140. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 16:05:18,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:05:21,663][06909] Updated weights for policy 0, policy_version 28682 (0.0024) [2024-06-27 16:05:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 469991424. Throughput: 0: 43672.9. Samples: 372975420. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 16:05:23,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:05:25,710][06909] Updated weights for policy 0, policy_version 28692 (0.0031) [2024-06-27 16:05:28,852][06674] Fps is (10 sec: 42590.1, 60 sec: 43689.1, 300 sec: 43542.3). Total num frames: 470220800. Throughput: 0: 43726.0. Samples: 373099140. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 16:05:28,852][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:05:29,215][06909] Updated weights for policy 0, policy_version 28702 (0.0029) [2024-06-27 16:05:33,173][06909] Updated weights for policy 0, policy_version 28712 (0.0032) [2024-06-27 16:05:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 470450176. Throughput: 0: 43667.6. Samples: 373370100. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 16:05:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:05:36,491][06909] Updated weights for policy 0, policy_version 28722 (0.0045) [2024-06-27 16:05:38,850][06674] Fps is (10 sec: 44245.9, 60 sec: 44236.8, 300 sec: 43598.1). Total num frames: 470663168. Throughput: 0: 43668.5. Samples: 373636780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 16:05:38,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:05:40,706][06909] Updated weights for policy 0, policy_version 28732 (0.0029) [2024-06-27 16:05:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43653.7). Total num frames: 470892544. Throughput: 0: 43760.9. Samples: 373759520. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 16:05:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:05:43,882][06909] Updated weights for policy 0, policy_version 28742 (0.0026) [2024-06-27 16:05:48,242][06909] Updated weights for policy 0, policy_version 28752 (0.0031) [2024-06-27 16:05:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.5, 300 sec: 43764.7). Total num frames: 471105536. Throughput: 0: 43731.7. Samples: 374025520. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 16:05:48,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:05:48,999][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000028755_471121920.pth... [2024-06-27 16:05:49,052][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000028114_460619776.pth [2024-06-27 16:05:51,282][06909] Updated weights for policy 0, policy_version 28762 (0.0036) [2024-06-27 16:05:53,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43962.3, 300 sec: 43597.8). Total num frames: 471318528. Throughput: 0: 43666.6. Samples: 374284460. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 16:05:53,852][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:05:55,875][06909] Updated weights for policy 0, policy_version 28772 (0.0040) [2024-06-27 16:05:58,602][06909] Updated weights for policy 0, policy_version 28782 (0.0029) [2024-06-27 16:05:58,852][06674] Fps is (10 sec: 45865.8, 60 sec: 43689.1, 300 sec: 43708.9). Total num frames: 471564288. Throughput: 0: 43663.3. Samples: 374406840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-27 16:05:58,852][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:06:03,254][06909] Updated weights for policy 0, policy_version 28792 (0.0033) [2024-06-27 16:06:03,850][06674] Fps is (10 sec: 44245.4, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 471760896. Throughput: 0: 43686.7. Samples: 374677040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-27 16:06:03,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:06:06,316][06909] Updated weights for policy 0, policy_version 28802 (0.0027) [2024-06-27 16:06:08,850][06674] Fps is (10 sec: 39329.9, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 471957504. Throughput: 0: 43608.8. Samples: 374937820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-27 16:06:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:06:10,726][06887] Signal inference workers to stop experience collection... (5350 times) [2024-06-27 16:06:10,728][06887] Signal inference workers to resume experience collection... (5350 times) [2024-06-27 16:06:10,741][06909] Updated weights for policy 0, policy_version 28812 (0.0030) [2024-06-27 16:06:10,776][06909] InferenceWorker_p0-w0: stopping experience collection (5350 times) [2024-06-27 16:06:10,776][06909] InferenceWorker_p0-w0: resuming experience collection (5350 times) [2024-06-27 16:06:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43653.7). Total num frames: 472186880. Throughput: 0: 43729.0. Samples: 375066860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-27 16:06:13,851][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:06:14,315][06909] Updated weights for policy 0, policy_version 28822 (0.0032) [2024-06-27 16:06:18,239][06909] Updated weights for policy 0, policy_version 28832 (0.0038) [2024-06-27 16:06:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 472416256. Throughput: 0: 43536.0. Samples: 375329220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 16:06:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:06:21,734][06909] Updated weights for policy 0, policy_version 28842 (0.0021) [2024-06-27 16:06:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 472629248. Throughput: 0: 43468.4. Samples: 375592860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 16:06:23,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:06:25,917][06909] Updated weights for policy 0, policy_version 28852 (0.0033) [2024-06-27 16:06:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43692.1, 300 sec: 43598.1). Total num frames: 472842240. Throughput: 0: 43451.0. Samples: 375714820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 16:06:28,854][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 16:06:29,171][06909] Updated weights for policy 0, policy_version 28862 (0.0022) [2024-06-27 16:06:33,307][06909] Updated weights for policy 0, policy_version 28872 (0.0028) [2024-06-27 16:06:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.6, 300 sec: 43765.2). Total num frames: 473088000. Throughput: 0: 43596.8. Samples: 375987380. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 16:06:33,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:06:36,574][06909] Updated weights for policy 0, policy_version 28882 (0.0042) [2024-06-27 16:06:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 473268224. Throughput: 0: 43714.9. Samples: 376251540. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-27 16:06:38,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:06:40,682][06909] Updated weights for policy 0, policy_version 28892 (0.0026) [2024-06-27 16:06:43,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 473497600. Throughput: 0: 43711.0. Samples: 376373740. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-27 16:06:43,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:06:44,270][06909] Updated weights for policy 0, policy_version 28902 (0.0030) [2024-06-27 16:06:48,020][06909] Updated weights for policy 0, policy_version 28912 (0.0034) [2024-06-27 16:06:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 473726976. Throughput: 0: 43718.7. Samples: 376644380. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-27 16:06:48,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:06:51,847][06909] Updated weights for policy 0, policy_version 28922 (0.0040) [2024-06-27 16:06:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43146.0, 300 sec: 43542.6). Total num frames: 473907200. Throughput: 0: 43612.5. Samples: 376900380. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:06:53,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:06:55,771][06909] Updated weights for policy 0, policy_version 28932 (0.0028) [2024-06-27 16:06:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43146.0, 300 sec: 43709.2). Total num frames: 474152960. Throughput: 0: 43623.1. Samples: 377029900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:06:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:06:59,326][06909] Updated weights for policy 0, policy_version 28942 (0.0027) [2024-06-27 16:07:03,580][06909] Updated weights for policy 0, policy_version 28952 (0.0039) [2024-06-27 16:07:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 474365952. Throughput: 0: 43627.0. Samples: 377292440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:07:03,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 16:07:07,190][06909] Updated weights for policy 0, policy_version 28962 (0.0040) [2024-06-27 16:07:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 474578944. Throughput: 0: 43540.4. Samples: 377552180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:07:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:07:11,010][06909] Updated weights for policy 0, policy_version 28972 (0.0030) [2024-06-27 16:07:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 474791936. Throughput: 0: 43643.1. Samples: 377678760. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-27 16:07:13,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:07:14,595][06909] Updated weights for policy 0, policy_version 28982 (0.0035) [2024-06-27 16:07:18,341][06909] Updated weights for policy 0, policy_version 28992 (0.0029) [2024-06-27 16:07:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.5, 300 sec: 43764.7). Total num frames: 475021312. Throughput: 0: 43562.2. Samples: 377947680. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-27 16:07:18,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:07:22,244][06909] Updated weights for policy 0, policy_version 29002 (0.0030) [2024-06-27 16:07:22,247][06887] Signal inference workers to stop experience collection... (5400 times) [2024-06-27 16:07:22,247][06887] Signal inference workers to resume experience collection... (5400 times) [2024-06-27 16:07:22,295][06909] InferenceWorker_p0-w0: stopping experience collection (5400 times) [2024-06-27 16:07:22,295][06909] InferenceWorker_p0-w0: resuming experience collection (5400 times) [2024-06-27 16:07:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 475234304. Throughput: 0: 43485.2. Samples: 378208380. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-27 16:07:23,851][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:07:25,650][06909] Updated weights for policy 0, policy_version 29012 (0.0027) [2024-06-27 16:07:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 475463680. Throughput: 0: 43649.2. Samples: 378337960. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-27 16:07:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:07:29,568][06909] Updated weights for policy 0, policy_version 29022 (0.0040) [2024-06-27 16:07:33,424][06909] Updated weights for policy 0, policy_version 29032 (0.0032) [2024-06-27 16:07:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43144.5, 300 sec: 43709.2). Total num frames: 475676672. Throughput: 0: 43576.0. Samples: 378605300. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:07:33,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:07:36,914][06909] Updated weights for policy 0, policy_version 29042 (0.0022) [2024-06-27 16:07:38,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 475873280. Throughput: 0: 43758.2. Samples: 378869500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:07:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:07:40,662][06909] Updated weights for policy 0, policy_version 29052 (0.0030) [2024-06-27 16:07:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 476119040. Throughput: 0: 43645.0. Samples: 378993920. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:07:43,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:07:44,587][06909] Updated weights for policy 0, policy_version 29062 (0.0032) [2024-06-27 16:07:48,049][06909] Updated weights for policy 0, policy_version 29072 (0.0032) [2024-06-27 16:07:48,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 476332032. Throughput: 0: 43816.4. Samples: 379264180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:07:48,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:07:48,941][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000029074_476348416.pth... [2024-06-27 16:07:48,997][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000028434_465862656.pth [2024-06-27 16:07:52,015][06909] Updated weights for policy 0, policy_version 29082 (0.0042) [2024-06-27 16:07:53,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 476528640. Throughput: 0: 43842.7. Samples: 379525100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 16:07:53,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:07:55,835][06909] Updated weights for policy 0, policy_version 29092 (0.0034) [2024-06-27 16:07:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.8, 300 sec: 43653.6). Total num frames: 476774400. Throughput: 0: 43787.2. Samples: 379649180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 16:07:58,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:08:00,017][06909] Updated weights for policy 0, policy_version 29102 (0.0021) [2024-06-27 16:08:03,293][06909] Updated weights for policy 0, policy_version 29112 (0.0024) [2024-06-27 16:08:03,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 477003776. Throughput: 0: 43639.6. Samples: 379911460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 16:08:03,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:08:07,402][06909] Updated weights for policy 0, policy_version 29122 (0.0031) [2024-06-27 16:08:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 477216768. Throughput: 0: 43729.8. Samples: 380176220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 16:08:08,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:08:10,629][06909] Updated weights for policy 0, policy_version 29132 (0.0031) [2024-06-27 16:08:13,851][06674] Fps is (10 sec: 42594.3, 60 sec: 43963.0, 300 sec: 43653.5). Total num frames: 477429760. Throughput: 0: 43689.3. Samples: 380304020. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:08:13,851][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:08:14,818][06909] Updated weights for policy 0, policy_version 29142 (0.0032) [2024-06-27 16:08:17,994][06909] Updated weights for policy 0, policy_version 29152 (0.0038) [2024-06-27 16:08:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 477659136. Throughput: 0: 43737.0. Samples: 380573460. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:08:18,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 16:08:22,258][06909] Updated weights for policy 0, policy_version 29162 (0.0037) [2024-06-27 16:08:23,850][06674] Fps is (10 sec: 42602.8, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 477855744. Throughput: 0: 43712.8. Samples: 380836580. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:08:23,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:08:25,380][06909] Updated weights for policy 0, policy_version 29172 (0.0022) [2024-06-27 16:08:28,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43689.2, 300 sec: 43708.9). Total num frames: 478085120. Throughput: 0: 43751.3. Samples: 380962820. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:08:28,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:08:29,559][06909] Updated weights for policy 0, policy_version 29182 (0.0025) [2024-06-27 16:08:33,196][06909] Updated weights for policy 0, policy_version 29192 (0.0022) [2024-06-27 16:08:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 43820.3). Total num frames: 478330880. Throughput: 0: 43725.3. Samples: 381231820. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 16:08:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:08:34,290][06887] Signal inference workers to stop experience collection... (5450 times) [2024-06-27 16:08:34,291][06887] Signal inference workers to resume experience collection... (5450 times) [2024-06-27 16:08:34,336][06909] InferenceWorker_p0-w0: stopping experience collection (5450 times) [2024-06-27 16:08:34,336][06909] InferenceWorker_p0-w0: resuming experience collection (5450 times) [2024-06-27 16:08:36,833][06909] Updated weights for policy 0, policy_version 29202 (0.0025) [2024-06-27 16:08:38,850][06674] Fps is (10 sec: 44246.1, 60 sec: 44236.8, 300 sec: 43709.2). Total num frames: 478527488. Throughput: 0: 43807.6. Samples: 381496440. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 16:08:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:08:40,529][06909] Updated weights for policy 0, policy_version 29212 (0.0025) [2024-06-27 16:08:43,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 478724096. Throughput: 0: 43859.4. Samples: 381622860. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 16:08:43,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:08:44,327][06909] Updated weights for policy 0, policy_version 29222 (0.0033) [2024-06-27 16:08:48,022][06909] Updated weights for policy 0, policy_version 29232 (0.0027) [2024-06-27 16:08:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 478969856. Throughput: 0: 43816.1. Samples: 381883180. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 16:08:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:08:52,099][06909] Updated weights for policy 0, policy_version 29242 (0.0032) [2024-06-27 16:08:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43542.6). Total num frames: 479166464. Throughput: 0: 43797.8. Samples: 382147120. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-27 16:08:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:08:55,460][06909] Updated weights for policy 0, policy_version 29252 (0.0024) [2024-06-27 16:08:58,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 479379456. Throughput: 0: 43656.6. Samples: 382268520. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-27 16:08:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:08:59,826][06909] Updated weights for policy 0, policy_version 29262 (0.0031) [2024-06-27 16:09:02,904][06909] Updated weights for policy 0, policy_version 29272 (0.0041) [2024-06-27 16:09:03,850][06674] Fps is (10 sec: 47512.6, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 479641600. Throughput: 0: 43709.5. Samples: 382540400. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-27 16:09:03,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:09:07,183][06909] Updated weights for policy 0, policy_version 29282 (0.0034) [2024-06-27 16:09:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 479838208. Throughput: 0: 43881.8. Samples: 382811260. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-27 16:09:08,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:09:10,182][06909] Updated weights for policy 0, policy_version 29292 (0.0032) [2024-06-27 16:09:13,852][06674] Fps is (10 sec: 39314.5, 60 sec: 43416.9, 300 sec: 43653.3). Total num frames: 480034816. Throughput: 0: 43792.9. Samples: 382933500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 16:09:13,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:09:14,616][06909] Updated weights for policy 0, policy_version 29302 (0.0035) [2024-06-27 16:09:17,631][06909] Updated weights for policy 0, policy_version 29312 (0.0038) [2024-06-27 16:09:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 480296960. Throughput: 0: 43655.6. Samples: 383196320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 16:09:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:09:21,942][06909] Updated weights for policy 0, policy_version 29322 (0.0022) [2024-06-27 16:09:23,850][06674] Fps is (10 sec: 44246.2, 60 sec: 43690.8, 300 sec: 43653.7). Total num frames: 480477184. Throughput: 0: 43648.5. Samples: 383460620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 16:09:23,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:09:25,435][06909] Updated weights for policy 0, policy_version 29332 (0.0029) [2024-06-27 16:09:28,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43692.2, 300 sec: 43709.2). Total num frames: 480706560. Throughput: 0: 43526.7. Samples: 383581560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:09:28,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:09:29,286][06909] Updated weights for policy 0, policy_version 29342 (0.0037) [2024-06-27 16:09:33,084][06909] Updated weights for policy 0, policy_version 29352 (0.0033) [2024-06-27 16:09:33,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43417.6, 300 sec: 43820.2). Total num frames: 480935936. Throughput: 0: 43807.5. Samples: 383854520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:09:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:09:37,378][06909] Updated weights for policy 0, policy_version 29362 (0.0040) [2024-06-27 16:09:38,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.5, 300 sec: 43709.1). Total num frames: 481148928. Throughput: 0: 43614.1. Samples: 384109760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:09:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:09:40,664][06909] Updated weights for policy 0, policy_version 29372 (0.0050) [2024-06-27 16:09:43,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43542.5). Total num frames: 481345536. Throughput: 0: 43707.5. Samples: 384235360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:09:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:09:44,713][06909] Updated weights for policy 0, policy_version 29382 (0.0030) [2024-06-27 16:09:48,033][06909] Updated weights for policy 0, policy_version 29392 (0.0030) [2024-06-27 16:09:48,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 481607680. Throughput: 0: 43795.7. Samples: 384511200. Policy #0 lag: (min: 3.0, avg: 10.7, max: 22.0) [2024-06-27 16:09:48,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:09:49,001][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000029397_481640448.pth... [2024-06-27 16:09:49,052][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000028755_471121920.pth [2024-06-27 16:09:52,643][06909] Updated weights for policy 0, policy_version 29402 (0.0041) [2024-06-27 16:09:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 481787904. Throughput: 0: 43456.5. Samples: 384766800. Policy #0 lag: (min: 3.0, avg: 10.7, max: 22.0) [2024-06-27 16:09:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:09:55,341][06887] Signal inference workers to stop experience collection... (5500 times) [2024-06-27 16:09:55,342][06887] Signal inference workers to resume experience collection... (5500 times) [2024-06-27 16:09:55,356][06909] Updated weights for policy 0, policy_version 29412 (0.0040) [2024-06-27 16:09:55,385][06909] InferenceWorker_p0-w0: stopping experience collection (5500 times) [2024-06-27 16:09:55,385][06909] InferenceWorker_p0-w0: resuming experience collection (5500 times) [2024-06-27 16:09:58,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 482000896. Throughput: 0: 43453.5. Samples: 384888820. Policy #0 lag: (min: 3.0, avg: 10.7, max: 22.0) [2024-06-27 16:09:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:10:00,000][06909] Updated weights for policy 0, policy_version 29422 (0.0039) [2024-06-27 16:10:02,723][06909] Updated weights for policy 0, policy_version 29432 (0.0035) [2024-06-27 16:10:03,852][06674] Fps is (10 sec: 47503.8, 60 sec: 43689.4, 300 sec: 43875.5). Total num frames: 482263040. Throughput: 0: 43730.0. Samples: 385164260. Policy #0 lag: (min: 3.0, avg: 10.7, max: 22.0) [2024-06-27 16:10:03,852][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:10:07,315][06909] Updated weights for policy 0, policy_version 29442 (0.0032) [2024-06-27 16:10:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 482459648. Throughput: 0: 43726.0. Samples: 385428300. Policy #0 lag: (min: 1.0, avg: 8.5, max: 22.0) [2024-06-27 16:10:08,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:10:10,082][06909] Updated weights for policy 0, policy_version 29452 (0.0035) [2024-06-27 16:10:13,850][06674] Fps is (10 sec: 40968.5, 60 sec: 43965.2, 300 sec: 43653.7). Total num frames: 482672640. Throughput: 0: 43812.5. Samples: 385553120. Policy #0 lag: (min: 1.0, avg: 8.5, max: 22.0) [2024-06-27 16:10:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:10:14,779][06909] Updated weights for policy 0, policy_version 29462 (0.0025) [2024-06-27 16:10:17,704][06909] Updated weights for policy 0, policy_version 29472 (0.0030) [2024-06-27 16:10:18,852][06674] Fps is (10 sec: 47504.4, 60 sec: 43962.2, 300 sec: 43875.5). Total num frames: 482934784. Throughput: 0: 43695.4. Samples: 385820900. Policy #0 lag: (min: 1.0, avg: 8.5, max: 22.0) [2024-06-27 16:10:18,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:10:22,209][06909] Updated weights for policy 0, policy_version 29482 (0.0035) [2024-06-27 16:10:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43709.5). Total num frames: 483115008. Throughput: 0: 43966.4. Samples: 386088240. Policy #0 lag: (min: 1.0, avg: 8.5, max: 22.0) [2024-06-27 16:10:23,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:10:25,243][06909] Updated weights for policy 0, policy_version 29492 (0.0037) [2024-06-27 16:10:28,850][06674] Fps is (10 sec: 39329.3, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 483328000. Throughput: 0: 43910.6. Samples: 386211340. Policy #0 lag: (min: 0.0, avg: 12.2, max: 20.0) [2024-06-27 16:10:28,851][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:10:29,956][06909] Updated weights for policy 0, policy_version 29502 (0.0043) [2024-06-27 16:10:32,720][06909] Updated weights for policy 0, policy_version 29512 (0.0032) [2024-06-27 16:10:33,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 43820.3). Total num frames: 483590144. Throughput: 0: 43699.6. Samples: 386477680. Policy #0 lag: (min: 0.0, avg: 12.2, max: 20.0) [2024-06-27 16:10:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:10:37,268][06909] Updated weights for policy 0, policy_version 29522 (0.0037) [2024-06-27 16:10:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.8, 300 sec: 43653.6). Total num frames: 483770368. Throughput: 0: 43967.9. Samples: 386745360. Policy #0 lag: (min: 0.0, avg: 12.2, max: 20.0) [2024-06-27 16:10:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:10:40,082][06909] Updated weights for policy 0, policy_version 29532 (0.0032) [2024-06-27 16:10:43,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 483983360. Throughput: 0: 43882.3. Samples: 386863520. Policy #0 lag: (min: 0.0, avg: 12.2, max: 20.0) [2024-06-27 16:10:43,851][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:10:45,142][06909] Updated weights for policy 0, policy_version 29542 (0.0033) [2024-06-27 16:10:47,651][06909] Updated weights for policy 0, policy_version 29552 (0.0043) [2024-06-27 16:10:48,852][06674] Fps is (10 sec: 47503.9, 60 sec: 43962.2, 300 sec: 43820.3). Total num frames: 484245504. Throughput: 0: 43602.6. Samples: 387126380. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 16:10:48,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:10:52,499][06909] Updated weights for policy 0, policy_version 29562 (0.0027) [2024-06-27 16:10:53,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.6, 300 sec: 43598.4). Total num frames: 484425728. Throughput: 0: 43794.2. Samples: 387399040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 16:10:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:10:55,111][06909] Updated weights for policy 0, policy_version 29572 (0.0022) [2024-06-27 16:10:56,547][06887] Signal inference workers to stop experience collection... (5550 times) [2024-06-27 16:10:56,597][06909] InferenceWorker_p0-w0: stopping experience collection (5550 times) [2024-06-27 16:10:56,667][06887] Signal inference workers to resume experience collection... (5550 times) [2024-06-27 16:10:56,667][06909] InferenceWorker_p0-w0: resuming experience collection (5550 times) [2024-06-27 16:10:58,850][06674] Fps is (10 sec: 39329.4, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 484638720. Throughput: 0: 43752.3. Samples: 387521980. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 16:10:58,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:10:59,814][06909] Updated weights for policy 0, policy_version 29582 (0.0033) [2024-06-27 16:11:02,495][06909] Updated weights for policy 0, policy_version 29592 (0.0038) [2024-06-27 16:11:03,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43692.2, 300 sec: 43820.3). Total num frames: 484884480. Throughput: 0: 43639.4. Samples: 387784580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 16:11:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:11:07,244][06909] Updated weights for policy 0, policy_version 29602 (0.0032) [2024-06-27 16:11:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 485081088. Throughput: 0: 43690.6. Samples: 388054320. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 16:11:08,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:11:10,079][06909] Updated weights for policy 0, policy_version 29612 (0.0024) [2024-06-27 16:11:13,850][06674] Fps is (10 sec: 40957.2, 60 sec: 43690.2, 300 sec: 43653.5). Total num frames: 485294080. Throughput: 0: 43714.1. Samples: 388178500. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 16:11:13,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:11:14,509][06909] Updated weights for policy 0, policy_version 29622 (0.0038) [2024-06-27 16:11:17,424][06909] Updated weights for policy 0, policy_version 29632 (0.0037) [2024-06-27 16:11:18,852][06674] Fps is (10 sec: 45866.1, 60 sec: 43417.6, 300 sec: 43764.4). Total num frames: 485539840. Throughput: 0: 43636.2. Samples: 388441400. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 16:11:18,852][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:11:22,277][06909] Updated weights for policy 0, policy_version 29642 (0.0031) [2024-06-27 16:11:23,850][06674] Fps is (10 sec: 42601.1, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 485720064. Throughput: 0: 43639.6. Samples: 388709140. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 16:11:23,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:11:25,062][06909] Updated weights for policy 0, policy_version 29652 (0.0045) [2024-06-27 16:11:28,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 485949440. Throughput: 0: 43676.8. Samples: 388828980. Policy #0 lag: (min: 0.0, avg: 12.0, max: 23.0) [2024-06-27 16:11:28,852][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:11:29,695][06909] Updated weights for policy 0, policy_version 29662 (0.0039) [2024-06-27 16:11:32,788][06909] Updated weights for policy 0, policy_version 29672 (0.0038) [2024-06-27 16:11:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43144.4, 300 sec: 43764.7). Total num frames: 486178816. Throughput: 0: 43731.7. Samples: 389094220. Policy #0 lag: (min: 0.0, avg: 12.0, max: 23.0) [2024-06-27 16:11:33,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:11:37,079][06909] Updated weights for policy 0, policy_version 29682 (0.0023) [2024-06-27 16:11:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 486391808. Throughput: 0: 43618.8. Samples: 389361880. Policy #0 lag: (min: 0.0, avg: 12.0, max: 23.0) [2024-06-27 16:11:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:11:40,164][06909] Updated weights for policy 0, policy_version 29692 (0.0020) [2024-06-27 16:11:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 486604800. Throughput: 0: 43648.0. Samples: 389486140. Policy #0 lag: (min: 0.0, avg: 12.0, max: 23.0) [2024-06-27 16:11:43,850][06674] Avg episode reward: [(0, '0.389')] [2024-06-27 16:11:44,770][06909] Updated weights for policy 0, policy_version 29702 (0.0032) [2024-06-27 16:11:47,507][06909] Updated weights for policy 0, policy_version 29712 (0.0040) [2024-06-27 16:11:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43419.1, 300 sec: 43875.8). Total num frames: 486850560. Throughput: 0: 43759.9. Samples: 389753780. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-27 16:11:48,850][06674] Avg episode reward: [(0, '0.389')] [2024-06-27 16:11:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000029715_486850560.pth... [2024-06-27 16:11:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000029074_476348416.pth [2024-06-27 16:11:52,143][06909] Updated weights for policy 0, policy_version 29722 (0.0029) [2024-06-27 16:11:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 487030784. Throughput: 0: 43722.7. Samples: 390021840. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-27 16:11:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:11:55,116][06909] Updated weights for policy 0, policy_version 29732 (0.0037) [2024-06-27 16:11:58,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 487260160. Throughput: 0: 43763.1. Samples: 390147820. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-27 16:11:58,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:11:59,463][06909] Updated weights for policy 0, policy_version 29742 (0.0031) [2024-06-27 16:12:02,716][06909] Updated weights for policy 0, policy_version 29752 (0.0042) [2024-06-27 16:12:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 487489536. Throughput: 0: 43713.1. Samples: 390408400. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-27 16:12:03,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:12:06,818][06909] Updated weights for policy 0, policy_version 29762 (0.0028) [2024-06-27 16:12:08,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 487702528. Throughput: 0: 43682.3. Samples: 390674840. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-27 16:12:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:12:10,149][06909] Updated weights for policy 0, policy_version 29772 (0.0039) [2024-06-27 16:12:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43691.1, 300 sec: 43709.2). Total num frames: 487915520. Throughput: 0: 43780.0. Samples: 390799080. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-27 16:12:13,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:12:14,831][06909] Updated weights for policy 0, policy_version 29782 (0.0034) [2024-06-27 16:12:17,604][06909] Updated weights for policy 0, policy_version 29792 (0.0032) [2024-06-27 16:12:18,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43419.0, 300 sec: 43764.7). Total num frames: 488144896. Throughput: 0: 43658.6. Samples: 391058860. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-27 16:12:18,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:12:22,253][06909] Updated weights for policy 0, policy_version 29802 (0.0040) [2024-06-27 16:12:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 488357888. Throughput: 0: 43621.3. Samples: 391324840. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-27 16:12:23,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:12:25,125][06909] Updated weights for policy 0, policy_version 29812 (0.0042) [2024-06-27 16:12:26,373][06887] Signal inference workers to stop experience collection... (5600 times) [2024-06-27 16:12:26,373][06887] Signal inference workers to resume experience collection... (5600 times) [2024-06-27 16:12:26,419][06909] InferenceWorker_p0-w0: stopping experience collection (5600 times) [2024-06-27 16:12:26,419][06909] InferenceWorker_p0-w0: resuming experience collection (5600 times) [2024-06-27 16:12:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 488570880. Throughput: 0: 43666.7. Samples: 391451140. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 16:12:28,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:12:29,907][06909] Updated weights for policy 0, policy_version 29822 (0.0036) [2024-06-27 16:12:33,095][06909] Updated weights for policy 0, policy_version 29832 (0.0027) [2024-06-27 16:12:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 488783872. Throughput: 0: 43438.7. Samples: 391708520. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 16:12:33,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:12:37,189][06909] Updated weights for policy 0, policy_version 29842 (0.0023) [2024-06-27 16:12:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 489013248. Throughput: 0: 43483.5. Samples: 391978600. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 16:12:38,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 16:12:40,478][06909] Updated weights for policy 0, policy_version 29852 (0.0037) [2024-06-27 16:12:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 489226240. Throughput: 0: 43542.9. Samples: 392107240. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 16:12:43,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:12:44,798][06909] Updated weights for policy 0, policy_version 29862 (0.0029) [2024-06-27 16:12:48,186][06909] Updated weights for policy 0, policy_version 29872 (0.0026) [2024-06-27 16:12:48,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 489472000. Throughput: 0: 43641.4. Samples: 392372260. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 16:12:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:12:52,156][06909] Updated weights for policy 0, policy_version 29882 (0.0041) [2024-06-27 16:12:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 489668608. Throughput: 0: 43627.6. Samples: 392638080. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 16:12:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:12:55,469][06909] Updated weights for policy 0, policy_version 29892 (0.0032) [2024-06-27 16:12:58,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.8, 300 sec: 43653.6). Total num frames: 489881600. Throughput: 0: 43574.3. Samples: 392759920. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 16:12:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:12:59,769][06909] Updated weights for policy 0, policy_version 29902 (0.0027) [2024-06-27 16:13:03,010][06909] Updated weights for policy 0, policy_version 29912 (0.0025) [2024-06-27 16:13:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 490127360. Throughput: 0: 43742.4. Samples: 393027260. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 16:13:03,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:13:07,112][06909] Updated weights for policy 0, policy_version 29922 (0.0036) [2024-06-27 16:13:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43709.3). Total num frames: 490323968. Throughput: 0: 43706.7. Samples: 393291640. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-27 16:13:08,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:13:10,305][06909] Updated weights for policy 0, policy_version 29932 (0.0029) [2024-06-27 16:13:13,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 490536960. Throughput: 0: 43792.9. Samples: 393421820. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-27 16:13:13,859][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:13:14,742][06909] Updated weights for policy 0, policy_version 29942 (0.0021) [2024-06-27 16:13:18,003][06909] Updated weights for policy 0, policy_version 29952 (0.0028) [2024-06-27 16:13:18,855][06674] Fps is (10 sec: 44213.9, 60 sec: 43687.0, 300 sec: 43764.0). Total num frames: 490766336. Throughput: 0: 43787.8. Samples: 393679200. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-27 16:13:18,855][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:13:22,280][06909] Updated weights for policy 0, policy_version 29962 (0.0026) [2024-06-27 16:13:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43709.5). Total num frames: 490979328. Throughput: 0: 43683.2. Samples: 393944340. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-27 16:13:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:13:25,585][06909] Updated weights for policy 0, policy_version 29972 (0.0036) [2024-06-27 16:13:28,850][06674] Fps is (10 sec: 40980.8, 60 sec: 43417.5, 300 sec: 43542.5). Total num frames: 491175936. Throughput: 0: 43637.2. Samples: 394070920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 16:13:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:13:29,418][06887] Signal inference workers to stop experience collection... (5650 times) [2024-06-27 16:13:29,430][06909] InferenceWorker_p0-w0: stopping experience collection (5650 times) [2024-06-27 16:13:29,477][06887] Signal inference workers to resume experience collection... (5650 times) [2024-06-27 16:13:29,477][06909] InferenceWorker_p0-w0: resuming experience collection (5650 times) [2024-06-27 16:13:29,612][06909] Updated weights for policy 0, policy_version 29982 (0.0030) [2024-06-27 16:13:33,222][06909] Updated weights for policy 0, policy_version 29992 (0.0039) [2024-06-27 16:13:33,856][06674] Fps is (10 sec: 45846.9, 60 sec: 44232.2, 300 sec: 43763.8). Total num frames: 491438080. Throughput: 0: 43665.1. Samples: 394337460. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 16:13:33,857][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:13:37,123][06909] Updated weights for policy 0, policy_version 30002 (0.0036) [2024-06-27 16:13:38,851][06674] Fps is (10 sec: 47507.4, 60 sec: 43962.7, 300 sec: 43820.1). Total num frames: 491651072. Throughput: 0: 43625.7. Samples: 394601300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 16:13:38,852][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:13:40,744][06909] Updated weights for policy 0, policy_version 30012 (0.0036) [2024-06-27 16:13:43,850][06674] Fps is (10 sec: 40985.1, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 491847680. Throughput: 0: 43671.2. Samples: 394725120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:13:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:13:44,488][06909] Updated weights for policy 0, policy_version 30022 (0.0042) [2024-06-27 16:13:48,021][06909] Updated weights for policy 0, policy_version 30032 (0.0036) [2024-06-27 16:13:48,852][06674] Fps is (10 sec: 45872.2, 60 sec: 43962.1, 300 sec: 43875.5). Total num frames: 492109824. Throughput: 0: 43715.2. Samples: 394994540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:13:48,853][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:13:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000030036_492109824.pth... [2024-06-27 16:13:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000029397_481640448.pth [2024-06-27 16:13:51,842][06909] Updated weights for policy 0, policy_version 30042 (0.0036) [2024-06-27 16:13:53,850][06674] Fps is (10 sec: 45873.1, 60 sec: 43963.4, 300 sec: 43820.2). Total num frames: 492306432. Throughput: 0: 43695.1. Samples: 395257940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:13:53,851][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:13:55,511][06909] Updated weights for policy 0, policy_version 30052 (0.0031) [2024-06-27 16:13:58,850][06674] Fps is (10 sec: 39329.7, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 492503040. Throughput: 0: 43665.3. Samples: 395386760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:13:58,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:13:59,184][06909] Updated weights for policy 0, policy_version 30062 (0.0029) [2024-06-27 16:14:03,099][06909] Updated weights for policy 0, policy_version 30072 (0.0030) [2024-06-27 16:14:03,850][06674] Fps is (10 sec: 44238.8, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 492748800. Throughput: 0: 43945.1. Samples: 395656500. Policy #0 lag: (min: 1.0, avg: 10.2, max: 23.0) [2024-06-27 16:14:03,854][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:14:06,786][06909] Updated weights for policy 0, policy_version 30082 (0.0026) [2024-06-27 16:14:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 43820.6). Total num frames: 492961792. Throughput: 0: 43863.1. Samples: 395918180. Policy #0 lag: (min: 1.0, avg: 10.2, max: 23.0) [2024-06-27 16:14:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:14:10,634][06909] Updated weights for policy 0, policy_version 30092 (0.0030) [2024-06-27 16:14:13,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 493158400. Throughput: 0: 43898.8. Samples: 396046360. Policy #0 lag: (min: 1.0, avg: 10.2, max: 23.0) [2024-06-27 16:14:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:14:14,222][06909] Updated weights for policy 0, policy_version 30102 (0.0031) [2024-06-27 16:14:18,085][06909] Updated weights for policy 0, policy_version 30112 (0.0032) [2024-06-27 16:14:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43967.4, 300 sec: 43820.2). Total num frames: 493404160. Throughput: 0: 44034.3. Samples: 396318740. Policy #0 lag: (min: 1.0, avg: 10.2, max: 23.0) [2024-06-27 16:14:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:14:21,712][06909] Updated weights for policy 0, policy_version 30122 (0.0044) [2024-06-27 16:14:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.6, 300 sec: 43764.7). Total num frames: 493617152. Throughput: 0: 43819.5. Samples: 396573120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:14:23,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:14:25,540][06909] Updated weights for policy 0, policy_version 30132 (0.0044) [2024-06-27 16:14:28,850][06674] Fps is (10 sec: 40961.0, 60 sec: 43963.9, 300 sec: 43653.7). Total num frames: 493813760. Throughput: 0: 43973.9. Samples: 396703940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:14:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:14:29,104][06909] Updated weights for policy 0, policy_version 30142 (0.0042) [2024-06-27 16:14:33,059][06909] Updated weights for policy 0, policy_version 30152 (0.0035) [2024-06-27 16:14:33,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43695.1, 300 sec: 43764.7). Total num frames: 494059520. Throughput: 0: 43983.8. Samples: 396973720. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:14:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:14:34,091][06887] Signal inference workers to stop experience collection... (5700 times) [2024-06-27 16:14:34,130][06909] InferenceWorker_p0-w0: stopping experience collection (5700 times) [2024-06-27 16:14:34,138][06887] Signal inference workers to resume experience collection... (5700 times) [2024-06-27 16:14:34,152][06909] InferenceWorker_p0-w0: resuming experience collection (5700 times) [2024-06-27 16:14:36,470][06909] Updated weights for policy 0, policy_version 30162 (0.0036) [2024-06-27 16:14:38,852][06674] Fps is (10 sec: 45865.3, 60 sec: 43690.2, 300 sec: 43820.0). Total num frames: 494272512. Throughput: 0: 43810.0. Samples: 397229460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:14:38,852][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 16:14:40,427][06909] Updated weights for policy 0, policy_version 30172 (0.0032) [2024-06-27 16:14:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43653.7). Total num frames: 494485504. Throughput: 0: 43848.1. Samples: 397359920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:14:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:14:43,891][06909] Updated weights for policy 0, policy_version 30182 (0.0024) [2024-06-27 16:14:48,284][06909] Updated weights for policy 0, policy_version 30192 (0.0032) [2024-06-27 16:14:48,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43419.1, 300 sec: 43820.2). Total num frames: 494714880. Throughput: 0: 43842.6. Samples: 397629420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:14:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:14:51,377][06909] Updated weights for policy 0, policy_version 30202 (0.0029) [2024-06-27 16:14:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43691.1, 300 sec: 43820.3). Total num frames: 494927872. Throughput: 0: 43650.8. Samples: 397882460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:14:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:14:55,887][06909] Updated weights for policy 0, policy_version 30212 (0.0030) [2024-06-27 16:14:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43653.9). Total num frames: 495140864. Throughput: 0: 43561.0. Samples: 398006600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:14:58,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:14:58,959][06909] Updated weights for policy 0, policy_version 30222 (0.0028) [2024-06-27 16:15:03,527][06909] Updated weights for policy 0, policy_version 30232 (0.0026) [2024-06-27 16:15:03,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 495353856. Throughput: 0: 43445.1. Samples: 398273760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 16:15:03,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:15:06,521][06909] Updated weights for policy 0, policy_version 30242 (0.0031) [2024-06-27 16:15:08,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43689.2, 300 sec: 43764.4). Total num frames: 495583232. Throughput: 0: 43595.9. Samples: 398535020. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 16:15:08,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:15:10,784][06909] Updated weights for policy 0, policy_version 30252 (0.0032) [2024-06-27 16:15:13,846][06909] Updated weights for policy 0, policy_version 30262 (0.0028) [2024-06-27 16:15:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 43654.0). Total num frames: 495812608. Throughput: 0: 43546.2. Samples: 398663520. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 16:15:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:15:18,210][06909] Updated weights for policy 0, policy_version 30272 (0.0033) [2024-06-27 16:15:18,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 496025600. Throughput: 0: 43685.3. Samples: 398939560. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 16:15:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:15:21,392][06909] Updated weights for policy 0, policy_version 30282 (0.0029) [2024-06-27 16:15:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 496238592. Throughput: 0: 43557.6. Samples: 399189460. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 16:15:23,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:15:25,675][06909] Updated weights for policy 0, policy_version 30292 (0.0045) [2024-06-27 16:15:28,814][06909] Updated weights for policy 0, policy_version 30302 (0.0026) [2024-06-27 16:15:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 43653.6). Total num frames: 496467968. Throughput: 0: 43532.3. Samples: 399318880. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 16:15:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:15:33,234][06887] Signal inference workers to stop experience collection... (5750 times) [2024-06-27 16:15:33,236][06887] Signal inference workers to resume experience collection... (5750 times) [2024-06-27 16:15:33,249][06909] Updated weights for policy 0, policy_version 30312 (0.0041) [2024-06-27 16:15:33,280][06909] InferenceWorker_p0-w0: stopping experience collection (5750 times) [2024-06-27 16:15:33,280][06909] InferenceWorker_p0-w0: resuming experience collection (5750 times) [2024-06-27 16:15:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 496680960. Throughput: 0: 43683.1. Samples: 399595160. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 16:15:33,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:15:36,379][06909] Updated weights for policy 0, policy_version 30322 (0.0036) [2024-06-27 16:15:38,852][06674] Fps is (10 sec: 42590.1, 60 sec: 43690.7, 300 sec: 43764.4). Total num frames: 496893952. Throughput: 0: 43580.1. Samples: 399843660. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 16:15:38,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:15:40,869][06909] Updated weights for policy 0, policy_version 30332 (0.0032) [2024-06-27 16:15:43,827][06909] Updated weights for policy 0, policy_version 30342 (0.0025) [2024-06-27 16:15:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43654.0). Total num frames: 497123328. Throughput: 0: 43746.2. Samples: 399975180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 16:15:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:15:48,242][06909] Updated weights for policy 0, policy_version 30352 (0.0030) [2024-06-27 16:15:48,850][06674] Fps is (10 sec: 40967.7, 60 sec: 43144.5, 300 sec: 43653.6). Total num frames: 497303552. Throughput: 0: 43782.0. Samples: 400243960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 16:15:48,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:15:48,918][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000030354_497319936.pth... [2024-06-27 16:15:48,971][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000029715_486850560.pth [2024-06-27 16:15:51,348][06909] Updated weights for policy 0, policy_version 30362 (0.0037) [2024-06-27 16:15:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 497532928. Throughput: 0: 43597.6. Samples: 400496820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 16:15:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:15:55,929][06909] Updated weights for policy 0, policy_version 30372 (0.0031) [2024-06-27 16:15:58,764][06909] Updated weights for policy 0, policy_version 30382 (0.0025) [2024-06-27 16:15:58,850][06674] Fps is (10 sec: 47514.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 497778688. Throughput: 0: 43675.5. Samples: 400628920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 16:15:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:16:03,305][06909] Updated weights for policy 0, policy_version 30392 (0.0030) [2024-06-27 16:16:03,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 497975296. Throughput: 0: 43602.2. Samples: 400901660. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-27 16:16:03,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:16:06,167][06909] Updated weights for policy 0, policy_version 30402 (0.0027) [2024-06-27 16:16:08,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43419.0, 300 sec: 43709.3). Total num frames: 498188288. Throughput: 0: 43766.0. Samples: 401158940. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-27 16:16:08,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:16:10,644][06909] Updated weights for policy 0, policy_version 30412 (0.0034) [2024-06-27 16:16:13,613][06909] Updated weights for policy 0, policy_version 30422 (0.0036) [2024-06-27 16:16:13,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43690.7, 300 sec: 43709.5). Total num frames: 498434048. Throughput: 0: 43934.8. Samples: 401295940. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-27 16:16:13,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:16:17,860][06909] Updated weights for policy 0, policy_version 30432 (0.0023) [2024-06-27 16:16:18,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 498647040. Throughput: 0: 43684.9. Samples: 401560980. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-27 16:16:18,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:16:21,090][06909] Updated weights for policy 0, policy_version 30442 (0.0031) [2024-06-27 16:16:23,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43690.5, 300 sec: 43764.7). Total num frames: 498860032. Throughput: 0: 43735.6. Samples: 401811680. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-27 16:16:23,856][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:16:25,826][06909] Updated weights for policy 0, policy_version 30452 (0.0026) [2024-06-27 16:16:28,757][06909] Updated weights for policy 0, policy_version 30462 (0.0034) [2024-06-27 16:16:28,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 499089408. Throughput: 0: 43767.0. Samples: 401944700. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-27 16:16:28,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:16:33,234][06909] Updated weights for policy 0, policy_version 30472 (0.0021) [2024-06-27 16:16:33,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 499286016. Throughput: 0: 43648.6. Samples: 402208140. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-27 16:16:33,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:16:36,277][06909] Updated weights for policy 0, policy_version 30482 (0.0028) [2024-06-27 16:16:38,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43692.2, 300 sec: 43764.7). Total num frames: 499515392. Throughput: 0: 43752.0. Samples: 402465660. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-27 16:16:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:16:40,767][06909] Updated weights for policy 0, policy_version 30492 (0.0050) [2024-06-27 16:16:43,653][06909] Updated weights for policy 0, policy_version 30502 (0.0032) [2024-06-27 16:16:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 499744768. Throughput: 0: 43816.9. Samples: 402600680. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 16:16:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:16:48,097][06887] Signal inference workers to stop experience collection... (5800 times) [2024-06-27 16:16:48,097][06887] Signal inference workers to resume experience collection... (5800 times) [2024-06-27 16:16:48,143][06909] InferenceWorker_p0-w0: stopping experience collection (5800 times) [2024-06-27 16:16:48,143][06909] InferenceWorker_p0-w0: resuming experience collection (5800 times) [2024-06-27 16:16:48,229][06909] Updated weights for policy 0, policy_version 30512 (0.0040) [2024-06-27 16:16:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 499941376. Throughput: 0: 43645.4. Samples: 402865700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 16:16:48,859][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:16:51,671][06909] Updated weights for policy 0, policy_version 30523 (0.0032) [2024-06-27 16:16:53,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 500154368. Throughput: 0: 43739.2. Samples: 403127200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 16:16:53,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:16:55,894][06909] Updated weights for policy 0, policy_version 30533 (0.0022) [2024-06-27 16:16:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 500400128. Throughput: 0: 43700.0. Samples: 403262440. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 16:16:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:16:58,969][06909] Updated weights for policy 0, policy_version 30543 (0.0044) [2024-06-27 16:17:03,324][06909] Updated weights for policy 0, policy_version 30553 (0.0036) [2024-06-27 16:17:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 500596736. Throughput: 0: 43644.9. Samples: 403525000. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 16:17:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:17:06,600][06909] Updated weights for policy 0, policy_version 30563 (0.0034) [2024-06-27 16:17:08,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.9, 300 sec: 43709.2). Total num frames: 500809728. Throughput: 0: 43859.8. Samples: 403785360. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 16:17:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:17:10,710][06909] Updated weights for policy 0, policy_version 30573 (0.0033) [2024-06-27 16:17:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 501055488. Throughput: 0: 43865.0. Samples: 403918620. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 16:17:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:17:14,164][06909] Updated weights for policy 0, policy_version 30583 (0.0037) [2024-06-27 16:17:18,393][06909] Updated weights for policy 0, policy_version 30593 (0.0028) [2024-06-27 16:17:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 501252096. Throughput: 0: 43838.7. Samples: 404180880. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 16:17:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:17:21,615][06909] Updated weights for policy 0, policy_version 30603 (0.0032) [2024-06-27 16:17:23,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43689.3, 300 sec: 43764.4). Total num frames: 501481472. Throughput: 0: 43789.5. Samples: 404436280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 16:17:23,852][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:17:25,981][06909] Updated weights for policy 0, policy_version 30613 (0.0032) [2024-06-27 16:17:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 501694464. Throughput: 0: 43700.4. Samples: 404567200. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 16:17:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:17:29,514][06909] Updated weights for policy 0, policy_version 30623 (0.0033) [2024-06-27 16:17:33,442][06909] Updated weights for policy 0, policy_version 30633 (0.0021) [2024-06-27 16:17:33,850][06674] Fps is (10 sec: 42606.7, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 501907456. Throughput: 0: 43728.0. Samples: 404833460. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 16:17:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:17:36,847][06909] Updated weights for policy 0, policy_version 30643 (0.0030) [2024-06-27 16:17:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 502136832. Throughput: 0: 43729.3. Samples: 405095020. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 16:17:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:17:40,749][06909] Updated weights for policy 0, policy_version 30653 (0.0037) [2024-06-27 16:17:43,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 502366208. Throughput: 0: 43847.5. Samples: 405235580. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:17:43,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:17:44,188][06909] Updated weights for policy 0, policy_version 30663 (0.0025) [2024-06-27 16:17:48,515][06909] Updated weights for policy 0, policy_version 30673 (0.0035) [2024-06-27 16:17:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 502562816. Throughput: 0: 43855.2. Samples: 405498480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:17:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:17:48,944][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000030675_502579200.pth... [2024-06-27 16:17:48,992][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000030036_492109824.pth [2024-06-27 16:17:49,458][06887] Signal inference workers to stop experience collection... (5850 times) [2024-06-27 16:17:49,502][06909] InferenceWorker_p0-w0: stopping experience collection (5850 times) [2024-06-27 16:17:49,573][06887] Signal inference workers to resume experience collection... (5850 times) [2024-06-27 16:17:49,573][06909] InferenceWorker_p0-w0: resuming experience collection (5850 times) [2024-06-27 16:17:52,201][06909] Updated weights for policy 0, policy_version 30683 (0.0026) [2024-06-27 16:17:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 502792192. Throughput: 0: 43757.3. Samples: 405754440. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:17:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:17:55,975][06909] Updated weights for policy 0, policy_version 30693 (0.0031) [2024-06-27 16:17:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 503005184. Throughput: 0: 43773.0. Samples: 405888400. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:17:58,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:17:59,674][06909] Updated weights for policy 0, policy_version 30703 (0.0027) [2024-06-27 16:18:03,334][06909] Updated weights for policy 0, policy_version 30713 (0.0052) [2024-06-27 16:18:03,852][06674] Fps is (10 sec: 44227.2, 60 sec: 43962.1, 300 sec: 43764.4). Total num frames: 503234560. Throughput: 0: 43797.9. Samples: 406151880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 16:18:03,853][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:18:07,373][06909] Updated weights for policy 0, policy_version 30723 (0.0028) [2024-06-27 16:18:08,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43962.1, 300 sec: 43764.4). Total num frames: 503447552. Throughput: 0: 43844.0. Samples: 406409260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 16:18:08,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:18:10,758][06909] Updated weights for policy 0, policy_version 30733 (0.0030) [2024-06-27 16:18:13,850][06674] Fps is (10 sec: 42607.8, 60 sec: 43417.7, 300 sec: 43710.0). Total num frames: 503660544. Throughput: 0: 43886.3. Samples: 406542080. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 16:18:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:18:14,662][06909] Updated weights for policy 0, policy_version 30743 (0.0032) [2024-06-27 16:18:18,140][06909] Updated weights for policy 0, policy_version 30753 (0.0042) [2024-06-27 16:18:18,850][06674] Fps is (10 sec: 44246.0, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 503889920. Throughput: 0: 43825.0. Samples: 406805580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 16:18:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:18:22,087][06909] Updated weights for policy 0, policy_version 30763 (0.0030) [2024-06-27 16:18:23,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43692.1, 300 sec: 43820.3). Total num frames: 504102912. Throughput: 0: 43810.2. Samples: 407066480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:18:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:18:25,732][06909] Updated weights for policy 0, policy_version 30773 (0.0041) [2024-06-27 16:18:28,850][06674] Fps is (10 sec: 42597.5, 60 sec: 43690.5, 300 sec: 43654.5). Total num frames: 504315904. Throughput: 0: 43529.1. Samples: 407194400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:18:28,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:18:29,535][06909] Updated weights for policy 0, policy_version 30783 (0.0033) [2024-06-27 16:18:33,135][06909] Updated weights for policy 0, policy_version 30793 (0.0029) [2024-06-27 16:18:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43709.4). Total num frames: 504545280. Throughput: 0: 43595.4. Samples: 407460280. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:18:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:18:36,859][06909] Updated weights for policy 0, policy_version 30803 (0.0037) [2024-06-27 16:18:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 504758272. Throughput: 0: 43734.1. Samples: 407722480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:18:38,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:18:40,487][06909] Updated weights for policy 0, policy_version 30813 (0.0033) [2024-06-27 16:18:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43598.4). Total num frames: 504971264. Throughput: 0: 43682.1. Samples: 407854100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:18:43,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:18:44,374][06909] Updated weights for policy 0, policy_version 30823 (0.0028) [2024-06-27 16:18:48,268][06909] Updated weights for policy 0, policy_version 30833 (0.0032) [2024-06-27 16:18:48,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 43709.3). Total num frames: 505200640. Throughput: 0: 43688.8. Samples: 408117780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 16:18:48,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:18:51,795][06909] Updated weights for policy 0, policy_version 30843 (0.0029) [2024-06-27 16:18:53,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43417.4, 300 sec: 43709.2). Total num frames: 505397248. Throughput: 0: 43848.5. Samples: 408382360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 16:18:53,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:18:55,694][06909] Updated weights for policy 0, policy_version 30853 (0.0035) [2024-06-27 16:18:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 505626624. Throughput: 0: 43743.9. Samples: 408510560. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 16:18:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:18:59,579][06909] Updated weights for policy 0, policy_version 30863 (0.0043) [2024-06-27 16:19:03,189][06909] Updated weights for policy 0, policy_version 30873 (0.0031) [2024-06-27 16:19:03,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43419.1, 300 sec: 43653.6). Total num frames: 505839616. Throughput: 0: 43769.7. Samples: 408775220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 16:19:03,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:19:06,977][06909] Updated weights for policy 0, policy_version 30883 (0.0025) [2024-06-27 16:19:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43419.1, 300 sec: 43709.2). Total num frames: 506052608. Throughput: 0: 43706.7. Samples: 409033280. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-27 16:19:08,852][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 16:19:10,535][06909] Updated weights for policy 0, policy_version 30893 (0.0031) [2024-06-27 16:19:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 506298368. Throughput: 0: 43642.4. Samples: 409158300. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-27 16:19:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:19:14,775][06887] Signal inference workers to stop experience collection... (5900 times) [2024-06-27 16:19:14,778][06909] Updated weights for policy 0, policy_version 30903 (0.0036) [2024-06-27 16:19:14,780][06887] Signal inference workers to resume experience collection... (5900 times) [2024-06-27 16:19:14,791][06909] InferenceWorker_p0-w0: stopping experience collection (5900 times) [2024-06-27 16:19:14,822][06909] InferenceWorker_p0-w0: resuming experience collection (5900 times) [2024-06-27 16:19:18,207][06909] Updated weights for policy 0, policy_version 30913 (0.0029) [2024-06-27 16:19:18,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 506527744. Throughput: 0: 43857.9. Samples: 409433880. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-27 16:19:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:19:22,167][06909] Updated weights for policy 0, policy_version 30923 (0.0039) [2024-06-27 16:19:23,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.5, 300 sec: 43709.1). Total num frames: 506707968. Throughput: 0: 43795.0. Samples: 409693260. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-27 16:19:23,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:19:25,674][06909] Updated weights for policy 0, policy_version 30933 (0.0032) [2024-06-27 16:19:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.9, 300 sec: 43709.2). Total num frames: 506953728. Throughput: 0: 43699.6. Samples: 409820580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 16:19:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:19:29,448][06909] Updated weights for policy 0, policy_version 30943 (0.0045) [2024-06-27 16:19:33,174][06909] Updated weights for policy 0, policy_version 30953 (0.0031) [2024-06-27 16:19:33,850][06674] Fps is (10 sec: 45876.2, 60 sec: 43690.8, 300 sec: 43709.5). Total num frames: 507166720. Throughput: 0: 43986.2. Samples: 410097160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 16:19:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:19:36,952][06909] Updated weights for policy 0, policy_version 30963 (0.0030) [2024-06-27 16:19:38,852][06674] Fps is (10 sec: 40951.6, 60 sec: 43416.2, 300 sec: 43653.3). Total num frames: 507363328. Throughput: 0: 43627.1. Samples: 410345660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 16:19:38,852][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:19:40,738][06909] Updated weights for policy 0, policy_version 30973 (0.0036) [2024-06-27 16:19:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 507609088. Throughput: 0: 43641.9. Samples: 410474440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 16:19:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:19:44,624][06909] Updated weights for policy 0, policy_version 30983 (0.0038) [2024-06-27 16:19:48,186][06909] Updated weights for policy 0, policy_version 30993 (0.0035) [2024-06-27 16:19:48,850][06674] Fps is (10 sec: 49161.6, 60 sec: 44236.7, 300 sec: 43820.2). Total num frames: 507854848. Throughput: 0: 43982.2. Samples: 410754420. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 16:19:48,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:19:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000030997_507854848.pth... [2024-06-27 16:19:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000030354_497319936.pth [2024-06-27 16:19:52,082][06909] Updated weights for policy 0, policy_version 31003 (0.0036) [2024-06-27 16:19:53,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.8, 300 sec: 43653.6). Total num frames: 508018688. Throughput: 0: 43744.5. Samples: 411001780. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 16:19:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:19:55,701][06909] Updated weights for policy 0, policy_version 31013 (0.0043) [2024-06-27 16:19:58,851][06674] Fps is (10 sec: 40954.9, 60 sec: 43962.8, 300 sec: 43764.5). Total num frames: 508264448. Throughput: 0: 43792.5. Samples: 411129020. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 16:19:58,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:19:59,325][06909] Updated weights for policy 0, policy_version 31023 (0.0028) [2024-06-27 16:20:02,756][06887] Signal inference workers to stop experience collection... (5950 times) [2024-06-27 16:20:02,757][06887] Signal inference workers to resume experience collection... (5950 times) [2024-06-27 16:20:02,793][06909] InferenceWorker_p0-w0: stopping experience collection (5950 times) [2024-06-27 16:20:02,794][06909] InferenceWorker_p0-w0: resuming experience collection (5950 times) [2024-06-27 16:20:03,247][06909] Updated weights for policy 0, policy_version 31033 (0.0026) [2024-06-27 16:20:03,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.9, 300 sec: 43765.0). Total num frames: 508493824. Throughput: 0: 43947.1. Samples: 411411500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 16:20:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:20:06,742][06909] Updated weights for policy 0, policy_version 31043 (0.0023) [2024-06-27 16:20:08,850][06674] Fps is (10 sec: 40965.6, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 508674048. Throughput: 0: 43852.6. Samples: 411666620. Policy #0 lag: (min: 0.0, avg: 12.6, max: 30.0) [2024-06-27 16:20:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:20:10,782][06909] Updated weights for policy 0, policy_version 31053 (0.0025) [2024-06-27 16:20:13,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43689.2, 300 sec: 43708.9). Total num frames: 508919808. Throughput: 0: 43823.4. Samples: 411792720. Policy #0 lag: (min: 0.0, avg: 12.6, max: 30.0) [2024-06-27 16:20:13,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:20:14,069][06909] Updated weights for policy 0, policy_version 31063 (0.0034) [2024-06-27 16:20:18,158][06909] Updated weights for policy 0, policy_version 31073 (0.0031) [2024-06-27 16:20:18,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 509149184. Throughput: 0: 43939.1. Samples: 412074420. Policy #0 lag: (min: 0.0, avg: 12.6, max: 30.0) [2024-06-27 16:20:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:20:21,395][06909] Updated weights for policy 0, policy_version 31083 (0.0034) [2024-06-27 16:20:23,850][06674] Fps is (10 sec: 42606.6, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 509345792. Throughput: 0: 44049.0. Samples: 412327780. Policy #0 lag: (min: 0.0, avg: 12.6, max: 30.0) [2024-06-27 16:20:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:20:25,818][06909] Updated weights for policy 0, policy_version 31093 (0.0032) [2024-06-27 16:20:28,656][06909] Updated weights for policy 0, policy_version 31103 (0.0037) [2024-06-27 16:20:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 509591552. Throughput: 0: 44036.7. Samples: 412456100. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2024-06-27 16:20:28,853][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:20:33,120][06909] Updated weights for policy 0, policy_version 31113 (0.0035) [2024-06-27 16:20:33,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44236.8, 300 sec: 43820.6). Total num frames: 509820928. Throughput: 0: 44037.8. Samples: 412736120. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2024-06-27 16:20:33,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:20:35,850][06909] Updated weights for policy 0, policy_version 31123 (0.0033) [2024-06-27 16:20:38,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43692.1, 300 sec: 43598.1). Total num frames: 509984768. Throughput: 0: 44164.4. Samples: 412989180. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2024-06-27 16:20:38,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:20:40,526][06909] Updated weights for policy 0, policy_version 31133 (0.0047) [2024-06-27 16:20:43,790][06909] Updated weights for policy 0, policy_version 31143 (0.0036) [2024-06-27 16:20:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 510246912. Throughput: 0: 44008.1. Samples: 413109320. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2024-06-27 16:20:43,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 16:20:47,810][06909] Updated weights for policy 0, policy_version 31153 (0.0033) [2024-06-27 16:20:48,850][06674] Fps is (10 sec: 50790.3, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 510492672. Throughput: 0: 43899.9. Samples: 413387000. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 16:20:48,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:20:51,422][06909] Updated weights for policy 0, policy_version 31163 (0.0029) [2024-06-27 16:20:53,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 510640128. Throughput: 0: 43957.3. Samples: 413644700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 16:20:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 16:20:54,585][06887] Signal inference workers to stop experience collection... (6000 times) [2024-06-27 16:20:54,627][06909] InferenceWorker_p0-w0: stopping experience collection (6000 times) [2024-06-27 16:20:54,642][06887] Signal inference workers to resume experience collection... (6000 times) [2024-06-27 16:20:54,644][06909] InferenceWorker_p0-w0: resuming experience collection (6000 times) [2024-06-27 16:20:55,285][06909] Updated weights for policy 0, policy_version 31173 (0.0033) [2024-06-27 16:20:58,838][06909] Updated weights for policy 0, policy_version 31183 (0.0029) [2024-06-27 16:20:58,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43964.7, 300 sec: 43820.3). Total num frames: 510902272. Throughput: 0: 43853.9. Samples: 413766060. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 16:20:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:21:02,800][06909] Updated weights for policy 0, policy_version 31193 (0.0036) [2024-06-27 16:21:03,850][06674] Fps is (10 sec: 49151.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 511131648. Throughput: 0: 43730.6. Samples: 414042300. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 16:21:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:21:06,147][06909] Updated weights for policy 0, policy_version 31203 (0.0037) [2024-06-27 16:21:08,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 511311872. Throughput: 0: 43876.6. Samples: 414302220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 16:21:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:21:10,519][06909] Updated weights for policy 0, policy_version 31213 (0.0027) [2024-06-27 16:21:13,621][06909] Updated weights for policy 0, policy_version 31223 (0.0042) [2024-06-27 16:21:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43965.3, 300 sec: 43764.7). Total num frames: 511557632. Throughput: 0: 43711.2. Samples: 414423100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 16:21:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:21:17,871][06909] Updated weights for policy 0, policy_version 31233 (0.0035) [2024-06-27 16:21:18,854][06674] Fps is (10 sec: 49133.3, 60 sec: 44234.0, 300 sec: 43875.3). Total num frames: 511803392. Throughput: 0: 43652.8. Samples: 414700660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 16:21:18,854][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:21:20,877][06909] Updated weights for policy 0, policy_version 31243 (0.0036) [2024-06-27 16:21:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.8, 300 sec: 43653.7). Total num frames: 511967232. Throughput: 0: 43831.2. Samples: 414961580. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 16:21:23,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:21:25,318][06909] Updated weights for policy 0, policy_version 31253 (0.0036) [2024-06-27 16:21:28,582][06909] Updated weights for policy 0, policy_version 31263 (0.0033) [2024-06-27 16:21:28,850][06674] Fps is (10 sec: 40975.5, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 512212992. Throughput: 0: 43895.5. Samples: 415084620. Policy #0 lag: (min: 0.0, avg: 12.0, max: 23.0) [2024-06-27 16:21:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:21:32,653][06909] Updated weights for policy 0, policy_version 31273 (0.0045) [2024-06-27 16:21:33,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 512442368. Throughput: 0: 43848.0. Samples: 415360160. Policy #0 lag: (min: 0.0, avg: 12.0, max: 23.0) [2024-06-27 16:21:33,855][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:21:35,867][06909] Updated weights for policy 0, policy_version 31283 (0.0020) [2024-06-27 16:21:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 512622592. Throughput: 0: 43918.6. Samples: 415621040. Policy #0 lag: (min: 0.0, avg: 12.0, max: 23.0) [2024-06-27 16:21:38,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:21:40,013][06909] Updated weights for policy 0, policy_version 31293 (0.0029) [2024-06-27 16:21:43,637][06909] Updated weights for policy 0, policy_version 31303 (0.0035) [2024-06-27 16:21:43,852][06674] Fps is (10 sec: 42590.2, 60 sec: 43689.2, 300 sec: 43820.0). Total num frames: 512868352. Throughput: 0: 43957.6. Samples: 415744240. Policy #0 lag: (min: 0.0, avg: 12.0, max: 23.0) [2024-06-27 16:21:43,852][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:21:47,525][06909] Updated weights for policy 0, policy_version 31313 (0.0027) [2024-06-27 16:21:48,853][06674] Fps is (10 sec: 49137.5, 60 sec: 43688.6, 300 sec: 43930.9). Total num frames: 513114112. Throughput: 0: 43780.7. Samples: 416012560. Policy #0 lag: (min: 1.0, avg: 9.1, max: 23.0) [2024-06-27 16:21:48,854][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:21:48,885][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000031318_513114112.pth... [2024-06-27 16:21:48,948][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000030675_502579200.pth [2024-06-27 16:21:51,114][06909] Updated weights for policy 0, policy_version 31323 (0.0032) [2024-06-27 16:21:53,850][06674] Fps is (10 sec: 40968.0, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 513277952. Throughput: 0: 43838.6. Samples: 416274960. Policy #0 lag: (min: 1.0, avg: 9.1, max: 23.0) [2024-06-27 16:21:53,851][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:21:54,685][06887] Signal inference workers to stop experience collection... (6050 times) [2024-06-27 16:21:54,685][06887] Signal inference workers to resume experience collection... (6050 times) [2024-06-27 16:21:54,708][06909] InferenceWorker_p0-w0: stopping experience collection (6050 times) [2024-06-27 16:21:54,736][06909] InferenceWorker_p0-w0: resuming experience collection (6050 times) [2024-06-27 16:21:55,075][06909] Updated weights for policy 0, policy_version 31333 (0.0028) [2024-06-27 16:21:58,549][06909] Updated weights for policy 0, policy_version 31343 (0.0038) [2024-06-27 16:21:58,850][06674] Fps is (10 sec: 40972.2, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 513523712. Throughput: 0: 43844.4. Samples: 416396100. Policy #0 lag: (min: 1.0, avg: 9.1, max: 23.0) [2024-06-27 16:21:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:22:02,551][06909] Updated weights for policy 0, policy_version 31353 (0.0031) [2024-06-27 16:22:03,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 513753088. Throughput: 0: 43818.7. Samples: 416672340. Policy #0 lag: (min: 1.0, avg: 9.1, max: 23.0) [2024-06-27 16:22:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:22:06,212][06909] Updated weights for policy 0, policy_version 31363 (0.0033) [2024-06-27 16:22:08,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 513933312. Throughput: 0: 43742.6. Samples: 416930000. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 16:22:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:22:09,845][06909] Updated weights for policy 0, policy_version 31373 (0.0050) [2024-06-27 16:22:13,745][06909] Updated weights for policy 0, policy_version 31383 (0.0026) [2024-06-27 16:22:13,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 514179072. Throughput: 0: 43814.7. Samples: 417056280. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 16:22:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:22:17,506][06909] Updated weights for policy 0, policy_version 31393 (0.0040) [2024-06-27 16:22:18,850][06674] Fps is (10 sec: 49151.9, 60 sec: 43693.4, 300 sec: 43876.1). Total num frames: 514424832. Throughput: 0: 43622.7. Samples: 417323180. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 16:22:18,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:22:21,274][06909] Updated weights for policy 0, policy_version 31403 (0.0031) [2024-06-27 16:22:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 514605056. Throughput: 0: 43673.4. Samples: 417586340. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 16:22:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:22:24,888][06909] Updated weights for policy 0, policy_version 31413 (0.0028) [2024-06-27 16:22:28,681][06909] Updated weights for policy 0, policy_version 31423 (0.0038) [2024-06-27 16:22:28,852][06674] Fps is (10 sec: 40951.7, 60 sec: 43689.2, 300 sec: 43820.0). Total num frames: 514834432. Throughput: 0: 43649.8. Samples: 417708480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 16:22:28,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:22:32,280][06909] Updated weights for policy 0, policy_version 31433 (0.0038) [2024-06-27 16:22:33,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 515080192. Throughput: 0: 43651.3. Samples: 417976740. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 16:22:33,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:22:36,163][06909] Updated weights for policy 0, policy_version 31443 (0.0026) [2024-06-27 16:22:38,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 515260416. Throughput: 0: 43753.4. Samples: 418243860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 16:22:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:22:39,846][06909] Updated weights for policy 0, policy_version 31453 (0.0030) [2024-06-27 16:22:43,635][06909] Updated weights for policy 0, policy_version 31463 (0.0031) [2024-06-27 16:22:43,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43692.2, 300 sec: 43820.3). Total num frames: 515489792. Throughput: 0: 43832.4. Samples: 418368560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 16:22:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:22:47,328][06909] Updated weights for policy 0, policy_version 31473 (0.0032) [2024-06-27 16:22:48,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43692.8, 300 sec: 43875.8). Total num frames: 515735552. Throughput: 0: 43621.0. Samples: 418635280. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-27 16:22:48,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:22:51,138][06909] Updated weights for policy 0, policy_version 31483 (0.0038) [2024-06-27 16:22:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 43820.2). Total num frames: 515932160. Throughput: 0: 43787.5. Samples: 418900440. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-27 16:22:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:22:54,827][06909] Updated weights for policy 0, policy_version 31493 (0.0027) [2024-06-27 16:22:58,531][06909] Updated weights for policy 0, policy_version 31503 (0.0041) [2024-06-27 16:22:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 43765.0). Total num frames: 516145152. Throughput: 0: 43711.0. Samples: 419023280. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-27 16:22:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:23:02,457][06909] Updated weights for policy 0, policy_version 31513 (0.0035) [2024-06-27 16:23:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 43876.1). Total num frames: 516390912. Throughput: 0: 43761.7. Samples: 419292460. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-27 16:23:03,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:23:06,228][06909] Updated weights for policy 0, policy_version 31523 (0.0039) [2024-06-27 16:23:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 43820.2). Total num frames: 516587520. Throughput: 0: 43785.2. Samples: 419556680. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-27 16:23:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:23:09,767][06909] Updated weights for policy 0, policy_version 31533 (0.0032) [2024-06-27 16:23:10,346][06887] Signal inference workers to stop experience collection... (6100 times) [2024-06-27 16:23:10,347][06887] Signal inference workers to resume experience collection... (6100 times) [2024-06-27 16:23:10,395][06909] InferenceWorker_p0-w0: stopping experience collection (6100 times) [2024-06-27 16:23:10,395][06909] InferenceWorker_p0-w0: resuming experience collection (6100 times) [2024-06-27 16:23:13,477][06909] Updated weights for policy 0, policy_version 31543 (0.0034) [2024-06-27 16:23:13,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 516800512. Throughput: 0: 43849.5. Samples: 419681620. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:23:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:23:17,055][06909] Updated weights for policy 0, policy_version 31553 (0.0023) [2024-06-27 16:23:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 517046272. Throughput: 0: 43738.2. Samples: 419944960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:23:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:23:20,861][06909] Updated weights for policy 0, policy_version 31563 (0.0033) [2024-06-27 16:23:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43764.8). Total num frames: 517226496. Throughput: 0: 43855.2. Samples: 420217340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:23:23,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:23:24,586][06909] Updated weights for policy 0, policy_version 31573 (0.0040) [2024-06-27 16:23:28,235][06909] Updated weights for policy 0, policy_version 31583 (0.0042) [2024-06-27 16:23:28,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43692.1, 300 sec: 43764.7). Total num frames: 517455872. Throughput: 0: 43823.5. Samples: 420340620. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:23:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:23:32,079][06909] Updated weights for policy 0, policy_version 31593 (0.0034) [2024-06-27 16:23:33,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 517701632. Throughput: 0: 43853.3. Samples: 420608680. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 16:23:33,854][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:23:35,858][06909] Updated weights for policy 0, policy_version 31603 (0.0041) [2024-06-27 16:23:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 517898240. Throughput: 0: 43865.4. Samples: 420874380. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 16:23:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 16:23:39,686][06909] Updated weights for policy 0, policy_version 31613 (0.0032) [2024-06-27 16:23:43,161][06909] Updated weights for policy 0, policy_version 31623 (0.0025) [2024-06-27 16:23:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 518111232. Throughput: 0: 43828.4. Samples: 420995560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 16:23:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:23:47,056][06909] Updated weights for policy 0, policy_version 31633 (0.0035) [2024-06-27 16:23:48,856][06674] Fps is (10 sec: 45846.9, 60 sec: 43686.2, 300 sec: 43930.4). Total num frames: 518356992. Throughput: 0: 43733.2. Samples: 421260720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 16:23:48,857][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:23:48,871][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000031638_518356992.pth... [2024-06-27 16:23:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000030997_507854848.pth [2024-06-27 16:23:50,497][06909] Updated weights for policy 0, policy_version 31643 (0.0025) [2024-06-27 16:23:53,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 518537216. Throughput: 0: 43803.2. Samples: 421527820. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 16:23:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:23:54,719][06909] Updated weights for policy 0, policy_version 31653 (0.0023) [2024-06-27 16:23:57,838][06909] Updated weights for policy 0, policy_version 31663 (0.0032) [2024-06-27 16:23:58,850][06674] Fps is (10 sec: 40985.2, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 518766592. Throughput: 0: 43733.8. Samples: 421649640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 16:23:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:24:02,228][06909] Updated weights for policy 0, policy_version 31673 (0.0029) [2024-06-27 16:24:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43417.7, 300 sec: 43875.8). Total num frames: 518995968. Throughput: 0: 43695.2. Samples: 421911240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 16:24:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:24:05,799][06909] Updated weights for policy 0, policy_version 31683 (0.0030) [2024-06-27 16:24:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 519192576. Throughput: 0: 43756.4. Samples: 422186380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 16:24:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:24:09,551][06909] Updated weights for policy 0, policy_version 31693 (0.0025) [2024-06-27 16:24:13,099][06909] Updated weights for policy 0, policy_version 31703 (0.0033) [2024-06-27 16:24:13,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 519438336. Throughput: 0: 43757.3. Samples: 422309700. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 16:24:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:24:17,142][06909] Updated weights for policy 0, policy_version 31713 (0.0045) [2024-06-27 16:24:18,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 519667712. Throughput: 0: 43664.9. Samples: 422573600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 16:24:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:24:20,450][06909] Updated weights for policy 0, policy_version 31723 (0.0027) [2024-06-27 16:24:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 519864320. Throughput: 0: 43608.4. Samples: 422836760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 16:24:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:24:24,846][06909] Updated weights for policy 0, policy_version 31733 (0.0026) [2024-06-27 16:24:28,207][06909] Updated weights for policy 0, policy_version 31743 (0.0033) [2024-06-27 16:24:28,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 520077312. Throughput: 0: 43633.9. Samples: 422959080. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 16:24:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:24:32,177][06909] Updated weights for policy 0, policy_version 31753 (0.0034) [2024-06-27 16:24:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 43931.6). Total num frames: 520323072. Throughput: 0: 43715.3. Samples: 423227640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 16:24:33,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 16:24:35,562][06909] Updated weights for policy 0, policy_version 31763 (0.0038) [2024-06-27 16:24:37,635][06887] Signal inference workers to stop experience collection... (6150 times) [2024-06-27 16:24:37,636][06887] Signal inference workers to resume experience collection... (6150 times) [2024-06-27 16:24:37,649][06909] InferenceWorker_p0-w0: stopping experience collection (6150 times) [2024-06-27 16:24:37,650][06909] InferenceWorker_p0-w0: resuming experience collection (6150 times) [2024-06-27 16:24:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 520503296. Throughput: 0: 43599.1. Samples: 423489780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 16:24:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:24:40,144][06909] Updated weights for policy 0, policy_version 31773 (0.0033) [2024-06-27 16:24:42,904][06909] Updated weights for policy 0, policy_version 31783 (0.0027) [2024-06-27 16:24:43,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.8, 300 sec: 43653.7). Total num frames: 520732672. Throughput: 0: 43687.6. Samples: 423615580. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 16:24:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:24:47,414][06909] Updated weights for policy 0, policy_version 31793 (0.0031) [2024-06-27 16:24:48,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43695.1, 300 sec: 43931.3). Total num frames: 520978432. Throughput: 0: 43911.4. Samples: 423887260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 16:24:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:24:50,107][06909] Updated weights for policy 0, policy_version 31803 (0.0023) [2024-06-27 16:24:53,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.6, 300 sec: 43764.9). Total num frames: 521175040. Throughput: 0: 43766.0. Samples: 424155860. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 16:24:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:24:54,653][06909] Updated weights for policy 0, policy_version 31813 (0.0028) [2024-06-27 16:24:57,836][06909] Updated weights for policy 0, policy_version 31823 (0.0029) [2024-06-27 16:24:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 521404416. Throughput: 0: 43819.5. Samples: 424281580. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 16:24:58,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:25:01,906][06909] Updated weights for policy 0, policy_version 31833 (0.0031) [2024-06-27 16:25:03,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 521633792. Throughput: 0: 43721.4. Samples: 424541060. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 16:25:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:25:05,110][06909] Updated weights for policy 0, policy_version 31843 (0.0041) [2024-06-27 16:25:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.6, 300 sec: 43765.0). Total num frames: 521830400. Throughput: 0: 43983.0. Samples: 424816000. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 16:25:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:25:09,466][06909] Updated weights for policy 0, policy_version 31853 (0.0051) [2024-06-27 16:25:12,508][06909] Updated weights for policy 0, policy_version 31863 (0.0031) [2024-06-27 16:25:13,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 522059776. Throughput: 0: 43976.3. Samples: 424938020. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 16:25:13,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:25:17,083][06909] Updated weights for policy 0, policy_version 31873 (0.0024) [2024-06-27 16:25:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43820.3). Total num frames: 522272768. Throughput: 0: 43844.9. Samples: 425200660. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 16:25:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:25:20,214][06909] Updated weights for policy 0, policy_version 31883 (0.0031) [2024-06-27 16:25:23,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 522485760. Throughput: 0: 43920.4. Samples: 425466200. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 16:25:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:25:24,512][06909] Updated weights for policy 0, policy_version 31893 (0.0027) [2024-06-27 16:25:27,698][06909] Updated weights for policy 0, policy_version 31903 (0.0027) [2024-06-27 16:25:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 522698752. Throughput: 0: 44003.6. Samples: 425595740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 16:25:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:25:32,218][06909] Updated weights for policy 0, policy_version 31913 (0.0031) [2024-06-27 16:25:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 522944512. Throughput: 0: 43764.5. Samples: 425856660. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 16:25:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:25:35,184][06909] Updated weights for policy 0, policy_version 31923 (0.0026) [2024-06-27 16:25:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 523141120. Throughput: 0: 43703.7. Samples: 426122520. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 16:25:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:25:39,484][06909] Updated weights for policy 0, policy_version 31933 (0.0022) [2024-06-27 16:25:42,808][06909] Updated weights for policy 0, policy_version 31943 (0.0033) [2024-06-27 16:25:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43709.2). Total num frames: 523386880. Throughput: 0: 43763.6. Samples: 426250940. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 16:25:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:25:46,861][06909] Updated weights for policy 0, policy_version 31953 (0.0021) [2024-06-27 16:25:48,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 523599872. Throughput: 0: 43854.6. Samples: 426514520. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 16:25:48,853][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:25:48,990][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000031959_523616256.pth... [2024-06-27 16:25:49,051][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000031318_513114112.pth [2024-06-27 16:25:50,164][06909] Updated weights for policy 0, policy_version 31963 (0.0032) [2024-06-27 16:25:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 523796480. Throughput: 0: 43760.1. Samples: 426785200. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 16:25:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:25:54,301][06909] Updated weights for policy 0, policy_version 31973 (0.0032) [2024-06-27 16:25:57,517][06909] Updated weights for policy 0, policy_version 31983 (0.0039) [2024-06-27 16:25:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 524025856. Throughput: 0: 43837.5. Samples: 426910700. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 16:25:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:26:01,751][06909] Updated weights for policy 0, policy_version 31993 (0.0031) [2024-06-27 16:26:03,172][06887] Signal inference workers to stop experience collection... (6200 times) [2024-06-27 16:26:03,172][06887] Signal inference workers to resume experience collection... (6200 times) [2024-06-27 16:26:03,223][06909] InferenceWorker_p0-w0: stopping experience collection (6200 times) [2024-06-27 16:26:03,223][06909] InferenceWorker_p0-w0: resuming experience collection (6200 times) [2024-06-27 16:26:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 524255232. Throughput: 0: 43844.1. Samples: 427173640. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 16:26:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:26:05,259][06909] Updated weights for policy 0, policy_version 32003 (0.0050) [2024-06-27 16:26:08,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43962.2, 300 sec: 43764.4). Total num frames: 524468224. Throughput: 0: 43822.4. Samples: 427438300. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 16:26:08,853][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:26:09,730][06909] Updated weights for policy 0, policy_version 32013 (0.0031) [2024-06-27 16:26:12,833][06909] Updated weights for policy 0, policy_version 32023 (0.0030) [2024-06-27 16:26:13,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.7, 300 sec: 43709.7). Total num frames: 524697600. Throughput: 0: 43714.4. Samples: 427562900. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 16:26:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:26:17,602][06909] Updated weights for policy 0, policy_version 32033 (0.0037) [2024-06-27 16:26:18,850][06674] Fps is (10 sec: 47523.4, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 524943360. Throughput: 0: 43918.2. Samples: 427832980. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 16:26:18,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:26:20,268][06909] Updated weights for policy 0, policy_version 32043 (0.0047) [2024-06-27 16:26:23,852][06674] Fps is (10 sec: 42590.3, 60 sec: 43962.3, 300 sec: 43764.4). Total num frames: 525123584. Throughput: 0: 43659.3. Samples: 428087280. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 16:26:23,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:26:24,893][06909] Updated weights for policy 0, policy_version 32053 (0.0026) [2024-06-27 16:26:28,041][06909] Updated weights for policy 0, policy_version 32063 (0.0034) [2024-06-27 16:26:28,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 525336576. Throughput: 0: 43755.5. Samples: 428219940. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 16:26:28,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:26:32,303][06909] Updated weights for policy 0, policy_version 32073 (0.0038) [2024-06-27 16:26:33,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 525565952. Throughput: 0: 43851.6. Samples: 428487840. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 16:26:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 16:26:35,450][06909] Updated weights for policy 0, policy_version 32083 (0.0032) [2024-06-27 16:26:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43765.0). Total num frames: 525778944. Throughput: 0: 43596.9. Samples: 428747060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 16:26:38,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:26:39,531][06909] Updated weights for policy 0, policy_version 32093 (0.0030) [2024-06-27 16:26:43,245][06909] Updated weights for policy 0, policy_version 32103 (0.0046) [2024-06-27 16:26:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43709.6). Total num frames: 526008320. Throughput: 0: 43658.3. Samples: 428875320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 16:26:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:26:46,853][06909] Updated weights for policy 0, policy_version 32113 (0.0049) [2024-06-27 16:26:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 526237696. Throughput: 0: 43695.0. Samples: 429139920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 16:26:48,856][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:26:50,682][06909] Updated weights for policy 0, policy_version 32123 (0.0041) [2024-06-27 16:26:53,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 526434304. Throughput: 0: 43566.4. Samples: 429398700. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 16:26:53,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:26:54,813][06909] Updated weights for policy 0, policy_version 32133 (0.0035) [2024-06-27 16:26:58,116][06909] Updated weights for policy 0, policy_version 32143 (0.0035) [2024-06-27 16:26:58,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 526647296. Throughput: 0: 43616.6. Samples: 429525640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 16:26:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:27:02,521][06909] Updated weights for policy 0, policy_version 32153 (0.0032) [2024-06-27 16:27:03,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 526893056. Throughput: 0: 43632.1. Samples: 429796420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:27:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:27:05,645][06909] Updated weights for policy 0, policy_version 32163 (0.0042) [2024-06-27 16:27:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43692.2, 300 sec: 43764.7). Total num frames: 527089664. Throughput: 0: 43753.1. Samples: 430056080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:27:08,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:27:09,959][06909] Updated weights for policy 0, policy_version 32173 (0.0027) [2024-06-27 16:27:13,305][06909] Updated weights for policy 0, policy_version 32183 (0.0037) [2024-06-27 16:27:13,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43417.7, 300 sec: 43653.6). Total num frames: 527302656. Throughput: 0: 43664.0. Samples: 430184820. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:27:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:27:17,040][06887] Signal inference workers to stop experience collection... (6250 times) [2024-06-27 16:27:17,041][06887] Signal inference workers to resume experience collection... (6250 times) [2024-06-27 16:27:17,058][06909] InferenceWorker_p0-w0: stopping experience collection (6250 times) [2024-06-27 16:27:17,058][06909] InferenceWorker_p0-w0: resuming experience collection (6250 times) [2024-06-27 16:27:17,213][06909] Updated weights for policy 0, policy_version 32193 (0.0027) [2024-06-27 16:27:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43144.5, 300 sec: 43820.2). Total num frames: 527532032. Throughput: 0: 43584.0. Samples: 430449120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:27:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:27:20,873][06909] Updated weights for policy 0, policy_version 32203 (0.0034) [2024-06-27 16:27:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43692.1, 300 sec: 43765.0). Total num frames: 527745024. Throughput: 0: 43634.7. Samples: 430710620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:27:23,853][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:27:24,489][06909] Updated weights for policy 0, policy_version 32213 (0.0023) [2024-06-27 16:27:28,268][06909] Updated weights for policy 0, policy_version 32223 (0.0039) [2024-06-27 16:27:28,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 527941632. Throughput: 0: 43599.5. Samples: 430837300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:27:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:27:31,789][06909] Updated weights for policy 0, policy_version 32233 (0.0026) [2024-06-27 16:27:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 528187392. Throughput: 0: 43696.5. Samples: 431106260. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:27:33,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:27:35,473][06909] Updated weights for policy 0, policy_version 32243 (0.0041) [2024-06-27 16:27:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 528400384. Throughput: 0: 43875.6. Samples: 431373100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:27:38,854][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:27:39,581][06909] Updated weights for policy 0, policy_version 32253 (0.0032) [2024-06-27 16:27:42,785][06909] Updated weights for policy 0, policy_version 32263 (0.0034) [2024-06-27 16:27:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 43653.7). Total num frames: 528613376. Throughput: 0: 43799.6. Samples: 431496620. Policy #0 lag: (min: 0.0, avg: 12.9, max: 24.0) [2024-06-27 16:27:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:27:47,446][06909] Updated weights for policy 0, policy_version 32273 (0.0037) [2024-06-27 16:27:48,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 528875520. Throughput: 0: 43970.7. Samples: 431775100. Policy #0 lag: (min: 0.0, avg: 12.9, max: 24.0) [2024-06-27 16:27:48,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:27:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000032280_528875520.pth... [2024-06-27 16:27:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000031638_518356992.pth [2024-06-27 16:27:50,165][06909] Updated weights for policy 0, policy_version 32283 (0.0034) [2024-06-27 16:27:53,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43689.3, 300 sec: 43764.4). Total num frames: 529055744. Throughput: 0: 43787.9. Samples: 432026620. Policy #0 lag: (min: 0.0, avg: 12.9, max: 24.0) [2024-06-27 16:27:53,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:27:54,700][06909] Updated weights for policy 0, policy_version 32293 (0.0041) [2024-06-27 16:27:57,814][06909] Updated weights for policy 0, policy_version 32303 (0.0039) [2024-06-27 16:27:58,856][06674] Fps is (10 sec: 40934.9, 60 sec: 43959.2, 300 sec: 43708.3). Total num frames: 529285120. Throughput: 0: 43799.0. Samples: 432156040. Policy #0 lag: (min: 0.0, avg: 12.9, max: 24.0) [2024-06-27 16:27:58,856][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:28:02,090][06909] Updated weights for policy 0, policy_version 32313 (0.0029) [2024-06-27 16:28:03,850][06674] Fps is (10 sec: 45884.6, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 529514496. Throughput: 0: 43909.4. Samples: 432425040. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-27 16:28:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:28:05,798][06909] Updated weights for policy 0, policy_version 32323 (0.0040) [2024-06-27 16:28:08,850][06674] Fps is (10 sec: 40985.2, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 529694720. Throughput: 0: 43877.9. Samples: 432685120. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-27 16:28:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:28:09,625][06909] Updated weights for policy 0, policy_version 32333 (0.0036) [2024-06-27 16:28:13,201][06909] Updated weights for policy 0, policy_version 32343 (0.0035) [2024-06-27 16:28:13,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 529924096. Throughput: 0: 43870.6. Samples: 432811480. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-27 16:28:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:28:17,023][06909] Updated weights for policy 0, policy_version 32353 (0.0046) [2024-06-27 16:28:18,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 530153472. Throughput: 0: 43835.5. Samples: 433078860. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-27 16:28:18,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:28:20,539][06909] Updated weights for policy 0, policy_version 32363 (0.0037) [2024-06-27 16:28:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 530350080. Throughput: 0: 43604.1. Samples: 433335280. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-27 16:28:23,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:28:24,455][06909] Updated weights for policy 0, policy_version 32373 (0.0037) [2024-06-27 16:28:28,049][06909] Updated weights for policy 0, policy_version 32383 (0.0040) [2024-06-27 16:28:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.7, 300 sec: 43709.2). Total num frames: 530595840. Throughput: 0: 43706.5. Samples: 433463420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 16:28:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:28:31,923][06909] Updated weights for policy 0, policy_version 32393 (0.0025) [2024-06-27 16:28:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 530808832. Throughput: 0: 43546.3. Samples: 433734680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 16:28:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:28:33,868][06887] Signal inference workers to stop experience collection... (6300 times) [2024-06-27 16:28:33,874][06887] Signal inference workers to resume experience collection... (6300 times) [2024-06-27 16:28:33,897][06909] InferenceWorker_p0-w0: stopping experience collection (6300 times) [2024-06-27 16:28:33,898][06909] InferenceWorker_p0-w0: resuming experience collection (6300 times) [2024-06-27 16:28:35,265][06909] Updated weights for policy 0, policy_version 32403 (0.0031) [2024-06-27 16:28:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 531021824. Throughput: 0: 43859.7. Samples: 434000220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 16:28:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:28:39,589][06909] Updated weights for policy 0, policy_version 32413 (0.0038) [2024-06-27 16:28:42,601][06909] Updated weights for policy 0, policy_version 32423 (0.0035) [2024-06-27 16:28:43,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 43710.1). Total num frames: 531251200. Throughput: 0: 43741.0. Samples: 434124120. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 16:28:43,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:28:47,083][06909] Updated weights for policy 0, policy_version 32433 (0.0036) [2024-06-27 16:28:48,850][06674] Fps is (10 sec: 42598.0, 60 sec: 42871.4, 300 sec: 43764.7). Total num frames: 531447808. Throughput: 0: 43519.0. Samples: 434383400. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 16:28:48,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:28:50,297][06909] Updated weights for policy 0, policy_version 32443 (0.0033) [2024-06-27 16:28:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43692.1, 300 sec: 43764.7). Total num frames: 531677184. Throughput: 0: 43523.5. Samples: 434643680. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 16:28:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:28:54,586][06909] Updated weights for policy 0, policy_version 32453 (0.0027) [2024-06-27 16:28:58,235][06909] Updated weights for policy 0, policy_version 32463 (0.0033) [2024-06-27 16:28:58,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43422.1, 300 sec: 43709.2). Total num frames: 531890176. Throughput: 0: 43591.6. Samples: 434773100. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 16:28:58,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:29:01,960][06909] Updated weights for policy 0, policy_version 32473 (0.0023) [2024-06-27 16:29:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 43764.7). Total num frames: 532103168. Throughput: 0: 43463.3. Samples: 435034700. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 16:29:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:29:05,578][06909] Updated weights for policy 0, policy_version 32483 (0.0036) [2024-06-27 16:29:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 532316160. Throughput: 0: 43721.3. Samples: 435302740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 16:29:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:29:09,235][06909] Updated weights for policy 0, policy_version 32493 (0.0034) [2024-06-27 16:29:12,918][06909] Updated weights for policy 0, policy_version 32503 (0.0023) [2024-06-27 16:29:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 532561920. Throughput: 0: 43773.8. Samples: 435433240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 16:29:13,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 16:29:16,870][06909] Updated weights for policy 0, policy_version 32513 (0.0036) [2024-06-27 16:29:18,855][06674] Fps is (10 sec: 44213.6, 60 sec: 43413.9, 300 sec: 43708.4). Total num frames: 532758528. Throughput: 0: 43493.1. Samples: 435692100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 16:29:18,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:29:20,383][06909] Updated weights for policy 0, policy_version 32523 (0.0021) [2024-06-27 16:29:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 532987904. Throughput: 0: 43593.0. Samples: 435961900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 16:29:23,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:29:24,307][06909] Updated weights for policy 0, policy_version 32533 (0.0023) [2024-06-27 16:29:27,970][06909] Updated weights for policy 0, policy_version 32543 (0.0030) [2024-06-27 16:29:28,850][06674] Fps is (10 sec: 47538.1, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 533233664. Throughput: 0: 43655.1. Samples: 436088600. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 16:29:28,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:29:32,192][06909] Updated weights for policy 0, policy_version 32553 (0.0041) [2024-06-27 16:29:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 533430272. Throughput: 0: 43779.2. Samples: 436353460. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 16:29:33,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:29:35,458][06909] Updated weights for policy 0, policy_version 32563 (0.0027) [2024-06-27 16:29:38,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 533643264. Throughput: 0: 43709.2. Samples: 436610600. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 16:29:38,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:29:39,610][06909] Updated weights for policy 0, policy_version 32573 (0.0022) [2024-06-27 16:29:43,091][06909] Updated weights for policy 0, policy_version 32583 (0.0030) [2024-06-27 16:29:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 533872640. Throughput: 0: 43771.5. Samples: 436742820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 16:29:43,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:29:46,954][06909] Updated weights for policy 0, policy_version 32593 (0.0025) [2024-06-27 16:29:48,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 534085632. Throughput: 0: 43693.8. Samples: 437000920. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:29:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:29:48,966][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000032599_534102016.pth... [2024-06-27 16:29:49,023][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000031959_523616256.pth [2024-06-27 16:29:50,719][06909] Updated weights for policy 0, policy_version 32603 (0.0037) [2024-06-27 16:29:53,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43690.5, 300 sec: 43709.2). Total num frames: 534298624. Throughput: 0: 43607.8. Samples: 437265100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:29:53,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:29:54,460][06909] Updated weights for policy 0, policy_version 32613 (0.0041) [2024-06-27 16:29:57,982][06909] Updated weights for policy 0, policy_version 32623 (0.0029) [2024-06-27 16:29:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 534511616. Throughput: 0: 43664.4. Samples: 437398140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:29:58,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:30:02,032][06909] Updated weights for policy 0, policy_version 32633 (0.0025) [2024-06-27 16:30:03,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 534740992. Throughput: 0: 43799.7. Samples: 437662860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:30:03,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:30:05,438][06909] Updated weights for policy 0, policy_version 32643 (0.0029) [2024-06-27 16:30:08,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 534953984. Throughput: 0: 43657.1. Samples: 437926480. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 16:30:08,851][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:30:09,350][06909] Updated weights for policy 0, policy_version 32653 (0.0042) [2024-06-27 16:30:13,102][06909] Updated weights for policy 0, policy_version 32663 (0.0043) [2024-06-27 16:30:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 535183360. Throughput: 0: 43706.7. Samples: 438055400. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:30:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:30:17,210][06909] Updated weights for policy 0, policy_version 32673 (0.0034) [2024-06-27 16:30:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43967.5, 300 sec: 43764.7). Total num frames: 535396352. Throughput: 0: 43646.1. Samples: 438317540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:30:18,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:30:20,675][06909] Updated weights for policy 0, policy_version 32683 (0.0037) [2024-06-27 16:30:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 535592960. Throughput: 0: 43604.6. Samples: 438572800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:30:23,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:30:24,657][06909] Updated weights for policy 0, policy_version 32693 (0.0032) [2024-06-27 16:30:28,146][06909] Updated weights for policy 0, policy_version 32703 (0.0029) [2024-06-27 16:30:28,856][06674] Fps is (10 sec: 42572.7, 60 sec: 43140.2, 300 sec: 43652.7). Total num frames: 535822336. Throughput: 0: 43691.8. Samples: 438709220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:30:28,857][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:30:31,841][06909] Updated weights for policy 0, policy_version 32713 (0.0042) [2024-06-27 16:30:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 536051712. Throughput: 0: 43755.9. Samples: 438969940. Policy #0 lag: (min: 1.0, avg: 9.9, max: 21.0) [2024-06-27 16:30:33,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:30:34,253][06887] Signal inference workers to stop experience collection... (6350 times) [2024-06-27 16:30:34,293][06909] InferenceWorker_p0-w0: stopping experience collection (6350 times) [2024-06-27 16:30:34,302][06887] Signal inference workers to resume experience collection... (6350 times) [2024-06-27 16:30:34,310][06909] InferenceWorker_p0-w0: resuming experience collection (6350 times) [2024-06-27 16:30:35,838][06909] Updated weights for policy 0, policy_version 32723 (0.0040) [2024-06-27 16:30:38,852][06674] Fps is (10 sec: 44254.9, 60 sec: 43689.3, 300 sec: 43653.3). Total num frames: 536264704. Throughput: 0: 43720.8. Samples: 439232620. Policy #0 lag: (min: 1.0, avg: 9.9, max: 21.0) [2024-06-27 16:30:38,852][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:30:39,194][06909] Updated weights for policy 0, policy_version 32733 (0.0034) [2024-06-27 16:30:43,055][06909] Updated weights for policy 0, policy_version 32743 (0.0041) [2024-06-27 16:30:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 536494080. Throughput: 0: 43720.4. Samples: 439365560. Policy #0 lag: (min: 1.0, avg: 9.9, max: 21.0) [2024-06-27 16:30:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:30:46,618][06909] Updated weights for policy 0, policy_version 32753 (0.0040) [2024-06-27 16:30:48,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 536707072. Throughput: 0: 43743.5. Samples: 439631320. Policy #0 lag: (min: 1.0, avg: 9.9, max: 21.0) [2024-06-27 16:30:48,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:30:50,580][06909] Updated weights for policy 0, policy_version 32763 (0.0046) [2024-06-27 16:30:53,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.9, 300 sec: 43764.7). Total num frames: 536936448. Throughput: 0: 43722.4. Samples: 439893980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:30:53,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:30:53,908][06909] Updated weights for policy 0, policy_version 32773 (0.0028) [2024-06-27 16:30:58,083][06909] Updated weights for policy 0, policy_version 32783 (0.0028) [2024-06-27 16:30:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 537149440. Throughput: 0: 43655.9. Samples: 440019920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:30:58,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:31:01,641][06909] Updated weights for policy 0, policy_version 32793 (0.0033) [2024-06-27 16:31:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43765.0). Total num frames: 537378816. Throughput: 0: 43778.7. Samples: 440287580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:31:03,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:31:05,598][06909] Updated weights for policy 0, policy_version 32803 (0.0034) [2024-06-27 16:31:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.8, 300 sec: 43653.7). Total num frames: 537575424. Throughput: 0: 43894.6. Samples: 440548060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:31:08,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:31:09,312][06909] Updated weights for policy 0, policy_version 32813 (0.0025) [2024-06-27 16:31:13,030][06909] Updated weights for policy 0, policy_version 32823 (0.0035) [2024-06-27 16:31:13,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.5, 300 sec: 43542.6). Total num frames: 537788416. Throughput: 0: 43640.1. Samples: 440672760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:31:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:31:16,744][06909] Updated weights for policy 0, policy_version 32833 (0.0020) [2024-06-27 16:31:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.8, 300 sec: 43709.5). Total num frames: 538017792. Throughput: 0: 43756.0. Samples: 440938960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:31:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:31:20,322][06909] Updated weights for policy 0, policy_version 32843 (0.0028) [2024-06-27 16:31:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 538230784. Throughput: 0: 43752.7. Samples: 441201400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:31:23,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:31:24,122][06909] Updated weights for policy 0, policy_version 32853 (0.0037) [2024-06-27 16:31:27,554][06909] Updated weights for policy 0, policy_version 32863 (0.0031) [2024-06-27 16:31:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43968.2, 300 sec: 43709.2). Total num frames: 538460160. Throughput: 0: 43766.7. Samples: 441335060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:31:28,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:31:31,415][06909] Updated weights for policy 0, policy_version 32873 (0.0025) [2024-06-27 16:31:33,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 538673152. Throughput: 0: 43792.4. Samples: 441601980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:31:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:31:35,806][06909] Updated weights for policy 0, policy_version 32883 (0.0041) [2024-06-27 16:31:38,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43965.1, 300 sec: 43709.1). Total num frames: 538902528. Throughput: 0: 43744.6. Samples: 441862500. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 16:31:38,851][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 16:31:39,020][06909] Updated weights for policy 0, policy_version 32893 (0.0032) [2024-06-27 16:31:43,061][06909] Updated weights for policy 0, policy_version 32903 (0.0034) [2024-06-27 16:31:43,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 539115520. Throughput: 0: 43898.2. Samples: 441995340. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 16:31:43,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:31:46,592][06909] Updated weights for policy 0, policy_version 32913 (0.0042) [2024-06-27 16:31:48,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 539328512. Throughput: 0: 43741.3. Samples: 442255940. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 16:31:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:31:48,889][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000032919_539344896.pth... [2024-06-27 16:31:48,949][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000032280_528875520.pth [2024-06-27 16:31:50,298][06909] Updated weights for policy 0, policy_version 32923 (0.0024) [2024-06-27 16:31:53,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 539557888. Throughput: 0: 43862.2. Samples: 442521860. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 16:31:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:31:54,132][06909] Updated weights for policy 0, policy_version 32933 (0.0043) [2024-06-27 16:31:57,526][06909] Updated weights for policy 0, policy_version 32943 (0.0031) [2024-06-27 16:31:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 43653.6). Total num frames: 539770880. Throughput: 0: 43947.2. Samples: 442650380. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-27 16:31:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:32:01,973][06909] Updated weights for policy 0, policy_version 32953 (0.0038) [2024-06-27 16:32:03,852][06674] Fps is (10 sec: 42590.0, 60 sec: 43416.2, 300 sec: 43708.9). Total num frames: 539983872. Throughput: 0: 43991.3. Samples: 442918660. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-27 16:32:03,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:32:04,844][06909] Updated weights for policy 0, policy_version 32963 (0.0031) [2024-06-27 16:32:08,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 540213248. Throughput: 0: 43871.8. Samples: 443175640. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-27 16:32:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:32:09,189][06909] Updated weights for policy 0, policy_version 32973 (0.0031) [2024-06-27 16:32:11,255][06887] Signal inference workers to stop experience collection... (6400 times) [2024-06-27 16:32:11,256][06887] Signal inference workers to resume experience collection... (6400 times) [2024-06-27 16:32:11,275][06909] InferenceWorker_p0-w0: stopping experience collection (6400 times) [2024-06-27 16:32:11,275][06909] InferenceWorker_p0-w0: resuming experience collection (6400 times) [2024-06-27 16:32:12,739][06909] Updated weights for policy 0, policy_version 32983 (0.0037) [2024-06-27 16:32:13,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 540426240. Throughput: 0: 43936.1. Samples: 443312180. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-27 16:32:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 16:32:16,592][06909] Updated weights for policy 0, policy_version 32993 (0.0035) [2024-06-27 16:32:18,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 540655616. Throughput: 0: 43946.3. Samples: 443579560. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-27 16:32:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:32:20,342][06909] Updated weights for policy 0, policy_version 33003 (0.0028) [2024-06-27 16:32:23,836][06909] Updated weights for policy 0, policy_version 33013 (0.0050) [2024-06-27 16:32:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 540884992. Throughput: 0: 43813.1. Samples: 443834080. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-27 16:32:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:32:27,852][06909] Updated weights for policy 0, policy_version 33023 (0.0028) [2024-06-27 16:32:28,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 43820.2). Total num frames: 541114368. Throughput: 0: 43952.1. Samples: 443973180. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-27 16:32:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:32:31,484][06909] Updated weights for policy 0, policy_version 33033 (0.0036) [2024-06-27 16:32:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 541294592. Throughput: 0: 44061.8. Samples: 444238720. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-27 16:32:33,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:32:35,263][06909] Updated weights for policy 0, policy_version 33043 (0.0045) [2024-06-27 16:32:38,850][06674] Fps is (10 sec: 40960.9, 60 sec: 43690.9, 300 sec: 43764.7). Total num frames: 541523968. Throughput: 0: 43783.2. Samples: 444492100. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-27 16:32:38,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:32:38,903][06909] Updated weights for policy 0, policy_version 33053 (0.0032) [2024-06-27 16:32:42,479][06909] Updated weights for policy 0, policy_version 33063 (0.0026) [2024-06-27 16:32:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 541753344. Throughput: 0: 43878.6. Samples: 444624920. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 16:32:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:32:46,466][06909] Updated weights for policy 0, policy_version 33073 (0.0036) [2024-06-27 16:32:48,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.7, 300 sec: 43709.5). Total num frames: 541949952. Throughput: 0: 43831.7. Samples: 444891000. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 16:32:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:32:50,109][06909] Updated weights for policy 0, policy_version 33083 (0.0042) [2024-06-27 16:32:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43710.1). Total num frames: 542179328. Throughput: 0: 43798.8. Samples: 445146580. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 16:32:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:32:53,979][06909] Updated weights for policy 0, policy_version 33093 (0.0027) [2024-06-27 16:32:57,827][06909] Updated weights for policy 0, policy_version 33103 (0.0040) [2024-06-27 16:32:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 542408704. Throughput: 0: 43699.0. Samples: 445278640. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 16:32:58,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:33:01,309][06909] Updated weights for policy 0, policy_version 33113 (0.0031) [2024-06-27 16:33:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43965.2, 300 sec: 43820.3). Total num frames: 542621696. Throughput: 0: 43861.3. Samples: 445553320. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 16:33:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:33:05,031][06909] Updated weights for policy 0, policy_version 33123 (0.0037) [2024-06-27 16:33:08,566][06909] Updated weights for policy 0, policy_version 33133 (0.0037) [2024-06-27 16:33:08,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43962.3, 300 sec: 43820.0). Total num frames: 542851072. Throughput: 0: 43850.9. Samples: 445807460. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 16:33:08,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:33:12,343][06909] Updated weights for policy 0, policy_version 33143 (0.0040) [2024-06-27 16:33:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 543064064. Throughput: 0: 43788.6. Samples: 445943660. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 16:33:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:33:16,307][06909] Updated weights for policy 0, policy_version 33153 (0.0029) [2024-06-27 16:33:18,850][06674] Fps is (10 sec: 44246.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 543293440. Throughput: 0: 43722.7. Samples: 446206240. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 16:33:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:33:19,726][06909] Updated weights for policy 0, policy_version 33163 (0.0035) [2024-06-27 16:33:22,455][06887] Signal inference workers to stop experience collection... (6450 times) [2024-06-27 16:33:22,458][06887] Signal inference workers to resume experience collection... (6450 times) [2024-06-27 16:33:22,488][06909] InferenceWorker_p0-w0: stopping experience collection (6450 times) [2024-06-27 16:33:22,488][06909] InferenceWorker_p0-w0: resuming experience collection (6450 times) [2024-06-27 16:33:23,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43416.1, 300 sec: 43708.9). Total num frames: 543490048. Throughput: 0: 43736.6. Samples: 446460340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 16:33:23,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:33:24,012][06909] Updated weights for policy 0, policy_version 33173 (0.0038) [2024-06-27 16:33:27,102][06909] Updated weights for policy 0, policy_version 33183 (0.0037) [2024-06-27 16:33:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 43820.2). Total num frames: 543735808. Throughput: 0: 43861.3. Samples: 446598680. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:33:28,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:33:31,220][06909] Updated weights for policy 0, policy_version 33193 (0.0036) [2024-06-27 16:33:33,850][06674] Fps is (10 sec: 45884.2, 60 sec: 44236.7, 300 sec: 43820.3). Total num frames: 543948800. Throughput: 0: 43986.2. Samples: 446870380. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:33:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:33:34,891][06909] Updated weights for policy 0, policy_version 33203 (0.0024) [2024-06-27 16:33:38,477][06909] Updated weights for policy 0, policy_version 33213 (0.0026) [2024-06-27 16:33:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.6, 300 sec: 43764.7). Total num frames: 544161792. Throughput: 0: 43996.8. Samples: 447126440. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:33:38,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:33:42,315][06909] Updated weights for policy 0, policy_version 33223 (0.0028) [2024-06-27 16:33:43,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 544391168. Throughput: 0: 43885.4. Samples: 447253480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:33:43,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:33:46,241][06909] Updated weights for policy 0, policy_version 33233 (0.0041) [2024-06-27 16:33:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 544587776. Throughput: 0: 43625.8. Samples: 447516480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 16:33:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:33:48,928][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000033240_544604160.pth... [2024-06-27 16:33:48,981][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000032599_534102016.pth [2024-06-27 16:33:49,815][06909] Updated weights for policy 0, policy_version 33243 (0.0041) [2024-06-27 16:33:53,850][06674] Fps is (10 sec: 40959.2, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 544800768. Throughput: 0: 43645.4. Samples: 447771420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 16:33:53,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:33:53,861][06909] Updated weights for policy 0, policy_version 33253 (0.0032) [2024-06-27 16:33:57,365][06909] Updated weights for policy 0, policy_version 33263 (0.0028) [2024-06-27 16:33:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 545030144. Throughput: 0: 43642.3. Samples: 447907560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 16:33:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:34:01,295][06909] Updated weights for policy 0, policy_version 33273 (0.0032) [2024-06-27 16:34:03,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 545259520. Throughput: 0: 43685.7. Samples: 448172100. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 16:34:03,854][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:34:04,778][06909] Updated weights for policy 0, policy_version 33283 (0.0026) [2024-06-27 16:34:08,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43419.1, 300 sec: 43709.2). Total num frames: 545456128. Throughput: 0: 43746.4. Samples: 448428840. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 16:34:08,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:34:08,929][06909] Updated weights for policy 0, policy_version 33293 (0.0021) [2024-06-27 16:34:12,037][06909] Updated weights for policy 0, policy_version 33303 (0.0025) [2024-06-27 16:34:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.6, 300 sec: 43876.6). Total num frames: 545701888. Throughput: 0: 43687.9. Samples: 448564640. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 16:34:13,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:34:16,142][06909] Updated weights for policy 0, policy_version 33313 (0.0038) [2024-06-27 16:34:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 545914880. Throughput: 0: 43606.3. Samples: 448832660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 16:34:18,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:34:19,581][06909] Updated weights for policy 0, policy_version 33323 (0.0031) [2024-06-27 16:34:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43692.1, 300 sec: 43653.6). Total num frames: 546111488. Throughput: 0: 43683.2. Samples: 449092180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 16:34:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:34:23,915][06909] Updated weights for policy 0, policy_version 33333 (0.0027) [2024-06-27 16:34:26,969][06909] Updated weights for policy 0, policy_version 33343 (0.0032) [2024-06-27 16:34:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 546340864. Throughput: 0: 43759.1. Samples: 449222640. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 16:34:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:34:31,337][06909] Updated weights for policy 0, policy_version 33353 (0.0051) [2024-06-27 16:34:33,853][06674] Fps is (10 sec: 45862.0, 60 sec: 43688.6, 300 sec: 43819.8). Total num frames: 546570240. Throughput: 0: 43777.2. Samples: 449486580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 16:34:33,853][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:34:34,763][06909] Updated weights for policy 0, policy_version 33363 (0.0032) [2024-06-27 16:34:38,627][06909] Updated weights for policy 0, policy_version 33373 (0.0033) [2024-06-27 16:34:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 546783232. Throughput: 0: 43881.1. Samples: 449746060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 16:34:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:34:42,132][06909] Updated weights for policy 0, policy_version 33383 (0.0036) [2024-06-27 16:34:43,850][06674] Fps is (10 sec: 42610.3, 60 sec: 43417.5, 300 sec: 43764.7). Total num frames: 546996224. Throughput: 0: 43752.3. Samples: 449876420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 16:34:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:34:46,199][06909] Updated weights for policy 0, policy_version 33393 (0.0029) [2024-06-27 16:34:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 547225600. Throughput: 0: 43712.9. Samples: 450139180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 16:34:48,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:34:49,559][06909] Updated weights for policy 0, policy_version 33403 (0.0042) [2024-06-27 16:34:53,248][06887] Signal inference workers to stop experience collection... (6500 times) [2024-06-27 16:34:53,248][06887] Signal inference workers to resume experience collection... (6500 times) [2024-06-27 16:34:53,258][06909] InferenceWorker_p0-w0: stopping experience collection (6500 times) [2024-06-27 16:34:53,258][06909] InferenceWorker_p0-w0: resuming experience collection (6500 times) [2024-06-27 16:34:53,767][06909] Updated weights for policy 0, policy_version 33413 (0.0032) [2024-06-27 16:34:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 547438592. Throughput: 0: 43848.0. Samples: 450402000. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-27 16:34:53,854][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:34:56,871][06909] Updated weights for policy 0, policy_version 33423 (0.0032) [2024-06-27 16:34:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 547651584. Throughput: 0: 43743.2. Samples: 450533080. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-27 16:34:58,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:35:01,246][06909] Updated weights for policy 0, policy_version 33433 (0.0033) [2024-06-27 16:35:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 547897344. Throughput: 0: 43805.0. Samples: 450803880. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-27 16:35:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:35:04,139][06909] Updated weights for policy 0, policy_version 33443 (0.0033) [2024-06-27 16:35:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 548077568. Throughput: 0: 43874.3. Samples: 451066520. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-27 16:35:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:35:08,879][06909] Updated weights for policy 0, policy_version 33453 (0.0036) [2024-06-27 16:35:11,859][06909] Updated weights for policy 0, policy_version 33463 (0.0027) [2024-06-27 16:35:13,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 548306944. Throughput: 0: 43856.7. Samples: 451196200. Policy #0 lag: (min: 0.0, avg: 13.2, max: 24.0) [2024-06-27 16:35:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:35:16,150][06909] Updated weights for policy 0, policy_version 33473 (0.0033) [2024-06-27 16:35:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 548536320. Throughput: 0: 43845.5. Samples: 451459500. Policy #0 lag: (min: 0.0, avg: 13.2, max: 24.0) [2024-06-27 16:35:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:35:19,421][06909] Updated weights for policy 0, policy_version 33483 (0.0031) [2024-06-27 16:35:23,731][06909] Updated weights for policy 0, policy_version 33493 (0.0026) [2024-06-27 16:35:23,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44236.9, 300 sec: 43876.7). Total num frames: 548765696. Throughput: 0: 43934.7. Samples: 451723120. Policy #0 lag: (min: 0.0, avg: 13.2, max: 24.0) [2024-06-27 16:35:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:35:26,776][06909] Updated weights for policy 0, policy_version 33503 (0.0035) [2024-06-27 16:35:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 548962304. Throughput: 0: 43886.8. Samples: 451851320. Policy #0 lag: (min: 0.0, avg: 13.2, max: 24.0) [2024-06-27 16:35:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:35:31,101][06909] Updated weights for policy 0, policy_version 33513 (0.0032) [2024-06-27 16:35:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43965.9, 300 sec: 43876.1). Total num frames: 549208064. Throughput: 0: 43873.8. Samples: 452113500. Policy #0 lag: (min: 0.0, avg: 13.2, max: 24.0) [2024-06-27 16:35:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:35:34,058][06909] Updated weights for policy 0, policy_version 33523 (0.0035) [2024-06-27 16:35:38,804][06909] Updated weights for policy 0, policy_version 33533 (0.0035) [2024-06-27 16:35:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 549404672. Throughput: 0: 44100.9. Samples: 452386540. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 16:35:38,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:35:41,731][06909] Updated weights for policy 0, policy_version 33543 (0.0026) [2024-06-27 16:35:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 549634048. Throughput: 0: 43762.7. Samples: 452502400. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 16:35:43,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:35:46,425][06909] Updated weights for policy 0, policy_version 33553 (0.0027) [2024-06-27 16:35:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 549863424. Throughput: 0: 43686.1. Samples: 452769760. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 16:35:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:35:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000033561_549863424.pth... [2024-06-27 16:35:48,935][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000032919_539344896.pth [2024-06-27 16:35:49,241][06909] Updated weights for policy 0, policy_version 33563 (0.0030) [2024-06-27 16:35:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 550043648. Throughput: 0: 43805.8. Samples: 453037780. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 16:35:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:35:53,911][06909] Updated weights for policy 0, policy_version 33573 (0.0034) [2024-06-27 16:35:56,813][06909] Updated weights for policy 0, policy_version 33583 (0.0047) [2024-06-27 16:35:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 550289408. Throughput: 0: 43559.1. Samples: 453156360. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 16:35:58,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:36:01,529][06909] Updated weights for policy 0, policy_version 33593 (0.0030) [2024-06-27 16:36:03,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 550518784. Throughput: 0: 43652.8. Samples: 453423880. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 16:36:03,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:36:04,309][06909] Updated weights for policy 0, policy_version 33603 (0.0026) [2024-06-27 16:36:08,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 550699008. Throughput: 0: 43675.0. Samples: 453688500. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 16:36:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:36:09,026][06909] Updated weights for policy 0, policy_version 33613 (0.0038) [2024-06-27 16:36:11,488][06887] Signal inference workers to stop experience collection... (6550 times) [2024-06-27 16:36:11,488][06887] Signal inference workers to resume experience collection... (6550 times) [2024-06-27 16:36:11,525][06909] InferenceWorker_p0-w0: stopping experience collection (6550 times) [2024-06-27 16:36:11,525][06909] InferenceWorker_p0-w0: resuming experience collection (6550 times) [2024-06-27 16:36:11,824][06909] Updated weights for policy 0, policy_version 33623 (0.0038) [2024-06-27 16:36:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 550944768. Throughput: 0: 43572.0. Samples: 453812060. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 16:36:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:36:16,532][06909] Updated weights for policy 0, policy_version 33633 (0.0043) [2024-06-27 16:36:18,852][06674] Fps is (10 sec: 47504.2, 60 sec: 43962.2, 300 sec: 43875.5). Total num frames: 551174144. Throughput: 0: 43639.4. Samples: 454077360. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 16:36:18,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:36:19,223][06909] Updated weights for policy 0, policy_version 33643 (0.0027) [2024-06-27 16:36:23,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43144.5, 300 sec: 43709.2). Total num frames: 551354368. Throughput: 0: 43480.0. Samples: 454343140. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 16:36:23,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:36:24,472][06909] Updated weights for policy 0, policy_version 33653 (0.0031) [2024-06-27 16:36:26,481][06909] Updated weights for policy 0, policy_version 33663 (0.0039) [2024-06-27 16:36:28,850][06674] Fps is (10 sec: 42606.7, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 551600128. Throughput: 0: 43574.2. Samples: 454463240. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 16:36:28,855][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:36:31,687][06909] Updated weights for policy 0, policy_version 33673 (0.0042) [2024-06-27 16:36:33,850][06674] Fps is (10 sec: 49152.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 551845888. Throughput: 0: 43680.9. Samples: 454735400. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 16:36:33,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:36:34,157][06909] Updated weights for policy 0, policy_version 33683 (0.0040) [2024-06-27 16:36:38,852][06674] Fps is (10 sec: 40951.8, 60 sec: 43416.1, 300 sec: 43708.9). Total num frames: 552009728. Throughput: 0: 43849.1. Samples: 455011080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 16:36:38,853][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:36:39,208][06909] Updated weights for policy 0, policy_version 33693 (0.0036) [2024-06-27 16:36:41,431][06909] Updated weights for policy 0, policy_version 33703 (0.0022) [2024-06-27 16:36:43,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 552255488. Throughput: 0: 43680.0. Samples: 455121960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 16:36:43,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:36:46,614][06909] Updated weights for policy 0, policy_version 33713 (0.0032) [2024-06-27 16:36:48,853][06674] Fps is (10 sec: 49145.5, 60 sec: 43961.3, 300 sec: 43875.3). Total num frames: 552501248. Throughput: 0: 43878.5. Samples: 455398560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 16:36:48,854][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:36:49,029][06909] Updated weights for policy 0, policy_version 33723 (0.0039) [2024-06-27 16:36:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 552665088. Throughput: 0: 43706.6. Samples: 455655300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 16:36:53,854][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:36:54,335][06909] Updated weights for policy 0, policy_version 33733 (0.0035) [2024-06-27 16:36:56,601][06909] Updated weights for policy 0, policy_version 33743 (0.0036) [2024-06-27 16:36:58,850][06674] Fps is (10 sec: 40973.6, 60 sec: 43690.7, 300 sec: 43820.5). Total num frames: 552910848. Throughput: 0: 43706.2. Samples: 455778840. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 16:36:58,851][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 16:37:01,637][06909] Updated weights for policy 0, policy_version 33753 (0.0028) [2024-06-27 16:37:03,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 553140224. Throughput: 0: 43850.8. Samples: 456050560. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-27 16:37:03,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:37:04,050][06909] Updated weights for policy 0, policy_version 33763 (0.0025) [2024-06-27 16:37:08,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 553320448. Throughput: 0: 43813.0. Samples: 456314720. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-27 16:37:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:37:09,084][06909] Updated weights for policy 0, policy_version 33773 (0.0035) [2024-06-27 16:37:09,944][06887] Signal inference workers to stop experience collection... (6600 times) [2024-06-27 16:37:09,946][06887] Signal inference workers to resume experience collection... (6600 times) [2024-06-27 16:37:09,994][06909] InferenceWorker_p0-w0: stopping experience collection (6600 times) [2024-06-27 16:37:09,994][06909] InferenceWorker_p0-w0: resuming experience collection (6600 times) [2024-06-27 16:37:11,576][06909] Updated weights for policy 0, policy_version 33783 (0.0038) [2024-06-27 16:37:13,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 553566208. Throughput: 0: 43776.4. Samples: 456433180. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-27 16:37:13,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:37:16,410][06909] Updated weights for policy 0, policy_version 33793 (0.0039) [2024-06-27 16:37:18,850][06674] Fps is (10 sec: 49151.2, 60 sec: 43965.1, 300 sec: 43820.2). Total num frames: 553811968. Throughput: 0: 43827.9. Samples: 456707660. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-27 16:37:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:37:19,154][06909] Updated weights for policy 0, policy_version 33803 (0.0027) [2024-06-27 16:37:23,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 553975808. Throughput: 0: 43733.6. Samples: 456979000. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-27 16:37:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:37:23,859][06909] Updated weights for policy 0, policy_version 33813 (0.0035) [2024-06-27 16:37:26,740][06909] Updated weights for policy 0, policy_version 33823 (0.0041) [2024-06-27 16:37:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 554237952. Throughput: 0: 43925.9. Samples: 457098620. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-27 16:37:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:37:31,511][06909] Updated weights for policy 0, policy_version 33833 (0.0039) [2024-06-27 16:37:33,850][06674] Fps is (10 sec: 49152.1, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 554467328. Throughput: 0: 43809.5. Samples: 457369840. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-27 16:37:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:37:33,979][06909] Updated weights for policy 0, policy_version 33843 (0.0026) [2024-06-27 16:37:38,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43692.1, 300 sec: 43653.6). Total num frames: 554631168. Throughput: 0: 43952.9. Samples: 457633180. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-27 16:37:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:37:39,038][06909] Updated weights for policy 0, policy_version 33853 (0.0026) [2024-06-27 16:37:41,543][06909] Updated weights for policy 0, policy_version 33863 (0.0038) [2024-06-27 16:37:43,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 554876928. Throughput: 0: 43870.3. Samples: 457753000. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-27 16:37:43,862][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:37:46,395][06909] Updated weights for policy 0, policy_version 33873 (0.0033) [2024-06-27 16:37:48,850][06674] Fps is (10 sec: 49151.6, 60 sec: 43693.0, 300 sec: 43875.8). Total num frames: 555122688. Throughput: 0: 43898.1. Samples: 458025980. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2024-06-27 16:37:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:37:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000033883_555139072.pth... [2024-06-27 16:37:48,872][06909] Updated weights for policy 0, policy_version 33883 (0.0030) [2024-06-27 16:37:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000033240_544604160.pth [2024-06-27 16:37:53,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 555286528. Throughput: 0: 43880.3. Samples: 458289340. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2024-06-27 16:37:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:37:53,990][06909] Updated weights for policy 0, policy_version 33893 (0.0043) [2024-06-27 16:37:56,206][06909] Updated weights for policy 0, policy_version 33903 (0.0030) [2024-06-27 16:37:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 555548672. Throughput: 0: 43883.2. Samples: 458407920. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2024-06-27 16:37:58,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:38:01,462][06909] Updated weights for policy 0, policy_version 33913 (0.0037) [2024-06-27 16:38:03,850][06674] Fps is (10 sec: 49152.1, 60 sec: 43963.7, 300 sec: 43820.6). Total num frames: 555778048. Throughput: 0: 43917.4. Samples: 458683940. Policy #0 lag: (min: 0.0, avg: 7.7, max: 20.0) [2024-06-27 16:38:03,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:38:04,070][06909] Updated weights for policy 0, policy_version 33923 (0.0034) [2024-06-27 16:38:08,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 555941888. Throughput: 0: 43779.5. Samples: 458949080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:38:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:38:08,886][06909] Updated weights for policy 0, policy_version 33933 (0.0040) [2024-06-27 16:38:11,786][06909] Updated weights for policy 0, policy_version 33943 (0.0034) [2024-06-27 16:38:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 556204032. Throughput: 0: 43768.4. Samples: 459068200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:38:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:38:16,606][06909] Updated weights for policy 0, policy_version 33953 (0.0036) [2024-06-27 16:38:18,850][06674] Fps is (10 sec: 49152.2, 60 sec: 43690.7, 300 sec: 43876.1). Total num frames: 556433408. Throughput: 0: 43709.3. Samples: 459336760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:38:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:38:19,075][06909] Updated weights for policy 0, policy_version 33963 (0.0032) [2024-06-27 16:38:23,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 556597248. Throughput: 0: 43828.5. Samples: 459605460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:38:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:38:23,953][06909] Updated weights for policy 0, policy_version 33973 (0.0038) [2024-06-27 16:38:26,734][06909] Updated weights for policy 0, policy_version 33983 (0.0037) [2024-06-27 16:38:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 556859392. Throughput: 0: 43871.1. Samples: 459727200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:38:28,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:38:29,424][06887] Signal inference workers to stop experience collection... (6650 times) [2024-06-27 16:38:29,424][06887] Signal inference workers to resume experience collection... (6650 times) [2024-06-27 16:38:29,441][06909] InferenceWorker_p0-w0: stopping experience collection (6650 times) [2024-06-27 16:38:29,441][06909] InferenceWorker_p0-w0: resuming experience collection (6650 times) [2024-06-27 16:38:31,502][06909] Updated weights for policy 0, policy_version 33993 (0.0025) [2024-06-27 16:38:33,852][06674] Fps is (10 sec: 49141.9, 60 sec: 43689.1, 300 sec: 43820.0). Total num frames: 557088768. Throughput: 0: 43798.5. Samples: 459997000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 16:38:33,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:38:34,002][06909] Updated weights for policy 0, policy_version 34003 (0.0034) [2024-06-27 16:38:38,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 557252608. Throughput: 0: 43872.5. Samples: 460263600. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 16:38:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:38:38,898][06909] Updated weights for policy 0, policy_version 34013 (0.0033) [2024-06-27 16:38:41,546][06909] Updated weights for policy 0, policy_version 34023 (0.0025) [2024-06-27 16:38:43,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 557514752. Throughput: 0: 44017.9. Samples: 460388720. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 16:38:43,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:38:46,359][06909] Updated weights for policy 0, policy_version 34033 (0.0032) [2024-06-27 16:38:48,850][06674] Fps is (10 sec: 49152.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 557744128. Throughput: 0: 43794.7. Samples: 460654700. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 16:38:48,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:38:48,935][06909] Updated weights for policy 0, policy_version 34043 (0.0029) [2024-06-27 16:38:53,654][06909] Updated weights for policy 0, policy_version 34053 (0.0034) [2024-06-27 16:38:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 557924352. Throughput: 0: 43952.1. Samples: 460926920. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:38:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:38:56,491][06909] Updated weights for policy 0, policy_version 34063 (0.0041) [2024-06-27 16:38:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 558170112. Throughput: 0: 44006.2. Samples: 461048480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:38:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:39:01,225][06909] Updated weights for policy 0, policy_version 34073 (0.0049) [2024-06-27 16:39:03,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 558399488. Throughput: 0: 43863.5. Samples: 461310620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:39:03,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:39:03,934][06909] Updated weights for policy 0, policy_version 34083 (0.0036) [2024-06-27 16:39:08,539][06909] Updated weights for policy 0, policy_version 34093 (0.0025) [2024-06-27 16:39:08,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 558579712. Throughput: 0: 43890.6. Samples: 461580540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:39:08,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:39:11,284][06909] Updated weights for policy 0, policy_version 34103 (0.0036) [2024-06-27 16:39:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 558825472. Throughput: 0: 43826.6. Samples: 461699400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:39:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:39:16,166][06909] Updated weights for policy 0, policy_version 34113 (0.0025) [2024-06-27 16:39:18,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 559054848. Throughput: 0: 43729.5. Samples: 461964740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 16:39:18,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:39:18,998][06909] Updated weights for policy 0, policy_version 34123 (0.0042) [2024-06-27 16:39:23,462][06909] Updated weights for policy 0, policy_version 34133 (0.0034) [2024-06-27 16:39:23,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 559235072. Throughput: 0: 43674.4. Samples: 462228940. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 16:39:23,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:39:26,608][06909] Updated weights for policy 0, policy_version 34143 (0.0036) [2024-06-27 16:39:28,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.6, 300 sec: 43709.6). Total num frames: 559464448. Throughput: 0: 43571.1. Samples: 462349420. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 16:39:28,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:39:31,259][06909] Updated weights for policy 0, policy_version 34153 (0.0041) [2024-06-27 16:39:33,856][06674] Fps is (10 sec: 47484.4, 60 sec: 43687.8, 300 sec: 43819.4). Total num frames: 559710208. Throughput: 0: 43498.6. Samples: 462612400. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 16:39:33,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:39:34,583][06909] Updated weights for policy 0, policy_version 34163 (0.0044) [2024-06-27 16:39:38,756][06909] Updated weights for policy 0, policy_version 34173 (0.0027) [2024-06-27 16:39:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 559890432. Throughput: 0: 43395.5. Samples: 462879720. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-27 16:39:38,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:39:39,388][06887] Signal inference workers to stop experience collection... (6700 times) [2024-06-27 16:39:39,388][06887] Signal inference workers to resume experience collection... (6700 times) [2024-06-27 16:39:39,425][06909] InferenceWorker_p0-w0: stopping experience collection (6700 times) [2024-06-27 16:39:39,425][06909] InferenceWorker_p0-w0: resuming experience collection (6700 times) [2024-06-27 16:39:41,875][06909] Updated weights for policy 0, policy_version 34183 (0.0027) [2024-06-27 16:39:43,850][06674] Fps is (10 sec: 40985.1, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 560119808. Throughput: 0: 43374.3. Samples: 463000320. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-27 16:39:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:39:46,142][06909] Updated weights for policy 0, policy_version 34193 (0.0039) [2024-06-27 16:39:48,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 560365568. Throughput: 0: 43369.3. Samples: 463262240. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-27 16:39:48,859][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:39:48,873][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000034202_560365568.pth... [2024-06-27 16:39:48,924][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000033561_549863424.pth [2024-06-27 16:39:49,222][06909] Updated weights for policy 0, policy_version 34203 (0.0021) [2024-06-27 16:39:53,762][06909] Updated weights for policy 0, policy_version 34213 (0.0031) [2024-06-27 16:39:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 560545792. Throughput: 0: 43321.9. Samples: 463530020. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-27 16:39:53,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:39:56,512][06909] Updated weights for policy 0, policy_version 34223 (0.0026) [2024-06-27 16:39:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 560775168. Throughput: 0: 43465.3. Samples: 463655340. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-27 16:39:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:40:01,028][06909] Updated weights for policy 0, policy_version 34233 (0.0032) [2024-06-27 16:40:03,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 561020928. Throughput: 0: 43476.9. Samples: 463921200. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 16:40:03,851][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:40:03,912][06909] Updated weights for policy 0, policy_version 34243 (0.0028) [2024-06-27 16:40:08,688][06909] Updated weights for policy 0, policy_version 34253 (0.0023) [2024-06-27 16:40:08,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 561201152. Throughput: 0: 43528.8. Samples: 464187740. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 16:40:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:40:11,530][06909] Updated weights for policy 0, policy_version 34263 (0.0035) [2024-06-27 16:40:13,850][06674] Fps is (10 sec: 40960.9, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 561430528. Throughput: 0: 43549.8. Samples: 464309160. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 16:40:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:40:15,999][06909] Updated weights for policy 0, policy_version 34273 (0.0031) [2024-06-27 16:40:18,814][06909] Updated weights for policy 0, policy_version 34283 (0.0029) [2024-06-27 16:40:18,850][06674] Fps is (10 sec: 49151.8, 60 sec: 43963.8, 300 sec: 43820.2). Total num frames: 561692672. Throughput: 0: 43713.4. Samples: 464579240. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 16:40:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:40:23,551][06909] Updated weights for policy 0, policy_version 34293 (0.0029) [2024-06-27 16:40:23,853][06674] Fps is (10 sec: 44223.8, 60 sec: 43961.6, 300 sec: 43764.3). Total num frames: 561872896. Throughput: 0: 43822.6. Samples: 464851860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 16:40:23,853][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:40:26,559][06909] Updated weights for policy 0, policy_version 34303 (0.0036) [2024-06-27 16:40:28,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 562085888. Throughput: 0: 43817.6. Samples: 464972120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 16:40:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:40:31,031][06909] Updated weights for policy 0, policy_version 34313 (0.0044) [2024-06-27 16:40:33,850][06674] Fps is (10 sec: 45888.2, 60 sec: 43695.1, 300 sec: 43820.3). Total num frames: 562331648. Throughput: 0: 43933.4. Samples: 465239240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 16:40:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:40:33,970][06909] Updated weights for policy 0, policy_version 34323 (0.0030) [2024-06-27 16:40:38,405][06909] Updated weights for policy 0, policy_version 34333 (0.0041) [2024-06-27 16:40:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 562528256. Throughput: 0: 44011.9. Samples: 465510560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 16:40:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:40:41,213][06909] Updated weights for policy 0, policy_version 34343 (0.0030) [2024-06-27 16:40:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 562757632. Throughput: 0: 43884.9. Samples: 465630160. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 16:40:43,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:40:46,103][06909] Updated weights for policy 0, policy_version 34353 (0.0026) [2024-06-27 16:40:47,829][06887] Signal inference workers to stop experience collection... (6750 times) [2024-06-27 16:40:47,836][06887] Signal inference workers to resume experience collection... (6750 times) [2024-06-27 16:40:47,848][06909] InferenceWorker_p0-w0: stopping experience collection (6750 times) [2024-06-27 16:40:47,848][06909] InferenceWorker_p0-w0: resuming experience collection (6750 times) [2024-06-27 16:40:48,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 562987008. Throughput: 0: 43899.3. Samples: 465896660. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 16:40:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:40:48,930][06909] Updated weights for policy 0, policy_version 34363 (0.0027) [2024-06-27 16:40:53,583][06909] Updated weights for policy 0, policy_version 34373 (0.0028) [2024-06-27 16:40:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 563183616. Throughput: 0: 43886.7. Samples: 466162640. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 16:40:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:40:56,399][06909] Updated weights for policy 0, policy_version 34383 (0.0036) [2024-06-27 16:40:58,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 563396608. Throughput: 0: 43871.9. Samples: 466283400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 16:40:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:41:01,337][06909] Updated weights for policy 0, policy_version 34393 (0.0029) [2024-06-27 16:41:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 563642368. Throughput: 0: 43779.2. Samples: 466549300. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 16:41:03,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:41:03,972][06909] Updated weights for policy 0, policy_version 34403 (0.0029) [2024-06-27 16:41:08,735][06909] Updated weights for policy 0, policy_version 34413 (0.0035) [2024-06-27 16:41:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 563822592. Throughput: 0: 43666.7. Samples: 466816740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 16:41:08,854][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:41:11,515][06909] Updated weights for policy 0, policy_version 34423 (0.0030) [2024-06-27 16:41:13,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.6, 300 sec: 43709.5). Total num frames: 564068352. Throughput: 0: 43655.1. Samples: 466936600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 16:41:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:41:16,083][06909] Updated weights for policy 0, policy_version 34433 (0.0032) [2024-06-27 16:41:18,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43417.6, 300 sec: 43875.8). Total num frames: 564297728. Throughput: 0: 43610.2. Samples: 467201700. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 16:41:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:41:18,874][06909] Updated weights for policy 0, policy_version 34443 (0.0030) [2024-06-27 16:41:23,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43419.6, 300 sec: 43653.6). Total num frames: 564477952. Throughput: 0: 43533.4. Samples: 467469560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 16:41:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:41:23,864][06909] Updated weights for policy 0, policy_version 34453 (0.0035) [2024-06-27 16:41:26,332][06909] Updated weights for policy 0, policy_version 34463 (0.0040) [2024-06-27 16:41:28,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 564707328. Throughput: 0: 43547.5. Samples: 467589800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 16:41:28,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:41:31,142][06909] Updated weights for policy 0, policy_version 34473 (0.0033) [2024-06-27 16:41:33,622][06909] Updated weights for policy 0, policy_version 34483 (0.0035) [2024-06-27 16:41:33,850][06674] Fps is (10 sec: 49151.9, 60 sec: 43963.7, 300 sec: 43931.6). Total num frames: 564969472. Throughput: 0: 43763.0. Samples: 467866000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:41:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:41:38,419][06909] Updated weights for policy 0, policy_version 34493 (0.0034) [2024-06-27 16:41:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 565133312. Throughput: 0: 43666.6. Samples: 468127640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:41:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:41:41,248][06909] Updated weights for policy 0, policy_version 34503 (0.0045) [2024-06-27 16:41:43,850][06674] Fps is (10 sec: 39321.1, 60 sec: 43417.5, 300 sec: 43598.6). Total num frames: 565362688. Throughput: 0: 43623.4. Samples: 468246460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:41:43,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:41:46,152][06909] Updated weights for policy 0, policy_version 34513 (0.0034) [2024-06-27 16:41:46,168][06887] Signal inference workers to stop experience collection... (6800 times) [2024-06-27 16:41:46,169][06887] Signal inference workers to resume experience collection... (6800 times) [2024-06-27 16:41:46,191][06909] InferenceWorker_p0-w0: stopping experience collection (6800 times) [2024-06-27 16:41:46,192][06909] InferenceWorker_p0-w0: resuming experience collection (6800 times) [2024-06-27 16:41:48,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43690.5, 300 sec: 43875.8). Total num frames: 565608448. Throughput: 0: 43644.2. Samples: 468513300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:41:48,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:41:48,982][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000034523_565624832.pth... [2024-06-27 16:41:48,989][06909] Updated weights for policy 0, policy_version 34523 (0.0024) [2024-06-27 16:41:49,031][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000033883_555139072.pth [2024-06-27 16:41:53,440][06909] Updated weights for policy 0, policy_version 34533 (0.0041) [2024-06-27 16:41:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 565805056. Throughput: 0: 43628.0. Samples: 468780000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:41:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:41:56,275][06909] Updated weights for policy 0, policy_version 34543 (0.0028) [2024-06-27 16:41:58,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 566018048. Throughput: 0: 43650.3. Samples: 468900860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:41:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:42:00,790][06909] Updated weights for policy 0, policy_version 34553 (0.0040) [2024-06-27 16:42:03,697][06909] Updated weights for policy 0, policy_version 34563 (0.0028) [2024-06-27 16:42:03,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 566280192. Throughput: 0: 43831.9. Samples: 469174140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:42:03,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:42:08,329][06909] Updated weights for policy 0, policy_version 34573 (0.0036) [2024-06-27 16:42:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 566460416. Throughput: 0: 43840.9. Samples: 469442400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:42:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:42:11,153][06909] Updated weights for policy 0, policy_version 34583 (0.0044) [2024-06-27 16:42:13,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 566689792. Throughput: 0: 43778.3. Samples: 469559820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:42:13,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:42:15,745][06909] Updated weights for policy 0, policy_version 34593 (0.0033) [2024-06-27 16:42:18,662][06909] Updated weights for policy 0, policy_version 34603 (0.0033) [2024-06-27 16:42:18,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 566935552. Throughput: 0: 43654.6. Samples: 469830460. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:42:18,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:42:23,261][06909] Updated weights for policy 0, policy_version 34613 (0.0039) [2024-06-27 16:42:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 567115776. Throughput: 0: 43639.2. Samples: 470091400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:42:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:42:26,516][06909] Updated weights for policy 0, policy_version 34623 (0.0026) [2024-06-27 16:42:28,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 567345152. Throughput: 0: 43741.5. Samples: 470214820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:42:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:42:30,919][06909] Updated weights for policy 0, policy_version 34633 (0.0033) [2024-06-27 16:42:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43417.6, 300 sec: 43875.8). Total num frames: 567574528. Throughput: 0: 43603.2. Samples: 470475440. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:42:33,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:42:34,309][06909] Updated weights for policy 0, policy_version 34643 (0.0032) [2024-06-27 16:42:38,385][06909] Updated weights for policy 0, policy_version 34653 (0.0035) [2024-06-27 16:42:38,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 567771136. Throughput: 0: 43512.8. Samples: 470738080. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 16:42:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:42:41,786][06909] Updated weights for policy 0, policy_version 34663 (0.0037) [2024-06-27 16:42:43,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 567984128. Throughput: 0: 43677.8. Samples: 470866360. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 16:42:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 16:42:45,969][06909] Updated weights for policy 0, policy_version 34673 (0.0032) [2024-06-27 16:42:48,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 568229888. Throughput: 0: 43494.7. Samples: 471131400. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 16:42:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:42:49,040][06909] Updated weights for policy 0, policy_version 34683 (0.0032) [2024-06-27 16:42:53,547][06909] Updated weights for policy 0, policy_version 34693 (0.0038) [2024-06-27 16:42:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 568426496. Throughput: 0: 43411.1. Samples: 471395900. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 16:42:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 16:42:56,789][06909] Updated weights for policy 0, policy_version 34703 (0.0029) [2024-06-27 16:42:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 568655872. Throughput: 0: 43564.9. Samples: 471520240. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 16:42:58,855][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:43:00,848][06909] Updated weights for policy 0, policy_version 34713 (0.0035) [2024-06-27 16:43:02,449][06887] Signal inference workers to stop experience collection... (6850 times) [2024-06-27 16:43:02,506][06887] Signal inference workers to resume experience collection... (6850 times) [2024-06-27 16:43:02,507][06909] InferenceWorker_p0-w0: stopping experience collection (6850 times) [2024-06-27 16:43:02,533][06909] InferenceWorker_p0-w0: resuming experience collection (6850 times) [2024-06-27 16:43:03,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43144.5, 300 sec: 43820.2). Total num frames: 568868864. Throughput: 0: 43475.5. Samples: 471786860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 16:43:03,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 16:43:04,241][06909] Updated weights for policy 0, policy_version 34723 (0.0040) [2024-06-27 16:43:08,378][06909] Updated weights for policy 0, policy_version 34733 (0.0029) [2024-06-27 16:43:08,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 569081856. Throughput: 0: 43486.7. Samples: 472048300. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 16:43:08,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 16:43:11,710][06909] Updated weights for policy 0, policy_version 34743 (0.0026) [2024-06-27 16:43:13,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 569311232. Throughput: 0: 43521.8. Samples: 472173300. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 16:43:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:43:15,981][06909] Updated weights for policy 0, policy_version 34753 (0.0034) [2024-06-27 16:43:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43144.7, 300 sec: 43820.3). Total num frames: 569524224. Throughput: 0: 43626.4. Samples: 472438620. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 16:43:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:43:19,447][06909] Updated weights for policy 0, policy_version 34763 (0.0035) [2024-06-27 16:43:23,309][06909] Updated weights for policy 0, policy_version 34773 (0.0029) [2024-06-27 16:43:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 569737216. Throughput: 0: 43614.3. Samples: 472700720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 16:43:23,854][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 16:43:26,754][06909] Updated weights for policy 0, policy_version 34783 (0.0038) [2024-06-27 16:43:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.6, 300 sec: 43598.4). Total num frames: 569950208. Throughput: 0: 43756.8. Samples: 472835420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 16:43:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:43:30,987][06909] Updated weights for policy 0, policy_version 34793 (0.0037) [2024-06-27 16:43:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 43764.7). Total num frames: 570163200. Throughput: 0: 43618.7. Samples: 473094240. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 16:43:33,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:43:34,454][06909] Updated weights for policy 0, policy_version 34803 (0.0024) [2024-06-27 16:43:38,320][06909] Updated weights for policy 0, policy_version 34813 (0.0051) [2024-06-27 16:43:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 570376192. Throughput: 0: 43524.4. Samples: 473354500. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 16:43:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:43:42,005][06909] Updated weights for policy 0, policy_version 34823 (0.0033) [2024-06-27 16:43:43,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 570621952. Throughput: 0: 43682.8. Samples: 473485960. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 16:43:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 16:43:45,805][06909] Updated weights for policy 0, policy_version 34833 (0.0036) [2024-06-27 16:43:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 570834944. Throughput: 0: 43611.2. Samples: 473749360. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 16:43:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:43:48,939][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000034842_570851328.pth... [2024-06-27 16:43:48,983][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000034202_560365568.pth [2024-06-27 16:43:49,341][06909] Updated weights for policy 0, policy_version 34843 (0.0027) [2024-06-27 16:43:53,313][06909] Updated weights for policy 0, policy_version 34853 (0.0049) [2024-06-27 16:43:53,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.5, 300 sec: 43653.6). Total num frames: 571047936. Throughput: 0: 43479.0. Samples: 474004860. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 16:43:53,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:43:56,942][06909] Updated weights for policy 0, policy_version 34863 (0.0032) [2024-06-27 16:43:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 571260928. Throughput: 0: 43718.2. Samples: 474140620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 16:43:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:44:00,703][06909] Updated weights for policy 0, policy_version 34873 (0.0035) [2024-06-27 16:44:03,850][06674] Fps is (10 sec: 45876.1, 60 sec: 43963.9, 300 sec: 43820.3). Total num frames: 571506688. Throughput: 0: 43634.2. Samples: 474402160. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 16:44:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:44:04,214][06909] Updated weights for policy 0, policy_version 34883 (0.0037) [2024-06-27 16:44:08,360][06909] Updated weights for policy 0, policy_version 34893 (0.0032) [2024-06-27 16:44:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 571686912. Throughput: 0: 43664.1. Samples: 474665600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:44:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:44:11,762][06909] Updated weights for policy 0, policy_version 34903 (0.0036) [2024-06-27 16:44:13,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 571916288. Throughput: 0: 43560.9. Samples: 474795660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:44:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:44:15,909][06909] Updated weights for policy 0, policy_version 34913 (0.0034) [2024-06-27 16:44:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 572129280. Throughput: 0: 43708.9. Samples: 475061140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:44:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:44:19,473][06909] Updated weights for policy 0, policy_version 34923 (0.0033) [2024-06-27 16:44:23,456][06909] Updated weights for policy 0, policy_version 34933 (0.0034) [2024-06-27 16:44:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.7, 300 sec: 43653.6). Total num frames: 572342272. Throughput: 0: 43680.9. Samples: 475320140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:44:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:44:26,983][06909] Updated weights for policy 0, policy_version 34943 (0.0029) [2024-06-27 16:44:28,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43599.0). Total num frames: 572571648. Throughput: 0: 43652.0. Samples: 475450300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 16:44:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:44:30,834][06909] Updated weights for policy 0, policy_version 34953 (0.0037) [2024-06-27 16:44:33,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 572801024. Throughput: 0: 43757.4. Samples: 475718440. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 16:44:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:44:34,316][06909] Updated weights for policy 0, policy_version 34963 (0.0033) [2024-06-27 16:44:38,309][06909] Updated weights for policy 0, policy_version 34973 (0.0049) [2024-06-27 16:44:38,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.2, 300 sec: 43708.9). Total num frames: 573014016. Throughput: 0: 43724.3. Samples: 475972540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 16:44:38,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:44:41,738][06909] Updated weights for policy 0, policy_version 34983 (0.0042) [2024-06-27 16:44:43,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43144.4, 300 sec: 43542.6). Total num frames: 573210624. Throughput: 0: 43697.2. Samples: 476107000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 16:44:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:44:45,938][06909] Updated weights for policy 0, policy_version 34993 (0.0035) [2024-06-27 16:44:48,850][06674] Fps is (10 sec: 44246.2, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 573456384. Throughput: 0: 43697.8. Samples: 476368560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 16:44:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:44:49,615][06909] Updated weights for policy 0, policy_version 35003 (0.0030) [2024-06-27 16:44:53,454][06909] Updated weights for policy 0, policy_version 35013 (0.0037) [2024-06-27 16:44:53,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 573669376. Throughput: 0: 43672.4. Samples: 476630860. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 16:44:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:44:54,763][06887] Signal inference workers to stop experience collection... (6900 times) [2024-06-27 16:44:54,772][06887] Signal inference workers to resume experience collection... (6900 times) [2024-06-27 16:44:54,815][06909] InferenceWorker_p0-w0: stopping experience collection (6900 times) [2024-06-27 16:44:54,816][06909] InferenceWorker_p0-w0: resuming experience collection (6900 times) [2024-06-27 16:44:56,963][06909] Updated weights for policy 0, policy_version 35023 (0.0032) [2024-06-27 16:44:58,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 573865984. Throughput: 0: 43589.8. Samples: 476757200. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 16:44:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:45:00,786][06909] Updated weights for policy 0, policy_version 35033 (0.0048) [2024-06-27 16:45:03,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43144.5, 300 sec: 43709.2). Total num frames: 574095360. Throughput: 0: 43527.7. Samples: 477019880. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 16:45:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 16:45:04,518][06909] Updated weights for policy 0, policy_version 35043 (0.0027) [2024-06-27 16:45:08,414][06909] Updated weights for policy 0, policy_version 35053 (0.0026) [2024-06-27 16:45:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 574308352. Throughput: 0: 43707.5. Samples: 477286980. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 16:45:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:45:12,073][06909] Updated weights for policy 0, policy_version 35063 (0.0035) [2024-06-27 16:45:13,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43690.6, 300 sec: 43542.5). Total num frames: 574537728. Throughput: 0: 43738.0. Samples: 477418520. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 16:45:13,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:45:15,815][06909] Updated weights for policy 0, policy_version 35073 (0.0036) [2024-06-27 16:45:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 43709.6). Total num frames: 574767104. Throughput: 0: 43608.9. Samples: 477680840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:45:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:45:19,332][06909] Updated weights for policy 0, policy_version 35083 (0.0036) [2024-06-27 16:45:23,286][06909] Updated weights for policy 0, policy_version 35093 (0.0032) [2024-06-27 16:45:23,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 574980096. Throughput: 0: 43677.5. Samples: 477937940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:45:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 16:45:26,842][06909] Updated weights for policy 0, policy_version 35103 (0.0030) [2024-06-27 16:45:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 575193088. Throughput: 0: 43658.3. Samples: 478071620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:45:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:45:30,827][06909] Updated weights for policy 0, policy_version 35113 (0.0021) [2024-06-27 16:45:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 575406080. Throughput: 0: 43696.3. Samples: 478334900. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:45:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:45:34,552][06909] Updated weights for policy 0, policy_version 35123 (0.0030) [2024-06-27 16:45:38,213][06909] Updated weights for policy 0, policy_version 35133 (0.0021) [2024-06-27 16:45:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43419.1, 300 sec: 43598.1). Total num frames: 575619072. Throughput: 0: 43542.2. Samples: 478590260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:45:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:45:42,200][06909] Updated weights for policy 0, policy_version 35143 (0.0033) [2024-06-27 16:45:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 575848448. Throughput: 0: 43694.6. Samples: 478723460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:45:43,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:45:45,503][06909] Updated weights for policy 0, policy_version 35153 (0.0037) [2024-06-27 16:45:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 576061440. Throughput: 0: 43781.7. Samples: 478990060. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:45:48,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:45:48,904][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000035161_576077824.pth... [2024-06-27 16:45:48,953][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000034523_565624832.pth [2024-06-27 16:45:49,795][06909] Updated weights for policy 0, policy_version 35163 (0.0042) [2024-06-27 16:45:53,387][06909] Updated weights for policy 0, policy_version 35173 (0.0038) [2024-06-27 16:45:53,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 576290816. Throughput: 0: 43479.6. Samples: 479243560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:45:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:45:57,197][06909] Updated weights for policy 0, policy_version 35183 (0.0038) [2024-06-27 16:45:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 576503808. Throughput: 0: 43508.1. Samples: 479376380. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 16:45:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:46:00,790][06909] Updated weights for policy 0, policy_version 35193 (0.0028) [2024-06-27 16:46:03,852][06674] Fps is (10 sec: 42586.8, 60 sec: 43688.7, 300 sec: 43708.8). Total num frames: 576716800. Throughput: 0: 43673.0. Samples: 479646240. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 16:46:03,853][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:46:04,560][06909] Updated weights for policy 0, policy_version 35203 (0.0033) [2024-06-27 16:46:08,416][06909] Updated weights for policy 0, policy_version 35213 (0.0039) [2024-06-27 16:46:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 576929792. Throughput: 0: 43676.4. Samples: 479903380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 16:46:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:46:12,110][06909] Updated weights for policy 0, policy_version 35223 (0.0027) [2024-06-27 16:46:13,850][06674] Fps is (10 sec: 44247.8, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 577159168. Throughput: 0: 43733.7. Samples: 480039640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 16:46:13,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:46:15,724][06909] Updated weights for policy 0, policy_version 35233 (0.0031) [2024-06-27 16:46:16,708][06887] Signal inference workers to stop experience collection... (6950 times) [2024-06-27 16:46:16,709][06887] Signal inference workers to resume experience collection... (6950 times) [2024-06-27 16:46:16,732][06909] InferenceWorker_p0-w0: stopping experience collection (6950 times) [2024-06-27 16:46:16,736][06909] InferenceWorker_p0-w0: resuming experience collection (6950 times) [2024-06-27 16:46:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 577372160. Throughput: 0: 43649.8. Samples: 480299140. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 16:46:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:46:19,690][06909] Updated weights for policy 0, policy_version 35243 (0.0032) [2024-06-27 16:46:23,810][06909] Updated weights for policy 0, policy_version 35253 (0.0046) [2024-06-27 16:46:23,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 577585152. Throughput: 0: 43747.1. Samples: 480558880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 16:46:23,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:46:27,470][06909] Updated weights for policy 0, policy_version 35263 (0.0041) [2024-06-27 16:46:28,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 577830912. Throughput: 0: 43794.7. Samples: 480694220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 16:46:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:46:31,071][06909] Updated weights for policy 0, policy_version 35273 (0.0031) [2024-06-27 16:46:33,850][06674] Fps is (10 sec: 44235.3, 60 sec: 43690.5, 300 sec: 43709.1). Total num frames: 578027520. Throughput: 0: 43648.0. Samples: 480954240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 16:46:33,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:46:34,977][06909] Updated weights for policy 0, policy_version 35283 (0.0029) [2024-06-27 16:46:38,419][06909] Updated weights for policy 0, policy_version 35293 (0.0033) [2024-06-27 16:46:38,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 578256896. Throughput: 0: 43901.1. Samples: 481219120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 16:46:38,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:46:42,338][06909] Updated weights for policy 0, policy_version 35303 (0.0032) [2024-06-27 16:46:43,850][06674] Fps is (10 sec: 44237.9, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 578469888. Throughput: 0: 43853.2. Samples: 481349780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 16:46:43,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:46:45,677][06909] Updated weights for policy 0, policy_version 35313 (0.0032) [2024-06-27 16:46:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 578682880. Throughput: 0: 43730.1. Samples: 481613980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 16:46:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:46:49,726][06909] Updated weights for policy 0, policy_version 35323 (0.0027) [2024-06-27 16:46:53,036][06909] Updated weights for policy 0, policy_version 35333 (0.0031) [2024-06-27 16:46:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 578895872. Throughput: 0: 43770.7. Samples: 481873060. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 16:46:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:46:57,006][06909] Updated weights for policy 0, policy_version 35343 (0.0034) [2024-06-27 16:46:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 579141632. Throughput: 0: 43765.5. Samples: 482009080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 16:46:58,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:47:00,797][06909] Updated weights for policy 0, policy_version 35353 (0.0034) [2024-06-27 16:47:03,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43419.5, 300 sec: 43598.1). Total num frames: 579321856. Throughput: 0: 43716.6. Samples: 482266380. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 16:47:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:47:04,917][06909] Updated weights for policy 0, policy_version 35363 (0.0044) [2024-06-27 16:47:08,266][06909] Updated weights for policy 0, policy_version 35373 (0.0035) [2024-06-27 16:47:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 579567616. Throughput: 0: 43696.9. Samples: 482525240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 16:47:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:47:12,625][06909] Updated weights for policy 0, policy_version 35383 (0.0028) [2024-06-27 16:47:13,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.9, 300 sec: 43598.1). Total num frames: 579796992. Throughput: 0: 43727.6. Samples: 482661960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:47:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:47:15,742][06909] Updated weights for policy 0, policy_version 35393 (0.0027) [2024-06-27 16:47:18,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43689.2, 300 sec: 43653.3). Total num frames: 579993600. Throughput: 0: 43683.3. Samples: 482920060. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:47:18,852][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:47:19,908][06909] Updated weights for policy 0, policy_version 35403 (0.0042) [2024-06-27 16:47:23,317][06909] Updated weights for policy 0, policy_version 35413 (0.0037) [2024-06-27 16:47:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43653.7). Total num frames: 580222976. Throughput: 0: 43697.1. Samples: 483185480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:47:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:47:27,223][06909] Updated weights for policy 0, policy_version 35423 (0.0024) [2024-06-27 16:47:28,850][06674] Fps is (10 sec: 45884.5, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 580452352. Throughput: 0: 43766.3. Samples: 483319260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:47:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:47:30,755][06909] Updated weights for policy 0, policy_version 35433 (0.0037) [2024-06-27 16:47:33,853][06674] Fps is (10 sec: 42583.6, 60 sec: 43688.5, 300 sec: 43653.2). Total num frames: 580648960. Throughput: 0: 43717.2. Samples: 483581400. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 16:47:33,854][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:47:34,648][06909] Updated weights for policy 0, policy_version 35443 (0.0026) [2024-06-27 16:47:36,853][06887] Signal inference workers to stop experience collection... (7000 times) [2024-06-27 16:47:36,900][06909] InferenceWorker_p0-w0: stopping experience collection (7000 times) [2024-06-27 16:47:36,907][06887] Signal inference workers to resume experience collection... (7000 times) [2024-06-27 16:47:36,915][06909] InferenceWorker_p0-w0: resuming experience collection (7000 times) [2024-06-27 16:47:38,567][06909] Updated weights for policy 0, policy_version 35453 (0.0032) [2024-06-27 16:47:38,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 580878336. Throughput: 0: 43769.0. Samples: 483842660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 16:47:38,850][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 16:47:42,047][06909] Updated weights for policy 0, policy_version 35463 (0.0037) [2024-06-27 16:47:43,850][06674] Fps is (10 sec: 44251.7, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 581091328. Throughput: 0: 43700.4. Samples: 483975600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 16:47:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:47:45,997][06909] Updated weights for policy 0, policy_version 35473 (0.0037) [2024-06-27 16:47:48,856][06674] Fps is (10 sec: 42572.2, 60 sec: 43686.2, 300 sec: 43652.7). Total num frames: 581304320. Throughput: 0: 43750.9. Samples: 484235440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 16:47:48,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:47:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000035480_581304320.pth... [2024-06-27 16:47:48,927][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000034842_570851328.pth [2024-06-27 16:47:49,588][06909] Updated weights for policy 0, policy_version 35483 (0.0038) [2024-06-27 16:47:53,449][06909] Updated weights for policy 0, policy_version 35493 (0.0034) [2024-06-27 16:47:53,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 581517312. Throughput: 0: 43704.8. Samples: 484491960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 16:47:53,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:47:57,262][06909] Updated weights for policy 0, policy_version 35503 (0.0031) [2024-06-27 16:47:58,850][06674] Fps is (10 sec: 44263.5, 60 sec: 43417.5, 300 sec: 43653.7). Total num frames: 581746688. Throughput: 0: 43591.9. Samples: 484623600. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 16:47:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:48:00,735][06909] Updated weights for policy 0, policy_version 35513 (0.0031) [2024-06-27 16:48:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 581959680. Throughput: 0: 43638.4. Samples: 484883700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 16:48:03,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:48:04,679][06909] Updated weights for policy 0, policy_version 35523 (0.0027) [2024-06-27 16:48:08,558][06909] Updated weights for policy 0, policy_version 35533 (0.0034) [2024-06-27 16:48:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 582172672. Throughput: 0: 43582.6. Samples: 485146700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 16:48:08,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:48:12,359][06909] Updated weights for policy 0, policy_version 35543 (0.0023) [2024-06-27 16:48:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 582402048. Throughput: 0: 43485.0. Samples: 485276080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 16:48:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:48:15,929][06909] Updated weights for policy 0, policy_version 35553 (0.0027) [2024-06-27 16:48:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43419.1, 300 sec: 43598.1). Total num frames: 582598656. Throughput: 0: 43417.1. Samples: 485535020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 16:48:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:48:19,853][06909] Updated weights for policy 0, policy_version 35563 (0.0036) [2024-06-27 16:48:23,589][06909] Updated weights for policy 0, policy_version 35573 (0.0030) [2024-06-27 16:48:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 582828032. Throughput: 0: 43545.3. Samples: 485802200. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 16:48:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:48:27,301][06909] Updated weights for policy 0, policy_version 35583 (0.0038) [2024-06-27 16:48:28,852][06674] Fps is (10 sec: 45865.6, 60 sec: 43416.2, 300 sec: 43708.9). Total num frames: 583057408. Throughput: 0: 43601.1. Samples: 485937740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 16:48:28,853][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:48:30,831][06909] Updated weights for policy 0, policy_version 35593 (0.0024) [2024-06-27 16:48:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43147.0, 300 sec: 43598.1). Total num frames: 583237632. Throughput: 0: 43509.5. Samples: 486193100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 16:48:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:48:34,745][06909] Updated weights for policy 0, policy_version 35603 (0.0039) [2024-06-27 16:48:38,343][06909] Updated weights for policy 0, policy_version 35613 (0.0028) [2024-06-27 16:48:38,850][06674] Fps is (10 sec: 44246.2, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 583499776. Throughput: 0: 43489.9. Samples: 486449000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 16:48:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:48:42,182][06909] Updated weights for policy 0, policy_version 35623 (0.0046) [2024-06-27 16:48:43,850][06674] Fps is (10 sec: 47512.6, 60 sec: 43690.5, 300 sec: 43653.6). Total num frames: 583712768. Throughput: 0: 43610.6. Samples: 486586080. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 16:48:43,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:48:45,835][06909] Updated weights for policy 0, policy_version 35633 (0.0020) [2024-06-27 16:48:48,853][06674] Fps is (10 sec: 40945.6, 60 sec: 43419.5, 300 sec: 43597.6). Total num frames: 583909376. Throughput: 0: 43753.1. Samples: 486852740. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 16:48:48,854][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:48:49,613][06909] Updated weights for policy 0, policy_version 35643 (0.0041) [2024-06-27 16:48:53,351][06909] Updated weights for policy 0, policy_version 35653 (0.0045) [2024-06-27 16:48:53,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 584155136. Throughput: 0: 43700.0. Samples: 487113200. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 16:48:53,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:48:57,221][06909] Updated weights for policy 0, policy_version 35663 (0.0030) [2024-06-27 16:48:58,850][06674] Fps is (10 sec: 47530.0, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 584384512. Throughput: 0: 43838.2. Samples: 487248800. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 16:48:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:49:00,691][06909] Updated weights for policy 0, policy_version 35673 (0.0040) [2024-06-27 16:49:03,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 584564736. Throughput: 0: 43894.5. Samples: 487510280. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 16:49:03,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:49:04,864][06909] Updated weights for policy 0, policy_version 35683 (0.0032) [2024-06-27 16:49:08,625][06909] Updated weights for policy 0, policy_version 35693 (0.0030) [2024-06-27 16:49:08,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 584794112. Throughput: 0: 43889.7. Samples: 487777240. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 16:49:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:49:12,172][06909] Updated weights for policy 0, policy_version 35703 (0.0042) [2024-06-27 16:49:13,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 585039872. Throughput: 0: 43700.2. Samples: 487904160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 16:49:13,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:49:16,039][06909] Updated weights for policy 0, policy_version 35713 (0.0032) [2024-06-27 16:49:18,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 585236480. Throughput: 0: 43787.6. Samples: 488163540. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 16:49:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:49:19,702][06909] Updated weights for policy 0, policy_version 35723 (0.0032) [2024-06-27 16:49:23,312][06909] Updated weights for policy 0, policy_version 35733 (0.0033) [2024-06-27 16:49:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 585449472. Throughput: 0: 43937.8. Samples: 488426200. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 16:49:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:49:27,016][06909] Updated weights for policy 0, policy_version 35743 (0.0035) [2024-06-27 16:49:27,741][06887] Signal inference workers to stop experience collection... (7050 times) [2024-06-27 16:49:27,791][06909] InferenceWorker_p0-w0: stopping experience collection (7050 times) [2024-06-27 16:49:27,810][06887] Signal inference workers to resume experience collection... (7050 times) [2024-06-27 16:49:27,811][06909] InferenceWorker_p0-w0: resuming experience collection (7050 times) [2024-06-27 16:49:28,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43965.2, 300 sec: 43709.2). Total num frames: 585695232. Throughput: 0: 43894.3. Samples: 488561320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 16:49:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:49:30,705][06909] Updated weights for policy 0, policy_version 35753 (0.0034) [2024-06-27 16:49:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 43653.9). Total num frames: 585891840. Throughput: 0: 43795.4. Samples: 488823380. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 16:49:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:49:34,300][06909] Updated weights for policy 0, policy_version 35763 (0.0032) [2024-06-27 16:49:38,149][06909] Updated weights for policy 0, policy_version 35773 (0.0037) [2024-06-27 16:49:38,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 586104832. Throughput: 0: 43781.3. Samples: 489083360. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 16:49:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 16:49:41,979][06909] Updated weights for policy 0, policy_version 35783 (0.0032) [2024-06-27 16:49:43,850][06674] Fps is (10 sec: 47512.7, 60 sec: 44236.8, 300 sec: 43764.7). Total num frames: 586366976. Throughput: 0: 43822.0. Samples: 489220800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 16:49:43,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:49:45,564][06909] Updated weights for policy 0, policy_version 35793 (0.0030) [2024-06-27 16:49:48,852][06674] Fps is (10 sec: 45865.9, 60 sec: 44237.8, 300 sec: 43708.9). Total num frames: 586563584. Throughput: 0: 43879.9. Samples: 489484960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 16:49:48,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:49:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000035801_586563584.pth... [2024-06-27 16:49:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000035161_576077824.pth [2024-06-27 16:49:49,787][06909] Updated weights for policy 0, policy_version 35803 (0.0036) [2024-06-27 16:49:52,925][06909] Updated weights for policy 0, policy_version 35813 (0.0034) [2024-06-27 16:49:53,850][06674] Fps is (10 sec: 39322.5, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 586760192. Throughput: 0: 43647.7. Samples: 489741380. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-27 16:49:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:49:57,065][06909] Updated weights for policy 0, policy_version 35823 (0.0029) [2024-06-27 16:49:58,850][06674] Fps is (10 sec: 44245.4, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 587005952. Throughput: 0: 43893.7. Samples: 489879380. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-27 16:49:58,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:50:00,809][06909] Updated weights for policy 0, policy_version 35833 (0.0037) [2024-06-27 16:50:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.9, 300 sec: 43709.2). Total num frames: 587202560. Throughput: 0: 43875.5. Samples: 490137940. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-27 16:50:03,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:50:04,491][06909] Updated weights for policy 0, policy_version 35843 (0.0037) [2024-06-27 16:50:08,237][06909] Updated weights for policy 0, policy_version 35853 (0.0027) [2024-06-27 16:50:08,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 587415552. Throughput: 0: 43793.2. Samples: 490396900. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-27 16:50:08,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:50:11,792][06909] Updated weights for policy 0, policy_version 35863 (0.0029) [2024-06-27 16:50:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 587661312. Throughput: 0: 43867.7. Samples: 490535360. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-27 16:50:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:50:15,613][06909] Updated weights for policy 0, policy_version 35873 (0.0041) [2024-06-27 16:50:18,850][06674] Fps is (10 sec: 47514.5, 60 sec: 44236.8, 300 sec: 43764.7). Total num frames: 587890688. Throughput: 0: 43923.2. Samples: 490799920. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:50:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:50:19,152][06909] Updated weights for policy 0, policy_version 35883 (0.0033) [2024-06-27 16:50:23,160][06909] Updated weights for policy 0, policy_version 35893 (0.0029) [2024-06-27 16:50:23,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 588070912. Throughput: 0: 43966.6. Samples: 491061860. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:50:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:50:26,872][06909] Updated weights for policy 0, policy_version 35903 (0.0041) [2024-06-27 16:50:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 588333056. Throughput: 0: 43846.3. Samples: 491193880. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:50:28,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:50:30,683][06909] Updated weights for policy 0, policy_version 35913 (0.0031) [2024-06-27 16:50:33,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 588529664. Throughput: 0: 43919.8. Samples: 491461260. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 16:50:33,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 16:50:34,254][06909] Updated weights for policy 0, policy_version 35923 (0.0032) [2024-06-27 16:50:38,227][06909] Updated weights for policy 0, policy_version 35933 (0.0040) [2024-06-27 16:50:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 588742656. Throughput: 0: 44003.4. Samples: 491721540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 16:50:38,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:50:41,730][06909] Updated weights for policy 0, policy_version 35943 (0.0041) [2024-06-27 16:50:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 588988416. Throughput: 0: 43759.2. Samples: 491848540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 16:50:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:50:45,665][06909] Updated weights for policy 0, policy_version 35953 (0.0032) [2024-06-27 16:50:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43692.1, 300 sec: 43709.2). Total num frames: 589185024. Throughput: 0: 43890.1. Samples: 492113000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 16:50:48,851][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:50:49,436][06909] Updated weights for policy 0, policy_version 35963 (0.0023) [2024-06-27 16:50:53,291][06909] Updated weights for policy 0, policy_version 35973 (0.0022) [2024-06-27 16:50:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 589398016. Throughput: 0: 44094.8. Samples: 492381160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 16:50:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:50:56,689][06909] Updated weights for policy 0, policy_version 35983 (0.0034) [2024-06-27 16:50:58,856][06674] Fps is (10 sec: 47485.0, 60 sec: 44232.4, 300 sec: 43875.3). Total num frames: 589660160. Throughput: 0: 43913.6. Samples: 492511740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 16:50:58,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:51:00,769][06909] Updated weights for policy 0, policy_version 35993 (0.0031) [2024-06-27 16:51:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 589824000. Throughput: 0: 43753.3. Samples: 492768820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 16:51:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:51:04,269][06909] Updated weights for policy 0, policy_version 36003 (0.0024) [2024-06-27 16:51:05,130][06887] Signal inference workers to stop experience collection... (7100 times) [2024-06-27 16:51:05,171][06909] InferenceWorker_p0-w0: stopping experience collection (7100 times) [2024-06-27 16:51:05,245][06887] Signal inference workers to resume experience collection... (7100 times) [2024-06-27 16:51:05,246][06909] InferenceWorker_p0-w0: resuming experience collection (7100 times) [2024-06-27 16:51:08,168][06909] Updated weights for policy 0, policy_version 36013 (0.0033) [2024-06-27 16:51:08,850][06674] Fps is (10 sec: 39345.3, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 590053376. Throughput: 0: 43872.4. Samples: 493036120. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 16:51:08,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:51:11,673][06909] Updated weights for policy 0, policy_version 36023 (0.0023) [2024-06-27 16:51:13,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 590299136. Throughput: 0: 43883.1. Samples: 493168620. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 16:51:13,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:51:15,706][06909] Updated weights for policy 0, policy_version 36033 (0.0028) [2024-06-27 16:51:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 590512128. Throughput: 0: 43867.5. Samples: 493435300. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 16:51:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:51:19,262][06909] Updated weights for policy 0, policy_version 36043 (0.0039) [2024-06-27 16:51:23,642][06909] Updated weights for policy 0, policy_version 36053 (0.0037) [2024-06-27 16:51:23,851][06674] Fps is (10 sec: 40956.5, 60 sec: 43963.1, 300 sec: 43653.5). Total num frames: 590708736. Throughput: 0: 43778.7. Samples: 493691620. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 16:51:23,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:51:26,725][06909] Updated weights for policy 0, policy_version 36063 (0.0041) [2024-06-27 16:51:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 43875.9). Total num frames: 590970880. Throughput: 0: 43772.0. Samples: 493818280. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:51:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:51:31,027][06909] Updated weights for policy 0, policy_version 36073 (0.0027) [2024-06-27 16:51:33,850][06674] Fps is (10 sec: 44241.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 591151104. Throughput: 0: 43724.1. Samples: 494080580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:51:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:51:34,417][06909] Updated weights for policy 0, policy_version 36083 (0.0034) [2024-06-27 16:51:38,545][06909] Updated weights for policy 0, policy_version 36093 (0.0029) [2024-06-27 16:51:38,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 591364096. Throughput: 0: 43825.2. Samples: 494353300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:51:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:51:41,742][06909] Updated weights for policy 0, policy_version 36103 (0.0030) [2024-06-27 16:51:43,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 591626240. Throughput: 0: 43709.4. Samples: 494478400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:51:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:51:45,854][06909] Updated weights for policy 0, policy_version 36113 (0.0028) [2024-06-27 16:51:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 591806464. Throughput: 0: 43826.7. Samples: 494741020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 16:51:48,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:51:48,873][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000036122_591822848.pth... [2024-06-27 16:51:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000035480_581304320.pth [2024-06-27 16:51:49,505][06909] Updated weights for policy 0, policy_version 36123 (0.0044) [2024-06-27 16:51:53,430][06909] Updated weights for policy 0, policy_version 36133 (0.0039) [2024-06-27 16:51:53,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 592019456. Throughput: 0: 43818.7. Samples: 495007960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 16:51:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:51:56,782][06909] Updated weights for policy 0, policy_version 36143 (0.0034) [2024-06-27 16:51:58,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43695.1, 300 sec: 43931.3). Total num frames: 592281600. Throughput: 0: 43761.8. Samples: 495137900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 16:51:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:52:00,875][06909] Updated weights for policy 0, policy_version 36153 (0.0036) [2024-06-27 16:52:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 592461824. Throughput: 0: 43605.3. Samples: 495397540. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 16:52:03,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:52:04,267][06909] Updated weights for policy 0, policy_version 36163 (0.0039) [2024-06-27 16:52:08,264][06909] Updated weights for policy 0, policy_version 36173 (0.0025) [2024-06-27 16:52:08,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 592691200. Throughput: 0: 43832.8. Samples: 495664060. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 16:52:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:52:11,907][06909] Updated weights for policy 0, policy_version 36183 (0.0022) [2024-06-27 16:52:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43820.6). Total num frames: 592920576. Throughput: 0: 43820.4. Samples: 495790200. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:52:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:52:15,802][06909] Updated weights for policy 0, policy_version 36193 (0.0026) [2024-06-27 16:52:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 593117184. Throughput: 0: 43811.1. Samples: 496052080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:52:18,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:52:19,214][06909] Updated weights for policy 0, policy_version 36203 (0.0036) [2024-06-27 16:52:23,081][06909] Updated weights for policy 0, policy_version 36213 (0.0031) [2024-06-27 16:52:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43964.4, 300 sec: 43709.2). Total num frames: 593346560. Throughput: 0: 43628.1. Samples: 496316560. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:52:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:52:26,577][06909] Updated weights for policy 0, policy_version 36223 (0.0046) [2024-06-27 16:52:28,852][06674] Fps is (10 sec: 47503.8, 60 sec: 43689.2, 300 sec: 43876.0). Total num frames: 593592320. Throughput: 0: 43783.8. Samples: 496448760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:52:28,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:52:30,450][06909] Updated weights for policy 0, policy_version 36233 (0.0025) [2024-06-27 16:52:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 593788928. Throughput: 0: 43862.2. Samples: 496714820. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 16:52:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:52:33,963][06909] Updated weights for policy 0, policy_version 36243 (0.0029) [2024-06-27 16:52:35,926][06887] Signal inference workers to stop experience collection... (7150 times) [2024-06-27 16:52:35,926][06887] Signal inference workers to resume experience collection... (7150 times) [2024-06-27 16:52:35,976][06909] InferenceWorker_p0-w0: stopping experience collection (7150 times) [2024-06-27 16:52:35,976][06909] InferenceWorker_p0-w0: resuming experience collection (7150 times) [2024-06-27 16:52:37,984][06909] Updated weights for policy 0, policy_version 36253 (0.0034) [2024-06-27 16:52:38,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 594001920. Throughput: 0: 43976.4. Samples: 496986900. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 16:52:38,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:52:41,241][06909] Updated weights for policy 0, policy_version 36263 (0.0031) [2024-06-27 16:52:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 43876.7). Total num frames: 594247680. Throughput: 0: 43849.3. Samples: 497111120. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 16:52:43,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:52:45,761][06909] Updated weights for policy 0, policy_version 36273 (0.0041) [2024-06-27 16:52:48,741][06909] Updated weights for policy 0, policy_version 36283 (0.0040) [2024-06-27 16:52:48,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 594460672. Throughput: 0: 43997.4. Samples: 497377420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 16:52:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:52:53,128][06909] Updated weights for policy 0, policy_version 36293 (0.0041) [2024-06-27 16:52:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 43820.3). Total num frames: 594673664. Throughput: 0: 43853.8. Samples: 497637480. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 16:52:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:52:56,403][06909] Updated weights for policy 0, policy_version 36303 (0.0038) [2024-06-27 16:52:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 594903040. Throughput: 0: 43874.1. Samples: 497764540. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 16:52:58,851][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:53:00,379][06909] Updated weights for policy 0, policy_version 36313 (0.0036) [2024-06-27 16:53:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 595099648. Throughput: 0: 43868.9. Samples: 498026180. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 16:53:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:53:03,973][06909] Updated weights for policy 0, policy_version 36323 (0.0029) [2024-06-27 16:53:07,821][06909] Updated weights for policy 0, policy_version 36333 (0.0038) [2024-06-27 16:53:08,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 595312640. Throughput: 0: 43869.8. Samples: 498290700. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 16:53:08,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:53:11,425][06909] Updated weights for policy 0, policy_version 36343 (0.0037) [2024-06-27 16:53:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 595542016. Throughput: 0: 43751.3. Samples: 498417480. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 16:53:13,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:53:15,628][06909] Updated weights for policy 0, policy_version 36353 (0.0035) [2024-06-27 16:53:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 595755008. Throughput: 0: 43696.8. Samples: 498681180. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 16:53:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:53:18,863][06909] Updated weights for policy 0, policy_version 36363 (0.0030) [2024-06-27 16:53:23,294][06909] Updated weights for policy 0, policy_version 36373 (0.0034) [2024-06-27 16:53:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.6, 300 sec: 43820.6). Total num frames: 595984384. Throughput: 0: 43583.5. Samples: 498948160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 16:53:23,851][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:53:26,295][06909] Updated weights for policy 0, policy_version 36383 (0.0022) [2024-06-27 16:53:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43692.1, 300 sec: 43986.9). Total num frames: 596213760. Throughput: 0: 43679.1. Samples: 499076680. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 16:53:28,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 16:53:30,558][06909] Updated weights for policy 0, policy_version 36393 (0.0031) [2024-06-27 16:53:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 596410368. Throughput: 0: 43619.0. Samples: 499340280. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 16:53:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:53:34,050][06909] Updated weights for policy 0, policy_version 36403 (0.0033) [2024-06-27 16:53:37,840][06909] Updated weights for policy 0, policy_version 36413 (0.0024) [2024-06-27 16:53:38,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 596639744. Throughput: 0: 43765.8. Samples: 499606940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 16:53:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:53:41,396][06887] Signal inference workers to stop experience collection... (7200 times) [2024-06-27 16:53:41,444][06909] InferenceWorker_p0-w0: stopping experience collection (7200 times) [2024-06-27 16:53:41,448][06887] Signal inference workers to resume experience collection... (7200 times) [2024-06-27 16:53:41,459][06909] InferenceWorker_p0-w0: resuming experience collection (7200 times) [2024-06-27 16:53:41,462][06909] Updated weights for policy 0, policy_version 36423 (0.0038) [2024-06-27 16:53:43,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43931.9). Total num frames: 596869120. Throughput: 0: 43920.6. Samples: 499740960. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 16:53:43,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:53:45,031][06909] Updated weights for policy 0, policy_version 36433 (0.0033) [2024-06-27 16:53:48,569][06909] Updated weights for policy 0, policy_version 36443 (0.0024) [2024-06-27 16:53:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 597082112. Throughput: 0: 44142.7. Samples: 500012600. Policy #0 lag: (min: 1.0, avg: 10.4, max: 21.0) [2024-06-27 16:53:48,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:53:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000036443_597082112.pth... [2024-06-27 16:53:48,906][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000035801_586563584.pth [2024-06-27 16:53:52,593][06909] Updated weights for policy 0, policy_version 36453 (0.0031) [2024-06-27 16:53:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 597311488. Throughput: 0: 44045.8. Samples: 500272760. Policy #0 lag: (min: 1.0, avg: 10.4, max: 21.0) [2024-06-27 16:53:53,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:53:55,909][06909] Updated weights for policy 0, policy_version 36463 (0.0041) [2024-06-27 16:53:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 597524480. Throughput: 0: 44097.8. Samples: 500401880. Policy #0 lag: (min: 1.0, avg: 10.4, max: 21.0) [2024-06-27 16:53:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:54:00,022][06909] Updated weights for policy 0, policy_version 36473 (0.0038) [2024-06-27 16:54:03,505][06909] Updated weights for policy 0, policy_version 36483 (0.0039) [2024-06-27 16:54:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 597770240. Throughput: 0: 44212.0. Samples: 500670720. Policy #0 lag: (min: 1.0, avg: 10.4, max: 21.0) [2024-06-27 16:54:03,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:54:07,675][06909] Updated weights for policy 0, policy_version 36493 (0.0038) [2024-06-27 16:54:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 43820.2). Total num frames: 597966848. Throughput: 0: 44116.0. Samples: 500933380. Policy #0 lag: (min: 1.0, avg: 10.4, max: 21.0) [2024-06-27 16:54:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:54:10,874][06909] Updated weights for policy 0, policy_version 36503 (0.0031) [2024-06-27 16:54:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 598196224. Throughput: 0: 44013.4. Samples: 501057280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:54:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:54:15,319][06909] Updated weights for policy 0, policy_version 36513 (0.0032) [2024-06-27 16:54:18,530][06909] Updated weights for policy 0, policy_version 36523 (0.0027) [2024-06-27 16:54:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 598409216. Throughput: 0: 44119.2. Samples: 501325640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:54:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:54:22,629][06909] Updated weights for policy 0, policy_version 36533 (0.0030) [2024-06-27 16:54:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 598638592. Throughput: 0: 43951.0. Samples: 501584740. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:54:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:54:26,228][06909] Updated weights for policy 0, policy_version 36543 (0.0036) [2024-06-27 16:54:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 598835200. Throughput: 0: 43902.5. Samples: 501716580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 16:54:28,851][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:54:29,887][06909] Updated weights for policy 0, policy_version 36553 (0.0023) [2024-06-27 16:54:33,618][06909] Updated weights for policy 0, policy_version 36563 (0.0026) [2024-06-27 16:54:33,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 599048192. Throughput: 0: 43763.2. Samples: 501981940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:54:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:54:37,467][06909] Updated weights for policy 0, policy_version 36573 (0.0030) [2024-06-27 16:54:38,852][06674] Fps is (10 sec: 45866.2, 60 sec: 44235.2, 300 sec: 43820.0). Total num frames: 599293952. Throughput: 0: 43891.3. Samples: 502247960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:54:38,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:54:40,945][06909] Updated weights for policy 0, policy_version 36583 (0.0031) [2024-06-27 16:54:43,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43820.6). Total num frames: 599490560. Throughput: 0: 43865.3. Samples: 502375820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:54:43,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:54:44,942][06909] Updated weights for policy 0, policy_version 36593 (0.0034) [2024-06-27 16:54:48,328][06909] Updated weights for policy 0, policy_version 36603 (0.0029) [2024-06-27 16:54:48,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 599736320. Throughput: 0: 43795.6. Samples: 502641520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:54:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:54:52,358][06909] Updated weights for policy 0, policy_version 36613 (0.0033) [2024-06-27 16:54:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 599932928. Throughput: 0: 43765.7. Samples: 502902840. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 16:54:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:54:55,674][06909] Updated weights for policy 0, policy_version 36623 (0.0039) [2024-06-27 16:54:58,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 600145920. Throughput: 0: 43923.5. Samples: 503033840. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 16:54:58,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:54:59,841][06909] Updated weights for policy 0, policy_version 36633 (0.0033) [2024-06-27 16:55:03,330][06909] Updated weights for policy 0, policy_version 36643 (0.0036) [2024-06-27 16:55:03,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 600391680. Throughput: 0: 44019.2. Samples: 503306500. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 16:55:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:55:07,176][06909] Updated weights for policy 0, policy_version 36653 (0.0031) [2024-06-27 16:55:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 600604672. Throughput: 0: 43913.8. Samples: 503560860. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 16:55:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:55:10,727][06909] Updated weights for policy 0, policy_version 36663 (0.0022) [2024-06-27 16:55:11,870][06887] Signal inference workers to stop experience collection... (7250 times) [2024-06-27 16:55:11,871][06887] Signal inference workers to resume experience collection... (7250 times) [2024-06-27 16:55:11,916][06909] InferenceWorker_p0-w0: stopping experience collection (7250 times) [2024-06-27 16:55:11,916][06909] InferenceWorker_p0-w0: resuming experience collection (7250 times) [2024-06-27 16:55:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 600817664. Throughput: 0: 44023.7. Samples: 503697640. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 16:55:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:55:14,585][06909] Updated weights for policy 0, policy_version 36673 (0.0033) [2024-06-27 16:55:18,282][06909] Updated weights for policy 0, policy_version 36683 (0.0029) [2024-06-27 16:55:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 601030656. Throughput: 0: 44035.0. Samples: 503963520. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 16:55:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:55:21,884][06909] Updated weights for policy 0, policy_version 36693 (0.0031) [2024-06-27 16:55:23,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 601276416. Throughput: 0: 43897.2. Samples: 504223240. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 16:55:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:55:25,599][06909] Updated weights for policy 0, policy_version 36703 (0.0035) [2024-06-27 16:55:28,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 601456640. Throughput: 0: 44116.1. Samples: 504361040. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 16:55:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:55:29,549][06909] Updated weights for policy 0, policy_version 36713 (0.0039) [2024-06-27 16:55:33,228][06909] Updated weights for policy 0, policy_version 36723 (0.0030) [2024-06-27 16:55:33,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 601686016. Throughput: 0: 43921.8. Samples: 504618000. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 16:55:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:55:36,808][06909] Updated weights for policy 0, policy_version 36733 (0.0027) [2024-06-27 16:55:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43692.2, 300 sec: 43820.3). Total num frames: 601915392. Throughput: 0: 43873.1. Samples: 504877120. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 16:55:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:55:40,665][06909] Updated weights for policy 0, policy_version 36743 (0.0026) [2024-06-27 16:55:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 602128384. Throughput: 0: 43905.8. Samples: 505009600. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-27 16:55:43,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:55:44,375][06909] Updated weights for policy 0, policy_version 36753 (0.0025) [2024-06-27 16:55:48,008][06909] Updated weights for policy 0, policy_version 36763 (0.0036) [2024-06-27 16:55:48,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 602357760. Throughput: 0: 43865.2. Samples: 505280440. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-27 16:55:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:55:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000036765_602357760.pth... [2024-06-27 16:55:48,930][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000036122_591822848.pth [2024-06-27 16:55:52,178][06909] Updated weights for policy 0, policy_version 36773 (0.0032) [2024-06-27 16:55:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.9, 300 sec: 43821.2). Total num frames: 602587136. Throughput: 0: 43858.7. Samples: 505534500. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-27 16:55:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 16:55:55,755][06909] Updated weights for policy 0, policy_version 36783 (0.0031) [2024-06-27 16:55:58,851][06674] Fps is (10 sec: 42595.0, 60 sec: 43963.0, 300 sec: 43931.2). Total num frames: 602783744. Throughput: 0: 43753.3. Samples: 505666580. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-27 16:55:58,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:55:59,466][06909] Updated weights for policy 0, policy_version 36793 (0.0045) [2024-06-27 16:56:03,120][06909] Updated weights for policy 0, policy_version 36803 (0.0030) [2024-06-27 16:56:03,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43417.5, 300 sec: 43875.8). Total num frames: 602996736. Throughput: 0: 43759.6. Samples: 505932700. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-27 16:56:03,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:56:06,822][06909] Updated weights for policy 0, policy_version 36813 (0.0028) [2024-06-27 16:56:08,850][06674] Fps is (10 sec: 44240.3, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 603226112. Throughput: 0: 43661.1. Samples: 506188000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 16:56:08,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:56:10,533][06909] Updated weights for policy 0, policy_version 36823 (0.0031) [2024-06-27 16:56:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 603455488. Throughput: 0: 43817.2. Samples: 506332820. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 16:56:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:56:14,086][06909] Updated weights for policy 0, policy_version 36833 (0.0040) [2024-06-27 16:56:18,123][06909] Updated weights for policy 0, policy_version 36843 (0.0034) [2024-06-27 16:56:18,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.6, 300 sec: 43820.4). Total num frames: 603635712. Throughput: 0: 43816.4. Samples: 506589740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 16:56:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:56:21,801][06909] Updated weights for policy 0, policy_version 36853 (0.0039) [2024-06-27 16:56:23,282][06887] Signal inference workers to stop experience collection... (7300 times) [2024-06-27 16:56:23,316][06909] InferenceWorker_p0-w0: stopping experience collection (7300 times) [2024-06-27 16:56:23,341][06887] Signal inference workers to resume experience collection... (7300 times) [2024-06-27 16:56:23,342][06909] InferenceWorker_p0-w0: resuming experience collection (7300 times) [2024-06-27 16:56:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 603897856. Throughput: 0: 43887.5. Samples: 506852060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 16:56:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:56:25,480][06909] Updated weights for policy 0, policy_version 36863 (0.0028) [2024-06-27 16:56:28,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 604110848. Throughput: 0: 43852.7. Samples: 506982980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 16:56:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:56:29,280][06909] Updated weights for policy 0, policy_version 36873 (0.0045) [2024-06-27 16:56:32,950][06909] Updated weights for policy 0, policy_version 36883 (0.0030) [2024-06-27 16:56:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 604307456. Throughput: 0: 43654.4. Samples: 507244880. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-27 16:56:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:56:37,010][06909] Updated weights for policy 0, policy_version 36893 (0.0032) [2024-06-27 16:56:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.6, 300 sec: 43820.3). Total num frames: 604553216. Throughput: 0: 43693.7. Samples: 507500720. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-27 16:56:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:56:40,551][06909] Updated weights for policy 0, policy_version 36903 (0.0037) [2024-06-27 16:56:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 604766208. Throughput: 0: 43899.1. Samples: 507642000. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-27 16:56:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:56:44,343][06909] Updated weights for policy 0, policy_version 36913 (0.0032) [2024-06-27 16:56:47,842][06909] Updated weights for policy 0, policy_version 36923 (0.0025) [2024-06-27 16:56:48,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.7, 300 sec: 43875.8). Total num frames: 604962816. Throughput: 0: 43921.4. Samples: 507909160. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-27 16:56:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:56:51,561][06909] Updated weights for policy 0, policy_version 36933 (0.0032) [2024-06-27 16:56:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.5, 300 sec: 43820.2). Total num frames: 605208576. Throughput: 0: 43872.9. Samples: 508162280. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-27 16:56:53,851][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 16:56:55,800][06909] Updated weights for policy 0, policy_version 36943 (0.0030) [2024-06-27 16:56:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43964.4, 300 sec: 43931.3). Total num frames: 605421568. Throughput: 0: 43768.9. Samples: 508302420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 16:56:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:56:59,496][06909] Updated weights for policy 0, policy_version 36953 (0.0028) [2024-06-27 16:57:03,058][06909] Updated weights for policy 0, policy_version 36963 (0.0026) [2024-06-27 16:57:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 605634560. Throughput: 0: 43719.4. Samples: 508557120. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 16:57:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:57:06,857][06909] Updated weights for policy 0, policy_version 36973 (0.0040) [2024-06-27 16:57:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 605847552. Throughput: 0: 43722.7. Samples: 508819580. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 16:57:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 16:57:10,403][06909] Updated weights for policy 0, policy_version 36983 (0.0027) [2024-06-27 16:57:13,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 606076928. Throughput: 0: 43800.0. Samples: 508953980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 16:57:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 16:57:14,187][06909] Updated weights for policy 0, policy_version 36993 (0.0030) [2024-06-27 16:57:17,829][06909] Updated weights for policy 0, policy_version 37003 (0.0028) [2024-06-27 16:57:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 606273536. Throughput: 0: 43881.7. Samples: 509219560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 16:57:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:57:21,443][06909] Updated weights for policy 0, policy_version 37013 (0.0031) [2024-06-27 16:57:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 43765.0). Total num frames: 606502912. Throughput: 0: 43813.8. Samples: 509472340. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 16:57:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:57:25,480][06909] Updated weights for policy 0, policy_version 37023 (0.0043) [2024-06-27 16:57:27,891][06887] Signal inference workers to stop experience collection... (7350 times) [2024-06-27 16:57:27,891][06887] Signal inference workers to resume experience collection... (7350 times) [2024-06-27 16:57:27,917][06909] InferenceWorker_p0-w0: stopping experience collection (7350 times) [2024-06-27 16:57:27,917][06909] InferenceWorker_p0-w0: resuming experience collection (7350 times) [2024-06-27 16:57:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 606732288. Throughput: 0: 43779.6. Samples: 509612080. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 16:57:28,850][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 16:57:28,940][06909] Updated weights for policy 0, policy_version 37033 (0.0036) [2024-06-27 16:57:33,376][06909] Updated weights for policy 0, policy_version 37043 (0.0039) [2024-06-27 16:57:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 606928896. Throughput: 0: 43651.6. Samples: 509873480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 16:57:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 16:57:36,363][06909] Updated weights for policy 0, policy_version 37053 (0.0039) [2024-06-27 16:57:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 607174656. Throughput: 0: 43886.3. Samples: 510137160. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 16:57:38,851][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:57:40,702][06909] Updated weights for policy 0, policy_version 37063 (0.0029) [2024-06-27 16:57:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 607387648. Throughput: 0: 43670.2. Samples: 510267580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:57:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:57:44,006][06909] Updated weights for policy 0, policy_version 37073 (0.0037) [2024-06-27 16:57:48,217][06909] Updated weights for policy 0, policy_version 37083 (0.0029) [2024-06-27 16:57:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.6, 300 sec: 43820.2). Total num frames: 607600640. Throughput: 0: 43839.1. Samples: 510529880. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:57:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:57:48,855][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000037085_607600640.pth... [2024-06-27 16:57:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000036443_597082112.pth [2024-06-27 16:57:51,387][06909] Updated weights for policy 0, policy_version 37093 (0.0041) [2024-06-27 16:57:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.7, 300 sec: 43764.8). Total num frames: 607813632. Throughput: 0: 43927.7. Samples: 510796320. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:57:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:57:55,537][06909] Updated weights for policy 0, policy_version 37103 (0.0030) [2024-06-27 16:57:58,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 608043008. Throughput: 0: 43812.1. Samples: 510925520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:57:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:57:58,962][06909] Updated weights for policy 0, policy_version 37113 (0.0030) [2024-06-27 16:58:02,974][06909] Updated weights for policy 0, policy_version 37123 (0.0031) [2024-06-27 16:58:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 608256000. Throughput: 0: 43775.6. Samples: 511189460. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 16:58:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:58:06,371][06909] Updated weights for policy 0, policy_version 37133 (0.0031) [2024-06-27 16:58:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 608485376. Throughput: 0: 43931.7. Samples: 511449260. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 16:58:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:58:10,539][06909] Updated weights for policy 0, policy_version 37143 (0.0032) [2024-06-27 16:58:13,776][06909] Updated weights for policy 0, policy_version 37153 (0.0030) [2024-06-27 16:58:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 608714752. Throughput: 0: 43728.0. Samples: 511579840. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 16:58:13,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 16:58:18,013][06909] Updated weights for policy 0, policy_version 37163 (0.0041) [2024-06-27 16:58:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 608911360. Throughput: 0: 43773.3. Samples: 511843280. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 16:58:18,853][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:58:21,090][06909] Updated weights for policy 0, policy_version 37173 (0.0028) [2024-06-27 16:58:23,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 609140736. Throughput: 0: 43813.7. Samples: 512108780. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 16:58:23,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:58:25,247][06909] Updated weights for policy 0, policy_version 37183 (0.0021) [2024-06-27 16:58:28,628][06909] Updated weights for policy 0, policy_version 37193 (0.0036) [2024-06-27 16:58:28,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 609370112. Throughput: 0: 43934.7. Samples: 512244640. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 16:58:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:58:33,050][06909] Updated weights for policy 0, policy_version 37203 (0.0036) [2024-06-27 16:58:33,850][06674] Fps is (10 sec: 44237.7, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 609583104. Throughput: 0: 43847.3. Samples: 512503000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 16:58:33,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:58:36,254][06909] Updated weights for policy 0, policy_version 37213 (0.0035) [2024-06-27 16:58:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 609812480. Throughput: 0: 43728.8. Samples: 512764120. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 16:58:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:58:40,221][06909] Updated weights for policy 0, policy_version 37223 (0.0026) [2024-06-27 16:58:43,603][06909] Updated weights for policy 0, policy_version 37233 (0.0030) [2024-06-27 16:58:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 610025472. Throughput: 0: 43863.5. Samples: 512899380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 16:58:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:58:47,611][06909] Updated weights for policy 0, policy_version 37243 (0.0034) [2024-06-27 16:58:48,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 610222080. Throughput: 0: 43707.5. Samples: 513156300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 16:58:48,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 16:58:50,240][06887] Signal inference workers to stop experience collection... (7400 times) [2024-06-27 16:58:50,240][06887] Signal inference workers to resume experience collection... (7400 times) [2024-06-27 16:58:50,280][06909] InferenceWorker_p0-w0: stopping experience collection (7400 times) [2024-06-27 16:58:50,281][06909] InferenceWorker_p0-w0: resuming experience collection (7400 times) [2024-06-27 16:58:51,082][06909] Updated weights for policy 0, policy_version 37253 (0.0031) [2024-06-27 16:58:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 610467840. Throughput: 0: 43743.9. Samples: 513417740. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 16:58:53,851][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 16:58:54,930][06909] Updated weights for policy 0, policy_version 37263 (0.0027) [2024-06-27 16:58:58,387][06909] Updated weights for policy 0, policy_version 37273 (0.0029) [2024-06-27 16:58:58,856][06674] Fps is (10 sec: 45847.6, 60 sec: 43959.3, 300 sec: 43763.8). Total num frames: 610680832. Throughput: 0: 43925.6. Samples: 513556760. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 16:58:58,856][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:59:02,148][06909] Updated weights for policy 0, policy_version 37283 (0.0039) [2024-06-27 16:59:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 610893824. Throughput: 0: 43818.3. Samples: 513815100. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 16:59:03,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 16:59:06,017][06909] Updated weights for policy 0, policy_version 37293 (0.0026) [2024-06-27 16:59:08,850][06674] Fps is (10 sec: 45903.2, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 611139584. Throughput: 0: 43929.5. Samples: 514085600. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 16:59:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:59:09,978][06909] Updated weights for policy 0, policy_version 37303 (0.0033) [2024-06-27 16:59:13,374][06909] Updated weights for policy 0, policy_version 37313 (0.0037) [2024-06-27 16:59:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 611336192. Throughput: 0: 43865.3. Samples: 514218580. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 16:59:13,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 16:59:17,347][06909] Updated weights for policy 0, policy_version 37323 (0.0033) [2024-06-27 16:59:18,850][06674] Fps is (10 sec: 42597.2, 60 sec: 44236.7, 300 sec: 43820.2). Total num frames: 611565568. Throughput: 0: 44080.6. Samples: 514486640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 16:59:18,851][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:59:20,861][06909] Updated weights for policy 0, policy_version 37333 (0.0025) [2024-06-27 16:59:23,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 611811328. Throughput: 0: 44040.3. Samples: 514745940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 16:59:23,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 16:59:24,630][06909] Updated weights for policy 0, policy_version 37343 (0.0032) [2024-06-27 16:59:28,502][06909] Updated weights for policy 0, policy_version 37353 (0.0039) [2024-06-27 16:59:28,850][06674] Fps is (10 sec: 42599.5, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 611991552. Throughput: 0: 43998.2. Samples: 514879300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 16:59:28,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 16:59:32,234][06909] Updated weights for policy 0, policy_version 37363 (0.0033) [2024-06-27 16:59:33,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43963.7, 300 sec: 43820.6). Total num frames: 612220928. Throughput: 0: 44114.8. Samples: 515141460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 16:59:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:59:36,055][06909] Updated weights for policy 0, policy_version 37373 (0.0023) [2024-06-27 16:59:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 612450304. Throughput: 0: 44043.2. Samples: 515399680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 16:59:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 16:59:39,532][06909] Updated weights for policy 0, policy_version 37383 (0.0044) [2024-06-27 16:59:43,551][06909] Updated weights for policy 0, policy_version 37393 (0.0032) [2024-06-27 16:59:43,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.2, 300 sec: 43820.0). Total num frames: 612663296. Throughput: 0: 44005.3. Samples: 515536820. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-27 16:59:43,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 16:59:47,118][06909] Updated weights for policy 0, policy_version 37403 (0.0034) [2024-06-27 16:59:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 612876288. Throughput: 0: 44089.8. Samples: 515799140. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-27 16:59:48,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 16:59:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000037407_612876288.pth... [2024-06-27 16:59:48,906][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000036765_602357760.pth [2024-06-27 16:59:51,147][06909] Updated weights for policy 0, policy_version 37413 (0.0030) [2024-06-27 16:59:53,850][06674] Fps is (10 sec: 45884.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 613122048. Throughput: 0: 43822.2. Samples: 516057600. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-27 16:59:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 16:59:54,701][06909] Updated weights for policy 0, policy_version 37423 (0.0040) [2024-06-27 16:59:58,647][06909] Updated weights for policy 0, policy_version 37433 (0.0039) [2024-06-27 16:59:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43695.1, 300 sec: 43764.7). Total num frames: 613302272. Throughput: 0: 43919.2. Samples: 516194940. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-27 16:59:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:00:02,081][06909] Updated weights for policy 0, policy_version 37443 (0.0035) [2024-06-27 17:00:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 613548032. Throughput: 0: 43783.8. Samples: 516456900. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-27 17:00:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:00:05,974][06909] Updated weights for policy 0, policy_version 37453 (0.0036) [2024-06-27 17:00:08,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43690.5, 300 sec: 43875.8). Total num frames: 613761024. Throughput: 0: 43790.7. Samples: 516716520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 17:00:08,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:00:09,390][06909] Updated weights for policy 0, policy_version 37463 (0.0033) [2024-06-27 17:00:13,643][06909] Updated weights for policy 0, policy_version 37473 (0.0031) [2024-06-27 17:00:13,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 613974016. Throughput: 0: 43683.4. Samples: 516845060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 17:00:13,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 17:00:16,826][06909] Updated weights for policy 0, policy_version 37483 (0.0036) [2024-06-27 17:00:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 614187008. Throughput: 0: 43992.8. Samples: 517121140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 17:00:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:00:20,830][06909] Updated weights for policy 0, policy_version 37493 (0.0032) [2024-06-27 17:00:23,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43417.7, 300 sec: 43931.3). Total num frames: 614416384. Throughput: 0: 43963.6. Samples: 517378040. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 17:00:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:00:24,231][06909] Updated weights for policy 0, policy_version 37503 (0.0037) [2024-06-27 17:00:28,213][06909] Updated weights for policy 0, policy_version 37513 (0.0050) [2024-06-27 17:00:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 614612992. Throughput: 0: 43911.8. Samples: 517512760. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 17:00:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:00:29,322][06887] Signal inference workers to stop experience collection... (7450 times) [2024-06-27 17:00:29,323][06887] Signal inference workers to resume experience collection... (7450 times) [2024-06-27 17:00:29,359][06909] InferenceWorker_p0-w0: stopping experience collection (7450 times) [2024-06-27 17:00:29,359][06909] InferenceWorker_p0-w0: resuming experience collection (7450 times) [2024-06-27 17:00:31,919][06909] Updated weights for policy 0, policy_version 37523 (0.0030) [2024-06-27 17:00:33,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 614858752. Throughput: 0: 43997.3. Samples: 517779020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2024-06-27 17:00:33,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:00:35,879][06909] Updated weights for policy 0, policy_version 37533 (0.0028) [2024-06-27 17:00:38,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 615088128. Throughput: 0: 44005.8. Samples: 518037860. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2024-06-27 17:00:38,853][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:00:39,161][06909] Updated weights for policy 0, policy_version 37543 (0.0035) [2024-06-27 17:00:43,137][06909] Updated weights for policy 0, policy_version 37553 (0.0025) [2024-06-27 17:00:43,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43419.0, 300 sec: 43764.7). Total num frames: 615268352. Throughput: 0: 44075.9. Samples: 518178360. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2024-06-27 17:00:43,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 17:00:46,443][06909] Updated weights for policy 0, policy_version 37563 (0.0033) [2024-06-27 17:00:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 615514112. Throughput: 0: 43958.2. Samples: 518435020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 19.0) [2024-06-27 17:00:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:00:50,498][06909] Updated weights for policy 0, policy_version 37573 (0.0038) [2024-06-27 17:00:53,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43690.7, 300 sec: 43931.5). Total num frames: 615743488. Throughput: 0: 43946.3. Samples: 518694100. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 17:00:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:00:54,146][06909] Updated weights for policy 0, policy_version 37583 (0.0021) [2024-06-27 17:00:58,276][06909] Updated weights for policy 0, policy_version 37593 (0.0031) [2024-06-27 17:00:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 615923712. Throughput: 0: 44164.1. Samples: 518832440. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 17:00:58,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 17:01:01,626][06909] Updated weights for policy 0, policy_version 37603 (0.0036) [2024-06-27 17:01:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 616169472. Throughput: 0: 43732.0. Samples: 519089080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 17:01:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:01:05,762][06909] Updated weights for policy 0, policy_version 37613 (0.0029) [2024-06-27 17:01:08,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.9, 300 sec: 43875.8). Total num frames: 616398848. Throughput: 0: 43851.6. Samples: 519351360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 17:01:08,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:01:08,858][06909] Updated weights for policy 0, policy_version 37623 (0.0043) [2024-06-27 17:01:13,084][06909] Updated weights for policy 0, policy_version 37633 (0.0027) [2024-06-27 17:01:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 616595456. Throughput: 0: 43906.7. Samples: 519488560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 17:01:13,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:01:16,650][06909] Updated weights for policy 0, policy_version 37643 (0.0031) [2024-06-27 17:01:18,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 616824832. Throughput: 0: 43797.3. Samples: 519749900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 17:01:18,851][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 17:01:20,933][06909] Updated weights for policy 0, policy_version 37653 (0.0032) [2024-06-27 17:01:23,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 617054208. Throughput: 0: 43824.9. Samples: 520009980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 17:01:23,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:01:24,025][06909] Updated weights for policy 0, policy_version 37663 (0.0029) [2024-06-27 17:01:28,509][06909] Updated weights for policy 0, policy_version 37673 (0.0041) [2024-06-27 17:01:28,852][06674] Fps is (10 sec: 40951.9, 60 sec: 43689.2, 300 sec: 43819.9). Total num frames: 617234432. Throughput: 0: 43676.3. Samples: 520143880. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 17:01:28,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:01:31,291][06909] Updated weights for policy 0, policy_version 37683 (0.0019) [2024-06-27 17:01:33,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43689.2, 300 sec: 43820.0). Total num frames: 617480192. Throughput: 0: 43734.4. Samples: 520403160. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 17:01:33,853][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:01:35,819][06909] Updated weights for policy 0, policy_version 37693 (0.0033) [2024-06-27 17:01:38,850][06674] Fps is (10 sec: 47523.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 617709568. Throughput: 0: 43741.4. Samples: 520662460. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 17:01:38,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:01:38,925][06909] Updated weights for policy 0, policy_version 37703 (0.0031) [2024-06-27 17:01:43,075][06909] Updated weights for policy 0, policy_version 37713 (0.0026) [2024-06-27 17:01:43,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 617906176. Throughput: 0: 43696.5. Samples: 520798780. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 17:01:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:01:46,722][06909] Updated weights for policy 0, policy_version 37723 (0.0028) [2024-06-27 17:01:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 618135552. Throughput: 0: 43749.8. Samples: 521057820. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 17:01:48,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:01:48,940][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000037729_618151936.pth... [2024-06-27 17:01:48,990][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000037085_607600640.pth [2024-06-27 17:01:50,725][06909] Updated weights for policy 0, policy_version 37733 (0.0028) [2024-06-27 17:01:53,014][06887] Signal inference workers to stop experience collection... (7500 times) [2024-06-27 17:01:53,014][06887] Signal inference workers to resume experience collection... (7500 times) [2024-06-27 17:01:53,028][06909] InferenceWorker_p0-w0: stopping experience collection (7500 times) [2024-06-27 17:01:53,028][06909] InferenceWorker_p0-w0: resuming experience collection (7500 times) [2024-06-27 17:01:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 618364928. Throughput: 0: 43662.6. Samples: 521316180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 17:01:53,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:01:54,081][06909] Updated weights for policy 0, policy_version 37743 (0.0044) [2024-06-27 17:01:58,151][06909] Updated weights for policy 0, policy_version 37753 (0.0032) [2024-06-27 17:01:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 618577920. Throughput: 0: 43764.3. Samples: 521457960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 17:01:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:02:01,319][06909] Updated weights for policy 0, policy_version 37763 (0.0034) [2024-06-27 17:02:03,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.9, 300 sec: 43931.4). Total num frames: 618807296. Throughput: 0: 43781.6. Samples: 521720060. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 17:02:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:02:05,704][06909] Updated weights for policy 0, policy_version 37773 (0.0038) [2024-06-27 17:02:08,575][06909] Updated weights for policy 0, policy_version 37783 (0.0033) [2024-06-27 17:02:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 619036672. Throughput: 0: 43830.6. Samples: 521982360. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 17:02:08,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:02:13,318][06909] Updated weights for policy 0, policy_version 37793 (0.0023) [2024-06-27 17:02:13,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 619216896. Throughput: 0: 43964.7. Samples: 522122200. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 17:02:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:02:16,013][06909] Updated weights for policy 0, policy_version 37803 (0.0032) [2024-06-27 17:02:18,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 619462656. Throughput: 0: 44024.7. Samples: 522384180. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 17:02:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:02:20,708][06909] Updated weights for policy 0, policy_version 37813 (0.0036) [2024-06-27 17:02:23,184][06909] Updated weights for policy 0, policy_version 37823 (0.0030) [2024-06-27 17:02:23,850][06674] Fps is (10 sec: 49151.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 619708416. Throughput: 0: 43906.1. Samples: 522638240. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 17:02:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:02:28,083][06909] Updated weights for policy 0, policy_version 37833 (0.0037) [2024-06-27 17:02:28,853][06674] Fps is (10 sec: 44221.0, 60 sec: 44508.8, 300 sec: 43986.3). Total num frames: 619905024. Throughput: 0: 43962.3. Samples: 522777240. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 17:02:28,854][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:02:30,870][06909] Updated weights for policy 0, policy_version 37843 (0.0040) [2024-06-27 17:02:33,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43692.2, 300 sec: 43820.3). Total num frames: 620101632. Throughput: 0: 44033.8. Samples: 523039340. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 17:02:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:02:35,499][06909] Updated weights for policy 0, policy_version 37853 (0.0031) [2024-06-27 17:02:38,635][06909] Updated weights for policy 0, policy_version 37863 (0.0031) [2024-06-27 17:02:38,850][06674] Fps is (10 sec: 44252.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 620347392. Throughput: 0: 44028.4. Samples: 523297460. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 17:02:38,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:02:43,196][06909] Updated weights for policy 0, policy_version 37873 (0.0030) [2024-06-27 17:02:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 620544000. Throughput: 0: 43944.5. Samples: 523435460. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 17:02:43,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:02:45,932][06909] Updated weights for policy 0, policy_version 37883 (0.0046) [2024-06-27 17:02:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 620756992. Throughput: 0: 43904.3. Samples: 523695760. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 17:02:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:02:50,481][06909] Updated weights for policy 0, policy_version 37893 (0.0032) [2024-06-27 17:02:53,188][06909] Updated weights for policy 0, policy_version 37903 (0.0028) [2024-06-27 17:02:53,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 621019136. Throughput: 0: 43923.6. Samples: 523958920. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 17:02:53,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:02:57,841][06909] Updated weights for policy 0, policy_version 37913 (0.0023) [2024-06-27 17:02:58,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 621232128. Throughput: 0: 43966.1. Samples: 524100680. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 17:02:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:03:00,490][06909] Updated weights for policy 0, policy_version 37923 (0.0031) [2024-06-27 17:03:03,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 621428736. Throughput: 0: 43903.6. Samples: 524359840. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 17:03:03,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:03:05,731][06909] Updated weights for policy 0, policy_version 37933 (0.0030) [2024-06-27 17:03:08,123][06909] Updated weights for policy 0, policy_version 37943 (0.0027) [2024-06-27 17:03:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 621674496. Throughput: 0: 43833.8. Samples: 524610760. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 17:03:08,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:03:13,149][06909] Updated weights for policy 0, policy_version 37953 (0.0033) [2024-06-27 17:03:13,535][06887] Signal inference workers to stop experience collection... (7550 times) [2024-06-27 17:03:13,537][06887] Signal inference workers to resume experience collection... (7550 times) [2024-06-27 17:03:13,559][06909] InferenceWorker_p0-w0: stopping experience collection (7550 times) [2024-06-27 17:03:13,559][06909] InferenceWorker_p0-w0: resuming experience collection (7550 times) [2024-06-27 17:03:13,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 621887488. Throughput: 0: 43991.8. Samples: 524756720. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 17:03:13,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 17:03:15,469][06909] Updated weights for policy 0, policy_version 37963 (0.0033) [2024-06-27 17:03:18,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43417.5, 300 sec: 43820.3). Total num frames: 622067712. Throughput: 0: 43870.1. Samples: 525013500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 17:03:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:03:20,555][06909] Updated weights for policy 0, policy_version 37973 (0.0030) [2024-06-27 17:03:23,059][06909] Updated weights for policy 0, policy_version 37983 (0.0028) [2024-06-27 17:03:23,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 622346240. Throughput: 0: 43835.1. Samples: 525270040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 17:03:23,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:03:27,882][06909] Updated weights for policy 0, policy_version 37993 (0.0038) [2024-06-27 17:03:28,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43693.2, 300 sec: 43875.8). Total num frames: 622526464. Throughput: 0: 43972.8. Samples: 525414240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 17:03:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:03:30,322][06909] Updated weights for policy 0, policy_version 38003 (0.0032) [2024-06-27 17:03:33,850][06674] Fps is (10 sec: 37683.3, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 622723072. Throughput: 0: 43981.3. Samples: 525674920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 17:03:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:03:35,424][06909] Updated weights for policy 0, policy_version 38013 (0.0027) [2024-06-27 17:03:37,733][06909] Updated weights for policy 0, policy_version 38023 (0.0024) [2024-06-27 17:03:38,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 623001600. Throughput: 0: 43820.9. Samples: 525930860. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 17:03:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:03:42,879][06909] Updated weights for policy 0, policy_version 38033 (0.0027) [2024-06-27 17:03:43,850][06674] Fps is (10 sec: 49150.2, 60 sec: 44509.6, 300 sec: 44042.4). Total num frames: 623214592. Throughput: 0: 43953.1. Samples: 526078580. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-27 17:03:43,851][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:03:45,167][06909] Updated weights for policy 0, policy_version 38043 (0.0033) [2024-06-27 17:03:48,852][06674] Fps is (10 sec: 37675.7, 60 sec: 43689.1, 300 sec: 43764.4). Total num frames: 623378432. Throughput: 0: 43817.1. Samples: 526331700. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-27 17:03:48,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:03:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000038048_623378432.pth... [2024-06-27 17:03:48,945][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000037407_612876288.pth [2024-06-27 17:03:50,195][06909] Updated weights for policy 0, policy_version 38053 (0.0029) [2024-06-27 17:03:52,806][06909] Updated weights for policy 0, policy_version 38063 (0.0039) [2024-06-27 17:03:53,850][06674] Fps is (10 sec: 44238.3, 60 sec: 43963.8, 300 sec: 43987.8). Total num frames: 623656960. Throughput: 0: 43817.4. Samples: 526582540. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-27 17:03:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:03:57,863][06909] Updated weights for policy 0, policy_version 38073 (0.0036) [2024-06-27 17:03:58,850][06674] Fps is (10 sec: 47522.9, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 623853568. Throughput: 0: 43789.8. Samples: 526727260. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-27 17:03:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:04:00,466][06909] Updated weights for policy 0, policy_version 38083 (0.0049) [2024-06-27 17:04:03,850][06674] Fps is (10 sec: 36044.9, 60 sec: 43144.5, 300 sec: 43653.6). Total num frames: 624017408. Throughput: 0: 43781.9. Samples: 526983680. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-27 17:04:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:04:05,187][06909] Updated weights for policy 0, policy_version 38093 (0.0027) [2024-06-27 17:04:07,734][06909] Updated weights for policy 0, policy_version 38103 (0.0025) [2024-06-27 17:04:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 624312320. Throughput: 0: 43636.4. Samples: 527233680. Policy #0 lag: (min: 0.0, avg: 13.6, max: 21.0) [2024-06-27 17:04:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:04:12,786][06909] Updated weights for policy 0, policy_version 38113 (0.0031) [2024-06-27 17:04:13,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 624492544. Throughput: 0: 43832.5. Samples: 527386700. Policy #0 lag: (min: 0.0, avg: 13.6, max: 21.0) [2024-06-27 17:04:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:04:15,231][06909] Updated weights for policy 0, policy_version 38123 (0.0027) [2024-06-27 17:04:18,850][06674] Fps is (10 sec: 36044.7, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 624672768. Throughput: 0: 43715.0. Samples: 527642100. Policy #0 lag: (min: 0.0, avg: 13.6, max: 21.0) [2024-06-27 17:04:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:04:20,042][06909] Updated weights for policy 0, policy_version 38133 (0.0021) [2024-06-27 17:04:22,675][06909] Updated weights for policy 0, policy_version 38143 (0.0027) [2024-06-27 17:04:23,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 624967680. Throughput: 0: 43775.1. Samples: 527900740. Policy #0 lag: (min: 0.0, avg: 13.6, max: 21.0) [2024-06-27 17:04:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:04:27,460][06909] Updated weights for policy 0, policy_version 38153 (0.0033) [2024-06-27 17:04:27,703][06887] Signal inference workers to stop experience collection... (7600 times) [2024-06-27 17:04:27,704][06887] Signal inference workers to resume experience collection... (7600 times) [2024-06-27 17:04:27,748][06909] InferenceWorker_p0-w0: stopping experience collection (7600 times) [2024-06-27 17:04:27,748][06909] InferenceWorker_p0-w0: resuming experience collection (7600 times) [2024-06-27 17:04:28,850][06674] Fps is (10 sec: 50790.1, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 625180672. Throughput: 0: 43831.3. Samples: 528050980. Policy #0 lag: (min: 0.0, avg: 13.6, max: 21.0) [2024-06-27 17:04:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:04:30,350][06909] Updated weights for policy 0, policy_version 38163 (0.0037) [2024-06-27 17:04:33,850][06674] Fps is (10 sec: 39320.8, 60 sec: 43963.5, 300 sec: 43764.7). Total num frames: 625360896. Throughput: 0: 43878.6. Samples: 528306160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 17:04:33,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:04:35,089][06909] Updated weights for policy 0, policy_version 38173 (0.0026) [2024-06-27 17:04:37,822][06909] Updated weights for policy 0, policy_version 38183 (0.0032) [2024-06-27 17:04:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43931.6). Total num frames: 625623040. Throughput: 0: 43920.8. Samples: 528558980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 17:04:38,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:04:42,510][06909] Updated weights for policy 0, policy_version 38193 (0.0031) [2024-06-27 17:04:43,850][06674] Fps is (10 sec: 45876.9, 60 sec: 43417.9, 300 sec: 43875.8). Total num frames: 625819648. Throughput: 0: 44053.9. Samples: 528709680. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 17:04:43,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:04:45,426][06909] Updated weights for policy 0, policy_version 38203 (0.0023) [2024-06-27 17:04:48,850][06674] Fps is (10 sec: 37683.5, 60 sec: 43692.1, 300 sec: 43653.6). Total num frames: 625999872. Throughput: 0: 43952.4. Samples: 528961540. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 17:04:48,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:04:49,930][06909] Updated weights for policy 0, policy_version 38213 (0.0034) [2024-06-27 17:04:52,629][06909] Updated weights for policy 0, policy_version 38223 (0.0026) [2024-06-27 17:04:53,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 626294784. Throughput: 0: 43972.8. Samples: 529212460. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 17:04:53,851][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 17:04:57,388][06909] Updated weights for policy 0, policy_version 38233 (0.0045) [2024-06-27 17:04:58,850][06674] Fps is (10 sec: 49151.3, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 626491392. Throughput: 0: 43944.3. Samples: 529364200. Policy #0 lag: (min: 0.0, avg: 8.5, max: 23.0) [2024-06-27 17:04:58,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:05:00,070][06909] Updated weights for policy 0, policy_version 38243 (0.0026) [2024-06-27 17:05:03,850][06674] Fps is (10 sec: 34406.6, 60 sec: 43690.6, 300 sec: 43653.7). Total num frames: 626638848. Throughput: 0: 43919.1. Samples: 529618460. Policy #0 lag: (min: 0.0, avg: 8.5, max: 23.0) [2024-06-27 17:05:03,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:05:04,795][06909] Updated weights for policy 0, policy_version 38253 (0.0023) [2024-06-27 17:05:07,414][06909] Updated weights for policy 0, policy_version 38263 (0.0021) [2024-06-27 17:05:08,850][06674] Fps is (10 sec: 44237.8, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 626933760. Throughput: 0: 43876.6. Samples: 529875180. Policy #0 lag: (min: 0.0, avg: 8.5, max: 23.0) [2024-06-27 17:05:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:05:12,394][06909] Updated weights for policy 0, policy_version 38273 (0.0030) [2024-06-27 17:05:13,850][06674] Fps is (10 sec: 52429.0, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 627163136. Throughput: 0: 43865.4. Samples: 530024920. Policy #0 lag: (min: 0.0, avg: 8.5, max: 23.0) [2024-06-27 17:05:13,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 17:05:15,279][06909] Updated weights for policy 0, policy_version 38283 (0.0023) [2024-06-27 17:05:18,850][06674] Fps is (10 sec: 39321.8, 60 sec: 44236.9, 300 sec: 43764.7). Total num frames: 627326976. Throughput: 0: 43913.3. Samples: 530282240. Policy #0 lag: (min: 0.0, avg: 8.5, max: 23.0) [2024-06-27 17:05:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:05:19,654][06909] Updated weights for policy 0, policy_version 38293 (0.0034) [2024-06-27 17:05:22,659][06909] Updated weights for policy 0, policy_version 38303 (0.0039) [2024-06-27 17:05:23,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 627589120. Throughput: 0: 43922.0. Samples: 530535460. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 17:05:23,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 17:05:27,134][06909] Updated weights for policy 0, policy_version 38313 (0.0040) [2024-06-27 17:05:28,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 627802112. Throughput: 0: 43789.2. Samples: 530680200. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 17:05:28,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 17:05:29,978][06909] Updated weights for policy 0, policy_version 38323 (0.0027) [2024-06-27 17:05:33,850][06674] Fps is (10 sec: 37683.2, 60 sec: 43417.9, 300 sec: 43653.7). Total num frames: 627965952. Throughput: 0: 43879.7. Samples: 530936120. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 17:05:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:05:34,421][06887] Signal inference workers to stop experience collection... (7650 times) [2024-06-27 17:05:34,422][06887] Signal inference workers to resume experience collection... (7650 times) [2024-06-27 17:05:34,458][06909] InferenceWorker_p0-w0: stopping experience collection (7650 times) [2024-06-27 17:05:34,459][06909] InferenceWorker_p0-w0: resuming experience collection (7650 times) [2024-06-27 17:05:34,560][06909] Updated weights for policy 0, policy_version 38333 (0.0023) [2024-06-27 17:05:37,297][06909] Updated weights for policy 0, policy_version 38343 (0.0037) [2024-06-27 17:05:38,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 628244480. Throughput: 0: 44042.4. Samples: 531194360. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 17:05:38,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:05:42,165][06909] Updated weights for policy 0, policy_version 38353 (0.0032) [2024-06-27 17:05:43,850][06674] Fps is (10 sec: 49151.7, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 628457472. Throughput: 0: 43819.3. Samples: 531336060. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 17:05:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:05:44,946][06909] Updated weights for policy 0, policy_version 38363 (0.0032) [2024-06-27 17:05:48,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 628637696. Throughput: 0: 43861.8. Samples: 531592240. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-27 17:05:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:05:48,922][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000038370_628654080.pth... [2024-06-27 17:05:48,981][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000037729_618151936.pth [2024-06-27 17:05:49,605][06909] Updated weights for policy 0, policy_version 38373 (0.0038) [2024-06-27 17:05:52,375][06909] Updated weights for policy 0, policy_version 38383 (0.0036) [2024-06-27 17:05:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 628899840. Throughput: 0: 43999.4. Samples: 531855160. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-27 17:05:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:05:56,989][06909] Updated weights for policy 0, policy_version 38393 (0.0031) [2024-06-27 17:05:58,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 629112832. Throughput: 0: 43741.8. Samples: 531993300. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-27 17:05:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:06:00,007][06909] Updated weights for policy 0, policy_version 38403 (0.0034) [2024-06-27 17:06:03,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44509.8, 300 sec: 43764.7). Total num frames: 629309440. Throughput: 0: 43797.6. Samples: 532253140. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-27 17:06:03,851][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:06:04,426][06909] Updated weights for policy 0, policy_version 38413 (0.0038) [2024-06-27 17:06:07,915][06909] Updated weights for policy 0, policy_version 38423 (0.0030) [2024-06-27 17:06:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 629555200. Throughput: 0: 43920.3. Samples: 532511880. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-27 17:06:08,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:06:12,093][06909] Updated weights for policy 0, policy_version 38433 (0.0027) [2024-06-27 17:06:13,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 629784576. Throughput: 0: 43707.2. Samples: 532647020. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-27 17:06:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:06:15,261][06909] Updated weights for policy 0, policy_version 38443 (0.0034) [2024-06-27 17:06:18,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.6, 300 sec: 43764.7). Total num frames: 629964800. Throughput: 0: 43854.1. Samples: 532909560. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-27 17:06:18,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:06:19,531][06909] Updated weights for policy 0, policy_version 38453 (0.0022) [2024-06-27 17:06:22,528][06909] Updated weights for policy 0, policy_version 38463 (0.0030) [2024-06-27 17:06:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43987.2). Total num frames: 630210560. Throughput: 0: 43922.2. Samples: 533170860. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-27 17:06:23,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:06:26,845][06909] Updated weights for policy 0, policy_version 38473 (0.0037) [2024-06-27 17:06:28,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.8, 300 sec: 43931.6). Total num frames: 630439936. Throughput: 0: 43872.8. Samples: 533310340. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-27 17:06:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:06:30,066][06909] Updated weights for policy 0, policy_version 38483 (0.0032) [2024-06-27 17:06:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.8, 300 sec: 43764.7). Total num frames: 630620160. Throughput: 0: 43991.2. Samples: 533571840. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-27 17:06:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:06:34,397][06909] Updated weights for policy 0, policy_version 38493 (0.0026) [2024-06-27 17:06:37,886][06909] Updated weights for policy 0, policy_version 38503 (0.0033) [2024-06-27 17:06:38,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.5, 300 sec: 43931.3). Total num frames: 630865920. Throughput: 0: 43796.4. Samples: 533826000. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-27 17:06:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:06:41,660][06909] Updated weights for policy 0, policy_version 38513 (0.0033) [2024-06-27 17:06:43,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 631078912. Throughput: 0: 43872.0. Samples: 533967540. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-27 17:06:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:06:45,065][06909] Updated weights for policy 0, policy_version 38523 (0.0030) [2024-06-27 17:06:46,999][06887] Signal inference workers to stop experience collection... (7700 times) [2024-06-27 17:06:46,999][06887] Signal inference workers to resume experience collection... (7700 times) [2024-06-27 17:06:47,013][06909] InferenceWorker_p0-w0: stopping experience collection (7700 times) [2024-06-27 17:06:47,013][06909] InferenceWorker_p0-w0: resuming experience collection (7700 times) [2024-06-27 17:06:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.7, 300 sec: 43820.2). Total num frames: 631291904. Throughput: 0: 43930.6. Samples: 534230020. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-27 17:06:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:06:49,155][06909] Updated weights for policy 0, policy_version 38533 (0.0028) [2024-06-27 17:06:52,830][06909] Updated weights for policy 0, policy_version 38543 (0.0029) [2024-06-27 17:06:53,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 631521280. Throughput: 0: 43928.9. Samples: 534488680. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-27 17:06:53,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:06:56,564][06909] Updated weights for policy 0, policy_version 38553 (0.0032) [2024-06-27 17:06:58,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 631750656. Throughput: 0: 44020.0. Samples: 534627920. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-27 17:06:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:07:00,052][06909] Updated weights for policy 0, policy_version 38563 (0.0039) [2024-06-27 17:07:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 43820.3). Total num frames: 631963648. Throughput: 0: 44127.2. Samples: 534895280. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 17:07:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:07:04,130][06909] Updated weights for policy 0, policy_version 38573 (0.0027) [2024-06-27 17:07:07,293][06909] Updated weights for policy 0, policy_version 38583 (0.0025) [2024-06-27 17:07:08,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 632176640. Throughput: 0: 44157.1. Samples: 535157940. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 17:07:08,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:07:11,495][06909] Updated weights for policy 0, policy_version 38593 (0.0021) [2024-06-27 17:07:13,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 632422400. Throughput: 0: 43964.0. Samples: 535288720. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 17:07:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:07:15,058][06909] Updated weights for policy 0, policy_version 38603 (0.0037) [2024-06-27 17:07:18,705][06909] Updated weights for policy 0, policy_version 38613 (0.0031) [2024-06-27 17:07:18,850][06674] Fps is (10 sec: 45876.2, 60 sec: 44510.0, 300 sec: 43820.3). Total num frames: 632635392. Throughput: 0: 44157.4. Samples: 535558920. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 17:07:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:07:22,395][06909] Updated weights for policy 0, policy_version 38623 (0.0028) [2024-06-27 17:07:23,852][06674] Fps is (10 sec: 40951.7, 60 sec: 43689.2, 300 sec: 43820.5). Total num frames: 632832000. Throughput: 0: 44322.6. Samples: 535820600. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 17:07:23,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:07:25,987][06909] Updated weights for policy 0, policy_version 38633 (0.0022) [2024-06-27 17:07:28,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 633077760. Throughput: 0: 44036.3. Samples: 535949180. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-27 17:07:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:07:29,603][06909] Updated weights for policy 0, policy_version 38643 (0.0045) [2024-06-27 17:07:33,642][06909] Updated weights for policy 0, policy_version 38653 (0.0036) [2024-06-27 17:07:33,850][06674] Fps is (10 sec: 45884.5, 60 sec: 44509.8, 300 sec: 43875.8). Total num frames: 633290752. Throughput: 0: 44131.7. Samples: 536215940. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-27 17:07:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:07:37,198][06909] Updated weights for policy 0, policy_version 38663 (0.0034) [2024-06-27 17:07:38,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 633487360. Throughput: 0: 44237.5. Samples: 536479360. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-27 17:07:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:07:41,214][06909] Updated weights for policy 0, policy_version 38673 (0.0042) [2024-06-27 17:07:43,852][06674] Fps is (10 sec: 44228.1, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 633733120. Throughput: 0: 43944.3. Samples: 536605500. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-27 17:07:43,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:07:44,669][06909] Updated weights for policy 0, policy_version 38683 (0.0037) [2024-06-27 17:07:48,649][06909] Updated weights for policy 0, policy_version 38693 (0.0030) [2024-06-27 17:07:48,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44510.0, 300 sec: 43875.8). Total num frames: 633962496. Throughput: 0: 44060.5. Samples: 536878000. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-27 17:07:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:07:48,950][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000038695_633978880.pth... [2024-06-27 17:07:49,004][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000038048_623378432.pth [2024-06-27 17:07:52,142][06909] Updated weights for policy 0, policy_version 38703 (0.0031) [2024-06-27 17:07:53,850][06674] Fps is (10 sec: 40968.0, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 634142720. Throughput: 0: 44105.5. Samples: 537142680. Policy #0 lag: (min: 0.0, avg: 12.2, max: 20.0) [2024-06-27 17:07:53,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:07:55,908][06909] Updated weights for policy 0, policy_version 38713 (0.0031) [2024-06-27 17:07:56,108][06887] Signal inference workers to stop experience collection... (7750 times) [2024-06-27 17:07:56,110][06887] Signal inference workers to resume experience collection... (7750 times) [2024-06-27 17:07:56,131][06909] InferenceWorker_p0-w0: stopping experience collection (7750 times) [2024-06-27 17:07:56,131][06909] InferenceWorker_p0-w0: resuming experience collection (7750 times) [2024-06-27 17:07:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 634388480. Throughput: 0: 43884.5. Samples: 537263520. Policy #0 lag: (min: 0.0, avg: 12.2, max: 20.0) [2024-06-27 17:07:58,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:07:59,816][06909] Updated weights for policy 0, policy_version 38723 (0.0038) [2024-06-27 17:08:03,273][06909] Updated weights for policy 0, policy_version 38733 (0.0037) [2024-06-27 17:08:03,856][06674] Fps is (10 sec: 50760.0, 60 sec: 44778.4, 300 sec: 43986.0). Total num frames: 634650624. Throughput: 0: 43915.8. Samples: 537535400. Policy #0 lag: (min: 0.0, avg: 12.2, max: 20.0) [2024-06-27 17:08:03,856][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:08:07,191][06909] Updated weights for policy 0, policy_version 38743 (0.0038) [2024-06-27 17:08:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 634814464. Throughput: 0: 43963.3. Samples: 537798860. Policy #0 lag: (min: 0.0, avg: 12.2, max: 20.0) [2024-06-27 17:08:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:08:10,913][06909] Updated weights for policy 0, policy_version 38753 (0.0037) [2024-06-27 17:08:13,850][06674] Fps is (10 sec: 39345.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 635043840. Throughput: 0: 43812.1. Samples: 537920720. Policy #0 lag: (min: 0.0, avg: 12.2, max: 20.0) [2024-06-27 17:08:13,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:08:14,574][06909] Updated weights for policy 0, policy_version 38763 (0.0035) [2024-06-27 17:08:18,324][06909] Updated weights for policy 0, policy_version 38773 (0.0034) [2024-06-27 17:08:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 635273216. Throughput: 0: 43846.2. Samples: 538189020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 17:08:18,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:08:22,189][06909] Updated weights for policy 0, policy_version 38783 (0.0038) [2024-06-27 17:08:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43965.2, 300 sec: 43875.8). Total num frames: 635469824. Throughput: 0: 43823.9. Samples: 538451440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 17:08:23,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:08:25,675][06909] Updated weights for policy 0, policy_version 38793 (0.0038) [2024-06-27 17:08:28,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 635715584. Throughput: 0: 43925.1. Samples: 538582040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 17:08:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:08:29,470][06909] Updated weights for policy 0, policy_version 38803 (0.0027) [2024-06-27 17:08:33,628][06909] Updated weights for policy 0, policy_version 38813 (0.0038) [2024-06-27 17:08:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 635928576. Throughput: 0: 43740.3. Samples: 538846320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 17:08:33,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 17:08:37,518][06909] Updated weights for policy 0, policy_version 38823 (0.0040) [2024-06-27 17:08:38,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 43764.8). Total num frames: 636125184. Throughput: 0: 43644.1. Samples: 539106660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 17:08:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:08:40,938][06909] Updated weights for policy 0, policy_version 38833 (0.0044) [2024-06-27 17:08:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43692.1, 300 sec: 43987.2). Total num frames: 636354560. Throughput: 0: 43734.6. Samples: 539231580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:08:43,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:08:45,006][06909] Updated weights for policy 0, policy_version 38843 (0.0026) [2024-06-27 17:08:48,385][06909] Updated weights for policy 0, policy_version 38853 (0.0042) [2024-06-27 17:08:48,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 636600320. Throughput: 0: 43679.5. Samples: 539500720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:08:48,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:08:52,357][06909] Updated weights for policy 0, policy_version 38863 (0.0034) [2024-06-27 17:08:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 636780544. Throughput: 0: 43622.7. Samples: 539761880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:08:53,854][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:08:55,864][06909] Updated weights for policy 0, policy_version 38873 (0.0039) [2024-06-27 17:08:58,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43417.4, 300 sec: 43986.8). Total num frames: 636993536. Throughput: 0: 43711.9. Samples: 539887760. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:08:58,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:08:59,838][06909] Updated weights for policy 0, policy_version 38883 (0.0037) [2024-06-27 17:09:03,501][06909] Updated weights for policy 0, policy_version 38893 (0.0035) [2024-06-27 17:09:03,852][06674] Fps is (10 sec: 44227.9, 60 sec: 42874.3, 300 sec: 43764.4). Total num frames: 637222912. Throughput: 0: 43715.4. Samples: 540156300. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:09:03,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:09:07,310][06909] Updated weights for policy 0, policy_version 38903 (0.0030) [2024-06-27 17:09:08,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 637452288. Throughput: 0: 43619.1. Samples: 540414300. Policy #0 lag: (min: 2.0, avg: 12.2, max: 25.0) [2024-06-27 17:09:08,851][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:09:10,498][06887] Signal inference workers to stop experience collection... (7800 times) [2024-06-27 17:09:10,526][06909] InferenceWorker_p0-w0: stopping experience collection (7800 times) [2024-06-27 17:09:10,551][06887] Signal inference workers to resume experience collection... (7800 times) [2024-06-27 17:09:10,552][06909] InferenceWorker_p0-w0: resuming experience collection (7800 times) [2024-06-27 17:09:11,216][06909] Updated weights for policy 0, policy_version 38913 (0.0034) [2024-06-27 17:09:13,850][06674] Fps is (10 sec: 44246.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 637665280. Throughput: 0: 43733.3. Samples: 540550040. Policy #0 lag: (min: 2.0, avg: 12.2, max: 25.0) [2024-06-27 17:09:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:09:14,551][06909] Updated weights for policy 0, policy_version 38923 (0.0025) [2024-06-27 17:09:18,530][06909] Updated weights for policy 0, policy_version 38933 (0.0028) [2024-06-27 17:09:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 637911040. Throughput: 0: 43760.0. Samples: 540815520. Policy #0 lag: (min: 2.0, avg: 12.2, max: 25.0) [2024-06-27 17:09:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:09:22,078][06909] Updated weights for policy 0, policy_version 38943 (0.0019) [2024-06-27 17:09:23,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43962.2, 300 sec: 43820.0). Total num frames: 638107648. Throughput: 0: 43809.1. Samples: 541078160. Policy #0 lag: (min: 2.0, avg: 12.2, max: 25.0) [2024-06-27 17:09:23,852][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:09:25,771][06909] Updated weights for policy 0, policy_version 38953 (0.0021) [2024-06-27 17:09:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 638337024. Throughput: 0: 43884.9. Samples: 541206400. Policy #0 lag: (min: 2.0, avg: 12.2, max: 25.0) [2024-06-27 17:09:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:09:29,626][06909] Updated weights for policy 0, policy_version 38963 (0.0025) [2024-06-27 17:09:33,045][06909] Updated weights for policy 0, policy_version 38973 (0.0037) [2024-06-27 17:09:33,850][06674] Fps is (10 sec: 45884.2, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 638566400. Throughput: 0: 43917.4. Samples: 541477000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 17:09:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:09:36,972][06909] Updated weights for policy 0, policy_version 38983 (0.0030) [2024-06-27 17:09:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 638779392. Throughput: 0: 43770.3. Samples: 541731540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 17:09:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:09:40,924][06909] Updated weights for policy 0, policy_version 38993 (0.0036) [2024-06-27 17:09:43,855][06674] Fps is (10 sec: 40938.0, 60 sec: 43686.7, 300 sec: 43986.1). Total num frames: 638976000. Throughput: 0: 43855.7. Samples: 541861500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 17:09:43,856][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:09:44,853][06909] Updated weights for policy 0, policy_version 39003 (0.0038) [2024-06-27 17:09:48,422][06909] Updated weights for policy 0, policy_version 39013 (0.0027) [2024-06-27 17:09:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 639205376. Throughput: 0: 43690.1. Samples: 542122260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 17:09:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:09:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000039014_639205376.pth... [2024-06-27 17:09:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000038370_628654080.pth [2024-06-27 17:09:52,229][06909] Updated weights for policy 0, policy_version 39023 (0.0050) [2024-06-27 17:09:53,850][06674] Fps is (10 sec: 45900.3, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 639434752. Throughput: 0: 43720.1. Samples: 542381700. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 17:09:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:09:56,203][06909] Updated weights for policy 0, policy_version 39033 (0.0028) [2024-06-27 17:09:58,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 639631360. Throughput: 0: 43794.5. Samples: 542520800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:09:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:09:59,453][06909] Updated weights for policy 0, policy_version 39043 (0.0047) [2024-06-27 17:10:03,482][06909] Updated weights for policy 0, policy_version 39053 (0.0041) [2024-06-27 17:10:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44238.3, 300 sec: 43875.8). Total num frames: 639877120. Throughput: 0: 43693.4. Samples: 542781720. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:10:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:10:06,879][06909] Updated weights for policy 0, policy_version 39063 (0.0033) [2024-06-27 17:10:08,852][06674] Fps is (10 sec: 45866.3, 60 sec: 43962.3, 300 sec: 43820.0). Total num frames: 640090112. Throughput: 0: 43623.1. Samples: 543041200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:10:08,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 17:10:10,989][06909] Updated weights for policy 0, policy_version 39073 (0.0029) [2024-06-27 17:10:13,851][06674] Fps is (10 sec: 40954.6, 60 sec: 43689.7, 300 sec: 43931.1). Total num frames: 640286720. Throughput: 0: 43577.4. Samples: 543167440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:10:13,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:10:14,615][06909] Updated weights for policy 0, policy_version 39083 (0.0027) [2024-06-27 17:10:18,355][06909] Updated weights for policy 0, policy_version 39093 (0.0029) [2024-06-27 17:10:18,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 640532480. Throughput: 0: 43541.4. Samples: 543436360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:10:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:10:22,109][06887] Signal inference workers to stop experience collection... (7850 times) [2024-06-27 17:10:22,146][06909] InferenceWorker_p0-w0: stopping experience collection (7850 times) [2024-06-27 17:10:22,155][06887] Signal inference workers to resume experience collection... (7850 times) [2024-06-27 17:10:22,165][06909] InferenceWorker_p0-w0: resuming experience collection (7850 times) [2024-06-27 17:10:22,168][06909] Updated weights for policy 0, policy_version 39103 (0.0032) [2024-06-27 17:10:23,856][06674] Fps is (10 sec: 45853.6, 60 sec: 43960.9, 300 sec: 43874.9). Total num frames: 640745472. Throughput: 0: 43607.9. Samples: 543694160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-27 17:10:23,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:10:25,926][06909] Updated weights for policy 0, policy_version 39113 (0.0032) [2024-06-27 17:10:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 640942080. Throughput: 0: 43756.8. Samples: 543830320. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-27 17:10:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:10:29,402][06909] Updated weights for policy 0, policy_version 39123 (0.0037) [2024-06-27 17:10:33,264][06909] Updated weights for policy 0, policy_version 39133 (0.0035) [2024-06-27 17:10:33,850][06674] Fps is (10 sec: 42624.4, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 641171456. Throughput: 0: 43777.8. Samples: 544092260. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-27 17:10:33,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 17:10:36,806][06909] Updated weights for policy 0, policy_version 39143 (0.0032) [2024-06-27 17:10:38,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43690.5, 300 sec: 43875.8). Total num frames: 641400832. Throughput: 0: 43910.0. Samples: 544357660. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-27 17:10:38,851][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:10:40,926][06909] Updated weights for policy 0, policy_version 39153 (0.0031) [2024-06-27 17:10:43,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43967.7, 300 sec: 43986.9). Total num frames: 641613824. Throughput: 0: 43856.1. Samples: 544494320. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-27 17:10:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:10:44,249][06909] Updated weights for policy 0, policy_version 39163 (0.0026) [2024-06-27 17:10:48,362][06909] Updated weights for policy 0, policy_version 39173 (0.0026) [2024-06-27 17:10:48,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 641843200. Throughput: 0: 43903.0. Samples: 544757360. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:10:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 17:10:51,673][06909] Updated weights for policy 0, policy_version 39183 (0.0026) [2024-06-27 17:10:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 642056192. Throughput: 0: 43922.0. Samples: 545017600. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:10:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:10:55,672][06909] Updated weights for policy 0, policy_version 39193 (0.0038) [2024-06-27 17:10:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 642269184. Throughput: 0: 44020.8. Samples: 545148320. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:10:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:10:59,538][06909] Updated weights for policy 0, policy_version 39203 (0.0040) [2024-06-27 17:11:03,107][06909] Updated weights for policy 0, policy_version 39213 (0.0037) [2024-06-27 17:11:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 642498560. Throughput: 0: 43961.5. Samples: 545414620. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:11:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 17:11:06,818][06909] Updated weights for policy 0, policy_version 39223 (0.0025) [2024-06-27 17:11:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43965.3, 300 sec: 43875.8). Total num frames: 642727936. Throughput: 0: 43930.8. Samples: 545670780. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:11:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:11:10,832][06909] Updated weights for policy 0, policy_version 39233 (0.0038) [2024-06-27 17:11:13,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43691.7, 300 sec: 43875.8). Total num frames: 642908160. Throughput: 0: 43855.7. Samples: 545803820. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 17:11:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:11:14,155][06909] Updated weights for policy 0, policy_version 39243 (0.0027) [2024-06-27 17:11:18,382][06909] Updated weights for policy 0, policy_version 39253 (0.0028) [2024-06-27 17:11:18,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.6, 300 sec: 43820.2). Total num frames: 643137536. Throughput: 0: 43798.9. Samples: 546063220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 17:11:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:11:21,494][06909] Updated weights for policy 0, policy_version 39263 (0.0031) [2024-06-27 17:11:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43695.1, 300 sec: 43820.3). Total num frames: 643366912. Throughput: 0: 43781.1. Samples: 546327800. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 17:11:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:11:25,956][06909] Updated weights for policy 0, policy_version 39273 (0.0037) [2024-06-27 17:11:28,852][06674] Fps is (10 sec: 45866.1, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 643596288. Throughput: 0: 43806.4. Samples: 546465700. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 17:11:28,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:11:29,307][06909] Updated weights for policy 0, policy_version 39283 (0.0028) [2024-06-27 17:11:33,509][06909] Updated weights for policy 0, policy_version 39293 (0.0023) [2024-06-27 17:11:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 643792896. Throughput: 0: 43598.4. Samples: 546719280. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 17:11:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:11:36,798][06909] Updated weights for policy 0, policy_version 39303 (0.0031) [2024-06-27 17:11:38,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 644022272. Throughput: 0: 43716.9. Samples: 546984860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 17:11:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:11:39,041][06887] Signal inference workers to stop experience collection... (7900 times) [2024-06-27 17:11:39,041][06887] Signal inference workers to resume experience collection... (7900 times) [2024-06-27 17:11:39,061][06909] InferenceWorker_p0-w0: stopping experience collection (7900 times) [2024-06-27 17:11:39,062][06909] InferenceWorker_p0-w0: resuming experience collection (7900 times) [2024-06-27 17:11:40,809][06909] Updated weights for policy 0, policy_version 39313 (0.0029) [2024-06-27 17:11:43,856][06674] Fps is (10 sec: 45847.0, 60 sec: 43959.3, 300 sec: 43930.5). Total num frames: 644251648. Throughput: 0: 43719.0. Samples: 547115940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 17:11:43,856][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:11:44,182][06909] Updated weights for policy 0, policy_version 39323 (0.0029) [2024-06-27 17:11:48,233][06909] Updated weights for policy 0, policy_version 39333 (0.0033) [2024-06-27 17:11:48,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43689.2, 300 sec: 43875.5). Total num frames: 644464640. Throughput: 0: 43695.7. Samples: 547381020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 17:11:48,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:11:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000039335_644464640.pth... [2024-06-27 17:11:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000038695_633978880.pth [2024-06-27 17:11:51,643][06909] Updated weights for policy 0, policy_version 39343 (0.0038) [2024-06-27 17:11:53,850][06674] Fps is (10 sec: 44263.5, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 644694016. Throughput: 0: 43832.0. Samples: 547643220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 17:11:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:11:55,723][06909] Updated weights for policy 0, policy_version 39353 (0.0027) [2024-06-27 17:11:58,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 644890624. Throughput: 0: 43783.0. Samples: 547774060. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 17:11:58,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:11:59,003][06909] Updated weights for policy 0, policy_version 39363 (0.0042) [2024-06-27 17:12:02,975][06909] Updated weights for policy 0, policy_version 39373 (0.0026) [2024-06-27 17:12:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.5, 300 sec: 43820.3). Total num frames: 645103616. Throughput: 0: 43841.8. Samples: 548036100. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 17:12:03,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:12:06,453][06909] Updated weights for policy 0, policy_version 39383 (0.0034) [2024-06-27 17:12:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 645332992. Throughput: 0: 43859.9. Samples: 548301500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 17:12:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:12:10,861][06909] Updated weights for policy 0, policy_version 39393 (0.0044) [2024-06-27 17:12:13,790][06909] Updated weights for policy 0, policy_version 39403 (0.0043) [2024-06-27 17:12:13,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44509.8, 300 sec: 43875.8). Total num frames: 645578752. Throughput: 0: 43677.1. Samples: 548431080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 17:12:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:12:18,267][06909] Updated weights for policy 0, policy_version 39413 (0.0038) [2024-06-27 17:12:18,856][06674] Fps is (10 sec: 44210.2, 60 sec: 43959.3, 300 sec: 43875.2). Total num frames: 645775360. Throughput: 0: 43780.2. Samples: 548689660. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 17:12:18,857][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:12:21,405][06909] Updated weights for policy 0, policy_version 39423 (0.0031) [2024-06-27 17:12:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 645988352. Throughput: 0: 43904.0. Samples: 548960540. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 17:12:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:12:25,683][06909] Updated weights for policy 0, policy_version 39433 (0.0031) [2024-06-27 17:12:28,850][06674] Fps is (10 sec: 44263.4, 60 sec: 43692.1, 300 sec: 43820.3). Total num frames: 646217728. Throughput: 0: 43845.8. Samples: 549088740. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 17:12:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:12:29,066][06909] Updated weights for policy 0, policy_version 39443 (0.0035) [2024-06-27 17:12:33,446][06909] Updated weights for policy 0, policy_version 39453 (0.0032) [2024-06-27 17:12:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 646414336. Throughput: 0: 43748.8. Samples: 549349620. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 17:12:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:12:36,592][06909] Updated weights for policy 0, policy_version 39463 (0.0032) [2024-06-27 17:12:38,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43690.5, 300 sec: 43765.0). Total num frames: 646643712. Throughput: 0: 43819.4. Samples: 549615100. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 17:12:38,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:12:40,721][06909] Updated weights for policy 0, policy_version 39473 (0.0030) [2024-06-27 17:12:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43695.1, 300 sec: 43764.7). Total num frames: 646873088. Throughput: 0: 43808.5. Samples: 549745440. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 17:12:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:12:43,963][06909] Updated weights for policy 0, policy_version 39483 (0.0029) [2024-06-27 17:12:48,164][06909] Updated weights for policy 0, policy_version 39493 (0.0045) [2024-06-27 17:12:48,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43419.1, 300 sec: 43820.3). Total num frames: 647069696. Throughput: 0: 43857.8. Samples: 550009700. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 17:12:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:12:51,440][06909] Updated weights for policy 0, policy_version 39503 (0.0036) [2024-06-27 17:12:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 647299072. Throughput: 0: 43675.1. Samples: 550266880. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:12:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:12:55,891][06909] Updated weights for policy 0, policy_version 39513 (0.0036) [2024-06-27 17:12:58,772][06909] Updated weights for policy 0, policy_version 39523 (0.0022) [2024-06-27 17:12:58,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 43710.1). Total num frames: 647544832. Throughput: 0: 43830.7. Samples: 550403460. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:12:58,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 17:13:01,150][06887] Signal inference workers to stop experience collection... (7950 times) [2024-06-27 17:13:01,153][06887] Signal inference workers to resume experience collection... (7950 times) [2024-06-27 17:13:01,176][06909] InferenceWorker_p0-w0: stopping experience collection (7950 times) [2024-06-27 17:13:01,176][06909] InferenceWorker_p0-w0: resuming experience collection (7950 times) [2024-06-27 17:13:03,386][06909] Updated weights for policy 0, policy_version 39533 (0.0025) [2024-06-27 17:13:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 647725056. Throughput: 0: 43909.0. Samples: 550665300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:13:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:13:06,490][06909] Updated weights for policy 0, policy_version 39543 (0.0034) [2024-06-27 17:13:08,850][06674] Fps is (10 sec: 39321.0, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 647938048. Throughput: 0: 43570.0. Samples: 550921200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:13:08,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:13:10,843][06909] Updated weights for policy 0, policy_version 39553 (0.0026) [2024-06-27 17:13:13,829][06909] Updated weights for policy 0, policy_version 39563 (0.0037) [2024-06-27 17:13:13,852][06674] Fps is (10 sec: 47503.8, 60 sec: 43689.2, 300 sec: 43820.0). Total num frames: 648200192. Throughput: 0: 43551.4. Samples: 551048640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:13:13,853][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:13:18,233][06909] Updated weights for policy 0, policy_version 39573 (0.0029) [2024-06-27 17:13:18,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43695.0, 300 sec: 43820.3). Total num frames: 648396800. Throughput: 0: 43850.5. Samples: 551322900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 17:13:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:13:21,626][06909] Updated weights for policy 0, policy_version 39583 (0.0030) [2024-06-27 17:13:23,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 648609792. Throughput: 0: 43717.5. Samples: 551582380. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 17:13:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:13:25,657][06909] Updated weights for policy 0, policy_version 39593 (0.0031) [2024-06-27 17:13:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 648839168. Throughput: 0: 43664.0. Samples: 551710320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 17:13:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:13:29,161][06909] Updated weights for policy 0, policy_version 39603 (0.0047) [2024-06-27 17:13:33,263][06909] Updated weights for policy 0, policy_version 39613 (0.0033) [2024-06-27 17:13:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 649052160. Throughput: 0: 43663.1. Samples: 551974540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 17:13:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:13:36,482][06909] Updated weights for policy 0, policy_version 39623 (0.0024) [2024-06-27 17:13:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 649265152. Throughput: 0: 43638.6. Samples: 552230620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 17:13:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:13:41,032][06909] Updated weights for policy 0, policy_version 39633 (0.0031) [2024-06-27 17:13:43,775][06909] Updated weights for policy 0, policy_version 39643 (0.0032) [2024-06-27 17:13:43,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.6, 300 sec: 43764.7). Total num frames: 649510912. Throughput: 0: 43685.2. Samples: 552369300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:13:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:13:48,406][06909] Updated weights for policy 0, policy_version 39653 (0.0041) [2024-06-27 17:13:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 649707520. Throughput: 0: 43681.2. Samples: 552630960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:13:48,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:13:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000039655_649707520.pth... [2024-06-27 17:13:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000039014_639205376.pth [2024-06-27 17:13:51,471][06909] Updated weights for policy 0, policy_version 39663 (0.0038) [2024-06-27 17:13:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 649920512. Throughput: 0: 43681.5. Samples: 552886860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:13:53,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 17:13:55,792][06909] Updated weights for policy 0, policy_version 39673 (0.0034) [2024-06-27 17:13:58,818][06909] Updated weights for policy 0, policy_version 39683 (0.0033) [2024-06-27 17:13:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.6, 300 sec: 43876.1). Total num frames: 650166272. Throughput: 0: 43794.4. Samples: 553019300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:13:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:14:03,410][06909] Updated weights for policy 0, policy_version 39693 (0.0034) [2024-06-27 17:14:03,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 650346496. Throughput: 0: 43640.0. Samples: 553286700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:14:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:14:06,437][06909] Updated weights for policy 0, policy_version 39703 (0.0038) [2024-06-27 17:14:08,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 650575872. Throughput: 0: 43599.6. Samples: 553544360. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 17:14:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:14:10,892][06909] Updated weights for policy 0, policy_version 39713 (0.0032) [2024-06-27 17:14:13,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43419.1, 300 sec: 43709.2). Total num frames: 650805248. Throughput: 0: 43765.4. Samples: 553679760. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 17:14:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:14:13,981][06909] Updated weights for policy 0, policy_version 39723 (0.0022) [2024-06-27 17:14:18,477][06909] Updated weights for policy 0, policy_version 39733 (0.0035) [2024-06-27 17:14:18,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43416.1, 300 sec: 43709.2). Total num frames: 651001856. Throughput: 0: 43697.0. Samples: 553941000. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 17:14:18,853][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:14:18,942][06887] Signal inference workers to stop experience collection... (8000 times) [2024-06-27 17:14:18,991][06909] InferenceWorker_p0-w0: stopping experience collection (8000 times) [2024-06-27 17:14:19,000][06887] Signal inference workers to resume experience collection... (8000 times) [2024-06-27 17:14:19,005][06909] InferenceWorker_p0-w0: resuming experience collection (8000 times) [2024-06-27 17:14:21,232][06909] Updated weights for policy 0, policy_version 39743 (0.0036) [2024-06-27 17:14:23,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 651231232. Throughput: 0: 43795.5. Samples: 554201420. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 17:14:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:14:26,041][06909] Updated weights for policy 0, policy_version 39753 (0.0031) [2024-06-27 17:14:28,534][06909] Updated weights for policy 0, policy_version 39763 (0.0033) [2024-06-27 17:14:28,850][06674] Fps is (10 sec: 47524.1, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 651476992. Throughput: 0: 43745.5. Samples: 554337840. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 17:14:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:14:33,333][06909] Updated weights for policy 0, policy_version 39773 (0.0032) [2024-06-27 17:14:33,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 651673600. Throughput: 0: 43742.8. Samples: 554599380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 17:14:33,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:14:36,047][06909] Updated weights for policy 0, policy_version 39783 (0.0032) [2024-06-27 17:14:38,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43765.5). Total num frames: 651886592. Throughput: 0: 43831.6. Samples: 554859280. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 17:14:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:14:40,874][06909] Updated weights for policy 0, policy_version 39793 (0.0031) [2024-06-27 17:14:43,348][06909] Updated weights for policy 0, policy_version 39803 (0.0024) [2024-06-27 17:14:43,852][06674] Fps is (10 sec: 45865.7, 60 sec: 43689.2, 300 sec: 43819.9). Total num frames: 652132352. Throughput: 0: 43795.8. Samples: 554990200. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 17:14:43,853][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:14:48,099][06909] Updated weights for policy 0, policy_version 39813 (0.0037) [2024-06-27 17:14:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 652328960. Throughput: 0: 43959.2. Samples: 555264860. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 17:14:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:14:51,014][06909] Updated weights for policy 0, policy_version 39823 (0.0037) [2024-06-27 17:14:53,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 652541952. Throughput: 0: 43825.8. Samples: 555516520. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 17:14:53,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:14:55,558][06909] Updated weights for policy 0, policy_version 39833 (0.0041) [2024-06-27 17:14:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 652771328. Throughput: 0: 43805.7. Samples: 555651020. Policy #0 lag: (min: 0.0, avg: 12.2, max: 22.0) [2024-06-27 17:14:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:14:59,024][06909] Updated weights for policy 0, policy_version 39843 (0.0025) [2024-06-27 17:15:03,126][06909] Updated weights for policy 0, policy_version 39853 (0.0038) [2024-06-27 17:15:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43709.5). Total num frames: 652984320. Throughput: 0: 43991.5. Samples: 555920520. Policy #0 lag: (min: 0.0, avg: 12.2, max: 22.0) [2024-06-27 17:15:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:15:06,314][06909] Updated weights for policy 0, policy_version 39863 (0.0037) [2024-06-27 17:15:08,856][06674] Fps is (10 sec: 42572.9, 60 sec: 43686.3, 300 sec: 43764.0). Total num frames: 653197312. Throughput: 0: 43791.6. Samples: 556172300. Policy #0 lag: (min: 0.0, avg: 12.2, max: 22.0) [2024-06-27 17:15:08,865][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:15:10,701][06909] Updated weights for policy 0, policy_version 39873 (0.0028) [2024-06-27 17:15:13,682][06909] Updated weights for policy 0, policy_version 39883 (0.0029) [2024-06-27 17:15:13,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.6, 300 sec: 43764.7). Total num frames: 653443072. Throughput: 0: 43644.3. Samples: 556301840. Policy #0 lag: (min: 0.0, avg: 12.2, max: 22.0) [2024-06-27 17:15:13,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:15:18,154][06909] Updated weights for policy 0, policy_version 39893 (0.0021) [2024-06-27 17:15:18,850][06674] Fps is (10 sec: 44263.5, 60 sec: 43965.3, 300 sec: 43710.1). Total num frames: 653639680. Throughput: 0: 43791.1. Samples: 556569980. Policy #0 lag: (min: 0.0, avg: 12.2, max: 22.0) [2024-06-27 17:15:18,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:15:20,975][06909] Updated weights for policy 0, policy_version 39903 (0.0027) [2024-06-27 17:15:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 653852672. Throughput: 0: 43800.8. Samples: 556830320. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:15:23,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 17:15:25,641][06909] Updated weights for policy 0, policy_version 39913 (0.0024) [2024-06-27 17:15:28,326][06909] Updated weights for policy 0, policy_version 39923 (0.0042) [2024-06-27 17:15:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43820.2). Total num frames: 654098432. Throughput: 0: 43841.6. Samples: 556962980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:15:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:15:33,094][06909] Updated weights for policy 0, policy_version 39933 (0.0037) [2024-06-27 17:15:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 654295040. Throughput: 0: 43568.0. Samples: 557225420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:15:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:15:36,337][06909] Updated weights for policy 0, policy_version 39943 (0.0035) [2024-06-27 17:15:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 654508032. Throughput: 0: 43659.7. Samples: 557481200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:15:38,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:15:40,757][06909] Updated weights for policy 0, policy_version 39953 (0.0049) [2024-06-27 17:15:41,572][06887] Signal inference workers to stop experience collection... (8050 times) [2024-06-27 17:15:41,617][06909] InferenceWorker_p0-w0: stopping experience collection (8050 times) [2024-06-27 17:15:41,626][06887] Signal inference workers to resume experience collection... (8050 times) [2024-06-27 17:15:41,634][06909] InferenceWorker_p0-w0: resuming experience collection (8050 times) [2024-06-27 17:15:43,638][06909] Updated weights for policy 0, policy_version 39963 (0.0032) [2024-06-27 17:15:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43692.2, 300 sec: 43764.7). Total num frames: 654753792. Throughput: 0: 43689.8. Samples: 557617060. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:15:43,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:15:48,223][06909] Updated weights for policy 0, policy_version 39973 (0.0029) [2024-06-27 17:15:48,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 654934016. Throughput: 0: 43597.3. Samples: 557882400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:15:48,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:15:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000039975_654950400.pth... [2024-06-27 17:15:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000039335_644464640.pth [2024-06-27 17:15:51,358][06909] Updated weights for policy 0, policy_version 39983 (0.0027) [2024-06-27 17:15:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 655163392. Throughput: 0: 43700.1. Samples: 558138540. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:15:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:15:55,610][06909] Updated weights for policy 0, policy_version 39993 (0.0039) [2024-06-27 17:15:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 655392768. Throughput: 0: 43718.4. Samples: 558269160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:15:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:15:58,917][06909] Updated weights for policy 0, policy_version 40003 (0.0046) [2024-06-27 17:16:02,802][06909] Updated weights for policy 0, policy_version 40013 (0.0031) [2024-06-27 17:16:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 655622144. Throughput: 0: 43652.9. Samples: 558534360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:16:03,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:16:06,146][06909] Updated weights for policy 0, policy_version 40023 (0.0034) [2024-06-27 17:16:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43968.1, 300 sec: 43820.2). Total num frames: 655835136. Throughput: 0: 43724.0. Samples: 558797900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:16:08,851][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 17:16:10,691][06909] Updated weights for policy 0, policy_version 40033 (0.0031) [2024-06-27 17:16:13,402][06909] Updated weights for policy 0, policy_version 40043 (0.0028) [2024-06-27 17:16:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 656064512. Throughput: 0: 43706.2. Samples: 558929760. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:16:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:16:18,118][06909] Updated weights for policy 0, policy_version 40053 (0.0036) [2024-06-27 17:16:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 656261120. Throughput: 0: 43747.1. Samples: 559194040. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 17:16:18,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:16:21,044][06909] Updated weights for policy 0, policy_version 40063 (0.0034) [2024-06-27 17:16:23,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.7, 300 sec: 43709.5). Total num frames: 656490496. Throughput: 0: 43839.0. Samples: 559453960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 17:16:23,856][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:16:25,620][06909] Updated weights for policy 0, policy_version 40073 (0.0033) [2024-06-27 17:16:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 656703488. Throughput: 0: 43848.5. Samples: 559590240. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 17:16:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:16:28,907][06909] Updated weights for policy 0, policy_version 40083 (0.0031) [2024-06-27 17:16:32,851][06909] Updated weights for policy 0, policy_version 40093 (0.0031) [2024-06-27 17:16:33,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 656916480. Throughput: 0: 43701.0. Samples: 559848940. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 17:16:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:16:36,331][06909] Updated weights for policy 0, policy_version 40103 (0.0028) [2024-06-27 17:16:38,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43710.1). Total num frames: 657145856. Throughput: 0: 43902.2. Samples: 560114140. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 17:16:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:16:40,415][06909] Updated weights for policy 0, policy_version 40113 (0.0028) [2024-06-27 17:16:43,695][06909] Updated weights for policy 0, policy_version 40123 (0.0031) [2024-06-27 17:16:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 43765.0). Total num frames: 657375232. Throughput: 0: 43926.7. Samples: 560245860. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 17:16:43,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 17:16:47,563][06909] Updated weights for policy 0, policy_version 40133 (0.0026) [2024-06-27 17:16:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 43764.7). Total num frames: 657604608. Throughput: 0: 44037.7. Samples: 560516060. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 17:16:48,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 17:16:50,928][06909] Updated weights for policy 0, policy_version 40143 (0.0032) [2024-06-27 17:16:53,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43962.2, 300 sec: 43764.4). Total num frames: 657801216. Throughput: 0: 44017.6. Samples: 560778780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 17:16:53,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:16:55,322][06909] Updated weights for policy 0, policy_version 40153 (0.0034) [2024-06-27 17:16:58,270][06909] Updated weights for policy 0, policy_version 40163 (0.0029) [2024-06-27 17:16:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 658030592. Throughput: 0: 43854.2. Samples: 560903200. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 17:16:58,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:17:03,096][06909] Updated weights for policy 0, policy_version 40173 (0.0031) [2024-06-27 17:17:03,850][06674] Fps is (10 sec: 45884.3, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 658259968. Throughput: 0: 43895.0. Samples: 561169320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 17:17:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:17:06,068][06909] Updated weights for policy 0, policy_version 40183 (0.0026) [2024-06-27 17:17:08,855][06674] Fps is (10 sec: 40940.0, 60 sec: 43414.1, 300 sec: 43597.4). Total num frames: 658440192. Throughput: 0: 44021.2. Samples: 561435120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:17:08,855][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:17:10,363][06909] Updated weights for policy 0, policy_version 40193 (0.0022) [2024-06-27 17:17:13,265][06909] Updated weights for policy 0, policy_version 40203 (0.0045) [2024-06-27 17:17:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43765.6). Total num frames: 658685952. Throughput: 0: 43899.1. Samples: 561565700. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:17:13,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:17:17,792][06909] Updated weights for policy 0, policy_version 40213 (0.0032) [2024-06-27 17:17:18,850][06674] Fps is (10 sec: 49175.6, 60 sec: 44509.8, 300 sec: 43875.8). Total num frames: 658931712. Throughput: 0: 44081.2. Samples: 561832600. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:17:18,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:17:21,099][06909] Updated weights for policy 0, policy_version 40223 (0.0031) [2024-06-27 17:17:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 659111936. Throughput: 0: 44104.4. Samples: 562098840. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:17:23,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:17:25,184][06909] Updated weights for policy 0, policy_version 40233 (0.0031) [2024-06-27 17:17:25,536][06887] Signal inference workers to stop experience collection... (8100 times) [2024-06-27 17:17:25,588][06909] InferenceWorker_p0-w0: stopping experience collection (8100 times) [2024-06-27 17:17:25,591][06887] Signal inference workers to resume experience collection... (8100 times) [2024-06-27 17:17:25,601][06909] InferenceWorker_p0-w0: resuming experience collection (8100 times) [2024-06-27 17:17:28,435][06909] Updated weights for policy 0, policy_version 40243 (0.0045) [2024-06-27 17:17:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 659357696. Throughput: 0: 43807.5. Samples: 562217200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:17:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:17:32,556][06909] Updated weights for policy 0, policy_version 40253 (0.0033) [2024-06-27 17:17:33,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44509.8, 300 sec: 43875.8). Total num frames: 659587072. Throughput: 0: 43960.1. Samples: 562494260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 17:17:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:17:35,717][06909] Updated weights for policy 0, policy_version 40263 (0.0045) [2024-06-27 17:17:38,856][06674] Fps is (10 sec: 42572.8, 60 sec: 43959.3, 300 sec: 43763.8). Total num frames: 659783680. Throughput: 0: 43986.3. Samples: 562758340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 17:17:38,856][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:17:40,050][06909] Updated weights for policy 0, policy_version 40273 (0.0026) [2024-06-27 17:17:42,990][06909] Updated weights for policy 0, policy_version 40283 (0.0039) [2024-06-27 17:17:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 660013056. Throughput: 0: 43986.2. Samples: 562882580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 17:17:43,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 17:17:47,412][06909] Updated weights for policy 0, policy_version 40293 (0.0031) [2024-06-27 17:17:48,850][06674] Fps is (10 sec: 45902.9, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 660242432. Throughput: 0: 44099.1. Samples: 563153780. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 17:17:48,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:17:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000040298_660242432.pth... [2024-06-27 17:17:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000039655_649707520.pth [2024-06-27 17:17:50,641][06909] Updated weights for policy 0, policy_version 40303 (0.0035) [2024-06-27 17:17:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43965.2, 300 sec: 43709.2). Total num frames: 660439040. Throughput: 0: 44102.1. Samples: 563419500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 17:17:53,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:17:55,350][06909] Updated weights for policy 0, policy_version 40313 (0.0031) [2024-06-27 17:17:57,966][06909] Updated weights for policy 0, policy_version 40323 (0.0033) [2024-06-27 17:17:58,852][06674] Fps is (10 sec: 44227.6, 60 sec: 44235.2, 300 sec: 43931.0). Total num frames: 660684800. Throughput: 0: 43945.9. Samples: 563543360. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-27 17:17:58,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:18:02,661][06909] Updated weights for policy 0, policy_version 40333 (0.0046) [2024-06-27 17:18:03,852][06674] Fps is (10 sec: 47504.0, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 660914176. Throughput: 0: 44106.1. Samples: 563817460. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-27 17:18:03,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:18:05,292][06909] Updated weights for policy 0, policy_version 40343 (0.0035) [2024-06-27 17:18:08,850][06674] Fps is (10 sec: 40968.7, 60 sec: 44240.4, 300 sec: 43709.5). Total num frames: 661094400. Throughput: 0: 43875.6. Samples: 564073240. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-27 17:18:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:18:10,179][06909] Updated weights for policy 0, policy_version 40353 (0.0032) [2024-06-27 17:18:12,895][06909] Updated weights for policy 0, policy_version 40363 (0.0027) [2024-06-27 17:18:13,850][06674] Fps is (10 sec: 42606.8, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 661340160. Throughput: 0: 44118.2. Samples: 564202520. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-27 17:18:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:18:17,752][06909] Updated weights for policy 0, policy_version 40373 (0.0043) [2024-06-27 17:18:18,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 661553152. Throughput: 0: 43948.8. Samples: 564471960. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-27 17:18:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:18:20,187][06909] Updated weights for policy 0, policy_version 40383 (0.0044) [2024-06-27 17:18:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 661749760. Throughput: 0: 43780.9. Samples: 564728220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:18:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:18:25,237][06909] Updated weights for policy 0, policy_version 40393 (0.0030) [2024-06-27 17:18:27,915][06909] Updated weights for policy 0, policy_version 40403 (0.0034) [2024-06-27 17:18:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 662011904. Throughput: 0: 43834.1. Samples: 564855120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:18:28,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:18:32,610][06909] Updated weights for policy 0, policy_version 40413 (0.0034) [2024-06-27 17:18:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43820.3). Total num frames: 662192128. Throughput: 0: 43821.8. Samples: 565125760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:18:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:18:35,573][06909] Updated weights for policy 0, policy_version 40423 (0.0035) [2024-06-27 17:18:38,856][06674] Fps is (10 sec: 39298.1, 60 sec: 43690.6, 300 sec: 43708.3). Total num frames: 662405120. Throughput: 0: 43509.3. Samples: 565377680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:18:38,856][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:18:39,364][06887] Signal inference workers to stop experience collection... (8150 times) [2024-06-27 17:18:39,364][06887] Signal inference workers to resume experience collection... (8150 times) [2024-06-27 17:18:39,407][06909] InferenceWorker_p0-w0: stopping experience collection (8150 times) [2024-06-27 17:18:39,408][06909] InferenceWorker_p0-w0: resuming experience collection (8150 times) [2024-06-27 17:18:40,250][06909] Updated weights for policy 0, policy_version 40433 (0.0033) [2024-06-27 17:18:42,802][06909] Updated weights for policy 0, policy_version 40443 (0.0034) [2024-06-27 17:18:43,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 662667264. Throughput: 0: 43790.9. Samples: 565513860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:18:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:18:47,499][06909] Updated weights for policy 0, policy_version 40453 (0.0032) [2024-06-27 17:18:48,850][06674] Fps is (10 sec: 44264.1, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 662847488. Throughput: 0: 43670.1. Samples: 565782520. Policy #0 lag: (min: 1.0, avg: 9.8, max: 23.0) [2024-06-27 17:18:48,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:18:50,365][06909] Updated weights for policy 0, policy_version 40463 (0.0037) [2024-06-27 17:18:53,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 663060480. Throughput: 0: 43726.6. Samples: 566040940. Policy #0 lag: (min: 1.0, avg: 9.8, max: 23.0) [2024-06-27 17:18:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:18:55,157][06909] Updated weights for policy 0, policy_version 40473 (0.0046) [2024-06-27 17:18:57,748][06909] Updated weights for policy 0, policy_version 40483 (0.0025) [2024-06-27 17:18:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43692.2, 300 sec: 43931.4). Total num frames: 663306240. Throughput: 0: 43630.3. Samples: 566165880. Policy #0 lag: (min: 1.0, avg: 9.8, max: 23.0) [2024-06-27 17:18:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:19:02,458][06909] Updated weights for policy 0, policy_version 40493 (0.0037) [2024-06-27 17:19:03,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43419.0, 300 sec: 43875.8). Total num frames: 663519232. Throughput: 0: 43808.4. Samples: 566443340. Policy #0 lag: (min: 1.0, avg: 9.8, max: 23.0) [2024-06-27 17:19:03,851][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 17:19:04,967][06909] Updated weights for policy 0, policy_version 40503 (0.0033) [2024-06-27 17:19:08,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 663715840. Throughput: 0: 43747.1. Samples: 566696840. Policy #0 lag: (min: 1.0, avg: 9.8, max: 23.0) [2024-06-27 17:19:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:19:09,927][06909] Updated weights for policy 0, policy_version 40513 (0.0031) [2024-06-27 17:19:12,732][06909] Updated weights for policy 0, policy_version 40523 (0.0022) [2024-06-27 17:19:13,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.8, 300 sec: 43987.2). Total num frames: 663977984. Throughput: 0: 43817.9. Samples: 566826920. Policy #0 lag: (min: 1.0, avg: 9.8, max: 23.0) [2024-06-27 17:19:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:19:17,842][06909] Updated weights for policy 0, policy_version 40533 (0.0041) [2024-06-27 17:19:18,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43417.6, 300 sec: 43820.3). Total num frames: 664158208. Throughput: 0: 43798.9. Samples: 567096720. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 17:19:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:19:20,063][06909] Updated weights for policy 0, policy_version 40543 (0.0037) [2024-06-27 17:19:23,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 664371200. Throughput: 0: 43940.6. Samples: 567354740. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 17:19:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:19:25,137][06909] Updated weights for policy 0, policy_version 40553 (0.0029) [2024-06-27 17:19:27,793][06909] Updated weights for policy 0, policy_version 40563 (0.0026) [2024-06-27 17:19:28,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 664633344. Throughput: 0: 43790.2. Samples: 567484420. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 17:19:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:19:32,729][06909] Updated weights for policy 0, policy_version 40573 (0.0040) [2024-06-27 17:19:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 664829952. Throughput: 0: 43753.6. Samples: 567751440. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 17:19:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:19:35,226][06909] Updated weights for policy 0, policy_version 40583 (0.0020) [2024-06-27 17:19:38,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43695.0, 300 sec: 43709.5). Total num frames: 665026560. Throughput: 0: 43794.6. Samples: 568011700. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 17:19:38,851][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 17:19:39,971][06909] Updated weights for policy 0, policy_version 40593 (0.0039) [2024-06-27 17:19:42,815][06887] Signal inference workers to stop experience collection... (8200 times) [2024-06-27 17:19:42,817][06887] Signal inference workers to resume experience collection... (8200 times) [2024-06-27 17:19:42,835][06909] Updated weights for policy 0, policy_version 40603 (0.0026) [2024-06-27 17:19:42,843][06909] InferenceWorker_p0-w0: stopping experience collection (8200 times) [2024-06-27 17:19:42,844][06909] InferenceWorker_p0-w0: resuming experience collection (8200 times) [2024-06-27 17:19:43,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 665288704. Throughput: 0: 43962.7. Samples: 568144200. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 17:19:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:19:47,279][06909] Updated weights for policy 0, policy_version 40613 (0.0030) [2024-06-27 17:19:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 665468928. Throughput: 0: 43640.1. Samples: 568407140. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 17:19:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:19:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000040618_665485312.pth... [2024-06-27 17:19:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000039975_654950400.pth [2024-06-27 17:19:50,093][06909] Updated weights for policy 0, policy_version 40623 (0.0034) [2024-06-27 17:19:53,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 665681920. Throughput: 0: 43856.9. Samples: 568670400. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 17:19:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:19:55,093][06909] Updated weights for policy 0, policy_version 40633 (0.0028) [2024-06-27 17:19:57,851][06909] Updated weights for policy 0, policy_version 40643 (0.0043) [2024-06-27 17:19:58,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 665944064. Throughput: 0: 43841.0. Samples: 568799760. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 17:19:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:20:02,370][06909] Updated weights for policy 0, policy_version 40653 (0.0037) [2024-06-27 17:20:03,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43144.6, 300 sec: 43765.6). Total num frames: 666107904. Throughput: 0: 43748.1. Samples: 569065380. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 17:20:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:20:05,242][06909] Updated weights for policy 0, policy_version 40663 (0.0039) [2024-06-27 17:20:08,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 666337280. Throughput: 0: 43745.4. Samples: 569323280. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 17:20:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:20:09,585][06909] Updated weights for policy 0, policy_version 40673 (0.0038) [2024-06-27 17:20:12,555][06909] Updated weights for policy 0, policy_version 40683 (0.0031) [2024-06-27 17:20:13,851][06674] Fps is (10 sec: 49147.9, 60 sec: 43690.0, 300 sec: 43931.2). Total num frames: 666599424. Throughput: 0: 43828.1. Samples: 569456720. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 17:20:13,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:20:17,012][06909] Updated weights for policy 0, policy_version 40693 (0.0027) [2024-06-27 17:20:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 666779648. Throughput: 0: 43831.7. Samples: 569723860. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 17:20:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:20:20,376][06909] Updated weights for policy 0, policy_version 40703 (0.0034) [2024-06-27 17:20:23,850][06674] Fps is (10 sec: 40963.1, 60 sec: 43963.6, 300 sec: 43764.7). Total num frames: 667009024. Throughput: 0: 43754.6. Samples: 569980660. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 17:20:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:20:24,744][06909] Updated weights for policy 0, policy_version 40713 (0.0038) [2024-06-27 17:20:27,748][06909] Updated weights for policy 0, policy_version 40723 (0.0032) [2024-06-27 17:20:28,850][06674] Fps is (10 sec: 49151.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 667271168. Throughput: 0: 43765.2. Samples: 570113640. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-27 17:20:28,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:20:31,958][06909] Updated weights for policy 0, policy_version 40733 (0.0032) [2024-06-27 17:20:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 667451392. Throughput: 0: 43999.9. Samples: 570387140. Policy #0 lag: (min: 1.0, avg: 8.1, max: 21.0) [2024-06-27 17:20:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:20:35,106][06909] Updated weights for policy 0, policy_version 40743 (0.0032) [2024-06-27 17:20:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.9, 300 sec: 43820.3). Total num frames: 667680768. Throughput: 0: 43815.1. Samples: 570642080. Policy #0 lag: (min: 1.0, avg: 8.1, max: 21.0) [2024-06-27 17:20:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:20:39,805][06909] Updated weights for policy 0, policy_version 40753 (0.0028) [2024-06-27 17:20:42,374][06909] Updated weights for policy 0, policy_version 40763 (0.0042) [2024-06-27 17:20:43,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 667910144. Throughput: 0: 44058.6. Samples: 570782400. Policy #0 lag: (min: 1.0, avg: 8.1, max: 21.0) [2024-06-27 17:20:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:20:46,963][06909] Updated weights for policy 0, policy_version 40773 (0.0032) [2024-06-27 17:20:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 668106752. Throughput: 0: 44068.5. Samples: 571048460. Policy #0 lag: (min: 1.0, avg: 8.1, max: 21.0) [2024-06-27 17:20:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:20:49,908][06909] Updated weights for policy 0, policy_version 40783 (0.0030) [2024-06-27 17:20:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 668336128. Throughput: 0: 44039.5. Samples: 571305060. Policy #0 lag: (min: 1.0, avg: 8.1, max: 21.0) [2024-06-27 17:20:53,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:20:54,239][06909] Updated weights for policy 0, policy_version 40793 (0.0035) [2024-06-27 17:20:57,562][06909] Updated weights for policy 0, policy_version 40803 (0.0043) [2024-06-27 17:20:58,852][06674] Fps is (10 sec: 47503.7, 60 sec: 43962.2, 300 sec: 43931.0). Total num frames: 668581888. Throughput: 0: 43958.0. Samples: 571434880. Policy #0 lag: (min: 1.0, avg: 8.1, max: 21.0) [2024-06-27 17:20:58,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:21:01,667][06909] Updated weights for policy 0, policy_version 40813 (0.0021) [2024-06-27 17:21:03,686][06887] Signal inference workers to stop experience collection... (8250 times) [2024-06-27 17:21:03,687][06887] Signal inference workers to resume experience collection... (8250 times) [2024-06-27 17:21:03,727][06909] InferenceWorker_p0-w0: stopping experience collection (8250 times) [2024-06-27 17:21:03,727][06909] InferenceWorker_p0-w0: resuming experience collection (8250 times) [2024-06-27 17:21:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44510.0, 300 sec: 43875.8). Total num frames: 668778496. Throughput: 0: 44032.0. Samples: 571705300. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 17:21:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:21:04,961][06909] Updated weights for policy 0, policy_version 40823 (0.0025) [2024-06-27 17:21:08,850][06674] Fps is (10 sec: 40968.3, 60 sec: 44236.7, 300 sec: 43820.2). Total num frames: 668991488. Throughput: 0: 44004.6. Samples: 571960860. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 17:21:08,850][06674] Avg episode reward: [(0, '0.436')] [2024-06-27 17:21:08,977][06909] Updated weights for policy 0, policy_version 40833 (0.0043) [2024-06-27 17:21:12,755][06909] Updated weights for policy 0, policy_version 40843 (0.0050) [2024-06-27 17:21:13,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43964.4, 300 sec: 43986.9). Total num frames: 669237248. Throughput: 0: 43913.8. Samples: 572089760. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 17:21:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:21:16,442][06909] Updated weights for policy 0, policy_version 40853 (0.0046) [2024-06-27 17:21:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 669417472. Throughput: 0: 43840.2. Samples: 572359940. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 17:21:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:21:19,986][06909] Updated weights for policy 0, policy_version 40863 (0.0022) [2024-06-27 17:21:23,606][06909] Updated weights for policy 0, policy_version 40873 (0.0039) [2024-06-27 17:21:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 669663232. Throughput: 0: 44036.9. Samples: 572623740. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 17:21:23,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:21:27,397][06909] Updated weights for policy 0, policy_version 40883 (0.0025) [2024-06-27 17:21:28,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 669892608. Throughput: 0: 43811.1. Samples: 572753900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 17:21:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:21:31,092][06909] Updated weights for policy 0, policy_version 40893 (0.0024) [2024-06-27 17:21:33,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 670072832. Throughput: 0: 43748.3. Samples: 573017140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 17:21:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:21:34,791][06909] Updated weights for policy 0, policy_version 40903 (0.0041) [2024-06-27 17:21:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 670302208. Throughput: 0: 43548.1. Samples: 573264720. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 17:21:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:21:39,134][06909] Updated weights for policy 0, policy_version 40913 (0.0039) [2024-06-27 17:21:42,539][06909] Updated weights for policy 0, policy_version 40923 (0.0022) [2024-06-27 17:21:43,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 670547968. Throughput: 0: 43634.7. Samples: 573398360. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 17:21:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:21:46,380][06909] Updated weights for policy 0, policy_version 40933 (0.0028) [2024-06-27 17:21:48,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 43820.5). Total num frames: 670728192. Throughput: 0: 43665.2. Samples: 573670240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 17:21:48,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:21:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000040938_670728192.pth... [2024-06-27 17:21:48,953][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000040298_660242432.pth [2024-06-27 17:21:50,130][06909] Updated weights for policy 0, policy_version 40943 (0.0030) [2024-06-27 17:21:53,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 670957568. Throughput: 0: 43723.5. Samples: 573928420. Policy #0 lag: (min: 0.0, avg: 11.6, max: 20.0) [2024-06-27 17:21:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:21:53,891][06909] Updated weights for policy 0, policy_version 40953 (0.0036) [2024-06-27 17:21:57,550][06909] Updated weights for policy 0, policy_version 40963 (0.0036) [2024-06-27 17:21:58,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43692.2, 300 sec: 43875.8). Total num frames: 671203328. Throughput: 0: 43876.0. Samples: 574064180. Policy #0 lag: (min: 0.0, avg: 11.6, max: 20.0) [2024-06-27 17:21:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 17:22:01,310][06909] Updated weights for policy 0, policy_version 40973 (0.0042) [2024-06-27 17:22:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43876.5). Total num frames: 671383552. Throughput: 0: 43616.0. Samples: 574322660. Policy #0 lag: (min: 0.0, avg: 11.6, max: 20.0) [2024-06-27 17:22:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:22:05,052][06909] Updated weights for policy 0, policy_version 40983 (0.0035) [2024-06-27 17:22:08,675][06909] Updated weights for policy 0, policy_version 40993 (0.0022) [2024-06-27 17:22:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 671629312. Throughput: 0: 43541.4. Samples: 574583100. Policy #0 lag: (min: 0.0, avg: 11.6, max: 20.0) [2024-06-27 17:22:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:22:12,412][06909] Updated weights for policy 0, policy_version 41003 (0.0023) [2024-06-27 17:22:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 671842304. Throughput: 0: 43693.8. Samples: 574720120. Policy #0 lag: (min: 0.0, avg: 11.6, max: 20.0) [2024-06-27 17:22:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:22:16,402][06909] Updated weights for policy 0, policy_version 41013 (0.0038) [2024-06-27 17:22:18,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 672038912. Throughput: 0: 43683.6. Samples: 574982900. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 17:22:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:22:20,126][06909] Updated weights for policy 0, policy_version 41023 (0.0036) [2024-06-27 17:22:23,381][06887] Signal inference workers to stop experience collection... (8300 times) [2024-06-27 17:22:23,381][06887] Signal inference workers to resume experience collection... (8300 times) [2024-06-27 17:22:23,427][06909] InferenceWorker_p0-w0: stopping experience collection (8300 times) [2024-06-27 17:22:23,427][06909] InferenceWorker_p0-w0: resuming experience collection (8300 times) [2024-06-27 17:22:23,834][06909] Updated weights for policy 0, policy_version 41033 (0.0034) [2024-06-27 17:22:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 672284672. Throughput: 0: 43835.0. Samples: 575237300. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 17:22:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 17:22:27,510][06909] Updated weights for policy 0, policy_version 41043 (0.0038) [2024-06-27 17:22:28,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 672497664. Throughput: 0: 43862.9. Samples: 575372180. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 17:22:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:22:31,424][06909] Updated weights for policy 0, policy_version 41053 (0.0033) [2024-06-27 17:22:33,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.8, 300 sec: 43765.6). Total num frames: 672694272. Throughput: 0: 43627.7. Samples: 575633480. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 17:22:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:22:35,102][06909] Updated weights for policy 0, policy_version 41063 (0.0028) [2024-06-27 17:22:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 672923648. Throughput: 0: 43602.3. Samples: 575890520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 17:22:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:22:39,069][06909] Updated weights for policy 0, policy_version 41073 (0.0025) [2024-06-27 17:22:42,445][06909] Updated weights for policy 0, policy_version 41083 (0.0035) [2024-06-27 17:22:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 673153024. Throughput: 0: 43558.7. Samples: 576024320. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 17:22:43,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:22:46,279][06909] Updated weights for policy 0, policy_version 41093 (0.0027) [2024-06-27 17:22:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 673349632. Throughput: 0: 43730.7. Samples: 576290540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:22:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:22:50,025][06909] Updated weights for policy 0, policy_version 41103 (0.0025) [2024-06-27 17:22:53,603][06909] Updated weights for policy 0, policy_version 41113 (0.0045) [2024-06-27 17:22:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43765.0). Total num frames: 673595392. Throughput: 0: 43708.9. Samples: 576550000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:22:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:22:57,387][06909] Updated weights for policy 0, policy_version 41123 (0.0029) [2024-06-27 17:22:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43417.6, 300 sec: 43709.5). Total num frames: 673808384. Throughput: 0: 43782.2. Samples: 576690320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:22:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:23:01,161][06909] Updated weights for policy 0, policy_version 41133 (0.0033) [2024-06-27 17:23:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 674021376. Throughput: 0: 43577.4. Samples: 576943880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:23:03,850][06674] Avg episode reward: [(0, '0.393')] [2024-06-27 17:23:04,757][06909] Updated weights for policy 0, policy_version 41143 (0.0025) [2024-06-27 17:23:08,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43416.1, 300 sec: 43708.9). Total num frames: 674234368. Throughput: 0: 43696.8. Samples: 577203740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:23:08,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:23:09,071][06909] Updated weights for policy 0, policy_version 41153 (0.0032) [2024-06-27 17:23:12,401][06909] Updated weights for policy 0, policy_version 41163 (0.0039) [2024-06-27 17:23:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 674480128. Throughput: 0: 43743.5. Samples: 577340640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 17:23:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:23:16,726][06909] Updated weights for policy 0, policy_version 41173 (0.0021) [2024-06-27 17:23:18,850][06674] Fps is (10 sec: 44245.2, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 674676736. Throughput: 0: 43750.0. Samples: 577602240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 17:23:18,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:23:19,925][06909] Updated weights for policy 0, policy_version 41183 (0.0037) [2024-06-27 17:23:23,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 674889728. Throughput: 0: 43861.3. Samples: 577864280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 17:23:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:23:23,892][06909] Updated weights for policy 0, policy_version 41193 (0.0035) [2024-06-27 17:23:27,610][06909] Updated weights for policy 0, policy_version 41203 (0.0034) [2024-06-27 17:23:28,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 675135488. Throughput: 0: 43839.4. Samples: 577997100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 17:23:28,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:23:31,275][06909] Updated weights for policy 0, policy_version 41213 (0.0024) [2024-06-27 17:23:33,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 43821.2). Total num frames: 675332096. Throughput: 0: 43831.6. Samples: 578262960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 17:23:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:23:34,931][06909] Updated weights for policy 0, policy_version 41223 (0.0046) [2024-06-27 17:23:38,833][06909] Updated weights for policy 0, policy_version 41233 (0.0023) [2024-06-27 17:23:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 675561472. Throughput: 0: 44001.3. Samples: 578530060. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 17:23:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:23:42,175][06909] Updated weights for policy 0, policy_version 41243 (0.0022) [2024-06-27 17:23:43,852][06674] Fps is (10 sec: 45866.8, 60 sec: 43962.4, 300 sec: 43875.5). Total num frames: 675790848. Throughput: 0: 43642.3. Samples: 578654300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 17:23:43,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:23:44,933][06887] Signal inference workers to stop experience collection... (8350 times) [2024-06-27 17:23:44,933][06887] Signal inference workers to resume experience collection... (8350 times) [2024-06-27 17:23:44,952][06909] InferenceWorker_p0-w0: stopping experience collection (8350 times) [2024-06-27 17:23:44,952][06909] InferenceWorker_p0-w0: resuming experience collection (8350 times) [2024-06-27 17:23:46,435][06909] Updated weights for policy 0, policy_version 41253 (0.0040) [2024-06-27 17:23:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.6, 300 sec: 43820.2). Total num frames: 675987456. Throughput: 0: 43941.2. Samples: 578921240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 17:23:48,853][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:23:48,942][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000041260_676003840.pth... [2024-06-27 17:23:48,995][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000040618_665485312.pth [2024-06-27 17:23:49,707][06909] Updated weights for policy 0, policy_version 41263 (0.0040) [2024-06-27 17:23:53,850][06674] Fps is (10 sec: 40967.2, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 676200448. Throughput: 0: 43948.7. Samples: 579181340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 17:23:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:23:54,009][06909] Updated weights for policy 0, policy_version 41273 (0.0036) [2024-06-27 17:23:57,386][06909] Updated weights for policy 0, policy_version 41283 (0.0037) [2024-06-27 17:23:58,852][06674] Fps is (10 sec: 45866.2, 60 sec: 43962.2, 300 sec: 43820.0). Total num frames: 676446208. Throughput: 0: 43823.8. Samples: 579312800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 17:23:58,852][06674] Avg episode reward: [(0, '0.392')] [2024-06-27 17:24:01,527][06909] Updated weights for policy 0, policy_version 41293 (0.0039) [2024-06-27 17:24:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 676659200. Throughput: 0: 43851.2. Samples: 579575540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 17:24:03,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:24:04,850][06909] Updated weights for policy 0, policy_version 41303 (0.0034) [2024-06-27 17:24:08,850][06674] Fps is (10 sec: 40968.7, 60 sec: 43692.2, 300 sec: 43653.7). Total num frames: 676855808. Throughput: 0: 43862.8. Samples: 579838100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 17:24:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:24:08,903][06909] Updated weights for policy 0, policy_version 41313 (0.0027) [2024-06-27 17:24:12,304][06909] Updated weights for policy 0, policy_version 41323 (0.0026) [2024-06-27 17:24:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 677101568. Throughput: 0: 43946.7. Samples: 579974700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 17:24:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:24:16,114][06909] Updated weights for policy 0, policy_version 41333 (0.0027) [2024-06-27 17:24:18,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.7, 300 sec: 43820.2). Total num frames: 677298176. Throughput: 0: 43917.6. Samples: 580239260. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 17:24:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:24:19,616][06909] Updated weights for policy 0, policy_version 41343 (0.0028) [2024-06-27 17:24:23,394][06909] Updated weights for policy 0, policy_version 41353 (0.0023) [2024-06-27 17:24:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 677527552. Throughput: 0: 43686.3. Samples: 580495940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 17:24:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:24:27,182][06909] Updated weights for policy 0, policy_version 41363 (0.0037) [2024-06-27 17:24:28,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 677773312. Throughput: 0: 43939.9. Samples: 580631520. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 17:24:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:24:31,136][06909] Updated weights for policy 0, policy_version 41373 (0.0027) [2024-06-27 17:24:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 677969920. Throughput: 0: 43823.6. Samples: 580893300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:24:33,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:24:34,698][06909] Updated weights for policy 0, policy_version 41383 (0.0023) [2024-06-27 17:24:38,817][06909] Updated weights for policy 0, policy_version 41393 (0.0033) [2024-06-27 17:24:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 678182912. Throughput: 0: 43840.8. Samples: 581154180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:24:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:24:42,546][06909] Updated weights for policy 0, policy_version 41403 (0.0033) [2024-06-27 17:24:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43691.9, 300 sec: 43875.8). Total num frames: 678412288. Throughput: 0: 43857.5. Samples: 581286300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:24:43,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:24:46,107][06909] Updated weights for policy 0, policy_version 41413 (0.0033) [2024-06-27 17:24:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 678625280. Throughput: 0: 43776.4. Samples: 581545480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:24:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:24:49,882][06909] Updated weights for policy 0, policy_version 41423 (0.0030) [2024-06-27 17:24:53,465][06909] Updated weights for policy 0, policy_version 41433 (0.0045) [2024-06-27 17:24:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 43764.7). Total num frames: 678854656. Throughput: 0: 43802.1. Samples: 581809200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:24:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:24:57,364][06909] Updated weights for policy 0, policy_version 41443 (0.0033) [2024-06-27 17:24:58,230][06887] Signal inference workers to stop experience collection... (8400 times) [2024-06-27 17:24:58,231][06887] Signal inference workers to resume experience collection... (8400 times) [2024-06-27 17:24:58,247][06909] InferenceWorker_p0-w0: stopping experience collection (8400 times) [2024-06-27 17:24:58,277][06909] InferenceWorker_p0-w0: resuming experience collection (8400 times) [2024-06-27 17:24:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 679084032. Throughput: 0: 43846.6. Samples: 581947800. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 17:24:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:25:00,788][06909] Updated weights for policy 0, policy_version 41453 (0.0040) [2024-06-27 17:25:03,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43144.6, 300 sec: 43764.7). Total num frames: 679247872. Throughput: 0: 43865.9. Samples: 582213220. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 17:25:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:25:04,690][06909] Updated weights for policy 0, policy_version 41463 (0.0032) [2024-06-27 17:25:08,286][06909] Updated weights for policy 0, policy_version 41473 (0.0031) [2024-06-27 17:25:08,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.7, 300 sec: 43709.3). Total num frames: 679493632. Throughput: 0: 43878.2. Samples: 582470460. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 17:25:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:25:12,162][06909] Updated weights for policy 0, policy_version 41483 (0.0033) [2024-06-27 17:25:13,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 679723008. Throughput: 0: 43736.6. Samples: 582599660. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 17:25:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:25:16,517][06909] Updated weights for policy 0, policy_version 41493 (0.0038) [2024-06-27 17:25:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 679919616. Throughput: 0: 43609.0. Samples: 582855700. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 17:25:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:25:19,966][06909] Updated weights for policy 0, policy_version 41503 (0.0024) [2024-06-27 17:25:23,831][06909] Updated weights for policy 0, policy_version 41513 (0.0027) [2024-06-27 17:25:23,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 680148992. Throughput: 0: 43700.9. Samples: 583120720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 17:25:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:25:27,240][06909] Updated weights for policy 0, policy_version 41523 (0.0042) [2024-06-27 17:25:28,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 680378368. Throughput: 0: 43613.0. Samples: 583248880. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 17:25:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:25:31,108][06909] Updated weights for policy 0, policy_version 41533 (0.0032) [2024-06-27 17:25:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 680591360. Throughput: 0: 43784.5. Samples: 583515780. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 17:25:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:25:34,599][06909] Updated weights for policy 0, policy_version 41543 (0.0039) [2024-06-27 17:25:38,469][06909] Updated weights for policy 0, policy_version 41553 (0.0047) [2024-06-27 17:25:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 680804352. Throughput: 0: 43828.5. Samples: 583781480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 17:25:38,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:25:42,398][06909] Updated weights for policy 0, policy_version 41563 (0.0038) [2024-06-27 17:25:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 681050112. Throughput: 0: 43781.7. Samples: 583917980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 17:25:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:25:45,651][06909] Updated weights for policy 0, policy_version 41573 (0.0033) [2024-06-27 17:25:48,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 681246720. Throughput: 0: 43675.6. Samples: 584178620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 17:25:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:25:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000041580_681246720.pth... [2024-06-27 17:25:48,919][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000040938_670728192.pth [2024-06-27 17:25:49,711][06909] Updated weights for policy 0, policy_version 41583 (0.0033) [2024-06-27 17:25:53,354][06909] Updated weights for policy 0, policy_version 41593 (0.0026) [2024-06-27 17:25:53,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43709.5). Total num frames: 681476096. Throughput: 0: 43781.8. Samples: 584440640. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 17:25:53,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:25:57,071][06909] Updated weights for policy 0, policy_version 41603 (0.0030) [2024-06-27 17:25:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 681689088. Throughput: 0: 43970.6. Samples: 584578340. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 17:25:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:26:00,918][06909] Updated weights for policy 0, policy_version 41613 (0.0032) [2024-06-27 17:26:03,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 681885696. Throughput: 0: 43908.9. Samples: 584831600. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 17:26:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:26:04,835][06909] Updated weights for policy 0, policy_version 41623 (0.0034) [2024-06-27 17:26:08,157][06909] Updated weights for policy 0, policy_version 41633 (0.0036) [2024-06-27 17:26:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 43764.7). Total num frames: 682147840. Throughput: 0: 43841.3. Samples: 585093580. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 17:26:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:26:12,137][06909] Updated weights for policy 0, policy_version 41643 (0.0032) [2024-06-27 17:26:12,159][06887] Signal inference workers to stop experience collection... (8450 times) [2024-06-27 17:26:12,160][06887] Signal inference workers to resume experience collection... (8450 times) [2024-06-27 17:26:12,210][06909] InferenceWorker_p0-w0: stopping experience collection (8450 times) [2024-06-27 17:26:12,211][06909] InferenceWorker_p0-w0: resuming experience collection (8450 times) [2024-06-27 17:26:13,850][06674] Fps is (10 sec: 49151.9, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 682377216. Throughput: 0: 44099.1. Samples: 585233340. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 17:26:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:26:15,390][06909] Updated weights for policy 0, policy_version 41653 (0.0032) [2024-06-27 17:26:18,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 682557440. Throughput: 0: 44110.7. Samples: 585500760. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 17:26:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:26:19,391][06909] Updated weights for policy 0, policy_version 41663 (0.0036) [2024-06-27 17:26:22,757][06909] Updated weights for policy 0, policy_version 41673 (0.0034) [2024-06-27 17:26:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 43764.7). Total num frames: 682803200. Throughput: 0: 44118.7. Samples: 585766820. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 17:26:23,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:26:27,031][06909] Updated weights for policy 0, policy_version 41683 (0.0034) [2024-06-27 17:26:28,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 683032576. Throughput: 0: 44089.1. Samples: 585901980. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 17:26:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:26:30,044][06909] Updated weights for policy 0, policy_version 41693 (0.0026) [2024-06-27 17:26:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 683212800. Throughput: 0: 43992.3. Samples: 586158280. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 17:26:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:26:34,538][06909] Updated weights for policy 0, policy_version 41703 (0.0032) [2024-06-27 17:26:37,950][06909] Updated weights for policy 0, policy_version 41713 (0.0031) [2024-06-27 17:26:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 43764.7). Total num frames: 683458560. Throughput: 0: 43922.3. Samples: 586417140. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 17:26:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:26:41,905][06909] Updated weights for policy 0, policy_version 41723 (0.0039) [2024-06-27 17:26:43,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.9, 300 sec: 43931.4). Total num frames: 683687936. Throughput: 0: 43884.0. Samples: 586553120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 17:26:43,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 17:26:45,337][06909] Updated weights for policy 0, policy_version 41733 (0.0030) [2024-06-27 17:26:48,850][06674] Fps is (10 sec: 40959.2, 60 sec: 43690.5, 300 sec: 43764.7). Total num frames: 683868160. Throughput: 0: 44025.6. Samples: 586812760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 17:26:48,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:26:49,718][06909] Updated weights for policy 0, policy_version 41743 (0.0030) [2024-06-27 17:26:53,037][06909] Updated weights for policy 0, policy_version 41753 (0.0037) [2024-06-27 17:26:53,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 684113920. Throughput: 0: 43830.7. Samples: 587065960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 17:26:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:26:57,230][06909] Updated weights for policy 0, policy_version 41763 (0.0051) [2024-06-27 17:26:58,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 684343296. Throughput: 0: 43833.7. Samples: 587205860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 17:26:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:27:00,465][06909] Updated weights for policy 0, policy_version 41773 (0.0030) [2024-06-27 17:27:03,852][06674] Fps is (10 sec: 40952.1, 60 sec: 43962.2, 300 sec: 43708.9). Total num frames: 684523520. Throughput: 0: 43756.3. Samples: 587469880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 17:27:03,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:27:04,637][06909] Updated weights for policy 0, policy_version 41783 (0.0035) [2024-06-27 17:27:07,827][06909] Updated weights for policy 0, policy_version 41793 (0.0031) [2024-06-27 17:27:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43820.2). Total num frames: 684769280. Throughput: 0: 43567.6. Samples: 587727360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 17:27:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:27:12,121][06909] Updated weights for policy 0, policy_version 41803 (0.0033) [2024-06-27 17:27:13,850][06674] Fps is (10 sec: 47522.8, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 684998656. Throughput: 0: 43624.8. Samples: 587865100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 17:27:13,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:27:15,353][06909] Updated weights for policy 0, policy_version 41813 (0.0027) [2024-06-27 17:27:18,856][06674] Fps is (10 sec: 42572.4, 60 sec: 43959.2, 300 sec: 43763.8). Total num frames: 685195264. Throughput: 0: 43733.2. Samples: 588126540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 17:27:18,857][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:27:19,662][06909] Updated weights for policy 0, policy_version 41823 (0.0030) [2024-06-27 17:27:22,807][06909] Updated weights for policy 0, policy_version 41833 (0.0032) [2024-06-27 17:27:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 685424640. Throughput: 0: 43771.0. Samples: 588386840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 17:27:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:27:27,313][06909] Updated weights for policy 0, policy_version 41843 (0.0039) [2024-06-27 17:27:28,850][06674] Fps is (10 sec: 45903.1, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 685654016. Throughput: 0: 43782.6. Samples: 588523340. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 17:27:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:27:30,243][06909] Updated weights for policy 0, policy_version 41853 (0.0037) [2024-06-27 17:27:33,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 685834240. Throughput: 0: 43742.9. Samples: 588781180. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 17:27:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:27:34,649][06909] Updated weights for policy 0, policy_version 41863 (0.0028) [2024-06-27 17:27:37,622][06909] Updated weights for policy 0, policy_version 41873 (0.0041) [2024-06-27 17:27:38,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 686080000. Throughput: 0: 43893.9. Samples: 589041180. Policy #0 lag: (min: 0.0, avg: 11.7, max: 26.0) [2024-06-27 17:27:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:27:39,374][06887] Signal inference workers to stop experience collection... (8500 times) [2024-06-27 17:27:39,374][06887] Signal inference workers to resume experience collection... (8500 times) [2024-06-27 17:27:39,387][06909] InferenceWorker_p0-w0: stopping experience collection (8500 times) [2024-06-27 17:27:39,387][06909] InferenceWorker_p0-w0: resuming experience collection (8500 times) [2024-06-27 17:27:42,062][06909] Updated weights for policy 0, policy_version 41883 (0.0033) [2024-06-27 17:27:43,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 686309376. Throughput: 0: 43850.7. Samples: 589179140. Policy #0 lag: (min: 0.0, avg: 11.7, max: 26.0) [2024-06-27 17:27:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:27:45,360][06909] Updated weights for policy 0, policy_version 41893 (0.0036) [2024-06-27 17:27:48,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 686489600. Throughput: 0: 43848.6. Samples: 589442980. Policy #0 lag: (min: 0.0, avg: 11.7, max: 26.0) [2024-06-27 17:27:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:27:48,993][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000041901_686505984.pth... [2024-06-27 17:27:49,034][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000041260_676003840.pth [2024-06-27 17:27:49,476][06909] Updated weights for policy 0, policy_version 41903 (0.0029) [2024-06-27 17:27:52,720][06909] Updated weights for policy 0, policy_version 41913 (0.0032) [2024-06-27 17:27:53,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 686718976. Throughput: 0: 43829.0. Samples: 589699660. Policy #0 lag: (min: 0.0, avg: 11.7, max: 26.0) [2024-06-27 17:27:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:27:57,137][06909] Updated weights for policy 0, policy_version 41923 (0.0022) [2024-06-27 17:27:58,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 686948352. Throughput: 0: 43723.3. Samples: 589832640. Policy #0 lag: (min: 0.0, avg: 11.7, max: 26.0) [2024-06-27 17:27:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:28:00,171][06909] Updated weights for policy 0, policy_version 41933 (0.0029) [2024-06-27 17:28:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43692.2, 300 sec: 43765.0). Total num frames: 687144960. Throughput: 0: 43805.1. Samples: 590097500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 17:28:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:28:04,551][06909] Updated weights for policy 0, policy_version 41943 (0.0043) [2024-06-27 17:28:07,579][06909] Updated weights for policy 0, policy_version 41953 (0.0034) [2024-06-27 17:28:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 687374336. Throughput: 0: 43653.0. Samples: 590351220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 17:28:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:28:12,061][06909] Updated weights for policy 0, policy_version 41963 (0.0041) [2024-06-27 17:28:13,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43417.6, 300 sec: 43820.3). Total num frames: 687603712. Throughput: 0: 43709.8. Samples: 590490280. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 17:28:13,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:28:15,171][06909] Updated weights for policy 0, policy_version 41973 (0.0030) [2024-06-27 17:28:18,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43421.9, 300 sec: 43764.7). Total num frames: 687800320. Throughput: 0: 43681.1. Samples: 590746840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 17:28:18,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:28:19,415][06909] Updated weights for policy 0, policy_version 41983 (0.0024) [2024-06-27 17:28:22,570][06909] Updated weights for policy 0, policy_version 41993 (0.0030) [2024-06-27 17:28:23,852][06674] Fps is (10 sec: 44228.1, 60 sec: 43689.2, 300 sec: 43764.4). Total num frames: 688046080. Throughput: 0: 43708.7. Samples: 591008160. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 17:28:23,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:28:26,866][06909] Updated weights for policy 0, policy_version 42003 (0.0034) [2024-06-27 17:28:28,850][06674] Fps is (10 sec: 49152.5, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 688291840. Throughput: 0: 43684.9. Samples: 591144960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 17:28:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:28:29,872][06909] Updated weights for policy 0, policy_version 42013 (0.0027) [2024-06-27 17:28:33,850][06674] Fps is (10 sec: 44245.6, 60 sec: 44236.7, 300 sec: 43820.3). Total num frames: 688488448. Throughput: 0: 43817.3. Samples: 591414760. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 17:28:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:28:34,089][06909] Updated weights for policy 0, policy_version 42023 (0.0027) [2024-06-27 17:28:37,615][06909] Updated weights for policy 0, policy_version 42033 (0.0033) [2024-06-27 17:28:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 43765.0). Total num frames: 688701440. Throughput: 0: 43833.6. Samples: 591672180. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 17:28:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:28:41,568][06909] Updated weights for policy 0, policy_version 42043 (0.0034) [2024-06-27 17:28:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 688930816. Throughput: 0: 43907.1. Samples: 591808460. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 17:28:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:28:45,009][06909] Updated weights for policy 0, policy_version 42053 (0.0035) [2024-06-27 17:28:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 689111040. Throughput: 0: 43781.7. Samples: 592067680. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 17:28:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:28:49,419][06909] Updated weights for policy 0, policy_version 42063 (0.0037) [2024-06-27 17:28:52,565][06909] Updated weights for policy 0, policy_version 42073 (0.0040) [2024-06-27 17:28:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43765.0). Total num frames: 689356800. Throughput: 0: 43904.0. Samples: 592326900. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 17:28:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:28:56,590][06887] Signal inference workers to stop experience collection... (8550 times) [2024-06-27 17:28:56,648][06887] Signal inference workers to resume experience collection... (8550 times) [2024-06-27 17:28:56,650][06909] InferenceWorker_p0-w0: stopping experience collection (8550 times) [2024-06-27 17:28:56,668][06909] InferenceWorker_p0-w0: resuming experience collection (8550 times) [2024-06-27 17:28:56,828][06909] Updated weights for policy 0, policy_version 42083 (0.0032) [2024-06-27 17:28:58,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.6, 300 sec: 43820.3). Total num frames: 689586176. Throughput: 0: 43930.2. Samples: 592467140. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-27 17:28:58,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:29:00,090][06909] Updated weights for policy 0, policy_version 42093 (0.0021) [2024-06-27 17:29:03,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 689766400. Throughput: 0: 43939.4. Samples: 592724100. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-27 17:29:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:29:04,183][06909] Updated weights for policy 0, policy_version 42103 (0.0037) [2024-06-27 17:29:07,554][06909] Updated weights for policy 0, policy_version 42113 (0.0026) [2024-06-27 17:29:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 43875.8). Total num frames: 690044928. Throughput: 0: 43984.2. Samples: 592987360. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-27 17:29:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:29:11,392][06909] Updated weights for policy 0, policy_version 42123 (0.0052) [2024-06-27 17:29:13,850][06674] Fps is (10 sec: 49151.5, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 690257920. Throughput: 0: 43928.5. Samples: 593121740. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-27 17:29:13,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 17:29:14,898][06909] Updated weights for policy 0, policy_version 42133 (0.0028) [2024-06-27 17:29:18,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.9, 300 sec: 43820.3). Total num frames: 690454528. Throughput: 0: 43912.5. Samples: 593390820. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-27 17:29:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:29:18,976][06909] Updated weights for policy 0, policy_version 42143 (0.0044) [2024-06-27 17:29:22,743][06909] Updated weights for policy 0, policy_version 42153 (0.0032) [2024-06-27 17:29:23,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43692.1, 300 sec: 43709.2). Total num frames: 690667520. Throughput: 0: 43690.7. Samples: 593638260. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 17:29:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:29:26,505][06909] Updated weights for policy 0, policy_version 42163 (0.0025) [2024-06-27 17:29:28,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 690913280. Throughput: 0: 43700.3. Samples: 593774980. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 17:29:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:29:29,975][06909] Updated weights for policy 0, policy_version 42173 (0.0037) [2024-06-27 17:29:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 691109888. Throughput: 0: 43863.0. Samples: 594041520. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 17:29:33,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:29:34,072][06909] Updated weights for policy 0, policy_version 42183 (0.0045) [2024-06-27 17:29:37,198][06909] Updated weights for policy 0, policy_version 42193 (0.0037) [2024-06-27 17:29:38,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 691339264. Throughput: 0: 43932.0. Samples: 594303840. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 17:29:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:29:41,352][06909] Updated weights for policy 0, policy_version 42203 (0.0035) [2024-06-27 17:29:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 691552256. Throughput: 0: 43823.5. Samples: 594439200. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 17:29:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:29:44,951][06909] Updated weights for policy 0, policy_version 42213 (0.0029) [2024-06-27 17:29:48,810][06909] Updated weights for policy 0, policy_version 42223 (0.0025) [2024-06-27 17:29:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.9, 300 sec: 43820.3). Total num frames: 691781632. Throughput: 0: 43921.2. Samples: 594700560. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 17:29:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:29:48,937][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000042224_691798016.pth... [2024-06-27 17:29:48,987][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000041580_681246720.pth [2024-06-27 17:29:49,876][06887] Signal inference workers to stop experience collection... (8600 times) [2024-06-27 17:29:49,881][06887] Signal inference workers to resume experience collection... (8600 times) [2024-06-27 17:29:49,897][06909] InferenceWorker_p0-w0: stopping experience collection (8600 times) [2024-06-27 17:29:49,931][06909] InferenceWorker_p0-w0: resuming experience collection (8600 times) [2024-06-27 17:29:52,312][06909] Updated weights for policy 0, policy_version 42233 (0.0033) [2024-06-27 17:29:53,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 691994624. Throughput: 0: 43869.8. Samples: 594961500. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 17:29:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:29:56,209][06909] Updated weights for policy 0, policy_version 42243 (0.0040) [2024-06-27 17:29:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 692224000. Throughput: 0: 43896.8. Samples: 595097100. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 17:29:58,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 17:29:59,824][06909] Updated weights for policy 0, policy_version 42253 (0.0038) [2024-06-27 17:30:03,653][06909] Updated weights for policy 0, policy_version 42263 (0.0032) [2024-06-27 17:30:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44509.7, 300 sec: 43875.8). Total num frames: 692436992. Throughput: 0: 43931.0. Samples: 595367720. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 17:30:03,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:30:07,190][06909] Updated weights for policy 0, policy_version 42273 (0.0038) [2024-06-27 17:30:08,850][06674] Fps is (10 sec: 42597.4, 60 sec: 43417.4, 300 sec: 43820.2). Total num frames: 692649984. Throughput: 0: 44092.7. Samples: 595622440. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 17:30:08,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:30:11,270][06909] Updated weights for policy 0, policy_version 42283 (0.0037) [2024-06-27 17:30:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 692879360. Throughput: 0: 43957.0. Samples: 595753040. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 17:30:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:30:14,955][06909] Updated weights for policy 0, policy_version 42293 (0.0045) [2024-06-27 17:30:18,762][06909] Updated weights for policy 0, policy_version 42303 (0.0028) [2024-06-27 17:30:18,850][06674] Fps is (10 sec: 44238.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 693092352. Throughput: 0: 43992.1. Samples: 596021160. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:30:18,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:30:22,384][06909] Updated weights for policy 0, policy_version 42313 (0.0033) [2024-06-27 17:30:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 693305344. Throughput: 0: 43874.2. Samples: 596278180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:30:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:30:26,174][06909] Updated weights for policy 0, policy_version 42323 (0.0030) [2024-06-27 17:30:28,856][06674] Fps is (10 sec: 44209.8, 60 sec: 43686.3, 300 sec: 43874.9). Total num frames: 693534720. Throughput: 0: 43660.4. Samples: 596404180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:30:28,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:30:29,925][06909] Updated weights for policy 0, policy_version 42333 (0.0030) [2024-06-27 17:30:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 693731328. Throughput: 0: 43784.5. Samples: 596670860. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:30:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:30:33,898][06909] Updated weights for policy 0, policy_version 42343 (0.0027) [2024-06-27 17:30:37,389][06909] Updated weights for policy 0, policy_version 42353 (0.0029) [2024-06-27 17:30:38,850][06674] Fps is (10 sec: 40985.2, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 693944320. Throughput: 0: 43812.5. Samples: 596933060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:30:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:30:41,411][06909] Updated weights for policy 0, policy_version 42363 (0.0032) [2024-06-27 17:30:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 694190080. Throughput: 0: 43763.1. Samples: 597066440. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:30:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:30:44,932][06909] Updated weights for policy 0, policy_version 42373 (0.0044) [2024-06-27 17:30:48,675][06909] Updated weights for policy 0, policy_version 42383 (0.0031) [2024-06-27 17:30:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 694403072. Throughput: 0: 43678.3. Samples: 597333240. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 17:30:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:30:52,232][06909] Updated weights for policy 0, policy_version 42393 (0.0043) [2024-06-27 17:30:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 694599680. Throughput: 0: 43741.2. Samples: 597590780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 17:30:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 17:30:56,079][06909] Updated weights for policy 0, policy_version 42403 (0.0025) [2024-06-27 17:30:58,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43963.7, 300 sec: 43986.8). Total num frames: 694861824. Throughput: 0: 43650.1. Samples: 597717300. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 17:30:58,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:30:59,656][06909] Updated weights for policy 0, policy_version 42413 (0.0022) [2024-06-27 17:31:03,192][06887] Signal inference workers to stop experience collection... (8650 times) [2024-06-27 17:31:03,235][06909] InferenceWorker_p0-w0: stopping experience collection (8650 times) [2024-06-27 17:31:03,249][06887] Signal inference workers to resume experience collection... (8650 times) [2024-06-27 17:31:03,252][06909] InferenceWorker_p0-w0: resuming experience collection (8650 times) [2024-06-27 17:31:03,398][06909] Updated weights for policy 0, policy_version 42423 (0.0026) [2024-06-27 17:31:03,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.9, 300 sec: 43820.3). Total num frames: 695074816. Throughput: 0: 43782.3. Samples: 597991360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 17:31:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:31:06,957][06909] Updated weights for policy 0, policy_version 42433 (0.0031) [2024-06-27 17:31:08,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43417.7, 300 sec: 43653.6). Total num frames: 695255040. Throughput: 0: 43843.9. Samples: 598251160. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 17:31:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:31:10,904][06909] Updated weights for policy 0, policy_version 42443 (0.0026) [2024-06-27 17:31:13,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 695500800. Throughput: 0: 43895.7. Samples: 598379220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:31:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:31:14,612][06909] Updated weights for policy 0, policy_version 42453 (0.0054) [2024-06-27 17:31:18,252][06909] Updated weights for policy 0, policy_version 42463 (0.0032) [2024-06-27 17:31:18,852][06674] Fps is (10 sec: 47504.2, 60 sec: 43962.2, 300 sec: 43820.0). Total num frames: 695730176. Throughput: 0: 43847.8. Samples: 598644100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:31:18,853][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:31:22,153][06909] Updated weights for policy 0, policy_version 42473 (0.0032) [2024-06-27 17:31:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 695926784. Throughput: 0: 44007.9. Samples: 598913420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:31:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 17:31:26,009][06909] Updated weights for policy 0, policy_version 42483 (0.0045) [2024-06-27 17:31:28,850][06674] Fps is (10 sec: 44246.3, 60 sec: 43968.3, 300 sec: 43931.4). Total num frames: 696172544. Throughput: 0: 43880.5. Samples: 599041060. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:31:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:31:29,578][06909] Updated weights for policy 0, policy_version 42493 (0.0029) [2024-06-27 17:31:33,266][06909] Updated weights for policy 0, policy_version 42503 (0.0038) [2024-06-27 17:31:33,854][06674] Fps is (10 sec: 47496.3, 60 sec: 44507.2, 300 sec: 43875.2). Total num frames: 696401920. Throughput: 0: 43851.9. Samples: 599306740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:31:33,854][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:31:36,959][06909] Updated weights for policy 0, policy_version 42513 (0.0036) [2024-06-27 17:31:38,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 696582144. Throughput: 0: 43956.8. Samples: 599568840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-27 17:31:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:31:40,643][06909] Updated weights for policy 0, policy_version 42523 (0.0032) [2024-06-27 17:31:43,850][06674] Fps is (10 sec: 42613.9, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 696827904. Throughput: 0: 43921.9. Samples: 599693780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-27 17:31:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 17:31:44,366][06909] Updated weights for policy 0, policy_version 42533 (0.0034) [2024-06-27 17:31:47,911][06909] Updated weights for policy 0, policy_version 42543 (0.0033) [2024-06-27 17:31:48,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.6, 300 sec: 43820.3). Total num frames: 697040896. Throughput: 0: 43837.6. Samples: 599964060. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-27 17:31:48,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:31:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000042544_697040896.pth... [2024-06-27 17:31:48,936][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000041901_686505984.pth [2024-06-27 17:31:51,845][06909] Updated weights for policy 0, policy_version 42553 (0.0037) [2024-06-27 17:31:53,851][06674] Fps is (10 sec: 42593.9, 60 sec: 44236.0, 300 sec: 43764.6). Total num frames: 697253888. Throughput: 0: 43980.8. Samples: 600230340. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-27 17:31:53,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:31:55,591][06909] Updated weights for policy 0, policy_version 42563 (0.0027) [2024-06-27 17:31:58,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.8, 300 sec: 43931.6). Total num frames: 697483264. Throughput: 0: 43811.2. Samples: 600350720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-27 17:31:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:31:59,347][06909] Updated weights for policy 0, policy_version 42573 (0.0029) [2024-06-27 17:32:03,029][06909] Updated weights for policy 0, policy_version 42583 (0.0033) [2024-06-27 17:32:03,850][06674] Fps is (10 sec: 44241.6, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 697696256. Throughput: 0: 43926.5. Samples: 600620700. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-27 17:32:03,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:32:06,913][06909] Updated weights for policy 0, policy_version 42593 (0.0026) [2024-06-27 17:32:08,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 697892864. Throughput: 0: 43902.6. Samples: 600889040. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 17:32:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:32:10,364][06909] Updated weights for policy 0, policy_version 42603 (0.0028) [2024-06-27 17:32:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 43932.3). Total num frames: 698155008. Throughput: 0: 43852.8. Samples: 601014440. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 17:32:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 17:32:14,605][06909] Updated weights for policy 0, policy_version 42613 (0.0029) [2024-06-27 17:32:18,149][06909] Updated weights for policy 0, policy_version 42623 (0.0031) [2024-06-27 17:32:18,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43965.3, 300 sec: 43875.8). Total num frames: 698368000. Throughput: 0: 43746.2. Samples: 601275160. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 17:32:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:32:21,837][06909] Updated weights for policy 0, policy_version 42633 (0.0022) [2024-06-27 17:32:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 698564608. Throughput: 0: 44003.2. Samples: 601548980. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 17:32:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:32:25,378][06909] Updated weights for policy 0, policy_version 42643 (0.0035) [2024-06-27 17:32:27,751][06887] Signal inference workers to stop experience collection... (8700 times) [2024-06-27 17:32:27,752][06887] Signal inference workers to resume experience collection... (8700 times) [2024-06-27 17:32:27,775][06909] InferenceWorker_p0-w0: stopping experience collection (8700 times) [2024-06-27 17:32:27,775][06909] InferenceWorker_p0-w0: resuming experience collection (8700 times) [2024-06-27 17:32:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 698810368. Throughput: 0: 44068.4. Samples: 601676860. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 17:32:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:32:29,159][06909] Updated weights for policy 0, policy_version 42653 (0.0033) [2024-06-27 17:32:32,984][06909] Updated weights for policy 0, policy_version 42663 (0.0042) [2024-06-27 17:32:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43693.4, 300 sec: 43875.8). Total num frames: 699023360. Throughput: 0: 43926.4. Samples: 601940740. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-27 17:32:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:32:37,239][06909] Updated weights for policy 0, policy_version 42673 (0.0038) [2024-06-27 17:32:38,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.8, 300 sec: 43820.3). Total num frames: 699236352. Throughput: 0: 43949.5. Samples: 602208020. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-27 17:32:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 17:32:40,254][06909] Updated weights for policy 0, policy_version 42683 (0.0027) [2024-06-27 17:32:43,853][06674] Fps is (10 sec: 44221.2, 60 sec: 43961.2, 300 sec: 43986.4). Total num frames: 699465728. Throughput: 0: 44051.2. Samples: 602333180. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-27 17:32:43,854][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:32:44,528][06909] Updated weights for policy 0, policy_version 42693 (0.0049) [2024-06-27 17:32:47,575][06909] Updated weights for policy 0, policy_version 42703 (0.0031) [2024-06-27 17:32:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 699662336. Throughput: 0: 43868.0. Samples: 602594760. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-27 17:32:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:32:51,840][06909] Updated weights for policy 0, policy_version 42713 (0.0043) [2024-06-27 17:32:53,850][06674] Fps is (10 sec: 40974.5, 60 sec: 43691.5, 300 sec: 43820.3). Total num frames: 699875328. Throughput: 0: 43885.0. Samples: 602863860. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-27 17:32:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:32:55,472][06909] Updated weights for policy 0, policy_version 42723 (0.0032) [2024-06-27 17:32:58,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 700121088. Throughput: 0: 43871.0. Samples: 602988640. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-27 17:32:58,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:32:59,109][06909] Updated weights for policy 0, policy_version 42733 (0.0030) [2024-06-27 17:33:02,763][06909] Updated weights for policy 0, policy_version 42743 (0.0035) [2024-06-27 17:33:03,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 700334080. Throughput: 0: 43891.0. Samples: 603250260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:33:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:33:06,600][06909] Updated weights for policy 0, policy_version 42753 (0.0037) [2024-06-27 17:33:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 700547072. Throughput: 0: 43804.8. Samples: 603520200. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:33:08,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:33:10,034][06909] Updated weights for policy 0, policy_version 42763 (0.0042) [2024-06-27 17:33:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 700776448. Throughput: 0: 43824.4. Samples: 603648960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:33:13,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:33:14,318][06909] Updated weights for policy 0, policy_version 42773 (0.0041) [2024-06-27 17:33:17,721][06909] Updated weights for policy 0, policy_version 42783 (0.0026) [2024-06-27 17:33:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 43820.6). Total num frames: 700973056. Throughput: 0: 43784.4. Samples: 603911040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:33:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:33:22,024][06909] Updated weights for policy 0, policy_version 42793 (0.0032) [2024-06-27 17:33:23,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 701202432. Throughput: 0: 43755.5. Samples: 604177020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:33:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:33:25,042][06909] Updated weights for policy 0, policy_version 42803 (0.0031) [2024-06-27 17:33:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 701431808. Throughput: 0: 43807.4. Samples: 604304360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 17:33:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:33:29,490][06909] Updated weights for policy 0, policy_version 42813 (0.0036) [2024-06-27 17:33:32,574][06909] Updated weights for policy 0, policy_version 42823 (0.0044) [2024-06-27 17:33:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 701644800. Throughput: 0: 43708.9. Samples: 604561660. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 17:33:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:33:36,862][06909] Updated weights for policy 0, policy_version 42833 (0.0035) [2024-06-27 17:33:38,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 701841408. Throughput: 0: 43596.9. Samples: 604825720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 17:33:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:33:40,194][06909] Updated weights for policy 0, policy_version 42843 (0.0030) [2024-06-27 17:33:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43693.2, 300 sec: 43986.9). Total num frames: 702087168. Throughput: 0: 43765.0. Samples: 604958060. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 17:33:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:33:44,447][06909] Updated weights for policy 0, policy_version 42853 (0.0030) [2024-06-27 17:33:45,559][06887] Signal inference workers to stop experience collection... (8750 times) [2024-06-27 17:33:45,560][06887] Signal inference workers to resume experience collection... (8750 times) [2024-06-27 17:33:45,587][06909] InferenceWorker_p0-w0: stopping experience collection (8750 times) [2024-06-27 17:33:45,587][06909] InferenceWorker_p0-w0: resuming experience collection (8750 times) [2024-06-27 17:33:47,848][06909] Updated weights for policy 0, policy_version 42863 (0.0033) [2024-06-27 17:33:48,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 702300160. Throughput: 0: 43655.6. Samples: 605214760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 17:33:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:33:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000042865_702300160.pth... [2024-06-27 17:33:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000042224_691798016.pth [2024-06-27 17:33:51,865][06909] Updated weights for policy 0, policy_version 42873 (0.0040) [2024-06-27 17:33:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 702529536. Throughput: 0: 43547.2. Samples: 605479820. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 17:33:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:33:55,398][06909] Updated weights for policy 0, policy_version 42883 (0.0041) [2024-06-27 17:33:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 702742528. Throughput: 0: 43697.4. Samples: 605615340. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:33:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:33:59,068][06909] Updated weights for policy 0, policy_version 42893 (0.0038) [2024-06-27 17:34:02,721][06909] Updated weights for policy 0, policy_version 42903 (0.0029) [2024-06-27 17:34:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 702939136. Throughput: 0: 43591.2. Samples: 605872640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:34:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:34:06,357][06909] Updated weights for policy 0, policy_version 42913 (0.0032) [2024-06-27 17:34:08,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 703168512. Throughput: 0: 43610.6. Samples: 606139500. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:34:08,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:34:10,195][06909] Updated weights for policy 0, policy_version 42923 (0.0052) [2024-06-27 17:34:13,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 703397888. Throughput: 0: 43657.3. Samples: 606268940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:34:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:34:14,364][06909] Updated weights for policy 0, policy_version 42933 (0.0033) [2024-06-27 17:34:17,954][06909] Updated weights for policy 0, policy_version 42943 (0.0033) [2024-06-27 17:34:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 703610880. Throughput: 0: 43715.9. Samples: 606528880. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:34:18,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:34:21,763][06909] Updated weights for policy 0, policy_version 42953 (0.0039) [2024-06-27 17:34:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 703823872. Throughput: 0: 43767.5. Samples: 606795260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:34:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:34:25,300][06909] Updated weights for policy 0, policy_version 42963 (0.0035) [2024-06-27 17:34:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 704053248. Throughput: 0: 43820.7. Samples: 606930000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:34:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:34:29,438][06909] Updated weights for policy 0, policy_version 42973 (0.0037) [2024-06-27 17:34:32,994][06909] Updated weights for policy 0, policy_version 42983 (0.0027) [2024-06-27 17:34:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 704266240. Throughput: 0: 43996.5. Samples: 607194600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:34:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:34:36,743][06909] Updated weights for policy 0, policy_version 42993 (0.0030) [2024-06-27 17:34:38,852][06674] Fps is (10 sec: 44228.3, 60 sec: 44235.2, 300 sec: 43875.5). Total num frames: 704495616. Throughput: 0: 43818.8. Samples: 607451760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:34:38,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:34:40,386][06909] Updated weights for policy 0, policy_version 43003 (0.0022) [2024-06-27 17:34:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 704708608. Throughput: 0: 43855.6. Samples: 607588840. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:34:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:34:43,932][06909] Updated weights for policy 0, policy_version 43013 (0.0031) [2024-06-27 17:34:47,671][06909] Updated weights for policy 0, policy_version 43023 (0.0026) [2024-06-27 17:34:48,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 704937984. Throughput: 0: 44000.3. Samples: 607852660. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-27 17:34:48,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:34:51,258][06909] Updated weights for policy 0, policy_version 43033 (0.0037) [2024-06-27 17:34:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 705150976. Throughput: 0: 43995.6. Samples: 608119300. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-27 17:34:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:34:55,267][06909] Updated weights for policy 0, policy_version 43043 (0.0046) [2024-06-27 17:34:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 705363968. Throughput: 0: 43973.2. Samples: 608247740. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-27 17:34:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:34:59,225][06909] Updated weights for policy 0, policy_version 43053 (0.0043) [2024-06-27 17:35:02,839][06909] Updated weights for policy 0, policy_version 43063 (0.0025) [2024-06-27 17:35:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 705593344. Throughput: 0: 44045.4. Samples: 608510920. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-27 17:35:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:35:06,691][06909] Updated weights for policy 0, policy_version 43073 (0.0028) [2024-06-27 17:35:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 705789952. Throughput: 0: 43941.7. Samples: 608772640. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-27 17:35:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:35:10,064][06909] Updated weights for policy 0, policy_version 43083 (0.0033) [2024-06-27 17:35:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 706019328. Throughput: 0: 43950.3. Samples: 608907760. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-27 17:35:13,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:35:14,146][06909] Updated weights for policy 0, policy_version 43093 (0.0034) [2024-06-27 17:35:17,647][06909] Updated weights for policy 0, policy_version 43103 (0.0033) [2024-06-27 17:35:18,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 706248704. Throughput: 0: 43831.0. Samples: 609167000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 17:35:18,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:35:19,706][06887] Signal inference workers to stop experience collection... (8800 times) [2024-06-27 17:35:19,706][06887] Signal inference workers to resume experience collection... (8800 times) [2024-06-27 17:35:19,723][06909] InferenceWorker_p0-w0: stopping experience collection (8800 times) [2024-06-27 17:35:19,724][06909] InferenceWorker_p0-w0: resuming experience collection (8800 times) [2024-06-27 17:35:21,238][06909] Updated weights for policy 0, policy_version 43113 (0.0030) [2024-06-27 17:35:23,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.8, 300 sec: 43821.2). Total num frames: 706461696. Throughput: 0: 44004.3. Samples: 609431860. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 17:35:23,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:35:25,282][06909] Updated weights for policy 0, policy_version 43123 (0.0028) [2024-06-27 17:35:28,784][06909] Updated weights for policy 0, policy_version 43133 (0.0022) [2024-06-27 17:35:28,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 706691072. Throughput: 0: 43790.6. Samples: 609559420. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 17:35:28,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:35:32,513][06909] Updated weights for policy 0, policy_version 43143 (0.0027) [2024-06-27 17:35:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 706904064. Throughput: 0: 43945.4. Samples: 609830200. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 17:35:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:35:36,397][06909] Updated weights for policy 0, policy_version 43153 (0.0027) [2024-06-27 17:35:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43965.2, 300 sec: 43875.8). Total num frames: 707133440. Throughput: 0: 43855.0. Samples: 610092780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 17:35:38,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:35:40,059][06909] Updated weights for policy 0, policy_version 43163 (0.0039) [2024-06-27 17:35:43,778][06909] Updated weights for policy 0, policy_version 43173 (0.0026) [2024-06-27 17:35:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 707346432. Throughput: 0: 43941.9. Samples: 610225120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:35:43,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:35:47,450][06909] Updated weights for policy 0, policy_version 43183 (0.0030) [2024-06-27 17:35:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 707575808. Throughput: 0: 44042.6. Samples: 610492840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:35:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:35:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000043187_707575808.pth... [2024-06-27 17:35:48,959][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000042544_697040896.pth [2024-06-27 17:35:51,158][06909] Updated weights for policy 0, policy_version 43193 (0.0029) [2024-06-27 17:35:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 707788800. Throughput: 0: 43954.3. Samples: 610750580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:35:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:35:54,841][06909] Updated weights for policy 0, policy_version 43203 (0.0033) [2024-06-27 17:35:58,441][06909] Updated weights for policy 0, policy_version 43213 (0.0032) [2024-06-27 17:35:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 708001792. Throughput: 0: 43817.5. Samples: 610879540. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:35:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:36:02,209][06909] Updated weights for policy 0, policy_version 43223 (0.0023) [2024-06-27 17:36:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 708231168. Throughput: 0: 43982.8. Samples: 611146220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:36:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:36:05,793][06909] Updated weights for policy 0, policy_version 43233 (0.0026) [2024-06-27 17:36:08,850][06674] Fps is (10 sec: 44235.9, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 708444160. Throughput: 0: 43978.4. Samples: 611410900. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 17:36:08,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:36:09,518][06909] Updated weights for policy 0, policy_version 43243 (0.0032) [2024-06-27 17:36:13,556][06909] Updated weights for policy 0, policy_version 43253 (0.0034) [2024-06-27 17:36:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43820.6). Total num frames: 708657152. Throughput: 0: 44031.1. Samples: 611540820. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:36:13,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:36:17,150][06909] Updated weights for policy 0, policy_version 43263 (0.0042) [2024-06-27 17:36:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 708886528. Throughput: 0: 43848.9. Samples: 611803400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:36:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:36:20,916][06909] Updated weights for policy 0, policy_version 43273 (0.0028) [2024-06-27 17:36:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 709099520. Throughput: 0: 43865.0. Samples: 612066700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:36:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:36:24,478][06909] Updated weights for policy 0, policy_version 43283 (0.0025) [2024-06-27 17:36:28,455][06909] Updated weights for policy 0, policy_version 43293 (0.0036) [2024-06-27 17:36:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43765.2). Total num frames: 709312512. Throughput: 0: 43922.5. Samples: 612201640. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:36:28,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:36:32,094][06909] Updated weights for policy 0, policy_version 43303 (0.0035) [2024-06-27 17:36:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 709525504. Throughput: 0: 43626.3. Samples: 612456020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:36:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:36:36,235][06909] Updated weights for policy 0, policy_version 43313 (0.0032) [2024-06-27 17:36:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 709771264. Throughput: 0: 43782.1. Samples: 612720780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 17:36:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:36:39,838][06909] Updated weights for policy 0, policy_version 43323 (0.0035) [2024-06-27 17:36:43,507][06909] Updated weights for policy 0, policy_version 43333 (0.0028) [2024-06-27 17:36:43,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.5, 300 sec: 43820.3). Total num frames: 709967872. Throughput: 0: 43832.3. Samples: 612852000. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 17:36:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:36:47,053][06909] Updated weights for policy 0, policy_version 43343 (0.0034) [2024-06-27 17:36:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43875.9). Total num frames: 710197248. Throughput: 0: 43689.7. Samples: 613112260. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 17:36:48,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:36:51,177][06909] Updated weights for policy 0, policy_version 43353 (0.0038) [2024-06-27 17:36:53,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 710426624. Throughput: 0: 43670.7. Samples: 613376080. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 17:36:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:36:54,451][06909] Updated weights for policy 0, policy_version 43363 (0.0042) [2024-06-27 17:36:58,438][06909] Updated weights for policy 0, policy_version 43373 (0.0041) [2024-06-27 17:36:58,852][06674] Fps is (10 sec: 42590.1, 60 sec: 43689.1, 300 sec: 43820.0). Total num frames: 710623232. Throughput: 0: 43650.5. Samples: 613505180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 17:36:58,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:37:01,592][06887] Signal inference workers to stop experience collection... (8850 times) [2024-06-27 17:37:01,615][06909] InferenceWorker_p0-w0: stopping experience collection (8850 times) [2024-06-27 17:37:01,653][06887] Signal inference workers to resume experience collection... (8850 times) [2024-06-27 17:37:01,654][06909] InferenceWorker_p0-w0: resuming experience collection (8850 times) [2024-06-27 17:37:01,786][06909] Updated weights for policy 0, policy_version 43383 (0.0037) [2024-06-27 17:37:03,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 710852608. Throughput: 0: 43691.7. Samples: 613769520. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 17:37:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:37:05,727][06909] Updated weights for policy 0, policy_version 43393 (0.0034) [2024-06-27 17:37:08,850][06674] Fps is (10 sec: 44246.1, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 711065600. Throughput: 0: 43810.7. Samples: 614038180. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-27 17:37:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:37:09,507][06909] Updated weights for policy 0, policy_version 43403 (0.0040) [2024-06-27 17:37:12,947][06909] Updated weights for policy 0, policy_version 43413 (0.0023) [2024-06-27 17:37:13,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 711278592. Throughput: 0: 43600.0. Samples: 614163640. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-27 17:37:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:37:17,045][06909] Updated weights for policy 0, policy_version 43423 (0.0027) [2024-06-27 17:37:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 711507968. Throughput: 0: 43929.3. Samples: 614432840. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-27 17:37:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:37:20,721][06909] Updated weights for policy 0, policy_version 43433 (0.0031) [2024-06-27 17:37:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 711720960. Throughput: 0: 43883.6. Samples: 614695540. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-27 17:37:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:37:24,401][06909] Updated weights for policy 0, policy_version 43443 (0.0030) [2024-06-27 17:37:27,940][06909] Updated weights for policy 0, policy_version 43453 (0.0046) [2024-06-27 17:37:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 711933952. Throughput: 0: 43908.6. Samples: 614827880. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-27 17:37:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:37:31,611][06909] Updated weights for policy 0, policy_version 43463 (0.0032) [2024-06-27 17:37:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 712179712. Throughput: 0: 44072.6. Samples: 615095520. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 17:37:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:37:35,582][06909] Updated weights for policy 0, policy_version 43473 (0.0026) [2024-06-27 17:37:38,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43963.7, 300 sec: 43876.3). Total num frames: 712409088. Throughput: 0: 44122.7. Samples: 615361600. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 17:37:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:37:39,019][06909] Updated weights for policy 0, policy_version 43483 (0.0032) [2024-06-27 17:37:42,861][06909] Updated weights for policy 0, policy_version 43493 (0.0040) [2024-06-27 17:37:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 712605696. Throughput: 0: 44191.7. Samples: 615493720. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 17:37:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:37:46,428][06909] Updated weights for policy 0, policy_version 43503 (0.0039) [2024-06-27 17:37:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 712835072. Throughput: 0: 44100.3. Samples: 615754040. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 17:37:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:37:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000043508_712835072.pth... [2024-06-27 17:37:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000042865_702300160.pth [2024-06-27 17:37:50,158][06909] Updated weights for policy 0, policy_version 43513 (0.0034) [2024-06-27 17:37:53,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 713048064. Throughput: 0: 44004.9. Samples: 616018400. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 17:37:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:37:54,231][06909] Updated weights for policy 0, policy_version 43523 (0.0029) [2024-06-27 17:37:58,155][06909] Updated weights for policy 0, policy_version 43533 (0.0032) [2024-06-27 17:37:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43965.1, 300 sec: 43820.2). Total num frames: 713261056. Throughput: 0: 44102.2. Samples: 616148240. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 17:37:58,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:38:01,897][06909] Updated weights for policy 0, policy_version 43543 (0.0029) [2024-06-27 17:38:03,850][06674] Fps is (10 sec: 44235.2, 60 sec: 43963.5, 300 sec: 43875.8). Total num frames: 713490432. Throughput: 0: 44005.5. Samples: 616413100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 17:38:03,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:38:05,459][06909] Updated weights for policy 0, policy_version 43553 (0.0029) [2024-06-27 17:38:08,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 713703424. Throughput: 0: 43927.6. Samples: 616672280. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 17:38:08,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:38:09,280][06909] Updated weights for policy 0, policy_version 43563 (0.0027) [2024-06-27 17:38:13,074][06909] Updated weights for policy 0, policy_version 43573 (0.0030) [2024-06-27 17:38:13,850][06674] Fps is (10 sec: 42599.9, 60 sec: 43963.9, 300 sec: 43875.8). Total num frames: 713916416. Throughput: 0: 43838.2. Samples: 616800600. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 17:38:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:38:16,584][06909] Updated weights for policy 0, policy_version 43583 (0.0041) [2024-06-27 17:38:18,850][06674] Fps is (10 sec: 44235.4, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 714145792. Throughput: 0: 43710.8. Samples: 617062520. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 17:38:18,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:38:20,577][06909] Updated weights for policy 0, policy_version 43593 (0.0039) [2024-06-27 17:38:23,854][06674] Fps is (10 sec: 45855.9, 60 sec: 44233.7, 300 sec: 43875.2). Total num frames: 714375168. Throughput: 0: 43759.6. Samples: 617330960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 17:38:23,854][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:38:23,925][06909] Updated weights for policy 0, policy_version 43603 (0.0039) [2024-06-27 17:38:27,930][06909] Updated weights for policy 0, policy_version 43613 (0.0035) [2024-06-27 17:38:28,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43963.6, 300 sec: 43820.2). Total num frames: 714571776. Throughput: 0: 43826.7. Samples: 617465920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:38:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:38:31,251][06909] Updated weights for policy 0, policy_version 43623 (0.0025) [2024-06-27 17:38:33,850][06674] Fps is (10 sec: 42616.5, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 714801152. Throughput: 0: 43769.5. Samples: 617723660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:38:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:38:35,186][06909] Updated weights for policy 0, policy_version 43633 (0.0036) [2024-06-27 17:38:38,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 715030528. Throughput: 0: 43948.4. Samples: 617996080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:38:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:38:38,947][06909] Updated weights for policy 0, policy_version 43643 (0.0029) [2024-06-27 17:38:39,139][06887] Signal inference workers to stop experience collection... (8900 times) [2024-06-27 17:38:39,139][06887] Signal inference workers to resume experience collection... (8900 times) [2024-06-27 17:38:39,161][06909] InferenceWorker_p0-w0: stopping experience collection (8900 times) [2024-06-27 17:38:39,162][06909] InferenceWorker_p0-w0: resuming experience collection (8900 times) [2024-06-27 17:38:42,468][06909] Updated weights for policy 0, policy_version 43653 (0.0031) [2024-06-27 17:38:43,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 715243520. Throughput: 0: 43912.9. Samples: 618124320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:38:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:38:46,703][06909] Updated weights for policy 0, policy_version 43663 (0.0027) [2024-06-27 17:38:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 715472896. Throughput: 0: 43852.2. Samples: 618386440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:38:48,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:38:50,252][06909] Updated weights for policy 0, policy_version 43673 (0.0035) [2024-06-27 17:38:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 715685888. Throughput: 0: 44057.7. Samples: 618654880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:38:53,853][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:38:53,891][06909] Updated weights for policy 0, policy_version 43683 (0.0027) [2024-06-27 17:38:57,595][06909] Updated weights for policy 0, policy_version 43693 (0.0022) [2024-06-27 17:38:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 715898880. Throughput: 0: 44171.5. Samples: 618788320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-27 17:38:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:39:01,187][06909] Updated weights for policy 0, policy_version 43703 (0.0023) [2024-06-27 17:39:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43964.0, 300 sec: 43931.4). Total num frames: 716128256. Throughput: 0: 44215.0. Samples: 619052180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-27 17:39:03,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:39:04,973][06909] Updated weights for policy 0, policy_version 43713 (0.0041) [2024-06-27 17:39:08,670][06909] Updated weights for policy 0, policy_version 43723 (0.0028) [2024-06-27 17:39:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 716357632. Throughput: 0: 44089.4. Samples: 619314800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-27 17:39:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:39:12,481][06909] Updated weights for policy 0, policy_version 43733 (0.0028) [2024-06-27 17:39:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 716554240. Throughput: 0: 43922.3. Samples: 619442420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-27 17:39:13,853][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:39:15,897][06909] Updated weights for policy 0, policy_version 43743 (0.0022) [2024-06-27 17:39:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44237.1, 300 sec: 43986.9). Total num frames: 716800000. Throughput: 0: 44176.4. Samples: 619711600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-27 17:39:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:39:19,711][06909] Updated weights for policy 0, policy_version 43753 (0.0029) [2024-06-27 17:39:23,576][06909] Updated weights for policy 0, policy_version 43763 (0.0041) [2024-06-27 17:39:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43966.8, 300 sec: 43931.4). Total num frames: 717012992. Throughput: 0: 44010.3. Samples: 619976540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-27 17:39:23,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:39:27,058][06909] Updated weights for policy 0, policy_version 43773 (0.0044) [2024-06-27 17:39:28,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 717209600. Throughput: 0: 44058.3. Samples: 620106940. Policy #0 lag: (min: 0.0, avg: 11.1, max: 26.0) [2024-06-27 17:39:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:39:30,990][06909] Updated weights for policy 0, policy_version 43783 (0.0045) [2024-06-27 17:39:33,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.7, 300 sec: 43931.6). Total num frames: 717455360. Throughput: 0: 44116.4. Samples: 620371680. Policy #0 lag: (min: 0.0, avg: 11.1, max: 26.0) [2024-06-27 17:39:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:39:34,880][06909] Updated weights for policy 0, policy_version 43793 (0.0034) [2024-06-27 17:39:38,249][06909] Updated weights for policy 0, policy_version 43803 (0.0029) [2024-06-27 17:39:38,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 717684736. Throughput: 0: 43894.6. Samples: 620630140. Policy #0 lag: (min: 0.0, avg: 11.1, max: 26.0) [2024-06-27 17:39:38,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:39:42,182][06909] Updated weights for policy 0, policy_version 43813 (0.0022) [2024-06-27 17:39:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 717881344. Throughput: 0: 43956.4. Samples: 620766360. Policy #0 lag: (min: 0.0, avg: 11.1, max: 26.0) [2024-06-27 17:39:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:39:45,813][06909] Updated weights for policy 0, policy_version 43823 (0.0029) [2024-06-27 17:39:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 718110720. Throughput: 0: 44008.4. Samples: 621032560. Policy #0 lag: (min: 0.0, avg: 11.1, max: 26.0) [2024-06-27 17:39:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:39:48,873][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000043830_718110720.pth... [2024-06-27 17:39:48,931][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000043187_707575808.pth [2024-06-27 17:39:49,554][06909] Updated weights for policy 0, policy_version 43833 (0.0031) [2024-06-27 17:39:53,057][06909] Updated weights for policy 0, policy_version 43843 (0.0041) [2024-06-27 17:39:53,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 718340096. Throughput: 0: 43975.6. Samples: 621293700. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 17:39:53,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:39:56,371][06887] Signal inference workers to stop experience collection... (8950 times) [2024-06-27 17:39:56,372][06887] Signal inference workers to resume experience collection... (8950 times) [2024-06-27 17:39:56,386][06909] InferenceWorker_p0-w0: stopping experience collection (8950 times) [2024-06-27 17:39:56,416][06909] InferenceWorker_p0-w0: resuming experience collection (8950 times) [2024-06-27 17:39:56,839][06909] Updated weights for policy 0, policy_version 43853 (0.0022) [2024-06-27 17:39:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 718536704. Throughput: 0: 44214.2. Samples: 621432060. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 17:39:58,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:40:00,468][06909] Updated weights for policy 0, policy_version 43863 (0.0024) [2024-06-27 17:40:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 718766080. Throughput: 0: 44000.9. Samples: 621691640. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 17:40:03,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:40:04,189][06909] Updated weights for policy 0, policy_version 43873 (0.0027) [2024-06-27 17:40:08,252][06909] Updated weights for policy 0, policy_version 43883 (0.0030) [2024-06-27 17:40:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 718995456. Throughput: 0: 43845.2. Samples: 621949580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 17:40:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:40:11,875][06909] Updated weights for policy 0, policy_version 43893 (0.0032) [2024-06-27 17:40:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 719192064. Throughput: 0: 43844.9. Samples: 622079960. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 17:40:13,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 17:40:15,615][06909] Updated weights for policy 0, policy_version 43903 (0.0027) [2024-06-27 17:40:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.5, 300 sec: 43931.3). Total num frames: 719421440. Throughput: 0: 43906.2. Samples: 622347460. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 17:40:18,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:40:19,368][06909] Updated weights for policy 0, policy_version 43913 (0.0034) [2024-06-27 17:40:23,362][06909] Updated weights for policy 0, policy_version 43923 (0.0032) [2024-06-27 17:40:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 719650816. Throughput: 0: 43891.6. Samples: 622605260. Policy #0 lag: (min: 1.0, avg: 11.9, max: 22.0) [2024-06-27 17:40:23,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:40:26,829][06909] Updated weights for policy 0, policy_version 43933 (0.0040) [2024-06-27 17:40:28,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 719831040. Throughput: 0: 43880.1. Samples: 622740960. Policy #0 lag: (min: 1.0, avg: 11.9, max: 22.0) [2024-06-27 17:40:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:40:30,766][06909] Updated weights for policy 0, policy_version 43943 (0.0042) [2024-06-27 17:40:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 720093184. Throughput: 0: 43772.4. Samples: 623002320. Policy #0 lag: (min: 1.0, avg: 11.9, max: 22.0) [2024-06-27 17:40:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:40:34,370][06909] Updated weights for policy 0, policy_version 43953 (0.0041) [2024-06-27 17:40:38,029][06909] Updated weights for policy 0, policy_version 43963 (0.0036) [2024-06-27 17:40:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43417.7, 300 sec: 43875.8). Total num frames: 720289792. Throughput: 0: 43885.8. Samples: 623268560. Policy #0 lag: (min: 1.0, avg: 11.9, max: 22.0) [2024-06-27 17:40:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:40:41,662][06909] Updated weights for policy 0, policy_version 43973 (0.0036) [2024-06-27 17:40:43,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43962.2, 300 sec: 43875.5). Total num frames: 720519168. Throughput: 0: 43739.8. Samples: 623400440. Policy #0 lag: (min: 1.0, avg: 11.9, max: 22.0) [2024-06-27 17:40:43,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:40:45,714][06909] Updated weights for policy 0, policy_version 43983 (0.0044) [2024-06-27 17:40:48,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 720764928. Throughput: 0: 43852.0. Samples: 623664980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:40:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:40:48,953][06909] Updated weights for policy 0, policy_version 43993 (0.0022) [2024-06-27 17:40:53,046][06909] Updated weights for policy 0, policy_version 44003 (0.0025) [2024-06-27 17:40:53,850][06674] Fps is (10 sec: 45884.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 720977920. Throughput: 0: 44000.9. Samples: 623929620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:40:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:40:56,701][06909] Updated weights for policy 0, policy_version 44013 (0.0031) [2024-06-27 17:40:58,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 721158144. Throughput: 0: 44039.6. Samples: 624061740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:40:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:41:00,335][06909] Updated weights for policy 0, policy_version 44023 (0.0033) [2024-06-27 17:41:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 721403904. Throughput: 0: 43920.9. Samples: 624323900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:41:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:41:04,154][06909] Updated weights for policy 0, policy_version 44033 (0.0035) [2024-06-27 17:41:06,482][06887] Signal inference workers to stop experience collection... (9000 times) [2024-06-27 17:41:06,482][06887] Signal inference workers to resume experience collection... (9000 times) [2024-06-27 17:41:06,529][06909] InferenceWorker_p0-w0: stopping experience collection (9000 times) [2024-06-27 17:41:06,529][06909] InferenceWorker_p0-w0: resuming experience collection (9000 times) [2024-06-27 17:41:07,945][06909] Updated weights for policy 0, policy_version 44043 (0.0028) [2024-06-27 17:41:08,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 721633280. Throughput: 0: 43980.1. Samples: 624584360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:41:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:41:11,806][06909] Updated weights for policy 0, policy_version 44053 (0.0034) [2024-06-27 17:41:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 721829888. Throughput: 0: 43934.6. Samples: 624718020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:41:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:41:15,194][06909] Updated weights for policy 0, policy_version 44063 (0.0028) [2024-06-27 17:41:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 722059264. Throughput: 0: 44169.9. Samples: 624989960. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-27 17:41:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:41:19,094][06909] Updated weights for policy 0, policy_version 44073 (0.0035) [2024-06-27 17:41:22,630][06909] Updated weights for policy 0, policy_version 44083 (0.0021) [2024-06-27 17:41:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 722272256. Throughput: 0: 44038.6. Samples: 625250300. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-27 17:41:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:41:26,596][06909] Updated weights for policy 0, policy_version 44093 (0.0034) [2024-06-27 17:41:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 722485248. Throughput: 0: 43874.9. Samples: 625374720. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-27 17:41:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:41:30,388][06909] Updated weights for policy 0, policy_version 44103 (0.0031) [2024-06-27 17:41:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 722731008. Throughput: 0: 43851.1. Samples: 625638280. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-27 17:41:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:41:34,502][06909] Updated weights for policy 0, policy_version 44113 (0.0032) [2024-06-27 17:41:38,098][06909] Updated weights for policy 0, policy_version 44123 (0.0031) [2024-06-27 17:41:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 722927616. Throughput: 0: 43882.8. Samples: 625904340. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-27 17:41:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:41:41,869][06909] Updated weights for policy 0, policy_version 44133 (0.0046) [2024-06-27 17:41:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43965.3, 300 sec: 43931.4). Total num frames: 723156992. Throughput: 0: 43771.2. Samples: 626031440. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-27 17:41:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:41:45,565][06909] Updated weights for policy 0, policy_version 44143 (0.0030) [2024-06-27 17:41:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.6, 300 sec: 43931.4). Total num frames: 723386368. Throughput: 0: 43922.7. Samples: 626300420. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-27 17:41:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:41:48,881][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000044152_723386368.pth... [2024-06-27 17:41:48,933][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000043508_712835072.pth [2024-06-27 17:41:49,306][06909] Updated weights for policy 0, policy_version 44153 (0.0032) [2024-06-27 17:41:52,824][06909] Updated weights for policy 0, policy_version 44163 (0.0036) [2024-06-27 17:41:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 723599360. Throughput: 0: 43778.2. Samples: 626554380. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-27 17:41:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:41:56,923][06909] Updated weights for policy 0, policy_version 44173 (0.0022) [2024-06-27 17:41:58,852][06674] Fps is (10 sec: 42589.6, 60 sec: 44235.3, 300 sec: 43931.0). Total num frames: 723812352. Throughput: 0: 43808.7. Samples: 626689500. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-27 17:41:58,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:42:00,362][06909] Updated weights for policy 0, policy_version 44183 (0.0037) [2024-06-27 17:42:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 724041728. Throughput: 0: 43644.4. Samples: 626953960. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-27 17:42:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:42:04,242][06909] Updated weights for policy 0, policy_version 44193 (0.0037) [2024-06-27 17:42:07,960][06909] Updated weights for policy 0, policy_version 44203 (0.0036) [2024-06-27 17:42:08,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 724254720. Throughput: 0: 43644.4. Samples: 627214300. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-27 17:42:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:42:11,680][06909] Updated weights for policy 0, policy_version 44213 (0.0033) [2024-06-27 17:42:12,762][06887] Signal inference workers to stop experience collection... (9050 times) [2024-06-27 17:42:12,813][06909] InferenceWorker_p0-w0: stopping experience collection (9050 times) [2024-06-27 17:42:12,822][06887] Signal inference workers to resume experience collection... (9050 times) [2024-06-27 17:42:12,836][06909] InferenceWorker_p0-w0: resuming experience collection (9050 times) [2024-06-27 17:42:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 724467712. Throughput: 0: 43692.4. Samples: 627340880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:42:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:42:15,224][06909] Updated weights for policy 0, policy_version 44223 (0.0045) [2024-06-27 17:42:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 724697088. Throughput: 0: 43826.9. Samples: 627610500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:42:18,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:42:19,285][06909] Updated weights for policy 0, policy_version 44233 (0.0037) [2024-06-27 17:42:22,820][06909] Updated weights for policy 0, policy_version 44243 (0.0036) [2024-06-27 17:42:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 724910080. Throughput: 0: 43702.6. Samples: 627870960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:42:23,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:42:26,634][06909] Updated weights for policy 0, policy_version 44253 (0.0036) [2024-06-27 17:42:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 725123072. Throughput: 0: 43795.5. Samples: 628002240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:42:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:42:30,164][06909] Updated weights for policy 0, policy_version 44263 (0.0031) [2024-06-27 17:42:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 725352448. Throughput: 0: 43632.8. Samples: 628263900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:42:33,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:42:34,318][06909] Updated weights for policy 0, policy_version 44273 (0.0043) [2024-06-27 17:42:37,774][06909] Updated weights for policy 0, policy_version 44283 (0.0034) [2024-06-27 17:42:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 725565440. Throughput: 0: 43671.2. Samples: 628519580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 17:42:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:42:41,702][06909] Updated weights for policy 0, policy_version 44293 (0.0035) [2024-06-27 17:42:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43417.5, 300 sec: 43820.3). Total num frames: 725762048. Throughput: 0: 43691.7. Samples: 628655540. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 17:42:43,850][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 17:42:45,463][06909] Updated weights for policy 0, policy_version 44303 (0.0027) [2024-06-27 17:42:48,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 726007808. Throughput: 0: 43694.5. Samples: 628920220. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 17:42:48,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:42:49,164][06909] Updated weights for policy 0, policy_version 44313 (0.0033) [2024-06-27 17:42:52,881][06909] Updated weights for policy 0, policy_version 44323 (0.0037) [2024-06-27 17:42:53,856][06674] Fps is (10 sec: 45848.2, 60 sec: 43686.3, 300 sec: 43930.5). Total num frames: 726220800. Throughput: 0: 43572.5. Samples: 629175320. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 17:42:53,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:42:56,533][06909] Updated weights for policy 0, policy_version 44333 (0.0042) [2024-06-27 17:42:58,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43419.0, 300 sec: 43820.3). Total num frames: 726417408. Throughput: 0: 43776.3. Samples: 629310820. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 17:42:58,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:43:00,270][06909] Updated weights for policy 0, policy_version 44343 (0.0037) [2024-06-27 17:43:03,827][06909] Updated weights for policy 0, policy_version 44353 (0.0035) [2024-06-27 17:43:03,850][06674] Fps is (10 sec: 45902.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 726679552. Throughput: 0: 43853.9. Samples: 629583920. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 17:43:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:43:07,521][06909] Updated weights for policy 0, policy_version 44363 (0.0023) [2024-06-27 17:43:08,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 726892544. Throughput: 0: 43823.9. Samples: 629843040. Policy #0 lag: (min: 1.0, avg: 9.4, max: 21.0) [2024-06-27 17:43:08,860][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:43:11,356][06909] Updated weights for policy 0, policy_version 44373 (0.0038) [2024-06-27 17:43:13,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43417.6, 300 sec: 43820.3). Total num frames: 727072768. Throughput: 0: 43877.8. Samples: 629976740. Policy #0 lag: (min: 1.0, avg: 9.4, max: 21.0) [2024-06-27 17:43:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:43:15,319][06909] Updated weights for policy 0, policy_version 44383 (0.0037) [2024-06-27 17:43:18,642][06909] Updated weights for policy 0, policy_version 44393 (0.0028) [2024-06-27 17:43:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43931.9). Total num frames: 727334912. Throughput: 0: 43948.0. Samples: 630241560. Policy #0 lag: (min: 1.0, avg: 9.4, max: 21.0) [2024-06-27 17:43:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:43:22,659][06909] Updated weights for policy 0, policy_version 44403 (0.0026) [2024-06-27 17:43:23,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 727531520. Throughput: 0: 44025.4. Samples: 630500720. Policy #0 lag: (min: 1.0, avg: 9.4, max: 21.0) [2024-06-27 17:43:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:43:26,604][06909] Updated weights for policy 0, policy_version 44413 (0.0036) [2024-06-27 17:43:28,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 727744512. Throughput: 0: 43961.9. Samples: 630633820. Policy #0 lag: (min: 1.0, avg: 9.4, max: 21.0) [2024-06-27 17:43:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:43:30,100][06909] Updated weights for policy 0, policy_version 44423 (0.0041) [2024-06-27 17:43:30,325][06887] Signal inference workers to stop experience collection... (9100 times) [2024-06-27 17:43:30,332][06887] Signal inference workers to resume experience collection... (9100 times) [2024-06-27 17:43:30,372][06909] InferenceWorker_p0-w0: stopping experience collection (9100 times) [2024-06-27 17:43:30,372][06909] InferenceWorker_p0-w0: resuming experience collection (9100 times) [2024-06-27 17:43:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 727973888. Throughput: 0: 43975.3. Samples: 630899100. Policy #0 lag: (min: 1.0, avg: 9.4, max: 21.0) [2024-06-27 17:43:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:43:33,864][06909] Updated weights for policy 0, policy_version 44433 (0.0041) [2024-06-27 17:43:37,542][06909] Updated weights for policy 0, policy_version 44443 (0.0025) [2024-06-27 17:43:38,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 728203264. Throughput: 0: 44097.3. Samples: 631159440. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:43:38,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:43:41,283][06909] Updated weights for policy 0, policy_version 44453 (0.0029) [2024-06-27 17:43:43,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 728416256. Throughput: 0: 44031.2. Samples: 631292220. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:43:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:43:45,256][06909] Updated weights for policy 0, policy_version 44463 (0.0037) [2024-06-27 17:43:48,578][06909] Updated weights for policy 0, policy_version 44473 (0.0024) [2024-06-27 17:43:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 728645632. Throughput: 0: 43972.4. Samples: 631562680. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:43:48,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:43:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000044474_728662016.pth... [2024-06-27 17:43:48,916][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000043830_718110720.pth [2024-06-27 17:43:52,598][06909] Updated weights for policy 0, policy_version 44483 (0.0037) [2024-06-27 17:43:53,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44241.2, 300 sec: 43986.9). Total num frames: 728875008. Throughput: 0: 44065.0. Samples: 631825960. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:43:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:43:56,023][06909] Updated weights for policy 0, policy_version 44493 (0.0038) [2024-06-27 17:43:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 729071616. Throughput: 0: 44032.7. Samples: 631958220. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:43:58,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:43:59,992][06909] Updated weights for policy 0, policy_version 44503 (0.0038) [2024-06-27 17:44:03,626][06909] Updated weights for policy 0, policy_version 44513 (0.0027) [2024-06-27 17:44:03,852][06674] Fps is (10 sec: 44227.0, 60 sec: 43962.2, 300 sec: 43931.0). Total num frames: 729317376. Throughput: 0: 43971.3. Samples: 632220360. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 17:44:03,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:44:07,764][06909] Updated weights for policy 0, policy_version 44523 (0.0030) [2024-06-27 17:44:08,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 729513984. Throughput: 0: 43939.5. Samples: 632478000. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 17:44:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:44:10,848][06909] Updated weights for policy 0, policy_version 44533 (0.0036) [2024-06-27 17:44:13,850][06674] Fps is (10 sec: 40968.3, 60 sec: 44236.7, 300 sec: 43820.2). Total num frames: 729726976. Throughput: 0: 43966.5. Samples: 632612320. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 17:44:13,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:44:14,932][06909] Updated weights for policy 0, policy_version 44543 (0.0024) [2024-06-27 17:44:18,612][06909] Updated weights for policy 0, policy_version 44553 (0.0027) [2024-06-27 17:44:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 729972736. Throughput: 0: 44066.6. Samples: 632882100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 17:44:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:44:22,200][06909] Updated weights for policy 0, policy_version 44563 (0.0040) [2024-06-27 17:44:23,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 730169344. Throughput: 0: 44113.0. Samples: 633144520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 17:44:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:44:25,827][06909] Updated weights for policy 0, policy_version 44573 (0.0027) [2024-06-27 17:44:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 730398720. Throughput: 0: 43919.6. Samples: 633268600. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 17:44:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:44:29,626][06909] Updated weights for policy 0, policy_version 44583 (0.0037) [2024-06-27 17:44:33,357][06909] Updated weights for policy 0, policy_version 44593 (0.0031) [2024-06-27 17:44:33,852][06674] Fps is (10 sec: 47503.9, 60 sec: 44508.3, 300 sec: 43931.0). Total num frames: 730644480. Throughput: 0: 44088.7. Samples: 633546760. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:44:33,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:44:36,831][06909] Updated weights for policy 0, policy_version 44603 (0.0026) [2024-06-27 17:44:38,856][06674] Fps is (10 sec: 42572.4, 60 sec: 43686.3, 300 sec: 43874.9). Total num frames: 730824704. Throughput: 0: 44048.2. Samples: 633808400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:44:38,857][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:44:40,821][06909] Updated weights for policy 0, policy_version 44613 (0.0037) [2024-06-27 17:44:43,602][06887] Signal inference workers to stop experience collection... (9150 times) [2024-06-27 17:44:43,602][06887] Signal inference workers to resume experience collection... (9150 times) [2024-06-27 17:44:43,641][06909] InferenceWorker_p0-w0: stopping experience collection (9150 times) [2024-06-27 17:44:43,642][06909] InferenceWorker_p0-w0: resuming experience collection (9150 times) [2024-06-27 17:44:43,850][06674] Fps is (10 sec: 42607.1, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 731070464. Throughput: 0: 43890.8. Samples: 633933300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:44:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:44:44,493][06909] Updated weights for policy 0, policy_version 44623 (0.0046) [2024-06-27 17:44:47,985][06909] Updated weights for policy 0, policy_version 44633 (0.0027) [2024-06-27 17:44:48,850][06674] Fps is (10 sec: 47542.2, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 731299840. Throughput: 0: 44092.7. Samples: 634204440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:44:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:44:52,291][06909] Updated weights for policy 0, policy_version 44643 (0.0036) [2024-06-27 17:44:53,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 43875.8). Total num frames: 731480064. Throughput: 0: 44208.9. Samples: 634467400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:44:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:44:55,533][06909] Updated weights for policy 0, policy_version 44653 (0.0029) [2024-06-27 17:44:58,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.9, 300 sec: 43875.8). Total num frames: 731709440. Throughput: 0: 44048.2. Samples: 634594480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 17:44:58,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 17:44:59,495][06909] Updated weights for policy 0, policy_version 44663 (0.0037) [2024-06-27 17:45:03,052][06909] Updated weights for policy 0, policy_version 44673 (0.0029) [2024-06-27 17:45:03,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43965.2, 300 sec: 43931.3). Total num frames: 731955200. Throughput: 0: 43987.4. Samples: 634861540. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 17:45:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:45:06,690][06909] Updated weights for policy 0, policy_version 44683 (0.0035) [2024-06-27 17:45:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 732151808. Throughput: 0: 44107.6. Samples: 635129360. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 17:45:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:45:10,306][06909] Updated weights for policy 0, policy_version 44693 (0.0031) [2024-06-27 17:45:13,850][06674] Fps is (10 sec: 44237.7, 60 sec: 44510.0, 300 sec: 43986.9). Total num frames: 732397568. Throughput: 0: 44173.9. Samples: 635256420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 17:45:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:45:13,882][06909] Updated weights for policy 0, policy_version 44703 (0.0032) [2024-06-27 17:45:18,095][06909] Updated weights for policy 0, policy_version 44713 (0.0032) [2024-06-27 17:45:18,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 732610560. Throughput: 0: 43918.4. Samples: 635523000. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 17:45:18,851][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 17:45:21,712][06909] Updated weights for policy 0, policy_version 44723 (0.0031) [2024-06-27 17:45:23,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 732807168. Throughput: 0: 44131.7. Samples: 635794060. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 17:45:23,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:45:25,370][06909] Updated weights for policy 0, policy_version 44733 (0.0036) [2024-06-27 17:45:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 733052928. Throughput: 0: 44147.5. Samples: 635919940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 17:45:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:45:28,986][06909] Updated weights for policy 0, policy_version 44743 (0.0037) [2024-06-27 17:45:32,887][06909] Updated weights for policy 0, policy_version 44753 (0.0029) [2024-06-27 17:45:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43692.1, 300 sec: 43986.9). Total num frames: 733265920. Throughput: 0: 43903.6. Samples: 636180100. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 17:45:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:45:36,718][06909] Updated weights for policy 0, policy_version 44763 (0.0036) [2024-06-27 17:45:38,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43968.3, 300 sec: 43876.1). Total num frames: 733462528. Throughput: 0: 44062.2. Samples: 636450200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 17:45:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:45:40,617][06909] Updated weights for policy 0, policy_version 44773 (0.0034) [2024-06-27 17:45:43,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43962.2, 300 sec: 43875.5). Total num frames: 733708288. Throughput: 0: 44137.5. Samples: 636580760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 17:45:43,853][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:45:44,237][06909] Updated weights for policy 0, policy_version 44783 (0.0029) [2024-06-27 17:45:48,085][06909] Updated weights for policy 0, policy_version 44793 (0.0033) [2024-06-27 17:45:48,852][06674] Fps is (10 sec: 45865.5, 60 sec: 43689.3, 300 sec: 43875.5). Total num frames: 733921280. Throughput: 0: 43796.4. Samples: 636832460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 17:45:48,852][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:45:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000044795_733921280.pth... [2024-06-27 17:45:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000044152_723386368.pth [2024-06-27 17:45:51,626][06909] Updated weights for policy 0, policy_version 44803 (0.0038) [2024-06-27 17:45:53,850][06674] Fps is (10 sec: 39329.3, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 734101504. Throughput: 0: 43771.5. Samples: 637099080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 17:45:53,851][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:45:56,067][06909] Updated weights for policy 0, policy_version 44813 (0.0028) [2024-06-27 17:45:58,854][06674] Fps is (10 sec: 44228.6, 60 sec: 44233.9, 300 sec: 43930.8). Total num frames: 734363648. Throughput: 0: 43815.3. Samples: 637228280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 17:45:58,854][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:45:59,235][06909] Updated weights for policy 0, policy_version 44823 (0.0022) [2024-06-27 17:46:03,459][06909] Updated weights for policy 0, policy_version 44833 (0.0033) [2024-06-27 17:46:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43144.6, 300 sec: 43764.7). Total num frames: 734543872. Throughput: 0: 43663.6. Samples: 637487860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 17:46:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:46:06,491][06909] Updated weights for policy 0, policy_version 44843 (0.0030) [2024-06-27 17:46:08,850][06674] Fps is (10 sec: 40975.8, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 734773248. Throughput: 0: 43569.4. Samples: 637754680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 17:46:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:46:10,803][06909] Updated weights for policy 0, policy_version 44853 (0.0038) [2024-06-27 17:46:13,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 735019008. Throughput: 0: 43711.2. Samples: 637886940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 17:46:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:46:14,534][06909] Updated weights for policy 0, policy_version 44863 (0.0034) [2024-06-27 17:46:18,462][06909] Updated weights for policy 0, policy_version 44873 (0.0036) [2024-06-27 17:46:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 43820.3). Total num frames: 735199232. Throughput: 0: 43523.6. Samples: 638138660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 17:46:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:46:21,925][06909] Updated weights for policy 0, policy_version 44883 (0.0031) [2024-06-27 17:46:23,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 735428608. Throughput: 0: 43483.9. Samples: 638406980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 17:46:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:46:25,796][06909] Updated weights for policy 0, policy_version 44893 (0.0022) [2024-06-27 17:46:27,033][06887] Signal inference workers to stop experience collection... (9200 times) [2024-06-27 17:46:27,033][06887] Signal inference workers to resume experience collection... (9200 times) [2024-06-27 17:46:27,047][06909] InferenceWorker_p0-w0: stopping experience collection (9200 times) [2024-06-27 17:46:27,048][06909] InferenceWorker_p0-w0: resuming experience collection (9200 times) [2024-06-27 17:46:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 735657984. Throughput: 0: 43451.8. Samples: 638536000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:46:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:46:29,278][06909] Updated weights for policy 0, policy_version 44903 (0.0035) [2024-06-27 17:46:33,502][06909] Updated weights for policy 0, policy_version 44913 (0.0033) [2024-06-27 17:46:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43875.8). Total num frames: 735870976. Throughput: 0: 43521.5. Samples: 638790840. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:46:33,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:46:37,192][06909] Updated weights for policy 0, policy_version 44923 (0.0029) [2024-06-27 17:46:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 736083968. Throughput: 0: 43546.7. Samples: 639058680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:46:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:46:41,284][06909] Updated weights for policy 0, policy_version 44933 (0.0037) [2024-06-27 17:46:43,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43692.2, 300 sec: 43875.8). Total num frames: 736329728. Throughput: 0: 43659.4. Samples: 639192780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:46:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:46:44,387][06909] Updated weights for policy 0, policy_version 44943 (0.0034) [2024-06-27 17:46:48,666][06909] Updated weights for policy 0, policy_version 44953 (0.0028) [2024-06-27 17:46:48,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43144.5, 300 sec: 43764.4). Total num frames: 736509952. Throughput: 0: 43652.7. Samples: 639452320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:46:48,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:46:51,714][06909] Updated weights for policy 0, policy_version 44963 (0.0037) [2024-06-27 17:46:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.9, 300 sec: 43876.1). Total num frames: 736755712. Throughput: 0: 43701.4. Samples: 639721240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 17:46:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:46:55,834][06909] Updated weights for policy 0, policy_version 44973 (0.0042) [2024-06-27 17:46:58,850][06674] Fps is (10 sec: 47523.2, 60 sec: 43693.5, 300 sec: 43875.8). Total num frames: 736985088. Throughput: 0: 43640.0. Samples: 639850740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 17:46:58,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:46:59,075][06909] Updated weights for policy 0, policy_version 44983 (0.0035) [2024-06-27 17:47:03,358][06909] Updated weights for policy 0, policy_version 44993 (0.0026) [2024-06-27 17:47:03,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 737165312. Throughput: 0: 43830.2. Samples: 640111020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 17:47:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:47:06,859][06909] Updated weights for policy 0, policy_version 45003 (0.0035) [2024-06-27 17:47:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 737443840. Throughput: 0: 43844.0. Samples: 640379960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 17:47:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:47:10,555][06909] Updated weights for policy 0, policy_version 45013 (0.0033) [2024-06-27 17:47:13,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 737640448. Throughput: 0: 43976.9. Samples: 640514960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 17:47:13,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:47:14,053][06909] Updated weights for policy 0, policy_version 45023 (0.0028) [2024-06-27 17:47:18,354][06909] Updated weights for policy 0, policy_version 45033 (0.0030) [2024-06-27 17:47:18,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 737837056. Throughput: 0: 44147.7. Samples: 640777480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 17:47:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:47:21,457][06909] Updated weights for policy 0, policy_version 45043 (0.0033) [2024-06-27 17:47:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 738099200. Throughput: 0: 44009.8. Samples: 641039120. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 17:47:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:47:25,607][06909] Updated weights for policy 0, policy_version 45053 (0.0025) [2024-06-27 17:47:28,835][06909] Updated weights for policy 0, policy_version 45063 (0.0041) [2024-06-27 17:47:28,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 738312192. Throughput: 0: 44100.8. Samples: 641177320. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 17:47:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:47:33,124][06909] Updated weights for policy 0, policy_version 45073 (0.0037) [2024-06-27 17:47:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 738508800. Throughput: 0: 44164.7. Samples: 641439640. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 17:47:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:47:36,396][06909] Updated weights for policy 0, policy_version 45083 (0.0032) [2024-06-27 17:47:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 738754560. Throughput: 0: 43988.8. Samples: 641700740. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 17:47:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:47:40,472][06909] Updated weights for policy 0, policy_version 45093 (0.0025) [2024-06-27 17:47:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 738951168. Throughput: 0: 44018.6. Samples: 641831580. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 17:47:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:47:43,912][06909] Updated weights for policy 0, policy_version 45103 (0.0039) [2024-06-27 17:47:48,057][06909] Updated weights for policy 0, policy_version 45113 (0.0043) [2024-06-27 17:47:48,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44238.3, 300 sec: 43876.7). Total num frames: 739164160. Throughput: 0: 44099.1. Samples: 642095480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 17:47:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:47:48,932][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000045116_739180544.pth... [2024-06-27 17:47:48,978][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000044474_728662016.pth [2024-06-27 17:47:51,497][06909] Updated weights for policy 0, policy_version 45123 (0.0034) [2024-06-27 17:47:53,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 739409920. Throughput: 0: 43828.8. Samples: 642352260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:47:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:47:55,324][06909] Updated weights for policy 0, policy_version 45133 (0.0025) [2024-06-27 17:47:58,683][06887] Signal inference workers to stop experience collection... (9250 times) [2024-06-27 17:47:58,730][06909] InferenceWorker_p0-w0: stopping experience collection (9250 times) [2024-06-27 17:47:58,730][06887] Signal inference workers to resume experience collection... (9250 times) [2024-06-27 17:47:58,746][06909] InferenceWorker_p0-w0: resuming experience collection (9250 times) [2024-06-27 17:47:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 739606528. Throughput: 0: 43987.0. Samples: 642494380. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:47:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:47:58,871][06909] Updated weights for policy 0, policy_version 45143 (0.0027) [2024-06-27 17:48:02,866][06909] Updated weights for policy 0, policy_version 45153 (0.0032) [2024-06-27 17:48:03,850][06674] Fps is (10 sec: 40960.7, 60 sec: 44236.8, 300 sec: 43820.3). Total num frames: 739819520. Throughput: 0: 43898.2. Samples: 642752900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:48:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:48:06,365][06909] Updated weights for policy 0, policy_version 45163 (0.0036) [2024-06-27 17:48:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.5, 300 sec: 43986.8). Total num frames: 740048896. Throughput: 0: 43872.7. Samples: 643013400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:48:08,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:48:10,302][06909] Updated weights for policy 0, policy_version 45173 (0.0022) [2024-06-27 17:48:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 740261888. Throughput: 0: 43906.7. Samples: 643153120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:48:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:48:13,898][06909] Updated weights for policy 0, policy_version 45183 (0.0028) [2024-06-27 17:48:17,594][06909] Updated weights for policy 0, policy_version 45193 (0.0042) [2024-06-27 17:48:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.8, 300 sec: 43986.8). Total num frames: 740507648. Throughput: 0: 43941.7. Samples: 643417020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 17:48:18,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:48:21,133][06909] Updated weights for policy 0, policy_version 45203 (0.0036) [2024-06-27 17:48:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 740720640. Throughput: 0: 43921.3. Samples: 643677200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 17:48:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:48:24,892][06909] Updated weights for policy 0, policy_version 45213 (0.0042) [2024-06-27 17:48:28,799][06909] Updated weights for policy 0, policy_version 45223 (0.0032) [2024-06-27 17:48:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 740933632. Throughput: 0: 43863.1. Samples: 643805420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 17:48:28,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:48:32,167][06909] Updated weights for policy 0, policy_version 45233 (0.0038) [2024-06-27 17:48:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 741146624. Throughput: 0: 43781.8. Samples: 644065660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 17:48:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:48:36,358][06909] Updated weights for policy 0, policy_version 45243 (0.0036) [2024-06-27 17:48:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 741392384. Throughput: 0: 43888.5. Samples: 644327240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 17:48:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:48:39,964][06909] Updated weights for policy 0, policy_version 45253 (0.0030) [2024-06-27 17:48:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 741572608. Throughput: 0: 43858.3. Samples: 644468000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 17:48:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:48:43,906][06909] Updated weights for policy 0, policy_version 45263 (0.0035) [2024-06-27 17:48:47,195][06909] Updated weights for policy 0, policy_version 45273 (0.0032) [2024-06-27 17:48:48,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 741801984. Throughput: 0: 43931.0. Samples: 644729800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 17:48:48,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:48:51,130][06909] Updated weights for policy 0, policy_version 45283 (0.0027) [2024-06-27 17:48:53,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 742031360. Throughput: 0: 44072.0. Samples: 644996640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 17:48:53,850][06674] Avg episode reward: [(0, '0.453')] [2024-06-27 17:48:54,012][06887] Saving new best policy, reward=0.453! [2024-06-27 17:48:54,816][06909] Updated weights for policy 0, policy_version 45293 (0.0023) [2024-06-27 17:48:58,647][06909] Updated weights for policy 0, policy_version 45303 (0.0037) [2024-06-27 17:48:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 43876.1). Total num frames: 742260736. Throughput: 0: 43961.8. Samples: 645131400. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 17:48:58,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:49:02,141][06909] Updated weights for policy 0, policy_version 45313 (0.0035) [2024-06-27 17:49:03,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 742473728. Throughput: 0: 43877.4. Samples: 645391500. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 17:49:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:49:06,069][06909] Updated weights for policy 0, policy_version 45323 (0.0033) [2024-06-27 17:49:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 742703104. Throughput: 0: 43870.2. Samples: 645651360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 17:49:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:49:09,477][06909] Updated weights for policy 0, policy_version 45333 (0.0026) [2024-06-27 17:49:13,786][06909] Updated weights for policy 0, policy_version 45343 (0.0029) [2024-06-27 17:49:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 742899712. Throughput: 0: 44043.3. Samples: 645787360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 17:49:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:49:14,459][06887] Signal inference workers to stop experience collection... (9300 times) [2024-06-27 17:49:14,463][06887] Signal inference workers to resume experience collection... (9300 times) [2024-06-27 17:49:14,500][06909] InferenceWorker_p0-w0: stopping experience collection (9300 times) [2024-06-27 17:49:14,500][06909] InferenceWorker_p0-w0: resuming experience collection (9300 times) [2024-06-27 17:49:17,066][06909] Updated weights for policy 0, policy_version 45353 (0.0027) [2024-06-27 17:49:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 743129088. Throughput: 0: 44047.0. Samples: 646047780. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:49:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:49:20,950][06909] Updated weights for policy 0, policy_version 45363 (0.0029) [2024-06-27 17:49:23,852][06674] Fps is (10 sec: 44227.3, 60 sec: 43689.2, 300 sec: 43875.5). Total num frames: 743342080. Throughput: 0: 43969.6. Samples: 646305960. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:49:23,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:49:24,387][06909] Updated weights for policy 0, policy_version 45373 (0.0037) [2024-06-27 17:49:28,418][06909] Updated weights for policy 0, policy_version 45383 (0.0037) [2024-06-27 17:49:28,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 43820.6). Total num frames: 743571456. Throughput: 0: 43933.8. Samples: 646445020. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:49:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:49:31,707][06909] Updated weights for policy 0, policy_version 45393 (0.0028) [2024-06-27 17:49:33,850][06674] Fps is (10 sec: 42606.6, 60 sec: 43690.5, 300 sec: 43876.7). Total num frames: 743768064. Throughput: 0: 44017.7. Samples: 646710600. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:49:33,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:49:35,975][06909] Updated weights for policy 0, policy_version 45403 (0.0035) [2024-06-27 17:49:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 744013824. Throughput: 0: 43852.6. Samples: 646970000. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:49:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:49:39,341][06909] Updated weights for policy 0, policy_version 45413 (0.0037) [2024-06-27 17:49:43,352][06909] Updated weights for policy 0, policy_version 45423 (0.0035) [2024-06-27 17:49:43,850][06674] Fps is (10 sec: 47514.7, 60 sec: 44509.9, 300 sec: 43875.8). Total num frames: 744243200. Throughput: 0: 43825.0. Samples: 647103520. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 17:49:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:49:46,985][06909] Updated weights for policy 0, policy_version 45433 (0.0033) [2024-06-27 17:49:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 744423424. Throughput: 0: 43860.5. Samples: 647365220. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 17:49:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:49:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000045436_744423424.pth... [2024-06-27 17:49:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000044795_733921280.pth [2024-06-27 17:49:50,600][06909] Updated weights for policy 0, policy_version 45443 (0.0030) [2024-06-27 17:49:53,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 744669184. Throughput: 0: 43915.6. Samples: 647627560. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 17:49:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:49:54,255][06909] Updated weights for policy 0, policy_version 45453 (0.0035) [2024-06-27 17:49:58,189][06909] Updated weights for policy 0, policy_version 45463 (0.0026) [2024-06-27 17:49:58,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 744898560. Throughput: 0: 43960.3. Samples: 647765580. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 17:49:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:50:01,621][06909] Updated weights for policy 0, policy_version 45473 (0.0027) [2024-06-27 17:50:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 745095168. Throughput: 0: 43843.2. Samples: 648020720. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 17:50:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:50:05,763][06909] Updated weights for policy 0, policy_version 45483 (0.0021) [2024-06-27 17:50:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 745340928. Throughput: 0: 44123.7. Samples: 648291440. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 17:50:08,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:50:08,954][06909] Updated weights for policy 0, policy_version 45493 (0.0039) [2024-06-27 17:50:13,058][06909] Updated weights for policy 0, policy_version 45503 (0.0031) [2024-06-27 17:50:13,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.8, 300 sec: 43931.3). Total num frames: 745570304. Throughput: 0: 44144.4. Samples: 648431520. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 17:50:13,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:50:16,731][06909] Updated weights for policy 0, policy_version 45513 (0.0032) [2024-06-27 17:50:18,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 745734144. Throughput: 0: 43954.8. Samples: 648688560. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 17:50:18,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:50:20,525][06909] Updated weights for policy 0, policy_version 45523 (0.0045) [2024-06-27 17:50:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44238.3, 300 sec: 43875.8). Total num frames: 745996288. Throughput: 0: 43971.1. Samples: 648948700. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 17:50:23,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:50:24,318][06909] Updated weights for policy 0, policy_version 45533 (0.0030) [2024-06-27 17:50:27,893][06909] Updated weights for policy 0, policy_version 45543 (0.0022) [2024-06-27 17:50:27,916][06887] Signal inference workers to stop experience collection... (9350 times) [2024-06-27 17:50:27,916][06887] Signal inference workers to resume experience collection... (9350 times) [2024-06-27 17:50:27,933][06909] InferenceWorker_p0-w0: stopping experience collection (9350 times) [2024-06-27 17:50:27,933][06909] InferenceWorker_p0-w0: resuming experience collection (9350 times) [2024-06-27 17:50:28,856][06674] Fps is (10 sec: 50759.7, 60 sec: 44505.3, 300 sec: 43986.0). Total num frames: 746242048. Throughput: 0: 44060.6. Samples: 649086520. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 17:50:28,856][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:50:31,870][06909] Updated weights for policy 0, policy_version 45553 (0.0022) [2024-06-27 17:50:33,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 746405888. Throughput: 0: 43908.2. Samples: 649341100. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 17:50:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:50:35,364][06909] Updated weights for policy 0, policy_version 45563 (0.0033) [2024-06-27 17:50:38,850][06674] Fps is (10 sec: 37706.1, 60 sec: 43417.6, 300 sec: 43765.0). Total num frames: 746618880. Throughput: 0: 43952.0. Samples: 649605400. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 17:50:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:50:39,206][06909] Updated weights for policy 0, policy_version 45573 (0.0023) [2024-06-27 17:50:42,946][06909] Updated weights for policy 0, policy_version 45583 (0.0034) [2024-06-27 17:50:43,850][06674] Fps is (10 sec: 47514.4, 60 sec: 43963.7, 300 sec: 43931.6). Total num frames: 746881024. Throughput: 0: 43914.7. Samples: 649741740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:50:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:50:46,603][06909] Updated weights for policy 0, policy_version 45593 (0.0019) [2024-06-27 17:50:48,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43689.1, 300 sec: 43875.5). Total num frames: 747044864. Throughput: 0: 43952.1. Samples: 649998660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:50:48,853][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:50:50,300][06909] Updated weights for policy 0, policy_version 45603 (0.0033) [2024-06-27 17:50:53,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 43876.4). Total num frames: 747307008. Throughput: 0: 43871.6. Samples: 650265660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:50:53,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:50:54,106][06909] Updated weights for policy 0, policy_version 45613 (0.0027) [2024-06-27 17:50:57,653][06909] Updated weights for policy 0, policy_version 45623 (0.0025) [2024-06-27 17:50:58,850][06674] Fps is (10 sec: 50800.9, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 747552768. Throughput: 0: 43861.7. Samples: 650405300. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:50:58,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:51:01,497][06909] Updated weights for policy 0, policy_version 45633 (0.0026) [2024-06-27 17:51:03,856][06674] Fps is (10 sec: 40935.6, 60 sec: 43686.3, 300 sec: 43874.9). Total num frames: 747716608. Throughput: 0: 43778.2. Samples: 650658840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:51:03,856][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:51:05,135][06909] Updated weights for policy 0, policy_version 45643 (0.0040) [2024-06-27 17:51:08,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 747962368. Throughput: 0: 43899.5. Samples: 650924180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 17:51:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:51:09,169][06909] Updated weights for policy 0, policy_version 45653 (0.0031) [2024-06-27 17:51:12,567][06909] Updated weights for policy 0, policy_version 45663 (0.0033) [2024-06-27 17:51:13,850][06674] Fps is (10 sec: 49181.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 748208128. Throughput: 0: 43950.9. Samples: 651064040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 17:51:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:51:16,504][06909] Updated weights for policy 0, policy_version 45673 (0.0047) [2024-06-27 17:51:18,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 748371968. Throughput: 0: 43980.6. Samples: 651320220. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 17:51:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:51:20,238][06909] Updated weights for policy 0, policy_version 45683 (0.0027) [2024-06-27 17:51:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 748617728. Throughput: 0: 43795.6. Samples: 651576200. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 17:51:23,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:51:23,932][06909] Updated weights for policy 0, policy_version 45693 (0.0025) [2024-06-27 17:51:27,810][06909] Updated weights for policy 0, policy_version 45703 (0.0025) [2024-06-27 17:51:28,850][06674] Fps is (10 sec: 49151.8, 60 sec: 43695.1, 300 sec: 44042.4). Total num frames: 748863488. Throughput: 0: 43835.5. Samples: 651714340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 17:51:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:51:31,346][06909] Updated weights for policy 0, policy_version 45713 (0.0027) [2024-06-27 17:51:33,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 749027328. Throughput: 0: 43832.2. Samples: 651971020. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 17:51:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:51:35,253][06909] Updated weights for policy 0, policy_version 45723 (0.0043) [2024-06-27 17:51:38,850][06674] Fps is (10 sec: 40958.9, 60 sec: 44236.6, 300 sec: 43875.8). Total num frames: 749273088. Throughput: 0: 43670.5. Samples: 652230840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 17:51:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:51:39,402][06909] Updated weights for policy 0, policy_version 45733 (0.0023) [2024-06-27 17:51:42,681][06909] Updated weights for policy 0, policy_version 45743 (0.0031) [2024-06-27 17:51:43,850][06674] Fps is (10 sec: 49152.4, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 749518848. Throughput: 0: 43647.1. Samples: 652369420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 17:51:43,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:51:46,686][06909] Updated weights for policy 0, policy_version 45753 (0.0034) [2024-06-27 17:51:47,044][06887] Signal inference workers to stop experience collection... (9400 times) [2024-06-27 17:51:47,045][06887] Signal inference workers to resume experience collection... (9400 times) [2024-06-27 17:51:47,072][06909] InferenceWorker_p0-w0: stopping experience collection (9400 times) [2024-06-27 17:51:47,072][06909] InferenceWorker_p0-w0: resuming experience collection (9400 times) [2024-06-27 17:51:48,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43965.2, 300 sec: 43820.2). Total num frames: 749682688. Throughput: 0: 43759.9. Samples: 652627780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 17:51:48,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:51:48,886][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000045758_749699072.pth... [2024-06-27 17:51:48,941][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000045116_739180544.pth [2024-06-27 17:51:50,136][06909] Updated weights for policy 0, policy_version 45763 (0.0033) [2024-06-27 17:51:53,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 749912064. Throughput: 0: 43724.9. Samples: 652891800. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 17:51:53,858][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:51:54,242][06909] Updated weights for policy 0, policy_version 45773 (0.0044) [2024-06-27 17:51:57,726][06909] Updated weights for policy 0, policy_version 45783 (0.0031) [2024-06-27 17:51:58,850][06674] Fps is (10 sec: 49152.4, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 750174208. Throughput: 0: 43594.1. Samples: 653025780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 17:51:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:52:01,770][06909] Updated weights for policy 0, policy_version 45793 (0.0032) [2024-06-27 17:52:03,856][06674] Fps is (10 sec: 42572.2, 60 sec: 43690.6, 300 sec: 43708.3). Total num frames: 750338048. Throughput: 0: 43534.9. Samples: 653279560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 17:52:03,857][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:52:05,527][06909] Updated weights for policy 0, policy_version 45803 (0.0021) [2024-06-27 17:52:08,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 750583808. Throughput: 0: 43548.9. Samples: 653535900. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 17:52:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:52:09,530][06909] Updated weights for policy 0, policy_version 45813 (0.0040) [2024-06-27 17:52:13,015][06909] Updated weights for policy 0, policy_version 45823 (0.0034) [2024-06-27 17:52:13,850][06674] Fps is (10 sec: 49181.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 750829568. Throughput: 0: 43565.2. Samples: 653674780. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 17:52:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:52:16,914][06909] Updated weights for policy 0, policy_version 45833 (0.0034) [2024-06-27 17:52:18,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 750993408. Throughput: 0: 43706.3. Samples: 653937800. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 17:52:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:52:20,423][06909] Updated weights for policy 0, policy_version 45843 (0.0039) [2024-06-27 17:52:23,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 751239168. Throughput: 0: 43643.4. Samples: 654194780. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 17:52:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:52:24,427][06909] Updated weights for policy 0, policy_version 45853 (0.0033) [2024-06-27 17:52:27,738][06909] Updated weights for policy 0, policy_version 45863 (0.0035) [2024-06-27 17:52:28,850][06674] Fps is (10 sec: 49152.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 751484928. Throughput: 0: 43620.1. Samples: 654332320. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 17:52:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:52:31,697][06909] Updated weights for policy 0, policy_version 45873 (0.0023) [2024-06-27 17:52:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 751665152. Throughput: 0: 43789.9. Samples: 654598320. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-27 17:52:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:52:35,326][06909] Updated weights for policy 0, policy_version 45883 (0.0045) [2024-06-27 17:52:38,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 751894528. Throughput: 0: 43570.6. Samples: 654852480. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 17:52:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:52:39,099][06909] Updated weights for policy 0, policy_version 45893 (0.0028) [2024-06-27 17:52:43,153][06909] Updated weights for policy 0, policy_version 45903 (0.0038) [2024-06-27 17:52:43,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43417.7, 300 sec: 43931.3). Total num frames: 752123904. Throughput: 0: 43553.5. Samples: 654985680. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 17:52:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:52:46,824][06909] Updated weights for policy 0, policy_version 45913 (0.0035) [2024-06-27 17:52:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 752304128. Throughput: 0: 43808.6. Samples: 655250680. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 17:52:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:52:50,573][06909] Updated weights for policy 0, policy_version 45923 (0.0034) [2024-06-27 17:52:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 752549888. Throughput: 0: 43661.8. Samples: 655500680. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 17:52:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:52:54,773][06909] Updated weights for policy 0, policy_version 45933 (0.0057) [2024-06-27 17:52:58,050][06909] Updated weights for policy 0, policy_version 45943 (0.0036) [2024-06-27 17:52:58,856][06674] Fps is (10 sec: 49122.3, 60 sec: 43686.3, 300 sec: 43986.0). Total num frames: 752795648. Throughput: 0: 43623.5. Samples: 655638100. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 17:52:58,857][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:53:02,071][06909] Updated weights for policy 0, policy_version 45953 (0.0025) [2024-06-27 17:53:03,852][06674] Fps is (10 sec: 42589.5, 60 sec: 43966.7, 300 sec: 43820.0). Total num frames: 752975872. Throughput: 0: 43661.1. Samples: 655902640. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 17:53:03,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:53:05,446][06909] Updated weights for policy 0, policy_version 45963 (0.0022) [2024-06-27 17:53:08,852][06674] Fps is (10 sec: 40976.2, 60 sec: 43689.1, 300 sec: 43875.5). Total num frames: 753205248. Throughput: 0: 43767.2. Samples: 656164400. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 17:53:08,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:53:09,379][06909] Updated weights for policy 0, policy_version 45973 (0.0021) [2024-06-27 17:53:11,384][06887] Signal inference workers to stop experience collection... (9450 times) [2024-06-27 17:53:11,384][06887] Signal inference workers to resume experience collection... (9450 times) [2024-06-27 17:53:11,403][06909] InferenceWorker_p0-w0: stopping experience collection (9450 times) [2024-06-27 17:53:11,403][06909] InferenceWorker_p0-w0: resuming experience collection (9450 times) [2024-06-27 17:53:12,756][06909] Updated weights for policy 0, policy_version 45983 (0.0035) [2024-06-27 17:53:13,850][06674] Fps is (10 sec: 47523.1, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 753451008. Throughput: 0: 43737.7. Samples: 656300520. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 17:53:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:53:16,638][06909] Updated weights for policy 0, policy_version 45993 (0.0038) [2024-06-27 17:53:18,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 753631232. Throughput: 0: 43697.8. Samples: 656564720. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 17:53:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:53:20,242][06909] Updated weights for policy 0, policy_version 46003 (0.0031) [2024-06-27 17:53:23,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 753844224. Throughput: 0: 43636.1. Samples: 656816100. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 17:53:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:53:24,356][06909] Updated weights for policy 0, policy_version 46013 (0.0036) [2024-06-27 17:53:27,942][06909] Updated weights for policy 0, policy_version 46023 (0.0028) [2024-06-27 17:53:28,852][06674] Fps is (10 sec: 45865.9, 60 sec: 43416.1, 300 sec: 43875.5). Total num frames: 754089984. Throughput: 0: 43623.3. Samples: 656948820. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 17:53:28,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:53:31,981][06909] Updated weights for policy 0, policy_version 46033 (0.0036) [2024-06-27 17:53:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 754286592. Throughput: 0: 43465.8. Samples: 657206640. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 17:53:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:53:35,766][06909] Updated weights for policy 0, policy_version 46043 (0.0036) [2024-06-27 17:53:38,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 754515968. Throughput: 0: 43852.9. Samples: 657474060. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:53:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:53:39,191][06909] Updated weights for policy 0, policy_version 46053 (0.0039) [2024-06-27 17:53:43,034][06909] Updated weights for policy 0, policy_version 46063 (0.0029) [2024-06-27 17:53:43,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43690.5, 300 sec: 43875.8). Total num frames: 754745344. Throughput: 0: 43768.4. Samples: 657607420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:53:43,851][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 17:53:46,766][06909] Updated weights for policy 0, policy_version 46073 (0.0029) [2024-06-27 17:53:48,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 754941952. Throughput: 0: 43741.9. Samples: 657870940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:53:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:53:48,981][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000046079_754958336.pth... [2024-06-27 17:53:49,035][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000045436_744423424.pth [2024-06-27 17:53:50,442][06909] Updated weights for policy 0, policy_version 46083 (0.0034) [2024-06-27 17:53:53,850][06674] Fps is (10 sec: 40960.9, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 755154944. Throughput: 0: 43729.2. Samples: 658132120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:53:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:53:54,269][06909] Updated weights for policy 0, policy_version 46093 (0.0031) [2024-06-27 17:53:57,763][06909] Updated weights for policy 0, policy_version 46103 (0.0033) [2024-06-27 17:53:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43422.0, 300 sec: 43820.3). Total num frames: 755400704. Throughput: 0: 43695.6. Samples: 658266820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:53:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:54:01,799][06909] Updated weights for policy 0, policy_version 46113 (0.0034) [2024-06-27 17:54:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43692.2, 300 sec: 43709.2). Total num frames: 755597312. Throughput: 0: 43616.1. Samples: 658527440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 17:54:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:54:05,439][06909] Updated weights for policy 0, policy_version 46123 (0.0031) [2024-06-27 17:54:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43692.2, 300 sec: 43820.2). Total num frames: 755826688. Throughput: 0: 43827.5. Samples: 658788340. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:54:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:54:09,220][06909] Updated weights for policy 0, policy_version 46133 (0.0032) [2024-06-27 17:54:12,919][06909] Updated weights for policy 0, policy_version 46143 (0.0035) [2024-06-27 17:54:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 756056064. Throughput: 0: 43857.1. Samples: 658922300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:54:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:54:16,661][06909] Updated weights for policy 0, policy_version 46153 (0.0035) [2024-06-27 17:54:18,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 43765.0). Total num frames: 756252672. Throughput: 0: 43814.1. Samples: 659178280. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:54:18,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:54:20,480][06909] Updated weights for policy 0, policy_version 46163 (0.0036) [2024-06-27 17:54:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 756482048. Throughput: 0: 43802.7. Samples: 659445180. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:54:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:54:24,003][06909] Updated weights for policy 0, policy_version 46173 (0.0028) [2024-06-27 17:54:28,020][06909] Updated weights for policy 0, policy_version 46183 (0.0039) [2024-06-27 17:54:28,850][06674] Fps is (10 sec: 45876.1, 60 sec: 43692.2, 300 sec: 43875.8). Total num frames: 756711424. Throughput: 0: 43743.3. Samples: 659575860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:54:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:54:31,891][06909] Updated weights for policy 0, policy_version 46193 (0.0030) [2024-06-27 17:54:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 756908032. Throughput: 0: 43654.7. Samples: 659835400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 17:54:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:54:35,542][06909] Updated weights for policy 0, policy_version 46203 (0.0040) [2024-06-27 17:54:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 757137408. Throughput: 0: 43839.1. Samples: 660104880. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:54:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:54:39,152][06909] Updated weights for policy 0, policy_version 46213 (0.0031) [2024-06-27 17:54:43,102][06909] Updated weights for policy 0, policy_version 46223 (0.0042) [2024-06-27 17:54:43,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 757366784. Throughput: 0: 43670.7. Samples: 660232000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:54:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:54:46,878][06909] Updated weights for policy 0, policy_version 46233 (0.0041) [2024-06-27 17:54:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 757563392. Throughput: 0: 43657.8. Samples: 660492040. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:54:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:54:50,683][06909] Updated weights for policy 0, policy_version 46243 (0.0037) [2024-06-27 17:54:51,094][06887] Signal inference workers to stop experience collection... (9500 times) [2024-06-27 17:54:51,141][06909] InferenceWorker_p0-w0: stopping experience collection (9500 times) [2024-06-27 17:54:51,209][06887] Signal inference workers to resume experience collection... (9500 times) [2024-06-27 17:54:51,210][06909] InferenceWorker_p0-w0: resuming experience collection (9500 times) [2024-06-27 17:54:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 757792768. Throughput: 0: 43605.3. Samples: 660750580. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:54:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:54:54,406][06909] Updated weights for policy 0, policy_version 46253 (0.0051) [2024-06-27 17:54:58,428][06909] Updated weights for policy 0, policy_version 46263 (0.0027) [2024-06-27 17:54:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43144.6, 300 sec: 43709.2). Total num frames: 757989376. Throughput: 0: 43465.4. Samples: 660878240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 17:54:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:55:02,133][06909] Updated weights for policy 0, policy_version 46273 (0.0037) [2024-06-27 17:55:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 758235136. Throughput: 0: 43683.3. Samples: 661144020. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 17:55:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:55:05,771][06909] Updated weights for policy 0, policy_version 46283 (0.0034) [2024-06-27 17:55:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 758448128. Throughput: 0: 43504.9. Samples: 661402900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 17:55:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:55:09,611][06909] Updated weights for policy 0, policy_version 46293 (0.0028) [2024-06-27 17:55:13,294][06909] Updated weights for policy 0, policy_version 46303 (0.0030) [2024-06-27 17:55:13,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43416.1, 300 sec: 43820.0). Total num frames: 758661120. Throughput: 0: 43589.5. Samples: 661537480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 17:55:13,861][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 17:55:17,051][06909] Updated weights for policy 0, policy_version 46313 (0.0026) [2024-06-27 17:55:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.9, 300 sec: 43709.2). Total num frames: 758890496. Throughput: 0: 43678.8. Samples: 661800940. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 17:55:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:55:20,668][06909] Updated weights for policy 0, policy_version 46323 (0.0039) [2024-06-27 17:55:23,856][06674] Fps is (10 sec: 44219.4, 60 sec: 43686.3, 300 sec: 43598.1). Total num frames: 759103488. Throughput: 0: 43412.5. Samples: 662058700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 17:55:23,856][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:55:24,313][06909] Updated weights for policy 0, policy_version 46333 (0.0033) [2024-06-27 17:55:28,342][06909] Updated weights for policy 0, policy_version 46343 (0.0038) [2024-06-27 17:55:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43764.8). Total num frames: 759316480. Throughput: 0: 43548.5. Samples: 662191680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 17:55:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:55:32,005][06909] Updated weights for policy 0, policy_version 46353 (0.0026) [2024-06-27 17:55:33,856][06674] Fps is (10 sec: 44236.4, 60 sec: 43959.4, 300 sec: 43819.4). Total num frames: 759545856. Throughput: 0: 43618.5. Samples: 662455140. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-27 17:55:33,856][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:55:35,944][06909] Updated weights for policy 0, policy_version 46363 (0.0039) [2024-06-27 17:55:38,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 759758848. Throughput: 0: 43639.6. Samples: 662714360. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-27 17:55:38,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:55:39,294][06909] Updated weights for policy 0, policy_version 46373 (0.0034) [2024-06-27 17:55:43,463][06909] Updated weights for policy 0, policy_version 46383 (0.0035) [2024-06-27 17:55:43,850][06674] Fps is (10 sec: 40984.1, 60 sec: 43144.4, 300 sec: 43765.0). Total num frames: 759955456. Throughput: 0: 43640.7. Samples: 662842080. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-27 17:55:43,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 17:55:46,895][06909] Updated weights for policy 0, policy_version 46393 (0.0029) [2024-06-27 17:55:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43653.7). Total num frames: 760184832. Throughput: 0: 43595.1. Samples: 663105800. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-27 17:55:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:55:48,880][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000046398_760184832.pth... [2024-06-27 17:55:48,957][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000045758_749699072.pth [2024-06-27 17:55:50,968][06909] Updated weights for policy 0, policy_version 46403 (0.0033) [2024-06-27 17:55:53,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 760414208. Throughput: 0: 43613.2. Samples: 663365500. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-27 17:55:53,853][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:55:54,451][06909] Updated weights for policy 0, policy_version 46413 (0.0037) [2024-06-27 17:55:58,320][06909] Updated weights for policy 0, policy_version 46423 (0.0027) [2024-06-27 17:55:58,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43962.2, 300 sec: 43765.3). Total num frames: 760627200. Throughput: 0: 43562.7. Samples: 663497800. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-27 17:55:58,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:56:01,609][06909] Updated weights for policy 0, policy_version 46433 (0.0037) [2024-06-27 17:56:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 760856576. Throughput: 0: 43681.7. Samples: 663766620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 17:56:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:56:05,854][06909] Updated weights for policy 0, policy_version 46443 (0.0052) [2024-06-27 17:56:08,850][06674] Fps is (10 sec: 44246.1, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 761069568. Throughput: 0: 43720.5. Samples: 664025860. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 17:56:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:56:08,883][06909] Updated weights for policy 0, policy_version 46453 (0.0020) [2024-06-27 17:56:13,191][06909] Updated weights for policy 0, policy_version 46463 (0.0024) [2024-06-27 17:56:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43692.1, 300 sec: 43764.7). Total num frames: 761282560. Throughput: 0: 43793.6. Samples: 664162400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 17:56:13,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:56:16,567][06909] Updated weights for policy 0, policy_version 46473 (0.0031) [2024-06-27 17:56:18,850][06674] Fps is (10 sec: 42597.5, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 761495552. Throughput: 0: 43692.4. Samples: 664421040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 17:56:18,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:56:20,831][06909] Updated weights for policy 0, policy_version 46483 (0.0025) [2024-06-27 17:56:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43421.9, 300 sec: 43542.6). Total num frames: 761708544. Throughput: 0: 43648.5. Samples: 664678540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 17:56:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:56:24,064][06887] Signal inference workers to stop experience collection... (9550 times) [2024-06-27 17:56:24,110][06909] InferenceWorker_p0-w0: stopping experience collection (9550 times) [2024-06-27 17:56:24,179][06887] Signal inference workers to resume experience collection... (9550 times) [2024-06-27 17:56:24,179][06909] InferenceWorker_p0-w0: resuming experience collection (9550 times) [2024-06-27 17:56:24,319][06909] Updated weights for policy 0, policy_version 46493 (0.0037) [2024-06-27 17:56:28,598][06909] Updated weights for policy 0, policy_version 46503 (0.0032) [2024-06-27 17:56:28,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 761921536. Throughput: 0: 43826.9. Samples: 664814280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 17:56:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:56:31,686][06909] Updated weights for policy 0, policy_version 46513 (0.0028) [2024-06-27 17:56:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43148.9, 300 sec: 43598.1). Total num frames: 762134528. Throughput: 0: 43742.2. Samples: 665074200. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 17:56:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:56:35,983][06909] Updated weights for policy 0, policy_version 46523 (0.0039) [2024-06-27 17:56:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 762380288. Throughput: 0: 43768.5. Samples: 665335080. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 17:56:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:56:38,987][06909] Updated weights for policy 0, policy_version 46533 (0.0027) [2024-06-27 17:56:43,284][06909] Updated weights for policy 0, policy_version 46543 (0.0031) [2024-06-27 17:56:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 762593280. Throughput: 0: 43810.0. Samples: 665469160. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 17:56:43,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:56:46,315][06909] Updated weights for policy 0, policy_version 46553 (0.0032) [2024-06-27 17:56:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 762822656. Throughput: 0: 43673.4. Samples: 665731920. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 17:56:48,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:56:50,787][06909] Updated weights for policy 0, policy_version 46563 (0.0044) [2024-06-27 17:56:53,666][06909] Updated weights for policy 0, policy_version 46573 (0.0031) [2024-06-27 17:56:53,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 763052032. Throughput: 0: 43796.2. Samples: 665996700. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 17:56:53,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:56:58,179][06909] Updated weights for policy 0, policy_version 46583 (0.0029) [2024-06-27 17:56:58,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43692.1, 300 sec: 43765.6). Total num frames: 763248640. Throughput: 0: 43787.5. Samples: 666132840. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 17:56:58,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:57:01,379][06909] Updated weights for policy 0, policy_version 46593 (0.0029) [2024-06-27 17:57:03,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 763478016. Throughput: 0: 43687.2. Samples: 666386960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 17:57:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:57:05,861][06909] Updated weights for policy 0, policy_version 46603 (0.0030) [2024-06-27 17:57:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.5, 300 sec: 43598.1). Total num frames: 763691008. Throughput: 0: 43820.4. Samples: 666650460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 17:57:08,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:57:08,963][06909] Updated weights for policy 0, policy_version 46613 (0.0030) [2024-06-27 17:57:13,186][06909] Updated weights for policy 0, policy_version 46623 (0.0029) [2024-06-27 17:57:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 763920384. Throughput: 0: 43868.9. Samples: 666788380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 17:57:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:57:16,357][06909] Updated weights for policy 0, policy_version 46633 (0.0030) [2024-06-27 17:57:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 764133376. Throughput: 0: 43816.5. Samples: 667045940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 17:57:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:57:20,550][06909] Updated weights for policy 0, policy_version 46643 (0.0030) [2024-06-27 17:57:23,761][06909] Updated weights for policy 0, policy_version 46653 (0.0028) [2024-06-27 17:57:23,852][06674] Fps is (10 sec: 44227.4, 60 sec: 44235.3, 300 sec: 43653.3). Total num frames: 764362752. Throughput: 0: 43822.0. Samples: 667307160. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 17:57:23,852][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 17:57:28,148][06909] Updated weights for policy 0, policy_version 46663 (0.0030) [2024-06-27 17:57:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 764559360. Throughput: 0: 43844.9. Samples: 667442180. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 17:57:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:57:31,390][06909] Updated weights for policy 0, policy_version 46673 (0.0028) [2024-06-27 17:57:33,850][06674] Fps is (10 sec: 40968.5, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 764772352. Throughput: 0: 43774.7. Samples: 667701780. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 17:57:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:57:35,739][06909] Updated weights for policy 0, policy_version 46683 (0.0033) [2024-06-27 17:57:38,739][06909] Updated weights for policy 0, policy_version 46693 (0.0034) [2024-06-27 17:57:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 765018112. Throughput: 0: 43615.8. Samples: 667959400. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 17:57:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:57:43,453][06909] Updated weights for policy 0, policy_version 46703 (0.0037) [2024-06-27 17:57:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 765214720. Throughput: 0: 43665.8. Samples: 668097800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 17:57:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:57:44,909][06887] Signal inference workers to stop experience collection... (9600 times) [2024-06-27 17:57:44,948][06909] InferenceWorker_p0-w0: stopping experience collection (9600 times) [2024-06-27 17:57:44,960][06887] Signal inference workers to resume experience collection... (9600 times) [2024-06-27 17:57:44,969][06909] InferenceWorker_p0-w0: resuming experience collection (9600 times) [2024-06-27 17:57:46,143][06909] Updated weights for policy 0, policy_version 46713 (0.0029) [2024-06-27 17:57:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 765427712. Throughput: 0: 43652.9. Samples: 668351340. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 17:57:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:57:48,978][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000046719_765444096.pth... [2024-06-27 17:57:49,021][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000046079_754958336.pth [2024-06-27 17:57:50,882][06909] Updated weights for policy 0, policy_version 46723 (0.0022) [2024-06-27 17:57:53,711][06909] Updated weights for policy 0, policy_version 46733 (0.0031) [2024-06-27 17:57:53,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.8, 300 sec: 43654.5). Total num frames: 765673472. Throughput: 0: 43566.3. Samples: 668610940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 17:57:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:57:58,221][06909] Updated weights for policy 0, policy_version 46743 (0.0036) [2024-06-27 17:57:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.7, 300 sec: 43653.9). Total num frames: 765853696. Throughput: 0: 43522.6. Samples: 668746900. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 17:57:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:58:01,376][06909] Updated weights for policy 0, policy_version 46753 (0.0027) [2024-06-27 17:58:03,850][06674] Fps is (10 sec: 40959.2, 60 sec: 43417.4, 300 sec: 43653.9). Total num frames: 766083072. Throughput: 0: 43585.6. Samples: 669007300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 17:58:03,851][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 17:58:05,733][06909] Updated weights for policy 0, policy_version 46763 (0.0042) [2024-06-27 17:58:08,774][06909] Updated weights for policy 0, policy_version 46773 (0.0033) [2024-06-27 17:58:08,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.8, 300 sec: 43653.7). Total num frames: 766328832. Throughput: 0: 43657.6. Samples: 669271660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 17:58:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 17:58:13,342][06909] Updated weights for policy 0, policy_version 46783 (0.0036) [2024-06-27 17:58:13,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 766525440. Throughput: 0: 43734.6. Samples: 669410240. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 17:58:13,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:58:16,072][06909] Updated weights for policy 0, policy_version 46793 (0.0039) [2024-06-27 17:58:18,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 766738432. Throughput: 0: 43797.6. Samples: 669672680. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 17:58:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:58:20,558][06909] Updated weights for policy 0, policy_version 46803 (0.0046) [2024-06-27 17:58:23,367][06909] Updated weights for policy 0, policy_version 46813 (0.0034) [2024-06-27 17:58:23,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43965.2, 300 sec: 43765.0). Total num frames: 767000576. Throughput: 0: 43679.9. Samples: 669925000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 17:58:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:58:27,995][06909] Updated weights for policy 0, policy_version 46823 (0.0028) [2024-06-27 17:58:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 767164416. Throughput: 0: 43753.9. Samples: 670066720. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:58:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:58:31,052][06909] Updated weights for policy 0, policy_version 46833 (0.0034) [2024-06-27 17:58:33,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 767393792. Throughput: 0: 43817.2. Samples: 670323120. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:58:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:58:35,335][06909] Updated weights for policy 0, policy_version 46843 (0.0036) [2024-06-27 17:58:38,646][06909] Updated weights for policy 0, policy_version 46853 (0.0038) [2024-06-27 17:58:38,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 767639552. Throughput: 0: 43822.8. Samples: 670582960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:58:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:58:42,914][06909] Updated weights for policy 0, policy_version 46863 (0.0031) [2024-06-27 17:58:43,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 767836160. Throughput: 0: 43930.3. Samples: 670723760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:58:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:58:45,995][06909] Updated weights for policy 0, policy_version 46873 (0.0047) [2024-06-27 17:58:48,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 768049152. Throughput: 0: 43958.9. Samples: 670985440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:58:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:58:50,349][06909] Updated weights for policy 0, policy_version 46883 (0.0032) [2024-06-27 17:58:53,649][06909] Updated weights for policy 0, policy_version 46893 (0.0038) [2024-06-27 17:58:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 768294912. Throughput: 0: 43825.3. Samples: 671243800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 17:58:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:58:58,134][06909] Updated weights for policy 0, policy_version 46903 (0.0024) [2024-06-27 17:58:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 768491520. Throughput: 0: 43840.0. Samples: 671383040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 17:58:58,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:59:00,911][06909] Updated weights for policy 0, policy_version 46913 (0.0038) [2024-06-27 17:59:03,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.9, 300 sec: 43653.7). Total num frames: 768704512. Throughput: 0: 43721.5. Samples: 671640140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 17:59:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:59:04,927][06887] Signal inference workers to stop experience collection... (9650 times) [2024-06-27 17:59:04,968][06909] InferenceWorker_p0-w0: stopping experience collection (9650 times) [2024-06-27 17:59:05,046][06887] Signal inference workers to resume experience collection... (9650 times) [2024-06-27 17:59:05,046][06909] InferenceWorker_p0-w0: resuming experience collection (9650 times) [2024-06-27 17:59:05,456][06909] Updated weights for policy 0, policy_version 46923 (0.0025) [2024-06-27 17:59:08,487][06909] Updated weights for policy 0, policy_version 46933 (0.0024) [2024-06-27 17:59:08,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 768966656. Throughput: 0: 43751.1. Samples: 671893800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 17:59:08,853][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:59:13,182][06909] Updated weights for policy 0, policy_version 46943 (0.0032) [2024-06-27 17:59:13,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 769146880. Throughput: 0: 43672.8. Samples: 672032000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 17:59:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:59:15,842][06909] Updated weights for policy 0, policy_version 46953 (0.0032) [2024-06-27 17:59:18,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 769359872. Throughput: 0: 43735.6. Samples: 672291220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 17:59:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 17:59:20,502][06909] Updated weights for policy 0, policy_version 46963 (0.0038) [2024-06-27 17:59:23,583][06909] Updated weights for policy 0, policy_version 46973 (0.0027) [2024-06-27 17:59:23,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 769622016. Throughput: 0: 43681.2. Samples: 672548620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 17:59:23,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 17:59:28,263][06909] Updated weights for policy 0, policy_version 46983 (0.0040) [2024-06-27 17:59:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 769802240. Throughput: 0: 43563.4. Samples: 672684120. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 17:59:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:59:31,176][06909] Updated weights for policy 0, policy_version 46993 (0.0043) [2024-06-27 17:59:33,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 770015232. Throughput: 0: 43460.8. Samples: 672941180. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 17:59:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:59:35,878][06909] Updated weights for policy 0, policy_version 47003 (0.0032) [2024-06-27 17:59:38,569][06909] Updated weights for policy 0, policy_version 47013 (0.0027) [2024-06-27 17:59:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 770260992. Throughput: 0: 43418.6. Samples: 673197640. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 17:59:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 17:59:43,389][06909] Updated weights for policy 0, policy_version 47023 (0.0030) [2024-06-27 17:59:43,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 770441216. Throughput: 0: 43506.8. Samples: 673340840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 17:59:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 17:59:46,129][06909] Updated weights for policy 0, policy_version 47033 (0.0040) [2024-06-27 17:59:48,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 770670592. Throughput: 0: 43572.4. Samples: 673600900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 17:59:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 17:59:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000047038_770670592.pth... [2024-06-27 17:59:48,930][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000046398_760184832.pth [2024-06-27 17:59:50,893][06909] Updated weights for policy 0, policy_version 47043 (0.0038) [2024-06-27 17:59:53,569][06909] Updated weights for policy 0, policy_version 47053 (0.0029) [2024-06-27 17:59:53,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 770916352. Throughput: 0: 43656.4. Samples: 673858340. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 17:59:53,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 17:59:58,426][06909] Updated weights for policy 0, policy_version 47063 (0.0038) [2024-06-27 17:59:58,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43144.6, 300 sec: 43542.6). Total num frames: 771080192. Throughput: 0: 43712.1. Samples: 673999040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 17:59:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:00:01,089][06909] Updated weights for policy 0, policy_version 47073 (0.0036) [2024-06-27 18:00:03,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 771325952. Throughput: 0: 43614.6. Samples: 674253880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 18:00:03,851][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 18:00:05,865][06909] Updated weights for policy 0, policy_version 47083 (0.0047) [2024-06-27 18:00:08,710][06909] Updated weights for policy 0, policy_version 47093 (0.0026) [2024-06-27 18:00:08,850][06674] Fps is (10 sec: 49152.2, 60 sec: 43417.6, 300 sec: 43765.0). Total num frames: 771571712. Throughput: 0: 43613.4. Samples: 674511220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 18:00:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:00:13,432][06909] Updated weights for policy 0, policy_version 47103 (0.0038) [2024-06-27 18:00:13,766][06887] Signal inference workers to stop experience collection... (9700 times) [2024-06-27 18:00:13,767][06887] Signal inference workers to resume experience collection... (9700 times) [2024-06-27 18:00:13,790][06909] InferenceWorker_p0-w0: stopping experience collection (9700 times) [2024-06-27 18:00:13,790][06909] InferenceWorker_p0-w0: resuming experience collection (9700 times) [2024-06-27 18:00:13,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.8, 300 sec: 43653.6). Total num frames: 771768320. Throughput: 0: 43684.6. Samples: 674649920. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 18:00:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:00:16,208][06909] Updated weights for policy 0, policy_version 47113 (0.0024) [2024-06-27 18:00:18,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 43654.5). Total num frames: 771981312. Throughput: 0: 43682.7. Samples: 674906900. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 18:00:18,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:00:20,712][06909] Updated weights for policy 0, policy_version 47123 (0.0028) [2024-06-27 18:00:23,439][06909] Updated weights for policy 0, policy_version 47133 (0.0028) [2024-06-27 18:00:23,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43690.7, 300 sec: 43820.2). Total num frames: 772243456. Throughput: 0: 43806.6. Samples: 675168940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 18:00:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:00:28,383][06909] Updated weights for policy 0, policy_version 47143 (0.0039) [2024-06-27 18:00:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 43599.0). Total num frames: 772407296. Throughput: 0: 43719.9. Samples: 675308240. Policy #0 lag: (min: 1.0, avg: 10.0, max: 20.0) [2024-06-27 18:00:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:00:31,435][06909] Updated weights for policy 0, policy_version 47153 (0.0040) [2024-06-27 18:00:33,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 772636672. Throughput: 0: 43598.6. Samples: 675562840. Policy #0 lag: (min: 1.0, avg: 10.0, max: 20.0) [2024-06-27 18:00:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:00:35,994][06909] Updated weights for policy 0, policy_version 47163 (0.0030) [2024-06-27 18:00:38,673][06909] Updated weights for policy 0, policy_version 47173 (0.0039) [2024-06-27 18:00:38,851][06674] Fps is (10 sec: 47508.9, 60 sec: 43689.9, 300 sec: 43820.1). Total num frames: 772882432. Throughput: 0: 43672.9. Samples: 675823660. Policy #0 lag: (min: 1.0, avg: 10.0, max: 20.0) [2024-06-27 18:00:38,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:00:43,290][06909] Updated weights for policy 0, policy_version 47183 (0.0028) [2024-06-27 18:00:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 773079040. Throughput: 0: 43674.6. Samples: 675964400. Policy #0 lag: (min: 1.0, avg: 10.0, max: 20.0) [2024-06-27 18:00:43,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:00:46,101][06909] Updated weights for policy 0, policy_version 47193 (0.0025) [2024-06-27 18:00:48,850][06674] Fps is (10 sec: 40964.3, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 773292032. Throughput: 0: 43757.4. Samples: 676222960. Policy #0 lag: (min: 1.0, avg: 10.0, max: 20.0) [2024-06-27 18:00:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:00:50,827][06909] Updated weights for policy 0, policy_version 47203 (0.0034) [2024-06-27 18:00:53,765][06909] Updated weights for policy 0, policy_version 47213 (0.0036) [2024-06-27 18:00:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 43765.0). Total num frames: 773537792. Throughput: 0: 43763.5. Samples: 676480580. Policy #0 lag: (min: 1.0, avg: 10.0, max: 20.0) [2024-06-27 18:00:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:00:58,315][06909] Updated weights for policy 0, policy_version 47223 (0.0033) [2024-06-27 18:00:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 43653.6). Total num frames: 773734400. Throughput: 0: 43674.5. Samples: 676615280. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:00:58,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:01:01,107][06909] Updated weights for policy 0, policy_version 47233 (0.0041) [2024-06-27 18:01:03,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 773947392. Throughput: 0: 43777.8. Samples: 676876900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:01:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:01:05,891][06909] Updated weights for policy 0, policy_version 47243 (0.0035) [2024-06-27 18:01:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 774176768. Throughput: 0: 43582.3. Samples: 677130140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:01:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:01:08,859][06909] Updated weights for policy 0, policy_version 47253 (0.0043) [2024-06-27 18:01:13,451][06909] Updated weights for policy 0, policy_version 47263 (0.0040) [2024-06-27 18:01:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.5, 300 sec: 43653.7). Total num frames: 774373376. Throughput: 0: 43630.3. Samples: 677271600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:01:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:01:16,370][06909] Updated weights for policy 0, policy_version 47273 (0.0031) [2024-06-27 18:01:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 774602752. Throughput: 0: 43825.3. Samples: 677534980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:01:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:01:20,680][06909] Updated weights for policy 0, policy_version 47283 (0.0032) [2024-06-27 18:01:23,699][06909] Updated weights for policy 0, policy_version 47293 (0.0037) [2024-06-27 18:01:23,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 774848512. Throughput: 0: 43768.6. Samples: 677793200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:01:23,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:01:27,945][06909] Updated weights for policy 0, policy_version 47303 (0.0035) [2024-06-27 18:01:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 775045120. Throughput: 0: 43659.2. Samples: 677929060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 18:01:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:01:29,265][06887] Signal inference workers to stop experience collection... (9750 times) [2024-06-27 18:01:29,266][06887] Signal inference workers to resume experience collection... (9750 times) [2024-06-27 18:01:29,292][06909] InferenceWorker_p0-w0: stopping experience collection (9750 times) [2024-06-27 18:01:29,292][06909] InferenceWorker_p0-w0: resuming experience collection (9750 times) [2024-06-27 18:01:31,402][06909] Updated weights for policy 0, policy_version 47313 (0.0034) [2024-06-27 18:01:33,852][06674] Fps is (10 sec: 40951.3, 60 sec: 43689.2, 300 sec: 43653.3). Total num frames: 775258112. Throughput: 0: 43700.2. Samples: 678189560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 18:01:33,853][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:01:35,545][06909] Updated weights for policy 0, policy_version 47323 (0.0037) [2024-06-27 18:01:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43418.4, 300 sec: 43709.2). Total num frames: 775487488. Throughput: 0: 43828.6. Samples: 678452860. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 18:01:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:01:38,943][06909] Updated weights for policy 0, policy_version 47333 (0.0043) [2024-06-27 18:01:43,077][06909] Updated weights for policy 0, policy_version 47343 (0.0034) [2024-06-27 18:01:43,852][06674] Fps is (10 sec: 44236.9, 60 sec: 43689.2, 300 sec: 43653.3). Total num frames: 775700480. Throughput: 0: 43796.8. Samples: 678586220. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 18:01:43,853][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:01:46,227][06909] Updated weights for policy 0, policy_version 47353 (0.0032) [2024-06-27 18:01:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 775913472. Throughput: 0: 43811.2. Samples: 678848400. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 18:01:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:01:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000047358_775913472.pth... [2024-06-27 18:01:48,930][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000046719_765444096.pth [2024-06-27 18:01:50,609][06909] Updated weights for policy 0, policy_version 47363 (0.0037) [2024-06-27 18:01:53,850][06674] Fps is (10 sec: 44246.0, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 776142848. Throughput: 0: 43877.8. Samples: 679104640. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 18:01:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:01:54,033][06909] Updated weights for policy 0, policy_version 47373 (0.0040) [2024-06-27 18:01:58,199][06909] Updated weights for policy 0, policy_version 47383 (0.0036) [2024-06-27 18:01:58,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 776355840. Throughput: 0: 43808.8. Samples: 679243000. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 18:01:58,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:02:01,451][06909] Updated weights for policy 0, policy_version 47393 (0.0029) [2024-06-27 18:02:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 776568832. Throughput: 0: 43840.6. Samples: 679507800. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 18:02:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:02:05,508][06909] Updated weights for policy 0, policy_version 47403 (0.0031) [2024-06-27 18:02:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 776798208. Throughput: 0: 43831.8. Samples: 679765640. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 18:02:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:02:09,035][06909] Updated weights for policy 0, policy_version 47413 (0.0032) [2024-06-27 18:02:13,282][06909] Updated weights for policy 0, policy_version 47423 (0.0035) [2024-06-27 18:02:13,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 777011200. Throughput: 0: 43703.5. Samples: 679895720. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 18:02:13,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:02:16,428][06909] Updated weights for policy 0, policy_version 47433 (0.0032) [2024-06-27 18:02:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43653.9). Total num frames: 777240576. Throughput: 0: 43706.8. Samples: 680156280. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 18:02:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:02:20,647][06909] Updated weights for policy 0, policy_version 47443 (0.0035) [2024-06-27 18:02:23,856][06674] Fps is (10 sec: 44210.3, 60 sec: 43413.2, 300 sec: 43708.3). Total num frames: 777453568. Throughput: 0: 43606.1. Samples: 680415400. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 18:02:23,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:02:24,125][06909] Updated weights for policy 0, policy_version 47453 (0.0032) [2024-06-27 18:02:28,591][06909] Updated weights for policy 0, policy_version 47463 (0.0028) [2024-06-27 18:02:28,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.7, 300 sec: 43653.6). Total num frames: 777650176. Throughput: 0: 43586.5. Samples: 680547520. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 18:02:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:02:31,651][06909] Updated weights for policy 0, policy_version 47473 (0.0037) [2024-06-27 18:02:33,850][06674] Fps is (10 sec: 44263.2, 60 sec: 43965.2, 300 sec: 43653.6). Total num frames: 777895936. Throughput: 0: 43599.9. Samples: 680810400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 18:02:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:02:35,831][06909] Updated weights for policy 0, policy_version 47483 (0.0023) [2024-06-27 18:02:38,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43690.5, 300 sec: 43709.2). Total num frames: 778108928. Throughput: 0: 43614.1. Samples: 681067280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 18:02:38,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:02:39,392][06909] Updated weights for policy 0, policy_version 47493 (0.0035) [2024-06-27 18:02:43,189][06909] Updated weights for policy 0, policy_version 47503 (0.0034) [2024-06-27 18:02:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43692.2, 300 sec: 43709.2). Total num frames: 778321920. Throughput: 0: 43525.9. Samples: 681201660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 18:02:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:02:46,613][06909] Updated weights for policy 0, policy_version 47513 (0.0048) [2024-06-27 18:02:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.6, 300 sec: 43653.6). Total num frames: 778551296. Throughput: 0: 43483.8. Samples: 681464580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 18:02:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:02:50,822][06909] Updated weights for policy 0, policy_version 47523 (0.0030) [2024-06-27 18:02:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 778764288. Throughput: 0: 43676.9. Samples: 681731100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 18:02:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:02:54,175][06909] Updated weights for policy 0, policy_version 47533 (0.0040) [2024-06-27 18:02:58,181][06909] Updated weights for policy 0, policy_version 47543 (0.0035) [2024-06-27 18:02:58,439][06887] Signal inference workers to stop experience collection... (9800 times) [2024-06-27 18:02:58,439][06887] Signal inference workers to resume experience collection... (9800 times) [2024-06-27 18:02:58,459][06909] InferenceWorker_p0-w0: stopping experience collection (9800 times) [2024-06-27 18:02:58,459][06909] InferenceWorker_p0-w0: resuming experience collection (9800 times) [2024-06-27 18:02:58,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 778977280. Throughput: 0: 43650.9. Samples: 681860020. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 18:02:58,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:03:01,584][06909] Updated weights for policy 0, policy_version 47553 (0.0041) [2024-06-27 18:03:03,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43963.8, 300 sec: 43653.7). Total num frames: 779206656. Throughput: 0: 43611.7. Samples: 682118800. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 18:03:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:03:05,551][06909] Updated weights for policy 0, policy_version 47563 (0.0027) [2024-06-27 18:03:08,850][06674] Fps is (10 sec: 44238.3, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 779419648. Throughput: 0: 43686.0. Samples: 682381000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 18:03:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:03:08,974][06909] Updated weights for policy 0, policy_version 47573 (0.0040) [2024-06-27 18:03:13,219][06909] Updated weights for policy 0, policy_version 47583 (0.0036) [2024-06-27 18:03:13,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.6, 300 sec: 43653.7). Total num frames: 779616256. Throughput: 0: 43618.6. Samples: 682510360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 18:03:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:03:16,994][06909] Updated weights for policy 0, policy_version 47593 (0.0037) [2024-06-27 18:03:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 779862016. Throughput: 0: 43587.6. Samples: 682771840. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 18:03:18,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:03:20,519][06909] Updated weights for policy 0, policy_version 47603 (0.0030) [2024-06-27 18:03:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43695.1, 300 sec: 43764.7). Total num frames: 780075008. Throughput: 0: 43729.1. Samples: 683035080. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 18:03:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:03:24,428][06909] Updated weights for policy 0, policy_version 47613 (0.0025) [2024-06-27 18:03:28,281][06909] Updated weights for policy 0, policy_version 47623 (0.0031) [2024-06-27 18:03:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 780271616. Throughput: 0: 43622.7. Samples: 683164680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:03:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:03:31,947][06909] Updated weights for policy 0, policy_version 47633 (0.0035) [2024-06-27 18:03:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.8, 300 sec: 43598.1). Total num frames: 780500992. Throughput: 0: 43566.5. Samples: 683425060. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:03:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:03:35,799][06909] Updated weights for policy 0, policy_version 47643 (0.0036) [2024-06-27 18:03:38,853][06674] Fps is (10 sec: 45860.3, 60 sec: 43688.4, 300 sec: 43708.7). Total num frames: 780730368. Throughput: 0: 43523.2. Samples: 683689780. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:03:38,854][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:03:39,231][06909] Updated weights for policy 0, policy_version 47653 (0.0036) [2024-06-27 18:03:43,237][06909] Updated weights for policy 0, policy_version 47663 (0.0026) [2024-06-27 18:03:43,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 780943360. Throughput: 0: 43531.3. Samples: 683818920. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:03:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:03:46,644][06909] Updated weights for policy 0, policy_version 47673 (0.0033) [2024-06-27 18:03:48,850][06674] Fps is (10 sec: 42612.0, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 781156352. Throughput: 0: 43721.6. Samples: 684086280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:03:48,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:03:48,992][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000047679_781172736.pth... [2024-06-27 18:03:49,041][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000047038_770670592.pth [2024-06-27 18:03:50,559][06909] Updated weights for policy 0, policy_version 47683 (0.0030) [2024-06-27 18:03:53,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 781369344. Throughput: 0: 43761.4. Samples: 684350260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:03:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:03:54,102][06909] Updated weights for policy 0, policy_version 47693 (0.0036) [2024-06-27 18:03:57,846][06909] Updated weights for policy 0, policy_version 47703 (0.0040) [2024-06-27 18:03:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 781598720. Throughput: 0: 43659.5. Samples: 684475040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2024-06-27 18:03:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:04:01,799][06909] Updated weights for policy 0, policy_version 47713 (0.0029) [2024-06-27 18:04:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 781811712. Throughput: 0: 43760.5. Samples: 684741060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2024-06-27 18:04:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:04:05,230][06909] Updated weights for policy 0, policy_version 47723 (0.0036) [2024-06-27 18:04:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 782041088. Throughput: 0: 43728.4. Samples: 685002860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2024-06-27 18:04:08,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:04:09,113][06909] Updated weights for policy 0, policy_version 47733 (0.0032) [2024-06-27 18:04:12,853][06909] Updated weights for policy 0, policy_version 47743 (0.0039) [2024-06-27 18:04:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 782254080. Throughput: 0: 43730.7. Samples: 685132560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2024-06-27 18:04:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:04:16,823][06909] Updated weights for policy 0, policy_version 47753 (0.0039) [2024-06-27 18:04:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 782467072. Throughput: 0: 43701.7. Samples: 685391640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2024-06-27 18:04:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:04:20,353][06909] Updated weights for policy 0, policy_version 47763 (0.0037) [2024-06-27 18:04:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43653.7). Total num frames: 782680064. Throughput: 0: 43541.9. Samples: 685649020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 19.0) [2024-06-27 18:04:23,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:04:24,327][06909] Updated weights for policy 0, policy_version 47773 (0.0021) [2024-06-27 18:04:28,034][06909] Updated weights for policy 0, policy_version 47783 (0.0038) [2024-06-27 18:04:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 782909440. Throughput: 0: 43589.4. Samples: 685780440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 18:04:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:04:32,069][06909] Updated weights for policy 0, policy_version 47793 (0.0026) [2024-06-27 18:04:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 783122432. Throughput: 0: 43385.5. Samples: 686038620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 18:04:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:04:34,698][06887] Signal inference workers to stop experience collection... (9850 times) [2024-06-27 18:04:34,699][06887] Signal inference workers to resume experience collection... (9850 times) [2024-06-27 18:04:34,743][06909] InferenceWorker_p0-w0: stopping experience collection (9850 times) [2024-06-27 18:04:34,743][06909] InferenceWorker_p0-w0: resuming experience collection (9850 times) [2024-06-27 18:04:35,395][06909] Updated weights for policy 0, policy_version 47803 (0.0033) [2024-06-27 18:04:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43146.9, 300 sec: 43653.6). Total num frames: 783319040. Throughput: 0: 43432.4. Samples: 686304720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 18:04:38,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:04:39,694][06909] Updated weights for policy 0, policy_version 47813 (0.0022) [2024-06-27 18:04:42,700][06909] Updated weights for policy 0, policy_version 47823 (0.0032) [2024-06-27 18:04:43,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 783564800. Throughput: 0: 43458.2. Samples: 686430660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 18:04:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:04:47,107][06909] Updated weights for policy 0, policy_version 47833 (0.0034) [2024-06-27 18:04:48,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 783777792. Throughput: 0: 43594.7. Samples: 686702820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 18:04:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:04:50,017][06909] Updated weights for policy 0, policy_version 47843 (0.0032) [2024-06-27 18:04:53,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 783974400. Throughput: 0: 43555.7. Samples: 686962860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 18:04:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:04:54,620][06909] Updated weights for policy 0, policy_version 47853 (0.0041) [2024-06-27 18:04:57,899][06909] Updated weights for policy 0, policy_version 47863 (0.0033) [2024-06-27 18:04:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 784220160. Throughput: 0: 43551.1. Samples: 687092360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 18:04:58,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 18:05:02,241][06909] Updated weights for policy 0, policy_version 47873 (0.0035) [2024-06-27 18:05:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 784433152. Throughput: 0: 43705.8. Samples: 687358400. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 18:05:03,856][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:05:05,132][06909] Updated weights for policy 0, policy_version 47883 (0.0040) [2024-06-27 18:05:08,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 784629760. Throughput: 0: 43826.2. Samples: 687621200. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 18:05:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:05:09,544][06909] Updated weights for policy 0, policy_version 47893 (0.0040) [2024-06-27 18:05:12,577][06909] Updated weights for policy 0, policy_version 47903 (0.0040) [2024-06-27 18:05:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 784875520. Throughput: 0: 43780.1. Samples: 687750540. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 18:05:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:05:17,126][06909] Updated weights for policy 0, policy_version 47913 (0.0031) [2024-06-27 18:05:18,850][06674] Fps is (10 sec: 49151.1, 60 sec: 44236.7, 300 sec: 43653.6). Total num frames: 785121280. Throughput: 0: 44017.5. Samples: 688019420. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 18:05:18,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:05:20,145][06909] Updated weights for policy 0, policy_version 47923 (0.0028) [2024-06-27 18:05:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.6, 300 sec: 43653.7). Total num frames: 785285120. Throughput: 0: 43716.1. Samples: 688271940. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 18:05:23,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:05:24,964][06909] Updated weights for policy 0, policy_version 47933 (0.0032) [2024-06-27 18:05:27,611][06909] Updated weights for policy 0, policy_version 47943 (0.0034) [2024-06-27 18:05:28,850][06674] Fps is (10 sec: 40961.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 785530880. Throughput: 0: 43837.0. Samples: 688403320. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 18:05:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:05:32,277][06909] Updated weights for policy 0, policy_version 47953 (0.0031) [2024-06-27 18:05:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.6, 300 sec: 43598.3). Total num frames: 785743872. Throughput: 0: 43733.8. Samples: 688670840. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 18:05:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:05:35,020][06909] Updated weights for policy 0, policy_version 47963 (0.0026) [2024-06-27 18:05:38,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 785940480. Throughput: 0: 43680.8. Samples: 688928500. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 18:05:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:05:39,650][06909] Updated weights for policy 0, policy_version 47973 (0.0028) [2024-06-27 18:05:42,832][06909] Updated weights for policy 0, policy_version 47983 (0.0041) [2024-06-27 18:05:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 786186240. Throughput: 0: 43604.5. Samples: 689054560. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 18:05:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:05:47,262][06909] Updated weights for policy 0, policy_version 47993 (0.0034) [2024-06-27 18:05:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 786399232. Throughput: 0: 43700.4. Samples: 689324920. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 18:05:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:05:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000047998_786399232.pth... [2024-06-27 18:05:48,939][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000047358_775913472.pth [2024-06-27 18:05:50,064][06909] Updated weights for policy 0, policy_version 48003 (0.0025) [2024-06-27 18:05:53,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 43653.7). Total num frames: 786612224. Throughput: 0: 43714.2. Samples: 689588340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 18:05:53,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:05:54,555][06909] Updated weights for policy 0, policy_version 48013 (0.0036) [2024-06-27 18:05:57,324][06909] Updated weights for policy 0, policy_version 48023 (0.0032) [2024-06-27 18:05:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 786841600. Throughput: 0: 43694.1. Samples: 689716780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 18:05:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:06:02,084][06909] Updated weights for policy 0, policy_version 48033 (0.0028) [2024-06-27 18:06:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 787070976. Throughput: 0: 43664.6. Samples: 689984320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 18:06:03,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:06:04,715][06909] Updated weights for policy 0, policy_version 48043 (0.0033) [2024-06-27 18:06:08,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 787251200. Throughput: 0: 43832.3. Samples: 690244400. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 18:06:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:06:09,765][06909] Updated weights for policy 0, policy_version 48053 (0.0030) [2024-06-27 18:06:12,740][06909] Updated weights for policy 0, policy_version 48063 (0.0040) [2024-06-27 18:06:13,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.6, 300 sec: 43653.7). Total num frames: 787480576. Throughput: 0: 43714.2. Samples: 690370460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 18:06:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:06:17,045][06909] Updated weights for policy 0, policy_version 48073 (0.0027) [2024-06-27 18:06:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43144.7, 300 sec: 43598.1). Total num frames: 787709952. Throughput: 0: 43682.1. Samples: 690636540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 18:06:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:06:19,366][06887] Signal inference workers to stop experience collection... (9900 times) [2024-06-27 18:06:19,405][06909] InferenceWorker_p0-w0: stopping experience collection (9900 times) [2024-06-27 18:06:19,485][06887] Signal inference workers to resume experience collection... (9900 times) [2024-06-27 18:06:19,486][06909] InferenceWorker_p0-w0: resuming experience collection (9900 times) [2024-06-27 18:06:20,209][06909] Updated weights for policy 0, policy_version 48083 (0.0032) [2024-06-27 18:06:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 787906560. Throughput: 0: 43694.2. Samples: 690894740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 18:06:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:06:24,673][06909] Updated weights for policy 0, policy_version 48093 (0.0026) [2024-06-27 18:06:27,571][06909] Updated weights for policy 0, policy_version 48103 (0.0036) [2024-06-27 18:06:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.5, 300 sec: 43654.0). Total num frames: 788135936. Throughput: 0: 43849.7. Samples: 691027800. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 18:06:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:06:32,052][06909] Updated weights for policy 0, policy_version 48113 (0.0036) [2024-06-27 18:06:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 788365312. Throughput: 0: 43608.5. Samples: 691287300. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 18:06:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:06:35,097][06909] Updated weights for policy 0, policy_version 48123 (0.0030) [2024-06-27 18:06:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43598.4). Total num frames: 788561920. Throughput: 0: 43588.0. Samples: 691549800. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 18:06:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:06:39,471][06909] Updated weights for policy 0, policy_version 48133 (0.0036) [2024-06-27 18:06:42,517][06909] Updated weights for policy 0, policy_version 48143 (0.0024) [2024-06-27 18:06:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 788807680. Throughput: 0: 43642.3. Samples: 691680680. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 18:06:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:06:46,970][06909] Updated weights for policy 0, policy_version 48153 (0.0042) [2024-06-27 18:06:48,851][06674] Fps is (10 sec: 45871.9, 60 sec: 43690.1, 300 sec: 43653.5). Total num frames: 789020672. Throughput: 0: 43597.0. Samples: 691946220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 18:06:48,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:06:50,279][06909] Updated weights for policy 0, policy_version 48163 (0.0043) [2024-06-27 18:06:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 789233664. Throughput: 0: 43737.4. Samples: 692212580. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 18:06:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:06:54,405][06909] Updated weights for policy 0, policy_version 48173 (0.0021) [2024-06-27 18:06:57,956][06909] Updated weights for policy 0, policy_version 48183 (0.0023) [2024-06-27 18:06:58,853][06674] Fps is (10 sec: 44224.0, 60 sec: 43688.0, 300 sec: 43708.6). Total num frames: 789463040. Throughput: 0: 43755.9. Samples: 692339640. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 18:06:58,854][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:07:02,073][06909] Updated weights for policy 0, policy_version 48193 (0.0033) [2024-06-27 18:07:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 43653.7). Total num frames: 789676032. Throughput: 0: 43581.4. Samples: 692597700. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 18:07:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:07:05,337][06909] Updated weights for policy 0, policy_version 48203 (0.0039) [2024-06-27 18:07:08,850][06674] Fps is (10 sec: 40975.2, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 789872640. Throughput: 0: 43839.6. Samples: 692867520. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 18:07:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:07:09,429][06909] Updated weights for policy 0, policy_version 48213 (0.0045) [2024-06-27 18:07:12,849][06909] Updated weights for policy 0, policy_version 48223 (0.0041) [2024-06-27 18:07:13,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 43653.6). Total num frames: 790118400. Throughput: 0: 43674.1. Samples: 692993140. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 18:07:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:07:16,645][06909] Updated weights for policy 0, policy_version 48233 (0.0039) [2024-06-27 18:07:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43654.6). Total num frames: 790331392. Throughput: 0: 43756.9. Samples: 693256360. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 18:07:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:07:20,210][06909] Updated weights for policy 0, policy_version 48243 (0.0044) [2024-06-27 18:07:23,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 790544384. Throughput: 0: 43935.2. Samples: 693526880. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 18:07:23,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 18:07:24,325][06909] Updated weights for policy 0, policy_version 48253 (0.0031) [2024-06-27 18:07:27,766][06909] Updated weights for policy 0, policy_version 48263 (0.0032) [2024-06-27 18:07:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43653.7). Total num frames: 790773760. Throughput: 0: 43819.1. Samples: 693652540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:07:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:07:31,805][06909] Updated weights for policy 0, policy_version 48273 (0.0033) [2024-06-27 18:07:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 791003136. Throughput: 0: 43816.4. Samples: 693917920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:07:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:07:35,303][06909] Updated weights for policy 0, policy_version 48283 (0.0045) [2024-06-27 18:07:38,852][06674] Fps is (10 sec: 40951.0, 60 sec: 43689.2, 300 sec: 43597.8). Total num frames: 791183360. Throughput: 0: 43829.4. Samples: 694185000. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:07:38,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:07:39,271][06909] Updated weights for policy 0, policy_version 48293 (0.0034) [2024-06-27 18:07:39,686][06887] Signal inference workers to stop experience collection... (9950 times) [2024-06-27 18:07:39,688][06887] Signal inference workers to resume experience collection... (9950 times) [2024-06-27 18:07:39,715][06909] InferenceWorker_p0-w0: stopping experience collection (9950 times) [2024-06-27 18:07:39,715][06909] InferenceWorker_p0-w0: resuming experience collection (9950 times) [2024-06-27 18:07:42,613][06909] Updated weights for policy 0, policy_version 48303 (0.0037) [2024-06-27 18:07:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43653.7). Total num frames: 791429120. Throughput: 0: 43769.8. Samples: 694309120. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:07:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:07:46,927][06909] Updated weights for policy 0, policy_version 48313 (0.0037) [2024-06-27 18:07:48,850][06674] Fps is (10 sec: 47524.0, 60 sec: 43964.3, 300 sec: 43709.2). Total num frames: 791658496. Throughput: 0: 43745.8. Samples: 694566260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:07:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:07:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000048319_791658496.pth... [2024-06-27 18:07:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000047679_781172736.pth [2024-06-27 18:07:50,546][06909] Updated weights for policy 0, policy_version 48323 (0.0035) [2024-06-27 18:07:53,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 43598.2). Total num frames: 791838720. Throughput: 0: 43558.2. Samples: 694827640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:07:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:07:54,532][06909] Updated weights for policy 0, policy_version 48333 (0.0034) [2024-06-27 18:07:57,916][06909] Updated weights for policy 0, policy_version 48343 (0.0032) [2024-06-27 18:07:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43693.4, 300 sec: 43653.6). Total num frames: 792084480. Throughput: 0: 43551.3. Samples: 694952940. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 18:07:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:08:01,918][06909] Updated weights for policy 0, policy_version 48353 (0.0033) [2024-06-27 18:08:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 792297472. Throughput: 0: 43638.6. Samples: 695220100. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 18:08:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:08:05,352][06909] Updated weights for policy 0, policy_version 48363 (0.0027) [2024-06-27 18:08:08,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 792494080. Throughput: 0: 43479.9. Samples: 695483480. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 18:08:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:08:09,599][06909] Updated weights for policy 0, policy_version 48373 (0.0030) [2024-06-27 18:08:12,786][06909] Updated weights for policy 0, policy_version 48383 (0.0030) [2024-06-27 18:08:13,856][06674] Fps is (10 sec: 44209.5, 60 sec: 43686.3, 300 sec: 43652.7). Total num frames: 792739840. Throughput: 0: 43474.0. Samples: 695609140. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 18:08:13,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:08:17,010][06909] Updated weights for policy 0, policy_version 48393 (0.0031) [2024-06-27 18:08:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 792952832. Throughput: 0: 43292.4. Samples: 695866080. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 18:08:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:08:20,257][06909] Updated weights for policy 0, policy_version 48403 (0.0039) [2024-06-27 18:08:23,850][06674] Fps is (10 sec: 40985.5, 60 sec: 43417.6, 300 sec: 43653.7). Total num frames: 793149440. Throughput: 0: 43163.4. Samples: 696127260. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 18:08:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:08:24,801][06909] Updated weights for policy 0, policy_version 48413 (0.0036) [2024-06-27 18:08:28,256][06909] Updated weights for policy 0, policy_version 48423 (0.0026) [2024-06-27 18:08:28,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 793395200. Throughput: 0: 43176.4. Samples: 696252060. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 18:08:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:08:32,376][06909] Updated weights for policy 0, policy_version 48433 (0.0027) [2024-06-27 18:08:33,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43417.5, 300 sec: 43654.1). Total num frames: 793608192. Throughput: 0: 43383.5. Samples: 696518520. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 18:08:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:08:35,659][06909] Updated weights for policy 0, policy_version 48443 (0.0037) [2024-06-27 18:08:38,852][06674] Fps is (10 sec: 39313.6, 60 sec: 43417.6, 300 sec: 43542.3). Total num frames: 793788416. Throughput: 0: 43358.8. Samples: 696778880. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 18:08:38,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:08:39,779][06909] Updated weights for policy 0, policy_version 48453 (0.0032) [2024-06-27 18:08:43,203][06909] Updated weights for policy 0, policy_version 48463 (0.0032) [2024-06-27 18:08:43,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43689.1, 300 sec: 43708.9). Total num frames: 794050560. Throughput: 0: 43332.5. Samples: 696903000. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 18:08:43,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:08:47,153][06909] Updated weights for policy 0, policy_version 48473 (0.0025) [2024-06-27 18:08:48,850][06674] Fps is (10 sec: 45884.9, 60 sec: 43144.5, 300 sec: 43653.6). Total num frames: 794247168. Throughput: 0: 43293.3. Samples: 697168300. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 18:08:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:08:50,778][06909] Updated weights for policy 0, policy_version 48483 (0.0039) [2024-06-27 18:08:50,800][06887] Signal inference workers to stop experience collection... (10000 times) [2024-06-27 18:08:50,800][06887] Signal inference workers to resume experience collection... (10000 times) [2024-06-27 18:08:50,839][06909] InferenceWorker_p0-w0: stopping experience collection (10000 times) [2024-06-27 18:08:50,840][06909] InferenceWorker_p0-w0: resuming experience collection (10000 times) [2024-06-27 18:08:53,850][06674] Fps is (10 sec: 40967.5, 60 sec: 43690.4, 300 sec: 43598.1). Total num frames: 794460160. Throughput: 0: 43340.2. Samples: 697433800. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 18:08:53,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:08:54,766][06909] Updated weights for policy 0, policy_version 48493 (0.0036) [2024-06-27 18:08:58,033][06909] Updated weights for policy 0, policy_version 48503 (0.0033) [2024-06-27 18:08:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 794689536. Throughput: 0: 43346.9. Samples: 697559480. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 18:08:58,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:09:02,283][06909] Updated weights for policy 0, policy_version 48513 (0.0025) [2024-06-27 18:09:03,850][06674] Fps is (10 sec: 44238.4, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 794902528. Throughput: 0: 43631.6. Samples: 697829500. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 18:09:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:09:05,589][06909] Updated weights for policy 0, policy_version 48523 (0.0042) [2024-06-27 18:09:08,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 795099136. Throughput: 0: 43548.0. Samples: 698086920. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 18:09:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:09:09,716][06909] Updated weights for policy 0, policy_version 48533 (0.0046) [2024-06-27 18:09:13,259][06909] Updated weights for policy 0, policy_version 48543 (0.0029) [2024-06-27 18:09:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43422.1, 300 sec: 43653.7). Total num frames: 795344896. Throughput: 0: 43593.0. Samples: 698213740. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 18:09:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:09:17,300][06909] Updated weights for policy 0, policy_version 48553 (0.0032) [2024-06-27 18:09:18,852][06674] Fps is (10 sec: 47503.4, 60 sec: 43689.1, 300 sec: 43708.9). Total num frames: 795574272. Throughput: 0: 43533.1. Samples: 698477600. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 18:09:18,852][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:09:20,725][06909] Updated weights for policy 0, policy_version 48563 (0.0034) [2024-06-27 18:09:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 795770880. Throughput: 0: 43633.6. Samples: 698742300. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 18:09:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:09:24,828][06909] Updated weights for policy 0, policy_version 48573 (0.0047) [2024-06-27 18:09:28,313][06909] Updated weights for policy 0, policy_version 48583 (0.0040) [2024-06-27 18:09:28,850][06674] Fps is (10 sec: 40968.0, 60 sec: 43144.5, 300 sec: 43598.1). Total num frames: 795983872. Throughput: 0: 43635.7. Samples: 698866520. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 18:09:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:09:32,282][06909] Updated weights for policy 0, policy_version 48593 (0.0030) [2024-06-27 18:09:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 796213248. Throughput: 0: 43604.0. Samples: 699130480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:09:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:09:35,620][06909] Updated weights for policy 0, policy_version 48603 (0.0034) [2024-06-27 18:09:38,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43692.2, 300 sec: 43542.6). Total num frames: 796409856. Throughput: 0: 43606.1. Samples: 699396060. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:09:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:09:39,710][06909] Updated weights for policy 0, policy_version 48613 (0.0022) [2024-06-27 18:09:43,281][06909] Updated weights for policy 0, policy_version 48623 (0.0030) [2024-06-27 18:09:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43146.1, 300 sec: 43598.1). Total num frames: 796639232. Throughput: 0: 43656.9. Samples: 699524040. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:09:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:09:47,484][06909] Updated weights for policy 0, policy_version 48633 (0.0034) [2024-06-27 18:09:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 796868608. Throughput: 0: 43624.4. Samples: 699792600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:09:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:09:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000048637_796868608.pth... [2024-06-27 18:09:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000047998_786399232.pth [2024-06-27 18:09:50,645][06909] Updated weights for policy 0, policy_version 48643 (0.0041) [2024-06-27 18:09:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.9, 300 sec: 43598.1). Total num frames: 797081600. Throughput: 0: 43744.5. Samples: 700055420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:09:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:09:54,690][06909] Updated weights for policy 0, policy_version 48653 (0.0032) [2024-06-27 18:09:58,430][06909] Updated weights for policy 0, policy_version 48663 (0.0043) [2024-06-27 18:09:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 797310976. Throughput: 0: 43674.1. Samples: 700179080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:09:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:10:02,073][06909] Updated weights for policy 0, policy_version 48673 (0.0036) [2024-06-27 18:10:03,852][06674] Fps is (10 sec: 45865.3, 60 sec: 43962.2, 300 sec: 43764.4). Total num frames: 797540352. Throughput: 0: 43716.9. Samples: 700444860. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:10:03,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:10:05,805][06909] Updated weights for policy 0, policy_version 48683 (0.0030) [2024-06-27 18:10:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 797736960. Throughput: 0: 43694.3. Samples: 700708540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:10:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:10:09,545][06909] Updated weights for policy 0, policy_version 48693 (0.0034) [2024-06-27 18:10:13,525][06909] Updated weights for policy 0, policy_version 48703 (0.0045) [2024-06-27 18:10:13,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43963.6, 300 sec: 43598.1). Total num frames: 797982720. Throughput: 0: 43815.2. Samples: 700838200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:10:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:10:16,910][06909] Updated weights for policy 0, policy_version 48713 (0.0035) [2024-06-27 18:10:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43146.1, 300 sec: 43653.6). Total num frames: 798162944. Throughput: 0: 43675.6. Samples: 701095880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:10:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:10:20,978][06909] Updated weights for policy 0, policy_version 48723 (0.0034) [2024-06-27 18:10:20,991][06887] Signal inference workers to stop experience collection... (10050 times) [2024-06-27 18:10:20,991][06887] Signal inference workers to resume experience collection... (10050 times) [2024-06-27 18:10:21,037][06909] InferenceWorker_p0-w0: stopping experience collection (10050 times) [2024-06-27 18:10:21,037][06909] InferenceWorker_p0-w0: resuming experience collection (10050 times) [2024-06-27 18:10:23,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 798392320. Throughput: 0: 43590.7. Samples: 701357640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:10:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:10:24,694][06909] Updated weights for policy 0, policy_version 48733 (0.0034) [2024-06-27 18:10:28,444][06909] Updated weights for policy 0, policy_version 48743 (0.0047) [2024-06-27 18:10:28,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 798621696. Throughput: 0: 43705.2. Samples: 701490780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:10:28,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:10:32,036][06909] Updated weights for policy 0, policy_version 48753 (0.0037) [2024-06-27 18:10:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 798801920. Throughput: 0: 43421.4. Samples: 701746560. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 18:10:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:10:36,024][06909] Updated weights for policy 0, policy_version 48763 (0.0039) [2024-06-27 18:10:38,854][06674] Fps is (10 sec: 42579.7, 60 sec: 43960.4, 300 sec: 43597.4). Total num frames: 799047680. Throughput: 0: 43478.3. Samples: 702012140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 18:10:38,855][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 18:10:39,771][06909] Updated weights for policy 0, policy_version 48773 (0.0032) [2024-06-27 18:10:43,850][06674] Fps is (10 sec: 44235.7, 60 sec: 43417.5, 300 sec: 43542.5). Total num frames: 799244288. Throughput: 0: 43522.1. Samples: 702137580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 18:10:43,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:10:43,870][06909] Updated weights for policy 0, policy_version 48783 (0.0028) [2024-06-27 18:10:47,201][06909] Updated weights for policy 0, policy_version 48793 (0.0026) [2024-06-27 18:10:48,852][06674] Fps is (10 sec: 44247.0, 60 sec: 43689.1, 300 sec: 43653.3). Total num frames: 799490048. Throughput: 0: 43362.6. Samples: 702396180. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 18:10:48,853][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:10:51,146][06909] Updated weights for policy 0, policy_version 48803 (0.0021) [2024-06-27 18:10:53,850][06674] Fps is (10 sec: 42599.5, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 799670272. Throughput: 0: 43434.2. Samples: 702663080. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 18:10:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:10:54,522][06909] Updated weights for policy 0, policy_version 48813 (0.0032) [2024-06-27 18:10:58,466][06909] Updated weights for policy 0, policy_version 48823 (0.0027) [2024-06-27 18:10:58,850][06674] Fps is (10 sec: 42607.7, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 799916032. Throughput: 0: 43449.4. Samples: 702793420. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 18:10:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:11:02,087][06909] Updated weights for policy 0, policy_version 48833 (0.0031) [2024-06-27 18:11:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43146.1, 300 sec: 43653.7). Total num frames: 800129024. Throughput: 0: 43610.7. Samples: 703058360. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 18:11:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:11:05,767][06909] Updated weights for policy 0, policy_version 48843 (0.0038) [2024-06-27 18:11:08,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.5, 300 sec: 43653.6). Total num frames: 800358400. Throughput: 0: 43679.4. Samples: 703323220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 18:11:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:11:09,511][06909] Updated weights for policy 0, policy_version 48853 (0.0036) [2024-06-27 18:11:13,442][06909] Updated weights for policy 0, policy_version 48863 (0.0029) [2024-06-27 18:11:13,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 800587776. Throughput: 0: 43546.3. Samples: 703450360. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 18:11:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:11:16,978][06909] Updated weights for policy 0, policy_version 48873 (0.0026) [2024-06-27 18:11:18,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 800784384. Throughput: 0: 43645.3. Samples: 703710600. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 18:11:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:11:20,985][06909] Updated weights for policy 0, policy_version 48883 (0.0032) [2024-06-27 18:11:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 801013760. Throughput: 0: 43696.8. Samples: 703978300. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 18:11:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:11:24,587][06909] Updated weights for policy 0, policy_version 48893 (0.0032) [2024-06-27 18:11:28,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43144.7, 300 sec: 43542.6). Total num frames: 801210368. Throughput: 0: 43746.9. Samples: 704106180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 18:11:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:11:28,989][06909] Updated weights for policy 0, policy_version 48903 (0.0038) [2024-06-27 18:11:30,962][06887] Signal inference workers to stop experience collection... (10100 times) [2024-06-27 18:11:31,015][06887] Signal inference workers to resume experience collection... (10100 times) [2024-06-27 18:11:31,016][06909] InferenceWorker_p0-w0: stopping experience collection (10100 times) [2024-06-27 18:11:31,031][06909] InferenceWorker_p0-w0: resuming experience collection (10100 times) [2024-06-27 18:11:31,905][06909] Updated weights for policy 0, policy_version 48913 (0.0032) [2024-06-27 18:11:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 43709.2). Total num frames: 801456128. Throughput: 0: 43752.9. Samples: 704364960. Policy #0 lag: (min: 0.0, avg: 11.5, max: 27.0) [2024-06-27 18:11:33,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 18:11:36,327][06909] Updated weights for policy 0, policy_version 48923 (0.0048) [2024-06-27 18:11:38,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43967.0, 300 sec: 43653.6). Total num frames: 801685504. Throughput: 0: 43934.1. Samples: 704640120. Policy #0 lag: (min: 0.0, avg: 11.5, max: 27.0) [2024-06-27 18:11:38,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:11:39,142][06909] Updated weights for policy 0, policy_version 48933 (0.0037) [2024-06-27 18:11:43,628][06909] Updated weights for policy 0, policy_version 48943 (0.0035) [2024-06-27 18:11:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.9, 300 sec: 43653.8). Total num frames: 801898496. Throughput: 0: 43868.9. Samples: 704767520. Policy #0 lag: (min: 0.0, avg: 11.5, max: 27.0) [2024-06-27 18:11:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:11:46,556][06909] Updated weights for policy 0, policy_version 48953 (0.0035) [2024-06-27 18:11:48,852][06674] Fps is (10 sec: 42589.3, 60 sec: 43690.7, 300 sec: 43653.3). Total num frames: 802111488. Throughput: 0: 43875.2. Samples: 705032840. Policy #0 lag: (min: 0.0, avg: 11.5, max: 27.0) [2024-06-27 18:11:48,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:11:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000048957_802111488.pth... [2024-06-27 18:11:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000048319_791658496.pth [2024-06-27 18:11:50,981][06909] Updated weights for policy 0, policy_version 48963 (0.0038) [2024-06-27 18:11:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.8, 300 sec: 43654.2). Total num frames: 802340864. Throughput: 0: 43855.2. Samples: 705296700. Policy #0 lag: (min: 0.0, avg: 11.5, max: 27.0) [2024-06-27 18:11:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:11:54,062][06909] Updated weights for policy 0, policy_version 48973 (0.0023) [2024-06-27 18:11:58,850][06674] Fps is (10 sec: 40968.1, 60 sec: 43417.5, 300 sec: 43542.5). Total num frames: 802521088. Throughput: 0: 43853.7. Samples: 705423780. Policy #0 lag: (min: 0.0, avg: 11.5, max: 27.0) [2024-06-27 18:11:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:11:58,863][06909] Updated weights for policy 0, policy_version 48983 (0.0037) [2024-06-27 18:12:01,635][06909] Updated weights for policy 0, policy_version 48993 (0.0031) [2024-06-27 18:12:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 43764.7). Total num frames: 802783232. Throughput: 0: 43768.9. Samples: 705680200. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 18:12:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:12:06,815][06909] Updated weights for policy 0, policy_version 49003 (0.0036) [2024-06-27 18:12:08,850][06674] Fps is (10 sec: 47514.4, 60 sec: 43963.8, 300 sec: 43653.7). Total num frames: 802996224. Throughput: 0: 43674.7. Samples: 705943660. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 18:12:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:12:09,295][06909] Updated weights for policy 0, policy_version 49013 (0.0027) [2024-06-27 18:12:13,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43144.6, 300 sec: 43542.5). Total num frames: 803176448. Throughput: 0: 43653.3. Samples: 706070580. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 18:12:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:12:14,217][06909] Updated weights for policy 0, policy_version 49023 (0.0031) [2024-06-27 18:12:16,700][06909] Updated weights for policy 0, policy_version 49033 (0.0028) [2024-06-27 18:12:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 803422208. Throughput: 0: 43848.7. Samples: 706338160. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 18:12:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:12:21,460][06909] Updated weights for policy 0, policy_version 49043 (0.0036) [2024-06-27 18:12:23,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 803651584. Throughput: 0: 43641.4. Samples: 706603980. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 18:12:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:12:24,350][06909] Updated weights for policy 0, policy_version 49053 (0.0027) [2024-06-27 18:12:28,783][06909] Updated weights for policy 0, policy_version 49063 (0.0030) [2024-06-27 18:12:28,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43542.5). Total num frames: 803848192. Throughput: 0: 43784.9. Samples: 706737840. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-27 18:12:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:12:31,777][06909] Updated weights for policy 0, policy_version 49073 (0.0033) [2024-06-27 18:12:33,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43709.5). Total num frames: 804077568. Throughput: 0: 43530.4. Samples: 706991620. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 18:12:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:12:36,522][06909] Updated weights for policy 0, policy_version 49083 (0.0034) [2024-06-27 18:12:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 804290560. Throughput: 0: 43562.7. Samples: 707257020. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 18:12:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:12:39,397][06909] Updated weights for policy 0, policy_version 49093 (0.0026) [2024-06-27 18:12:43,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 804503552. Throughput: 0: 43596.2. Samples: 707385600. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 18:12:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:12:43,854][06909] Updated weights for policy 0, policy_version 49103 (0.0022) [2024-06-27 18:12:45,731][06887] Signal inference workers to stop experience collection... (10150 times) [2024-06-27 18:12:45,766][06909] InferenceWorker_p0-w0: stopping experience collection (10150 times) [2024-06-27 18:12:45,791][06887] Signal inference workers to resume experience collection... (10150 times) [2024-06-27 18:12:45,796][06909] InferenceWorker_p0-w0: resuming experience collection (10150 times) [2024-06-27 18:12:46,855][06909] Updated weights for policy 0, policy_version 49113 (0.0026) [2024-06-27 18:12:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43692.2, 300 sec: 43709.2). Total num frames: 804732928. Throughput: 0: 43567.0. Samples: 707640720. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 18:12:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:12:51,201][06909] Updated weights for policy 0, policy_version 49123 (0.0033) [2024-06-27 18:12:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 804945920. Throughput: 0: 43651.2. Samples: 707907960. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 18:12:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:12:54,727][06909] Updated weights for policy 0, policy_version 49133 (0.0036) [2024-06-27 18:12:58,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.8, 300 sec: 43542.6). Total num frames: 805142528. Throughput: 0: 43556.5. Samples: 708030620. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 18:12:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 18:12:59,135][06909] Updated weights for policy 0, policy_version 49143 (0.0040) [2024-06-27 18:13:02,151][06909] Updated weights for policy 0, policy_version 49153 (0.0030) [2024-06-27 18:13:03,852][06674] Fps is (10 sec: 44227.2, 60 sec: 43416.0, 300 sec: 43708.9). Total num frames: 805388288. Throughput: 0: 43546.9. Samples: 708297860. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-27 18:13:03,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 18:13:06,439][06909] Updated weights for policy 0, policy_version 49163 (0.0036) [2024-06-27 18:13:08,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43690.6, 300 sec: 43654.5). Total num frames: 805617664. Throughput: 0: 43493.6. Samples: 708561200. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:13:08,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:13:09,494][06909] Updated weights for policy 0, policy_version 49173 (0.0030) [2024-06-27 18:13:13,803][06909] Updated weights for policy 0, policy_version 49183 (0.0036) [2024-06-27 18:13:13,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 805814272. Throughput: 0: 43429.4. Samples: 708692160. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:13:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:13:17,105][06909] Updated weights for policy 0, policy_version 49193 (0.0033) [2024-06-27 18:13:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 806043648. Throughput: 0: 43635.6. Samples: 708955220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:13:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:13:21,473][06909] Updated weights for policy 0, policy_version 49203 (0.0033) [2024-06-27 18:13:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 806256640. Throughput: 0: 43548.0. Samples: 709216680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:13:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:13:24,622][06909] Updated weights for policy 0, policy_version 49213 (0.0038) [2024-06-27 18:13:28,759][06909] Updated weights for policy 0, policy_version 49223 (0.0033) [2024-06-27 18:13:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 806469632. Throughput: 0: 43501.7. Samples: 709343180. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:13:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:13:32,190][06909] Updated weights for policy 0, policy_version 49233 (0.0032) [2024-06-27 18:13:33,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43765.0). Total num frames: 806699008. Throughput: 0: 43738.5. Samples: 709608960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:13:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:13:36,340][06909] Updated weights for policy 0, policy_version 49243 (0.0031) [2024-06-27 18:13:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 43654.0). Total num frames: 806928384. Throughput: 0: 43783.1. Samples: 709878200. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:13:38,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:13:39,806][06909] Updated weights for policy 0, policy_version 49253 (0.0036) [2024-06-27 18:13:43,677][06909] Updated weights for policy 0, policy_version 49263 (0.0031) [2024-06-27 18:13:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 807124992. Throughput: 0: 43903.4. Samples: 710006280. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:13:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:13:47,095][06909] Updated weights for policy 0, policy_version 49273 (0.0026) [2024-06-27 18:13:48,856][06674] Fps is (10 sec: 42572.1, 60 sec: 43686.2, 300 sec: 43708.3). Total num frames: 807354368. Throughput: 0: 43761.9. Samples: 710267320. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:13:48,857][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:13:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000049277_807354368.pth... [2024-06-27 18:13:48,935][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000048637_796868608.pth [2024-06-27 18:13:51,007][06909] Updated weights for policy 0, policy_version 49283 (0.0027) [2024-06-27 18:13:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 807583744. Throughput: 0: 43792.0. Samples: 710531840. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:13:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:13:54,423][06909] Updated weights for policy 0, policy_version 49293 (0.0025) [2024-06-27 18:13:58,812][06909] Updated weights for policy 0, policy_version 49303 (0.0041) [2024-06-27 18:13:58,850][06674] Fps is (10 sec: 42624.4, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 807780352. Throughput: 0: 43889.3. Samples: 710667180. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:13:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:13:59,321][06887] Signal inference workers to stop experience collection... (10200 times) [2024-06-27 18:13:59,328][06887] Signal inference workers to resume experience collection... (10200 times) [2024-06-27 18:13:59,334][06909] InferenceWorker_p0-w0: stopping experience collection (10200 times) [2024-06-27 18:13:59,354][06909] InferenceWorker_p0-w0: resuming experience collection (10200 times) [2024-06-27 18:14:01,845][06909] Updated weights for policy 0, policy_version 49313 (0.0032) [2024-06-27 18:14:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43692.2, 300 sec: 43764.7). Total num frames: 808009728. Throughput: 0: 43729.7. Samples: 710923060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:14:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:14:06,144][06909] Updated weights for policy 0, policy_version 49323 (0.0028) [2024-06-27 18:14:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 808222720. Throughput: 0: 43826.6. Samples: 711188880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-27 18:14:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:14:09,485][06909] Updated weights for policy 0, policy_version 49333 (0.0024) [2024-06-27 18:14:13,489][06909] Updated weights for policy 0, policy_version 49343 (0.0034) [2024-06-27 18:14:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43598.4). Total num frames: 808435712. Throughput: 0: 43840.9. Samples: 711316020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-27 18:14:13,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:14:17,324][06909] Updated weights for policy 0, policy_version 49353 (0.0033) [2024-06-27 18:14:18,856][06674] Fps is (10 sec: 44210.1, 60 sec: 43686.2, 300 sec: 43708.3). Total num frames: 808665088. Throughput: 0: 43666.2. Samples: 711574200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-27 18:14:18,857][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:14:21,315][06909] Updated weights for policy 0, policy_version 49363 (0.0035) [2024-06-27 18:14:23,852][06674] Fps is (10 sec: 44227.2, 60 sec: 43689.1, 300 sec: 43708.9). Total num frames: 808878080. Throughput: 0: 43584.5. Samples: 711839600. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-27 18:14:23,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:14:24,801][06909] Updated weights for policy 0, policy_version 49373 (0.0023) [2024-06-27 18:14:28,850][06674] Fps is (10 sec: 40985.1, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 809074688. Throughput: 0: 43568.1. Samples: 711966840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-27 18:14:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:14:29,111][06909] Updated weights for policy 0, policy_version 49383 (0.0043) [2024-06-27 18:14:32,096][06909] Updated weights for policy 0, policy_version 49393 (0.0033) [2024-06-27 18:14:33,850][06674] Fps is (10 sec: 44246.2, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 809320448. Throughput: 0: 43597.9. Samples: 712228960. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-27 18:14:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:14:36,424][06909] Updated weights for policy 0, policy_version 49403 (0.0036) [2024-06-27 18:14:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 809533440. Throughput: 0: 43637.7. Samples: 712495540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:14:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:14:39,448][06909] Updated weights for policy 0, policy_version 49413 (0.0036) [2024-06-27 18:14:43,851][06674] Fps is (10 sec: 40955.0, 60 sec: 43416.7, 300 sec: 43597.9). Total num frames: 809730048. Throughput: 0: 43303.2. Samples: 712615880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:14:43,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:14:44,077][06909] Updated weights for policy 0, policy_version 49423 (0.0034) [2024-06-27 18:14:47,214][06909] Updated weights for policy 0, policy_version 49433 (0.0034) [2024-06-27 18:14:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43695.1, 300 sec: 43709.2). Total num frames: 809975808. Throughput: 0: 43409.7. Samples: 712876500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:14:48,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:14:51,382][06909] Updated weights for policy 0, policy_version 49443 (0.0038) [2024-06-27 18:14:53,850][06674] Fps is (10 sec: 45881.5, 60 sec: 43417.7, 300 sec: 43653.6). Total num frames: 810188800. Throughput: 0: 43417.0. Samples: 713142640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:14:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:14:54,742][06909] Updated weights for policy 0, policy_version 49453 (0.0030) [2024-06-27 18:14:58,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 43542.9). Total num frames: 810385408. Throughput: 0: 43446.2. Samples: 713271100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:14:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:14:59,039][06909] Updated weights for policy 0, policy_version 49463 (0.0039) [2024-06-27 18:15:02,457][06909] Updated weights for policy 0, policy_version 49473 (0.0043) [2024-06-27 18:15:03,856][06674] Fps is (10 sec: 44209.6, 60 sec: 43686.3, 300 sec: 43708.3). Total num frames: 810631168. Throughput: 0: 43514.3. Samples: 713532340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:15:03,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:15:06,355][06909] Updated weights for policy 0, policy_version 49483 (0.0030) [2024-06-27 18:15:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 810827776. Throughput: 0: 43471.5. Samples: 713795720. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:15:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:15:10,194][06909] Updated weights for policy 0, policy_version 49493 (0.0038) [2024-06-27 18:15:13,850][06674] Fps is (10 sec: 40984.7, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 811040768. Throughput: 0: 43559.0. Samples: 713927000. Policy #0 lag: (min: 1.0, avg: 11.2, max: 22.0) [2024-06-27 18:15:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:15:13,903][06909] Updated weights for policy 0, policy_version 49503 (0.0028) [2024-06-27 18:15:17,470][06909] Updated weights for policy 0, policy_version 49513 (0.0030) [2024-06-27 18:15:18,853][06674] Fps is (10 sec: 44223.6, 60 sec: 43419.9, 300 sec: 43653.2). Total num frames: 811270144. Throughput: 0: 43627.5. Samples: 714192320. Policy #0 lag: (min: 1.0, avg: 11.2, max: 22.0) [2024-06-27 18:15:18,853][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:15:21,715][06909] Updated weights for policy 0, policy_version 49523 (0.0026) [2024-06-27 18:15:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43692.3, 300 sec: 43653.7). Total num frames: 811499520. Throughput: 0: 43478.4. Samples: 714452060. Policy #0 lag: (min: 1.0, avg: 11.2, max: 22.0) [2024-06-27 18:15:23,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:15:24,814][06909] Updated weights for policy 0, policy_version 49533 (0.0040) [2024-06-27 18:15:28,850][06674] Fps is (10 sec: 42611.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 811696128. Throughput: 0: 43624.4. Samples: 714578920. Policy #0 lag: (min: 1.0, avg: 11.2, max: 22.0) [2024-06-27 18:15:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:15:29,056][06909] Updated weights for policy 0, policy_version 49543 (0.0037) [2024-06-27 18:15:31,794][06887] Signal inference workers to stop experience collection... (10250 times) [2024-06-27 18:15:31,794][06887] Signal inference workers to resume experience collection... (10250 times) [2024-06-27 18:15:31,838][06909] InferenceWorker_p0-w0: stopping experience collection (10250 times) [2024-06-27 18:15:31,838][06909] InferenceWorker_p0-w0: resuming experience collection (10250 times) [2024-06-27 18:15:32,638][06909] Updated weights for policy 0, policy_version 49553 (0.0036) [2024-06-27 18:15:33,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43417.6, 300 sec: 43654.3). Total num frames: 811925504. Throughput: 0: 43730.7. Samples: 714844380. Policy #0 lag: (min: 1.0, avg: 11.2, max: 22.0) [2024-06-27 18:15:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:15:36,686][06909] Updated weights for policy 0, policy_version 49563 (0.0033) [2024-06-27 18:15:38,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43416.2, 300 sec: 43708.9). Total num frames: 812138496. Throughput: 0: 43469.5. Samples: 715098860. Policy #0 lag: (min: 1.0, avg: 11.2, max: 22.0) [2024-06-27 18:15:38,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:15:40,070][06909] Updated weights for policy 0, policy_version 49573 (0.0027) [2024-06-27 18:15:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43691.6, 300 sec: 43598.4). Total num frames: 812351488. Throughput: 0: 43539.1. Samples: 715230360. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 18:15:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:15:44,138][06909] Updated weights for policy 0, policy_version 49583 (0.0024) [2024-06-27 18:15:47,458][06909] Updated weights for policy 0, policy_version 49593 (0.0036) [2024-06-27 18:15:48,850][06674] Fps is (10 sec: 44245.3, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 812580864. Throughput: 0: 43634.2. Samples: 715495620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 18:15:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:15:48,989][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000049597_812597248.pth... [2024-06-27 18:15:49,034][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000048957_802111488.pth [2024-06-27 18:15:51,676][06909] Updated weights for policy 0, policy_version 49603 (0.0034) [2024-06-27 18:15:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 812810240. Throughput: 0: 43575.9. Samples: 715756640. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 18:15:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:15:55,154][06909] Updated weights for policy 0, policy_version 49613 (0.0030) [2024-06-27 18:15:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 813006848. Throughput: 0: 43661.7. Samples: 715891780. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 18:15:58,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:15:59,077][06909] Updated weights for policy 0, policy_version 49623 (0.0036) [2024-06-27 18:16:02,599][06909] Updated weights for policy 0, policy_version 49633 (0.0046) [2024-06-27 18:16:03,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43149.0, 300 sec: 43598.1). Total num frames: 813219840. Throughput: 0: 43378.9. Samples: 716144240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 18:16:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:16:06,570][06909] Updated weights for policy 0, policy_version 49643 (0.0025) [2024-06-27 18:16:08,854][06674] Fps is (10 sec: 45855.1, 60 sec: 43960.4, 300 sec: 43653.0). Total num frames: 813465600. Throughput: 0: 43444.5. Samples: 716407260. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 18:16:08,855][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:16:10,054][06909] Updated weights for policy 0, policy_version 49653 (0.0042) [2024-06-27 18:16:13,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 813662208. Throughput: 0: 43582.1. Samples: 716540120. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 18:16:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:16:14,010][06909] Updated weights for policy 0, policy_version 49663 (0.0025) [2024-06-27 18:16:17,901][06909] Updated weights for policy 0, policy_version 49673 (0.0028) [2024-06-27 18:16:18,850][06674] Fps is (10 sec: 40978.3, 60 sec: 43419.7, 300 sec: 43598.1). Total num frames: 813875200. Throughput: 0: 43506.7. Samples: 716802180. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 18:16:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:16:21,858][06909] Updated weights for policy 0, policy_version 49683 (0.0038) [2024-06-27 18:16:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 814104576. Throughput: 0: 43510.9. Samples: 717056760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 18:16:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:16:25,324][06909] Updated weights for policy 0, policy_version 49693 (0.0033) [2024-06-27 18:16:28,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 814317568. Throughput: 0: 43612.9. Samples: 717192940. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 18:16:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:16:29,151][06909] Updated weights for policy 0, policy_version 49703 (0.0032) [2024-06-27 18:16:32,714][06909] Updated weights for policy 0, policy_version 49713 (0.0025) [2024-06-27 18:16:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 814530560. Throughput: 0: 43589.1. Samples: 717457120. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 18:16:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:16:36,668][06909] Updated weights for policy 0, policy_version 49723 (0.0046) [2024-06-27 18:16:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43692.2, 300 sec: 43598.1). Total num frames: 814759936. Throughput: 0: 43580.5. Samples: 717717760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 18:16:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:16:40,068][06909] Updated weights for policy 0, policy_version 49733 (0.0023) [2024-06-27 18:16:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43598.4). Total num frames: 814972928. Throughput: 0: 43429.9. Samples: 717846120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 18:16:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:16:44,360][06909] Updated weights for policy 0, policy_version 49743 (0.0030) [2024-06-27 18:16:47,677][06909] Updated weights for policy 0, policy_version 49753 (0.0037) [2024-06-27 18:16:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 815185920. Throughput: 0: 43619.5. Samples: 718107120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 18:16:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:16:51,850][06909] Updated weights for policy 0, policy_version 49763 (0.0043) [2024-06-27 18:16:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 815415296. Throughput: 0: 43647.5. Samples: 718371200. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 18:16:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:16:55,223][06909] Updated weights for policy 0, policy_version 49773 (0.0025) [2024-06-27 18:16:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 815611904. Throughput: 0: 43662.8. Samples: 718504940. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 18:16:58,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:16:59,237][06909] Updated weights for policy 0, policy_version 49783 (0.0040) [2024-06-27 18:17:02,754][06909] Updated weights for policy 0, policy_version 49793 (0.0033) [2024-06-27 18:17:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 815841280. Throughput: 0: 43559.6. Samples: 718762360. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 18:17:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:17:07,061][06909] Updated weights for policy 0, policy_version 49803 (0.0025) [2024-06-27 18:17:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43147.7, 300 sec: 43653.6). Total num frames: 816054272. Throughput: 0: 43747.1. Samples: 719025380. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 18:17:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:17:08,862][06887] Signal inference workers to stop experience collection... (10300 times) [2024-06-27 18:17:08,912][06909] InferenceWorker_p0-w0: stopping experience collection (10300 times) [2024-06-27 18:17:08,917][06887] Signal inference workers to resume experience collection... (10300 times) [2024-06-27 18:17:08,929][06909] InferenceWorker_p0-w0: resuming experience collection (10300 times) [2024-06-27 18:17:10,136][06909] Updated weights for policy 0, policy_version 49813 (0.0027) [2024-06-27 18:17:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 816283648. Throughput: 0: 43773.2. Samples: 719162740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 18:17:13,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 18:17:14,357][06909] Updated weights for policy 0, policy_version 49823 (0.0033) [2024-06-27 18:17:17,867][06909] Updated weights for policy 0, policy_version 49833 (0.0039) [2024-06-27 18:17:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 816513024. Throughput: 0: 43693.3. Samples: 719423320. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 18:17:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:17:21,797][06909] Updated weights for policy 0, policy_version 49843 (0.0032) [2024-06-27 18:17:23,852][06674] Fps is (10 sec: 45866.0, 60 sec: 43962.2, 300 sec: 43708.9). Total num frames: 816742400. Throughput: 0: 43774.4. Samples: 719687700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 18:17:23,853][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:17:25,190][06909] Updated weights for policy 0, policy_version 49853 (0.0026) [2024-06-27 18:17:28,851][06674] Fps is (10 sec: 42594.5, 60 sec: 43690.0, 300 sec: 43598.0). Total num frames: 816939008. Throughput: 0: 43768.8. Samples: 719815760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 18:17:28,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:17:29,068][06909] Updated weights for policy 0, policy_version 49863 (0.0025) [2024-06-27 18:17:32,667][06909] Updated weights for policy 0, policy_version 49873 (0.0032) [2024-06-27 18:17:33,850][06674] Fps is (10 sec: 40968.7, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 817152000. Throughput: 0: 43835.1. Samples: 720079700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 18:17:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:17:36,540][06909] Updated weights for policy 0, policy_version 49883 (0.0035) [2024-06-27 18:17:38,850][06674] Fps is (10 sec: 42602.2, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 817364992. Throughput: 0: 43748.0. Samples: 720339860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 18:17:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:17:40,152][06909] Updated weights for policy 0, policy_version 49893 (0.0034) [2024-06-27 18:17:43,851][06674] Fps is (10 sec: 44230.0, 60 sec: 43689.5, 300 sec: 43597.9). Total num frames: 817594368. Throughput: 0: 43780.8. Samples: 720475140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 18:17:43,852][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 18:17:44,113][06909] Updated weights for policy 0, policy_version 49903 (0.0041) [2024-06-27 18:17:47,724][06909] Updated weights for policy 0, policy_version 49913 (0.0029) [2024-06-27 18:17:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.5, 300 sec: 43598.1). Total num frames: 817807360. Throughput: 0: 43733.2. Samples: 720730360. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 18:17:48,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:17:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000049915_817807360.pth... [2024-06-27 18:17:48,912][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000049277_807354368.pth [2024-06-27 18:17:51,805][06909] Updated weights for policy 0, policy_version 49923 (0.0042) [2024-06-27 18:17:53,852][06674] Fps is (10 sec: 42596.2, 60 sec: 43416.2, 300 sec: 43653.3). Total num frames: 818020352. Throughput: 0: 43742.1. Samples: 720993860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 18:17:53,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:17:55,311][06909] Updated weights for policy 0, policy_version 49933 (0.0025) [2024-06-27 18:17:58,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43962.2, 300 sec: 43598.1). Total num frames: 818249728. Throughput: 0: 43637.6. Samples: 721126520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 18:17:58,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:17:59,336][06909] Updated weights for policy 0, policy_version 49943 (0.0031) [2024-06-27 18:18:02,726][06909] Updated weights for policy 0, policy_version 49953 (0.0032) [2024-06-27 18:18:03,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 818462720. Throughput: 0: 43522.3. Samples: 721381820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 18:18:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:18:06,967][06909] Updated weights for policy 0, policy_version 49963 (0.0035) [2024-06-27 18:18:08,850][06674] Fps is (10 sec: 42607.6, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 818675712. Throughput: 0: 43531.8. Samples: 721646540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 18:18:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:18:10,476][06909] Updated weights for policy 0, policy_version 49973 (0.0039) [2024-06-27 18:18:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 818905088. Throughput: 0: 43599.2. Samples: 721777680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 18:18:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:18:14,432][06909] Updated weights for policy 0, policy_version 49983 (0.0028) [2024-06-27 18:18:18,016][06909] Updated weights for policy 0, policy_version 49993 (0.0031) [2024-06-27 18:18:18,856][06674] Fps is (10 sec: 44209.7, 60 sec: 43413.2, 300 sec: 43597.2). Total num frames: 819118080. Throughput: 0: 43425.2. Samples: 722034100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 18:18:18,856][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:18:21,795][06909] Updated weights for policy 0, policy_version 50003 (0.0038) [2024-06-27 18:18:23,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43419.0, 300 sec: 43653.6). Total num frames: 819347456. Throughput: 0: 43479.5. Samples: 722296440. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 18:18:23,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:18:25,532][06909] Updated weights for policy 0, policy_version 50013 (0.0027) [2024-06-27 18:18:28,222][06887] Signal inference workers to stop experience collection... (10350 times) [2024-06-27 18:18:28,248][06909] InferenceWorker_p0-w0: stopping experience collection (10350 times) [2024-06-27 18:18:28,282][06887] Signal inference workers to resume experience collection... (10350 times) [2024-06-27 18:18:28,283][06909] InferenceWorker_p0-w0: resuming experience collection (10350 times) [2024-06-27 18:18:28,856][06674] Fps is (10 sec: 44236.8, 60 sec: 43686.9, 300 sec: 43597.2). Total num frames: 819560448. Throughput: 0: 43505.3. Samples: 722433080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 18:18:28,857][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:18:29,130][06909] Updated weights for policy 0, policy_version 50023 (0.0041) [2024-06-27 18:18:32,883][06909] Updated weights for policy 0, policy_version 50033 (0.0027) [2024-06-27 18:18:33,856][06674] Fps is (10 sec: 42573.1, 60 sec: 43686.2, 300 sec: 43541.7). Total num frames: 819773440. Throughput: 0: 43608.0. Samples: 722692980. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 18:18:33,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:18:36,410][06909] Updated weights for policy 0, policy_version 50043 (0.0035) [2024-06-27 18:18:38,850][06674] Fps is (10 sec: 42624.5, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 819986432. Throughput: 0: 43620.7. Samples: 722956700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 18:18:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:18:40,693][06909] Updated weights for policy 0, policy_version 50053 (0.0032) [2024-06-27 18:18:43,850][06674] Fps is (10 sec: 44263.6, 60 sec: 43691.7, 300 sec: 43599.0). Total num frames: 820215808. Throughput: 0: 43598.0. Samples: 723088340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 18:18:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:18:44,087][06909] Updated weights for policy 0, policy_version 50063 (0.0042) [2024-06-27 18:18:47,999][06909] Updated weights for policy 0, policy_version 50073 (0.0032) [2024-06-27 18:18:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.8, 300 sec: 43487.0). Total num frames: 820412416. Throughput: 0: 43744.5. Samples: 723350320. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 18:18:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:18:51,900][06909] Updated weights for policy 0, policy_version 50083 (0.0027) [2024-06-27 18:18:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43692.1, 300 sec: 43598.1). Total num frames: 820641792. Throughput: 0: 43691.9. Samples: 723612680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 18:18:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:18:55,751][06909] Updated weights for policy 0, policy_version 50093 (0.0031) [2024-06-27 18:18:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43419.1, 300 sec: 43542.6). Total num frames: 820854784. Throughput: 0: 43694.1. Samples: 723743920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 18:18:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:18:59,201][06909] Updated weights for policy 0, policy_version 50103 (0.0038) [2024-06-27 18:19:03,379][06909] Updated weights for policy 0, policy_version 50113 (0.0039) [2024-06-27 18:19:03,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 821051392. Throughput: 0: 43713.1. Samples: 724000920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 18:19:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:19:06,446][06909] Updated weights for policy 0, policy_version 50123 (0.0042) [2024-06-27 18:19:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 821313536. Throughput: 0: 43684.2. Samples: 724262220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 18:19:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:19:10,976][06909] Updated weights for policy 0, policy_version 50133 (0.0034) [2024-06-27 18:19:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 43543.5). Total num frames: 821510144. Throughput: 0: 43749.5. Samples: 724401540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 18:19:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:19:14,224][06909] Updated weights for policy 0, policy_version 50143 (0.0034) [2024-06-27 18:19:18,580][06909] Updated weights for policy 0, policy_version 50153 (0.0035) [2024-06-27 18:19:18,850][06674] Fps is (10 sec: 39320.6, 60 sec: 43148.8, 300 sec: 43487.3). Total num frames: 821706752. Throughput: 0: 43662.6. Samples: 724657540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 18:19:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:19:21,457][06909] Updated weights for policy 0, policy_version 50163 (0.0024) [2024-06-27 18:19:23,856][06674] Fps is (10 sec: 44209.7, 60 sec: 43413.3, 300 sec: 43652.7). Total num frames: 821952512. Throughput: 0: 43646.1. Samples: 724921040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:19:23,857][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:19:25,893][06909] Updated weights for policy 0, policy_version 50173 (0.0033) [2024-06-27 18:19:28,856][06674] Fps is (10 sec: 47485.5, 60 sec: 43690.6, 300 sec: 43597.2). Total num frames: 822181888. Throughput: 0: 43703.0. Samples: 725055240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:19:28,857][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:19:29,229][06909] Updated weights for policy 0, policy_version 50183 (0.0027) [2024-06-27 18:19:33,237][06909] Updated weights for policy 0, policy_version 50193 (0.0032) [2024-06-27 18:19:33,850][06674] Fps is (10 sec: 44263.6, 60 sec: 43695.1, 300 sec: 43598.1). Total num frames: 822394880. Throughput: 0: 43677.2. Samples: 725315800. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:19:33,856][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:19:36,493][06909] Updated weights for policy 0, policy_version 50203 (0.0020) [2024-06-27 18:19:38,850][06674] Fps is (10 sec: 44264.1, 60 sec: 43963.7, 300 sec: 43709.4). Total num frames: 822624256. Throughput: 0: 43587.2. Samples: 725574100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:19:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:19:40,877][06909] Updated weights for policy 0, policy_version 50213 (0.0033) [2024-06-27 18:19:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 822837248. Throughput: 0: 43704.8. Samples: 725710640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:19:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:19:44,046][06909] Updated weights for policy 0, policy_version 50223 (0.0028) [2024-06-27 18:19:48,624][06909] Updated weights for policy 0, policy_version 50233 (0.0031) [2024-06-27 18:19:48,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 823033856. Throughput: 0: 43693.8. Samples: 725967140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:19:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:19:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000050234_823033856.pth... [2024-06-27 18:19:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000049597_812597248.pth [2024-06-27 18:19:51,422][06909] Updated weights for policy 0, policy_version 50243 (0.0029) [2024-06-27 18:19:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.9, 300 sec: 43764.7). Total num frames: 823296000. Throughput: 0: 43541.3. Samples: 726221580. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:19:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:19:56,140][06909] Updated weights for policy 0, policy_version 50253 (0.0027) [2024-06-27 18:19:58,159][06887] Signal inference workers to stop experience collection... (10400 times) [2024-06-27 18:19:58,195][06909] InferenceWorker_p0-w0: stopping experience collection (10400 times) [2024-06-27 18:19:58,220][06887] Signal inference workers to resume experience collection... (10400 times) [2024-06-27 18:19:58,221][06909] InferenceWorker_p0-w0: resuming experience collection (10400 times) [2024-06-27 18:19:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43599.0). Total num frames: 823492608. Throughput: 0: 43395.5. Samples: 726354340. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:19:58,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:19:59,251][06909] Updated weights for policy 0, policy_version 50263 (0.0024) [2024-06-27 18:20:03,455][06909] Updated weights for policy 0, policy_version 50273 (0.0028) [2024-06-27 18:20:03,850][06674] Fps is (10 sec: 37683.4, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 823672832. Throughput: 0: 43567.4. Samples: 726618060. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:20:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:20:06,724][06909] Updated weights for policy 0, policy_version 50283 (0.0041) [2024-06-27 18:20:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43653.7). Total num frames: 823918592. Throughput: 0: 43376.1. Samples: 726872700. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:20:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:20:10,774][06909] Updated weights for policy 0, policy_version 50293 (0.0037) [2024-06-27 18:20:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 43598.5). Total num frames: 824131584. Throughput: 0: 43477.1. Samples: 727011440. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:20:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:20:14,478][06909] Updated weights for policy 0, policy_version 50303 (0.0044) [2024-06-27 18:20:18,067][06909] Updated weights for policy 0, policy_version 50313 (0.0025) [2024-06-27 18:20:18,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.8, 300 sec: 43487.0). Total num frames: 824328192. Throughput: 0: 43355.1. Samples: 727266780. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:20:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:20:22,001][06909] Updated weights for policy 0, policy_version 50323 (0.0033) [2024-06-27 18:20:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43695.1, 300 sec: 43653.6). Total num frames: 824573952. Throughput: 0: 43283.1. Samples: 727521840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:20:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:20:25,919][06909] Updated weights for policy 0, policy_version 50333 (0.0033) [2024-06-27 18:20:28,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43147.4, 300 sec: 43542.3). Total num frames: 824770560. Throughput: 0: 43286.0. Samples: 727658600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:20:28,853][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:20:29,515][06909] Updated weights for policy 0, policy_version 50343 (0.0039) [2024-06-27 18:20:33,702][06909] Updated weights for policy 0, policy_version 50353 (0.0038) [2024-06-27 18:20:33,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43144.5, 300 sec: 43542.9). Total num frames: 824983552. Throughput: 0: 43359.0. Samples: 727918300. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:20:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:20:36,800][06909] Updated weights for policy 0, policy_version 50363 (0.0037) [2024-06-27 18:20:38,850][06674] Fps is (10 sec: 45885.1, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 825229312. Throughput: 0: 43553.8. Samples: 728181500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:20:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:20:41,058][06909] Updated weights for policy 0, policy_version 50373 (0.0042) [2024-06-27 18:20:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43144.6, 300 sec: 43542.6). Total num frames: 825425920. Throughput: 0: 43557.8. Samples: 728314440. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:20:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:20:44,399][06909] Updated weights for policy 0, policy_version 50383 (0.0028) [2024-06-27 18:20:48,420][06909] Updated weights for policy 0, policy_version 50393 (0.0034) [2024-06-27 18:20:48,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 825638912. Throughput: 0: 43392.3. Samples: 728570720. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:20:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:20:52,190][06909] Updated weights for policy 0, policy_version 50403 (0.0030) [2024-06-27 18:20:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 42871.4, 300 sec: 43598.1). Total num frames: 825868288. Throughput: 0: 43438.1. Samples: 728827420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:20:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:20:55,827][06909] Updated weights for policy 0, policy_version 50413 (0.0033) [2024-06-27 18:20:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43144.5, 300 sec: 43598.1). Total num frames: 826081280. Throughput: 0: 43339.0. Samples: 728961700. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:20:58,851][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 18:20:59,953][06909] Updated weights for policy 0, policy_version 50423 (0.0031) [2024-06-27 18:21:03,542][06909] Updated weights for policy 0, policy_version 50433 (0.0034) [2024-06-27 18:21:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43487.7). Total num frames: 826294272. Throughput: 0: 43388.4. Samples: 729219260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:21:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:21:07,247][06909] Updated weights for policy 0, policy_version 50443 (0.0026) [2024-06-27 18:21:08,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 826523648. Throughput: 0: 43466.7. Samples: 729477840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:21:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:21:11,383][06909] Updated weights for policy 0, policy_version 50453 (0.0029) [2024-06-27 18:21:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43144.5, 300 sec: 43542.6). Total num frames: 826720256. Throughput: 0: 43395.0. Samples: 729611280. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:21:13,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:21:14,557][06909] Updated weights for policy 0, policy_version 50463 (0.0038) [2024-06-27 18:21:18,582][06909] Updated weights for policy 0, policy_version 50473 (0.0033) [2024-06-27 18:21:18,852][06674] Fps is (10 sec: 42589.2, 60 sec: 43689.1, 300 sec: 43542.3). Total num frames: 826949632. Throughput: 0: 43335.3. Samples: 729868480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:21:18,853][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 18:21:22,246][06909] Updated weights for policy 0, policy_version 50483 (0.0031) [2024-06-27 18:21:23,573][06887] Signal inference workers to stop experience collection... (10450 times) [2024-06-27 18:21:23,574][06887] Signal inference workers to resume experience collection... (10450 times) [2024-06-27 18:21:23,613][06909] InferenceWorker_p0-w0: stopping experience collection (10450 times) [2024-06-27 18:21:23,613][06909] InferenceWorker_p0-w0: resuming experience collection (10450 times) [2024-06-27 18:21:23,852][06674] Fps is (10 sec: 45866.1, 60 sec: 43416.2, 300 sec: 43597.8). Total num frames: 827179008. Throughput: 0: 43522.1. Samples: 730140080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:21:23,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:21:25,895][06909] Updated weights for policy 0, policy_version 50493 (0.0037) [2024-06-27 18:21:28,850][06674] Fps is (10 sec: 44246.3, 60 sec: 43692.3, 300 sec: 43598.1). Total num frames: 827392000. Throughput: 0: 43504.1. Samples: 730272120. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 18:21:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:21:30,296][06909] Updated weights for policy 0, policy_version 50503 (0.0036) [2024-06-27 18:21:33,188][06909] Updated weights for policy 0, policy_version 50513 (0.0034) [2024-06-27 18:21:33,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 827604992. Throughput: 0: 43562.3. Samples: 730531020. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 18:21:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:21:37,554][06909] Updated weights for policy 0, policy_version 50523 (0.0029) [2024-06-27 18:21:38,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 827834368. Throughput: 0: 43770.2. Samples: 730797080. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 18:21:38,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:21:40,406][06909] Updated weights for policy 0, policy_version 50533 (0.0030) [2024-06-27 18:21:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 828047360. Throughput: 0: 43842.7. Samples: 730934620. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 18:21:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:21:44,817][06909] Updated weights for policy 0, policy_version 50543 (0.0026) [2024-06-27 18:21:48,357][06909] Updated weights for policy 0, policy_version 50553 (0.0026) [2024-06-27 18:21:48,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43542.5). Total num frames: 828260352. Throughput: 0: 43808.3. Samples: 731190640. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 18:21:48,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:21:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000050553_828260352.pth... [2024-06-27 18:21:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000049915_817807360.pth [2024-06-27 18:21:52,127][06909] Updated weights for policy 0, policy_version 50563 (0.0034) [2024-06-27 18:21:53,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43689.2, 300 sec: 43653.3). Total num frames: 828489728. Throughput: 0: 43890.8. Samples: 731453020. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 18:21:53,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:21:56,330][06909] Updated weights for policy 0, policy_version 50573 (0.0041) [2024-06-27 18:21:58,850][06674] Fps is (10 sec: 44237.8, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 828702720. Throughput: 0: 43877.4. Samples: 731585760. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:21:58,856][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:22:00,032][06909] Updated weights for policy 0, policy_version 50583 (0.0039) [2024-06-27 18:22:03,638][06909] Updated weights for policy 0, policy_version 50593 (0.0027) [2024-06-27 18:22:03,850][06674] Fps is (10 sec: 42606.3, 60 sec: 43690.5, 300 sec: 43598.1). Total num frames: 828915712. Throughput: 0: 43953.0. Samples: 731846280. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:22:03,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:22:07,439][06909] Updated weights for policy 0, policy_version 50603 (0.0031) [2024-06-27 18:22:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 829145088. Throughput: 0: 43767.3. Samples: 732109520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:22:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:22:11,001][06909] Updated weights for policy 0, policy_version 50613 (0.0042) [2024-06-27 18:22:13,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.7, 300 sec: 43598.1). Total num frames: 829374464. Throughput: 0: 43728.3. Samples: 732239900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:22:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:22:15,047][06909] Updated weights for policy 0, policy_version 50623 (0.0044) [2024-06-27 18:22:18,500][06909] Updated weights for policy 0, policy_version 50633 (0.0035) [2024-06-27 18:22:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43692.2, 300 sec: 43487.3). Total num frames: 829571072. Throughput: 0: 43780.0. Samples: 732501120. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:22:18,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 18:22:22,385][06909] Updated weights for policy 0, policy_version 50643 (0.0045) [2024-06-27 18:22:23,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43692.2, 300 sec: 43598.2). Total num frames: 829800448. Throughput: 0: 43721.9. Samples: 732764560. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:22:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:22:26,029][06909] Updated weights for policy 0, policy_version 50653 (0.0044) [2024-06-27 18:22:28,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43689.1, 300 sec: 43597.8). Total num frames: 830013440. Throughput: 0: 43577.2. Samples: 732895680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 18:22:28,861][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:22:29,690][06909] Updated weights for policy 0, policy_version 50663 (0.0032) [2024-06-27 18:22:33,551][06909] Updated weights for policy 0, policy_version 50673 (0.0042) [2024-06-27 18:22:33,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 830226432. Throughput: 0: 43581.9. Samples: 733151820. Policy #0 lag: (min: 1.0, avg: 10.8, max: 21.0) [2024-06-27 18:22:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:22:37,418][06909] Updated weights for policy 0, policy_version 50683 (0.0035) [2024-06-27 18:22:38,850][06674] Fps is (10 sec: 42606.7, 60 sec: 43417.6, 300 sec: 43542.8). Total num frames: 830439424. Throughput: 0: 43526.3. Samples: 733411620. Policy #0 lag: (min: 1.0, avg: 10.8, max: 21.0) [2024-06-27 18:22:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 18:22:41,095][06909] Updated weights for policy 0, policy_version 50693 (0.0038) [2024-06-27 18:22:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 830652416. Throughput: 0: 43472.9. Samples: 733542040. Policy #0 lag: (min: 1.0, avg: 10.8, max: 21.0) [2024-06-27 18:22:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:22:44,743][06909] Updated weights for policy 0, policy_version 50703 (0.0035) [2024-06-27 18:22:48,379][06909] Updated weights for policy 0, policy_version 50713 (0.0036) [2024-06-27 18:22:48,856][06674] Fps is (10 sec: 44210.1, 60 sec: 43686.3, 300 sec: 43597.5). Total num frames: 830881792. Throughput: 0: 43478.7. Samples: 733803080. Policy #0 lag: (min: 1.0, avg: 10.8, max: 21.0) [2024-06-27 18:22:48,857][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:22:51,067][06887] Signal inference workers to stop experience collection... (10500 times) [2024-06-27 18:22:51,067][06887] Signal inference workers to resume experience collection... (10500 times) [2024-06-27 18:22:51,078][06909] InferenceWorker_p0-w0: stopping experience collection (10500 times) [2024-06-27 18:22:51,078][06909] InferenceWorker_p0-w0: resuming experience collection (10500 times) [2024-06-27 18:22:52,557][06909] Updated weights for policy 0, policy_version 50723 (0.0033) [2024-06-27 18:22:53,852][06674] Fps is (10 sec: 44227.3, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 831094784. Throughput: 0: 43524.1. Samples: 734068200. Policy #0 lag: (min: 1.0, avg: 10.8, max: 21.0) [2024-06-27 18:22:53,853][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:22:56,233][06909] Updated weights for policy 0, policy_version 50733 (0.0044) [2024-06-27 18:22:58,850][06674] Fps is (10 sec: 44263.6, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 831324160. Throughput: 0: 43495.1. Samples: 734197180. Policy #0 lag: (min: 1.0, avg: 10.8, max: 21.0) [2024-06-27 18:22:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:22:59,934][06909] Updated weights for policy 0, policy_version 50743 (0.0032) [2024-06-27 18:23:03,597][06909] Updated weights for policy 0, policy_version 50753 (0.0029) [2024-06-27 18:23:03,852][06674] Fps is (10 sec: 44236.9, 60 sec: 43689.3, 300 sec: 43597.8). Total num frames: 831537152. Throughput: 0: 43450.0. Samples: 734456460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 18:23:03,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:23:07,434][06909] Updated weights for policy 0, policy_version 50763 (0.0032) [2024-06-27 18:23:08,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 831733760. Throughput: 0: 43316.3. Samples: 734713800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 18:23:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:23:11,461][06909] Updated weights for policy 0, policy_version 50773 (0.0047) [2024-06-27 18:23:13,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43144.5, 300 sec: 43543.4). Total num frames: 831963136. Throughput: 0: 43435.7. Samples: 734850200. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 18:23:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:23:15,076][06909] Updated weights for policy 0, policy_version 50783 (0.0041) [2024-06-27 18:23:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 832176128. Throughput: 0: 43505.7. Samples: 735109580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 18:23:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:23:18,924][06909] Updated weights for policy 0, policy_version 50793 (0.0027) [2024-06-27 18:23:22,334][06909] Updated weights for policy 0, policy_version 50803 (0.0030) [2024-06-27 18:23:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43144.5, 300 sec: 43487.9). Total num frames: 832389120. Throughput: 0: 43606.3. Samples: 735373900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 18:23:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:23:26,260][06909] Updated weights for policy 0, policy_version 50813 (0.0036) [2024-06-27 18:23:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43146.0, 300 sec: 43487.9). Total num frames: 832602112. Throughput: 0: 43500.0. Samples: 735499540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 18:23:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:23:29,959][06909] Updated weights for policy 0, policy_version 50823 (0.0035) [2024-06-27 18:23:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 832831488. Throughput: 0: 43523.8. Samples: 735761380. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 18:23:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:23:34,039][06909] Updated weights for policy 0, policy_version 50833 (0.0039) [2024-06-27 18:23:37,476][06909] Updated weights for policy 0, policy_version 50843 (0.0037) [2024-06-27 18:23:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 833060864. Throughput: 0: 43542.0. Samples: 736027500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 18:23:38,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:23:41,417][06909] Updated weights for policy 0, policy_version 50853 (0.0029) [2024-06-27 18:23:43,856][06674] Fps is (10 sec: 44209.2, 60 sec: 43686.1, 300 sec: 43597.2). Total num frames: 833273856. Throughput: 0: 43611.0. Samples: 736159940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 18:23:43,857][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:23:44,961][06909] Updated weights for policy 0, policy_version 50863 (0.0030) [2024-06-27 18:23:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43422.0, 300 sec: 43542.6). Total num frames: 833486848. Throughput: 0: 43590.9. Samples: 736417960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 18:23:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:23:48,882][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000050873_833503232.pth... [2024-06-27 18:23:48,889][06909] Updated weights for policy 0, policy_version 50873 (0.0035) [2024-06-27 18:23:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000050234_823033856.pth [2024-06-27 18:23:52,381][06909] Updated weights for policy 0, policy_version 50883 (0.0039) [2024-06-27 18:23:53,850][06674] Fps is (10 sec: 45903.8, 60 sec: 43965.3, 300 sec: 43653.7). Total num frames: 833732608. Throughput: 0: 43861.9. Samples: 736687580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 18:23:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:23:56,169][06909] Updated weights for policy 0, policy_version 50893 (0.0051) [2024-06-27 18:23:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 833945600. Throughput: 0: 43736.6. Samples: 736818340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 18:23:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:23:59,958][06909] Updated weights for policy 0, policy_version 50903 (0.0032) [2024-06-27 18:24:03,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43419.1, 300 sec: 43487.0). Total num frames: 834142208. Throughput: 0: 43632.6. Samples: 737073040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 18:24:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:24:04,003][06909] Updated weights for policy 0, policy_version 50913 (0.0038) [2024-06-27 18:24:07,184][06887] Signal inference workers to stop experience collection... (10550 times) [2024-06-27 18:24:07,185][06887] Signal inference workers to resume experience collection... (10550 times) [2024-06-27 18:24:07,223][06909] InferenceWorker_p0-w0: stopping experience collection (10550 times) [2024-06-27 18:24:07,223][06909] InferenceWorker_p0-w0: resuming experience collection (10550 times) [2024-06-27 18:24:07,323][06909] Updated weights for policy 0, policy_version 50923 (0.0035) [2024-06-27 18:24:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.9, 300 sec: 43653.6). Total num frames: 834387968. Throughput: 0: 43714.3. Samples: 737341040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:24:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:24:11,269][06909] Updated weights for policy 0, policy_version 50933 (0.0025) [2024-06-27 18:24:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.8, 300 sec: 43653.7). Total num frames: 834584576. Throughput: 0: 43742.3. Samples: 737467940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:24:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:24:14,887][06909] Updated weights for policy 0, policy_version 50943 (0.0042) [2024-06-27 18:24:18,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.8, 300 sec: 43543.5). Total num frames: 834797568. Throughput: 0: 43728.0. Samples: 737729140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:24:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:24:18,923][06909] Updated weights for policy 0, policy_version 50953 (0.0036) [2024-06-27 18:24:22,231][06909] Updated weights for policy 0, policy_version 50963 (0.0026) [2024-06-27 18:24:23,850][06674] Fps is (10 sec: 45873.8, 60 sec: 44236.6, 300 sec: 43599.0). Total num frames: 835043328. Throughput: 0: 43751.8. Samples: 737996340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:24:23,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:24:26,252][06909] Updated weights for policy 0, policy_version 50973 (0.0041) [2024-06-27 18:24:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43542.6). Total num frames: 835239936. Throughput: 0: 43656.3. Samples: 738124200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:24:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:24:29,942][06909] Updated weights for policy 0, policy_version 50983 (0.0027) [2024-06-27 18:24:33,716][06909] Updated weights for policy 0, policy_version 50993 (0.0038) [2024-06-27 18:24:33,850][06674] Fps is (10 sec: 42599.4, 60 sec: 43963.7, 300 sec: 43542.6). Total num frames: 835469312. Throughput: 0: 43707.2. Samples: 738384780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:24:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:24:37,264][06909] Updated weights for policy 0, policy_version 51003 (0.0033) [2024-06-27 18:24:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 835698688. Throughput: 0: 43582.6. Samples: 738648800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 18:24:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 18:24:40,957][06909] Updated weights for policy 0, policy_version 51013 (0.0047) [2024-06-27 18:24:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43968.3, 300 sec: 43653.6). Total num frames: 835911680. Throughput: 0: 43589.3. Samples: 738779860. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 18:24:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:24:45,228][06909] Updated weights for policy 0, policy_version 51023 (0.0033) [2024-06-27 18:24:48,368][06909] Updated weights for policy 0, policy_version 51033 (0.0032) [2024-06-27 18:24:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43487.0). Total num frames: 836124672. Throughput: 0: 43744.5. Samples: 739041540. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 18:24:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:24:52,469][06909] Updated weights for policy 0, policy_version 51043 (0.0025) [2024-06-27 18:24:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 836337664. Throughput: 0: 43726.2. Samples: 739308720. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 18:24:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:24:55,955][06909] Updated weights for policy 0, policy_version 51053 (0.0030) [2024-06-27 18:24:58,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 836550656. Throughput: 0: 43779.8. Samples: 739438040. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 18:24:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:24:59,692][06909] Updated weights for policy 0, policy_version 51063 (0.0031) [2024-06-27 18:25:03,276][06909] Updated weights for policy 0, policy_version 51073 (0.0036) [2024-06-27 18:25:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 836780032. Throughput: 0: 43835.5. Samples: 739701740. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 18:25:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:25:07,350][06909] Updated weights for policy 0, policy_version 51083 (0.0026) [2024-06-27 18:25:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 837009408. Throughput: 0: 43701.1. Samples: 739962880. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 18:25:08,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:25:10,707][06909] Updated weights for policy 0, policy_version 51093 (0.0028) [2024-06-27 18:25:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 837206016. Throughput: 0: 43835.0. Samples: 740096780. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:25:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:25:14,637][06909] Updated weights for policy 0, policy_version 51103 (0.0056) [2024-06-27 18:25:18,436][06909] Updated weights for policy 0, policy_version 51113 (0.0038) [2024-06-27 18:25:18,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43962.1, 300 sec: 43597.8). Total num frames: 837435392. Throughput: 0: 43865.1. Samples: 740358800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:25:18,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:25:22,328][06909] Updated weights for policy 0, policy_version 51123 (0.0039) [2024-06-27 18:25:23,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.8, 300 sec: 43709.5). Total num frames: 837664768. Throughput: 0: 43732.3. Samples: 740616760. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:25:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:25:25,732][06909] Updated weights for policy 0, policy_version 51133 (0.0037) [2024-06-27 18:25:28,850][06674] Fps is (10 sec: 40966.2, 60 sec: 43417.1, 300 sec: 43598.0). Total num frames: 837844992. Throughput: 0: 43670.5. Samples: 740745060. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:25:28,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:25:29,665][06909] Updated weights for policy 0, policy_version 51143 (0.0030) [2024-06-27 18:25:33,283][06909] Updated weights for policy 0, policy_version 51153 (0.0032) [2024-06-27 18:25:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 838090752. Throughput: 0: 43690.6. Samples: 741007620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:25:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:25:37,468][06909] Updated weights for policy 0, policy_version 51163 (0.0037) [2024-06-27 18:25:38,856][06674] Fps is (10 sec: 47487.5, 60 sec: 43686.2, 300 sec: 43708.3). Total num frames: 838320128. Throughput: 0: 43705.6. Samples: 741275740. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:25:38,856][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:25:40,831][06909] Updated weights for policy 0, policy_version 51173 (0.0030) [2024-06-27 18:25:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 838533120. Throughput: 0: 43730.8. Samples: 741405920. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:25:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:25:44,668][06887] Signal inference workers to stop experience collection... (10600 times) [2024-06-27 18:25:44,671][06887] Signal inference workers to resume experience collection... (10600 times) [2024-06-27 18:25:44,692][06909] InferenceWorker_p0-w0: stopping experience collection (10600 times) [2024-06-27 18:25:44,692][06909] InferenceWorker_p0-w0: resuming experience collection (10600 times) [2024-06-27 18:25:44,807][06909] Updated weights for policy 0, policy_version 51183 (0.0034) [2024-06-27 18:25:48,192][06909] Updated weights for policy 0, policy_version 51193 (0.0035) [2024-06-27 18:25:48,850][06674] Fps is (10 sec: 42624.5, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 838746112. Throughput: 0: 43614.2. Samples: 741664380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:25:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:25:48,877][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000051193_838746112.pth... [2024-06-27 18:25:48,938][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000050553_828260352.pth [2024-06-27 18:25:52,820][06909] Updated weights for policy 0, policy_version 51203 (0.0032) [2024-06-27 18:25:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 838959104. Throughput: 0: 43827.2. Samples: 741935100. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:25:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:25:55,542][06909] Updated weights for policy 0, policy_version 51213 (0.0023) [2024-06-27 18:25:58,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43689.2, 300 sec: 43653.3). Total num frames: 839172096. Throughput: 0: 43721.6. Samples: 742064340. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:25:58,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:26:00,042][06909] Updated weights for policy 0, policy_version 51223 (0.0025) [2024-06-27 18:26:03,185][06909] Updated weights for policy 0, policy_version 51233 (0.0033) [2024-06-27 18:26:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 839401472. Throughput: 0: 43576.2. Samples: 742319640. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:26:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:26:07,748][06909] Updated weights for policy 0, policy_version 51243 (0.0031) [2024-06-27 18:26:08,850][06674] Fps is (10 sec: 45884.7, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 839630848. Throughput: 0: 43605.9. Samples: 742579020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:26:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:26:10,536][06909] Updated weights for policy 0, policy_version 51253 (0.0036) [2024-06-27 18:26:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43653.9). Total num frames: 839827456. Throughput: 0: 43856.1. Samples: 742718560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:26:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:26:15,001][06909] Updated weights for policy 0, policy_version 51263 (0.0053) [2024-06-27 18:26:17,921][06909] Updated weights for policy 0, policy_version 51273 (0.0043) [2024-06-27 18:26:18,851][06674] Fps is (10 sec: 42593.3, 60 sec: 43691.3, 300 sec: 43653.8). Total num frames: 840056832. Throughput: 0: 43746.9. Samples: 742976280. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 18:26:18,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 18:26:22,621][06909] Updated weights for policy 0, policy_version 51283 (0.0027) [2024-06-27 18:26:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 840286208. Throughput: 0: 43796.6. Samples: 743246320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 18:26:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:26:25,634][06909] Updated weights for policy 0, policy_version 51293 (0.0019) [2024-06-27 18:26:28,850][06674] Fps is (10 sec: 44242.0, 60 sec: 44237.2, 300 sec: 43709.2). Total num frames: 840499200. Throughput: 0: 43842.2. Samples: 743378820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 18:26:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:26:29,885][06909] Updated weights for policy 0, policy_version 51303 (0.0030) [2024-06-27 18:26:33,030][06909] Updated weights for policy 0, policy_version 51313 (0.0033) [2024-06-27 18:26:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 840712192. Throughput: 0: 43880.8. Samples: 743639020. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 18:26:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:26:37,539][06909] Updated weights for policy 0, policy_version 51323 (0.0036) [2024-06-27 18:26:38,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43422.0, 300 sec: 43653.7). Total num frames: 840925184. Throughput: 0: 43716.4. Samples: 743902340. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 18:26:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:26:40,857][06909] Updated weights for policy 0, policy_version 51333 (0.0031) [2024-06-27 18:26:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 841154560. Throughput: 0: 43734.4. Samples: 744032300. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 18:26:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:26:44,884][06909] Updated weights for policy 0, policy_version 51343 (0.0031) [2024-06-27 18:26:48,321][06909] Updated weights for policy 0, policy_version 51353 (0.0033) [2024-06-27 18:26:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43653.9). Total num frames: 841367552. Throughput: 0: 43775.1. Samples: 744289520. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 18:26:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:26:52,265][06909] Updated weights for policy 0, policy_version 51363 (0.0035) [2024-06-27 18:26:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 841580544. Throughput: 0: 43942.3. Samples: 744556420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 18:26:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:26:55,817][06909] Updated weights for policy 0, policy_version 51373 (0.0031) [2024-06-27 18:26:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43692.1, 300 sec: 43653.7). Total num frames: 841793536. Throughput: 0: 43682.2. Samples: 744684260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 18:26:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:27:00,061][06909] Updated weights for policy 0, policy_version 51383 (0.0030) [2024-06-27 18:27:03,184][06909] Updated weights for policy 0, policy_version 51393 (0.0032) [2024-06-27 18:27:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 842022912. Throughput: 0: 43692.7. Samples: 744942400. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 18:27:03,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:27:07,466][06909] Updated weights for policy 0, policy_version 51403 (0.0035) [2024-06-27 18:27:08,850][06674] Fps is (10 sec: 44237.8, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 842235904. Throughput: 0: 43574.4. Samples: 745207160. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 18:27:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:27:10,642][06909] Updated weights for policy 0, policy_version 51413 (0.0041) [2024-06-27 18:27:13,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 842465280. Throughput: 0: 43538.3. Samples: 745338040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 18:27:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:27:14,923][06909] Updated weights for policy 0, policy_version 51423 (0.0041) [2024-06-27 18:27:17,295][06887] Signal inference workers to stop experience collection... (10650 times) [2024-06-27 18:27:17,302][06887] Signal inference workers to resume experience collection... (10650 times) [2024-06-27 18:27:17,346][06909] InferenceWorker_p0-w0: stopping experience collection (10650 times) [2024-06-27 18:27:17,347][06909] InferenceWorker_p0-w0: resuming experience collection (10650 times) [2024-06-27 18:27:18,550][06909] Updated weights for policy 0, policy_version 51433 (0.0042) [2024-06-27 18:27:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43691.6, 300 sec: 43653.6). Total num frames: 842678272. Throughput: 0: 43463.2. Samples: 745594860. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 18:27:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:27:22,368][06909] Updated weights for policy 0, policy_version 51443 (0.0044) [2024-06-27 18:27:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43144.6, 300 sec: 43598.4). Total num frames: 842874880. Throughput: 0: 43400.5. Samples: 745855360. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 18:27:23,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:27:26,326][06909] Updated weights for policy 0, policy_version 51453 (0.0035) [2024-06-27 18:27:28,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 843104256. Throughput: 0: 43360.8. Samples: 745983540. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 18:27:28,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:27:30,159][06909] Updated weights for policy 0, policy_version 51463 (0.0024) [2024-06-27 18:27:33,626][06909] Updated weights for policy 0, policy_version 51473 (0.0030) [2024-06-27 18:27:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 843333632. Throughput: 0: 43560.4. Samples: 746249740. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 18:27:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:27:37,382][06909] Updated weights for policy 0, policy_version 51483 (0.0037) [2024-06-27 18:27:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 843546624. Throughput: 0: 43564.8. Samples: 746516840. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 18:27:38,850][06674] Avg episode reward: [(0, '0.437')] [2024-06-27 18:27:41,003][06909] Updated weights for policy 0, policy_version 51493 (0.0035) [2024-06-27 18:27:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43710.1). Total num frames: 843776000. Throughput: 0: 43526.2. Samples: 746642940. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 18:27:43,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:27:44,779][06909] Updated weights for policy 0, policy_version 51503 (0.0029) [2024-06-27 18:27:48,302][06909] Updated weights for policy 0, policy_version 51513 (0.0036) [2024-06-27 18:27:48,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 43765.0). Total num frames: 844005376. Throughput: 0: 43742.7. Samples: 746910820. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 18:27:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:27:48,879][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000051514_844005376.pth... [2024-06-27 18:27:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000050873_833503232.pth [2024-06-27 18:27:52,508][06909] Updated weights for policy 0, policy_version 51523 (0.0041) [2024-06-27 18:27:53,851][06674] Fps is (10 sec: 42592.9, 60 sec: 43689.6, 300 sec: 43653.4). Total num frames: 844201984. Throughput: 0: 43614.1. Samples: 747169860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 18:27:53,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:27:56,502][06909] Updated weights for policy 0, policy_version 51533 (0.0025) [2024-06-27 18:27:58,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43653.9). Total num frames: 844414976. Throughput: 0: 43774.6. Samples: 747307900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 18:27:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:27:59,743][06909] Updated weights for policy 0, policy_version 51543 (0.0033) [2024-06-27 18:28:03,743][06909] Updated weights for policy 0, policy_version 51553 (0.0033) [2024-06-27 18:28:03,850][06674] Fps is (10 sec: 44243.1, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 844644352. Throughput: 0: 43901.3. Samples: 747570420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 18:28:03,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:28:07,415][06909] Updated weights for policy 0, policy_version 51563 (0.0032) [2024-06-27 18:28:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.6, 300 sec: 43764.7). Total num frames: 844873728. Throughput: 0: 43811.4. Samples: 747826880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 18:28:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:28:11,214][06909] Updated weights for policy 0, policy_version 51573 (0.0027) [2024-06-27 18:28:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 845086720. Throughput: 0: 43906.4. Samples: 747959320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 18:28:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:28:14,694][06909] Updated weights for policy 0, policy_version 51583 (0.0036) [2024-06-27 18:28:18,722][06909] Updated weights for policy 0, policy_version 51593 (0.0030) [2024-06-27 18:28:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 845299712. Throughput: 0: 43988.8. Samples: 748229240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 18:28:18,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:28:22,090][06909] Updated weights for policy 0, policy_version 51603 (0.0027) [2024-06-27 18:28:23,852][06674] Fps is (10 sec: 44227.5, 60 sec: 44235.2, 300 sec: 43819.9). Total num frames: 845529088. Throughput: 0: 43781.2. Samples: 748487080. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 18:28:23,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:28:26,420][06909] Updated weights for policy 0, policy_version 51613 (0.0025) [2024-06-27 18:28:28,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.9, 300 sec: 43764.7). Total num frames: 845742080. Throughput: 0: 44020.1. Samples: 748623840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 18:28:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:28:29,544][06909] Updated weights for policy 0, policy_version 51623 (0.0030) [2024-06-27 18:28:33,850][06674] Fps is (10 sec: 42607.7, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 845955072. Throughput: 0: 43876.6. Samples: 748885260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 18:28:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:28:33,857][06909] Updated weights for policy 0, policy_version 51633 (0.0032) [2024-06-27 18:28:37,072][06909] Updated weights for policy 0, policy_version 51643 (0.0035) [2024-06-27 18:28:38,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 43765.6). Total num frames: 846184448. Throughput: 0: 43853.7. Samples: 749143220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 18:28:38,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:28:41,329][06909] Updated weights for policy 0, policy_version 51653 (0.0040) [2024-06-27 18:28:43,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 846397440. Throughput: 0: 43718.2. Samples: 749275220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 18:28:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:28:44,590][06909] Updated weights for policy 0, policy_version 51663 (0.0021) [2024-06-27 18:28:48,655][06909] Updated weights for policy 0, policy_version 51673 (0.0033) [2024-06-27 18:28:48,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 846610432. Throughput: 0: 43780.9. Samples: 749540560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 18:28:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:28:51,881][06909] Updated weights for policy 0, policy_version 51683 (0.0029) [2024-06-27 18:28:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43964.8, 300 sec: 43709.2). Total num frames: 846839808. Throughput: 0: 43833.5. Samples: 749799380. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 18:28:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:28:56,201][06909] Updated weights for policy 0, policy_version 51693 (0.0027) [2024-06-27 18:28:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 847052800. Throughput: 0: 43911.5. Samples: 749935340. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 18:28:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:28:59,612][06909] Updated weights for policy 0, policy_version 51703 (0.0035) [2024-06-27 18:29:00,020][06887] Signal inference workers to stop experience collection... (10700 times) [2024-06-27 18:29:00,020][06887] Signal inference workers to resume experience collection... (10700 times) [2024-06-27 18:29:00,037][06909] InferenceWorker_p0-w0: stopping experience collection (10700 times) [2024-06-27 18:29:00,038][06909] InferenceWorker_p0-w0: resuming experience collection (10700 times) [2024-06-27 18:29:03,511][06909] Updated weights for policy 0, policy_version 51713 (0.0031) [2024-06-27 18:29:03,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 847265792. Throughput: 0: 43680.1. Samples: 750194840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 18:29:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:29:07,285][06909] Updated weights for policy 0, policy_version 51723 (0.0025) [2024-06-27 18:29:08,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 847511552. Throughput: 0: 43688.5. Samples: 750452980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 18:29:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:29:11,244][06909] Updated weights for policy 0, policy_version 51733 (0.0031) [2024-06-27 18:29:13,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.6, 300 sec: 43820.2). Total num frames: 847724544. Throughput: 0: 43717.2. Samples: 750591120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 18:29:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:29:14,412][06909] Updated weights for policy 0, policy_version 51743 (0.0031) [2024-06-27 18:29:18,591][06909] Updated weights for policy 0, policy_version 51753 (0.0031) [2024-06-27 18:29:18,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 847921152. Throughput: 0: 43768.3. Samples: 750854840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 18:29:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:29:21,799][06909] Updated weights for policy 0, policy_version 51763 (0.0034) [2024-06-27 18:29:23,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43692.2, 300 sec: 43764.7). Total num frames: 848150528. Throughput: 0: 43620.6. Samples: 751106140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 18:29:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:29:25,998][06909] Updated weights for policy 0, policy_version 51773 (0.0036) [2024-06-27 18:29:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43417.6, 300 sec: 43653.7). Total num frames: 848347136. Throughput: 0: 43714.8. Samples: 751242380. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 18:29:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:29:29,464][06909] Updated weights for policy 0, policy_version 51783 (0.0043) [2024-06-27 18:29:33,419][06909] Updated weights for policy 0, policy_version 51793 (0.0027) [2024-06-27 18:29:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 848576512. Throughput: 0: 43717.3. Samples: 751507840. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 18:29:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:29:36,874][06909] Updated weights for policy 0, policy_version 51803 (0.0033) [2024-06-27 18:29:38,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 848805888. Throughput: 0: 43560.8. Samples: 751759620. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 18:29:38,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:29:41,181][06909] Updated weights for policy 0, policy_version 51813 (0.0039) [2024-06-27 18:29:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 849002496. Throughput: 0: 43605.7. Samples: 751897600. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 18:29:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:29:44,812][06909] Updated weights for policy 0, policy_version 51823 (0.0033) [2024-06-27 18:29:48,693][06909] Updated weights for policy 0, policy_version 51833 (0.0040) [2024-06-27 18:29:48,856][06674] Fps is (10 sec: 42572.8, 60 sec: 43686.2, 300 sec: 43708.3). Total num frames: 849231872. Throughput: 0: 43688.3. Samples: 752161080. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 18:29:48,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:29:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000051834_849248256.pth... [2024-06-27 18:29:48,912][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000051193_838746112.pth [2024-06-27 18:29:52,226][06909] Updated weights for policy 0, policy_version 51843 (0.0031) [2024-06-27 18:29:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 849461248. Throughput: 0: 43653.9. Samples: 752417400. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 18:29:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:29:56,314][06909] Updated weights for policy 0, policy_version 51853 (0.0032) [2024-06-27 18:29:58,852][06674] Fps is (10 sec: 44254.5, 60 sec: 43689.1, 300 sec: 43708.9). Total num frames: 849674240. Throughput: 0: 43637.7. Samples: 752554900. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 18:29:58,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:29:59,687][06909] Updated weights for policy 0, policy_version 51863 (0.0031) [2024-06-27 18:30:03,799][06909] Updated weights for policy 0, policy_version 51873 (0.0035) [2024-06-27 18:30:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 849887232. Throughput: 0: 43635.2. Samples: 752818420. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2024-06-27 18:30:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:30:07,134][06909] Updated weights for policy 0, policy_version 51883 (0.0044) [2024-06-27 18:30:08,852][06674] Fps is (10 sec: 44236.8, 60 sec: 43416.2, 300 sec: 43764.4). Total num frames: 850116608. Throughput: 0: 43602.9. Samples: 753068360. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2024-06-27 18:30:08,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:30:11,296][06909] Updated weights for policy 0, policy_version 51893 (0.0039) [2024-06-27 18:30:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43144.6, 300 sec: 43653.9). Total num frames: 850313216. Throughput: 0: 43735.0. Samples: 753210460. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2024-06-27 18:30:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:30:14,770][06909] Updated weights for policy 0, policy_version 51903 (0.0042) [2024-06-27 18:30:18,594][06909] Updated weights for policy 0, policy_version 51913 (0.0027) [2024-06-27 18:30:18,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 850558976. Throughput: 0: 43627.9. Samples: 753471100. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2024-06-27 18:30:18,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:30:22,440][06909] Updated weights for policy 0, policy_version 51923 (0.0029) [2024-06-27 18:30:23,856][06674] Fps is (10 sec: 45847.5, 60 sec: 43686.2, 300 sec: 43819.4). Total num frames: 850771968. Throughput: 0: 43620.8. Samples: 753722820. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2024-06-27 18:30:23,857][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:30:26,034][06909] Updated weights for policy 0, policy_version 51933 (0.0037) [2024-06-27 18:30:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 850968576. Throughput: 0: 43566.6. Samples: 753858100. Policy #0 lag: (min: 1.0, avg: 9.9, max: 20.0) [2024-06-27 18:30:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:30:29,880][06909] Updated weights for policy 0, policy_version 51943 (0.0034) [2024-06-27 18:30:30,420][06887] Signal inference workers to stop experience collection... (10750 times) [2024-06-27 18:30:30,468][06887] Signal inference workers to resume experience collection... (10750 times) [2024-06-27 18:30:30,469][06909] InferenceWorker_p0-w0: stopping experience collection (10750 times) [2024-06-27 18:30:30,486][06909] InferenceWorker_p0-w0: resuming experience collection (10750 times) [2024-06-27 18:30:33,745][06909] Updated weights for policy 0, policy_version 51953 (0.0035) [2024-06-27 18:30:33,852][06674] Fps is (10 sec: 42615.5, 60 sec: 43689.2, 300 sec: 43654.2). Total num frames: 851197952. Throughput: 0: 43502.5. Samples: 754118520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 18:30:33,853][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:30:37,259][06909] Updated weights for policy 0, policy_version 51963 (0.0041) [2024-06-27 18:30:38,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 851427328. Throughput: 0: 43500.5. Samples: 754374920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 18:30:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:30:41,537][06909] Updated weights for policy 0, policy_version 51973 (0.0034) [2024-06-27 18:30:43,850][06674] Fps is (10 sec: 42607.0, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 851623936. Throughput: 0: 43610.9. Samples: 754517300. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 18:30:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:30:44,676][06909] Updated weights for policy 0, policy_version 51983 (0.0023) [2024-06-27 18:30:48,786][06909] Updated weights for policy 0, policy_version 51993 (0.0040) [2024-06-27 18:30:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43695.1, 300 sec: 43709.2). Total num frames: 851853312. Throughput: 0: 43440.0. Samples: 754773220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 18:30:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:30:52,507][06909] Updated weights for policy 0, policy_version 52003 (0.0031) [2024-06-27 18:30:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43765.0). Total num frames: 852082688. Throughput: 0: 43555.3. Samples: 755028260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 18:30:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:30:56,250][06909] Updated weights for policy 0, policy_version 52013 (0.0022) [2024-06-27 18:30:58,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43419.0, 300 sec: 43653.6). Total num frames: 852279296. Throughput: 0: 43468.4. Samples: 755166540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 18:30:58,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:30:59,912][06909] Updated weights for policy 0, policy_version 52023 (0.0028) [2024-06-27 18:31:03,783][06909] Updated weights for policy 0, policy_version 52033 (0.0039) [2024-06-27 18:31:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 852508672. Throughput: 0: 43495.6. Samples: 755428400. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 18:31:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:31:07,619][06909] Updated weights for policy 0, policy_version 52043 (0.0029) [2024-06-27 18:31:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43419.1, 300 sec: 43709.2). Total num frames: 852721664. Throughput: 0: 43505.4. Samples: 755680300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-27 18:31:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:31:11,519][06909] Updated weights for policy 0, policy_version 52053 (0.0044) [2024-06-27 18:31:13,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43689.2, 300 sec: 43653.5). Total num frames: 852934656. Throughput: 0: 43424.8. Samples: 755812300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-27 18:31:13,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:31:15,116][06909] Updated weights for policy 0, policy_version 52063 (0.0038) [2024-06-27 18:31:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43144.5, 300 sec: 43598.1). Total num frames: 853147648. Throughput: 0: 43425.5. Samples: 756072580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-27 18:31:18,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:31:19,015][06909] Updated weights for policy 0, policy_version 52073 (0.0036) [2024-06-27 18:31:22,626][06909] Updated weights for policy 0, policy_version 52083 (0.0036) [2024-06-27 18:31:23,850][06674] Fps is (10 sec: 45884.7, 60 sec: 43695.1, 300 sec: 43709.2). Total num frames: 853393408. Throughput: 0: 43512.9. Samples: 756333000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-27 18:31:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:31:26,548][06909] Updated weights for policy 0, policy_version 52093 (0.0043) [2024-06-27 18:31:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 853573632. Throughput: 0: 43340.0. Samples: 756467600. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-27 18:31:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:31:30,153][06909] Updated weights for policy 0, policy_version 52103 (0.0033) [2024-06-27 18:31:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43419.1, 300 sec: 43653.6). Total num frames: 853803008. Throughput: 0: 43566.3. Samples: 756733700. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-27 18:31:33,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 18:31:33,977][06909] Updated weights for policy 0, policy_version 52113 (0.0048) [2024-06-27 18:31:37,658][06909] Updated weights for policy 0, policy_version 52123 (0.0038) [2024-06-27 18:31:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43144.5, 300 sec: 43598.1). Total num frames: 854016000. Throughput: 0: 43542.3. Samples: 756987660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 18:31:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:31:41,325][06909] Updated weights for policy 0, policy_version 52133 (0.0031) [2024-06-27 18:31:43,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 854245376. Throughput: 0: 43422.2. Samples: 757120540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 18:31:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:31:45,124][06909] Updated weights for policy 0, policy_version 52143 (0.0036) [2024-06-27 18:31:48,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 854458368. Throughput: 0: 43499.0. Samples: 757385860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 18:31:48,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:31:48,991][06909] Updated weights for policy 0, policy_version 52153 (0.0028) [2024-06-27 18:31:48,992][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000052153_854474752.pth... [2024-06-27 18:31:49,032][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000051514_844005376.pth [2024-06-27 18:31:53,090][06909] Updated weights for policy 0, policy_version 52163 (0.0031) [2024-06-27 18:31:53,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43144.6, 300 sec: 43653.7). Total num frames: 854671360. Throughput: 0: 43456.0. Samples: 757635820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 18:31:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:31:56,366][06909] Updated weights for policy 0, policy_version 52173 (0.0026) [2024-06-27 18:31:57,843][06887] Signal inference workers to stop experience collection... (10800 times) [2024-06-27 18:31:57,845][06887] Signal inference workers to resume experience collection... (10800 times) [2024-06-27 18:31:57,888][06909] InferenceWorker_p0-w0: stopping experience collection (10800 times) [2024-06-27 18:31:57,888][06909] InferenceWorker_p0-w0: resuming experience collection (10800 times) [2024-06-27 18:31:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 854884352. Throughput: 0: 43405.9. Samples: 757765480. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 18:31:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:32:00,406][06909] Updated weights for policy 0, policy_version 52183 (0.0021) [2024-06-27 18:32:03,718][06909] Updated weights for policy 0, policy_version 52193 (0.0040) [2024-06-27 18:32:03,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.6, 300 sec: 43709.1). Total num frames: 855130112. Throughput: 0: 43620.5. Samples: 758035500. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 18:32:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:32:07,796][06909] Updated weights for policy 0, policy_version 52203 (0.0036) [2024-06-27 18:32:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 855326720. Throughput: 0: 43538.6. Samples: 758292240. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 18:32:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:32:11,478][06909] Updated weights for policy 0, policy_version 52213 (0.0042) [2024-06-27 18:32:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43692.1, 300 sec: 43653.6). Total num frames: 855556096. Throughput: 0: 43522.3. Samples: 758426100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:32:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:32:15,286][06909] Updated weights for policy 0, policy_version 52223 (0.0038) [2024-06-27 18:32:18,814][06909] Updated weights for policy 0, policy_version 52233 (0.0038) [2024-06-27 18:32:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 855785472. Throughput: 0: 43511.0. Samples: 758691700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:32:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:32:22,727][06909] Updated weights for policy 0, policy_version 52243 (0.0033) [2024-06-27 18:32:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 855998464. Throughput: 0: 43619.5. Samples: 758950540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:32:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:32:26,557][06909] Updated weights for policy 0, policy_version 52253 (0.0028) [2024-06-27 18:32:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 856211456. Throughput: 0: 43558.6. Samples: 759080680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:32:28,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:32:30,257][06909] Updated weights for policy 0, policy_version 52263 (0.0035) [2024-06-27 18:32:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 856408064. Throughput: 0: 43452.7. Samples: 759341220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:32:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:32:34,279][06909] Updated weights for policy 0, policy_version 52273 (0.0032) [2024-06-27 18:32:37,722][06909] Updated weights for policy 0, policy_version 52283 (0.0028) [2024-06-27 18:32:38,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 856637440. Throughput: 0: 43591.1. Samples: 759597420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:32:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:32:41,564][06909] Updated weights for policy 0, policy_version 52293 (0.0034) [2024-06-27 18:32:43,856][06674] Fps is (10 sec: 45847.3, 60 sec: 43686.3, 300 sec: 43597.2). Total num frames: 856866816. Throughput: 0: 43649.3. Samples: 759729960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:32:43,856][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:32:45,528][06909] Updated weights for policy 0, policy_version 52303 (0.0042) [2024-06-27 18:32:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 43653.9). Total num frames: 857079808. Throughput: 0: 43492.1. Samples: 759992640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:32:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:32:49,418][06909] Updated weights for policy 0, policy_version 52313 (0.0027) [2024-06-27 18:32:52,837][06909] Updated weights for policy 0, policy_version 52323 (0.0031) [2024-06-27 18:32:53,850][06674] Fps is (10 sec: 42624.3, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 857292800. Throughput: 0: 43604.5. Samples: 760254440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:32:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:32:56,784][06909] Updated weights for policy 0, policy_version 52333 (0.0048) [2024-06-27 18:32:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 857505792. Throughput: 0: 43489.0. Samples: 760383100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:32:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:33:00,373][06909] Updated weights for policy 0, policy_version 52343 (0.0049) [2024-06-27 18:33:03,850][06674] Fps is (10 sec: 44234.9, 60 sec: 43417.4, 300 sec: 43598.1). Total num frames: 857735168. Throughput: 0: 43609.0. Samples: 760654120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:33:03,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:33:04,360][06909] Updated weights for policy 0, policy_version 52353 (0.0045) [2024-06-27 18:33:07,755][06909] Updated weights for policy 0, policy_version 52363 (0.0030) [2024-06-27 18:33:08,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 857948160. Throughput: 0: 43432.4. Samples: 760905000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:33:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:33:11,959][06909] Updated weights for policy 0, policy_version 52373 (0.0032) [2024-06-27 18:33:13,850][06674] Fps is (10 sec: 42599.9, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 858161152. Throughput: 0: 43555.2. Samples: 761040660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 18:33:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:33:15,189][06909] Updated weights for policy 0, policy_version 52383 (0.0033) [2024-06-27 18:33:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43598.4). Total num frames: 858390528. Throughput: 0: 43623.0. Samples: 761304260. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 18:33:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:33:19,397][06909] Updated weights for policy 0, policy_version 52393 (0.0030) [2024-06-27 18:33:22,965][06909] Updated weights for policy 0, policy_version 52403 (0.0022) [2024-06-27 18:33:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 858603520. Throughput: 0: 43675.4. Samples: 761562820. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 18:33:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:33:27,059][06909] Updated weights for policy 0, policy_version 52413 (0.0031) [2024-06-27 18:33:27,153][06887] Signal inference workers to stop experience collection... (10850 times) [2024-06-27 18:33:27,206][06909] InferenceWorker_p0-w0: stopping experience collection (10850 times) [2024-06-27 18:33:27,207][06887] Signal inference workers to resume experience collection... (10850 times) [2024-06-27 18:33:27,217][06909] InferenceWorker_p0-w0: resuming experience collection (10850 times) [2024-06-27 18:33:28,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43144.7, 300 sec: 43542.5). Total num frames: 858800128. Throughput: 0: 43666.8. Samples: 761694700. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 18:33:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:33:30,421][06909] Updated weights for policy 0, policy_version 52423 (0.0032) [2024-06-27 18:33:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.6, 300 sec: 43598.1). Total num frames: 859045888. Throughput: 0: 43800.3. Samples: 761963660. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 18:33:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:33:34,391][06909] Updated weights for policy 0, policy_version 52433 (0.0028) [2024-06-27 18:33:37,623][06909] Updated weights for policy 0, policy_version 52443 (0.0032) [2024-06-27 18:33:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 859258880. Throughput: 0: 43695.6. Samples: 762220740. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 18:33:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:33:41,696][06909] Updated weights for policy 0, policy_version 52453 (0.0034) [2024-06-27 18:33:43,851][06674] Fps is (10 sec: 42591.7, 60 sec: 43420.8, 300 sec: 43597.9). Total num frames: 859471872. Throughput: 0: 43867.6. Samples: 762357220. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 18:33:43,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:33:45,282][06909] Updated weights for policy 0, policy_version 52463 (0.0043) [2024-06-27 18:33:48,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.5, 300 sec: 43542.5). Total num frames: 859684864. Throughput: 0: 43578.6. Samples: 762615140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 18:33:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:33:48,967][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000052472_859701248.pth... [2024-06-27 18:33:49,029][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000051834_849248256.pth [2024-06-27 18:33:49,254][06909] Updated weights for policy 0, policy_version 52473 (0.0024) [2024-06-27 18:33:52,930][06909] Updated weights for policy 0, policy_version 52483 (0.0027) [2024-06-27 18:33:53,850][06674] Fps is (10 sec: 44244.4, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 859914240. Throughput: 0: 43688.1. Samples: 762870960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 18:33:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:33:56,883][06909] Updated weights for policy 0, policy_version 52493 (0.0032) [2024-06-27 18:33:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 860143616. Throughput: 0: 43606.3. Samples: 763002940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 18:33:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:34:00,714][06909] Updated weights for policy 0, policy_version 52503 (0.0024) [2024-06-27 18:34:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43691.0, 300 sec: 43542.6). Total num frames: 860356608. Throughput: 0: 43704.1. Samples: 763270940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 18:34:03,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:34:04,202][06909] Updated weights for policy 0, policy_version 52513 (0.0027) [2024-06-27 18:34:08,133][06909] Updated weights for policy 0, policy_version 52523 (0.0045) [2024-06-27 18:34:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 860569600. Throughput: 0: 43744.2. Samples: 763531300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 18:34:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:34:11,643][06909] Updated weights for policy 0, policy_version 52533 (0.0037) [2024-06-27 18:34:13,856][06674] Fps is (10 sec: 42572.2, 60 sec: 43686.3, 300 sec: 43597.2). Total num frames: 860782592. Throughput: 0: 43795.3. Samples: 763665760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 18:34:13,857][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:34:15,524][06909] Updated weights for policy 0, policy_version 52543 (0.0024) [2024-06-27 18:34:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 860995584. Throughput: 0: 43587.2. Samples: 763925080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 18:34:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:34:19,124][06909] Updated weights for policy 0, policy_version 52553 (0.0035) [2024-06-27 18:34:22,902][06909] Updated weights for policy 0, policy_version 52563 (0.0028) [2024-06-27 18:34:23,850][06674] Fps is (10 sec: 44264.0, 60 sec: 43690.8, 300 sec: 43653.6). Total num frames: 861224960. Throughput: 0: 43672.0. Samples: 764185980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 18:34:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:34:26,896][06909] Updated weights for policy 0, policy_version 52573 (0.0042) [2024-06-27 18:34:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 861437952. Throughput: 0: 43660.3. Samples: 764321860. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 18:34:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:34:30,450][06909] Updated weights for policy 0, policy_version 52583 (0.0031) [2024-06-27 18:34:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 861667328. Throughput: 0: 43634.2. Samples: 764578680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 18:34:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:34:34,480][06909] Updated weights for policy 0, policy_version 52593 (0.0024) [2024-06-27 18:34:37,738][06909] Updated weights for policy 0, policy_version 52603 (0.0033) [2024-06-27 18:34:38,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43690.5, 300 sec: 43653.6). Total num frames: 861880320. Throughput: 0: 43847.8. Samples: 764844120. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 18:34:38,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:34:41,836][06909] Updated weights for policy 0, policy_version 52613 (0.0029) [2024-06-27 18:34:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43691.9, 300 sec: 43599.0). Total num frames: 862093312. Throughput: 0: 43692.4. Samples: 764969100. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 18:34:43,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:34:45,853][06909] Updated weights for policy 0, policy_version 52623 (0.0037) [2024-06-27 18:34:48,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 862322688. Throughput: 0: 43570.6. Samples: 765231620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 18:34:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:34:49,598][06909] Updated weights for policy 0, policy_version 52633 (0.0038) [2024-06-27 18:34:53,154][06909] Updated weights for policy 0, policy_version 52643 (0.0031) [2024-06-27 18:34:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43598.4). Total num frames: 862535680. Throughput: 0: 43608.3. Samples: 765493680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:34:53,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:34:55,216][06887] Signal inference workers to stop experience collection... (10900 times) [2024-06-27 18:34:55,216][06887] Signal inference workers to resume experience collection... (10900 times) [2024-06-27 18:34:55,255][06909] InferenceWorker_p0-w0: stopping experience collection (10900 times) [2024-06-27 18:34:55,255][06909] InferenceWorker_p0-w0: resuming experience collection (10900 times) [2024-06-27 18:34:56,924][06909] Updated weights for policy 0, policy_version 52653 (0.0045) [2024-06-27 18:34:58,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 862748672. Throughput: 0: 43489.8. Samples: 765622540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:34:58,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:35:00,637][06909] Updated weights for policy 0, policy_version 52663 (0.0031) [2024-06-27 18:35:03,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43417.6, 300 sec: 43542.9). Total num frames: 862961664. Throughput: 0: 43647.1. Samples: 765889200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:35:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:35:04,234][06909] Updated weights for policy 0, policy_version 52673 (0.0028) [2024-06-27 18:35:07,918][06909] Updated weights for policy 0, policy_version 52683 (0.0036) [2024-06-27 18:35:08,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 863191040. Throughput: 0: 43794.7. Samples: 766156740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:35:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:35:11,872][06909] Updated weights for policy 0, policy_version 52693 (0.0033) [2024-06-27 18:35:13,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43695.1, 300 sec: 43542.6). Total num frames: 863404032. Throughput: 0: 43566.5. Samples: 766282360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:35:13,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:35:15,549][06909] Updated weights for policy 0, policy_version 52703 (0.0037) [2024-06-27 18:35:18,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 43599.0). Total num frames: 863633408. Throughput: 0: 43712.9. Samples: 766545760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:35:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:35:19,353][06909] Updated weights for policy 0, policy_version 52713 (0.0033) [2024-06-27 18:35:23,197][06909] Updated weights for policy 0, policy_version 52723 (0.0048) [2024-06-27 18:35:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 863830016. Throughput: 0: 43477.8. Samples: 766800620. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:35:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:35:26,967][06909] Updated weights for policy 0, policy_version 52733 (0.0037) [2024-06-27 18:35:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.5, 300 sec: 43542.9). Total num frames: 864043008. Throughput: 0: 43552.4. Samples: 766928960. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 18:35:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:35:30,518][06909] Updated weights for policy 0, policy_version 52743 (0.0032) [2024-06-27 18:35:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 864272384. Throughput: 0: 43544.9. Samples: 767191140. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 18:35:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:35:34,596][06909] Updated weights for policy 0, policy_version 52753 (0.0042) [2024-06-27 18:35:38,346][06909] Updated weights for policy 0, policy_version 52763 (0.0034) [2024-06-27 18:35:38,852][06674] Fps is (10 sec: 45866.1, 60 sec: 43689.3, 300 sec: 43653.3). Total num frames: 864501760. Throughput: 0: 43550.1. Samples: 767453520. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 18:35:38,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:35:41,894][06909] Updated weights for policy 0, policy_version 52773 (0.0021) [2024-06-27 18:35:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 864698368. Throughput: 0: 43670.8. Samples: 767587720. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 18:35:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:35:45,635][06909] Updated weights for policy 0, policy_version 52783 (0.0031) [2024-06-27 18:35:48,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43417.5, 300 sec: 43542.6). Total num frames: 864927744. Throughput: 0: 43469.6. Samples: 767845340. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 18:35:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:35:48,951][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000052792_864944128.pth... [2024-06-27 18:35:48,998][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000052153_854474752.pth [2024-06-27 18:35:49,664][06909] Updated weights for policy 0, policy_version 52793 (0.0032) [2024-06-27 18:35:53,065][06909] Updated weights for policy 0, policy_version 52803 (0.0041) [2024-06-27 18:35:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 865157120. Throughput: 0: 43375.1. Samples: 768108620. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 18:35:53,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:35:56,848][06909] Updated weights for policy 0, policy_version 52813 (0.0026) [2024-06-27 18:35:58,851][06674] Fps is (10 sec: 44232.1, 60 sec: 43689.9, 300 sec: 43597.9). Total num frames: 865370112. Throughput: 0: 43571.4. Samples: 768243120. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 18:35:58,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:36:00,584][06909] Updated weights for policy 0, policy_version 52823 (0.0031) [2024-06-27 18:36:03,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43962.1, 300 sec: 43653.3). Total num frames: 865599488. Throughput: 0: 43496.7. Samples: 768503200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 18:36:03,853][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:36:04,274][06909] Updated weights for policy 0, policy_version 52833 (0.0031) [2024-06-27 18:36:08,341][06909] Updated weights for policy 0, policy_version 52843 (0.0031) [2024-06-27 18:36:08,856][06674] Fps is (10 sec: 42577.5, 60 sec: 43413.2, 300 sec: 43597.5). Total num frames: 865796096. Throughput: 0: 43595.1. Samples: 768762660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 18:36:08,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:36:11,687][06909] Updated weights for policy 0, policy_version 52853 (0.0030) [2024-06-27 18:36:13,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 866025472. Throughput: 0: 43717.5. Samples: 768896240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 18:36:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:36:15,736][06909] Updated weights for policy 0, policy_version 52863 (0.0031) [2024-06-27 18:36:18,850][06674] Fps is (10 sec: 45903.2, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 866254848. Throughput: 0: 43752.0. Samples: 769159980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 18:36:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:36:19,098][06909] Updated weights for policy 0, policy_version 52873 (0.0032) [2024-06-27 18:36:23,235][06909] Updated weights for policy 0, policy_version 52883 (0.0031) [2024-06-27 18:36:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 866451456. Throughput: 0: 43812.2. Samples: 769424980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 18:36:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:36:26,760][06909] Updated weights for policy 0, policy_version 52893 (0.0036) [2024-06-27 18:36:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 866680832. Throughput: 0: 43740.4. Samples: 769556040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 18:36:28,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-27 18:36:30,499][06909] Updated weights for policy 0, policy_version 52903 (0.0038) [2024-06-27 18:36:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 866893824. Throughput: 0: 43819.2. Samples: 769817200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 18:36:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:36:34,385][06909] Updated weights for policy 0, policy_version 52913 (0.0031) [2024-06-27 18:36:37,894][06909] Updated weights for policy 0, policy_version 52923 (0.0023) [2024-06-27 18:36:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43419.1, 300 sec: 43598.1). Total num frames: 867106816. Throughput: 0: 44004.4. Samples: 770088820. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 18:36:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:36:39,022][06887] Signal inference workers to stop experience collection... (10950 times) [2024-06-27 18:36:39,072][06909] InferenceWorker_p0-w0: stopping experience collection (10950 times) [2024-06-27 18:36:39,081][06887] Signal inference workers to resume experience collection... (10950 times) [2024-06-27 18:36:39,096][06909] InferenceWorker_p0-w0: resuming experience collection (10950 times) [2024-06-27 18:36:41,574][06909] Updated weights for policy 0, policy_version 52933 (0.0049) [2024-06-27 18:36:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43653.7). Total num frames: 867336192. Throughput: 0: 43815.3. Samples: 770214760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 18:36:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:36:45,598][06909] Updated weights for policy 0, policy_version 52943 (0.0042) [2024-06-27 18:36:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 867549184. Throughput: 0: 43614.1. Samples: 770465740. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 18:36:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:36:49,370][06909] Updated weights for policy 0, policy_version 52953 (0.0045) [2024-06-27 18:36:53,078][06909] Updated weights for policy 0, policy_version 52963 (0.0035) [2024-06-27 18:36:53,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43416.1, 300 sec: 43653.3). Total num frames: 867762176. Throughput: 0: 43750.6. Samples: 770731260. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 18:36:53,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:36:56,727][06909] Updated weights for policy 0, policy_version 52973 (0.0037) [2024-06-27 18:36:58,852][06674] Fps is (10 sec: 44227.3, 60 sec: 43690.0, 300 sec: 43597.8). Total num frames: 867991552. Throughput: 0: 43736.6. Samples: 770864480. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 18:36:58,853][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:37:00,665][06909] Updated weights for policy 0, policy_version 52983 (0.0021) [2024-06-27 18:37:03,850][06674] Fps is (10 sec: 44246.1, 60 sec: 43419.2, 300 sec: 43653.7). Total num frames: 868204544. Throughput: 0: 43531.6. Samples: 771118900. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 18:37:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:37:04,715][06909] Updated weights for policy 0, policy_version 52993 (0.0039) [2024-06-27 18:37:08,155][06909] Updated weights for policy 0, policy_version 53003 (0.0030) [2024-06-27 18:37:08,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43968.1, 300 sec: 43653.6). Total num frames: 868433920. Throughput: 0: 43613.3. Samples: 771387580. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 18:37:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:37:12,064][06909] Updated weights for policy 0, policy_version 53013 (0.0029) [2024-06-27 18:37:13,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43417.5, 300 sec: 43542.6). Total num frames: 868630528. Throughput: 0: 43555.5. Samples: 771516040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 18:37:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:37:15,568][06909] Updated weights for policy 0, policy_version 53023 (0.0027) [2024-06-27 18:37:18,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43144.5, 300 sec: 43542.6). Total num frames: 868843520. Throughput: 0: 43466.6. Samples: 771773200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 18:37:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:37:19,354][06909] Updated weights for policy 0, policy_version 53033 (0.0034) [2024-06-27 18:37:23,387][06909] Updated weights for policy 0, policy_version 53043 (0.0033) [2024-06-27 18:37:23,852][06674] Fps is (10 sec: 44228.1, 60 sec: 43689.2, 300 sec: 43597.8). Total num frames: 869072896. Throughput: 0: 43439.3. Samples: 772043680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 18:37:23,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:37:26,991][06909] Updated weights for policy 0, policy_version 53053 (0.0023) [2024-06-27 18:37:28,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43417.7, 300 sec: 43653.6). Total num frames: 869285888. Throughput: 0: 43462.4. Samples: 772170560. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 18:37:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:37:30,859][06909] Updated weights for policy 0, policy_version 53063 (0.0044) [2024-06-27 18:37:33,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 869498880. Throughput: 0: 43423.5. Samples: 772419800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 18:37:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:37:34,519][06909] Updated weights for policy 0, policy_version 53073 (0.0023) [2024-06-27 18:37:38,507][06909] Updated weights for policy 0, policy_version 53083 (0.0041) [2024-06-27 18:37:38,852][06674] Fps is (10 sec: 42589.2, 60 sec: 43416.1, 300 sec: 43543.2). Total num frames: 869711872. Throughput: 0: 43462.2. Samples: 772687060. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-27 18:37:38,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:37:42,001][06909] Updated weights for policy 0, policy_version 53093 (0.0021) [2024-06-27 18:37:43,856][06674] Fps is (10 sec: 42573.3, 60 sec: 43140.3, 300 sec: 43541.7). Total num frames: 869924864. Throughput: 0: 43298.1. Samples: 772813060. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-27 18:37:43,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:37:46,166][06909] Updated weights for policy 0, policy_version 53103 (0.0024) [2024-06-27 18:37:48,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 870154240. Throughput: 0: 43387.5. Samples: 773071340. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-27 18:37:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:37:48,992][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000053111_870170624.pth... [2024-06-27 18:37:49,046][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000052472_859701248.pth [2024-06-27 18:37:49,590][06909] Updated weights for policy 0, policy_version 53113 (0.0035) [2024-06-27 18:37:53,610][06909] Updated weights for policy 0, policy_version 53123 (0.0038) [2024-06-27 18:37:53,850][06674] Fps is (10 sec: 45901.9, 60 sec: 43692.1, 300 sec: 43653.6). Total num frames: 870383616. Throughput: 0: 43407.6. Samples: 773340920. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-27 18:37:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:37:57,030][06909] Updated weights for policy 0, policy_version 53133 (0.0035) [2024-06-27 18:37:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43146.0, 300 sec: 43542.6). Total num frames: 870580224. Throughput: 0: 43472.5. Samples: 773472300. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-27 18:37:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:38:00,953][06909] Updated weights for policy 0, policy_version 53143 (0.0033) [2024-06-27 18:38:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 870809600. Throughput: 0: 43537.8. Samples: 773732400. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-27 18:38:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:38:04,438][06909] Updated weights for policy 0, policy_version 53153 (0.0030) [2024-06-27 18:38:08,398][06887] Signal inference workers to stop experience collection... (11000 times) [2024-06-27 18:38:08,400][06887] Signal inference workers to resume experience collection... (11000 times) [2024-06-27 18:38:08,439][06909] InferenceWorker_p0-w0: stopping experience collection (11000 times) [2024-06-27 18:38:08,439][06909] InferenceWorker_p0-w0: resuming experience collection (11000 times) [2024-06-27 18:38:08,537][06909] Updated weights for policy 0, policy_version 53163 (0.0046) [2024-06-27 18:38:08,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 871038976. Throughput: 0: 43515.0. Samples: 774001760. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-27 18:38:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:38:12,148][06909] Updated weights for policy 0, policy_version 53173 (0.0025) [2024-06-27 18:38:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 871235584. Throughput: 0: 43448.0. Samples: 774125720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 18:38:13,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:38:16,096][06909] Updated weights for policy 0, policy_version 53183 (0.0033) [2024-06-27 18:38:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 871464960. Throughput: 0: 43665.0. Samples: 774384720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 18:38:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:38:19,431][06909] Updated weights for policy 0, policy_version 53193 (0.0038) [2024-06-27 18:38:23,373][06909] Updated weights for policy 0, policy_version 53203 (0.0028) [2024-06-27 18:38:23,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43965.2, 300 sec: 43764.7). Total num frames: 871710720. Throughput: 0: 43749.4. Samples: 774655700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 18:38:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:38:26,816][06909] Updated weights for policy 0, policy_version 53213 (0.0024) [2024-06-27 18:38:28,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43689.1, 300 sec: 43597.8). Total num frames: 871907328. Throughput: 0: 43908.7. Samples: 774788780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 18:38:28,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:38:30,822][06909] Updated weights for policy 0, policy_version 53223 (0.0026) [2024-06-27 18:38:33,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 872120320. Throughput: 0: 43960.9. Samples: 775049580. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 18:38:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:38:34,550][06909] Updated weights for policy 0, policy_version 53233 (0.0034) [2024-06-27 18:38:38,366][06909] Updated weights for policy 0, policy_version 53243 (0.0030) [2024-06-27 18:38:38,850][06674] Fps is (10 sec: 45884.5, 60 sec: 44238.3, 300 sec: 43709.4). Total num frames: 872366080. Throughput: 0: 43821.4. Samples: 775312880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 18:38:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:38:42,022][06909] Updated weights for policy 0, policy_version 53253 (0.0033) [2024-06-27 18:38:43,850][06674] Fps is (10 sec: 42596.9, 60 sec: 43694.8, 300 sec: 43598.1). Total num frames: 872546304. Throughput: 0: 43847.8. Samples: 775445460. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 18:38:43,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:38:45,907][06909] Updated weights for policy 0, policy_version 53263 (0.0033) [2024-06-27 18:38:48,856][06674] Fps is (10 sec: 40935.0, 60 sec: 43686.2, 300 sec: 43597.2). Total num frames: 872775680. Throughput: 0: 43718.5. Samples: 775700000. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-27 18:38:48,857][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:38:49,800][06909] Updated weights for policy 0, policy_version 53273 (0.0040) [2024-06-27 18:38:53,565][06909] Updated weights for policy 0, policy_version 53283 (0.0037) [2024-06-27 18:38:53,850][06674] Fps is (10 sec: 45876.2, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 873005056. Throughput: 0: 43466.1. Samples: 775957740. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-27 18:38:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:38:57,424][06909] Updated weights for policy 0, policy_version 53293 (0.0045) [2024-06-27 18:38:58,850][06674] Fps is (10 sec: 42624.7, 60 sec: 43690.8, 300 sec: 43542.6). Total num frames: 873201664. Throughput: 0: 43478.2. Samples: 776082240. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-27 18:38:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:39:01,330][06909] Updated weights for policy 0, policy_version 53303 (0.0028) [2024-06-27 18:39:03,855][06674] Fps is (10 sec: 42576.3, 60 sec: 43686.8, 300 sec: 43597.3). Total num frames: 873431040. Throughput: 0: 43493.5. Samples: 776342160. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-27 18:39:03,856][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:39:04,754][06909] Updated weights for policy 0, policy_version 53313 (0.0030) [2024-06-27 18:39:08,674][06909] Updated weights for policy 0, policy_version 53323 (0.0039) [2024-06-27 18:39:08,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43417.4, 300 sec: 43599.0). Total num frames: 873644032. Throughput: 0: 43474.2. Samples: 776612040. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-27 18:39:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:39:12,455][06909] Updated weights for policy 0, policy_version 53333 (0.0034) [2024-06-27 18:39:13,850][06674] Fps is (10 sec: 40981.7, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 873840640. Throughput: 0: 43333.5. Samples: 776738700. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-27 18:39:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:39:16,097][06909] Updated weights for policy 0, policy_version 53343 (0.0037) [2024-06-27 18:39:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.5, 300 sec: 43598.1). Total num frames: 874086400. Throughput: 0: 43222.9. Samples: 776994620. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 18:39:18,859][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:39:19,866][06909] Updated weights for policy 0, policy_version 53353 (0.0030) [2024-06-27 18:39:23,786][06909] Updated weights for policy 0, policy_version 53363 (0.0028) [2024-06-27 18:39:23,853][06674] Fps is (10 sec: 45860.2, 60 sec: 43142.3, 300 sec: 43597.6). Total num frames: 874299392. Throughput: 0: 43446.2. Samples: 777268100. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 18:39:23,854][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:39:27,471][06909] Updated weights for policy 0, policy_version 53373 (0.0031) [2024-06-27 18:39:28,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43145.9, 300 sec: 43487.0). Total num frames: 874496000. Throughput: 0: 43351.3. Samples: 777396260. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 18:39:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:39:31,050][06909] Updated weights for policy 0, policy_version 53383 (0.0027) [2024-06-27 18:39:31,308][06887] Signal inference workers to stop experience collection... (11050 times) [2024-06-27 18:39:31,308][06887] Signal inference workers to resume experience collection... (11050 times) [2024-06-27 18:39:31,346][06909] InferenceWorker_p0-w0: stopping experience collection (11050 times) [2024-06-27 18:39:31,346][06909] InferenceWorker_p0-w0: resuming experience collection (11050 times) [2024-06-27 18:39:33,852][06674] Fps is (10 sec: 44242.1, 60 sec: 43689.1, 300 sec: 43597.8). Total num frames: 874741760. Throughput: 0: 43500.4. Samples: 777657340. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 18:39:33,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:39:35,034][06909] Updated weights for policy 0, policy_version 53393 (0.0042) [2024-06-27 18:39:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 42871.5, 300 sec: 43542.6). Total num frames: 874938368. Throughput: 0: 43594.8. Samples: 777919500. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 18:39:38,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:39:38,966][06909] Updated weights for policy 0, policy_version 53403 (0.0034) [2024-06-27 18:39:42,402][06909] Updated weights for policy 0, policy_version 53413 (0.0026) [2024-06-27 18:39:43,850][06674] Fps is (10 sec: 40968.6, 60 sec: 43417.8, 300 sec: 43487.0). Total num frames: 875151360. Throughput: 0: 43582.7. Samples: 778043460. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 18:39:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:39:46,398][06909] Updated weights for policy 0, policy_version 53423 (0.0035) [2024-06-27 18:39:48,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43695.1, 300 sec: 43598.1). Total num frames: 875397120. Throughput: 0: 43583.7. Samples: 778303200. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 18:39:48,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 18:39:48,855][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000053430_875397120.pth... [2024-06-27 18:39:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000052792_864944128.pth [2024-06-27 18:39:50,234][06909] Updated weights for policy 0, policy_version 53433 (0.0036) [2024-06-27 18:39:53,827][06909] Updated weights for policy 0, policy_version 53443 (0.0027) [2024-06-27 18:39:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 875610112. Throughput: 0: 43598.9. Samples: 778573980. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 18:39:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:39:57,668][06909] Updated weights for policy 0, policy_version 53453 (0.0027) [2024-06-27 18:39:58,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.5, 300 sec: 43542.5). Total num frames: 875806720. Throughput: 0: 43581.3. Samples: 778699860. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 18:39:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:40:01,256][06909] Updated weights for policy 0, policy_version 53463 (0.0038) [2024-06-27 18:40:03,852][06674] Fps is (10 sec: 44227.3, 60 sec: 43693.0, 300 sec: 43597.8). Total num frames: 876052480. Throughput: 0: 43579.9. Samples: 778955800. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 18:40:03,853][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:40:05,085][06909] Updated weights for policy 0, policy_version 53473 (0.0035) [2024-06-27 18:40:08,738][06909] Updated weights for policy 0, policy_version 53483 (0.0029) [2024-06-27 18:40:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 876265472. Throughput: 0: 43590.3. Samples: 779229520. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 18:40:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:40:12,378][06909] Updated weights for policy 0, policy_version 53493 (0.0027) [2024-06-27 18:40:13,850][06674] Fps is (10 sec: 40968.4, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 876462080. Throughput: 0: 43639.1. Samples: 779360020. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 18:40:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:40:16,251][06909] Updated weights for policy 0, policy_version 53503 (0.0028) [2024-06-27 18:40:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 43653.7). Total num frames: 876707840. Throughput: 0: 43510.9. Samples: 779615240. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 18:40:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:40:20,192][06909] Updated weights for policy 0, policy_version 53513 (0.0037) [2024-06-27 18:40:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43419.9, 300 sec: 43598.1). Total num frames: 876904448. Throughput: 0: 43552.8. Samples: 779879380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 18:40:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:40:23,866][06909] Updated weights for policy 0, policy_version 53523 (0.0031) [2024-06-27 18:40:27,478][06909] Updated weights for policy 0, policy_version 53533 (0.0035) [2024-06-27 18:40:28,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 877133824. Throughput: 0: 43570.5. Samples: 780004140. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 18:40:28,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:40:31,406][06909] Updated weights for policy 0, policy_version 53543 (0.0041) [2024-06-27 18:40:33,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43692.2, 300 sec: 43598.4). Total num frames: 877363200. Throughput: 0: 43727.2. Samples: 780270920. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 18:40:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:40:35,303][06909] Updated weights for policy 0, policy_version 53553 (0.0041) [2024-06-27 18:40:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 877559808. Throughput: 0: 43447.5. Samples: 780529120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 18:40:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:40:39,140][06909] Updated weights for policy 0, policy_version 53563 (0.0030) [2024-06-27 18:40:42,995][06909] Updated weights for policy 0, policy_version 53573 (0.0033) [2024-06-27 18:40:43,852][06674] Fps is (10 sec: 40951.3, 60 sec: 43689.1, 300 sec: 43542.3). Total num frames: 877772800. Throughput: 0: 43376.3. Samples: 780651880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 18:40:43,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:40:46,530][06909] Updated weights for policy 0, policy_version 53583 (0.0040) [2024-06-27 18:40:48,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 878018560. Throughput: 0: 43522.3. Samples: 780914220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 18:40:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:40:50,331][06909] Updated weights for policy 0, policy_version 53593 (0.0032) [2024-06-27 18:40:53,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43144.5, 300 sec: 43487.2). Total num frames: 878198784. Throughput: 0: 43280.9. Samples: 781177160. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 18:40:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:40:53,996][06909] Updated weights for policy 0, policy_version 53603 (0.0026) [2024-06-27 18:40:58,331][06909] Updated weights for policy 0, policy_version 53613 (0.0033) [2024-06-27 18:40:58,850][06674] Fps is (10 sec: 37683.7, 60 sec: 43144.5, 300 sec: 43376.2). Total num frames: 878395392. Throughput: 0: 43124.9. Samples: 781300640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 18:40:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:41:01,862][06909] Updated weights for policy 0, policy_version 53623 (0.0032) [2024-06-27 18:41:03,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43419.0, 300 sec: 43599.0). Total num frames: 878657536. Throughput: 0: 43244.3. Samples: 781561240. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 18:41:03,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:41:05,616][06909] Updated weights for policy 0, policy_version 53633 (0.0034) [2024-06-27 18:41:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43144.4, 300 sec: 43487.0). Total num frames: 878854144. Throughput: 0: 43327.0. Samples: 781829100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 18:41:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:41:09,226][06909] Updated weights for policy 0, policy_version 53643 (0.0035) [2024-06-27 18:41:13,019][06909] Updated weights for policy 0, policy_version 53653 (0.0039) [2024-06-27 18:41:13,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 879067136. Throughput: 0: 43345.9. Samples: 781954700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 18:41:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:41:15,271][06887] Signal inference workers to stop experience collection... (11100 times) [2024-06-27 18:41:15,306][06909] InferenceWorker_p0-w0: stopping experience collection (11100 times) [2024-06-27 18:41:15,331][06887] Signal inference workers to resume experience collection... (11100 times) [2024-06-27 18:41:15,332][06909] InferenceWorker_p0-w0: resuming experience collection (11100 times) [2024-06-27 18:41:16,776][06909] Updated weights for policy 0, policy_version 53663 (0.0037) [2024-06-27 18:41:18,850][06674] Fps is (10 sec: 47514.4, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 879329280. Throughput: 0: 43334.6. Samples: 782220980. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 18:41:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:41:20,648][06909] Updated weights for policy 0, policy_version 53673 (0.0031) [2024-06-27 18:41:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 879509504. Throughput: 0: 43467.1. Samples: 782485140. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 18:41:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:41:24,442][06909] Updated weights for policy 0, policy_version 53683 (0.0026) [2024-06-27 18:41:27,923][06909] Updated weights for policy 0, policy_version 53693 (0.0031) [2024-06-27 18:41:28,852][06674] Fps is (10 sec: 39313.3, 60 sec: 43143.1, 300 sec: 43486.7). Total num frames: 879722496. Throughput: 0: 43519.1. Samples: 782610240. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 18:41:28,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:41:31,771][06909] Updated weights for policy 0, policy_version 53703 (0.0034) [2024-06-27 18:41:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 879968256. Throughput: 0: 43668.2. Samples: 782879280. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:41:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:41:35,691][06909] Updated weights for policy 0, policy_version 53713 (0.0022) [2024-06-27 18:41:38,850][06674] Fps is (10 sec: 44246.1, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 880164864. Throughput: 0: 43697.8. Samples: 783143560. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:41:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:41:39,357][06909] Updated weights for policy 0, policy_version 53723 (0.0034) [2024-06-27 18:41:43,202][06909] Updated weights for policy 0, policy_version 53733 (0.0027) [2024-06-27 18:41:43,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43146.0, 300 sec: 43431.5). Total num frames: 880361472. Throughput: 0: 43672.5. Samples: 783265900. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:41:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:41:46,722][06909] Updated weights for policy 0, policy_version 53743 (0.0028) [2024-06-27 18:41:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43144.6, 300 sec: 43542.9). Total num frames: 880607232. Throughput: 0: 43700.5. Samples: 783527760. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:41:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:41:48,894][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000053749_880623616.pth... [2024-06-27 18:41:48,941][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000053111_870170624.pth [2024-06-27 18:41:50,454][06909] Updated weights for policy 0, policy_version 53753 (0.0030) [2024-06-27 18:41:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 43487.3). Total num frames: 880820224. Throughput: 0: 43620.6. Samples: 783792020. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:41:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:41:54,143][06909] Updated weights for policy 0, policy_version 53763 (0.0020) [2024-06-27 18:41:58,339][06909] Updated weights for policy 0, policy_version 53773 (0.0032) [2024-06-27 18:41:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43487.0). Total num frames: 881033216. Throughput: 0: 43673.7. Samples: 783920020. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:41:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:42:01,911][06909] Updated weights for policy 0, policy_version 53783 (0.0041) [2024-06-27 18:42:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 881262592. Throughput: 0: 43498.6. Samples: 784178420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 18:42:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:42:05,724][06909] Updated weights for policy 0, policy_version 53793 (0.0030) [2024-06-27 18:42:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.8, 300 sec: 43542.6). Total num frames: 881475584. Throughput: 0: 43623.7. Samples: 784448200. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:42:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:42:09,240][06909] Updated weights for policy 0, policy_version 53803 (0.0038) [2024-06-27 18:42:13,066][06909] Updated weights for policy 0, policy_version 53813 (0.0034) [2024-06-27 18:42:13,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43417.4, 300 sec: 43487.0). Total num frames: 881672192. Throughput: 0: 43786.3. Samples: 784580540. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:42:13,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:42:16,559][06909] Updated weights for policy 0, policy_version 53823 (0.0020) [2024-06-27 18:42:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 43598.4). Total num frames: 881934336. Throughput: 0: 43601.8. Samples: 784841360. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:42:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:42:20,859][06909] Updated weights for policy 0, policy_version 53833 (0.0034) [2024-06-27 18:42:23,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 882147328. Throughput: 0: 43386.6. Samples: 785095960. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:42:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:42:24,229][06909] Updated weights for policy 0, policy_version 53843 (0.0029) [2024-06-27 18:42:28,495][06909] Updated weights for policy 0, policy_version 53853 (0.0033) [2024-06-27 18:42:28,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43692.1, 300 sec: 43542.6). Total num frames: 882343936. Throughput: 0: 43651.1. Samples: 785230200. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:42:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:42:31,711][06909] Updated weights for policy 0, policy_version 53863 (0.0031) [2024-06-27 18:42:33,856][06674] Fps is (10 sec: 42572.6, 60 sec: 43413.2, 300 sec: 43597.5). Total num frames: 882573312. Throughput: 0: 43587.9. Samples: 785489480. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:42:33,857][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:42:35,830][06909] Updated weights for policy 0, policy_version 53873 (0.0038) [2024-06-27 18:42:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43599.0). Total num frames: 882786304. Throughput: 0: 43562.9. Samples: 785752360. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 18:42:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:42:39,603][06909] Updated weights for policy 0, policy_version 53883 (0.0032) [2024-06-27 18:42:43,519][06909] Updated weights for policy 0, policy_version 53893 (0.0041) [2024-06-27 18:42:43,850][06674] Fps is (10 sec: 40985.5, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 882982912. Throughput: 0: 43572.1. Samples: 785880760. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-27 18:42:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:42:47,077][06909] Updated weights for policy 0, policy_version 53903 (0.0040) [2024-06-27 18:42:47,582][06887] Signal inference workers to stop experience collection... (11150 times) [2024-06-27 18:42:47,643][06909] InferenceWorker_p0-w0: stopping experience collection (11150 times) [2024-06-27 18:42:47,649][06887] Signal inference workers to resume experience collection... (11150 times) [2024-06-27 18:42:47,660][06909] InferenceWorker_p0-w0: resuming experience collection (11150 times) [2024-06-27 18:42:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 883228672. Throughput: 0: 43768.8. Samples: 786148020. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-27 18:42:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:42:50,822][06909] Updated weights for policy 0, policy_version 53913 (0.0030) [2024-06-27 18:42:53,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43963.6, 300 sec: 43653.6). Total num frames: 883458048. Throughput: 0: 43609.7. Samples: 786410640. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-27 18:42:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:42:54,361][06909] Updated weights for policy 0, policy_version 53923 (0.0033) [2024-06-27 18:42:58,395][06909] Updated weights for policy 0, policy_version 53933 (0.0047) [2024-06-27 18:42:58,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43689.1, 300 sec: 43542.3). Total num frames: 883654656. Throughput: 0: 43542.6. Samples: 786540040. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-27 18:42:58,852][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 18:43:01,716][06909] Updated weights for policy 0, policy_version 53943 (0.0054) [2024-06-27 18:43:03,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 883884032. Throughput: 0: 43569.8. Samples: 786802000. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-27 18:43:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:43:06,144][06909] Updated weights for policy 0, policy_version 53953 (0.0033) [2024-06-27 18:43:08,850][06674] Fps is (10 sec: 45884.2, 60 sec: 43963.6, 300 sec: 43653.6). Total num frames: 884113408. Throughput: 0: 43607.5. Samples: 787058300. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-27 18:43:08,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:43:09,503][06909] Updated weights for policy 0, policy_version 53963 (0.0032) [2024-06-27 18:43:13,462][06909] Updated weights for policy 0, policy_version 53973 (0.0040) [2024-06-27 18:43:13,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.9, 300 sec: 43598.1). Total num frames: 884326400. Throughput: 0: 43565.3. Samples: 787190640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 18:43:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:43:16,866][06909] Updated weights for policy 0, policy_version 53983 (0.0041) [2024-06-27 18:43:18,852][06674] Fps is (10 sec: 42590.2, 60 sec: 43416.1, 300 sec: 43486.7). Total num frames: 884539392. Throughput: 0: 43640.4. Samples: 787453120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 18:43:18,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:43:20,884][06909] Updated weights for policy 0, policy_version 53993 (0.0035) [2024-06-27 18:43:23,852][06674] Fps is (10 sec: 44228.1, 60 sec: 43689.2, 300 sec: 43598.1). Total num frames: 884768768. Throughput: 0: 43488.8. Samples: 787709440. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 18:43:23,853][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:43:24,645][06909] Updated weights for policy 0, policy_version 54003 (0.0029) [2024-06-27 18:43:28,402][06909] Updated weights for policy 0, policy_version 54013 (0.0034) [2024-06-27 18:43:28,850][06674] Fps is (10 sec: 40968.7, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 884948992. Throughput: 0: 43511.5. Samples: 787838780. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 18:43:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:43:32,188][06909] Updated weights for policy 0, policy_version 54023 (0.0035) [2024-06-27 18:43:33,850][06674] Fps is (10 sec: 40968.1, 60 sec: 43422.0, 300 sec: 43431.5). Total num frames: 885178368. Throughput: 0: 43548.4. Samples: 788107700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 18:43:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:43:35,891][06909] Updated weights for policy 0, policy_version 54033 (0.0038) [2024-06-27 18:43:38,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43963.7, 300 sec: 43653.7). Total num frames: 885424128. Throughput: 0: 43464.4. Samples: 788366540. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 18:43:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:43:39,758][06909] Updated weights for policy 0, policy_version 54043 (0.0026) [2024-06-27 18:43:43,659][06909] Updated weights for policy 0, policy_version 54053 (0.0030) [2024-06-27 18:43:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43487.9). Total num frames: 885604352. Throughput: 0: 43426.5. Samples: 788494140. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 18:43:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 18:43:47,603][06909] Updated weights for policy 0, policy_version 54063 (0.0023) [2024-06-27 18:43:48,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 885850112. Throughput: 0: 43437.7. Samples: 788756700. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 18:43:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:43:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000054068_885850112.pth... [2024-06-27 18:43:48,912][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000053430_875397120.pth [2024-06-27 18:43:50,989][06909] Updated weights for policy 0, policy_version 54073 (0.0029) [2024-06-27 18:43:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43144.5, 300 sec: 43542.5). Total num frames: 886046720. Throughput: 0: 43468.0. Samples: 789014360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 18:43:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:43:55,088][06909] Updated weights for policy 0, policy_version 54083 (0.0027) [2024-06-27 18:43:58,570][06909] Updated weights for policy 0, policy_version 54093 (0.0031) [2024-06-27 18:43:58,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43419.1, 300 sec: 43487.8). Total num frames: 886259712. Throughput: 0: 43376.5. Samples: 789142580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 18:43:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:44:02,963][06909] Updated weights for policy 0, policy_version 54103 (0.0039) [2024-06-27 18:44:03,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 886489088. Throughput: 0: 43486.5. Samples: 789409920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 18:44:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:44:05,933][06909] Updated weights for policy 0, policy_version 54113 (0.0037) [2024-06-27 18:44:07,233][06887] Signal inference workers to stop experience collection... (11200 times) [2024-06-27 18:44:07,262][06909] InferenceWorker_p0-w0: stopping experience collection (11200 times) [2024-06-27 18:44:07,289][06887] Signal inference workers to resume experience collection... (11200 times) [2024-06-27 18:44:07,290][06909] InferenceWorker_p0-w0: resuming experience collection (11200 times) [2024-06-27 18:44:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43417.7, 300 sec: 43653.6). Total num frames: 886718464. Throughput: 0: 43390.9. Samples: 789661940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 18:44:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:44:10,243][06909] Updated weights for policy 0, policy_version 54123 (0.0032) [2024-06-27 18:44:13,521][06909] Updated weights for policy 0, policy_version 54133 (0.0028) [2024-06-27 18:44:13,852][06674] Fps is (10 sec: 42589.2, 60 sec: 43143.1, 300 sec: 43486.7). Total num frames: 886915072. Throughput: 0: 43449.9. Samples: 789794120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 18:44:13,853][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:44:17,606][06909] Updated weights for policy 0, policy_version 54143 (0.0036) [2024-06-27 18:44:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43419.1, 300 sec: 43543.0). Total num frames: 887144448. Throughput: 0: 43500.1. Samples: 790065200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 18:44:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:44:21,299][06909] Updated weights for policy 0, policy_version 54153 (0.0034) [2024-06-27 18:44:23,850][06674] Fps is (10 sec: 44246.5, 60 sec: 43146.0, 300 sec: 43598.1). Total num frames: 887357440. Throughput: 0: 43397.9. Samples: 790319440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2024-06-27 18:44:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:44:24,921][06909] Updated weights for policy 0, policy_version 54163 (0.0028) [2024-06-27 18:44:28,799][06909] Updated weights for policy 0, policy_version 54173 (0.0027) [2024-06-27 18:44:28,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43487.3). Total num frames: 887570432. Throughput: 0: 43464.8. Samples: 790450060. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2024-06-27 18:44:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:44:32,686][06909] Updated weights for policy 0, policy_version 54183 (0.0039) [2024-06-27 18:44:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 887799808. Throughput: 0: 43526.7. Samples: 790715400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2024-06-27 18:44:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:44:36,320][06909] Updated weights for policy 0, policy_version 54193 (0.0035) [2024-06-27 18:44:38,852][06674] Fps is (10 sec: 44228.1, 60 sec: 43143.1, 300 sec: 43597.8). Total num frames: 888012800. Throughput: 0: 43518.5. Samples: 790972780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2024-06-27 18:44:38,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:44:40,044][06909] Updated weights for policy 0, policy_version 54203 (0.0030) [2024-06-27 18:44:43,676][06909] Updated weights for policy 0, policy_version 54213 (0.0037) [2024-06-27 18:44:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 888225792. Throughput: 0: 43631.5. Samples: 791106000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2024-06-27 18:44:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:44:47,432][06909] Updated weights for policy 0, policy_version 54223 (0.0033) [2024-06-27 18:44:48,850][06674] Fps is (10 sec: 42607.5, 60 sec: 43144.6, 300 sec: 43487.0). Total num frames: 888438784. Throughput: 0: 43573.8. Samples: 791370740. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2024-06-27 18:44:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:44:51,327][06909] Updated weights for policy 0, policy_version 54233 (0.0033) [2024-06-27 18:44:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 888668160. Throughput: 0: 43465.8. Samples: 791617900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 19.0) [2024-06-27 18:44:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:44:55,518][06909] Updated weights for policy 0, policy_version 54243 (0.0038) [2024-06-27 18:44:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 43431.8). Total num frames: 888864768. Throughput: 0: 43614.9. Samples: 791756700. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:44:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:44:59,249][06909] Updated weights for policy 0, policy_version 54253 (0.0026) [2024-06-27 18:45:02,846][06909] Updated weights for policy 0, policy_version 54263 (0.0036) [2024-06-27 18:45:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 889077760. Throughput: 0: 43348.8. Samples: 792015900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:45:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:45:06,830][06909] Updated weights for policy 0, policy_version 54273 (0.0029) [2024-06-27 18:45:08,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 889323520. Throughput: 0: 43130.9. Samples: 792260340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:45:08,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:45:10,824][06909] Updated weights for policy 0, policy_version 54283 (0.0030) [2024-06-27 18:45:13,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43144.6, 300 sec: 43375.6). Total num frames: 889503744. Throughput: 0: 43314.1. Samples: 792399280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:45:13,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:45:14,294][06909] Updated weights for policy 0, policy_version 54293 (0.0026) [2024-06-27 18:45:15,513][06887] Signal inference workers to stop experience collection... (11250 times) [2024-06-27 18:45:15,567][06887] Signal inference workers to resume experience collection... (11250 times) [2024-06-27 18:45:15,568][06909] InferenceWorker_p0-w0: stopping experience collection (11250 times) [2024-06-27 18:45:15,585][06909] InferenceWorker_p0-w0: resuming experience collection (11250 times) [2024-06-27 18:45:18,199][06909] Updated weights for policy 0, policy_version 54303 (0.0026) [2024-06-27 18:45:18,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 889733120. Throughput: 0: 43197.8. Samples: 792659300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:45:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:45:21,832][06909] Updated weights for policy 0, policy_version 54313 (0.0037) [2024-06-27 18:45:23,850][06674] Fps is (10 sec: 47523.2, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 889978880. Throughput: 0: 43144.2. Samples: 792914180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:45:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:45:25,690][06909] Updated weights for policy 0, policy_version 54323 (0.0039) [2024-06-27 18:45:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43144.7, 300 sec: 43375.9). Total num frames: 890159104. Throughput: 0: 43244.1. Samples: 793051980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:45:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:45:29,421][06909] Updated weights for policy 0, policy_version 54333 (0.0049) [2024-06-27 18:45:33,260][06909] Updated weights for policy 0, policy_version 54343 (0.0039) [2024-06-27 18:45:33,850][06674] Fps is (10 sec: 37682.7, 60 sec: 42598.3, 300 sec: 43375.9). Total num frames: 890355712. Throughput: 0: 43010.9. Samples: 793306240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:45:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:45:36,987][06909] Updated weights for policy 0, policy_version 54353 (0.0028) [2024-06-27 18:45:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43419.1, 300 sec: 43542.9). Total num frames: 890617856. Throughput: 0: 43203.6. Samples: 793562060. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:45:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:45:40,707][06909] Updated weights for policy 0, policy_version 54363 (0.0028) [2024-06-27 18:45:43,852][06674] Fps is (10 sec: 44228.4, 60 sec: 42870.1, 300 sec: 43320.1). Total num frames: 890798080. Throughput: 0: 43222.5. Samples: 793701800. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:45:43,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:45:44,604][06909] Updated weights for policy 0, policy_version 54373 (0.0033) [2024-06-27 18:45:48,223][06909] Updated weights for policy 0, policy_version 54383 (0.0042) [2024-06-27 18:45:48,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 891027456. Throughput: 0: 43040.9. Samples: 793952740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:45:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:45:48,884][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000054385_891043840.pth... [2024-06-27 18:45:48,934][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000053749_880623616.pth [2024-06-27 18:45:52,184][06909] Updated weights for policy 0, policy_version 54393 (0.0043) [2024-06-27 18:45:53,850][06674] Fps is (10 sec: 45884.9, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 891256832. Throughput: 0: 43323.3. Samples: 794209880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:45:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:45:55,946][06909] Updated weights for policy 0, policy_version 54403 (0.0030) [2024-06-27 18:45:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 43376.0). Total num frames: 891453440. Throughput: 0: 43280.1. Samples: 794346800. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-27 18:45:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:45:59,498][06909] Updated weights for policy 0, policy_version 54413 (0.0037) [2024-06-27 18:46:03,444][06909] Updated weights for policy 0, policy_version 54423 (0.0042) [2024-06-27 18:46:03,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 891682816. Throughput: 0: 43394.1. Samples: 794612040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 18:46:03,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:46:07,158][06909] Updated weights for policy 0, policy_version 54433 (0.0032) [2024-06-27 18:46:08,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 891928576. Throughput: 0: 43378.6. Samples: 794866220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 18:46:08,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:46:10,975][06909] Updated weights for policy 0, policy_version 54443 (0.0033) [2024-06-27 18:46:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43692.2, 300 sec: 43375.9). Total num frames: 892125184. Throughput: 0: 43346.2. Samples: 795002560. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 18:46:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:46:14,652][06909] Updated weights for policy 0, policy_version 54453 (0.0030) [2024-06-27 18:46:18,552][06909] Updated weights for policy 0, policy_version 54463 (0.0030) [2024-06-27 18:46:18,850][06674] Fps is (10 sec: 39322.3, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 892321792. Throughput: 0: 43484.2. Samples: 795263020. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 18:46:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:46:22,065][06909] Updated weights for policy 0, policy_version 54473 (0.0030) [2024-06-27 18:46:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43417.6, 300 sec: 43598.4). Total num frames: 892583936. Throughput: 0: 43415.5. Samples: 795515760. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 18:46:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:46:26,083][06909] Updated weights for policy 0, policy_version 54483 (0.0046) [2024-06-27 18:46:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 892764160. Throughput: 0: 43285.5. Samples: 795649560. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 18:46:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:46:29,705][06909] Updated weights for policy 0, policy_version 54493 (0.0023) [2024-06-27 18:46:33,513][06909] Updated weights for policy 0, policy_version 54503 (0.0029) [2024-06-27 18:46:33,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.8, 300 sec: 43487.0). Total num frames: 892993536. Throughput: 0: 43437.7. Samples: 795907440. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 18:46:33,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:46:35,670][06887] Signal inference workers to stop experience collection... (11300 times) [2024-06-27 18:46:35,702][06909] InferenceWorker_p0-w0: stopping experience collection (11300 times) [2024-06-27 18:46:35,731][06887] Signal inference workers to resume experience collection... (11300 times) [2024-06-27 18:46:35,731][06909] InferenceWorker_p0-w0: resuming experience collection (11300 times) [2024-06-27 18:46:37,066][06909] Updated weights for policy 0, policy_version 54513 (0.0039) [2024-06-27 18:46:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 893222912. Throughput: 0: 43492.7. Samples: 796167060. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-27 18:46:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:46:40,984][06909] Updated weights for policy 0, policy_version 54523 (0.0030) [2024-06-27 18:46:43,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43692.2, 300 sec: 43431.5). Total num frames: 893419520. Throughput: 0: 43506.8. Samples: 796304600. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-27 18:46:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:46:44,550][06909] Updated weights for policy 0, policy_version 54533 (0.0030) [2024-06-27 18:46:48,550][06909] Updated weights for policy 0, policy_version 54543 (0.0038) [2024-06-27 18:46:48,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 893632512. Throughput: 0: 43410.7. Samples: 796565520. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-27 18:46:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:46:52,183][06909] Updated weights for policy 0, policy_version 54553 (0.0035) [2024-06-27 18:46:53,852][06674] Fps is (10 sec: 45865.4, 60 sec: 43689.1, 300 sec: 43542.3). Total num frames: 893878272. Throughput: 0: 43445.6. Samples: 796821360. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-27 18:46:53,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:46:56,282][06909] Updated weights for policy 0, policy_version 54563 (0.0039) [2024-06-27 18:46:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 894074880. Throughput: 0: 43507.1. Samples: 796960380. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-27 18:46:58,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:46:59,451][06909] Updated weights for policy 0, policy_version 54573 (0.0032) [2024-06-27 18:47:03,666][06909] Updated weights for policy 0, policy_version 54583 (0.0035) [2024-06-27 18:47:03,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 894304256. Throughput: 0: 43582.2. Samples: 797224220. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-27 18:47:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:47:06,746][06909] Updated weights for policy 0, policy_version 54593 (0.0027) [2024-06-27 18:47:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 894533632. Throughput: 0: 43572.9. Samples: 797476540. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-27 18:47:08,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 18:47:11,140][06909] Updated weights for policy 0, policy_version 54603 (0.0030) [2024-06-27 18:47:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 894730240. Throughput: 0: 43697.8. Samples: 797615960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 18:47:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:47:14,325][06909] Updated weights for policy 0, policy_version 54613 (0.0035) [2024-06-27 18:47:18,499][06909] Updated weights for policy 0, policy_version 54623 (0.0034) [2024-06-27 18:47:18,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.5, 300 sec: 43375.9). Total num frames: 894943232. Throughput: 0: 43756.0. Samples: 797876460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 18:47:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:47:21,851][06909] Updated weights for policy 0, policy_version 54633 (0.0025) [2024-06-27 18:47:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 895172608. Throughput: 0: 43618.7. Samples: 798129900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 18:47:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:47:26,241][06909] Updated weights for policy 0, policy_version 54643 (0.0024) [2024-06-27 18:47:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43432.4). Total num frames: 895385600. Throughput: 0: 43657.6. Samples: 798269200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 18:47:28,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:47:29,321][06909] Updated weights for policy 0, policy_version 54653 (0.0035) [2024-06-27 18:47:33,798][06909] Updated weights for policy 0, policy_version 54663 (0.0030) [2024-06-27 18:47:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 895598592. Throughput: 0: 43582.1. Samples: 798526720. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 18:47:33,856][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:47:37,008][06909] Updated weights for policy 0, policy_version 54673 (0.0025) [2024-06-27 18:47:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43417.7, 300 sec: 43542.5). Total num frames: 895827968. Throughput: 0: 43562.1. Samples: 798781560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 18:47:38,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 18:47:41,595][06909] Updated weights for policy 0, policy_version 54683 (0.0033) [2024-06-27 18:47:43,628][06887] Signal inference workers to stop experience collection... (11350 times) [2024-06-27 18:47:43,629][06887] Signal inference workers to resume experience collection... (11350 times) [2024-06-27 18:47:43,676][06909] InferenceWorker_p0-w0: stopping experience collection (11350 times) [2024-06-27 18:47:43,676][06909] InferenceWorker_p0-w0: resuming experience collection (11350 times) [2024-06-27 18:47:43,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 896040960. Throughput: 0: 43556.0. Samples: 798920400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 18:47:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:47:44,377][06909] Updated weights for policy 0, policy_version 54693 (0.0042) [2024-06-27 18:47:48,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 896237568. Throughput: 0: 43451.5. Samples: 799179540. Policy #0 lag: (min: 0.0, avg: 12.6, max: 28.0) [2024-06-27 18:47:48,850][06674] Avg episode reward: [(0, '0.473')] [2024-06-27 18:47:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000054702_896237568.pth... [2024-06-27 18:47:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000054068_885850112.pth [2024-06-27 18:47:48,927][06887] Saving new best policy, reward=0.473! [2024-06-27 18:47:49,171][06909] Updated weights for policy 0, policy_version 54703 (0.0037) [2024-06-27 18:47:52,067][06909] Updated weights for policy 0, policy_version 54713 (0.0025) [2024-06-27 18:47:53,856][06674] Fps is (10 sec: 44210.0, 60 sec: 43414.7, 300 sec: 43486.4). Total num frames: 896483328. Throughput: 0: 43347.1. Samples: 799427420. Policy #0 lag: (min: 0.0, avg: 12.6, max: 28.0) [2024-06-27 18:47:53,856][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:47:56,505][06909] Updated weights for policy 0, policy_version 54723 (0.0027) [2024-06-27 18:47:58,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 896663552. Throughput: 0: 43366.8. Samples: 799567460. Policy #0 lag: (min: 0.0, avg: 12.6, max: 28.0) [2024-06-27 18:47:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:47:59,533][06909] Updated weights for policy 0, policy_version 54733 (0.0042) [2024-06-27 18:48:03,850][06674] Fps is (10 sec: 40984.8, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 896892928. Throughput: 0: 43346.7. Samples: 799827060. Policy #0 lag: (min: 0.0, avg: 12.6, max: 28.0) [2024-06-27 18:48:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:48:03,939][06909] Updated weights for policy 0, policy_version 54743 (0.0033) [2024-06-27 18:48:07,146][06909] Updated weights for policy 0, policy_version 54753 (0.0040) [2024-06-27 18:48:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43144.6, 300 sec: 43376.0). Total num frames: 897122304. Throughput: 0: 43418.8. Samples: 800083740. Policy #0 lag: (min: 0.0, avg: 12.6, max: 28.0) [2024-06-27 18:48:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:48:11,667][06909] Updated weights for policy 0, policy_version 54763 (0.0030) [2024-06-27 18:48:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.7, 300 sec: 43376.3). Total num frames: 897335296. Throughput: 0: 43412.2. Samples: 800222740. Policy #0 lag: (min: 0.0, avg: 12.6, max: 28.0) [2024-06-27 18:48:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:48:14,643][06909] Updated weights for policy 0, policy_version 54773 (0.0027) [2024-06-27 18:48:18,850][06674] Fps is (10 sec: 42597.4, 60 sec: 43417.6, 300 sec: 43320.7). Total num frames: 897548288. Throughput: 0: 43446.2. Samples: 800481800. Policy #0 lag: (min: 0.0, avg: 12.6, max: 28.0) [2024-06-27 18:48:18,859][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:48:19,317][06909] Updated weights for policy 0, policy_version 54783 (0.0043) [2024-06-27 18:48:22,284][06909] Updated weights for policy 0, policy_version 54793 (0.0043) [2024-06-27 18:48:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 897794048. Throughput: 0: 43538.3. Samples: 800740780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 18:48:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:48:26,976][06909] Updated weights for policy 0, policy_version 54803 (0.0032) [2024-06-27 18:48:28,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 897990656. Throughput: 0: 43484.4. Samples: 800877200. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 18:48:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:48:29,445][06909] Updated weights for policy 0, policy_version 54813 (0.0042) [2024-06-27 18:48:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 898203648. Throughput: 0: 43517.4. Samples: 801137820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 18:48:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:48:34,302][06909] Updated weights for policy 0, policy_version 54823 (0.0027) [2024-06-27 18:48:37,126][06909] Updated weights for policy 0, policy_version 54833 (0.0030) [2024-06-27 18:48:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 898433024. Throughput: 0: 43552.1. Samples: 801387000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 18:48:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:48:41,778][06909] Updated weights for policy 0, policy_version 54843 (0.0037) [2024-06-27 18:48:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 898646016. Throughput: 0: 43580.8. Samples: 801528600. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 18:48:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:48:44,175][06887] Signal inference workers to stop experience collection... (11400 times) [2024-06-27 18:48:44,175][06887] Signal inference workers to resume experience collection... (11400 times) [2024-06-27 18:48:44,220][06909] InferenceWorker_p0-w0: stopping experience collection (11400 times) [2024-06-27 18:48:44,220][06909] InferenceWorker_p0-w0: resuming experience collection (11400 times) [2024-06-27 18:48:44,602][06909] Updated weights for policy 0, policy_version 54853 (0.0041) [2024-06-27 18:48:48,856][06674] Fps is (10 sec: 42572.5, 60 sec: 43686.3, 300 sec: 43430.6). Total num frames: 898859008. Throughput: 0: 43642.6. Samples: 801791240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 18:48:48,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:48:49,209][06909] Updated weights for policy 0, policy_version 54863 (0.0033) [2024-06-27 18:48:52,518][06909] Updated weights for policy 0, policy_version 54873 (0.0037) [2024-06-27 18:48:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43422.0, 300 sec: 43487.0). Total num frames: 899088384. Throughput: 0: 43684.0. Samples: 802049520. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 18:48:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:48:56,617][06909] Updated weights for policy 0, policy_version 54883 (0.0038) [2024-06-27 18:48:58,850][06674] Fps is (10 sec: 44263.7, 60 sec: 43963.7, 300 sec: 43431.5). Total num frames: 899301376. Throughput: 0: 43585.3. Samples: 802184080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:48:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:48:59,815][06909] Updated weights for policy 0, policy_version 54893 (0.0028) [2024-06-27 18:49:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43375.9). Total num frames: 899514368. Throughput: 0: 43625.5. Samples: 802444940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:49:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:49:04,307][06909] Updated weights for policy 0, policy_version 54903 (0.0036) [2024-06-27 18:49:07,115][06909] Updated weights for policy 0, policy_version 54913 (0.0029) [2024-06-27 18:49:08,852][06674] Fps is (10 sec: 45865.5, 60 sec: 43962.2, 300 sec: 43542.6). Total num frames: 899760128. Throughput: 0: 43597.9. Samples: 802702780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:49:08,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:49:11,716][06909] Updated weights for policy 0, policy_version 54923 (0.0037) [2024-06-27 18:49:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43431.5). Total num frames: 899956736. Throughput: 0: 43588.5. Samples: 802838680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:49:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:49:14,645][06909] Updated weights for policy 0, policy_version 54933 (0.0043) [2024-06-27 18:49:18,850][06674] Fps is (10 sec: 40968.3, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 900169728. Throughput: 0: 43638.6. Samples: 803101560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:49:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:49:19,122][06909] Updated weights for policy 0, policy_version 54943 (0.0035) [2024-06-27 18:49:22,252][06909] Updated weights for policy 0, policy_version 54953 (0.0031) [2024-06-27 18:49:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 900415488. Throughput: 0: 43690.2. Samples: 803353060. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 18:49:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:49:26,629][06909] Updated weights for policy 0, policy_version 54963 (0.0039) [2024-06-27 18:49:28,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 900595712. Throughput: 0: 43512.4. Samples: 803486660. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-27 18:49:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:49:29,925][06909] Updated weights for policy 0, policy_version 54973 (0.0036) [2024-06-27 18:49:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43431.8). Total num frames: 900825088. Throughput: 0: 43500.5. Samples: 803748500. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-27 18:49:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:49:34,164][06909] Updated weights for policy 0, policy_version 54983 (0.0022) [2024-06-27 18:49:37,492][06909] Updated weights for policy 0, policy_version 54993 (0.0032) [2024-06-27 18:49:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 901054464. Throughput: 0: 43441.3. Samples: 804004380. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-27 18:49:38,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:49:42,060][06909] Updated weights for policy 0, policy_version 55003 (0.0035) [2024-06-27 18:49:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 901251072. Throughput: 0: 43382.3. Samples: 804136280. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-27 18:49:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:49:44,955][06909] Updated weights for policy 0, policy_version 55013 (0.0023) [2024-06-27 18:49:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43695.1, 300 sec: 43431.5). Total num frames: 901480448. Throughput: 0: 43434.6. Samples: 804399500. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-27 18:49:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:49:48,871][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000055022_901480448.pth... [2024-06-27 18:49:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000054385_891043840.pth [2024-06-27 18:49:49,525][06909] Updated weights for policy 0, policy_version 55023 (0.0028) [2024-06-27 18:49:52,305][06887] Signal inference workers to stop experience collection... (11450 times) [2024-06-27 18:49:52,305][06887] Signal inference workers to resume experience collection... (11450 times) [2024-06-27 18:49:52,334][06909] InferenceWorker_p0-w0: stopping experience collection (11450 times) [2024-06-27 18:49:52,334][06909] InferenceWorker_p0-w0: resuming experience collection (11450 times) [2024-06-27 18:49:52,453][06909] Updated weights for policy 0, policy_version 55033 (0.0033) [2024-06-27 18:49:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 901709824. Throughput: 0: 43310.5. Samples: 804651660. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-27 18:49:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:49:56,858][06909] Updated weights for policy 0, policy_version 55043 (0.0027) [2024-06-27 18:49:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 901906432. Throughput: 0: 43523.0. Samples: 804797220. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-27 18:49:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:49:59,815][06909] Updated weights for policy 0, policy_version 55053 (0.0040) [2024-06-27 18:50:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.7, 300 sec: 43376.0). Total num frames: 902119424. Throughput: 0: 43389.1. Samples: 805054060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:50:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:50:04,193][06909] Updated weights for policy 0, policy_version 55063 (0.0034) [2024-06-27 18:50:07,515][06909] Updated weights for policy 0, policy_version 55073 (0.0030) [2024-06-27 18:50:08,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43419.1, 300 sec: 43598.4). Total num frames: 902365184. Throughput: 0: 43414.2. Samples: 805306700. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:50:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:50:11,462][06909] Updated weights for policy 0, policy_version 55083 (0.0036) [2024-06-27 18:50:13,856][06674] Fps is (10 sec: 44209.4, 60 sec: 43413.2, 300 sec: 43486.1). Total num frames: 902561792. Throughput: 0: 43461.7. Samples: 805442700. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:50:13,856][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:50:15,142][06909] Updated weights for policy 0, policy_version 55093 (0.0032) [2024-06-27 18:50:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 902791168. Throughput: 0: 43524.4. Samples: 805707100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:50:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:50:19,211][06909] Updated weights for policy 0, policy_version 55103 (0.0041) [2024-06-27 18:50:22,594][06909] Updated weights for policy 0, policy_version 55113 (0.0050) [2024-06-27 18:50:23,850][06674] Fps is (10 sec: 44264.1, 60 sec: 43144.6, 300 sec: 43542.6). Total num frames: 903004160. Throughput: 0: 43510.7. Samples: 805962360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:50:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:50:26,794][06909] Updated weights for policy 0, policy_version 55123 (0.0024) [2024-06-27 18:50:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43653.7). Total num frames: 903233536. Throughput: 0: 43683.9. Samples: 806102060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:50:28,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:50:30,161][06909] Updated weights for policy 0, policy_version 55133 (0.0042) [2024-06-27 18:50:33,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43144.5, 300 sec: 43375.9). Total num frames: 903413760. Throughput: 0: 43519.1. Samples: 806357860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 18:50:33,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 18:50:34,386][06909] Updated weights for policy 0, policy_version 55143 (0.0027) [2024-06-27 18:50:37,641][06909] Updated weights for policy 0, policy_version 55153 (0.0027) [2024-06-27 18:50:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.6, 300 sec: 43653.9). Total num frames: 903675904. Throughput: 0: 43717.3. Samples: 806618940. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 18:50:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:50:41,626][06909] Updated weights for policy 0, policy_version 55163 (0.0036) [2024-06-27 18:50:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 903872512. Throughput: 0: 43587.2. Samples: 806758640. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 18:50:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:50:44,936][06909] Updated weights for policy 0, policy_version 55173 (0.0035) [2024-06-27 18:50:48,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 904085504. Throughput: 0: 43733.3. Samples: 807022060. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 18:50:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:50:49,226][06909] Updated weights for policy 0, policy_version 55183 (0.0036) [2024-06-27 18:50:52,705][06909] Updated weights for policy 0, policy_version 55193 (0.0032) [2024-06-27 18:50:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 904331264. Throughput: 0: 43635.6. Samples: 807270300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 18:50:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:50:56,560][06909] Updated weights for policy 0, policy_version 55203 (0.0035) [2024-06-27 18:50:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 904527872. Throughput: 0: 43740.1. Samples: 807410740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 18:50:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:51:00,034][06909] Updated weights for policy 0, policy_version 55213 (0.0027) [2024-06-27 18:51:03,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43487.0). Total num frames: 904757248. Throughput: 0: 43685.4. Samples: 807672940. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 18:51:03,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:51:03,997][06909] Updated weights for policy 0, policy_version 55223 (0.0028) [2024-06-27 18:51:07,048][06887] Signal inference workers to stop experience collection... (11500 times) [2024-06-27 18:51:07,048][06887] Signal inference workers to resume experience collection... (11500 times) [2024-06-27 18:51:07,090][06909] InferenceWorker_p0-w0: stopping experience collection (11500 times) [2024-06-27 18:51:07,090][06909] InferenceWorker_p0-w0: resuming experience collection (11500 times) [2024-06-27 18:51:07,606][06909] Updated weights for policy 0, policy_version 55233 (0.0041) [2024-06-27 18:51:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 904986624. Throughput: 0: 43764.9. Samples: 807931780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 18:51:08,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:51:11,888][06909] Updated weights for policy 0, policy_version 55243 (0.0035) [2024-06-27 18:51:13,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43693.6, 300 sec: 43597.8). Total num frames: 905183232. Throughput: 0: 43667.4. Samples: 808067180. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 18:51:13,853][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:51:15,064][06909] Updated weights for policy 0, policy_version 55253 (0.0027) [2024-06-27 18:51:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 905412608. Throughput: 0: 43894.2. Samples: 808333100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 18:51:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:51:19,156][06909] Updated weights for policy 0, policy_version 55263 (0.0030) [2024-06-27 18:51:22,552][06909] Updated weights for policy 0, policy_version 55273 (0.0026) [2024-06-27 18:51:23,852][06674] Fps is (10 sec: 45875.0, 60 sec: 43962.2, 300 sec: 43653.3). Total num frames: 905641984. Throughput: 0: 43838.0. Samples: 808591740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 18:51:23,852][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 18:51:26,537][06909] Updated weights for policy 0, policy_version 55283 (0.0046) [2024-06-27 18:51:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 905838592. Throughput: 0: 43765.4. Samples: 808728080. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 18:51:28,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 18:51:30,070][06909] Updated weights for policy 0, policy_version 55293 (0.0035) [2024-06-27 18:51:33,850][06674] Fps is (10 sec: 42607.0, 60 sec: 44236.7, 300 sec: 43542.6). Total num frames: 906067968. Throughput: 0: 43707.5. Samples: 808988900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 18:51:33,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 18:51:34,015][06909] Updated weights for policy 0, policy_version 55303 (0.0035) [2024-06-27 18:51:37,873][06909] Updated weights for policy 0, policy_version 55313 (0.0031) [2024-06-27 18:51:38,856][06674] Fps is (10 sec: 44209.6, 60 sec: 43413.2, 300 sec: 43597.2). Total num frames: 906280960. Throughput: 0: 43941.5. Samples: 809247940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 18:51:38,857][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:51:41,536][06909] Updated weights for policy 0, policy_version 55323 (0.0030) [2024-06-27 18:51:43,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 906493952. Throughput: 0: 43759.6. Samples: 809379920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 18:51:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:51:45,254][06909] Updated weights for policy 0, policy_version 55333 (0.0027) [2024-06-27 18:51:48,850][06674] Fps is (10 sec: 44263.8, 60 sec: 43963.7, 300 sec: 43542.9). Total num frames: 906723328. Throughput: 0: 43878.2. Samples: 809647460. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:51:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:51:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000055342_906723328.pth... [2024-06-27 18:51:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000054702_896237568.pth [2024-06-27 18:51:49,074][06909] Updated weights for policy 0, policy_version 55343 (0.0030) [2024-06-27 18:51:52,712][06909] Updated weights for policy 0, policy_version 55353 (0.0038) [2024-06-27 18:51:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 906936320. Throughput: 0: 43721.4. Samples: 809899240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:51:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:51:56,654][06909] Updated weights for policy 0, policy_version 55363 (0.0036) [2024-06-27 18:51:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 907165696. Throughput: 0: 43833.6. Samples: 810039600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:51:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:51:59,998][06909] Updated weights for policy 0, policy_version 55373 (0.0036) [2024-06-27 18:52:03,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43690.5, 300 sec: 43542.5). Total num frames: 907378688. Throughput: 0: 43903.4. Samples: 810308760. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:52:03,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:52:04,055][06909] Updated weights for policy 0, policy_version 55383 (0.0035) [2024-06-27 18:52:07,238][06909] Updated weights for policy 0, policy_version 55393 (0.0042) [2024-06-27 18:52:08,850][06674] Fps is (10 sec: 42597.0, 60 sec: 43417.4, 300 sec: 43598.1). Total num frames: 907591680. Throughput: 0: 43965.4. Samples: 810570100. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:52:08,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:52:11,546][06909] Updated weights for policy 0, policy_version 55403 (0.0035) [2024-06-27 18:52:13,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44238.3, 300 sec: 43709.2). Total num frames: 907837440. Throughput: 0: 43858.2. Samples: 810701700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:52:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:52:14,786][06909] Updated weights for policy 0, policy_version 55413 (0.0038) [2024-06-27 18:52:18,827][06909] Updated weights for policy 0, policy_version 55423 (0.0035) [2024-06-27 18:52:18,850][06674] Fps is (10 sec: 45876.2, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 908050432. Throughput: 0: 43898.7. Samples: 810964340. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:52:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:52:22,613][06909] Updated weights for policy 0, policy_version 55433 (0.0034) [2024-06-27 18:52:23,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43419.1, 300 sec: 43598.1). Total num frames: 908247040. Throughput: 0: 43796.1. Samples: 811218500. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 18:52:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:52:24,207][06887] Signal inference workers to stop experience collection... (11550 times) [2024-06-27 18:52:24,247][06909] InferenceWorker_p0-w0: stopping experience collection (11550 times) [2024-06-27 18:52:24,267][06887] Signal inference workers to resume experience collection... (11550 times) [2024-06-27 18:52:24,268][06909] InferenceWorker_p0-w0: resuming experience collection (11550 times) [2024-06-27 18:52:26,559][06909] Updated weights for policy 0, policy_version 55443 (0.0027) [2024-06-27 18:52:28,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44235.2, 300 sec: 43708.9). Total num frames: 908492800. Throughput: 0: 43891.7. Samples: 811355140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 18:52:28,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:52:30,049][06909] Updated weights for policy 0, policy_version 55453 (0.0031) [2024-06-27 18:52:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 908689408. Throughput: 0: 43808.0. Samples: 811618820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 18:52:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:52:33,945][06909] Updated weights for policy 0, policy_version 55463 (0.0028) [2024-06-27 18:52:37,321][06909] Updated weights for policy 0, policy_version 55473 (0.0029) [2024-06-27 18:52:38,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43968.2, 300 sec: 43653.6). Total num frames: 908918784. Throughput: 0: 44076.4. Samples: 811882680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 18:52:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:52:41,432][06909] Updated weights for policy 0, policy_version 55483 (0.0027) [2024-06-27 18:52:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 909131776. Throughput: 0: 43917.7. Samples: 812015900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 18:52:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:52:44,694][06909] Updated weights for policy 0, policy_version 55493 (0.0038) [2024-06-27 18:52:48,814][06909] Updated weights for policy 0, policy_version 55503 (0.0037) [2024-06-27 18:52:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43654.5). Total num frames: 909361152. Throughput: 0: 43699.2. Samples: 812275220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 18:52:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:52:52,625][06909] Updated weights for policy 0, policy_version 55513 (0.0029) [2024-06-27 18:52:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 909557760. Throughput: 0: 43472.2. Samples: 812526340. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 18:52:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:52:57,037][06909] Updated weights for policy 0, policy_version 55523 (0.0043) [2024-06-27 18:52:58,850][06674] Fps is (10 sec: 44234.8, 60 sec: 43963.3, 300 sec: 43764.6). Total num frames: 909803520. Throughput: 0: 43576.8. Samples: 812662680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2024-06-27 18:52:58,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:53:00,100][06909] Updated weights for policy 0, policy_version 55533 (0.0047) [2024-06-27 18:53:03,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43689.3, 300 sec: 43653.3). Total num frames: 910000128. Throughput: 0: 43541.1. Samples: 812923780. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2024-06-27 18:53:03,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:53:04,392][06909] Updated weights for policy 0, policy_version 55543 (0.0031) [2024-06-27 18:53:07,369][06909] Updated weights for policy 0, policy_version 55553 (0.0039) [2024-06-27 18:53:08,850][06674] Fps is (10 sec: 42600.4, 60 sec: 43963.9, 300 sec: 43709.2). Total num frames: 910229504. Throughput: 0: 43683.2. Samples: 813184240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2024-06-27 18:53:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:53:11,799][06909] Updated weights for policy 0, policy_version 55563 (0.0037) [2024-06-27 18:53:13,850][06674] Fps is (10 sec: 45884.8, 60 sec: 43690.7, 300 sec: 43764.8). Total num frames: 910458880. Throughput: 0: 43578.5. Samples: 813316080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2024-06-27 18:53:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:53:14,990][06909] Updated weights for policy 0, policy_version 55573 (0.0036) [2024-06-27 18:53:18,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43416.1, 300 sec: 43597.8). Total num frames: 910655488. Throughput: 0: 43631.8. Samples: 813582340. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2024-06-27 18:53:18,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:53:19,130][06909] Updated weights for policy 0, policy_version 55583 (0.0026) [2024-06-27 18:53:22,414][06909] Updated weights for policy 0, policy_version 55593 (0.0023) [2024-06-27 18:53:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 910884864. Throughput: 0: 43551.0. Samples: 813842480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2024-06-27 18:53:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:53:27,021][06909] Updated weights for policy 0, policy_version 55603 (0.0044) [2024-06-27 18:53:28,850][06674] Fps is (10 sec: 44246.0, 60 sec: 43419.1, 300 sec: 43709.2). Total num frames: 911097856. Throughput: 0: 43544.9. Samples: 813975420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 24.0) [2024-06-27 18:53:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:53:29,762][06909] Updated weights for policy 0, policy_version 55613 (0.0036) [2024-06-27 18:53:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 911310848. Throughput: 0: 43548.0. Samples: 814234880. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 18:53:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:53:34,375][06909] Updated weights for policy 0, policy_version 55623 (0.0044) [2024-06-27 18:53:35,984][06887] Signal inference workers to stop experience collection... (11600 times) [2024-06-27 18:53:36,038][06909] InferenceWorker_p0-w0: stopping experience collection (11600 times) [2024-06-27 18:53:36,045][06887] Signal inference workers to resume experience collection... (11600 times) [2024-06-27 18:53:36,047][06909] InferenceWorker_p0-w0: resuming experience collection (11600 times) [2024-06-27 18:53:37,415][06909] Updated weights for policy 0, policy_version 55633 (0.0032) [2024-06-27 18:53:38,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43416.1, 300 sec: 43653.3). Total num frames: 911523840. Throughput: 0: 43634.5. Samples: 814489980. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 18:53:38,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:53:41,900][06909] Updated weights for policy 0, policy_version 55643 (0.0034) [2024-06-27 18:53:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.6, 300 sec: 43765.6). Total num frames: 911769600. Throughput: 0: 43746.2. Samples: 814631240. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 18:53:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:53:45,072][06909] Updated weights for policy 0, policy_version 55653 (0.0034) [2024-06-27 18:53:48,850][06674] Fps is (10 sec: 44245.5, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 911966208. Throughput: 0: 43728.6. Samples: 814891480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 18:53:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:53:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000055662_911966208.pth... [2024-06-27 18:53:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000055022_901480448.pth [2024-06-27 18:53:49,158][06909] Updated weights for policy 0, policy_version 55663 (0.0030) [2024-06-27 18:53:52,880][06909] Updated weights for policy 0, policy_version 55673 (0.0027) [2024-06-27 18:53:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 912195584. Throughput: 0: 43682.2. Samples: 815149940. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 18:53:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:53:56,860][06909] Updated weights for policy 0, policy_version 55683 (0.0031) [2024-06-27 18:53:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43418.0, 300 sec: 43709.2). Total num frames: 912408576. Throughput: 0: 43806.6. Samples: 815287380. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 18:53:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:54:00,215][06909] Updated weights for policy 0, policy_version 55693 (0.0032) [2024-06-27 18:54:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43692.2, 300 sec: 43598.4). Total num frames: 912621568. Throughput: 0: 43610.0. Samples: 815544700. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 18:54:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:54:04,087][06909] Updated weights for policy 0, policy_version 55703 (0.0032) [2024-06-27 18:54:07,498][06909] Updated weights for policy 0, policy_version 55713 (0.0032) [2024-06-27 18:54:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 912850944. Throughput: 0: 43686.7. Samples: 815808380. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 18:54:08,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-27 18:54:11,815][06909] Updated weights for policy 0, policy_version 55723 (0.0035) [2024-06-27 18:54:13,852][06674] Fps is (10 sec: 44227.9, 60 sec: 43416.1, 300 sec: 43708.9). Total num frames: 913063936. Throughput: 0: 43631.8. Samples: 815938940. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 18:54:13,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:54:14,830][06909] Updated weights for policy 0, policy_version 55733 (0.0031) [2024-06-27 18:54:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43692.2, 300 sec: 43598.1). Total num frames: 913276928. Throughput: 0: 43646.3. Samples: 816198960. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 18:54:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:54:18,979][06909] Updated weights for policy 0, policy_version 55743 (0.0034) [2024-06-27 18:54:22,808][06909] Updated weights for policy 0, policy_version 55753 (0.0031) [2024-06-27 18:54:23,850][06674] Fps is (10 sec: 44245.4, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 913506304. Throughput: 0: 43805.0. Samples: 816461120. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 18:54:23,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:54:26,630][06909] Updated weights for policy 0, policy_version 55763 (0.0028) [2024-06-27 18:54:28,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 913702912. Throughput: 0: 43600.2. Samples: 816593240. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 18:54:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:54:30,248][06909] Updated weights for policy 0, policy_version 55773 (0.0029) [2024-06-27 18:54:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 913932288. Throughput: 0: 43628.9. Samples: 816854780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 18:54:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:54:33,968][06909] Updated weights for policy 0, policy_version 55783 (0.0032) [2024-06-27 18:54:37,748][06909] Updated weights for policy 0, policy_version 55793 (0.0022) [2024-06-27 18:54:38,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44238.3, 300 sec: 43820.3). Total num frames: 914178048. Throughput: 0: 43755.2. Samples: 817118920. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 18:54:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:54:41,332][06909] Updated weights for policy 0, policy_version 55803 (0.0041) [2024-06-27 18:54:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 43653.6). Total num frames: 914358272. Throughput: 0: 43504.8. Samples: 817245100. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 18:54:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:54:45,152][06909] Updated weights for policy 0, policy_version 55813 (0.0022) [2024-06-27 18:54:48,644][06909] Updated weights for policy 0, policy_version 55823 (0.0046) [2024-06-27 18:54:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 914604032. Throughput: 0: 43738.2. Samples: 817512920. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 18:54:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:54:52,915][06909] Updated weights for policy 0, policy_version 55833 (0.0029) [2024-06-27 18:54:53,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 914817024. Throughput: 0: 43587.1. Samples: 817769800. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 18:54:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:54:56,491][06909] Updated weights for policy 0, policy_version 55843 (0.0024) [2024-06-27 18:54:58,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43417.5, 300 sec: 43709.1). Total num frames: 915013632. Throughput: 0: 43558.7. Samples: 817899000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 18:54:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:55:00,446][06909] Updated weights for policy 0, policy_version 55853 (0.0040) [2024-06-27 18:55:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 915243008. Throughput: 0: 43505.4. Samples: 818156700. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 18:55:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:55:04,070][06909] Updated weights for policy 0, policy_version 55863 (0.0027) [2024-06-27 18:55:06,810][06887] Signal inference workers to stop experience collection... (11650 times) [2024-06-27 18:55:06,868][06909] InferenceWorker_p0-w0: stopping experience collection (11650 times) [2024-06-27 18:55:06,927][06887] Signal inference workers to resume experience collection... (11650 times) [2024-06-27 18:55:06,927][06909] InferenceWorker_p0-w0: resuming experience collection (11650 times) [2024-06-27 18:55:07,813][06909] Updated weights for policy 0, policy_version 55873 (0.0027) [2024-06-27 18:55:08,850][06674] Fps is (10 sec: 45876.1, 60 sec: 43690.7, 300 sec: 43765.6). Total num frames: 915472384. Throughput: 0: 43453.9. Samples: 818416540. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 18:55:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:55:11,883][06909] Updated weights for policy 0, policy_version 55883 (0.0037) [2024-06-27 18:55:13,856][06674] Fps is (10 sec: 42572.3, 60 sec: 43414.7, 300 sec: 43652.7). Total num frames: 915668992. Throughput: 0: 43419.4. Samples: 818547380. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 18:55:13,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:55:15,229][06909] Updated weights for policy 0, policy_version 55893 (0.0040) [2024-06-27 18:55:18,850][06674] Fps is (10 sec: 42597.4, 60 sec: 43690.5, 300 sec: 43709.1). Total num frames: 915898368. Throughput: 0: 43488.3. Samples: 818811760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 18:55:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:55:19,214][06909] Updated weights for policy 0, policy_version 55903 (0.0023) [2024-06-27 18:55:22,919][06909] Updated weights for policy 0, policy_version 55913 (0.0026) [2024-06-27 18:55:23,850][06674] Fps is (10 sec: 45903.3, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 916127744. Throughput: 0: 43549.3. Samples: 819078640. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 18:55:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:55:26,546][06909] Updated weights for policy 0, policy_version 55923 (0.0035) [2024-06-27 18:55:28,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 916307968. Throughput: 0: 43673.0. Samples: 819210380. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 18:55:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:55:30,270][06909] Updated weights for policy 0, policy_version 55933 (0.0046) [2024-06-27 18:55:33,836][06909] Updated weights for policy 0, policy_version 55943 (0.0040) [2024-06-27 18:55:33,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 916570112. Throughput: 0: 43497.7. Samples: 819470320. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 18:55:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:55:37,997][06909] Updated weights for policy 0, policy_version 55953 (0.0032) [2024-06-27 18:55:38,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 916783104. Throughput: 0: 43540.1. Samples: 819729100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 18:55:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:55:41,362][06909] Updated weights for policy 0, policy_version 55963 (0.0034) [2024-06-27 18:55:43,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 916963328. Throughput: 0: 43506.8. Samples: 819856800. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 18:55:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:55:45,546][06909] Updated weights for policy 0, policy_version 55973 (0.0040) [2024-06-27 18:55:48,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 917209088. Throughput: 0: 43629.2. Samples: 820120020. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-27 18:55:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:55:48,871][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000055982_917209088.pth... [2024-06-27 18:55:48,926][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000055342_906723328.pth [2024-06-27 18:55:49,092][06909] Updated weights for policy 0, policy_version 55983 (0.0037) [2024-06-27 18:55:53,215][06909] Updated weights for policy 0, policy_version 55993 (0.0029) [2024-06-27 18:55:53,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 917438464. Throughput: 0: 43708.8. Samples: 820383440. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-27 18:55:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:55:56,365][06909] Updated weights for policy 0, policy_version 56003 (0.0045) [2024-06-27 18:55:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 917635072. Throughput: 0: 43631.2. Samples: 820510520. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-27 18:55:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:56:00,737][06909] Updated weights for policy 0, policy_version 56013 (0.0031) [2024-06-27 18:56:03,705][06909] Updated weights for policy 0, policy_version 56023 (0.0028) [2024-06-27 18:56:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 917880832. Throughput: 0: 43628.0. Samples: 820775020. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-27 18:56:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:56:08,214][06909] Updated weights for policy 0, policy_version 56033 (0.0029) [2024-06-27 18:56:08,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43690.7, 300 sec: 43765.0). Total num frames: 918093824. Throughput: 0: 43590.2. Samples: 821040200. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-27 18:56:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:56:11,449][06909] Updated weights for policy 0, policy_version 56043 (0.0034) [2024-06-27 18:56:13,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43421.9, 300 sec: 43598.1). Total num frames: 918274048. Throughput: 0: 43511.0. Samples: 821168380. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-27 18:56:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:56:14,903][06887] Signal inference workers to stop experience collection... (11700 times) [2024-06-27 18:56:14,903][06887] Signal inference workers to resume experience collection... (11700 times) [2024-06-27 18:56:14,944][06909] InferenceWorker_p0-w0: stopping experience collection (11700 times) [2024-06-27 18:56:14,944][06909] InferenceWorker_p0-w0: resuming experience collection (11700 times) [2024-06-27 18:56:15,538][06909] Updated weights for policy 0, policy_version 56053 (0.0028) [2024-06-27 18:56:18,710][06909] Updated weights for policy 0, policy_version 56063 (0.0032) [2024-06-27 18:56:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.9, 300 sec: 43709.5). Total num frames: 918536192. Throughput: 0: 43548.1. Samples: 821429980. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-27 18:56:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:56:23,295][06909] Updated weights for policy 0, policy_version 56073 (0.0032) [2024-06-27 18:56:23,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 918732800. Throughput: 0: 43641.7. Samples: 821692980. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-27 18:56:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:56:26,226][06909] Updated weights for policy 0, policy_version 56083 (0.0029) [2024-06-27 18:56:28,850][06674] Fps is (10 sec: 37683.1, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 918913024. Throughput: 0: 43588.5. Samples: 821818280. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 18:56:28,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:56:30,679][06909] Updated weights for policy 0, policy_version 56093 (0.0043) [2024-06-27 18:56:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 43710.1). Total num frames: 919175168. Throughput: 0: 43578.7. Samples: 822081060. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 18:56:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:56:34,006][06909] Updated weights for policy 0, policy_version 56103 (0.0038) [2024-06-27 18:56:38,271][06909] Updated weights for policy 0, policy_version 56113 (0.0032) [2024-06-27 18:56:38,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 919388160. Throughput: 0: 43602.3. Samples: 822345540. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 18:56:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:56:41,327][06909] Updated weights for policy 0, policy_version 56123 (0.0048) [2024-06-27 18:56:43,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 919584768. Throughput: 0: 43583.2. Samples: 822471760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 18:56:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:56:45,699][06909] Updated weights for policy 0, policy_version 56133 (0.0023) [2024-06-27 18:56:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 919830528. Throughput: 0: 43637.9. Samples: 822738720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 18:56:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:56:48,979][06909] Updated weights for policy 0, policy_version 56143 (0.0041) [2024-06-27 18:56:53,190][06909] Updated weights for policy 0, policy_version 56153 (0.0030) [2024-06-27 18:56:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 920043520. Throughput: 0: 43663.9. Samples: 823005080. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 18:56:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:56:56,296][06909] Updated weights for policy 0, policy_version 56163 (0.0027) [2024-06-27 18:56:58,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 920240128. Throughput: 0: 43565.9. Samples: 823128840. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 18:56:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:57:00,856][06909] Updated weights for policy 0, policy_version 56173 (0.0030) [2024-06-27 18:57:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 920485888. Throughput: 0: 43524.8. Samples: 823388600. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 18:57:03,851][06674] Avg episode reward: [(0, '0.394')] [2024-06-27 18:57:04,212][06909] Updated weights for policy 0, policy_version 56183 (0.0034) [2024-06-27 18:57:08,259][06909] Updated weights for policy 0, policy_version 56193 (0.0033) [2024-06-27 18:57:08,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 920698880. Throughput: 0: 43601.2. Samples: 823655040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 18:57:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:57:11,568][06909] Updated weights for policy 0, policy_version 56203 (0.0030) [2024-06-27 18:57:13,850][06674] Fps is (10 sec: 39322.3, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 920879104. Throughput: 0: 43544.1. Samples: 823777760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 18:57:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:57:15,914][06909] Updated weights for policy 0, policy_version 56213 (0.0029) [2024-06-27 18:57:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 921141248. Throughput: 0: 43678.6. Samples: 824046600. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 18:57:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:57:18,878][06909] Updated weights for policy 0, policy_version 56223 (0.0029) [2024-06-27 18:57:23,352][06909] Updated weights for policy 0, policy_version 56233 (0.0046) [2024-06-27 18:57:23,852][06674] Fps is (10 sec: 47503.6, 60 sec: 43689.2, 300 sec: 43598.1). Total num frames: 921354240. Throughput: 0: 43573.5. Samples: 824306440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 18:57:23,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:57:26,723][06909] Updated weights for policy 0, policy_version 56243 (0.0042) [2024-06-27 18:57:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 43653.6). Total num frames: 921567232. Throughput: 0: 43638.5. Samples: 824435500. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 18:57:28,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:57:30,794][06909] Updated weights for policy 0, policy_version 56253 (0.0025) [2024-06-27 18:57:31,202][06887] Signal inference workers to stop experience collection... (11750 times) [2024-06-27 18:57:31,203][06887] Signal inference workers to resume experience collection... (11750 times) [2024-06-27 18:57:31,240][06909] InferenceWorker_p0-w0: stopping experience collection (11750 times) [2024-06-27 18:57:31,240][06909] InferenceWorker_p0-w0: resuming experience collection (11750 times) [2024-06-27 18:57:33,850][06674] Fps is (10 sec: 42607.0, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 921780224. Throughput: 0: 43674.7. Samples: 824704080. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 18:57:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:57:34,034][06909] Updated weights for policy 0, policy_version 56263 (0.0023) [2024-06-27 18:57:38,478][06909] Updated weights for policy 0, policy_version 56273 (0.0024) [2024-06-27 18:57:38,851][06674] Fps is (10 sec: 42594.7, 60 sec: 43416.9, 300 sec: 43598.0). Total num frames: 921993216. Throughput: 0: 43747.1. Samples: 824973740. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:57:38,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:57:41,667][06909] Updated weights for policy 0, policy_version 56283 (0.0028) [2024-06-27 18:57:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 922206208. Throughput: 0: 43684.5. Samples: 825094640. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:57:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:57:45,842][06909] Updated weights for policy 0, policy_version 56293 (0.0038) [2024-06-27 18:57:48,852][06674] Fps is (10 sec: 45870.1, 60 sec: 43689.2, 300 sec: 43708.9). Total num frames: 922451968. Throughput: 0: 43741.2. Samples: 825357040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:57:48,864][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 18:57:48,885][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000056302_922451968.pth... [2024-06-27 18:57:48,941][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000055662_911966208.pth [2024-06-27 18:57:49,318][06909] Updated weights for policy 0, policy_version 56303 (0.0030) [2024-06-27 18:57:53,836][06909] Updated weights for policy 0, policy_version 56313 (0.0038) [2024-06-27 18:57:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43144.5, 300 sec: 43487.1). Total num frames: 922632192. Throughput: 0: 43620.0. Samples: 825617940. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:57:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:57:56,909][06909] Updated weights for policy 0, policy_version 56323 (0.0035) [2024-06-27 18:57:58,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43690.6, 300 sec: 43598.4). Total num frames: 922861568. Throughput: 0: 43580.3. Samples: 825738880. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:57:58,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:58:01,232][06909] Updated weights for policy 0, policy_version 56333 (0.0025) [2024-06-27 18:58:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43144.6, 300 sec: 43542.6). Total num frames: 923074560. Throughput: 0: 43460.9. Samples: 826002340. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:58:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:58:04,639][06909] Updated weights for policy 0, policy_version 56343 (0.0034) [2024-06-27 18:58:08,541][06909] Updated weights for policy 0, policy_version 56353 (0.0035) [2024-06-27 18:58:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 923303936. Throughput: 0: 43581.5. Samples: 826267520. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 18:58:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:58:11,882][06909] Updated weights for policy 0, policy_version 56363 (0.0028) [2024-06-27 18:58:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.6, 300 sec: 43598.4). Total num frames: 923516928. Throughput: 0: 43649.4. Samples: 826399720. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 18:58:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:58:16,010][06909] Updated weights for policy 0, policy_version 56373 (0.0034) [2024-06-27 18:58:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43144.6, 300 sec: 43542.6). Total num frames: 923729920. Throughput: 0: 43387.6. Samples: 826656520. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 18:58:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:58:19,407][06909] Updated weights for policy 0, policy_version 56383 (0.0038) [2024-06-27 18:58:23,715][06909] Updated weights for policy 0, policy_version 56393 (0.0023) [2024-06-27 18:58:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43145.9, 300 sec: 43542.5). Total num frames: 923942912. Throughput: 0: 43216.4. Samples: 826918440. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 18:58:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:58:27,149][06909] Updated weights for policy 0, policy_version 56403 (0.0033) [2024-06-27 18:58:28,850][06674] Fps is (10 sec: 44235.2, 60 sec: 43417.4, 300 sec: 43598.1). Total num frames: 924172288. Throughput: 0: 43352.5. Samples: 827045520. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 18:58:28,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:58:31,314][06909] Updated weights for policy 0, policy_version 56413 (0.0025) [2024-06-27 18:58:33,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43417.6, 300 sec: 43598.4). Total num frames: 924385280. Throughput: 0: 43261.1. Samples: 827303700. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 18:58:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:58:34,915][06909] Updated weights for policy 0, policy_version 56423 (0.0027) [2024-06-27 18:58:38,850][06674] Fps is (10 sec: 40961.2, 60 sec: 43145.2, 300 sec: 43431.5). Total num frames: 924581888. Throughput: 0: 43389.3. Samples: 827570460. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 18:58:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:58:38,968][06909] Updated weights for policy 0, policy_version 56433 (0.0037) [2024-06-27 18:58:42,297][06909] Updated weights for policy 0, policy_version 56443 (0.0031) [2024-06-27 18:58:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 43653.7). Total num frames: 924844032. Throughput: 0: 43726.3. Samples: 827706560. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 18:58:43,850][06674] Avg episode reward: [(0, '0.407')] [2024-06-27 18:58:46,256][06909] Updated weights for policy 0, policy_version 56453 (0.0028) [2024-06-27 18:58:48,850][06674] Fps is (10 sec: 44237.2, 60 sec: 42873.0, 300 sec: 43487.0). Total num frames: 925024256. Throughput: 0: 43625.5. Samples: 827965480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 18:58:48,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 18:58:49,635][06909] Updated weights for policy 0, policy_version 56463 (0.0036) [2024-06-27 18:58:53,497][06909] Updated weights for policy 0, policy_version 56473 (0.0029) [2024-06-27 18:58:53,856][06674] Fps is (10 sec: 40934.8, 60 sec: 43686.3, 300 sec: 43541.7). Total num frames: 925253632. Throughput: 0: 43578.5. Samples: 828228820. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 18:58:53,857][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:58:56,978][06909] Updated weights for policy 0, policy_version 56483 (0.0040) [2024-06-27 18:58:58,854][06674] Fps is (10 sec: 45854.1, 60 sec: 43687.4, 300 sec: 43597.4). Total num frames: 925483008. Throughput: 0: 43668.5. Samples: 828365000. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 18:58:58,855][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:59:00,968][06909] Updated weights for policy 0, policy_version 56493 (0.0030) [2024-06-27 18:59:03,850][06674] Fps is (10 sec: 44264.0, 60 sec: 43690.8, 300 sec: 43542.6). Total num frames: 925696000. Throughput: 0: 43765.3. Samples: 828625960. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 18:59:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:59:04,503][06909] Updated weights for policy 0, policy_version 56503 (0.0035) [2024-06-27 18:59:05,916][06887] Signal inference workers to stop experience collection... (11800 times) [2024-06-27 18:59:05,917][06887] Signal inference workers to resume experience collection... (11800 times) [2024-06-27 18:59:05,968][06909] InferenceWorker_p0-w0: stopping experience collection (11800 times) [2024-06-27 18:59:05,968][06909] InferenceWorker_p0-w0: resuming experience collection (11800 times) [2024-06-27 18:59:08,690][06909] Updated weights for policy 0, policy_version 56513 (0.0040) [2024-06-27 18:59:08,850][06674] Fps is (10 sec: 42617.6, 60 sec: 43417.6, 300 sec: 43542.9). Total num frames: 925908992. Throughput: 0: 43634.3. Samples: 828881980. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 18:59:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 18:59:12,326][06909] Updated weights for policy 0, policy_version 56523 (0.0030) [2024-06-27 18:59:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 926154752. Throughput: 0: 43755.0. Samples: 829014480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 18:59:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 18:59:16,233][06909] Updated weights for policy 0, policy_version 56533 (0.0032) [2024-06-27 18:59:18,850][06674] Fps is (10 sec: 42596.9, 60 sec: 43417.3, 300 sec: 43487.0). Total num frames: 926334976. Throughput: 0: 43804.5. Samples: 829274920. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 18:59:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:59:19,795][06909] Updated weights for policy 0, policy_version 56543 (0.0044) [2024-06-27 18:59:23,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 926547968. Throughput: 0: 43673.0. Samples: 829535740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 18:59:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:59:23,891][06909] Updated weights for policy 0, policy_version 56553 (0.0040) [2024-06-27 18:59:27,213][06909] Updated weights for policy 0, policy_version 56563 (0.0027) [2024-06-27 18:59:28,850][06674] Fps is (10 sec: 45876.9, 60 sec: 43690.9, 300 sec: 43598.1). Total num frames: 926793728. Throughput: 0: 43501.7. Samples: 829664140. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 18:59:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 18:59:31,288][06909] Updated weights for policy 0, policy_version 56573 (0.0042) [2024-06-27 18:59:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 926990336. Throughput: 0: 43575.1. Samples: 829926360. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 18:59:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 18:59:34,634][06909] Updated weights for policy 0, policy_version 56583 (0.0036) [2024-06-27 18:59:38,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 927203328. Throughput: 0: 43456.9. Samples: 830184120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 18:59:38,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 18:59:39,222][06909] Updated weights for policy 0, policy_version 56593 (0.0031) [2024-06-27 18:59:42,462][06909] Updated weights for policy 0, policy_version 56603 (0.0032) [2024-06-27 18:59:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 927449088. Throughput: 0: 43328.8. Samples: 830314600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 18:59:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:59:46,623][06909] Updated weights for policy 0, policy_version 56613 (0.0027) [2024-06-27 18:59:48,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 927645696. Throughput: 0: 43259.5. Samples: 830572640. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 18:59:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:59:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000056619_927645696.pth... [2024-06-27 18:59:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000055982_917209088.pth [2024-06-27 18:59:50,211][06909] Updated weights for policy 0, policy_version 56623 (0.0037) [2024-06-27 18:59:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43422.0, 300 sec: 43542.6). Total num frames: 927858688. Throughput: 0: 43412.9. Samples: 830835560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 18:59:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 18:59:53,971][06909] Updated weights for policy 0, policy_version 56633 (0.0028) [2024-06-27 18:59:57,415][06909] Updated weights for policy 0, policy_version 56643 (0.0020) [2024-06-27 18:59:58,852][06674] Fps is (10 sec: 45865.6, 60 sec: 43692.5, 300 sec: 43597.8). Total num frames: 928104448. Throughput: 0: 43436.6. Samples: 830969220. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 18:59:58,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:00:01,744][06909] Updated weights for policy 0, policy_version 56653 (0.0026) [2024-06-27 19:00:03,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 928317440. Throughput: 0: 43502.2. Samples: 831232500. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 19:00:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:00:04,885][06909] Updated weights for policy 0, policy_version 56663 (0.0026) [2024-06-27 19:00:08,850][06674] Fps is (10 sec: 40968.3, 60 sec: 43417.6, 300 sec: 43543.5). Total num frames: 928514048. Throughput: 0: 43586.6. Samples: 831497140. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 19:00:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:00:08,960][06909] Updated weights for policy 0, policy_version 56673 (0.0029) [2024-06-27 19:00:12,260][06909] Updated weights for policy 0, policy_version 56683 (0.0036) [2024-06-27 19:00:13,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43144.5, 300 sec: 43542.6). Total num frames: 928743424. Throughput: 0: 43611.5. Samples: 831626660. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 19:00:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:00:16,346][06909] Updated weights for policy 0, policy_version 56693 (0.0025) [2024-06-27 19:00:18,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43964.1, 300 sec: 43542.6). Total num frames: 928972800. Throughput: 0: 43632.0. Samples: 831889800. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 19:00:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:00:19,586][06909] Updated weights for policy 0, policy_version 56703 (0.0039) [2024-06-27 19:00:23,850][06674] Fps is (10 sec: 42597.5, 60 sec: 43690.4, 300 sec: 43598.1). Total num frames: 929169408. Throughput: 0: 43678.5. Samples: 832149660. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 19:00:23,851][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 19:00:24,022][06909] Updated weights for policy 0, policy_version 56713 (0.0031) [2024-06-27 19:00:27,494][06909] Updated weights for policy 0, policy_version 56723 (0.0037) [2024-06-27 19:00:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 929398784. Throughput: 0: 43649.8. Samples: 832278840. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 19:00:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:00:31,785][06909] Updated weights for policy 0, policy_version 56733 (0.0035) [2024-06-27 19:00:33,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 929611776. Throughput: 0: 43613.7. Samples: 832535260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:00:33,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:00:34,706][06887] Signal inference workers to stop experience collection... (11850 times) [2024-06-27 19:00:34,707][06887] Signal inference workers to resume experience collection... (11850 times) [2024-06-27 19:00:34,732][06909] InferenceWorker_p0-w0: stopping experience collection (11850 times) [2024-06-27 19:00:34,732][06909] InferenceWorker_p0-w0: resuming experience collection (11850 times) [2024-06-27 19:00:35,010][06909] Updated weights for policy 0, policy_version 56743 (0.0040) [2024-06-27 19:00:38,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43689.2, 300 sec: 43597.8). Total num frames: 929824768. Throughput: 0: 43506.4. Samples: 832793440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:00:38,853][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 19:00:39,526][06909] Updated weights for policy 0, policy_version 56753 (0.0035) [2024-06-27 19:00:42,669][06909] Updated weights for policy 0, policy_version 56763 (0.0023) [2024-06-27 19:00:43,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 930054144. Throughput: 0: 43510.9. Samples: 832927120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:00:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:00:46,844][06909] Updated weights for policy 0, policy_version 56773 (0.0036) [2024-06-27 19:00:48,850][06674] Fps is (10 sec: 44246.2, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 930267136. Throughput: 0: 43432.4. Samples: 833186960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:00:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:00:50,058][06909] Updated weights for policy 0, policy_version 56783 (0.0029) [2024-06-27 19:00:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 930480128. Throughput: 0: 43361.8. Samples: 833448420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:00:53,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 19:00:54,174][06909] Updated weights for policy 0, policy_version 56793 (0.0024) [2024-06-27 19:00:57,353][06909] Updated weights for policy 0, policy_version 56803 (0.0033) [2024-06-27 19:00:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43419.1, 300 sec: 43487.0). Total num frames: 930709504. Throughput: 0: 43426.2. Samples: 833580840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:00:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:01:01,599][06909] Updated weights for policy 0, policy_version 56813 (0.0030) [2024-06-27 19:01:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 930922496. Throughput: 0: 43496.4. Samples: 833847140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:01:03,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 19:01:05,037][06909] Updated weights for policy 0, policy_version 56823 (0.0027) [2024-06-27 19:01:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 931135488. Throughput: 0: 43588.7. Samples: 834111140. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 19:01:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:01:08,956][06909] Updated weights for policy 0, policy_version 56833 (0.0028) [2024-06-27 19:01:12,610][06909] Updated weights for policy 0, policy_version 56843 (0.0035) [2024-06-27 19:01:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 931364864. Throughput: 0: 43581.3. Samples: 834240000. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 19:01:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:01:16,566][06909] Updated weights for policy 0, policy_version 56853 (0.0036) [2024-06-27 19:01:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43144.6, 300 sec: 43487.0). Total num frames: 931561472. Throughput: 0: 43678.8. Samples: 834500800. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 19:01:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:01:20,100][06909] Updated weights for policy 0, policy_version 56863 (0.0030) [2024-06-27 19:01:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.8, 300 sec: 43653.6). Total num frames: 931790848. Throughput: 0: 43737.1. Samples: 834761520. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 19:01:23,854][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:01:24,157][06909] Updated weights for policy 0, policy_version 56873 (0.0041) [2024-06-27 19:01:27,471][06909] Updated weights for policy 0, policy_version 56883 (0.0032) [2024-06-27 19:01:28,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 932036608. Throughput: 0: 43829.3. Samples: 834899440. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 19:01:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:01:31,851][06909] Updated weights for policy 0, policy_version 56893 (0.0033) [2024-06-27 19:01:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 932216832. Throughput: 0: 43750.2. Samples: 835155720. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 19:01:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:01:34,914][06909] Updated weights for policy 0, policy_version 56903 (0.0038) [2024-06-27 19:01:38,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43692.2, 300 sec: 43598.1). Total num frames: 932446208. Throughput: 0: 43750.3. Samples: 835417180. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 19:01:38,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 19:01:39,218][06909] Updated weights for policy 0, policy_version 56913 (0.0029) [2024-06-27 19:01:42,397][06909] Updated weights for policy 0, policy_version 56923 (0.0037) [2024-06-27 19:01:43,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 932675584. Throughput: 0: 43823.2. Samples: 835552880. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:01:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:01:46,466][06909] Updated weights for policy 0, policy_version 56933 (0.0041) [2024-06-27 19:01:48,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43689.1, 300 sec: 43542.3). Total num frames: 932888576. Throughput: 0: 43626.9. Samples: 835810440. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:01:48,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:01:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000056939_932888576.pth... [2024-06-27 19:01:48,903][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000056302_922451968.pth [2024-06-27 19:01:50,240][06909] Updated weights for policy 0, policy_version 56943 (0.0027) [2024-06-27 19:01:53,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 933085184. Throughput: 0: 43466.3. Samples: 836067120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:01:53,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 19:01:54,318][06909] Updated weights for policy 0, policy_version 56953 (0.0034) [2024-06-27 19:01:57,759][06909] Updated weights for policy 0, policy_version 56963 (0.0026) [2024-06-27 19:01:58,852][06674] Fps is (10 sec: 45875.2, 60 sec: 43962.3, 300 sec: 43597.8). Total num frames: 933347328. Throughput: 0: 43476.2. Samples: 836196520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:01:58,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:02:01,884][06909] Updated weights for policy 0, policy_version 56973 (0.0034) [2024-06-27 19:02:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 933527552. Throughput: 0: 43511.4. Samples: 836458820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:02:03,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 19:02:04,520][06887] Signal inference workers to stop experience collection... (11900 times) [2024-06-27 19:02:04,521][06887] Signal inference workers to resume experience collection... (11900 times) [2024-06-27 19:02:04,548][06909] InferenceWorker_p0-w0: stopping experience collection (11900 times) [2024-06-27 19:02:04,548][06909] InferenceWorker_p0-w0: resuming experience collection (11900 times) [2024-06-27 19:02:05,125][06909] Updated weights for policy 0, policy_version 56983 (0.0030) [2024-06-27 19:02:08,850][06674] Fps is (10 sec: 40968.0, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 933756928. Throughput: 0: 43388.8. Samples: 836714020. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:02:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:02:09,417][06909] Updated weights for policy 0, policy_version 56993 (0.0039) [2024-06-27 19:02:12,791][06909] Updated weights for policy 0, policy_version 57003 (0.0047) [2024-06-27 19:02:13,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 933986304. Throughput: 0: 43268.1. Samples: 836846500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:02:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:02:17,009][06909] Updated weights for policy 0, policy_version 57013 (0.0034) [2024-06-27 19:02:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.6, 300 sec: 43487.3). Total num frames: 934182912. Throughput: 0: 43492.8. Samples: 837112900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:02:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:02:20,134][06909] Updated weights for policy 0, policy_version 57023 (0.0032) [2024-06-27 19:02:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 934412288. Throughput: 0: 43601.7. Samples: 837379260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:02:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:02:24,459][06909] Updated weights for policy 0, policy_version 57033 (0.0033) [2024-06-27 19:02:27,709][06909] Updated weights for policy 0, policy_version 57043 (0.0034) [2024-06-27 19:02:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 934641664. Throughput: 0: 43514.2. Samples: 837511020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:02:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:02:31,760][06909] Updated weights for policy 0, policy_version 57053 (0.0028) [2024-06-27 19:02:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.6, 300 sec: 43487.2). Total num frames: 934821888. Throughput: 0: 43499.2. Samples: 837767820. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:02:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:02:35,206][06909] Updated weights for policy 0, policy_version 57063 (0.0037) [2024-06-27 19:02:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 935067648. Throughput: 0: 43676.0. Samples: 838032540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:02:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:02:39,066][06909] Updated weights for policy 0, policy_version 57073 (0.0030) [2024-06-27 19:02:42,717][06909] Updated weights for policy 0, policy_version 57083 (0.0034) [2024-06-27 19:02:43,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43690.6, 300 sec: 43542.9). Total num frames: 935297024. Throughput: 0: 43679.3. Samples: 838162000. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:02:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:02:46,687][06909] Updated weights for policy 0, policy_version 57093 (0.0027) [2024-06-27 19:02:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43146.0, 300 sec: 43542.6). Total num frames: 935477248. Throughput: 0: 43636.6. Samples: 838422460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:02:48,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 19:02:50,313][06909] Updated weights for policy 0, policy_version 57103 (0.0039) [2024-06-27 19:02:53,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.6, 300 sec: 43598.1). Total num frames: 935723008. Throughput: 0: 43609.8. Samples: 838676460. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 19:02:53,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:02:54,608][06909] Updated weights for policy 0, policy_version 57113 (0.0031) [2024-06-27 19:02:57,670][06909] Updated weights for policy 0, policy_version 57123 (0.0021) [2024-06-27 19:02:58,850][06674] Fps is (10 sec: 49152.0, 60 sec: 43692.2, 300 sec: 43709.2). Total num frames: 935968768. Throughput: 0: 43672.4. Samples: 838811760. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 19:02:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:03:02,079][06909] Updated weights for policy 0, policy_version 57133 (0.0030) [2024-06-27 19:03:03,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 936132608. Throughput: 0: 43573.0. Samples: 839073680. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 19:03:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:03:05,133][06909] Updated weights for policy 0, policy_version 57143 (0.0039) [2024-06-27 19:03:08,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 936378368. Throughput: 0: 43461.3. Samples: 839335020. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 19:03:08,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:03:09,761][06909] Updated weights for policy 0, policy_version 57153 (0.0031) [2024-06-27 19:03:12,689][06909] Updated weights for policy 0, policy_version 57163 (0.0037) [2024-06-27 19:03:13,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 936607744. Throughput: 0: 43445.2. Samples: 839466060. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 19:03:13,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:03:16,510][06887] Signal inference workers to stop experience collection... (11950 times) [2024-06-27 19:03:16,511][06887] Signal inference workers to resume experience collection... (11950 times) [2024-06-27 19:03:16,561][06909] InferenceWorker_p0-w0: stopping experience collection (11950 times) [2024-06-27 19:03:16,561][06909] InferenceWorker_p0-w0: resuming experience collection (11950 times) [2024-06-27 19:03:17,117][06909] Updated weights for policy 0, policy_version 57173 (0.0044) [2024-06-27 19:03:18,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 936787968. Throughput: 0: 43344.9. Samples: 839718340. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 19:03:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:03:20,360][06909] Updated weights for policy 0, policy_version 57183 (0.0031) [2024-06-27 19:03:23,852][06674] Fps is (10 sec: 40952.0, 60 sec: 43416.1, 300 sec: 43542.3). Total num frames: 937017344. Throughput: 0: 43180.2. Samples: 839975740. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 19:03:23,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:03:24,592][06909] Updated weights for policy 0, policy_version 57193 (0.0036) [2024-06-27 19:03:28,059][06909] Updated weights for policy 0, policy_version 57203 (0.0040) [2024-06-27 19:03:28,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 937263104. Throughput: 0: 43378.3. Samples: 840114020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 19:03:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:03:31,918][06909] Updated weights for policy 0, policy_version 57213 (0.0032) [2024-06-27 19:03:33,850][06674] Fps is (10 sec: 40968.1, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 937426944. Throughput: 0: 43299.4. Samples: 840370940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 19:03:33,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:03:35,771][06909] Updated weights for policy 0, policy_version 57223 (0.0024) [2024-06-27 19:03:38,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 937672704. Throughput: 0: 43480.1. Samples: 840633060. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 19:03:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:03:39,807][06909] Updated weights for policy 0, policy_version 57233 (0.0035) [2024-06-27 19:03:43,137][06909] Updated weights for policy 0, policy_version 57243 (0.0032) [2024-06-27 19:03:43,850][06674] Fps is (10 sec: 47514.3, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 937902080. Throughput: 0: 43477.4. Samples: 840768240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 19:03:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:03:47,182][06909] Updated weights for policy 0, policy_version 57253 (0.0037) [2024-06-27 19:03:48,852][06674] Fps is (10 sec: 40951.3, 60 sec: 43416.0, 300 sec: 43487.6). Total num frames: 938082304. Throughput: 0: 43452.1. Samples: 841029120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 19:03:48,853][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 19:03:48,916][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000057257_938098688.pth... [2024-06-27 19:03:48,970][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000056619_927645696.pth [2024-06-27 19:03:50,651][06909] Updated weights for policy 0, policy_version 57263 (0.0040) [2024-06-27 19:03:53,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43144.5, 300 sec: 43487.7). Total num frames: 938311680. Throughput: 0: 43379.9. Samples: 841287120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 19:03:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:03:54,582][06909] Updated weights for policy 0, policy_version 57273 (0.0033) [2024-06-27 19:03:58,072][06909] Updated weights for policy 0, policy_version 57283 (0.0033) [2024-06-27 19:03:58,850][06674] Fps is (10 sec: 47522.7, 60 sec: 43144.3, 300 sec: 43598.1). Total num frames: 938557440. Throughput: 0: 43522.1. Samples: 841424560. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 19:03:58,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:04:02,288][06909] Updated weights for policy 0, policy_version 57293 (0.0020) [2024-06-27 19:04:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43144.4, 300 sec: 43431.5). Total num frames: 938721280. Throughput: 0: 43539.1. Samples: 841677600. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 19:04:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:04:05,461][06909] Updated weights for policy 0, policy_version 57303 (0.0026) [2024-06-27 19:04:08,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 938983424. Throughput: 0: 43573.0. Samples: 841936440. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 19:04:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:04:09,654][06909] Updated weights for policy 0, policy_version 57313 (0.0024) [2024-06-27 19:04:13,389][06909] Updated weights for policy 0, policy_version 57323 (0.0029) [2024-06-27 19:04:13,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43144.5, 300 sec: 43598.1). Total num frames: 939196416. Throughput: 0: 43575.0. Samples: 842074900. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 19:04:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:04:17,029][06909] Updated weights for policy 0, policy_version 57333 (0.0032) [2024-06-27 19:04:18,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 939393024. Throughput: 0: 43613.0. Samples: 842333520. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 19:04:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:04:20,848][06909] Updated weights for policy 0, policy_version 57343 (0.0032) [2024-06-27 19:04:21,035][06887] Signal inference workers to stop experience collection... (12000 times) [2024-06-27 19:04:21,035][06887] Signal inference workers to resume experience collection... (12000 times) [2024-06-27 19:04:21,053][06909] InferenceWorker_p0-w0: stopping experience collection (12000 times) [2024-06-27 19:04:21,053][06909] InferenceWorker_p0-w0: resuming experience collection (12000 times) [2024-06-27 19:04:23,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43419.1, 300 sec: 43487.0). Total num frames: 939622400. Throughput: 0: 43607.1. Samples: 842595380. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 19:04:23,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 19:04:24,746][06909] Updated weights for policy 0, policy_version 57353 (0.0038) [2024-06-27 19:04:28,195][06909] Updated weights for policy 0, policy_version 57363 (0.0041) [2024-06-27 19:04:28,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 939868160. Throughput: 0: 43538.1. Samples: 842727460. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 19:04:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:04:32,091][06909] Updated weights for policy 0, policy_version 57373 (0.0030) [2024-06-27 19:04:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 940048384. Throughput: 0: 43533.2. Samples: 842988020. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 19:04:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:04:35,493][06909] Updated weights for policy 0, policy_version 57383 (0.0031) [2024-06-27 19:04:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 940277760. Throughput: 0: 43643.6. Samples: 843251080. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 19:04:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:04:39,716][06909] Updated weights for policy 0, policy_version 57393 (0.0042) [2024-06-27 19:04:43,048][06909] Updated weights for policy 0, policy_version 57403 (0.0036) [2024-06-27 19:04:43,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 940507136. Throughput: 0: 43570.9. Samples: 843385240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:04:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:04:46,996][06909] Updated weights for policy 0, policy_version 57413 (0.0045) [2024-06-27 19:04:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43965.2, 300 sec: 43598.1). Total num frames: 940720128. Throughput: 0: 43715.5. Samples: 843644800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:04:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:04:50,666][06909] Updated weights for policy 0, policy_version 57423 (0.0033) [2024-06-27 19:04:53,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.3, 300 sec: 43542.6). Total num frames: 940949504. Throughput: 0: 43778.9. Samples: 843906580. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:04:53,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:04:54,802][06909] Updated weights for policy 0, policy_version 57433 (0.0023) [2024-06-27 19:04:58,447][06909] Updated weights for policy 0, policy_version 57443 (0.0037) [2024-06-27 19:04:58,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43417.8, 300 sec: 43542.6). Total num frames: 941162496. Throughput: 0: 43618.4. Samples: 844037720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:04:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:05:02,103][06909] Updated weights for policy 0, policy_version 57453 (0.0029) [2024-06-27 19:05:03,850][06674] Fps is (10 sec: 42606.7, 60 sec: 44236.8, 300 sec: 43598.1). Total num frames: 941375488. Throughput: 0: 43733.2. Samples: 844301520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:05:03,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:05:05,821][06909] Updated weights for policy 0, policy_version 57463 (0.0030) [2024-06-27 19:05:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 941588480. Throughput: 0: 43824.5. Samples: 844567480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:05:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:05:09,419][06909] Updated weights for policy 0, policy_version 57473 (0.0029) [2024-06-27 19:05:13,057][06909] Updated weights for policy 0, policy_version 57483 (0.0035) [2024-06-27 19:05:13,851][06674] Fps is (10 sec: 45872.1, 60 sec: 43963.2, 300 sec: 43598.0). Total num frames: 941834240. Throughput: 0: 43841.5. Samples: 844700360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:05:13,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:05:16,767][06909] Updated weights for policy 0, policy_version 57493 (0.0031) [2024-06-27 19:05:18,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 43653.7). Total num frames: 942047232. Throughput: 0: 43902.2. Samples: 844963620. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 19:05:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:05:20,343][06909] Updated weights for policy 0, policy_version 57503 (0.0028) [2024-06-27 19:05:23,850][06674] Fps is (10 sec: 42601.6, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 942260224. Throughput: 0: 43895.1. Samples: 845226360. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 19:05:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:05:24,612][06909] Updated weights for policy 0, policy_version 57513 (0.0026) [2024-06-27 19:05:27,868][06909] Updated weights for policy 0, policy_version 57523 (0.0039) [2024-06-27 19:05:28,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 942473216. Throughput: 0: 43747.2. Samples: 845353860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 19:05:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:05:31,998][06909] Updated weights for policy 0, policy_version 57533 (0.0031) [2024-06-27 19:05:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43598.4). Total num frames: 942686208. Throughput: 0: 43843.6. Samples: 845617760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 19:05:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:05:35,737][06909] Updated weights for policy 0, policy_version 57543 (0.0030) [2024-06-27 19:05:38,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 942899200. Throughput: 0: 43583.4. Samples: 845867740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 19:05:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:05:39,822][06909] Updated weights for policy 0, policy_version 57553 (0.0024) [2024-06-27 19:05:43,505][06909] Updated weights for policy 0, policy_version 57563 (0.0031) [2024-06-27 19:05:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 943112192. Throughput: 0: 43608.8. Samples: 846000120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 19:05:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:05:47,345][06909] Updated weights for policy 0, policy_version 57573 (0.0025) [2024-06-27 19:05:48,764][06887] Signal inference workers to stop experience collection... (12050 times) [2024-06-27 19:05:48,765][06887] Signal inference workers to resume experience collection... (12050 times) [2024-06-27 19:05:48,812][06909] InferenceWorker_p0-w0: stopping experience collection (12050 times) [2024-06-27 19:05:48,812][06909] InferenceWorker_p0-w0: resuming experience collection (12050 times) [2024-06-27 19:05:48,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 943341568. Throughput: 0: 43551.5. Samples: 846261340. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 19:05:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:05:48,901][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000057578_943357952.pth... [2024-06-27 19:05:48,951][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000056939_932888576.pth [2024-06-27 19:05:50,799][06909] Updated weights for policy 0, policy_version 57583 (0.0031) [2024-06-27 19:05:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43146.1, 300 sec: 43487.0). Total num frames: 943538176. Throughput: 0: 43620.5. Samples: 846530400. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 19:05:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:05:55,012][06909] Updated weights for policy 0, policy_version 57593 (0.0039) [2024-06-27 19:05:58,274][06909] Updated weights for policy 0, policy_version 57603 (0.0031) [2024-06-27 19:05:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 943783936. Throughput: 0: 43373.6. Samples: 846652140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 19:05:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:06:02,353][06909] Updated weights for policy 0, policy_version 57613 (0.0027) [2024-06-27 19:06:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 943996928. Throughput: 0: 43287.2. Samples: 846911540. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 19:06:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:06:05,834][06909] Updated weights for policy 0, policy_version 57623 (0.0022) [2024-06-27 19:06:08,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43144.6, 300 sec: 43431.5). Total num frames: 944177152. Throughput: 0: 43350.8. Samples: 847177140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 19:06:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:06:09,880][06909] Updated weights for policy 0, policy_version 57633 (0.0032) [2024-06-27 19:06:13,115][06909] Updated weights for policy 0, policy_version 57643 (0.0036) [2024-06-27 19:06:13,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43145.0, 300 sec: 43598.1). Total num frames: 944422912. Throughput: 0: 43274.9. Samples: 847301240. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 19:06:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:06:17,640][06909] Updated weights for policy 0, policy_version 57653 (0.0040) [2024-06-27 19:06:18,850][06674] Fps is (10 sec: 47512.6, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 944652288. Throughput: 0: 43331.9. Samples: 847567700. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 19:06:18,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:06:21,325][06909] Updated weights for policy 0, policy_version 57663 (0.0043) [2024-06-27 19:06:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43144.5, 300 sec: 43431.5). Total num frames: 944848896. Throughput: 0: 43521.7. Samples: 847826220. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-27 19:06:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:06:25,049][06909] Updated weights for policy 0, policy_version 57673 (0.0033) [2024-06-27 19:06:28,652][06909] Updated weights for policy 0, policy_version 57683 (0.0043) [2024-06-27 19:06:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 945078272. Throughput: 0: 43392.4. Samples: 847952780. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-27 19:06:28,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:06:32,650][06909] Updated weights for policy 0, policy_version 57693 (0.0032) [2024-06-27 19:06:33,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 945307648. Throughput: 0: 43612.2. Samples: 848223880. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-27 19:06:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:06:36,002][06909] Updated weights for policy 0, policy_version 57703 (0.0032) [2024-06-27 19:06:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 945504256. Throughput: 0: 43486.1. Samples: 848487280. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-27 19:06:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:06:40,016][06909] Updated weights for policy 0, policy_version 57713 (0.0025) [2024-06-27 19:06:43,270][06909] Updated weights for policy 0, policy_version 57723 (0.0031) [2024-06-27 19:06:43,856][06674] Fps is (10 sec: 44210.0, 60 sec: 43959.3, 300 sec: 43597.5). Total num frames: 945750016. Throughput: 0: 43632.9. Samples: 848615880. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-27 19:06:43,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:06:47,636][06909] Updated weights for policy 0, policy_version 57733 (0.0034) [2024-06-27 19:06:48,852][06674] Fps is (10 sec: 47504.1, 60 sec: 43962.3, 300 sec: 43708.9). Total num frames: 945979392. Throughput: 0: 43819.7. Samples: 848883520. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-27 19:06:48,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 19:06:50,593][06909] Updated weights for policy 0, policy_version 57743 (0.0028) [2024-06-27 19:06:53,852][06674] Fps is (10 sec: 42615.5, 60 sec: 43962.2, 300 sec: 43487.0). Total num frames: 946176000. Throughput: 0: 43818.4. Samples: 849149060. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-27 19:06:53,853][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 19:06:55,152][06909] Updated weights for policy 0, policy_version 57753 (0.0039) [2024-06-27 19:06:58,538][06909] Updated weights for policy 0, policy_version 57763 (0.0050) [2024-06-27 19:06:58,850][06674] Fps is (10 sec: 40968.6, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 946388992. Throughput: 0: 43706.3. Samples: 849268020. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-27 19:06:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:07:02,909][06909] Updated weights for policy 0, policy_version 57773 (0.0032) [2024-06-27 19:07:03,850][06674] Fps is (10 sec: 44245.2, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 946618368. Throughput: 0: 43712.9. Samples: 849534780. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 19:07:03,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:07:06,225][06909] Updated weights for policy 0, policy_version 57783 (0.0029) [2024-06-27 19:07:08,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43962.2, 300 sec: 43486.7). Total num frames: 946814976. Throughput: 0: 43598.9. Samples: 849788260. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 19:07:08,852][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 19:07:10,347][06909] Updated weights for policy 0, policy_version 57793 (0.0030) [2024-06-27 19:07:13,718][06909] Updated weights for policy 0, policy_version 57803 (0.0025) [2024-06-27 19:07:13,853][06674] Fps is (10 sec: 42583.6, 60 sec: 43688.1, 300 sec: 43597.6). Total num frames: 947044352. Throughput: 0: 43719.2. Samples: 849920300. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 19:07:13,854][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:07:16,612][06887] Signal inference workers to stop experience collection... (12100 times) [2024-06-27 19:07:16,612][06887] Signal inference workers to resume experience collection... (12100 times) [2024-06-27 19:07:16,624][06909] InferenceWorker_p0-w0: stopping experience collection (12100 times) [2024-06-27 19:07:16,625][06909] InferenceWorker_p0-w0: resuming experience collection (12100 times) [2024-06-27 19:07:17,616][06909] Updated weights for policy 0, policy_version 57813 (0.0037) [2024-06-27 19:07:18,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 947257344. Throughput: 0: 43568.4. Samples: 850184460. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 19:07:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:07:21,106][06909] Updated weights for policy 0, policy_version 57823 (0.0043) [2024-06-27 19:07:23,850][06674] Fps is (10 sec: 40974.6, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 947453952. Throughput: 0: 43499.6. Samples: 850444760. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 19:07:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:07:25,390][06909] Updated weights for policy 0, policy_version 57833 (0.0043) [2024-06-27 19:07:28,598][06909] Updated weights for policy 0, policy_version 57843 (0.0023) [2024-06-27 19:07:28,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 947716096. Throughput: 0: 43430.1. Samples: 850569980. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 19:07:28,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:07:32,776][06909] Updated weights for policy 0, policy_version 57853 (0.0051) [2024-06-27 19:07:33,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 947929088. Throughput: 0: 43514.4. Samples: 850841580. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 19:07:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:07:35,999][06909] Updated weights for policy 0, policy_version 57863 (0.0033) [2024-06-27 19:07:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 948125696. Throughput: 0: 43226.7. Samples: 851094180. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-27 19:07:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:07:40,266][06909] Updated weights for policy 0, policy_version 57873 (0.0033) [2024-06-27 19:07:43,585][06909] Updated weights for policy 0, policy_version 57883 (0.0033) [2024-06-27 19:07:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43422.0, 300 sec: 43653.6). Total num frames: 948355072. Throughput: 0: 43582.7. Samples: 851229240. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 19:07:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:07:47,922][06909] Updated weights for policy 0, policy_version 57893 (0.0031) [2024-06-27 19:07:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43419.0, 300 sec: 43598.1). Total num frames: 948584448. Throughput: 0: 43500.4. Samples: 851492300. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 19:07:48,851][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 19:07:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000057897_948584448.pth... [2024-06-27 19:07:48,931][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000057257_938098688.pth [2024-06-27 19:07:51,256][06909] Updated weights for policy 0, policy_version 57903 (0.0037) [2024-06-27 19:07:53,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43146.0, 300 sec: 43375.9). Total num frames: 948764672. Throughput: 0: 43628.6. Samples: 851751460. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 19:07:53,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:07:55,401][06909] Updated weights for policy 0, policy_version 57913 (0.0026) [2024-06-27 19:07:58,588][06909] Updated weights for policy 0, policy_version 57923 (0.0024) [2024-06-27 19:07:58,850][06674] Fps is (10 sec: 44237.8, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 949026816. Throughput: 0: 43699.1. Samples: 851886600. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 19:07:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:08:02,815][06909] Updated weights for policy 0, policy_version 57933 (0.0028) [2024-06-27 19:08:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 949223424. Throughput: 0: 43662.2. Samples: 852149260. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 19:08:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:08:05,992][06909] Updated weights for policy 0, policy_version 57943 (0.0036) [2024-06-27 19:08:08,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43692.1, 300 sec: 43487.0). Total num frames: 949436416. Throughput: 0: 43604.0. Samples: 852406940. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 19:08:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:08:10,426][06909] Updated weights for policy 0, policy_version 57953 (0.0036) [2024-06-27 19:08:13,496][06909] Updated weights for policy 0, policy_version 57963 (0.0033) [2024-06-27 19:08:13,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43693.2, 300 sec: 43653.6). Total num frames: 949665792. Throughput: 0: 43667.6. Samples: 852535020. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-27 19:08:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:08:17,841][06909] Updated weights for policy 0, policy_version 57973 (0.0048) [2024-06-27 19:08:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.5, 300 sec: 43598.4). Total num frames: 949878784. Throughput: 0: 43639.4. Samples: 852805360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 19:08:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:08:21,150][06909] Updated weights for policy 0, policy_version 57983 (0.0042) [2024-06-27 19:08:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 950075392. Throughput: 0: 43770.8. Samples: 853063860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 19:08:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:08:25,583][06909] Updated weights for policy 0, policy_version 57993 (0.0037) [2024-06-27 19:08:28,417][06909] Updated weights for policy 0, policy_version 58003 (0.0032) [2024-06-27 19:08:28,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 950321152. Throughput: 0: 43625.4. Samples: 853192380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 19:08:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:08:32,857][06909] Updated weights for policy 0, policy_version 58013 (0.0022) [2024-06-27 19:08:33,228][06887] Signal inference workers to stop experience collection... (12150 times) [2024-06-27 19:08:33,232][06887] Signal inference workers to resume experience collection... (12150 times) [2024-06-27 19:08:33,268][06909] InferenceWorker_p0-w0: stopping experience collection (12150 times) [2024-06-27 19:08:33,268][06909] InferenceWorker_p0-w0: resuming experience collection (12150 times) [2024-06-27 19:08:33,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 950534144. Throughput: 0: 43739.8. Samples: 853460580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 19:08:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:08:36,061][06909] Updated weights for policy 0, policy_version 58023 (0.0030) [2024-06-27 19:08:38,852][06674] Fps is (10 sec: 42589.5, 60 sec: 43689.3, 300 sec: 43542.2). Total num frames: 950747136. Throughput: 0: 43846.5. Samples: 853724640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 19:08:38,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:08:40,390][06909] Updated weights for policy 0, policy_version 58033 (0.0032) [2024-06-27 19:08:43,307][06909] Updated weights for policy 0, policy_version 58043 (0.0030) [2024-06-27 19:08:43,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44236.8, 300 sec: 43820.6). Total num frames: 951009280. Throughput: 0: 43787.9. Samples: 853857060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 19:08:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:08:47,974][06909] Updated weights for policy 0, policy_version 58053 (0.0037) [2024-06-27 19:08:48,850][06674] Fps is (10 sec: 45884.4, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 951205888. Throughput: 0: 43727.0. Samples: 854116980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 19:08:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:08:50,754][06909] Updated weights for policy 0, policy_version 58063 (0.0032) [2024-06-27 19:08:53,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.8, 300 sec: 43598.1). Total num frames: 951418880. Throughput: 0: 43889.8. Samples: 854381980. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 19:08:53,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:08:55,234][06909] Updated weights for policy 0, policy_version 58073 (0.0038) [2024-06-27 19:08:58,250][06909] Updated weights for policy 0, policy_version 58083 (0.0029) [2024-06-27 19:08:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 951664640. Throughput: 0: 43966.7. Samples: 854513520. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 19:08:58,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:09:02,756][06909] Updated weights for policy 0, policy_version 58093 (0.0036) [2024-06-27 19:09:03,852][06674] Fps is (10 sec: 42590.2, 60 sec: 43689.2, 300 sec: 43597.8). Total num frames: 951844864. Throughput: 0: 43769.8. Samples: 854775080. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 19:09:03,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:09:05,685][06909] Updated weights for policy 0, policy_version 58103 (0.0033) [2024-06-27 19:09:08,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 952057856. Throughput: 0: 43835.1. Samples: 855036440. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 19:09:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:09:10,098][06909] Updated weights for policy 0, policy_version 58113 (0.0035) [2024-06-27 19:09:13,349][06909] Updated weights for policy 0, policy_version 58123 (0.0031) [2024-06-27 19:09:13,850][06674] Fps is (10 sec: 45884.2, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 952303616. Throughput: 0: 43899.9. Samples: 855167880. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 19:09:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:09:17,811][06909] Updated weights for policy 0, policy_version 58133 (0.0032) [2024-06-27 19:09:18,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 952500224. Throughput: 0: 43819.3. Samples: 855432460. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 19:09:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:09:20,943][06909] Updated weights for policy 0, policy_version 58143 (0.0025) [2024-06-27 19:09:23,851][06674] Fps is (10 sec: 42591.4, 60 sec: 44235.6, 300 sec: 43597.9). Total num frames: 952729600. Throughput: 0: 43817.7. Samples: 855696420. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-27 19:09:23,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:09:25,238][06909] Updated weights for policy 0, policy_version 58153 (0.0020) [2024-06-27 19:09:28,406][06909] Updated weights for policy 0, policy_version 58163 (0.0044) [2024-06-27 19:09:28,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44236.8, 300 sec: 43820.3). Total num frames: 952975360. Throughput: 0: 43777.4. Samples: 855827040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:09:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:09:32,587][06909] Updated weights for policy 0, policy_version 58173 (0.0026) [2024-06-27 19:09:33,851][06674] Fps is (10 sec: 44240.1, 60 sec: 43963.0, 300 sec: 43709.0). Total num frames: 953171968. Throughput: 0: 43945.8. Samples: 856094580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:09:33,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:09:35,737][06909] Updated weights for policy 0, policy_version 58183 (0.0038) [2024-06-27 19:09:38,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43965.1, 300 sec: 43653.6). Total num frames: 953384960. Throughput: 0: 43787.0. Samples: 856352400. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:09:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:09:40,022][06909] Updated weights for policy 0, policy_version 58193 (0.0025) [2024-06-27 19:09:43,274][06909] Updated weights for policy 0, policy_version 58203 (0.0031) [2024-06-27 19:09:43,850][06674] Fps is (10 sec: 44241.1, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 953614336. Throughput: 0: 43755.7. Samples: 856482520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:09:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:09:47,313][06909] Updated weights for policy 0, policy_version 58213 (0.0040) [2024-06-27 19:09:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.5, 300 sec: 43653.9). Total num frames: 953827328. Throughput: 0: 43884.8. Samples: 856749820. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:09:48,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:09:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000058217_953827328.pth... [2024-06-27 19:09:48,907][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000057578_943357952.pth [2024-06-27 19:09:50,710][06909] Updated weights for policy 0, policy_version 58223 (0.0031) [2024-06-27 19:09:53,852][06674] Fps is (10 sec: 42589.3, 60 sec: 43689.2, 300 sec: 43653.3). Total num frames: 954040320. Throughput: 0: 43858.8. Samples: 857010180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:09:53,852][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 19:09:54,952][06909] Updated weights for policy 0, policy_version 58233 (0.0038) [2024-06-27 19:09:57,452][06887] Signal inference workers to stop experience collection... (12200 times) [2024-06-27 19:09:57,452][06887] Signal inference workers to resume experience collection... (12200 times) [2024-06-27 19:09:57,476][06909] InferenceWorker_p0-w0: stopping experience collection (12200 times) [2024-06-27 19:09:57,476][06909] InferenceWorker_p0-w0: resuming experience collection (12200 times) [2024-06-27 19:09:58,166][06909] Updated weights for policy 0, policy_version 58243 (0.0044) [2024-06-27 19:09:58,850][06674] Fps is (10 sec: 45876.3, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 954286080. Throughput: 0: 43788.5. Samples: 857138360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:09:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:10:02,554][06909] Updated weights for policy 0, policy_version 58253 (0.0038) [2024-06-27 19:10:03,850][06674] Fps is (10 sec: 44246.4, 60 sec: 43965.2, 300 sec: 43709.2). Total num frames: 954482688. Throughput: 0: 43864.2. Samples: 857406340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:10:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:10:05,663][06909] Updated weights for policy 0, policy_version 58263 (0.0029) [2024-06-27 19:10:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 43653.8). Total num frames: 954712064. Throughput: 0: 43762.9. Samples: 857665680. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 19:10:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:10:09,813][06909] Updated weights for policy 0, policy_version 58273 (0.0040) [2024-06-27 19:10:13,434][06909] Updated weights for policy 0, policy_version 58283 (0.0031) [2024-06-27 19:10:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 954941440. Throughput: 0: 43838.3. Samples: 857799760. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 19:10:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:10:17,127][06909] Updated weights for policy 0, policy_version 58293 (0.0028) [2024-06-27 19:10:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 955138048. Throughput: 0: 43786.7. Samples: 858064940. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 19:10:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:10:20,748][06909] Updated weights for policy 0, policy_version 58303 (0.0026) [2024-06-27 19:10:23,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43691.9, 300 sec: 43653.6). Total num frames: 955351040. Throughput: 0: 43842.4. Samples: 858325300. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 19:10:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:10:24,548][06909] Updated weights for policy 0, policy_version 58313 (0.0033) [2024-06-27 19:10:28,391][06909] Updated weights for policy 0, policy_version 58323 (0.0043) [2024-06-27 19:10:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 43653.6). Total num frames: 955564032. Throughput: 0: 43819.0. Samples: 858454380. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 19:10:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:10:32,025][06909] Updated weights for policy 0, policy_version 58333 (0.0040) [2024-06-27 19:10:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43691.4, 300 sec: 43709.2). Total num frames: 955793408. Throughput: 0: 43570.9. Samples: 858710500. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 19:10:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:10:35,974][06909] Updated weights for policy 0, policy_version 58343 (0.0031) [2024-06-27 19:10:38,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43689.3, 300 sec: 43708.9). Total num frames: 956006400. Throughput: 0: 43630.2. Samples: 858973540. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 19:10:38,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:10:39,693][06909] Updated weights for policy 0, policy_version 58353 (0.0055) [2024-06-27 19:10:43,342][06909] Updated weights for policy 0, policy_version 58363 (0.0032) [2024-06-27 19:10:43,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 956235776. Throughput: 0: 43719.9. Samples: 859105760. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 19:10:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:10:47,128][06909] Updated weights for policy 0, policy_version 58373 (0.0029) [2024-06-27 19:10:48,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 956448768. Throughput: 0: 43488.3. Samples: 859363320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 19:10:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:10:51,068][06909] Updated weights for policy 0, policy_version 58383 (0.0053) [2024-06-27 19:10:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43692.2, 300 sec: 43653.6). Total num frames: 956661760. Throughput: 0: 43660.0. Samples: 859630380. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 19:10:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:10:54,552][06909] Updated weights for policy 0, policy_version 58393 (0.0032) [2024-06-27 19:10:58,454][06909] Updated weights for policy 0, policy_version 58403 (0.0037) [2024-06-27 19:10:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 956891136. Throughput: 0: 43604.8. Samples: 859761980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 19:10:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:11:01,875][06909] Updated weights for policy 0, policy_version 58413 (0.0030) [2024-06-27 19:11:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 957104128. Throughput: 0: 43552.4. Samples: 860024800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 19:11:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:11:05,960][06909] Updated weights for policy 0, policy_version 58423 (0.0034) [2024-06-27 19:11:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 957317120. Throughput: 0: 43664.8. Samples: 860290220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 19:11:08,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:11:09,492][06909] Updated weights for policy 0, policy_version 58433 (0.0033) [2024-06-27 19:11:13,434][06909] Updated weights for policy 0, policy_version 58443 (0.0029) [2024-06-27 19:11:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 957546496. Throughput: 0: 43609.8. Samples: 860416820. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 19:11:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:11:17,012][06909] Updated weights for policy 0, policy_version 58453 (0.0029) [2024-06-27 19:11:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 957759488. Throughput: 0: 43782.7. Samples: 860680720. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-27 19:11:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:11:20,827][06909] Updated weights for policy 0, policy_version 58463 (0.0029) [2024-06-27 19:11:23,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 957939712. Throughput: 0: 43891.0. Samples: 860948540. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-27 19:11:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:11:24,532][06909] Updated weights for policy 0, policy_version 58473 (0.0028) [2024-06-27 19:11:28,405][06909] Updated weights for policy 0, policy_version 58483 (0.0026) [2024-06-27 19:11:28,564][06887] Signal inference workers to stop experience collection... (12250 times) [2024-06-27 19:11:28,589][06909] InferenceWorker_p0-w0: stopping experience collection (12250 times) [2024-06-27 19:11:28,675][06887] Signal inference workers to resume experience collection... (12250 times) [2024-06-27 19:11:28,676][06909] InferenceWorker_p0-w0: resuming experience collection (12250 times) [2024-06-27 19:11:28,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.7, 300 sec: 43764.7). Total num frames: 958218240. Throughput: 0: 43783.6. Samples: 861076020. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-27 19:11:28,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:11:31,875][06909] Updated weights for policy 0, policy_version 58493 (0.0028) [2024-06-27 19:11:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 958414848. Throughput: 0: 43878.8. Samples: 861337860. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-27 19:11:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:11:36,048][06909] Updated weights for policy 0, policy_version 58503 (0.0023) [2024-06-27 19:11:38,852][06674] Fps is (10 sec: 42590.2, 60 sec: 43963.8, 300 sec: 43709.8). Total num frames: 958644224. Throughput: 0: 43884.7. Samples: 861605280. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-27 19:11:38,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:11:39,331][06909] Updated weights for policy 0, policy_version 58513 (0.0033) [2024-06-27 19:11:43,477][06909] Updated weights for policy 0, policy_version 58523 (0.0033) [2024-06-27 19:11:43,852][06674] Fps is (10 sec: 42589.5, 60 sec: 43416.2, 300 sec: 43598.1). Total num frames: 958840832. Throughput: 0: 43799.0. Samples: 861733020. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-27 19:11:43,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:11:46,774][06909] Updated weights for policy 0, policy_version 58533 (0.0026) [2024-06-27 19:11:48,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43690.8, 300 sec: 43709.5). Total num frames: 959070208. Throughput: 0: 43725.5. Samples: 861992440. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-27 19:11:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:11:48,894][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000058538_959086592.pth... [2024-06-27 19:11:48,945][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000057897_948584448.pth [2024-06-27 19:11:51,051][06909] Updated weights for policy 0, policy_version 58543 (0.0049) [2024-06-27 19:11:53,850][06674] Fps is (10 sec: 45884.8, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 959299584. Throughput: 0: 43705.9. Samples: 862256980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 19:11:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:11:54,139][06909] Updated weights for policy 0, policy_version 58553 (0.0036) [2024-06-27 19:11:58,534][06909] Updated weights for policy 0, policy_version 58563 (0.0033) [2024-06-27 19:11:58,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 959496192. Throughput: 0: 43580.4. Samples: 862377940. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 19:11:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:12:02,135][06909] Updated weights for policy 0, policy_version 58573 (0.0035) [2024-06-27 19:12:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43765.0). Total num frames: 959725568. Throughput: 0: 43567.5. Samples: 862641260. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 19:12:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:12:06,055][06909] Updated weights for policy 0, policy_version 58583 (0.0023) [2024-06-27 19:12:08,853][06674] Fps is (10 sec: 44221.5, 60 sec: 43688.1, 300 sec: 43709.2). Total num frames: 959938560. Throughput: 0: 43513.0. Samples: 862906780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 19:12:08,854][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:12:09,460][06909] Updated weights for policy 0, policy_version 58593 (0.0029) [2024-06-27 19:12:13,814][06909] Updated weights for policy 0, policy_version 58603 (0.0029) [2024-06-27 19:12:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 960151552. Throughput: 0: 43506.7. Samples: 863033820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 19:12:13,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:12:16,763][06909] Updated weights for policy 0, policy_version 58613 (0.0027) [2024-06-27 19:12:18,852][06674] Fps is (10 sec: 44243.4, 60 sec: 43689.2, 300 sec: 43820.0). Total num frames: 960380928. Throughput: 0: 43579.7. Samples: 863299040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 19:12:18,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:12:21,110][06909] Updated weights for policy 0, policy_version 58623 (0.0030) [2024-06-27 19:12:23,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.8, 300 sec: 43653.7). Total num frames: 960593920. Throughput: 0: 43423.4. Samples: 863559240. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 19:12:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:12:24,416][06909] Updated weights for policy 0, policy_version 58633 (0.0037) [2024-06-27 19:12:28,566][06909] Updated weights for policy 0, policy_version 58643 (0.0042) [2024-06-27 19:12:28,856][06674] Fps is (10 sec: 42581.3, 60 sec: 43140.3, 300 sec: 43652.7). Total num frames: 960806912. Throughput: 0: 43457.0. Samples: 863688760. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-27 19:12:28,857][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:12:32,158][06909] Updated weights for policy 0, policy_version 58653 (0.0025) [2024-06-27 19:12:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 961036288. Throughput: 0: 43557.3. Samples: 863952520. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 19:12:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:12:36,106][06909] Updated weights for policy 0, policy_version 58663 (0.0032) [2024-06-27 19:12:38,850][06674] Fps is (10 sec: 44263.5, 60 sec: 43419.0, 300 sec: 43709.2). Total num frames: 961249280. Throughput: 0: 43541.2. Samples: 864216340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 19:12:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:12:39,925][06909] Updated weights for policy 0, policy_version 58673 (0.0032) [2024-06-27 19:12:41,228][06887] Signal inference workers to stop experience collection... (12300 times) [2024-06-27 19:12:41,274][06909] InferenceWorker_p0-w0: stopping experience collection (12300 times) [2024-06-27 19:12:41,280][06887] Signal inference workers to resume experience collection... (12300 times) [2024-06-27 19:12:41,296][06909] InferenceWorker_p0-w0: resuming experience collection (12300 times) [2024-06-27 19:12:43,531][06909] Updated weights for policy 0, policy_version 58683 (0.0026) [2024-06-27 19:12:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43692.2, 300 sec: 43653.7). Total num frames: 961462272. Throughput: 0: 43697.9. Samples: 864344340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 19:12:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:12:47,146][06909] Updated weights for policy 0, policy_version 58693 (0.0038) [2024-06-27 19:12:48,856][06674] Fps is (10 sec: 45849.3, 60 sec: 43959.5, 300 sec: 43875.0). Total num frames: 961708032. Throughput: 0: 43698.9. Samples: 864607960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 19:12:48,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:12:51,121][06909] Updated weights for policy 0, policy_version 58703 (0.0032) [2024-06-27 19:12:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 961904640. Throughput: 0: 43712.8. Samples: 864873700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 19:12:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 19:12:54,598][06909] Updated weights for policy 0, policy_version 58713 (0.0038) [2024-06-27 19:12:58,677][06909] Updated weights for policy 0, policy_version 58723 (0.0044) [2024-06-27 19:12:58,852][06674] Fps is (10 sec: 40974.7, 60 sec: 43689.2, 300 sec: 43708.9). Total num frames: 962117632. Throughput: 0: 43782.5. Samples: 865004120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 19:12:58,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:13:02,138][06909] Updated weights for policy 0, policy_version 58733 (0.0037) [2024-06-27 19:13:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 962347008. Throughput: 0: 43735.4. Samples: 865267040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 19:13:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:13:06,081][06909] Updated weights for policy 0, policy_version 58743 (0.0029) [2024-06-27 19:13:08,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43693.2, 300 sec: 43709.2). Total num frames: 962560000. Throughput: 0: 43760.3. Samples: 865528460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 19:13:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:13:09,601][06909] Updated weights for policy 0, policy_version 58753 (0.0031) [2024-06-27 19:13:13,419][06909] Updated weights for policy 0, policy_version 58763 (0.0020) [2024-06-27 19:13:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 962772992. Throughput: 0: 43805.9. Samples: 865659760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 19:13:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:13:16,862][06909] Updated weights for policy 0, policy_version 58773 (0.0032) [2024-06-27 19:13:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43692.2, 300 sec: 43820.3). Total num frames: 963002368. Throughput: 0: 43752.9. Samples: 865921400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 19:13:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:13:20,766][06909] Updated weights for policy 0, policy_version 58783 (0.0028) [2024-06-27 19:13:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 963215360. Throughput: 0: 43904.9. Samples: 866192060. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 19:13:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:13:24,779][06909] Updated weights for policy 0, policy_version 58793 (0.0031) [2024-06-27 19:13:28,532][06909] Updated weights for policy 0, policy_version 58803 (0.0031) [2024-06-27 19:13:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43695.1, 300 sec: 43709.2). Total num frames: 963428352. Throughput: 0: 43769.3. Samples: 866313960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 19:13:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:13:32,095][06909] Updated weights for policy 0, policy_version 58813 (0.0022) [2024-06-27 19:13:33,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43689.2, 300 sec: 43764.7). Total num frames: 963657728. Throughput: 0: 43848.0. Samples: 866580960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 19:13:33,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:13:36,248][06909] Updated weights for policy 0, policy_version 58823 (0.0041) [2024-06-27 19:13:38,852][06674] Fps is (10 sec: 45865.5, 60 sec: 43962.2, 300 sec: 43653.3). Total num frames: 963887104. Throughput: 0: 43754.8. Samples: 866842760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 19:13:38,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:13:39,391][06909] Updated weights for policy 0, policy_version 58833 (0.0030) [2024-06-27 19:13:43,610][06909] Updated weights for policy 0, policy_version 58843 (0.0037) [2024-06-27 19:13:43,850][06674] Fps is (10 sec: 42606.6, 60 sec: 43690.5, 300 sec: 43653.6). Total num frames: 964083712. Throughput: 0: 43793.5. Samples: 866974740. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 19:13:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:13:47,071][06909] Updated weights for policy 0, policy_version 58853 (0.0032) [2024-06-27 19:13:48,850][06674] Fps is (10 sec: 42606.5, 60 sec: 43421.6, 300 sec: 43709.2). Total num frames: 964313088. Throughput: 0: 43728.3. Samples: 867234820. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 19:13:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:13:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000058857_964313088.pth... [2024-06-27 19:13:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000058217_953827328.pth [2024-06-27 19:13:51,137][06909] Updated weights for policy 0, policy_version 58863 (0.0036) [2024-06-27 19:13:53,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 964542464. Throughput: 0: 43700.0. Samples: 867494960. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 19:13:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:13:54,360][06909] Updated weights for policy 0, policy_version 58873 (0.0033) [2024-06-27 19:13:58,462][06909] Updated weights for policy 0, policy_version 58883 (0.0041) [2024-06-27 19:13:58,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43692.2, 300 sec: 43709.5). Total num frames: 964739072. Throughput: 0: 43815.6. Samples: 867631460. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 19:13:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:14:01,998][06909] Updated weights for policy 0, policy_version 58893 (0.0027) [2024-06-27 19:14:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 964968448. Throughput: 0: 43719.5. Samples: 867888780. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 19:14:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:14:06,401][06909] Updated weights for policy 0, policy_version 58903 (0.0036) [2024-06-27 19:14:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 965197824. Throughput: 0: 43541.4. Samples: 868151420. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 19:14:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:14:09,325][06909] Updated weights for policy 0, policy_version 58913 (0.0033) [2024-06-27 19:14:13,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 965378048. Throughput: 0: 43693.6. Samples: 868280180. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-27 19:14:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:14:14,080][06909] Updated weights for policy 0, policy_version 58923 (0.0038) [2024-06-27 19:14:15,948][06887] Signal inference workers to stop experience collection... (12350 times) [2024-06-27 19:14:15,948][06887] Signal inference workers to resume experience collection... (12350 times) [2024-06-27 19:14:15,990][06909] InferenceWorker_p0-w0: stopping experience collection (12350 times) [2024-06-27 19:14:15,990][06909] InferenceWorker_p0-w0: resuming experience collection (12350 times) [2024-06-27 19:14:17,164][06909] Updated weights for policy 0, policy_version 58933 (0.0040) [2024-06-27 19:14:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.6, 300 sec: 43765.0). Total num frames: 965640192. Throughput: 0: 43635.2. Samples: 868544460. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 19:14:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:14:21,347][06909] Updated weights for policy 0, policy_version 58943 (0.0029) [2024-06-27 19:14:23,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 965853184. Throughput: 0: 43509.9. Samples: 868800620. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 19:14:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:14:24,658][06909] Updated weights for policy 0, policy_version 58953 (0.0035) [2024-06-27 19:14:28,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43417.6, 300 sec: 43598.2). Total num frames: 966033408. Throughput: 0: 43557.4. Samples: 868934820. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 19:14:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:14:28,926][06909] Updated weights for policy 0, policy_version 58963 (0.0032) [2024-06-27 19:14:32,074][06909] Updated weights for policy 0, policy_version 58973 (0.0038) [2024-06-27 19:14:33,852][06674] Fps is (10 sec: 42590.0, 60 sec: 43690.7, 300 sec: 43708.9). Total num frames: 966279168. Throughput: 0: 43628.4. Samples: 869198180. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 19:14:33,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:14:36,504][06909] Updated weights for policy 0, policy_version 58983 (0.0028) [2024-06-27 19:14:38,856][06674] Fps is (10 sec: 49122.1, 60 sec: 43960.8, 300 sec: 43763.8). Total num frames: 966524928. Throughput: 0: 43668.4. Samples: 869460300. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 19:14:38,857][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:14:39,585][06909] Updated weights for policy 0, policy_version 58993 (0.0035) [2024-06-27 19:14:43,850][06674] Fps is (10 sec: 40968.5, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 966688768. Throughput: 0: 43517.3. Samples: 869589740. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 19:14:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:14:43,894][06909] Updated weights for policy 0, policy_version 59003 (0.0041) [2024-06-27 19:14:46,975][06909] Updated weights for policy 0, policy_version 59013 (0.0026) [2024-06-27 19:14:48,850][06674] Fps is (10 sec: 40984.7, 60 sec: 43690.7, 300 sec: 43709.5). Total num frames: 966934528. Throughput: 0: 43590.6. Samples: 869850360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 19:14:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:14:51,575][06909] Updated weights for policy 0, policy_version 59023 (0.0029) [2024-06-27 19:14:53,852][06674] Fps is (10 sec: 45865.7, 60 sec: 43416.2, 300 sec: 43597.8). Total num frames: 967147520. Throughput: 0: 43564.2. Samples: 870111900. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 19:14:53,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:14:54,617][06909] Updated weights for policy 0, policy_version 59033 (0.0039) [2024-06-27 19:14:58,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43144.5, 300 sec: 43542.6). Total num frames: 967327744. Throughput: 0: 43573.5. Samples: 870240980. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 19:14:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:14:59,069][06909] Updated weights for policy 0, policy_version 59043 (0.0027) [2024-06-27 19:15:02,296][06909] Updated weights for policy 0, policy_version 59053 (0.0039) [2024-06-27 19:15:03,850][06674] Fps is (10 sec: 45884.1, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 967606272. Throughput: 0: 43591.1. Samples: 870506060. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 19:15:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:15:06,417][06909] Updated weights for policy 0, policy_version 59063 (0.0020) [2024-06-27 19:15:08,850][06674] Fps is (10 sec: 49152.2, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 967819264. Throughput: 0: 43593.5. Samples: 870762320. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 19:15:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:15:10,300][06909] Updated weights for policy 0, policy_version 59073 (0.0039) [2024-06-27 19:15:13,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 967999488. Throughput: 0: 43550.7. Samples: 870894600. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 19:15:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:15:13,901][06909] Updated weights for policy 0, policy_version 59083 (0.0028) [2024-06-27 19:15:17,648][06909] Updated weights for policy 0, policy_version 59093 (0.0041) [2024-06-27 19:15:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 968245248. Throughput: 0: 43726.4. Samples: 871165780. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 19:15:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:15:21,498][06909] Updated weights for policy 0, policy_version 59103 (0.0031) [2024-06-27 19:15:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 968458240. Throughput: 0: 43518.4. Samples: 871418360. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 19:15:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:15:24,941][06909] Updated weights for policy 0, policy_version 59113 (0.0035) [2024-06-27 19:15:28,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 968638464. Throughput: 0: 43641.3. Samples: 871553600. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 19:15:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:15:29,229][06909] Updated weights for policy 0, policy_version 59123 (0.0028) [2024-06-27 19:15:32,277][06909] Updated weights for policy 0, policy_version 59133 (0.0028) [2024-06-27 19:15:33,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43692.1, 300 sec: 43709.5). Total num frames: 968900608. Throughput: 0: 43599.1. Samples: 871812320. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 19:15:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:15:36,998][06909] Updated weights for policy 0, policy_version 59143 (0.0034) [2024-06-27 19:15:37,538][06887] Signal inference workers to stop experience collection... (12400 times) [2024-06-27 19:15:37,591][06887] Signal inference workers to resume experience collection... (12400 times) [2024-06-27 19:15:37,592][06909] InferenceWorker_p0-w0: stopping experience collection (12400 times) [2024-06-27 19:15:37,609][06909] InferenceWorker_p0-w0: resuming experience collection (12400 times) [2024-06-27 19:15:38,850][06674] Fps is (10 sec: 49151.5, 60 sec: 43422.0, 300 sec: 43709.2). Total num frames: 969129984. Throughput: 0: 43461.9. Samples: 872067600. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 19:15:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:15:39,952][06909] Updated weights for policy 0, policy_version 59153 (0.0022) [2024-06-27 19:15:43,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 969310208. Throughput: 0: 43712.8. Samples: 872208060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 19:15:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:15:44,222][06909] Updated weights for policy 0, policy_version 59163 (0.0042) [2024-06-27 19:15:47,653][06909] Updated weights for policy 0, policy_version 59173 (0.0042) [2024-06-27 19:15:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 969555968. Throughput: 0: 43667.6. Samples: 872471100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 19:15:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:15:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000059177_969555968.pth... [2024-06-27 19:15:48,934][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000058538_959086592.pth [2024-06-27 19:15:51,688][06909] Updated weights for policy 0, policy_version 59183 (0.0032) [2024-06-27 19:15:53,856][06674] Fps is (10 sec: 47485.3, 60 sec: 43960.8, 300 sec: 43708.3). Total num frames: 969785344. Throughput: 0: 43640.3. Samples: 872726400. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 19:15:53,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:15:55,304][06909] Updated weights for policy 0, policy_version 59193 (0.0031) [2024-06-27 19:15:58,851][06674] Fps is (10 sec: 40956.7, 60 sec: 43963.1, 300 sec: 43598.0). Total num frames: 969965568. Throughput: 0: 43581.0. Samples: 872855780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 19:15:58,851][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 19:15:59,191][06909] Updated weights for policy 0, policy_version 59203 (0.0041) [2024-06-27 19:16:02,594][06909] Updated weights for policy 0, policy_version 59213 (0.0031) [2024-06-27 19:16:03,850][06674] Fps is (10 sec: 42624.3, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 970211328. Throughput: 0: 43485.9. Samples: 873122640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 19:16:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:16:06,550][06909] Updated weights for policy 0, policy_version 59223 (0.0035) [2024-06-27 19:16:08,850][06674] Fps is (10 sec: 47516.9, 60 sec: 43690.5, 300 sec: 43709.2). Total num frames: 970440704. Throughput: 0: 43582.1. Samples: 873379560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 19:16:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:16:10,061][06909] Updated weights for policy 0, policy_version 59233 (0.0035) [2024-06-27 19:16:13,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 970620928. Throughput: 0: 43483.1. Samples: 873510340. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-27 19:16:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:16:14,123][06909] Updated weights for policy 0, policy_version 59243 (0.0042) [2024-06-27 19:16:17,360][06909] Updated weights for policy 0, policy_version 59253 (0.0031) [2024-06-27 19:16:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 970866688. Throughput: 0: 43631.1. Samples: 873775720. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-27 19:16:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:16:22,109][06909] Updated weights for policy 0, policy_version 59263 (0.0032) [2024-06-27 19:16:23,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.7, 300 sec: 43653.7). Total num frames: 971096064. Throughput: 0: 43729.4. Samples: 874035420. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-27 19:16:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:16:25,213][06909] Updated weights for policy 0, policy_version 59273 (0.0025) [2024-06-27 19:16:28,852][06674] Fps is (10 sec: 40952.1, 60 sec: 43962.2, 300 sec: 43597.8). Total num frames: 971276288. Throughput: 0: 43614.1. Samples: 874170780. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-27 19:16:28,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:16:29,338][06909] Updated weights for policy 0, policy_version 59283 (0.0032) [2024-06-27 19:16:32,505][06909] Updated weights for policy 0, policy_version 59293 (0.0031) [2024-06-27 19:16:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43653.9). Total num frames: 971522048. Throughput: 0: 43677.3. Samples: 874436580. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-27 19:16:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:16:36,619][06909] Updated weights for policy 0, policy_version 59303 (0.0028) [2024-06-27 19:16:38,850][06674] Fps is (10 sec: 45884.9, 60 sec: 43417.7, 300 sec: 43709.5). Total num frames: 971735040. Throughput: 0: 43633.9. Samples: 874689660. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-27 19:16:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:16:39,805][06909] Updated weights for policy 0, policy_version 59313 (0.0039) [2024-06-27 19:16:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 971931648. Throughput: 0: 43806.9. Samples: 874827060. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-27 19:16:43,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:16:43,894][06909] Updated weights for policy 0, policy_version 59323 (0.0030) [2024-06-27 19:16:45,233][06887] Signal inference workers to stop experience collection... (12450 times) [2024-06-27 19:16:45,289][06909] InferenceWorker_p0-w0: stopping experience collection (12450 times) [2024-06-27 19:16:45,289][06887] Signal inference workers to resume experience collection... (12450 times) [2024-06-27 19:16:45,311][06909] InferenceWorker_p0-w0: resuming experience collection (12450 times) [2024-06-27 19:16:47,618][06909] Updated weights for policy 0, policy_version 59333 (0.0028) [2024-06-27 19:16:48,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 972177408. Throughput: 0: 43727.3. Samples: 875090380. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 19:16:48,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:16:51,181][06909] Updated weights for policy 0, policy_version 59343 (0.0041) [2024-06-27 19:16:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43421.9, 300 sec: 43709.2). Total num frames: 972390400. Throughput: 0: 43714.3. Samples: 875346700. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 19:16:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:16:54,889][06909] Updated weights for policy 0, policy_version 59353 (0.0040) [2024-06-27 19:16:58,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43691.3, 300 sec: 43598.1). Total num frames: 972587008. Throughput: 0: 43769.4. Samples: 875479960. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 19:16:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:16:59,134][06909] Updated weights for policy 0, policy_version 59363 (0.0031) [2024-06-27 19:17:02,363][06909] Updated weights for policy 0, policy_version 59373 (0.0028) [2024-06-27 19:17:03,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43417.6, 300 sec: 43654.2). Total num frames: 972816384. Throughput: 0: 43713.9. Samples: 875742840. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 19:17:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:17:06,862][06909] Updated weights for policy 0, policy_version 59383 (0.0030) [2024-06-27 19:17:08,852][06674] Fps is (10 sec: 45865.7, 60 sec: 43416.2, 300 sec: 43708.9). Total num frames: 973045760. Throughput: 0: 43693.6. Samples: 876001720. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 19:17:08,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:17:09,838][06909] Updated weights for policy 0, policy_version 59393 (0.0042) [2024-06-27 19:17:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43598.4). Total num frames: 973242368. Throughput: 0: 43674.4. Samples: 876136040. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 19:17:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:17:14,259][06909] Updated weights for policy 0, policy_version 59403 (0.0031) [2024-06-27 19:17:17,364][06909] Updated weights for policy 0, policy_version 59413 (0.0036) [2024-06-27 19:17:18,850][06674] Fps is (10 sec: 44243.6, 60 sec: 43690.4, 300 sec: 43709.1). Total num frames: 973488128. Throughput: 0: 43620.8. Samples: 876399540. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 19:17:18,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:17:21,427][06909] Updated weights for policy 0, policy_version 59423 (0.0030) [2024-06-27 19:17:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 43710.1). Total num frames: 973701120. Throughput: 0: 43837.7. Samples: 876662360. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-27 19:17:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:17:24,867][06909] Updated weights for policy 0, policy_version 59433 (0.0039) [2024-06-27 19:17:28,763][06909] Updated weights for policy 0, policy_version 59443 (0.0031) [2024-06-27 19:17:28,850][06674] Fps is (10 sec: 42600.8, 60 sec: 43965.3, 300 sec: 43653.6). Total num frames: 973914112. Throughput: 0: 43620.1. Samples: 876789960. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-27 19:17:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:17:32,233][06909] Updated weights for policy 0, policy_version 59453 (0.0030) [2024-06-27 19:17:33,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 974143488. Throughput: 0: 43638.2. Samples: 877054100. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-27 19:17:33,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:17:36,332][06909] Updated weights for policy 0, policy_version 59463 (0.0035) [2024-06-27 19:17:38,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 974372864. Throughput: 0: 43725.4. Samples: 877314340. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-27 19:17:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:17:39,750][06909] Updated weights for policy 0, policy_version 59473 (0.0025) [2024-06-27 19:17:43,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43543.4). Total num frames: 974553088. Throughput: 0: 43711.5. Samples: 877446980. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-27 19:17:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:17:43,995][06909] Updated weights for policy 0, policy_version 59483 (0.0021) [2024-06-27 19:17:47,205][06909] Updated weights for policy 0, policy_version 59493 (0.0032) [2024-06-27 19:17:48,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.8, 300 sec: 43653.6). Total num frames: 974782464. Throughput: 0: 43680.0. Samples: 877708440. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-27 19:17:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:17:48,925][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000059497_974798848.pth... [2024-06-27 19:17:48,984][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000058857_964313088.pth [2024-06-27 19:17:51,558][06909] Updated weights for policy 0, policy_version 59503 (0.0030) [2024-06-27 19:17:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 43709.5). Total num frames: 975011840. Throughput: 0: 43733.1. Samples: 877969620. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-27 19:17:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:17:54,688][06909] Updated weights for policy 0, policy_version 59513 (0.0034) [2024-06-27 19:17:58,544][06887] Signal inference workers to stop experience collection... (12500 times) [2024-06-27 19:17:58,545][06887] Signal inference workers to resume experience collection... (12500 times) [2024-06-27 19:17:58,583][06909] InferenceWorker_p0-w0: stopping experience collection (12500 times) [2024-06-27 19:17:58,584][06909] InferenceWorker_p0-w0: resuming experience collection (12500 times) [2024-06-27 19:17:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 975208448. Throughput: 0: 43780.4. Samples: 878106160. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-27 19:17:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:17:58,863][06909] Updated weights for policy 0, policy_version 59523 (0.0033) [2024-06-27 19:18:02,058][06909] Updated weights for policy 0, policy_version 59533 (0.0039) [2024-06-27 19:18:03,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 975421440. Throughput: 0: 43688.5. Samples: 878365500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 19:18:03,864][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:18:06,299][06909] Updated weights for policy 0, policy_version 59543 (0.0031) [2024-06-27 19:18:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43419.1, 300 sec: 43653.6). Total num frames: 975650816. Throughput: 0: 43795.6. Samples: 878633160. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 19:18:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:18:09,511][06909] Updated weights for policy 0, policy_version 59553 (0.0028) [2024-06-27 19:18:13,553][06909] Updated weights for policy 0, policy_version 59563 (0.0029) [2024-06-27 19:18:13,856][06674] Fps is (10 sec: 45847.2, 60 sec: 43959.2, 300 sec: 43652.7). Total num frames: 975880192. Throughput: 0: 43785.6. Samples: 878760580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 19:18:13,865][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:18:16,904][06909] Updated weights for policy 0, policy_version 59573 (0.0033) [2024-06-27 19:18:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43418.0, 300 sec: 43653.6). Total num frames: 976093184. Throughput: 0: 43629.0. Samples: 879017400. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 19:18:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:18:21,183][06909] Updated weights for policy 0, policy_version 59583 (0.0036) [2024-06-27 19:18:23,850][06674] Fps is (10 sec: 44263.5, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 976322560. Throughput: 0: 43699.9. Samples: 879280840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 19:18:23,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:18:24,717][06909] Updated weights for policy 0, policy_version 59593 (0.0035) [2024-06-27 19:18:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43598.4). Total num frames: 976519168. Throughput: 0: 43770.7. Samples: 879416660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 19:18:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:18:28,912][06909] Updated weights for policy 0, policy_version 59603 (0.0030) [2024-06-27 19:18:32,058][06909] Updated weights for policy 0, policy_version 59613 (0.0029) [2024-06-27 19:18:33,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43417.7, 300 sec: 43598.4). Total num frames: 976748544. Throughput: 0: 43609.8. Samples: 879670880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 19:18:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:18:36,554][06909] Updated weights for policy 0, policy_version 59623 (0.0022) [2024-06-27 19:18:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 976977920. Throughput: 0: 43730.8. Samples: 879937500. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:18:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:18:39,539][06909] Updated weights for policy 0, policy_version 59633 (0.0027) [2024-06-27 19:18:43,850][06674] Fps is (10 sec: 42595.4, 60 sec: 43690.2, 300 sec: 43598.0). Total num frames: 977174528. Throughput: 0: 43684.7. Samples: 880072000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:18:43,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:18:43,891][06909] Updated weights for policy 0, policy_version 59643 (0.0028) [2024-06-27 19:18:46,945][06909] Updated weights for policy 0, policy_version 59653 (0.0039) [2024-06-27 19:18:48,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 977387520. Throughput: 0: 43557.9. Samples: 880325600. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:18:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:18:51,170][06909] Updated weights for policy 0, policy_version 59663 (0.0035) [2024-06-27 19:18:53,850][06674] Fps is (10 sec: 47515.9, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 977649664. Throughput: 0: 43613.1. Samples: 880595760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:18:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:18:54,330][06909] Updated weights for policy 0, policy_version 59673 (0.0031) [2024-06-27 19:18:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 977829888. Throughput: 0: 43632.2. Samples: 880723760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:18:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:18:58,993][06909] Updated weights for policy 0, policy_version 59683 (0.0033) [2024-06-27 19:19:01,715][06909] Updated weights for policy 0, policy_version 59693 (0.0037) [2024-06-27 19:19:03,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 978042880. Throughput: 0: 43588.9. Samples: 880978900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:19:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:19:06,538][06909] Updated weights for policy 0, policy_version 59703 (0.0030) [2024-06-27 19:19:08,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.6, 300 sec: 43764.7). Total num frames: 978288640. Throughput: 0: 43634.2. Samples: 881244380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:19:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:19:09,471][06909] Updated weights for policy 0, policy_version 59713 (0.0031) [2024-06-27 19:19:10,431][06887] Signal inference workers to stop experience collection... (12550 times) [2024-06-27 19:19:10,431][06887] Signal inference workers to resume experience collection... (12550 times) [2024-06-27 19:19:10,473][06909] InferenceWorker_p0-w0: stopping experience collection (12550 times) [2024-06-27 19:19:10,473][06909] InferenceWorker_p0-w0: resuming experience collection (12550 times) [2024-06-27 19:19:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43422.0, 300 sec: 43542.6). Total num frames: 978485248. Throughput: 0: 43754.2. Samples: 881385600. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-27 19:19:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:19:13,940][06909] Updated weights for policy 0, policy_version 59723 (0.0025) [2024-06-27 19:19:16,846][06909] Updated weights for policy 0, policy_version 59733 (0.0026) [2024-06-27 19:19:18,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.5, 300 sec: 43542.6). Total num frames: 978698240. Throughput: 0: 43722.9. Samples: 881638420. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-27 19:19:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:19:21,458][06909] Updated weights for policy 0, policy_version 59743 (0.0024) [2024-06-27 19:19:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 978944000. Throughput: 0: 43601.2. Samples: 881899560. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-27 19:19:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:19:24,418][06909] Updated weights for policy 0, policy_version 59753 (0.0030) [2024-06-27 19:19:28,780][06909] Updated weights for policy 0, policy_version 59763 (0.0032) [2024-06-27 19:19:28,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 43653.9). Total num frames: 979156992. Throughput: 0: 43668.6. Samples: 882037060. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-27 19:19:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:19:31,792][06909] Updated weights for policy 0, policy_version 59773 (0.0028) [2024-06-27 19:19:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.5, 300 sec: 43487.9). Total num frames: 979353600. Throughput: 0: 43707.0. Samples: 882292420. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-27 19:19:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:19:36,631][06909] Updated weights for policy 0, policy_version 59783 (0.0024) [2024-06-27 19:19:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.6, 300 sec: 43820.2). Total num frames: 979615744. Throughput: 0: 43481.9. Samples: 882552440. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-27 19:19:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:19:39,421][06909] Updated weights for policy 0, policy_version 59793 (0.0034) [2024-06-27 19:19:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43418.1, 300 sec: 43542.6). Total num frames: 979779584. Throughput: 0: 43613.3. Samples: 882686360. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-27 19:19:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:19:44,085][06909] Updated weights for policy 0, policy_version 59803 (0.0037) [2024-06-27 19:19:46,743][06909] Updated weights for policy 0, policy_version 59813 (0.0040) [2024-06-27 19:19:48,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43690.5, 300 sec: 43598.4). Total num frames: 980008960. Throughput: 0: 43631.5. Samples: 882942320. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-27 19:19:48,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:19:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000059815_980008960.pth... [2024-06-27 19:19:48,927][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000059177_969555968.pth [2024-06-27 19:19:51,386][06909] Updated weights for policy 0, policy_version 59823 (0.0037) [2024-06-27 19:19:53,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43417.6, 300 sec: 43820.2). Total num frames: 980254720. Throughput: 0: 43643.5. Samples: 883208340. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 19:19:53,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:19:54,194][06909] Updated weights for policy 0, policy_version 59833 (0.0029) [2024-06-27 19:19:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 980451328. Throughput: 0: 43441.3. Samples: 883340460. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 19:19:58,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:19:59,037][06909] Updated weights for policy 0, policy_version 59843 (0.0027) [2024-06-27 19:20:01,950][06909] Updated weights for policy 0, policy_version 59853 (0.0032) [2024-06-27 19:20:03,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 980664320. Throughput: 0: 43500.1. Samples: 883595920. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 19:20:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:20:06,440][06909] Updated weights for policy 0, policy_version 59863 (0.0033) [2024-06-27 19:20:08,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 980926464. Throughput: 0: 43602.7. Samples: 883861680. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 19:20:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:20:09,328][06909] Updated weights for policy 0, policy_version 59873 (0.0039) [2024-06-27 19:20:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 981106688. Throughput: 0: 43528.9. Samples: 883995860. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 19:20:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:20:14,310][06909] Updated weights for policy 0, policy_version 59883 (0.0039) [2024-06-27 19:20:16,981][06909] Updated weights for policy 0, policy_version 59893 (0.0033) [2024-06-27 19:20:18,850][06674] Fps is (10 sec: 39321.1, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 981319680. Throughput: 0: 43383.9. Samples: 884244700. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 19:20:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:20:21,609][06909] Updated weights for policy 0, policy_version 59903 (0.0027) [2024-06-27 19:20:23,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 981565440. Throughput: 0: 43508.9. Samples: 884510340. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 19:20:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:20:24,605][06909] Updated weights for policy 0, policy_version 59913 (0.0030) [2024-06-27 19:20:28,371][06887] Signal inference workers to stop experience collection... (12600 times) [2024-06-27 19:20:28,419][06909] InferenceWorker_p0-w0: stopping experience collection (12600 times) [2024-06-27 19:20:28,486][06887] Signal inference workers to resume experience collection... (12600 times) [2024-06-27 19:20:28,487][06909] InferenceWorker_p0-w0: resuming experience collection (12600 times) [2024-06-27 19:20:28,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 981762048. Throughput: 0: 43560.8. Samples: 884646600. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 19:20:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:20:28,882][06909] Updated weights for policy 0, policy_version 59923 (0.0026) [2024-06-27 19:20:32,055][06909] Updated weights for policy 0, policy_version 59933 (0.0028) [2024-06-27 19:20:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 981975040. Throughput: 0: 43525.4. Samples: 884900960. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 19:20:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:20:36,702][06909] Updated weights for policy 0, policy_version 59943 (0.0039) [2024-06-27 19:20:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 982220800. Throughput: 0: 43642.4. Samples: 885172240. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 19:20:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:20:39,496][06909] Updated weights for policy 0, policy_version 59953 (0.0038) [2024-06-27 19:20:43,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 982417408. Throughput: 0: 43590.4. Samples: 885302020. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 19:20:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:20:43,952][06909] Updated weights for policy 0, policy_version 59963 (0.0030) [2024-06-27 19:20:47,347][06909] Updated weights for policy 0, policy_version 59973 (0.0046) [2024-06-27 19:20:48,856][06674] Fps is (10 sec: 40935.1, 60 sec: 43686.4, 300 sec: 43542.6). Total num frames: 982630400. Throughput: 0: 43511.1. Samples: 885554180. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 19:20:48,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:20:51,326][06909] Updated weights for policy 0, policy_version 59983 (0.0035) [2024-06-27 19:20:53,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43690.7, 300 sec: 43764.8). Total num frames: 982876160. Throughput: 0: 43451.5. Samples: 885817000. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 19:20:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:20:54,913][06909] Updated weights for policy 0, policy_version 59993 (0.0054) [2024-06-27 19:20:58,850][06674] Fps is (10 sec: 42624.0, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 983056384. Throughput: 0: 43487.5. Samples: 885952800. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 19:20:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:20:59,441][06909] Updated weights for policy 0, policy_version 60003 (0.0030) [2024-06-27 19:21:02,362][06909] Updated weights for policy 0, policy_version 60013 (0.0034) [2024-06-27 19:21:03,850][06674] Fps is (10 sec: 40958.8, 60 sec: 43690.4, 300 sec: 43542.5). Total num frames: 983285760. Throughput: 0: 43581.1. Samples: 886205860. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 19:21:03,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:21:06,663][06909] Updated weights for policy 0, policy_version 60023 (0.0040) [2024-06-27 19:21:08,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 983531520. Throughput: 0: 43606.7. Samples: 886472640. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-27 19:21:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:21:09,689][06909] Updated weights for policy 0, policy_version 60033 (0.0030) [2024-06-27 19:21:13,850][06674] Fps is (10 sec: 44238.5, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 983728128. Throughput: 0: 43695.6. Samples: 886612900. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-27 19:21:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:21:13,952][06909] Updated weights for policy 0, policy_version 60043 (0.0034) [2024-06-27 19:21:17,138][06909] Updated weights for policy 0, policy_version 60053 (0.0024) [2024-06-27 19:21:18,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 983941120. Throughput: 0: 43785.7. Samples: 886871320. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-27 19:21:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:21:21,533][06909] Updated weights for policy 0, policy_version 60063 (0.0025) [2024-06-27 19:21:23,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43690.6, 300 sec: 43765.0). Total num frames: 984186880. Throughput: 0: 43601.6. Samples: 887134320. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-27 19:21:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:21:24,890][06909] Updated weights for policy 0, policy_version 60073 (0.0044) [2024-06-27 19:21:28,811][06909] Updated weights for policy 0, policy_version 60083 (0.0023) [2024-06-27 19:21:28,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 984399872. Throughput: 0: 43795.8. Samples: 887272840. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-27 19:21:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:21:32,317][06909] Updated weights for policy 0, policy_version 60093 (0.0033) [2024-06-27 19:21:33,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 984612864. Throughput: 0: 43810.8. Samples: 887525400. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-27 19:21:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:21:36,567][06909] Updated weights for policy 0, policy_version 60103 (0.0020) [2024-06-27 19:21:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.5, 300 sec: 43764.7). Total num frames: 984842240. Throughput: 0: 43795.0. Samples: 887787780. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-27 19:21:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:21:39,817][06909] Updated weights for policy 0, policy_version 60113 (0.0026) [2024-06-27 19:21:43,723][06887] Signal inference workers to stop experience collection... (12650 times) [2024-06-27 19:21:43,726][06887] Signal inference workers to resume experience collection... (12650 times) [2024-06-27 19:21:43,757][06909] InferenceWorker_p0-w0: stopping experience collection (12650 times) [2024-06-27 19:21:43,757][06909] InferenceWorker_p0-w0: resuming experience collection (12650 times) [2024-06-27 19:21:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 985038848. Throughput: 0: 43903.2. Samples: 887928440. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:21:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:21:43,863][06909] Updated weights for policy 0, policy_version 60123 (0.0028) [2024-06-27 19:21:47,270][06909] Updated weights for policy 0, policy_version 60133 (0.0031) [2024-06-27 19:21:48,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43968.2, 300 sec: 43653.7). Total num frames: 985268224. Throughput: 0: 44040.4. Samples: 888187660. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:21:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:21:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000060136_985268224.pth... [2024-06-27 19:21:48,896][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000059497_974798848.pth [2024-06-27 19:21:51,106][06909] Updated weights for policy 0, policy_version 60143 (0.0041) [2024-06-27 19:21:53,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 985513984. Throughput: 0: 43945.8. Samples: 888450200. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:21:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:21:54,606][06909] Updated weights for policy 0, policy_version 60153 (0.0036) [2024-06-27 19:21:58,726][06909] Updated weights for policy 0, policy_version 60163 (0.0035) [2024-06-27 19:21:58,856][06674] Fps is (10 sec: 44209.7, 60 sec: 44232.3, 300 sec: 43708.3). Total num frames: 985710592. Throughput: 0: 43938.0. Samples: 888590380. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:21:58,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:22:02,136][06909] Updated weights for policy 0, policy_version 60173 (0.0042) [2024-06-27 19:22:03,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43691.0, 300 sec: 43598.4). Total num frames: 985907200. Throughput: 0: 43937.0. Samples: 888848480. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:22:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:22:06,301][06909] Updated weights for policy 0, policy_version 60183 (0.0035) [2024-06-27 19:22:08,856][06674] Fps is (10 sec: 44237.1, 60 sec: 43686.2, 300 sec: 43763.8). Total num frames: 986152960. Throughput: 0: 43833.3. Samples: 889107080. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:22:08,865][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:22:09,452][06909] Updated weights for policy 0, policy_version 60193 (0.0048) [2024-06-27 19:22:13,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43690.5, 300 sec: 43598.2). Total num frames: 986349568. Throughput: 0: 43851.5. Samples: 889246160. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:22:13,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:22:14,043][06909] Updated weights for policy 0, policy_version 60203 (0.0023) [2024-06-27 19:22:17,392][06909] Updated weights for policy 0, policy_version 60213 (0.0032) [2024-06-27 19:22:18,852][06674] Fps is (10 sec: 44254.5, 60 sec: 44235.3, 300 sec: 43708.9). Total num frames: 986595328. Throughput: 0: 43899.3. Samples: 889500960. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:22:18,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:22:21,512][06909] Updated weights for policy 0, policy_version 60223 (0.0027) [2024-06-27 19:22:23,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.6, 300 sec: 43709.1). Total num frames: 986808320. Throughput: 0: 43841.8. Samples: 889760660. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:22:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:22:24,797][06909] Updated weights for policy 0, policy_version 60233 (0.0030) [2024-06-27 19:22:28,837][06909] Updated weights for policy 0, policy_version 60243 (0.0046) [2024-06-27 19:22:28,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 987021312. Throughput: 0: 43731.9. Samples: 889896380. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:22:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:22:32,373][06909] Updated weights for policy 0, policy_version 60253 (0.0025) [2024-06-27 19:22:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 987234304. Throughput: 0: 43767.9. Samples: 890157220. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:22:33,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:22:36,030][06909] Updated weights for policy 0, policy_version 60263 (0.0027) [2024-06-27 19:22:38,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43689.3, 300 sec: 43764.4). Total num frames: 987463680. Throughput: 0: 43789.5. Samples: 890420820. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:22:38,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:22:39,834][06909] Updated weights for policy 0, policy_version 60273 (0.0022) [2024-06-27 19:22:43,689][06909] Updated weights for policy 0, policy_version 60283 (0.0035) [2024-06-27 19:22:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 987676672. Throughput: 0: 43831.2. Samples: 890562520. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:22:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:22:47,151][06909] Updated weights for policy 0, policy_version 60293 (0.0025) [2024-06-27 19:22:48,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 987889664. Throughput: 0: 43759.0. Samples: 890817640. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:22:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:22:51,305][06909] Updated weights for policy 0, policy_version 60303 (0.0039) [2024-06-27 19:22:51,571][06887] Signal inference workers to stop experience collection... (12700 times) [2024-06-27 19:22:51,584][06909] InferenceWorker_p0-w0: stopping experience collection (12700 times) [2024-06-27 19:22:51,685][06887] Signal inference workers to resume experience collection... (12700 times) [2024-06-27 19:22:51,686][06909] InferenceWorker_p0-w0: resuming experience collection (12700 times) [2024-06-27 19:22:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 988119040. Throughput: 0: 43796.1. Samples: 891077640. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-27 19:22:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:22:54,503][06909] Updated weights for policy 0, policy_version 60313 (0.0020) [2024-06-27 19:22:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43422.1, 300 sec: 43709.2). Total num frames: 988315648. Throughput: 0: 43685.1. Samples: 891211980. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:22:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:22:58,871][06909] Updated weights for policy 0, policy_version 60323 (0.0033) [2024-06-27 19:23:02,360][06909] Updated weights for policy 0, policy_version 60333 (0.0023) [2024-06-27 19:23:03,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 988545024. Throughput: 0: 43840.2. Samples: 891473680. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:23:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:23:06,168][06909] Updated weights for policy 0, policy_version 60343 (0.0035) [2024-06-27 19:23:08,850][06674] Fps is (10 sec: 45874.2, 60 sec: 43694.9, 300 sec: 43710.1). Total num frames: 988774400. Throughput: 0: 43849.8. Samples: 891733900. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:23:08,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:23:09,726][06909] Updated weights for policy 0, policy_version 60353 (0.0037) [2024-06-27 19:23:13,532][06909] Updated weights for policy 0, policy_version 60363 (0.0037) [2024-06-27 19:23:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 43764.7). Total num frames: 989003776. Throughput: 0: 43802.2. Samples: 891867480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:23:13,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:23:16,994][06909] Updated weights for policy 0, policy_version 60373 (0.0037) [2024-06-27 19:23:18,852][06674] Fps is (10 sec: 40952.3, 60 sec: 43144.5, 300 sec: 43597.8). Total num frames: 989184000. Throughput: 0: 43775.5. Samples: 892127200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:23:18,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:23:21,156][06909] Updated weights for policy 0, policy_version 60383 (0.0035) [2024-06-27 19:23:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 989429760. Throughput: 0: 43730.4. Samples: 892388600. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:23:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:23:24,983][06909] Updated weights for policy 0, policy_version 60393 (0.0031) [2024-06-27 19:23:28,717][06909] Updated weights for policy 0, policy_version 60403 (0.0043) [2024-06-27 19:23:28,850][06674] Fps is (10 sec: 45884.8, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 989642752. Throughput: 0: 43617.9. Samples: 892525320. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:23:28,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 19:23:32,535][06909] Updated weights for policy 0, policy_version 60413 (0.0030) [2024-06-27 19:23:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 989855744. Throughput: 0: 43791.6. Samples: 892788260. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:23:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:23:36,226][06909] Updated weights for policy 0, policy_version 60423 (0.0046) [2024-06-27 19:23:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43692.1, 300 sec: 43764.8). Total num frames: 990085120. Throughput: 0: 43706.2. Samples: 893044420. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-27 19:23:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:23:40,181][06909] Updated weights for policy 0, policy_version 60433 (0.0030) [2024-06-27 19:23:43,472][06909] Updated weights for policy 0, policy_version 60443 (0.0033) [2024-06-27 19:23:43,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 990298112. Throughput: 0: 43669.8. Samples: 893177120. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-27 19:23:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:23:47,637][06909] Updated weights for policy 0, policy_version 60453 (0.0037) [2024-06-27 19:23:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 990494720. Throughput: 0: 43696.1. Samples: 893440000. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-27 19:23:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:23:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000060455_990494720.pth... [2024-06-27 19:23:48,926][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000059815_980008960.pth [2024-06-27 19:23:50,982][06909] Updated weights for policy 0, policy_version 60463 (0.0034) [2024-06-27 19:23:53,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 990740480. Throughput: 0: 43604.6. Samples: 893696100. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-27 19:23:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:23:55,311][06909] Updated weights for policy 0, policy_version 60473 (0.0040) [2024-06-27 19:23:58,610][06909] Updated weights for policy 0, policy_version 60483 (0.0043) [2024-06-27 19:23:58,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 990953472. Throughput: 0: 43703.3. Samples: 893834120. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-27 19:23:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:24:02,812][06909] Updated weights for policy 0, policy_version 60493 (0.0034) [2024-06-27 19:24:02,967][06887] Signal inference workers to stop experience collection... (12750 times) [2024-06-27 19:24:02,970][06887] Signal inference workers to resume experience collection... (12750 times) [2024-06-27 19:24:02,985][06909] InferenceWorker_p0-w0: stopping experience collection (12750 times) [2024-06-27 19:24:03,016][06909] InferenceWorker_p0-w0: resuming experience collection (12750 times) [2024-06-27 19:24:03,851][06674] Fps is (10 sec: 42594.3, 60 sec: 43690.0, 300 sec: 43653.5). Total num frames: 991166464. Throughput: 0: 43849.0. Samples: 894100360. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-27 19:24:03,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:24:06,108][06909] Updated weights for policy 0, policy_version 60503 (0.0043) [2024-06-27 19:24:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 991395840. Throughput: 0: 43627.2. Samples: 894351820. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-27 19:24:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:24:10,177][06909] Updated weights for policy 0, policy_version 60513 (0.0033) [2024-06-27 19:24:13,795][06909] Updated weights for policy 0, policy_version 60523 (0.0031) [2024-06-27 19:24:13,850][06674] Fps is (10 sec: 44241.1, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 991608832. Throughput: 0: 43582.1. Samples: 894486520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 19:24:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:24:17,759][06909] Updated weights for policy 0, policy_version 60533 (0.0037) [2024-06-27 19:24:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43965.2, 300 sec: 43653.6). Total num frames: 991821824. Throughput: 0: 43583.6. Samples: 894749520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 19:24:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:24:21,009][06909] Updated weights for policy 0, policy_version 60543 (0.0036) [2024-06-27 19:24:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 992051200. Throughput: 0: 43656.9. Samples: 895008980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 19:24:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:24:25,362][06909] Updated weights for policy 0, policy_version 60553 (0.0024) [2024-06-27 19:24:28,623][06909] Updated weights for policy 0, policy_version 60563 (0.0029) [2024-06-27 19:24:28,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.6, 300 sec: 43820.2). Total num frames: 992280576. Throughput: 0: 43641.6. Samples: 895141000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 19:24:28,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:24:32,855][06909] Updated weights for policy 0, policy_version 60573 (0.0020) [2024-06-27 19:24:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 992477184. Throughput: 0: 43588.5. Samples: 895401480. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 19:24:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:24:36,191][06909] Updated weights for policy 0, policy_version 60583 (0.0026) [2024-06-27 19:24:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43820.2). Total num frames: 992706560. Throughput: 0: 43652.9. Samples: 895660480. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 19:24:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:24:40,299][06909] Updated weights for policy 0, policy_version 60593 (0.0036) [2024-06-27 19:24:43,492][06909] Updated weights for policy 0, policy_version 60603 (0.0027) [2024-06-27 19:24:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 992919552. Throughput: 0: 43606.2. Samples: 895796400. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 19:24:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:24:47,773][06909] Updated weights for policy 0, policy_version 60613 (0.0040) [2024-06-27 19:24:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43653.7). Total num frames: 993132544. Throughput: 0: 43586.3. Samples: 896061700. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 19:24:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:24:50,871][06909] Updated weights for policy 0, policy_version 60623 (0.0026) [2024-06-27 19:24:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 993361920. Throughput: 0: 43727.5. Samples: 896319560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:24:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:24:55,271][06909] Updated weights for policy 0, policy_version 60633 (0.0031) [2024-06-27 19:24:58,782][06909] Updated weights for policy 0, policy_version 60643 (0.0038) [2024-06-27 19:24:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 993574912. Throughput: 0: 43738.2. Samples: 896454740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:24:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:25:02,937][06909] Updated weights for policy 0, policy_version 60653 (0.0031) [2024-06-27 19:25:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43418.3, 300 sec: 43542.6). Total num frames: 993771520. Throughput: 0: 43615.6. Samples: 896712220. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:25:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 19:25:06,070][06909] Updated weights for policy 0, policy_version 60663 (0.0029) [2024-06-27 19:25:08,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 994000896. Throughput: 0: 43604.9. Samples: 896971200. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:25:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:25:10,465][06909] Updated weights for policy 0, policy_version 60673 (0.0044) [2024-06-27 19:25:13,407][06909] Updated weights for policy 0, policy_version 60683 (0.0036) [2024-06-27 19:25:13,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 994246656. Throughput: 0: 43589.4. Samples: 897102520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:25:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:25:17,870][06909] Updated weights for policy 0, policy_version 60693 (0.0027) [2024-06-27 19:25:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 994443264. Throughput: 0: 43790.2. Samples: 897372040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:25:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:25:20,982][06909] Updated weights for policy 0, policy_version 60703 (0.0034) [2024-06-27 19:25:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 994672640. Throughput: 0: 43815.6. Samples: 897632180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:25:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:25:25,238][06909] Updated weights for policy 0, policy_version 60713 (0.0047) [2024-06-27 19:25:28,493][06909] Updated weights for policy 0, policy_version 60723 (0.0046) [2024-06-27 19:25:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 994885632. Throughput: 0: 43691.6. Samples: 897762520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:25:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:25:31,496][06887] Signal inference workers to stop experience collection... (12800 times) [2024-06-27 19:25:31,555][06909] InferenceWorker_p0-w0: stopping experience collection (12800 times) [2024-06-27 19:25:31,561][06887] Signal inference workers to resume experience collection... (12800 times) [2024-06-27 19:25:31,572][06909] InferenceWorker_p0-w0: resuming experience collection (12800 times) [2024-06-27 19:25:32,570][06909] Updated weights for policy 0, policy_version 60733 (0.0037) [2024-06-27 19:25:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 995115008. Throughput: 0: 43744.5. Samples: 898030200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:25:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:25:35,782][06909] Updated weights for policy 0, policy_version 60743 (0.0032) [2024-06-27 19:25:38,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 995328000. Throughput: 0: 43767.9. Samples: 898289120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:25:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:25:40,279][06909] Updated weights for policy 0, policy_version 60753 (0.0024) [2024-06-27 19:25:43,542][06909] Updated weights for policy 0, policy_version 60763 (0.0036) [2024-06-27 19:25:43,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 43821.1). Total num frames: 995557376. Throughput: 0: 43627.5. Samples: 898417980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:25:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:25:47,498][06909] Updated weights for policy 0, policy_version 60773 (0.0044) [2024-06-27 19:25:48,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 995753984. Throughput: 0: 43672.5. Samples: 898677480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:25:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:25:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000060776_995753984.pth... [2024-06-27 19:25:48,926][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000060136_985268224.pth [2024-06-27 19:25:50,894][06909] Updated weights for policy 0, policy_version 60783 (0.0032) [2024-06-27 19:25:53,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 995966976. Throughput: 0: 43770.7. Samples: 898940880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:25:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:25:55,293][06909] Updated weights for policy 0, policy_version 60793 (0.0039) [2024-06-27 19:25:58,379][06909] Updated weights for policy 0, policy_version 60803 (0.0026) [2024-06-27 19:25:58,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 996229120. Throughput: 0: 43730.2. Samples: 899070380. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:25:58,854][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:26:02,838][06909] Updated weights for policy 0, policy_version 60813 (0.0029) [2024-06-27 19:26:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 43709.2). Total num frames: 996425728. Throughput: 0: 43607.0. Samples: 899334360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 19:26:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:26:06,061][06909] Updated weights for policy 0, policy_version 60823 (0.0037) [2024-06-27 19:26:08,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 996622336. Throughput: 0: 43641.9. Samples: 899596060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:26:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:26:10,309][06909] Updated weights for policy 0, policy_version 60833 (0.0035) [2024-06-27 19:26:13,327][06909] Updated weights for policy 0, policy_version 60843 (0.0038) [2024-06-27 19:26:13,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 996868096. Throughput: 0: 43678.5. Samples: 899728060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:26:13,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:26:17,700][06909] Updated weights for policy 0, policy_version 60853 (0.0031) [2024-06-27 19:26:18,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43689.1, 300 sec: 43653.4). Total num frames: 997064704. Throughput: 0: 43498.9. Samples: 899987740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:26:18,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:26:21,120][06909] Updated weights for policy 0, policy_version 60863 (0.0038) [2024-06-27 19:26:23,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 997261312. Throughput: 0: 43353.4. Samples: 900240020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:26:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:26:25,421][06909] Updated weights for policy 0, policy_version 60873 (0.0040) [2024-06-27 19:26:28,610][06909] Updated weights for policy 0, policy_version 60883 (0.0033) [2024-06-27 19:26:28,850][06674] Fps is (10 sec: 44244.9, 60 sec: 43690.5, 300 sec: 43709.1). Total num frames: 997507072. Throughput: 0: 43415.4. Samples: 900371680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:26:28,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:26:32,811][06909] Updated weights for policy 0, policy_version 60893 (0.0028) [2024-06-27 19:26:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 997703680. Throughput: 0: 43477.4. Samples: 900633960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:26:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:26:36,104][06909] Updated weights for policy 0, policy_version 60903 (0.0036) [2024-06-27 19:26:38,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43144.6, 300 sec: 43653.6). Total num frames: 997916672. Throughput: 0: 43398.1. Samples: 900893800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:26:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:26:40,472][06909] Updated weights for policy 0, policy_version 60913 (0.0044) [2024-06-27 19:26:43,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43144.6, 300 sec: 43653.6). Total num frames: 998146048. Throughput: 0: 43412.0. Samples: 901023920. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 19:26:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:26:44,068][06909] Updated weights for policy 0, policy_version 60923 (0.0043) [2024-06-27 19:26:45,322][06887] Signal inference workers to stop experience collection... (12850 times) [2024-06-27 19:26:45,323][06887] Signal inference workers to resume experience collection... (12850 times) [2024-06-27 19:26:45,350][06909] InferenceWorker_p0-w0: stopping experience collection (12850 times) [2024-06-27 19:26:45,350][06909] InferenceWorker_p0-w0: resuming experience collection (12850 times) [2024-06-27 19:26:47,983][06909] Updated weights for policy 0, policy_version 60933 (0.0030) [2024-06-27 19:26:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.5, 300 sec: 43542.5). Total num frames: 998359040. Throughput: 0: 43395.1. Samples: 901287140. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 19:26:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:26:51,525][06909] Updated weights for policy 0, policy_version 60943 (0.0039) [2024-06-27 19:26:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.5, 300 sec: 43599.0). Total num frames: 998572032. Throughput: 0: 43311.0. Samples: 901545060. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 19:26:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:26:55,357][06909] Updated weights for policy 0, policy_version 60953 (0.0031) [2024-06-27 19:26:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 42871.5, 300 sec: 43709.2). Total num frames: 998801408. Throughput: 0: 43363.2. Samples: 901679400. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 19:26:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:26:59,277][06909] Updated weights for policy 0, policy_version 60963 (0.0029) [2024-06-27 19:27:03,247][06909] Updated weights for policy 0, policy_version 60973 (0.0041) [2024-06-27 19:27:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 42871.4, 300 sec: 43543.4). Total num frames: 998998016. Throughput: 0: 43234.4. Samples: 901933200. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 19:27:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:27:06,662][06909] Updated weights for policy 0, policy_version 60983 (0.0026) [2024-06-27 19:27:08,856][06674] Fps is (10 sec: 40935.3, 60 sec: 43140.1, 300 sec: 43597.2). Total num frames: 999211008. Throughput: 0: 43476.8. Samples: 902196740. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 19:27:08,857][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:27:10,441][06909] Updated weights for policy 0, policy_version 60993 (0.0039) [2024-06-27 19:27:13,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43144.7, 300 sec: 43598.4). Total num frames: 999456768. Throughput: 0: 43491.8. Samples: 902328800. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 19:27:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:27:13,903][06909] Updated weights for policy 0, policy_version 61003 (0.0026) [2024-06-27 19:27:18,034][06909] Updated weights for policy 0, policy_version 61013 (0.0029) [2024-06-27 19:27:18,850][06674] Fps is (10 sec: 47542.6, 60 sec: 43692.2, 300 sec: 43653.7). Total num frames: 999686144. Throughput: 0: 43638.2. Samples: 902597680. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-27 19:27:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:27:21,110][06909] Updated weights for policy 0, policy_version 61023 (0.0042) [2024-06-27 19:27:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 999882752. Throughput: 0: 43749.8. Samples: 902862540. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 19:27:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:27:25,366][06909] Updated weights for policy 0, policy_version 61033 (0.0025) [2024-06-27 19:27:28,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43417.7, 300 sec: 43653.6). Total num frames: 1000112128. Throughput: 0: 43735.0. Samples: 902992000. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 19:27:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:27:29,030][06909] Updated weights for policy 0, policy_version 61043 (0.0036) [2024-06-27 19:27:32,758][06909] Updated weights for policy 0, policy_version 61053 (0.0036) [2024-06-27 19:27:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43598.4). Total num frames: 1000325120. Throughput: 0: 43752.1. Samples: 903255980. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 19:27:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:27:36,320][06909] Updated weights for policy 0, policy_version 61063 (0.0021) [2024-06-27 19:27:38,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 1000521728. Throughput: 0: 43912.1. Samples: 903521100. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 19:27:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:27:40,083][06909] Updated weights for policy 0, policy_version 61073 (0.0020) [2024-06-27 19:27:43,634][06909] Updated weights for policy 0, policy_version 61083 (0.0036) [2024-06-27 19:27:43,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1000783872. Throughput: 0: 43760.4. Samples: 903648620. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 19:27:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:27:47,916][06909] Updated weights for policy 0, policy_version 61093 (0.0034) [2024-06-27 19:27:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1000980480. Throughput: 0: 43933.0. Samples: 903910180. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 19:27:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:27:48,937][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000061096_1000996864.pth... [2024-06-27 19:27:48,985][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000060455_990494720.pth [2024-06-27 19:27:51,315][06909] Updated weights for policy 0, policy_version 61103 (0.0042) [2024-06-27 19:27:53,850][06674] Fps is (10 sec: 39322.3, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1001177088. Throughput: 0: 44015.3. Samples: 904177160. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 19:27:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:27:55,194][06909] Updated weights for policy 0, policy_version 61113 (0.0024) [2024-06-27 19:27:58,802][06909] Updated weights for policy 0, policy_version 61123 (0.0038) [2024-06-27 19:27:58,852][06674] Fps is (10 sec: 45865.7, 60 sec: 43962.3, 300 sec: 43708.9). Total num frames: 1001439232. Throughput: 0: 43817.9. Samples: 904300700. Policy #0 lag: (min: 0.0, avg: 11.6, max: 20.0) [2024-06-27 19:27:58,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:28:02,465][06909] Updated weights for policy 0, policy_version 61133 (0.0032) [2024-06-27 19:28:03,850][06674] Fps is (10 sec: 47512.1, 60 sec: 44236.6, 300 sec: 43653.6). Total num frames: 1001652224. Throughput: 0: 43670.8. Samples: 904562880. Policy #0 lag: (min: 0.0, avg: 11.6, max: 20.0) [2024-06-27 19:28:03,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:28:06,235][06909] Updated weights for policy 0, policy_version 61143 (0.0028) [2024-06-27 19:28:08,850][06674] Fps is (10 sec: 40968.5, 60 sec: 43968.2, 300 sec: 43542.6). Total num frames: 1001848832. Throughput: 0: 43969.4. Samples: 904841160. Policy #0 lag: (min: 0.0, avg: 11.6, max: 20.0) [2024-06-27 19:28:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:28:09,023][06887] Signal inference workers to stop experience collection... (12900 times) [2024-06-27 19:28:09,024][06887] Signal inference workers to resume experience collection... (12900 times) [2024-06-27 19:28:09,071][06909] InferenceWorker_p0-w0: stopping experience collection (12900 times) [2024-06-27 19:28:09,071][06909] InferenceWorker_p0-w0: resuming experience collection (12900 times) [2024-06-27 19:28:09,723][06909] Updated weights for policy 0, policy_version 61153 (0.0039) [2024-06-27 19:28:13,784][06909] Updated weights for policy 0, policy_version 61163 (0.0040) [2024-06-27 19:28:13,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43963.6, 300 sec: 43765.0). Total num frames: 1002094592. Throughput: 0: 43797.0. Samples: 904962860. Policy #0 lag: (min: 0.0, avg: 11.6, max: 20.0) [2024-06-27 19:28:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:28:17,331][06909] Updated weights for policy 0, policy_version 61173 (0.0034) [2024-06-27 19:28:18,852][06674] Fps is (10 sec: 45865.6, 60 sec: 43689.2, 300 sec: 43653.3). Total num frames: 1002307584. Throughput: 0: 43659.2. Samples: 905220740. Policy #0 lag: (min: 0.0, avg: 11.6, max: 20.0) [2024-06-27 19:28:18,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:28:21,060][06909] Updated weights for policy 0, policy_version 61183 (0.0028) [2024-06-27 19:28:23,856][06674] Fps is (10 sec: 40935.3, 60 sec: 43686.2, 300 sec: 43597.2). Total num frames: 1002504192. Throughput: 0: 43812.7. Samples: 905492940. Policy #0 lag: (min: 0.0, avg: 11.6, max: 20.0) [2024-06-27 19:28:23,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:28:25,041][06909] Updated weights for policy 0, policy_version 61193 (0.0032) [2024-06-27 19:28:28,594][06909] Updated weights for policy 0, policy_version 61203 (0.0030) [2024-06-27 19:28:28,850][06674] Fps is (10 sec: 44245.2, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1002749952. Throughput: 0: 43609.7. Samples: 905611060. Policy #0 lag: (min: 0.0, avg: 11.6, max: 20.0) [2024-06-27 19:28:28,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:28:32,798][06909] Updated weights for policy 0, policy_version 61213 (0.0030) [2024-06-27 19:28:33,850][06674] Fps is (10 sec: 47541.9, 60 sec: 44236.6, 300 sec: 43709.2). Total num frames: 1002979328. Throughput: 0: 43778.1. Samples: 905880200. Policy #0 lag: (min: 0.0, avg: 11.6, max: 20.0) [2024-06-27 19:28:33,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 19:28:35,937][06909] Updated weights for policy 0, policy_version 61223 (0.0035) [2024-06-27 19:28:38,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.6, 300 sec: 43598.1). Total num frames: 1003159552. Throughput: 0: 43791.9. Samples: 906147800. Policy #0 lag: (min: 0.0, avg: 11.5, max: 20.0) [2024-06-27 19:28:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:28:40,198][06909] Updated weights for policy 0, policy_version 61233 (0.0032) [2024-06-27 19:28:43,690][06909] Updated weights for policy 0, policy_version 61243 (0.0031) [2024-06-27 19:28:43,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 1003405312. Throughput: 0: 43919.4. Samples: 906276980. Policy #0 lag: (min: 0.0, avg: 11.5, max: 20.0) [2024-06-27 19:28:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:28:47,533][06909] Updated weights for policy 0, policy_version 61253 (0.0028) [2024-06-27 19:28:48,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 43709.2). Total num frames: 1003634688. Throughput: 0: 43811.8. Samples: 906534400. Policy #0 lag: (min: 0.0, avg: 11.5, max: 20.0) [2024-06-27 19:28:48,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:28:51,169][06909] Updated weights for policy 0, policy_version 61263 (0.0030) [2024-06-27 19:28:53,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 1003814912. Throughput: 0: 43676.4. Samples: 906806600. Policy #0 lag: (min: 0.0, avg: 11.5, max: 20.0) [2024-06-27 19:28:53,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 19:28:55,224][06909] Updated weights for policy 0, policy_version 61273 (0.0033) [2024-06-27 19:28:58,516][06909] Updated weights for policy 0, policy_version 61283 (0.0033) [2024-06-27 19:28:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43692.1, 300 sec: 43709.3). Total num frames: 1004060672. Throughput: 0: 43686.3. Samples: 906928740. Policy #0 lag: (min: 0.0, avg: 11.5, max: 20.0) [2024-06-27 19:28:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:29:02,763][06909] Updated weights for policy 0, policy_version 61293 (0.0024) [2024-06-27 19:29:03,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.9, 300 sec: 43709.2). Total num frames: 1004290048. Throughput: 0: 43893.0. Samples: 907195840. Policy #0 lag: (min: 0.0, avg: 11.5, max: 20.0) [2024-06-27 19:29:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:29:06,160][06909] Updated weights for policy 0, policy_version 61303 (0.0029) [2024-06-27 19:29:08,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1004470272. Throughput: 0: 43586.8. Samples: 907454080. Policy #0 lag: (min: 0.0, avg: 11.5, max: 20.0) [2024-06-27 19:29:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:29:10,258][06909] Updated weights for policy 0, policy_version 61313 (0.0041) [2024-06-27 19:29:13,511][06909] Updated weights for policy 0, policy_version 61323 (0.0031) [2024-06-27 19:29:13,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1004716032. Throughput: 0: 43638.0. Samples: 907574760. Policy #0 lag: (min: 0.0, avg: 11.5, max: 20.0) [2024-06-27 19:29:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:29:17,576][06909] Updated weights for policy 0, policy_version 61333 (0.0029) [2024-06-27 19:29:18,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43965.2, 300 sec: 43709.2). Total num frames: 1004945408. Throughput: 0: 43771.6. Samples: 907849920. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 19:29:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:29:21,275][06909] Updated weights for policy 0, policy_version 61343 (0.0032) [2024-06-27 19:29:23,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43695.1, 300 sec: 43542.6). Total num frames: 1005125632. Throughput: 0: 43636.1. Samples: 908111420. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 19:29:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:29:25,215][06909] Updated weights for policy 0, policy_version 61353 (0.0029) [2024-06-27 19:29:28,730][06909] Updated weights for policy 0, policy_version 61363 (0.0031) [2024-06-27 19:29:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1005371392. Throughput: 0: 43542.5. Samples: 908236400. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 19:29:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:29:31,606][06887] Signal inference workers to stop experience collection... (12950 times) [2024-06-27 19:29:31,635][06909] InferenceWorker_p0-w0: stopping experience collection (12950 times) [2024-06-27 19:29:31,660][06887] Signal inference workers to resume experience collection... (12950 times) [2024-06-27 19:29:31,664][06909] InferenceWorker_p0-w0: resuming experience collection (12950 times) [2024-06-27 19:29:32,618][06909] Updated weights for policy 0, policy_version 61373 (0.0035) [2024-06-27 19:29:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 1005584384. Throughput: 0: 43758.3. Samples: 908503520. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 19:29:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:29:36,130][06909] Updated weights for policy 0, policy_version 61383 (0.0034) [2024-06-27 19:29:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 43709.2). Total num frames: 1005813760. Throughput: 0: 43659.5. Samples: 908771280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 19:29:38,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:29:40,111][06909] Updated weights for policy 0, policy_version 61393 (0.0028) [2024-06-27 19:29:43,399][06909] Updated weights for policy 0, policy_version 61403 (0.0040) [2024-06-27 19:29:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1006026752. Throughput: 0: 43776.5. Samples: 908898680. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 19:29:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:29:47,718][06909] Updated weights for policy 0, policy_version 61413 (0.0028) [2024-06-27 19:29:48,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43689.1, 300 sec: 43708.9). Total num frames: 1006256128. Throughput: 0: 43789.1. Samples: 909166440. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-27 19:29:48,853][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 19:29:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000061417_1006256128.pth... [2024-06-27 19:29:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000060776_995753984.pth [2024-06-27 19:29:51,203][06909] Updated weights for policy 0, policy_version 61423 (0.0042) [2024-06-27 19:29:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43653.7). Total num frames: 1006452736. Throughput: 0: 43861.8. Samples: 909427860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:29:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:29:55,026][06909] Updated weights for policy 0, policy_version 61433 (0.0027) [2024-06-27 19:29:58,464][06909] Updated weights for policy 0, policy_version 61443 (0.0030) [2024-06-27 19:29:58,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1006682112. Throughput: 0: 44050.1. Samples: 909557020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:29:58,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:30:02,397][06909] Updated weights for policy 0, policy_version 61453 (0.0033) [2024-06-27 19:30:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1006911488. Throughput: 0: 43826.3. Samples: 909822100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:30:03,851][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 19:30:05,828][06909] Updated weights for policy 0, policy_version 61463 (0.0031) [2024-06-27 19:30:08,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 1007091712. Throughput: 0: 43885.8. Samples: 910086280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:30:08,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:30:09,841][06909] Updated weights for policy 0, policy_version 61473 (0.0035) [2024-06-27 19:30:13,439][06909] Updated weights for policy 0, policy_version 61483 (0.0038) [2024-06-27 19:30:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1007337472. Throughput: 0: 43899.7. Samples: 910211880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:30:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:30:17,349][06909] Updated weights for policy 0, policy_version 61493 (0.0036) [2024-06-27 19:30:18,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1007566848. Throughput: 0: 43689.7. Samples: 910469560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:30:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:30:21,084][06909] Updated weights for policy 0, policy_version 61503 (0.0026) [2024-06-27 19:30:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 1007763456. Throughput: 0: 43807.2. Samples: 910742600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:30:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:30:24,737][06909] Updated weights for policy 0, policy_version 61513 (0.0034) [2024-06-27 19:30:28,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1007976448. Throughput: 0: 43805.3. Samples: 910869920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 19:30:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:30:29,072][06909] Updated weights for policy 0, policy_version 61523 (0.0031) [2024-06-27 19:30:32,399][06909] Updated weights for policy 0, policy_version 61533 (0.0038) [2024-06-27 19:30:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1008222208. Throughput: 0: 43682.1. Samples: 911132040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-27 19:30:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:30:36,465][06909] Updated weights for policy 0, policy_version 61543 (0.0032) [2024-06-27 19:30:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1008418816. Throughput: 0: 43820.5. Samples: 911399780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-27 19:30:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:30:39,837][06909] Updated weights for policy 0, policy_version 61553 (0.0035) [2024-06-27 19:30:43,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 1008631808. Throughput: 0: 43721.9. Samples: 911524500. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-27 19:30:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:30:43,910][06909] Updated weights for policy 0, policy_version 61563 (0.0034) [2024-06-27 19:30:47,402][06909] Updated weights for policy 0, policy_version 61573 (0.0038) [2024-06-27 19:30:48,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43692.1, 300 sec: 43764.7). Total num frames: 1008877568. Throughput: 0: 43621.7. Samples: 911785080. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-27 19:30:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:30:51,139][06909] Updated weights for policy 0, policy_version 61583 (0.0037) [2024-06-27 19:30:53,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 1009074176. Throughput: 0: 43727.8. Samples: 912054040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-27 19:30:53,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:30:54,841][06909] Updated weights for policy 0, policy_version 61593 (0.0033) [2024-06-27 19:30:57,488][06887] Signal inference workers to stop experience collection... (13000 times) [2024-06-27 19:30:57,538][06909] InferenceWorker_p0-w0: stopping experience collection (13000 times) [2024-06-27 19:30:57,543][06887] Signal inference workers to resume experience collection... (13000 times) [2024-06-27 19:30:57,552][06909] InferenceWorker_p0-w0: resuming experience collection (13000 times) [2024-06-27 19:30:58,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1009287168. Throughput: 0: 43710.3. Samples: 912178840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-27 19:30:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:30:58,929][06909] Updated weights for policy 0, policy_version 61603 (0.0028) [2024-06-27 19:31:02,211][06909] Updated weights for policy 0, policy_version 61613 (0.0043) [2024-06-27 19:31:03,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1009516544. Throughput: 0: 43774.3. Samples: 912439400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-27 19:31:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:31:06,779][06909] Updated weights for policy 0, policy_version 61623 (0.0026) [2024-06-27 19:31:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 1009729536. Throughput: 0: 43696.0. Samples: 912708920. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 19:31:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:31:09,682][06909] Updated weights for policy 0, policy_version 61633 (0.0029) [2024-06-27 19:31:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 43653.9). Total num frames: 1009942528. Throughput: 0: 43698.6. Samples: 912836360. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 19:31:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:31:14,271][06909] Updated weights for policy 0, policy_version 61643 (0.0028) [2024-06-27 19:31:17,503][06909] Updated weights for policy 0, policy_version 61653 (0.0033) [2024-06-27 19:31:18,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.7, 300 sec: 43820.2). Total num frames: 1010188288. Throughput: 0: 43828.4. Samples: 913104320. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 19:31:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:31:21,597][06909] Updated weights for policy 0, policy_version 61663 (0.0030) [2024-06-27 19:31:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1010384896. Throughput: 0: 43564.9. Samples: 913360200. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 19:31:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:31:25,034][06909] Updated weights for policy 0, policy_version 61673 (0.0040) [2024-06-27 19:31:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1010597888. Throughput: 0: 43602.1. Samples: 913486600. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 19:31:28,850][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 19:31:29,031][06909] Updated weights for policy 0, policy_version 61683 (0.0025) [2024-06-27 19:31:32,317][06909] Updated weights for policy 0, policy_version 61693 (0.0026) [2024-06-27 19:31:33,852][06674] Fps is (10 sec: 45865.5, 60 sec: 43689.2, 300 sec: 43820.0). Total num frames: 1010843648. Throughput: 0: 43641.6. Samples: 913749040. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 19:31:33,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:31:36,357][06909] Updated weights for policy 0, policy_version 61703 (0.0057) [2024-06-27 19:31:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1011040256. Throughput: 0: 43659.6. Samples: 914018720. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 19:31:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:31:39,729][06909] Updated weights for policy 0, policy_version 61713 (0.0027) [2024-06-27 19:31:43,856][06674] Fps is (10 sec: 40943.7, 60 sec: 43686.2, 300 sec: 43708.3). Total num frames: 1011253248. Throughput: 0: 43678.1. Samples: 914144620. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-27 19:31:43,856][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:31:44,141][06909] Updated weights for policy 0, policy_version 61723 (0.0026) [2024-06-27 19:31:46,986][06909] Updated weights for policy 0, policy_version 61733 (0.0028) [2024-06-27 19:31:48,852][06674] Fps is (10 sec: 45866.2, 60 sec: 43689.3, 300 sec: 43820.0). Total num frames: 1011499008. Throughput: 0: 43677.5. Samples: 914404980. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 19:31:48,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:31:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000061737_1011499008.pth... [2024-06-27 19:31:48,914][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000061096_1000996864.pth [2024-06-27 19:31:51,562][06909] Updated weights for policy 0, policy_version 61743 (0.0026) [2024-06-27 19:31:53,850][06674] Fps is (10 sec: 45902.6, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 1011712000. Throughput: 0: 43653.2. Samples: 914673320. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 19:31:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:31:54,955][06909] Updated weights for policy 0, policy_version 61753 (0.0030) [2024-06-27 19:31:58,834][06909] Updated weights for policy 0, policy_version 61763 (0.0025) [2024-06-27 19:31:58,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1011924992. Throughput: 0: 43681.8. Samples: 914802040. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 19:31:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:32:01,040][06887] Signal inference workers to stop experience collection... (13050 times) [2024-06-27 19:32:01,098][06887] Signal inference workers to resume experience collection... (13050 times) [2024-06-27 19:32:01,098][06909] InferenceWorker_p0-w0: stopping experience collection (13050 times) [2024-06-27 19:32:01,116][06909] InferenceWorker_p0-w0: resuming experience collection (13050 times) [2024-06-27 19:32:02,484][06909] Updated weights for policy 0, policy_version 61773 (0.0027) [2024-06-27 19:32:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43876.7). Total num frames: 1012154368. Throughput: 0: 43679.6. Samples: 915069900. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 19:32:03,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:32:06,153][06909] Updated weights for policy 0, policy_version 61783 (0.0040) [2024-06-27 19:32:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1012367360. Throughput: 0: 43724.5. Samples: 915327800. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 19:32:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:32:09,892][06909] Updated weights for policy 0, policy_version 61793 (0.0041) [2024-06-27 19:32:13,734][06909] Updated weights for policy 0, policy_version 61803 (0.0029) [2024-06-27 19:32:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1012580352. Throughput: 0: 43848.0. Samples: 915459760. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 19:32:13,855][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:32:17,223][06909] Updated weights for policy 0, policy_version 61813 (0.0031) [2024-06-27 19:32:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1012809728. Throughput: 0: 43767.8. Samples: 915718500. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 19:32:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:32:21,552][06909] Updated weights for policy 0, policy_version 61823 (0.0024) [2024-06-27 19:32:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 1013022720. Throughput: 0: 43681.0. Samples: 915984360. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 19:32:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:32:24,608][06909] Updated weights for policy 0, policy_version 61833 (0.0041) [2024-06-27 19:32:28,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1013219328. Throughput: 0: 43650.4. Samples: 916108620. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 19:32:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:32:28,930][06909] Updated weights for policy 0, policy_version 61843 (0.0027) [2024-06-27 19:32:32,408][06909] Updated weights for policy 0, policy_version 61853 (0.0030) [2024-06-27 19:32:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43692.2, 300 sec: 43875.8). Total num frames: 1013465088. Throughput: 0: 43734.5. Samples: 916372940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 19:32:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:32:36,346][06909] Updated weights for policy 0, policy_version 61863 (0.0033) [2024-06-27 19:32:38,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1013678080. Throughput: 0: 43647.1. Samples: 916637440. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 19:32:38,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:32:40,053][06909] Updated weights for policy 0, policy_version 61873 (0.0031) [2024-06-27 19:32:43,823][06909] Updated weights for policy 0, policy_version 61883 (0.0035) [2024-06-27 19:32:43,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43968.1, 300 sec: 43764.7). Total num frames: 1013891072. Throughput: 0: 43535.9. Samples: 916761160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 19:32:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:32:47,659][06909] Updated weights for policy 0, policy_version 61893 (0.0038) [2024-06-27 19:32:48,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43419.1, 300 sec: 43820.3). Total num frames: 1014104064. Throughput: 0: 43525.9. Samples: 917028560. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 19:32:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:32:51,250][06909] Updated weights for policy 0, policy_version 61903 (0.0029) [2024-06-27 19:32:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43709.5). Total num frames: 1014333440. Throughput: 0: 43529.7. Samples: 917286640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 19:32:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:32:55,033][06909] Updated weights for policy 0, policy_version 61913 (0.0036) [2024-06-27 19:32:58,726][06909] Updated weights for policy 0, policy_version 61923 (0.0031) [2024-06-27 19:32:58,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1014546432. Throughput: 0: 43467.0. Samples: 917415780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 19:32:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:33:02,478][06909] Updated weights for policy 0, policy_version 61933 (0.0034) [2024-06-27 19:33:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 1014759424. Throughput: 0: 43644.1. Samples: 917682480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:33:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:33:06,212][06909] Updated weights for policy 0, policy_version 61943 (0.0040) [2024-06-27 19:33:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1014988800. Throughput: 0: 43522.6. Samples: 917942880. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:33:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:33:09,934][06909] Updated weights for policy 0, policy_version 61953 (0.0031) [2024-06-27 19:33:13,725][06909] Updated weights for policy 0, policy_version 61963 (0.0033) [2024-06-27 19:33:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 43709.5). Total num frames: 1015201792. Throughput: 0: 43707.5. Samples: 918075460. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:33:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:33:17,347][06909] Updated weights for policy 0, policy_version 61973 (0.0039) [2024-06-27 19:33:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 43765.6). Total num frames: 1015414784. Throughput: 0: 43621.8. Samples: 918335920. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:33:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:33:20,978][06909] Updated weights for policy 0, policy_version 61983 (0.0045) [2024-06-27 19:33:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.5, 300 sec: 43653.7). Total num frames: 1015627776. Throughput: 0: 43644.5. Samples: 918601440. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:33:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 19:33:25,008][06909] Updated weights for policy 0, policy_version 61993 (0.0035) [2024-06-27 19:33:28,582][06909] Updated weights for policy 0, policy_version 62003 (0.0048) [2024-06-27 19:33:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43653.7). Total num frames: 1015857152. Throughput: 0: 43711.6. Samples: 918728180. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:33:28,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:33:30,008][06887] Signal inference workers to stop experience collection... (13100 times) [2024-06-27 19:33:30,009][06887] Signal inference workers to resume experience collection... (13100 times) [2024-06-27 19:33:30,051][06909] InferenceWorker_p0-w0: stopping experience collection (13100 times) [2024-06-27 19:33:30,051][06909] InferenceWorker_p0-w0: resuming experience collection (13100 times) [2024-06-27 19:33:32,458][06909] Updated weights for policy 0, policy_version 62013 (0.0038) [2024-06-27 19:33:33,856][06674] Fps is (10 sec: 45847.3, 60 sec: 43686.2, 300 sec: 43819.4). Total num frames: 1016086528. Throughput: 0: 43743.8. Samples: 918997300. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:33:33,857][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:33:36,037][06909] Updated weights for policy 0, policy_version 62023 (0.0042) [2024-06-27 19:33:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 1016299520. Throughput: 0: 43888.5. Samples: 919261620. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:33:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:33:39,749][06909] Updated weights for policy 0, policy_version 62033 (0.0040) [2024-06-27 19:33:43,559][06909] Updated weights for policy 0, policy_version 62043 (0.0028) [2024-06-27 19:33:43,850][06674] Fps is (10 sec: 42624.6, 60 sec: 43690.8, 300 sec: 43653.6). Total num frames: 1016512512. Throughput: 0: 43885.5. Samples: 919390620. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 19:33:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:33:47,005][06909] Updated weights for policy 0, policy_version 62053 (0.0031) [2024-06-27 19:33:48,850][06674] Fps is (10 sec: 44235.4, 60 sec: 43963.5, 300 sec: 43820.2). Total num frames: 1016741888. Throughput: 0: 43756.6. Samples: 919651540. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 19:33:48,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:33:48,874][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000062057_1016741888.pth... [2024-06-27 19:33:48,936][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000061417_1006256128.pth [2024-06-27 19:33:51,111][06909] Updated weights for policy 0, policy_version 62063 (0.0035) [2024-06-27 19:33:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1016954880. Throughput: 0: 43856.5. Samples: 919916420. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 19:33:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:33:54,357][06909] Updated weights for policy 0, policy_version 62073 (0.0029) [2024-06-27 19:33:58,411][06909] Updated weights for policy 0, policy_version 62083 (0.0031) [2024-06-27 19:33:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1017167872. Throughput: 0: 43879.5. Samples: 920050040. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 19:33:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:34:02,344][06909] Updated weights for policy 0, policy_version 62093 (0.0042) [2024-06-27 19:34:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.6, 300 sec: 43820.3). Total num frames: 1017397248. Throughput: 0: 43879.0. Samples: 920310480. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 19:34:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:34:06,030][06909] Updated weights for policy 0, policy_version 62103 (0.0028) [2024-06-27 19:34:08,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1017610240. Throughput: 0: 43820.1. Samples: 920573340. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 19:34:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:34:09,623][06909] Updated weights for policy 0, policy_version 62113 (0.0047) [2024-06-27 19:34:13,417][06909] Updated weights for policy 0, policy_version 62123 (0.0031) [2024-06-27 19:34:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1017823232. Throughput: 0: 43889.8. Samples: 920703220. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 19:34:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:34:17,263][06909] Updated weights for policy 0, policy_version 62133 (0.0031) [2024-06-27 19:34:18,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 43820.2). Total num frames: 1018052608. Throughput: 0: 43794.3. Samples: 920967780. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 19:34:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:34:21,162][06909] Updated weights for policy 0, policy_version 62143 (0.0037) [2024-06-27 19:34:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1018265600. Throughput: 0: 43707.1. Samples: 921228440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 19:34:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:34:24,720][06909] Updated weights for policy 0, policy_version 62153 (0.0025) [2024-06-27 19:34:28,717][06909] Updated weights for policy 0, policy_version 62163 (0.0031) [2024-06-27 19:34:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1018478592. Throughput: 0: 43736.0. Samples: 921358740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 19:34:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:34:31,918][06909] Updated weights for policy 0, policy_version 62173 (0.0039) [2024-06-27 19:34:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43695.1, 300 sec: 43709.2). Total num frames: 1018707968. Throughput: 0: 43784.7. Samples: 921621840. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 19:34:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:34:35,989][06909] Updated weights for policy 0, policy_version 62183 (0.0031) [2024-06-27 19:34:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1018920960. Throughput: 0: 43844.0. Samples: 921889400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 19:34:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:34:39,229][06909] Updated weights for policy 0, policy_version 62193 (0.0036) [2024-06-27 19:34:43,618][06909] Updated weights for policy 0, policy_version 62203 (0.0025) [2024-06-27 19:34:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43654.0). Total num frames: 1019133952. Throughput: 0: 43744.5. Samples: 922018540. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 19:34:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:34:47,040][06909] Updated weights for policy 0, policy_version 62213 (0.0026) [2024-06-27 19:34:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 1019363328. Throughput: 0: 43675.1. Samples: 922275860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 19:34:48,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:34:51,344][06909] Updated weights for policy 0, policy_version 62223 (0.0050) [2024-06-27 19:34:53,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43689.2, 300 sec: 43708.9). Total num frames: 1019576320. Throughput: 0: 43666.4. Samples: 922538420. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 19:34:53,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:34:54,710][06909] Updated weights for policy 0, policy_version 62233 (0.0027) [2024-06-27 19:34:58,748][06909] Updated weights for policy 0, policy_version 62243 (0.0036) [2024-06-27 19:34:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.8, 300 sec: 43653.6). Total num frames: 1019789312. Throughput: 0: 43589.3. Samples: 922664740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 19:34:58,850][06674] Avg episode reward: [(0, '0.432')] [2024-06-27 19:35:02,262][06909] Updated weights for policy 0, policy_version 62253 (0.0032) [2024-06-27 19:35:03,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1020018688. Throughput: 0: 43523.7. Samples: 922926340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 19:35:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:35:05,572][06887] Signal inference workers to stop experience collection... (13150 times) [2024-06-27 19:35:05,607][06909] InferenceWorker_p0-w0: stopping experience collection (13150 times) [2024-06-27 19:35:05,627][06887] Signal inference workers to resume experience collection... (13150 times) [2024-06-27 19:35:05,627][06909] InferenceWorker_p0-w0: resuming experience collection (13150 times) [2024-06-27 19:35:06,379][06909] Updated weights for policy 0, policy_version 62263 (0.0039) [2024-06-27 19:35:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1020231680. Throughput: 0: 43730.6. Samples: 923196320. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 19:35:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:35:09,602][06909] Updated weights for policy 0, policy_version 62273 (0.0033) [2024-06-27 19:35:13,633][06909] Updated weights for policy 0, policy_version 62283 (0.0033) [2024-06-27 19:35:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1020444672. Throughput: 0: 43741.3. Samples: 923327100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 19:35:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:35:16,937][06909] Updated weights for policy 0, policy_version 62293 (0.0030) [2024-06-27 19:35:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1020674048. Throughput: 0: 43514.7. Samples: 923580000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 19:35:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:35:21,221][06909] Updated weights for policy 0, policy_version 62303 (0.0035) [2024-06-27 19:35:23,856][06674] Fps is (10 sec: 44210.2, 60 sec: 43686.2, 300 sec: 43763.8). Total num frames: 1020887040. Throughput: 0: 43376.8. Samples: 923841620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 19:35:23,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:35:24,531][06909] Updated weights for policy 0, policy_version 62313 (0.0037) [2024-06-27 19:35:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1021083648. Throughput: 0: 43271.1. Samples: 923965740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 19:35:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:35:28,930][06909] Updated weights for policy 0, policy_version 62323 (0.0030) [2024-06-27 19:35:32,359][06909] Updated weights for policy 0, policy_version 62333 (0.0028) [2024-06-27 19:35:33,850][06674] Fps is (10 sec: 42624.0, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1021313024. Throughput: 0: 43328.0. Samples: 924225620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 19:35:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:35:36,350][06909] Updated weights for policy 0, policy_version 62343 (0.0031) [2024-06-27 19:35:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1021526016. Throughput: 0: 43349.1. Samples: 924489040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:35:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:35:40,170][06909] Updated weights for policy 0, policy_version 62353 (0.0028) [2024-06-27 19:35:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 1021739008. Throughput: 0: 43504.3. Samples: 924622440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:35:43,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:35:43,863][06909] Updated weights for policy 0, policy_version 62363 (0.0041) [2024-06-27 19:35:47,500][06909] Updated weights for policy 0, policy_version 62373 (0.0034) [2024-06-27 19:35:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43144.6, 300 sec: 43653.7). Total num frames: 1021952000. Throughput: 0: 43587.2. Samples: 924887760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:35:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:35:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000062376_1021968384.pth... [2024-06-27 19:35:48,906][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000061737_1011499008.pth [2024-06-27 19:35:51,173][06909] Updated weights for policy 0, policy_version 62383 (0.0040) [2024-06-27 19:35:53,850][06674] Fps is (10 sec: 45876.2, 60 sec: 43692.2, 300 sec: 43764.7). Total num frames: 1022197760. Throughput: 0: 43395.7. Samples: 925149120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:35:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:35:54,923][06909] Updated weights for policy 0, policy_version 62393 (0.0031) [2024-06-27 19:35:58,592][06909] Updated weights for policy 0, policy_version 62403 (0.0025) [2024-06-27 19:35:58,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1022410752. Throughput: 0: 43441.4. Samples: 925281960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:35:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:36:02,264][06909] Updated weights for policy 0, policy_version 62413 (0.0041) [2024-06-27 19:36:03,851][06674] Fps is (10 sec: 44232.4, 60 sec: 43690.0, 300 sec: 43764.6). Total num frames: 1022640128. Throughput: 0: 43509.3. Samples: 925537960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:36:03,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:36:05,989][06909] Updated weights for policy 0, policy_version 62423 (0.0035) [2024-06-27 19:36:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1022836736. Throughput: 0: 43693.0. Samples: 925807540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:36:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:36:09,744][06909] Updated weights for policy 0, policy_version 62433 (0.0040) [2024-06-27 19:36:13,740][06909] Updated weights for policy 0, policy_version 62443 (0.0026) [2024-06-27 19:36:13,850][06674] Fps is (10 sec: 42601.8, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1023066112. Throughput: 0: 43710.6. Samples: 925932720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:36:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:36:17,650][06909] Updated weights for policy 0, policy_version 62453 (0.0042) [2024-06-27 19:36:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43144.6, 300 sec: 43653.6). Total num frames: 1023262720. Throughput: 0: 43755.2. Samples: 926194600. Policy #0 lag: (min: 1.0, avg: 9.9, max: 21.0) [2024-06-27 19:36:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:36:21,340][06909] Updated weights for policy 0, policy_version 62463 (0.0038) [2024-06-27 19:36:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43421.9, 300 sec: 43709.2). Total num frames: 1023492096. Throughput: 0: 43720.3. Samples: 926456460. Policy #0 lag: (min: 1.0, avg: 9.9, max: 21.0) [2024-06-27 19:36:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:36:24,999][06909] Updated weights for policy 0, policy_version 62473 (0.0037) [2024-06-27 19:36:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43598.4). Total num frames: 1023705088. Throughput: 0: 43746.3. Samples: 926591020. Policy #0 lag: (min: 1.0, avg: 9.9, max: 21.0) [2024-06-27 19:36:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 19:36:29,135][06909] Updated weights for policy 0, policy_version 62483 (0.0033) [2024-06-27 19:36:32,676][06909] Updated weights for policy 0, policy_version 62493 (0.0030) [2024-06-27 19:36:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 1023918080. Throughput: 0: 43637.2. Samples: 926851440. Policy #0 lag: (min: 1.0, avg: 9.9, max: 21.0) [2024-06-27 19:36:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:36:36,484][06909] Updated weights for policy 0, policy_version 62503 (0.0033) [2024-06-27 19:36:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 43765.6). Total num frames: 1024163840. Throughput: 0: 43592.8. Samples: 927110800. Policy #0 lag: (min: 1.0, avg: 9.9, max: 21.0) [2024-06-27 19:36:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:36:40,186][06909] Updated weights for policy 0, policy_version 62513 (0.0034) [2024-06-27 19:36:43,856][06909] Updated weights for policy 0, policy_version 62523 (0.0034) [2024-06-27 19:36:43,856][06674] Fps is (10 sec: 45847.6, 60 sec: 43959.4, 300 sec: 43653.0). Total num frames: 1024376832. Throughput: 0: 43532.8. Samples: 927241200. Policy #0 lag: (min: 1.0, avg: 9.9, max: 21.0) [2024-06-27 19:36:43,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:36:47,491][06887] Signal inference workers to stop experience collection... (13200 times) [2024-06-27 19:36:47,539][06909] InferenceWorker_p0-w0: stopping experience collection (13200 times) [2024-06-27 19:36:47,609][06887] Signal inference workers to resume experience collection... (13200 times) [2024-06-27 19:36:47,609][06909] InferenceWorker_p0-w0: resuming experience collection (13200 times) [2024-06-27 19:36:47,611][06909] Updated weights for policy 0, policy_version 62533 (0.0029) [2024-06-27 19:36:48,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1024573440. Throughput: 0: 43654.2. Samples: 927502360. Policy #0 lag: (min: 1.0, avg: 9.9, max: 21.0) [2024-06-27 19:36:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:36:51,726][06909] Updated weights for policy 0, policy_version 62543 (0.0032) [2024-06-27 19:36:53,850][06674] Fps is (10 sec: 40984.3, 60 sec: 43144.4, 300 sec: 43598.1). Total num frames: 1024786432. Throughput: 0: 43446.1. Samples: 927762620. Policy #0 lag: (min: 1.0, avg: 9.9, max: 21.0) [2024-06-27 19:36:53,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:36:55,430][06909] Updated weights for policy 0, policy_version 62553 (0.0028) [2024-06-27 19:36:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1025015808. Throughput: 0: 43679.7. Samples: 927898300. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 19:36:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:36:58,910][06909] Updated weights for policy 0, policy_version 62563 (0.0028) [2024-06-27 19:37:02,700][06909] Updated weights for policy 0, policy_version 62573 (0.0044) [2024-06-27 19:37:03,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43145.2, 300 sec: 43598.1). Total num frames: 1025228800. Throughput: 0: 43745.3. Samples: 928163140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 19:37:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:37:06,363][06909] Updated weights for policy 0, policy_version 62583 (0.0032) [2024-06-27 19:37:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1025458176. Throughput: 0: 43736.1. Samples: 928424580. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 19:37:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:37:10,062][06909] Updated weights for policy 0, policy_version 62593 (0.0024) [2024-06-27 19:37:13,717][06909] Updated weights for policy 0, policy_version 62603 (0.0035) [2024-06-27 19:37:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1025687552. Throughput: 0: 43749.8. Samples: 928559760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 19:37:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:37:17,783][06909] Updated weights for policy 0, policy_version 62613 (0.0026) [2024-06-27 19:37:18,851][06674] Fps is (10 sec: 42594.2, 60 sec: 43689.9, 300 sec: 43597.9). Total num frames: 1025884160. Throughput: 0: 43744.8. Samples: 928820000. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 19:37:18,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:37:21,334][06909] Updated weights for policy 0, policy_version 62623 (0.0034) [2024-06-27 19:37:23,853][06674] Fps is (10 sec: 42584.1, 60 sec: 43688.3, 300 sec: 43708.7). Total num frames: 1026113536. Throughput: 0: 43721.2. Samples: 929078400. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 19:37:23,854][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:37:25,148][06909] Updated weights for policy 0, policy_version 62633 (0.0035) [2024-06-27 19:37:28,689][06909] Updated weights for policy 0, policy_version 62643 (0.0028) [2024-06-27 19:37:28,850][06674] Fps is (10 sec: 45880.0, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 1026342912. Throughput: 0: 43857.5. Samples: 929214520. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 19:37:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:37:32,679][06909] Updated weights for policy 0, policy_version 62653 (0.0034) [2024-06-27 19:37:33,850][06674] Fps is (10 sec: 44251.7, 60 sec: 43963.7, 300 sec: 43653.7). Total num frames: 1026555904. Throughput: 0: 43891.6. Samples: 929477480. Policy #0 lag: (min: 1.0, avg: 11.5, max: 23.0) [2024-06-27 19:37:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 19:37:36,148][06909] Updated weights for policy 0, policy_version 62663 (0.0035) [2024-06-27 19:37:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 1026768896. Throughput: 0: 43998.0. Samples: 929742520. Policy #0 lag: (min: 1.0, avg: 11.5, max: 23.0) [2024-06-27 19:37:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:37:40,062][06909] Updated weights for policy 0, policy_version 62673 (0.0027) [2024-06-27 19:37:43,558][06909] Updated weights for policy 0, policy_version 62683 (0.0034) [2024-06-27 19:37:43,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43695.1, 300 sec: 43709.2). Total num frames: 1026998272. Throughput: 0: 43907.1. Samples: 929874120. Policy #0 lag: (min: 1.0, avg: 11.5, max: 23.0) [2024-06-27 19:37:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:37:47,511][06909] Updated weights for policy 0, policy_version 62693 (0.0028) [2024-06-27 19:37:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 1027211264. Throughput: 0: 43910.2. Samples: 930139100. Policy #0 lag: (min: 1.0, avg: 11.5, max: 23.0) [2024-06-27 19:37:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:37:48,906][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000062697_1027227648.pth... [2024-06-27 19:37:48,959][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000062057_1016741888.pth [2024-06-27 19:37:51,108][06909] Updated weights for policy 0, policy_version 62703 (0.0032) [2024-06-27 19:37:53,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1027407872. Throughput: 0: 43842.2. Samples: 930397480. Policy #0 lag: (min: 1.0, avg: 11.5, max: 23.0) [2024-06-27 19:37:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:37:55,273][06909] Updated weights for policy 0, policy_version 62713 (0.0034) [2024-06-27 19:37:58,689][06909] Updated weights for policy 0, policy_version 62723 (0.0037) [2024-06-27 19:37:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 43764.7). Total num frames: 1027670016. Throughput: 0: 43730.6. Samples: 930527640. Policy #0 lag: (min: 1.0, avg: 11.5, max: 23.0) [2024-06-27 19:37:58,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:38:02,755][06909] Updated weights for policy 0, policy_version 62733 (0.0021) [2024-06-27 19:38:03,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.7, 300 sec: 43709.2). Total num frames: 1027883008. Throughput: 0: 43913.8. Samples: 930796080. Policy #0 lag: (min: 1.0, avg: 11.5, max: 23.0) [2024-06-27 19:38:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:38:04,255][06887] Signal inference workers to stop experience collection... (13250 times) [2024-06-27 19:38:04,255][06887] Signal inference workers to resume experience collection... (13250 times) [2024-06-27 19:38:04,278][06909] InferenceWorker_p0-w0: stopping experience collection (13250 times) [2024-06-27 19:38:04,278][06909] InferenceWorker_p0-w0: resuming experience collection (13250 times) [2024-06-27 19:38:06,134][06909] Updated weights for policy 0, policy_version 62743 (0.0029) [2024-06-27 19:38:08,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 1028063232. Throughput: 0: 44086.7. Samples: 931062160. Policy #0 lag: (min: 1.0, avg: 11.5, max: 23.0) [2024-06-27 19:38:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:38:10,350][06909] Updated weights for policy 0, policy_version 62753 (0.0040) [2024-06-27 19:38:13,512][06909] Updated weights for policy 0, policy_version 62763 (0.0032) [2024-06-27 19:38:13,856][06674] Fps is (10 sec: 42573.2, 60 sec: 43686.3, 300 sec: 43708.3). Total num frames: 1028308992. Throughput: 0: 43787.0. Samples: 931185200. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:38:13,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:38:17,869][06909] Updated weights for policy 0, policy_version 62773 (0.0027) [2024-06-27 19:38:18,850][06674] Fps is (10 sec: 47514.5, 60 sec: 44237.6, 300 sec: 43764.7). Total num frames: 1028538368. Throughput: 0: 43838.7. Samples: 931450220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:38:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:38:20,836][06909] Updated weights for policy 0, policy_version 62783 (0.0031) [2024-06-27 19:38:23,850][06674] Fps is (10 sec: 42624.0, 60 sec: 43693.1, 300 sec: 43653.6). Total num frames: 1028734976. Throughput: 0: 43715.9. Samples: 931709740. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:38:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:38:25,595][06909] Updated weights for policy 0, policy_version 62793 (0.0039) [2024-06-27 19:38:28,404][06909] Updated weights for policy 0, policy_version 62803 (0.0029) [2024-06-27 19:38:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43654.6). Total num frames: 1028964352. Throughput: 0: 43650.2. Samples: 931838380. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:38:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:38:32,918][06909] Updated weights for policy 0, policy_version 62813 (0.0031) [2024-06-27 19:38:33,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1029193728. Throughput: 0: 43625.0. Samples: 932102220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:38:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:38:36,145][06909] Updated weights for policy 0, policy_version 62823 (0.0022) [2024-06-27 19:38:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1029390336. Throughput: 0: 43734.3. Samples: 932365520. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:38:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:38:40,199][06909] Updated weights for policy 0, policy_version 62833 (0.0029) [2024-06-27 19:38:43,498][06909] Updated weights for policy 0, policy_version 62843 (0.0029) [2024-06-27 19:38:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43653.7). Total num frames: 1029619712. Throughput: 0: 43658.8. Samples: 932492280. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:38:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:38:47,936][06909] Updated weights for policy 0, policy_version 62853 (0.0032) [2024-06-27 19:38:48,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1029832704. Throughput: 0: 43535.1. Samples: 932755160. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:38:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:38:51,032][06909] Updated weights for policy 0, policy_version 62863 (0.0043) [2024-06-27 19:38:53,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1030029312. Throughput: 0: 43358.9. Samples: 933013300. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:38:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:38:55,461][06909] Updated weights for policy 0, policy_version 62873 (0.0026) [2024-06-27 19:38:58,488][06909] Updated weights for policy 0, policy_version 62883 (0.0024) [2024-06-27 19:38:58,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 1030275072. Throughput: 0: 43520.1. Samples: 933143340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:38:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:39:03,302][06909] Updated weights for policy 0, policy_version 62893 (0.0031) [2024-06-27 19:39:03,852][06674] Fps is (10 sec: 44227.3, 60 sec: 43143.1, 300 sec: 43597.8). Total num frames: 1030471680. Throughput: 0: 43524.2. Samples: 933408900. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:39:03,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:39:06,402][06909] Updated weights for policy 0, policy_version 62903 (0.0030) [2024-06-27 19:39:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.9, 300 sec: 43653.6). Total num frames: 1030701056. Throughput: 0: 43397.8. Samples: 933662640. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:39:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:39:10,554][06909] Updated weights for policy 0, policy_version 62913 (0.0035) [2024-06-27 19:39:13,843][06909] Updated weights for policy 0, policy_version 62923 (0.0032) [2024-06-27 19:39:13,850][06674] Fps is (10 sec: 45884.6, 60 sec: 43695.0, 300 sec: 43653.6). Total num frames: 1030930432. Throughput: 0: 43519.9. Samples: 933796780. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:39:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:39:17,673][06887] Signal inference workers to stop experience collection... (13300 times) [2024-06-27 19:39:17,673][06887] Signal inference workers to resume experience collection... (13300 times) [2024-06-27 19:39:17,706][06909] InferenceWorker_p0-w0: stopping experience collection (13300 times) [2024-06-27 19:39:17,706][06909] InferenceWorker_p0-w0: resuming experience collection (13300 times) [2024-06-27 19:39:18,180][06909] Updated weights for policy 0, policy_version 62933 (0.0026) [2024-06-27 19:39:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43144.5, 300 sec: 43598.1). Total num frames: 1031127040. Throughput: 0: 43485.6. Samples: 934059080. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:39:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 19:39:21,463][06909] Updated weights for policy 0, policy_version 62943 (0.0035) [2024-06-27 19:39:23,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1031340032. Throughput: 0: 43335.9. Samples: 934315640. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:39:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:39:25,572][06909] Updated weights for policy 0, policy_version 62953 (0.0034) [2024-06-27 19:39:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 1031569408. Throughput: 0: 43446.1. Samples: 934447360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 19:39:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:39:29,057][06909] Updated weights for policy 0, policy_version 62963 (0.0032) [2024-06-27 19:39:32,893][06909] Updated weights for policy 0, policy_version 62973 (0.0037) [2024-06-27 19:39:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43144.4, 300 sec: 43598.1). Total num frames: 1031782400. Throughput: 0: 43445.0. Samples: 934710180. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 19:39:33,859][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:39:36,393][06909] Updated weights for policy 0, policy_version 62983 (0.0035) [2024-06-27 19:39:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 1031995392. Throughput: 0: 43439.0. Samples: 934968060. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 19:39:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:39:40,723][06909] Updated weights for policy 0, policy_version 62993 (0.0043) [2024-06-27 19:39:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1032224768. Throughput: 0: 43614.2. Samples: 935105980. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 19:39:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:39:43,883][06909] Updated weights for policy 0, policy_version 63003 (0.0048) [2024-06-27 19:39:47,977][06909] Updated weights for policy 0, policy_version 63013 (0.0039) [2024-06-27 19:39:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.7, 300 sec: 43598.4). Total num frames: 1032437760. Throughput: 0: 43466.0. Samples: 935364780. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 19:39:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:39:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000063015_1032437760.pth... [2024-06-27 19:39:48,925][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000062376_1021968384.pth [2024-06-27 19:39:51,601][06909] Updated weights for policy 0, policy_version 63023 (0.0036) [2024-06-27 19:39:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1032650752. Throughput: 0: 43571.0. Samples: 935623340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 19:39:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:39:55,640][06909] Updated weights for policy 0, policy_version 63033 (0.0040) [2024-06-27 19:39:58,856][06674] Fps is (10 sec: 40935.0, 60 sec: 42867.1, 300 sec: 43486.1). Total num frames: 1032847360. Throughput: 0: 43555.5. Samples: 935757040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 19:39:58,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:39:59,146][06909] Updated weights for policy 0, policy_version 63043 (0.0025) [2024-06-27 19:40:03,094][06909] Updated weights for policy 0, policy_version 63053 (0.0028) [2024-06-27 19:40:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43692.2, 300 sec: 43598.1). Total num frames: 1033093120. Throughput: 0: 43542.7. Samples: 936018500. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-27 19:40:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:40:06,469][06909] Updated weights for policy 0, policy_version 63063 (0.0021) [2024-06-27 19:40:08,850][06674] Fps is (10 sec: 47542.3, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1033322496. Throughput: 0: 43505.8. Samples: 936273400. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 19:40:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:40:10,626][06909] Updated weights for policy 0, policy_version 63073 (0.0023) [2024-06-27 19:40:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1033535488. Throughput: 0: 43503.2. Samples: 936405000. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 19:40:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:40:14,241][06909] Updated weights for policy 0, policy_version 63083 (0.0026) [2024-06-27 19:40:17,896][06909] Updated weights for policy 0, policy_version 63093 (0.0030) [2024-06-27 19:40:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43599.0). Total num frames: 1033748480. Throughput: 0: 43655.4. Samples: 936674680. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 19:40:18,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 19:40:21,562][06909] Updated weights for policy 0, policy_version 63103 (0.0032) [2024-06-27 19:40:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.8, 300 sec: 43653.7). Total num frames: 1033961472. Throughput: 0: 43611.7. Samples: 936930580. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 19:40:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:40:25,055][06887] Signal inference workers to stop experience collection... (13350 times) [2024-06-27 19:40:25,058][06887] Signal inference workers to resume experience collection... (13350 times) [2024-06-27 19:40:25,098][06909] InferenceWorker_p0-w0: stopping experience collection (13350 times) [2024-06-27 19:40:25,098][06909] InferenceWorker_p0-w0: resuming experience collection (13350 times) [2024-06-27 19:40:25,396][06909] Updated weights for policy 0, policy_version 63113 (0.0028) [2024-06-27 19:40:28,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1034190848. Throughput: 0: 43587.1. Samples: 937067400. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 19:40:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:40:29,011][06909] Updated weights for policy 0, policy_version 63123 (0.0028) [2024-06-27 19:40:33,082][06909] Updated weights for policy 0, policy_version 63133 (0.0032) [2024-06-27 19:40:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1034403840. Throughput: 0: 43641.3. Samples: 937328640. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 19:40:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:40:36,522][06909] Updated weights for policy 0, policy_version 63143 (0.0026) [2024-06-27 19:40:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1034616832. Throughput: 0: 43457.4. Samples: 937578920. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 19:40:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:40:40,618][06909] Updated weights for policy 0, policy_version 63153 (0.0044) [2024-06-27 19:40:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 1034829824. Throughput: 0: 43441.0. Samples: 937711620. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 19:40:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:40:44,157][06909] Updated weights for policy 0, policy_version 63163 (0.0027) [2024-06-27 19:40:48,372][06909] Updated weights for policy 0, policy_version 63173 (0.0037) [2024-06-27 19:40:48,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43417.5, 300 sec: 43542.5). Total num frames: 1035042816. Throughput: 0: 43535.5. Samples: 937977600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:40:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:40:51,656][06909] Updated weights for policy 0, policy_version 63183 (0.0040) [2024-06-27 19:40:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1035272192. Throughput: 0: 43485.8. Samples: 938230260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:40:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:40:55,744][06909] Updated weights for policy 0, policy_version 63193 (0.0024) [2024-06-27 19:40:58,852][06674] Fps is (10 sec: 45866.4, 60 sec: 44239.8, 300 sec: 43597.9). Total num frames: 1035501568. Throughput: 0: 43672.2. Samples: 938370340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:40:58,861][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:40:59,403][06909] Updated weights for policy 0, policy_version 63203 (0.0031) [2024-06-27 19:41:02,954][06909] Updated weights for policy 0, policy_version 63213 (0.0035) [2024-06-27 19:41:03,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.8, 300 sec: 43653.7). Total num frames: 1035714560. Throughput: 0: 43580.7. Samples: 938635800. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:41:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:41:06,701][06909] Updated weights for policy 0, policy_version 63223 (0.0035) [2024-06-27 19:41:08,850][06674] Fps is (10 sec: 42607.0, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1035927552. Throughput: 0: 43523.9. Samples: 938889160. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:41:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:41:10,460][06909] Updated weights for policy 0, policy_version 63233 (0.0037) [2024-06-27 19:41:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1036156928. Throughput: 0: 43394.3. Samples: 939020140. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:41:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:41:14,109][06909] Updated weights for policy 0, policy_version 63243 (0.0034) [2024-06-27 19:41:18,152][06909] Updated weights for policy 0, policy_version 63253 (0.0028) [2024-06-27 19:41:18,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43416.2, 300 sec: 43597.8). Total num frames: 1036353536. Throughput: 0: 43476.6. Samples: 939285180. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:41:18,852][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:41:21,771][06909] Updated weights for policy 0, policy_version 63263 (0.0031) [2024-06-27 19:41:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1036582912. Throughput: 0: 43560.9. Samples: 939539160. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-27 19:41:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:41:25,810][06909] Updated weights for policy 0, policy_version 63273 (0.0035) [2024-06-27 19:41:28,850][06674] Fps is (10 sec: 42607.5, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 1036779520. Throughput: 0: 43630.2. Samples: 939674980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:41:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:41:29,346][06909] Updated weights for policy 0, policy_version 63283 (0.0034) [2024-06-27 19:41:33,402][06909] Updated weights for policy 0, policy_version 63293 (0.0034) [2024-06-27 19:41:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 1037008896. Throughput: 0: 43474.8. Samples: 939933960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:41:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:41:37,044][06909] Updated weights for policy 0, policy_version 63303 (0.0031) [2024-06-27 19:41:38,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.6, 300 sec: 43599.0). Total num frames: 1037238272. Throughput: 0: 43597.8. Samples: 940192160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:41:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:41:40,906][06909] Updated weights for policy 0, policy_version 63313 (0.0032) [2024-06-27 19:41:43,852][06674] Fps is (10 sec: 44227.9, 60 sec: 43689.2, 300 sec: 43653.3). Total num frames: 1037451264. Throughput: 0: 43483.6. Samples: 940327100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:41:43,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:41:44,338][06909] Updated weights for policy 0, policy_version 63323 (0.0035) [2024-06-27 19:41:48,206][06909] Updated weights for policy 0, policy_version 63333 (0.0033) [2024-06-27 19:41:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1037680640. Throughput: 0: 43497.7. Samples: 940593200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:41:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:41:48,878][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000063335_1037680640.pth... [2024-06-27 19:41:48,935][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000062697_1027227648.pth [2024-06-27 19:41:49,501][06887] Signal inference workers to stop experience collection... (13400 times) [2024-06-27 19:41:49,547][06909] InferenceWorker_p0-w0: stopping experience collection (13400 times) [2024-06-27 19:41:49,611][06887] Signal inference workers to resume experience collection... (13400 times) [2024-06-27 19:41:49,612][06909] InferenceWorker_p0-w0: resuming experience collection (13400 times) [2024-06-27 19:41:51,889][06909] Updated weights for policy 0, policy_version 63343 (0.0031) [2024-06-27 19:41:53,850][06674] Fps is (10 sec: 44246.1, 60 sec: 43690.8, 300 sec: 43653.6). Total num frames: 1037893632. Throughput: 0: 43558.8. Samples: 940849300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:41:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:41:55,489][06909] Updated weights for policy 0, policy_version 63353 (0.0038) [2024-06-27 19:41:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43419.1, 300 sec: 43653.7). Total num frames: 1038106624. Throughput: 0: 43752.0. Samples: 940988980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:41:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:41:59,113][06909] Updated weights for policy 0, policy_version 63363 (0.0030) [2024-06-27 19:42:03,237][06909] Updated weights for policy 0, policy_version 63373 (0.0029) [2024-06-27 19:42:03,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 1038319616. Throughput: 0: 43629.6. Samples: 941248420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 19:42:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:42:06,576][06909] Updated weights for policy 0, policy_version 63383 (0.0049) [2024-06-27 19:42:08,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1038548992. Throughput: 0: 43784.8. Samples: 941509480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 19:42:08,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:42:10,525][06909] Updated weights for policy 0, policy_version 63393 (0.0046) [2024-06-27 19:42:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.5, 300 sec: 43653.8). Total num frames: 1038761984. Throughput: 0: 43705.7. Samples: 941641740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 19:42:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:42:14,441][06909] Updated weights for policy 0, policy_version 63403 (0.0026) [2024-06-27 19:42:17,853][06909] Updated weights for policy 0, policy_version 63413 (0.0026) [2024-06-27 19:42:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43692.1, 300 sec: 43598.6). Total num frames: 1038974976. Throughput: 0: 43781.8. Samples: 941904140. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 19:42:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:42:21,844][06909] Updated weights for policy 0, policy_version 63423 (0.0027) [2024-06-27 19:42:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 1039187968. Throughput: 0: 43936.1. Samples: 942169280. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 19:42:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:42:25,461][06909] Updated weights for policy 0, policy_version 63433 (0.0031) [2024-06-27 19:42:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 43653.6). Total num frames: 1039433728. Throughput: 0: 43906.8. Samples: 942302820. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 19:42:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:42:29,086][06909] Updated weights for policy 0, policy_version 63443 (0.0037) [2024-06-27 19:42:32,766][06909] Updated weights for policy 0, policy_version 63453 (0.0033) [2024-06-27 19:42:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 1039646720. Throughput: 0: 43772.9. Samples: 942562980. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 19:42:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:42:36,596][06909] Updated weights for policy 0, policy_version 63463 (0.0021) [2024-06-27 19:42:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1039859712. Throughput: 0: 43927.5. Samples: 942826040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 19:42:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:42:40,346][06909] Updated weights for policy 0, policy_version 63473 (0.0029) [2024-06-27 19:42:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43692.1, 300 sec: 43598.1). Total num frames: 1040072704. Throughput: 0: 43883.9. Samples: 942963760. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-27 19:42:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:42:44,213][06909] Updated weights for policy 0, policy_version 63483 (0.0037) [2024-06-27 19:42:47,826][06909] Updated weights for policy 0, policy_version 63493 (0.0030) [2024-06-27 19:42:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 1040269312. Throughput: 0: 43727.1. Samples: 943216140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 19:42:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:42:52,116][06909] Updated weights for policy 0, policy_version 63503 (0.0033) [2024-06-27 19:42:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 1040531456. Throughput: 0: 43669.9. Samples: 943474620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 19:42:53,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:42:55,587][06909] Updated weights for policy 0, policy_version 63513 (0.0032) [2024-06-27 19:42:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 1040728064. Throughput: 0: 43815.1. Samples: 943613420. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 19:42:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 19:42:59,385][06909] Updated weights for policy 0, policy_version 63523 (0.0039) [2024-06-27 19:43:02,905][06909] Updated weights for policy 0, policy_version 63533 (0.0045) [2024-06-27 19:43:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1040957440. Throughput: 0: 43760.4. Samples: 943873360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 19:43:03,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:43:06,807][06909] Updated weights for policy 0, policy_version 63543 (0.0031) [2024-06-27 19:43:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.8, 300 sec: 43599.0). Total num frames: 1041170432. Throughput: 0: 43738.2. Samples: 944137500. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 19:43:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:43:10,190][06909] Updated weights for policy 0, policy_version 63553 (0.0048) [2024-06-27 19:43:13,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 1041383424. Throughput: 0: 43662.3. Samples: 944267620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 19:43:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:43:14,100][06909] Updated weights for policy 0, policy_version 63563 (0.0047) [2024-06-27 19:43:17,724][06909] Updated weights for policy 0, policy_version 63573 (0.0031) [2024-06-27 19:43:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1041596416. Throughput: 0: 43713.3. Samples: 944530080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 19:43:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:43:21,636][06909] Updated weights for policy 0, policy_version 63583 (0.0034) [2024-06-27 19:43:23,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 43653.6). Total num frames: 1041842176. Throughput: 0: 43611.4. Samples: 944788560. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 19:43:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:43:25,056][06909] Updated weights for policy 0, policy_version 63593 (0.0044) [2024-06-27 19:43:28,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 1042038784. Throughput: 0: 43701.4. Samples: 944930320. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 19:43:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:43:29,176][06909] Updated weights for policy 0, policy_version 63603 (0.0039) [2024-06-27 19:43:31,143][06887] Signal inference workers to stop experience collection... (13450 times) [2024-06-27 19:43:31,148][06887] Signal inference workers to resume experience collection... (13450 times) [2024-06-27 19:43:31,189][06909] InferenceWorker_p0-w0: stopping experience collection (13450 times) [2024-06-27 19:43:31,190][06909] InferenceWorker_p0-w0: resuming experience collection (13450 times) [2024-06-27 19:43:33,336][06909] Updated weights for policy 0, policy_version 63613 (0.0037) [2024-06-27 19:43:33,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43144.5, 300 sec: 43542.5). Total num frames: 1042235392. Throughput: 0: 43633.7. Samples: 945179660. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 19:43:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:43:36,706][06909] Updated weights for policy 0, policy_version 63623 (0.0021) [2024-06-27 19:43:38,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44236.7, 300 sec: 43709.2). Total num frames: 1042513920. Throughput: 0: 43778.6. Samples: 945444660. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 19:43:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:43:40,642][06909] Updated weights for policy 0, policy_version 63633 (0.0028) [2024-06-27 19:43:43,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1042694144. Throughput: 0: 43797.0. Samples: 945584280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 19:43:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:43:44,058][06909] Updated weights for policy 0, policy_version 63643 (0.0024) [2024-06-27 19:43:48,082][06909] Updated weights for policy 0, policy_version 63653 (0.0041) [2024-06-27 19:43:48,854][06674] Fps is (10 sec: 40944.8, 60 sec: 44234.0, 300 sec: 43708.6). Total num frames: 1042923520. Throughput: 0: 43712.4. Samples: 945840580. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 19:43:48,854][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:43:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000063655_1042923520.pth... [2024-06-27 19:43:48,904][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000063015_1032437760.pth [2024-06-27 19:43:52,095][06909] Updated weights for policy 0, policy_version 63663 (0.0042) [2024-06-27 19:43:53,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1043152896. Throughput: 0: 43682.2. Samples: 946103200. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 19:43:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:43:55,542][06909] Updated weights for policy 0, policy_version 63673 (0.0029) [2024-06-27 19:43:58,850][06674] Fps is (10 sec: 40975.1, 60 sec: 43417.5, 300 sec: 43598.4). Total num frames: 1043333120. Throughput: 0: 43812.7. Samples: 946239200. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 19:43:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:43:59,462][06909] Updated weights for policy 0, policy_version 63683 (0.0044) [2024-06-27 19:44:03,008][06909] Updated weights for policy 0, policy_version 63693 (0.0034) [2024-06-27 19:44:03,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43144.6, 300 sec: 43542.6). Total num frames: 1043546112. Throughput: 0: 43596.5. Samples: 946491920. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:44:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:44:06,801][06909] Updated weights for policy 0, policy_version 63703 (0.0040) [2024-06-27 19:44:08,850][06674] Fps is (10 sec: 49152.4, 60 sec: 44236.8, 300 sec: 43709.2). Total num frames: 1043824640. Throughput: 0: 43541.9. Samples: 946747940. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:44:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:44:10,587][06909] Updated weights for policy 0, policy_version 63713 (0.0029) [2024-06-27 19:44:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1044004864. Throughput: 0: 43536.4. Samples: 946889460. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:44:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:44:14,534][06909] Updated weights for policy 0, policy_version 63723 (0.0039) [2024-06-27 19:44:18,498][06909] Updated weights for policy 0, policy_version 63733 (0.0038) [2024-06-27 19:44:18,850][06674] Fps is (10 sec: 37683.4, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1044201472. Throughput: 0: 43689.5. Samples: 947145680. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:44:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:44:21,817][06909] Updated weights for policy 0, policy_version 63743 (0.0032) [2024-06-27 19:44:23,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1044480000. Throughput: 0: 43672.0. Samples: 947409900. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:44:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:44:25,791][06909] Updated weights for policy 0, policy_version 63753 (0.0033) [2024-06-27 19:44:28,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1044676608. Throughput: 0: 43540.4. Samples: 947543600. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:44:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 19:44:29,761][06909] Updated weights for policy 0, policy_version 63763 (0.0042) [2024-06-27 19:44:30,048][06887] Signal inference workers to stop experience collection... (13500 times) [2024-06-27 19:44:30,101][06909] InferenceWorker_p0-w0: stopping experience collection (13500 times) [2024-06-27 19:44:30,108][06887] Signal inference workers to resume experience collection... (13500 times) [2024-06-27 19:44:30,111][06909] InferenceWorker_p0-w0: resuming experience collection (13500 times) [2024-06-27 19:44:33,094][06909] Updated weights for policy 0, policy_version 63773 (0.0037) [2024-06-27 19:44:33,851][06674] Fps is (10 sec: 39317.6, 60 sec: 43963.0, 300 sec: 43653.5). Total num frames: 1044873216. Throughput: 0: 43615.9. Samples: 947803180. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:44:33,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:44:37,144][06909] Updated weights for policy 0, policy_version 63783 (0.0036) [2024-06-27 19:44:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1045118976. Throughput: 0: 43497.3. Samples: 948060580. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 19:44:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:44:40,939][06909] Updated weights for policy 0, policy_version 63793 (0.0034) [2024-06-27 19:44:43,852][06674] Fps is (10 sec: 45870.8, 60 sec: 43962.2, 300 sec: 43708.9). Total num frames: 1045331968. Throughput: 0: 43702.1. Samples: 948205880. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-27 19:44:43,852][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 19:44:44,405][06909] Updated weights for policy 0, policy_version 63803 (0.0037) [2024-06-27 19:44:48,230][06909] Updated weights for policy 0, policy_version 63813 (0.0036) [2024-06-27 19:44:48,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43420.4, 300 sec: 43653.7). Total num frames: 1045528576. Throughput: 0: 43665.4. Samples: 948456860. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-27 19:44:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:44:52,223][06909] Updated weights for policy 0, policy_version 63823 (0.0037) [2024-06-27 19:44:53,850][06674] Fps is (10 sec: 45884.6, 60 sec: 43963.7, 300 sec: 43876.7). Total num frames: 1045790720. Throughput: 0: 43733.8. Samples: 948715960. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-27 19:44:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:44:55,508][06909] Updated weights for policy 0, policy_version 63833 (0.0021) [2024-06-27 19:44:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 1045970944. Throughput: 0: 43822.5. Samples: 948861480. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-27 19:44:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:44:59,498][06909] Updated weights for policy 0, policy_version 63843 (0.0030) [2024-06-27 19:45:02,843][06909] Updated weights for policy 0, policy_version 63853 (0.0039) [2024-06-27 19:45:03,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 1046183936. Throughput: 0: 43618.3. Samples: 949108500. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-27 19:45:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:45:07,039][06909] Updated weights for policy 0, policy_version 63863 (0.0040) [2024-06-27 19:45:08,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1046429696. Throughput: 0: 43570.8. Samples: 949370580. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-27 19:45:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:45:10,103][06909] Updated weights for policy 0, policy_version 63873 (0.0045) [2024-06-27 19:45:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1046626304. Throughput: 0: 43768.1. Samples: 949513160. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-27 19:45:13,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:45:14,790][06909] Updated weights for policy 0, policy_version 63883 (0.0041) [2024-06-27 19:45:17,742][06909] Updated weights for policy 0, policy_version 63893 (0.0031) [2024-06-27 19:45:18,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1046822912. Throughput: 0: 43615.3. Samples: 949765820. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-27 19:45:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:45:22,366][06909] Updated weights for policy 0, policy_version 63903 (0.0029) [2024-06-27 19:45:23,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1047085056. Throughput: 0: 43609.3. Samples: 950023000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 19:45:23,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:45:25,590][06909] Updated weights for policy 0, policy_version 63913 (0.0024) [2024-06-27 19:45:28,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43144.4, 300 sec: 43598.1). Total num frames: 1047265280. Throughput: 0: 43425.0. Samples: 950159920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 19:45:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:45:29,871][06909] Updated weights for policy 0, policy_version 63923 (0.0034) [2024-06-27 19:45:32,995][06909] Updated weights for policy 0, policy_version 63933 (0.0044) [2024-06-27 19:45:33,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43418.3, 300 sec: 43598.1). Total num frames: 1047478272. Throughput: 0: 43354.0. Samples: 950407800. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 19:45:33,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:45:37,440][06909] Updated weights for policy 0, policy_version 63943 (0.0030) [2024-06-27 19:45:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1047724032. Throughput: 0: 43584.4. Samples: 950677260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 19:45:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:45:40,244][06909] Updated weights for policy 0, policy_version 63953 (0.0033) [2024-06-27 19:45:43,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43146.0, 300 sec: 43653.7). Total num frames: 1047920640. Throughput: 0: 43388.0. Samples: 950813940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 19:45:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:45:45,011][06909] Updated weights for policy 0, policy_version 63963 (0.0045) [2024-06-27 19:45:45,667][06887] Signal inference workers to stop experience collection... (13550 times) [2024-06-27 19:45:45,667][06887] Signal inference workers to resume experience collection... (13550 times) [2024-06-27 19:45:45,711][06909] InferenceWorker_p0-w0: stopping experience collection (13550 times) [2024-06-27 19:45:45,711][06909] InferenceWorker_p0-w0: resuming experience collection (13550 times) [2024-06-27 19:45:47,792][06909] Updated weights for policy 0, policy_version 63973 (0.0034) [2024-06-27 19:45:48,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1048133632. Throughput: 0: 43384.0. Samples: 951060780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 19:45:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:45:48,932][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000063974_1048150016.pth... [2024-06-27 19:45:48,982][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000063335_1037680640.pth [2024-06-27 19:45:52,519][06909] Updated weights for policy 0, policy_version 63983 (0.0030) [2024-06-27 19:45:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43144.5, 300 sec: 43653.9). Total num frames: 1048379392. Throughput: 0: 43592.8. Samples: 951332260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 19:45:53,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:45:55,171][06909] Updated weights for policy 0, policy_version 63993 (0.0052) [2024-06-27 19:45:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1048576000. Throughput: 0: 43280.3. Samples: 951460780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 19:45:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:45:59,936][06909] Updated weights for policy 0, policy_version 64003 (0.0030) [2024-06-27 19:46:02,521][06909] Updated weights for policy 0, policy_version 64013 (0.0034) [2024-06-27 19:46:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1048805376. Throughput: 0: 43414.2. Samples: 951719460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 19:46:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:46:07,290][06909] Updated weights for policy 0, policy_version 64023 (0.0038) [2024-06-27 19:46:08,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 1049034752. Throughput: 0: 43599.7. Samples: 951984980. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 19:46:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:46:10,034][06909] Updated weights for policy 0, policy_version 64033 (0.0033) [2024-06-27 19:46:13,850][06674] Fps is (10 sec: 39321.8, 60 sec: 42871.4, 300 sec: 43542.9). Total num frames: 1049198592. Throughput: 0: 43523.3. Samples: 952118460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 19:46:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:46:14,899][06909] Updated weights for policy 0, policy_version 64043 (0.0030) [2024-06-27 19:46:17,567][06909] Updated weights for policy 0, policy_version 64053 (0.0034) [2024-06-27 19:46:18,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1049444352. Throughput: 0: 43609.5. Samples: 952370220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 19:46:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:46:22,478][06909] Updated weights for policy 0, policy_version 64063 (0.0030) [2024-06-27 19:46:23,855][06674] Fps is (10 sec: 49124.3, 60 sec: 43413.6, 300 sec: 43763.9). Total num frames: 1049690112. Throughput: 0: 43589.3. Samples: 952639020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 19:46:23,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:46:24,916][06909] Updated weights for policy 0, policy_version 64073 (0.0038) [2024-06-27 19:46:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43144.7, 300 sec: 43542.6). Total num frames: 1049853952. Throughput: 0: 43503.7. Samples: 952771600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 19:46:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:46:30,085][06909] Updated weights for policy 0, policy_version 64083 (0.0033) [2024-06-27 19:46:32,609][06909] Updated weights for policy 0, policy_version 64093 (0.0048) [2024-06-27 19:46:33,850][06674] Fps is (10 sec: 40983.0, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 1050099712. Throughput: 0: 43424.0. Samples: 953014860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 19:46:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:46:37,930][06909] Updated weights for policy 0, policy_version 64103 (0.0025) [2024-06-27 19:46:38,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43417.7, 300 sec: 43654.0). Total num frames: 1050329088. Throughput: 0: 43578.8. Samples: 953293300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 19:46:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:46:39,915][06909] Updated weights for policy 0, policy_version 64113 (0.0023) [2024-06-27 19:46:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 1050525696. Throughput: 0: 43614.3. Samples: 953423420. Policy #0 lag: (min: 0.0, avg: 12.8, max: 26.0) [2024-06-27 19:46:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 19:46:45,190][06909] Updated weights for policy 0, policy_version 64123 (0.0030) [2024-06-27 19:46:47,569][06909] Updated weights for policy 0, policy_version 64133 (0.0041) [2024-06-27 19:46:48,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 43653.6). Total num frames: 1050771456. Throughput: 0: 43450.2. Samples: 953674720. Policy #0 lag: (min: 0.0, avg: 12.8, max: 26.0) [2024-06-27 19:46:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:46:52,096][06887] Signal inference workers to stop experience collection... (13600 times) [2024-06-27 19:46:52,102][06887] Signal inference workers to resume experience collection... (13600 times) [2024-06-27 19:46:52,142][06909] InferenceWorker_p0-w0: stopping experience collection (13600 times) [2024-06-27 19:46:52,143][06909] InferenceWorker_p0-w0: resuming experience collection (13600 times) [2024-06-27 19:46:52,454][06909] Updated weights for policy 0, policy_version 64143 (0.0026) [2024-06-27 19:46:53,856][06674] Fps is (10 sec: 45847.3, 60 sec: 43413.3, 300 sec: 43652.7). Total num frames: 1050984448. Throughput: 0: 43592.8. Samples: 953946920. Policy #0 lag: (min: 0.0, avg: 12.8, max: 26.0) [2024-06-27 19:46:53,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:46:55,077][06909] Updated weights for policy 0, policy_version 64153 (0.0031) [2024-06-27 19:46:58,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43144.6, 300 sec: 43542.6). Total num frames: 1051164672. Throughput: 0: 43520.8. Samples: 954076900. Policy #0 lag: (min: 0.0, avg: 12.8, max: 26.0) [2024-06-27 19:46:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:47:00,172][06909] Updated weights for policy 0, policy_version 64163 (0.0036) [2024-06-27 19:47:02,510][06909] Updated weights for policy 0, policy_version 64173 (0.0041) [2024-06-27 19:47:03,850][06674] Fps is (10 sec: 44263.9, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1051426816. Throughput: 0: 43607.6. Samples: 954332560. Policy #0 lag: (min: 0.0, avg: 12.8, max: 26.0) [2024-06-27 19:47:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:47:07,806][06909] Updated weights for policy 0, policy_version 64183 (0.0032) [2024-06-27 19:47:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43144.5, 300 sec: 43598.1). Total num frames: 1051623424. Throughput: 0: 43734.3. Samples: 954606820. Policy #0 lag: (min: 0.0, avg: 12.8, max: 26.0) [2024-06-27 19:47:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:47:09,878][06909] Updated weights for policy 0, policy_version 64193 (0.0039) [2024-06-27 19:47:13,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 1051836416. Throughput: 0: 43498.1. Samples: 954729020. Policy #0 lag: (min: 0.0, avg: 12.8, max: 26.0) [2024-06-27 19:47:13,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 19:47:15,137][06909] Updated weights for policy 0, policy_version 64203 (0.0035) [2024-06-27 19:47:17,346][06909] Updated weights for policy 0, policy_version 64213 (0.0026) [2024-06-27 19:47:18,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 1052082176. Throughput: 0: 43817.2. Samples: 954986640. Policy #0 lag: (min: 0.0, avg: 12.8, max: 26.0) [2024-06-27 19:47:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:47:22,688][06909] Updated weights for policy 0, policy_version 64223 (0.0041) [2024-06-27 19:47:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43421.6, 300 sec: 43598.1). Total num frames: 1052295168. Throughput: 0: 43693.2. Samples: 955259500. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 19:47:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:47:25,191][06909] Updated weights for policy 0, policy_version 64233 (0.0035) [2024-06-27 19:47:28,856][06674] Fps is (10 sec: 39298.4, 60 sec: 43686.2, 300 sec: 43486.1). Total num frames: 1052475392. Throughput: 0: 43634.6. Samples: 955387240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 19:47:28,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:47:30,157][06909] Updated weights for policy 0, policy_version 64243 (0.0029) [2024-06-27 19:47:32,702][06909] Updated weights for policy 0, policy_version 64253 (0.0033) [2024-06-27 19:47:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.6, 300 sec: 43653.6). Total num frames: 1052737536. Throughput: 0: 43778.6. Samples: 955644760. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 19:47:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:47:37,724][06909] Updated weights for policy 0, policy_version 64263 (0.0036) [2024-06-27 19:47:38,850][06674] Fps is (10 sec: 47542.1, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1052950528. Throughput: 0: 43588.9. Samples: 955908160. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 19:47:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:47:40,180][06909] Updated weights for policy 0, policy_version 64273 (0.0029) [2024-06-27 19:47:43,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1053147136. Throughput: 0: 43522.7. Samples: 956035420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 19:47:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:47:45,157][06909] Updated weights for policy 0, policy_version 64283 (0.0041) [2024-06-27 19:47:47,938][06909] Updated weights for policy 0, policy_version 64293 (0.0024) [2024-06-27 19:47:48,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 1053409280. Throughput: 0: 43700.3. Samples: 956299080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 19:47:48,856][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:47:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000064295_1053409280.pth... [2024-06-27 19:47:48,930][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000063655_1042923520.pth [2024-06-27 19:47:52,663][06909] Updated weights for policy 0, policy_version 64303 (0.0032) [2024-06-27 19:47:52,760][06887] Signal inference workers to stop experience collection... (13650 times) [2024-06-27 19:47:52,813][06887] Signal inference workers to resume experience collection... (13650 times) [2024-06-27 19:47:52,813][06909] InferenceWorker_p0-w0: stopping experience collection (13650 times) [2024-06-27 19:47:52,841][06909] InferenceWorker_p0-w0: resuming experience collection (13650 times) [2024-06-27 19:47:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43695.0, 300 sec: 43653.6). Total num frames: 1053605888. Throughput: 0: 43689.3. Samples: 956572840. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 19:47:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:47:55,236][06909] Updated weights for policy 0, policy_version 64313 (0.0047) [2024-06-27 19:47:58,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43963.8, 300 sec: 43542.6). Total num frames: 1053802496. Throughput: 0: 43820.5. Samples: 956700940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 19:47:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:47:59,999][06909] Updated weights for policy 0, policy_version 64323 (0.0029) [2024-06-27 19:48:02,715][06909] Updated weights for policy 0, policy_version 64333 (0.0036) [2024-06-27 19:48:03,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1054048256. Throughput: 0: 43825.5. Samples: 956958780. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-27 19:48:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:48:07,430][06909] Updated weights for policy 0, policy_version 64343 (0.0033) [2024-06-27 19:48:08,856][06674] Fps is (10 sec: 47484.7, 60 sec: 44232.3, 300 sec: 43708.3). Total num frames: 1054277632. Throughput: 0: 43567.5. Samples: 957220300. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-27 19:48:08,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:48:10,599][06909] Updated weights for policy 0, policy_version 64353 (0.0036) [2024-06-27 19:48:13,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 1054441472. Throughput: 0: 43569.4. Samples: 957347600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-27 19:48:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:48:15,192][06909] Updated weights for policy 0, policy_version 64363 (0.0043) [2024-06-27 19:48:18,142][06909] Updated weights for policy 0, policy_version 64373 (0.0051) [2024-06-27 19:48:18,850][06674] Fps is (10 sec: 42624.1, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1054703616. Throughput: 0: 43709.4. Samples: 957611680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-27 19:48:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:48:22,611][06909] Updated weights for policy 0, policy_version 64383 (0.0036) [2024-06-27 19:48:23,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1054916608. Throughput: 0: 43611.1. Samples: 957870660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-27 19:48:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:48:25,644][06909] Updated weights for policy 0, policy_version 64393 (0.0033) [2024-06-27 19:48:28,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43968.2, 300 sec: 43653.7). Total num frames: 1055113216. Throughput: 0: 43539.2. Samples: 957994680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-27 19:48:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:48:30,316][06909] Updated weights for policy 0, policy_version 64403 (0.0047) [2024-06-27 19:48:33,258][06909] Updated weights for policy 0, policy_version 64413 (0.0039) [2024-06-27 19:48:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 1055358976. Throughput: 0: 43624.5. Samples: 958262180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-27 19:48:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:48:37,881][06909] Updated weights for policy 0, policy_version 64423 (0.0028) [2024-06-27 19:48:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1055571968. Throughput: 0: 43402.6. Samples: 958525960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-27 19:48:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:48:40,651][06909] Updated weights for policy 0, policy_version 64433 (0.0033) [2024-06-27 19:48:43,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43417.6, 300 sec: 43487.6). Total num frames: 1055752192. Throughput: 0: 43331.9. Samples: 958650880. Policy #0 lag: (min: 1.0, avg: 11.2, max: 22.0) [2024-06-27 19:48:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:48:45,122][06909] Updated weights for policy 0, policy_version 64443 (0.0035) [2024-06-27 19:48:48,506][06909] Updated weights for policy 0, policy_version 64453 (0.0043) [2024-06-27 19:48:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43144.6, 300 sec: 43542.6). Total num frames: 1055997952. Throughput: 0: 43533.3. Samples: 958917780. Policy #0 lag: (min: 1.0, avg: 11.2, max: 22.0) [2024-06-27 19:48:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:48:52,794][06909] Updated weights for policy 0, policy_version 64463 (0.0024) [2024-06-27 19:48:53,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1056227328. Throughput: 0: 43449.0. Samples: 959175240. Policy #0 lag: (min: 1.0, avg: 11.2, max: 22.0) [2024-06-27 19:48:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:48:56,030][06909] Updated weights for policy 0, policy_version 64473 (0.0022) [2024-06-27 19:48:58,850][06674] Fps is (10 sec: 39320.9, 60 sec: 43144.4, 300 sec: 43542.5). Total num frames: 1056391168. Throughput: 0: 43415.4. Samples: 959301300. Policy #0 lag: (min: 1.0, avg: 11.2, max: 22.0) [2024-06-27 19:48:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:48:59,701][06887] Signal inference workers to stop experience collection... (13700 times) [2024-06-27 19:48:59,702][06887] Signal inference workers to resume experience collection... (13700 times) [2024-06-27 19:48:59,722][06909] InferenceWorker_p0-w0: stopping experience collection (13700 times) [2024-06-27 19:48:59,722][06909] InferenceWorker_p0-w0: resuming experience collection (13700 times) [2024-06-27 19:49:00,368][06909] Updated weights for policy 0, policy_version 64483 (0.0038) [2024-06-27 19:49:03,684][06909] Updated weights for policy 0, policy_version 64493 (0.0033) [2024-06-27 19:49:03,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 1056653312. Throughput: 0: 43457.8. Samples: 959567280. Policy #0 lag: (min: 1.0, avg: 11.2, max: 22.0) [2024-06-27 19:49:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:49:07,762][06909] Updated weights for policy 0, policy_version 64503 (0.0034) [2024-06-27 19:49:08,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43148.8, 300 sec: 43598.1). Total num frames: 1056866304. Throughput: 0: 43509.2. Samples: 959828580. Policy #0 lag: (min: 1.0, avg: 11.2, max: 22.0) [2024-06-27 19:49:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:49:10,941][06909] Updated weights for policy 0, policy_version 64513 (0.0039) [2024-06-27 19:49:13,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1057062912. Throughput: 0: 43575.4. Samples: 959955580. Policy #0 lag: (min: 1.0, avg: 11.2, max: 22.0) [2024-06-27 19:49:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:49:15,343][06909] Updated weights for policy 0, policy_version 64523 (0.0032) [2024-06-27 19:49:18,361][06909] Updated weights for policy 0, policy_version 64533 (0.0035) [2024-06-27 19:49:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 1057325056. Throughput: 0: 43668.0. Samples: 960227240. Policy #0 lag: (min: 1.0, avg: 11.2, max: 22.0) [2024-06-27 19:49:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:49:22,632][06909] Updated weights for policy 0, policy_version 64543 (0.0032) [2024-06-27 19:49:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 1057521664. Throughput: 0: 43607.2. Samples: 960488280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2024-06-27 19:49:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:49:26,271][06909] Updated weights for policy 0, policy_version 64553 (0.0039) [2024-06-27 19:49:28,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43417.6, 300 sec: 43542.7). Total num frames: 1057718272. Throughput: 0: 43535.2. Samples: 960609960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2024-06-27 19:49:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:49:30,602][06909] Updated weights for policy 0, policy_version 64563 (0.0046) [2024-06-27 19:49:33,683][06909] Updated weights for policy 0, policy_version 64573 (0.0031) [2024-06-27 19:49:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 1057964032. Throughput: 0: 43513.6. Samples: 960875900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2024-06-27 19:49:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:49:37,899][06909] Updated weights for policy 0, policy_version 64583 (0.0032) [2024-06-27 19:49:38,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43690.7, 300 sec: 43598.4). Total num frames: 1058193408. Throughput: 0: 43501.7. Samples: 961132820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2024-06-27 19:49:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:49:41,285][06909] Updated weights for policy 0, policy_version 64593 (0.0038) [2024-06-27 19:49:43,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 1058373632. Throughput: 0: 43674.3. Samples: 961266640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2024-06-27 19:49:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:49:45,257][06909] Updated weights for policy 0, policy_version 64603 (0.0049) [2024-06-27 19:49:48,775][06909] Updated weights for policy 0, policy_version 64613 (0.0036) [2024-06-27 19:49:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.5, 300 sec: 43487.0). Total num frames: 1058619392. Throughput: 0: 43601.7. Samples: 961529360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2024-06-27 19:49:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:49:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000064613_1058619392.pth... [2024-06-27 19:49:48,932][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000063974_1048150016.pth [2024-06-27 19:49:52,810][06909] Updated weights for policy 0, policy_version 64623 (0.0032) [2024-06-27 19:49:53,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43690.6, 300 sec: 43653.7). Total num frames: 1058848768. Throughput: 0: 43577.0. Samples: 961789540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2024-06-27 19:49:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:49:56,281][06909] Updated weights for policy 0, policy_version 64633 (0.0040) [2024-06-27 19:49:58,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43690.8, 300 sec: 43487.0). Total num frames: 1059012608. Throughput: 0: 43698.3. Samples: 961922000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 24.0) [2024-06-27 19:49:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:50:00,445][06909] Updated weights for policy 0, policy_version 64643 (0.0035) [2024-06-27 19:50:03,705][06909] Updated weights for policy 0, policy_version 64653 (0.0035) [2024-06-27 19:50:03,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43689.2, 300 sec: 43542.3). Total num frames: 1059274752. Throughput: 0: 43528.3. Samples: 962186100. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 19:50:03,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:50:08,122][06909] Updated weights for policy 0, policy_version 64663 (0.0026) [2024-06-27 19:50:08,850][06674] Fps is (10 sec: 49151.5, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 1059504128. Throughput: 0: 43473.7. Samples: 962444600. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 19:50:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:50:10,836][06887] Signal inference workers to stop experience collection... (13750 times) [2024-06-27 19:50:10,884][06909] InferenceWorker_p0-w0: stopping experience collection (13750 times) [2024-06-27 19:50:10,945][06887] Signal inference workers to resume experience collection... (13750 times) [2024-06-27 19:50:10,946][06909] InferenceWorker_p0-w0: resuming experience collection (13750 times) [2024-06-27 19:50:11,390][06909] Updated weights for policy 0, policy_version 64673 (0.0029) [2024-06-27 19:50:13,850][06674] Fps is (10 sec: 39329.4, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 1059667968. Throughput: 0: 43623.5. Samples: 962573020. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 19:50:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:50:15,505][06909] Updated weights for policy 0, policy_version 64683 (0.0030) [2024-06-27 19:50:18,774][06909] Updated weights for policy 0, policy_version 64693 (0.0033) [2024-06-27 19:50:18,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 1059930112. Throughput: 0: 43523.7. Samples: 962834460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 19:50:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:50:22,733][06909] Updated weights for policy 0, policy_version 64703 (0.0030) [2024-06-27 19:50:23,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1060143104. Throughput: 0: 43620.9. Samples: 963095760. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 19:50:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:50:26,108][06909] Updated weights for policy 0, policy_version 64713 (0.0038) [2024-06-27 19:50:28,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1060339712. Throughput: 0: 43592.8. Samples: 963228320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 19:50:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:50:30,285][06909] Updated weights for policy 0, policy_version 64723 (0.0035) [2024-06-27 19:50:33,822][06909] Updated weights for policy 0, policy_version 64733 (0.0035) [2024-06-27 19:50:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1060585472. Throughput: 0: 43690.2. Samples: 963495420. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 19:50:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:50:37,649][06909] Updated weights for policy 0, policy_version 64743 (0.0032) [2024-06-27 19:50:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 1060798464. Throughput: 0: 43639.0. Samples: 963753300. Policy #0 lag: (min: 0.0, avg: 9.0, max: 23.0) [2024-06-27 19:50:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:50:41,377][06909] Updated weights for policy 0, policy_version 64753 (0.0040) [2024-06-27 19:50:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1060995072. Throughput: 0: 43677.2. Samples: 963887480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:50:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:50:45,091][06909] Updated weights for policy 0, policy_version 64763 (0.0035) [2024-06-27 19:50:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 1061224448. Throughput: 0: 43697.0. Samples: 964152380. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:50:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:50:48,912][06909] Updated weights for policy 0, policy_version 64773 (0.0029) [2024-06-27 19:50:52,648][06909] Updated weights for policy 0, policy_version 64783 (0.0039) [2024-06-27 19:50:53,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1061470208. Throughput: 0: 43524.9. Samples: 964403220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:50:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:50:56,219][06909] Updated weights for policy 0, policy_version 64793 (0.0030) [2024-06-27 19:50:58,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 43542.6). Total num frames: 1061650432. Throughput: 0: 43787.7. Samples: 964543460. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:50:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:51:00,057][06909] Updated weights for policy 0, policy_version 64803 (0.0035) [2024-06-27 19:51:03,666][06909] Updated weights for policy 0, policy_version 64813 (0.0029) [2024-06-27 19:51:03,852][06674] Fps is (10 sec: 42590.0, 60 sec: 43690.7, 300 sec: 43597.8). Total num frames: 1061896192. Throughput: 0: 43894.9. Samples: 964809820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:51:03,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:51:07,647][06909] Updated weights for policy 0, policy_version 64823 (0.0033) [2024-06-27 19:51:08,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 1062109184. Throughput: 0: 43794.6. Samples: 965066520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:51:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:51:11,401][06909] Updated weights for policy 0, policy_version 64833 (0.0043) [2024-06-27 19:51:13,850][06674] Fps is (10 sec: 44245.5, 60 sec: 44509.8, 300 sec: 43709.2). Total num frames: 1062338560. Throughput: 0: 43847.1. Samples: 965201440. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:51:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:51:15,189][06909] Updated weights for policy 0, policy_version 64843 (0.0032) [2024-06-27 19:51:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.5, 300 sec: 43543.4). Total num frames: 1062535168. Throughput: 0: 43761.4. Samples: 965464680. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 19:51:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:51:19,013][06909] Updated weights for policy 0, policy_version 64853 (0.0036) [2024-06-27 19:51:22,675][06909] Updated weights for policy 0, policy_version 64863 (0.0035) [2024-06-27 19:51:23,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1062764544. Throughput: 0: 43552.9. Samples: 965713180. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:51:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:51:26,694][06909] Updated weights for policy 0, policy_version 64873 (0.0037) [2024-06-27 19:51:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1062961152. Throughput: 0: 43535.1. Samples: 965846560. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:51:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:51:30,238][06909] Updated weights for policy 0, policy_version 64883 (0.0030) [2024-06-27 19:51:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1063190528. Throughput: 0: 43466.3. Samples: 966108360. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:51:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:51:34,062][06909] Updated weights for policy 0, policy_version 64893 (0.0031) [2024-06-27 19:51:37,220][06887] Signal inference workers to stop experience collection... (13800 times) [2024-06-27 19:51:37,220][06887] Signal inference workers to resume experience collection... (13800 times) [2024-06-27 19:51:37,258][06909] InferenceWorker_p0-w0: stopping experience collection (13800 times) [2024-06-27 19:51:37,258][06909] InferenceWorker_p0-w0: resuming experience collection (13800 times) [2024-06-27 19:51:37,537][06909] Updated weights for policy 0, policy_version 64903 (0.0037) [2024-06-27 19:51:38,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1063436288. Throughput: 0: 43728.9. Samples: 966371020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:51:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:51:41,291][06909] Updated weights for policy 0, policy_version 64913 (0.0033) [2024-06-27 19:51:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 1063616512. Throughput: 0: 43627.4. Samples: 966506700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:51:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:51:44,878][06909] Updated weights for policy 0, policy_version 64923 (0.0036) [2024-06-27 19:51:48,775][06909] Updated weights for policy 0, policy_version 64933 (0.0044) [2024-06-27 19:51:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43654.5). Total num frames: 1063862272. Throughput: 0: 43585.1. Samples: 966771060. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:51:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:51:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000064933_1063862272.pth... [2024-06-27 19:51:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000064295_1053409280.pth [2024-06-27 19:51:52,507][06909] Updated weights for policy 0, policy_version 64943 (0.0026) [2024-06-27 19:51:53,852][06674] Fps is (10 sec: 45866.1, 60 sec: 43416.1, 300 sec: 43764.4). Total num frames: 1064075264. Throughput: 0: 43570.1. Samples: 967027260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:51:53,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:51:56,570][06909] Updated weights for policy 0, policy_version 64953 (0.0046) [2024-06-27 19:51:58,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.6, 300 sec: 43598.1). Total num frames: 1064288256. Throughput: 0: 43716.9. Samples: 967168700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 19:51:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:51:59,928][06909] Updated weights for policy 0, policy_version 64963 (0.0042) [2024-06-27 19:52:03,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43419.1, 300 sec: 43653.6). Total num frames: 1064501248. Throughput: 0: 43591.6. Samples: 967426300. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-27 19:52:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:52:04,245][06909] Updated weights for policy 0, policy_version 64973 (0.0033) [2024-06-27 19:52:07,578][06909] Updated weights for policy 0, policy_version 64983 (0.0046) [2024-06-27 19:52:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1064730624. Throughput: 0: 43901.3. Samples: 967688740. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-27 19:52:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:52:11,517][06909] Updated weights for policy 0, policy_version 64993 (0.0032) [2024-06-27 19:52:13,852][06674] Fps is (10 sec: 42589.5, 60 sec: 43143.1, 300 sec: 43542.3). Total num frames: 1064927232. Throughput: 0: 43852.7. Samples: 967820020. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-27 19:52:13,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:52:14,980][06909] Updated weights for policy 0, policy_version 65003 (0.0026) [2024-06-27 19:52:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1065156608. Throughput: 0: 43883.9. Samples: 968083140. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-27 19:52:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:52:19,045][06909] Updated weights for policy 0, policy_version 65013 (0.0024) [2024-06-27 19:52:22,384][06909] Updated weights for policy 0, policy_version 65023 (0.0032) [2024-06-27 19:52:23,850][06674] Fps is (10 sec: 47523.2, 60 sec: 43963.7, 300 sec: 43821.1). Total num frames: 1065402368. Throughput: 0: 43794.7. Samples: 968341780. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-27 19:52:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:52:26,600][06909] Updated weights for policy 0, policy_version 65033 (0.0034) [2024-06-27 19:52:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 1065598976. Throughput: 0: 43810.3. Samples: 968478160. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-27 19:52:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:52:29,878][06909] Updated weights for policy 0, policy_version 65043 (0.0031) [2024-06-27 19:52:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1065811968. Throughput: 0: 43706.2. Samples: 968737840. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-27 19:52:33,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 19:52:34,019][06909] Updated weights for policy 0, policy_version 65053 (0.0029) [2024-06-27 19:52:37,353][06909] Updated weights for policy 0, policy_version 65063 (0.0023) [2024-06-27 19:52:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1066041344. Throughput: 0: 43606.0. Samples: 968989440. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-27 19:52:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:52:42,021][06909] Updated weights for policy 0, policy_version 65073 (0.0031) [2024-06-27 19:52:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 1066221568. Throughput: 0: 43523.6. Samples: 969127260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:52:43,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:52:45,065][06909] Updated weights for policy 0, policy_version 65083 (0.0038) [2024-06-27 19:52:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1066467328. Throughput: 0: 43436.4. Samples: 969380940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:52:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:52:49,356][06909] Updated weights for policy 0, policy_version 65093 (0.0024) [2024-06-27 19:52:52,746][06909] Updated weights for policy 0, policy_version 65103 (0.0039) [2024-06-27 19:52:53,852][06674] Fps is (10 sec: 47504.5, 60 sec: 43690.7, 300 sec: 43708.9). Total num frames: 1066696704. Throughput: 0: 43363.9. Samples: 969640200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:52:53,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:52:56,909][06909] Updated weights for policy 0, policy_version 65113 (0.0035) [2024-06-27 19:52:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43542.5). Total num frames: 1066893312. Throughput: 0: 43569.5. Samples: 969780560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:52:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:52:59,524][06887] Signal inference workers to stop experience collection... (13850 times) [2024-06-27 19:52:59,583][06887] Signal inference workers to resume experience collection... (13850 times) [2024-06-27 19:52:59,584][06909] InferenceWorker_p0-w0: stopping experience collection (13850 times) [2024-06-27 19:52:59,602][06909] InferenceWorker_p0-w0: resuming experience collection (13850 times) [2024-06-27 19:52:59,872][06909] Updated weights for policy 0, policy_version 65123 (0.0025) [2024-06-27 19:53:03,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43690.7, 300 sec: 43543.5). Total num frames: 1067122688. Throughput: 0: 43741.4. Samples: 970051500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:53:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:53:04,102][06909] Updated weights for policy 0, policy_version 65133 (0.0037) [2024-06-27 19:53:07,098][06909] Updated weights for policy 0, policy_version 65143 (0.0031) [2024-06-27 19:53:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1067352064. Throughput: 0: 43699.5. Samples: 970308260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:53:08,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:53:11,368][06909] Updated weights for policy 0, policy_version 65153 (0.0033) [2024-06-27 19:53:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43692.2, 300 sec: 43542.6). Total num frames: 1067548672. Throughput: 0: 43620.5. Samples: 970441080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:53:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:53:14,792][06909] Updated weights for policy 0, policy_version 65163 (0.0024) [2024-06-27 19:53:18,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 1067778048. Throughput: 0: 43734.8. Samples: 970705900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 19:53:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:53:18,951][06909] Updated weights for policy 0, policy_version 65173 (0.0036) [2024-06-27 19:53:22,256][06909] Updated weights for policy 0, policy_version 65183 (0.0037) [2024-06-27 19:53:23,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 1068007424. Throughput: 0: 43817.9. Samples: 970961240. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 19:53:23,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 19:53:26,826][06909] Updated weights for policy 0, policy_version 65193 (0.0030) [2024-06-27 19:53:28,850][06674] Fps is (10 sec: 40959.0, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 1068187648. Throughput: 0: 43706.2. Samples: 971094040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 19:53:28,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:53:29,974][06909] Updated weights for policy 0, policy_version 65203 (0.0032) [2024-06-27 19:53:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1068433408. Throughput: 0: 43881.8. Samples: 971355620. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 19:53:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:53:34,165][06909] Updated weights for policy 0, policy_version 65213 (0.0037) [2024-06-27 19:53:37,393][06909] Updated weights for policy 0, policy_version 65223 (0.0037) [2024-06-27 19:53:38,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 1068646400. Throughput: 0: 43974.0. Samples: 971618940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 19:53:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:53:41,504][06909] Updated weights for policy 0, policy_version 65233 (0.0044) [2024-06-27 19:53:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 1068859392. Throughput: 0: 43808.5. Samples: 971751940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 19:53:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:53:44,704][06909] Updated weights for policy 0, policy_version 65243 (0.0046) [2024-06-27 19:53:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1069088768. Throughput: 0: 43701.3. Samples: 972018060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 19:53:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:53:48,912][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000065253_1069105152.pth... [2024-06-27 19:53:48,915][06909] Updated weights for policy 0, policy_version 65253 (0.0040) [2024-06-27 19:53:48,960][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000064613_1058619392.pth [2024-06-27 19:53:52,444][06909] Updated weights for policy 0, policy_version 65263 (0.0035) [2024-06-27 19:53:53,856][06674] Fps is (10 sec: 44209.8, 60 sec: 43414.7, 300 sec: 43763.8). Total num frames: 1069301760. Throughput: 0: 43595.1. Samples: 972270300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 19:53:53,857][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:53:56,463][06909] Updated weights for policy 0, policy_version 65273 (0.0028) [2024-06-27 19:53:58,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1069514752. Throughput: 0: 43545.2. Samples: 972400620. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 19:53:58,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:53:59,951][06909] Updated weights for policy 0, policy_version 65283 (0.0038) [2024-06-27 19:54:03,850][06674] Fps is (10 sec: 44263.7, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1069744128. Throughput: 0: 43628.3. Samples: 972669180. Policy #0 lag: (min: 0.0, avg: 12.3, max: 27.0) [2024-06-27 19:54:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:54:03,938][06909] Updated weights for policy 0, policy_version 65293 (0.0034) [2024-06-27 19:54:07,566][06909] Updated weights for policy 0, policy_version 65303 (0.0038) [2024-06-27 19:54:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1069957120. Throughput: 0: 43556.7. Samples: 972921300. Policy #0 lag: (min: 0.0, avg: 12.3, max: 27.0) [2024-06-27 19:54:08,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:54:11,671][06909] Updated weights for policy 0, policy_version 65313 (0.0035) [2024-06-27 19:54:13,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 43542.5). Total num frames: 1070170112. Throughput: 0: 43552.4. Samples: 973053900. Policy #0 lag: (min: 0.0, avg: 12.3, max: 27.0) [2024-06-27 19:54:13,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:54:14,598][06887] Signal inference workers to stop experience collection... (13900 times) [2024-06-27 19:54:14,655][06909] InferenceWorker_p0-w0: stopping experience collection (13900 times) [2024-06-27 19:54:14,659][06887] Signal inference workers to resume experience collection... (13900 times) [2024-06-27 19:54:14,671][06909] InferenceWorker_p0-w0: resuming experience collection (13900 times) [2024-06-27 19:54:15,363][06909] Updated weights for policy 0, policy_version 65323 (0.0035) [2024-06-27 19:54:18,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1070383104. Throughput: 0: 43577.8. Samples: 973316620. Policy #0 lag: (min: 0.0, avg: 12.3, max: 27.0) [2024-06-27 19:54:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:54:19,035][06909] Updated weights for policy 0, policy_version 65333 (0.0023) [2024-06-27 19:54:22,767][06909] Updated weights for policy 0, policy_version 65343 (0.0037) [2024-06-27 19:54:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 1070612480. Throughput: 0: 43499.8. Samples: 973576440. Policy #0 lag: (min: 0.0, avg: 12.3, max: 27.0) [2024-06-27 19:54:23,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:54:26,709][06909] Updated weights for policy 0, policy_version 65353 (0.0028) [2024-06-27 19:54:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 1070825472. Throughput: 0: 43422.6. Samples: 973705960. Policy #0 lag: (min: 0.0, avg: 12.3, max: 27.0) [2024-06-27 19:54:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:54:30,144][06909] Updated weights for policy 0, policy_version 65363 (0.0034) [2024-06-27 19:54:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.5, 300 sec: 43542.6). Total num frames: 1071038464. Throughput: 0: 43356.0. Samples: 973969080. Policy #0 lag: (min: 0.0, avg: 12.3, max: 27.0) [2024-06-27 19:54:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:54:34,436][06909] Updated weights for policy 0, policy_version 65373 (0.0028) [2024-06-27 19:54:37,899][06909] Updated weights for policy 0, policy_version 65383 (0.0027) [2024-06-27 19:54:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1071267840. Throughput: 0: 43296.0. Samples: 974218360. Policy #0 lag: (min: 0.0, avg: 12.3, max: 27.0) [2024-06-27 19:54:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:54:41,726][06909] Updated weights for policy 0, policy_version 65393 (0.0032) [2024-06-27 19:54:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1071480832. Throughput: 0: 43507.2. Samples: 974358440. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:54:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:54:45,318][06909] Updated weights for policy 0, policy_version 65403 (0.0035) [2024-06-27 19:54:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 1071693824. Throughput: 0: 43458.6. Samples: 974624820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:54:48,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:54:49,269][06909] Updated weights for policy 0, policy_version 65413 (0.0030) [2024-06-27 19:54:52,999][06909] Updated weights for policy 0, policy_version 65423 (0.0033) [2024-06-27 19:54:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43695.0, 300 sec: 43764.7). Total num frames: 1071923200. Throughput: 0: 43518.2. Samples: 974879620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:54:53,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:54:56,777][06909] Updated weights for policy 0, policy_version 65433 (0.0045) [2024-06-27 19:54:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.8, 300 sec: 43598.4). Total num frames: 1072136192. Throughput: 0: 43535.2. Samples: 975012980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:54:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:55:00,497][06909] Updated weights for policy 0, policy_version 65443 (0.0045) [2024-06-27 19:55:03,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1072365568. Throughput: 0: 43622.3. Samples: 975279620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:55:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:55:04,089][06909] Updated weights for policy 0, policy_version 65453 (0.0031) [2024-06-27 19:55:07,996][06909] Updated weights for policy 0, policy_version 65463 (0.0033) [2024-06-27 19:55:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1072562176. Throughput: 0: 43493.4. Samples: 975533640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:55:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:55:11,931][06909] Updated weights for policy 0, policy_version 65473 (0.0029) [2024-06-27 19:55:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 1072791552. Throughput: 0: 43602.3. Samples: 975668060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:55:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:55:15,292][06909] Updated weights for policy 0, policy_version 65483 (0.0033) [2024-06-27 19:55:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1073004544. Throughput: 0: 43552.5. Samples: 975928940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 19:55:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:55:19,423][06909] Updated weights for policy 0, policy_version 65493 (0.0036) [2024-06-27 19:55:22,798][06909] Updated weights for policy 0, policy_version 65503 (0.0028) [2024-06-27 19:55:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 1073217536. Throughput: 0: 43692.9. Samples: 976184540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:55:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:55:27,071][06909] Updated weights for policy 0, policy_version 65513 (0.0031) [2024-06-27 19:55:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1073446912. Throughput: 0: 43593.8. Samples: 976320160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:55:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:55:30,519][06909] Updated weights for policy 0, policy_version 65523 (0.0039) [2024-06-27 19:55:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 1073643520. Throughput: 0: 43458.8. Samples: 976580460. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:55:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:55:34,444][06909] Updated weights for policy 0, policy_version 65533 (0.0038) [2024-06-27 19:55:37,818][06909] Updated weights for policy 0, policy_version 65543 (0.0032) [2024-06-27 19:55:38,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43416.1, 300 sec: 43653.3). Total num frames: 1073872896. Throughput: 0: 43749.2. Samples: 976848420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:55:38,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:55:41,845][06909] Updated weights for policy 0, policy_version 65553 (0.0025) [2024-06-27 19:55:43,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1074118656. Throughput: 0: 43799.2. Samples: 976983940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:55:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:55:45,299][06909] Updated weights for policy 0, policy_version 65563 (0.0031) [2024-06-27 19:55:48,850][06674] Fps is (10 sec: 44246.2, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 1074315264. Throughput: 0: 43768.4. Samples: 977249200. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:55:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:55:48,923][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000065572_1074331648.pth... [2024-06-27 19:55:48,974][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000064933_1063862272.pth [2024-06-27 19:55:49,579][06909] Updated weights for policy 0, policy_version 65573 (0.0024) [2024-06-27 19:55:49,596][06887] Signal inference workers to stop experience collection... (13950 times) [2024-06-27 19:55:49,596][06887] Signal inference workers to resume experience collection... (13950 times) [2024-06-27 19:55:49,616][06909] InferenceWorker_p0-w0: stopping experience collection (13950 times) [2024-06-27 19:55:49,617][06909] InferenceWorker_p0-w0: resuming experience collection (13950 times) [2024-06-27 19:55:52,890][06909] Updated weights for policy 0, policy_version 65583 (0.0035) [2024-06-27 19:55:53,852][06674] Fps is (10 sec: 40951.3, 60 sec: 43416.2, 300 sec: 43653.3). Total num frames: 1074528256. Throughput: 0: 43761.2. Samples: 977502980. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:55:53,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:55:57,252][06909] Updated weights for policy 0, policy_version 65593 (0.0025) [2024-06-27 19:55:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 43653.9). Total num frames: 1074774016. Throughput: 0: 43703.5. Samples: 977634720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-27 19:55:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:56:00,226][06909] Updated weights for policy 0, policy_version 65603 (0.0035) [2024-06-27 19:56:03,850][06674] Fps is (10 sec: 42607.6, 60 sec: 43144.5, 300 sec: 43542.6). Total num frames: 1074954240. Throughput: 0: 43725.9. Samples: 977896600. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 19:56:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:56:04,617][06909] Updated weights for policy 0, policy_version 65613 (0.0038) [2024-06-27 19:56:08,070][06909] Updated weights for policy 0, policy_version 65623 (0.0037) [2024-06-27 19:56:08,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43417.7, 300 sec: 43487.0). Total num frames: 1075167232. Throughput: 0: 43693.3. Samples: 978150740. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 19:56:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:56:12,061][06909] Updated weights for policy 0, policy_version 65633 (0.0033) [2024-06-27 19:56:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1075412992. Throughput: 0: 43696.9. Samples: 978286520. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 19:56:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:56:15,414][06909] Updated weights for policy 0, policy_version 65643 (0.0032) [2024-06-27 19:56:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 1075609600. Throughput: 0: 43679.8. Samples: 978546060. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 19:56:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:56:19,680][06909] Updated weights for policy 0, policy_version 65653 (0.0033) [2024-06-27 19:56:23,647][06909] Updated weights for policy 0, policy_version 65663 (0.0023) [2024-06-27 19:56:23,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 1075822592. Throughput: 0: 43501.0. Samples: 978805880. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 19:56:23,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:56:27,431][06909] Updated weights for policy 0, policy_version 65673 (0.0035) [2024-06-27 19:56:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1076068352. Throughput: 0: 43380.4. Samples: 978936060. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 19:56:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:56:31,038][06909] Updated weights for policy 0, policy_version 65683 (0.0031) [2024-06-27 19:56:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.5, 300 sec: 43487.0). Total num frames: 1076264960. Throughput: 0: 43163.0. Samples: 979191540. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 19:56:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:56:34,909][06909] Updated weights for policy 0, policy_version 65693 (0.0040) [2024-06-27 19:56:38,378][06909] Updated weights for policy 0, policy_version 65703 (0.0031) [2024-06-27 19:56:38,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43419.1, 300 sec: 43598.1). Total num frames: 1076477952. Throughput: 0: 43322.4. Samples: 979452400. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-27 19:56:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:56:42,345][06909] Updated weights for policy 0, policy_version 65713 (0.0033) [2024-06-27 19:56:43,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1076723712. Throughput: 0: 43374.3. Samples: 979586560. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:56:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:56:45,684][06909] Updated weights for policy 0, policy_version 65723 (0.0025) [2024-06-27 19:56:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 43542.9). Total num frames: 1076920320. Throughput: 0: 43349.7. Samples: 979847340. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:56:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:56:49,755][06909] Updated weights for policy 0, policy_version 65733 (0.0027) [2024-06-27 19:56:53,549][06909] Updated weights for policy 0, policy_version 65743 (0.0031) [2024-06-27 19:56:53,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43419.1, 300 sec: 43542.6). Total num frames: 1077133312. Throughput: 0: 43514.6. Samples: 980108900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:56:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:56:57,409][06909] Updated weights for policy 0, policy_version 65753 (0.0039) [2024-06-27 19:56:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 1077379072. Throughput: 0: 43426.1. Samples: 980240700. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:56:58,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:57:01,120][06909] Updated weights for policy 0, policy_version 65763 (0.0030) [2024-06-27 19:57:01,754][06887] Signal inference workers to stop experience collection... (14000 times) [2024-06-27 19:57:01,803][06887] Signal inference workers to resume experience collection... (14000 times) [2024-06-27 19:57:01,804][06909] InferenceWorker_p0-w0: stopping experience collection (14000 times) [2024-06-27 19:57:01,818][06909] InferenceWorker_p0-w0: resuming experience collection (14000 times) [2024-06-27 19:57:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 1077575680. Throughput: 0: 43484.9. Samples: 980502880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:57:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:57:05,174][06909] Updated weights for policy 0, policy_version 65773 (0.0036) [2024-06-27 19:57:08,692][06909] Updated weights for policy 0, policy_version 65783 (0.0036) [2024-06-27 19:57:08,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.5, 300 sec: 43598.4). Total num frames: 1077788672. Throughput: 0: 43588.4. Samples: 980767360. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:57:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:57:12,438][06909] Updated weights for policy 0, policy_version 65793 (0.0027) [2024-06-27 19:57:13,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1078050816. Throughput: 0: 43726.2. Samples: 980903740. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:57:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 19:57:15,965][06909] Updated weights for policy 0, policy_version 65803 (0.0029) [2024-06-27 19:57:18,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 1078214656. Throughput: 0: 43800.5. Samples: 981162560. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 19:57:18,854][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:57:20,113][06909] Updated weights for policy 0, policy_version 65813 (0.0027) [2024-06-27 19:57:23,288][06909] Updated weights for policy 0, policy_version 65823 (0.0036) [2024-06-27 19:57:23,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 1078444032. Throughput: 0: 43584.9. Samples: 981413720. Policy #0 lag: (min: 0.0, avg: 13.0, max: 22.0) [2024-06-27 19:57:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:57:27,412][06909] Updated weights for policy 0, policy_version 65833 (0.0038) [2024-06-27 19:57:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 1078673408. Throughput: 0: 43639.0. Samples: 981550320. Policy #0 lag: (min: 0.0, avg: 13.0, max: 22.0) [2024-06-27 19:57:28,854][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:57:31,159][06909] Updated weights for policy 0, policy_version 65843 (0.0032) [2024-06-27 19:57:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.5, 300 sec: 43487.0). Total num frames: 1078870016. Throughput: 0: 43539.0. Samples: 981806600. Policy #0 lag: (min: 0.0, avg: 13.0, max: 22.0) [2024-06-27 19:57:33,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 19:57:34,905][06909] Updated weights for policy 0, policy_version 65853 (0.0044) [2024-06-27 19:57:38,397][06909] Updated weights for policy 0, policy_version 65863 (0.0028) [2024-06-27 19:57:38,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43689.2, 300 sec: 43653.4). Total num frames: 1079099392. Throughput: 0: 43671.4. Samples: 982074200. Policy #0 lag: (min: 0.0, avg: 13.0, max: 22.0) [2024-06-27 19:57:38,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:57:42,471][06909] Updated weights for policy 0, policy_version 65873 (0.0027) [2024-06-27 19:57:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 1079328768. Throughput: 0: 43779.1. Samples: 982210760. Policy #0 lag: (min: 0.0, avg: 13.0, max: 22.0) [2024-06-27 19:57:43,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:57:46,020][06909] Updated weights for policy 0, policy_version 65883 (0.0030) [2024-06-27 19:57:48,850][06674] Fps is (10 sec: 42606.4, 60 sec: 43417.5, 300 sec: 43487.3). Total num frames: 1079525376. Throughput: 0: 43647.8. Samples: 982467040. Policy #0 lag: (min: 0.0, avg: 13.0, max: 22.0) [2024-06-27 19:57:48,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:57:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000065889_1079525376.pth... [2024-06-27 19:57:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000065253_1069105152.pth [2024-06-27 19:57:50,220][06909] Updated weights for policy 0, policy_version 65893 (0.0030) [2024-06-27 19:57:53,564][06909] Updated weights for policy 0, policy_version 65903 (0.0021) [2024-06-27 19:57:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 1079771136. Throughput: 0: 43633.0. Samples: 982730840. Policy #0 lag: (min: 0.0, avg: 13.0, max: 22.0) [2024-06-27 19:57:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:57:57,745][06909] Updated weights for policy 0, policy_version 65913 (0.0043) [2024-06-27 19:57:58,856][06674] Fps is (10 sec: 45848.1, 60 sec: 43413.2, 300 sec: 43597.2). Total num frames: 1079984128. Throughput: 0: 43557.6. Samples: 982864100. Policy #0 lag: (min: 0.0, avg: 13.0, max: 22.0) [2024-06-27 19:57:58,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:58:01,018][06909] Updated weights for policy 0, policy_version 65923 (0.0030) [2024-06-27 19:58:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 1080197120. Throughput: 0: 43499.9. Samples: 983120060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 19:58:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 19:58:05,245][06909] Updated weights for policy 0, policy_version 65933 (0.0039) [2024-06-27 19:58:08,337][06909] Updated weights for policy 0, policy_version 65943 (0.0046) [2024-06-27 19:58:08,850][06674] Fps is (10 sec: 44263.3, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 1080426496. Throughput: 0: 43731.1. Samples: 983381620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 19:58:08,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:58:12,722][06909] Updated weights for policy 0, policy_version 65953 (0.0028) [2024-06-27 19:58:13,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 1080639488. Throughput: 0: 43742.8. Samples: 983518740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 19:58:13,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 19:58:15,617][06909] Updated weights for policy 0, policy_version 65963 (0.0035) [2024-06-27 19:58:18,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 1080836096. Throughput: 0: 43769.0. Samples: 983776200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 19:58:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 19:58:20,054][06909] Updated weights for policy 0, policy_version 65973 (0.0034) [2024-06-27 19:58:20,867][06887] Signal inference workers to stop experience collection... (14050 times) [2024-06-27 19:58:20,893][06909] InferenceWorker_p0-w0: stopping experience collection (14050 times) [2024-06-27 19:58:20,928][06887] Signal inference workers to resume experience collection... (14050 times) [2024-06-27 19:58:20,928][06909] InferenceWorker_p0-w0: resuming experience collection (14050 times) [2024-06-27 19:58:23,343][06909] Updated weights for policy 0, policy_version 65983 (0.0032) [2024-06-27 19:58:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1081081856. Throughput: 0: 43620.3. Samples: 984037020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 19:58:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 19:58:27,505][06909] Updated weights for policy 0, policy_version 65993 (0.0030) [2024-06-27 19:58:28,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1081294848. Throughput: 0: 43615.6. Samples: 984173460. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 19:58:28,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 19:58:30,954][06909] Updated weights for policy 0, policy_version 66003 (0.0027) [2024-06-27 19:58:33,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.8, 300 sec: 43598.1). Total num frames: 1081507840. Throughput: 0: 43553.4. Samples: 984426940. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 19:58:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:58:35,301][06909] Updated weights for policy 0, policy_version 66013 (0.0030) [2024-06-27 19:58:38,337][06909] Updated weights for policy 0, policy_version 66023 (0.0048) [2024-06-27 19:58:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43965.2, 300 sec: 43653.6). Total num frames: 1081737216. Throughput: 0: 43510.7. Samples: 984688820. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 19:58:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:58:42,628][06909] Updated weights for policy 0, policy_version 66033 (0.0038) [2024-06-27 19:58:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 1081950208. Throughput: 0: 43483.2. Samples: 984820580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 19:58:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:58:46,255][06909] Updated weights for policy 0, policy_version 66043 (0.0028) [2024-06-27 19:58:48,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43417.7, 300 sec: 43487.9). Total num frames: 1082130432. Throughput: 0: 43541.3. Samples: 985079420. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-27 19:58:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 19:58:50,108][06909] Updated weights for policy 0, policy_version 66053 (0.0033) [2024-06-27 19:58:53,668][06909] Updated weights for policy 0, policy_version 66063 (0.0026) [2024-06-27 19:58:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.8, 300 sec: 43653.7). Total num frames: 1082392576. Throughput: 0: 43519.7. Samples: 985340000. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-27 19:58:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:58:57,696][06909] Updated weights for policy 0, policy_version 66073 (0.0038) [2024-06-27 19:58:58,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43695.1, 300 sec: 43598.1). Total num frames: 1082605568. Throughput: 0: 43602.2. Samples: 985480840. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-27 19:58:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:59:00,817][06909] Updated weights for policy 0, policy_version 66083 (0.0033) [2024-06-27 19:59:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1082818560. Throughput: 0: 43606.7. Samples: 985738500. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-27 19:59:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:59:04,963][06909] Updated weights for policy 0, policy_version 66093 (0.0031) [2024-06-27 19:59:08,067][06909] Updated weights for policy 0, policy_version 66103 (0.0031) [2024-06-27 19:59:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1083047936. Throughput: 0: 43650.6. Samples: 986001300. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-27 19:59:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:59:12,414][06909] Updated weights for policy 0, policy_version 66113 (0.0031) [2024-06-27 19:59:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1083244544. Throughput: 0: 43648.1. Samples: 986137620. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-27 19:59:13,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 19:59:16,019][06909] Updated weights for policy 0, policy_version 66123 (0.0022) [2024-06-27 19:59:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 1083473920. Throughput: 0: 43687.5. Samples: 986392880. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-27 19:59:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:59:20,040][06909] Updated weights for policy 0, policy_version 66133 (0.0035) [2024-06-27 19:59:23,511][06909] Updated weights for policy 0, policy_version 66143 (0.0027) [2024-06-27 19:59:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1083703296. Throughput: 0: 43750.8. Samples: 986657600. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-27 19:59:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:59:27,585][06909] Updated weights for policy 0, policy_version 66153 (0.0030) [2024-06-27 19:59:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1083899904. Throughput: 0: 43693.4. Samples: 986786780. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 19:59:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 19:59:31,058][06909] Updated weights for policy 0, policy_version 66163 (0.0028) [2024-06-27 19:59:33,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1084129280. Throughput: 0: 43676.8. Samples: 987044880. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 19:59:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 19:59:34,955][06909] Updated weights for policy 0, policy_version 66173 (0.0032) [2024-06-27 19:59:38,393][06909] Updated weights for policy 0, policy_version 66183 (0.0041) [2024-06-27 19:59:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1084342272. Throughput: 0: 43784.9. Samples: 987310320. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 19:59:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:59:42,187][06909] Updated weights for policy 0, policy_version 66193 (0.0040) [2024-06-27 19:59:43,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43144.6, 300 sec: 43542.6). Total num frames: 1084538880. Throughput: 0: 43624.4. Samples: 987443940. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 19:59:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 19:59:45,698][06909] Updated weights for policy 0, policy_version 66203 (0.0023) [2024-06-27 19:59:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43542.6). Total num frames: 1084768256. Throughput: 0: 43645.7. Samples: 987702560. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 19:59:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:59:48,898][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000066210_1084784640.pth... [2024-06-27 19:59:48,951][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000065572_1074331648.pth [2024-06-27 19:59:49,841][06909] Updated weights for policy 0, policy_version 66213 (0.0030) [2024-06-27 19:59:51,520][06887] Signal inference workers to stop experience collection... (14100 times) [2024-06-27 19:59:51,554][06909] InferenceWorker_p0-w0: stopping experience collection (14100 times) [2024-06-27 19:59:51,573][06887] Signal inference workers to resume experience collection... (14100 times) [2024-06-27 19:59:51,573][06909] InferenceWorker_p0-w0: resuming experience collection (14100 times) [2024-06-27 19:59:53,766][06909] Updated weights for policy 0, policy_version 66223 (0.0035) [2024-06-27 19:59:53,852][06674] Fps is (10 sec: 45865.5, 60 sec: 43416.1, 300 sec: 43597.8). Total num frames: 1084997632. Throughput: 0: 43654.1. Samples: 987965820. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 19:59:53,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 19:59:57,715][06909] Updated weights for policy 0, policy_version 66233 (0.0042) [2024-06-27 19:59:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43144.5, 300 sec: 43487.0). Total num frames: 1085194240. Throughput: 0: 43505.3. Samples: 988095360. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 19:59:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:00:01,030][06909] Updated weights for policy 0, policy_version 66243 (0.0035) [2024-06-27 20:00:03,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1085423616. Throughput: 0: 43502.3. Samples: 988350480. Policy #0 lag: (min: 0.0, avg: 11.1, max: 20.0) [2024-06-27 20:00:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:00:05,079][06909] Updated weights for policy 0, policy_version 66253 (0.0041) [2024-06-27 20:00:08,574][06909] Updated weights for policy 0, policy_version 66263 (0.0025) [2024-06-27 20:00:08,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1085652992. Throughput: 0: 43496.3. Samples: 988614940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:00:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:00:12,359][06909] Updated weights for policy 0, policy_version 66273 (0.0041) [2024-06-27 20:00:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.6, 300 sec: 43653.6). Total num frames: 1085882368. Throughput: 0: 43668.8. Samples: 988751880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:00:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:00:15,822][06909] Updated weights for policy 0, policy_version 66283 (0.0031) [2024-06-27 20:00:18,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43416.2, 300 sec: 43597.8). Total num frames: 1086078976. Throughput: 0: 43730.6. Samples: 989012840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:00:18,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:00:19,661][06909] Updated weights for policy 0, policy_version 66293 (0.0034) [2024-06-27 20:00:23,229][06909] Updated weights for policy 0, policy_version 66303 (0.0043) [2024-06-27 20:00:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1086324736. Throughput: 0: 43458.2. Samples: 989265940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:00:23,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:00:27,173][06909] Updated weights for policy 0, policy_version 66313 (0.0030) [2024-06-27 20:00:28,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 1086504960. Throughput: 0: 43635.4. Samples: 989407540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:00:28,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:00:30,925][06909] Updated weights for policy 0, policy_version 66323 (0.0039) [2024-06-27 20:00:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.7, 300 sec: 43598.4). Total num frames: 1086734336. Throughput: 0: 43621.0. Samples: 989665500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:00:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:00:35,334][06909] Updated weights for policy 0, policy_version 66333 (0.0026) [2024-06-27 20:00:38,679][06909] Updated weights for policy 0, policy_version 66343 (0.0039) [2024-06-27 20:00:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.6, 300 sec: 43542.5). Total num frames: 1086963712. Throughput: 0: 43507.3. Samples: 989923560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:00:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:00:42,578][06909] Updated weights for policy 0, policy_version 66353 (0.0027) [2024-06-27 20:00:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 1087176704. Throughput: 0: 43558.2. Samples: 990055480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:00:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:00:46,218][06909] Updated weights for policy 0, policy_version 66363 (0.0038) [2024-06-27 20:00:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43598.4). Total num frames: 1087389696. Throughput: 0: 43711.5. Samples: 990317500. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 20:00:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:00:49,990][06909] Updated weights for policy 0, policy_version 66373 (0.0034) [2024-06-27 20:00:53,470][06909] Updated weights for policy 0, policy_version 66383 (0.0032) [2024-06-27 20:00:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43692.2, 300 sec: 43542.6). Total num frames: 1087619072. Throughput: 0: 43610.8. Samples: 990577420. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 20:00:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:00:57,354][06909] Updated weights for policy 0, policy_version 66393 (0.0030) [2024-06-27 20:00:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 43709.2). Total num frames: 1087848448. Throughput: 0: 43700.5. Samples: 990718400. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 20:00:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:01:00,796][06909] Updated weights for policy 0, policy_version 66403 (0.0022) [2024-06-27 20:01:03,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1088045056. Throughput: 0: 43616.6. Samples: 990975500. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 20:01:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:01:05,003][06909] Updated weights for policy 0, policy_version 66413 (0.0034) [2024-06-27 20:01:08,526][06909] Updated weights for policy 0, policy_version 66423 (0.0033) [2024-06-27 20:01:08,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1088274432. Throughput: 0: 43680.0. Samples: 991231540. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 20:01:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:01:12,747][06909] Updated weights for policy 0, policy_version 66433 (0.0034) [2024-06-27 20:01:13,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43144.7, 300 sec: 43598.1). Total num frames: 1088471040. Throughput: 0: 43435.3. Samples: 991362120. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 20:01:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:01:13,969][06887] Signal inference workers to stop experience collection... (14150 times) [2024-06-27 20:01:13,969][06887] Signal inference workers to resume experience collection... (14150 times) [2024-06-27 20:01:13,990][06909] InferenceWorker_p0-w0: stopping experience collection (14150 times) [2024-06-27 20:01:14,024][06909] InferenceWorker_p0-w0: resuming experience collection (14150 times) [2024-06-27 20:01:16,002][06909] Updated weights for policy 0, policy_version 66443 (0.0041) [2024-06-27 20:01:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43965.2, 300 sec: 43709.2). Total num frames: 1088716800. Throughput: 0: 43684.9. Samples: 991631320. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 20:01:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:01:20,086][06909] Updated weights for policy 0, policy_version 66453 (0.0036) [2024-06-27 20:01:23,561][06909] Updated weights for policy 0, policy_version 66463 (0.0039) [2024-06-27 20:01:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1088929792. Throughput: 0: 43593.8. Samples: 991885280. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 20:01:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:01:27,440][06909] Updated weights for policy 0, policy_version 66473 (0.0036) [2024-06-27 20:01:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 1089142784. Throughput: 0: 43821.3. Samples: 992027440. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 20:01:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:01:30,745][06909] Updated weights for policy 0, policy_version 66483 (0.0038) [2024-06-27 20:01:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1089355776. Throughput: 0: 43896.5. Samples: 992292840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:01:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:01:34,769][06909] Updated weights for policy 0, policy_version 66493 (0.0037) [2024-06-27 20:01:38,432][06909] Updated weights for policy 0, policy_version 66503 (0.0039) [2024-06-27 20:01:38,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1089585152. Throughput: 0: 43779.8. Samples: 992547520. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:01:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:01:42,437][06909] Updated weights for policy 0, policy_version 66513 (0.0035) [2024-06-27 20:01:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1089814528. Throughput: 0: 43617.8. Samples: 992681200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:01:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:01:45,889][06909] Updated weights for policy 0, policy_version 66523 (0.0032) [2024-06-27 20:01:48,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1090027520. Throughput: 0: 43714.8. Samples: 992942660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:01:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:01:48,954][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000066531_1090043904.pth... [2024-06-27 20:01:49,009][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000065889_1079525376.pth [2024-06-27 20:01:49,873][06909] Updated weights for policy 0, policy_version 66533 (0.0039) [2024-06-27 20:01:53,764][06909] Updated weights for policy 0, policy_version 66543 (0.0019) [2024-06-27 20:01:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1090240512. Throughput: 0: 43947.6. Samples: 993209180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:01:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:01:57,390][06909] Updated weights for policy 0, policy_version 66553 (0.0024) [2024-06-27 20:01:58,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43144.5, 300 sec: 43598.1). Total num frames: 1090437120. Throughput: 0: 43903.9. Samples: 993337800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:01:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:02:01,085][06909] Updated weights for policy 0, policy_version 66563 (0.0028) [2024-06-27 20:02:03,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1090682880. Throughput: 0: 43757.2. Samples: 993600400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:02:03,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:02:04,688][06909] Updated weights for policy 0, policy_version 66573 (0.0036) [2024-06-27 20:02:08,528][06909] Updated weights for policy 0, policy_version 66583 (0.0028) [2024-06-27 20:02:08,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43690.6, 300 sec: 43542.5). Total num frames: 1090895872. Throughput: 0: 43932.3. Samples: 993862240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:02:08,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:02:12,182][06909] Updated weights for policy 0, policy_version 66593 (0.0036) [2024-06-27 20:02:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 1091108864. Throughput: 0: 43657.8. Samples: 993992040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 20:02:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:02:16,011][06909] Updated weights for policy 0, policy_version 66603 (0.0042) [2024-06-27 20:02:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1091338240. Throughput: 0: 43552.7. Samples: 994252720. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 20:02:18,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:02:20,227][06909] Updated weights for policy 0, policy_version 66613 (0.0027) [2024-06-27 20:02:23,771][06909] Updated weights for policy 0, policy_version 66623 (0.0030) [2024-06-27 20:02:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1091551232. Throughput: 0: 43752.6. Samples: 994516380. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 20:02:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:02:27,690][06909] Updated weights for policy 0, policy_version 66633 (0.0036) [2024-06-27 20:02:28,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1091764224. Throughput: 0: 43573.8. Samples: 994642020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 20:02:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:02:31,217][06909] Updated weights for policy 0, policy_version 66643 (0.0035) [2024-06-27 20:02:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43709.5). Total num frames: 1091993600. Throughput: 0: 43613.8. Samples: 994905280. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 20:02:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:02:35,151][06909] Updated weights for policy 0, policy_version 66653 (0.0039) [2024-06-27 20:02:38,685][06909] Updated weights for policy 0, policy_version 66663 (0.0042) [2024-06-27 20:02:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.8, 300 sec: 43653.7). Total num frames: 1092206592. Throughput: 0: 43460.8. Samples: 995164920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 20:02:38,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:02:42,388][06909] Updated weights for policy 0, policy_version 66673 (0.0036) [2024-06-27 20:02:43,852][06674] Fps is (10 sec: 40951.5, 60 sec: 43143.1, 300 sec: 43653.4). Total num frames: 1092403200. Throughput: 0: 43606.5. Samples: 995300180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 20:02:43,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:02:46,303][06909] Updated weights for policy 0, policy_version 66683 (0.0027) [2024-06-27 20:02:48,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 1092632576. Throughput: 0: 43527.5. Samples: 995559140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 20:02:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:02:50,239][06909] Updated weights for policy 0, policy_version 66693 (0.0033) [2024-06-27 20:02:52,192][06887] Signal inference workers to stop experience collection... (14200 times) [2024-06-27 20:02:52,198][06887] Signal inference workers to resume experience collection... (14200 times) [2024-06-27 20:02:52,212][06909] InferenceWorker_p0-w0: stopping experience collection (14200 times) [2024-06-27 20:02:52,212][06909] InferenceWorker_p0-w0: resuming experience collection (14200 times) [2024-06-27 20:02:53,723][06909] Updated weights for policy 0, policy_version 66703 (0.0031) [2024-06-27 20:02:53,850][06674] Fps is (10 sec: 45884.8, 60 sec: 43690.7, 300 sec: 43654.5). Total num frames: 1092861952. Throughput: 0: 43536.7. Samples: 995821380. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 20:02:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:02:57,487][06909] Updated weights for policy 0, policy_version 66713 (0.0040) [2024-06-27 20:02:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 1093074944. Throughput: 0: 43650.2. Samples: 995956300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 20:02:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:03:01,542][06909] Updated weights for policy 0, policy_version 66723 (0.0034) [2024-06-27 20:03:03,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43689.3, 300 sec: 43653.4). Total num frames: 1093304320. Throughput: 0: 43573.3. Samples: 996213600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 20:03:03,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:03:04,860][06909] Updated weights for policy 0, policy_version 66733 (0.0026) [2024-06-27 20:03:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1093500928. Throughput: 0: 43649.3. Samples: 996480600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 20:03:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:03:08,886][06909] Updated weights for policy 0, policy_version 66743 (0.0031) [2024-06-27 20:03:12,834][06909] Updated weights for policy 0, policy_version 66753 (0.0028) [2024-06-27 20:03:13,850][06674] Fps is (10 sec: 40968.6, 60 sec: 43417.7, 300 sec: 43653.6). Total num frames: 1093713920. Throughput: 0: 43620.9. Samples: 996604960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 20:03:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:03:16,249][06909] Updated weights for policy 0, policy_version 66763 (0.0030) [2024-06-27 20:03:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.8, 300 sec: 43653.6). Total num frames: 1093959680. Throughput: 0: 43697.7. Samples: 996871680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 20:03:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:03:20,258][06909] Updated weights for policy 0, policy_version 66773 (0.0042) [2024-06-27 20:03:23,568][06909] Updated weights for policy 0, policy_version 66783 (0.0033) [2024-06-27 20:03:23,850][06674] Fps is (10 sec: 45874.2, 60 sec: 43690.5, 300 sec: 43653.6). Total num frames: 1094172672. Throughput: 0: 43631.0. Samples: 997128320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 20:03:23,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:03:27,594][06909] Updated weights for policy 0, policy_version 66793 (0.0035) [2024-06-27 20:03:28,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1094369280. Throughput: 0: 43538.8. Samples: 997259340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-27 20:03:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:03:31,136][06909] Updated weights for policy 0, policy_version 66803 (0.0030) [2024-06-27 20:03:33,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1094615040. Throughput: 0: 43531.3. Samples: 997518040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:03:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:03:35,306][06909] Updated weights for policy 0, policy_version 66813 (0.0031) [2024-06-27 20:03:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1094811648. Throughput: 0: 43674.6. Samples: 997786740. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:03:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:03:39,047][06909] Updated weights for policy 0, policy_version 66823 (0.0035) [2024-06-27 20:03:42,796][06909] Updated weights for policy 0, policy_version 66833 (0.0036) [2024-06-27 20:03:43,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43692.1, 300 sec: 43709.2). Total num frames: 1095024640. Throughput: 0: 43676.9. Samples: 997921760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:03:43,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:03:46,536][06909] Updated weights for policy 0, policy_version 66843 (0.0029) [2024-06-27 20:03:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.9, 300 sec: 43653.6). Total num frames: 1095270400. Throughput: 0: 43582.4. Samples: 998174720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:03:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:03:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000066850_1095270400.pth... [2024-06-27 20:03:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000066210_1084784640.pth [2024-06-27 20:03:50,476][06909] Updated weights for policy 0, policy_version 66853 (0.0023) [2024-06-27 20:03:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 1095467008. Throughput: 0: 43526.7. Samples: 998439300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:03:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:03:53,999][06909] Updated weights for policy 0, policy_version 66863 (0.0022) [2024-06-27 20:03:57,793][06909] Updated weights for policy 0, policy_version 66873 (0.0034) [2024-06-27 20:03:58,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1095680000. Throughput: 0: 43540.8. Samples: 998564300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:03:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 20:04:01,375][06909] Updated weights for policy 0, policy_version 66883 (0.0037) [2024-06-27 20:04:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43692.1, 300 sec: 43653.6). Total num frames: 1095925760. Throughput: 0: 43652.4. Samples: 998836040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:04:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:04:05,059][06909] Updated weights for policy 0, policy_version 66893 (0.0034) [2024-06-27 20:04:08,781][06909] Updated weights for policy 0, policy_version 66903 (0.0026) [2024-06-27 20:04:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1096138752. Throughput: 0: 43835.3. Samples: 999100900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:04:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:04:10,836][06887] Signal inference workers to stop experience collection... (14250 times) [2024-06-27 20:04:10,836][06887] Signal inference workers to resume experience collection... (14250 times) [2024-06-27 20:04:10,873][06909] InferenceWorker_p0-w0: stopping experience collection (14250 times) [2024-06-27 20:04:10,873][06909] InferenceWorker_p0-w0: resuming experience collection (14250 times) [2024-06-27 20:04:12,751][06909] Updated weights for policy 0, policy_version 66913 (0.0039) [2024-06-27 20:04:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.6, 300 sec: 43653.6). Total num frames: 1096351744. Throughput: 0: 43806.2. Samples: 999230620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:04:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:04:16,330][06909] Updated weights for policy 0, policy_version 66923 (0.0031) [2024-06-27 20:04:18,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43689.2, 300 sec: 43653.3). Total num frames: 1096581120. Throughput: 0: 44019.7. Samples: 999499020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:04:18,861][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:04:20,016][06909] Updated weights for policy 0, policy_version 66933 (0.0028) [2024-06-27 20:04:23,852][06674] Fps is (10 sec: 42590.0, 60 sec: 43416.2, 300 sec: 43653.3). Total num frames: 1096777728. Throughput: 0: 43730.5. Samples: 999754700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:04:23,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:04:24,183][06909] Updated weights for policy 0, policy_version 66943 (0.0038) [2024-06-27 20:04:27,764][06909] Updated weights for policy 0, policy_version 66953 (0.0032) [2024-06-27 20:04:28,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1096990720. Throughput: 0: 43590.2. Samples: 999883320. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:04:28,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:04:31,495][06909] Updated weights for policy 0, policy_version 66963 (0.0031) [2024-06-27 20:04:33,850][06674] Fps is (10 sec: 44245.3, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 1097220096. Throughput: 0: 43695.4. Samples: 1000141020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:04:33,854][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:04:35,226][06909] Updated weights for policy 0, policy_version 66973 (0.0026) [2024-06-27 20:04:38,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 1097433088. Throughput: 0: 43506.3. Samples: 1000397080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:04:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:04:38,864][06909] Updated weights for policy 0, policy_version 66983 (0.0027) [2024-06-27 20:04:42,629][06909] Updated weights for policy 0, policy_version 66993 (0.0048) [2024-06-27 20:04:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1097646080. Throughput: 0: 43637.3. Samples: 1000527980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:04:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:04:46,081][06909] Updated weights for policy 0, policy_version 67003 (0.0041) [2024-06-27 20:04:48,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.6, 300 sec: 43709.5). Total num frames: 1097891840. Throughput: 0: 43647.5. Samples: 1000800180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:04:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:04:49,904][06909] Updated weights for policy 0, policy_version 67013 (0.0036) [2024-06-27 20:04:53,598][06909] Updated weights for policy 0, policy_version 67023 (0.0028) [2024-06-27 20:04:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 1098104832. Throughput: 0: 43456.0. Samples: 1001056420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:04:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:04:57,250][06909] Updated weights for policy 0, policy_version 67033 (0.0025) [2024-06-27 20:04:58,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1098301440. Throughput: 0: 43527.6. Samples: 1001189360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 20:04:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:05:01,445][06909] Updated weights for policy 0, policy_version 67043 (0.0045) [2024-06-27 20:05:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1098547200. Throughput: 0: 43461.9. Samples: 1001454720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 20:05:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:05:04,963][06909] Updated weights for policy 0, policy_version 67053 (0.0035) [2024-06-27 20:05:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 1098743808. Throughput: 0: 43529.0. Samples: 1001713420. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 20:05:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:05:09,077][06909] Updated weights for policy 0, policy_version 67063 (0.0026) [2024-06-27 20:05:12,257][06909] Updated weights for policy 0, policy_version 67073 (0.0039) [2024-06-27 20:05:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43709.5). Total num frames: 1098973184. Throughput: 0: 43626.3. Samples: 1001846500. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 20:05:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:05:16,391][06909] Updated weights for policy 0, policy_version 67083 (0.0027) [2024-06-27 20:05:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43692.2, 300 sec: 43653.6). Total num frames: 1099202560. Throughput: 0: 43839.7. Samples: 1002113800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 20:05:18,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 20:05:19,529][06909] Updated weights for policy 0, policy_version 67093 (0.0030) [2024-06-27 20:05:23,686][06909] Updated weights for policy 0, policy_version 67103 (0.0038) [2024-06-27 20:05:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43965.2, 300 sec: 43764.7). Total num frames: 1099415552. Throughput: 0: 44007.0. Samples: 1002377400. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 20:05:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:05:25,883][06887] Signal inference workers to stop experience collection... (14300 times) [2024-06-27 20:05:25,883][06887] Signal inference workers to resume experience collection... (14300 times) [2024-06-27 20:05:25,927][06909] InferenceWorker_p0-w0: stopping experience collection (14300 times) [2024-06-27 20:05:25,927][06909] InferenceWorker_p0-w0: resuming experience collection (14300 times) [2024-06-27 20:05:27,290][06909] Updated weights for policy 0, policy_version 67113 (0.0030) [2024-06-27 20:05:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1099628544. Throughput: 0: 43940.5. Samples: 1002505300. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 20:05:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:05:31,059][06909] Updated weights for policy 0, policy_version 67123 (0.0041) [2024-06-27 20:05:33,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.9, 300 sec: 43764.7). Total num frames: 1099874304. Throughput: 0: 43913.9. Samples: 1002776300. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 20:05:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:05:34,493][06909] Updated weights for policy 0, policy_version 67133 (0.0033) [2024-06-27 20:05:38,537][06909] Updated weights for policy 0, policy_version 67143 (0.0026) [2024-06-27 20:05:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1100070912. Throughput: 0: 43985.8. Samples: 1003035780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 20:05:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:05:41,966][06909] Updated weights for policy 0, policy_version 67153 (0.0034) [2024-06-27 20:05:43,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1100283904. Throughput: 0: 43810.2. Samples: 1003160820. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 20:05:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:05:46,398][06909] Updated weights for policy 0, policy_version 67163 (0.0041) [2024-06-27 20:05:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1100513280. Throughput: 0: 43770.7. Samples: 1003424400. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 20:05:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:05:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000067170_1100513280.pth... [2024-06-27 20:05:48,950][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000066531_1090043904.pth [2024-06-27 20:05:49,981][06909] Updated weights for policy 0, policy_version 67173 (0.0047) [2024-06-27 20:05:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1100709888. Throughput: 0: 43811.7. Samples: 1003684940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 20:05:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:05:53,962][06909] Updated weights for policy 0, policy_version 67183 (0.0026) [2024-06-27 20:05:57,366][06909] Updated weights for policy 0, policy_version 67193 (0.0039) [2024-06-27 20:05:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1100939264. Throughput: 0: 43645.7. Samples: 1003810560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 20:05:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 20:06:01,493][06909] Updated weights for policy 0, policy_version 67203 (0.0037) [2024-06-27 20:06:03,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1101168640. Throughput: 0: 43667.4. Samples: 1004078840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 20:06:03,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:06:04,762][06909] Updated weights for policy 0, policy_version 67213 (0.0030) [2024-06-27 20:06:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1101365248. Throughput: 0: 43684.5. Samples: 1004343200. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 20:06:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:06:08,950][06909] Updated weights for policy 0, policy_version 67223 (0.0042) [2024-06-27 20:06:12,509][06909] Updated weights for policy 0, policy_version 67233 (0.0032) [2024-06-27 20:06:13,852][06674] Fps is (10 sec: 42590.1, 60 sec: 43689.1, 300 sec: 43653.3). Total num frames: 1101594624. Throughput: 0: 43713.1. Samples: 1004472480. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 20:06:13,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:06:16,303][06909] Updated weights for policy 0, policy_version 67243 (0.0035) [2024-06-27 20:06:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1101824000. Throughput: 0: 43569.3. Samples: 1004736920. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:06:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:06:19,960][06909] Updated weights for policy 0, policy_version 67253 (0.0038) [2024-06-27 20:06:23,850][06674] Fps is (10 sec: 42607.0, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 1102020608. Throughput: 0: 43516.4. Samples: 1004994020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:06:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:06:24,280][06909] Updated weights for policy 0, policy_version 67263 (0.0033) [2024-06-27 20:06:27,851][06909] Updated weights for policy 0, policy_version 67273 (0.0033) [2024-06-27 20:06:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1102249984. Throughput: 0: 43585.3. Samples: 1005122160. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:06:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:06:31,986][06909] Updated weights for policy 0, policy_version 67283 (0.0033) [2024-06-27 20:06:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 1102479360. Throughput: 0: 43553.7. Samples: 1005384320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:06:33,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:06:35,235][06909] Updated weights for policy 0, policy_version 67293 (0.0039) [2024-06-27 20:06:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1102675968. Throughput: 0: 43646.6. Samples: 1005649040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:06:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:06:39,469][06909] Updated weights for policy 0, policy_version 67303 (0.0048) [2024-06-27 20:06:42,542][06909] Updated weights for policy 0, policy_version 67313 (0.0026) [2024-06-27 20:06:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1102905344. Throughput: 0: 43837.4. Samples: 1005783240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:06:43,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:06:46,882][06909] Updated weights for policy 0, policy_version 67323 (0.0026) [2024-06-27 20:06:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 1103101952. Throughput: 0: 43733.9. Samples: 1006046860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:06:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:06:49,899][06909] Updated weights for policy 0, policy_version 67333 (0.0037) [2024-06-27 20:06:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1103331328. Throughput: 0: 43622.2. Samples: 1006306200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:06:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:06:54,212][06909] Updated weights for policy 0, policy_version 67343 (0.0028) [2024-06-27 20:06:57,747][06909] Updated weights for policy 0, policy_version 67353 (0.0029) [2024-06-27 20:06:58,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1103577088. Throughput: 0: 43576.2. Samples: 1006433320. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 20:06:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:07:01,494][06909] Updated weights for policy 0, policy_version 67363 (0.0033) [2024-06-27 20:07:02,990][06887] Signal inference workers to stop experience collection... (14350 times) [2024-06-27 20:07:02,992][06887] Signal inference workers to resume experience collection... (14350 times) [2024-06-27 20:07:03,030][06909] InferenceWorker_p0-w0: stopping experience collection (14350 times) [2024-06-27 20:07:03,030][06909] InferenceWorker_p0-w0: resuming experience collection (14350 times) [2024-06-27 20:07:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 1103773696. Throughput: 0: 43531.6. Samples: 1006695840. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 20:07:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:07:05,103][06909] Updated weights for policy 0, policy_version 67373 (0.0027) [2024-06-27 20:07:08,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1103970304. Throughput: 0: 43671.2. Samples: 1006959220. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 20:07:08,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:07:09,188][06909] Updated weights for policy 0, policy_version 67383 (0.0031) [2024-06-27 20:07:12,870][06909] Updated weights for policy 0, policy_version 67393 (0.0041) [2024-06-27 20:07:13,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43965.3, 300 sec: 43709.2). Total num frames: 1104232448. Throughput: 0: 43595.2. Samples: 1007083940. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 20:07:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:07:16,957][06909] Updated weights for policy 0, policy_version 67403 (0.0035) [2024-06-27 20:07:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 1104412672. Throughput: 0: 43505.9. Samples: 1007342080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 20:07:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:07:20,344][06909] Updated weights for policy 0, policy_version 67413 (0.0045) [2024-06-27 20:07:23,856][06674] Fps is (10 sec: 39297.5, 60 sec: 43413.2, 300 sec: 43597.2). Total num frames: 1104625664. Throughput: 0: 43540.3. Samples: 1007608620. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 20:07:23,857][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:07:24,332][06909] Updated weights for policy 0, policy_version 67423 (0.0042) [2024-06-27 20:07:27,858][06909] Updated weights for policy 0, policy_version 67433 (0.0033) [2024-06-27 20:07:28,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1104871424. Throughput: 0: 43384.5. Samples: 1007735540. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 20:07:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:07:31,818][06909] Updated weights for policy 0, policy_version 67443 (0.0028) [2024-06-27 20:07:33,850][06674] Fps is (10 sec: 45903.3, 60 sec: 43417.7, 300 sec: 43653.6). Total num frames: 1105084416. Throughput: 0: 43397.8. Samples: 1007999760. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 20:07:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:07:35,486][06909] Updated weights for policy 0, policy_version 67453 (0.0035) [2024-06-27 20:07:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.6, 300 sec: 43653.9). Total num frames: 1105281024. Throughput: 0: 43480.5. Samples: 1008262820. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 20:07:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:07:39,240][06909] Updated weights for policy 0, policy_version 67463 (0.0031) [2024-06-27 20:07:43,007][06909] Updated weights for policy 0, policy_version 67473 (0.0033) [2024-06-27 20:07:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1105543168. Throughput: 0: 43493.7. Samples: 1008390540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:07:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:07:46,640][06909] Updated weights for policy 0, policy_version 67483 (0.0038) [2024-06-27 20:07:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1105723392. Throughput: 0: 43557.7. Samples: 1008655940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:07:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:07:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000067488_1105723392.pth... [2024-06-27 20:07:48,927][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000066850_1095270400.pth [2024-06-27 20:07:50,172][06909] Updated weights for policy 0, policy_version 67493 (0.0036) [2024-06-27 20:07:53,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1105936384. Throughput: 0: 43526.5. Samples: 1008917920. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:07:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:07:54,376][06909] Updated weights for policy 0, policy_version 67503 (0.0031) [2024-06-27 20:07:57,909][06909] Updated weights for policy 0, policy_version 67513 (0.0042) [2024-06-27 20:07:58,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43690.6, 300 sec: 43709.5). Total num frames: 1106198528. Throughput: 0: 43797.2. Samples: 1009054820. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:07:58,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:08:02,049][06909] Updated weights for policy 0, policy_version 67523 (0.0026) [2024-06-27 20:08:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 1106378752. Throughput: 0: 43792.8. Samples: 1009312760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:08:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:08:05,365][06909] Updated weights for policy 0, policy_version 67533 (0.0024) [2024-06-27 20:08:08,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1106591744. Throughput: 0: 43889.0. Samples: 1009583360. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:08:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:08:09,257][06909] Updated weights for policy 0, policy_version 67543 (0.0037) [2024-06-27 20:08:12,855][06909] Updated weights for policy 0, policy_version 67553 (0.0041) [2024-06-27 20:08:13,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1106853888. Throughput: 0: 43898.3. Samples: 1009710960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:08:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:08:16,522][06909] Updated weights for policy 0, policy_version 67563 (0.0023) [2024-06-27 20:08:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1107034112. Throughput: 0: 43811.1. Samples: 1009971260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:08:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:08:18,956][06887] Signal inference workers to stop experience collection... (14400 times) [2024-06-27 20:08:18,956][06887] Signal inference workers to resume experience collection... (14400 times) [2024-06-27 20:08:19,004][06909] InferenceWorker_p0-w0: stopping experience collection (14400 times) [2024-06-27 20:08:19,004][06909] InferenceWorker_p0-w0: resuming experience collection (14400 times) [2024-06-27 20:08:20,242][06909] Updated weights for policy 0, policy_version 67573 (0.0030) [2024-06-27 20:08:23,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43968.2, 300 sec: 43709.2). Total num frames: 1107263488. Throughput: 0: 43888.4. Samples: 1010237800. Policy #0 lag: (min: 0.0, avg: 12.0, max: 25.0) [2024-06-27 20:08:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:08:24,041][06909] Updated weights for policy 0, policy_version 67583 (0.0034) [2024-06-27 20:08:27,616][06909] Updated weights for policy 0, policy_version 67593 (0.0033) [2024-06-27 20:08:28,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1107509248. Throughput: 0: 43901.5. Samples: 1010366100. Policy #0 lag: (min: 0.0, avg: 12.0, max: 25.0) [2024-06-27 20:08:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:08:31,775][06909] Updated weights for policy 0, policy_version 67603 (0.0041) [2024-06-27 20:08:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 1107673088. Throughput: 0: 43726.8. Samples: 1010623640. Policy #0 lag: (min: 0.0, avg: 12.0, max: 25.0) [2024-06-27 20:08:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:08:35,300][06909] Updated weights for policy 0, policy_version 67613 (0.0032) [2024-06-27 20:08:38,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1107902464. Throughput: 0: 43771.2. Samples: 1010887620. Policy #0 lag: (min: 0.0, avg: 12.0, max: 25.0) [2024-06-27 20:08:38,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:08:39,166][06909] Updated weights for policy 0, policy_version 67623 (0.0037) [2024-06-27 20:08:42,554][06909] Updated weights for policy 0, policy_version 67633 (0.0039) [2024-06-27 20:08:43,850][06674] Fps is (10 sec: 49151.9, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1108164608. Throughput: 0: 43641.9. Samples: 1011018700. Policy #0 lag: (min: 0.0, avg: 12.0, max: 25.0) [2024-06-27 20:08:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:08:46,927][06909] Updated weights for policy 0, policy_version 67643 (0.0038) [2024-06-27 20:08:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1108344832. Throughput: 0: 43643.9. Samples: 1011276740. Policy #0 lag: (min: 0.0, avg: 12.0, max: 25.0) [2024-06-27 20:08:48,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:08:50,479][06909] Updated weights for policy 0, policy_version 67653 (0.0034) [2024-06-27 20:08:53,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1108557824. Throughput: 0: 43497.0. Samples: 1011540720. Policy #0 lag: (min: 0.0, avg: 12.0, max: 25.0) [2024-06-27 20:08:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:08:54,203][06909] Updated weights for policy 0, policy_version 67663 (0.0036) [2024-06-27 20:08:57,791][06909] Updated weights for policy 0, policy_version 67673 (0.0031) [2024-06-27 20:08:58,856][06674] Fps is (10 sec: 47485.4, 60 sec: 43686.3, 300 sec: 43708.3). Total num frames: 1108819968. Throughput: 0: 43657.2. Samples: 1011675800. Policy #0 lag: (min: 0.0, avg: 12.0, max: 25.0) [2024-06-27 20:08:58,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:09:01,418][06909] Updated weights for policy 0, policy_version 67683 (0.0031) [2024-06-27 20:09:03,856][06674] Fps is (10 sec: 44209.8, 60 sec: 43686.3, 300 sec: 43597.2). Total num frames: 1109000192. Throughput: 0: 43703.9. Samples: 1011938200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:09:03,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:09:05,144][06909] Updated weights for policy 0, policy_version 67693 (0.0031) [2024-06-27 20:09:08,850][06674] Fps is (10 sec: 40984.7, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 1109229568. Throughput: 0: 43551.6. Samples: 1012197620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:09:08,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:09:09,226][06909] Updated weights for policy 0, policy_version 67703 (0.0036) [2024-06-27 20:09:12,703][06909] Updated weights for policy 0, policy_version 67713 (0.0027) [2024-06-27 20:09:13,850][06674] Fps is (10 sec: 47542.5, 60 sec: 43690.6, 300 sec: 43709.5). Total num frames: 1109475328. Throughput: 0: 43686.6. Samples: 1012332000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:09:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:09:16,614][06909] Updated weights for policy 0, policy_version 67723 (0.0031) [2024-06-27 20:09:18,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.7, 300 sec: 43598.4). Total num frames: 1109639168. Throughput: 0: 43694.7. Samples: 1012589900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:09:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:09:20,069][06909] Updated weights for policy 0, policy_version 67733 (0.0040) [2024-06-27 20:09:23,852][06674] Fps is (10 sec: 39314.8, 60 sec: 43416.4, 300 sec: 43653.4). Total num frames: 1109868544. Throughput: 0: 43722.3. Samples: 1012855200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:09:23,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:09:24,317][06909] Updated weights for policy 0, policy_version 67743 (0.0040) [2024-06-27 20:09:27,679][06909] Updated weights for policy 0, policy_version 67753 (0.0038) [2024-06-27 20:09:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43144.5, 300 sec: 43653.7). Total num frames: 1110097920. Throughput: 0: 43741.4. Samples: 1012987060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:09:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:09:31,643][06909] Updated weights for policy 0, policy_version 67763 (0.0036) [2024-06-27 20:09:33,850][06674] Fps is (10 sec: 42605.1, 60 sec: 43690.5, 300 sec: 43598.1). Total num frames: 1110294528. Throughput: 0: 43760.0. Samples: 1013245940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:09:33,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:09:35,032][06909] Updated weights for policy 0, policy_version 67773 (0.0035) [2024-06-27 20:09:38,597][06887] Signal inference workers to stop experience collection... (14450 times) [2024-06-27 20:09:38,597][06887] Signal inference workers to resume experience collection... (14450 times) [2024-06-27 20:09:38,621][06909] InferenceWorker_p0-w0: stopping experience collection (14450 times) [2024-06-27 20:09:38,621][06909] InferenceWorker_p0-w0: resuming experience collection (14450 times) [2024-06-27 20:09:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1110540288. Throughput: 0: 43840.8. Samples: 1013513560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:09:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:09:38,919][06909] Updated weights for policy 0, policy_version 67783 (0.0033) [2024-06-27 20:09:42,564][06909] Updated weights for policy 0, policy_version 67793 (0.0043) [2024-06-27 20:09:43,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 1110769664. Throughput: 0: 43741.8. Samples: 1013643920. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 20:09:43,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:09:46,192][06909] Updated weights for policy 0, policy_version 67803 (0.0036) [2024-06-27 20:09:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 1110966272. Throughput: 0: 43657.9. Samples: 1013902540. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:09:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:09:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000067808_1110966272.pth... [2024-06-27 20:09:48,919][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000067170_1100513280.pth [2024-06-27 20:09:50,406][06909] Updated weights for policy 0, policy_version 67813 (0.0034) [2024-06-27 20:09:53,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1111195648. Throughput: 0: 43549.4. Samples: 1014157340. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:09:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:09:53,916][06909] Updated weights for policy 0, policy_version 67823 (0.0032) [2024-06-27 20:09:57,834][06909] Updated weights for policy 0, policy_version 67833 (0.0035) [2024-06-27 20:09:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43148.9, 300 sec: 43598.1). Total num frames: 1111408640. Throughput: 0: 43608.9. Samples: 1014294400. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:09:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:10:01,715][06909] Updated weights for policy 0, policy_version 67843 (0.0032) [2024-06-27 20:10:03,850][06674] Fps is (10 sec: 42597.3, 60 sec: 43694.9, 300 sec: 43653.6). Total num frames: 1111621632. Throughput: 0: 43568.2. Samples: 1014550480. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:10:03,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:10:05,103][06909] Updated weights for policy 0, policy_version 67853 (0.0023) [2024-06-27 20:10:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.8, 300 sec: 43653.7). Total num frames: 1111851008. Throughput: 0: 43628.9. Samples: 1014818420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:10:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:10:08,955][06909] Updated weights for policy 0, policy_version 67863 (0.0032) [2024-06-27 20:10:12,756][06909] Updated weights for policy 0, policy_version 67873 (0.0038) [2024-06-27 20:10:13,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43144.5, 300 sec: 43598.1). Total num frames: 1112064000. Throughput: 0: 43483.9. Samples: 1014943840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:10:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:10:16,184][06909] Updated weights for policy 0, policy_version 67883 (0.0025) [2024-06-27 20:10:18,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 1112276992. Throughput: 0: 43677.4. Samples: 1015211420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:10:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:10:20,091][06909] Updated weights for policy 0, policy_version 67893 (0.0027) [2024-06-27 20:10:23,805][06909] Updated weights for policy 0, policy_version 67903 (0.0032) [2024-06-27 20:10:23,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44238.1, 300 sec: 43709.2). Total num frames: 1112522752. Throughput: 0: 43525.8. Samples: 1015472220. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:10:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:10:27,631][06909] Updated weights for policy 0, policy_version 67913 (0.0025) [2024-06-27 20:10:28,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.6, 300 sec: 43598.1). Total num frames: 1112735744. Throughput: 0: 43622.2. Samples: 1015606920. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 20:10:28,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:10:31,250][06909] Updated weights for policy 0, policy_version 67923 (0.0026) [2024-06-27 20:10:33,850][06674] Fps is (10 sec: 39320.9, 60 sec: 43690.7, 300 sec: 43542.5). Total num frames: 1112915968. Throughput: 0: 43604.3. Samples: 1015864740. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 20:10:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:10:35,136][06909] Updated weights for policy 0, policy_version 67933 (0.0037) [2024-06-27 20:10:38,702][06909] Updated weights for policy 0, policy_version 67943 (0.0034) [2024-06-27 20:10:38,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1113178112. Throughput: 0: 43735.5. Samples: 1016125440. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 20:10:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:10:42,539][06909] Updated weights for policy 0, policy_version 67953 (0.0036) [2024-06-27 20:10:43,856][06674] Fps is (10 sec: 47485.4, 60 sec: 43686.3, 300 sec: 43652.8). Total num frames: 1113391104. Throughput: 0: 43662.0. Samples: 1016259460. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 20:10:43,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 20:10:46,376][06909] Updated weights for policy 0, policy_version 67963 (0.0026) [2024-06-27 20:10:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1113604096. Throughput: 0: 43679.8. Samples: 1016516060. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 20:10:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:10:50,147][06909] Updated weights for policy 0, policy_version 67973 (0.0036) [2024-06-27 20:10:53,756][06909] Updated weights for policy 0, policy_version 67983 (0.0040) [2024-06-27 20:10:53,850][06674] Fps is (10 sec: 44263.9, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1113833472. Throughput: 0: 43613.2. Samples: 1016781020. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 20:10:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:10:57,536][06909] Updated weights for policy 0, policy_version 67993 (0.0034) [2024-06-27 20:10:58,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 1114013696. Throughput: 0: 43732.1. Samples: 1016911780. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 20:10:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:10:59,462][06887] Signal inference workers to stop experience collection... (14500 times) [2024-06-27 20:10:59,516][06909] InferenceWorker_p0-w0: stopping experience collection (14500 times) [2024-06-27 20:10:59,519][06887] Signal inference workers to resume experience collection... (14500 times) [2024-06-27 20:10:59,526][06909] InferenceWorker_p0-w0: resuming experience collection (14500 times) [2024-06-27 20:11:01,163][06909] Updated weights for policy 0, policy_version 68003 (0.0040) [2024-06-27 20:11:03,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1114243072. Throughput: 0: 43601.7. Samples: 1017173500. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 20:11:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:11:04,946][06909] Updated weights for policy 0, policy_version 68013 (0.0028) [2024-06-27 20:11:08,666][06909] Updated weights for policy 0, policy_version 68023 (0.0039) [2024-06-27 20:11:08,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43963.6, 300 sec: 43709.5). Total num frames: 1114488832. Throughput: 0: 43514.5. Samples: 1017430380. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 20:11:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:11:12,557][06909] Updated weights for policy 0, policy_version 68033 (0.0033) [2024-06-27 20:11:13,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1114685440. Throughput: 0: 43542.0. Samples: 1017566300. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 20:11:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:11:16,097][06909] Updated weights for policy 0, policy_version 68043 (0.0038) [2024-06-27 20:11:18,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1114898432. Throughput: 0: 43618.8. Samples: 1017827580. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 20:11:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:11:20,250][06909] Updated weights for policy 0, policy_version 68053 (0.0040) [2024-06-27 20:11:23,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 1115127808. Throughput: 0: 43620.8. Samples: 1018088380. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 20:11:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:11:24,115][06909] Updated weights for policy 0, policy_version 68063 (0.0035) [2024-06-27 20:11:27,685][06909] Updated weights for policy 0, policy_version 68073 (0.0029) [2024-06-27 20:11:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43144.7, 300 sec: 43542.6). Total num frames: 1115324416. Throughput: 0: 43535.3. Samples: 1018218280. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 20:11:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:11:31,917][06909] Updated weights for policy 0, policy_version 68083 (0.0032) [2024-06-27 20:11:33,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.9, 300 sec: 43653.6). Total num frames: 1115553792. Throughput: 0: 43612.0. Samples: 1018478600. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 20:11:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:11:35,146][06909] Updated weights for policy 0, policy_version 68093 (0.0036) [2024-06-27 20:11:38,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 1115783168. Throughput: 0: 43551.0. Samples: 1018740820. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 20:11:38,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 20:11:39,193][06909] Updated weights for policy 0, policy_version 68103 (0.0039) [2024-06-27 20:11:42,766][06909] Updated weights for policy 0, policy_version 68113 (0.0028) [2024-06-27 20:11:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43148.9, 300 sec: 43653.6). Total num frames: 1115979776. Throughput: 0: 43629.7. Samples: 1018875120. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 20:11:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:11:46,714][06909] Updated weights for policy 0, policy_version 68123 (0.0035) [2024-06-27 20:11:48,851][06674] Fps is (10 sec: 42594.2, 60 sec: 43416.8, 300 sec: 43653.5). Total num frames: 1116209152. Throughput: 0: 43607.1. Samples: 1019135860. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 20:11:48,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:11:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000068128_1116209152.pth... [2024-06-27 20:11:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000067488_1105723392.pth [2024-06-27 20:11:50,525][06909] Updated weights for policy 0, policy_version 68133 (0.0039) [2024-06-27 20:11:53,856][06674] Fps is (10 sec: 45847.8, 60 sec: 43413.2, 300 sec: 43597.2). Total num frames: 1116438528. Throughput: 0: 43606.3. Samples: 1019392920. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:11:53,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:11:54,310][06909] Updated weights for policy 0, policy_version 68143 (0.0037) [2024-06-27 20:11:57,831][06909] Updated weights for policy 0, policy_version 68153 (0.0038) [2024-06-27 20:11:58,850][06674] Fps is (10 sec: 44241.2, 60 sec: 43963.6, 300 sec: 43653.6). Total num frames: 1116651520. Throughput: 0: 43539.0. Samples: 1019525560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:11:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:12:01,948][06909] Updated weights for policy 0, policy_version 68163 (0.0033) [2024-06-27 20:12:03,850][06674] Fps is (10 sec: 42624.3, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 1116864512. Throughput: 0: 43538.8. Samples: 1019786820. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:12:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:12:05,174][06909] Updated weights for policy 0, policy_version 68173 (0.0041) [2024-06-27 20:12:08,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1117093888. Throughput: 0: 43618.4. Samples: 1020051200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:12:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:12:09,351][06909] Updated weights for policy 0, policy_version 68183 (0.0038) [2024-06-27 20:12:12,478][06909] Updated weights for policy 0, policy_version 68193 (0.0026) [2024-06-27 20:12:13,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1117306880. Throughput: 0: 43681.2. Samples: 1020183940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:12:13,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:12:15,687][06887] Signal inference workers to stop experience collection... (14550 times) [2024-06-27 20:12:15,688][06887] Signal inference workers to resume experience collection... (14550 times) [2024-06-27 20:12:15,703][06909] InferenceWorker_p0-w0: stopping experience collection (14550 times) [2024-06-27 20:12:15,731][06909] InferenceWorker_p0-w0: resuming experience collection (14550 times) [2024-06-27 20:12:16,565][06909] Updated weights for policy 0, policy_version 68203 (0.0042) [2024-06-27 20:12:18,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43690.6, 300 sec: 43710.1). Total num frames: 1117519872. Throughput: 0: 43733.6. Samples: 1020446620. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:12:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:12:20,476][06909] Updated weights for policy 0, policy_version 68213 (0.0032) [2024-06-27 20:12:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1117749248. Throughput: 0: 43559.1. Samples: 1020700980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:12:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:12:24,365][06909] Updated weights for policy 0, policy_version 68223 (0.0028) [2024-06-27 20:12:27,742][06909] Updated weights for policy 0, policy_version 68233 (0.0042) [2024-06-27 20:12:28,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 1117962240. Throughput: 0: 43618.3. Samples: 1020837940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:12:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:12:32,116][06909] Updated weights for policy 0, policy_version 68243 (0.0025) [2024-06-27 20:12:33,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1118175232. Throughput: 0: 43681.1. Samples: 1021101460. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 20:12:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:12:35,190][06909] Updated weights for policy 0, policy_version 68253 (0.0025) [2024-06-27 20:12:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1118404608. Throughput: 0: 43697.4. Samples: 1021359040. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 20:12:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:12:39,327][06909] Updated weights for policy 0, policy_version 68263 (0.0040) [2024-06-27 20:12:42,822][06909] Updated weights for policy 0, policy_version 68273 (0.0023) [2024-06-27 20:12:43,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.3, 300 sec: 43708.9). Total num frames: 1118617600. Throughput: 0: 43796.7. Samples: 1021496500. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 20:12:43,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:12:46,826][06909] Updated weights for policy 0, policy_version 68283 (0.0030) [2024-06-27 20:12:48,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43691.4, 300 sec: 43709.2). Total num frames: 1118830592. Throughput: 0: 43874.5. Samples: 1021761180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 20:12:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:12:50,200][06909] Updated weights for policy 0, policy_version 68293 (0.0034) [2024-06-27 20:12:53,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43695.1, 300 sec: 43598.1). Total num frames: 1119059968. Throughput: 0: 43724.9. Samples: 1022018820. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 20:12:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:12:54,326][06909] Updated weights for policy 0, policy_version 68303 (0.0034) [2024-06-27 20:12:57,982][06909] Updated weights for policy 0, policy_version 68313 (0.0025) [2024-06-27 20:12:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1119272960. Throughput: 0: 43728.0. Samples: 1022151700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 20:12:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:13:01,623][06909] Updated weights for policy 0, policy_version 68323 (0.0030) [2024-06-27 20:13:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1119502336. Throughput: 0: 43965.5. Samples: 1022425060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 20:13:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:13:05,201][06909] Updated weights for policy 0, policy_version 68333 (0.0032) [2024-06-27 20:13:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.5, 300 sec: 43598.1). Total num frames: 1119715328. Throughput: 0: 43865.3. Samples: 1022674920. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 20:13:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:13:09,403][06909] Updated weights for policy 0, policy_version 68343 (0.0024) [2024-06-27 20:13:12,727][06909] Updated weights for policy 0, policy_version 68353 (0.0030) [2024-06-27 20:13:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1119928320. Throughput: 0: 43848.0. Samples: 1022811100. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 20:13:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:13:16,834][06909] Updated weights for policy 0, policy_version 68363 (0.0039) [2024-06-27 20:13:18,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43963.9, 300 sec: 43709.2). Total num frames: 1120157696. Throughput: 0: 43904.9. Samples: 1023077180. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 20:13:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:13:20,333][06909] Updated weights for policy 0, policy_version 68373 (0.0040) [2024-06-27 20:13:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 1120354304. Throughput: 0: 43783.6. Samples: 1023329300. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 20:13:23,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 20:13:24,482][06909] Updated weights for policy 0, policy_version 68383 (0.0041) [2024-06-27 20:13:27,700][06909] Updated weights for policy 0, policy_version 68393 (0.0040) [2024-06-27 20:13:28,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1120583680. Throughput: 0: 43679.2. Samples: 1023461980. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 20:13:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:13:31,808][06909] Updated weights for policy 0, policy_version 68403 (0.0039) [2024-06-27 20:13:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1120796672. Throughput: 0: 43811.7. Samples: 1023732700. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 20:13:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:13:35,461][06909] Updated weights for policy 0, policy_version 68413 (0.0032) [2024-06-27 20:13:38,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1121026048. Throughput: 0: 43733.3. Samples: 1023986820. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 20:13:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:13:39,137][06909] Updated weights for policy 0, policy_version 68423 (0.0027) [2024-06-27 20:13:42,924][06909] Updated weights for policy 0, policy_version 68433 (0.0043) [2024-06-27 20:13:43,857][06674] Fps is (10 sec: 44206.7, 60 sec: 43687.2, 300 sec: 43708.2). Total num frames: 1121239040. Throughput: 0: 43649.5. Samples: 1024116220. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 20:13:43,857][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:13:46,686][06909] Updated weights for policy 0, policy_version 68443 (0.0028) [2024-06-27 20:13:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1121468416. Throughput: 0: 43558.5. Samples: 1024385200. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 20:13:48,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:13:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000068449_1121468416.pth... [2024-06-27 20:13:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000067808_1110966272.pth [2024-06-27 20:13:50,258][06909] Updated weights for policy 0, policy_version 68453 (0.0036) [2024-06-27 20:13:52,140][06887] Signal inference workers to stop experience collection... (14600 times) [2024-06-27 20:13:52,186][06909] InferenceWorker_p0-w0: stopping experience collection (14600 times) [2024-06-27 20:13:52,194][06887] Signal inference workers to resume experience collection... (14600 times) [2024-06-27 20:13:52,208][06909] InferenceWorker_p0-w0: resuming experience collection (14600 times) [2024-06-27 20:13:53,850][06674] Fps is (10 sec: 42627.1, 60 sec: 43417.5, 300 sec: 43543.5). Total num frames: 1121665024. Throughput: 0: 43652.1. Samples: 1024639260. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-27 20:13:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:13:54,308][06909] Updated weights for policy 0, policy_version 68463 (0.0030) [2024-06-27 20:13:57,761][06909] Updated weights for policy 0, policy_version 68473 (0.0036) [2024-06-27 20:13:58,852][06674] Fps is (10 sec: 42590.2, 60 sec: 43689.2, 300 sec: 43709.8). Total num frames: 1121894400. Throughput: 0: 43476.6. Samples: 1024767640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:13:58,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:14:01,986][06909] Updated weights for policy 0, policy_version 68483 (0.0027) [2024-06-27 20:14:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1122123776. Throughput: 0: 43588.8. Samples: 1025038680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:14:03,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:14:05,689][06909] Updated weights for policy 0, policy_version 68493 (0.0038) [2024-06-27 20:14:08,852][06674] Fps is (10 sec: 44236.7, 60 sec: 43689.3, 300 sec: 43597.8). Total num frames: 1122336768. Throughput: 0: 43582.4. Samples: 1025290600. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:14:08,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:14:09,330][06909] Updated weights for policy 0, policy_version 68503 (0.0044) [2024-06-27 20:14:13,247][06909] Updated weights for policy 0, policy_version 68513 (0.0032) [2024-06-27 20:14:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1122549760. Throughput: 0: 43599.7. Samples: 1025423960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:14:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:14:16,543][06909] Updated weights for policy 0, policy_version 68523 (0.0027) [2024-06-27 20:14:18,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43417.5, 300 sec: 43709.4). Total num frames: 1122762752. Throughput: 0: 43534.6. Samples: 1025691760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:14:18,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:14:20,617][06909] Updated weights for policy 0, policy_version 68533 (0.0027) [2024-06-27 20:14:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1122992128. Throughput: 0: 43579.1. Samples: 1025947880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:14:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:14:24,273][06909] Updated weights for policy 0, policy_version 68543 (0.0049) [2024-06-27 20:14:28,225][06909] Updated weights for policy 0, policy_version 68553 (0.0024) [2024-06-27 20:14:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1123205120. Throughput: 0: 43554.5. Samples: 1026075880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:14:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:14:32,005][06909] Updated weights for policy 0, policy_version 68563 (0.0030) [2024-06-27 20:14:33,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1123418112. Throughput: 0: 43734.9. Samples: 1026353260. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:14:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:14:35,816][06909] Updated weights for policy 0, policy_version 68573 (0.0029) [2024-06-27 20:14:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1123647488. Throughput: 0: 43640.8. Samples: 1026603100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 20:14:38,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:14:39,337][06909] Updated weights for policy 0, policy_version 68583 (0.0028) [2024-06-27 20:14:43,066][06909] Updated weights for policy 0, policy_version 68593 (0.0028) [2024-06-27 20:14:43,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43695.5, 300 sec: 43709.2). Total num frames: 1123860480. Throughput: 0: 43786.8. Samples: 1026737960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 20:14:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:14:46,576][06909] Updated weights for policy 0, policy_version 68603 (0.0039) [2024-06-27 20:14:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1124089856. Throughput: 0: 43741.3. Samples: 1027007040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 20:14:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:14:50,335][06909] Updated weights for policy 0, policy_version 68613 (0.0027) [2024-06-27 20:14:53,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1124302848. Throughput: 0: 43958.0. Samples: 1027268620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 20:14:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:14:53,880][06909] Updated weights for policy 0, policy_version 68623 (0.0032) [2024-06-27 20:14:57,596][06909] Updated weights for policy 0, policy_version 68633 (0.0037) [2024-06-27 20:14:58,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43965.3, 300 sec: 43764.8). Total num frames: 1124532224. Throughput: 0: 43828.9. Samples: 1027396260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 20:14:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:15:01,491][06909] Updated weights for policy 0, policy_version 68643 (0.0041) [2024-06-27 20:15:03,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43689.2, 300 sec: 43708.9). Total num frames: 1124745216. Throughput: 0: 43879.8. Samples: 1027666440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 20:15:03,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:15:05,186][06909] Updated weights for policy 0, policy_version 68653 (0.0038) [2024-06-27 20:15:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43692.2, 300 sec: 43709.2). Total num frames: 1124958208. Throughput: 0: 43889.5. Samples: 1027922900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 20:15:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:15:08,918][06909] Updated weights for policy 0, policy_version 68663 (0.0025) [2024-06-27 20:15:12,874][06909] Updated weights for policy 0, policy_version 68673 (0.0035) [2024-06-27 20:15:13,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 1125154816. Throughput: 0: 44052.9. Samples: 1028058260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 20:15:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:15:14,087][06887] Signal inference workers to stop experience collection... (14650 times) [2024-06-27 20:15:14,088][06887] Signal inference workers to resume experience collection... (14650 times) [2024-06-27 20:15:14,129][06909] InferenceWorker_p0-w0: stopping experience collection (14650 times) [2024-06-27 20:15:14,129][06909] InferenceWorker_p0-w0: resuming experience collection (14650 times) [2024-06-27 20:15:16,686][06909] Updated weights for policy 0, policy_version 68683 (0.0029) [2024-06-27 20:15:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1125384192. Throughput: 0: 43724.8. Samples: 1028320880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 20:15:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:15:20,159][06909] Updated weights for policy 0, policy_version 68693 (0.0047) [2024-06-27 20:15:23,852][06674] Fps is (10 sec: 45866.1, 60 sec: 43689.2, 300 sec: 43653.4). Total num frames: 1125613568. Throughput: 0: 43880.3. Samples: 1028577800. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 20:15:23,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:15:23,995][06909] Updated weights for policy 0, policy_version 68703 (0.0021) [2024-06-27 20:15:27,634][06909] Updated weights for policy 0, policy_version 68713 (0.0046) [2024-06-27 20:15:28,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1125842944. Throughput: 0: 43927.1. Samples: 1028714680. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 20:15:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:15:31,244][06909] Updated weights for policy 0, policy_version 68723 (0.0031) [2024-06-27 20:15:33,850][06674] Fps is (10 sec: 42607.0, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1126039552. Throughput: 0: 43814.7. Samples: 1028978700. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 20:15:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:15:34,831][06909] Updated weights for policy 0, policy_version 68733 (0.0029) [2024-06-27 20:15:38,553][06909] Updated weights for policy 0, policy_version 68743 (0.0025) [2024-06-27 20:15:38,856][06674] Fps is (10 sec: 44210.3, 60 sec: 43959.4, 300 sec: 43709.2). Total num frames: 1126285312. Throughput: 0: 43782.5. Samples: 1029239100. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 20:15:38,857][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:15:42,746][06909] Updated weights for policy 0, policy_version 68753 (0.0035) [2024-06-27 20:15:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1126498304. Throughput: 0: 43876.3. Samples: 1029370700. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 20:15:43,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:15:45,927][06909] Updated weights for policy 0, policy_version 68763 (0.0024) [2024-06-27 20:15:48,850][06674] Fps is (10 sec: 42624.6, 60 sec: 43690.8, 300 sec: 43653.6). Total num frames: 1126711296. Throughput: 0: 43763.0. Samples: 1029635680. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 20:15:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:15:48,887][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000068770_1126727680.pth... [2024-06-27 20:15:48,936][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000068128_1116209152.pth [2024-06-27 20:15:50,162][06909] Updated weights for policy 0, policy_version 68773 (0.0033) [2024-06-27 20:15:53,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1126924288. Throughput: 0: 43791.1. Samples: 1029893500. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 20:15:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:15:53,959][06909] Updated weights for policy 0, policy_version 68783 (0.0039) [2024-06-27 20:15:57,436][06909] Updated weights for policy 0, policy_version 68793 (0.0038) [2024-06-27 20:15:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1127153664. Throughput: 0: 43819.6. Samples: 1030030140. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-27 20:15:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:16:01,214][06909] Updated weights for policy 0, policy_version 68803 (0.0037) [2024-06-27 20:16:03,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43419.1, 300 sec: 43598.1). Total num frames: 1127350272. Throughput: 0: 43698.6. Samples: 1030287320. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 20:16:03,858][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 20:16:05,345][06909] Updated weights for policy 0, policy_version 68813 (0.0025) [2024-06-27 20:16:08,589][06909] Updated weights for policy 0, policy_version 68823 (0.0025) [2024-06-27 20:16:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1127596032. Throughput: 0: 43900.2. Samples: 1030553220. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 20:16:08,860][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:16:12,584][06909] Updated weights for policy 0, policy_version 68833 (0.0034) [2024-06-27 20:16:13,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44510.0, 300 sec: 43820.3). Total num frames: 1127825408. Throughput: 0: 43830.8. Samples: 1030687060. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 20:16:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:16:16,492][06909] Updated weights for policy 0, policy_version 68843 (0.0032) [2024-06-27 20:16:18,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 1128022016. Throughput: 0: 43742.5. Samples: 1030947120. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 20:16:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:16:19,870][06909] Updated weights for policy 0, policy_version 68853 (0.0027) [2024-06-27 20:16:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43692.2, 300 sec: 43764.7). Total num frames: 1128235008. Throughput: 0: 43735.7. Samples: 1031206940. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 20:16:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:16:23,882][06909] Updated weights for policy 0, policy_version 68863 (0.0033) [2024-06-27 20:16:27,671][06909] Updated weights for policy 0, policy_version 68873 (0.0030) [2024-06-27 20:16:28,850][06674] Fps is (10 sec: 44237.9, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 1128464384. Throughput: 0: 43827.2. Samples: 1031342920. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 20:16:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:16:31,272][06909] Updated weights for policy 0, policy_version 68883 (0.0037) [2024-06-27 20:16:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1128660992. Throughput: 0: 43614.2. Samples: 1031598320. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 20:16:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:16:35,229][06887] Signal inference workers to stop experience collection... (14700 times) [2024-06-27 20:16:35,229][06887] Signal inference workers to resume experience collection... (14700 times) [2024-06-27 20:16:35,233][06909] Updated weights for policy 0, policy_version 68893 (0.0037) [2024-06-27 20:16:35,271][06909] InferenceWorker_p0-w0: stopping experience collection (14700 times) [2024-06-27 20:16:35,271][06909] InferenceWorker_p0-w0: resuming experience collection (14700 times) [2024-06-27 20:16:38,786][06909] Updated weights for policy 0, policy_version 68903 (0.0049) [2024-06-27 20:16:38,850][06674] Fps is (10 sec: 44235.6, 60 sec: 43694.9, 300 sec: 43820.2). Total num frames: 1128906752. Throughput: 0: 43827.3. Samples: 1031865740. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 20:16:38,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:16:42,594][06909] Updated weights for policy 0, policy_version 68913 (0.0037) [2024-06-27 20:16:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.8, 300 sec: 43764.9). Total num frames: 1129119744. Throughput: 0: 43735.7. Samples: 1031998240. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-27 20:16:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:16:46,109][06909] Updated weights for policy 0, policy_version 68923 (0.0033) [2024-06-27 20:16:48,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.6, 300 sec: 43710.1). Total num frames: 1129332736. Throughput: 0: 43724.9. Samples: 1032254940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-27 20:16:48,859][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 20:16:50,111][06909] Updated weights for policy 0, policy_version 68933 (0.0033) [2024-06-27 20:16:53,634][06909] Updated weights for policy 0, policy_version 68943 (0.0042) [2024-06-27 20:16:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1129562112. Throughput: 0: 43898.8. Samples: 1032528660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-27 20:16:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:16:57,343][06909] Updated weights for policy 0, policy_version 68953 (0.0039) [2024-06-27 20:16:58,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 1129791488. Throughput: 0: 43697.2. Samples: 1032653440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-27 20:16:58,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:17:01,325][06909] Updated weights for policy 0, policy_version 68963 (0.0035) [2024-06-27 20:17:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 43764.7). Total num frames: 1130004480. Throughput: 0: 43842.3. Samples: 1032920020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-27 20:17:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:17:04,856][06909] Updated weights for policy 0, policy_version 68973 (0.0033) [2024-06-27 20:17:08,623][06909] Updated weights for policy 0, policy_version 68983 (0.0037) [2024-06-27 20:17:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 1130233856. Throughput: 0: 44033.6. Samples: 1033188460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-27 20:17:08,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:17:12,655][06909] Updated weights for policy 0, policy_version 68993 (0.0045) [2024-06-27 20:17:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 1130446848. Throughput: 0: 43836.3. Samples: 1033315560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-27 20:17:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:17:16,082][06909] Updated weights for policy 0, policy_version 69003 (0.0038) [2024-06-27 20:17:18,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.9, 300 sec: 43764.7). Total num frames: 1130659840. Throughput: 0: 43832.9. Samples: 1033570800. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-27 20:17:18,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 20:17:19,990][06909] Updated weights for policy 0, policy_version 69013 (0.0039) [2024-06-27 20:17:23,601][06909] Updated weights for policy 0, policy_version 69023 (0.0026) [2024-06-27 20:17:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1130872832. Throughput: 0: 43848.1. Samples: 1033838900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-27 20:17:23,854][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 20:17:27,294][06909] Updated weights for policy 0, policy_version 69033 (0.0037) [2024-06-27 20:17:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1131085824. Throughput: 0: 43903.9. Samples: 1033973920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:17:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:17:31,139][06909] Updated weights for policy 0, policy_version 69043 (0.0026) [2024-06-27 20:17:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 43764.7). Total num frames: 1131315200. Throughput: 0: 43788.9. Samples: 1034225440. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:17:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:17:34,662][06909] Updated weights for policy 0, policy_version 69053 (0.0030) [2024-06-27 20:17:38,736][06909] Updated weights for policy 0, policy_version 69063 (0.0039) [2024-06-27 20:17:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.8, 300 sec: 43765.0). Total num frames: 1131528192. Throughput: 0: 43716.9. Samples: 1034495920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:17:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:17:42,213][06909] Updated weights for policy 0, policy_version 69073 (0.0050) [2024-06-27 20:17:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1131741184. Throughput: 0: 43779.6. Samples: 1034623520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:17:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:17:45,987][06909] Updated weights for policy 0, policy_version 69083 (0.0025) [2024-06-27 20:17:48,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44236.7, 300 sec: 43820.2). Total num frames: 1131986944. Throughput: 0: 43815.0. Samples: 1034891700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:17:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:17:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000069091_1131986944.pth... [2024-06-27 20:17:48,907][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000068449_1121468416.pth [2024-06-27 20:17:49,971][06909] Updated weights for policy 0, policy_version 69093 (0.0042) [2024-06-27 20:17:53,438][06909] Updated weights for policy 0, policy_version 69103 (0.0047) [2024-06-27 20:17:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1132199936. Throughput: 0: 43489.4. Samples: 1035145480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:17:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:17:57,650][06909] Updated weights for policy 0, policy_version 69113 (0.0038) [2024-06-27 20:17:58,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1132396544. Throughput: 0: 43529.7. Samples: 1035274400. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:17:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:18:00,780][06887] Signal inference workers to stop experience collection... (14750 times) [2024-06-27 20:18:00,781][06887] Signal inference workers to resume experience collection... (14750 times) [2024-06-27 20:18:00,824][06909] InferenceWorker_p0-w0: stopping experience collection (14750 times) [2024-06-27 20:18:00,824][06909] InferenceWorker_p0-w0: resuming experience collection (14750 times) [2024-06-27 20:18:00,928][06909] Updated weights for policy 0, policy_version 69123 (0.0024) [2024-06-27 20:18:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1132625920. Throughput: 0: 43619.5. Samples: 1035533680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:18:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:18:05,059][06909] Updated weights for policy 0, policy_version 69133 (0.0041) [2024-06-27 20:18:08,678][06909] Updated weights for policy 0, policy_version 69143 (0.0028) [2024-06-27 20:18:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 1132838912. Throughput: 0: 43505.7. Samples: 1035796660. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:18:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:18:12,455][06909] Updated weights for policy 0, policy_version 69153 (0.0031) [2024-06-27 20:18:13,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 1133051904. Throughput: 0: 43444.4. Samples: 1035928920. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:18:13,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:18:16,273][06909] Updated weights for policy 0, policy_version 69163 (0.0026) [2024-06-27 20:18:18,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 1133281280. Throughput: 0: 43689.4. Samples: 1036191460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:18:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:18:20,033][06909] Updated weights for policy 0, policy_version 69173 (0.0040) [2024-06-27 20:18:23,665][06909] Updated weights for policy 0, policy_version 69183 (0.0036) [2024-06-27 20:18:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1133494272. Throughput: 0: 43464.8. Samples: 1036451840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:18:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:18:27,406][06909] Updated weights for policy 0, policy_version 69193 (0.0033) [2024-06-27 20:18:28,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1133690880. Throughput: 0: 43437.8. Samples: 1036578220. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:18:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:18:31,199][06909] Updated weights for policy 0, policy_version 69203 (0.0026) [2024-06-27 20:18:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43144.5, 300 sec: 43653.6). Total num frames: 1133903872. Throughput: 0: 43214.3. Samples: 1036836340. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:18:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:18:35,394][06909] Updated weights for policy 0, policy_version 69213 (0.0044) [2024-06-27 20:18:38,521][06909] Updated weights for policy 0, policy_version 69223 (0.0021) [2024-06-27 20:18:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.6, 300 sec: 43765.7). Total num frames: 1134149632. Throughput: 0: 43441.8. Samples: 1037100360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:18:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:18:42,626][06909] Updated weights for policy 0, policy_version 69233 (0.0033) [2024-06-27 20:18:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 1134346240. Throughput: 0: 43508.6. Samples: 1037232280. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:18:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:18:46,242][06909] Updated weights for policy 0, policy_version 69243 (0.0028) [2024-06-27 20:18:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43144.7, 300 sec: 43764.7). Total num frames: 1134575616. Throughput: 0: 43619.2. Samples: 1037496540. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-27 20:18:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:18:49,905][06909] Updated weights for policy 0, policy_version 69253 (0.0037) [2024-06-27 20:18:53,812][06909] Updated weights for policy 0, policy_version 69263 (0.0035) [2024-06-27 20:18:53,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43417.6, 300 sec: 43765.0). Total num frames: 1134804992. Throughput: 0: 43626.7. Samples: 1037759860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 20:18:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:18:57,601][06909] Updated weights for policy 0, policy_version 69273 (0.0023) [2024-06-27 20:18:58,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1135017984. Throughput: 0: 43519.6. Samples: 1037887300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 20:18:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:19:01,585][06909] Updated weights for policy 0, policy_version 69283 (0.0034) [2024-06-27 20:19:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 43709.5). Total num frames: 1135230976. Throughput: 0: 43576.0. Samples: 1038152380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 20:19:03,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 20:19:04,875][06909] Updated weights for policy 0, policy_version 69293 (0.0034) [2024-06-27 20:19:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 1135443968. Throughput: 0: 43516.9. Samples: 1038410100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 20:19:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:19:08,877][06909] Updated weights for policy 0, policy_version 69303 (0.0023) [2024-06-27 20:19:12,643][06909] Updated weights for policy 0, policy_version 69313 (0.0036) [2024-06-27 20:19:13,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43144.6, 300 sec: 43653.6). Total num frames: 1135640576. Throughput: 0: 43597.7. Samples: 1038540120. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 20:19:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:19:16,320][06909] Updated weights for policy 0, policy_version 69323 (0.0027) [2024-06-27 20:19:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43144.5, 300 sec: 43653.6). Total num frames: 1135869952. Throughput: 0: 43644.8. Samples: 1038800360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 20:19:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:19:19,688][06887] Signal inference workers to stop experience collection... (14800 times) [2024-06-27 20:19:19,689][06887] Signal inference workers to resume experience collection... (14800 times) [2024-06-27 20:19:19,727][06909] InferenceWorker_p0-w0: stopping experience collection (14800 times) [2024-06-27 20:19:19,727][06909] InferenceWorker_p0-w0: resuming experience collection (14800 times) [2024-06-27 20:19:19,992][06909] Updated weights for policy 0, policy_version 69333 (0.0025) [2024-06-27 20:19:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 1136099328. Throughput: 0: 43697.2. Samples: 1039066740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 20:19:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:19:24,132][06909] Updated weights for policy 0, policy_version 69343 (0.0038) [2024-06-27 20:19:27,679][06909] Updated weights for policy 0, policy_version 69353 (0.0032) [2024-06-27 20:19:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 1136295936. Throughput: 0: 43602.2. Samples: 1039194380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 20:19:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:19:31,454][06909] Updated weights for policy 0, policy_version 69363 (0.0040) [2024-06-27 20:19:33,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1136541696. Throughput: 0: 43545.7. Samples: 1039456100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 20:19:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:19:35,124][06909] Updated weights for policy 0, policy_version 69373 (0.0038) [2024-06-27 20:19:38,745][06909] Updated weights for policy 0, policy_version 69383 (0.0031) [2024-06-27 20:19:38,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1136771072. Throughput: 0: 43508.6. Samples: 1039717740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:19:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:19:42,314][06909] Updated weights for policy 0, policy_version 69393 (0.0037) [2024-06-27 20:19:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1136967680. Throughput: 0: 43729.3. Samples: 1039855120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:19:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:19:46,680][06909] Updated weights for policy 0, policy_version 69403 (0.0032) [2024-06-27 20:19:48,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 1137180672. Throughput: 0: 43445.7. Samples: 1040107440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:19:48,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:19:48,939][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000069409_1137197056.pth... [2024-06-27 20:19:48,983][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000068770_1126727680.pth [2024-06-27 20:19:50,267][06909] Updated weights for policy 0, policy_version 69413 (0.0031) [2024-06-27 20:19:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 1137410048. Throughput: 0: 43676.4. Samples: 1040375540. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:19:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:19:54,125][06909] Updated weights for policy 0, policy_version 69423 (0.0024) [2024-06-27 20:19:57,874][06909] Updated weights for policy 0, policy_version 69433 (0.0031) [2024-06-27 20:19:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43144.6, 300 sec: 43598.4). Total num frames: 1137606656. Throughput: 0: 43710.8. Samples: 1040507100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:19:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:20:01,468][06909] Updated weights for policy 0, policy_version 69443 (0.0034) [2024-06-27 20:20:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1137852416. Throughput: 0: 43760.9. Samples: 1040769600. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:20:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:20:05,484][06909] Updated weights for policy 0, policy_version 69453 (0.0032) [2024-06-27 20:20:08,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1138065408. Throughput: 0: 43640.0. Samples: 1041030540. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:20:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:20:09,031][06909] Updated weights for policy 0, policy_version 69463 (0.0032) [2024-06-27 20:20:12,751][06909] Updated weights for policy 0, policy_version 69473 (0.0022) [2024-06-27 20:20:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1138278400. Throughput: 0: 43714.2. Samples: 1041161520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:20:13,856][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 20:20:16,389][06909] Updated weights for policy 0, policy_version 69483 (0.0047) [2024-06-27 20:20:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 43765.0). Total num frames: 1138524160. Throughput: 0: 43847.5. Samples: 1041429240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:20:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:20:20,017][06909] Updated weights for policy 0, policy_version 69493 (0.0040) [2024-06-27 20:20:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1138720768. Throughput: 0: 43795.9. Samples: 1041688560. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 20:20:23,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:20:24,071][06909] Updated weights for policy 0, policy_version 69503 (0.0035) [2024-06-27 20:20:27,893][06909] Updated weights for policy 0, policy_version 69513 (0.0040) [2024-06-27 20:20:28,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1138933760. Throughput: 0: 43619.6. Samples: 1041818000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 20:20:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:20:31,442][06909] Updated weights for policy 0, policy_version 69523 (0.0042) [2024-06-27 20:20:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43654.5). Total num frames: 1139163136. Throughput: 0: 43901.3. Samples: 1042083000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 20:20:33,854][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:20:35,244][06909] Updated weights for policy 0, policy_version 69533 (0.0029) [2024-06-27 20:20:38,852][06674] Fps is (10 sec: 44228.2, 60 sec: 43416.1, 300 sec: 43653.3). Total num frames: 1139376128. Throughput: 0: 43660.7. Samples: 1042340360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 20:20:38,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:20:38,996][06909] Updated weights for policy 0, policy_version 69543 (0.0039) [2024-06-27 20:20:43,017][06909] Updated weights for policy 0, policy_version 69553 (0.0029) [2024-06-27 20:20:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1139589120. Throughput: 0: 43648.4. Samples: 1042471280. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 20:20:43,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:20:46,592][06909] Updated weights for policy 0, policy_version 69563 (0.0034) [2024-06-27 20:20:48,800][06887] Signal inference workers to stop experience collection... (14850 times) [2024-06-27 20:20:48,814][06909] InferenceWorker_p0-w0: stopping experience collection (14850 times) [2024-06-27 20:20:48,850][06674] Fps is (10 sec: 42606.7, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1139802112. Throughput: 0: 43671.5. Samples: 1042734820. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 20:20:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:20:48,861][06887] Signal inference workers to resume experience collection... (14850 times) [2024-06-27 20:20:48,862][06909] InferenceWorker_p0-w0: resuming experience collection (14850 times) [2024-06-27 20:20:50,291][06909] Updated weights for policy 0, policy_version 69573 (0.0036) [2024-06-27 20:20:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1140031488. Throughput: 0: 43860.6. Samples: 1043004260. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 20:20:53,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 20:20:53,993][06909] Updated weights for policy 0, policy_version 69583 (0.0029) [2024-06-27 20:20:57,586][06909] Updated weights for policy 0, policy_version 69593 (0.0040) [2024-06-27 20:20:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1140244480. Throughput: 0: 43825.0. Samples: 1043133640. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-27 20:20:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:21:01,525][06909] Updated weights for policy 0, policy_version 69603 (0.0051) [2024-06-27 20:21:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.8, 300 sec: 43653.7). Total num frames: 1140473856. Throughput: 0: 43656.5. Samples: 1043393780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 20:21:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:21:04,981][06909] Updated weights for policy 0, policy_version 69613 (0.0030) [2024-06-27 20:21:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.7, 300 sec: 43542.6). Total num frames: 1140670464. Throughput: 0: 43734.3. Samples: 1043656600. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 20:21:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:21:09,094][06909] Updated weights for policy 0, policy_version 69623 (0.0039) [2024-06-27 20:21:12,814][06909] Updated weights for policy 0, policy_version 69633 (0.0028) [2024-06-27 20:21:13,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1140883456. Throughput: 0: 43694.4. Samples: 1043784240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 20:21:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:21:16,636][06909] Updated weights for policy 0, policy_version 69643 (0.0034) [2024-06-27 20:21:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1141129216. Throughput: 0: 43563.2. Samples: 1044043340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 20:21:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:21:20,515][06909] Updated weights for policy 0, policy_version 69653 (0.0033) [2024-06-27 20:21:23,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1141342208. Throughput: 0: 43665.4. Samples: 1044305220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 20:21:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:21:24,218][06909] Updated weights for policy 0, policy_version 69663 (0.0027) [2024-06-27 20:21:27,840][06909] Updated weights for policy 0, policy_version 69673 (0.0038) [2024-06-27 20:21:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1141571584. Throughput: 0: 43673.2. Samples: 1044436580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 20:21:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:21:31,662][06909] Updated weights for policy 0, policy_version 69683 (0.0031) [2024-06-27 20:21:33,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1141800960. Throughput: 0: 43628.1. Samples: 1044698080. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 20:21:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:21:35,118][06909] Updated weights for policy 0, policy_version 69693 (0.0036) [2024-06-27 20:21:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43692.1, 300 sec: 43653.6). Total num frames: 1141997568. Throughput: 0: 43562.1. Samples: 1044964560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 20:21:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:21:39,075][06909] Updated weights for policy 0, policy_version 69703 (0.0035) [2024-06-27 20:21:42,758][06909] Updated weights for policy 0, policy_version 69713 (0.0044) [2024-06-27 20:21:43,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1142210560. Throughput: 0: 43544.4. Samples: 1045093140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-27 20:21:43,853][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:21:46,423][06909] Updated weights for policy 0, policy_version 69723 (0.0030) [2024-06-27 20:21:48,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 1142439936. Throughput: 0: 43692.9. Samples: 1045359960. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-27 20:21:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:21:48,918][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000069730_1142456320.pth... [2024-06-27 20:21:48,962][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000069091_1131986944.pth [2024-06-27 20:21:50,447][06909] Updated weights for policy 0, policy_version 69733 (0.0034) [2024-06-27 20:21:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1142652928. Throughput: 0: 43680.8. Samples: 1045622240. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-27 20:21:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:21:54,065][06909] Updated weights for policy 0, policy_version 69743 (0.0027) [2024-06-27 20:21:58,056][06909] Updated weights for policy 0, policy_version 69753 (0.0033) [2024-06-27 20:21:58,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1142865920. Throughput: 0: 43653.6. Samples: 1045748660. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-27 20:21:58,864][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:22:01,625][06909] Updated weights for policy 0, policy_version 69763 (0.0044) [2024-06-27 20:22:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1143095296. Throughput: 0: 43658.3. Samples: 1046007960. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-27 20:22:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:22:05,340][06909] Updated weights for policy 0, policy_version 69773 (0.0028) [2024-06-27 20:22:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 1143308288. Throughput: 0: 43900.9. Samples: 1046280760. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-27 20:22:08,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:22:08,970][06909] Updated weights for policy 0, policy_version 69783 (0.0036) [2024-06-27 20:22:12,527][06909] Updated weights for policy 0, policy_version 69793 (0.0034) [2024-06-27 20:22:13,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.6, 300 sec: 43598.1). Total num frames: 1143521280. Throughput: 0: 43750.7. Samples: 1046405360. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-27 20:22:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:22:16,516][06909] Updated weights for policy 0, policy_version 69803 (0.0033) [2024-06-27 20:22:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1143750656. Throughput: 0: 43828.4. Samples: 1046670360. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-27 20:22:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:22:19,900][06909] Updated weights for policy 0, policy_version 69813 (0.0029) [2024-06-27 20:22:23,648][06887] Signal inference workers to stop experience collection... (14900 times) [2024-06-27 20:22:23,652][06887] Signal inference workers to resume experience collection... (14900 times) [2024-06-27 20:22:23,689][06909] InferenceWorker_p0-w0: stopping experience collection (14900 times) [2024-06-27 20:22:23,690][06909] InferenceWorker_p0-w0: resuming experience collection (14900 times) [2024-06-27 20:22:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1143963648. Throughput: 0: 43691.1. Samples: 1046930660. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-27 20:22:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:22:24,117][06909] Updated weights for policy 0, policy_version 69823 (0.0037) [2024-06-27 20:22:27,420][06909] Updated weights for policy 0, policy_version 69833 (0.0036) [2024-06-27 20:22:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1144176640. Throughput: 0: 43641.3. Samples: 1047057000. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-27 20:22:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:22:31,779][06909] Updated weights for policy 0, policy_version 69843 (0.0033) [2024-06-27 20:22:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 1144406016. Throughput: 0: 43523.5. Samples: 1047318520. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-27 20:22:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:22:35,323][06909] Updated weights for policy 0, policy_version 69853 (0.0026) [2024-06-27 20:22:38,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1144602624. Throughput: 0: 43681.0. Samples: 1047587880. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-27 20:22:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:22:39,366][06909] Updated weights for policy 0, policy_version 69863 (0.0046) [2024-06-27 20:22:42,660][06909] Updated weights for policy 0, policy_version 69873 (0.0039) [2024-06-27 20:22:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43598.1). Total num frames: 1144848384. Throughput: 0: 43633.4. Samples: 1047712160. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-27 20:22:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:22:46,725][06909] Updated weights for policy 0, policy_version 69883 (0.0030) [2024-06-27 20:22:48,852][06674] Fps is (10 sec: 47503.6, 60 sec: 43962.2, 300 sec: 43653.3). Total num frames: 1145077760. Throughput: 0: 43873.5. Samples: 1047982360. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-27 20:22:48,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:22:49,937][06909] Updated weights for policy 0, policy_version 69893 (0.0029) [2024-06-27 20:22:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1145274368. Throughput: 0: 43854.8. Samples: 1048254220. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-27 20:22:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:22:54,047][06909] Updated weights for policy 0, policy_version 69903 (0.0036) [2024-06-27 20:22:57,249][06909] Updated weights for policy 0, policy_version 69913 (0.0030) [2024-06-27 20:22:58,850][06674] Fps is (10 sec: 40968.6, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1145487360. Throughput: 0: 43870.3. Samples: 1048379520. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-27 20:22:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:23:01,657][06909] Updated weights for policy 0, policy_version 69923 (0.0037) [2024-06-27 20:23:03,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 1145733120. Throughput: 0: 43858.1. Samples: 1048643980. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-27 20:23:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:23:04,562][06909] Updated weights for policy 0, policy_version 69933 (0.0030) [2024-06-27 20:23:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1145929728. Throughput: 0: 43806.3. Samples: 1048901940. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-27 20:23:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:23:09,181][06909] Updated weights for policy 0, policy_version 69943 (0.0025) [2024-06-27 20:23:12,518][06909] Updated weights for policy 0, policy_version 69953 (0.0046) [2024-06-27 20:23:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 43709.2). Total num frames: 1146175488. Throughput: 0: 43891.1. Samples: 1049032100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 20:23:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:23:16,438][06909] Updated weights for policy 0, policy_version 69963 (0.0026) [2024-06-27 20:23:18,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 1146388480. Throughput: 0: 44069.6. Samples: 1049301660. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 20:23:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:23:19,875][06909] Updated weights for policy 0, policy_version 69973 (0.0031) [2024-06-27 20:23:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1146585088. Throughput: 0: 43812.4. Samples: 1049559440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 20:23:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:23:23,998][06909] Updated weights for policy 0, policy_version 69983 (0.0036) [2024-06-27 20:23:27,447][06909] Updated weights for policy 0, policy_version 69993 (0.0032) [2024-06-27 20:23:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 1146814464. Throughput: 0: 43800.9. Samples: 1049683200. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 20:23:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:23:31,680][06909] Updated weights for policy 0, policy_version 70003 (0.0034) [2024-06-27 20:23:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1147043840. Throughput: 0: 43921.1. Samples: 1049958720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 20:23:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:23:34,658][06909] Updated weights for policy 0, policy_version 70013 (0.0031) [2024-06-27 20:23:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1147240448. Throughput: 0: 43561.8. Samples: 1050214500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 20:23:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 20:23:39,307][06909] Updated weights for policy 0, policy_version 70023 (0.0048) [2024-06-27 20:23:42,245][06909] Updated weights for policy 0, policy_version 70033 (0.0027) [2024-06-27 20:23:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1147469824. Throughput: 0: 43693.3. Samples: 1050345720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 20:23:43,850][06674] Avg episode reward: [(0, '0.443')] [2024-06-27 20:23:46,552][06909] Updated weights for policy 0, policy_version 70043 (0.0035) [2024-06-27 20:23:48,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43692.1, 300 sec: 43709.2). Total num frames: 1147699200. Throughput: 0: 43756.0. Samples: 1050613000. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-27 20:23:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:23:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000070050_1147699200.pth... [2024-06-27 20:23:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000069409_1137197056.pth [2024-06-27 20:23:49,895][06909] Updated weights for policy 0, policy_version 70053 (0.0032) [2024-06-27 20:23:50,549][06887] Signal inference workers to stop experience collection... (14950 times) [2024-06-27 20:23:50,594][06909] InferenceWorker_p0-w0: stopping experience collection (14950 times) [2024-06-27 20:23:50,603][06887] Signal inference workers to resume experience collection... (14950 times) [2024-06-27 20:23:50,604][06909] InferenceWorker_p0-w0: resuming experience collection (14950 times) [2024-06-27 20:23:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1147895808. Throughput: 0: 43638.3. Samples: 1050865660. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-27 20:23:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:23:53,904][06909] Updated weights for policy 0, policy_version 70063 (0.0031) [2024-06-27 20:23:57,678][06909] Updated weights for policy 0, policy_version 70073 (0.0030) [2024-06-27 20:23:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 1148125184. Throughput: 0: 43717.2. Samples: 1050999380. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-27 20:23:58,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 20:24:01,506][06909] Updated weights for policy 0, policy_version 70083 (0.0028) [2024-06-27 20:24:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43144.6, 300 sec: 43653.6). Total num frames: 1148321792. Throughput: 0: 43473.9. Samples: 1051257980. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-27 20:24:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:24:04,995][06909] Updated weights for policy 0, policy_version 70093 (0.0036) [2024-06-27 20:24:08,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1148551168. Throughput: 0: 43751.6. Samples: 1051528260. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-27 20:24:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:24:08,947][06909] Updated weights for policy 0, policy_version 70103 (0.0041) [2024-06-27 20:24:12,186][06909] Updated weights for policy 0, policy_version 70113 (0.0041) [2024-06-27 20:24:13,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1148796928. Throughput: 0: 43787.1. Samples: 1051653620. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-27 20:24:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:24:16,402][06909] Updated weights for policy 0, policy_version 70123 (0.0037) [2024-06-27 20:24:18,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1149009920. Throughput: 0: 43665.7. Samples: 1051923680. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-27 20:24:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:24:19,599][06909] Updated weights for policy 0, policy_version 70133 (0.0029) [2024-06-27 20:24:23,842][06909] Updated weights for policy 0, policy_version 70143 (0.0044) [2024-06-27 20:24:23,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43962.2, 300 sec: 43820.0). Total num frames: 1149222912. Throughput: 0: 43709.1. Samples: 1052181500. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-27 20:24:23,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:24:27,272][06909] Updated weights for policy 0, policy_version 70153 (0.0031) [2024-06-27 20:24:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1149452288. Throughput: 0: 43623.1. Samples: 1052308760. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-27 20:24:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:24:31,619][06909] Updated weights for policy 0, policy_version 70163 (0.0032) [2024-06-27 20:24:33,850][06674] Fps is (10 sec: 40968.7, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 1149632512. Throughput: 0: 43713.0. Samples: 1052580080. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-27 20:24:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:24:34,612][06909] Updated weights for policy 0, policy_version 70173 (0.0028) [2024-06-27 20:24:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1149861888. Throughput: 0: 43919.1. Samples: 1052842020. Policy #0 lag: (min: 0.0, avg: 12.0, max: 20.0) [2024-06-27 20:24:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:24:38,874][06909] Updated weights for policy 0, policy_version 70183 (0.0027) [2024-06-27 20:24:42,408][06909] Updated weights for policy 0, policy_version 70193 (0.0032) [2024-06-27 20:24:43,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1150107648. Throughput: 0: 43836.5. Samples: 1052972020. Policy #0 lag: (min: 0.0, avg: 12.0, max: 20.0) [2024-06-27 20:24:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:24:46,506][06909] Updated weights for policy 0, policy_version 70203 (0.0033) [2024-06-27 20:24:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 1150304256. Throughput: 0: 43959.1. Samples: 1053236140. Policy #0 lag: (min: 0.0, avg: 12.0, max: 20.0) [2024-06-27 20:24:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:24:49,961][06909] Updated weights for policy 0, policy_version 70213 (0.0023) [2024-06-27 20:24:53,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1150517248. Throughput: 0: 43762.3. Samples: 1053497560. Policy #0 lag: (min: 0.0, avg: 12.0, max: 20.0) [2024-06-27 20:24:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:24:53,864][06909] Updated weights for policy 0, policy_version 70223 (0.0034) [2024-06-27 20:24:57,430][06909] Updated weights for policy 0, policy_version 70233 (0.0031) [2024-06-27 20:24:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 1150763008. Throughput: 0: 43775.1. Samples: 1053623500. Policy #0 lag: (min: 0.0, avg: 12.0, max: 20.0) [2024-06-27 20:24:58,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:25:01,217][06909] Updated weights for policy 0, policy_version 70243 (0.0040) [2024-06-27 20:25:03,360][06887] Signal inference workers to stop experience collection... (15000 times) [2024-06-27 20:25:03,360][06887] Signal inference workers to resume experience collection... (15000 times) [2024-06-27 20:25:03,407][06909] InferenceWorker_p0-w0: stopping experience collection (15000 times) [2024-06-27 20:25:03,407][06909] InferenceWorker_p0-w0: resuming experience collection (15000 times) [2024-06-27 20:25:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1150959616. Throughput: 0: 43593.0. Samples: 1053885360. Policy #0 lag: (min: 0.0, avg: 12.0, max: 20.0) [2024-06-27 20:25:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:25:05,226][06909] Updated weights for policy 0, policy_version 70253 (0.0036) [2024-06-27 20:25:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1151172608. Throughput: 0: 43662.9. Samples: 1054146240. Policy #0 lag: (min: 0.0, avg: 12.0, max: 20.0) [2024-06-27 20:25:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:25:09,160][06909] Updated weights for policy 0, policy_version 70263 (0.0035) [2024-06-27 20:25:12,455][06909] Updated weights for policy 0, policy_version 70273 (0.0026) [2024-06-27 20:25:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1151418368. Throughput: 0: 43853.8. Samples: 1054282180. Policy #0 lag: (min: 0.0, avg: 12.0, max: 20.0) [2024-06-27 20:25:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:25:16,444][06909] Updated weights for policy 0, policy_version 70283 (0.0028) [2024-06-27 20:25:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1151631360. Throughput: 0: 43783.5. Samples: 1054550340. Policy #0 lag: (min: 0.0, avg: 12.0, max: 20.0) [2024-06-27 20:25:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:25:19,928][06909] Updated weights for policy 0, policy_version 70293 (0.0033) [2024-06-27 20:25:23,695][06909] Updated weights for policy 0, policy_version 70303 (0.0038) [2024-06-27 20:25:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43692.1, 300 sec: 43764.7). Total num frames: 1151844352. Throughput: 0: 43730.6. Samples: 1054809900. Policy #0 lag: (min: 1.0, avg: 9.7, max: 24.0) [2024-06-27 20:25:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:25:27,235][06909] Updated weights for policy 0, policy_version 70313 (0.0028) [2024-06-27 20:25:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1152073728. Throughput: 0: 43687.1. Samples: 1054937940. Policy #0 lag: (min: 1.0, avg: 9.7, max: 24.0) [2024-06-27 20:25:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:25:31,499][06909] Updated weights for policy 0, policy_version 70323 (0.0028) [2024-06-27 20:25:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 43765.0). Total num frames: 1152286720. Throughput: 0: 43787.0. Samples: 1055206560. Policy #0 lag: (min: 1.0, avg: 9.7, max: 24.0) [2024-06-27 20:25:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:25:34,795][06909] Updated weights for policy 0, policy_version 70333 (0.0024) [2024-06-27 20:25:38,777][06909] Updated weights for policy 0, policy_version 70343 (0.0026) [2024-06-27 20:25:38,856][06674] Fps is (10 sec: 42572.6, 60 sec: 43959.3, 300 sec: 43763.8). Total num frames: 1152499712. Throughput: 0: 43798.0. Samples: 1055468740. Policy #0 lag: (min: 1.0, avg: 9.7, max: 24.0) [2024-06-27 20:25:38,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:25:42,503][06909] Updated weights for policy 0, policy_version 70353 (0.0037) [2024-06-27 20:25:43,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1152729088. Throughput: 0: 43881.9. Samples: 1055598180. Policy #0 lag: (min: 1.0, avg: 9.7, max: 24.0) [2024-06-27 20:25:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:25:46,237][06909] Updated weights for policy 0, policy_version 70363 (0.0031) [2024-06-27 20:25:48,852][06674] Fps is (10 sec: 42615.4, 60 sec: 43689.1, 300 sec: 43708.9). Total num frames: 1152925696. Throughput: 0: 43826.8. Samples: 1055857660. Policy #0 lag: (min: 1.0, avg: 9.7, max: 24.0) [2024-06-27 20:25:48,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:25:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000070369_1152925696.pth... [2024-06-27 20:25:48,941][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000069730_1142456320.pth [2024-06-27 20:25:49,993][06909] Updated weights for policy 0, policy_version 70373 (0.0040) [2024-06-27 20:25:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1153138688. Throughput: 0: 43719.2. Samples: 1056113600. Policy #0 lag: (min: 1.0, avg: 9.7, max: 24.0) [2024-06-27 20:25:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:25:53,913][06909] Updated weights for policy 0, policy_version 70383 (0.0043) [2024-06-27 20:25:57,706][06909] Updated weights for policy 0, policy_version 70393 (0.0037) [2024-06-27 20:25:58,850][06674] Fps is (10 sec: 44246.3, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 1153368064. Throughput: 0: 43582.7. Samples: 1056243400. Policy #0 lag: (min: 1.0, avg: 9.7, max: 24.0) [2024-06-27 20:25:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:26:01,213][06909] Updated weights for policy 0, policy_version 70403 (0.0033) [2024-06-27 20:26:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1153581056. Throughput: 0: 43501.3. Samples: 1056507900. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 20:26:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:26:05,032][06909] Updated weights for policy 0, policy_version 70413 (0.0030) [2024-06-27 20:26:08,816][06909] Updated weights for policy 0, policy_version 70423 (0.0038) [2024-06-27 20:26:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1153810432. Throughput: 0: 43437.4. Samples: 1056764580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 20:26:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:26:12,646][06909] Updated weights for policy 0, policy_version 70433 (0.0034) [2024-06-27 20:26:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1154039808. Throughput: 0: 43577.8. Samples: 1056898940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 20:26:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:26:16,577][06909] Updated weights for policy 0, policy_version 70443 (0.0029) [2024-06-27 20:26:18,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43144.5, 300 sec: 43653.7). Total num frames: 1154220032. Throughput: 0: 43406.3. Samples: 1057159840. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 20:26:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:26:20,184][06909] Updated weights for policy 0, policy_version 70453 (0.0042) [2024-06-27 20:26:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 1154449408. Throughput: 0: 43312.5. Samples: 1057417540. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 20:26:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:26:24,046][06909] Updated weights for policy 0, policy_version 70463 (0.0043) [2024-06-27 20:26:27,480][06909] Updated weights for policy 0, policy_version 70473 (0.0032) [2024-06-27 20:26:28,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 1154662400. Throughput: 0: 43419.6. Samples: 1057552060. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 20:26:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:26:31,462][06909] Updated weights for policy 0, policy_version 70483 (0.0032) [2024-06-27 20:26:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43144.5, 300 sec: 43653.6). Total num frames: 1154875392. Throughput: 0: 43422.4. Samples: 1057811580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 20:26:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:26:35,103][06909] Updated weights for policy 0, policy_version 70493 (0.0028) [2024-06-27 20:26:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43422.0, 300 sec: 43709.2). Total num frames: 1155104768. Throughput: 0: 43453.4. Samples: 1058069000. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 20:26:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:26:39,363][06909] Updated weights for policy 0, policy_version 70503 (0.0042) [2024-06-27 20:26:43,096][06909] Updated weights for policy 0, policy_version 70513 (0.0044) [2024-06-27 20:26:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43144.5, 300 sec: 43653.6). Total num frames: 1155317760. Throughput: 0: 43653.7. Samples: 1058207820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-27 20:26:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:26:46,727][06909] Updated weights for policy 0, policy_version 70523 (0.0028) [2024-06-27 20:26:48,614][06887] Signal inference workers to stop experience collection... (15050 times) [2024-06-27 20:26:48,614][06887] Signal inference workers to resume experience collection... (15050 times) [2024-06-27 20:26:48,648][06909] InferenceWorker_p0-w0: stopping experience collection (15050 times) [2024-06-27 20:26:48,648][06909] InferenceWorker_p0-w0: resuming experience collection (15050 times) [2024-06-27 20:26:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43692.2, 300 sec: 43709.2). Total num frames: 1155547136. Throughput: 0: 43464.0. Samples: 1058463780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 20:26:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:26:50,452][06909] Updated weights for policy 0, policy_version 70533 (0.0028) [2024-06-27 20:26:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1155760128. Throughput: 0: 43464.0. Samples: 1058720460. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 20:26:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:26:54,069][06909] Updated weights for policy 0, policy_version 70543 (0.0028) [2024-06-27 20:26:57,775][06909] Updated weights for policy 0, policy_version 70553 (0.0036) [2024-06-27 20:26:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.6, 300 sec: 43764.7). Total num frames: 1156005888. Throughput: 0: 43546.6. Samples: 1058858540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 20:26:58,854][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:27:01,638][06909] Updated weights for policy 0, policy_version 70563 (0.0034) [2024-06-27 20:27:03,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 1156186112. Throughput: 0: 43656.3. Samples: 1059124380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 20:27:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:27:05,090][06909] Updated weights for policy 0, policy_version 70573 (0.0032) [2024-06-27 20:27:08,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1156415488. Throughput: 0: 43517.3. Samples: 1059375820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 20:27:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:27:09,156][06909] Updated weights for policy 0, policy_version 70583 (0.0038) [2024-06-27 20:27:12,790][06909] Updated weights for policy 0, policy_version 70593 (0.0034) [2024-06-27 20:27:13,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 1156644864. Throughput: 0: 43539.0. Samples: 1059511320. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 20:27:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:27:16,501][06909] Updated weights for policy 0, policy_version 70603 (0.0037) [2024-06-27 20:27:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1156841472. Throughput: 0: 43537.3. Samples: 1059770760. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 20:27:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 20:27:20,395][06909] Updated weights for policy 0, policy_version 70613 (0.0038) [2024-06-27 20:27:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1157070848. Throughput: 0: 43300.8. Samples: 1060017540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 20:27:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:27:24,318][06909] Updated weights for policy 0, policy_version 70623 (0.0032) [2024-06-27 20:27:28,240][06909] Updated weights for policy 0, policy_version 70633 (0.0033) [2024-06-27 20:27:28,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1157283840. Throughput: 0: 43335.6. Samples: 1060157920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 20:27:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 20:27:31,851][06909] Updated weights for policy 0, policy_version 70643 (0.0045) [2024-06-27 20:27:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1157496832. Throughput: 0: 43489.4. Samples: 1060420800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 20:27:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:27:35,699][06909] Updated weights for policy 0, policy_version 70653 (0.0036) [2024-06-27 20:27:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1157726208. Throughput: 0: 43571.5. Samples: 1060681180. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 20:27:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:27:39,492][06909] Updated weights for policy 0, policy_version 70663 (0.0041) [2024-06-27 20:27:42,879][06909] Updated weights for policy 0, policy_version 70673 (0.0036) [2024-06-27 20:27:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43598.4). Total num frames: 1157939200. Throughput: 0: 43490.3. Samples: 1060815600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 20:27:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:27:46,746][06909] Updated weights for policy 0, policy_version 70683 (0.0031) [2024-06-27 20:27:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 1158152192. Throughput: 0: 43407.2. Samples: 1061077700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 20:27:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:27:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000070688_1158152192.pth... [2024-06-27 20:27:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000070050_1147699200.pth [2024-06-27 20:27:50,505][06909] Updated weights for policy 0, policy_version 70693 (0.0041) [2024-06-27 20:27:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1158381568. Throughput: 0: 43482.7. Samples: 1061332540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 20:27:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:27:54,317][06909] Updated weights for policy 0, policy_version 70703 (0.0037) [2024-06-27 20:27:57,985][06909] Updated weights for policy 0, policy_version 70713 (0.0030) [2024-06-27 20:27:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 1158594560. Throughput: 0: 43552.5. Samples: 1061471180. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 20:27:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:28:01,637][06909] Updated weights for policy 0, policy_version 70723 (0.0032) [2024-06-27 20:28:03,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1158791168. Throughput: 0: 43635.6. Samples: 1061734360. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 20:28:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:28:05,531][06909] Updated weights for policy 0, policy_version 70733 (0.0043) [2024-06-27 20:28:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1159036928. Throughput: 0: 43797.7. Samples: 1061988440. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 20:28:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:28:09,297][06909] Updated weights for policy 0, policy_version 70743 (0.0026) [2024-06-27 20:28:12,407][06887] Signal inference workers to stop experience collection... (15100 times) [2024-06-27 20:28:12,463][06909] InferenceWorker_p0-w0: stopping experience collection (15100 times) [2024-06-27 20:28:12,469][06887] Signal inference workers to resume experience collection... (15100 times) [2024-06-27 20:28:12,478][06909] InferenceWorker_p0-w0: resuming experience collection (15100 times) [2024-06-27 20:28:12,759][06909] Updated weights for policy 0, policy_version 70753 (0.0044) [2024-06-27 20:28:13,852][06674] Fps is (10 sec: 45867.1, 60 sec: 43416.3, 300 sec: 43597.8). Total num frames: 1159249920. Throughput: 0: 43814.6. Samples: 1062129660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 20:28:13,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:28:16,826][06909] Updated weights for policy 0, policy_version 70763 (0.0026) [2024-06-27 20:28:18,851][06674] Fps is (10 sec: 42593.8, 60 sec: 43689.9, 300 sec: 43653.5). Total num frames: 1159462912. Throughput: 0: 43788.2. Samples: 1062391320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 20:28:18,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:28:20,155][06909] Updated weights for policy 0, policy_version 70773 (0.0025) [2024-06-27 20:28:23,856][06674] Fps is (10 sec: 44218.0, 60 sec: 43686.3, 300 sec: 43652.7). Total num frames: 1159692288. Throughput: 0: 43644.4. Samples: 1062645440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 20:28:23,857][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 20:28:24,762][06909] Updated weights for policy 0, policy_version 70783 (0.0029) [2024-06-27 20:28:27,922][06909] Updated weights for policy 0, policy_version 70793 (0.0029) [2024-06-27 20:28:28,852][06674] Fps is (10 sec: 44232.9, 60 sec: 43689.1, 300 sec: 43597.8). Total num frames: 1159905280. Throughput: 0: 43684.6. Samples: 1062781500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 20:28:28,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:28:31,998][06909] Updated weights for policy 0, policy_version 70803 (0.0034) [2024-06-27 20:28:33,852][06674] Fps is (10 sec: 42615.6, 60 sec: 43689.2, 300 sec: 43653.3). Total num frames: 1160118272. Throughput: 0: 43633.2. Samples: 1063041280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 20:28:33,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:28:35,534][06909] Updated weights for policy 0, policy_version 70813 (0.0041) [2024-06-27 20:28:38,850][06674] Fps is (10 sec: 44245.5, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1160347648. Throughput: 0: 43730.1. Samples: 1063300400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 20:28:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:28:39,427][06909] Updated weights for policy 0, policy_version 70823 (0.0038) [2024-06-27 20:28:43,164][06909] Updated weights for policy 0, policy_version 70833 (0.0034) [2024-06-27 20:28:43,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 1160544256. Throughput: 0: 43630.2. Samples: 1063434540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 20:28:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:28:46,881][06909] Updated weights for policy 0, policy_version 70843 (0.0025) [2024-06-27 20:28:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1160773632. Throughput: 0: 43492.5. Samples: 1063691520. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 20:28:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:28:50,479][06909] Updated weights for policy 0, policy_version 70853 (0.0021) [2024-06-27 20:28:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.6, 300 sec: 43653.7). Total num frames: 1161003008. Throughput: 0: 43853.4. Samples: 1063961840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 20:28:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:28:54,161][06909] Updated weights for policy 0, policy_version 70863 (0.0022) [2024-06-27 20:28:57,895][06909] Updated weights for policy 0, policy_version 70873 (0.0035) [2024-06-27 20:28:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 1161199616. Throughput: 0: 43645.3. Samples: 1064093620. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:28:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:29:02,031][06909] Updated weights for policy 0, policy_version 70883 (0.0036) [2024-06-27 20:29:03,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1161412608. Throughput: 0: 43567.4. Samples: 1064351800. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:29:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:29:05,661][06909] Updated weights for policy 0, policy_version 70893 (0.0029) [2024-06-27 20:29:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 1161641984. Throughput: 0: 43584.9. Samples: 1064606500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:29:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:29:09,328][06909] Updated weights for policy 0, policy_version 70903 (0.0048) [2024-06-27 20:29:13,128][06909] Updated weights for policy 0, policy_version 70913 (0.0028) [2024-06-27 20:29:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43419.0, 300 sec: 43542.6). Total num frames: 1161854976. Throughput: 0: 43526.9. Samples: 1064740120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:29:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:29:16,899][06909] Updated weights for policy 0, policy_version 70923 (0.0032) [2024-06-27 20:29:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43418.4, 300 sec: 43542.9). Total num frames: 1162067968. Throughput: 0: 43672.6. Samples: 1065006460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:29:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:29:20,539][06909] Updated weights for policy 0, policy_version 70933 (0.0034) [2024-06-27 20:29:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43421.9, 300 sec: 43542.6). Total num frames: 1162297344. Throughput: 0: 43647.1. Samples: 1065264520. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:29:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:29:24,218][06909] Updated weights for policy 0, policy_version 70943 (0.0025) [2024-06-27 20:29:28,275][06909] Updated weights for policy 0, policy_version 70953 (0.0028) [2024-06-27 20:29:28,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43692.2, 300 sec: 43709.2). Total num frames: 1162526720. Throughput: 0: 43604.4. Samples: 1065396740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:29:28,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 20:29:31,426][06887] Signal inference workers to stop experience collection... (15150 times) [2024-06-27 20:29:31,426][06887] Signal inference workers to resume experience collection... (15150 times) [2024-06-27 20:29:31,440][06909] InferenceWorker_p0-w0: stopping experience collection (15150 times) [2024-06-27 20:29:31,440][06909] InferenceWorker_p0-w0: resuming experience collection (15150 times) [2024-06-27 20:29:31,585][06909] Updated weights for policy 0, policy_version 70963 (0.0045) [2024-06-27 20:29:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43419.0, 300 sec: 43598.1). Total num frames: 1162723328. Throughput: 0: 43788.8. Samples: 1065662020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:29:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:29:35,580][06909] Updated weights for policy 0, policy_version 70973 (0.0024) [2024-06-27 20:29:38,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43417.6, 300 sec: 43542.5). Total num frames: 1162952704. Throughput: 0: 43534.6. Samples: 1065920900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:29:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:29:39,580][06909] Updated weights for policy 0, policy_version 70983 (0.0035) [2024-06-27 20:29:43,306][06909] Updated weights for policy 0, policy_version 70993 (0.0034) [2024-06-27 20:29:43,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1163165696. Throughput: 0: 43547.6. Samples: 1066053260. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 20:29:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:29:47,325][06909] Updated weights for policy 0, policy_version 71003 (0.0045) [2024-06-27 20:29:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1163378688. Throughput: 0: 43547.0. Samples: 1066311420. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 20:29:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:29:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000071007_1163378688.pth... [2024-06-27 20:29:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000070369_1152925696.pth [2024-06-27 20:29:50,544][06909] Updated weights for policy 0, policy_version 71013 (0.0032) [2024-06-27 20:29:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43144.7, 300 sec: 43487.0). Total num frames: 1163591680. Throughput: 0: 43656.6. Samples: 1066571040. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 20:29:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:29:54,679][06909] Updated weights for policy 0, policy_version 71023 (0.0028) [2024-06-27 20:29:57,979][06909] Updated weights for policy 0, policy_version 71033 (0.0038) [2024-06-27 20:29:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1163821056. Throughput: 0: 43690.0. Samples: 1066706180. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 20:29:58,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:30:02,148][06909] Updated weights for policy 0, policy_version 71043 (0.0035) [2024-06-27 20:30:03,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1164034048. Throughput: 0: 43600.0. Samples: 1066968460. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 20:30:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:30:05,696][06909] Updated weights for policy 0, policy_version 71053 (0.0039) [2024-06-27 20:30:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 1164263424. Throughput: 0: 43540.0. Samples: 1067223820. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 20:30:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:30:09,582][06909] Updated weights for policy 0, policy_version 71063 (0.0038) [2024-06-27 20:30:13,095][06909] Updated weights for policy 0, policy_version 71073 (0.0046) [2024-06-27 20:30:13,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.6, 300 sec: 43598.1). Total num frames: 1164492800. Throughput: 0: 43656.7. Samples: 1067361300. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 20:30:13,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:30:16,962][06909] Updated weights for policy 0, policy_version 71083 (0.0027) [2024-06-27 20:30:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 1164689408. Throughput: 0: 43596.5. Samples: 1067623860. Policy #0 lag: (min: 0.0, avg: 11.0, max: 24.0) [2024-06-27 20:30:18,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:30:20,483][06909] Updated weights for policy 0, policy_version 71093 (0.0032) [2024-06-27 20:30:23,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 1164902400. Throughput: 0: 43647.2. Samples: 1067885020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 20:30:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:30:24,898][06909] Updated weights for policy 0, policy_version 71103 (0.0030) [2024-06-27 20:30:28,093][06909] Updated weights for policy 0, policy_version 71113 (0.0022) [2024-06-27 20:30:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.6, 300 sec: 43542.6). Total num frames: 1165131776. Throughput: 0: 43542.6. Samples: 1068012680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 20:30:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:30:32,345][06909] Updated weights for policy 0, policy_version 71123 (0.0036) [2024-06-27 20:30:33,856][06674] Fps is (10 sec: 44211.8, 60 sec: 43686.6, 300 sec: 43542.6). Total num frames: 1165344768. Throughput: 0: 43621.2. Samples: 1068274620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 20:30:33,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:30:35,645][06909] Updated weights for policy 0, policy_version 71133 (0.0025) [2024-06-27 20:30:38,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43416.2, 300 sec: 43486.7). Total num frames: 1165557760. Throughput: 0: 43552.6. Samples: 1068531000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 20:30:38,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:30:39,961][06909] Updated weights for policy 0, policy_version 71143 (0.0033) [2024-06-27 20:30:43,154][06909] Updated weights for policy 0, policy_version 71153 (0.0031) [2024-06-27 20:30:43,850][06674] Fps is (10 sec: 45901.6, 60 sec: 43963.7, 300 sec: 43654.0). Total num frames: 1165803520. Throughput: 0: 43562.4. Samples: 1068666480. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 20:30:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:30:47,064][06909] Updated weights for policy 0, policy_version 71163 (0.0026) [2024-06-27 20:30:48,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1166000128. Throughput: 0: 43789.4. Samples: 1068938980. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 20:30:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:30:50,470][06909] Updated weights for policy 0, policy_version 71173 (0.0037) [2024-06-27 20:30:53,854][06674] Fps is (10 sec: 42581.2, 60 sec: 43960.7, 300 sec: 43597.5). Total num frames: 1166229504. Throughput: 0: 43822.4. Samples: 1069196000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 20:30:53,854][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:30:54,678][06909] Updated weights for policy 0, policy_version 71183 (0.0026) [2024-06-27 20:30:57,900][06909] Updated weights for policy 0, policy_version 71193 (0.0032) [2024-06-27 20:30:58,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.9, 300 sec: 43653.7). Total num frames: 1166458880. Throughput: 0: 43835.3. Samples: 1069333880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 20:30:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:31:02,315][06909] Updated weights for policy 0, policy_version 71203 (0.0054) [2024-06-27 20:31:03,852][06674] Fps is (10 sec: 42606.7, 60 sec: 43689.2, 300 sec: 43542.3). Total num frames: 1166655488. Throughput: 0: 43769.2. Samples: 1069593560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 20:31:03,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:31:05,278][06887] Signal inference workers to stop experience collection... (15200 times) [2024-06-27 20:31:05,329][06909] InferenceWorker_p0-w0: stopping experience collection (15200 times) [2024-06-27 20:31:05,335][06887] Signal inference workers to resume experience collection... (15200 times) [2024-06-27 20:31:05,352][06909] InferenceWorker_p0-w0: resuming experience collection (15200 times) [2024-06-27 20:31:05,469][06909] Updated weights for policy 0, policy_version 71213 (0.0023) [2024-06-27 20:31:08,850][06674] Fps is (10 sec: 42597.4, 60 sec: 43690.6, 300 sec: 43542.5). Total num frames: 1166884864. Throughput: 0: 43617.7. Samples: 1069847820. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-27 20:31:08,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:31:09,600][06909] Updated weights for policy 0, policy_version 71223 (0.0031) [2024-06-27 20:31:12,887][06909] Updated weights for policy 0, policy_version 71233 (0.0022) [2024-06-27 20:31:13,850][06674] Fps is (10 sec: 44246.1, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 1167097856. Throughput: 0: 43672.5. Samples: 1069977940. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-27 20:31:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:31:17,076][06909] Updated weights for policy 0, policy_version 71243 (0.0038) [2024-06-27 20:31:18,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 1167327232. Throughput: 0: 43751.4. Samples: 1070243180. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-27 20:31:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 20:31:20,633][06909] Updated weights for policy 0, policy_version 71253 (0.0028) [2024-06-27 20:31:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1167523840. Throughput: 0: 43876.7. Samples: 1070505360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-27 20:31:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:31:24,519][06909] Updated weights for policy 0, policy_version 71263 (0.0041) [2024-06-27 20:31:28,093][06909] Updated weights for policy 0, policy_version 71273 (0.0039) [2024-06-27 20:31:28,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1167769600. Throughput: 0: 43824.8. Samples: 1070638600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-27 20:31:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:31:32,267][06909] Updated weights for policy 0, policy_version 71283 (0.0022) [2024-06-27 20:31:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43694.8, 300 sec: 43598.1). Total num frames: 1167966208. Throughput: 0: 43700.0. Samples: 1070905480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-27 20:31:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:31:35,426][06909] Updated weights for policy 0, policy_version 71293 (0.0028) [2024-06-27 20:31:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44238.3, 300 sec: 43709.2). Total num frames: 1168211968. Throughput: 0: 43804.3. Samples: 1071167020. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-27 20:31:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:31:39,607][06909] Updated weights for policy 0, policy_version 71303 (0.0039) [2024-06-27 20:31:43,070][06909] Updated weights for policy 0, policy_version 71313 (0.0045) [2024-06-27 20:31:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1168424960. Throughput: 0: 43703.9. Samples: 1071300560. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-27 20:31:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:31:47,111][06909] Updated weights for policy 0, policy_version 71323 (0.0029) [2024-06-27 20:31:48,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.6, 300 sec: 43653.6). Total num frames: 1168637952. Throughput: 0: 43775.6. Samples: 1071563380. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-27 20:31:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:31:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000071328_1168637952.pth... [2024-06-27 20:31:48,931][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000070688_1158152192.pth [2024-06-27 20:31:50,354][06909] Updated weights for policy 0, policy_version 71333 (0.0031) [2024-06-27 20:31:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43420.5, 300 sec: 43487.0). Total num frames: 1168834560. Throughput: 0: 43851.7. Samples: 1071821140. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 20:31:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:31:54,555][06909] Updated weights for policy 0, policy_version 71343 (0.0037) [2024-06-27 20:31:58,017][06909] Updated weights for policy 0, policy_version 71353 (0.0030) [2024-06-27 20:31:58,852][06674] Fps is (10 sec: 44228.5, 60 sec: 43689.1, 300 sec: 43708.9). Total num frames: 1169080320. Throughput: 0: 43790.8. Samples: 1071948620. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 20:31:58,853][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:32:01,983][06909] Updated weights for policy 0, policy_version 71363 (0.0027) [2024-06-27 20:32:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43692.2, 300 sec: 43598.1). Total num frames: 1169276928. Throughput: 0: 43815.6. Samples: 1072214880. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 20:32:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:32:05,621][06909] Updated weights for policy 0, policy_version 71373 (0.0030) [2024-06-27 20:32:08,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 1169506304. Throughput: 0: 43701.7. Samples: 1072471940. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 20:32:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:32:09,602][06909] Updated weights for policy 0, policy_version 71383 (0.0031) [2024-06-27 20:32:12,949][06909] Updated weights for policy 0, policy_version 71393 (0.0032) [2024-06-27 20:32:13,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1169735680. Throughput: 0: 43756.9. Samples: 1072607660. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 20:32:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:32:17,258][06909] Updated weights for policy 0, policy_version 71403 (0.0039) [2024-06-27 20:32:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 1169932288. Throughput: 0: 43646.2. Samples: 1072869560. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 20:32:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:32:20,650][06909] Updated weights for policy 0, policy_version 71413 (0.0034) [2024-06-27 20:32:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 1170161664. Throughput: 0: 43444.5. Samples: 1073122020. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 20:32:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:32:24,804][06909] Updated weights for policy 0, policy_version 71423 (0.0037) [2024-06-27 20:32:28,231][06909] Updated weights for policy 0, policy_version 71433 (0.0029) [2024-06-27 20:32:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1170391040. Throughput: 0: 43584.5. Samples: 1073261860. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 20:32:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:32:32,027][06909] Updated weights for policy 0, policy_version 71443 (0.0031) [2024-06-27 20:32:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 1170604032. Throughput: 0: 43549.0. Samples: 1073523080. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 20:32:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:32:35,444][06909] Updated weights for policy 0, policy_version 71453 (0.0033) [2024-06-27 20:32:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 1170817024. Throughput: 0: 43610.2. Samples: 1073783600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:32:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:32:39,571][06909] Updated weights for policy 0, policy_version 71463 (0.0031) [2024-06-27 20:32:42,775][06909] Updated weights for policy 0, policy_version 71473 (0.0025) [2024-06-27 20:32:43,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1171062784. Throughput: 0: 43749.2. Samples: 1073917240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:32:43,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 20:32:47,284][06909] Updated weights for policy 0, policy_version 71483 (0.0044) [2024-06-27 20:32:48,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1171243008. Throughput: 0: 43543.8. Samples: 1074174360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:32:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:32:50,596][06909] Updated weights for policy 0, policy_version 71493 (0.0039) [2024-06-27 20:32:52,350][06887] Signal inference workers to stop experience collection... (15250 times) [2024-06-27 20:32:52,350][06887] Signal inference workers to resume experience collection... (15250 times) [2024-06-27 20:32:52,374][06909] InferenceWorker_p0-w0: stopping experience collection (15250 times) [2024-06-27 20:32:52,374][06909] InferenceWorker_p0-w0: resuming experience collection (15250 times) [2024-06-27 20:32:53,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 1171472384. Throughput: 0: 43595.9. Samples: 1074433760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:32:53,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:32:54,857][06909] Updated weights for policy 0, policy_version 71503 (0.0041) [2024-06-27 20:32:57,965][06909] Updated weights for policy 0, policy_version 71513 (0.0021) [2024-06-27 20:32:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43692.2, 300 sec: 43764.7). Total num frames: 1171701760. Throughput: 0: 43649.3. Samples: 1074571880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:32:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:33:02,020][06909] Updated weights for policy 0, policy_version 71523 (0.0035) [2024-06-27 20:33:03,853][06674] Fps is (10 sec: 44223.8, 60 sec: 43961.5, 300 sec: 43653.2). Total num frames: 1171914752. Throughput: 0: 43672.7. Samples: 1074834960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:33:03,854][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:33:05,566][06909] Updated weights for policy 0, policy_version 71533 (0.0024) [2024-06-27 20:33:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43653.9). Total num frames: 1172127744. Throughput: 0: 43957.8. Samples: 1075100120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:33:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:33:09,222][06909] Updated weights for policy 0, policy_version 71543 (0.0023) [2024-06-27 20:33:12,854][06909] Updated weights for policy 0, policy_version 71553 (0.0036) [2024-06-27 20:33:13,856][06674] Fps is (10 sec: 44223.4, 60 sec: 43686.3, 300 sec: 43708.5). Total num frames: 1172357120. Throughput: 0: 43758.5. Samples: 1075231260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:33:13,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:33:17,026][06909] Updated weights for policy 0, policy_version 71563 (0.0035) [2024-06-27 20:33:18,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.7, 300 sec: 43654.5). Total num frames: 1172570112. Throughput: 0: 43668.3. Samples: 1075488160. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 20:33:18,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:33:20,276][06909] Updated weights for policy 0, policy_version 71573 (0.0036) [2024-06-27 20:33:23,850][06674] Fps is (10 sec: 40984.0, 60 sec: 43417.5, 300 sec: 43598.4). Total num frames: 1172766720. Throughput: 0: 43714.0. Samples: 1075750740. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 20:33:23,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:33:24,415][06909] Updated weights for policy 0, policy_version 71583 (0.0032) [2024-06-27 20:33:27,991][06909] Updated weights for policy 0, policy_version 71593 (0.0028) [2024-06-27 20:33:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.5, 300 sec: 43709.5). Total num frames: 1173012480. Throughput: 0: 43618.1. Samples: 1075880060. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 20:33:28,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:33:32,192][06909] Updated weights for policy 0, policy_version 71603 (0.0032) [2024-06-27 20:33:33,850][06674] Fps is (10 sec: 44238.1, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1173209088. Throughput: 0: 43777.1. Samples: 1076144320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 20:33:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:33:35,603][06909] Updated weights for policy 0, policy_version 71613 (0.0027) [2024-06-27 20:33:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1173454848. Throughput: 0: 43859.6. Samples: 1076407440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 20:33:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:33:39,466][06909] Updated weights for policy 0, policy_version 71623 (0.0033) [2024-06-27 20:33:42,867][06909] Updated weights for policy 0, policy_version 71633 (0.0028) [2024-06-27 20:33:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 42871.4, 300 sec: 43598.1). Total num frames: 1173635072. Throughput: 0: 43681.3. Samples: 1076537540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 20:33:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:33:46,817][06909] Updated weights for policy 0, policy_version 71643 (0.0034) [2024-06-27 20:33:48,856][06674] Fps is (10 sec: 42572.6, 60 sec: 43959.3, 300 sec: 43652.7). Total num frames: 1173880832. Throughput: 0: 43673.9. Samples: 1076800420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 20:33:48,857][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:33:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000071648_1173880832.pth... [2024-06-27 20:33:48,963][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000071007_1163378688.pth [2024-06-27 20:33:50,225][06909] Updated weights for policy 0, policy_version 71653 (0.0030) [2024-06-27 20:33:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 1174077440. Throughput: 0: 43468.0. Samples: 1077056180. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 20:33:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:33:54,512][06909] Updated weights for policy 0, policy_version 71663 (0.0026) [2024-06-27 20:33:58,053][06909] Updated weights for policy 0, policy_version 71673 (0.0022) [2024-06-27 20:33:58,850][06674] Fps is (10 sec: 44263.7, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1174323200. Throughput: 0: 43430.7. Samples: 1077185380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-27 20:33:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:34:02,053][06909] Updated weights for policy 0, policy_version 71683 (0.0031) [2024-06-27 20:34:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43419.8, 300 sec: 43653.7). Total num frames: 1174519808. Throughput: 0: 43595.3. Samples: 1077449940. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:34:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:34:05,652][06909] Updated weights for policy 0, policy_version 71693 (0.0032) [2024-06-27 20:34:08,850][06674] Fps is (10 sec: 40959.2, 60 sec: 43417.4, 300 sec: 43653.6). Total num frames: 1174732800. Throughput: 0: 43547.1. Samples: 1077710360. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:34:08,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:34:09,713][06909] Updated weights for policy 0, policy_version 71703 (0.0040) [2024-06-27 20:34:13,169][06909] Updated weights for policy 0, policy_version 71713 (0.0025) [2024-06-27 20:34:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43422.0, 300 sec: 43709.2). Total num frames: 1174962176. Throughput: 0: 43595.7. Samples: 1077841860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:34:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:34:16,163][06887] Signal inference workers to stop experience collection... (15300 times) [2024-06-27 20:34:16,215][06909] InferenceWorker_p0-w0: stopping experience collection (15300 times) [2024-06-27 20:34:16,222][06887] Signal inference workers to resume experience collection... (15300 times) [2024-06-27 20:34:16,229][06909] InferenceWorker_p0-w0: resuming experience collection (15300 times) [2024-06-27 20:34:17,071][06909] Updated weights for policy 0, policy_version 71723 (0.0036) [2024-06-27 20:34:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 1175175168. Throughput: 0: 43460.1. Samples: 1078100040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:34:18,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:34:20,585][06909] Updated weights for policy 0, policy_version 71733 (0.0034) [2024-06-27 20:34:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.9, 300 sec: 43653.6). Total num frames: 1175404544. Throughput: 0: 43405.4. Samples: 1078360680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:34:23,853][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:34:24,898][06909] Updated weights for policy 0, policy_version 71743 (0.0030) [2024-06-27 20:34:28,037][06909] Updated weights for policy 0, policy_version 71753 (0.0032) [2024-06-27 20:34:28,850][06674] Fps is (10 sec: 42599.5, 60 sec: 43144.6, 300 sec: 43653.7). Total num frames: 1175601152. Throughput: 0: 43388.0. Samples: 1078490000. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:34:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:34:32,242][06909] Updated weights for policy 0, policy_version 71763 (0.0029) [2024-06-27 20:34:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1175830528. Throughput: 0: 43354.3. Samples: 1078751100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:34:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:34:35,279][06909] Updated weights for policy 0, policy_version 71773 (0.0033) [2024-06-27 20:34:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43144.5, 300 sec: 43653.6). Total num frames: 1176043520. Throughput: 0: 43592.8. Samples: 1079017860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:34:38,860][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:34:39,659][06909] Updated weights for policy 0, policy_version 71783 (0.0029) [2024-06-27 20:34:43,293][06909] Updated weights for policy 0, policy_version 71793 (0.0030) [2024-06-27 20:34:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1176272896. Throughput: 0: 43549.8. Samples: 1079145120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 20:34:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:34:47,422][06909] Updated weights for policy 0, policy_version 71803 (0.0036) [2024-06-27 20:34:48,852][06674] Fps is (10 sec: 45866.2, 60 sec: 43693.7, 300 sec: 43764.4). Total num frames: 1176502272. Throughput: 0: 43592.7. Samples: 1079411700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 20:34:48,861][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:34:50,709][06909] Updated weights for policy 0, policy_version 71813 (0.0036) [2024-06-27 20:34:53,855][06674] Fps is (10 sec: 44214.7, 60 sec: 43960.0, 300 sec: 43708.5). Total num frames: 1176715264. Throughput: 0: 43601.6. Samples: 1079672640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 20:34:53,855][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:34:54,818][06909] Updated weights for policy 0, policy_version 71823 (0.0037) [2024-06-27 20:34:58,271][06909] Updated weights for policy 0, policy_version 71833 (0.0037) [2024-06-27 20:34:58,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 1176928256. Throughput: 0: 43455.6. Samples: 1079797360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 20:34:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:35:02,485][06909] Updated weights for policy 0, policy_version 71843 (0.0027) [2024-06-27 20:35:03,850][06674] Fps is (10 sec: 42619.9, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1177141248. Throughput: 0: 43634.1. Samples: 1080063560. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 20:35:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:35:05,502][06909] Updated weights for policy 0, policy_version 71853 (0.0029) [2024-06-27 20:35:08,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.9, 300 sec: 43653.6). Total num frames: 1177370624. Throughput: 0: 43653.8. Samples: 1080325100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 20:35:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:35:09,863][06909] Updated weights for policy 0, policy_version 71863 (0.0032) [2024-06-27 20:35:12,937][06909] Updated weights for policy 0, policy_version 71873 (0.0028) [2024-06-27 20:35:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1177583616. Throughput: 0: 43560.0. Samples: 1080450200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 20:35:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:35:17,183][06909] Updated weights for policy 0, policy_version 71883 (0.0039) [2024-06-27 20:35:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 1177796608. Throughput: 0: 43601.8. Samples: 1080713180. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 20:35:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:35:20,957][06909] Updated weights for policy 0, policy_version 71893 (0.0032) [2024-06-27 20:35:23,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 1177993216. Throughput: 0: 43622.3. Samples: 1080980860. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 20:35:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:35:24,736][06909] Updated weights for policy 0, policy_version 71903 (0.0031) [2024-06-27 20:35:25,913][06887] Signal inference workers to stop experience collection... (15350 times) [2024-06-27 20:35:25,937][06909] InferenceWorker_p0-w0: stopping experience collection (15350 times) [2024-06-27 20:35:26,036][06887] Signal inference workers to resume experience collection... (15350 times) [2024-06-27 20:35:26,036][06909] InferenceWorker_p0-w0: resuming experience collection (15350 times) [2024-06-27 20:35:28,242][06909] Updated weights for policy 0, policy_version 71913 (0.0032) [2024-06-27 20:35:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43710.0). Total num frames: 1178238976. Throughput: 0: 43530.2. Samples: 1081103980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 20:35:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:35:32,010][06909] Updated weights for policy 0, policy_version 71923 (0.0022) [2024-06-27 20:35:33,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.8, 300 sec: 43709.5). Total num frames: 1178451968. Throughput: 0: 43590.5. Samples: 1081373180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:35:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:35:36,065][06909] Updated weights for policy 0, policy_version 71933 (0.0025) [2024-06-27 20:35:38,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43962.3, 300 sec: 43653.3). Total num frames: 1178681344. Throughput: 0: 43646.4. Samples: 1081636600. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:35:38,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:35:39,619][06909] Updated weights for policy 0, policy_version 71943 (0.0025) [2024-06-27 20:35:43,379][06909] Updated weights for policy 0, policy_version 71953 (0.0025) [2024-06-27 20:35:43,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43689.2, 300 sec: 43708.9). Total num frames: 1178894336. Throughput: 0: 43581.1. Samples: 1081758600. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:35:43,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:35:47,045][06909] Updated weights for policy 0, policy_version 71963 (0.0026) [2024-06-27 20:35:48,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43692.1, 300 sec: 43709.8). Total num frames: 1179123712. Throughput: 0: 43644.4. Samples: 1082027560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:35:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:35:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000071968_1179123712.pth... [2024-06-27 20:35:48,913][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000071328_1168637952.pth [2024-06-27 20:35:50,756][06909] Updated weights for policy 0, policy_version 71973 (0.0028) [2024-06-27 20:35:53,850][06674] Fps is (10 sec: 39329.6, 60 sec: 42875.0, 300 sec: 43487.0). Total num frames: 1179287552. Throughput: 0: 43632.1. Samples: 1082288540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:35:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:35:54,717][06909] Updated weights for policy 0, policy_version 71983 (0.0038) [2024-06-27 20:35:58,435][06909] Updated weights for policy 0, policy_version 71993 (0.0034) [2024-06-27 20:35:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43709.5). Total num frames: 1179549696. Throughput: 0: 43666.6. Samples: 1082415200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:35:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:36:02,189][06909] Updated weights for policy 0, policy_version 72003 (0.0043) [2024-06-27 20:36:03,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43690.6, 300 sec: 43653.7). Total num frames: 1179762688. Throughput: 0: 43559.2. Samples: 1082673340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:36:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:36:05,868][06909] Updated weights for policy 0, policy_version 72013 (0.0041) [2024-06-27 20:36:08,850][06674] Fps is (10 sec: 39321.8, 60 sec: 42871.5, 300 sec: 43542.6). Total num frames: 1179942912. Throughput: 0: 43413.3. Samples: 1082934460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:36:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:36:09,669][06909] Updated weights for policy 0, policy_version 72023 (0.0026) [2024-06-27 20:36:13,469][06909] Updated weights for policy 0, policy_version 72033 (0.0038) [2024-06-27 20:36:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43598.1). Total num frames: 1180188672. Throughput: 0: 43593.9. Samples: 1083065700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-27 20:36:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:36:17,241][06909] Updated weights for policy 0, policy_version 72043 (0.0037) [2024-06-27 20:36:18,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1180418048. Throughput: 0: 43486.6. Samples: 1083330080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:36:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:36:21,332][06909] Updated weights for policy 0, policy_version 72053 (0.0031) [2024-06-27 20:36:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 1180614656. Throughput: 0: 43539.8. Samples: 1083595800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:36:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:36:24,990][06909] Updated weights for policy 0, policy_version 72063 (0.0030) [2024-06-27 20:36:28,615][06909] Updated weights for policy 0, policy_version 72073 (0.0031) [2024-06-27 20:36:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1180860416. Throughput: 0: 43626.9. Samples: 1083721720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:36:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:36:32,435][06909] Updated weights for policy 0, policy_version 72083 (0.0045) [2024-06-27 20:36:33,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43690.7, 300 sec: 43598.1). Total num frames: 1181073408. Throughput: 0: 43671.2. Samples: 1083992760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:36:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:36:35,835][06909] Updated weights for policy 0, policy_version 72093 (0.0030) [2024-06-27 20:36:38,852][06674] Fps is (10 sec: 39313.4, 60 sec: 42871.5, 300 sec: 43486.7). Total num frames: 1181253632. Throughput: 0: 43724.2. Samples: 1084256220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:36:38,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:36:39,880][06909] Updated weights for policy 0, policy_version 72103 (0.0030) [2024-06-27 20:36:43,567][06909] Updated weights for policy 0, policy_version 72113 (0.0036) [2024-06-27 20:36:43,850][06674] Fps is (10 sec: 42597.3, 60 sec: 43419.0, 300 sec: 43598.1). Total num frames: 1181499392. Throughput: 0: 43704.3. Samples: 1084381900. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:36:43,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:36:44,979][06887] Signal inference workers to stop experience collection... (15400 times) [2024-06-27 20:36:44,979][06887] Signal inference workers to resume experience collection... (15400 times) [2024-06-27 20:36:45,008][06909] InferenceWorker_p0-w0: stopping experience collection (15400 times) [2024-06-27 20:36:45,012][06909] InferenceWorker_p0-w0: resuming experience collection (15400 times) [2024-06-27 20:36:47,398][06909] Updated weights for policy 0, policy_version 72123 (0.0026) [2024-06-27 20:36:48,852][06674] Fps is (10 sec: 47512.5, 60 sec: 43416.0, 300 sec: 43708.8). Total num frames: 1181728768. Throughput: 0: 43840.4. Samples: 1084646260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:36:48,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 20:36:51,190][06909] Updated weights for policy 0, policy_version 72133 (0.0030) [2024-06-27 20:36:53,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.7, 300 sec: 43542.9). Total num frames: 1181925376. Throughput: 0: 43825.8. Samples: 1084906620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:36:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:36:54,769][06909] Updated weights for policy 0, policy_version 72143 (0.0040) [2024-06-27 20:36:58,753][06909] Updated weights for policy 0, policy_version 72153 (0.0026) [2024-06-27 20:36:58,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 1182154752. Throughput: 0: 43647.3. Samples: 1085029840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 20:36:58,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 20:37:02,271][06909] Updated weights for policy 0, policy_version 72163 (0.0035) [2024-06-27 20:37:03,852][06674] Fps is (10 sec: 47503.8, 60 sec: 43962.2, 300 sec: 43708.9). Total num frames: 1182400512. Throughput: 0: 43607.4. Samples: 1085292500. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 20:37:03,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:37:06,799][06909] Updated weights for policy 0, policy_version 72173 (0.0028) [2024-06-27 20:37:08,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 1182564352. Throughput: 0: 43671.5. Samples: 1085561020. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 20:37:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:37:09,905][06909] Updated weights for policy 0, policy_version 72183 (0.0028) [2024-06-27 20:37:13,850][06674] Fps is (10 sec: 39329.5, 60 sec: 43417.5, 300 sec: 43598.1). Total num frames: 1182793728. Throughput: 0: 43541.7. Samples: 1085681100. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 20:37:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:37:14,100][06909] Updated weights for policy 0, policy_version 72193 (0.0026) [2024-06-27 20:37:17,642][06909] Updated weights for policy 0, policy_version 72203 (0.0037) [2024-06-27 20:37:18,850][06674] Fps is (10 sec: 49151.9, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1183055872. Throughput: 0: 43490.5. Samples: 1085949840. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 20:37:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:37:21,427][06909] Updated weights for policy 0, policy_version 72213 (0.0030) [2024-06-27 20:37:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 1183236096. Throughput: 0: 43493.1. Samples: 1086213320. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 20:37:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:37:25,027][06909] Updated weights for policy 0, policy_version 72223 (0.0024) [2024-06-27 20:37:28,856][06674] Fps is (10 sec: 39298.4, 60 sec: 43140.2, 300 sec: 43541.7). Total num frames: 1183449088. Throughput: 0: 43499.7. Samples: 1086339640. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 20:37:28,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:37:29,408][06909] Updated weights for policy 0, policy_version 72233 (0.0042) [2024-06-27 20:37:32,583][06909] Updated weights for policy 0, policy_version 72243 (0.0030) [2024-06-27 20:37:33,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43690.5, 300 sec: 43653.6). Total num frames: 1183694848. Throughput: 0: 43489.2. Samples: 1086603180. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 20:37:33,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:37:36,773][06909] Updated weights for policy 0, policy_version 72253 (0.0048) [2024-06-27 20:37:38,850][06674] Fps is (10 sec: 44263.8, 60 sec: 43965.3, 300 sec: 43487.0). Total num frames: 1183891456. Throughput: 0: 43510.8. Samples: 1086864600. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 20:37:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:37:39,926][06909] Updated weights for policy 0, policy_version 72263 (0.0031) [2024-06-27 20:37:43,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.7, 300 sec: 43598.1). Total num frames: 1184104448. Throughput: 0: 43586.8. Samples: 1086991240. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 20:37:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:37:44,267][06909] Updated weights for policy 0, policy_version 72273 (0.0030) [2024-06-27 20:37:47,493][06909] Updated weights for policy 0, policy_version 72283 (0.0030) [2024-06-27 20:37:48,850][06674] Fps is (10 sec: 45873.9, 60 sec: 43692.2, 300 sec: 43653.6). Total num frames: 1184350208. Throughput: 0: 43648.1. Samples: 1087256580. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 20:37:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:37:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000072287_1184350208.pth... [2024-06-27 20:37:48,927][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000071648_1173880832.pth [2024-06-27 20:37:51,796][06909] Updated weights for policy 0, policy_version 72293 (0.0031) [2024-06-27 20:37:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 1184530432. Throughput: 0: 43378.7. Samples: 1087513060. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 20:37:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:37:54,859][06909] Updated weights for policy 0, policy_version 72303 (0.0025) [2024-06-27 20:37:58,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.7, 300 sec: 43543.0). Total num frames: 1184759808. Throughput: 0: 43487.5. Samples: 1087638040. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 20:37:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:37:59,103][06909] Updated weights for policy 0, policy_version 72313 (0.0026) [2024-06-27 20:38:02,646][06909] Updated weights for policy 0, policy_version 72323 (0.0040) [2024-06-27 20:38:03,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43419.1, 300 sec: 43653.6). Total num frames: 1185005568. Throughput: 0: 43567.6. Samples: 1087910380. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 20:38:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 20:38:06,786][06909] Updated weights for policy 0, policy_version 72333 (0.0042) [2024-06-27 20:38:07,967][06887] Signal inference workers to stop experience collection... (15450 times) [2024-06-27 20:38:07,993][06909] InferenceWorker_p0-w0: stopping experience collection (15450 times) [2024-06-27 20:38:08,029][06887] Signal inference workers to resume experience collection... (15450 times) [2024-06-27 20:38:08,030][06909] InferenceWorker_p0-w0: resuming experience collection (15450 times) [2024-06-27 20:38:08,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.8, 300 sec: 43543.5). Total num frames: 1185202176. Throughput: 0: 43524.5. Samples: 1088171920. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 20:38:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:38:10,035][06909] Updated weights for policy 0, policy_version 72343 (0.0038) [2024-06-27 20:38:13,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 1185415168. Throughput: 0: 43542.3. Samples: 1088298780. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 20:38:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:38:14,020][06909] Updated weights for policy 0, policy_version 72353 (0.0031) [2024-06-27 20:38:17,446][06909] Updated weights for policy 0, policy_version 72363 (0.0033) [2024-06-27 20:38:18,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1185660928. Throughput: 0: 43659.7. Samples: 1088567860. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 20:38:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:38:21,567][06909] Updated weights for policy 0, policy_version 72373 (0.0031) [2024-06-27 20:38:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43542.6). Total num frames: 1185857536. Throughput: 0: 43683.8. Samples: 1088830380. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 20:38:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:38:24,984][06909] Updated weights for policy 0, policy_version 72383 (0.0047) [2024-06-27 20:38:28,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43695.0, 300 sec: 43598.1). Total num frames: 1186070528. Throughput: 0: 43669.8. Samples: 1088956380. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-27 20:38:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:38:28,918][06909] Updated weights for policy 0, policy_version 72393 (0.0023) [2024-06-27 20:38:32,282][06909] Updated weights for policy 0, policy_version 72403 (0.0034) [2024-06-27 20:38:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.8, 300 sec: 43598.1). Total num frames: 1186316288. Throughput: 0: 43558.3. Samples: 1089216700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-27 20:38:33,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:38:36,173][06909] Updated weights for policy 0, policy_version 72413 (0.0033) [2024-06-27 20:38:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.5, 300 sec: 43653.6). Total num frames: 1186512896. Throughput: 0: 43978.5. Samples: 1089492100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-27 20:38:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:38:39,537][06909] Updated weights for policy 0, policy_version 72423 (0.0045) [2024-06-27 20:38:43,497][06909] Updated weights for policy 0, policy_version 72433 (0.0049) [2024-06-27 20:38:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43599.0). Total num frames: 1186742272. Throughput: 0: 43832.5. Samples: 1089610500. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-27 20:38:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:38:47,190][06909] Updated weights for policy 0, policy_version 72443 (0.0034) [2024-06-27 20:38:48,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1186971648. Throughput: 0: 43746.2. Samples: 1089878960. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-27 20:38:48,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:38:51,339][06909] Updated weights for policy 0, policy_version 72453 (0.0033) [2024-06-27 20:38:53,852][06674] Fps is (10 sec: 40953.2, 60 sec: 43689.4, 300 sec: 43486.8). Total num frames: 1187151872. Throughput: 0: 43987.2. Samples: 1090151420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-27 20:38:53,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:38:54,691][06909] Updated weights for policy 0, policy_version 72463 (0.0032) [2024-06-27 20:38:58,833][06909] Updated weights for policy 0, policy_version 72473 (0.0046) [2024-06-27 20:38:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 1187397632. Throughput: 0: 43860.8. Samples: 1090272520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-27 20:38:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:39:02,298][06909] Updated weights for policy 0, policy_version 72483 (0.0033) [2024-06-27 20:39:03,850][06674] Fps is (10 sec: 47522.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1187627008. Throughput: 0: 43772.1. Samples: 1090537600. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-27 20:39:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:39:06,134][06909] Updated weights for policy 0, policy_version 72493 (0.0044) [2024-06-27 20:39:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1187823616. Throughput: 0: 43891.6. Samples: 1090805500. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-27 20:39:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:39:09,691][06909] Updated weights for policy 0, policy_version 72503 (0.0036) [2024-06-27 20:39:12,529][06887] Signal inference workers to stop experience collection... (15500 times) [2024-06-27 20:39:12,588][06909] InferenceWorker_p0-w0: stopping experience collection (15500 times) [2024-06-27 20:39:12,590][06887] Signal inference workers to resume experience collection... (15500 times) [2024-06-27 20:39:12,599][06909] InferenceWorker_p0-w0: resuming experience collection (15500 times) [2024-06-27 20:39:13,501][06909] Updated weights for policy 0, policy_version 72513 (0.0032) [2024-06-27 20:39:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43653.7). Total num frames: 1188052992. Throughput: 0: 43760.9. Samples: 1090925620. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 20:39:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:39:17,144][06909] Updated weights for policy 0, policy_version 72523 (0.0042) [2024-06-27 20:39:18,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1188298752. Throughput: 0: 43848.9. Samples: 1091189900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 20:39:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:39:21,048][06909] Updated weights for policy 0, policy_version 72533 (0.0030) [2024-06-27 20:39:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.8, 300 sec: 43653.7). Total num frames: 1188478976. Throughput: 0: 43809.5. Samples: 1091463520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 20:39:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:39:24,517][06909] Updated weights for policy 0, policy_version 72543 (0.0033) [2024-06-27 20:39:28,494][06909] Updated weights for policy 0, policy_version 72553 (0.0036) [2024-06-27 20:39:28,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 43653.7). Total num frames: 1188708352. Throughput: 0: 43992.6. Samples: 1091590160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 20:39:28,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 20:39:32,083][06909] Updated weights for policy 0, policy_version 72563 (0.0025) [2024-06-27 20:39:33,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1188954112. Throughput: 0: 43841.3. Samples: 1091851820. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 20:39:33,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:39:36,156][06909] Updated weights for policy 0, policy_version 72573 (0.0028) [2024-06-27 20:39:38,850][06674] Fps is (10 sec: 42597.4, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1189134336. Throughput: 0: 43851.3. Samples: 1092124660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 20:39:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:39:39,581][06909] Updated weights for policy 0, policy_version 72583 (0.0039) [2024-06-27 20:39:43,758][06909] Updated weights for policy 0, policy_version 72593 (0.0040) [2024-06-27 20:39:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43598.4). Total num frames: 1189363712. Throughput: 0: 43873.0. Samples: 1092246800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 20:39:43,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:39:46,794][06909] Updated weights for policy 0, policy_version 72603 (0.0023) [2024-06-27 20:39:48,850][06674] Fps is (10 sec: 47514.6, 60 sec: 43963.8, 300 sec: 43709.9). Total num frames: 1189609472. Throughput: 0: 43819.5. Samples: 1092509480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 20:39:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:39:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000072608_1189609472.pth... [2024-06-27 20:39:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000071968_1179123712.pth [2024-06-27 20:39:50,969][06909] Updated weights for policy 0, policy_version 72613 (0.0041) [2024-06-27 20:39:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44238.1, 300 sec: 43653.6). Total num frames: 1189806080. Throughput: 0: 43994.7. Samples: 1092785260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 20:39:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:39:54,402][06909] Updated weights for policy 0, policy_version 72623 (0.0045) [2024-06-27 20:39:58,192][06909] Updated weights for policy 0, policy_version 72633 (0.0040) [2024-06-27 20:39:58,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1190019072. Throughput: 0: 44095.6. Samples: 1092909920. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 20:39:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:40:01,660][06909] Updated weights for policy 0, policy_version 72643 (0.0035) [2024-06-27 20:40:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1190264832. Throughput: 0: 44081.8. Samples: 1093173580. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 20:40:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:40:06,059][06909] Updated weights for policy 0, policy_version 72653 (0.0033) [2024-06-27 20:40:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 43709.2). Total num frames: 1190477824. Throughput: 0: 43944.8. Samples: 1093441040. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 20:40:08,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 20:40:09,397][06909] Updated weights for policy 0, policy_version 72663 (0.0044) [2024-06-27 20:40:13,394][06909] Updated weights for policy 0, policy_version 72673 (0.0054) [2024-06-27 20:40:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1190690816. Throughput: 0: 43888.3. Samples: 1093565140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 20:40:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:40:16,703][06887] Signal inference workers to stop experience collection... (15550 times) [2024-06-27 20:40:16,703][06887] Signal inference workers to resume experience collection... (15550 times) [2024-06-27 20:40:16,726][06909] InferenceWorker_p0-w0: stopping experience collection (15550 times) [2024-06-27 20:40:16,726][06909] InferenceWorker_p0-w0: resuming experience collection (15550 times) [2024-06-27 20:40:16,838][06909] Updated weights for policy 0, policy_version 72683 (0.0023) [2024-06-27 20:40:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1190920192. Throughput: 0: 43877.8. Samples: 1093826320. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 20:40:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:40:20,727][06909] Updated weights for policy 0, policy_version 72693 (0.0043) [2024-06-27 20:40:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.7, 300 sec: 43653.7). Total num frames: 1191116800. Throughput: 0: 43958.5. Samples: 1094102780. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 20:40:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:40:24,297][06909] Updated weights for policy 0, policy_version 72703 (0.0033) [2024-06-27 20:40:28,276][06909] Updated weights for policy 0, policy_version 72713 (0.0050) [2024-06-27 20:40:28,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.5, 300 sec: 43653.6). Total num frames: 1191329792. Throughput: 0: 43943.0. Samples: 1094224240. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 20:40:28,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:40:31,794][06909] Updated weights for policy 0, policy_version 72723 (0.0035) [2024-06-27 20:40:33,850][06674] Fps is (10 sec: 45874.2, 60 sec: 43690.6, 300 sec: 43709.5). Total num frames: 1191575552. Throughput: 0: 43818.5. Samples: 1094481320. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 20:40:33,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:40:35,570][06909] Updated weights for policy 0, policy_version 72733 (0.0030) [2024-06-27 20:40:38,850][06674] Fps is (10 sec: 45876.2, 60 sec: 44237.0, 300 sec: 43709.5). Total num frames: 1191788544. Throughput: 0: 43913.8. Samples: 1094761380. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-27 20:40:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:40:39,185][06909] Updated weights for policy 0, policy_version 72743 (0.0019) [2024-06-27 20:40:43,468][06909] Updated weights for policy 0, policy_version 72753 (0.0048) [2024-06-27 20:40:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.6, 300 sec: 43653.6). Total num frames: 1192001536. Throughput: 0: 43812.7. Samples: 1094881500. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 20:40:43,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:40:46,718][06909] Updated weights for policy 0, policy_version 72763 (0.0035) [2024-06-27 20:40:48,852][06674] Fps is (10 sec: 45865.0, 60 sec: 43962.2, 300 sec: 43931.0). Total num frames: 1192247296. Throughput: 0: 43678.0. Samples: 1095139180. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 20:40:48,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:40:50,913][06909] Updated weights for policy 0, policy_version 72773 (0.0038) [2024-06-27 20:40:53,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43690.6, 300 sec: 43653.7). Total num frames: 1192427520. Throughput: 0: 43701.4. Samples: 1095407600. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 20:40:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 20:40:54,410][06909] Updated weights for policy 0, policy_version 72783 (0.0030) [2024-06-27 20:40:58,209][06909] Updated weights for policy 0, policy_version 72793 (0.0026) [2024-06-27 20:40:58,850][06674] Fps is (10 sec: 39329.6, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 1192640512. Throughput: 0: 43711.1. Samples: 1095532140. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 20:40:58,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:41:02,201][06909] Updated weights for policy 0, policy_version 72803 (0.0035) [2024-06-27 20:41:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1192886272. Throughput: 0: 43857.4. Samples: 1095799900. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 20:41:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:41:05,600][06909] Updated weights for policy 0, policy_version 72813 (0.0041) [2024-06-27 20:41:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1193082880. Throughput: 0: 43537.7. Samples: 1096061980. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 20:41:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:41:09,539][06909] Updated weights for policy 0, policy_version 72823 (0.0031) [2024-06-27 20:41:13,493][06909] Updated weights for policy 0, policy_version 72833 (0.0023) [2024-06-27 20:41:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1193312256. Throughput: 0: 43684.1. Samples: 1096190020. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 20:41:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:41:17,076][06909] Updated weights for policy 0, policy_version 72843 (0.0038) [2024-06-27 20:41:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 1193541632. Throughput: 0: 43785.0. Samples: 1096451640. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 20:41:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:41:20,785][06909] Updated weights for policy 0, policy_version 72853 (0.0032) [2024-06-27 20:41:23,856][06674] Fps is (10 sec: 44209.9, 60 sec: 43959.3, 300 sec: 43708.3). Total num frames: 1193754624. Throughput: 0: 43503.4. Samples: 1096719300. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-27 20:41:23,856][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:41:24,500][06909] Updated weights for policy 0, policy_version 72863 (0.0027) [2024-06-27 20:41:28,153][06909] Updated weights for policy 0, policy_version 72873 (0.0033) [2024-06-27 20:41:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1193967616. Throughput: 0: 43646.0. Samples: 1096845560. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 20:41:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:41:31,844][06909] Updated weights for policy 0, policy_version 72883 (0.0029) [2024-06-27 20:41:32,404][06887] Signal inference workers to stop experience collection... (15600 times) [2024-06-27 20:41:32,405][06887] Signal inference workers to resume experience collection... (15600 times) [2024-06-27 20:41:32,453][06909] InferenceWorker_p0-w0: stopping experience collection (15600 times) [2024-06-27 20:41:32,453][06909] InferenceWorker_p0-w0: resuming experience collection (15600 times) [2024-06-27 20:41:33,850][06674] Fps is (10 sec: 44263.5, 60 sec: 43690.8, 300 sec: 43876.1). Total num frames: 1194196992. Throughput: 0: 43889.2. Samples: 1097114100. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 20:41:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:41:35,414][06909] Updated weights for policy 0, policy_version 72893 (0.0030) [2024-06-27 20:41:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.5, 300 sec: 43709.2). Total num frames: 1194393600. Throughput: 0: 43743.9. Samples: 1097376080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 20:41:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:41:39,801][06909] Updated weights for policy 0, policy_version 72903 (0.0023) [2024-06-27 20:41:43,222][06909] Updated weights for policy 0, policy_version 72913 (0.0035) [2024-06-27 20:41:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43709.5). Total num frames: 1194622976. Throughput: 0: 43772.1. Samples: 1097501880. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 20:41:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:41:47,123][06909] Updated weights for policy 0, policy_version 72923 (0.0029) [2024-06-27 20:41:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43419.1, 300 sec: 43820.2). Total num frames: 1194852352. Throughput: 0: 43618.5. Samples: 1097762740. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 20:41:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 20:41:48,855][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000072928_1194852352.pth... [2024-06-27 20:41:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000072287_1184350208.pth [2024-06-27 20:41:50,654][06909] Updated weights for policy 0, policy_version 72933 (0.0035) [2024-06-27 20:41:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1195048960. Throughput: 0: 43703.1. Samples: 1098028620. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 20:41:53,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:41:54,912][06909] Updated weights for policy 0, policy_version 72943 (0.0040) [2024-06-27 20:41:58,111][06909] Updated weights for policy 0, policy_version 72953 (0.0022) [2024-06-27 20:41:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 43709.5). Total num frames: 1195294720. Throughput: 0: 43668.9. Samples: 1098155120. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 20:41:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:42:02,307][06909] Updated weights for policy 0, policy_version 72963 (0.0043) [2024-06-27 20:42:03,852][06674] Fps is (10 sec: 45865.9, 60 sec: 43689.1, 300 sec: 43875.5). Total num frames: 1195507712. Throughput: 0: 43738.0. Samples: 1098419940. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 20:42:03,853][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:42:05,634][06909] Updated weights for policy 0, policy_version 72973 (0.0033) [2024-06-27 20:42:08,851][06674] Fps is (10 sec: 40956.6, 60 sec: 43690.1, 300 sec: 43764.6). Total num frames: 1195704320. Throughput: 0: 43567.8. Samples: 1098679620. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 20:42:08,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:42:09,742][06909] Updated weights for policy 0, policy_version 72983 (0.0027) [2024-06-27 20:42:13,132][06909] Updated weights for policy 0, policy_version 72993 (0.0031) [2024-06-27 20:42:13,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1195933696. Throughput: 0: 43612.0. Samples: 1098808100. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 20:42:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:42:17,429][06909] Updated weights for policy 0, policy_version 73003 (0.0043) [2024-06-27 20:42:18,850][06674] Fps is (10 sec: 44240.4, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 1196146688. Throughput: 0: 43692.5. Samples: 1099080260. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 20:42:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:42:20,655][06909] Updated weights for policy 0, policy_version 73013 (0.0040) [2024-06-27 20:42:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43421.9, 300 sec: 43765.6). Total num frames: 1196359680. Throughput: 0: 43522.6. Samples: 1099334600. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 20:42:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:42:25,058][06909] Updated weights for policy 0, policy_version 73023 (0.0037) [2024-06-27 20:42:28,110][06909] Updated weights for policy 0, policy_version 73033 (0.0034) [2024-06-27 20:42:28,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43689.2, 300 sec: 43708.9). Total num frames: 1196589056. Throughput: 0: 43659.8. Samples: 1099466660. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 20:42:28,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:42:32,589][06909] Updated weights for policy 0, policy_version 73043 (0.0028) [2024-06-27 20:42:33,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1196818432. Throughput: 0: 43776.6. Samples: 1099732680. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 20:42:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:42:35,689][06909] Updated weights for policy 0, policy_version 73053 (0.0037) [2024-06-27 20:42:38,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1197031424. Throughput: 0: 43522.2. Samples: 1099987120. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 20:42:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:42:39,971][06909] Updated weights for policy 0, policy_version 73063 (0.0030) [2024-06-27 20:42:43,330][06909] Updated weights for policy 0, policy_version 73073 (0.0031) [2024-06-27 20:42:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1197244416. Throughput: 0: 43626.7. Samples: 1100118320. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 20:42:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:42:45,425][06887] Signal inference workers to stop experience collection... (15650 times) [2024-06-27 20:42:45,461][06909] InferenceWorker_p0-w0: stopping experience collection (15650 times) [2024-06-27 20:42:45,478][06887] Signal inference workers to resume experience collection... (15650 times) [2024-06-27 20:42:45,480][06909] InferenceWorker_p0-w0: resuming experience collection (15650 times) [2024-06-27 20:42:47,530][06909] Updated weights for policy 0, policy_version 73083 (0.0029) [2024-06-27 20:42:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 43820.2). Total num frames: 1197457408. Throughput: 0: 43723.7. Samples: 1100387420. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 20:42:48,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:42:50,759][06909] Updated weights for policy 0, policy_version 73093 (0.0025) [2024-06-27 20:42:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1197686784. Throughput: 0: 43675.8. Samples: 1100645000. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-27 20:42:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:42:54,841][06909] Updated weights for policy 0, policy_version 73103 (0.0037) [2024-06-27 20:42:58,060][06909] Updated weights for policy 0, policy_version 73113 (0.0031) [2024-06-27 20:42:58,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1197899776. Throughput: 0: 43795.6. Samples: 1100778900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:42:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:43:02,747][06909] Updated weights for policy 0, policy_version 73123 (0.0037) [2024-06-27 20:43:03,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43419.2, 300 sec: 43764.7). Total num frames: 1198112768. Throughput: 0: 43576.5. Samples: 1101041200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:43:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:43:05,570][06909] Updated weights for policy 0, policy_version 73133 (0.0029) [2024-06-27 20:43:08,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43691.2, 300 sec: 43764.7). Total num frames: 1198325760. Throughput: 0: 43601.7. Samples: 1101296680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:43:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:43:10,059][06909] Updated weights for policy 0, policy_version 73143 (0.0034) [2024-06-27 20:43:13,092][06909] Updated weights for policy 0, policy_version 73153 (0.0019) [2024-06-27 20:43:13,850][06674] Fps is (10 sec: 42596.5, 60 sec: 43417.3, 300 sec: 43653.6). Total num frames: 1198538752. Throughput: 0: 43533.2. Samples: 1101425580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:43:13,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:43:17,516][06909] Updated weights for policy 0, policy_version 73163 (0.0037) [2024-06-27 20:43:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1198768128. Throughput: 0: 43654.0. Samples: 1101697120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:43:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:43:20,739][06909] Updated weights for policy 0, policy_version 73173 (0.0032) [2024-06-27 20:43:23,850][06674] Fps is (10 sec: 45876.5, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 1198997504. Throughput: 0: 43549.3. Samples: 1101946840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:43:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:43:25,234][06909] Updated weights for policy 0, policy_version 73183 (0.0043) [2024-06-27 20:43:28,575][06909] Updated weights for policy 0, policy_version 73193 (0.0044) [2024-06-27 20:43:28,852][06674] Fps is (10 sec: 42590.0, 60 sec: 43417.6, 300 sec: 43653.3). Total num frames: 1199194112. Throughput: 0: 43566.4. Samples: 1102078900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:43:28,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:43:32,623][06909] Updated weights for policy 0, policy_version 73203 (0.0027) [2024-06-27 20:43:33,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 1199423488. Throughput: 0: 43581.9. Samples: 1102348600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:43:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:43:35,910][06909] Updated weights for policy 0, policy_version 73213 (0.0044) [2024-06-27 20:43:38,850][06674] Fps is (10 sec: 45884.1, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1199652864. Throughput: 0: 43522.6. Samples: 1102603520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 20:43:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:43:40,053][06909] Updated weights for policy 0, policy_version 73223 (0.0029) [2024-06-27 20:43:43,250][06909] Updated weights for policy 0, policy_version 73233 (0.0028) [2024-06-27 20:43:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.5, 300 sec: 43653.6). Total num frames: 1199849472. Throughput: 0: 43489.7. Samples: 1102735940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 20:43:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 20:43:47,745][06909] Updated weights for policy 0, policy_version 73243 (0.0035) [2024-06-27 20:43:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43820.5). Total num frames: 1200078848. Throughput: 0: 43570.0. Samples: 1103001860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 20:43:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:43:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000073247_1200078848.pth... [2024-06-27 20:43:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000072608_1189609472.pth [2024-06-27 20:43:50,951][06909] Updated weights for policy 0, policy_version 73253 (0.0042) [2024-06-27 20:43:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1200291840. Throughput: 0: 43454.3. Samples: 1103252120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 20:43:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:43:55,446][06909] Updated weights for policy 0, policy_version 73263 (0.0034) [2024-06-27 20:43:58,330][06909] Updated weights for policy 0, policy_version 73273 (0.0033) [2024-06-27 20:43:58,852][06674] Fps is (10 sec: 42590.2, 60 sec: 43416.1, 300 sec: 43653.3). Total num frames: 1200504832. Throughput: 0: 43656.6. Samples: 1103390200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 20:43:58,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:44:02,730][06909] Updated weights for policy 0, policy_version 73283 (0.0030) [2024-06-27 20:44:03,727][06887] Signal inference workers to stop experience collection... (15700 times) [2024-06-27 20:44:03,732][06887] Signal inference workers to resume experience collection... (15700 times) [2024-06-27 20:44:03,737][06909] InferenceWorker_p0-w0: stopping experience collection (15700 times) [2024-06-27 20:44:03,752][06909] InferenceWorker_p0-w0: resuming experience collection (15700 times) [2024-06-27 20:44:03,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1200717824. Throughput: 0: 43600.2. Samples: 1103659120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 20:44:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:44:05,862][06909] Updated weights for policy 0, policy_version 73293 (0.0034) [2024-06-27 20:44:08,850][06674] Fps is (10 sec: 45883.8, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1200963584. Throughput: 0: 43683.0. Samples: 1103912580. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 20:44:08,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:44:10,178][06909] Updated weights for policy 0, policy_version 73303 (0.0028) [2024-06-27 20:44:13,404][06909] Updated weights for policy 0, policy_version 73313 (0.0034) [2024-06-27 20:44:13,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43964.0, 300 sec: 43653.6). Total num frames: 1201176576. Throughput: 0: 43756.2. Samples: 1104047840. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 20:44:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:44:17,485][06909] Updated weights for policy 0, policy_version 73323 (0.0039) [2024-06-27 20:44:18,850][06674] Fps is (10 sec: 39322.5, 60 sec: 43144.6, 300 sec: 43653.6). Total num frames: 1201356800. Throughput: 0: 43747.2. Samples: 1104317220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 20:44:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:44:20,859][06909] Updated weights for policy 0, policy_version 73333 (0.0029) [2024-06-27 20:44:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1201618944. Throughput: 0: 43610.6. Samples: 1104566000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-27 20:44:23,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:44:24,757][06909] Updated weights for policy 0, policy_version 73343 (0.0041) [2024-06-27 20:44:28,658][06909] Updated weights for policy 0, policy_version 73353 (0.0037) [2024-06-27 20:44:28,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43965.2, 300 sec: 43653.6). Total num frames: 1201831936. Throughput: 0: 43842.6. Samples: 1104708860. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-27 20:44:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:44:32,377][06909] Updated weights for policy 0, policy_version 73363 (0.0037) [2024-06-27 20:44:33,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1202028544. Throughput: 0: 43714.4. Samples: 1104969000. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-27 20:44:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:44:35,855][06909] Updated weights for policy 0, policy_version 73373 (0.0027) [2024-06-27 20:44:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1202274304. Throughput: 0: 43878.2. Samples: 1105226640. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-27 20:44:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:44:40,064][06909] Updated weights for policy 0, policy_version 73383 (0.0026) [2024-06-27 20:44:43,191][06909] Updated weights for policy 0, policy_version 73393 (0.0044) [2024-06-27 20:44:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 43653.6). Total num frames: 1202487296. Throughput: 0: 43883.4. Samples: 1105364860. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-27 20:44:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:44:47,209][06909] Updated weights for policy 0, policy_version 73403 (0.0026) [2024-06-27 20:44:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 1202716672. Throughput: 0: 43948.4. Samples: 1105636800. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-27 20:44:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:44:50,376][06909] Updated weights for policy 0, policy_version 73413 (0.0051) [2024-06-27 20:44:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 1202929664. Throughput: 0: 44090.4. Samples: 1105896640. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-27 20:44:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:44:54,615][06909] Updated weights for policy 0, policy_version 73423 (0.0028) [2024-06-27 20:44:58,087][06909] Updated weights for policy 0, policy_version 73433 (0.0023) [2024-06-27 20:44:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44238.2, 300 sec: 43709.2). Total num frames: 1203159040. Throughput: 0: 44060.3. Samples: 1106030560. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-27 20:44:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:45:01,838][06909] Updated weights for policy 0, policy_version 73443 (0.0039) [2024-06-27 20:45:03,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1203339264. Throughput: 0: 43800.3. Samples: 1106288240. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-27 20:45:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:45:05,883][06909] Updated weights for policy 0, policy_version 73453 (0.0023) [2024-06-27 20:45:08,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43417.8, 300 sec: 43653.7). Total num frames: 1203568640. Throughput: 0: 44039.7. Samples: 1106547780. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-27 20:45:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:45:09,530][06909] Updated weights for policy 0, policy_version 73463 (0.0023) [2024-06-27 20:45:13,096][06909] Updated weights for policy 0, policy_version 73473 (0.0041) [2024-06-27 20:45:13,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1203814400. Throughput: 0: 43790.8. Samples: 1106679440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 20:45:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:45:17,283][06909] Updated weights for policy 0, policy_version 73483 (0.0021) [2024-06-27 20:45:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 43709.2). Total num frames: 1204011008. Throughput: 0: 43900.8. Samples: 1106944540. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 20:45:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:45:20,804][06909] Updated weights for policy 0, policy_version 73493 (0.0038) [2024-06-27 20:45:23,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 1204224000. Throughput: 0: 43950.7. Samples: 1107204420. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 20:45:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:45:24,575][06909] Updated weights for policy 0, policy_version 73503 (0.0033) [2024-06-27 20:45:26,853][06887] Signal inference workers to stop experience collection... (15750 times) [2024-06-27 20:45:26,853][06887] Signal inference workers to resume experience collection... (15750 times) [2024-06-27 20:45:26,904][06909] InferenceWorker_p0-w0: stopping experience collection (15750 times) [2024-06-27 20:45:26,904][06909] InferenceWorker_p0-w0: resuming experience collection (15750 times) [2024-06-27 20:45:28,118][06909] Updated weights for policy 0, policy_version 73513 (0.0038) [2024-06-27 20:45:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1204469760. Throughput: 0: 43932.3. Samples: 1107341820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 20:45:28,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:45:31,975][06909] Updated weights for policy 0, policy_version 73523 (0.0028) [2024-06-27 20:45:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 1204666368. Throughput: 0: 43787.6. Samples: 1107607240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 20:45:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:45:35,496][06909] Updated weights for policy 0, policy_version 73533 (0.0041) [2024-06-27 20:45:38,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1204895744. Throughput: 0: 43829.8. Samples: 1107868980. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 20:45:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:45:39,178][06909] Updated weights for policy 0, policy_version 73543 (0.0031) [2024-06-27 20:45:43,041][06909] Updated weights for policy 0, policy_version 73553 (0.0032) [2024-06-27 20:45:43,853][06674] Fps is (10 sec: 47499.0, 60 sec: 44234.5, 300 sec: 43709.0). Total num frames: 1205141504. Throughput: 0: 43876.3. Samples: 1108005120. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 20:45:43,853][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:45:46,869][06909] Updated weights for policy 0, policy_version 73563 (0.0052) [2024-06-27 20:45:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1205338112. Throughput: 0: 43862.7. Samples: 1108262060. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 20:45:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:45:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000073568_1205338112.pth... [2024-06-27 20:45:48,919][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000072928_1194852352.pth [2024-06-27 20:45:50,556][06909] Updated weights for policy 0, policy_version 73573 (0.0039) [2024-06-27 20:45:53,852][06674] Fps is (10 sec: 40964.0, 60 sec: 43689.2, 300 sec: 43764.4). Total num frames: 1205551104. Throughput: 0: 43759.3. Samples: 1108517040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 20:45:53,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:45:54,670][06909] Updated weights for policy 0, policy_version 73583 (0.0035) [2024-06-27 20:45:58,139][06909] Updated weights for policy 0, policy_version 73593 (0.0030) [2024-06-27 20:45:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 1205780480. Throughput: 0: 43906.7. Samples: 1108655240. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 20:45:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:46:01,957][06909] Updated weights for policy 0, policy_version 73603 (0.0028) [2024-06-27 20:46:03,850][06674] Fps is (10 sec: 42607.0, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1205977088. Throughput: 0: 43896.5. Samples: 1108919880. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 20:46:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:46:05,460][06909] Updated weights for policy 0, policy_version 73613 (0.0025) [2024-06-27 20:46:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43653.6). Total num frames: 1206190080. Throughput: 0: 43864.1. Samples: 1109178300. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 20:46:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:46:09,491][06909] Updated weights for policy 0, policy_version 73623 (0.0042) [2024-06-27 20:46:12,941][06909] Updated weights for policy 0, policy_version 73633 (0.0045) [2024-06-27 20:46:13,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1206435840. Throughput: 0: 43793.1. Samples: 1109312500. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 20:46:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:46:16,926][06909] Updated weights for policy 0, policy_version 73643 (0.0027) [2024-06-27 20:46:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.8, 300 sec: 43654.5). Total num frames: 1206632448. Throughput: 0: 43553.8. Samples: 1109567160. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 20:46:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:46:20,555][06909] Updated weights for policy 0, policy_version 73653 (0.0027) [2024-06-27 20:46:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1206861824. Throughput: 0: 43572.8. Samples: 1109829760. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 20:46:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:46:24,242][06909] Updated weights for policy 0, policy_version 73663 (0.0033) [2024-06-27 20:46:28,186][06909] Updated weights for policy 0, policy_version 73673 (0.0034) [2024-06-27 20:46:28,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 1207107584. Throughput: 0: 43581.2. Samples: 1109966140. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 20:46:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:46:31,872][06909] Updated weights for policy 0, policy_version 73683 (0.0031) [2024-06-27 20:46:33,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.6, 300 sec: 43653.7). Total num frames: 1207271424. Throughput: 0: 43658.8. Samples: 1110226700. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 20:46:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:46:35,626][06909] Updated weights for policy 0, policy_version 73693 (0.0034) [2024-06-27 20:46:38,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1207517184. Throughput: 0: 43806.8. Samples: 1110488260. Policy #0 lag: (min: 1.0, avg: 8.9, max: 20.0) [2024-06-27 20:46:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:46:39,525][06909] Updated weights for policy 0, policy_version 73703 (0.0042) [2024-06-27 20:46:43,006][06909] Updated weights for policy 0, policy_version 73713 (0.0040) [2024-06-27 20:46:43,850][06674] Fps is (10 sec: 49151.7, 60 sec: 43692.9, 300 sec: 43764.7). Total num frames: 1207762944. Throughput: 0: 43760.9. Samples: 1110624480. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-27 20:46:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:46:46,830][06909] Updated weights for policy 0, policy_version 73723 (0.0032) [2024-06-27 20:46:48,175][06887] Signal inference workers to stop experience collection... (15800 times) [2024-06-27 20:46:48,198][06909] InferenceWorker_p0-w0: stopping experience collection (15800 times) [2024-06-27 20:46:48,285][06887] Signal inference workers to resume experience collection... (15800 times) [2024-06-27 20:46:48,285][06909] InferenceWorker_p0-w0: resuming experience collection (15800 times) [2024-06-27 20:46:48,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43690.5, 300 sec: 43764.7). Total num frames: 1207959552. Throughput: 0: 43565.1. Samples: 1110880320. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-27 20:46:48,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:46:50,589][06909] Updated weights for policy 0, policy_version 73733 (0.0039) [2024-06-27 20:46:53,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43692.1, 300 sec: 43653.6). Total num frames: 1208172544. Throughput: 0: 43765.3. Samples: 1111147740. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-27 20:46:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:46:54,154][06909] Updated weights for policy 0, policy_version 73743 (0.0027) [2024-06-27 20:46:58,141][06909] Updated weights for policy 0, policy_version 73753 (0.0034) [2024-06-27 20:46:58,850][06674] Fps is (10 sec: 45876.6, 60 sec: 43963.7, 300 sec: 43765.0). Total num frames: 1208418304. Throughput: 0: 43781.7. Samples: 1111282680. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-27 20:46:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:47:01,574][06909] Updated weights for policy 0, policy_version 73763 (0.0033) [2024-06-27 20:47:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43709.3). Total num frames: 1208598528. Throughput: 0: 43714.1. Samples: 1111534300. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-27 20:47:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:47:05,621][06909] Updated weights for policy 0, policy_version 73773 (0.0038) [2024-06-27 20:47:08,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 1208827904. Throughput: 0: 43612.4. Samples: 1111792320. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-27 20:47:08,854][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:47:09,122][06909] Updated weights for policy 0, policy_version 73783 (0.0038) [2024-06-27 20:47:13,073][06909] Updated weights for policy 0, policy_version 73793 (0.0038) [2024-06-27 20:47:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1209057280. Throughput: 0: 43657.8. Samples: 1111930740. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-27 20:47:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:47:16,559][06909] Updated weights for policy 0, policy_version 73803 (0.0024) [2024-06-27 20:47:18,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1209253888. Throughput: 0: 43715.0. Samples: 1112193880. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-27 20:47:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:47:20,561][06909] Updated weights for policy 0, policy_version 73813 (0.0026) [2024-06-27 20:47:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 43765.0). Total num frames: 1209499648. Throughput: 0: 43704.8. Samples: 1112454980. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-27 20:47:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:47:24,346][06909] Updated weights for policy 0, policy_version 73823 (0.0031) [2024-06-27 20:47:27,906][06909] Updated weights for policy 0, policy_version 73833 (0.0031) [2024-06-27 20:47:28,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1209729024. Throughput: 0: 43754.2. Samples: 1112593420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:47:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:47:31,599][06909] Updated weights for policy 0, policy_version 73843 (0.0028) [2024-06-27 20:47:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.6, 300 sec: 43653.6). Total num frames: 1209909248. Throughput: 0: 43822.0. Samples: 1112852300. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:47:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:47:35,503][06909] Updated weights for policy 0, policy_version 73853 (0.0030) [2024-06-27 20:47:38,835][06909] Updated weights for policy 0, policy_version 73863 (0.0036) [2024-06-27 20:47:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 43820.2). Total num frames: 1210171392. Throughput: 0: 43704.0. Samples: 1113114420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:47:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:47:42,908][06909] Updated weights for policy 0, policy_version 73873 (0.0040) [2024-06-27 20:47:43,850][06674] Fps is (10 sec: 49151.4, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 1210400768. Throughput: 0: 43742.0. Samples: 1113251080. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:47:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:47:46,407][06909] Updated weights for policy 0, policy_version 73883 (0.0039) [2024-06-27 20:47:48,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43417.8, 300 sec: 43653.7). Total num frames: 1210564608. Throughput: 0: 43949.4. Samples: 1113512020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:47:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:47:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000073888_1210580992.pth... [2024-06-27 20:47:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000073247_1200078848.pth [2024-06-27 20:47:50,564][06909] Updated weights for policy 0, policy_version 73893 (0.0027) [2024-06-27 20:47:53,850][06674] Fps is (10 sec: 40960.9, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 1210810368. Throughput: 0: 43926.8. Samples: 1113769020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:47:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:47:53,927][06909] Updated weights for policy 0, policy_version 73903 (0.0032) [2024-06-27 20:47:57,792][06909] Updated weights for policy 0, policy_version 73913 (0.0028) [2024-06-27 20:47:58,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1211039744. Throughput: 0: 43958.7. Samples: 1113908880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:47:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:48:01,706][06909] Updated weights for policy 0, policy_version 73923 (0.0031) [2024-06-27 20:48:01,724][06887] Signal inference workers to stop experience collection... (15850 times) [2024-06-27 20:48:01,724][06887] Signal inference workers to resume experience collection... (15850 times) [2024-06-27 20:48:01,768][06909] InferenceWorker_p0-w0: stopping experience collection (15850 times) [2024-06-27 20:48:01,768][06909] InferenceWorker_p0-w0: resuming experience collection (15850 times) [2024-06-27 20:48:03,850][06674] Fps is (10 sec: 42597.4, 60 sec: 43963.6, 300 sec: 43764.7). Total num frames: 1211236352. Throughput: 0: 43918.9. Samples: 1114170240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:48:03,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:48:05,230][06909] Updated weights for policy 0, policy_version 73933 (0.0036) [2024-06-27 20:48:08,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1211465728. Throughput: 0: 43935.6. Samples: 1114432080. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 20:48:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:48:08,994][06909] Updated weights for policy 0, policy_version 73943 (0.0023) [2024-06-27 20:48:12,717][06909] Updated weights for policy 0, policy_version 73953 (0.0036) [2024-06-27 20:48:13,850][06674] Fps is (10 sec: 47514.4, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1211711488. Throughput: 0: 44008.5. Samples: 1114573800. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2024-06-27 20:48:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:48:16,544][06909] Updated weights for policy 0, policy_version 73963 (0.0039) [2024-06-27 20:48:18,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1211875328. Throughput: 0: 43795.2. Samples: 1114823080. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2024-06-27 20:48:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:48:20,349][06909] Updated weights for policy 0, policy_version 73973 (0.0033) [2024-06-27 20:48:23,846][06909] Updated weights for policy 0, policy_version 73983 (0.0034) [2024-06-27 20:48:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43876.1). Total num frames: 1212137472. Throughput: 0: 43721.4. Samples: 1115081880. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2024-06-27 20:48:23,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:48:27,815][06909] Updated weights for policy 0, policy_version 73993 (0.0036) [2024-06-27 20:48:28,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1212350464. Throughput: 0: 43817.5. Samples: 1115222860. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2024-06-27 20:48:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:48:31,187][06909] Updated weights for policy 0, policy_version 74003 (0.0028) [2024-06-27 20:48:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1212547072. Throughput: 0: 43731.4. Samples: 1115479940. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2024-06-27 20:48:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:48:35,236][06909] Updated weights for policy 0, policy_version 74013 (0.0034) [2024-06-27 20:48:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43820.3). Total num frames: 1212776448. Throughput: 0: 44041.7. Samples: 1115750900. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2024-06-27 20:48:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:48:38,928][06909] Updated weights for policy 0, policy_version 74023 (0.0043) [2024-06-27 20:48:42,527][06909] Updated weights for policy 0, policy_version 74033 (0.0020) [2024-06-27 20:48:43,850][06674] Fps is (10 sec: 49152.5, 60 sec: 43963.9, 300 sec: 43931.4). Total num frames: 1213038592. Throughput: 0: 44004.4. Samples: 1115889080. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2024-06-27 20:48:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:48:46,214][06909] Updated weights for policy 0, policy_version 74043 (0.0033) [2024-06-27 20:48:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1213186048. Throughput: 0: 43874.0. Samples: 1116144560. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2024-06-27 20:48:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:48:50,082][06909] Updated weights for policy 0, policy_version 74053 (0.0036) [2024-06-27 20:48:53,485][06909] Updated weights for policy 0, policy_version 74063 (0.0029) [2024-06-27 20:48:53,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 43876.1). Total num frames: 1213448192. Throughput: 0: 43859.1. Samples: 1116405740. Policy #0 lag: (min: 0.0, avg: 7.5, max: 20.0) [2024-06-27 20:48:53,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:48:57,524][06909] Updated weights for policy 0, policy_version 74073 (0.0037) [2024-06-27 20:48:58,850][06674] Fps is (10 sec: 50790.1, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1213693952. Throughput: 0: 43962.2. Samples: 1116552100. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2024-06-27 20:48:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:49:01,409][06909] Updated weights for policy 0, policy_version 74083 (0.0033) [2024-06-27 20:49:03,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 1213841408. Throughput: 0: 43954.6. Samples: 1116801040. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2024-06-27 20:49:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:49:05,386][06909] Updated weights for policy 0, policy_version 74093 (0.0027) [2024-06-27 20:49:08,655][06909] Updated weights for policy 0, policy_version 74103 (0.0028) [2024-06-27 20:49:08,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 1214103552. Throughput: 0: 43923.9. Samples: 1117058460. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2024-06-27 20:49:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:49:12,771][06909] Updated weights for policy 0, policy_version 74113 (0.0035) [2024-06-27 20:49:13,249][06887] Signal inference workers to stop experience collection... (15900 times) [2024-06-27 20:49:13,249][06887] Signal inference workers to resume experience collection... (15900 times) [2024-06-27 20:49:13,290][06909] InferenceWorker_p0-w0: stopping experience collection (15900 times) [2024-06-27 20:49:13,290][06909] InferenceWorker_p0-w0: resuming experience collection (15900 times) [2024-06-27 20:49:13,850][06674] Fps is (10 sec: 49151.8, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1214332928. Throughput: 0: 43935.5. Samples: 1117199960. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2024-06-27 20:49:13,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:49:16,178][06909] Updated weights for policy 0, policy_version 74123 (0.0027) [2024-06-27 20:49:18,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1214513152. Throughput: 0: 44029.4. Samples: 1117461260. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2024-06-27 20:49:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 20:49:19,938][06909] Updated weights for policy 0, policy_version 74133 (0.0029) [2024-06-27 20:49:23,481][06909] Updated weights for policy 0, policy_version 74143 (0.0043) [2024-06-27 20:49:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1214758912. Throughput: 0: 43807.2. Samples: 1117722220. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2024-06-27 20:49:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:49:27,553][06909] Updated weights for policy 0, policy_version 74153 (0.0031) [2024-06-27 20:49:28,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 1214988288. Throughput: 0: 43861.6. Samples: 1117862860. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2024-06-27 20:49:28,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:49:30,762][06909] Updated weights for policy 0, policy_version 74163 (0.0036) [2024-06-27 20:49:33,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1215168512. Throughput: 0: 43781.8. Samples: 1118114740. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2024-06-27 20:49:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:49:35,128][06909] Updated weights for policy 0, policy_version 74173 (0.0032) [2024-06-27 20:49:38,296][06909] Updated weights for policy 0, policy_version 74183 (0.0031) [2024-06-27 20:49:38,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1215414272. Throughput: 0: 43777.0. Samples: 1118375700. Policy #0 lag: (min: 0.0, avg: 7.3, max: 20.0) [2024-06-27 20:49:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:49:42,498][06909] Updated weights for policy 0, policy_version 74193 (0.0043) [2024-06-27 20:49:43,850][06674] Fps is (10 sec: 49152.1, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1215660032. Throughput: 0: 43712.5. Samples: 1118519160. Policy #0 lag: (min: 2.0, avg: 8.6, max: 22.0) [2024-06-27 20:49:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:49:45,990][06909] Updated weights for policy 0, policy_version 74203 (0.0026) [2024-06-27 20:49:48,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 43709.2). Total num frames: 1215823872. Throughput: 0: 43904.0. Samples: 1118776720. Policy #0 lag: (min: 2.0, avg: 8.6, max: 22.0) [2024-06-27 20:49:48,853][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:49:48,962][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000074209_1215840256.pth... [2024-06-27 20:49:49,010][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000073568_1205338112.pth [2024-06-27 20:49:50,006][06909] Updated weights for policy 0, policy_version 74213 (0.0025) [2024-06-27 20:49:53,261][06909] Updated weights for policy 0, policy_version 74223 (0.0032) [2024-06-27 20:49:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1216086016. Throughput: 0: 43964.9. Samples: 1119036880. Policy #0 lag: (min: 2.0, avg: 8.6, max: 22.0) [2024-06-27 20:49:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:49:57,357][06909] Updated weights for policy 0, policy_version 74233 (0.0031) [2024-06-27 20:49:58,850][06674] Fps is (10 sec: 49152.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1216315392. Throughput: 0: 44043.6. Samples: 1119181920. Policy #0 lag: (min: 2.0, avg: 8.6, max: 22.0) [2024-06-27 20:49:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:50:00,514][06909] Updated weights for policy 0, policy_version 74243 (0.0037) [2024-06-27 20:50:03,852][06674] Fps is (10 sec: 39313.7, 60 sec: 43962.3, 300 sec: 43764.4). Total num frames: 1216479232. Throughput: 0: 43847.8. Samples: 1119434500. Policy #0 lag: (min: 2.0, avg: 8.6, max: 22.0) [2024-06-27 20:50:03,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:50:04,985][06909] Updated weights for policy 0, policy_version 74253 (0.0031) [2024-06-27 20:50:07,783][06909] Updated weights for policy 0, policy_version 74263 (0.0042) [2024-06-27 20:50:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1216741376. Throughput: 0: 43870.7. Samples: 1119696400. Policy #0 lag: (min: 2.0, avg: 8.6, max: 22.0) [2024-06-27 20:50:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:50:12,358][06909] Updated weights for policy 0, policy_version 74273 (0.0026) [2024-06-27 20:50:13,850][06674] Fps is (10 sec: 49161.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1216970752. Throughput: 0: 43930.8. Samples: 1119839740. Policy #0 lag: (min: 2.0, avg: 8.6, max: 22.0) [2024-06-27 20:50:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:50:15,590][06909] Updated weights for policy 0, policy_version 74283 (0.0036) [2024-06-27 20:50:18,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1217134592. Throughput: 0: 43882.6. Samples: 1120089460. Policy #0 lag: (min: 2.0, avg: 8.6, max: 22.0) [2024-06-27 20:50:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:50:19,887][06909] Updated weights for policy 0, policy_version 74293 (0.0034) [2024-06-27 20:50:23,458][06909] Updated weights for policy 0, policy_version 74303 (0.0036) [2024-06-27 20:50:23,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1217380352. Throughput: 0: 43771.4. Samples: 1120345420. Policy #0 lag: (min: 2.0, avg: 8.6, max: 22.0) [2024-06-27 20:50:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:50:27,732][06909] Updated weights for policy 0, policy_version 74313 (0.0035) [2024-06-27 20:50:28,850][06674] Fps is (10 sec: 49152.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1217626112. Throughput: 0: 43709.7. Samples: 1120486100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 20:50:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:50:30,645][06909] Updated weights for policy 0, policy_version 74323 (0.0038) [2024-06-27 20:50:33,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1217789952. Throughput: 0: 43677.0. Samples: 1120742180. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 20:50:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:50:35,035][06909] Updated weights for policy 0, policy_version 74333 (0.0035) [2024-06-27 20:50:35,667][06887] Signal inference workers to stop experience collection... (15950 times) [2024-06-27 20:50:35,668][06887] Signal inference workers to resume experience collection... (15950 times) [2024-06-27 20:50:35,705][06909] InferenceWorker_p0-w0: stopping experience collection (15950 times) [2024-06-27 20:50:35,705][06909] InferenceWorker_p0-w0: resuming experience collection (15950 times) [2024-06-27 20:50:37,817][06909] Updated weights for policy 0, policy_version 74343 (0.0032) [2024-06-27 20:50:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43765.2). Total num frames: 1218052096. Throughput: 0: 43827.5. Samples: 1121009120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 20:50:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:50:42,449][06909] Updated weights for policy 0, policy_version 74353 (0.0038) [2024-06-27 20:50:43,850][06674] Fps is (10 sec: 49151.3, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1218281472. Throughput: 0: 43695.5. Samples: 1121148220. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 20:50:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:50:45,118][06909] Updated weights for policy 0, policy_version 74363 (0.0020) [2024-06-27 20:50:48,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43690.7, 300 sec: 43709.5). Total num frames: 1218445312. Throughput: 0: 43933.1. Samples: 1121411400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 20:50:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:50:49,693][06909] Updated weights for policy 0, policy_version 74373 (0.0028) [2024-06-27 20:50:52,590][06909] Updated weights for policy 0, policy_version 74383 (0.0037) [2024-06-27 20:50:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 1218707456. Throughput: 0: 43591.9. Samples: 1121658040. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 20:50:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:50:57,147][06909] Updated weights for policy 0, policy_version 74393 (0.0031) [2024-06-27 20:50:58,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43417.6, 300 sec: 43875.8). Total num frames: 1218920448. Throughput: 0: 43586.3. Samples: 1121801120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 20:50:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:51:00,475][06909] Updated weights for policy 0, policy_version 74403 (0.0043) [2024-06-27 20:51:03,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43965.1, 300 sec: 43820.2). Total num frames: 1219117056. Throughput: 0: 43775.9. Samples: 1122059380. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 20:51:03,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:51:04,989][06909] Updated weights for policy 0, policy_version 74413 (0.0033) [2024-06-27 20:51:08,016][06909] Updated weights for policy 0, policy_version 74423 (0.0026) [2024-06-27 20:51:08,850][06674] Fps is (10 sec: 42596.6, 60 sec: 43417.3, 300 sec: 43764.7). Total num frames: 1219346432. Throughput: 0: 43809.5. Samples: 1122316860. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 20:51:08,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:51:12,384][06909] Updated weights for policy 0, policy_version 74433 (0.0034) [2024-06-27 20:51:13,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1219592192. Throughput: 0: 43793.3. Samples: 1122456800. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 20:51:13,859][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:51:15,517][06909] Updated weights for policy 0, policy_version 74443 (0.0025) [2024-06-27 20:51:18,850][06674] Fps is (10 sec: 42600.1, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 1219772416. Throughput: 0: 43933.3. Samples: 1122719180. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 20:51:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:51:19,587][06909] Updated weights for policy 0, policy_version 74453 (0.0031) [2024-06-27 20:51:22,823][06909] Updated weights for policy 0, policy_version 74463 (0.0043) [2024-06-27 20:51:23,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43962.3, 300 sec: 43764.4). Total num frames: 1220018176. Throughput: 0: 43826.5. Samples: 1122981400. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 20:51:23,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:51:27,246][06909] Updated weights for policy 0, policy_version 74473 (0.0045) [2024-06-27 20:51:28,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43690.6, 300 sec: 43986.8). Total num frames: 1220247552. Throughput: 0: 43788.4. Samples: 1123118700. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 20:51:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:51:30,256][06909] Updated weights for policy 0, policy_version 74483 (0.0045) [2024-06-27 20:51:33,850][06674] Fps is (10 sec: 42606.7, 60 sec: 44236.7, 300 sec: 43820.2). Total num frames: 1220444160. Throughput: 0: 43625.2. Samples: 1123374540. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 20:51:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:51:34,693][06909] Updated weights for policy 0, policy_version 74493 (0.0030) [2024-06-27 20:51:37,989][06909] Updated weights for policy 0, policy_version 74503 (0.0040) [2024-06-27 20:51:38,850][06674] Fps is (10 sec: 42599.4, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 1220673536. Throughput: 0: 43793.1. Samples: 1123628720. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 20:51:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:51:42,257][06909] Updated weights for policy 0, policy_version 74513 (0.0034) [2024-06-27 20:51:43,585][06887] Signal inference workers to stop experience collection... (16000 times) [2024-06-27 20:51:43,585][06887] Signal inference workers to resume experience collection... (16000 times) [2024-06-27 20:51:43,606][06909] InferenceWorker_p0-w0: stopping experience collection (16000 times) [2024-06-27 20:51:43,607][06909] InferenceWorker_p0-w0: resuming experience collection (16000 times) [2024-06-27 20:51:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1220902912. Throughput: 0: 43680.2. Samples: 1123766740. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 20:51:43,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:51:45,410][06909] Updated weights for policy 0, policy_version 74523 (0.0036) [2024-06-27 20:51:48,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1221083136. Throughput: 0: 43842.3. Samples: 1124032280. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 20:51:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:51:48,875][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000074530_1221099520.pth... [2024-06-27 20:51:48,919][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000073888_1210580992.pth [2024-06-27 20:51:49,655][06909] Updated weights for policy 0, policy_version 74533 (0.0026) [2024-06-27 20:51:52,971][06909] Updated weights for policy 0, policy_version 74543 (0.0034) [2024-06-27 20:51:53,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 1221345280. Throughput: 0: 43745.6. Samples: 1124285400. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 20:51:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:51:57,050][06909] Updated weights for policy 0, policy_version 74553 (0.0027) [2024-06-27 20:51:58,852][06674] Fps is (10 sec: 47503.7, 60 sec: 43962.2, 300 sec: 43931.0). Total num frames: 1221558272. Throughput: 0: 43751.8. Samples: 1124425720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 20:51:58,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:52:00,506][06909] Updated weights for policy 0, policy_version 74563 (0.0039) [2024-06-27 20:52:03,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1221754880. Throughput: 0: 43761.8. Samples: 1124688460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 20:52:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:52:04,554][06909] Updated weights for policy 0, policy_version 74573 (0.0035) [2024-06-27 20:52:08,030][06909] Updated weights for policy 0, policy_version 74583 (0.0027) [2024-06-27 20:52:08,850][06674] Fps is (10 sec: 44245.2, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 1222000640. Throughput: 0: 43583.2. Samples: 1124942560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 20:52:08,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:52:12,108][06909] Updated weights for policy 0, policy_version 74593 (0.0040) [2024-06-27 20:52:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.7, 300 sec: 43875.8). Total num frames: 1222197248. Throughput: 0: 43536.6. Samples: 1125077840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 20:52:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:52:15,831][06909] Updated weights for policy 0, policy_version 74603 (0.0029) [2024-06-27 20:52:18,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1222410240. Throughput: 0: 43636.6. Samples: 1125338180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 20:52:18,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:52:19,827][06909] Updated weights for policy 0, policy_version 74613 (0.0020) [2024-06-27 20:52:23,342][06909] Updated weights for policy 0, policy_version 74623 (0.0035) [2024-06-27 20:52:23,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43692.1, 300 sec: 43764.7). Total num frames: 1222639616. Throughput: 0: 43658.9. Samples: 1125593380. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 20:52:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:52:27,175][06909] Updated weights for policy 0, policy_version 74633 (0.0026) [2024-06-27 20:52:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43144.7, 300 sec: 43820.3). Total num frames: 1222836224. Throughput: 0: 43659.9. Samples: 1125731420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 20:52:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:52:30,773][06909] Updated weights for policy 0, policy_version 74643 (0.0030) [2024-06-27 20:52:33,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 1223065600. Throughput: 0: 43689.3. Samples: 1125998300. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 20:52:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:52:34,537][06909] Updated weights for policy 0, policy_version 74653 (0.0036) [2024-06-27 20:52:38,104][06909] Updated weights for policy 0, policy_version 74663 (0.0042) [2024-06-27 20:52:38,850][06674] Fps is (10 sec: 47512.6, 60 sec: 43963.5, 300 sec: 43764.7). Total num frames: 1223311360. Throughput: 0: 43880.4. Samples: 1126260020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 20:52:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:52:41,847][06909] Updated weights for policy 0, policy_version 74673 (0.0038) [2024-06-27 20:52:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 1223524352. Throughput: 0: 43744.2. Samples: 1126394120. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 20:52:43,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:52:45,277][06909] Updated weights for policy 0, policy_version 74683 (0.0028) [2024-06-27 20:52:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 43820.2). Total num frames: 1223737344. Throughput: 0: 43798.1. Samples: 1126659380. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 20:52:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:52:49,310][06909] Updated weights for policy 0, policy_version 74693 (0.0020) [2024-06-27 20:52:52,749][06909] Updated weights for policy 0, policy_version 74703 (0.0022) [2024-06-27 20:52:53,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 1223950336. Throughput: 0: 43957.2. Samples: 1126920620. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 20:52:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:52:57,057][06909] Updated weights for policy 0, policy_version 74713 (0.0052) [2024-06-27 20:52:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43692.1, 300 sec: 43875.8). Total num frames: 1224179712. Throughput: 0: 43831.0. Samples: 1127050240. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 20:52:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:53:00,490][06909] Updated weights for policy 0, policy_version 74723 (0.0026) [2024-06-27 20:53:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1224392704. Throughput: 0: 44000.1. Samples: 1127318180. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 20:53:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:53:04,305][06909] Updated weights for policy 0, policy_version 74733 (0.0038) [2024-06-27 20:53:07,865][06909] Updated weights for policy 0, policy_version 74743 (0.0043) [2024-06-27 20:53:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1224622080. Throughput: 0: 44102.2. Samples: 1127577980. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 20:53:08,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:53:11,842][06909] Updated weights for policy 0, policy_version 74753 (0.0027) [2024-06-27 20:53:13,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 1224835072. Throughput: 0: 44024.7. Samples: 1127712540. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 20:53:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:53:15,248][06909] Updated weights for policy 0, policy_version 74763 (0.0033) [2024-06-27 20:53:17,103][06887] Signal inference workers to stop experience collection... (16050 times) [2024-06-27 20:53:17,103][06887] Signal inference workers to resume experience collection... (16050 times) [2024-06-27 20:53:17,132][06909] InferenceWorker_p0-w0: stopping experience collection (16050 times) [2024-06-27 20:53:17,132][06909] InferenceWorker_p0-w0: resuming experience collection (16050 times) [2024-06-27 20:53:18,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.8, 300 sec: 43820.3). Total num frames: 1225064448. Throughput: 0: 43970.7. Samples: 1127976980. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 20:53:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:53:19,110][06909] Updated weights for policy 0, policy_version 74773 (0.0029) [2024-06-27 20:53:22,737][06909] Updated weights for policy 0, policy_version 74783 (0.0040) [2024-06-27 20:53:23,852][06674] Fps is (10 sec: 44228.2, 60 sec: 43962.3, 300 sec: 43820.0). Total num frames: 1225277440. Throughput: 0: 43919.9. Samples: 1128236500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 20:53:23,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:53:26,736][06909] Updated weights for policy 0, policy_version 74793 (0.0028) [2024-06-27 20:53:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1225490432. Throughput: 0: 43859.2. Samples: 1128367780. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 20:53:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:53:30,444][06909] Updated weights for policy 0, policy_version 74803 (0.0034) [2024-06-27 20:53:33,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1225703424. Throughput: 0: 43741.9. Samples: 1128627760. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 20:53:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:53:34,720][06909] Updated weights for policy 0, policy_version 74813 (0.0028) [2024-06-27 20:53:38,358][06909] Updated weights for policy 0, policy_version 74823 (0.0037) [2024-06-27 20:53:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 1225932800. Throughput: 0: 43601.3. Samples: 1128882680. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 20:53:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:53:42,110][06909] Updated weights for policy 0, policy_version 74833 (0.0019) [2024-06-27 20:53:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 43875.8). Total num frames: 1226129408. Throughput: 0: 43721.0. Samples: 1129017680. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 20:53:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:53:45,560][06909] Updated weights for policy 0, policy_version 74843 (0.0032) [2024-06-27 20:53:48,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 1226342400. Throughput: 0: 43562.6. Samples: 1129278500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 20:53:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:53:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000074850_1226342400.pth... [2024-06-27 20:53:48,937][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000074209_1215840256.pth [2024-06-27 20:53:49,476][06909] Updated weights for policy 0, policy_version 74853 (0.0036) [2024-06-27 20:53:52,853][06909] Updated weights for policy 0, policy_version 74863 (0.0042) [2024-06-27 20:53:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 1226588160. Throughput: 0: 43738.8. Samples: 1129546220. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 20:53:53,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:53:56,684][06909] Updated weights for policy 0, policy_version 74873 (0.0030) [2024-06-27 20:53:58,850][06674] Fps is (10 sec: 45874.0, 60 sec: 43690.5, 300 sec: 43931.3). Total num frames: 1226801152. Throughput: 0: 43690.1. Samples: 1129678600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 20:53:58,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:54:00,348][06909] Updated weights for policy 0, policy_version 74883 (0.0032) [2024-06-27 20:54:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.6, 300 sec: 43820.3). Total num frames: 1227030528. Throughput: 0: 43794.6. Samples: 1129947740. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 20:54:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:54:04,217][06909] Updated weights for policy 0, policy_version 74893 (0.0036) [2024-06-27 20:54:07,634][06909] Updated weights for policy 0, policy_version 74903 (0.0046) [2024-06-27 20:54:08,856][06674] Fps is (10 sec: 44210.9, 60 sec: 43686.3, 300 sec: 43763.8). Total num frames: 1227243520. Throughput: 0: 43677.0. Samples: 1130202140. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 20:54:08,857][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:54:11,985][06909] Updated weights for policy 0, policy_version 74913 (0.0029) [2024-06-27 20:54:13,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 1227440128. Throughput: 0: 43728.9. Samples: 1130335580. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 20:54:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:54:15,282][06909] Updated weights for policy 0, policy_version 74923 (0.0037) [2024-06-27 20:54:18,850][06674] Fps is (10 sec: 42624.0, 60 sec: 43417.5, 300 sec: 43764.7). Total num frames: 1227669504. Throughput: 0: 43656.4. Samples: 1130592300. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-27 20:54:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:54:19,544][06909] Updated weights for policy 0, policy_version 74933 (0.0047) [2024-06-27 20:54:23,004][06909] Updated weights for policy 0, policy_version 74943 (0.0049) [2024-06-27 20:54:23,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43417.6, 300 sec: 43708.9). Total num frames: 1227882496. Throughput: 0: 43783.8. Samples: 1130853040. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-27 20:54:23,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:54:26,749][06909] Updated weights for policy 0, policy_version 74953 (0.0031) [2024-06-27 20:54:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1228111872. Throughput: 0: 43826.7. Samples: 1130989880. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-27 20:54:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:54:30,210][06909] Updated weights for policy 0, policy_version 74963 (0.0028) [2024-06-27 20:54:31,638][06887] Signal inference workers to stop experience collection... (16100 times) [2024-06-27 20:54:31,639][06887] Signal inference workers to resume experience collection... (16100 times) [2024-06-27 20:54:31,651][06909] InferenceWorker_p0-w0: stopping experience collection (16100 times) [2024-06-27 20:54:31,651][06909] InferenceWorker_p0-w0: resuming experience collection (16100 times) [2024-06-27 20:54:33,850][06674] Fps is (10 sec: 45884.7, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1228341248. Throughput: 0: 43998.3. Samples: 1131258420. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-27 20:54:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:54:33,959][06909] Updated weights for policy 0, policy_version 74973 (0.0036) [2024-06-27 20:54:37,381][06909] Updated weights for policy 0, policy_version 74983 (0.0034) [2024-06-27 20:54:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1228554240. Throughput: 0: 43966.2. Samples: 1131524700. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-27 20:54:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:54:41,218][06909] Updated weights for policy 0, policy_version 74993 (0.0028) [2024-06-27 20:54:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1228767232. Throughput: 0: 43748.7. Samples: 1131647280. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-27 20:54:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:54:45,145][06909] Updated weights for policy 0, policy_version 75003 (0.0029) [2024-06-27 20:54:48,856][06674] Fps is (10 sec: 44210.2, 60 sec: 44232.3, 300 sec: 43763.8). Total num frames: 1228996608. Throughput: 0: 43702.6. Samples: 1131914620. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-27 20:54:48,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:54:48,964][06909] Updated weights for policy 0, policy_version 75013 (0.0040) [2024-06-27 20:54:52,494][06909] Updated weights for policy 0, policy_version 75023 (0.0036) [2024-06-27 20:54:53,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43417.6, 300 sec: 43653.6). Total num frames: 1229193216. Throughput: 0: 43758.8. Samples: 1132171020. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-27 20:54:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:54:56,731][06909] Updated weights for policy 0, policy_version 75033 (0.0032) [2024-06-27 20:54:58,850][06674] Fps is (10 sec: 42623.6, 60 sec: 43690.7, 300 sec: 43876.1). Total num frames: 1229422592. Throughput: 0: 43575.8. Samples: 1132296500. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-27 20:54:58,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:55:00,453][06909] Updated weights for policy 0, policy_version 75043 (0.0035) [2024-06-27 20:55:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1229635584. Throughput: 0: 43760.0. Samples: 1132561500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 20:55:03,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:55:04,256][06909] Updated weights for policy 0, policy_version 75053 (0.0042) [2024-06-27 20:55:07,883][06909] Updated weights for policy 0, policy_version 75063 (0.0028) [2024-06-27 20:55:08,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43148.8, 300 sec: 43598.1). Total num frames: 1229832192. Throughput: 0: 43785.4. Samples: 1132823300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 20:55:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:55:11,620][06909] Updated weights for policy 0, policy_version 75073 (0.0031) [2024-06-27 20:55:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1230077952. Throughput: 0: 43607.2. Samples: 1132952200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 20:55:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:55:15,234][06909] Updated weights for policy 0, policy_version 75083 (0.0030) [2024-06-27 20:55:18,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1230307328. Throughput: 0: 43565.3. Samples: 1133218860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 20:55:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:55:18,918][06909] Updated weights for policy 0, policy_version 75093 (0.0033) [2024-06-27 20:55:22,748][06909] Updated weights for policy 0, policy_version 75103 (0.0028) [2024-06-27 20:55:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43692.2, 300 sec: 43653.7). Total num frames: 1230503936. Throughput: 0: 43514.3. Samples: 1133482840. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 20:55:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 20:55:26,167][06909] Updated weights for policy 0, policy_version 75113 (0.0027) [2024-06-27 20:55:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1230733312. Throughput: 0: 43653.3. Samples: 1133611680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 20:55:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:55:30,124][06909] Updated weights for policy 0, policy_version 75123 (0.0035) [2024-06-27 20:55:33,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1230962688. Throughput: 0: 43686.3. Samples: 1133880240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 20:55:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:55:34,018][06909] Updated weights for policy 0, policy_version 75133 (0.0025) [2024-06-27 20:55:37,986][06909] Updated weights for policy 0, policy_version 75143 (0.0032) [2024-06-27 20:55:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.7, 300 sec: 43653.7). Total num frames: 1231159296. Throughput: 0: 43703.2. Samples: 1134137660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 20:55:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:55:41,457][06909] Updated weights for policy 0, policy_version 75153 (0.0030) [2024-06-27 20:55:43,856][06674] Fps is (10 sec: 42572.9, 60 sec: 43686.2, 300 sec: 43874.9). Total num frames: 1231388672. Throughput: 0: 43693.8. Samples: 1134262980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 20:55:43,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:55:45,327][06909] Updated weights for policy 0, policy_version 75163 (0.0044) [2024-06-27 20:55:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43695.1, 300 sec: 43764.7). Total num frames: 1231618048. Throughput: 0: 43917.4. Samples: 1134537780. Policy #0 lag: (min: 1.0, avg: 9.9, max: 23.0) [2024-06-27 20:55:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:55:48,957][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000075173_1231634432.pth... [2024-06-27 20:55:48,958][06909] Updated weights for policy 0, policy_version 75173 (0.0025) [2024-06-27 20:55:49,006][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000074530_1221099520.pth [2024-06-27 20:55:52,794][06909] Updated weights for policy 0, policy_version 75183 (0.0032) [2024-06-27 20:55:53,850][06674] Fps is (10 sec: 42624.3, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1231814656. Throughput: 0: 44036.5. Samples: 1134804940. Policy #0 lag: (min: 1.0, avg: 9.9, max: 23.0) [2024-06-27 20:55:53,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:55:56,627][06909] Updated weights for policy 0, policy_version 75193 (0.0024) [2024-06-27 20:55:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 1232044032. Throughput: 0: 43973.8. Samples: 1134931020. Policy #0 lag: (min: 1.0, avg: 9.9, max: 23.0) [2024-06-27 20:55:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:55:59,474][06887] Signal inference workers to stop experience collection... (16150 times) [2024-06-27 20:55:59,519][06909] InferenceWorker_p0-w0: stopping experience collection (16150 times) [2024-06-27 20:55:59,528][06887] Signal inference workers to resume experience collection... (16150 times) [2024-06-27 20:55:59,533][06909] InferenceWorker_p0-w0: resuming experience collection (16150 times) [2024-06-27 20:56:00,497][06909] Updated weights for policy 0, policy_version 75203 (0.0033) [2024-06-27 20:56:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1232273408. Throughput: 0: 44023.6. Samples: 1135199920. Policy #0 lag: (min: 1.0, avg: 9.9, max: 23.0) [2024-06-27 20:56:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:56:03,917][06909] Updated weights for policy 0, policy_version 75213 (0.0026) [2024-06-27 20:56:08,014][06909] Updated weights for policy 0, policy_version 75223 (0.0039) [2024-06-27 20:56:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.9, 300 sec: 43709.2). Total num frames: 1232486400. Throughput: 0: 43929.7. Samples: 1135459680. Policy #0 lag: (min: 1.0, avg: 9.9, max: 23.0) [2024-06-27 20:56:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:56:11,333][06909] Updated weights for policy 0, policy_version 75233 (0.0034) [2024-06-27 20:56:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1232715776. Throughput: 0: 43748.4. Samples: 1135580360. Policy #0 lag: (min: 1.0, avg: 9.9, max: 23.0) [2024-06-27 20:56:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:56:15,667][06909] Updated weights for policy 0, policy_version 75243 (0.0027) [2024-06-27 20:56:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43765.0). Total num frames: 1232928768. Throughput: 0: 43813.0. Samples: 1135851820. Policy #0 lag: (min: 1.0, avg: 9.9, max: 23.0) [2024-06-27 20:56:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:56:18,979][06909] Updated weights for policy 0, policy_version 75253 (0.0041) [2024-06-27 20:56:23,061][06909] Updated weights for policy 0, policy_version 75263 (0.0031) [2024-06-27 20:56:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43653.7). Total num frames: 1233125376. Throughput: 0: 43930.3. Samples: 1136114520. Policy #0 lag: (min: 1.0, avg: 9.9, max: 23.0) [2024-06-27 20:56:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:56:26,370][06909] Updated weights for policy 0, policy_version 75273 (0.0032) [2024-06-27 20:56:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1233371136. Throughput: 0: 43915.3. Samples: 1136238900. Policy #0 lag: (min: 1.0, avg: 9.9, max: 23.0) [2024-06-27 20:56:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:56:30,535][06909] Updated weights for policy 0, policy_version 75283 (0.0028) [2024-06-27 20:56:33,854][06674] Fps is (10 sec: 47494.5, 60 sec: 43960.9, 300 sec: 43819.6). Total num frames: 1233600512. Throughput: 0: 43842.8. Samples: 1136510880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 20:56:33,854][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:56:33,859][06909] Updated weights for policy 0, policy_version 75293 (0.0029) [2024-06-27 20:56:37,958][06909] Updated weights for policy 0, policy_version 75303 (0.0039) [2024-06-27 20:56:38,856][06674] Fps is (10 sec: 42573.3, 60 sec: 43959.4, 300 sec: 43708.3). Total num frames: 1233797120. Throughput: 0: 43815.6. Samples: 1136776900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 20:56:38,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:56:41,273][06909] Updated weights for policy 0, policy_version 75313 (0.0041) [2024-06-27 20:56:43,852][06674] Fps is (10 sec: 44245.4, 60 sec: 44239.8, 300 sec: 43931.0). Total num frames: 1234042880. Throughput: 0: 43756.2. Samples: 1136900140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 20:56:43,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:56:45,275][06909] Updated weights for policy 0, policy_version 75323 (0.0027) [2024-06-27 20:56:48,850][06674] Fps is (10 sec: 44262.7, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1234239488. Throughput: 0: 43639.1. Samples: 1137163680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 20:56:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:56:49,029][06909] Updated weights for policy 0, policy_version 75333 (0.0036) [2024-06-27 20:56:53,031][06909] Updated weights for policy 0, policy_version 75343 (0.0030) [2024-06-27 20:56:53,850][06674] Fps is (10 sec: 39329.6, 60 sec: 43690.7, 300 sec: 43654.0). Total num frames: 1234436096. Throughput: 0: 43767.5. Samples: 1137429220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 20:56:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:56:56,270][06909] Updated weights for policy 0, policy_version 75353 (0.0027) [2024-06-27 20:56:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1234698240. Throughput: 0: 43901.7. Samples: 1137555940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 20:56:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:57:00,524][06909] Updated weights for policy 0, policy_version 75363 (0.0031) [2024-06-27 20:57:03,670][06909] Updated weights for policy 0, policy_version 75373 (0.0024) [2024-06-27 20:57:03,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.7, 300 sec: 43764.8). Total num frames: 1234911232. Throughput: 0: 43736.0. Samples: 1137819940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 20:57:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 20:57:08,270][06909] Updated weights for policy 0, policy_version 75383 (0.0028) [2024-06-27 20:57:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 1235124224. Throughput: 0: 43916.8. Samples: 1138090780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 20:57:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:57:11,133][06909] Updated weights for policy 0, policy_version 75393 (0.0039) [2024-06-27 20:57:13,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 1235353600. Throughput: 0: 43808.7. Samples: 1138210300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 20:57:13,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:57:15,695][06909] Updated weights for policy 0, policy_version 75403 (0.0033) [2024-06-27 20:57:17,195][06887] Signal inference workers to stop experience collection... (16200 times) [2024-06-27 20:57:17,197][06887] Signal inference workers to resume experience collection... (16200 times) [2024-06-27 20:57:17,231][06909] InferenceWorker_p0-w0: stopping experience collection (16200 times) [2024-06-27 20:57:17,231][06909] InferenceWorker_p0-w0: resuming experience collection (16200 times) [2024-06-27 20:57:18,432][06909] Updated weights for policy 0, policy_version 75413 (0.0024) [2024-06-27 20:57:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1235566592. Throughput: 0: 43714.5. Samples: 1138477860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 20:57:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:57:23,014][06909] Updated weights for policy 0, policy_version 75423 (0.0034) [2024-06-27 20:57:23,850][06674] Fps is (10 sec: 42599.2, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1235779584. Throughput: 0: 43769.7. Samples: 1138746280. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 20:57:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:57:26,231][06909] Updated weights for policy 0, policy_version 75433 (0.0024) [2024-06-27 20:57:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1236008960. Throughput: 0: 43854.8. Samples: 1138873520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 20:57:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:57:30,562][06909] Updated weights for policy 0, policy_version 75443 (0.0046) [2024-06-27 20:57:33,607][06909] Updated weights for policy 0, policy_version 75453 (0.0031) [2024-06-27 20:57:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43966.6, 300 sec: 43820.3). Total num frames: 1236238336. Throughput: 0: 43944.9. Samples: 1139141200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 20:57:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:57:38,135][06909] Updated weights for policy 0, policy_version 75463 (0.0042) [2024-06-27 20:57:38,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43695.0, 300 sec: 43709.2). Total num frames: 1236418560. Throughput: 0: 43864.5. Samples: 1139403120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 20:57:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:57:41,089][06909] Updated weights for policy 0, policy_version 75473 (0.0028) [2024-06-27 20:57:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43692.2, 300 sec: 43820.3). Total num frames: 1236664320. Throughput: 0: 43858.2. Samples: 1139529560. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 20:57:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:57:45,584][06909] Updated weights for policy 0, policy_version 75483 (0.0025) [2024-06-27 20:57:48,404][06909] Updated weights for policy 0, policy_version 75493 (0.0026) [2024-06-27 20:57:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 1236877312. Throughput: 0: 43899.5. Samples: 1139795420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 20:57:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 20:57:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000075493_1236877312.pth... [2024-06-27 20:57:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000074850_1226342400.pth [2024-06-27 20:57:53,143][06909] Updated weights for policy 0, policy_version 75503 (0.0027) [2024-06-27 20:57:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 43764.7). Total num frames: 1237090304. Throughput: 0: 43717.0. Samples: 1140058040. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 20:57:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:57:56,057][06909] Updated weights for policy 0, policy_version 75513 (0.0047) [2024-06-27 20:57:58,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43690.5, 300 sec: 43820.2). Total num frames: 1237319680. Throughput: 0: 43938.7. Samples: 1140187540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 20:57:58,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:58:00,478][06909] Updated weights for policy 0, policy_version 75523 (0.0032) [2024-06-27 20:58:03,549][06909] Updated weights for policy 0, policy_version 75533 (0.0034) [2024-06-27 20:58:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1237532672. Throughput: 0: 43781.0. Samples: 1140448000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 20:58:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:58:07,760][06909] Updated weights for policy 0, policy_version 75543 (0.0026) [2024-06-27 20:58:08,850][06674] Fps is (10 sec: 40960.9, 60 sec: 43417.7, 300 sec: 43709.2). Total num frames: 1237729280. Throughput: 0: 43869.4. Samples: 1140720400. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 20:58:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:58:10,897][06909] Updated weights for policy 0, policy_version 75553 (0.0031) [2024-06-27 20:58:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 1237975040. Throughput: 0: 43746.7. Samples: 1140842120. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 20:58:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:58:15,646][06909] Updated weights for policy 0, policy_version 75563 (0.0044) [2024-06-27 20:58:18,273][06909] Updated weights for policy 0, policy_version 75573 (0.0030) [2024-06-27 20:58:18,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.7, 300 sec: 43765.0). Total num frames: 1238188032. Throughput: 0: 43597.7. Samples: 1141103100. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 20:58:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:58:23,028][06909] Updated weights for policy 0, policy_version 75583 (0.0037) [2024-06-27 20:58:23,850][06674] Fps is (10 sec: 39321.0, 60 sec: 43144.4, 300 sec: 43653.6). Total num frames: 1238368256. Throughput: 0: 43598.0. Samples: 1141365040. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 20:58:23,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:58:26,060][06909] Updated weights for policy 0, policy_version 75593 (0.0023) [2024-06-27 20:58:28,856][06674] Fps is (10 sec: 44210.4, 60 sec: 43686.3, 300 sec: 43819.4). Total num frames: 1238630400. Throughput: 0: 43638.6. Samples: 1141493560. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 20:58:28,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:58:30,286][06909] Updated weights for policy 0, policy_version 75603 (0.0021) [2024-06-27 20:58:31,321][06887] Signal inference workers to stop experience collection... (16250 times) [2024-06-27 20:58:31,371][06909] InferenceWorker_p0-w0: stopping experience collection (16250 times) [2024-06-27 20:58:31,438][06887] Signal inference workers to resume experience collection... (16250 times) [2024-06-27 20:58:31,439][06909] InferenceWorker_p0-w0: resuming experience collection (16250 times) [2024-06-27 20:58:33,495][06909] Updated weights for policy 0, policy_version 75613 (0.0040) [2024-06-27 20:58:33,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43417.6, 300 sec: 43764.7). Total num frames: 1238843392. Throughput: 0: 43595.9. Samples: 1141757240. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 20:58:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:58:38,143][06909] Updated weights for policy 0, policy_version 75623 (0.0045) [2024-06-27 20:58:38,850][06674] Fps is (10 sec: 40984.7, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1239040000. Throughput: 0: 43729.3. Samples: 1142025860. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 20:58:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:58:40,768][06909] Updated weights for policy 0, policy_version 75633 (0.0034) [2024-06-27 20:58:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.5, 300 sec: 43820.2). Total num frames: 1239269376. Throughput: 0: 43668.9. Samples: 1142152640. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 20:58:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:58:45,514][06909] Updated weights for policy 0, policy_version 75643 (0.0040) [2024-06-27 20:58:48,427][06909] Updated weights for policy 0, policy_version 75653 (0.0023) [2024-06-27 20:58:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1239498752. Throughput: 0: 43681.3. Samples: 1142413660. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 20:58:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:58:53,403][06909] Updated weights for policy 0, policy_version 75663 (0.0031) [2024-06-27 20:58:53,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43144.5, 300 sec: 43653.7). Total num frames: 1239678976. Throughput: 0: 43551.8. Samples: 1142680240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:58:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:58:55,714][06909] Updated weights for policy 0, policy_version 75673 (0.0026) [2024-06-27 20:58:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1239941120. Throughput: 0: 43510.6. Samples: 1142800100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:58:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:59:00,581][06909] Updated weights for policy 0, policy_version 75683 (0.0039) [2024-06-27 20:59:03,132][06909] Updated weights for policy 0, policy_version 75693 (0.0029) [2024-06-27 20:59:03,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43690.6, 300 sec: 43765.6). Total num frames: 1240154112. Throughput: 0: 43616.9. Samples: 1143065860. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:59:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:59:07,786][06909] Updated weights for policy 0, policy_version 75703 (0.0036) [2024-06-27 20:59:08,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1240350720. Throughput: 0: 43787.7. Samples: 1143335480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:59:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:59:10,663][06909] Updated weights for policy 0, policy_version 75713 (0.0052) [2024-06-27 20:59:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.5, 300 sec: 43764.7). Total num frames: 1240580096. Throughput: 0: 43723.6. Samples: 1143460860. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:59:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 20:59:15,363][06909] Updated weights for policy 0, policy_version 75723 (0.0044) [2024-06-27 20:59:17,987][06909] Updated weights for policy 0, policy_version 75733 (0.0039) [2024-06-27 20:59:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 43820.6). Total num frames: 1240809472. Throughput: 0: 43750.7. Samples: 1143726020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:59:18,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 20:59:22,957][06909] Updated weights for policy 0, policy_version 75743 (0.0036) [2024-06-27 20:59:23,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43709.2). Total num frames: 1241006080. Throughput: 0: 43779.2. Samples: 1143995920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:59:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:59:25,470][06909] Updated weights for policy 0, policy_version 75753 (0.0026) [2024-06-27 20:59:28,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43421.9, 300 sec: 43709.1). Total num frames: 1241235456. Throughput: 0: 43765.3. Samples: 1144122080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:59:28,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 20:59:30,573][06909] Updated weights for policy 0, policy_version 75763 (0.0031) [2024-06-27 20:59:32,977][06909] Updated weights for policy 0, policy_version 75773 (0.0033) [2024-06-27 20:59:33,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1241464832. Throughput: 0: 43657.2. Samples: 1144378240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-27 20:59:33,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 20:59:37,844][06909] Updated weights for policy 0, policy_version 75783 (0.0038) [2024-06-27 20:59:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1241677824. Throughput: 0: 43719.6. Samples: 1144647620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 20:59:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:59:40,529][06909] Updated weights for policy 0, policy_version 75793 (0.0026) [2024-06-27 20:59:43,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43963.8, 300 sec: 43765.6). Total num frames: 1241907200. Throughput: 0: 43771.2. Samples: 1144769800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 20:59:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 20:59:45,171][06909] Updated weights for policy 0, policy_version 75803 (0.0038) [2024-06-27 20:59:47,249][06887] Signal inference workers to stop experience collection... (16300 times) [2024-06-27 20:59:47,249][06887] Signal inference workers to resume experience collection... (16300 times) [2024-06-27 20:59:47,271][06909] InferenceWorker_p0-w0: stopping experience collection (16300 times) [2024-06-27 20:59:47,271][06909] InferenceWorker_p0-w0: resuming experience collection (16300 times) [2024-06-27 20:59:48,205][06909] Updated weights for policy 0, policy_version 75813 (0.0035) [2024-06-27 20:59:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1242136576. Throughput: 0: 43829.8. Samples: 1145038200. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 20:59:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:59:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000075814_1242136576.pth... [2024-06-27 20:59:48,927][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000075173_1231634432.pth [2024-06-27 20:59:52,967][06909] Updated weights for policy 0, policy_version 75823 (0.0042) [2024-06-27 20:59:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44510.0, 300 sec: 43820.3). Total num frames: 1242349568. Throughput: 0: 43808.5. Samples: 1145306860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 20:59:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 20:59:55,523][06909] Updated weights for policy 0, policy_version 75833 (0.0031) [2024-06-27 20:59:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1242562560. Throughput: 0: 43964.9. Samples: 1145439280. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 20:59:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:00:00,267][06909] Updated weights for policy 0, policy_version 75843 (0.0030) [2024-06-27 21:00:02,774][06909] Updated weights for policy 0, policy_version 75853 (0.0025) [2024-06-27 21:00:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1242775552. Throughput: 0: 43868.5. Samples: 1145700100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 21:00:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:00:07,517][06909] Updated weights for policy 0, policy_version 75863 (0.0034) [2024-06-27 21:00:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 43820.2). Total num frames: 1243004928. Throughput: 0: 43910.9. Samples: 1145971920. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 21:00:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:00:10,091][06909] Updated weights for policy 0, policy_version 75873 (0.0034) [2024-06-27 21:00:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 1243201536. Throughput: 0: 43963.3. Samples: 1146100420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 21:00:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:00:14,979][06909] Updated weights for policy 0, policy_version 75883 (0.0031) [2024-06-27 21:00:17,679][06909] Updated weights for policy 0, policy_version 75893 (0.0041) [2024-06-27 21:00:18,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43820.2). Total num frames: 1243430912. Throughput: 0: 43943.7. Samples: 1146355700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 21:00:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:00:22,379][06909] Updated weights for policy 0, policy_version 75903 (0.0030) [2024-06-27 21:00:23,852][06674] Fps is (10 sec: 45865.3, 60 sec: 44235.2, 300 sec: 43819.9). Total num frames: 1243660288. Throughput: 0: 43982.9. Samples: 1146626940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 21:00:23,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:00:25,289][06909] Updated weights for policy 0, policy_version 75913 (0.0031) [2024-06-27 21:00:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 1243873280. Throughput: 0: 44164.4. Samples: 1146757200. Policy #0 lag: (min: 0.0, avg: 12.6, max: 23.0) [2024-06-27 21:00:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:00:29,623][06909] Updated weights for policy 0, policy_version 75923 (0.0027) [2024-06-27 21:00:32,992][06909] Updated weights for policy 0, policy_version 75933 (0.0029) [2024-06-27 21:00:33,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1244102656. Throughput: 0: 43960.8. Samples: 1147016440. Policy #0 lag: (min: 0.0, avg: 12.6, max: 23.0) [2024-06-27 21:00:33,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:00:37,335][06909] Updated weights for policy 0, policy_version 75943 (0.0030) [2024-06-27 21:00:38,850][06674] Fps is (10 sec: 45873.6, 60 sec: 44236.5, 300 sec: 43876.6). Total num frames: 1244332032. Throughput: 0: 43996.0. Samples: 1147286700. Policy #0 lag: (min: 0.0, avg: 12.6, max: 23.0) [2024-06-27 21:00:38,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:00:40,363][06909] Updated weights for policy 0, policy_version 75953 (0.0039) [2024-06-27 21:00:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1244528640. Throughput: 0: 43985.8. Samples: 1147418640. Policy #0 lag: (min: 0.0, avg: 12.6, max: 23.0) [2024-06-27 21:00:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:00:44,965][06909] Updated weights for policy 0, policy_version 75963 (0.0036) [2024-06-27 21:00:47,640][06909] Updated weights for policy 0, policy_version 75973 (0.0034) [2024-06-27 21:00:48,850][06674] Fps is (10 sec: 44238.6, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1244774400. Throughput: 0: 43888.9. Samples: 1147675100. Policy #0 lag: (min: 0.0, avg: 12.6, max: 23.0) [2024-06-27 21:00:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 21:00:52,419][06909] Updated weights for policy 0, policy_version 75983 (0.0037) [2024-06-27 21:00:53,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1244971008. Throughput: 0: 43750.5. Samples: 1147940680. Policy #0 lag: (min: 0.0, avg: 12.6, max: 23.0) [2024-06-27 21:00:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:00:55,387][06909] Updated weights for policy 0, policy_version 75993 (0.0040) [2024-06-27 21:00:56,012][06887] Signal inference workers to stop experience collection... (16350 times) [2024-06-27 21:00:56,012][06887] Signal inference workers to resume experience collection... (16350 times) [2024-06-27 21:00:56,042][06909] InferenceWorker_p0-w0: stopping experience collection (16350 times) [2024-06-27 21:00:56,042][06909] InferenceWorker_p0-w0: resuming experience collection (16350 times) [2024-06-27 21:00:58,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1245184000. Throughput: 0: 43685.1. Samples: 1148066260. Policy #0 lag: (min: 0.0, avg: 12.6, max: 23.0) [2024-06-27 21:00:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:00:59,760][06909] Updated weights for policy 0, policy_version 76003 (0.0032) [2024-06-27 21:01:02,880][06909] Updated weights for policy 0, policy_version 76013 (0.0037) [2024-06-27 21:01:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1245429760. Throughput: 0: 43806.3. Samples: 1148326980. Policy #0 lag: (min: 0.0, avg: 12.6, max: 23.0) [2024-06-27 21:01:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:01:07,046][06909] Updated weights for policy 0, policy_version 76023 (0.0037) [2024-06-27 21:01:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 43820.2). Total num frames: 1245642752. Throughput: 0: 43829.1. Samples: 1148599160. Policy #0 lag: (min: 0.0, avg: 12.6, max: 23.0) [2024-06-27 21:01:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:01:10,157][06909] Updated weights for policy 0, policy_version 76033 (0.0045) [2024-06-27 21:01:13,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1245839360. Throughput: 0: 43822.2. Samples: 1148729200. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 21:01:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:01:14,674][06909] Updated weights for policy 0, policy_version 76043 (0.0032) [2024-06-27 21:01:17,876][06909] Updated weights for policy 0, policy_version 76053 (0.0029) [2024-06-27 21:01:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1246085120. Throughput: 0: 43682.1. Samples: 1148982140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 21:01:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:01:22,252][06909] Updated weights for policy 0, policy_version 76063 (0.0031) [2024-06-27 21:01:23,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43965.3, 300 sec: 43820.3). Total num frames: 1246298112. Throughput: 0: 43636.0. Samples: 1149250300. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 21:01:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:01:25,633][06909] Updated weights for policy 0, policy_version 76073 (0.0038) [2024-06-27 21:01:28,850][06674] Fps is (10 sec: 39322.3, 60 sec: 43417.6, 300 sec: 43654.2). Total num frames: 1246478336. Throughput: 0: 43554.3. Samples: 1149378580. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 21:01:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:01:29,613][06909] Updated weights for policy 0, policy_version 76083 (0.0040) [2024-06-27 21:01:32,878][06909] Updated weights for policy 0, policy_version 76093 (0.0027) [2024-06-27 21:01:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43876.7). Total num frames: 1246740480. Throughput: 0: 43688.0. Samples: 1149641060. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 21:01:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:01:36,995][06909] Updated weights for policy 0, policy_version 76103 (0.0031) [2024-06-27 21:01:38,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43691.0, 300 sec: 43765.0). Total num frames: 1246953472. Throughput: 0: 43744.0. Samples: 1149909160. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 21:01:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:01:40,246][06909] Updated weights for policy 0, policy_version 76113 (0.0024) [2024-06-27 21:01:43,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1247150080. Throughput: 0: 43813.4. Samples: 1150037860. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 21:01:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:01:44,391][06909] Updated weights for policy 0, policy_version 76123 (0.0032) [2024-06-27 21:01:47,538][06909] Updated weights for policy 0, policy_version 76133 (0.0034) [2024-06-27 21:01:48,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1247395840. Throughput: 0: 43769.7. Samples: 1150296620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 21:01:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:01:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000076135_1247395840.pth... [2024-06-27 21:01:48,908][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000075493_1236877312.pth [2024-06-27 21:01:52,232][06909] Updated weights for policy 0, policy_version 76143 (0.0046) [2024-06-27 21:01:53,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1247608832. Throughput: 0: 43625.0. Samples: 1150562280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-27 21:01:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:01:55,372][06909] Updated weights for policy 0, policy_version 76153 (0.0038) [2024-06-27 21:01:58,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1247805440. Throughput: 0: 43618.2. Samples: 1150692020. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 21:01:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:01:59,520][06909] Updated weights for policy 0, policy_version 76163 (0.0037) [2024-06-27 21:02:03,198][06887] Signal inference workers to stop experience collection... (16400 times) [2024-06-27 21:02:03,236][06909] InferenceWorker_p0-w0: stopping experience collection (16400 times) [2024-06-27 21:02:03,264][06887] Signal inference workers to resume experience collection... (16400 times) [2024-06-27 21:02:03,265][06909] InferenceWorker_p0-w0: resuming experience collection (16400 times) [2024-06-27 21:02:03,267][06909] Updated weights for policy 0, policy_version 76173 (0.0035) [2024-06-27 21:02:03,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43690.5, 300 sec: 43820.2). Total num frames: 1248051200. Throughput: 0: 43869.8. Samples: 1150956280. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 21:02:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:02:07,063][06909] Updated weights for policy 0, policy_version 76183 (0.0024) [2024-06-27 21:02:08,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1248280576. Throughput: 0: 43724.9. Samples: 1151217920. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 21:02:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:02:10,430][06909] Updated weights for policy 0, policy_version 76193 (0.0033) [2024-06-27 21:02:13,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1248460800. Throughput: 0: 43799.1. Samples: 1151349540. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 21:02:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:02:14,447][06909] Updated weights for policy 0, policy_version 76203 (0.0031) [2024-06-27 21:02:17,664][06909] Updated weights for policy 0, policy_version 76213 (0.0050) [2024-06-27 21:02:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 1248706560. Throughput: 0: 43811.5. Samples: 1151612580. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 21:02:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:02:22,045][06909] Updated weights for policy 0, policy_version 76223 (0.0039) [2024-06-27 21:02:23,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1248935936. Throughput: 0: 43650.6. Samples: 1151873440. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 21:02:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:02:25,069][06909] Updated weights for policy 0, policy_version 76233 (0.0041) [2024-06-27 21:02:28,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 1249099776. Throughput: 0: 43760.4. Samples: 1152007080. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 21:02:28,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:02:29,710][06909] Updated weights for policy 0, policy_version 76243 (0.0032) [2024-06-27 21:02:32,822][06909] Updated weights for policy 0, policy_version 76253 (0.0022) [2024-06-27 21:02:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.6, 300 sec: 43820.3). Total num frames: 1249345536. Throughput: 0: 43718.8. Samples: 1152263960. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 21:02:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:02:36,974][06909] Updated weights for policy 0, policy_version 76263 (0.0029) [2024-06-27 21:02:38,850][06674] Fps is (10 sec: 50790.5, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 1249607680. Throughput: 0: 43863.0. Samples: 1152536120. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 21:02:38,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:02:40,478][06909] Updated weights for policy 0, policy_version 76273 (0.0027) [2024-06-27 21:02:43,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 1249771520. Throughput: 0: 43968.9. Samples: 1152670620. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-27 21:02:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:02:44,371][06909] Updated weights for policy 0, policy_version 76283 (0.0030) [2024-06-27 21:02:47,688][06909] Updated weights for policy 0, policy_version 76293 (0.0040) [2024-06-27 21:02:48,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 1250017280. Throughput: 0: 43852.2. Samples: 1152929620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 24.0) [2024-06-27 21:02:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:02:52,032][06909] Updated weights for policy 0, policy_version 76303 (0.0046) [2024-06-27 21:02:53,850][06674] Fps is (10 sec: 49152.2, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 1250263040. Throughput: 0: 43926.6. Samples: 1153194620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 24.0) [2024-06-27 21:02:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:02:55,133][06909] Updated weights for policy 0, policy_version 76313 (0.0022) [2024-06-27 21:02:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1250443264. Throughput: 0: 44102.2. Samples: 1153334140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 24.0) [2024-06-27 21:02:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:02:59,689][06909] Updated weights for policy 0, policy_version 76323 (0.0034) [2024-06-27 21:03:02,375][06909] Updated weights for policy 0, policy_version 76333 (0.0036) [2024-06-27 21:03:03,852][06674] Fps is (10 sec: 40951.7, 60 sec: 43689.3, 300 sec: 43875.5). Total num frames: 1250672640. Throughput: 0: 43928.2. Samples: 1153589440. Policy #0 lag: (min: 0.0, avg: 10.3, max: 24.0) [2024-06-27 21:03:03,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:03:06,819][06909] Updated weights for policy 0, policy_version 76343 (0.0039) [2024-06-27 21:03:08,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1250918400. Throughput: 0: 43972.0. Samples: 1153852180. Policy #0 lag: (min: 0.0, avg: 10.3, max: 24.0) [2024-06-27 21:03:08,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:03:09,875][06909] Updated weights for policy 0, policy_version 76353 (0.0040) [2024-06-27 21:03:13,850][06674] Fps is (10 sec: 44246.2, 60 sec: 44236.8, 300 sec: 43820.3). Total num frames: 1251115008. Throughput: 0: 44194.4. Samples: 1153995820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 24.0) [2024-06-27 21:03:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:03:14,414][06909] Updated weights for policy 0, policy_version 76363 (0.0026) [2024-06-27 21:03:17,641][06909] Updated weights for policy 0, policy_version 76373 (0.0035) [2024-06-27 21:03:18,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1251328000. Throughput: 0: 44165.2. Samples: 1154251400. Policy #0 lag: (min: 0.0, avg: 10.3, max: 24.0) [2024-06-27 21:03:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:03:21,802][06909] Updated weights for policy 0, policy_version 76383 (0.0036) [2024-06-27 21:03:23,221][06887] Signal inference workers to stop experience collection... (16450 times) [2024-06-27 21:03:23,222][06887] Signal inference workers to resume experience collection... (16450 times) [2024-06-27 21:03:23,262][06909] InferenceWorker_p0-w0: stopping experience collection (16450 times) [2024-06-27 21:03:23,262][06909] InferenceWorker_p0-w0: resuming experience collection (16450 times) [2024-06-27 21:03:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 43876.7). Total num frames: 1251573760. Throughput: 0: 43798.4. Samples: 1154507040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 24.0) [2024-06-27 21:03:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:03:24,981][06909] Updated weights for policy 0, policy_version 76393 (0.0024) [2024-06-27 21:03:28,850][06674] Fps is (10 sec: 42599.1, 60 sec: 44236.9, 300 sec: 43764.7). Total num frames: 1251753984. Throughput: 0: 43940.1. Samples: 1154647920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 24.0) [2024-06-27 21:03:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:03:29,411][06909] Updated weights for policy 0, policy_version 76403 (0.0036) [2024-06-27 21:03:32,169][06909] Updated weights for policy 0, policy_version 76413 (0.0041) [2024-06-27 21:03:33,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1251983360. Throughput: 0: 43898.2. Samples: 1154905040. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 21:03:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:03:36,872][06909] Updated weights for policy 0, policy_version 76423 (0.0038) [2024-06-27 21:03:38,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1252229120. Throughput: 0: 43878.2. Samples: 1155169140. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 21:03:38,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:03:39,652][06909] Updated weights for policy 0, policy_version 76433 (0.0037) [2024-06-27 21:03:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 43820.3). Total num frames: 1252425728. Throughput: 0: 43903.2. Samples: 1155309780. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 21:03:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:03:44,333][06909] Updated weights for policy 0, policy_version 76443 (0.0020) [2024-06-27 21:03:47,231][06909] Updated weights for policy 0, policy_version 76453 (0.0030) [2024-06-27 21:03:48,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.5, 300 sec: 43931.3). Total num frames: 1252638720. Throughput: 0: 43923.7. Samples: 1155565920. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 21:03:48,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:03:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000076455_1252638720.pth... [2024-06-27 21:03:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000075814_1242136576.pth [2024-06-27 21:03:51,716][06909] Updated weights for policy 0, policy_version 76463 (0.0038) [2024-06-27 21:03:53,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1252900864. Throughput: 0: 43898.2. Samples: 1155827600. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 21:03:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:03:54,967][06909] Updated weights for policy 0, policy_version 76473 (0.0031) [2024-06-27 21:03:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1253081088. Throughput: 0: 43761.7. Samples: 1155965100. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 21:03:58,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:03:59,111][06909] Updated weights for policy 0, policy_version 76483 (0.0044) [2024-06-27 21:04:02,063][06909] Updated weights for policy 0, policy_version 76493 (0.0031) [2024-06-27 21:04:03,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43692.2, 300 sec: 43875.8). Total num frames: 1253294080. Throughput: 0: 43880.1. Samples: 1156226000. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 21:04:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:04:06,469][06909] Updated weights for policy 0, policy_version 76503 (0.0032) [2024-06-27 21:04:08,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 1253539840. Throughput: 0: 44029.3. Samples: 1156488360. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 21:04:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 21:04:09,723][06909] Updated weights for policy 0, policy_version 76513 (0.0035) [2024-06-27 21:04:13,741][06909] Updated weights for policy 0, policy_version 76523 (0.0037) [2024-06-27 21:04:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1253752832. Throughput: 0: 44044.8. Samples: 1156629940. Policy #0 lag: (min: 0.0, avg: 12.3, max: 21.0) [2024-06-27 21:04:13,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:04:16,940][06909] Updated weights for policy 0, policy_version 76533 (0.0035) [2024-06-27 21:04:18,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 1253949440. Throughput: 0: 44080.4. Samples: 1156888660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:04:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:04:21,339][06909] Updated weights for policy 0, policy_version 76543 (0.0036) [2024-06-27 21:04:23,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1254227968. Throughput: 0: 44108.5. Samples: 1157154020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:04:23,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:04:24,262][06909] Updated weights for policy 0, policy_version 76553 (0.0026) [2024-06-27 21:04:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1254391808. Throughput: 0: 44014.3. Samples: 1157290420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:04:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:04:28,879][06909] Updated weights for policy 0, policy_version 76563 (0.0042) [2024-06-27 21:04:31,917][06909] Updated weights for policy 0, policy_version 76573 (0.0037) [2024-06-27 21:04:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1254637568. Throughput: 0: 44071.7. Samples: 1157549140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:04:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:04:36,283][06909] Updated weights for policy 0, policy_version 76583 (0.0028) [2024-06-27 21:04:38,850][06674] Fps is (10 sec: 47512.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1254866944. Throughput: 0: 44079.0. Samples: 1157811160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:04:38,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:04:39,153][06909] Updated weights for policy 0, policy_version 76593 (0.0037) [2024-06-27 21:04:43,804][06909] Updated weights for policy 0, policy_version 76603 (0.0037) [2024-06-27 21:04:43,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43962.2, 300 sec: 43820.0). Total num frames: 1255063552. Throughput: 0: 44063.4. Samples: 1157948040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:04:43,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:04:46,965][06909] Updated weights for policy 0, policy_version 76613 (0.0040) [2024-06-27 21:04:48,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.9, 300 sec: 43820.3). Total num frames: 1255276544. Throughput: 0: 44118.7. Samples: 1158211340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:04:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:04:50,421][06887] Signal inference workers to stop experience collection... (16500 times) [2024-06-27 21:04:50,428][06887] Signal inference workers to resume experience collection... (16500 times) [2024-06-27 21:04:50,475][06909] InferenceWorker_p0-w0: stopping experience collection (16500 times) [2024-06-27 21:04:50,475][06909] InferenceWorker_p0-w0: resuming experience collection (16500 times) [2024-06-27 21:04:51,199][06909] Updated weights for policy 0, policy_version 76623 (0.0023) [2024-06-27 21:04:53,850][06674] Fps is (10 sec: 45884.4, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1255522304. Throughput: 0: 44061.2. Samples: 1158471120. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:04:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:04:54,464][06909] Updated weights for policy 0, policy_version 76633 (0.0035) [2024-06-27 21:04:58,668][06909] Updated weights for policy 0, policy_version 76643 (0.0036) [2024-06-27 21:04:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1255718912. Throughput: 0: 43978.6. Samples: 1158608980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:04:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:05:01,792][06909] Updated weights for policy 0, policy_version 76653 (0.0042) [2024-06-27 21:05:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1255948288. Throughput: 0: 44111.1. Samples: 1158873660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:05:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:05:05,966][06909] Updated weights for policy 0, policy_version 76663 (0.0027) [2024-06-27 21:05:08,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1256194048. Throughput: 0: 44015.5. Samples: 1159134720. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 21:05:08,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:05:09,143][06909] Updated weights for policy 0, policy_version 76673 (0.0033) [2024-06-27 21:05:13,311][06909] Updated weights for policy 0, policy_version 76683 (0.0031) [2024-06-27 21:05:13,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1256407040. Throughput: 0: 44005.6. Samples: 1159270680. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 21:05:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:05:17,142][06909] Updated weights for policy 0, policy_version 76693 (0.0026) [2024-06-27 21:05:18,850][06674] Fps is (10 sec: 40960.7, 60 sec: 44236.8, 300 sec: 43876.1). Total num frames: 1256603648. Throughput: 0: 44120.1. Samples: 1159534540. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 21:05:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:05:20,769][06909] Updated weights for policy 0, policy_version 76703 (0.0039) [2024-06-27 21:05:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1256849408. Throughput: 0: 44055.6. Samples: 1159793660. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 21:05:23,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:05:24,323][06909] Updated weights for policy 0, policy_version 76713 (0.0038) [2024-06-27 21:05:28,323][06909] Updated weights for policy 0, policy_version 76723 (0.0022) [2024-06-27 21:05:28,851][06674] Fps is (10 sec: 45867.3, 60 sec: 44508.6, 300 sec: 43931.1). Total num frames: 1257062400. Throughput: 0: 44033.7. Samples: 1159929540. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 21:05:28,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:05:32,081][06909] Updated weights for policy 0, policy_version 76733 (0.0042) [2024-06-27 21:05:33,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1257259008. Throughput: 0: 44102.2. Samples: 1160195940. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 21:05:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:05:35,537][06909] Updated weights for policy 0, policy_version 76743 (0.0029) [2024-06-27 21:05:38,850][06674] Fps is (10 sec: 44243.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1257504768. Throughput: 0: 44235.1. Samples: 1160461700. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 21:05:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:05:39,229][06909] Updated weights for policy 0, policy_version 76753 (0.0038) [2024-06-27 21:05:43,206][06909] Updated weights for policy 0, policy_version 76763 (0.0022) [2024-06-27 21:05:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44238.3, 300 sec: 43875.8). Total num frames: 1257717760. Throughput: 0: 44008.1. Samples: 1160589340. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 21:05:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:05:47,190][06909] Updated weights for policy 0, policy_version 76773 (0.0037) [2024-06-27 21:05:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1257930752. Throughput: 0: 44070.6. Samples: 1160856840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-27 21:05:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:05:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000076778_1257930752.pth... [2024-06-27 21:05:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000076135_1247395840.pth [2024-06-27 21:05:50,638][06909] Updated weights for policy 0, policy_version 76783 (0.0035) [2024-06-27 21:05:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1258160128. Throughput: 0: 43985.5. Samples: 1161114060. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:05:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:05:54,451][06909] Updated weights for policy 0, policy_version 76793 (0.0023) [2024-06-27 21:05:57,925][06909] Updated weights for policy 0, policy_version 76803 (0.0035) [2024-06-27 21:05:58,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 43931.3). Total num frames: 1258389504. Throughput: 0: 43895.6. Samples: 1161245980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:05:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:06:01,694][06909] Updated weights for policy 0, policy_version 76813 (0.0040) [2024-06-27 21:06:03,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1258569728. Throughput: 0: 43924.8. Samples: 1161511160. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:06:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:06:05,762][06909] Updated weights for policy 0, policy_version 76823 (0.0031) [2024-06-27 21:06:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.7, 300 sec: 43931.3). Total num frames: 1258799104. Throughput: 0: 43811.6. Samples: 1161765180. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:06:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:06:09,475][06909] Updated weights for policy 0, policy_version 76833 (0.0030) [2024-06-27 21:06:13,145][06909] Updated weights for policy 0, policy_version 76843 (0.0026) [2024-06-27 21:06:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 1259028480. Throughput: 0: 43825.2. Samples: 1161901600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:06:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:06:16,955][06909] Updated weights for policy 0, policy_version 76853 (0.0028) [2024-06-27 21:06:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 1259241472. Throughput: 0: 43706.5. Samples: 1162162740. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:06:18,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:06:20,460][06909] Updated weights for policy 0, policy_version 76863 (0.0031) [2024-06-27 21:06:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.7, 300 sec: 43986.9). Total num frames: 1259454464. Throughput: 0: 43700.6. Samples: 1162428220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:06:23,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:06:24,558][06909] Updated weights for policy 0, policy_version 76873 (0.0030) [2024-06-27 21:06:25,044][06887] Signal inference workers to stop experience collection... (16550 times) [2024-06-27 21:06:25,044][06887] Signal inference workers to resume experience collection... (16550 times) [2024-06-27 21:06:25,058][06909] InferenceWorker_p0-w0: stopping experience collection (16550 times) [2024-06-27 21:06:25,058][06909] InferenceWorker_p0-w0: resuming experience collection (16550 times) [2024-06-27 21:06:27,881][06909] Updated weights for policy 0, policy_version 76883 (0.0032) [2024-06-27 21:06:28,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43965.0, 300 sec: 43931.3). Total num frames: 1259700224. Throughput: 0: 43741.8. Samples: 1162557720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:06:28,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:06:31,664][06909] Updated weights for policy 0, policy_version 76893 (0.0031) [2024-06-27 21:06:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1259896832. Throughput: 0: 43727.2. Samples: 1162824560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:06:33,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 21:06:35,083][06909] Updated weights for policy 0, policy_version 76903 (0.0033) [2024-06-27 21:06:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1260126208. Throughput: 0: 43882.2. Samples: 1163088760. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:06:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:06:38,952][06909] Updated weights for policy 0, policy_version 76913 (0.0030) [2024-06-27 21:06:42,777][06909] Updated weights for policy 0, policy_version 76923 (0.0029) [2024-06-27 21:06:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1260355584. Throughput: 0: 43859.6. Samples: 1163219660. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:06:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:06:46,592][06909] Updated weights for policy 0, policy_version 76933 (0.0038) [2024-06-27 21:06:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1260552192. Throughput: 0: 43843.6. Samples: 1163484120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:06:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:06:50,258][06909] Updated weights for policy 0, policy_version 76943 (0.0028) [2024-06-27 21:06:53,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.6, 300 sec: 43931.4). Total num frames: 1260765184. Throughput: 0: 44205.4. Samples: 1163754420. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:06:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:06:54,345][06909] Updated weights for policy 0, policy_version 76953 (0.0028) [2024-06-27 21:06:57,604][06909] Updated weights for policy 0, policy_version 76963 (0.0042) [2024-06-27 21:06:58,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1261010944. Throughput: 0: 43967.4. Samples: 1163880140. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:06:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:07:01,733][06909] Updated weights for policy 0, policy_version 76973 (0.0028) [2024-06-27 21:07:03,850][06674] Fps is (10 sec: 47512.8, 60 sec: 44509.8, 300 sec: 43931.3). Total num frames: 1261240320. Throughput: 0: 43993.3. Samples: 1164142440. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:07:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 21:07:05,069][06909] Updated weights for policy 0, policy_version 76983 (0.0034) [2024-06-27 21:07:08,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1261420544. Throughput: 0: 44178.5. Samples: 1164416260. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:07:08,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:07:09,185][06909] Updated weights for policy 0, policy_version 76993 (0.0037) [2024-06-27 21:07:12,603][06909] Updated weights for policy 0, policy_version 77003 (0.0031) [2024-06-27 21:07:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1261666304. Throughput: 0: 44006.2. Samples: 1164538000. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:07:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:07:16,513][06909] Updated weights for policy 0, policy_version 77013 (0.0030) [2024-06-27 21:07:18,850][06674] Fps is (10 sec: 45876.1, 60 sec: 43963.9, 300 sec: 43875.8). Total num frames: 1261879296. Throughput: 0: 44047.6. Samples: 1164806700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:07:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:07:20,041][06909] Updated weights for policy 0, policy_version 77023 (0.0035) [2024-06-27 21:07:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1262075904. Throughput: 0: 44005.8. Samples: 1165069020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:07:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:07:24,094][06909] Updated weights for policy 0, policy_version 77033 (0.0021) [2024-06-27 21:07:27,354][06909] Updated weights for policy 0, policy_version 77043 (0.0030) [2024-06-27 21:07:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1262321664. Throughput: 0: 43930.7. Samples: 1165196540. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 21:07:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:07:31,521][06909] Updated weights for policy 0, policy_version 77053 (0.0057) [2024-06-27 21:07:33,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 1262551040. Throughput: 0: 44087.8. Samples: 1165468080. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 21:07:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:07:34,579][06909] Updated weights for policy 0, policy_version 77063 (0.0028) [2024-06-27 21:07:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1262747648. Throughput: 0: 44021.3. Samples: 1165735380. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 21:07:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:07:39,120][06909] Updated weights for policy 0, policy_version 77073 (0.0044) [2024-06-27 21:07:42,151][06909] Updated weights for policy 0, policy_version 77083 (0.0035) [2024-06-27 21:07:43,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1262993408. Throughput: 0: 44008.6. Samples: 1165860520. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 21:07:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:07:46,463][06909] Updated weights for policy 0, policy_version 77093 (0.0037) [2024-06-27 21:07:48,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44509.8, 300 sec: 43931.3). Total num frames: 1263222784. Throughput: 0: 44021.8. Samples: 1166123420. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 21:07:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:07:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000077101_1263222784.pth... [2024-06-27 21:07:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000076455_1252638720.pth [2024-06-27 21:07:49,455][06909] Updated weights for policy 0, policy_version 77103 (0.0024) [2024-06-27 21:07:49,992][06887] Signal inference workers to stop experience collection... (16600 times) [2024-06-27 21:07:50,039][06909] InferenceWorker_p0-w0: stopping experience collection (16600 times) [2024-06-27 21:07:50,042][06887] Signal inference workers to resume experience collection... (16600 times) [2024-06-27 21:07:50,051][06909] InferenceWorker_p0-w0: resuming experience collection (16600 times) [2024-06-27 21:07:53,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1263403008. Throughput: 0: 43897.4. Samples: 1166391640. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 21:07:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:07:54,241][06909] Updated weights for policy 0, policy_version 77113 (0.0035) [2024-06-27 21:07:57,051][06909] Updated weights for policy 0, policy_version 77123 (0.0035) [2024-06-27 21:07:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43987.2). Total num frames: 1263648768. Throughput: 0: 43869.2. Samples: 1166512120. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 21:07:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:08:01,446][06909] Updated weights for policy 0, policy_version 77133 (0.0035) [2024-06-27 21:08:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 1263861760. Throughput: 0: 43886.6. Samples: 1166781600. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 21:08:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:08:04,618][06909] Updated weights for policy 0, policy_version 77143 (0.0029) [2024-06-27 21:08:08,852][06674] Fps is (10 sec: 40952.2, 60 sec: 43962.3, 300 sec: 43875.5). Total num frames: 1264058368. Throughput: 0: 43910.0. Samples: 1167045060. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 21:08:08,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:08:09,006][06909] Updated weights for policy 0, policy_version 77153 (0.0030) [2024-06-27 21:08:11,878][06909] Updated weights for policy 0, policy_version 77163 (0.0036) [2024-06-27 21:08:13,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1264304128. Throughput: 0: 43864.8. Samples: 1167170460. Policy #0 lag: (min: 1.0, avg: 11.6, max: 23.0) [2024-06-27 21:08:13,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:08:16,458][06909] Updated weights for policy 0, policy_version 77173 (0.0040) [2024-06-27 21:08:18,850][06674] Fps is (10 sec: 47523.4, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1264533504. Throughput: 0: 43885.9. Samples: 1167442940. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 21:08:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:08:19,459][06909] Updated weights for policy 0, policy_version 77183 (0.0036) [2024-06-27 21:08:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1264713728. Throughput: 0: 43799.0. Samples: 1167706340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 21:08:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:08:24,009][06909] Updated weights for policy 0, policy_version 77193 (0.0027) [2024-06-27 21:08:26,843][06909] Updated weights for policy 0, policy_version 77203 (0.0026) [2024-06-27 21:08:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1264959488. Throughput: 0: 43731.6. Samples: 1167828440. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 21:08:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:08:31,483][06909] Updated weights for policy 0, policy_version 77213 (0.0029) [2024-06-27 21:08:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1265188864. Throughput: 0: 43849.8. Samples: 1168096660. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 21:08:33,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:08:34,421][06909] Updated weights for policy 0, policy_version 77223 (0.0045) [2024-06-27 21:08:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1265385472. Throughput: 0: 43922.7. Samples: 1168368160. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 21:08:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:08:38,852][06909] Updated weights for policy 0, policy_version 77233 (0.0034) [2024-06-27 21:08:42,203][06909] Updated weights for policy 0, policy_version 77243 (0.0030) [2024-06-27 21:08:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1265614848. Throughput: 0: 43879.6. Samples: 1168486700. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 21:08:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:08:46,323][06909] Updated weights for policy 0, policy_version 77253 (0.0031) [2024-06-27 21:08:48,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1265844224. Throughput: 0: 43885.2. Samples: 1168756440. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 21:08:48,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:08:49,320][06909] Updated weights for policy 0, policy_version 77263 (0.0031) [2024-06-27 21:08:53,589][06909] Updated weights for policy 0, policy_version 77273 (0.0040) [2024-06-27 21:08:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1266040832. Throughput: 0: 43937.0. Samples: 1169022140. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 21:08:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:08:56,691][06909] Updated weights for policy 0, policy_version 77283 (0.0025) [2024-06-27 21:08:58,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1266270208. Throughput: 0: 43993.4. Samples: 1169150160. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-27 21:08:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:09:01,231][06909] Updated weights for policy 0, policy_version 77293 (0.0031) [2024-06-27 21:09:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1266499584. Throughput: 0: 43795.9. Samples: 1169413760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:09:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:09:04,484][06909] Updated weights for policy 0, policy_version 77303 (0.0035) [2024-06-27 21:09:08,832][06909] Updated weights for policy 0, policy_version 77313 (0.0029) [2024-06-27 21:09:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43965.3, 300 sec: 43875.8). Total num frames: 1266696192. Throughput: 0: 43824.1. Samples: 1169678420. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:09:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:09:11,969][06909] Updated weights for policy 0, policy_version 77323 (0.0029) [2024-06-27 21:09:13,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1266925568. Throughput: 0: 43837.9. Samples: 1169801140. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:09:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:09:15,999][06909] Updated weights for policy 0, policy_version 77333 (0.0032) [2024-06-27 21:09:18,852][06674] Fps is (10 sec: 47503.5, 60 sec: 43962.2, 300 sec: 43875.5). Total num frames: 1267171328. Throughput: 0: 43848.3. Samples: 1170069920. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:09:18,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:09:19,167][06909] Updated weights for policy 0, policy_version 77343 (0.0036) [2024-06-27 21:09:23,573][06909] Updated weights for policy 0, policy_version 77353 (0.0042) [2024-06-27 21:09:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1267351552. Throughput: 0: 43704.9. Samples: 1170334880. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:09:23,850][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 21:09:24,740][06887] Signal inference workers to stop experience collection... (16650 times) [2024-06-27 21:09:24,767][06909] InferenceWorker_p0-w0: stopping experience collection (16650 times) [2024-06-27 21:09:24,803][06887] Signal inference workers to resume experience collection... (16650 times) [2024-06-27 21:09:24,804][06909] InferenceWorker_p0-w0: resuming experience collection (16650 times) [2024-06-27 21:09:26,604][06909] Updated weights for policy 0, policy_version 77363 (0.0030) [2024-06-27 21:09:28,850][06674] Fps is (10 sec: 42606.6, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 1267597312. Throughput: 0: 43932.9. Samples: 1170463680. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:09:28,853][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:09:31,013][06909] Updated weights for policy 0, policy_version 77373 (0.0028) [2024-06-27 21:09:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 1267810304. Throughput: 0: 43850.9. Samples: 1170729720. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:09:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:09:34,109][06909] Updated weights for policy 0, policy_version 77383 (0.0035) [2024-06-27 21:09:38,322][06909] Updated weights for policy 0, policy_version 77393 (0.0034) [2024-06-27 21:09:38,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.6, 300 sec: 43876.1). Total num frames: 1268006912. Throughput: 0: 43835.1. Samples: 1170994720. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:09:38,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 21:09:41,443][06909] Updated weights for policy 0, policy_version 77403 (0.0020) [2024-06-27 21:09:43,852][06674] Fps is (10 sec: 44227.3, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 1268252672. Throughput: 0: 43768.2. Samples: 1171119820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:09:43,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 21:09:45,928][06909] Updated weights for policy 0, policy_version 77413 (0.0021) [2024-06-27 21:09:48,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1268482048. Throughput: 0: 43794.2. Samples: 1171384500. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 21:09:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:09:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000077422_1268482048.pth... [2024-06-27 21:09:48,904][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000076778_1257930752.pth [2024-06-27 21:09:49,097][06909] Updated weights for policy 0, policy_version 77423 (0.0030) [2024-06-27 21:09:53,096][06909] Updated weights for policy 0, policy_version 77433 (0.0029) [2024-06-27 21:09:53,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1268695040. Throughput: 0: 43992.3. Samples: 1171658080. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 21:09:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:09:56,353][06909] Updated weights for policy 0, policy_version 77443 (0.0031) [2024-06-27 21:09:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1268908032. Throughput: 0: 44068.7. Samples: 1171784240. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 21:09:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:10:00,609][06909] Updated weights for policy 0, policy_version 77453 (0.0044) [2024-06-27 21:10:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1269153792. Throughput: 0: 43984.2. Samples: 1172049120. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 21:10:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:10:03,855][06909] Updated weights for policy 0, policy_version 77463 (0.0038) [2024-06-27 21:10:07,959][06909] Updated weights for policy 0, policy_version 77473 (0.0032) [2024-06-27 21:10:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1269334016. Throughput: 0: 44020.4. Samples: 1172315800. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 21:10:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:10:11,424][06909] Updated weights for policy 0, policy_version 77483 (0.0033) [2024-06-27 21:10:13,850][06674] Fps is (10 sec: 39321.1, 60 sec: 43690.5, 300 sec: 43875.8). Total num frames: 1269547008. Throughput: 0: 43938.6. Samples: 1172440920. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 21:10:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:10:15,437][06909] Updated weights for policy 0, policy_version 77493 (0.0026) [2024-06-27 21:10:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43692.2, 300 sec: 43875.8). Total num frames: 1269792768. Throughput: 0: 43857.7. Samples: 1172703320. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 21:10:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 21:10:19,111][06909] Updated weights for policy 0, policy_version 77503 (0.0028) [2024-06-27 21:10:23,487][06909] Updated weights for policy 0, policy_version 77513 (0.0049) [2024-06-27 21:10:23,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43963.7, 300 sec: 43820.5). Total num frames: 1269989376. Throughput: 0: 43887.7. Samples: 1172969660. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 21:10:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:10:26,598][06909] Updated weights for policy 0, policy_version 77523 (0.0029) [2024-06-27 21:10:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.7, 300 sec: 43875.8). Total num frames: 1270202368. Throughput: 0: 43846.1. Samples: 1173092800. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 21:10:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:10:30,739][06909] Updated weights for policy 0, policy_version 77533 (0.0035) [2024-06-27 21:10:33,815][06909] Updated weights for policy 0, policy_version 77543 (0.0027) [2024-06-27 21:10:33,850][06674] Fps is (10 sec: 47512.8, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1270464512. Throughput: 0: 43836.8. Samples: 1173357160. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-27 21:10:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:10:37,916][06909] Updated weights for policy 0, policy_version 77553 (0.0033) [2024-06-27 21:10:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1270644736. Throughput: 0: 43807.7. Samples: 1173629420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 21:10:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:10:40,737][06887] Signal inference workers to stop experience collection... (16700 times) [2024-06-27 21:10:40,740][06887] Signal inference workers to resume experience collection... (16700 times) [2024-06-27 21:10:40,787][06909] InferenceWorker_p0-w0: stopping experience collection (16700 times) [2024-06-27 21:10:40,787][06909] InferenceWorker_p0-w0: resuming experience collection (16700 times) [2024-06-27 21:10:41,038][06909] Updated weights for policy 0, policy_version 77563 (0.0032) [2024-06-27 21:10:43,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43419.1, 300 sec: 43820.3). Total num frames: 1270857728. Throughput: 0: 43715.2. Samples: 1173751420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 21:10:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:10:45,453][06909] Updated weights for policy 0, policy_version 77573 (0.0043) [2024-06-27 21:10:48,668][06909] Updated weights for policy 0, policy_version 77583 (0.0050) [2024-06-27 21:10:48,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1271119872. Throughput: 0: 43668.0. Samples: 1174014180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 21:10:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:10:52,851][06909] Updated weights for policy 0, policy_version 77593 (0.0041) [2024-06-27 21:10:53,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1271316480. Throughput: 0: 43747.5. Samples: 1174284440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 21:10:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:10:56,454][06909] Updated weights for policy 0, policy_version 77603 (0.0037) [2024-06-27 21:10:58,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43417.6, 300 sec: 43875.8). Total num frames: 1271513088. Throughput: 0: 43818.8. Samples: 1174412760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 21:10:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:11:00,461][06909] Updated weights for policy 0, policy_version 77613 (0.0044) [2024-06-27 21:11:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.5, 300 sec: 43931.3). Total num frames: 1271758848. Throughput: 0: 43875.8. Samples: 1174677740. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 21:11:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:11:04,207][06909] Updated weights for policy 0, policy_version 77623 (0.0026) [2024-06-27 21:11:08,023][06909] Updated weights for policy 0, policy_version 77633 (0.0030) [2024-06-27 21:11:08,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1271988224. Throughput: 0: 43883.5. Samples: 1174944420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 21:11:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:11:11,311][06909] Updated weights for policy 0, policy_version 77643 (0.0036) [2024-06-27 21:11:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1272184832. Throughput: 0: 43940.4. Samples: 1175070120. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 21:11:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:11:15,256][06909] Updated weights for policy 0, policy_version 77653 (0.0042) [2024-06-27 21:11:18,638][06909] Updated weights for policy 0, policy_version 77663 (0.0044) [2024-06-27 21:11:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1272430592. Throughput: 0: 43960.1. Samples: 1175335360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 21:11:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:11:22,820][06909] Updated weights for policy 0, policy_version 77673 (0.0039) [2024-06-27 21:11:23,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1272627200. Throughput: 0: 43807.6. Samples: 1175600760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 21:11:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:11:26,150][06909] Updated weights for policy 0, policy_version 77683 (0.0030) [2024-06-27 21:11:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1272840192. Throughput: 0: 43909.8. Samples: 1175727360. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:11:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:11:30,703][06909] Updated weights for policy 0, policy_version 77693 (0.0030) [2024-06-27 21:11:33,730][06909] Updated weights for policy 0, policy_version 77703 (0.0026) [2024-06-27 21:11:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 1273085952. Throughput: 0: 43836.0. Samples: 1175986800. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:11:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:11:38,195][06909] Updated weights for policy 0, policy_version 77713 (0.0027) [2024-06-27 21:11:38,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 1273298944. Throughput: 0: 43862.6. Samples: 1176258260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:11:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:11:41,678][06909] Updated weights for policy 0, policy_version 77723 (0.0038) [2024-06-27 21:11:43,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1273495552. Throughput: 0: 43897.4. Samples: 1176388140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:11:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:11:45,492][06909] Updated weights for policy 0, policy_version 77733 (0.0038) [2024-06-27 21:11:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 1273724928. Throughput: 0: 43723.2. Samples: 1176645280. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:11:48,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:11:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000077742_1273724928.pth... [2024-06-27 21:11:48,935][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000077101_1263222784.pth [2024-06-27 21:11:49,088][06909] Updated weights for policy 0, policy_version 77743 (0.0041) [2024-06-27 21:11:52,020][06887] Signal inference workers to stop experience collection... (16750 times) [2024-06-27 21:11:52,075][06909] InferenceWorker_p0-w0: stopping experience collection (16750 times) [2024-06-27 21:11:52,138][06887] Signal inference workers to resume experience collection... (16750 times) [2024-06-27 21:11:52,138][06909] InferenceWorker_p0-w0: resuming experience collection (16750 times) [2024-06-27 21:11:52,870][06909] Updated weights for policy 0, policy_version 77753 (0.0039) [2024-06-27 21:11:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1273954304. Throughput: 0: 43781.0. Samples: 1176914560. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:11:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:11:56,282][06909] Updated weights for policy 0, policy_version 77763 (0.0028) [2024-06-27 21:11:58,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1274134528. Throughput: 0: 43780.9. Samples: 1177040260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:11:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:12:00,645][06909] Updated weights for policy 0, policy_version 77773 (0.0025) [2024-06-27 21:12:03,715][06909] Updated weights for policy 0, policy_version 77783 (0.0028) [2024-06-27 21:12:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1274396672. Throughput: 0: 43629.3. Samples: 1177298680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:12:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:12:07,996][06909] Updated weights for policy 0, policy_version 77793 (0.0045) [2024-06-27 21:12:08,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 1274593280. Throughput: 0: 43709.3. Samples: 1177567680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:12:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:12:11,402][06909] Updated weights for policy 0, policy_version 77803 (0.0029) [2024-06-27 21:12:13,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1274806272. Throughput: 0: 43766.2. Samples: 1177696840. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 21:12:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:12:15,588][06909] Updated weights for policy 0, policy_version 77813 (0.0041) [2024-06-27 21:12:18,850][06674] Fps is (10 sec: 44235.7, 60 sec: 43417.5, 300 sec: 43931.3). Total num frames: 1275035648. Throughput: 0: 43858.0. Samples: 1177960420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 21:12:18,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:12:18,950][06909] Updated weights for policy 0, policy_version 77823 (0.0034) [2024-06-27 21:12:22,812][06909] Updated weights for policy 0, policy_version 77833 (0.0046) [2024-06-27 21:12:23,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1275281408. Throughput: 0: 43704.9. Samples: 1178224980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 21:12:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:12:26,227][06909] Updated weights for policy 0, policy_version 77843 (0.0036) [2024-06-27 21:12:28,850][06674] Fps is (10 sec: 42599.6, 60 sec: 43690.7, 300 sec: 43764.8). Total num frames: 1275461632. Throughput: 0: 43670.3. Samples: 1178353300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 21:12:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:12:29,970][06909] Updated weights for policy 0, policy_version 77853 (0.0040) [2024-06-27 21:12:33,686][06909] Updated weights for policy 0, policy_version 77863 (0.0028) [2024-06-27 21:12:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1275707392. Throughput: 0: 43844.9. Samples: 1178618300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 21:12:33,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:12:37,770][06909] Updated weights for policy 0, policy_version 77873 (0.0036) [2024-06-27 21:12:38,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1275936768. Throughput: 0: 43821.3. Samples: 1178886520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 21:12:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:12:40,936][06909] Updated weights for policy 0, policy_version 77883 (0.0031) [2024-06-27 21:12:43,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 1276133376. Throughput: 0: 44017.0. Samples: 1179021020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 21:12:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:12:45,062][06909] Updated weights for policy 0, policy_version 77893 (0.0025) [2024-06-27 21:12:48,595][06909] Updated weights for policy 0, policy_version 77903 (0.0034) [2024-06-27 21:12:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1276362752. Throughput: 0: 44010.2. Samples: 1179279140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 21:12:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:12:52,645][06909] Updated weights for policy 0, policy_version 77913 (0.0039) [2024-06-27 21:12:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1276592128. Throughput: 0: 43811.6. Samples: 1179539200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 21:12:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:12:56,118][06909] Updated weights for policy 0, policy_version 77923 (0.0030) [2024-06-27 21:12:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44509.8, 300 sec: 43875.8). Total num frames: 1276805120. Throughput: 0: 43895.8. Samples: 1179672160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 21:12:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:13:00,218][06909] Updated weights for policy 0, policy_version 77933 (0.0031) [2024-06-27 21:13:03,658][06909] Updated weights for policy 0, policy_version 77943 (0.0046) [2024-06-27 21:13:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43931.6). Total num frames: 1277018112. Throughput: 0: 43777.1. Samples: 1179930380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 21:13:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:13:07,539][06909] Updated weights for policy 0, policy_version 77953 (0.0028) [2024-06-27 21:13:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.8, 300 sec: 43931.3). Total num frames: 1277263872. Throughput: 0: 43848.9. Samples: 1180198180. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 21:13:08,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:13:10,990][06909] Updated weights for policy 0, policy_version 77963 (0.0032) [2024-06-27 21:13:13,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 43820.2). Total num frames: 1277460480. Throughput: 0: 43997.2. Samples: 1180333180. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 21:13:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:13:15,066][06909] Updated weights for policy 0, policy_version 77973 (0.0030) [2024-06-27 21:13:18,363][06909] Updated weights for policy 0, policy_version 77983 (0.0031) [2024-06-27 21:13:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1277689856. Throughput: 0: 44017.8. Samples: 1180599100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 21:13:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:13:20,986][06887] Signal inference workers to stop experience collection... (16800 times) [2024-06-27 21:13:20,987][06887] Signal inference workers to resume experience collection... (16800 times) [2024-06-27 21:13:21,027][06909] InferenceWorker_p0-w0: stopping experience collection (16800 times) [2024-06-27 21:13:21,027][06909] InferenceWorker_p0-w0: resuming experience collection (16800 times) [2024-06-27 21:13:22,370][06909] Updated weights for policy 0, policy_version 77993 (0.0031) [2024-06-27 21:13:23,852][06674] Fps is (10 sec: 45866.1, 60 sec: 43962.3, 300 sec: 43931.0). Total num frames: 1277919232. Throughput: 0: 43846.9. Samples: 1180859720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 21:13:23,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:13:25,895][06909] Updated weights for policy 0, policy_version 78003 (0.0035) [2024-06-27 21:13:28,856][06674] Fps is (10 sec: 42572.9, 60 sec: 44232.3, 300 sec: 43819.4). Total num frames: 1278115840. Throughput: 0: 43738.5. Samples: 1180989520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 21:13:28,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:13:29,914][06909] Updated weights for policy 0, policy_version 78013 (0.0040) [2024-06-27 21:13:33,524][06909] Updated weights for policy 0, policy_version 78023 (0.0039) [2024-06-27 21:13:33,856][06674] Fps is (10 sec: 40943.6, 60 sec: 43686.3, 300 sec: 43874.9). Total num frames: 1278328832. Throughput: 0: 43929.2. Samples: 1181256220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 21:13:33,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:13:37,379][06909] Updated weights for policy 0, policy_version 78033 (0.0033) [2024-06-27 21:13:38,850][06674] Fps is (10 sec: 45902.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1278574592. Throughput: 0: 43927.8. Samples: 1181515960. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 21:13:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:13:41,195][06909] Updated weights for policy 0, policy_version 78043 (0.0044) [2024-06-27 21:13:43,850][06674] Fps is (10 sec: 44264.0, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1278771200. Throughput: 0: 44067.8. Samples: 1181655200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-27 21:13:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:13:44,865][06909] Updated weights for policy 0, policy_version 78053 (0.0039) [2024-06-27 21:13:48,465][06909] Updated weights for policy 0, policy_version 78063 (0.0032) [2024-06-27 21:13:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1278984192. Throughput: 0: 44207.4. Samples: 1181919720. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 21:13:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:13:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000078063_1278984192.pth... [2024-06-27 21:13:48,907][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000077422_1268482048.pth [2024-06-27 21:13:52,220][06909] Updated weights for policy 0, policy_version 78073 (0.0036) [2024-06-27 21:13:53,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 1279229952. Throughput: 0: 43939.6. Samples: 1182175460. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 21:13:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:13:56,003][06909] Updated weights for policy 0, policy_version 78083 (0.0034) [2024-06-27 21:13:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1279442944. Throughput: 0: 44115.1. Samples: 1182318360. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 21:13:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 21:13:59,448][06909] Updated weights for policy 0, policy_version 78093 (0.0028) [2024-06-27 21:14:03,483][06909] Updated weights for policy 0, policy_version 78103 (0.0030) [2024-06-27 21:14:03,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1279639552. Throughput: 0: 44083.1. Samples: 1182582840. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 21:14:03,854][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:14:06,952][06909] Updated weights for policy 0, policy_version 78113 (0.0037) [2024-06-27 21:14:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1279885312. Throughput: 0: 44025.5. Samples: 1182840780. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 21:14:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:14:11,212][06909] Updated weights for policy 0, policy_version 78123 (0.0030) [2024-06-27 21:14:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 43820.6). Total num frames: 1280098304. Throughput: 0: 44238.0. Samples: 1182979960. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 21:14:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:14:14,525][06909] Updated weights for policy 0, policy_version 78133 (0.0030) [2024-06-27 21:14:18,541][06909] Updated weights for policy 0, policy_version 78143 (0.0037) [2024-06-27 21:14:18,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.7, 300 sec: 43875.8). Total num frames: 1280294912. Throughput: 0: 43936.2. Samples: 1183233080. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 21:14:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:14:21,846][06909] Updated weights for policy 0, policy_version 78153 (0.0032) [2024-06-27 21:14:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43692.1, 300 sec: 43875.8). Total num frames: 1280540672. Throughput: 0: 43990.3. Samples: 1183495520. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 21:14:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:14:26,280][06909] Updated weights for policy 0, policy_version 78163 (0.0039) [2024-06-27 21:14:28,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43968.2, 300 sec: 43875.8). Total num frames: 1280753664. Throughput: 0: 43981.2. Samples: 1183634360. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 21:14:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:14:29,370][06909] Updated weights for policy 0, policy_version 78173 (0.0022) [2024-06-27 21:14:33,551][06909] Updated weights for policy 0, policy_version 78183 (0.0026) [2024-06-27 21:14:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43695.1, 300 sec: 43875.8). Total num frames: 1280950272. Throughput: 0: 43912.5. Samples: 1183895780. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 21:14:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:14:36,588][06909] Updated weights for policy 0, policy_version 78193 (0.0034) [2024-06-27 21:14:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 43931.6). Total num frames: 1281212416. Throughput: 0: 44246.3. Samples: 1184166540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 21:14:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:14:40,799][06909] Updated weights for policy 0, policy_version 78203 (0.0032) [2024-06-27 21:14:43,852][06674] Fps is (10 sec: 47503.8, 60 sec: 44235.2, 300 sec: 43875.5). Total num frames: 1281425408. Throughput: 0: 44089.6. Samples: 1184302480. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 21:14:43,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:14:44,530][06909] Updated weights for policy 0, policy_version 78213 (0.0041) [2024-06-27 21:14:46,431][06887] Signal inference workers to stop experience collection... (16850 times) [2024-06-27 21:14:46,468][06909] InferenceWorker_p0-w0: stopping experience collection (16850 times) [2024-06-27 21:14:46,539][06887] Signal inference workers to resume experience collection... (16850 times) [2024-06-27 21:14:46,540][06909] InferenceWorker_p0-w0: resuming experience collection (16850 times) [2024-06-27 21:14:48,355][06909] Updated weights for policy 0, policy_version 78223 (0.0021) [2024-06-27 21:14:48,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1281605632. Throughput: 0: 43704.0. Samples: 1184549520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 21:14:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:14:51,779][06909] Updated weights for policy 0, policy_version 78233 (0.0035) [2024-06-27 21:14:53,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 1281851392. Throughput: 0: 43922.8. Samples: 1184817300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 21:14:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:14:55,582][06909] Updated weights for policy 0, policy_version 78243 (0.0023) [2024-06-27 21:14:58,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 1282080768. Throughput: 0: 43986.0. Samples: 1184959340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 21:14:58,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:14:59,028][06909] Updated weights for policy 0, policy_version 78253 (0.0022) [2024-06-27 21:15:03,651][06909] Updated weights for policy 0, policy_version 78263 (0.0036) [2024-06-27 21:15:03,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1282277376. Throughput: 0: 43987.0. Samples: 1185212500. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 21:15:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:15:06,412][06909] Updated weights for policy 0, policy_version 78273 (0.0033) [2024-06-27 21:15:08,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1282523136. Throughput: 0: 43990.2. Samples: 1185475080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 21:15:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:15:10,898][06909] Updated weights for policy 0, policy_version 78283 (0.0032) [2024-06-27 21:15:13,750][06909] Updated weights for policy 0, policy_version 78293 (0.0035) [2024-06-27 21:15:13,852][06674] Fps is (10 sec: 47504.1, 60 sec: 44235.3, 300 sec: 43931.0). Total num frames: 1282752512. Throughput: 0: 44062.0. Samples: 1185617240. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 21:15:13,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:15:18,183][06909] Updated weights for policy 0, policy_version 78303 (0.0038) [2024-06-27 21:15:18,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 1282932736. Throughput: 0: 43975.5. Samples: 1185874680. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 21:15:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:15:21,261][06909] Updated weights for policy 0, policy_version 78313 (0.0034) [2024-06-27 21:15:23,850][06674] Fps is (10 sec: 40968.1, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1283162112. Throughput: 0: 43778.6. Samples: 1186136580. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 21:15:23,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:15:25,487][06909] Updated weights for policy 0, policy_version 78323 (0.0033) [2024-06-27 21:15:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1283391488. Throughput: 0: 43840.1. Samples: 1186275200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 21:15:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:15:29,076][06909] Updated weights for policy 0, policy_version 78333 (0.0036) [2024-06-27 21:15:32,807][06909] Updated weights for policy 0, policy_version 78343 (0.0033) [2024-06-27 21:15:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 1283571712. Throughput: 0: 43967.6. Samples: 1186528060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 21:15:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:15:36,381][06909] Updated weights for policy 0, policy_version 78353 (0.0037) [2024-06-27 21:15:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1283833856. Throughput: 0: 43820.7. Samples: 1186789240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 21:15:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:15:40,471][06909] Updated weights for policy 0, policy_version 78363 (0.0037) [2024-06-27 21:15:43,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43692.1, 300 sec: 43820.3). Total num frames: 1284046848. Throughput: 0: 43785.5. Samples: 1186929680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 21:15:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:15:43,990][06909] Updated weights for policy 0, policy_version 78373 (0.0036) [2024-06-27 21:15:44,263][06887] Signal inference workers to stop experience collection... (16900 times) [2024-06-27 21:15:44,264][06887] Signal inference workers to resume experience collection... (16900 times) [2024-06-27 21:15:44,313][06909] InferenceWorker_p0-w0: stopping experience collection (16900 times) [2024-06-27 21:15:44,313][06909] InferenceWorker_p0-w0: resuming experience collection (16900 times) [2024-06-27 21:15:47,762][06909] Updated weights for policy 0, policy_version 78383 (0.0029) [2024-06-27 21:15:48,852][06674] Fps is (10 sec: 42589.8, 60 sec: 44235.3, 300 sec: 43875.5). Total num frames: 1284259840. Throughput: 0: 43738.9. Samples: 1187180840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 21:15:48,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:15:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000078385_1284259840.pth... [2024-06-27 21:15:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000077742_1273724928.pth [2024-06-27 21:15:51,372][06909] Updated weights for policy 0, policy_version 78393 (0.0037) [2024-06-27 21:15:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43931.4). Total num frames: 1284472832. Throughput: 0: 43732.1. Samples: 1187443020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 21:15:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:15:55,443][06909] Updated weights for policy 0, policy_version 78403 (0.0035) [2024-06-27 21:15:58,850][06674] Fps is (10 sec: 44246.6, 60 sec: 43690.9, 300 sec: 43875.8). Total num frames: 1284702208. Throughput: 0: 43822.6. Samples: 1187589160. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 21:15:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:15:58,998][06909] Updated weights for policy 0, policy_version 78413 (0.0036) [2024-06-27 21:16:03,091][06909] Updated weights for policy 0, policy_version 78423 (0.0029) [2024-06-27 21:16:03,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 1284915200. Throughput: 0: 43768.8. Samples: 1187844280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 21:16:03,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:16:06,368][06909] Updated weights for policy 0, policy_version 78433 (0.0047) [2024-06-27 21:16:08,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1285144576. Throughput: 0: 43517.4. Samples: 1188094860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 21:16:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:16:10,644][06909] Updated weights for policy 0, policy_version 78443 (0.0035) [2024-06-27 21:16:13,735][06909] Updated weights for policy 0, policy_version 78453 (0.0027) [2024-06-27 21:16:13,856][06674] Fps is (10 sec: 45848.1, 60 sec: 43687.8, 300 sec: 43874.9). Total num frames: 1285373952. Throughput: 0: 43708.9. Samples: 1188242360. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 21:16:13,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:16:17,948][06909] Updated weights for policy 0, policy_version 78463 (0.0029) [2024-06-27 21:16:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1285570560. Throughput: 0: 43824.4. Samples: 1188500160. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 21:16:18,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:16:21,274][06909] Updated weights for policy 0, policy_version 78473 (0.0022) [2024-06-27 21:16:23,850][06674] Fps is (10 sec: 44263.7, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1285816320. Throughput: 0: 43764.6. Samples: 1188758640. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 21:16:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 21:16:25,566][06909] Updated weights for policy 0, policy_version 78483 (0.0023) [2024-06-27 21:16:28,797][06909] Updated weights for policy 0, policy_version 78493 (0.0042) [2024-06-27 21:16:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1286029312. Throughput: 0: 43789.3. Samples: 1188900200. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 21:16:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:16:32,841][06909] Updated weights for policy 0, policy_version 78503 (0.0033) [2024-06-27 21:16:33,856][06674] Fps is (10 sec: 40935.7, 60 sec: 44232.5, 300 sec: 43819.4). Total num frames: 1286225920. Throughput: 0: 44002.1. Samples: 1189161100. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 21:16:33,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:16:36,177][06909] Updated weights for policy 0, policy_version 78513 (0.0028) [2024-06-27 21:16:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1286471680. Throughput: 0: 43928.5. Samples: 1189419800. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 21:16:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:16:40,192][06909] Updated weights for policy 0, policy_version 78523 (0.0026) [2024-06-27 21:16:43,604][06909] Updated weights for policy 0, policy_version 78533 (0.0031) [2024-06-27 21:16:43,850][06674] Fps is (10 sec: 47540.8, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1286701056. Throughput: 0: 43854.4. Samples: 1189562620. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 21:16:43,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:16:47,786][06909] Updated weights for policy 0, policy_version 78543 (0.0025) [2024-06-27 21:16:48,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43692.2, 300 sec: 43820.2). Total num frames: 1286881280. Throughput: 0: 43878.3. Samples: 1189818800. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 21:16:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:16:51,043][06909] Updated weights for policy 0, policy_version 78553 (0.0034) [2024-06-27 21:16:53,850][06674] Fps is (10 sec: 42599.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1287127040. Throughput: 0: 44101.8. Samples: 1190079440. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 21:16:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:16:55,250][06909] Updated weights for policy 0, policy_version 78563 (0.0042) [2024-06-27 21:16:58,620][06909] Updated weights for policy 0, policy_version 78573 (0.0036) [2024-06-27 21:16:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 1287340032. Throughput: 0: 43949.4. Samples: 1190219820. Policy #0 lag: (min: 1.0, avg: 8.9, max: 21.0) [2024-06-27 21:16:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:16:58,854][06887] Signal inference workers to stop experience collection... (16950 times) [2024-06-27 21:16:58,856][06887] Signal inference workers to resume experience collection... (16950 times) [2024-06-27 21:16:58,887][06909] InferenceWorker_p0-w0: stopping experience collection (16950 times) [2024-06-27 21:16:58,887][06909] InferenceWorker_p0-w0: resuming experience collection (16950 times) [2024-06-27 21:17:02,489][06909] Updated weights for policy 0, policy_version 78583 (0.0040) [2024-06-27 21:17:03,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1287536640. Throughput: 0: 43954.2. Samples: 1190478100. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 21:17:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:17:05,944][06909] Updated weights for policy 0, policy_version 78593 (0.0030) [2024-06-27 21:17:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1287782400. Throughput: 0: 44070.1. Samples: 1190741800. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 21:17:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:17:10,125][06909] Updated weights for policy 0, policy_version 78603 (0.0031) [2024-06-27 21:17:13,389][06909] Updated weights for policy 0, policy_version 78613 (0.0033) [2024-06-27 21:17:13,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43968.1, 300 sec: 43986.9). Total num frames: 1288011776. Throughput: 0: 44040.8. Samples: 1190882040. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 21:17:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:17:17,336][06909] Updated weights for policy 0, policy_version 78623 (0.0036) [2024-06-27 21:17:18,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1288192000. Throughput: 0: 43973.3. Samples: 1191139640. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 21:17:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:17:21,045][06909] Updated weights for policy 0, policy_version 78633 (0.0040) [2024-06-27 21:17:23,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1288437760. Throughput: 0: 43929.3. Samples: 1191396620. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 21:17:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:17:24,983][06909] Updated weights for policy 0, policy_version 78643 (0.0033) [2024-06-27 21:17:28,523][06909] Updated weights for policy 0, policy_version 78653 (0.0045) [2024-06-27 21:17:28,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 1288667136. Throughput: 0: 43933.1. Samples: 1191539600. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 21:17:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:17:32,301][06909] Updated weights for policy 0, policy_version 78663 (0.0029) [2024-06-27 21:17:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43695.0, 300 sec: 43764.7). Total num frames: 1288847360. Throughput: 0: 43869.9. Samples: 1191792940. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 21:17:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:17:35,826][06909] Updated weights for policy 0, policy_version 78673 (0.0044) [2024-06-27 21:17:38,856][06674] Fps is (10 sec: 42572.4, 60 sec: 43686.2, 300 sec: 43930.4). Total num frames: 1289093120. Throughput: 0: 43828.3. Samples: 1192051980. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 21:17:38,857][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:17:40,055][06909] Updated weights for policy 0, policy_version 78683 (0.0033) [2024-06-27 21:17:43,484][06909] Updated weights for policy 0, policy_version 78693 (0.0034) [2024-06-27 21:17:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.8, 300 sec: 43875.8). Total num frames: 1289306112. Throughput: 0: 43845.5. Samples: 1192192860. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-27 21:17:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:17:47,324][06909] Updated weights for policy 0, policy_version 78703 (0.0038) [2024-06-27 21:17:48,850][06674] Fps is (10 sec: 42623.4, 60 sec: 43963.6, 300 sec: 43820.2). Total num frames: 1289519104. Throughput: 0: 43779.4. Samples: 1192448180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 21:17:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:17:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000078706_1289519104.pth... [2024-06-27 21:17:48,941][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000078063_1278984192.pth [2024-06-27 21:17:51,259][06909] Updated weights for policy 0, policy_version 78713 (0.0032) [2024-06-27 21:17:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1289748480. Throughput: 0: 43727.6. Samples: 1192709540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 21:17:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:17:54,726][06909] Updated weights for policy 0, policy_version 78723 (0.0033) [2024-06-27 21:17:58,654][06909] Updated weights for policy 0, policy_version 78733 (0.0034) [2024-06-27 21:17:58,850][06674] Fps is (10 sec: 44237.8, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1289961472. Throughput: 0: 43676.6. Samples: 1192847480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 21:17:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:18:02,337][06909] Updated weights for policy 0, policy_version 78743 (0.0024) [2024-06-27 21:18:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 1290174464. Throughput: 0: 43773.0. Samples: 1193109420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 21:18:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:18:05,906][06909] Updated weights for policy 0, policy_version 78753 (0.0028) [2024-06-27 21:18:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 1290420224. Throughput: 0: 43801.3. Samples: 1193367680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 21:18:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:18:10,310][06909] Updated weights for policy 0, policy_version 78763 (0.0033) [2024-06-27 21:18:13,173][06909] Updated weights for policy 0, policy_version 78773 (0.0029) [2024-06-27 21:18:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.6, 300 sec: 43820.3). Total num frames: 1290616832. Throughput: 0: 43807.0. Samples: 1193510920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 21:18:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:18:17,574][06909] Updated weights for policy 0, policy_version 78783 (0.0034) [2024-06-27 21:18:18,850][06674] Fps is (10 sec: 40958.9, 60 sec: 43963.6, 300 sec: 43765.0). Total num frames: 1290829824. Throughput: 0: 43932.6. Samples: 1193769920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 21:18:18,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:18:21,004][06909] Updated weights for policy 0, policy_version 78793 (0.0035) [2024-06-27 21:18:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 43932.2). Total num frames: 1291075584. Throughput: 0: 43850.0. Samples: 1194024960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 21:18:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:18:24,914][06909] Updated weights for policy 0, policy_version 78803 (0.0030) [2024-06-27 21:18:28,630][06909] Updated weights for policy 0, policy_version 78813 (0.0035) [2024-06-27 21:18:28,850][06674] Fps is (10 sec: 44238.1, 60 sec: 43417.6, 300 sec: 43876.7). Total num frames: 1291272192. Throughput: 0: 43733.3. Samples: 1194160860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 21:18:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:18:32,311][06909] Updated weights for policy 0, policy_version 78823 (0.0028) [2024-06-27 21:18:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.7, 300 sec: 43820.3). Total num frames: 1291501568. Throughput: 0: 44032.2. Samples: 1194429620. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 21:18:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:18:34,952][06887] Signal inference workers to stop experience collection... (17000 times) [2024-06-27 21:18:34,956][06887] Signal inference workers to resume experience collection... (17000 times) [2024-06-27 21:18:35,004][06909] InferenceWorker_p0-w0: stopping experience collection (17000 times) [2024-06-27 21:18:35,004][06909] InferenceWorker_p0-w0: resuming experience collection (17000 times) [2024-06-27 21:18:35,813][06909] Updated weights for policy 0, policy_version 78833 (0.0032) [2024-06-27 21:18:38,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44241.2, 300 sec: 43986.9). Total num frames: 1291747328. Throughput: 0: 44022.6. Samples: 1194690560. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:18:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:18:39,636][06909] Updated weights for policy 0, policy_version 78843 (0.0034) [2024-06-27 21:18:43,381][06909] Updated weights for policy 0, policy_version 78853 (0.0030) [2024-06-27 21:18:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1291960320. Throughput: 0: 44027.0. Samples: 1194828700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:18:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:18:47,275][06909] Updated weights for policy 0, policy_version 78863 (0.0041) [2024-06-27 21:18:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1292156928. Throughput: 0: 43807.0. Samples: 1195080740. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:18:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 21:18:50,967][06909] Updated weights for policy 0, policy_version 78873 (0.0031) [2024-06-27 21:18:53,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1292386304. Throughput: 0: 43803.2. Samples: 1195338820. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:18:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:18:54,813][06909] Updated weights for policy 0, policy_version 78883 (0.0029) [2024-06-27 21:18:58,591][06909] Updated weights for policy 0, policy_version 78893 (0.0033) [2024-06-27 21:18:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1292582912. Throughput: 0: 43643.2. Samples: 1195474860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:18:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:19:02,008][06909] Updated weights for policy 0, policy_version 78903 (0.0033) [2024-06-27 21:19:03,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 1292828672. Throughput: 0: 43903.3. Samples: 1195745560. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:19:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:19:06,159][06909] Updated weights for policy 0, policy_version 78913 (0.0029) [2024-06-27 21:19:08,850][06674] Fps is (10 sec: 45874.1, 60 sec: 43690.5, 300 sec: 43875.8). Total num frames: 1293041664. Throughput: 0: 43963.8. Samples: 1196003340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:19:08,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:19:09,817][06909] Updated weights for policy 0, policy_version 78923 (0.0026) [2024-06-27 21:19:13,502][06909] Updated weights for policy 0, policy_version 78933 (0.0026) [2024-06-27 21:19:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1293254656. Throughput: 0: 43852.0. Samples: 1196134200. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:19:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:19:17,201][06909] Updated weights for policy 0, policy_version 78943 (0.0027) [2024-06-27 21:19:18,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44237.0, 300 sec: 43875.8). Total num frames: 1293484032. Throughput: 0: 43747.6. Samples: 1196398260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:19:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:19:21,143][06909] Updated weights for policy 0, policy_version 78953 (0.0044) [2024-06-27 21:19:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1293697024. Throughput: 0: 43754.3. Samples: 1196659500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:19:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:19:24,530][06909] Updated weights for policy 0, policy_version 78963 (0.0036) [2024-06-27 21:19:28,436][06909] Updated weights for policy 0, policy_version 78973 (0.0040) [2024-06-27 21:19:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 1293910016. Throughput: 0: 43563.1. Samples: 1196789040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 21:19:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:19:32,445][06909] Updated weights for policy 0, policy_version 78983 (0.0032) [2024-06-27 21:19:33,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.6, 300 sec: 43820.2). Total num frames: 1294139392. Throughput: 0: 43887.9. Samples: 1197055700. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 21:19:33,851][06674] Avg episode reward: [(0, '0.451')] [2024-06-27 21:19:35,706][06909] Updated weights for policy 0, policy_version 78993 (0.0024) [2024-06-27 21:19:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.7, 300 sec: 43820.6). Total num frames: 1294352384. Throughput: 0: 43918.2. Samples: 1197315140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 21:19:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:19:39,534][06909] Updated weights for policy 0, policy_version 79003 (0.0035) [2024-06-27 21:19:43,171][06909] Updated weights for policy 0, policy_version 79013 (0.0038) [2024-06-27 21:19:43,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43144.6, 300 sec: 43875.8). Total num frames: 1294548992. Throughput: 0: 43750.6. Samples: 1197443640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 21:19:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:19:47,444][06909] Updated weights for policy 0, policy_version 79023 (0.0027) [2024-06-27 21:19:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1294811136. Throughput: 0: 43821.8. Samples: 1197717540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 21:19:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:19:48,875][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000079029_1294811136.pth... [2024-06-27 21:19:48,939][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000078385_1284259840.pth [2024-06-27 21:19:50,781][06909] Updated weights for policy 0, policy_version 79033 (0.0025) [2024-06-27 21:19:52,792][06887] Signal inference workers to stop experience collection... (17050 times) [2024-06-27 21:19:52,819][06909] InferenceWorker_p0-w0: stopping experience collection (17050 times) [2024-06-27 21:19:52,858][06887] Signal inference workers to resume experience collection... (17050 times) [2024-06-27 21:19:52,858][06909] InferenceWorker_p0-w0: resuming experience collection (17050 times) [2024-06-27 21:19:53,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 1295024128. Throughput: 0: 43753.4. Samples: 1197972240. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 21:19:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:19:54,742][06909] Updated weights for policy 0, policy_version 79043 (0.0036) [2024-06-27 21:19:58,483][06909] Updated weights for policy 0, policy_version 79053 (0.0037) [2024-06-27 21:19:58,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1295220736. Throughput: 0: 43726.7. Samples: 1198101900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 21:19:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:20:02,047][06909] Updated weights for policy 0, policy_version 79063 (0.0031) [2024-06-27 21:20:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1295450112. Throughput: 0: 43932.4. Samples: 1198375220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 21:20:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:20:05,656][06909] Updated weights for policy 0, policy_version 79073 (0.0039) [2024-06-27 21:20:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.8, 300 sec: 43765.0). Total num frames: 1295663104. Throughput: 0: 43827.1. Samples: 1198631720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 21:20:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:20:09,820][06909] Updated weights for policy 0, policy_version 79083 (0.0025) [2024-06-27 21:20:13,033][06909] Updated weights for policy 0, policy_version 79093 (0.0037) [2024-06-27 21:20:13,852][06674] Fps is (10 sec: 40951.7, 60 sec: 43416.1, 300 sec: 43820.0). Total num frames: 1295859712. Throughput: 0: 43755.9. Samples: 1198758140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 21:20:13,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:20:17,179][06909] Updated weights for policy 0, policy_version 79103 (0.0028) [2024-06-27 21:20:18,850][06674] Fps is (10 sec: 45874.2, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 1296121856. Throughput: 0: 43860.8. Samples: 1199029440. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 21:20:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:20:20,743][06909] Updated weights for policy 0, policy_version 79113 (0.0020) [2024-06-27 21:20:23,850][06674] Fps is (10 sec: 47522.9, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 1296334848. Throughput: 0: 43859.9. Samples: 1199288840. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 21:20:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:20:24,831][06909] Updated weights for policy 0, policy_version 79123 (0.0034) [2024-06-27 21:20:28,022][06909] Updated weights for policy 0, policy_version 79133 (0.0028) [2024-06-27 21:20:28,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1296531456. Throughput: 0: 43908.8. Samples: 1199419540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 21:20:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:20:32,107][06909] Updated weights for policy 0, policy_version 79143 (0.0024) [2024-06-27 21:20:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1296793600. Throughput: 0: 43847.9. Samples: 1199690700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 21:20:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:20:35,886][06909] Updated weights for policy 0, policy_version 79153 (0.0029) [2024-06-27 21:20:38,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 1296990208. Throughput: 0: 43913.7. Samples: 1199948360. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 21:20:38,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:20:39,326][06909] Updated weights for policy 0, policy_version 79163 (0.0031) [2024-06-27 21:20:43,132][06909] Updated weights for policy 0, policy_version 79173 (0.0031) [2024-06-27 21:20:43,850][06674] Fps is (10 sec: 40960.5, 60 sec: 44236.8, 300 sec: 43876.1). Total num frames: 1297203200. Throughput: 0: 44050.6. Samples: 1200084180. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 21:20:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:20:47,000][06909] Updated weights for policy 0, policy_version 79183 (0.0034) [2024-06-27 21:20:48,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1297448960. Throughput: 0: 43898.3. Samples: 1200350640. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 21:20:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:20:50,308][06909] Updated weights for policy 0, policy_version 79193 (0.0038) [2024-06-27 21:20:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1297645568. Throughput: 0: 43982.2. Samples: 1200610920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 21:20:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:20:54,406][06909] Updated weights for policy 0, policy_version 79203 (0.0042) [2024-06-27 21:20:57,646][06909] Updated weights for policy 0, policy_version 79213 (0.0025) [2024-06-27 21:20:58,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1297858560. Throughput: 0: 44034.5. Samples: 1200739600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 21:20:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:21:01,968][06909] Updated weights for policy 0, policy_version 79223 (0.0034) [2024-06-27 21:21:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1298104320. Throughput: 0: 44014.8. Samples: 1201010100. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2024-06-27 21:21:03,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:21:05,388][06909] Updated weights for policy 0, policy_version 79233 (0.0036) [2024-06-27 21:21:06,792][06887] Signal inference workers to stop experience collection... (17100 times) [2024-06-27 21:21:06,792][06887] Signal inference workers to resume experience collection... (17100 times) [2024-06-27 21:21:06,820][06909] InferenceWorker_p0-w0: stopping experience collection (17100 times) [2024-06-27 21:21:06,820][06909] InferenceWorker_p0-w0: resuming experience collection (17100 times) [2024-06-27 21:21:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43821.2). Total num frames: 1298300928. Throughput: 0: 43970.7. Samples: 1201267520. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2024-06-27 21:21:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:21:09,168][06909] Updated weights for policy 0, policy_version 79243 (0.0031) [2024-06-27 21:21:12,961][06909] Updated weights for policy 0, policy_version 79253 (0.0035) [2024-06-27 21:21:13,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44238.3, 300 sec: 43875.8). Total num frames: 1298513920. Throughput: 0: 44017.3. Samples: 1201400320. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2024-06-27 21:21:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:21:16,474][06909] Updated weights for policy 0, policy_version 79263 (0.0034) [2024-06-27 21:21:18,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1298759680. Throughput: 0: 43973.2. Samples: 1201669500. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2024-06-27 21:21:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:21:20,194][06909] Updated weights for policy 0, policy_version 79273 (0.0037) [2024-06-27 21:21:23,823][06909] Updated weights for policy 0, policy_version 79283 (0.0033) [2024-06-27 21:21:23,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1298972672. Throughput: 0: 44072.2. Samples: 1201931600. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2024-06-27 21:21:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:21:27,500][06909] Updated weights for policy 0, policy_version 79293 (0.0037) [2024-06-27 21:21:28,850][06674] Fps is (10 sec: 40961.0, 60 sec: 43963.8, 300 sec: 43876.7). Total num frames: 1299169280. Throughput: 0: 43965.4. Samples: 1202062620. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2024-06-27 21:21:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:21:31,690][06909] Updated weights for policy 0, policy_version 79303 (0.0027) [2024-06-27 21:21:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1299415040. Throughput: 0: 43919.0. Samples: 1202327000. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2024-06-27 21:21:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:21:34,755][06909] Updated weights for policy 0, policy_version 79313 (0.0026) [2024-06-27 21:21:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1299611648. Throughput: 0: 43869.7. Samples: 1202585060. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2024-06-27 21:21:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:21:39,045][06909] Updated weights for policy 0, policy_version 79323 (0.0037) [2024-06-27 21:21:42,587][06909] Updated weights for policy 0, policy_version 79333 (0.0027) [2024-06-27 21:21:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1299824640. Throughput: 0: 43942.6. Samples: 1202717020. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2024-06-27 21:21:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:21:46,294][06909] Updated weights for policy 0, policy_version 79343 (0.0025) [2024-06-27 21:21:48,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1300070400. Throughput: 0: 43802.2. Samples: 1202981200. Policy #0 lag: (min: 0.0, avg: 7.8, max: 19.0) [2024-06-27 21:21:48,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:21:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000079350_1300070400.pth... [2024-06-27 21:21:48,924][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000078706_1289519104.pth [2024-06-27 21:21:50,262][06909] Updated weights for policy 0, policy_version 79353 (0.0038) [2024-06-27 21:21:53,582][06909] Updated weights for policy 0, policy_version 79363 (0.0026) [2024-06-27 21:21:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1300283392. Throughput: 0: 44044.1. Samples: 1203249500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 21:21:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:21:57,609][06909] Updated weights for policy 0, policy_version 79373 (0.0028) [2024-06-27 21:21:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1300496384. Throughput: 0: 43964.1. Samples: 1203378700. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 21:21:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:22:01,088][06909] Updated weights for policy 0, policy_version 79383 (0.0056) [2024-06-27 21:22:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1300725760. Throughput: 0: 43733.5. Samples: 1203637500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 21:22:03,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:22:05,122][06909] Updated weights for policy 0, policy_version 79393 (0.0041) [2024-06-27 21:22:08,714][06909] Updated weights for policy 0, policy_version 79403 (0.0039) [2024-06-27 21:22:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1300938752. Throughput: 0: 43955.5. Samples: 1203909600. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 21:22:08,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:22:12,672][06909] Updated weights for policy 0, policy_version 79413 (0.0024) [2024-06-27 21:22:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1301151744. Throughput: 0: 43858.1. Samples: 1204036240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 21:22:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:22:16,328][06909] Updated weights for policy 0, policy_version 79423 (0.0034) [2024-06-27 21:22:18,856][06674] Fps is (10 sec: 45847.9, 60 sec: 43959.5, 300 sec: 43930.4). Total num frames: 1301397504. Throughput: 0: 43671.6. Samples: 1204292480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 21:22:18,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:22:19,988][06909] Updated weights for policy 0, policy_version 79433 (0.0047) [2024-06-27 21:22:23,735][06909] Updated weights for policy 0, policy_version 79443 (0.0037) [2024-06-27 21:22:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 1301594112. Throughput: 0: 43936.8. Samples: 1204562220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 21:22:23,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:22:27,794][06909] Updated weights for policy 0, policy_version 79453 (0.0030) [2024-06-27 21:22:28,850][06674] Fps is (10 sec: 42624.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1301823488. Throughput: 0: 43916.5. Samples: 1204693260. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 21:22:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:22:31,070][06909] Updated weights for policy 0, policy_version 79463 (0.0034) [2024-06-27 21:22:33,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.8, 300 sec: 43932.2). Total num frames: 1302052864. Throughput: 0: 43885.9. Samples: 1204956060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 21:22:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:22:35,062][06909] Updated weights for policy 0, policy_version 79473 (0.0031) [2024-06-27 21:22:38,134][06887] Signal inference workers to stop experience collection... (17150 times) [2024-06-27 21:22:38,183][06909] InferenceWorker_p0-w0: stopping experience collection (17150 times) [2024-06-27 21:22:38,247][06887] Signal inference workers to resume experience collection... (17150 times) [2024-06-27 21:22:38,247][06909] InferenceWorker_p0-w0: resuming experience collection (17150 times) [2024-06-27 21:22:38,576][06909] Updated weights for policy 0, policy_version 79483 (0.0034) [2024-06-27 21:22:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1302249472. Throughput: 0: 43872.0. Samples: 1205223740. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 21:22:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:22:42,312][06909] Updated weights for policy 0, policy_version 79493 (0.0033) [2024-06-27 21:22:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 1302478848. Throughput: 0: 43744.9. Samples: 1205347220. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 21:22:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:22:46,367][06909] Updated weights for policy 0, policy_version 79503 (0.0033) [2024-06-27 21:22:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1302708224. Throughput: 0: 43784.5. Samples: 1205607800. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 21:22:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:22:49,602][06909] Updated weights for policy 0, policy_version 79513 (0.0042) [2024-06-27 21:22:53,826][06909] Updated weights for policy 0, policy_version 79523 (0.0026) [2024-06-27 21:22:53,851][06674] Fps is (10 sec: 42592.8, 60 sec: 43689.7, 300 sec: 43875.6). Total num frames: 1302904832. Throughput: 0: 43846.4. Samples: 1205882740. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 21:22:53,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:22:56,849][06909] Updated weights for policy 0, policy_version 79533 (0.0028) [2024-06-27 21:22:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1303134208. Throughput: 0: 43669.4. Samples: 1206001360. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 21:22:58,854][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:23:01,352][06909] Updated weights for policy 0, policy_version 79543 (0.0031) [2024-06-27 21:23:03,850][06674] Fps is (10 sec: 45880.9, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1303363584. Throughput: 0: 43868.0. Samples: 1206266280. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 21:23:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:23:04,762][06909] Updated weights for policy 0, policy_version 79553 (0.0030) [2024-06-27 21:23:08,653][06909] Updated weights for policy 0, policy_version 79563 (0.0035) [2024-06-27 21:23:08,856][06674] Fps is (10 sec: 42572.6, 60 sec: 43686.3, 300 sec: 43874.9). Total num frames: 1303560192. Throughput: 0: 43877.7. Samples: 1206536980. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 21:23:08,857][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:23:12,607][06909] Updated weights for policy 0, policy_version 79573 (0.0036) [2024-06-27 21:23:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 1303789568. Throughput: 0: 43799.0. Samples: 1206664220. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 21:23:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 21:23:16,097][06909] Updated weights for policy 0, policy_version 79583 (0.0032) [2024-06-27 21:23:18,850][06674] Fps is (10 sec: 47542.1, 60 sec: 43968.1, 300 sec: 43931.3). Total num frames: 1304035328. Throughput: 0: 43884.3. Samples: 1206930860. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 21:23:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:23:19,637][06909] Updated weights for policy 0, policy_version 79593 (0.0026) [2024-06-27 21:23:23,473][06909] Updated weights for policy 0, policy_version 79603 (0.0021) [2024-06-27 21:23:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1304231936. Throughput: 0: 43958.9. Samples: 1207201900. Policy #0 lag: (min: 1.0, avg: 10.7, max: 22.0) [2024-06-27 21:23:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:23:26,849][06909] Updated weights for policy 0, policy_version 79613 (0.0033) [2024-06-27 21:23:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.5, 300 sec: 43875.8). Total num frames: 1304444928. Throughput: 0: 43973.7. Samples: 1207326040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:23:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:23:31,150][06909] Updated weights for policy 0, policy_version 79623 (0.0036) [2024-06-27 21:23:33,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1304690688. Throughput: 0: 43943.1. Samples: 1207585240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:23:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 21:23:34,142][06909] Updated weights for policy 0, policy_version 79633 (0.0024) [2024-06-27 21:23:38,446][06909] Updated weights for policy 0, policy_version 79643 (0.0030) [2024-06-27 21:23:38,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1304870912. Throughput: 0: 43867.6. Samples: 1207856720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:23:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:23:41,412][06909] Updated weights for policy 0, policy_version 79653 (0.0031) [2024-06-27 21:23:43,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1305100288. Throughput: 0: 43959.2. Samples: 1207979520. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:23:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:23:46,039][06909] Updated weights for policy 0, policy_version 79663 (0.0026) [2024-06-27 21:23:48,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1305346048. Throughput: 0: 44073.8. Samples: 1208249600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:23:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:23:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000079672_1305346048.pth... [2024-06-27 21:23:48,933][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000079029_1294811136.pth [2024-06-27 21:23:49,082][06909] Updated weights for policy 0, policy_version 79673 (0.0020) [2024-06-27 21:23:52,617][06887] Signal inference workers to stop experience collection... (17200 times) [2024-06-27 21:23:52,623][06887] Signal inference workers to resume experience collection... (17200 times) [2024-06-27 21:23:52,663][06909] InferenceWorker_p0-w0: stopping experience collection (17200 times) [2024-06-27 21:23:52,663][06909] InferenceWorker_p0-w0: resuming experience collection (17200 times) [2024-06-27 21:23:53,318][06909] Updated weights for policy 0, policy_version 79683 (0.0037) [2024-06-27 21:23:53,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44237.7, 300 sec: 43986.9). Total num frames: 1305559040. Throughput: 0: 44123.7. Samples: 1208522280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:23:53,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:23:56,265][06909] Updated weights for policy 0, policy_version 79693 (0.0027) [2024-06-27 21:23:58,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 1305755648. Throughput: 0: 44082.2. Samples: 1208647920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:23:58,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:24:00,738][06909] Updated weights for policy 0, policy_version 79703 (0.0024) [2024-06-27 21:24:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 1306001408. Throughput: 0: 44027.2. Samples: 1208912080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:24:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:24:03,955][06909] Updated weights for policy 0, policy_version 79713 (0.0028) [2024-06-27 21:24:08,282][06909] Updated weights for policy 0, policy_version 79723 (0.0042) [2024-06-27 21:24:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44241.3, 300 sec: 43931.3). Total num frames: 1306214400. Throughput: 0: 43989.9. Samples: 1209181440. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:24:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:24:11,325][06909] Updated weights for policy 0, policy_version 79733 (0.0029) [2024-06-27 21:24:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1306427392. Throughput: 0: 44002.3. Samples: 1209306140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 21:24:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:24:15,802][06909] Updated weights for policy 0, policy_version 79743 (0.0032) [2024-06-27 21:24:18,781][06909] Updated weights for policy 0, policy_version 79753 (0.0037) [2024-06-27 21:24:18,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43963.7, 300 sec: 43986.8). Total num frames: 1306673152. Throughput: 0: 44200.3. Samples: 1209574260. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 21:24:18,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:24:23,247][06909] Updated weights for policy 0, policy_version 79763 (0.0036) [2024-06-27 21:24:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.9, 300 sec: 43931.3). Total num frames: 1306869760. Throughput: 0: 44009.7. Samples: 1209837160. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 21:24:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:24:26,338][06909] Updated weights for policy 0, policy_version 79773 (0.0028) [2024-06-27 21:24:28,850][06674] Fps is (10 sec: 39322.3, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1307066368. Throughput: 0: 44134.1. Samples: 1209965560. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 21:24:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:24:30,518][06909] Updated weights for policy 0, policy_version 79783 (0.0035) [2024-06-27 21:24:33,517][06909] Updated weights for policy 0, policy_version 79793 (0.0037) [2024-06-27 21:24:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1307328512. Throughput: 0: 44041.3. Samples: 1210231460. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 21:24:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:24:38,280][06909] Updated weights for policy 0, policy_version 79803 (0.0027) [2024-06-27 21:24:38,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44509.7, 300 sec: 44042.4). Total num frames: 1307541504. Throughput: 0: 43932.0. Samples: 1210499220. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 21:24:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:24:41,249][06909] Updated weights for policy 0, policy_version 79813 (0.0027) [2024-06-27 21:24:43,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1307738112. Throughput: 0: 43756.6. Samples: 1210616960. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 21:24:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:24:45,704][06909] Updated weights for policy 0, policy_version 79823 (0.0021) [2024-06-27 21:24:48,802][06909] Updated weights for policy 0, policy_version 79833 (0.0034) [2024-06-27 21:24:48,856][06674] Fps is (10 sec: 44210.1, 60 sec: 43959.3, 300 sec: 43930.4). Total num frames: 1307983872. Throughput: 0: 43755.4. Samples: 1210881340. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 21:24:48,857][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:24:53,113][06909] Updated weights for policy 0, policy_version 79843 (0.0029) [2024-06-27 21:24:53,852][06674] Fps is (10 sec: 44227.3, 60 sec: 43689.2, 300 sec: 43931.0). Total num frames: 1308180480. Throughput: 0: 43748.3. Samples: 1211150200. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 21:24:53,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:24:56,327][06909] Updated weights for policy 0, policy_version 79853 (0.0037) [2024-06-27 21:24:58,850][06674] Fps is (10 sec: 39345.4, 60 sec: 43690.7, 300 sec: 43820.2). Total num frames: 1308377088. Throughput: 0: 43809.2. Samples: 1211277560. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 21:24:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:25:00,497][06909] Updated weights for policy 0, policy_version 79863 (0.0028) [2024-06-27 21:25:03,714][06909] Updated weights for policy 0, policy_version 79873 (0.0033) [2024-06-27 21:25:03,850][06674] Fps is (10 sec: 45884.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1308639232. Throughput: 0: 43731.8. Samples: 1211542180. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-27 21:25:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:25:07,997][06909] Updated weights for policy 0, policy_version 79883 (0.0031) [2024-06-27 21:25:08,345][06887] Signal inference workers to stop experience collection... (17250 times) [2024-06-27 21:25:08,345][06887] Signal inference workers to resume experience collection... (17250 times) [2024-06-27 21:25:08,387][06909] InferenceWorker_p0-w0: stopping experience collection (17250 times) [2024-06-27 21:25:08,392][06909] InferenceWorker_p0-w0: resuming experience collection (17250 times) [2024-06-27 21:25:08,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 1308852224. Throughput: 0: 43820.4. Samples: 1211809080. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 21:25:08,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:25:11,140][06909] Updated weights for policy 0, policy_version 79893 (0.0043) [2024-06-27 21:25:13,854][06674] Fps is (10 sec: 40944.7, 60 sec: 43688.0, 300 sec: 43819.7). Total num frames: 1309048832. Throughput: 0: 43824.0. Samples: 1211937800. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 21:25:13,854][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:25:15,454][06909] Updated weights for policy 0, policy_version 79903 (0.0037) [2024-06-27 21:25:18,739][06909] Updated weights for policy 0, policy_version 79913 (0.0028) [2024-06-27 21:25:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 1309294592. Throughput: 0: 43848.0. Samples: 1212204620. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 21:25:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:25:23,040][06909] Updated weights for policy 0, policy_version 79923 (0.0041) [2024-06-27 21:25:23,850][06674] Fps is (10 sec: 44253.4, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 1309491200. Throughput: 0: 43827.3. Samples: 1212471440. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 21:25:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:25:26,180][06909] Updated weights for policy 0, policy_version 79933 (0.0043) [2024-06-27 21:25:28,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 1309704192. Throughput: 0: 44035.9. Samples: 1212598580. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 21:25:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:25:30,552][06909] Updated weights for policy 0, policy_version 79943 (0.0036) [2024-06-27 21:25:33,584][06909] Updated weights for policy 0, policy_version 79953 (0.0036) [2024-06-27 21:25:33,853][06674] Fps is (10 sec: 45861.0, 60 sec: 43688.5, 300 sec: 43930.9). Total num frames: 1309949952. Throughput: 0: 43966.6. Samples: 1212859700. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 21:25:33,854][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:25:37,914][06909] Updated weights for policy 0, policy_version 79963 (0.0029) [2024-06-27 21:25:38,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1310179328. Throughput: 0: 44034.8. Samples: 1213131680. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 21:25:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:25:41,353][06909] Updated weights for policy 0, policy_version 79973 (0.0046) [2024-06-27 21:25:43,850][06674] Fps is (10 sec: 40971.6, 60 sec: 43690.4, 300 sec: 43764.7). Total num frames: 1310359552. Throughput: 0: 44033.2. Samples: 1213259060. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 21:25:43,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:25:45,279][06909] Updated weights for policy 0, policy_version 79983 (0.0043) [2024-06-27 21:25:48,758][06909] Updated weights for policy 0, policy_version 79993 (0.0030) [2024-06-27 21:25:48,852][06674] Fps is (10 sec: 42590.0, 60 sec: 43693.6, 300 sec: 43931.0). Total num frames: 1310605312. Throughput: 0: 44014.8. Samples: 1213522940. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-27 21:25:48,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:25:48,981][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000079994_1310621696.pth... [2024-06-27 21:25:49,049][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000079350_1300070400.pth [2024-06-27 21:25:52,676][06909] Updated weights for policy 0, policy_version 80003 (0.0039) [2024-06-27 21:25:53,850][06674] Fps is (10 sec: 45876.5, 60 sec: 43965.3, 300 sec: 43931.3). Total num frames: 1310818304. Throughput: 0: 43981.9. Samples: 1213788260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:25:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:25:56,065][06909] Updated weights for policy 0, policy_version 80013 (0.0052) [2024-06-27 21:25:58,850][06674] Fps is (10 sec: 42607.7, 60 sec: 44236.9, 300 sec: 43820.3). Total num frames: 1311031296. Throughput: 0: 43966.4. Samples: 1213916120. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:25:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:26:00,255][06909] Updated weights for policy 0, policy_version 80023 (0.0026) [2024-06-27 21:26:03,730][06909] Updated weights for policy 0, policy_version 80033 (0.0043) [2024-06-27 21:26:03,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1311260672. Throughput: 0: 43816.0. Samples: 1214176340. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:26:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:26:07,881][06909] Updated weights for policy 0, policy_version 80043 (0.0037) [2024-06-27 21:26:08,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1311490048. Throughput: 0: 43918.6. Samples: 1214447780. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:26:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:26:10,909][06909] Updated weights for policy 0, policy_version 80053 (0.0028) [2024-06-27 21:26:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43966.4, 300 sec: 43820.3). Total num frames: 1311686656. Throughput: 0: 43859.4. Samples: 1214572260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:26:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:26:15,196][06909] Updated weights for policy 0, policy_version 80063 (0.0038) [2024-06-27 21:26:18,419][06909] Updated weights for policy 0, policy_version 80073 (0.0031) [2024-06-27 21:26:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1311916032. Throughput: 0: 43974.6. Samples: 1214838420. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:26:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:26:22,479][06909] Updated weights for policy 0, policy_version 80083 (0.0043) [2024-06-27 21:26:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1312145408. Throughput: 0: 43843.7. Samples: 1215104640. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:26:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:26:26,149][06909] Updated weights for policy 0, policy_version 80093 (0.0039) [2024-06-27 21:26:28,850][06674] Fps is (10 sec: 44235.9, 60 sec: 44236.6, 300 sec: 43875.8). Total num frames: 1312358400. Throughput: 0: 43925.4. Samples: 1215235700. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:26:28,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:26:29,984][06909] Updated weights for policy 0, policy_version 80103 (0.0044) [2024-06-27 21:26:33,434][06909] Updated weights for policy 0, policy_version 80113 (0.0031) [2024-06-27 21:26:33,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44239.0, 300 sec: 44042.4). Total num frames: 1312604160. Throughput: 0: 43911.7. Samples: 1215498880. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:26:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:26:37,549][06909] Updated weights for policy 0, policy_version 80123 (0.0040) [2024-06-27 21:26:38,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43417.7, 300 sec: 43931.3). Total num frames: 1312784384. Throughput: 0: 43831.9. Samples: 1215760700. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:26:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:26:38,976][06887] Signal inference workers to stop experience collection... (17300 times) [2024-06-27 21:26:38,977][06887] Signal inference workers to resume experience collection... (17300 times) [2024-06-27 21:26:39,003][06909] InferenceWorker_p0-w0: stopping experience collection (17300 times) [2024-06-27 21:26:39,003][06909] InferenceWorker_p0-w0: resuming experience collection (17300 times) [2024-06-27 21:26:40,709][06909] Updated weights for policy 0, policy_version 80133 (0.0029) [2024-06-27 21:26:43,850][06674] Fps is (10 sec: 40960.4, 60 sec: 44237.0, 300 sec: 43875.8). Total num frames: 1313013760. Throughput: 0: 43801.3. Samples: 1215887180. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-27 21:26:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:26:45,068][06909] Updated weights for policy 0, policy_version 80143 (0.0044) [2024-06-27 21:26:48,202][06909] Updated weights for policy 0, policy_version 80153 (0.0027) [2024-06-27 21:26:48,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43965.2, 300 sec: 43931.3). Total num frames: 1313243136. Throughput: 0: 44004.4. Samples: 1216156540. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-27 21:26:48,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:26:52,566][06909] Updated weights for policy 0, policy_version 80163 (0.0025) [2024-06-27 21:26:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1313439744. Throughput: 0: 43824.1. Samples: 1216419860. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-27 21:26:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:26:55,889][06909] Updated weights for policy 0, policy_version 80173 (0.0038) [2024-06-27 21:26:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 1313669120. Throughput: 0: 43889.8. Samples: 1216547300. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-27 21:26:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:26:59,887][06909] Updated weights for policy 0, policy_version 80183 (0.0034) [2024-06-27 21:27:03,540][06909] Updated weights for policy 0, policy_version 80193 (0.0034) [2024-06-27 21:27:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 1313882112. Throughput: 0: 43765.8. Samples: 1216807880. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-27 21:27:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:27:07,326][06909] Updated weights for policy 0, policy_version 80203 (0.0027) [2024-06-27 21:27:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 43875.8). Total num frames: 1314095104. Throughput: 0: 43797.8. Samples: 1217075540. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-27 21:27:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:27:10,797][06909] Updated weights for policy 0, policy_version 80213 (0.0044) [2024-06-27 21:27:13,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.8, 300 sec: 43821.1). Total num frames: 1314324480. Throughput: 0: 43649.5. Samples: 1217199920. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-27 21:27:13,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:27:14,941][06909] Updated weights for policy 0, policy_version 80223 (0.0032) [2024-06-27 21:27:18,187][06909] Updated weights for policy 0, policy_version 80233 (0.0037) [2024-06-27 21:27:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1314570240. Throughput: 0: 43834.8. Samples: 1217471440. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-27 21:27:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:27:22,311][06909] Updated weights for policy 0, policy_version 80243 (0.0035) [2024-06-27 21:27:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 43820.3). Total num frames: 1314750464. Throughput: 0: 43761.8. Samples: 1217729980. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-27 21:27:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:27:25,854][06909] Updated weights for policy 0, policy_version 80253 (0.0031) [2024-06-27 21:27:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 1314979840. Throughput: 0: 43772.5. Samples: 1217856940. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-27 21:27:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:27:30,063][06909] Updated weights for policy 0, policy_version 80263 (0.0024) [2024-06-27 21:27:33,582][06909] Updated weights for policy 0, policy_version 80273 (0.0022) [2024-06-27 21:27:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43417.7, 300 sec: 43931.3). Total num frames: 1315209216. Throughput: 0: 43630.8. Samples: 1218119920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:27:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:27:37,341][06909] Updated weights for policy 0, policy_version 80283 (0.0028) [2024-06-27 21:27:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1315405824. Throughput: 0: 43595.9. Samples: 1218381680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:27:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:27:40,978][06909] Updated weights for policy 0, policy_version 80293 (0.0028) [2024-06-27 21:27:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1315635200. Throughput: 0: 43664.1. Samples: 1218512180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:27:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:27:44,686][06909] Updated weights for policy 0, policy_version 80303 (0.0031) [2024-06-27 21:27:48,392][06909] Updated weights for policy 0, policy_version 80313 (0.0033) [2024-06-27 21:27:48,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 43931.5). Total num frames: 1315864576. Throughput: 0: 43937.3. Samples: 1218785060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:27:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:27:48,883][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000080315_1315880960.pth... [2024-06-27 21:27:48,937][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000079672_1305346048.pth [2024-06-27 21:27:52,117][06909] Updated weights for policy 0, policy_version 80323 (0.0030) [2024-06-27 21:27:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1316077568. Throughput: 0: 43802.2. Samples: 1219046640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:27:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:27:55,865][06909] Updated weights for policy 0, policy_version 80333 (0.0030) [2024-06-27 21:27:58,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 1316290560. Throughput: 0: 43874.5. Samples: 1219174280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:27:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:27:59,600][06909] Updated weights for policy 0, policy_version 80343 (0.0029) [2024-06-27 21:28:03,302][06909] Updated weights for policy 0, policy_version 80353 (0.0029) [2024-06-27 21:28:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43932.2). Total num frames: 1316519936. Throughput: 0: 43760.9. Samples: 1219440680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:28:03,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 21:28:05,788][06887] Signal inference workers to stop experience collection... (17350 times) [2024-06-27 21:28:05,788][06887] Signal inference workers to resume experience collection... (17350 times) [2024-06-27 21:28:05,815][06909] InferenceWorker_p0-w0: stopping experience collection (17350 times) [2024-06-27 21:28:05,815][06909] InferenceWorker_p0-w0: resuming experience collection (17350 times) [2024-06-27 21:28:07,025][06909] Updated weights for policy 0, policy_version 80363 (0.0035) [2024-06-27 21:28:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1316732928. Throughput: 0: 43775.4. Samples: 1219699880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:28:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:28:11,003][06909] Updated weights for policy 0, policy_version 80373 (0.0029) [2024-06-27 21:28:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1316945920. Throughput: 0: 43866.6. Samples: 1219830940. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:28:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:28:14,479][06909] Updated weights for policy 0, policy_version 80383 (0.0024) [2024-06-27 21:28:18,269][06909] Updated weights for policy 0, policy_version 80393 (0.0028) [2024-06-27 21:28:18,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 1317191680. Throughput: 0: 44064.4. Samples: 1220102820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 21:28:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:28:21,832][06909] Updated weights for policy 0, policy_version 80403 (0.0035) [2024-06-27 21:28:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.7, 300 sec: 43931.4). Total num frames: 1317404672. Throughput: 0: 43946.2. Samples: 1220359260. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 21:28:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:28:25,834][06909] Updated weights for policy 0, policy_version 80413 (0.0022) [2024-06-27 21:28:28,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1317601280. Throughput: 0: 44079.1. Samples: 1220495740. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 21:28:28,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 21:28:29,403][06909] Updated weights for policy 0, policy_version 80423 (0.0037) [2024-06-27 21:28:33,104][06909] Updated weights for policy 0, policy_version 80433 (0.0036) [2024-06-27 21:28:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1317847040. Throughput: 0: 43932.0. Samples: 1220762000. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 21:28:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:28:36,861][06909] Updated weights for policy 0, policy_version 80443 (0.0038) [2024-06-27 21:28:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1318043648. Throughput: 0: 43820.7. Samples: 1221018580. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 21:28:38,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:28:40,622][06909] Updated weights for policy 0, policy_version 80453 (0.0026) [2024-06-27 21:28:43,856][06674] Fps is (10 sec: 42572.5, 60 sec: 43959.3, 300 sec: 43819.4). Total num frames: 1318273024. Throughput: 0: 43827.1. Samples: 1221146760. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 21:28:43,856][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:28:44,527][06909] Updated weights for policy 0, policy_version 80463 (0.0036) [2024-06-27 21:28:48,336][06909] Updated weights for policy 0, policy_version 80473 (0.0032) [2024-06-27 21:28:48,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1318486016. Throughput: 0: 43862.7. Samples: 1221414500. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 21:28:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:28:51,765][06909] Updated weights for policy 0, policy_version 80483 (0.0027) [2024-06-27 21:28:53,850][06674] Fps is (10 sec: 44263.5, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1318715392. Throughput: 0: 43996.9. Samples: 1221679740. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 21:28:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:28:55,588][06909] Updated weights for policy 0, policy_version 80493 (0.0030) [2024-06-27 21:28:58,852][06674] Fps is (10 sec: 45865.7, 60 sec: 44235.4, 300 sec: 43875.5). Total num frames: 1318944768. Throughput: 0: 44024.2. Samples: 1221812120. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 21:28:58,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:28:59,260][06909] Updated weights for policy 0, policy_version 80503 (0.0036) [2024-06-27 21:29:03,103][06909] Updated weights for policy 0, policy_version 80513 (0.0031) [2024-06-27 21:29:03,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1319174144. Throughput: 0: 43981.6. Samples: 1222082000. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 21:29:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:29:06,538][06909] Updated weights for policy 0, policy_version 80523 (0.0026) [2024-06-27 21:29:08,850][06674] Fps is (10 sec: 42606.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1319370752. Throughput: 0: 44254.6. Samples: 1222350720. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-27 21:29:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:29:10,202][06909] Updated weights for policy 0, policy_version 80533 (0.0036) [2024-06-27 21:29:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.7, 300 sec: 43820.3). Total num frames: 1319600128. Throughput: 0: 44099.1. Samples: 1222480200. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:29:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:29:14,002][06909] Updated weights for policy 0, policy_version 80543 (0.0033) [2024-06-27 21:29:18,235][06909] Updated weights for policy 0, policy_version 80553 (0.0040) [2024-06-27 21:29:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1319829504. Throughput: 0: 43874.2. Samples: 1222736340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:29:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:29:21,330][06909] Updated weights for policy 0, policy_version 80563 (0.0028) [2024-06-27 21:29:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1320026112. Throughput: 0: 43967.1. Samples: 1222997100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:29:23,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:29:25,638][06909] Updated weights for policy 0, policy_version 80573 (0.0032) [2024-06-27 21:29:28,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.9, 300 sec: 43820.3). Total num frames: 1320255488. Throughput: 0: 44013.1. Samples: 1223127080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:29:28,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:29:29,150][06909] Updated weights for policy 0, policy_version 80583 (0.0030) [2024-06-27 21:29:33,107][06909] Updated weights for policy 0, policy_version 80593 (0.0033) [2024-06-27 21:29:33,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1320468480. Throughput: 0: 44050.2. Samples: 1223396760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:29:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:29:36,446][06909] Updated weights for policy 0, policy_version 80603 (0.0026) [2024-06-27 21:29:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.9, 300 sec: 43875.8). Total num frames: 1320681472. Throughput: 0: 43996.5. Samples: 1223659580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:29:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:29:39,926][06887] Signal inference workers to stop experience collection... (17400 times) [2024-06-27 21:29:39,926][06887] Signal inference workers to resume experience collection... (17400 times) [2024-06-27 21:29:39,938][06909] InferenceWorker_p0-w0: stopping experience collection (17400 times) [2024-06-27 21:29:39,948][06909] InferenceWorker_p0-w0: resuming experience collection (17400 times) [2024-06-27 21:29:40,393][06909] Updated weights for policy 0, policy_version 80613 (0.0025) [2024-06-27 21:29:43,601][06909] Updated weights for policy 0, policy_version 80623 (0.0035) [2024-06-27 21:29:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44241.2, 300 sec: 43876.7). Total num frames: 1320927232. Throughput: 0: 44094.8. Samples: 1223796300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:29:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:29:47,692][06909] Updated weights for policy 0, policy_version 80633 (0.0022) [2024-06-27 21:29:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43876.1). Total num frames: 1321123840. Throughput: 0: 44012.2. Samples: 1224062540. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:29:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:29:48,890][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000080636_1321140224.pth... [2024-06-27 21:29:48,960][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000079994_1310621696.pth [2024-06-27 21:29:51,093][06909] Updated weights for policy 0, policy_version 80643 (0.0036) [2024-06-27 21:29:53,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 1321336832. Throughput: 0: 43748.6. Samples: 1224319400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:29:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:29:55,122][06909] Updated weights for policy 0, policy_version 80653 (0.0022) [2024-06-27 21:29:58,657][06909] Updated weights for policy 0, policy_version 80663 (0.0044) [2024-06-27 21:29:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43965.2, 300 sec: 43875.8). Total num frames: 1321582592. Throughput: 0: 43642.3. Samples: 1224444100. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-27 21:29:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:30:02,887][06909] Updated weights for policy 0, policy_version 80673 (0.0031) [2024-06-27 21:30:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 1321779200. Throughput: 0: 43795.5. Samples: 1224707140. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-27 21:30:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:30:06,163][06909] Updated weights for policy 0, policy_version 80683 (0.0039) [2024-06-27 21:30:08,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 43931.9). Total num frames: 1322008576. Throughput: 0: 44035.8. Samples: 1224978700. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-27 21:30:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:30:10,175][06909] Updated weights for policy 0, policy_version 80693 (0.0026) [2024-06-27 21:30:13,359][06909] Updated weights for policy 0, policy_version 80703 (0.0023) [2024-06-27 21:30:13,851][06674] Fps is (10 sec: 45869.0, 60 sec: 43962.8, 300 sec: 43875.6). Total num frames: 1322237952. Throughput: 0: 44164.4. Samples: 1225114540. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-27 21:30:13,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 21:30:17,343][06909] Updated weights for policy 0, policy_version 80713 (0.0045) [2024-06-27 21:30:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.7, 300 sec: 43875.8). Total num frames: 1322434560. Throughput: 0: 44008.9. Samples: 1225377160. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-27 21:30:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:30:20,748][06909] Updated weights for policy 0, policy_version 80723 (0.0039) [2024-06-27 21:30:23,856][06674] Fps is (10 sec: 42578.5, 60 sec: 43959.4, 300 sec: 43930.4). Total num frames: 1322663936. Throughput: 0: 44076.7. Samples: 1225643300. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-27 21:30:23,865][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:30:24,993][06909] Updated weights for policy 0, policy_version 80733 (0.0027) [2024-06-27 21:30:28,380][06909] Updated weights for policy 0, policy_version 80743 (0.0031) [2024-06-27 21:30:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 43876.3). Total num frames: 1322893312. Throughput: 0: 43965.9. Samples: 1225774760. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-27 21:30:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:30:32,411][06909] Updated weights for policy 0, policy_version 80753 (0.0035) [2024-06-27 21:30:33,850][06674] Fps is (10 sec: 44263.9, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1323106304. Throughput: 0: 43808.5. Samples: 1226033920. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-27 21:30:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:30:35,800][06909] Updated weights for policy 0, policy_version 80763 (0.0048) [2024-06-27 21:30:38,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1323302912. Throughput: 0: 43995.1. Samples: 1226299180. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-27 21:30:38,859][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:30:40,007][06909] Updated weights for policy 0, policy_version 80773 (0.0029) [2024-06-27 21:30:43,437][06909] Updated weights for policy 0, policy_version 80783 (0.0023) [2024-06-27 21:30:43,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43963.7, 300 sec: 43931.6). Total num frames: 1323565056. Throughput: 0: 44143.0. Samples: 1226430540. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-27 21:30:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:30:47,277][06909] Updated weights for policy 0, policy_version 80793 (0.0019) [2024-06-27 21:30:48,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1323778048. Throughput: 0: 44154.2. Samples: 1226694080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 21:30:48,864][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:30:50,551][06909] Updated weights for policy 0, policy_version 80803 (0.0027) [2024-06-27 21:30:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1323991040. Throughput: 0: 44228.8. Samples: 1226969000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 21:30:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:30:54,434][06909] Updated weights for policy 0, policy_version 80813 (0.0027) [2024-06-27 21:30:58,223][06909] Updated weights for policy 0, policy_version 80823 (0.0031) [2024-06-27 21:30:58,852][06674] Fps is (10 sec: 45866.0, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 1324236800. Throughput: 0: 44135.3. Samples: 1227100660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 21:30:58,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:31:01,897][06909] Updated weights for policy 0, policy_version 80833 (0.0031) [2024-06-27 21:31:03,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 1324433408. Throughput: 0: 44165.1. Samples: 1227364600. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 21:31:03,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:31:05,569][06909] Updated weights for policy 0, policy_version 80843 (0.0033) [2024-06-27 21:31:08,718][06887] Signal inference workers to stop experience collection... (17450 times) [2024-06-27 21:31:08,718][06887] Signal inference workers to resume experience collection... (17450 times) [2024-06-27 21:31:08,732][06909] InferenceWorker_p0-w0: stopping experience collection (17450 times) [2024-06-27 21:31:08,732][06909] InferenceWorker_p0-w0: resuming experience collection (17450 times) [2024-06-27 21:31:08,850][06674] Fps is (10 sec: 40968.7, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 1324646400. Throughput: 0: 44143.3. Samples: 1227629480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 21:31:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:31:09,377][06909] Updated weights for policy 0, policy_version 80853 (0.0045) [2024-06-27 21:31:12,924][06909] Updated weights for policy 0, policy_version 80863 (0.0031) [2024-06-27 21:31:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44237.8, 300 sec: 43986.9). Total num frames: 1324892160. Throughput: 0: 43974.5. Samples: 1227753620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 21:31:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:31:16,893][06909] Updated weights for policy 0, policy_version 80873 (0.0033) [2024-06-27 21:31:18,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 1325088768. Throughput: 0: 44064.3. Samples: 1228016820. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 21:31:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:31:20,632][06909] Updated weights for policy 0, policy_version 80883 (0.0039) [2024-06-27 21:31:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43968.2, 300 sec: 43875.8). Total num frames: 1325301760. Throughput: 0: 44181.3. Samples: 1228287340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 21:31:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:31:24,366][06909] Updated weights for policy 0, policy_version 80893 (0.0031) [2024-06-27 21:31:28,109][06909] Updated weights for policy 0, policy_version 80903 (0.0039) [2024-06-27 21:31:28,850][06674] Fps is (10 sec: 47514.3, 60 sec: 44509.8, 300 sec: 43931.4). Total num frames: 1325563904. Throughput: 0: 44130.8. Samples: 1228416420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 21:31:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:31:31,625][06909] Updated weights for policy 0, policy_version 80913 (0.0026) [2024-06-27 21:31:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1325760512. Throughput: 0: 44160.0. Samples: 1228681280. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-27 21:31:33,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:31:35,384][06909] Updated weights for policy 0, policy_version 80923 (0.0037) [2024-06-27 21:31:38,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44782.8, 300 sec: 43986.9). Total num frames: 1325989888. Throughput: 0: 44028.8. Samples: 1228950300. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-27 21:31:38,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 21:31:39,230][06909] Updated weights for policy 0, policy_version 80933 (0.0032) [2024-06-27 21:31:42,855][06909] Updated weights for policy 0, policy_version 80943 (0.0042) [2024-06-27 21:31:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1326219264. Throughput: 0: 43901.5. Samples: 1229076140. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-27 21:31:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:31:47,058][06909] Updated weights for policy 0, policy_version 80953 (0.0035) [2024-06-27 21:31:48,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1326432256. Throughput: 0: 44025.6. Samples: 1229345740. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-27 21:31:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:31:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000080959_1326432256.pth... [2024-06-27 21:31:48,927][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000080315_1315880960.pth [2024-06-27 21:31:50,227][06909] Updated weights for policy 0, policy_version 80963 (0.0043) [2024-06-27 21:31:53,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1326628864. Throughput: 0: 43906.6. Samples: 1229605280. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-27 21:31:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:31:54,408][06909] Updated weights for policy 0, policy_version 80973 (0.0045) [2024-06-27 21:31:57,835][06909] Updated weights for policy 0, policy_version 80983 (0.0031) [2024-06-27 21:31:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 1326874624. Throughput: 0: 44025.8. Samples: 1229734780. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-27 21:31:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:32:01,717][06909] Updated weights for policy 0, policy_version 80993 (0.0029) [2024-06-27 21:32:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1327071232. Throughput: 0: 43913.0. Samples: 1229992900. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-27 21:32:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:32:05,087][06909] Updated weights for policy 0, policy_version 81003 (0.0029) [2024-06-27 21:32:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1327300608. Throughput: 0: 43828.9. Samples: 1230259640. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-27 21:32:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:32:09,002][06909] Updated weights for policy 0, policy_version 81013 (0.0031) [2024-06-27 21:32:12,480][06909] Updated weights for policy 0, policy_version 81023 (0.0032) [2024-06-27 21:32:13,850][06674] Fps is (10 sec: 44234.1, 60 sec: 43690.3, 300 sec: 43875.7). Total num frames: 1327513600. Throughput: 0: 43951.4. Samples: 1230394260. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-27 21:32:13,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:32:14,100][06887] Signal inference workers to stop experience collection... (17500 times) [2024-06-27 21:32:14,149][06887] Signal inference workers to resume experience collection... (17500 times) [2024-06-27 21:32:14,150][06909] InferenceWorker_p0-w0: stopping experience collection (17500 times) [2024-06-27 21:32:14,180][06909] InferenceWorker_p0-w0: resuming experience collection (17500 times) [2024-06-27 21:32:16,840][06909] Updated weights for policy 0, policy_version 81033 (0.0052) [2024-06-27 21:32:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1327742976. Throughput: 0: 44024.9. Samples: 1230662400. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-27 21:32:18,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-27 21:32:19,762][06909] Updated weights for policy 0, policy_version 81043 (0.0033) [2024-06-27 21:32:23,850][06674] Fps is (10 sec: 42601.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1327939584. Throughput: 0: 43760.5. Samples: 1230919520. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-27 21:32:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:32:24,647][06909] Updated weights for policy 0, policy_version 81053 (0.0026) [2024-06-27 21:32:27,431][06909] Updated weights for policy 0, policy_version 81063 (0.0034) [2024-06-27 21:32:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 1328168960. Throughput: 0: 43782.8. Samples: 1231046360. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 21:32:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:32:31,781][06909] Updated weights for policy 0, policy_version 81073 (0.0033) [2024-06-27 21:32:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1328398336. Throughput: 0: 44028.4. Samples: 1231327020. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 21:32:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:32:34,752][06909] Updated weights for policy 0, policy_version 81083 (0.0032) [2024-06-27 21:32:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1328611328. Throughput: 0: 43810.2. Samples: 1231576740. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 21:32:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:32:38,985][06909] Updated weights for policy 0, policy_version 81093 (0.0045) [2024-06-27 21:32:42,302][06909] Updated weights for policy 0, policy_version 81103 (0.0035) [2024-06-27 21:32:43,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 1328840704. Throughput: 0: 43914.9. Samples: 1231711040. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 21:32:43,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:32:46,335][06909] Updated weights for policy 0, policy_version 81113 (0.0021) [2024-06-27 21:32:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.5, 300 sec: 43986.9). Total num frames: 1329053696. Throughput: 0: 44195.0. Samples: 1231981680. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 21:32:48,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:32:49,627][06909] Updated weights for policy 0, policy_version 81123 (0.0026) [2024-06-27 21:32:53,850][06674] Fps is (10 sec: 42607.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1329266688. Throughput: 0: 43984.8. Samples: 1232238960. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 21:32:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:32:54,064][06909] Updated weights for policy 0, policy_version 81133 (0.0026) [2024-06-27 21:32:56,970][06909] Updated weights for policy 0, policy_version 81143 (0.0024) [2024-06-27 21:32:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1329512448. Throughput: 0: 43936.1. Samples: 1232371360. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 21:32:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:33:01,801][06909] Updated weights for policy 0, policy_version 81153 (0.0033) [2024-06-27 21:33:03,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1329725440. Throughput: 0: 44051.6. Samples: 1232644720. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 21:33:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:33:04,706][06909] Updated weights for policy 0, policy_version 81163 (0.0041) [2024-06-27 21:33:08,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1329922048. Throughput: 0: 43994.2. Samples: 1232899260. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 21:33:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:33:09,039][06909] Updated weights for policy 0, policy_version 81173 (0.0029) [2024-06-27 21:33:12,142][06909] Updated weights for policy 0, policy_version 81183 (0.0023) [2024-06-27 21:33:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44237.2, 300 sec: 43986.9). Total num frames: 1330167808. Throughput: 0: 44058.1. Samples: 1233028980. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-27 21:33:13,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 21:33:16,430][06909] Updated weights for policy 0, policy_version 81193 (0.0036) [2024-06-27 21:33:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1330380800. Throughput: 0: 43774.6. Samples: 1233296880. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 21:33:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:33:19,438][06909] Updated weights for policy 0, policy_version 81203 (0.0031) [2024-06-27 21:33:23,693][06909] Updated weights for policy 0, policy_version 81213 (0.0043) [2024-06-27 21:33:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1330593792. Throughput: 0: 43999.5. Samples: 1233556720. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 21:33:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 21:33:25,450][06887] Signal inference workers to stop experience collection... (17550 times) [2024-06-27 21:33:25,451][06887] Signal inference workers to resume experience collection... (17550 times) [2024-06-27 21:33:25,499][06909] InferenceWorker_p0-w0: stopping experience collection (17550 times) [2024-06-27 21:33:25,499][06909] InferenceWorker_p0-w0: resuming experience collection (17550 times) [2024-06-27 21:33:27,290][06909] Updated weights for policy 0, policy_version 81223 (0.0040) [2024-06-27 21:33:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1330823168. Throughput: 0: 43850.4. Samples: 1233684220. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 21:33:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:33:31,395][06909] Updated weights for policy 0, policy_version 81233 (0.0053) [2024-06-27 21:33:33,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 1331036160. Throughput: 0: 43805.7. Samples: 1233953020. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 21:33:33,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:33:34,611][06909] Updated weights for policy 0, policy_version 81243 (0.0028) [2024-06-27 21:33:38,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43932.2). Total num frames: 1331232768. Throughput: 0: 43931.2. Samples: 1234215860. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 21:33:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:33:39,154][06909] Updated weights for policy 0, policy_version 81253 (0.0036) [2024-06-27 21:33:42,126][06909] Updated weights for policy 0, policy_version 81263 (0.0032) [2024-06-27 21:33:43,850][06674] Fps is (10 sec: 44246.0, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 1331478528. Throughput: 0: 43880.1. Samples: 1234345960. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 21:33:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:33:46,419][06909] Updated weights for policy 0, policy_version 81273 (0.0031) [2024-06-27 21:33:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1331691520. Throughput: 0: 43681.3. Samples: 1234610380. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 21:33:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:33:48,977][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000081281_1331707904.pth... [2024-06-27 21:33:49,026][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000080636_1321140224.pth [2024-06-27 21:33:49,676][06909] Updated weights for policy 0, policy_version 81283 (0.0027) [2024-06-27 21:33:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43876.1). Total num frames: 1331888128. Throughput: 0: 43753.8. Samples: 1234868180. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 21:33:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:33:53,871][06909] Updated weights for policy 0, policy_version 81293 (0.0034) [2024-06-27 21:33:57,126][06909] Updated weights for policy 0, policy_version 81303 (0.0043) [2024-06-27 21:33:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1332133888. Throughput: 0: 43675.1. Samples: 1234994360. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 21:33:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:34:01,251][06909] Updated weights for policy 0, policy_version 81313 (0.0033) [2024-06-27 21:34:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1332346880. Throughput: 0: 43657.9. Samples: 1235261480. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 21:34:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:34:05,089][06909] Updated weights for policy 0, policy_version 81323 (0.0029) [2024-06-27 21:34:08,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1332543488. Throughput: 0: 43862.7. Samples: 1235530540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 21:34:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:34:09,064][06909] Updated weights for policy 0, policy_version 81333 (0.0023) [2024-06-27 21:34:12,294][06909] Updated weights for policy 0, policy_version 81343 (0.0029) [2024-06-27 21:34:13,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1332789248. Throughput: 0: 43792.9. Samples: 1235654900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 21:34:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:34:16,608][06909] Updated weights for policy 0, policy_version 81353 (0.0037) [2024-06-27 21:34:18,852][06674] Fps is (10 sec: 45865.9, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 1333002240. Throughput: 0: 43775.1. Samples: 1235922900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 21:34:18,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:34:19,835][06909] Updated weights for policy 0, policy_version 81363 (0.0030) [2024-06-27 21:34:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 43875.8). Total num frames: 1333198848. Throughput: 0: 43811.9. Samples: 1236187400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 21:34:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:34:24,137][06909] Updated weights for policy 0, policy_version 81373 (0.0026) [2024-06-27 21:34:27,241][06909] Updated weights for policy 0, policy_version 81383 (0.0030) [2024-06-27 21:34:28,850][06674] Fps is (10 sec: 44245.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1333444608. Throughput: 0: 43721.6. Samples: 1236313440. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 21:34:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:34:31,548][06909] Updated weights for policy 0, policy_version 81393 (0.0041) [2024-06-27 21:34:32,411][06887] Signal inference workers to stop experience collection... (17600 times) [2024-06-27 21:34:32,412][06887] Signal inference workers to resume experience collection... (17600 times) [2024-06-27 21:34:32,438][06909] InferenceWorker_p0-w0: stopping experience collection (17600 times) [2024-06-27 21:34:32,438][06909] InferenceWorker_p0-w0: resuming experience collection (17600 times) [2024-06-27 21:34:33,852][06674] Fps is (10 sec: 45865.9, 60 sec: 43690.7, 300 sec: 43986.6). Total num frames: 1333657600. Throughput: 0: 43803.8. Samples: 1236581640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 21:34:33,861][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:34:34,758][06909] Updated weights for policy 0, policy_version 81403 (0.0028) [2024-06-27 21:34:38,828][06909] Updated weights for policy 0, policy_version 81413 (0.0026) [2024-06-27 21:34:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1333870592. Throughput: 0: 44033.7. Samples: 1236849700. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 21:34:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:34:41,957][06909] Updated weights for policy 0, policy_version 81423 (0.0035) [2024-06-27 21:34:43,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1334099968. Throughput: 0: 44014.7. Samples: 1236975020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 21:34:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:34:46,468][06909] Updated weights for policy 0, policy_version 81433 (0.0042) [2024-06-27 21:34:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1334329344. Throughput: 0: 44112.3. Samples: 1237246540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 21:34:48,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:34:49,534][06909] Updated weights for policy 0, policy_version 81443 (0.0021) [2024-06-27 21:34:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1334509568. Throughput: 0: 44036.9. Samples: 1237512200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 21:34:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:34:53,896][06909] Updated weights for policy 0, policy_version 81453 (0.0038) [2024-06-27 21:34:56,887][06909] Updated weights for policy 0, policy_version 81463 (0.0026) [2024-06-27 21:34:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1334755328. Throughput: 0: 44136.0. Samples: 1237641020. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2024-06-27 21:34:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:35:01,098][06909] Updated weights for policy 0, policy_version 81473 (0.0040) [2024-06-27 21:35:03,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43963.6, 300 sec: 43986.8). Total num frames: 1334984704. Throughput: 0: 44097.9. Samples: 1237907220. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2024-06-27 21:35:03,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:35:04,418][06909] Updated weights for policy 0, policy_version 81483 (0.0028) [2024-06-27 21:35:08,581][06909] Updated weights for policy 0, policy_version 81493 (0.0047) [2024-06-27 21:35:08,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.6, 300 sec: 43876.0). Total num frames: 1335181312. Throughput: 0: 44081.2. Samples: 1238171060. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2024-06-27 21:35:08,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:35:12,252][06909] Updated weights for policy 0, policy_version 81503 (0.0039) [2024-06-27 21:35:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1335427072. Throughput: 0: 43939.2. Samples: 1238290700. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2024-06-27 21:35:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:35:16,160][06909] Updated weights for policy 0, policy_version 81513 (0.0026) [2024-06-27 21:35:18,852][06674] Fps is (10 sec: 45866.5, 60 sec: 43963.7, 300 sec: 43987.5). Total num frames: 1335640064. Throughput: 0: 44017.3. Samples: 1238562420. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2024-06-27 21:35:18,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:35:19,408][06909] Updated weights for policy 0, policy_version 81523 (0.0035) [2024-06-27 21:35:23,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1335820288. Throughput: 0: 43984.6. Samples: 1238829000. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2024-06-27 21:35:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:35:23,920][06909] Updated weights for policy 0, policy_version 81533 (0.0038) [2024-06-27 21:35:26,763][06909] Updated weights for policy 0, policy_version 81543 (0.0027) [2024-06-27 21:35:28,850][06674] Fps is (10 sec: 44246.2, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 1336082432. Throughput: 0: 43929.0. Samples: 1238951820. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2024-06-27 21:35:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:35:31,098][06909] Updated weights for policy 0, policy_version 81553 (0.0024) [2024-06-27 21:35:33,856][06674] Fps is (10 sec: 49121.4, 60 sec: 44233.8, 300 sec: 44097.0). Total num frames: 1336311808. Throughput: 0: 43763.9. Samples: 1239216180. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2024-06-27 21:35:33,857][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:35:34,148][06909] Updated weights for policy 0, policy_version 81563 (0.0042) [2024-06-27 21:35:38,495][06909] Updated weights for policy 0, policy_version 81573 (0.0033) [2024-06-27 21:35:38,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1336492032. Throughput: 0: 43940.9. Samples: 1239489540. Policy #0 lag: (min: 0.0, avg: 7.5, max: 19.0) [2024-06-27 21:35:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:35:41,767][06909] Updated weights for policy 0, policy_version 81583 (0.0020) [2024-06-27 21:35:43,850][06674] Fps is (10 sec: 42624.7, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1336737792. Throughput: 0: 43796.5. Samples: 1239611860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:35:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:35:45,811][06909] Updated weights for policy 0, policy_version 81593 (0.0033) [2024-06-27 21:35:47,397][06887] Signal inference workers to stop experience collection... (17650 times) [2024-06-27 21:35:47,397][06887] Signal inference workers to resume experience collection... (17650 times) [2024-06-27 21:35:47,418][06909] InferenceWorker_p0-w0: stopping experience collection (17650 times) [2024-06-27 21:35:47,418][06909] InferenceWorker_p0-w0: resuming experience collection (17650 times) [2024-06-27 21:35:48,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1336967168. Throughput: 0: 43732.0. Samples: 1239875160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:35:48,859][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:35:48,878][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000081602_1336967168.pth... [2024-06-27 21:35:48,955][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000080959_1326432256.pth [2024-06-27 21:35:49,095][06909] Updated weights for policy 0, policy_version 81603 (0.0022) [2024-06-27 21:35:53,453][06909] Updated weights for policy 0, policy_version 81613 (0.0032) [2024-06-27 21:35:53,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.7, 300 sec: 43765.0). Total num frames: 1337147392. Throughput: 0: 43628.1. Samples: 1240134320. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:35:53,859][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:35:57,076][06909] Updated weights for policy 0, policy_version 81623 (0.0041) [2024-06-27 21:35:58,851][06674] Fps is (10 sec: 42595.2, 60 sec: 43963.1, 300 sec: 43931.2). Total num frames: 1337393152. Throughput: 0: 43758.7. Samples: 1240259880. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:35:58,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:36:01,282][06909] Updated weights for policy 0, policy_version 81633 (0.0024) [2024-06-27 21:36:03,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1337622528. Throughput: 0: 43624.6. Samples: 1240525440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:36:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:36:04,443][06909] Updated weights for policy 0, policy_version 81643 (0.0033) [2024-06-27 21:36:08,604][06909] Updated weights for policy 0, policy_version 81653 (0.0025) [2024-06-27 21:36:08,850][06674] Fps is (10 sec: 40963.7, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 1337802752. Throughput: 0: 43643.0. Samples: 1240792940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:36:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:36:11,692][06909] Updated weights for policy 0, policy_version 81663 (0.0036) [2024-06-27 21:36:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 1338048512. Throughput: 0: 43684.8. Samples: 1240917640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:36:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:36:15,750][06909] Updated weights for policy 0, policy_version 81673 (0.0028) [2024-06-27 21:36:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 1338277888. Throughput: 0: 43771.3. Samples: 1241185620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:36:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:36:19,229][06909] Updated weights for policy 0, policy_version 81683 (0.0036) [2024-06-27 21:36:22,984][06909] Updated weights for policy 0, policy_version 81693 (0.0019) [2024-06-27 21:36:23,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.6, 300 sec: 43709.2). Total num frames: 1338458112. Throughput: 0: 43722.6. Samples: 1241457060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:36:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:36:26,507][06909] Updated weights for policy 0, policy_version 81703 (0.0049) [2024-06-27 21:36:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.5, 300 sec: 43875.8). Total num frames: 1338703872. Throughput: 0: 43865.7. Samples: 1241585820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:36:28,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:36:30,462][06909] Updated weights for policy 0, policy_version 81713 (0.0023) [2024-06-27 21:36:33,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43695.1, 300 sec: 43875.8). Total num frames: 1338933248. Throughput: 0: 43813.5. Samples: 1241846760. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 21:36:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:36:34,218][06909] Updated weights for policy 0, policy_version 81723 (0.0032) [2024-06-27 21:36:37,942][06909] Updated weights for policy 0, policy_version 81733 (0.0037) [2024-06-27 21:36:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.7, 300 sec: 43820.3). Total num frames: 1339146240. Throughput: 0: 44084.9. Samples: 1242118140. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 21:36:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:36:41,871][06909] Updated weights for policy 0, policy_version 81743 (0.0023) [2024-06-27 21:36:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1339359232. Throughput: 0: 44184.5. Samples: 1242248140. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 21:36:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:36:45,544][06909] Updated weights for policy 0, policy_version 81753 (0.0028) [2024-06-27 21:36:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1339588608. Throughput: 0: 43943.9. Samples: 1242502920. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 21:36:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:36:49,138][06909] Updated weights for policy 0, policy_version 81763 (0.0037) [2024-06-27 21:36:52,807][06909] Updated weights for policy 0, policy_version 81773 (0.0025) [2024-06-27 21:36:53,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.8, 300 sec: 43820.3). Total num frames: 1339801600. Throughput: 0: 44034.2. Samples: 1242774480. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 21:36:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:36:56,525][06909] Updated weights for policy 0, policy_version 81783 (0.0036) [2024-06-27 21:36:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43691.3, 300 sec: 43875.8). Total num frames: 1340014592. Throughput: 0: 44135.9. Samples: 1242903760. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 21:36:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:37:00,120][06909] Updated weights for policy 0, policy_version 81793 (0.0033) [2024-06-27 21:37:03,751][06909] Updated weights for policy 0, policy_version 81803 (0.0031) [2024-06-27 21:37:03,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 1340260352. Throughput: 0: 43847.4. Samples: 1243158760. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 21:37:03,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:37:07,954][06909] Updated weights for policy 0, policy_version 81813 (0.0035) [2024-06-27 21:37:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43875.9). Total num frames: 1340456960. Throughput: 0: 43807.1. Samples: 1243428380. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 21:37:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:37:11,646][06909] Updated weights for policy 0, policy_version 81823 (0.0027) [2024-06-27 21:37:13,850][06674] Fps is (10 sec: 40960.9, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1340669952. Throughput: 0: 43781.9. Samples: 1243556000. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 21:37:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:37:15,396][06909] Updated weights for policy 0, policy_version 81833 (0.0027) [2024-06-27 21:37:16,238][06887] Signal inference workers to stop experience collection... (17700 times) [2024-06-27 21:37:16,239][06887] Signal inference workers to resume experience collection... (17700 times) [2024-06-27 21:37:16,263][06909] InferenceWorker_p0-w0: stopping experience collection (17700 times) [2024-06-27 21:37:16,264][06909] InferenceWorker_p0-w0: resuming experience collection (17700 times) [2024-06-27 21:37:18,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43689.2, 300 sec: 43931.0). Total num frames: 1340899328. Throughput: 0: 43581.6. Samples: 1243808020. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 21:37:18,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:37:19,256][06909] Updated weights for policy 0, policy_version 81843 (0.0029) [2024-06-27 21:37:22,930][06909] Updated weights for policy 0, policy_version 81853 (0.0032) [2024-06-27 21:37:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 1341112320. Throughput: 0: 43709.9. Samples: 1244085080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:37:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:37:26,469][06909] Updated weights for policy 0, policy_version 81863 (0.0037) [2024-06-27 21:37:28,850][06674] Fps is (10 sec: 42606.7, 60 sec: 43690.7, 300 sec: 43820.2). Total num frames: 1341325312. Throughput: 0: 43668.7. Samples: 1244213240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:37:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:37:30,194][06909] Updated weights for policy 0, policy_version 81873 (0.0027) [2024-06-27 21:37:33,706][06909] Updated weights for policy 0, policy_version 81883 (0.0032) [2024-06-27 21:37:33,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1341571072. Throughput: 0: 43774.2. Samples: 1244472760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:37:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:37:38,040][06909] Updated weights for policy 0, policy_version 81893 (0.0039) [2024-06-27 21:37:38,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.8, 300 sec: 43876.1). Total num frames: 1341784064. Throughput: 0: 43796.5. Samples: 1244745320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:37:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:37:41,366][06909] Updated weights for policy 0, policy_version 81903 (0.0046) [2024-06-27 21:37:43,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 1341980672. Throughput: 0: 43786.7. Samples: 1244874160. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:37:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:37:45,519][06909] Updated weights for policy 0, policy_version 81913 (0.0038) [2024-06-27 21:37:48,851][06674] Fps is (10 sec: 42594.7, 60 sec: 43690.1, 300 sec: 43875.7). Total num frames: 1342210048. Throughput: 0: 43780.3. Samples: 1245128900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:37:48,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:37:48,995][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000081923_1342226432.pth... [2024-06-27 21:37:49,001][06909] Updated weights for policy 0, policy_version 81923 (0.0038) [2024-06-27 21:37:49,057][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000081281_1331707904.pth [2024-06-27 21:37:53,174][06909] Updated weights for policy 0, policy_version 81933 (0.0040) [2024-06-27 21:37:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1342439424. Throughput: 0: 43848.5. Samples: 1245401560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:37:53,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:37:56,221][06909] Updated weights for policy 0, policy_version 81943 (0.0035) [2024-06-27 21:37:58,850][06674] Fps is (10 sec: 42601.8, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1342636032. Throughput: 0: 43742.6. Samples: 1245524420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:37:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:38:00,440][06909] Updated weights for policy 0, policy_version 81953 (0.0043) [2024-06-27 21:38:03,358][06909] Updated weights for policy 0, policy_version 81963 (0.0037) [2024-06-27 21:38:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 1342881792. Throughput: 0: 44044.2. Samples: 1245789920. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:38:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:38:07,604][06909] Updated weights for policy 0, policy_version 81973 (0.0032) [2024-06-27 21:38:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1343094784. Throughput: 0: 43987.0. Samples: 1246064500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:38:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:38:11,192][06909] Updated weights for policy 0, policy_version 81983 (0.0032) [2024-06-27 21:38:13,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 1343291392. Throughput: 0: 43981.0. Samples: 1246192380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 21:38:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:38:15,151][06909] Updated weights for policy 0, policy_version 81993 (0.0034) [2024-06-27 21:38:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43692.2, 300 sec: 43820.3). Total num frames: 1343520768. Throughput: 0: 44041.9. Samples: 1246454640. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 21:38:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:38:18,938][06909] Updated weights for policy 0, policy_version 82003 (0.0031) [2024-06-27 21:38:22,407][06909] Updated weights for policy 0, policy_version 82013 (0.0028) [2024-06-27 21:38:23,444][06887] Signal inference workers to stop experience collection... (17750 times) [2024-06-27 21:38:23,469][06909] InferenceWorker_p0-w0: stopping experience collection (17750 times) [2024-06-27 21:38:23,505][06887] Signal inference workers to resume experience collection... (17750 times) [2024-06-27 21:38:23,506][06909] InferenceWorker_p0-w0: resuming experience collection (17750 times) [2024-06-27 21:38:23,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 1343766528. Throughput: 0: 43851.5. Samples: 1246718640. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 21:38:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:38:26,085][06909] Updated weights for policy 0, policy_version 82023 (0.0028) [2024-06-27 21:38:28,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 43820.5). Total num frames: 1343963136. Throughput: 0: 43991.9. Samples: 1246853800. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 21:38:28,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:38:30,101][06909] Updated weights for policy 0, policy_version 82033 (0.0033) [2024-06-27 21:38:33,566][06909] Updated weights for policy 0, policy_version 82043 (0.0023) [2024-06-27 21:38:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 1344192512. Throughput: 0: 44070.6. Samples: 1247112040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 21:38:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:38:37,280][06909] Updated weights for policy 0, policy_version 82053 (0.0039) [2024-06-27 21:38:38,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1344421888. Throughput: 0: 43949.8. Samples: 1247379300. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 21:38:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:38:40,819][06909] Updated weights for policy 0, policy_version 82063 (0.0033) [2024-06-27 21:38:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1344618496. Throughput: 0: 44214.3. Samples: 1247514060. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 21:38:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:38:44,816][06909] Updated weights for policy 0, policy_version 82073 (0.0037) [2024-06-27 21:38:48,441][06909] Updated weights for policy 0, policy_version 82083 (0.0040) [2024-06-27 21:38:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43964.3, 300 sec: 43931.3). Total num frames: 1344847872. Throughput: 0: 44121.8. Samples: 1247775400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 21:38:48,854][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:38:52,417][06909] Updated weights for policy 0, policy_version 82093 (0.0026) [2024-06-27 21:38:53,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1345093632. Throughput: 0: 43815.2. Samples: 1248036180. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 21:38:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:38:55,990][06909] Updated weights for policy 0, policy_version 82103 (0.0035) [2024-06-27 21:38:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1345290240. Throughput: 0: 44086.5. Samples: 1248176280. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-27 21:38:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:38:59,709][06909] Updated weights for policy 0, policy_version 82113 (0.0023) [2024-06-27 21:39:03,235][06909] Updated weights for policy 0, policy_version 82123 (0.0032) [2024-06-27 21:39:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1345519616. Throughput: 0: 44078.2. Samples: 1248438160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:39:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:39:07,355][06909] Updated weights for policy 0, policy_version 82133 (0.0028) [2024-06-27 21:39:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1345748992. Throughput: 0: 43954.6. Samples: 1248696600. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:39:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:39:10,849][06909] Updated weights for policy 0, policy_version 82143 (0.0034) [2024-06-27 21:39:13,852][06674] Fps is (10 sec: 42589.9, 60 sec: 44235.3, 300 sec: 43875.8). Total num frames: 1345945600. Throughput: 0: 43881.7. Samples: 1248828560. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:39:13,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:39:14,619][06909] Updated weights for policy 0, policy_version 82153 (0.0026) [2024-06-27 21:39:18,204][06909] Updated weights for policy 0, policy_version 82163 (0.0035) [2024-06-27 21:39:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 1346191360. Throughput: 0: 44105.2. Samples: 1249096780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:39:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:39:22,330][06909] Updated weights for policy 0, policy_version 82173 (0.0030) [2024-06-27 21:39:23,850][06674] Fps is (10 sec: 45884.5, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 1346404352. Throughput: 0: 43818.7. Samples: 1249351140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:39:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:39:25,738][06909] Updated weights for policy 0, policy_version 82183 (0.0031) [2024-06-27 21:39:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.9, 300 sec: 43931.6). Total num frames: 1346617344. Throughput: 0: 43946.2. Samples: 1249491640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:39:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:39:29,587][06909] Updated weights for policy 0, policy_version 82193 (0.0032) [2024-06-27 21:39:33,245][06909] Updated weights for policy 0, policy_version 82203 (0.0026) [2024-06-27 21:39:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1346813952. Throughput: 0: 44072.1. Samples: 1249758640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:39:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:39:36,799][06909] Updated weights for policy 0, policy_version 82213 (0.0036) [2024-06-27 21:39:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1347076096. Throughput: 0: 44040.1. Samples: 1250017980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:39:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:39:40,630][06909] Updated weights for policy 0, policy_version 82223 (0.0031) [2024-06-27 21:39:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1347272704. Throughput: 0: 43871.6. Samples: 1250150500. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:39:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:39:44,768][06887] Signal inference workers to stop experience collection... (17800 times) [2024-06-27 21:39:44,772][06909] Updated weights for policy 0, policy_version 82233 (0.0029) [2024-06-27 21:39:44,776][06887] Signal inference workers to resume experience collection... (17800 times) [2024-06-27 21:39:44,782][06909] InferenceWorker_p0-w0: stopping experience collection (17800 times) [2024-06-27 21:39:44,797][06909] InferenceWorker_p0-w0: resuming experience collection (17800 times) [2024-06-27 21:39:48,372][06909] Updated weights for policy 0, policy_version 82243 (0.0036) [2024-06-27 21:39:48,856][06674] Fps is (10 sec: 42572.5, 60 sec: 44232.4, 300 sec: 44041.5). Total num frames: 1347502080. Throughput: 0: 43901.2. Samples: 1250413980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-27 21:39:48,857][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:39:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000082245_1347502080.pth... [2024-06-27 21:39:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000081602_1336967168.pth [2024-06-27 21:39:52,135][06909] Updated weights for policy 0, policy_version 82253 (0.0028) [2024-06-27 21:39:53,856][06674] Fps is (10 sec: 45847.5, 60 sec: 43959.3, 300 sec: 43986.0). Total num frames: 1347731456. Throughput: 0: 43848.9. Samples: 1250670060. Policy #0 lag: (min: 0.0, avg: 13.0, max: 26.0) [2024-06-27 21:39:53,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:39:55,745][06909] Updated weights for policy 0, policy_version 82263 (0.0034) [2024-06-27 21:39:58,850][06674] Fps is (10 sec: 40984.8, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1347911680. Throughput: 0: 43927.7. Samples: 1250805220. Policy #0 lag: (min: 0.0, avg: 13.0, max: 26.0) [2024-06-27 21:39:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:39:59,474][06909] Updated weights for policy 0, policy_version 82273 (0.0045) [2024-06-27 21:40:03,399][06909] Updated weights for policy 0, policy_version 82283 (0.0024) [2024-06-27 21:40:03,850][06674] Fps is (10 sec: 40985.2, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 1348141056. Throughput: 0: 43881.1. Samples: 1251071420. Policy #0 lag: (min: 0.0, avg: 13.0, max: 26.0) [2024-06-27 21:40:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:40:06,773][06909] Updated weights for policy 0, policy_version 82293 (0.0024) [2024-06-27 21:40:08,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1348386816. Throughput: 0: 43899.0. Samples: 1251326600. Policy #0 lag: (min: 0.0, avg: 13.0, max: 26.0) [2024-06-27 21:40:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:40:11,079][06909] Updated weights for policy 0, policy_version 82303 (0.0024) [2024-06-27 21:40:13,856][06674] Fps is (10 sec: 42572.4, 60 sec: 43687.7, 300 sec: 43819.7). Total num frames: 1348567040. Throughput: 0: 43793.7. Samples: 1251462620. Policy #0 lag: (min: 0.0, avg: 13.0, max: 26.0) [2024-06-27 21:40:13,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:40:14,170][06909] Updated weights for policy 0, policy_version 82313 (0.0029) [2024-06-27 21:40:18,312][06909] Updated weights for policy 0, policy_version 82323 (0.0033) [2024-06-27 21:40:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 1348812800. Throughput: 0: 43761.8. Samples: 1251727920. Policy #0 lag: (min: 0.0, avg: 13.0, max: 26.0) [2024-06-27 21:40:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:40:21,594][06909] Updated weights for policy 0, policy_version 82333 (0.0022) [2024-06-27 21:40:23,850][06674] Fps is (10 sec: 45902.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1349025792. Throughput: 0: 43602.2. Samples: 1251980080. Policy #0 lag: (min: 0.0, avg: 13.0, max: 26.0) [2024-06-27 21:40:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:40:25,743][06909] Updated weights for policy 0, policy_version 82343 (0.0024) [2024-06-27 21:40:28,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43821.2). Total num frames: 1349238784. Throughput: 0: 43563.0. Samples: 1252110840. Policy #0 lag: (min: 0.0, avg: 13.0, max: 26.0) [2024-06-27 21:40:28,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:40:29,296][06909] Updated weights for policy 0, policy_version 82353 (0.0029) [2024-06-27 21:40:33,242][06909] Updated weights for policy 0, policy_version 82363 (0.0037) [2024-06-27 21:40:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1349451776. Throughput: 0: 43669.0. Samples: 1252378820. Policy #0 lag: (min: 0.0, avg: 13.0, max: 26.0) [2024-06-27 21:40:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:40:36,751][06909] Updated weights for policy 0, policy_version 82373 (0.0039) [2024-06-27 21:40:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.5, 300 sec: 43875.8). Total num frames: 1349681152. Throughput: 0: 43624.4. Samples: 1252632900. Policy #0 lag: (min: 0.0, avg: 13.0, max: 26.0) [2024-06-27 21:40:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:40:40,983][06909] Updated weights for policy 0, policy_version 82383 (0.0036) [2024-06-27 21:40:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1349894144. Throughput: 0: 43709.9. Samples: 1252772160. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 21:40:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:40:44,243][06909] Updated weights for policy 0, policy_version 82393 (0.0033) [2024-06-27 21:40:48,395][06909] Updated weights for policy 0, policy_version 82403 (0.0037) [2024-06-27 21:40:48,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43148.9, 300 sec: 43875.8). Total num frames: 1350090752. Throughput: 0: 43677.3. Samples: 1253036900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 21:40:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:40:51,632][06909] Updated weights for policy 0, policy_version 82413 (0.0037) [2024-06-27 21:40:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43422.0, 300 sec: 43875.9). Total num frames: 1350336512. Throughput: 0: 43642.3. Samples: 1253290500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 21:40:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:40:56,032][06909] Updated weights for policy 0, policy_version 82423 (0.0023) [2024-06-27 21:40:58,463][06887] Signal inference workers to stop experience collection... (17850 times) [2024-06-27 21:40:58,514][06887] Signal inference workers to resume experience collection... (17850 times) [2024-06-27 21:40:58,514][06909] InferenceWorker_p0-w0: stopping experience collection (17850 times) [2024-06-27 21:40:58,531][06909] InferenceWorker_p0-w0: resuming experience collection (17850 times) [2024-06-27 21:40:58,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1350565888. Throughput: 0: 43628.0. Samples: 1253425620. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 21:40:58,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:40:59,045][06909] Updated weights for policy 0, policy_version 82433 (0.0033) [2024-06-27 21:41:03,329][06909] Updated weights for policy 0, policy_version 82443 (0.0028) [2024-06-27 21:41:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1350778880. Throughput: 0: 43684.9. Samples: 1253693740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 21:41:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:41:06,772][06909] Updated weights for policy 0, policy_version 82453 (0.0036) [2024-06-27 21:41:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.5, 300 sec: 43875.8). Total num frames: 1350991872. Throughput: 0: 43665.6. Samples: 1253945040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 21:41:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:41:10,522][06909] Updated weights for policy 0, policy_version 82463 (0.0030) [2024-06-27 21:41:13,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43968.1, 300 sec: 43820.2). Total num frames: 1351204864. Throughput: 0: 43814.2. Samples: 1254082480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 21:41:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:41:14,363][06909] Updated weights for policy 0, policy_version 82473 (0.0036) [2024-06-27 21:41:18,158][06909] Updated weights for policy 0, policy_version 82483 (0.0029) [2024-06-27 21:41:18,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 1351417856. Throughput: 0: 43817.3. Samples: 1254350600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 21:41:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:41:21,548][06909] Updated weights for policy 0, policy_version 82493 (0.0036) [2024-06-27 21:41:23,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1351647232. Throughput: 0: 43976.1. Samples: 1254611820. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 21:41:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:41:25,477][06909] Updated weights for policy 0, policy_version 82503 (0.0027) [2024-06-27 21:41:28,829][06909] Updated weights for policy 0, policy_version 82513 (0.0039) [2024-06-27 21:41:28,852][06674] Fps is (10 sec: 47503.7, 60 sec: 44235.3, 300 sec: 43931.0). Total num frames: 1351892992. Throughput: 0: 44040.6. Samples: 1254754080. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 21:41:28,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:41:33,002][06909] Updated weights for policy 0, policy_version 82523 (0.0037) [2024-06-27 21:41:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 1352073216. Throughput: 0: 43910.2. Samples: 1255012860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 21:41:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:41:36,389][06909] Updated weights for policy 0, policy_version 82533 (0.0032) [2024-06-27 21:41:38,850][06674] Fps is (10 sec: 40968.8, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 1352302592. Throughput: 0: 44043.6. Samples: 1255272460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:41:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:41:40,534][06909] Updated weights for policy 0, policy_version 82543 (0.0026) [2024-06-27 21:41:43,670][06909] Updated weights for policy 0, policy_version 82553 (0.0034) [2024-06-27 21:41:43,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1352548352. Throughput: 0: 44160.0. Samples: 1255412820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:41:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:41:47,975][06909] Updated weights for policy 0, policy_version 82563 (0.0032) [2024-06-27 21:41:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1352744960. Throughput: 0: 43887.1. Samples: 1255668660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:41:48,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:41:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000082565_1352744960.pth... [2024-06-27 21:41:48,927][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000081923_1342226432.pth [2024-06-27 21:41:51,514][06909] Updated weights for policy 0, policy_version 82573 (0.0037) [2024-06-27 21:41:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1352974336. Throughput: 0: 44002.0. Samples: 1255925120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:41:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:41:55,641][06909] Updated weights for policy 0, policy_version 82583 (0.0028) [2024-06-27 21:41:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1353187328. Throughput: 0: 43976.9. Samples: 1256061440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:41:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:41:58,895][06909] Updated weights for policy 0, policy_version 82593 (0.0042) [2024-06-27 21:42:02,970][06909] Updated weights for policy 0, policy_version 82603 (0.0034) [2024-06-27 21:42:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.6, 300 sec: 43820.3). Total num frames: 1353383936. Throughput: 0: 43884.9. Samples: 1256325420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:42:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 21:42:06,377][06909] Updated weights for policy 0, policy_version 82613 (0.0030) [2024-06-27 21:42:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 1353613312. Throughput: 0: 43852.3. Samples: 1256585180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:42:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:42:10,232][06909] Updated weights for policy 0, policy_version 82623 (0.0036) [2024-06-27 21:42:12,968][06887] Signal inference workers to stop experience collection... (17900 times) [2024-06-27 21:42:12,972][06887] Signal inference workers to resume experience collection... (17900 times) [2024-06-27 21:42:13,022][06909] InferenceWorker_p0-w0: stopping experience collection (17900 times) [2024-06-27 21:42:13,022][06909] InferenceWorker_p0-w0: resuming experience collection (17900 times) [2024-06-27 21:42:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 43876.1). Total num frames: 1353842688. Throughput: 0: 43670.9. Samples: 1256719180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:42:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:42:14,073][06909] Updated weights for policy 0, policy_version 82633 (0.0043) [2024-06-27 21:42:17,767][06909] Updated weights for policy 0, policy_version 82643 (0.0030) [2024-06-27 21:42:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1354039296. Throughput: 0: 43763.6. Samples: 1256982220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:42:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:42:21,362][06909] Updated weights for policy 0, policy_version 82653 (0.0040) [2024-06-27 21:42:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1354268672. Throughput: 0: 43779.4. Samples: 1257242540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:42:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:42:25,261][06909] Updated weights for policy 0, policy_version 82663 (0.0035) [2024-06-27 21:42:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43419.1, 300 sec: 43820.3). Total num frames: 1354498048. Throughput: 0: 43664.5. Samples: 1257377720. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:42:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:42:28,866][06909] Updated weights for policy 0, policy_version 82673 (0.0022) [2024-06-27 21:42:33,048][06909] Updated weights for policy 0, policy_version 82683 (0.0035) [2024-06-27 21:42:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 1354711040. Throughput: 0: 43677.3. Samples: 1257634140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:42:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:42:36,289][06909] Updated weights for policy 0, policy_version 82693 (0.0036) [2024-06-27 21:42:38,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1354924032. Throughput: 0: 43862.2. Samples: 1257898920. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:42:38,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:42:40,237][06909] Updated weights for policy 0, policy_version 82703 (0.0025) [2024-06-27 21:42:43,846][06909] Updated weights for policy 0, policy_version 82713 (0.0033) [2024-06-27 21:42:43,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43690.8, 300 sec: 43931.5). Total num frames: 1355169792. Throughput: 0: 43829.1. Samples: 1258033740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:42:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:42:47,592][06909] Updated weights for policy 0, policy_version 82723 (0.0023) [2024-06-27 21:42:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 1355366400. Throughput: 0: 43847.9. Samples: 1258298580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:42:48,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:42:51,534][06909] Updated weights for policy 0, policy_version 82733 (0.0031) [2024-06-27 21:42:53,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43417.6, 300 sec: 43875.8). Total num frames: 1355579392. Throughput: 0: 43766.7. Samples: 1258554680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:42:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:42:55,321][06909] Updated weights for policy 0, policy_version 82743 (0.0030) [2024-06-27 21:42:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43820.2). Total num frames: 1355808768. Throughput: 0: 43938.5. Samples: 1258696420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:42:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:42:58,880][06909] Updated weights for policy 0, policy_version 82753 (0.0033) [2024-06-27 21:43:03,142][06909] Updated weights for policy 0, policy_version 82763 (0.0036) [2024-06-27 21:43:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1356005376. Throughput: 0: 43740.9. Samples: 1258950560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:43:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:43:06,106][06909] Updated weights for policy 0, policy_version 82773 (0.0036) [2024-06-27 21:43:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1356234752. Throughput: 0: 43751.1. Samples: 1259211340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:43:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:43:10,558][06909] Updated weights for policy 0, policy_version 82783 (0.0036) [2024-06-27 21:43:13,759][06909] Updated weights for policy 0, policy_version 82793 (0.0025) [2024-06-27 21:43:13,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1356480512. Throughput: 0: 43746.6. Samples: 1259346320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 21:43:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:43:17,954][06909] Updated weights for policy 0, policy_version 82803 (0.0027) [2024-06-27 21:43:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1356677120. Throughput: 0: 43939.1. Samples: 1259611400. Policy #0 lag: (min: 0.0, avg: 11.9, max: 21.0) [2024-06-27 21:43:18,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:43:21,211][06909] Updated weights for policy 0, policy_version 82813 (0.0032) [2024-06-27 21:43:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1356890112. Throughput: 0: 43926.8. Samples: 1259875620. Policy #0 lag: (min: 0.0, avg: 11.9, max: 21.0) [2024-06-27 21:43:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:43:25,185][06909] Updated weights for policy 0, policy_version 82823 (0.0030) [2024-06-27 21:43:28,607][06909] Updated weights for policy 0, policy_version 82833 (0.0037) [2024-06-27 21:43:28,850][06674] Fps is (10 sec: 47514.3, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1357152256. Throughput: 0: 43959.5. Samples: 1260011920. Policy #0 lag: (min: 0.0, avg: 11.9, max: 21.0) [2024-06-27 21:43:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:43:32,916][06909] Updated weights for policy 0, policy_version 82843 (0.0048) [2024-06-27 21:43:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 43709.2). Total num frames: 1357316096. Throughput: 0: 43782.7. Samples: 1260268800. Policy #0 lag: (min: 0.0, avg: 11.9, max: 21.0) [2024-06-27 21:43:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:43:35,990][06909] Updated weights for policy 0, policy_version 82853 (0.0036) [2024-06-27 21:43:38,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1357545472. Throughput: 0: 43896.9. Samples: 1260530040. Policy #0 lag: (min: 0.0, avg: 11.9, max: 21.0) [2024-06-27 21:43:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:43:39,104][06887] Signal inference workers to stop experience collection... (17950 times) [2024-06-27 21:43:39,105][06887] Signal inference workers to resume experience collection... (17950 times) [2024-06-27 21:43:39,120][06909] InferenceWorker_p0-w0: stopping experience collection (17950 times) [2024-06-27 21:43:39,120][06909] InferenceWorker_p0-w0: resuming experience collection (17950 times) [2024-06-27 21:43:40,695][06909] Updated weights for policy 0, policy_version 82863 (0.0037) [2024-06-27 21:43:43,509][06909] Updated weights for policy 0, policy_version 82873 (0.0036) [2024-06-27 21:43:43,850][06674] Fps is (10 sec: 49152.2, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1357807616. Throughput: 0: 43708.6. Samples: 1260663300. Policy #0 lag: (min: 0.0, avg: 11.9, max: 21.0) [2024-06-27 21:43:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:43:47,980][06909] Updated weights for policy 0, policy_version 82883 (0.0029) [2024-06-27 21:43:48,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1357987840. Throughput: 0: 43759.9. Samples: 1260919760. Policy #0 lag: (min: 0.0, avg: 11.9, max: 21.0) [2024-06-27 21:43:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:43:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000082885_1357987840.pth... [2024-06-27 21:43:48,914][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000082245_1347502080.pth [2024-06-27 21:43:50,844][06909] Updated weights for policy 0, policy_version 82893 (0.0031) [2024-06-27 21:43:53,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1358200832. Throughput: 0: 43886.2. Samples: 1261186220. Policy #0 lag: (min: 0.0, avg: 11.9, max: 21.0) [2024-06-27 21:43:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:43:55,543][06909] Updated weights for policy 0, policy_version 82903 (0.0026) [2024-06-27 21:43:58,507][06909] Updated weights for policy 0, policy_version 82913 (0.0031) [2024-06-27 21:43:58,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1358462976. Throughput: 0: 43762.6. Samples: 1261315640. Policy #0 lag: (min: 0.0, avg: 11.9, max: 21.0) [2024-06-27 21:43:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:44:02,558][06909] Updated weights for policy 0, policy_version 82923 (0.0028) [2024-06-27 21:44:03,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 43764.7). Total num frames: 1358659584. Throughput: 0: 43923.7. Samples: 1261587960. Policy #0 lag: (min: 0.0, avg: 11.9, max: 21.0) [2024-06-27 21:44:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:44:05,763][06909] Updated weights for policy 0, policy_version 82933 (0.0037) [2024-06-27 21:44:08,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 43820.6). Total num frames: 1358872576. Throughput: 0: 44140.4. Samples: 1261861940. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 21:44:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:44:09,758][06909] Updated weights for policy 0, policy_version 82943 (0.0033) [2024-06-27 21:44:13,173][06909] Updated weights for policy 0, policy_version 82953 (0.0031) [2024-06-27 21:44:13,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1359134720. Throughput: 0: 43947.1. Samples: 1261989540. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 21:44:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:44:17,787][06909] Updated weights for policy 0, policy_version 82963 (0.0051) [2024-06-27 21:44:18,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 43820.2). Total num frames: 1359331328. Throughput: 0: 44045.2. Samples: 1262250840. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 21:44:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:44:20,493][06909] Updated weights for policy 0, policy_version 82973 (0.0037) [2024-06-27 21:44:23,850][06674] Fps is (10 sec: 37683.1, 60 sec: 43690.7, 300 sec: 43709.2). Total num frames: 1359511552. Throughput: 0: 44204.5. Samples: 1262519240. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 21:44:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:44:25,174][06909] Updated weights for policy 0, policy_version 82983 (0.0025) [2024-06-27 21:44:27,761][06909] Updated weights for policy 0, policy_version 82993 (0.0027) [2024-06-27 21:44:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1359790080. Throughput: 0: 44008.8. Samples: 1262643700. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 21:44:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:44:32,551][06909] Updated weights for policy 0, policy_version 83003 (0.0038) [2024-06-27 21:44:33,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44509.9, 300 sec: 43764.7). Total num frames: 1359986688. Throughput: 0: 44225.1. Samples: 1262909880. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 21:44:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:44:35,469][06909] Updated weights for policy 0, policy_version 83013 (0.0031) [2024-06-27 21:44:38,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1360183296. Throughput: 0: 44303.2. Samples: 1263179860. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 21:44:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:44:39,778][06909] Updated weights for policy 0, policy_version 83023 (0.0030) [2024-06-27 21:44:42,819][06909] Updated weights for policy 0, policy_version 83033 (0.0040) [2024-06-27 21:44:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43876.7). Total num frames: 1360445440. Throughput: 0: 44166.7. Samples: 1263303140. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 21:44:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:44:47,187][06909] Updated weights for policy 0, policy_version 83043 (0.0023) [2024-06-27 21:44:48,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44237.0, 300 sec: 43765.6). Total num frames: 1360642048. Throughput: 0: 44109.8. Samples: 1263572900. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 21:44:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:44:50,286][06909] Updated weights for policy 0, policy_version 83053 (0.0040) [2024-06-27 21:44:53,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 1360855040. Throughput: 0: 43864.9. Samples: 1263835860. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 21:44:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:44:55,110][06909] Updated weights for policy 0, policy_version 83063 (0.0026) [2024-06-27 21:44:57,693][06909] Updated weights for policy 0, policy_version 83073 (0.0036) [2024-06-27 21:44:58,104][06887] Signal inference workers to stop experience collection... (18000 times) [2024-06-27 21:44:58,104][06887] Signal inference workers to resume experience collection... (18000 times) [2024-06-27 21:44:58,135][06909] InferenceWorker_p0-w0: stopping experience collection (18000 times) [2024-06-27 21:44:58,135][06909] InferenceWorker_p0-w0: resuming experience collection (18000 times) [2024-06-27 21:44:58,850][06674] Fps is (10 sec: 45874.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1361100800. Throughput: 0: 43808.3. Samples: 1263960920. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 21:44:58,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:45:02,602][06909] Updated weights for policy 0, policy_version 83083 (0.0030) [2024-06-27 21:45:03,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.9, 300 sec: 43875.8). Total num frames: 1361330176. Throughput: 0: 44259.7. Samples: 1264242520. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 21:45:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:45:05,113][06909] Updated weights for policy 0, policy_version 83093 (0.0036) [2024-06-27 21:45:08,852][06674] Fps is (10 sec: 40952.1, 60 sec: 43962.2, 300 sec: 43876.4). Total num frames: 1361510400. Throughput: 0: 44157.1. Samples: 1264506400. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 21:45:08,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:45:09,800][06909] Updated weights for policy 0, policy_version 83103 (0.0031) [2024-06-27 21:45:12,397][06909] Updated weights for policy 0, policy_version 83113 (0.0027) [2024-06-27 21:45:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1361756160. Throughput: 0: 43967.7. Samples: 1264622240. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 21:45:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:45:17,086][06909] Updated weights for policy 0, policy_version 83123 (0.0035) [2024-06-27 21:45:18,850][06674] Fps is (10 sec: 49161.3, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 1362001920. Throughput: 0: 44292.2. Samples: 1264903040. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 21:45:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:45:19,838][06909] Updated weights for policy 0, policy_version 83133 (0.0037) [2024-06-27 21:45:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44509.9, 300 sec: 43875.8). Total num frames: 1362182144. Throughput: 0: 44102.7. Samples: 1265164480. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 21:45:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:45:24,824][06909] Updated weights for policy 0, policy_version 83143 (0.0020) [2024-06-27 21:45:27,454][06909] Updated weights for policy 0, policy_version 83153 (0.0037) [2024-06-27 21:45:28,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1362411520. Throughput: 0: 44149.3. Samples: 1265289860. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 21:45:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:45:32,330][06909] Updated weights for policy 0, policy_version 83163 (0.0031) [2024-06-27 21:45:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1362640896. Throughput: 0: 44050.1. Samples: 1265555160. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 21:45:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:45:34,759][06909] Updated weights for policy 0, policy_version 83173 (0.0026) [2024-06-27 21:45:38,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 1362837504. Throughput: 0: 44104.9. Samples: 1265820580. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 21:45:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:45:39,785][06909] Updated weights for policy 0, policy_version 83183 (0.0029) [2024-06-27 21:45:42,049][06909] Updated weights for policy 0, policy_version 83193 (0.0031) [2024-06-27 21:45:43,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1363066880. Throughput: 0: 44047.8. Samples: 1265943060. Policy #0 lag: (min: 0.0, avg: 8.5, max: 22.0) [2024-06-27 21:45:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:45:47,043][06909] Updated weights for policy 0, policy_version 83203 (0.0025) [2024-06-27 21:45:48,850][06674] Fps is (10 sec: 47512.6, 60 sec: 44509.7, 300 sec: 43986.8). Total num frames: 1363312640. Throughput: 0: 43832.7. Samples: 1266215000. Policy #0 lag: (min: 1.0, avg: 7.9, max: 21.0) [2024-06-27 21:45:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-27 21:45:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000083210_1363312640.pth... [2024-06-27 21:45:48,919][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000082565_1352744960.pth [2024-06-27 21:45:49,588][06909] Updated weights for policy 0, policy_version 83213 (0.0033) [2024-06-27 21:45:53,850][06674] Fps is (10 sec: 40959.2, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1363476480. Throughput: 0: 43742.8. Samples: 1266474740. Policy #0 lag: (min: 1.0, avg: 7.9, max: 21.0) [2024-06-27 21:45:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:45:54,568][06909] Updated weights for policy 0, policy_version 83223 (0.0023) [2024-06-27 21:45:57,144][06909] Updated weights for policy 0, policy_version 83233 (0.0026) [2024-06-27 21:45:58,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 1363722240. Throughput: 0: 43911.5. Samples: 1266598260. Policy #0 lag: (min: 1.0, avg: 7.9, max: 21.0) [2024-06-27 21:45:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:46:02,260][06909] Updated weights for policy 0, policy_version 83243 (0.0033) [2024-06-27 21:46:03,850][06674] Fps is (10 sec: 47514.4, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 1363951616. Throughput: 0: 43736.2. Samples: 1266871160. Policy #0 lag: (min: 1.0, avg: 7.9, max: 21.0) [2024-06-27 21:46:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:46:04,617][06909] Updated weights for policy 0, policy_version 83253 (0.0026) [2024-06-27 21:46:08,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43965.1, 300 sec: 43875.8). Total num frames: 1364148224. Throughput: 0: 43737.6. Samples: 1267132680. Policy #0 lag: (min: 1.0, avg: 7.9, max: 21.0) [2024-06-27 21:46:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:46:09,543][06909] Updated weights for policy 0, policy_version 83263 (0.0031) [2024-06-27 21:46:11,915][06909] Updated weights for policy 0, policy_version 83273 (0.0024) [2024-06-27 21:46:13,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1364393984. Throughput: 0: 43795.6. Samples: 1267260660. Policy #0 lag: (min: 1.0, avg: 7.9, max: 21.0) [2024-06-27 21:46:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 21:46:16,757][06909] Updated weights for policy 0, policy_version 83283 (0.0032) [2024-06-27 21:46:18,040][06887] Signal inference workers to stop experience collection... (18050 times) [2024-06-27 21:46:18,098][06909] InferenceWorker_p0-w0: stopping experience collection (18050 times) [2024-06-27 21:46:18,099][06887] Signal inference workers to resume experience collection... (18050 times) [2024-06-27 21:46:18,112][06909] InferenceWorker_p0-w0: resuming experience collection (18050 times) [2024-06-27 21:46:18,850][06674] Fps is (10 sec: 47514.5, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1364623360. Throughput: 0: 44046.7. Samples: 1267537260. Policy #0 lag: (min: 1.0, avg: 7.9, max: 21.0) [2024-06-27 21:46:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:46:19,359][06909] Updated weights for policy 0, policy_version 83293 (0.0037) [2024-06-27 21:46:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43765.0). Total num frames: 1364803584. Throughput: 0: 43944.0. Samples: 1267798060. Policy #0 lag: (min: 1.0, avg: 7.9, max: 21.0) [2024-06-27 21:46:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:46:24,362][06909] Updated weights for policy 0, policy_version 83303 (0.0036) [2024-06-27 21:46:26,830][06909] Updated weights for policy 0, policy_version 83313 (0.0020) [2024-06-27 21:46:28,856][06674] Fps is (10 sec: 42572.1, 60 sec: 43959.3, 300 sec: 43986.0). Total num frames: 1365049344. Throughput: 0: 43881.9. Samples: 1267918020. Policy #0 lag: (min: 1.0, avg: 7.9, max: 21.0) [2024-06-27 21:46:28,857][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:46:31,762][06909] Updated weights for policy 0, policy_version 83323 (0.0031) [2024-06-27 21:46:33,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43963.6, 300 sec: 43986.8). Total num frames: 1365278720. Throughput: 0: 43969.3. Samples: 1268193620. Policy #0 lag: (min: 1.0, avg: 7.9, max: 21.0) [2024-06-27 21:46:33,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:46:34,534][06909] Updated weights for policy 0, policy_version 83333 (0.0034) [2024-06-27 21:46:38,850][06674] Fps is (10 sec: 40984.6, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1365458944. Throughput: 0: 44012.9. Samples: 1268455320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:46:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:46:39,605][06909] Updated weights for policy 0, policy_version 83343 (0.0022) [2024-06-27 21:46:41,809][06909] Updated weights for policy 0, policy_version 83353 (0.0034) [2024-06-27 21:46:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 1365704704. Throughput: 0: 43960.4. Samples: 1268576480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:46:43,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:46:46,876][06909] Updated weights for policy 0, policy_version 83363 (0.0027) [2024-06-27 21:46:48,850][06674] Fps is (10 sec: 47514.3, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 1365934080. Throughput: 0: 43917.7. Samples: 1268847460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:46:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:46:49,316][06909] Updated weights for policy 0, policy_version 83373 (0.0029) [2024-06-27 21:46:53,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1366114304. Throughput: 0: 44027.7. Samples: 1269113920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:46:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:46:54,303][06909] Updated weights for policy 0, policy_version 83383 (0.0048) [2024-06-27 21:46:56,871][06909] Updated weights for policy 0, policy_version 83393 (0.0034) [2024-06-27 21:46:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1366360064. Throughput: 0: 43918.7. Samples: 1269237000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:46:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:47:01,729][06909] Updated weights for policy 0, policy_version 83403 (0.0032) [2024-06-27 21:47:03,851][06674] Fps is (10 sec: 49146.0, 60 sec: 44235.8, 300 sec: 44042.2). Total num frames: 1366605824. Throughput: 0: 43769.4. Samples: 1269506940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:47:03,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:47:04,172][06909] Updated weights for policy 0, policy_version 83413 (0.0046) [2024-06-27 21:47:08,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1366786048. Throughput: 0: 44009.2. Samples: 1269778480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:47:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:47:09,227][06909] Updated weights for policy 0, policy_version 83423 (0.0035) [2024-06-27 21:47:11,858][06909] Updated weights for policy 0, policy_version 83433 (0.0028) [2024-06-27 21:47:13,850][06674] Fps is (10 sec: 42603.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1367031808. Throughput: 0: 44000.6. Samples: 1269897780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:47:13,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:47:16,518][06909] Updated weights for policy 0, policy_version 83443 (0.0024) [2024-06-27 21:47:18,850][06674] Fps is (10 sec: 49152.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1367277568. Throughput: 0: 43831.8. Samples: 1270166040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:47:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:47:19,292][06909] Updated weights for policy 0, policy_version 83453 (0.0036) [2024-06-27 21:47:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1367441408. Throughput: 0: 43956.1. Samples: 1270433340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:47:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:47:24,294][06909] Updated weights for policy 0, policy_version 83463 (0.0032) [2024-06-27 21:47:26,849][06909] Updated weights for policy 0, policy_version 83473 (0.0030) [2024-06-27 21:47:28,856][06674] Fps is (10 sec: 40935.8, 60 sec: 43963.9, 300 sec: 43986.0). Total num frames: 1367687168. Throughput: 0: 43988.6. Samples: 1270556220. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 21:47:28,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:47:31,571][06909] Updated weights for policy 0, policy_version 83483 (0.0045) [2024-06-27 21:47:33,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1367916544. Throughput: 0: 44009.7. Samples: 1270827900. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 21:47:33,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:47:34,491][06909] Updated weights for policy 0, policy_version 83493 (0.0045) [2024-06-27 21:47:38,850][06674] Fps is (10 sec: 40984.1, 60 sec: 43963.8, 300 sec: 43820.2). Total num frames: 1368096768. Throughput: 0: 43967.2. Samples: 1271092440. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 21:47:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:47:38,895][06909] Updated weights for policy 0, policy_version 83503 (0.0036) [2024-06-27 21:47:39,733][06887] Signal inference workers to stop experience collection... (18100 times) [2024-06-27 21:47:39,734][06887] Signal inference workers to resume experience collection... (18100 times) [2024-06-27 21:47:39,764][06909] InferenceWorker_p0-w0: stopping experience collection (18100 times) [2024-06-27 21:47:39,764][06909] InferenceWorker_p0-w0: resuming experience collection (18100 times) [2024-06-27 21:47:41,687][06909] Updated weights for policy 0, policy_version 83513 (0.0026) [2024-06-27 21:47:43,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 1368342528. Throughput: 0: 43962.4. Samples: 1271215300. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 21:47:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:47:46,249][06909] Updated weights for policy 0, policy_version 83523 (0.0031) [2024-06-27 21:47:48,850][06674] Fps is (10 sec: 49151.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1368588288. Throughput: 0: 43947.8. Samples: 1271484540. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 21:47:48,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:47:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000083532_1368588288.pth... [2024-06-27 21:47:48,929][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000082885_1357987840.pth [2024-06-27 21:47:49,309][06909] Updated weights for policy 0, policy_version 83533 (0.0031) [2024-06-27 21:47:53,813][06909] Updated weights for policy 0, policy_version 83543 (0.0030) [2024-06-27 21:47:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.9, 300 sec: 43931.4). Total num frames: 1368768512. Throughput: 0: 43864.2. Samples: 1271752360. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 21:47:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:47:57,085][06909] Updated weights for policy 0, policy_version 83553 (0.0039) [2024-06-27 21:47:58,852][06674] Fps is (10 sec: 40951.9, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 1368997888. Throughput: 0: 43970.0. Samples: 1271876520. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 21:47:58,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:48:01,362][06909] Updated weights for policy 0, policy_version 83563 (0.0036) [2024-06-27 21:48:03,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43964.6, 300 sec: 44097.9). Total num frames: 1369243648. Throughput: 0: 43765.2. Samples: 1272135480. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 21:48:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:48:04,508][06909] Updated weights for policy 0, policy_version 83573 (0.0031) [2024-06-27 21:48:08,819][06909] Updated weights for policy 0, policy_version 83583 (0.0019) [2024-06-27 21:48:08,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1369423872. Throughput: 0: 44016.0. Samples: 1272414060. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 21:48:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:48:11,828][06909] Updated weights for policy 0, policy_version 83593 (0.0029) [2024-06-27 21:48:13,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1369669632. Throughput: 0: 43946.7. Samples: 1272533560. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-27 21:48:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:48:16,018][06909] Updated weights for policy 0, policy_version 83603 (0.0032) [2024-06-27 21:48:18,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43690.5, 300 sec: 44097.9). Total num frames: 1369899008. Throughput: 0: 43792.4. Samples: 1272798560. Policy #0 lag: (min: 1.0, avg: 11.6, max: 24.0) [2024-06-27 21:48:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:48:19,339][06909] Updated weights for policy 0, policy_version 83613 (0.0036) [2024-06-27 21:48:23,561][06909] Updated weights for policy 0, policy_version 83623 (0.0034) [2024-06-27 21:48:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1370079232. Throughput: 0: 43976.9. Samples: 1273071400. Policy #0 lag: (min: 1.0, avg: 11.6, max: 24.0) [2024-06-27 21:48:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:48:26,965][06909] Updated weights for policy 0, policy_version 83633 (0.0039) [2024-06-27 21:48:28,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43694.9, 300 sec: 44042.4). Total num frames: 1370308608. Throughput: 0: 43950.9. Samples: 1273193100. Policy #0 lag: (min: 1.0, avg: 11.6, max: 24.0) [2024-06-27 21:48:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:48:30,697][06909] Updated weights for policy 0, policy_version 83643 (0.0036) [2024-06-27 21:48:33,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1370554368. Throughput: 0: 43795.3. Samples: 1273455320. Policy #0 lag: (min: 1.0, avg: 11.6, max: 24.0) [2024-06-27 21:48:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:48:34,373][06909] Updated weights for policy 0, policy_version 83653 (0.0041) [2024-06-27 21:48:38,489][06909] Updated weights for policy 0, policy_version 83663 (0.0037) [2024-06-27 21:48:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1370750976. Throughput: 0: 43917.8. Samples: 1273728660. Policy #0 lag: (min: 1.0, avg: 11.6, max: 24.0) [2024-06-27 21:48:38,858][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:48:41,911][06909] Updated weights for policy 0, policy_version 83673 (0.0033) [2024-06-27 21:48:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1370980352. Throughput: 0: 43914.0. Samples: 1273852560. Policy #0 lag: (min: 1.0, avg: 11.6, max: 24.0) [2024-06-27 21:48:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:48:46,123][06909] Updated weights for policy 0, policy_version 83683 (0.0033) [2024-06-27 21:48:48,853][06674] Fps is (10 sec: 45861.0, 60 sec: 43688.5, 300 sec: 44097.5). Total num frames: 1371209728. Throughput: 0: 43986.4. Samples: 1274115000. Policy #0 lag: (min: 1.0, avg: 11.6, max: 24.0) [2024-06-27 21:48:48,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:48:49,146][06909] Updated weights for policy 0, policy_version 83693 (0.0025) [2024-06-27 21:48:49,571][06887] Signal inference workers to stop experience collection... (18150 times) [2024-06-27 21:48:49,617][06909] InferenceWorker_p0-w0: stopping experience collection (18150 times) [2024-06-27 21:48:49,625][06887] Signal inference workers to resume experience collection... (18150 times) [2024-06-27 21:48:49,635][06909] InferenceWorker_p0-w0: resuming experience collection (18150 times) [2024-06-27 21:48:53,358][06909] Updated weights for policy 0, policy_version 83703 (0.0022) [2024-06-27 21:48:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.6, 300 sec: 43931.3). Total num frames: 1371422720. Throughput: 0: 43879.9. Samples: 1274388660. Policy #0 lag: (min: 1.0, avg: 11.6, max: 24.0) [2024-06-27 21:48:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:48:56,427][06909] Updated weights for policy 0, policy_version 83713 (0.0031) [2024-06-27 21:48:58,850][06674] Fps is (10 sec: 42611.4, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 1371635712. Throughput: 0: 44066.1. Samples: 1274516540. Policy #0 lag: (min: 1.0, avg: 11.6, max: 24.0) [2024-06-27 21:48:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:49:00,591][06909] Updated weights for policy 0, policy_version 83723 (0.0032) [2024-06-27 21:49:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1371865088. Throughput: 0: 44061.4. Samples: 1274781320. Policy #0 lag: (min: 1.0, avg: 11.6, max: 24.0) [2024-06-27 21:49:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:49:04,169][06909] Updated weights for policy 0, policy_version 83733 (0.0036) [2024-06-27 21:49:07,955][06909] Updated weights for policy 0, policy_version 83743 (0.0036) [2024-06-27 21:49:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1372078080. Throughput: 0: 43937.7. Samples: 1275048600. Policy #0 lag: (min: 1.0, avg: 11.6, max: 24.0) [2024-06-27 21:49:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:49:11,595][06909] Updated weights for policy 0, policy_version 83753 (0.0029) [2024-06-27 21:49:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43931.4). Total num frames: 1372291072. Throughput: 0: 44193.4. Samples: 1275181800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2024-06-27 21:49:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:49:15,449][06909] Updated weights for policy 0, policy_version 83763 (0.0028) [2024-06-27 21:49:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 1372520448. Throughput: 0: 43939.9. Samples: 1275432620. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2024-06-27 21:49:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:49:19,248][06909] Updated weights for policy 0, policy_version 83773 (0.0036) [2024-06-27 21:49:23,143][06909] Updated weights for policy 0, policy_version 83783 (0.0036) [2024-06-27 21:49:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 43931.3). Total num frames: 1372749824. Throughput: 0: 43899.5. Samples: 1275704140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2024-06-27 21:49:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:49:26,633][06909] Updated weights for policy 0, policy_version 83793 (0.0031) [2024-06-27 21:49:28,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43962.3, 300 sec: 43931.0). Total num frames: 1372946432. Throughput: 0: 43997.6. Samples: 1275832540. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2024-06-27 21:49:28,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:49:30,465][06909] Updated weights for policy 0, policy_version 83803 (0.0027) [2024-06-27 21:49:33,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43689.1, 300 sec: 44042.1). Total num frames: 1373175808. Throughput: 0: 43929.9. Samples: 1276091800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2024-06-27 21:49:33,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:49:33,988][06909] Updated weights for policy 0, policy_version 83813 (0.0027) [2024-06-27 21:49:37,927][06909] Updated weights for policy 0, policy_version 83823 (0.0043) [2024-06-27 21:49:38,850][06674] Fps is (10 sec: 47522.7, 60 sec: 44509.7, 300 sec: 43986.9). Total num frames: 1373421568. Throughput: 0: 43762.2. Samples: 1276357960. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2024-06-27 21:49:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:49:41,371][06909] Updated weights for policy 0, policy_version 83833 (0.0032) [2024-06-27 21:49:43,850][06674] Fps is (10 sec: 42606.7, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1373601792. Throughput: 0: 43941.7. Samples: 1276493920. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2024-06-27 21:49:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:49:45,178][06909] Updated weights for policy 0, policy_version 83843 (0.0034) [2024-06-27 21:49:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43692.8, 300 sec: 43986.9). Total num frames: 1373831168. Throughput: 0: 43792.4. Samples: 1276751980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2024-06-27 21:49:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:49:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000083852_1373831168.pth... [2024-06-27 21:49:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000083210_1363312640.pth [2024-06-27 21:49:49,267][06909] Updated weights for policy 0, policy_version 83853 (0.0033) [2024-06-27 21:49:52,941][06909] Updated weights for policy 0, policy_version 83863 (0.0033) [2024-06-27 21:49:53,424][06887] Signal inference workers to stop experience collection... (18200 times) [2024-06-27 21:49:53,428][06887] Signal inference workers to resume experience collection... (18200 times) [2024-06-27 21:49:53,466][06909] InferenceWorker_p0-w0: stopping experience collection (18200 times) [2024-06-27 21:49:53,467][06909] InferenceWorker_p0-w0: resuming experience collection (18200 times) [2024-06-27 21:49:53,850][06674] Fps is (10 sec: 47514.3, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1374076928. Throughput: 0: 43839.2. Samples: 1277021360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2024-06-27 21:49:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:49:56,774][06909] Updated weights for policy 0, policy_version 83873 (0.0034) [2024-06-27 21:49:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 1374257152. Throughput: 0: 43875.9. Samples: 1277156220. Policy #0 lag: (min: 0.0, avg: 10.7, max: 25.0) [2024-06-27 21:49:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:50:00,212][06909] Updated weights for policy 0, policy_version 83883 (0.0029) [2024-06-27 21:50:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 1374486528. Throughput: 0: 44080.0. Samples: 1277416220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-27 21:50:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:50:03,958][06909] Updated weights for policy 0, policy_version 83893 (0.0032) [2024-06-27 21:50:07,518][06909] Updated weights for policy 0, policy_version 83903 (0.0026) [2024-06-27 21:50:08,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 1374748672. Throughput: 0: 43963.9. Samples: 1277682520. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-27 21:50:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:50:11,238][06909] Updated weights for policy 0, policy_version 83913 (0.0037) [2024-06-27 21:50:13,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1374912512. Throughput: 0: 44197.4. Samples: 1277821340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-27 21:50:13,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:50:15,010][06909] Updated weights for policy 0, policy_version 83923 (0.0036) [2024-06-27 21:50:18,599][06909] Updated weights for policy 0, policy_version 83933 (0.0037) [2024-06-27 21:50:18,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1375158272. Throughput: 0: 44101.5. Samples: 1278076280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-27 21:50:18,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:50:22,473][06909] Updated weights for policy 0, policy_version 83943 (0.0034) [2024-06-27 21:50:23,850][06674] Fps is (10 sec: 49151.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1375404032. Throughput: 0: 44125.8. Samples: 1278343620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-27 21:50:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 21:50:26,025][06909] Updated weights for policy 0, policy_version 83953 (0.0033) [2024-06-27 21:50:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43965.2, 300 sec: 43875.8). Total num frames: 1375584256. Throughput: 0: 44129.8. Samples: 1278479760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-27 21:50:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:50:29,904][06909] Updated weights for policy 0, policy_version 83963 (0.0037) [2024-06-27 21:50:33,320][06909] Updated weights for policy 0, policy_version 83973 (0.0033) [2024-06-27 21:50:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 1375813632. Throughput: 0: 44069.4. Samples: 1278735100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-27 21:50:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:50:37,518][06909] Updated weights for policy 0, policy_version 83983 (0.0024) [2024-06-27 21:50:38,850][06674] Fps is (10 sec: 49151.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1376075776. Throughput: 0: 43916.3. Samples: 1278997600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-27 21:50:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:50:40,793][06909] Updated weights for policy 0, policy_version 83993 (0.0033) [2024-06-27 21:50:43,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 1376223232. Throughput: 0: 43972.6. Samples: 1279134980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-27 21:50:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:50:45,069][06909] Updated weights for policy 0, policy_version 84003 (0.0039) [2024-06-27 21:50:48,130][06909] Updated weights for policy 0, policy_version 84013 (0.0031) [2024-06-27 21:50:48,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1376468992. Throughput: 0: 43979.4. Samples: 1279395300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-27 21:50:48,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:50:52,384][06909] Updated weights for policy 0, policy_version 84023 (0.0032) [2024-06-27 21:50:53,852][06674] Fps is (10 sec: 49143.6, 60 sec: 43962.5, 300 sec: 44042.2). Total num frames: 1376714752. Throughput: 0: 43916.3. Samples: 1279658820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:50:53,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:50:55,822][06909] Updated weights for policy 0, policy_version 84033 (0.0024) [2024-06-27 21:50:58,707][06887] Signal inference workers to stop experience collection... (18250 times) [2024-06-27 21:50:58,708][06887] Signal inference workers to resume experience collection... (18250 times) [2024-06-27 21:50:58,748][06909] InferenceWorker_p0-w0: stopping experience collection (18250 times) [2024-06-27 21:50:58,749][06909] InferenceWorker_p0-w0: resuming experience collection (18250 times) [2024-06-27 21:50:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1376911360. Throughput: 0: 44021.3. Samples: 1279802300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:50:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:50:59,614][06909] Updated weights for policy 0, policy_version 84043 (0.0036) [2024-06-27 21:51:03,199][06909] Updated weights for policy 0, policy_version 84053 (0.0028) [2024-06-27 21:51:03,850][06674] Fps is (10 sec: 40967.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1377124352. Throughput: 0: 43925.0. Samples: 1280052900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:51:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 21:51:07,158][06909] Updated weights for policy 0, policy_version 84063 (0.0033) [2024-06-27 21:51:08,852][06674] Fps is (10 sec: 45866.2, 60 sec: 43689.3, 300 sec: 43986.6). Total num frames: 1377370112. Throughput: 0: 43909.7. Samples: 1280319640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:51:08,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:51:10,994][06909] Updated weights for policy 0, policy_version 84073 (0.0026) [2024-06-27 21:51:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 43820.2). Total num frames: 1377550336. Throughput: 0: 43982.7. Samples: 1280458980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:51:13,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:51:14,800][06909] Updated weights for policy 0, policy_version 84083 (0.0037) [2024-06-27 21:51:18,191][06909] Updated weights for policy 0, policy_version 84093 (0.0041) [2024-06-27 21:51:18,850][06674] Fps is (10 sec: 40968.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1377779712. Throughput: 0: 43873.3. Samples: 1280709400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:51:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:51:22,293][06909] Updated weights for policy 0, policy_version 84103 (0.0039) [2024-06-27 21:51:23,850][06674] Fps is (10 sec: 49151.8, 60 sec: 43963.8, 300 sec: 44043.3). Total num frames: 1378041856. Throughput: 0: 44036.9. Samples: 1280979260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:51:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:51:25,870][06909] Updated weights for policy 0, policy_version 84113 (0.0032) [2024-06-27 21:51:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1378205696. Throughput: 0: 44138.1. Samples: 1281121200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:51:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:51:29,494][06909] Updated weights for policy 0, policy_version 84123 (0.0036) [2024-06-27 21:51:33,009][06909] Updated weights for policy 0, policy_version 84133 (0.0025) [2024-06-27 21:51:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1378467840. Throughput: 0: 44040.1. Samples: 1281377100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:51:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:51:36,729][06909] Updated weights for policy 0, policy_version 84143 (0.0034) [2024-06-27 21:51:38,850][06674] Fps is (10 sec: 50790.5, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1378713600. Throughput: 0: 44120.7. Samples: 1281644180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:51:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:51:40,119][06909] Updated weights for policy 0, policy_version 84153 (0.0033) [2024-06-27 21:51:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44509.9, 300 sec: 43931.3). Total num frames: 1378893824. Throughput: 0: 44056.1. Samples: 1281784820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:51:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:51:44,344][06909] Updated weights for policy 0, policy_version 84163 (0.0039) [2024-06-27 21:51:47,387][06909] Updated weights for policy 0, policy_version 84173 (0.0036) [2024-06-27 21:51:48,850][06674] Fps is (10 sec: 37683.6, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1379090432. Throughput: 0: 44115.1. Samples: 1282038080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:51:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:51:48,955][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000084174_1379106816.pth... [2024-06-27 21:51:49,008][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000083532_1368588288.pth [2024-06-27 21:51:51,731][06909] Updated weights for policy 0, policy_version 84183 (0.0037) [2024-06-27 21:51:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43691.8, 300 sec: 43986.9). Total num frames: 1379336192. Throughput: 0: 43970.4. Samples: 1282298220. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:51:53,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 21:51:55,123][06909] Updated weights for policy 0, policy_version 84193 (0.0027) [2024-06-27 21:51:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.8, 300 sec: 43820.4). Total num frames: 1379532800. Throughput: 0: 43954.3. Samples: 1282436920. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:51:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:51:59,383][06909] Updated weights for policy 0, policy_version 84203 (0.0035) [2024-06-27 21:52:03,309][06909] Updated weights for policy 0, policy_version 84213 (0.0027) [2024-06-27 21:52:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1379778560. Throughput: 0: 44005.3. Samples: 1282689640. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:52:03,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:52:06,639][06909] Updated weights for policy 0, policy_version 84223 (0.0033) [2024-06-27 21:52:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43692.2, 300 sec: 43931.3). Total num frames: 1379991552. Throughput: 0: 43823.2. Samples: 1282951300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:52:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:52:10,523][06909] Updated weights for policy 0, policy_version 84233 (0.0034) [2024-06-27 21:52:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 43820.2). Total num frames: 1380204544. Throughput: 0: 43770.3. Samples: 1283090860. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:52:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:52:14,188][06909] Updated weights for policy 0, policy_version 84243 (0.0025) [2024-06-27 21:52:14,486][06887] Signal inference workers to stop experience collection... (18300 times) [2024-06-27 21:52:14,486][06887] Signal inference workers to resume experience collection... (18300 times) [2024-06-27 21:52:14,508][06909] InferenceWorker_p0-w0: stopping experience collection (18300 times) [2024-06-27 21:52:14,509][06909] InferenceWorker_p0-w0: resuming experience collection (18300 times) [2024-06-27 21:52:17,768][06909] Updated weights for policy 0, policy_version 84253 (0.0025) [2024-06-27 21:52:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1380433920. Throughput: 0: 44002.7. Samples: 1283357220. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:52:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:52:21,411][06909] Updated weights for policy 0, policy_version 84263 (0.0028) [2024-06-27 21:52:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.8, 300 sec: 43987.8). Total num frames: 1380663296. Throughput: 0: 43893.0. Samples: 1283619360. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:52:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:52:25,252][06909] Updated weights for policy 0, policy_version 84273 (0.0031) [2024-06-27 21:52:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 1380859904. Throughput: 0: 43811.6. Samples: 1283756340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:52:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:52:29,119][06909] Updated weights for policy 0, policy_version 84283 (0.0031) [2024-06-27 21:52:32,662][06909] Updated weights for policy 0, policy_version 84293 (0.0023) [2024-06-27 21:52:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1381089280. Throughput: 0: 44068.0. Samples: 1284021140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:52:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:52:36,400][06909] Updated weights for policy 0, policy_version 84303 (0.0022) [2024-06-27 21:52:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 1381318656. Throughput: 0: 44016.9. Samples: 1284278980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:52:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:52:40,049][06909] Updated weights for policy 0, policy_version 84313 (0.0026) [2024-06-27 21:52:43,772][06909] Updated weights for policy 0, policy_version 84323 (0.0042) [2024-06-27 21:52:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1381548032. Throughput: 0: 43924.4. Samples: 1284413520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:52:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:52:47,658][06909] Updated weights for policy 0, policy_version 84333 (0.0031) [2024-06-27 21:52:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1381744640. Throughput: 0: 44163.7. Samples: 1284677000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:52:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:52:51,379][06909] Updated weights for policy 0, policy_version 84343 (0.0039) [2024-06-27 21:52:53,852][06674] Fps is (10 sec: 40951.7, 60 sec: 43689.2, 300 sec: 43931.3). Total num frames: 1381957632. Throughput: 0: 44108.2. Samples: 1284936260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:52:53,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:52:55,537][06909] Updated weights for policy 0, policy_version 84353 (0.0026) [2024-06-27 21:52:58,522][06909] Updated weights for policy 0, policy_version 84363 (0.0030) [2024-06-27 21:52:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.8, 300 sec: 43931.3). Total num frames: 1382203392. Throughput: 0: 43967.1. Samples: 1285069380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:52:58,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:53:03,017][06909] Updated weights for policy 0, policy_version 84373 (0.0033) [2024-06-27 21:53:03,850][06674] Fps is (10 sec: 45884.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1382416384. Throughput: 0: 44075.9. Samples: 1285340640. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:53:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:53:05,873][06909] Updated weights for policy 0, policy_version 84383 (0.0024) [2024-06-27 21:53:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1382629376. Throughput: 0: 43991.9. Samples: 1285599000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:53:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:53:10,500][06909] Updated weights for policy 0, policy_version 84393 (0.0043) [2024-06-27 21:53:13,659][06909] Updated weights for policy 0, policy_version 84403 (0.0027) [2024-06-27 21:53:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 1382858752. Throughput: 0: 43772.8. Samples: 1285726120. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:53:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:53:17,740][06909] Updated weights for policy 0, policy_version 84413 (0.0032) [2024-06-27 21:53:18,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1383071744. Throughput: 0: 44045.6. Samples: 1286003200. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:53:18,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:53:19,940][06887] Signal inference workers to stop experience collection... (18350 times) [2024-06-27 21:53:19,941][06887] Signal inference workers to resume experience collection... (18350 times) [2024-06-27 21:53:19,965][06909] InferenceWorker_p0-w0: stopping experience collection (18350 times) [2024-06-27 21:53:19,965][06909] InferenceWorker_p0-w0: resuming experience collection (18350 times) [2024-06-27 21:53:20,929][06909] Updated weights for policy 0, policy_version 84423 (0.0029) [2024-06-27 21:53:23,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 1383301120. Throughput: 0: 44038.9. Samples: 1286260820. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-27 21:53:23,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:53:25,216][06909] Updated weights for policy 0, policy_version 84433 (0.0047) [2024-06-27 21:53:28,243][06909] Updated weights for policy 0, policy_version 84443 (0.0030) [2024-06-27 21:53:28,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 1383530496. Throughput: 0: 43947.1. Samples: 1286391140. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 21:53:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:53:32,763][06909] Updated weights for policy 0, policy_version 84453 (0.0034) [2024-06-27 21:53:33,852][06674] Fps is (10 sec: 44236.8, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 1383743488. Throughput: 0: 44093.6. Samples: 1286661300. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 21:53:33,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:53:35,653][06909] Updated weights for policy 0, policy_version 84463 (0.0041) [2024-06-27 21:53:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1383956480. Throughput: 0: 44199.3. Samples: 1286925140. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 21:53:38,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:53:40,244][06909] Updated weights for policy 0, policy_version 84473 (0.0024) [2024-06-27 21:53:43,190][06909] Updated weights for policy 0, policy_version 84483 (0.0030) [2024-06-27 21:53:43,850][06674] Fps is (10 sec: 45884.0, 60 sec: 44236.7, 300 sec: 44042.9). Total num frames: 1384202240. Throughput: 0: 44241.7. Samples: 1287060260. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 21:53:43,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:53:47,736][06909] Updated weights for policy 0, policy_version 84493 (0.0029) [2024-06-27 21:53:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1384398848. Throughput: 0: 44161.8. Samples: 1287327920. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 21:53:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:53:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000084497_1384398848.pth... [2024-06-27 21:53:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000083852_1373831168.pth [2024-06-27 21:53:50,654][06909] Updated weights for policy 0, policy_version 84503 (0.0030) [2024-06-27 21:53:53,850][06674] Fps is (10 sec: 40960.6, 60 sec: 44238.3, 300 sec: 43986.9). Total num frames: 1384611840. Throughput: 0: 44064.9. Samples: 1287581920. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 21:53:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 21:53:55,041][06909] Updated weights for policy 0, policy_version 84513 (0.0031) [2024-06-27 21:53:58,153][06909] Updated weights for policy 0, policy_version 84523 (0.0025) [2024-06-27 21:53:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1384857600. Throughput: 0: 44158.7. Samples: 1287713260. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 21:53:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:54:02,313][06909] Updated weights for policy 0, policy_version 84533 (0.0040) [2024-06-27 21:54:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1385054208. Throughput: 0: 43915.8. Samples: 1287979400. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 21:54:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 21:54:05,881][06909] Updated weights for policy 0, policy_version 84543 (0.0038) [2024-06-27 21:54:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1385267200. Throughput: 0: 43842.0. Samples: 1288233620. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 21:54:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:54:10,128][06909] Updated weights for policy 0, policy_version 84553 (0.0031) [2024-06-27 21:54:13,140][06909] Updated weights for policy 0, policy_version 84563 (0.0035) [2024-06-27 21:54:13,852][06674] Fps is (10 sec: 45865.9, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 1385512960. Throughput: 0: 44018.0. Samples: 1288372040. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-27 21:54:13,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:54:14,129][06887] Signal inference workers to stop experience collection... (18400 times) [2024-06-27 21:54:14,182][06887] Signal inference workers to resume experience collection... (18400 times) [2024-06-27 21:54:14,182][06909] InferenceWorker_p0-w0: stopping experience collection (18400 times) [2024-06-27 21:54:14,200][06909] InferenceWorker_p0-w0: resuming experience collection (18400 times) [2024-06-27 21:54:17,475][06909] Updated weights for policy 0, policy_version 84573 (0.0032) [2024-06-27 21:54:18,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43962.4, 300 sec: 43931.0). Total num frames: 1385709568. Throughput: 0: 44036.4. Samples: 1288642940. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:54:18,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:54:20,445][06909] Updated weights for policy 0, policy_version 84583 (0.0028) [2024-06-27 21:54:23,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43692.1, 300 sec: 43987.2). Total num frames: 1385922560. Throughput: 0: 43809.8. Samples: 1288896580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:54:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:54:25,059][06909] Updated weights for policy 0, policy_version 84593 (0.0038) [2024-06-27 21:54:28,295][06909] Updated weights for policy 0, policy_version 84603 (0.0030) [2024-06-27 21:54:28,850][06674] Fps is (10 sec: 47523.2, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 1386184704. Throughput: 0: 43765.4. Samples: 1289029700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:54:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:54:32,724][06909] Updated weights for policy 0, policy_version 84613 (0.0023) [2024-06-27 21:54:33,852][06674] Fps is (10 sec: 44227.9, 60 sec: 43690.7, 300 sec: 43875.5). Total num frames: 1386364928. Throughput: 0: 43735.4. Samples: 1289296100. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:54:33,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:54:35,566][06909] Updated weights for policy 0, policy_version 84623 (0.0024) [2024-06-27 21:54:38,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1386577920. Throughput: 0: 43868.8. Samples: 1289556020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:54:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 21:54:39,830][06909] Updated weights for policy 0, policy_version 84633 (0.0028) [2024-06-27 21:54:42,966][06909] Updated weights for policy 0, policy_version 84643 (0.0026) [2024-06-27 21:54:43,850][06674] Fps is (10 sec: 47523.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1386840064. Throughput: 0: 43960.0. Samples: 1289691460. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:54:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:54:47,287][06909] Updated weights for policy 0, policy_version 84653 (0.0032) [2024-06-27 21:54:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1387020288. Throughput: 0: 43932.9. Samples: 1289956380. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:54:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:54:50,467][06909] Updated weights for policy 0, policy_version 84663 (0.0030) [2024-06-27 21:54:53,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1387233280. Throughput: 0: 44135.0. Samples: 1290219700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:54:53,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:54:54,737][06909] Updated weights for policy 0, policy_version 84673 (0.0022) [2024-06-27 21:54:57,639][06909] Updated weights for policy 0, policy_version 84683 (0.0025) [2024-06-27 21:54:58,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1387495424. Throughput: 0: 44158.9. Samples: 1290359100. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:54:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:55:02,309][06909] Updated weights for policy 0, policy_version 84693 (0.0038) [2024-06-27 21:55:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1387675648. Throughput: 0: 43794.9. Samples: 1290613620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:55:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:55:05,367][06909] Updated weights for policy 0, policy_version 84703 (0.0040) [2024-06-27 21:55:08,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1387888640. Throughput: 0: 43895.7. Samples: 1290871880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 21:55:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:55:09,702][06909] Updated weights for policy 0, policy_version 84713 (0.0035) [2024-06-27 21:55:12,752][06909] Updated weights for policy 0, policy_version 84723 (0.0026) [2024-06-27 21:55:13,850][06674] Fps is (10 sec: 49151.7, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 1388167168. Throughput: 0: 44055.1. Samples: 1291012180. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:55:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:55:17,080][06909] Updated weights for policy 0, policy_version 84733 (0.0032) [2024-06-27 21:55:17,554][06887] Signal inference workers to stop experience collection... (18450 times) [2024-06-27 21:55:17,554][06887] Signal inference workers to resume experience collection... (18450 times) [2024-06-27 21:55:17,599][06909] InferenceWorker_p0-w0: stopping experience collection (18450 times) [2024-06-27 21:55:17,599][06909] InferenceWorker_p0-w0: resuming experience collection (18450 times) [2024-06-27 21:55:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43692.1, 300 sec: 43820.3). Total num frames: 1388331008. Throughput: 0: 44002.4. Samples: 1291276120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:55:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:55:20,162][06909] Updated weights for policy 0, policy_version 84743 (0.0027) [2024-06-27 21:55:23,850][06674] Fps is (10 sec: 37683.0, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1388544000. Throughput: 0: 43857.3. Samples: 1291529600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:55:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:55:24,694][06909] Updated weights for policy 0, policy_version 84753 (0.0034) [2024-06-27 21:55:27,872][06909] Updated weights for policy 0, policy_version 84763 (0.0028) [2024-06-27 21:55:28,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1388806144. Throughput: 0: 43855.6. Samples: 1291664960. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:55:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:55:32,377][06909] Updated weights for policy 0, policy_version 84773 (0.0031) [2024-06-27 21:55:33,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43692.2, 300 sec: 43764.7). Total num frames: 1388986368. Throughput: 0: 43893.0. Samples: 1291931560. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:55:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:55:35,386][06909] Updated weights for policy 0, policy_version 84783 (0.0030) [2024-06-27 21:55:38,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1389199360. Throughput: 0: 43676.0. Samples: 1292185120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:55:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 21:55:39,742][06909] Updated weights for policy 0, policy_version 84793 (0.0031) [2024-06-27 21:55:42,774][06909] Updated weights for policy 0, policy_version 84803 (0.0026) [2024-06-27 21:55:43,850][06674] Fps is (10 sec: 49151.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1389477888. Throughput: 0: 43642.6. Samples: 1292323020. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:55:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:55:46,960][06909] Updated weights for policy 0, policy_version 84813 (0.0028) [2024-06-27 21:55:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43820.5). Total num frames: 1389641728. Throughput: 0: 43935.5. Samples: 1292590720. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:55:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:55:48,885][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000084818_1389658112.pth... [2024-06-27 21:55:48,938][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000084174_1379106816.pth [2024-06-27 21:55:50,311][06909] Updated weights for policy 0, policy_version 84823 (0.0029) [2024-06-27 21:55:53,850][06674] Fps is (10 sec: 37682.6, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1389854720. Throughput: 0: 43966.9. Samples: 1292850400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:55:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 21:55:54,626][06909] Updated weights for policy 0, policy_version 84833 (0.0025) [2024-06-27 21:55:57,661][06909] Updated weights for policy 0, policy_version 84843 (0.0030) [2024-06-27 21:55:58,850][06674] Fps is (10 sec: 49152.2, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1390133248. Throughput: 0: 43750.2. Samples: 1292980940. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-27 21:55:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:56:01,841][06909] Updated weights for policy 0, policy_version 84853 (0.0041) [2024-06-27 21:56:03,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43963.7, 300 sec: 43876.1). Total num frames: 1390313472. Throughput: 0: 43769.8. Samples: 1293245760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:56:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:56:05,199][06909] Updated weights for policy 0, policy_version 84863 (0.0021) [2024-06-27 21:56:08,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1390526464. Throughput: 0: 44077.8. Samples: 1293513100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:56:08,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:56:09,288][06909] Updated weights for policy 0, policy_version 84873 (0.0036) [2024-06-27 21:56:12,476][06909] Updated weights for policy 0, policy_version 84883 (0.0035) [2024-06-27 21:56:13,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 1390788608. Throughput: 0: 44037.3. Samples: 1293646640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:56:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:56:16,673][06909] Updated weights for policy 0, policy_version 84893 (0.0024) [2024-06-27 21:56:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.6, 300 sec: 43820.3). Total num frames: 1390968832. Throughput: 0: 43804.3. Samples: 1293902760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:56:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:56:19,539][06887] Signal inference workers to stop experience collection... (18500 times) [2024-06-27 21:56:19,596][06909] InferenceWorker_p0-w0: stopping experience collection (18500 times) [2024-06-27 21:56:19,660][06887] Signal inference workers to resume experience collection... (18500 times) [2024-06-27 21:56:19,660][06909] InferenceWorker_p0-w0: resuming experience collection (18500 times) [2024-06-27 21:56:20,262][06909] Updated weights for policy 0, policy_version 84903 (0.0022) [2024-06-27 21:56:23,850][06674] Fps is (10 sec: 37683.4, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1391165440. Throughput: 0: 44067.2. Samples: 1294168140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:56:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:56:24,143][06909] Updated weights for policy 0, policy_version 84913 (0.0021) [2024-06-27 21:56:27,626][06909] Updated weights for policy 0, policy_version 84923 (0.0035) [2024-06-27 21:56:28,850][06674] Fps is (10 sec: 47514.4, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1391443968. Throughput: 0: 43996.5. Samples: 1294302860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:56:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:56:31,535][06909] Updated weights for policy 0, policy_version 84933 (0.0031) [2024-06-27 21:56:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 1391624192. Throughput: 0: 43897.8. Samples: 1294566120. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:56:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:56:34,887][06909] Updated weights for policy 0, policy_version 84943 (0.0033) [2024-06-27 21:56:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 1391853568. Throughput: 0: 43946.0. Samples: 1294827960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:56:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:56:38,933][06909] Updated weights for policy 0, policy_version 84953 (0.0035) [2024-06-27 21:56:42,281][06909] Updated weights for policy 0, policy_version 84963 (0.0040) [2024-06-27 21:56:43,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43417.7, 300 sec: 44042.4). Total num frames: 1392082944. Throughput: 0: 44061.0. Samples: 1294963680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:56:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:56:46,342][06909] Updated weights for policy 0, policy_version 84973 (0.0039) [2024-06-27 21:56:48,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1392295936. Throughput: 0: 43978.6. Samples: 1295224800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 21:56:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:56:49,747][06909] Updated weights for policy 0, policy_version 84983 (0.0035) [2024-06-27 21:56:53,850][06674] Fps is (10 sec: 42597.7, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1392508928. Throughput: 0: 43861.4. Samples: 1295486860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:56:53,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-27 21:56:53,873][06909] Updated weights for policy 0, policy_version 84993 (0.0022) [2024-06-27 21:56:57,569][06909] Updated weights for policy 0, policy_version 85003 (0.0049) [2024-06-27 21:56:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1392754688. Throughput: 0: 43764.9. Samples: 1295616060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:56:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 21:57:01,386][06909] Updated weights for policy 0, policy_version 85013 (0.0037) [2024-06-27 21:57:03,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.6, 300 sec: 43820.3). Total num frames: 1392918528. Throughput: 0: 43947.3. Samples: 1295880380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:57:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:57:04,962][06909] Updated weights for policy 0, policy_version 85023 (0.0040) [2024-06-27 21:57:08,701][06909] Updated weights for policy 0, policy_version 85033 (0.0030) [2024-06-27 21:57:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1393180672. Throughput: 0: 43871.9. Samples: 1296142380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:57:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:57:12,358][06909] Updated weights for policy 0, policy_version 85043 (0.0032) [2024-06-27 21:57:13,850][06674] Fps is (10 sec: 49152.0, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1393410048. Throughput: 0: 43905.8. Samples: 1296278620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:57:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:57:15,940][06909] Updated weights for policy 0, policy_version 85053 (0.0032) [2024-06-27 21:57:18,851][06674] Fps is (10 sec: 42592.8, 60 sec: 43962.8, 300 sec: 43875.6). Total num frames: 1393606656. Throughput: 0: 43848.9. Samples: 1296539380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:57:18,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:57:19,673][06909] Updated weights for policy 0, policy_version 85063 (0.0035) [2024-06-27 21:57:23,441][06909] Updated weights for policy 0, policy_version 85073 (0.0026) [2024-06-27 21:57:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 1393836032. Throughput: 0: 43863.1. Samples: 1296801800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:57:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:57:27,001][06909] Updated weights for policy 0, policy_version 85083 (0.0044) [2024-06-27 21:57:28,850][06674] Fps is (10 sec: 45881.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1394065408. Throughput: 0: 43974.1. Samples: 1296942520. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:57:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:57:30,769][06909] Updated weights for policy 0, policy_version 85093 (0.0035) [2024-06-27 21:57:32,687][06887] Signal inference workers to stop experience collection... (18550 times) [2024-06-27 21:57:32,687][06887] Signal inference workers to resume experience collection... (18550 times) [2024-06-27 21:57:32,701][06909] InferenceWorker_p0-w0: stopping experience collection (18550 times) [2024-06-27 21:57:32,713][06909] InferenceWorker_p0-w0: resuming experience collection (18550 times) [2024-06-27 21:57:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1394262016. Throughput: 0: 43935.2. Samples: 1297201880. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:57:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:57:35,089][06909] Updated weights for policy 0, policy_version 85103 (0.0025) [2024-06-27 21:57:38,585][06909] Updated weights for policy 0, policy_version 85113 (0.0043) [2024-06-27 21:57:38,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.6, 300 sec: 43931.3). Total num frames: 1394507776. Throughput: 0: 43764.8. Samples: 1297456280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:57:38,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:57:42,403][06909] Updated weights for policy 0, policy_version 85123 (0.0042) [2024-06-27 21:57:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1394720768. Throughput: 0: 43934.7. Samples: 1297593120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-27 21:57:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:57:45,913][06909] Updated weights for policy 0, policy_version 85133 (0.0031) [2024-06-27 21:57:48,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43417.6, 300 sec: 43876.1). Total num frames: 1394900992. Throughput: 0: 43886.1. Samples: 1297855260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:57:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:57:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000085139_1394917376.pth... [2024-06-27 21:57:48,912][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000084497_1384398848.pth [2024-06-27 21:57:49,661][06909] Updated weights for policy 0, policy_version 85143 (0.0044) [2024-06-27 21:57:53,254][06909] Updated weights for policy 0, policy_version 85153 (0.0032) [2024-06-27 21:57:53,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 43931.4). Total num frames: 1395163136. Throughput: 0: 43702.8. Samples: 1298109000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:57:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 21:57:57,027][06909] Updated weights for policy 0, policy_version 85163 (0.0039) [2024-06-27 21:57:58,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1395376128. Throughput: 0: 43887.9. Samples: 1298253580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:57:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:58:00,535][06909] Updated weights for policy 0, policy_version 85173 (0.0038) [2024-06-27 21:58:03,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1395572736. Throughput: 0: 43856.5. Samples: 1298512860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:58:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:58:04,701][06909] Updated weights for policy 0, policy_version 85183 (0.0042) [2024-06-27 21:58:08,129][06909] Updated weights for policy 0, policy_version 85193 (0.0030) [2024-06-27 21:58:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1395834880. Throughput: 0: 43755.0. Samples: 1298770780. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:58:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:58:12,343][06909] Updated weights for policy 0, policy_version 85203 (0.0041) [2024-06-27 21:58:13,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.6, 300 sec: 43931.4). Total num frames: 1396031488. Throughput: 0: 43687.1. Samples: 1298908440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:58:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 21:58:15,658][06909] Updated weights for policy 0, policy_version 85213 (0.0030) [2024-06-27 21:58:18,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43964.8, 300 sec: 43876.1). Total num frames: 1396244480. Throughput: 0: 43756.9. Samples: 1299170940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:58:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:58:19,640][06909] Updated weights for policy 0, policy_version 85223 (0.0043) [2024-06-27 21:58:23,133][06909] Updated weights for policy 0, policy_version 85233 (0.0032) [2024-06-27 21:58:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1396490240. Throughput: 0: 43998.3. Samples: 1299436200. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:58:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:58:26,938][06909] Updated weights for policy 0, policy_version 85243 (0.0039) [2024-06-27 21:58:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43876.1). Total num frames: 1396686848. Throughput: 0: 44101.8. Samples: 1299577700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:58:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:58:30,347][06909] Updated weights for policy 0, policy_version 85253 (0.0036) [2024-06-27 21:58:33,775][06887] Signal inference workers to stop experience collection... (18600 times) [2024-06-27 21:58:33,776][06887] Signal inference workers to resume experience collection... (18600 times) [2024-06-27 21:58:33,789][06909] InferenceWorker_p0-w0: stopping experience collection (18600 times) [2024-06-27 21:58:33,824][06909] InferenceWorker_p0-w0: resuming experience collection (18600 times) [2024-06-27 21:58:33,852][06674] Fps is (10 sec: 42590.2, 60 sec: 44235.3, 300 sec: 43931.0). Total num frames: 1396916224. Throughput: 0: 44100.3. Samples: 1299839860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 21:58:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 21:58:34,082][06909] Updated weights for policy 0, policy_version 85263 (0.0031) [2024-06-27 21:58:37,866][06909] Updated weights for policy 0, policy_version 85273 (0.0029) [2024-06-27 21:58:38,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1397145600. Throughput: 0: 44339.9. Samples: 1300104300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 21:58:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:58:41,736][06909] Updated weights for policy 0, policy_version 85283 (0.0044) [2024-06-27 21:58:43,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 1397358592. Throughput: 0: 44182.8. Samples: 1300241800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 21:58:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-27 21:58:45,462][06909] Updated weights for policy 0, policy_version 85293 (0.0034) [2024-06-27 21:58:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44782.9, 300 sec: 43986.9). Total num frames: 1397587968. Throughput: 0: 44286.1. Samples: 1300505740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 21:58:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:58:49,135][06909] Updated weights for policy 0, policy_version 85303 (0.0032) [2024-06-27 21:58:52,853][06909] Updated weights for policy 0, policy_version 85313 (0.0041) [2024-06-27 21:58:53,852][06674] Fps is (10 sec: 45865.6, 60 sec: 44235.2, 300 sec: 43931.0). Total num frames: 1397817344. Throughput: 0: 44326.5. Samples: 1300765560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 21:58:53,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:58:56,432][06909] Updated weights for policy 0, policy_version 85323 (0.0023) [2024-06-27 21:58:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1398013952. Throughput: 0: 44297.7. Samples: 1300901840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 21:58:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:59:00,130][06909] Updated weights for policy 0, policy_version 85333 (0.0026) [2024-06-27 21:59:03,850][06674] Fps is (10 sec: 42607.3, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 1398243328. Throughput: 0: 44542.7. Samples: 1301175360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 21:59:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:59:03,916][06909] Updated weights for policy 0, policy_version 85343 (0.0038) [2024-06-27 21:59:07,570][06909] Updated weights for policy 0, policy_version 85353 (0.0035) [2024-06-27 21:59:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43876.1). Total num frames: 1398456320. Throughput: 0: 44348.9. Samples: 1301431900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 21:59:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 21:59:11,281][06909] Updated weights for policy 0, policy_version 85363 (0.0035) [2024-06-27 21:59:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43931.6). Total num frames: 1398669312. Throughput: 0: 44081.8. Samples: 1301561380. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 21:59:13,850][06674] Avg episode reward: [(0, '0.453')] [2024-06-27 21:59:14,988][06909] Updated weights for policy 0, policy_version 85373 (0.0034) [2024-06-27 21:59:18,744][06909] Updated weights for policy 0, policy_version 85383 (0.0029) [2024-06-27 21:59:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1398915072. Throughput: 0: 44252.2. Samples: 1301831120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 21:59:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 21:59:22,509][06909] Updated weights for policy 0, policy_version 85393 (0.0030) [2024-06-27 21:59:23,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1399128064. Throughput: 0: 44104.5. Samples: 1302089000. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 21:59:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:59:26,215][06909] Updated weights for policy 0, policy_version 85403 (0.0033) [2024-06-27 21:59:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43931.7). Total num frames: 1399324672. Throughput: 0: 43903.1. Samples: 1302217440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 21:59:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:59:30,259][06909] Updated weights for policy 0, policy_version 85413 (0.0032) [2024-06-27 21:59:30,826][06887] Signal inference workers to stop experience collection... (18650 times) [2024-06-27 21:59:30,826][06887] Signal inference workers to resume experience collection... (18650 times) [2024-06-27 21:59:30,876][06909] InferenceWorker_p0-w0: stopping experience collection (18650 times) [2024-06-27 21:59:30,876][06909] InferenceWorker_p0-w0: resuming experience collection (18650 times) [2024-06-27 21:59:33,654][06909] Updated weights for policy 0, policy_version 85423 (0.0027) [2024-06-27 21:59:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 1399570432. Throughput: 0: 44028.2. Samples: 1302487000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:59:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:59:37,711][06909] Updated weights for policy 0, policy_version 85433 (0.0047) [2024-06-27 21:59:38,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1399783424. Throughput: 0: 43983.2. Samples: 1302744720. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:59:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:59:41,104][06909] Updated weights for policy 0, policy_version 85443 (0.0036) [2024-06-27 21:59:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1399996416. Throughput: 0: 43824.1. Samples: 1302873920. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:59:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 21:59:44,937][06909] Updated weights for policy 0, policy_version 85453 (0.0031) [2024-06-27 21:59:48,326][06909] Updated weights for policy 0, policy_version 85463 (0.0032) [2024-06-27 21:59:48,850][06674] Fps is (10 sec: 47514.2, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 1400258560. Throughput: 0: 43875.5. Samples: 1303149760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:59:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:59:48,877][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000085465_1400258560.pth... [2024-06-27 21:59:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000084818_1389658112.pth [2024-06-27 21:59:52,281][06909] Updated weights for policy 0, policy_version 85473 (0.0040) [2024-06-27 21:59:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43692.1, 300 sec: 43875.8). Total num frames: 1400438784. Throughput: 0: 43974.2. Samples: 1303410740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:59:53,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 21:59:55,842][06909] Updated weights for policy 0, policy_version 85483 (0.0036) [2024-06-27 21:59:58,850][06674] Fps is (10 sec: 40959.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1400668160. Throughput: 0: 43952.8. Samples: 1303539260. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 21:59:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 21:59:59,909][06909] Updated weights for policy 0, policy_version 85493 (0.0044) [2024-06-27 22:00:03,565][06909] Updated weights for policy 0, policy_version 85503 (0.0029) [2024-06-27 22:00:03,852][06674] Fps is (10 sec: 45866.0, 60 sec: 44235.2, 300 sec: 44097.6). Total num frames: 1400897536. Throughput: 0: 43761.1. Samples: 1303800460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:00:03,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:00:07,661][06909] Updated weights for policy 0, policy_version 85513 (0.0036) [2024-06-27 22:00:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1401094144. Throughput: 0: 43912.0. Samples: 1304065040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:00:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:00:11,042][06909] Updated weights for policy 0, policy_version 85523 (0.0036) [2024-06-27 22:00:13,850][06674] Fps is (10 sec: 42606.8, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1401323520. Throughput: 0: 43925.6. Samples: 1304194100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:00:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:00:14,850][06909] Updated weights for policy 0, policy_version 85533 (0.0029) [2024-06-27 22:00:18,397][06909] Updated weights for policy 0, policy_version 85543 (0.0032) [2024-06-27 22:00:18,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1401569280. Throughput: 0: 43964.8. Samples: 1304465420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:00:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:00:22,193][06909] Updated weights for policy 0, policy_version 85553 (0.0034) [2024-06-27 22:00:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1401749504. Throughput: 0: 43912.5. Samples: 1304720780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:00:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:00:25,734][06909] Updated weights for policy 0, policy_version 85563 (0.0029) [2024-06-27 22:00:28,850][06674] Fps is (10 sec: 40960.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1401978880. Throughput: 0: 43853.8. Samples: 1304847340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:00:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:00:29,761][06909] Updated weights for policy 0, policy_version 85573 (0.0035) [2024-06-27 22:00:33,212][06909] Updated weights for policy 0, policy_version 85583 (0.0028) [2024-06-27 22:00:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1402208256. Throughput: 0: 43846.6. Samples: 1305122860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:00:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:00:37,128][06909] Updated weights for policy 0, policy_version 85593 (0.0037) [2024-06-27 22:00:38,856][06674] Fps is (10 sec: 42572.3, 60 sec: 43686.3, 300 sec: 43819.4). Total num frames: 1402404864. Throughput: 0: 43852.3. Samples: 1305384360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:00:38,857][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:00:40,982][06909] Updated weights for policy 0, policy_version 85603 (0.0045) [2024-06-27 22:00:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1402634240. Throughput: 0: 43845.8. Samples: 1305512320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:00:43,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:00:44,738][06909] Updated weights for policy 0, policy_version 85613 (0.0041) [2024-06-27 22:00:48,406][06909] Updated weights for policy 0, policy_version 85623 (0.0027) [2024-06-27 22:00:48,850][06674] Fps is (10 sec: 45902.8, 60 sec: 43417.5, 300 sec: 44098.0). Total num frames: 1402863616. Throughput: 0: 43848.2. Samples: 1305773540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:00:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:00:52,163][06909] Updated weights for policy 0, policy_version 85633 (0.0033) [2024-06-27 22:00:53,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 1403043840. Throughput: 0: 43888.1. Samples: 1306040000. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:00:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:00:55,728][06909] Updated weights for policy 0, policy_version 85643 (0.0022) [2024-06-27 22:00:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1403289600. Throughput: 0: 43820.4. Samples: 1306166020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:00:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:00:59,437][06909] Updated weights for policy 0, policy_version 85653 (0.0035) [2024-06-27 22:01:02,911][06887] Signal inference workers to stop experience collection... (18700 times) [2024-06-27 22:01:02,912][06887] Signal inference workers to resume experience collection... (18700 times) [2024-06-27 22:01:02,932][06909] InferenceWorker_p0-w0: stopping experience collection (18700 times) [2024-06-27 22:01:02,932][06909] InferenceWorker_p0-w0: resuming experience collection (18700 times) [2024-06-27 22:01:03,321][06909] Updated weights for policy 0, policy_version 85663 (0.0038) [2024-06-27 22:01:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43419.1, 300 sec: 43986.9). Total num frames: 1403502592. Throughput: 0: 43600.6. Samples: 1306427440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:01:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:01:07,623][06909] Updated weights for policy 0, policy_version 85673 (0.0038) [2024-06-27 22:01:08,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 1403699200. Throughput: 0: 43632.4. Samples: 1306684240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:01:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:01:10,957][06909] Updated weights for policy 0, policy_version 85683 (0.0025) [2024-06-27 22:01:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1403944960. Throughput: 0: 43732.3. Samples: 1306815300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 22:01:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:01:14,816][06909] Updated weights for policy 0, policy_version 85693 (0.0039) [2024-06-27 22:01:18,734][06909] Updated weights for policy 0, policy_version 85703 (0.0034) [2024-06-27 22:01:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43144.6, 300 sec: 44042.4). Total num frames: 1404157952. Throughput: 0: 43548.0. Samples: 1307082520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 22:01:18,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 22:01:22,426][06909] Updated weights for policy 0, policy_version 85713 (0.0020) [2024-06-27 22:01:23,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 1404354560. Throughput: 0: 43704.2. Samples: 1307350780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 22:01:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:01:26,151][06909] Updated weights for policy 0, policy_version 85723 (0.0039) [2024-06-27 22:01:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1404600320. Throughput: 0: 43554.2. Samples: 1307472260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 22:01:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:01:29,878][06909] Updated weights for policy 0, policy_version 85733 (0.0035) [2024-06-27 22:01:33,447][06909] Updated weights for policy 0, policy_version 85743 (0.0029) [2024-06-27 22:01:33,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1404829696. Throughput: 0: 43687.6. Samples: 1307739480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 22:01:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:01:37,037][06909] Updated weights for policy 0, policy_version 85753 (0.0024) [2024-06-27 22:01:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43695.0, 300 sec: 43875.8). Total num frames: 1405026304. Throughput: 0: 43789.1. Samples: 1308010520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 22:01:38,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:01:40,871][06909] Updated weights for policy 0, policy_version 85763 (0.0031) [2024-06-27 22:01:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1405272064. Throughput: 0: 43820.0. Samples: 1308137920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 22:01:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:01:44,267][06909] Updated weights for policy 0, policy_version 85773 (0.0043) [2024-06-27 22:01:48,526][06909] Updated weights for policy 0, policy_version 85783 (0.0035) [2024-06-27 22:01:48,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1405485056. Throughput: 0: 43947.5. Samples: 1308405080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 22:01:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:01:48,895][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000085785_1405501440.pth... [2024-06-27 22:01:48,938][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000085139_1394917376.pth [2024-06-27 22:01:52,059][06909] Updated weights for policy 0, policy_version 85793 (0.0026) [2024-06-27 22:01:53,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43690.6, 300 sec: 43764.7). Total num frames: 1405665280. Throughput: 0: 44103.0. Samples: 1308668880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 22:01:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:01:56,183][06909] Updated weights for policy 0, policy_version 85803 (0.0033) [2024-06-27 22:01:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1405927424. Throughput: 0: 44044.4. Samples: 1308797300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 22:01:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:01:59,424][06909] Updated weights for policy 0, policy_version 85813 (0.0026) [2024-06-27 22:02:03,452][06909] Updated weights for policy 0, policy_version 85823 (0.0028) [2024-06-27 22:02:03,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1406140416. Throughput: 0: 43978.7. Samples: 1309061560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-27 22:02:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:02:06,878][06909] Updated weights for policy 0, policy_version 85833 (0.0031) [2024-06-27 22:02:08,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1406337024. Throughput: 0: 43872.8. Samples: 1309325060. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:02:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:02:10,832][06909] Updated weights for policy 0, policy_version 85843 (0.0025) [2024-06-27 22:02:13,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44042.6). Total num frames: 1406599168. Throughput: 0: 44040.5. Samples: 1309454080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:02:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:02:14,119][06909] Updated weights for policy 0, policy_version 85853 (0.0029) [2024-06-27 22:02:17,985][06909] Updated weights for policy 0, policy_version 85863 (0.0039) [2024-06-27 22:02:18,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1406812160. Throughput: 0: 44148.0. Samples: 1309726140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:02:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:02:21,374][06909] Updated weights for policy 0, policy_version 85873 (0.0033) [2024-06-27 22:02:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 1407008768. Throughput: 0: 44115.2. Samples: 1309995700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:02:23,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:02:25,726][06909] Updated weights for policy 0, policy_version 85883 (0.0038) [2024-06-27 22:02:26,613][06887] Signal inference workers to stop experience collection... (18750 times) [2024-06-27 22:02:26,651][06909] InferenceWorker_p0-w0: stopping experience collection (18750 times) [2024-06-27 22:02:26,665][06887] Signal inference workers to resume experience collection... (18750 times) [2024-06-27 22:02:26,673][06909] InferenceWorker_p0-w0: resuming experience collection (18750 times) [2024-06-27 22:02:28,731][06909] Updated weights for policy 0, policy_version 85893 (0.0031) [2024-06-27 22:02:28,852][06674] Fps is (10 sec: 45865.8, 60 sec: 44508.4, 300 sec: 44097.6). Total num frames: 1407270912. Throughput: 0: 43973.2. Samples: 1310116800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:02:28,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:02:33,349][06909] Updated weights for policy 0, policy_version 85903 (0.0026) [2024-06-27 22:02:33,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 1407467520. Throughput: 0: 43941.8. Samples: 1310382460. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:02:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:02:36,576][06909] Updated weights for policy 0, policy_version 85913 (0.0038) [2024-06-27 22:02:38,850][06674] Fps is (10 sec: 40968.8, 60 sec: 44237.0, 300 sec: 43931.4). Total num frames: 1407680512. Throughput: 0: 44149.9. Samples: 1310655620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:02:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:02:40,645][06909] Updated weights for policy 0, policy_version 85923 (0.0028) [2024-06-27 22:02:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1407909888. Throughput: 0: 44117.5. Samples: 1310782580. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:02:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:02:43,950][06909] Updated weights for policy 0, policy_version 85933 (0.0036) [2024-06-27 22:02:47,813][06909] Updated weights for policy 0, policy_version 85943 (0.0040) [2024-06-27 22:02:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1408139264. Throughput: 0: 44299.6. Samples: 1311055040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:02:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:02:51,114][06909] Updated weights for policy 0, policy_version 85953 (0.0035) [2024-06-27 22:02:53,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44783.1, 300 sec: 43986.9). Total num frames: 1408352256. Throughput: 0: 44329.9. Samples: 1311319900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:02:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:02:55,233][06909] Updated weights for policy 0, policy_version 85963 (0.0027) [2024-06-27 22:02:58,720][06909] Updated weights for policy 0, policy_version 85973 (0.0041) [2024-06-27 22:02:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 1408581632. Throughput: 0: 44372.0. Samples: 1311450820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 22:02:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:03:02,776][06909] Updated weights for policy 0, policy_version 85983 (0.0030) [2024-06-27 22:03:03,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1408794624. Throughput: 0: 44206.7. Samples: 1311715440. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 22:03:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:03:06,143][06909] Updated weights for policy 0, policy_version 85993 (0.0032) [2024-06-27 22:03:08,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44509.7, 300 sec: 43986.9). Total num frames: 1409007616. Throughput: 0: 44062.6. Samples: 1311978520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 22:03:08,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:03:10,225][06909] Updated weights for policy 0, policy_version 86003 (0.0041) [2024-06-27 22:03:13,620][06909] Updated weights for policy 0, policy_version 86013 (0.0035) [2024-06-27 22:03:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1409236992. Throughput: 0: 44287.8. Samples: 1312109660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 22:03:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:03:17,809][06909] Updated weights for policy 0, policy_version 86023 (0.0035) [2024-06-27 22:03:18,850][06674] Fps is (10 sec: 44237.8, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 1409449984. Throughput: 0: 44204.4. Samples: 1312371660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 22:03:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:03:21,122][06909] Updated weights for policy 0, policy_version 86033 (0.0036) [2024-06-27 22:03:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1409662976. Throughput: 0: 44019.9. Samples: 1312636520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 22:03:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:03:25,124][06909] Updated weights for policy 0, policy_version 86043 (0.0030) [2024-06-27 22:03:28,309][06909] Updated weights for policy 0, policy_version 86053 (0.0026) [2024-06-27 22:03:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43692.1, 300 sec: 43987.2). Total num frames: 1409892352. Throughput: 0: 44107.0. Samples: 1312767400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 22:03:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:03:32,304][06909] Updated weights for policy 0, policy_version 86063 (0.0028) [2024-06-27 22:03:33,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1410121728. Throughput: 0: 44030.3. Samples: 1313036400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 22:03:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:03:35,791][06909] Updated weights for policy 0, policy_version 86073 (0.0033) [2024-06-27 22:03:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1410318336. Throughput: 0: 44120.3. Samples: 1313305320. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 22:03:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:03:40,042][06909] Updated weights for policy 0, policy_version 86083 (0.0039) [2024-06-27 22:03:43,189][06909] Updated weights for policy 0, policy_version 86093 (0.0031) [2024-06-27 22:03:43,850][06674] Fps is (10 sec: 44235.8, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1410564096. Throughput: 0: 44054.2. Samples: 1313433260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 22:03:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:03:47,476][06909] Updated weights for policy 0, policy_version 86103 (0.0025) [2024-06-27 22:03:48,852][06674] Fps is (10 sec: 45865.7, 60 sec: 43962.2, 300 sec: 43931.3). Total num frames: 1410777088. Throughput: 0: 44111.3. Samples: 1313700540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-27 22:03:48,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:03:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000086107_1410777088.pth... [2024-06-27 22:03:48,909][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000085465_1400258560.pth [2024-06-27 22:03:50,342][06909] Updated weights for policy 0, policy_version 86113 (0.0030) [2024-06-27 22:03:53,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1410990080. Throughput: 0: 44085.1. Samples: 1313962340. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-27 22:03:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:03:54,806][06909] Updated weights for policy 0, policy_version 86123 (0.0043) [2024-06-27 22:03:56,527][06887] Signal inference workers to stop experience collection... (18800 times) [2024-06-27 22:03:56,552][06909] InferenceWorker_p0-w0: stopping experience collection (18800 times) [2024-06-27 22:03:56,587][06887] Signal inference workers to resume experience collection... (18800 times) [2024-06-27 22:03:56,587][06909] InferenceWorker_p0-w0: resuming experience collection (18800 times) [2024-06-27 22:03:58,042][06909] Updated weights for policy 0, policy_version 86133 (0.0031) [2024-06-27 22:03:58,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1411203072. Throughput: 0: 44168.0. Samples: 1314097220. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-27 22:03:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:04:02,092][06909] Updated weights for policy 0, policy_version 86143 (0.0043) [2024-06-27 22:04:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1411432448. Throughput: 0: 44206.6. Samples: 1314360960. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-27 22:04:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:04:05,406][06909] Updated weights for policy 0, policy_version 86153 (0.0044) [2024-06-27 22:04:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1411645440. Throughput: 0: 44244.4. Samples: 1314627520. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-27 22:04:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:04:09,539][06909] Updated weights for policy 0, policy_version 86163 (0.0032) [2024-06-27 22:04:12,792][06909] Updated weights for policy 0, policy_version 86173 (0.0036) [2024-06-27 22:04:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1411874816. Throughput: 0: 44184.6. Samples: 1314755700. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-27 22:04:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:04:16,709][06909] Updated weights for policy 0, policy_version 86183 (0.0034) [2024-06-27 22:04:18,853][06674] Fps is (10 sec: 47497.3, 60 sec: 44507.2, 300 sec: 44041.9). Total num frames: 1412120576. Throughput: 0: 44244.9. Samples: 1315027580. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-27 22:04:18,854][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:04:20,012][06909] Updated weights for policy 0, policy_version 86193 (0.0031) [2024-06-27 22:04:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1412300800. Throughput: 0: 44075.2. Samples: 1315288700. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-27 22:04:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:04:24,710][06909] Updated weights for policy 0, policy_version 86203 (0.0030) [2024-06-27 22:04:27,688][06909] Updated weights for policy 0, policy_version 86213 (0.0032) [2024-06-27 22:04:28,850][06674] Fps is (10 sec: 42613.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1412546560. Throughput: 0: 44056.9. Samples: 1315415820. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-27 22:04:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:04:32,259][06909] Updated weights for policy 0, policy_version 86223 (0.0032) [2024-06-27 22:04:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1412759552. Throughput: 0: 44009.1. Samples: 1315680860. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-27 22:04:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:04:35,379][06909] Updated weights for policy 0, policy_version 86233 (0.0026) [2024-06-27 22:04:38,850][06674] Fps is (10 sec: 40960.9, 60 sec: 43963.9, 300 sec: 43931.3). Total num frames: 1412956160. Throughput: 0: 44001.4. Samples: 1315942400. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-27 22:04:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:04:39,686][06909] Updated weights for policy 0, policy_version 86243 (0.0045) [2024-06-27 22:04:42,645][06909] Updated weights for policy 0, policy_version 86253 (0.0034) [2024-06-27 22:04:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 1413185536. Throughput: 0: 43810.6. Samples: 1316068700. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 22:04:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:04:46,897][06909] Updated weights for policy 0, policy_version 86263 (0.0043) [2024-06-27 22:04:48,852][06674] Fps is (10 sec: 47503.2, 60 sec: 44236.8, 300 sec: 44042.1). Total num frames: 1413431296. Throughput: 0: 44039.3. Samples: 1316342820. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 22:04:48,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:04:49,930][06909] Updated weights for policy 0, policy_version 86273 (0.0036) [2024-06-27 22:04:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 1413627904. Throughput: 0: 44050.4. Samples: 1316609780. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 22:04:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 22:04:54,208][06909] Updated weights for policy 0, policy_version 86283 (0.0031) [2024-06-27 22:04:57,463][06909] Updated weights for policy 0, policy_version 86293 (0.0032) [2024-06-27 22:04:58,850][06674] Fps is (10 sec: 42607.1, 60 sec: 44236.8, 300 sec: 43931.6). Total num frames: 1413857280. Throughput: 0: 43917.3. Samples: 1316731980. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 22:04:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:05:01,718][06909] Updated weights for policy 0, policy_version 86303 (0.0035) [2024-06-27 22:05:03,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1414070272. Throughput: 0: 43948.7. Samples: 1317005120. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 22:05:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:05:05,230][06909] Updated weights for policy 0, policy_version 86313 (0.0029) [2024-06-27 22:05:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1414283264. Throughput: 0: 43830.2. Samples: 1317261060. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 22:05:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:05:09,525][06909] Updated weights for policy 0, policy_version 86323 (0.0029) [2024-06-27 22:05:12,589][06909] Updated weights for policy 0, policy_version 86333 (0.0030) [2024-06-27 22:05:13,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1414512640. Throughput: 0: 43936.6. Samples: 1317392960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 22:05:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:05:16,696][06909] Updated weights for policy 0, policy_version 86343 (0.0044) [2024-06-27 22:05:18,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43966.2, 300 sec: 44097.9). Total num frames: 1414758400. Throughput: 0: 44031.5. Samples: 1317662280. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 22:05:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:05:20,124][06909] Updated weights for policy 0, policy_version 86353 (0.0029) [2024-06-27 22:05:22,966][06887] Signal inference workers to stop experience collection... (18850 times) [2024-06-27 22:05:22,967][06887] Signal inference workers to resume experience collection... (18850 times) [2024-06-27 22:05:23,011][06909] InferenceWorker_p0-w0: stopping experience collection (18850 times) [2024-06-27 22:05:23,011][06909] InferenceWorker_p0-w0: resuming experience collection (18850 times) [2024-06-27 22:05:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1414955008. Throughput: 0: 44104.3. Samples: 1317927100. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 22:05:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:05:23,884][06909] Updated weights for policy 0, policy_version 86363 (0.0038) [2024-06-27 22:05:27,687][06909] Updated weights for policy 0, policy_version 86373 (0.0034) [2024-06-27 22:05:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1415184384. Throughput: 0: 44113.8. Samples: 1318053820. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 22:05:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:05:31,261][06909] Updated weights for policy 0, policy_version 86383 (0.0029) [2024-06-27 22:05:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44098.9). Total num frames: 1415413760. Throughput: 0: 44123.4. Samples: 1318328280. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-27 22:05:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:05:34,918][06909] Updated weights for policy 0, policy_version 86393 (0.0034) [2024-06-27 22:05:38,703][06909] Updated weights for policy 0, policy_version 86403 (0.0026) [2024-06-27 22:05:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.7, 300 sec: 44042.4). Total num frames: 1415626752. Throughput: 0: 43947.4. Samples: 1318587420. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 22:05:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:05:42,364][06909] Updated weights for policy 0, policy_version 86413 (0.0043) [2024-06-27 22:05:43,852][06674] Fps is (10 sec: 42589.5, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 1415839744. Throughput: 0: 44076.7. Samples: 1318715520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 22:05:43,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:05:46,773][06909] Updated weights for policy 0, policy_version 86423 (0.0041) [2024-06-27 22:05:48,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43692.2, 300 sec: 44098.0). Total num frames: 1416052736. Throughput: 0: 43867.3. Samples: 1318979140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 22:05:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:05:48,944][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000086430_1416069120.pth... [2024-06-27 22:05:48,994][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000085785_1405501440.pth [2024-06-27 22:05:50,161][06909] Updated weights for policy 0, policy_version 86433 (0.0027) [2024-06-27 22:05:53,855][06674] Fps is (10 sec: 42584.1, 60 sec: 43959.7, 300 sec: 43986.1). Total num frames: 1416265728. Throughput: 0: 43916.1. Samples: 1319237520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 22:05:53,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:05:54,094][06909] Updated weights for policy 0, policy_version 86443 (0.0027) [2024-06-27 22:05:57,647][06909] Updated weights for policy 0, policy_version 86453 (0.0034) [2024-06-27 22:05:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1416495104. Throughput: 0: 43892.0. Samples: 1319368100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 22:05:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:06:01,376][06909] Updated weights for policy 0, policy_version 86463 (0.0040) [2024-06-27 22:06:03,850][06674] Fps is (10 sec: 45899.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1416724480. Throughput: 0: 43854.7. Samples: 1319635740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 22:06:03,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:06:05,067][06909] Updated weights for policy 0, policy_version 86473 (0.0037) [2024-06-27 22:06:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1416921088. Throughput: 0: 43773.9. Samples: 1319896920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 22:06:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:06:08,874][06909] Updated weights for policy 0, policy_version 86483 (0.0030) [2024-06-27 22:06:12,594][06909] Updated weights for policy 0, policy_version 86493 (0.0024) [2024-06-27 22:06:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.6, 300 sec: 44097.9). Total num frames: 1417166848. Throughput: 0: 43973.2. Samples: 1320032620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 22:06:13,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:06:16,156][06909] Updated weights for policy 0, policy_version 86503 (0.0024) [2024-06-27 22:06:18,850][06674] Fps is (10 sec: 45874.1, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 1417379840. Throughput: 0: 43767.4. Samples: 1320297820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 22:06:18,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 22:06:19,737][06909] Updated weights for policy 0, policy_version 86513 (0.0040) [2024-06-27 22:06:23,843][06909] Updated weights for policy 0, policy_version 86523 (0.0035) [2024-06-27 22:06:23,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1417592832. Throughput: 0: 43810.4. Samples: 1320558880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 22:06:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:06:27,304][06909] Updated weights for policy 0, policy_version 86533 (0.0050) [2024-06-27 22:06:28,852][06674] Fps is (10 sec: 44228.4, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 1417822208. Throughput: 0: 43885.8. Samples: 1320690380. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-27 22:06:28,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:06:31,511][06909] Updated weights for policy 0, policy_version 86543 (0.0047) [2024-06-27 22:06:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 1418018816. Throughput: 0: 43888.4. Samples: 1320954120. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-27 22:06:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:06:35,079][06909] Updated weights for policy 0, policy_version 86553 (0.0027) [2024-06-27 22:06:36,343][06887] Signal inference workers to stop experience collection... (18900 times) [2024-06-27 22:06:36,343][06887] Signal inference workers to resume experience collection... (18900 times) [2024-06-27 22:06:36,388][06909] InferenceWorker_p0-w0: stopping experience collection (18900 times) [2024-06-27 22:06:36,388][06909] InferenceWorker_p0-w0: resuming experience collection (18900 times) [2024-06-27 22:06:38,728][06909] Updated weights for policy 0, policy_version 86563 (0.0033) [2024-06-27 22:06:38,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1418248192. Throughput: 0: 44050.3. Samples: 1321219540. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-27 22:06:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:06:42,349][06909] Updated weights for policy 0, policy_version 86573 (0.0035) [2024-06-27 22:06:43,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 1418477568. Throughput: 0: 44111.0. Samples: 1321353100. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-27 22:06:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:06:46,001][06909] Updated weights for policy 0, policy_version 86583 (0.0043) [2024-06-27 22:06:48,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 1418690560. Throughput: 0: 43962.2. Samples: 1321614040. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-27 22:06:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:06:49,631][06909] Updated weights for policy 0, policy_version 86593 (0.0030) [2024-06-27 22:06:53,242][06909] Updated weights for policy 0, policy_version 86603 (0.0045) [2024-06-27 22:06:53,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43967.7, 300 sec: 43986.9). Total num frames: 1418903552. Throughput: 0: 44172.4. Samples: 1321884680. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-27 22:06:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:06:57,220][06909] Updated weights for policy 0, policy_version 86613 (0.0031) [2024-06-27 22:06:58,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1419132928. Throughput: 0: 44126.4. Samples: 1322018300. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-27 22:06:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:07:01,094][06909] Updated weights for policy 0, policy_version 86623 (0.0033) [2024-06-27 22:07:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1419345920. Throughput: 0: 44003.7. Samples: 1322277980. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-27 22:07:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:07:04,781][06909] Updated weights for policy 0, policy_version 86633 (0.0029) [2024-06-27 22:07:08,377][06909] Updated weights for policy 0, policy_version 86643 (0.0042) [2024-06-27 22:07:08,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44509.7, 300 sec: 44042.4). Total num frames: 1419591680. Throughput: 0: 44153.1. Samples: 1322545780. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-27 22:07:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:07:12,259][06909] Updated weights for policy 0, policy_version 86653 (0.0037) [2024-06-27 22:07:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1419788288. Throughput: 0: 44134.9. Samples: 1322676360. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-27 22:07:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:07:15,831][06909] Updated weights for policy 0, policy_version 86663 (0.0040) [2024-06-27 22:07:18,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1420001280. Throughput: 0: 44049.6. Samples: 1322936360. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-27 22:07:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:07:19,564][06909] Updated weights for policy 0, policy_version 86673 (0.0032) [2024-06-27 22:07:23,347][06909] Updated weights for policy 0, policy_version 86683 (0.0028) [2024-06-27 22:07:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 43987.2). Total num frames: 1420247040. Throughput: 0: 44105.2. Samples: 1323204280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:07:23,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:07:27,340][06909] Updated weights for policy 0, policy_version 86693 (0.0028) [2024-06-27 22:07:28,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 1420460032. Throughput: 0: 44189.4. Samples: 1323341620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:07:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:07:30,545][06909] Updated weights for policy 0, policy_version 86703 (0.0031) [2024-06-27 22:07:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 1420689408. Throughput: 0: 44072.2. Samples: 1323597280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:07:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:07:34,592][06909] Updated weights for policy 0, policy_version 86713 (0.0027) [2024-06-27 22:07:38,202][06909] Updated weights for policy 0, policy_version 86723 (0.0030) [2024-06-27 22:07:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1420902400. Throughput: 0: 44031.6. Samples: 1323866100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:07:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:07:41,909][06909] Updated weights for policy 0, policy_version 86733 (0.0020) [2024-06-27 22:07:43,856][06674] Fps is (10 sec: 40935.3, 60 sec: 43686.3, 300 sec: 43930.4). Total num frames: 1421099008. Throughput: 0: 43886.6. Samples: 1323993460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:07:43,856][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:07:45,492][06909] Updated weights for policy 0, policy_version 86743 (0.0034) [2024-06-27 22:07:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1421344768. Throughput: 0: 43992.5. Samples: 1324257640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:07:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:07:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000086752_1421344768.pth... [2024-06-27 22:07:48,913][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000086107_1410777088.pth [2024-06-27 22:07:49,553][06909] Updated weights for policy 0, policy_version 86753 (0.0035) [2024-06-27 22:07:52,904][06909] Updated weights for policy 0, policy_version 86763 (0.0041) [2024-06-27 22:07:53,850][06674] Fps is (10 sec: 47542.5, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1421574144. Throughput: 0: 43954.0. Samples: 1324523700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:07:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:07:56,823][06909] Updated weights for policy 0, policy_version 86773 (0.0032) [2024-06-27 22:07:58,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1421754368. Throughput: 0: 44020.0. Samples: 1324657260. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:07:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:08:00,398][06909] Updated weights for policy 0, policy_version 86783 (0.0039) [2024-06-27 22:08:00,629][06887] Signal inference workers to stop experience collection... (18950 times) [2024-06-27 22:08:00,662][06909] InferenceWorker_p0-w0: stopping experience collection (18950 times) [2024-06-27 22:08:00,686][06887] Signal inference workers to resume experience collection... (18950 times) [2024-06-27 22:08:00,692][06909] InferenceWorker_p0-w0: resuming experience collection (18950 times) [2024-06-27 22:08:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1422000128. Throughput: 0: 43995.3. Samples: 1324916140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:08:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:08:04,423][06909] Updated weights for policy 0, policy_version 86793 (0.0021) [2024-06-27 22:08:07,901][06909] Updated weights for policy 0, policy_version 86803 (0.0041) [2024-06-27 22:08:08,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 1422229504. Throughput: 0: 44043.7. Samples: 1325186240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:08:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:08:11,763][06909] Updated weights for policy 0, policy_version 86813 (0.0030) [2024-06-27 22:08:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1422426112. Throughput: 0: 43854.2. Samples: 1325315060. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 22:08:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:08:15,737][06909] Updated weights for policy 0, policy_version 86823 (0.0041) [2024-06-27 22:08:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1422655488. Throughput: 0: 43772.0. Samples: 1325567020. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 22:08:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:08:19,164][06909] Updated weights for policy 0, policy_version 86833 (0.0046) [2024-06-27 22:08:22,990][06909] Updated weights for policy 0, policy_version 86843 (0.0038) [2024-06-27 22:08:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1422868480. Throughput: 0: 43778.1. Samples: 1325836120. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 22:08:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:08:26,635][06909] Updated weights for policy 0, policy_version 86853 (0.0028) [2024-06-27 22:08:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1423081472. Throughput: 0: 43777.8. Samples: 1325963200. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 22:08:28,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:08:30,237][06909] Updated weights for policy 0, policy_version 86863 (0.0029) [2024-06-27 22:08:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1423310848. Throughput: 0: 43811.6. Samples: 1326229160. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 22:08:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:08:34,158][06909] Updated weights for policy 0, policy_version 86873 (0.0028) [2024-06-27 22:08:37,877][06909] Updated weights for policy 0, policy_version 86883 (0.0025) [2024-06-27 22:08:38,855][06674] Fps is (10 sec: 47487.5, 60 sec: 44232.6, 300 sec: 44041.6). Total num frames: 1423556608. Throughput: 0: 43802.9. Samples: 1326495080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 22:08:38,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:08:41,923][06909] Updated weights for policy 0, policy_version 86893 (0.0047) [2024-06-27 22:08:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43968.2, 300 sec: 43931.7). Total num frames: 1423736832. Throughput: 0: 43760.9. Samples: 1326626500. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 22:08:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:08:45,298][06909] Updated weights for policy 0, policy_version 86903 (0.0037) [2024-06-27 22:08:48,850][06674] Fps is (10 sec: 40983.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1423966208. Throughput: 0: 43727.5. Samples: 1326883880. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 22:08:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:08:49,234][06909] Updated weights for policy 0, policy_version 86913 (0.0045) [2024-06-27 22:08:52,748][06909] Updated weights for policy 0, policy_version 86923 (0.0030) [2024-06-27 22:08:53,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 1424211968. Throughput: 0: 43704.3. Samples: 1327152940. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 22:08:53,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:08:56,543][06909] Updated weights for policy 0, policy_version 86933 (0.0031) [2024-06-27 22:08:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1424408576. Throughput: 0: 43829.8. Samples: 1327287400. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 22:08:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:09:00,010][06909] Updated weights for policy 0, policy_version 86943 (0.0031) [2024-06-27 22:09:03,626][06909] Updated weights for policy 0, policy_version 86953 (0.0029) [2024-06-27 22:09:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1424637952. Throughput: 0: 44159.5. Samples: 1327554200. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-27 22:09:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:09:07,237][06909] Updated weights for policy 0, policy_version 86963 (0.0027) [2024-06-27 22:09:08,852][06674] Fps is (10 sec: 45865.6, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 1424867328. Throughput: 0: 44148.2. Samples: 1327822880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:09:08,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:09:11,274][06909] Updated weights for policy 0, policy_version 86973 (0.0026) [2024-06-27 22:09:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 43931.9). Total num frames: 1425080320. Throughput: 0: 44300.1. Samples: 1327956700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:09:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:09:14,454][06909] Updated weights for policy 0, policy_version 86983 (0.0034) [2024-06-27 22:09:18,523][06909] Updated weights for policy 0, policy_version 86993 (0.0026) [2024-06-27 22:09:18,856][06674] Fps is (10 sec: 42583.1, 60 sec: 43959.6, 300 sec: 44041.6). Total num frames: 1425293312. Throughput: 0: 44314.4. Samples: 1328223560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:09:18,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:09:22,185][06909] Updated weights for policy 0, policy_version 87003 (0.0030) [2024-06-27 22:09:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1425522688. Throughput: 0: 44170.8. Samples: 1328482520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:09:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:09:26,028][06909] Updated weights for policy 0, policy_version 87013 (0.0023) [2024-06-27 22:09:28,850][06674] Fps is (10 sec: 45901.4, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 1425752064. Throughput: 0: 44305.8. Samples: 1328620260. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:09:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:09:29,593][06909] Updated weights for policy 0, policy_version 87023 (0.0026) [2024-06-27 22:09:33,603][06909] Updated weights for policy 0, policy_version 87033 (0.0039) [2024-06-27 22:09:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1425965056. Throughput: 0: 44388.4. Samples: 1328881360. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:09:33,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:09:37,002][06909] Updated weights for policy 0, policy_version 87043 (0.0026) [2024-06-27 22:09:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43967.9, 300 sec: 44098.0). Total num frames: 1426194432. Throughput: 0: 44234.8. Samples: 1329143500. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:09:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:09:40,846][06909] Updated weights for policy 0, policy_version 87053 (0.0031) [2024-06-27 22:09:42,354][06887] Signal inference workers to stop experience collection... (19000 times) [2024-06-27 22:09:42,396][06909] InferenceWorker_p0-w0: stopping experience collection (19000 times) [2024-06-27 22:09:42,402][06887] Signal inference workers to resume experience collection... (19000 times) [2024-06-27 22:09:42,409][06909] InferenceWorker_p0-w0: resuming experience collection (19000 times) [2024-06-27 22:09:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 43987.2). Total num frames: 1426407424. Throughput: 0: 44231.1. Samples: 1329277800. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:09:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:09:44,391][06909] Updated weights for policy 0, policy_version 87063 (0.0040) [2024-06-27 22:09:48,583][06909] Updated weights for policy 0, policy_version 87073 (0.0029) [2024-06-27 22:09:48,852][06674] Fps is (10 sec: 40951.3, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 1426604032. Throughput: 0: 44162.0. Samples: 1329541580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:09:48,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:09:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000087073_1426604032.pth... [2024-06-27 22:09:48,933][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000086430_1416069120.pth [2024-06-27 22:09:52,140][06909] Updated weights for policy 0, policy_version 87083 (0.0025) [2024-06-27 22:09:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1426833408. Throughput: 0: 43847.8. Samples: 1329795940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:09:53,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:09:55,976][06909] Updated weights for policy 0, policy_version 87093 (0.0034) [2024-06-27 22:09:58,850][06674] Fps is (10 sec: 47523.2, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 1427079168. Throughput: 0: 44060.4. Samples: 1329939420. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-27 22:09:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:09:59,357][06909] Updated weights for policy 0, policy_version 87103 (0.0038) [2024-06-27 22:10:03,283][06909] Updated weights for policy 0, policy_version 87113 (0.0039) [2024-06-27 22:10:03,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1427275776. Throughput: 0: 43992.3. Samples: 1330202960. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-27 22:10:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:10:07,107][06909] Updated weights for policy 0, policy_version 87123 (0.0039) [2024-06-27 22:10:08,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43692.1, 300 sec: 43986.8). Total num frames: 1427488768. Throughput: 0: 43918.5. Samples: 1330458860. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-27 22:10:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:10:10,866][06909] Updated weights for policy 0, policy_version 87133 (0.0027) [2024-06-27 22:10:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 1427718144. Throughput: 0: 43800.9. Samples: 1330591300. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-27 22:10:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:10:14,637][06909] Updated weights for policy 0, policy_version 87143 (0.0034) [2024-06-27 22:10:18,081][06909] Updated weights for policy 0, policy_version 87153 (0.0027) [2024-06-27 22:10:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43967.8, 300 sec: 43986.9). Total num frames: 1427931136. Throughput: 0: 43905.2. Samples: 1330857100. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-27 22:10:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:10:21,849][06909] Updated weights for policy 0, policy_version 87163 (0.0032) [2024-06-27 22:10:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1428160512. Throughput: 0: 43758.1. Samples: 1331112620. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-27 22:10:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:10:25,805][06909] Updated weights for policy 0, policy_version 87173 (0.0023) [2024-06-27 22:10:28,852][06674] Fps is (10 sec: 45866.4, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 1428389888. Throughput: 0: 43915.9. Samples: 1331254100. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-27 22:10:28,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:10:29,077][06909] Updated weights for policy 0, policy_version 87183 (0.0040) [2024-06-27 22:10:33,470][06909] Updated weights for policy 0, policy_version 87193 (0.0031) [2024-06-27 22:10:33,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1428602880. Throughput: 0: 43971.9. Samples: 1331520220. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-27 22:10:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:10:36,516][06909] Updated weights for policy 0, policy_version 87203 (0.0033) [2024-06-27 22:10:38,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 1428815872. Throughput: 0: 44137.9. Samples: 1331782140. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-27 22:10:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:10:40,802][06909] Updated weights for policy 0, policy_version 87213 (0.0037) [2024-06-27 22:10:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1429045248. Throughput: 0: 43863.6. Samples: 1331913280. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-27 22:10:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:10:44,071][06909] Updated weights for policy 0, policy_version 87223 (0.0041) [2024-06-27 22:10:48,023][06909] Updated weights for policy 0, policy_version 87233 (0.0034) [2024-06-27 22:10:48,852][06674] Fps is (10 sec: 42589.0, 60 sec: 43963.7, 300 sec: 43987.4). Total num frames: 1429241856. Throughput: 0: 43964.1. Samples: 1332181440. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-27 22:10:48,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:10:51,839][06909] Updated weights for policy 0, policy_version 87243 (0.0034) [2024-06-27 22:10:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1429471232. Throughput: 0: 44137.0. Samples: 1332445020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:10:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:10:55,310][06909] Updated weights for policy 0, policy_version 87253 (0.0022) [2024-06-27 22:10:58,850][06674] Fps is (10 sec: 45885.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1429700608. Throughput: 0: 44176.4. Samples: 1332579240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:10:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:10:59,225][06909] Updated weights for policy 0, policy_version 87263 (0.0029) [2024-06-27 22:11:03,144][06909] Updated weights for policy 0, policy_version 87273 (0.0029) [2024-06-27 22:11:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1429913600. Throughput: 0: 44069.9. Samples: 1332840240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:11:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:11:06,382][06909] Updated weights for policy 0, policy_version 87283 (0.0034) [2024-06-27 22:11:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.9, 300 sec: 43931.4). Total num frames: 1430126592. Throughput: 0: 44246.3. Samples: 1333103700. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:11:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:11:10,582][06909] Updated weights for policy 0, policy_version 87293 (0.0029) [2024-06-27 22:11:13,690][06909] Updated weights for policy 0, policy_version 87303 (0.0042) [2024-06-27 22:11:13,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1430372352. Throughput: 0: 44174.8. Samples: 1333241880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:11:13,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:11:17,823][06909] Updated weights for policy 0, policy_version 87313 (0.0031) [2024-06-27 22:11:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1430568960. Throughput: 0: 44198.1. Samples: 1333509140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:11:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:11:21,219][06909] Updated weights for policy 0, policy_version 87323 (0.0031) [2024-06-27 22:11:23,856][06674] Fps is (10 sec: 40935.5, 60 sec: 43686.3, 300 sec: 43930.7). Total num frames: 1430781952. Throughput: 0: 44191.3. Samples: 1333771020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:11:23,857][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:11:25,158][06909] Updated weights for policy 0, policy_version 87333 (0.0033) [2024-06-27 22:11:28,723][06909] Updated weights for policy 0, policy_version 87343 (0.0031) [2024-06-27 22:11:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43965.2, 300 sec: 44097.9). Total num frames: 1431027712. Throughput: 0: 44228.9. Samples: 1333903580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:11:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:11:32,434][06909] Updated weights for policy 0, policy_version 87353 (0.0029) [2024-06-27 22:11:33,850][06674] Fps is (10 sec: 45903.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1431240704. Throughput: 0: 44223.9. Samples: 1334171420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:11:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:11:36,271][06909] Updated weights for policy 0, policy_version 87363 (0.0035) [2024-06-27 22:11:38,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1431453696. Throughput: 0: 44276.8. Samples: 1334437480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:11:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:11:39,976][06909] Updated weights for policy 0, policy_version 87373 (0.0037) [2024-06-27 22:11:41,309][06887] Signal inference workers to stop experience collection... (19050 times) [2024-06-27 22:11:41,310][06887] Signal inference workers to resume experience collection... (19050 times) [2024-06-27 22:11:41,354][06909] InferenceWorker_p0-w0: stopping experience collection (19050 times) [2024-06-27 22:11:41,354][06909] InferenceWorker_p0-w0: resuming experience collection (19050 times) [2024-06-27 22:11:43,522][06909] Updated weights for policy 0, policy_version 87383 (0.0029) [2024-06-27 22:11:43,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1431699456. Throughput: 0: 44099.6. Samples: 1334563720. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 22:11:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:11:47,523][06909] Updated weights for policy 0, policy_version 87393 (0.0045) [2024-06-27 22:11:48,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44511.4, 300 sec: 44097.9). Total num frames: 1431912448. Throughput: 0: 44382.2. Samples: 1334837440. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:11:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:11:48,917][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000087398_1431928832.pth... [2024-06-27 22:11:48,956][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000086752_1421344768.pth [2024-06-27 22:11:50,675][06909] Updated weights for policy 0, policy_version 87403 (0.0038) [2024-06-27 22:11:53,850][06674] Fps is (10 sec: 39320.9, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1432092672. Throughput: 0: 44427.9. Samples: 1335102960. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:11:53,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:11:54,934][06909] Updated weights for policy 0, policy_version 87413 (0.0044) [2024-06-27 22:11:58,170][06909] Updated weights for policy 0, policy_version 87423 (0.0041) [2024-06-27 22:11:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1432371200. Throughput: 0: 44110.6. Samples: 1335226860. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:11:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:12:02,149][06909] Updated weights for policy 0, policy_version 87433 (0.0033) [2024-06-27 22:12:03,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1432567808. Throughput: 0: 44114.6. Samples: 1335494300. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:12:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:12:05,325][06909] Updated weights for policy 0, policy_version 87443 (0.0022) [2024-06-27 22:12:08,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1432780800. Throughput: 0: 44388.2. Samples: 1335768220. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:12:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:12:09,879][06909] Updated weights for policy 0, policy_version 87453 (0.0030) [2024-06-27 22:12:13,021][06909] Updated weights for policy 0, policy_version 87463 (0.0041) [2024-06-27 22:12:13,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1433042944. Throughput: 0: 44250.6. Samples: 1335894860. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:12:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 22:12:17,267][06909] Updated weights for policy 0, policy_version 87473 (0.0041) [2024-06-27 22:12:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1433239552. Throughput: 0: 44151.0. Samples: 1336158220. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:12:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:12:20,506][06909] Updated weights for policy 0, policy_version 87483 (0.0028) [2024-06-27 22:12:23,850][06674] Fps is (10 sec: 37683.6, 60 sec: 43968.2, 300 sec: 43931.3). Total num frames: 1433419776. Throughput: 0: 44141.0. Samples: 1336423820. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:12:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:12:24,716][06909] Updated weights for policy 0, policy_version 87493 (0.0038) [2024-06-27 22:12:27,845][06909] Updated weights for policy 0, policy_version 87503 (0.0031) [2024-06-27 22:12:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1433681920. Throughput: 0: 44119.4. Samples: 1336549100. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:12:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:12:32,282][06909] Updated weights for policy 0, policy_version 87513 (0.0027) [2024-06-27 22:12:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1433894912. Throughput: 0: 43840.5. Samples: 1336810260. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:12:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:12:35,428][06909] Updated weights for policy 0, policy_version 87523 (0.0029) [2024-06-27 22:12:38,852][06674] Fps is (10 sec: 40951.7, 60 sec: 43962.3, 300 sec: 44043.0). Total num frames: 1434091520. Throughput: 0: 43941.6. Samples: 1337080420. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:12:38,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:12:39,738][06909] Updated weights for policy 0, policy_version 87533 (0.0042) [2024-06-27 22:12:42,879][06909] Updated weights for policy 0, policy_version 87543 (0.0037) [2024-06-27 22:12:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1434337280. Throughput: 0: 43829.0. Samples: 1337199160. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:12:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:12:47,234][06909] Updated weights for policy 0, policy_version 87553 (0.0031) [2024-06-27 22:12:48,850][06674] Fps is (10 sec: 45884.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1434550272. Throughput: 0: 44030.3. Samples: 1337475660. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:12:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:12:50,375][06909] Updated weights for policy 0, policy_version 87563 (0.0031) [2024-06-27 22:12:53,851][06674] Fps is (10 sec: 40954.9, 60 sec: 44236.0, 300 sec: 44042.2). Total num frames: 1434746880. Throughput: 0: 43698.0. Samples: 1337734680. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:12:53,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:12:54,889][06909] Updated weights for policy 0, policy_version 87573 (0.0047) [2024-06-27 22:12:55,588][06887] Signal inference workers to stop experience collection... (19100 times) [2024-06-27 22:12:55,589][06887] Signal inference workers to resume experience collection... (19100 times) [2024-06-27 22:12:55,627][06909] InferenceWorker_p0-w0: stopping experience collection (19100 times) [2024-06-27 22:12:55,627][06909] InferenceWorker_p0-w0: resuming experience collection (19100 times) [2024-06-27 22:12:57,986][06909] Updated weights for policy 0, policy_version 87583 (0.0036) [2024-06-27 22:12:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 1434992640. Throughput: 0: 43688.6. Samples: 1337860840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:12:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:13:02,161][06909] Updated weights for policy 0, policy_version 87593 (0.0047) [2024-06-27 22:13:03,850][06674] Fps is (10 sec: 45879.1, 60 sec: 43963.6, 300 sec: 43986.8). Total num frames: 1435205632. Throughput: 0: 43959.3. Samples: 1338136400. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:13:03,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:13:05,251][06909] Updated weights for policy 0, policy_version 87603 (0.0037) [2024-06-27 22:13:08,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1435402240. Throughput: 0: 43864.8. Samples: 1338397740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:13:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:13:09,642][06909] Updated weights for policy 0, policy_version 87613 (0.0033) [2024-06-27 22:13:12,479][06909] Updated weights for policy 0, policy_version 87623 (0.0030) [2024-06-27 22:13:13,850][06674] Fps is (10 sec: 45876.9, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 1435664384. Throughput: 0: 44000.6. Samples: 1338529120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:13:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:13:16,865][06909] Updated weights for policy 0, policy_version 87633 (0.0028) [2024-06-27 22:13:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1435860992. Throughput: 0: 44069.8. Samples: 1338793400. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:13:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:13:19,886][06909] Updated weights for policy 0, policy_version 87643 (0.0035) [2024-06-27 22:13:23,850][06674] Fps is (10 sec: 42597.7, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 1436090368. Throughput: 0: 43972.6. Samples: 1339059100. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:13:23,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:13:24,146][06909] Updated weights for policy 0, policy_version 87653 (0.0026) [2024-06-27 22:13:27,226][06909] Updated weights for policy 0, policy_version 87663 (0.0034) [2024-06-27 22:13:28,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1436336128. Throughput: 0: 44309.2. Samples: 1339193080. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:13:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:13:31,841][06909] Updated weights for policy 0, policy_version 87673 (0.0025) [2024-06-27 22:13:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44043.2). Total num frames: 1436549120. Throughput: 0: 44019.1. Samples: 1339456520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:13:33,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 22:13:35,014][06909] Updated weights for policy 0, policy_version 87683 (0.0042) [2024-06-27 22:13:38,850][06674] Fps is (10 sec: 40960.5, 60 sec: 44238.4, 300 sec: 44098.0). Total num frames: 1436745728. Throughput: 0: 44161.7. Samples: 1339721900. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:13:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:13:39,281][06909] Updated weights for policy 0, policy_version 87693 (0.0042) [2024-06-27 22:13:42,188][06909] Updated weights for policy 0, policy_version 87703 (0.0038) [2024-06-27 22:13:43,852][06674] Fps is (10 sec: 44227.9, 60 sec: 44235.2, 300 sec: 44153.2). Total num frames: 1436991488. Throughput: 0: 44205.9. Samples: 1339850200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:13:43,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:13:46,583][06909] Updated weights for policy 0, policy_version 87713 (0.0028) [2024-06-27 22:13:48,850][06674] Fps is (10 sec: 44235.7, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1437188096. Throughput: 0: 44013.9. Samples: 1340117020. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:13:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:13:48,997][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000087720_1437204480.pth... [2024-06-27 22:13:49,052][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000087073_1426604032.pth [2024-06-27 22:13:49,426][06909] Updated weights for policy 0, policy_version 87723 (0.0034) [2024-06-27 22:13:53,851][06674] Fps is (10 sec: 40964.6, 60 sec: 44237.0, 300 sec: 44042.3). Total num frames: 1437401088. Throughput: 0: 43989.3. Samples: 1340377300. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:13:53,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:13:54,222][06909] Updated weights for policy 0, policy_version 87733 (0.0037) [2024-06-27 22:13:57,166][06909] Updated weights for policy 0, policy_version 87743 (0.0032) [2024-06-27 22:13:58,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 1437646848. Throughput: 0: 43951.4. Samples: 1340506940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:13:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:14:01,536][06909] Updated weights for policy 0, policy_version 87753 (0.0020) [2024-06-27 22:14:03,850][06674] Fps is (10 sec: 45879.6, 60 sec: 44237.1, 300 sec: 44042.7). Total num frames: 1437859840. Throughput: 0: 43980.0. Samples: 1340772500. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:14:03,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-27 22:14:04,506][06909] Updated weights for policy 0, policy_version 87763 (0.0042) [2024-06-27 22:14:07,541][06887] Signal inference workers to stop experience collection... (19150 times) [2024-06-27 22:14:07,541][06887] Signal inference workers to resume experience collection... (19150 times) [2024-06-27 22:14:07,561][06909] InferenceWorker_p0-w0: stopping experience collection (19150 times) [2024-06-27 22:14:07,561][06909] InferenceWorker_p0-w0: resuming experience collection (19150 times) [2024-06-27 22:14:08,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1438056448. Throughput: 0: 43953.0. Samples: 1341036980. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:14:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:14:09,067][06909] Updated weights for policy 0, policy_version 87773 (0.0039) [2024-06-27 22:14:11,971][06909] Updated weights for policy 0, policy_version 87783 (0.0030) [2024-06-27 22:14:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.6, 300 sec: 44098.8). Total num frames: 1438302208. Throughput: 0: 43916.4. Samples: 1341169320. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:14:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:14:16,406][06909] Updated weights for policy 0, policy_version 87793 (0.0038) [2024-06-27 22:14:18,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1438515200. Throughput: 0: 43901.8. Samples: 1341432100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:14:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:14:19,602][06909] Updated weights for policy 0, policy_version 87803 (0.0026) [2024-06-27 22:14:23,746][06909] Updated weights for policy 0, policy_version 87813 (0.0039) [2024-06-27 22:14:23,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1438728192. Throughput: 0: 43837.3. Samples: 1341694580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-27 22:14:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:14:27,322][06909] Updated weights for policy 0, policy_version 87823 (0.0037) [2024-06-27 22:14:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1438957568. Throughput: 0: 43844.6. Samples: 1341823120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 22:14:28,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:14:31,336][06909] Updated weights for policy 0, policy_version 87833 (0.0040) [2024-06-27 22:14:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.7, 300 sec: 43931.3). Total num frames: 1439154176. Throughput: 0: 43807.8. Samples: 1342088360. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 22:14:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:14:34,750][06909] Updated weights for policy 0, policy_version 87843 (0.0033) [2024-06-27 22:14:38,559][06909] Updated weights for policy 0, policy_version 87853 (0.0030) [2024-06-27 22:14:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1439383552. Throughput: 0: 43971.0. Samples: 1342355960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 22:14:38,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:14:41,976][06909] Updated weights for policy 0, policy_version 87863 (0.0032) [2024-06-27 22:14:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43692.2, 300 sec: 44098.3). Total num frames: 1439612928. Throughput: 0: 43922.7. Samples: 1342483460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 22:14:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:14:46,433][06909] Updated weights for policy 0, policy_version 87873 (0.0044) [2024-06-27 22:14:48,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1439842304. Throughput: 0: 43920.0. Samples: 1342748900. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 22:14:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:14:49,554][06909] Updated weights for policy 0, policy_version 87883 (0.0033) [2024-06-27 22:14:53,654][06909] Updated weights for policy 0, policy_version 87893 (0.0027) [2024-06-27 22:14:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44237.4, 300 sec: 43986.9). Total num frames: 1440055296. Throughput: 0: 44017.7. Samples: 1343017780. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 22:14:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:14:57,030][06909] Updated weights for policy 0, policy_version 87903 (0.0041) [2024-06-27 22:14:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1440268288. Throughput: 0: 43833.0. Samples: 1343141800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 22:14:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:15:01,254][06909] Updated weights for policy 0, policy_version 87913 (0.0028) [2024-06-27 22:15:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.5, 300 sec: 44042.4). Total num frames: 1440481280. Throughput: 0: 43901.7. Samples: 1343407680. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 22:15:03,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:15:04,755][06909] Updated weights for policy 0, policy_version 87923 (0.0031) [2024-06-27 22:15:08,626][06909] Updated weights for policy 0, policy_version 87933 (0.0042) [2024-06-27 22:15:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1440710656. Throughput: 0: 43897.7. Samples: 1343669980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 22:15:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:15:11,928][06909] Updated weights for policy 0, policy_version 87943 (0.0040) [2024-06-27 22:15:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1440923648. Throughput: 0: 44010.7. Samples: 1343803600. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-27 22:15:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:15:15,813][06909] Updated weights for policy 0, policy_version 87953 (0.0030) [2024-06-27 22:15:18,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1441153024. Throughput: 0: 44052.9. Samples: 1344070740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:15:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:15:19,189][06909] Updated weights for policy 0, policy_version 87963 (0.0031) [2024-06-27 22:15:23,492][06909] Updated weights for policy 0, policy_version 87973 (0.0031) [2024-06-27 22:15:23,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 1441382400. Throughput: 0: 44081.4. Samples: 1344339620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:15:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:15:26,477][06909] Updated weights for policy 0, policy_version 87983 (0.0038) [2024-06-27 22:15:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1441579008. Throughput: 0: 43992.5. Samples: 1344463120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:15:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:15:30,513][06887] Signal inference workers to stop experience collection... (19200 times) [2024-06-27 22:15:30,513][06887] Signal inference workers to resume experience collection... (19200 times) [2024-06-27 22:15:30,549][06909] InferenceWorker_p0-w0: stopping experience collection (19200 times) [2024-06-27 22:15:30,549][06909] InferenceWorker_p0-w0: resuming experience collection (19200 times) [2024-06-27 22:15:30,808][06909] Updated weights for policy 0, policy_version 87993 (0.0037) [2024-06-27 22:15:33,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1441808384. Throughput: 0: 43984.3. Samples: 1344728200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:15:33,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:15:34,459][06909] Updated weights for policy 0, policy_version 88003 (0.0051) [2024-06-27 22:15:38,494][06909] Updated weights for policy 0, policy_version 88013 (0.0039) [2024-06-27 22:15:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1442021376. Throughput: 0: 43972.1. Samples: 1344996520. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:15:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:15:41,768][06909] Updated weights for policy 0, policy_version 88023 (0.0032) [2024-06-27 22:15:43,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 44042.7). Total num frames: 1442234368. Throughput: 0: 44028.4. Samples: 1345123080. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:15:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:15:45,754][06909] Updated weights for policy 0, policy_version 88033 (0.0037) [2024-06-27 22:15:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1442463744. Throughput: 0: 43969.4. Samples: 1345386300. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:15:48,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:15:48,916][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000088042_1442480128.pth... [2024-06-27 22:15:48,986][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000087398_1431928832.pth [2024-06-27 22:15:49,134][06909] Updated weights for policy 0, policy_version 88043 (0.0026) [2024-06-27 22:15:53,175][06909] Updated weights for policy 0, policy_version 88053 (0.0029) [2024-06-27 22:15:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1442693120. Throughput: 0: 43956.9. Samples: 1345648040. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:15:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:15:56,464][06909] Updated weights for policy 0, policy_version 88063 (0.0028) [2024-06-27 22:15:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1442906112. Throughput: 0: 43896.4. Samples: 1345778940. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:15:58,859][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:16:00,926][06909] Updated weights for policy 0, policy_version 88073 (0.0043) [2024-06-27 22:16:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1443135488. Throughput: 0: 43888.3. Samples: 1346045720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:16:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:16:03,913][06909] Updated weights for policy 0, policy_version 88083 (0.0023) [2024-06-27 22:16:08,282][06909] Updated weights for policy 0, policy_version 88093 (0.0038) [2024-06-27 22:16:08,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1443348480. Throughput: 0: 43778.3. Samples: 1346309640. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-27 22:16:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:16:11,530][06909] Updated weights for policy 0, policy_version 88103 (0.0030) [2024-06-27 22:16:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1443577856. Throughput: 0: 43865.7. Samples: 1346437080. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-27 22:16:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:16:15,769][06909] Updated weights for policy 0, policy_version 88113 (0.0037) [2024-06-27 22:16:18,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 44043.3). Total num frames: 1443774464. Throughput: 0: 43871.6. Samples: 1346702420. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-27 22:16:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:16:19,132][06909] Updated weights for policy 0, policy_version 88123 (0.0035) [2024-06-27 22:16:23,054][06909] Updated weights for policy 0, policy_version 88133 (0.0037) [2024-06-27 22:16:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1444003840. Throughput: 0: 43677.3. Samples: 1346962000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-27 22:16:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:16:26,757][06909] Updated weights for policy 0, policy_version 88143 (0.0027) [2024-06-27 22:16:28,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1444216832. Throughput: 0: 43875.1. Samples: 1347097460. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-27 22:16:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:16:30,474][06909] Updated weights for policy 0, policy_version 88153 (0.0027) [2024-06-27 22:16:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 1444446208. Throughput: 0: 44017.4. Samples: 1347367080. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-27 22:16:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:16:34,020][06909] Updated weights for policy 0, policy_version 88163 (0.0037) [2024-06-27 22:16:38,030][06909] Updated weights for policy 0, policy_version 88173 (0.0030) [2024-06-27 22:16:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1444675584. Throughput: 0: 44017.8. Samples: 1347628840. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-27 22:16:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:16:41,518][06909] Updated weights for policy 0, policy_version 88183 (0.0034) [2024-06-27 22:16:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1444872192. Throughput: 0: 44098.8. Samples: 1347763380. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-27 22:16:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:16:45,494][06909] Updated weights for policy 0, policy_version 88193 (0.0031) [2024-06-27 22:16:48,806][06909] Updated weights for policy 0, policy_version 88203 (0.0027) [2024-06-27 22:16:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1445117952. Throughput: 0: 43939.5. Samples: 1348023000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-27 22:16:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:16:52,971][06909] Updated weights for policy 0, policy_version 88213 (0.0029) [2024-06-27 22:16:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1445314560. Throughput: 0: 44037.3. Samples: 1348291320. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-27 22:16:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:16:56,318][06909] Updated weights for policy 0, policy_version 88223 (0.0025) [2024-06-27 22:16:58,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 1445527552. Throughput: 0: 43894.7. Samples: 1348412340. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-27 22:16:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:17:00,438][06909] Updated weights for policy 0, policy_version 88233 (0.0034) [2024-06-27 22:17:02,961][06887] Signal inference workers to stop experience collection... (19250 times) [2024-06-27 22:17:02,969][06887] Signal inference workers to resume experience collection... (19250 times) [2024-06-27 22:17:02,970][06909] InferenceWorker_p0-w0: stopping experience collection (19250 times) [2024-06-27 22:17:02,982][06909] InferenceWorker_p0-w0: resuming experience collection (19250 times) [2024-06-27 22:17:03,770][06909] Updated weights for policy 0, policy_version 88243 (0.0023) [2024-06-27 22:17:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1445773312. Throughput: 0: 43948.9. Samples: 1348680120. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-27 22:17:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:17:07,869][06909] Updated weights for policy 0, policy_version 88253 (0.0028) [2024-06-27 22:17:08,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 1445986304. Throughput: 0: 43900.4. Samples: 1348937520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 22:17:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:17:11,132][06909] Updated weights for policy 0, policy_version 88263 (0.0031) [2024-06-27 22:17:13,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43689.2, 300 sec: 43931.0). Total num frames: 1446199296. Throughput: 0: 43750.5. Samples: 1349066320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 22:17:13,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:17:15,389][06909] Updated weights for policy 0, policy_version 88273 (0.0042) [2024-06-27 22:17:18,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1446412288. Throughput: 0: 43776.9. Samples: 1349337040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 22:17:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:17:19,006][06909] Updated weights for policy 0, policy_version 88283 (0.0035) [2024-06-27 22:17:22,864][06909] Updated weights for policy 0, policy_version 88293 (0.0038) [2024-06-27 22:17:23,852][06674] Fps is (10 sec: 44236.7, 60 sec: 43962.2, 300 sec: 43931.0). Total num frames: 1446641664. Throughput: 0: 43708.7. Samples: 1349595820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 22:17:23,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:17:26,357][06909] Updated weights for policy 0, policy_version 88303 (0.0035) [2024-06-27 22:17:28,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1446838272. Throughput: 0: 43580.3. Samples: 1349724500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 22:17:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:17:30,525][06909] Updated weights for policy 0, policy_version 88313 (0.0041) [2024-06-27 22:17:33,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 1447067648. Throughput: 0: 43817.5. Samples: 1349994780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 22:17:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:17:33,905][06909] Updated weights for policy 0, policy_version 88323 (0.0026) [2024-06-27 22:17:37,854][06909] Updated weights for policy 0, policy_version 88333 (0.0021) [2024-06-27 22:17:38,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1447297024. Throughput: 0: 43586.2. Samples: 1350252700. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 22:17:38,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 22:17:41,131][06909] Updated weights for policy 0, policy_version 88343 (0.0031) [2024-06-27 22:17:43,856][06674] Fps is (10 sec: 42572.6, 60 sec: 43686.3, 300 sec: 43874.9). Total num frames: 1447493632. Throughput: 0: 43834.5. Samples: 1350385160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 22:17:43,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:17:45,114][06909] Updated weights for policy 0, policy_version 88353 (0.0028) [2024-06-27 22:17:48,393][06909] Updated weights for policy 0, policy_version 88363 (0.0044) [2024-06-27 22:17:48,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43689.2, 300 sec: 44042.3). Total num frames: 1447739392. Throughput: 0: 43904.7. Samples: 1350655920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 22:17:48,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:17:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000088363_1447739392.pth... [2024-06-27 22:17:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000087720_1437204480.pth [2024-06-27 22:17:52,570][06909] Updated weights for policy 0, policy_version 88373 (0.0028) [2024-06-27 22:17:53,850][06674] Fps is (10 sec: 47542.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1447968768. Throughput: 0: 44031.7. Samples: 1350918940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 22:17:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:17:56,326][06909] Updated weights for policy 0, policy_version 88383 (0.0028) [2024-06-27 22:17:58,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 1448165376. Throughput: 0: 44118.4. Samples: 1351051560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 22:17:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:17:59,871][06909] Updated weights for policy 0, policy_version 88393 (0.0037) [2024-06-27 22:18:03,652][06909] Updated weights for policy 0, policy_version 88403 (0.0036) [2024-06-27 22:18:03,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1448394752. Throughput: 0: 43881.6. Samples: 1351311720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:18:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:18:07,430][06909] Updated weights for policy 0, policy_version 88413 (0.0029) [2024-06-27 22:18:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 1448607744. Throughput: 0: 44058.5. Samples: 1351578360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:18:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:18:10,993][06909] Updated weights for policy 0, policy_version 88423 (0.0034) [2024-06-27 22:18:13,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 1448837120. Throughput: 0: 44272.1. Samples: 1351716740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:18:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:18:14,951][06909] Updated weights for policy 0, policy_version 88433 (0.0033) [2024-06-27 22:18:16,954][06887] Signal inference workers to stop experience collection... (19300 times) [2024-06-27 22:18:17,004][06909] InferenceWorker_p0-w0: stopping experience collection (19300 times) [2024-06-27 22:18:17,009][06887] Signal inference workers to resume experience collection... (19300 times) [2024-06-27 22:18:17,026][06909] InferenceWorker_p0-w0: resuming experience collection (19300 times) [2024-06-27 22:18:18,636][06909] Updated weights for policy 0, policy_version 88443 (0.0023) [2024-06-27 22:18:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1449050112. Throughput: 0: 43928.4. Samples: 1351971560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:18:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:18:22,323][06909] Updated weights for policy 0, policy_version 88453 (0.0033) [2024-06-27 22:18:23,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43692.1, 300 sec: 43820.2). Total num frames: 1449263104. Throughput: 0: 44244.7. Samples: 1352243720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:18:23,851][06674] Avg episode reward: [(0, '0.398')] [2024-06-27 22:18:25,969][06909] Updated weights for policy 0, policy_version 88463 (0.0030) [2024-06-27 22:18:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 1449492480. Throughput: 0: 44262.0. Samples: 1352376680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:18:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:18:29,546][06909] Updated weights for policy 0, policy_version 88473 (0.0049) [2024-06-27 22:18:33,208][06909] Updated weights for policy 0, policy_version 88483 (0.0033) [2024-06-27 22:18:33,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1449721856. Throughput: 0: 44050.9. Samples: 1352638120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:18:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:18:36,975][06909] Updated weights for policy 0, policy_version 88493 (0.0035) [2024-06-27 22:18:38,856][06674] Fps is (10 sec: 44209.7, 60 sec: 43959.3, 300 sec: 43875.2). Total num frames: 1449934848. Throughput: 0: 44161.1. Samples: 1352906460. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:18:38,857][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:18:40,809][06909] Updated weights for policy 0, policy_version 88503 (0.0032) [2024-06-27 22:18:43,852][06674] Fps is (10 sec: 42589.8, 60 sec: 44239.7, 300 sec: 43931.1). Total num frames: 1450147840. Throughput: 0: 44146.5. Samples: 1353038240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:18:43,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:18:44,348][06909] Updated weights for policy 0, policy_version 88513 (0.0027) [2024-06-27 22:18:47,976][06909] Updated weights for policy 0, policy_version 88523 (0.0030) [2024-06-27 22:18:48,850][06674] Fps is (10 sec: 45902.9, 60 sec: 44238.3, 300 sec: 44042.5). Total num frames: 1450393600. Throughput: 0: 44271.6. Samples: 1353303940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:18:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:18:51,718][06909] Updated weights for policy 0, policy_version 88533 (0.0040) [2024-06-27 22:18:53,850][06674] Fps is (10 sec: 44246.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1450590208. Throughput: 0: 44248.5. Samples: 1353569540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 22:18:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:18:55,742][06909] Updated weights for policy 0, policy_version 88543 (0.0032) [2024-06-27 22:18:58,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 1450803200. Throughput: 0: 44108.2. Samples: 1353701620. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 22:18:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:18:59,218][06909] Updated weights for policy 0, policy_version 88553 (0.0033) [2024-06-27 22:19:03,094][06909] Updated weights for policy 0, policy_version 88563 (0.0026) [2024-06-27 22:19:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1451032576. Throughput: 0: 44089.8. Samples: 1353955600. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 22:19:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:19:06,384][06909] Updated weights for policy 0, policy_version 88573 (0.0031) [2024-06-27 22:19:08,850][06674] Fps is (10 sec: 45876.2, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1451261952. Throughput: 0: 44100.2. Samples: 1354228220. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 22:19:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:19:10,331][06909] Updated weights for policy 0, policy_version 88583 (0.0029) [2024-06-27 22:19:13,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1451491328. Throughput: 0: 44155.4. Samples: 1354363680. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 22:19:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:19:14,007][06909] Updated weights for policy 0, policy_version 88593 (0.0027) [2024-06-27 22:19:17,607][06909] Updated weights for policy 0, policy_version 88603 (0.0039) [2024-06-27 22:19:18,850][06674] Fps is (10 sec: 44235.8, 60 sec: 44236.7, 300 sec: 43986.8). Total num frames: 1451704320. Throughput: 0: 44099.9. Samples: 1354622620. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 22:19:18,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:19:21,531][06909] Updated weights for policy 0, policy_version 88613 (0.0030) [2024-06-27 22:19:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 1451917312. Throughput: 0: 44095.3. Samples: 1354890480. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 22:19:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:19:25,090][06909] Updated weights for policy 0, policy_version 88623 (0.0034) [2024-06-27 22:19:28,850][06674] Fps is (10 sec: 44237.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1452146688. Throughput: 0: 43982.5. Samples: 1355017360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 22:19:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:19:28,879][06909] Updated weights for policy 0, policy_version 88633 (0.0037) [2024-06-27 22:19:32,978][06909] Updated weights for policy 0, policy_version 88643 (0.0040) [2024-06-27 22:19:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1452359680. Throughput: 0: 43788.1. Samples: 1355274400. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 22:19:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:19:36,435][06909] Updated weights for policy 0, policy_version 88653 (0.0031) [2024-06-27 22:19:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43968.3, 300 sec: 43931.3). Total num frames: 1452572672. Throughput: 0: 43852.9. Samples: 1355542920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 22:19:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:19:38,854][06887] Signal inference workers to stop experience collection... (19350 times) [2024-06-27 22:19:38,884][06909] InferenceWorker_p0-w0: stopping experience collection (19350 times) [2024-06-27 22:19:38,908][06887] Signal inference workers to resume experience collection... (19350 times) [2024-06-27 22:19:38,908][06909] InferenceWorker_p0-w0: resuming experience collection (19350 times) [2024-06-27 22:19:40,331][06909] Updated weights for policy 0, policy_version 88663 (0.0029) [2024-06-27 22:19:43,629][06909] Updated weights for policy 0, policy_version 88673 (0.0029) [2024-06-27 22:19:43,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44511.4, 300 sec: 43986.9). Total num frames: 1452818432. Throughput: 0: 43940.2. Samples: 1355678920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-27 22:19:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:19:47,577][06909] Updated weights for policy 0, policy_version 88683 (0.0041) [2024-06-27 22:19:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.7, 300 sec: 43875.8). Total num frames: 1452998656. Throughput: 0: 44106.3. Samples: 1355940380. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 22:19:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:19:48,944][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000088685_1453015040.pth... [2024-06-27 22:19:48,993][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000088042_1442480128.pth [2024-06-27 22:19:51,332][06909] Updated weights for policy 0, policy_version 88693 (0.0038) [2024-06-27 22:19:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 1453260800. Throughput: 0: 43923.5. Samples: 1356204780. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 22:19:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:19:54,842][06909] Updated weights for policy 0, policy_version 88703 (0.0026) [2024-06-27 22:19:58,711][06909] Updated weights for policy 0, policy_version 88713 (0.0030) [2024-06-27 22:19:58,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1453473792. Throughput: 0: 43919.1. Samples: 1356340040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 22:19:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:20:02,387][06909] Updated weights for policy 0, policy_version 88723 (0.0032) [2024-06-27 22:20:03,856][06674] Fps is (10 sec: 40935.2, 60 sec: 43959.3, 300 sec: 43930.4). Total num frames: 1453670400. Throughput: 0: 43913.4. Samples: 1356598980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 22:20:03,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:20:06,073][06909] Updated weights for policy 0, policy_version 88733 (0.0034) [2024-06-27 22:20:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1453899776. Throughput: 0: 43960.9. Samples: 1356868720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 22:20:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:20:09,552][06909] Updated weights for policy 0, policy_version 88743 (0.0039) [2024-06-27 22:20:13,629][06909] Updated weights for policy 0, policy_version 88753 (0.0023) [2024-06-27 22:20:13,850][06674] Fps is (10 sec: 45903.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1454129152. Throughput: 0: 44084.0. Samples: 1357001140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 22:20:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:20:17,598][06909] Updated weights for policy 0, policy_version 88763 (0.0034) [2024-06-27 22:20:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1454342144. Throughput: 0: 44139.9. Samples: 1357260700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 22:20:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:20:20,814][06909] Updated weights for policy 0, policy_version 88773 (0.0039) [2024-06-27 22:20:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1454571520. Throughput: 0: 44197.2. Samples: 1357531800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 22:20:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:20:24,701][06909] Updated weights for policy 0, policy_version 88783 (0.0029) [2024-06-27 22:20:28,417][06909] Updated weights for policy 0, policy_version 88793 (0.0035) [2024-06-27 22:20:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1454800896. Throughput: 0: 44251.0. Samples: 1357670220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 22:20:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:20:31,942][06909] Updated weights for policy 0, policy_version 88803 (0.0031) [2024-06-27 22:20:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1455013888. Throughput: 0: 44246.5. Samples: 1357931480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 22:20:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:20:35,866][06909] Updated weights for policy 0, policy_version 88813 (0.0036) [2024-06-27 22:20:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 1455243264. Throughput: 0: 44250.2. Samples: 1358196040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 22:20:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:20:39,466][06909] Updated weights for policy 0, policy_version 88823 (0.0026) [2024-06-27 22:20:43,228][06909] Updated weights for policy 0, policy_version 88833 (0.0034) [2024-06-27 22:20:43,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1455472640. Throughput: 0: 44241.9. Samples: 1358330920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:20:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:20:46,964][06909] Updated weights for policy 0, policy_version 88843 (0.0049) [2024-06-27 22:20:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 1455652864. Throughput: 0: 44182.0. Samples: 1358586900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:20:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:20:49,997][06887] Signal inference workers to stop experience collection... (19400 times) [2024-06-27 22:20:49,997][06887] Signal inference workers to resume experience collection... (19400 times) [2024-06-27 22:20:50,040][06909] InferenceWorker_p0-w0: stopping experience collection (19400 times) [2024-06-27 22:20:50,041][06909] InferenceWorker_p0-w0: resuming experience collection (19400 times) [2024-06-27 22:20:50,734][06909] Updated weights for policy 0, policy_version 88853 (0.0027) [2024-06-27 22:20:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1455898624. Throughput: 0: 44020.0. Samples: 1358849620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:20:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 22:20:54,580][06909] Updated weights for policy 0, policy_version 88863 (0.0038) [2024-06-27 22:20:58,323][06909] Updated weights for policy 0, policy_version 88873 (0.0035) [2024-06-27 22:20:58,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1456128000. Throughput: 0: 44123.0. Samples: 1358986680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:20:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:21:02,227][06909] Updated weights for policy 0, policy_version 88883 (0.0024) [2024-06-27 22:21:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44241.3, 300 sec: 43986.9). Total num frames: 1456324608. Throughput: 0: 44140.1. Samples: 1359247000. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:21:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 22:21:05,757][06909] Updated weights for policy 0, policy_version 88893 (0.0029) [2024-06-27 22:21:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1456553984. Throughput: 0: 43991.6. Samples: 1359511420. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:21:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:21:09,461][06909] Updated weights for policy 0, policy_version 88903 (0.0026) [2024-06-27 22:21:13,337][06909] Updated weights for policy 0, policy_version 88913 (0.0040) [2024-06-27 22:21:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1456766976. Throughput: 0: 43912.5. Samples: 1359646280. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:21:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:21:16,901][06909] Updated weights for policy 0, policy_version 88923 (0.0040) [2024-06-27 22:21:18,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1456979968. Throughput: 0: 43985.3. Samples: 1359910820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:21:18,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:21:20,629][06909] Updated weights for policy 0, policy_version 88933 (0.0031) [2024-06-27 22:21:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1457225728. Throughput: 0: 43815.5. Samples: 1360167740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:21:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:21:24,391][06909] Updated weights for policy 0, policy_version 88943 (0.0030) [2024-06-27 22:21:28,181][06909] Updated weights for policy 0, policy_version 88953 (0.0031) [2024-06-27 22:21:28,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1457438720. Throughput: 0: 43835.9. Samples: 1360303540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:21:28,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:21:31,748][06909] Updated weights for policy 0, policy_version 88963 (0.0038) [2024-06-27 22:21:33,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1457635328. Throughput: 0: 43863.9. Samples: 1360560780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:21:33,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:21:35,862][06909] Updated weights for policy 0, policy_version 88973 (0.0025) [2024-06-27 22:21:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1457864704. Throughput: 0: 43931.5. Samples: 1360826540. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 22:21:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:21:39,535][06909] Updated weights for policy 0, policy_version 88983 (0.0031) [2024-06-27 22:21:43,077][06909] Updated weights for policy 0, policy_version 88993 (0.0034) [2024-06-27 22:21:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 1458077696. Throughput: 0: 43989.8. Samples: 1360966220. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 22:21:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:21:46,745][06909] Updated weights for policy 0, policy_version 89003 (0.0044) [2024-06-27 22:21:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1458290688. Throughput: 0: 43906.6. Samples: 1361222800. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 22:21:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:21:48,882][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000089007_1458290688.pth... [2024-06-27 22:21:48,944][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000088363_1447739392.pth [2024-06-27 22:21:50,691][06909] Updated weights for policy 0, policy_version 89013 (0.0035) [2024-06-27 22:21:53,851][06674] Fps is (10 sec: 45868.4, 60 sec: 43962.6, 300 sec: 44097.7). Total num frames: 1458536448. Throughput: 0: 43862.5. Samples: 1361485300. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 22:21:53,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:21:54,469][06909] Updated weights for policy 0, policy_version 89023 (0.0046) [2024-06-27 22:21:58,076][06909] Updated weights for policy 0, policy_version 89033 (0.0030) [2024-06-27 22:21:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.7, 300 sec: 43931.3). Total num frames: 1458733056. Throughput: 0: 43849.3. Samples: 1361619500. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 22:21:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:22:01,910][06909] Updated weights for policy 0, policy_version 89043 (0.0035) [2024-06-27 22:22:03,854][06674] Fps is (10 sec: 40949.4, 60 sec: 43687.6, 300 sec: 43930.7). Total num frames: 1458946048. Throughput: 0: 43636.2. Samples: 1361874620. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 22:22:03,854][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:22:05,803][06909] Updated weights for policy 0, policy_version 89053 (0.0033) [2024-06-27 22:22:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 1459191808. Throughput: 0: 43767.5. Samples: 1362137280. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 22:22:08,850][06674] Avg episode reward: [(0, '0.396')] [2024-06-27 22:22:09,215][06909] Updated weights for policy 0, policy_version 89063 (0.0034) [2024-06-27 22:22:12,950][06909] Updated weights for policy 0, policy_version 89073 (0.0031) [2024-06-27 22:22:13,852][06674] Fps is (10 sec: 44245.8, 60 sec: 43689.1, 300 sec: 43986.6). Total num frames: 1459388416. Throughput: 0: 43869.1. Samples: 1362277740. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 22:22:13,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:22:16,801][06909] Updated weights for policy 0, policy_version 89083 (0.0027) [2024-06-27 22:22:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43987.2). Total num frames: 1459617792. Throughput: 0: 44043.6. Samples: 1362542740. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 22:22:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:22:19,296][06887] Signal inference workers to stop experience collection... (19450 times) [2024-06-27 22:22:19,296][06887] Signal inference workers to resume experience collection... (19450 times) [2024-06-27 22:22:19,335][06909] InferenceWorker_p0-w0: stopping experience collection (19450 times) [2024-06-27 22:22:19,335][06909] InferenceWorker_p0-w0: resuming experience collection (19450 times) [2024-06-27 22:22:20,107][06909] Updated weights for policy 0, policy_version 89093 (0.0022) [2024-06-27 22:22:23,850][06674] Fps is (10 sec: 45884.5, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 1459847168. Throughput: 0: 43947.6. Samples: 1362804180. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 22:22:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:22:24,087][06909] Updated weights for policy 0, policy_version 89103 (0.0037) [2024-06-27 22:22:28,067][06909] Updated weights for policy 0, policy_version 89113 (0.0028) [2024-06-27 22:22:28,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1460060160. Throughput: 0: 43883.6. Samples: 1362940980. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 22:22:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:22:31,834][06909] Updated weights for policy 0, policy_version 89123 (0.0023) [2024-06-27 22:22:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1460273152. Throughput: 0: 43993.8. Samples: 1363202520. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 22:22:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:22:35,358][06909] Updated weights for policy 0, policy_version 89133 (0.0033) [2024-06-27 22:22:38,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44098.8). Total num frames: 1460502528. Throughput: 0: 43941.9. Samples: 1363462620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 22:22:38,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:22:39,089][06909] Updated weights for policy 0, policy_version 89143 (0.0039) [2024-06-27 22:22:42,638][06909] Updated weights for policy 0, policy_version 89153 (0.0029) [2024-06-27 22:22:43,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 43987.2). Total num frames: 1460715520. Throughput: 0: 44085.7. Samples: 1363603360. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 22:22:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:22:46,346][06909] Updated weights for policy 0, policy_version 89163 (0.0038) [2024-06-27 22:22:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1460944896. Throughput: 0: 44248.8. Samples: 1363865640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 22:22:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:22:49,863][06909] Updated weights for policy 0, policy_version 89173 (0.0033) [2024-06-27 22:22:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43691.7, 300 sec: 44042.4). Total num frames: 1461157888. Throughput: 0: 44200.4. Samples: 1364126300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 22:22:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:22:54,173][06909] Updated weights for policy 0, policy_version 89183 (0.0028) [2024-06-27 22:22:57,349][06909] Updated weights for policy 0, policy_version 89193 (0.0025) [2024-06-27 22:22:58,856][06674] Fps is (10 sec: 42573.0, 60 sec: 43959.3, 300 sec: 43986.0). Total num frames: 1461370880. Throughput: 0: 44042.7. Samples: 1364259840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 22:22:58,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:23:01,479][06909] Updated weights for policy 0, policy_version 89203 (0.0042) [2024-06-27 22:23:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44512.8, 300 sec: 44097.9). Total num frames: 1461616640. Throughput: 0: 44012.8. Samples: 1364523320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 22:23:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:23:05,111][06909] Updated weights for policy 0, policy_version 89213 (0.0029) [2024-06-27 22:23:08,850][06674] Fps is (10 sec: 44263.3, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1461813248. Throughput: 0: 43968.4. Samples: 1364782760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 22:23:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:23:09,066][06909] Updated weights for policy 0, policy_version 89223 (0.0033) [2024-06-27 22:23:12,230][06909] Updated weights for policy 0, policy_version 89233 (0.0032) [2024-06-27 22:23:13,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 1462026240. Throughput: 0: 43870.5. Samples: 1364915160. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 22:23:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:23:16,280][06909] Updated weights for policy 0, policy_version 89243 (0.0032) [2024-06-27 22:23:18,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1462272000. Throughput: 0: 44120.9. Samples: 1365187960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 22:23:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:23:19,541][06909] Updated weights for policy 0, policy_version 89253 (0.0035) [2024-06-27 22:23:23,604][06909] Updated weights for policy 0, policy_version 89263 (0.0039) [2024-06-27 22:23:23,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1462501376. Throughput: 0: 44100.9. Samples: 1365447160. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 22:23:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:23:26,973][06909] Updated weights for policy 0, policy_version 89273 (0.0023) [2024-06-27 22:23:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1462681600. Throughput: 0: 43904.6. Samples: 1365579060. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 22:23:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:23:31,291][06909] Updated weights for policy 0, policy_version 89283 (0.0041) [2024-06-27 22:23:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44782.8, 300 sec: 44154.4). Total num frames: 1462960128. Throughput: 0: 44073.8. Samples: 1365848960. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 22:23:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:23:34,029][06909] Updated weights for policy 0, policy_version 89293 (0.0027) [2024-06-27 22:23:37,180][06887] Signal inference workers to stop experience collection... (19500 times) [2024-06-27 22:23:37,216][06909] InferenceWorker_p0-w0: stopping experience collection (19500 times) [2024-06-27 22:23:37,237][06887] Signal inference workers to resume experience collection... (19500 times) [2024-06-27 22:23:37,238][06909] InferenceWorker_p0-w0: resuming experience collection (19500 times) [2024-06-27 22:23:38,474][06909] Updated weights for policy 0, policy_version 89303 (0.0040) [2024-06-27 22:23:38,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.9, 300 sec: 44098.3). Total num frames: 1463156736. Throughput: 0: 44241.0. Samples: 1366117140. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 22:23:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:23:42,225][06909] Updated weights for policy 0, policy_version 89313 (0.0037) [2024-06-27 22:23:43,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1463353344. Throughput: 0: 44003.3. Samples: 1366239720. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 22:23:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:23:46,103][06909] Updated weights for policy 0, policy_version 89323 (0.0027) [2024-06-27 22:23:48,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1463615488. Throughput: 0: 44232.0. Samples: 1366513760. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 22:23:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:23:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000089332_1463615488.pth... [2024-06-27 22:23:48,916][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000088685_1453015040.pth [2024-06-27 22:23:49,553][06909] Updated weights for policy 0, policy_version 89333 (0.0044) [2024-06-27 22:23:53,415][06909] Updated weights for policy 0, policy_version 89343 (0.0031) [2024-06-27 22:23:53,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1463812096. Throughput: 0: 44295.3. Samples: 1366776040. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 22:23:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:23:56,774][06909] Updated weights for policy 0, policy_version 89353 (0.0041) [2024-06-27 22:23:58,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43968.2, 300 sec: 43986.9). Total num frames: 1464008704. Throughput: 0: 44169.4. Samples: 1366902780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 22:23:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:24:00,686][06909] Updated weights for policy 0, policy_version 89363 (0.0032) [2024-06-27 22:24:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1464270848. Throughput: 0: 44104.5. Samples: 1367172660. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 22:24:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:24:03,966][06909] Updated weights for policy 0, policy_version 89373 (0.0024) [2024-06-27 22:24:08,353][06909] Updated weights for policy 0, policy_version 89383 (0.0039) [2024-06-27 22:24:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1464467456. Throughput: 0: 44240.9. Samples: 1367438000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 22:24:08,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 22:24:11,803][06909] Updated weights for policy 0, policy_version 89393 (0.0041) [2024-06-27 22:24:13,850][06674] Fps is (10 sec: 40958.9, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1464680448. Throughput: 0: 44030.0. Samples: 1367560420. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-27 22:24:13,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:24:15,794][06909] Updated weights for policy 0, policy_version 89403 (0.0037) [2024-06-27 22:24:18,856][06674] Fps is (10 sec: 45847.3, 60 sec: 44232.3, 300 sec: 44097.0). Total num frames: 1464926208. Throughput: 0: 44073.2. Samples: 1367832520. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:24:18,857][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:24:19,496][06909] Updated weights for policy 0, policy_version 89413 (0.0030) [2024-06-27 22:24:23,180][06909] Updated weights for policy 0, policy_version 89423 (0.0035) [2024-06-27 22:24:23,850][06674] Fps is (10 sec: 44238.0, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1465122816. Throughput: 0: 43874.7. Samples: 1368091500. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:24:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:24:26,663][06909] Updated weights for policy 0, policy_version 89433 (0.0025) [2024-06-27 22:24:28,850][06674] Fps is (10 sec: 40985.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1465335808. Throughput: 0: 44026.7. Samples: 1368220920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:24:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:24:30,714][06909] Updated weights for policy 0, policy_version 89443 (0.0024) [2024-06-27 22:24:33,823][06909] Updated weights for policy 0, policy_version 89453 (0.0024) [2024-06-27 22:24:33,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1465597952. Throughput: 0: 43884.0. Samples: 1368488540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:24:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:24:38,150][06909] Updated weights for policy 0, policy_version 89463 (0.0035) [2024-06-27 22:24:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1465778176. Throughput: 0: 43953.7. Samples: 1368753960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:24:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:24:41,390][06909] Updated weights for policy 0, policy_version 89473 (0.0028) [2024-06-27 22:24:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1466007552. Throughput: 0: 44075.0. Samples: 1368886160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:24:43,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:24:44,905][06887] Signal inference workers to stop experience collection... (19550 times) [2024-06-27 22:24:44,907][06887] Signal inference workers to resume experience collection... (19550 times) [2024-06-27 22:24:44,948][06909] InferenceWorker_p0-w0: stopping experience collection (19550 times) [2024-06-27 22:24:44,948][06909] InferenceWorker_p0-w0: resuming experience collection (19550 times) [2024-06-27 22:24:45,700][06909] Updated weights for policy 0, policy_version 89483 (0.0037) [2024-06-27 22:24:48,852][06674] Fps is (10 sec: 45865.4, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 1466236928. Throughput: 0: 43907.7. Samples: 1369148600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:24:48,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:24:48,966][06909] Updated weights for policy 0, policy_version 89493 (0.0042) [2024-06-27 22:24:52,926][06909] Updated weights for policy 0, policy_version 89503 (0.0039) [2024-06-27 22:24:53,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1466449920. Throughput: 0: 44106.7. Samples: 1369422800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:24:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:24:56,633][06909] Updated weights for policy 0, policy_version 89513 (0.0039) [2024-06-27 22:24:58,850][06674] Fps is (10 sec: 42607.0, 60 sec: 44236.7, 300 sec: 44043.3). Total num frames: 1466662912. Throughput: 0: 44297.4. Samples: 1369553800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:24:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:25:00,413][06909] Updated weights for policy 0, policy_version 89523 (0.0030) [2024-06-27 22:25:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1466892288. Throughput: 0: 43914.5. Samples: 1369808400. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:25:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:25:03,963][06909] Updated weights for policy 0, policy_version 89533 (0.0032) [2024-06-27 22:25:07,820][06909] Updated weights for policy 0, policy_version 89543 (0.0040) [2024-06-27 22:25:08,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43417.7, 300 sec: 43875.8). Total num frames: 1467072512. Throughput: 0: 44077.3. Samples: 1370074980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:25:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:25:12,098][06909] Updated weights for policy 0, policy_version 89553 (0.0031) [2024-06-27 22:25:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44237.0, 300 sec: 44042.4). Total num frames: 1467334656. Throughput: 0: 44048.5. Samples: 1370203100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:25:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:25:15,151][06909] Updated weights for policy 0, policy_version 89563 (0.0023) [2024-06-27 22:25:18,852][06674] Fps is (10 sec: 47503.4, 60 sec: 43693.6, 300 sec: 43986.6). Total num frames: 1467547648. Throughput: 0: 43979.8. Samples: 1370467720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:25:18,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:25:19,290][06909] Updated weights for policy 0, policy_version 89573 (0.0032) [2024-06-27 22:25:22,830][06909] Updated weights for policy 0, policy_version 89583 (0.0034) [2024-06-27 22:25:23,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1467777024. Throughput: 0: 44048.3. Samples: 1370736140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:25:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:25:26,553][06909] Updated weights for policy 0, policy_version 89593 (0.0036) [2024-06-27 22:25:28,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 1467973632. Throughput: 0: 44087.6. Samples: 1370870100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:25:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:25:30,158][06909] Updated weights for policy 0, policy_version 89603 (0.0031) [2024-06-27 22:25:33,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43417.7, 300 sec: 43931.3). Total num frames: 1468203008. Throughput: 0: 43967.9. Samples: 1371127060. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:25:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:25:33,985][06909] Updated weights for policy 0, policy_version 89613 (0.0032) [2024-06-27 22:25:37,713][06909] Updated weights for policy 0, policy_version 89623 (0.0027) [2024-06-27 22:25:38,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1468432384. Throughput: 0: 43815.1. Samples: 1371394480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:25:38,855][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:25:41,567][06909] Updated weights for policy 0, policy_version 89633 (0.0037) [2024-06-27 22:25:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1468628992. Throughput: 0: 43790.8. Samples: 1371524380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:25:43,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:25:44,991][06909] Updated weights for policy 0, policy_version 89643 (0.0038) [2024-06-27 22:25:48,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43692.1, 300 sec: 43931.3). Total num frames: 1468858368. Throughput: 0: 43801.2. Samples: 1371779460. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:25:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:25:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000089652_1468858368.pth... [2024-06-27 22:25:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000089007_1458290688.pth [2024-06-27 22:25:49,069][06909] Updated weights for policy 0, policy_version 89653 (0.0035) [2024-06-27 22:25:52,375][06909] Updated weights for policy 0, policy_version 89663 (0.0043) [2024-06-27 22:25:53,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 1469087744. Throughput: 0: 43882.9. Samples: 1372049720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:25:53,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:25:56,270][06909] Updated weights for policy 0, policy_version 89673 (0.0036) [2024-06-27 22:25:58,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1469317120. Throughput: 0: 43986.6. Samples: 1372182500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:25:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:26:00,199][06909] Updated weights for policy 0, policy_version 89683 (0.0039) [2024-06-27 22:26:03,694][06909] Updated weights for policy 0, policy_version 89693 (0.0025) [2024-06-27 22:26:03,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1469530112. Throughput: 0: 43871.4. Samples: 1372441840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:26:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:26:07,368][06909] Updated weights for policy 0, policy_version 89703 (0.0043) [2024-06-27 22:26:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44782.9, 300 sec: 44042.4). Total num frames: 1469759488. Throughput: 0: 44057.0. Samples: 1372718700. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 22:26:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:26:11,330][06909] Updated weights for policy 0, policy_version 89713 (0.0032) [2024-06-27 22:26:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1469972480. Throughput: 0: 44100.0. Samples: 1372854600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 22:26:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:26:14,874][06909] Updated weights for policy 0, policy_version 89723 (0.0026) [2024-06-27 22:26:15,170][06887] Signal inference workers to stop experience collection... (19600 times) [2024-06-27 22:26:15,176][06887] Signal inference workers to resume experience collection... (19600 times) [2024-06-27 22:26:15,214][06909] InferenceWorker_p0-w0: stopping experience collection (19600 times) [2024-06-27 22:26:15,215][06909] InferenceWorker_p0-w0: resuming experience collection (19600 times) [2024-06-27 22:26:18,563][06909] Updated weights for policy 0, policy_version 89733 (0.0032) [2024-06-27 22:26:18,854][06674] Fps is (10 sec: 42582.2, 60 sec: 43962.5, 300 sec: 43930.8). Total num frames: 1470185472. Throughput: 0: 44109.6. Samples: 1373112160. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 22:26:18,854][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:26:22,253][06909] Updated weights for policy 0, policy_version 89743 (0.0024) [2024-06-27 22:26:23,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1470431232. Throughput: 0: 44060.0. Samples: 1373377180. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 22:26:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:26:26,321][06909] Updated weights for policy 0, policy_version 89753 (0.0029) [2024-06-27 22:26:28,850][06674] Fps is (10 sec: 45892.4, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 1470644224. Throughput: 0: 44269.7. Samples: 1373516520. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 22:26:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:26:29,423][06909] Updated weights for policy 0, policy_version 89763 (0.0025) [2024-06-27 22:26:33,701][06909] Updated weights for policy 0, policy_version 89773 (0.0034) [2024-06-27 22:26:33,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1470840832. Throughput: 0: 44349.8. Samples: 1373775200. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 22:26:33,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-27 22:26:37,153][06909] Updated weights for policy 0, policy_version 89783 (0.0029) [2024-06-27 22:26:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1471086592. Throughput: 0: 44072.1. Samples: 1374032960. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 22:26:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:26:41,198][06909] Updated weights for policy 0, policy_version 89793 (0.0032) [2024-06-27 22:26:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 1471299584. Throughput: 0: 44426.1. Samples: 1374181680. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 22:26:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:26:44,413][06909] Updated weights for policy 0, policy_version 89803 (0.0030) [2024-06-27 22:26:48,452][06909] Updated weights for policy 0, policy_version 89813 (0.0041) [2024-06-27 22:26:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43931.6). Total num frames: 1471496192. Throughput: 0: 44361.7. Samples: 1374438120. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 22:26:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:26:52,173][06909] Updated weights for policy 0, policy_version 89823 (0.0026) [2024-06-27 22:26:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1471758336. Throughput: 0: 43870.6. Samples: 1374692880. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 22:26:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:26:55,927][06909] Updated weights for policy 0, policy_version 89833 (0.0027) [2024-06-27 22:26:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 44043.0). Total num frames: 1471938560. Throughput: 0: 43911.1. Samples: 1374830600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-27 22:26:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:26:59,356][06909] Updated weights for policy 0, policy_version 89843 (0.0028) [2024-06-27 22:27:03,581][06909] Updated weights for policy 0, policy_version 89853 (0.0042) [2024-06-27 22:27:03,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1472151552. Throughput: 0: 44048.9. Samples: 1375094200. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 22:27:03,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:27:06,688][06909] Updated weights for policy 0, policy_version 89863 (0.0022) [2024-06-27 22:27:08,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 1472413696. Throughput: 0: 43874.2. Samples: 1375351520. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 22:27:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:27:11,297][06909] Updated weights for policy 0, policy_version 89873 (0.0033) [2024-06-27 22:27:13,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43963.5, 300 sec: 44042.4). Total num frames: 1472610304. Throughput: 0: 43921.0. Samples: 1375492980. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 22:27:13,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:27:14,166][06909] Updated weights for policy 0, policy_version 89883 (0.0054) [2024-06-27 22:27:18,619][06909] Updated weights for policy 0, policy_version 89893 (0.0027) [2024-06-27 22:27:18,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43966.5, 300 sec: 43986.9). Total num frames: 1472823296. Throughput: 0: 43980.1. Samples: 1375754300. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 22:27:18,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:27:21,613][06909] Updated weights for policy 0, policy_version 89903 (0.0035) [2024-06-27 22:27:23,850][06674] Fps is (10 sec: 45876.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1473069056. Throughput: 0: 43919.7. Samples: 1376009340. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 22:27:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:27:26,081][06909] Updated weights for policy 0, policy_version 89913 (0.0036) [2024-06-27 22:27:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1473282048. Throughput: 0: 43753.0. Samples: 1376150560. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 22:27:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:27:28,999][06909] Updated weights for policy 0, policy_version 89923 (0.0026) [2024-06-27 22:27:33,549][06909] Updated weights for policy 0, policy_version 89933 (0.0028) [2024-06-27 22:27:33,850][06674] Fps is (10 sec: 39321.1, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1473462272. Throughput: 0: 43868.0. Samples: 1376412180. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 22:27:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:27:36,753][06909] Updated weights for policy 0, policy_version 89943 (0.0031) [2024-06-27 22:27:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 1473724416. Throughput: 0: 43867.3. Samples: 1376666900. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 22:27:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:27:41,083][06909] Updated weights for policy 0, policy_version 89953 (0.0028) [2024-06-27 22:27:41,114][06887] Signal inference workers to stop experience collection... (19650 times) [2024-06-27 22:27:41,115][06887] Signal inference workers to resume experience collection... (19650 times) [2024-06-27 22:27:41,145][06909] InferenceWorker_p0-w0: stopping experience collection (19650 times) [2024-06-27 22:27:41,145][06909] InferenceWorker_p0-w0: resuming experience collection (19650 times) [2024-06-27 22:27:43,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1473937408. Throughput: 0: 43983.2. Samples: 1376809840. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 22:27:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:27:44,037][06909] Updated weights for policy 0, policy_version 89963 (0.0041) [2024-06-27 22:27:48,399][06909] Updated weights for policy 0, policy_version 89973 (0.0027) [2024-06-27 22:27:48,852][06674] Fps is (10 sec: 39312.8, 60 sec: 43689.1, 300 sec: 43931.0). Total num frames: 1474117632. Throughput: 0: 43861.6. Samples: 1377068060. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 22:27:48,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:27:48,909][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000089974_1474134016.pth... [2024-06-27 22:27:48,961][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000089332_1463615488.pth [2024-06-27 22:27:51,461][06909] Updated weights for policy 0, policy_version 89983 (0.0021) [2024-06-27 22:27:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44098.9). Total num frames: 1474379776. Throughput: 0: 43973.8. Samples: 1377330340. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 22:27:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:27:55,724][06909] Updated weights for policy 0, policy_version 89993 (0.0038) [2024-06-27 22:27:58,850][06674] Fps is (10 sec: 47524.2, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1474592768. Throughput: 0: 43958.6. Samples: 1377471100. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 22:27:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:27:58,885][06909] Updated weights for policy 0, policy_version 90003 (0.0032) [2024-06-27 22:28:03,276][06909] Updated weights for policy 0, policy_version 90013 (0.0040) [2024-06-27 22:28:03,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1474805760. Throughput: 0: 43908.8. Samples: 1377730200. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 22:28:03,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:28:06,605][06909] Updated weights for policy 0, policy_version 90023 (0.0029) [2024-06-27 22:28:08,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1475051520. Throughput: 0: 44021.7. Samples: 1377990320. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 22:28:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:28:10,705][06909] Updated weights for policy 0, policy_version 90033 (0.0035) [2024-06-27 22:28:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 1475248128. Throughput: 0: 43970.2. Samples: 1378129220. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 22:28:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:28:13,960][06909] Updated weights for policy 0, policy_version 90043 (0.0037) [2024-06-27 22:28:18,294][06909] Updated weights for policy 0, policy_version 90053 (0.0029) [2024-06-27 22:28:18,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1475461120. Throughput: 0: 43838.3. Samples: 1378384900. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 22:28:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:28:21,381][06909] Updated weights for policy 0, policy_version 90063 (0.0030) [2024-06-27 22:28:23,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1475706880. Throughput: 0: 44036.8. Samples: 1378648560. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 22:28:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:28:25,600][06909] Updated weights for policy 0, policy_version 90073 (0.0021) [2024-06-27 22:28:28,749][06909] Updated weights for policy 0, policy_version 90083 (0.0031) [2024-06-27 22:28:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 1475919872. Throughput: 0: 44042.2. Samples: 1378791740. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 22:28:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:28:33,100][06909] Updated weights for policy 0, policy_version 90093 (0.0032) [2024-06-27 22:28:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 1476116480. Throughput: 0: 44023.5. Samples: 1379049020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 22:28:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:28:36,133][06909] Updated weights for policy 0, policy_version 90103 (0.0038) [2024-06-27 22:28:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1476345856. Throughput: 0: 43890.7. Samples: 1379305420. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 22:28:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:28:40,515][06909] Updated weights for policy 0, policy_version 90113 (0.0031) [2024-06-27 22:28:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1476558848. Throughput: 0: 43879.0. Samples: 1379445660. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 22:28:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:28:43,878][06909] Updated weights for policy 0, policy_version 90123 (0.0038) [2024-06-27 22:28:47,772][06909] Updated weights for policy 0, policy_version 90133 (0.0030) [2024-06-27 22:28:48,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44238.3, 300 sec: 43931.3). Total num frames: 1476771840. Throughput: 0: 44048.0. Samples: 1379712360. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 22:28:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:28:51,091][06909] Updated weights for policy 0, policy_version 90143 (0.0042) [2024-06-27 22:28:53,851][06674] Fps is (10 sec: 44231.7, 60 sec: 43689.8, 300 sec: 44042.2). Total num frames: 1477001216. Throughput: 0: 43813.7. Samples: 1379961980. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-27 22:28:53,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:28:55,503][06909] Updated weights for policy 0, policy_version 90153 (0.0025) [2024-06-27 22:28:58,676][06909] Updated weights for policy 0, policy_version 90163 (0.0030) [2024-06-27 22:28:58,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 1477230592. Throughput: 0: 43893.7. Samples: 1380104440. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-27 22:28:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:29:02,778][06909] Updated weights for policy 0, policy_version 90173 (0.0041) [2024-06-27 22:29:03,305][06887] Signal inference workers to stop experience collection... (19700 times) [2024-06-27 22:29:03,343][06909] InferenceWorker_p0-w0: stopping experience collection (19700 times) [2024-06-27 22:29:03,354][06887] Signal inference workers to resume experience collection... (19700 times) [2024-06-27 22:29:03,360][06909] InferenceWorker_p0-w0: resuming experience collection (19700 times) [2024-06-27 22:29:03,850][06674] Fps is (10 sec: 42603.3, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 1477427200. Throughput: 0: 44158.3. Samples: 1380372020. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-27 22:29:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:29:05,827][06909] Updated weights for policy 0, policy_version 90183 (0.0040) [2024-06-27 22:29:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1477672960. Throughput: 0: 44122.1. Samples: 1380634060. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-27 22:29:08,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:29:10,262][06909] Updated weights for policy 0, policy_version 90193 (0.0035) [2024-06-27 22:29:13,379][06909] Updated weights for policy 0, policy_version 90203 (0.0041) [2024-06-27 22:29:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 43932.2). Total num frames: 1477885952. Throughput: 0: 43895.5. Samples: 1380767040. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-27 22:29:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:29:17,654][06909] Updated weights for policy 0, policy_version 90213 (0.0031) [2024-06-27 22:29:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1478098944. Throughput: 0: 44077.3. Samples: 1381032500. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-27 22:29:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:29:21,012][06909] Updated weights for policy 0, policy_version 90223 (0.0028) [2024-06-27 22:29:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1478328320. Throughput: 0: 44057.2. Samples: 1381288000. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-27 22:29:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:29:25,402][06909] Updated weights for policy 0, policy_version 90233 (0.0034) [2024-06-27 22:29:28,297][06909] Updated weights for policy 0, policy_version 90243 (0.0034) [2024-06-27 22:29:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1478541312. Throughput: 0: 43916.9. Samples: 1381421920. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-27 22:29:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:29:32,877][06909] Updated weights for policy 0, policy_version 90253 (0.0040) [2024-06-27 22:29:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1478754304. Throughput: 0: 44001.0. Samples: 1381692400. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-27 22:29:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:29:35,950][06909] Updated weights for policy 0, policy_version 90263 (0.0030) [2024-06-27 22:29:38,852][06674] Fps is (10 sec: 44227.3, 60 sec: 43962.1, 300 sec: 43986.6). Total num frames: 1478983680. Throughput: 0: 44153.7. Samples: 1381948940. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-27 22:29:38,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:29:40,131][06909] Updated weights for policy 0, policy_version 90273 (0.0043) [2024-06-27 22:29:43,119][06909] Updated weights for policy 0, policy_version 90283 (0.0021) [2024-06-27 22:29:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 1479213056. Throughput: 0: 44054.3. Samples: 1382086880. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-27 22:29:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:29:47,270][06909] Updated weights for policy 0, policy_version 90293 (0.0024) [2024-06-27 22:29:48,850][06674] Fps is (10 sec: 44245.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1479426048. Throughput: 0: 44081.7. Samples: 1382355700. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 22:29:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:29:48,902][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000090298_1479442432.pth... [2024-06-27 22:29:48,950][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000089652_1468858368.pth [2024-06-27 22:29:50,605][06909] Updated weights for policy 0, policy_version 90303 (0.0035) [2024-06-27 22:29:53,852][06674] Fps is (10 sec: 42589.5, 60 sec: 43963.1, 300 sec: 43986.6). Total num frames: 1479639040. Throughput: 0: 44123.8. Samples: 1382619720. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 22:29:53,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:29:54,486][06909] Updated weights for policy 0, policy_version 90313 (0.0031) [2024-06-27 22:29:58,011][06909] Updated weights for policy 0, policy_version 90323 (0.0031) [2024-06-27 22:29:58,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1479884800. Throughput: 0: 44029.8. Samples: 1382748380. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 22:29:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:30:02,290][06909] Updated weights for policy 0, policy_version 90333 (0.0028) [2024-06-27 22:30:03,850][06674] Fps is (10 sec: 45884.3, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1480097792. Throughput: 0: 44103.4. Samples: 1383017160. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 22:30:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:30:05,356][06909] Updated weights for policy 0, policy_version 90343 (0.0035) [2024-06-27 22:30:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43986.8). Total num frames: 1480310784. Throughput: 0: 44143.5. Samples: 1383274460. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 22:30:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:30:09,783][06909] Updated weights for policy 0, policy_version 90353 (0.0027) [2024-06-27 22:30:12,626][06909] Updated weights for policy 0, policy_version 90363 (0.0026) [2024-06-27 22:30:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 1480540160. Throughput: 0: 44232.4. Samples: 1383412380. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 22:30:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:30:14,349][06887] Signal inference workers to stop experience collection... (19750 times) [2024-06-27 22:30:14,349][06887] Signal inference workers to resume experience collection... (19750 times) [2024-06-27 22:30:14,397][06909] InferenceWorker_p0-w0: stopping experience collection (19750 times) [2024-06-27 22:30:14,397][06909] InferenceWorker_p0-w0: resuming experience collection (19750 times) [2024-06-27 22:30:17,283][06909] Updated weights for policy 0, policy_version 90373 (0.0050) [2024-06-27 22:30:18,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1480753152. Throughput: 0: 44147.6. Samples: 1383679040. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 22:30:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:30:20,381][06909] Updated weights for policy 0, policy_version 90383 (0.0030) [2024-06-27 22:30:23,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1480949760. Throughput: 0: 44140.6. Samples: 1383935180. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 22:30:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:30:24,641][06909] Updated weights for policy 0, policy_version 90393 (0.0031) [2024-06-27 22:30:27,700][06909] Updated weights for policy 0, policy_version 90403 (0.0035) [2024-06-27 22:30:28,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 1481211904. Throughput: 0: 44047.0. Samples: 1384069000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 22:30:28,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:30:31,997][06909] Updated weights for policy 0, policy_version 90413 (0.0043) [2024-06-27 22:30:33,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1481408512. Throughput: 0: 44057.0. Samples: 1384338260. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 22:30:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:30:34,983][06909] Updated weights for policy 0, policy_version 90423 (0.0038) [2024-06-27 22:30:38,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43692.2, 300 sec: 43986.9). Total num frames: 1481605120. Throughput: 0: 44093.2. Samples: 1384603820. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 22:30:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:30:39,396][06909] Updated weights for policy 0, policy_version 90433 (0.0031) [2024-06-27 22:30:42,591][06909] Updated weights for policy 0, policy_version 90443 (0.0032) [2024-06-27 22:30:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1481867264. Throughput: 0: 43996.5. Samples: 1384728220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 22:30:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:30:47,206][06909] Updated weights for policy 0, policy_version 90453 (0.0047) [2024-06-27 22:30:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1482063872. Throughput: 0: 43853.9. Samples: 1384990580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 22:30:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:30:49,939][06909] Updated weights for policy 0, policy_version 90463 (0.0031) [2024-06-27 22:30:53,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43692.2, 300 sec: 43875.8). Total num frames: 1482260480. Throughput: 0: 44064.6. Samples: 1385257360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 22:30:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:30:54,615][06909] Updated weights for policy 0, policy_version 90473 (0.0043) [2024-06-27 22:30:57,542][06909] Updated weights for policy 0, policy_version 90483 (0.0030) [2024-06-27 22:30:58,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1482539008. Throughput: 0: 43845.3. Samples: 1385385420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 22:30:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:31:02,135][06909] Updated weights for policy 0, policy_version 90493 (0.0032) [2024-06-27 22:31:03,852][06674] Fps is (10 sec: 47503.8, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 1482735616. Throughput: 0: 43877.5. Samples: 1385653620. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 22:31:03,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:31:04,736][06909] Updated weights for policy 0, policy_version 90503 (0.0030) [2024-06-27 22:31:08,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 1482932224. Throughput: 0: 44169.9. Samples: 1385922820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 22:31:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:31:09,517][06909] Updated weights for policy 0, policy_version 90513 (0.0036) [2024-06-27 22:31:12,085][06909] Updated weights for policy 0, policy_version 90523 (0.0034) [2024-06-27 22:31:13,856][06674] Fps is (10 sec: 45857.4, 60 sec: 44232.4, 300 sec: 44097.6). Total num frames: 1483194368. Throughput: 0: 43975.6. Samples: 1386048160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 22:31:13,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:31:16,777][06909] Updated weights for policy 0, policy_version 90533 (0.0034) [2024-06-27 22:31:18,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1483407360. Throughput: 0: 44048.4. Samples: 1386320440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 22:31:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:31:19,579][06909] Updated weights for policy 0, policy_version 90543 (0.0036) [2024-06-27 22:31:23,850][06674] Fps is (10 sec: 39344.6, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1483587584. Throughput: 0: 43942.6. Samples: 1386581240. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 22:31:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:31:24,354][06909] Updated weights for policy 0, policy_version 90553 (0.0047) [2024-06-27 22:31:27,009][06909] Updated weights for policy 0, policy_version 90563 (0.0029) [2024-06-27 22:31:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1483849728. Throughput: 0: 43978.6. Samples: 1386707260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 22:31:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:31:31,716][06909] Updated weights for policy 0, policy_version 90573 (0.0030) [2024-06-27 22:31:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1484062720. Throughput: 0: 44132.3. Samples: 1386976540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 22:31:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:31:34,425][06909] Updated weights for policy 0, policy_version 90583 (0.0022) [2024-06-27 22:31:38,620][06887] Signal inference workers to stop experience collection... (19800 times) [2024-06-27 22:31:38,653][06909] InferenceWorker_p0-w0: stopping experience collection (19800 times) [2024-06-27 22:31:38,668][06887] Signal inference workers to resume experience collection... (19800 times) [2024-06-27 22:31:38,678][06909] InferenceWorker_p0-w0: resuming experience collection (19800 times) [2024-06-27 22:31:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1484259328. Throughput: 0: 44135.0. Samples: 1387243440. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:31:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:31:39,243][06909] Updated weights for policy 0, policy_version 90593 (0.0045) [2024-06-27 22:31:41,920][06909] Updated weights for policy 0, policy_version 90603 (0.0028) [2024-06-27 22:31:43,851][06674] Fps is (10 sec: 44233.9, 60 sec: 43963.2, 300 sec: 44097.9). Total num frames: 1484505088. Throughput: 0: 44000.2. Samples: 1387365460. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:31:43,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:31:46,808][06909] Updated weights for policy 0, policy_version 90613 (0.0035) [2024-06-27 22:31:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1484718080. Throughput: 0: 43980.2. Samples: 1387632640. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:31:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:31:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000090621_1484734464.pth... [2024-06-27 22:31:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000089974_1474134016.pth [2024-06-27 22:31:49,429][06909] Updated weights for policy 0, policy_version 90623 (0.0045) [2024-06-27 22:31:53,850][06674] Fps is (10 sec: 39324.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1484898304. Throughput: 0: 43941.3. Samples: 1387900180. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:31:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:31:54,110][06909] Updated weights for policy 0, policy_version 90633 (0.0043) [2024-06-27 22:31:57,219][06909] Updated weights for policy 0, policy_version 90643 (0.0029) [2024-06-27 22:31:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 1485160448. Throughput: 0: 43925.7. Samples: 1388024560. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:31:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:32:01,746][06909] Updated weights for policy 0, policy_version 90653 (0.0029) [2024-06-27 22:32:03,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43965.2, 300 sec: 43931.3). Total num frames: 1485373440. Throughput: 0: 43746.2. Samples: 1388289020. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:32:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:32:04,371][06909] Updated weights for policy 0, policy_version 90663 (0.0035) [2024-06-27 22:32:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 1485570048. Throughput: 0: 44148.0. Samples: 1388567900. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:32:08,859][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:32:09,114][06909] Updated weights for policy 0, policy_version 90673 (0.0037) [2024-06-27 22:32:11,728][06909] Updated weights for policy 0, policy_version 90683 (0.0044) [2024-06-27 22:32:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43695.0, 300 sec: 44042.4). Total num frames: 1485815808. Throughput: 0: 43956.6. Samples: 1388685300. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:32:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:32:16,565][06909] Updated weights for policy 0, policy_version 90693 (0.0034) [2024-06-27 22:32:18,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1486045184. Throughput: 0: 43772.6. Samples: 1388946300. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:32:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:32:19,454][06909] Updated weights for policy 0, policy_version 90703 (0.0045) [2024-06-27 22:32:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1486225408. Throughput: 0: 43913.1. Samples: 1389219520. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:32:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:32:23,957][06909] Updated weights for policy 0, policy_version 90713 (0.0032) [2024-06-27 22:32:26,655][06909] Updated weights for policy 0, policy_version 90723 (0.0025) [2024-06-27 22:32:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1486471168. Throughput: 0: 43915.8. Samples: 1389341640. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-27 22:32:28,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:32:31,231][06909] Updated weights for policy 0, policy_version 90733 (0.0043) [2024-06-27 22:32:33,852][06674] Fps is (10 sec: 49141.5, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 1486716928. Throughput: 0: 43951.8. Samples: 1389610560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:32:33,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:32:34,307][06909] Updated weights for policy 0, policy_version 90743 (0.0039) [2024-06-27 22:32:38,739][06909] Updated weights for policy 0, policy_version 90753 (0.0038) [2024-06-27 22:32:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1486897152. Throughput: 0: 44010.1. Samples: 1389880640. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:32:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:32:41,543][06909] Updated weights for policy 0, policy_version 90763 (0.0037) [2024-06-27 22:32:43,850][06674] Fps is (10 sec: 40968.0, 60 sec: 43691.1, 300 sec: 44098.3). Total num frames: 1487126528. Throughput: 0: 44009.3. Samples: 1390004980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:32:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:32:46,063][06909] Updated weights for policy 0, policy_version 90773 (0.0029) [2024-06-27 22:32:48,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1487372288. Throughput: 0: 44010.6. Samples: 1390269500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:32:48,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:32:49,027][06909] Updated weights for policy 0, policy_version 90783 (0.0036) [2024-06-27 22:32:53,010][06887] Signal inference workers to stop experience collection... (19850 times) [2024-06-27 22:32:53,011][06887] Signal inference workers to resume experience collection... (19850 times) [2024-06-27 22:32:53,048][06909] InferenceWorker_p0-w0: stopping experience collection (19850 times) [2024-06-27 22:32:53,048][06909] InferenceWorker_p0-w0: resuming experience collection (19850 times) [2024-06-27 22:32:53,502][06909] Updated weights for policy 0, policy_version 90793 (0.0040) [2024-06-27 22:32:53,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1487552512. Throughput: 0: 43889.4. Samples: 1390542920. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:32:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:32:56,624][06909] Updated weights for policy 0, policy_version 90803 (0.0041) [2024-06-27 22:32:58,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1487781888. Throughput: 0: 44027.1. Samples: 1390666520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:32:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:33:01,017][06909] Updated weights for policy 0, policy_version 90813 (0.0031) [2024-06-27 22:33:03,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1488027648. Throughput: 0: 44177.3. Samples: 1390934280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:33:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:33:03,894][06909] Updated weights for policy 0, policy_version 90823 (0.0040) [2024-06-27 22:33:08,498][06909] Updated weights for policy 0, policy_version 90833 (0.0040) [2024-06-27 22:33:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 1488240640. Throughput: 0: 44061.2. Samples: 1391202280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:33:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:33:11,281][06909] Updated weights for policy 0, policy_version 90843 (0.0036) [2024-06-27 22:33:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1488453632. Throughput: 0: 44054.8. Samples: 1391324100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:33:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:33:15,818][06909] Updated weights for policy 0, policy_version 90853 (0.0040) [2024-06-27 22:33:18,601][06909] Updated weights for policy 0, policy_version 90863 (0.0037) [2024-06-27 22:33:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1488699392. Throughput: 0: 44074.4. Samples: 1391593820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:33:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:33:23,229][06909] Updated weights for policy 0, policy_version 90873 (0.0033) [2024-06-27 22:33:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 1488896000. Throughput: 0: 44031.1. Samples: 1391862040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-27 22:33:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:33:26,449][06909] Updated weights for policy 0, policy_version 90883 (0.0031) [2024-06-27 22:33:28,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1489092608. Throughput: 0: 44109.4. Samples: 1391989900. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 22:33:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:33:30,605][06909] Updated weights for policy 0, policy_version 90893 (0.0039) [2024-06-27 22:33:33,745][06909] Updated weights for policy 0, policy_version 90903 (0.0038) [2024-06-27 22:33:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43965.2, 300 sec: 44097.9). Total num frames: 1489354752. Throughput: 0: 44055.7. Samples: 1392252000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 22:33:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:33:38,205][06909] Updated weights for policy 0, policy_version 90913 (0.0026) [2024-06-27 22:33:38,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 1489567744. Throughput: 0: 43932.9. Samples: 1392519900. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 22:33:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:33:41,073][06909] Updated weights for policy 0, policy_version 90923 (0.0027) [2024-06-27 22:33:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1489764352. Throughput: 0: 43946.6. Samples: 1392644120. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 22:33:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:33:45,629][06909] Updated weights for policy 0, policy_version 90933 (0.0039) [2024-06-27 22:33:48,566][06909] Updated weights for policy 0, policy_version 90943 (0.0029) [2024-06-27 22:33:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.8, 300 sec: 44098.1). Total num frames: 1490010112. Throughput: 0: 43947.0. Samples: 1392911900. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 22:33:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:33:48,874][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000090943_1490010112.pth... [2024-06-27 22:33:48,964][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000090298_1479442432.pth [2024-06-27 22:33:53,025][06909] Updated weights for policy 0, policy_version 90953 (0.0034) [2024-06-27 22:33:53,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1490223104. Throughput: 0: 43990.8. Samples: 1393181860. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 22:33:53,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:33:56,210][06909] Updated weights for policy 0, policy_version 90963 (0.0037) [2024-06-27 22:33:58,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1490419712. Throughput: 0: 44150.6. Samples: 1393310880. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 22:33:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:34:00,565][06909] Updated weights for policy 0, policy_version 90973 (0.0026) [2024-06-27 22:34:03,709][06909] Updated weights for policy 0, policy_version 90983 (0.0020) [2024-06-27 22:34:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1490665472. Throughput: 0: 43999.1. Samples: 1393573780. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 22:34:03,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:34:06,921][06887] Signal inference workers to stop experience collection... (19900 times) [2024-06-27 22:34:06,972][06909] InferenceWorker_p0-w0: stopping experience collection (19900 times) [2024-06-27 22:34:07,041][06887] Signal inference workers to resume experience collection... (19900 times) [2024-06-27 22:34:07,041][06909] InferenceWorker_p0-w0: resuming experience collection (19900 times) [2024-06-27 22:34:07,813][06909] Updated weights for policy 0, policy_version 90993 (0.0034) [2024-06-27 22:34:08,850][06674] Fps is (10 sec: 49151.9, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1490911232. Throughput: 0: 43962.7. Samples: 1393840360. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 22:34:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:34:11,104][06909] Updated weights for policy 0, policy_version 91003 (0.0030) [2024-06-27 22:34:13,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1491075072. Throughput: 0: 44040.9. Samples: 1393971740. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 22:34:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:34:15,402][06909] Updated weights for policy 0, policy_version 91013 (0.0027) [2024-06-27 22:34:18,511][06909] Updated weights for policy 0, policy_version 91023 (0.0033) [2024-06-27 22:34:18,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1491320832. Throughput: 0: 43902.2. Samples: 1394227600. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-27 22:34:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:34:22,989][06909] Updated weights for policy 0, policy_version 91033 (0.0023) [2024-06-27 22:34:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1491533824. Throughput: 0: 43742.7. Samples: 1394488320. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:34:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:34:26,005][06909] Updated weights for policy 0, policy_version 91043 (0.0034) [2024-06-27 22:34:28,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1491730432. Throughput: 0: 43859.2. Samples: 1394617780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:34:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:34:30,418][06909] Updated weights for policy 0, policy_version 91053 (0.0037) [2024-06-27 22:34:33,463][06909] Updated weights for policy 0, policy_version 91063 (0.0034) [2024-06-27 22:34:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 44042.7). Total num frames: 1491976192. Throughput: 0: 43676.5. Samples: 1394877340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:34:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:34:37,977][06909] Updated weights for policy 0, policy_version 91073 (0.0033) [2024-06-27 22:34:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1492189184. Throughput: 0: 43685.3. Samples: 1395147700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:34:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:34:41,174][06909] Updated weights for policy 0, policy_version 91083 (0.0035) [2024-06-27 22:34:43,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1492385792. Throughput: 0: 43656.0. Samples: 1395275400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:34:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:34:45,250][06909] Updated weights for policy 0, policy_version 91093 (0.0041) [2024-06-27 22:34:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.7, 300 sec: 43987.2). Total num frames: 1492615168. Throughput: 0: 43523.7. Samples: 1395532340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:34:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:34:48,872][06909] Updated weights for policy 0, policy_version 91103 (0.0046) [2024-06-27 22:34:52,825][06909] Updated weights for policy 0, policy_version 91113 (0.0026) [2024-06-27 22:34:53,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1492860928. Throughput: 0: 43591.0. Samples: 1395801960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:34:53,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:34:56,054][06909] Updated weights for policy 0, policy_version 91123 (0.0034) [2024-06-27 22:34:58,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.5, 300 sec: 43820.3). Total num frames: 1493024768. Throughput: 0: 43639.0. Samples: 1395935500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:34:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:35:00,106][06909] Updated weights for policy 0, policy_version 91133 (0.0041) [2024-06-27 22:35:03,313][06909] Updated weights for policy 0, policy_version 91143 (0.0026) [2024-06-27 22:35:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1493286912. Throughput: 0: 43660.0. Samples: 1396192300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:35:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:35:07,759][06909] Updated weights for policy 0, policy_version 91153 (0.0025) [2024-06-27 22:35:08,850][06674] Fps is (10 sec: 49152.4, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 1493516288. Throughput: 0: 43977.3. Samples: 1396467300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:35:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:35:10,625][06909] Updated weights for policy 0, policy_version 91163 (0.0046) [2024-06-27 22:35:13,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1493696512. Throughput: 0: 44017.8. Samples: 1396598580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 22:35:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:35:15,274][06909] Updated weights for policy 0, policy_version 91173 (0.0043) [2024-06-27 22:35:18,661][06909] Updated weights for policy 0, policy_version 91183 (0.0030) [2024-06-27 22:35:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1493942272. Throughput: 0: 43941.7. Samples: 1396854720. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-27 22:35:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:35:22,468][06909] Updated weights for policy 0, policy_version 91193 (0.0028) [2024-06-27 22:35:23,850][06674] Fps is (10 sec: 50789.3, 60 sec: 44509.7, 300 sec: 44042.4). Total num frames: 1494204416. Throughput: 0: 43946.5. Samples: 1397125300. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-27 22:35:23,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:35:25,849][06909] Updated weights for policy 0, policy_version 91203 (0.0033) [2024-06-27 22:35:28,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1494351872. Throughput: 0: 44237.7. Samples: 1397266100. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-27 22:35:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:35:30,083][06909] Updated weights for policy 0, policy_version 91213 (0.0037) [2024-06-27 22:35:31,063][06887] Signal inference workers to stop experience collection... (19950 times) [2024-06-27 22:35:31,114][06909] InferenceWorker_p0-w0: stopping experience collection (19950 times) [2024-06-27 22:35:31,114][06887] Signal inference workers to resume experience collection... (19950 times) [2024-06-27 22:35:31,134][06909] InferenceWorker_p0-w0: resuming experience collection (19950 times) [2024-06-27 22:35:33,120][06909] Updated weights for policy 0, policy_version 91223 (0.0031) [2024-06-27 22:35:33,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1494597632. Throughput: 0: 44159.0. Samples: 1397519500. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-27 22:35:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:35:37,482][06909] Updated weights for policy 0, policy_version 91233 (0.0027) [2024-06-27 22:35:38,850][06674] Fps is (10 sec: 49152.9, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1494843392. Throughput: 0: 44009.0. Samples: 1397782360. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-27 22:35:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:35:40,395][06909] Updated weights for policy 0, policy_version 91243 (0.0031) [2024-06-27 22:35:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1495023616. Throughput: 0: 44236.1. Samples: 1397926120. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-27 22:35:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:35:44,969][06909] Updated weights for policy 0, policy_version 91253 (0.0025) [2024-06-27 22:35:47,619][06909] Updated weights for policy 0, policy_version 91263 (0.0027) [2024-06-27 22:35:48,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1495252992. Throughput: 0: 44236.5. Samples: 1398182940. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-27 22:35:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:35:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000091263_1495252992.pth... [2024-06-27 22:35:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000090621_1484734464.pth [2024-06-27 22:35:52,492][06909] Updated weights for policy 0, policy_version 91273 (0.0036) [2024-06-27 22:35:53,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1495498752. Throughput: 0: 43937.3. Samples: 1398444480. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-27 22:35:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:35:55,616][06909] Updated weights for policy 0, policy_version 91283 (0.0037) [2024-06-27 22:35:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.8, 300 sec: 43876.1). Total num frames: 1495678976. Throughput: 0: 44200.7. Samples: 1398587620. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-27 22:35:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:35:59,704][06909] Updated weights for policy 0, policy_version 91293 (0.0022) [2024-06-27 22:36:02,967][06909] Updated weights for policy 0, policy_version 91303 (0.0022) [2024-06-27 22:36:03,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1495908352. Throughput: 0: 44268.5. Samples: 1398846800. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-27 22:36:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:36:07,120][06909] Updated weights for policy 0, policy_version 91313 (0.0035) [2024-06-27 22:36:08,856][06674] Fps is (10 sec: 50759.9, 60 sec: 44505.3, 300 sec: 44042.4). Total num frames: 1496186880. Throughput: 0: 44008.4. Samples: 1399105940. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-27 22:36:08,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:36:10,307][06909] Updated weights for policy 0, policy_version 91323 (0.0044) [2024-06-27 22:36:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1496350720. Throughput: 0: 44056.6. Samples: 1399248640. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-27 22:36:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:36:14,745][06909] Updated weights for policy 0, policy_version 91333 (0.0024) [2024-06-27 22:36:17,598][06909] Updated weights for policy 0, policy_version 91343 (0.0028) [2024-06-27 22:36:18,850][06674] Fps is (10 sec: 39345.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1496580096. Throughput: 0: 44056.9. Samples: 1399502060. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-27 22:36:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:36:22,297][06909] Updated weights for policy 0, policy_version 91353 (0.0026) [2024-06-27 22:36:23,850][06674] Fps is (10 sec: 49151.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1496842240. Throughput: 0: 44047.4. Samples: 1399764500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-27 22:36:23,851][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 22:36:24,955][06909] Updated weights for policy 0, policy_version 91363 (0.0032) [2024-06-27 22:36:28,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1497006080. Throughput: 0: 43982.9. Samples: 1399905360. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-27 22:36:28,851][06674] Avg episode reward: [(0, '0.411')] [2024-06-27 22:36:29,542][06909] Updated weights for policy 0, policy_version 91373 (0.0031) [2024-06-27 22:36:32,515][06909] Updated weights for policy 0, policy_version 91383 (0.0041) [2024-06-27 22:36:33,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1497235456. Throughput: 0: 44016.4. Samples: 1400163680. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-27 22:36:33,854][06674] Avg episode reward: [(0, '0.410')] [2024-06-27 22:36:36,443][06887] Signal inference workers to stop experience collection... (20000 times) [2024-06-27 22:36:36,468][06909] InferenceWorker_p0-w0: stopping experience collection (20000 times) [2024-06-27 22:36:36,501][06887] Signal inference workers to resume experience collection... (20000 times) [2024-06-27 22:36:36,504][06909] InferenceWorker_p0-w0: resuming experience collection (20000 times) [2024-06-27 22:36:36,802][06909] Updated weights for policy 0, policy_version 91393 (0.0037) [2024-06-27 22:36:38,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44236.6, 300 sec: 44042.5). Total num frames: 1497497600. Throughput: 0: 44075.5. Samples: 1400427880. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-27 22:36:38,850][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 22:36:40,045][06909] Updated weights for policy 0, policy_version 91403 (0.0040) [2024-06-27 22:36:43,852][06674] Fps is (10 sec: 44228.1, 60 sec: 44235.2, 300 sec: 43931.0). Total num frames: 1497677824. Throughput: 0: 44097.6. Samples: 1400572100. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-27 22:36:43,853][06674] Avg episode reward: [(0, '0.409')] [2024-06-27 22:36:44,337][06909] Updated weights for policy 0, policy_version 91413 (0.0041) [2024-06-27 22:36:47,332][06909] Updated weights for policy 0, policy_version 91423 (0.0045) [2024-06-27 22:36:48,855][06674] Fps is (10 sec: 40940.2, 60 sec: 44233.1, 300 sec: 44097.2). Total num frames: 1497907200. Throughput: 0: 43969.3. Samples: 1400825640. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-27 22:36:48,855][06674] Avg episode reward: [(0, '0.412')] [2024-06-27 22:36:51,855][06909] Updated weights for policy 0, policy_version 91433 (0.0025) [2024-06-27 22:36:53,850][06674] Fps is (10 sec: 47523.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1498152960. Throughput: 0: 44106.0. Samples: 1401090440. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-27 22:36:53,850][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 22:36:54,983][06909] Updated weights for policy 0, policy_version 91443 (0.0030) [2024-06-27 22:36:58,850][06674] Fps is (10 sec: 40980.0, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1498316800. Throughput: 0: 43966.5. Samples: 1401227140. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-27 22:36:58,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 22:36:59,394][06909] Updated weights for policy 0, policy_version 91453 (0.0033) [2024-06-27 22:37:02,290][06909] Updated weights for policy 0, policy_version 91463 (0.0025) [2024-06-27 22:37:03,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1498546176. Throughput: 0: 43975.0. Samples: 1401480940. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-27 22:37:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 22:37:06,696][06909] Updated weights for policy 0, policy_version 91473 (0.0026) [2024-06-27 22:37:08,850][06674] Fps is (10 sec: 49152.0, 60 sec: 43695.0, 300 sec: 44042.4). Total num frames: 1498808320. Throughput: 0: 44107.5. Samples: 1401749340. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 22:37:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:37:09,820][06909] Updated weights for policy 0, policy_version 91483 (0.0035) [2024-06-27 22:37:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1499004928. Throughput: 0: 44105.4. Samples: 1401890100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 22:37:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:37:14,189][06909] Updated weights for policy 0, policy_version 91493 (0.0024) [2024-06-27 22:37:17,755][06909] Updated weights for policy 0, policy_version 91503 (0.0021) [2024-06-27 22:37:18,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1499217920. Throughput: 0: 44206.4. Samples: 1402152960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 22:37:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:37:21,442][06909] Updated weights for policy 0, policy_version 91513 (0.0025) [2024-06-27 22:37:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1499463680. Throughput: 0: 44104.2. Samples: 1402412560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 22:37:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:37:24,920][06909] Updated weights for policy 0, policy_version 91523 (0.0033) [2024-06-27 22:37:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 43876.1). Total num frames: 1499660288. Throughput: 0: 44085.2. Samples: 1402555840. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 22:37:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:37:28,874][06909] Updated weights for policy 0, policy_version 91533 (0.0027) [2024-06-27 22:37:32,100][06909] Updated weights for policy 0, policy_version 91543 (0.0032) [2024-06-27 22:37:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 1499873280. Throughput: 0: 44235.6. Samples: 1402816020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 22:37:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:37:36,299][06909] Updated weights for policy 0, policy_version 91553 (0.0034) [2024-06-27 22:37:38,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1500135424. Throughput: 0: 44113.7. Samples: 1403075560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 22:37:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:37:39,312][06909] Updated weights for policy 0, policy_version 91563 (0.0034) [2024-06-27 22:37:43,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43963.8, 300 sec: 43875.5). Total num frames: 1500315648. Throughput: 0: 44200.8. Samples: 1403216260. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 22:37:43,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:37:44,049][06909] Updated weights for policy 0, policy_version 91573 (0.0035) [2024-06-27 22:37:44,202][06887] Signal inference workers to stop experience collection... (20050 times) [2024-06-27 22:37:44,243][06909] InferenceWorker_p0-w0: stopping experience collection (20050 times) [2024-06-27 22:37:44,259][06887] Signal inference workers to resume experience collection... (20050 times) [2024-06-27 22:37:44,260][06909] InferenceWorker_p0-w0: resuming experience collection (20050 times) [2024-06-27 22:37:47,039][06909] Updated weights for policy 0, policy_version 91583 (0.0026) [2024-06-27 22:37:48,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43694.3, 300 sec: 43986.9). Total num frames: 1500528640. Throughput: 0: 44285.8. Samples: 1403473800. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 22:37:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:37:48,942][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000091586_1500545024.pth... [2024-06-27 22:37:49,010][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000090943_1490010112.pth [2024-06-27 22:37:51,394][06909] Updated weights for policy 0, policy_version 91593 (0.0036) [2024-06-27 22:37:53,850][06674] Fps is (10 sec: 45884.7, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1500774400. Throughput: 0: 44001.5. Samples: 1403729400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 22:37:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:37:54,453][06909] Updated weights for policy 0, policy_version 91603 (0.0031) [2024-06-27 22:37:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 1500971008. Throughput: 0: 43994.3. Samples: 1403869840. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-27 22:37:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:37:59,261][06909] Updated weights for policy 0, policy_version 91613 (0.0039) [2024-06-27 22:38:02,193][06909] Updated weights for policy 0, policy_version 91623 (0.0045) [2024-06-27 22:38:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1501184000. Throughput: 0: 43656.5. Samples: 1404117500. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 22:38:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:38:06,598][06909] Updated weights for policy 0, policy_version 91633 (0.0023) [2024-06-27 22:38:08,850][06674] Fps is (10 sec: 49152.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1501462528. Throughput: 0: 43826.3. Samples: 1404384740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 22:38:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:38:09,362][06909] Updated weights for policy 0, policy_version 91643 (0.0027) [2024-06-27 22:38:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1501626368. Throughput: 0: 43737.3. Samples: 1404524020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 22:38:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:38:13,978][06909] Updated weights for policy 0, policy_version 91653 (0.0037) [2024-06-27 22:38:16,905][06909] Updated weights for policy 0, policy_version 91663 (0.0044) [2024-06-27 22:38:18,850][06674] Fps is (10 sec: 37682.7, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1501839360. Throughput: 0: 43707.0. Samples: 1404782840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 22:38:18,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:38:21,351][06909] Updated weights for policy 0, policy_version 91673 (0.0036) [2024-06-27 22:38:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1502085120. Throughput: 0: 43742.8. Samples: 1405043980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 22:38:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:38:24,903][06909] Updated weights for policy 0, policy_version 91683 (0.0031) [2024-06-27 22:38:28,626][06909] Updated weights for policy 0, policy_version 91693 (0.0022) [2024-06-27 22:38:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 1502298112. Throughput: 0: 43681.0. Samples: 1405181820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 22:38:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:38:32,222][06909] Updated weights for policy 0, policy_version 91703 (0.0027) [2024-06-27 22:38:33,850][06674] Fps is (10 sec: 44235.8, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1502527488. Throughput: 0: 43801.2. Samples: 1405444860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 22:38:33,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:38:36,139][06909] Updated weights for policy 0, policy_version 91713 (0.0039) [2024-06-27 22:38:38,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1502756864. Throughput: 0: 43890.2. Samples: 1405704460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 22:38:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 22:38:39,765][06909] Updated weights for policy 0, policy_version 91723 (0.0039) [2024-06-27 22:38:43,770][06909] Updated weights for policy 0, policy_version 91733 (0.0030) [2024-06-27 22:38:43,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43965.2, 300 sec: 43875.8). Total num frames: 1502953472. Throughput: 0: 43840.9. Samples: 1405842680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 22:38:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:38:47,138][06909] Updated weights for policy 0, policy_version 91743 (0.0029) [2024-06-27 22:38:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1503166464. Throughput: 0: 44247.5. Samples: 1406108640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 22:38:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:38:51,237][06909] Updated weights for policy 0, policy_version 91753 (0.0040) [2024-06-27 22:38:53,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1503428608. Throughput: 0: 43972.4. Samples: 1406363500. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-27 22:38:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:38:54,428][06909] Updated weights for policy 0, policy_version 91763 (0.0031) [2024-06-27 22:38:58,613][06909] Updated weights for policy 0, policy_version 91773 (0.0035) [2024-06-27 22:38:58,695][06887] Signal inference workers to stop experience collection... (20100 times) [2024-06-27 22:38:58,731][06909] InferenceWorker_p0-w0: stopping experience collection (20100 times) [2024-06-27 22:38:58,753][06887] Signal inference workers to resume experience collection... (20100 times) [2024-06-27 22:38:58,756][06909] InferenceWorker_p0-w0: resuming experience collection (20100 times) [2024-06-27 22:38:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 43931.4). Total num frames: 1503625216. Throughput: 0: 44020.5. Samples: 1406504940. Policy #0 lag: (min: 1.0, avg: 9.6, max: 22.0) [2024-06-27 22:38:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:39:02,209][06909] Updated weights for policy 0, policy_version 91783 (0.0025) [2024-06-27 22:39:03,852][06674] Fps is (10 sec: 40951.5, 60 sec: 44235.2, 300 sec: 43819.9). Total num frames: 1503838208. Throughput: 0: 44031.4. Samples: 1406764340. Policy #0 lag: (min: 1.0, avg: 9.6, max: 22.0) [2024-06-27 22:39:03,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:39:05,875][06909] Updated weights for policy 0, policy_version 91793 (0.0037) [2024-06-27 22:39:08,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43417.5, 300 sec: 44042.4). Total num frames: 1504067584. Throughput: 0: 44065.6. Samples: 1407026940. Policy #0 lag: (min: 1.0, avg: 9.6, max: 22.0) [2024-06-27 22:39:08,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:39:09,454][06909] Updated weights for policy 0, policy_version 91803 (0.0029) [2024-06-27 22:39:13,301][06909] Updated weights for policy 0, policy_version 91813 (0.0028) [2024-06-27 22:39:13,850][06674] Fps is (10 sec: 45884.6, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 1504296960. Throughput: 0: 44017.4. Samples: 1407162600. Policy #0 lag: (min: 1.0, avg: 9.6, max: 22.0) [2024-06-27 22:39:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:39:17,078][06909] Updated weights for policy 0, policy_version 91823 (0.0024) [2024-06-27 22:39:18,850][06674] Fps is (10 sec: 42599.2, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 1504493568. Throughput: 0: 43983.3. Samples: 1407424100. Policy #0 lag: (min: 1.0, avg: 9.6, max: 22.0) [2024-06-27 22:39:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:39:20,790][06909] Updated weights for policy 0, policy_version 91833 (0.0021) [2024-06-27 22:39:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.6, 300 sec: 44097.9). Total num frames: 1504739328. Throughput: 0: 43999.8. Samples: 1407684460. Policy #0 lag: (min: 1.0, avg: 9.6, max: 22.0) [2024-06-27 22:39:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:39:24,472][06909] Updated weights for policy 0, policy_version 91843 (0.0035) [2024-06-27 22:39:28,318][06909] Updated weights for policy 0, policy_version 91853 (0.0022) [2024-06-27 22:39:28,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1504952320. Throughput: 0: 43968.4. Samples: 1407821260. Policy #0 lag: (min: 1.0, avg: 9.6, max: 22.0) [2024-06-27 22:39:28,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:39:31,777][06909] Updated weights for policy 0, policy_version 91863 (0.0026) [2024-06-27 22:39:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1505148928. Throughput: 0: 43979.4. Samples: 1408087720. Policy #0 lag: (min: 1.0, avg: 9.6, max: 22.0) [2024-06-27 22:39:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:39:35,602][06909] Updated weights for policy 0, policy_version 91873 (0.0041) [2024-06-27 22:39:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1505394688. Throughput: 0: 44101.7. Samples: 1408348080. Policy #0 lag: (min: 1.0, avg: 9.6, max: 22.0) [2024-06-27 22:39:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:39:39,326][06909] Updated weights for policy 0, policy_version 91883 (0.0032) [2024-06-27 22:39:42,986][06909] Updated weights for policy 0, policy_version 91893 (0.0034) [2024-06-27 22:39:43,852][06674] Fps is (10 sec: 45866.0, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 1505607680. Throughput: 0: 43833.9. Samples: 1408477560. Policy #0 lag: (min: 1.0, avg: 9.6, max: 22.0) [2024-06-27 22:39:43,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-27 22:39:46,859][06909] Updated weights for policy 0, policy_version 91903 (0.0024) [2024-06-27 22:39:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1505820672. Throughput: 0: 44183.8. Samples: 1408752520. Policy #0 lag: (min: 1.0, avg: 9.6, max: 22.0) [2024-06-27 22:39:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:39:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000091909_1505837056.pth... [2024-06-27 22:39:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000091263_1495252992.pth [2024-06-27 22:39:50,306][06909] Updated weights for policy 0, policy_version 91913 (0.0033) [2024-06-27 22:39:53,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 1506050048. Throughput: 0: 43989.4. Samples: 1409006460. Policy #0 lag: (min: 1.0, avg: 9.6, max: 22.0) [2024-06-27 22:39:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:39:54,058][06909] Updated weights for policy 0, policy_version 91923 (0.0030) [2024-06-27 22:39:57,795][06909] Updated weights for policy 0, policy_version 91933 (0.0027) [2024-06-27 22:39:58,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.6, 300 sec: 44042.4). Total num frames: 1506279424. Throughput: 0: 44011.9. Samples: 1409143140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 22:39:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:40:01,791][06909] Updated weights for policy 0, policy_version 91943 (0.0027) [2024-06-27 22:40:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43965.3, 300 sec: 43931.3). Total num frames: 1506476032. Throughput: 0: 44231.1. Samples: 1409414500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 22:40:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:40:05,020][06909] Updated weights for policy 0, policy_version 91953 (0.0038) [2024-06-27 22:40:08,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1506689024. Throughput: 0: 44216.0. Samples: 1409674180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 22:40:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:40:09,052][06909] Updated weights for policy 0, policy_version 91963 (0.0026) [2024-06-27 22:40:12,633][06909] Updated weights for policy 0, policy_version 91973 (0.0031) [2024-06-27 22:40:13,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1506934784. Throughput: 0: 44072.8. Samples: 1409804540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 22:40:13,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:40:16,696][06909] Updated weights for policy 0, policy_version 91983 (0.0037) [2024-06-27 22:40:18,381][06887] Signal inference workers to stop experience collection... (20150 times) [2024-06-27 22:40:18,381][06887] Signal inference workers to resume experience collection... (20150 times) [2024-06-27 22:40:18,396][06909] InferenceWorker_p0-w0: stopping experience collection (20150 times) [2024-06-27 22:40:18,396][06909] InferenceWorker_p0-w0: resuming experience collection (20150 times) [2024-06-27 22:40:18,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1507147776. Throughput: 0: 44123.7. Samples: 1410073280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 22:40:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:40:20,165][06909] Updated weights for policy 0, policy_version 91993 (0.0040) [2024-06-27 22:40:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1507360768. Throughput: 0: 44180.0. Samples: 1410336180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 22:40:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:40:24,148][06909] Updated weights for policy 0, policy_version 92003 (0.0037) [2024-06-27 22:40:27,534][06909] Updated weights for policy 0, policy_version 92013 (0.0034) [2024-06-27 22:40:28,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 1507590144. Throughput: 0: 44105.7. Samples: 1410462220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 22:40:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:40:31,812][06909] Updated weights for policy 0, policy_version 92023 (0.0034) [2024-06-27 22:40:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 1507819520. Throughput: 0: 44003.2. Samples: 1410732660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 22:40:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:40:34,818][06909] Updated weights for policy 0, policy_version 92033 (0.0033) [2024-06-27 22:40:38,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1508016128. Throughput: 0: 44276.0. Samples: 1410998880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 22:40:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:40:38,929][06909] Updated weights for policy 0, policy_version 92043 (0.0024) [2024-06-27 22:40:42,187][06909] Updated weights for policy 0, policy_version 92053 (0.0029) [2024-06-27 22:40:43,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44238.3, 300 sec: 44097.9). Total num frames: 1508261888. Throughput: 0: 44117.8. Samples: 1411128440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 22:40:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:40:46,480][06909] Updated weights for policy 0, policy_version 92063 (0.0029) [2024-06-27 22:40:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1508474880. Throughput: 0: 44058.6. Samples: 1411397140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-27 22:40:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:40:49,697][06909] Updated weights for policy 0, policy_version 92073 (0.0032) [2024-06-27 22:40:53,823][06909] Updated weights for policy 0, policy_version 92083 (0.0039) [2024-06-27 22:40:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1508687872. Throughput: 0: 44201.4. Samples: 1411663240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:40:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:40:56,999][06909] Updated weights for policy 0, policy_version 92093 (0.0029) [2024-06-27 22:40:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1508933632. Throughput: 0: 44193.5. Samples: 1411793240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:40:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:41:01,322][06909] Updated weights for policy 0, policy_version 92103 (0.0040) [2024-06-27 22:41:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 43876.7). Total num frames: 1509130240. Throughput: 0: 44096.9. Samples: 1412057640. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:41:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:41:04,442][06909] Updated weights for policy 0, policy_version 92113 (0.0025) [2024-06-27 22:41:08,575][06909] Updated weights for policy 0, policy_version 92123 (0.0032) [2024-06-27 22:41:08,850][06674] Fps is (10 sec: 40959.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1509343232. Throughput: 0: 44260.4. Samples: 1412327900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:41:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:41:11,668][06909] Updated weights for policy 0, policy_version 92133 (0.0035) [2024-06-27 22:41:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1509588992. Throughput: 0: 44247.5. Samples: 1412453360. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:41:13,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:41:15,971][06909] Updated weights for policy 0, policy_version 92143 (0.0039) [2024-06-27 22:41:18,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 1509818368. Throughput: 0: 44228.9. Samples: 1412722960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:41:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:41:19,402][06909] Updated weights for policy 0, policy_version 92153 (0.0026) [2024-06-27 22:41:23,698][06909] Updated weights for policy 0, policy_version 92163 (0.0030) [2024-06-27 22:41:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1509998592. Throughput: 0: 44213.4. Samples: 1412988480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:41:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:41:26,729][06909] Updated weights for policy 0, policy_version 92173 (0.0027) [2024-06-27 22:41:28,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 1510244352. Throughput: 0: 44246.7. Samples: 1413119540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:41:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:41:30,929][06909] Updated weights for policy 0, policy_version 92183 (0.0027) [2024-06-27 22:41:33,852][06674] Fps is (10 sec: 45865.5, 60 sec: 43962.2, 300 sec: 43931.1). Total num frames: 1510457344. Throughput: 0: 44183.3. Samples: 1413385480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:41:33,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:41:34,322][06909] Updated weights for policy 0, policy_version 92193 (0.0033) [2024-06-27 22:41:38,628][06909] Updated weights for policy 0, policy_version 92203 (0.0033) [2024-06-27 22:41:38,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.6, 300 sec: 43987.2). Total num frames: 1510653952. Throughput: 0: 44084.8. Samples: 1413647060. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:41:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:41:41,590][06909] Updated weights for policy 0, policy_version 92213 (0.0034) [2024-06-27 22:41:43,850][06674] Fps is (10 sec: 44246.0, 60 sec: 43963.8, 300 sec: 44043.2). Total num frames: 1510899712. Throughput: 0: 43994.6. Samples: 1413773000. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 22:41:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:41:45,843][06909] Updated weights for policy 0, policy_version 92223 (0.0035) [2024-06-27 22:41:48,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1511129088. Throughput: 0: 44086.1. Samples: 1414041520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 22:41:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:41:48,906][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000092233_1511145472.pth... [2024-06-27 22:41:48,911][06909] Updated weights for policy 0, policy_version 92233 (0.0032) [2024-06-27 22:41:48,962][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000091586_1500545024.pth [2024-06-27 22:41:50,119][06887] Signal inference workers to stop experience collection... (20200 times) [2024-06-27 22:41:50,121][06887] Signal inference workers to resume experience collection... (20200 times) [2024-06-27 22:41:50,154][06909] InferenceWorker_p0-w0: stopping experience collection (20200 times) [2024-06-27 22:41:50,154][06909] InferenceWorker_p0-w0: resuming experience collection (20200 times) [2024-06-27 22:41:53,243][06909] Updated weights for policy 0, policy_version 92243 (0.0031) [2024-06-27 22:41:53,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1511309312. Throughput: 0: 43879.2. Samples: 1414302460. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 22:41:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:41:56,535][06909] Updated weights for policy 0, policy_version 92253 (0.0028) [2024-06-27 22:41:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1511587840. Throughput: 0: 44007.1. Samples: 1414433680. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 22:41:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:42:00,723][06909] Updated weights for policy 0, policy_version 92263 (0.0030) [2024-06-27 22:42:03,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1511784448. Throughput: 0: 44010.2. Samples: 1414703420. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 22:42:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:42:03,979][06909] Updated weights for policy 0, policy_version 92273 (0.0039) [2024-06-27 22:42:07,866][06909] Updated weights for policy 0, policy_version 92283 (0.0032) [2024-06-27 22:42:08,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1511981056. Throughput: 0: 44134.6. Samples: 1414974540. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 22:42:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:42:11,462][06909] Updated weights for policy 0, policy_version 92293 (0.0036) [2024-06-27 22:42:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1512243200. Throughput: 0: 44085.8. Samples: 1415103400. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 22:42:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:42:15,558][06909] Updated weights for policy 0, policy_version 92303 (0.0040) [2024-06-27 22:42:18,646][06909] Updated weights for policy 0, policy_version 92313 (0.0031) [2024-06-27 22:42:18,850][06674] Fps is (10 sec: 47512.3, 60 sec: 43963.5, 300 sec: 44042.4). Total num frames: 1512456192. Throughput: 0: 44016.0. Samples: 1415366120. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 22:42:18,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:42:23,063][06909] Updated weights for policy 0, policy_version 92323 (0.0020) [2024-06-27 22:42:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1512652800. Throughput: 0: 44300.1. Samples: 1415640560. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 22:42:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:42:26,534][06909] Updated weights for policy 0, policy_version 92333 (0.0035) [2024-06-27 22:42:28,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1512898560. Throughput: 0: 44303.4. Samples: 1415766660. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 22:42:28,851][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 22:42:30,499][06909] Updated weights for policy 0, policy_version 92343 (0.0026) [2024-06-27 22:42:33,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43963.7, 300 sec: 43931.0). Total num frames: 1513095168. Throughput: 0: 44213.1. Samples: 1416031200. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 22:42:33,853][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:42:34,018][06909] Updated weights for policy 0, policy_version 92353 (0.0039) [2024-06-27 22:42:37,633][06909] Updated weights for policy 0, policy_version 92363 (0.0048) [2024-06-27 22:42:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44783.0, 300 sec: 44153.8). Total num frames: 1513340928. Throughput: 0: 44434.6. Samples: 1416302020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 22.0) [2024-06-27 22:42:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:42:41,165][06909] Updated weights for policy 0, policy_version 92373 (0.0031) [2024-06-27 22:42:43,852][06674] Fps is (10 sec: 45875.3, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 1513553920. Throughput: 0: 44384.2. Samples: 1416431060. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-27 22:42:43,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:42:45,381][06909] Updated weights for policy 0, policy_version 92383 (0.0034) [2024-06-27 22:42:48,603][06909] Updated weights for policy 0, policy_version 92393 (0.0029) [2024-06-27 22:42:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1513766912. Throughput: 0: 44243.9. Samples: 1416694400. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-27 22:42:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:42:52,791][06909] Updated weights for policy 0, policy_version 92403 (0.0033) [2024-06-27 22:42:53,850][06674] Fps is (10 sec: 40968.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1513963520. Throughput: 0: 44100.5. Samples: 1416959060. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-27 22:42:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:42:55,863][06909] Updated weights for policy 0, policy_version 92413 (0.0040) [2024-06-27 22:42:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 1514209280. Throughput: 0: 44004.9. Samples: 1417083620. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-27 22:42:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:43:00,212][06909] Updated weights for policy 0, policy_version 92423 (0.0026) [2024-06-27 22:43:00,872][06887] Signal inference workers to stop experience collection... (20250 times) [2024-06-27 22:43:00,900][06909] InferenceWorker_p0-w0: stopping experience collection (20250 times) [2024-06-27 22:43:00,929][06887] Signal inference workers to resume experience collection... (20250 times) [2024-06-27 22:43:00,935][06909] InferenceWorker_p0-w0: resuming experience collection (20250 times) [2024-06-27 22:43:03,837][06909] Updated weights for policy 0, policy_version 92433 (0.0026) [2024-06-27 22:43:03,853][06674] Fps is (10 sec: 45861.5, 60 sec: 43961.6, 300 sec: 43930.9). Total num frames: 1514422272. Throughput: 0: 44099.6. Samples: 1417350720. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-27 22:43:03,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:43:07,501][06909] Updated weights for policy 0, policy_version 92443 (0.0039) [2024-06-27 22:43:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1514651648. Throughput: 0: 43884.8. Samples: 1417615380. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-27 22:43:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:43:11,193][06909] Updated weights for policy 0, policy_version 92453 (0.0029) [2024-06-27 22:43:13,850][06674] Fps is (10 sec: 44249.4, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 1514864640. Throughput: 0: 44044.4. Samples: 1417748660. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-27 22:43:13,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:43:14,821][06909] Updated weights for policy 0, policy_version 92463 (0.0039) [2024-06-27 22:43:18,675][06909] Updated weights for policy 0, policy_version 92473 (0.0037) [2024-06-27 22:43:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 1515077632. Throughput: 0: 43873.5. Samples: 1418005420. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-27 22:43:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:43:22,863][06909] Updated weights for policy 0, policy_version 92483 (0.0022) [2024-06-27 22:43:23,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1515290624. Throughput: 0: 43673.4. Samples: 1418267320. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-27 22:43:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:43:25,975][06909] Updated weights for policy 0, policy_version 92493 (0.0027) [2024-06-27 22:43:28,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1515520000. Throughput: 0: 43717.1. Samples: 1418398240. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-27 22:43:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:43:29,950][06909] Updated weights for policy 0, policy_version 92503 (0.0034) [2024-06-27 22:43:33,184][06909] Updated weights for policy 0, policy_version 92513 (0.0034) [2024-06-27 22:43:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43965.3, 300 sec: 43986.9). Total num frames: 1515732992. Throughput: 0: 43763.7. Samples: 1418663760. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-27 22:43:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:43:37,474][06909] Updated weights for policy 0, policy_version 92523 (0.0031) [2024-06-27 22:43:38,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 1515962368. Throughput: 0: 43920.3. Samples: 1418935480. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2024-06-27 22:43:38,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:43:40,492][06909] Updated weights for policy 0, policy_version 92533 (0.0025) [2024-06-27 22:43:43,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 1516191744. Throughput: 0: 44029.3. Samples: 1419064940. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2024-06-27 22:43:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:43:44,786][06909] Updated weights for policy 0, policy_version 92543 (0.0034) [2024-06-27 22:43:48,342][06909] Updated weights for policy 0, policy_version 92553 (0.0028) [2024-06-27 22:43:48,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1516404736. Throughput: 0: 43827.8. Samples: 1419322840. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2024-06-27 22:43:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:43:48,957][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000092555_1516421120.pth... [2024-06-27 22:43:49,007][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000091909_1505837056.pth [2024-06-27 22:43:52,187][06909] Updated weights for policy 0, policy_version 92563 (0.0024) [2024-06-27 22:43:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1516617728. Throughput: 0: 43888.0. Samples: 1419590340. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2024-06-27 22:43:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:43:55,590][06909] Updated weights for policy 0, policy_version 92573 (0.0033) [2024-06-27 22:43:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 1516847104. Throughput: 0: 43823.7. Samples: 1419720720. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2024-06-27 22:43:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:43:59,589][06909] Updated weights for policy 0, policy_version 92583 (0.0030) [2024-06-27 22:44:02,920][06909] Updated weights for policy 0, policy_version 92593 (0.0040) [2024-06-27 22:44:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44239.0, 300 sec: 44098.0). Total num frames: 1517076480. Throughput: 0: 44066.8. Samples: 1419988420. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2024-06-27 22:44:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:44:07,152][06909] Updated weights for policy 0, policy_version 92603 (0.0030) [2024-06-27 22:44:08,420][06887] Signal inference workers to stop experience collection... (20300 times) [2024-06-27 22:44:08,420][06887] Signal inference workers to resume experience collection... (20300 times) [2024-06-27 22:44:08,433][06909] InferenceWorker_p0-w0: stopping experience collection (20300 times) [2024-06-27 22:44:08,433][06909] InferenceWorker_p0-w0: resuming experience collection (20300 times) [2024-06-27 22:44:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1517273088. Throughput: 0: 44121.3. Samples: 1420252780. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2024-06-27 22:44:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:44:10,261][06909] Updated weights for policy 0, policy_version 92613 (0.0022) [2024-06-27 22:44:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1517502464. Throughput: 0: 44084.9. Samples: 1420382060. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2024-06-27 22:44:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:44:14,613][06909] Updated weights for policy 0, policy_version 92623 (0.0023) [2024-06-27 22:44:17,464][06909] Updated weights for policy 0, policy_version 92633 (0.0039) [2024-06-27 22:44:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1517715456. Throughput: 0: 43987.5. Samples: 1420643200. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2024-06-27 22:44:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:44:21,792][06909] Updated weights for policy 0, policy_version 92643 (0.0047) [2024-06-27 22:44:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1517944832. Throughput: 0: 44008.2. Samples: 1420915840. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2024-06-27 22:44:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:44:25,140][06909] Updated weights for policy 0, policy_version 92653 (0.0047) [2024-06-27 22:44:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1518174208. Throughput: 0: 44018.7. Samples: 1421045780. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2024-06-27 22:44:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:44:29,163][06909] Updated weights for policy 0, policy_version 92663 (0.0036) [2024-06-27 22:44:32,875][06909] Updated weights for policy 0, policy_version 92673 (0.0034) [2024-06-27 22:44:33,850][06674] Fps is (10 sec: 45874.3, 60 sec: 44509.7, 300 sec: 44097.9). Total num frames: 1518403584. Throughput: 0: 44079.4. Samples: 1421306420. Policy #0 lag: (min: 0.0, avg: 8.2, max: 19.0) [2024-06-27 22:44:33,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:44:36,654][06909] Updated weights for policy 0, policy_version 92683 (0.0032) [2024-06-27 22:44:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 1518600192. Throughput: 0: 43983.1. Samples: 1421569580. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 22:44:38,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:44:40,281][06909] Updated weights for policy 0, policy_version 92693 (0.0040) [2024-06-27 22:44:43,850][06674] Fps is (10 sec: 40961.1, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 1518813184. Throughput: 0: 43980.5. Samples: 1421699840. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 22:44:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-27 22:44:44,300][06909] Updated weights for policy 0, policy_version 92703 (0.0035) [2024-06-27 22:44:47,751][06909] Updated weights for policy 0, policy_version 92713 (0.0031) [2024-06-27 22:44:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1519058944. Throughput: 0: 43791.6. Samples: 1421959040. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 22:44:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:44:51,886][06909] Updated weights for policy 0, policy_version 92723 (0.0033) [2024-06-27 22:44:53,850][06674] Fps is (10 sec: 45874.2, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1519271936. Throughput: 0: 43963.0. Samples: 1422231120. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 22:44:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:44:55,072][06909] Updated weights for policy 0, policy_version 92733 (0.0030) [2024-06-27 22:44:58,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1519468544. Throughput: 0: 44028.5. Samples: 1422363340. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 22:44:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:44:59,171][06909] Updated weights for policy 0, policy_version 92743 (0.0025) [2024-06-27 22:45:02,530][06909] Updated weights for policy 0, policy_version 92753 (0.0036) [2024-06-27 22:45:03,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1519730688. Throughput: 0: 44213.3. Samples: 1422632800. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 22:45:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:45:06,463][06909] Updated weights for policy 0, policy_version 92763 (0.0032) [2024-06-27 22:45:08,850][06674] Fps is (10 sec: 49151.4, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 1519960064. Throughput: 0: 44078.2. Samples: 1422899360. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 22:45:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:45:09,882][06909] Updated weights for policy 0, policy_version 92773 (0.0033) [2024-06-27 22:45:13,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1520140288. Throughput: 0: 44018.1. Samples: 1423026600. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 22:45:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:45:14,084][06909] Updated weights for policy 0, policy_version 92783 (0.0042) [2024-06-27 22:45:17,357][06909] Updated weights for policy 0, policy_version 92793 (0.0026) [2024-06-27 22:45:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44782.8, 300 sec: 44209.0). Total num frames: 1520402432. Throughput: 0: 44220.0. Samples: 1423296320. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 22:45:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:45:21,386][06909] Updated weights for policy 0, policy_version 92803 (0.0030) [2024-06-27 22:45:23,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1520615424. Throughput: 0: 44306.7. Samples: 1423563380. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 22:45:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:45:24,987][06909] Updated weights for policy 0, policy_version 92813 (0.0027) [2024-06-27 22:45:28,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1520795648. Throughput: 0: 44294.1. Samples: 1423693080. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-27 22:45:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:45:29,150][06909] Updated weights for policy 0, policy_version 92823 (0.0042) [2024-06-27 22:45:32,405][06909] Updated weights for policy 0, policy_version 92833 (0.0032) [2024-06-27 22:45:33,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1521057792. Throughput: 0: 44570.5. Samples: 1423964720. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 22:45:33,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:45:36,375][06909] Updated weights for policy 0, policy_version 92843 (0.0029) [2024-06-27 22:45:38,850][06674] Fps is (10 sec: 49151.6, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 1521287168. Throughput: 0: 44320.5. Samples: 1424225540. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 22:45:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:45:39,631][06909] Updated weights for policy 0, policy_version 92853 (0.0035) [2024-06-27 22:45:43,600][06909] Updated weights for policy 0, policy_version 92863 (0.0028) [2024-06-27 22:45:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1521467392. Throughput: 0: 44319.8. Samples: 1424357740. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 22:45:43,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:45:46,989][06909] Updated weights for policy 0, policy_version 92873 (0.0039) [2024-06-27 22:45:48,170][06887] Signal inference workers to stop experience collection... (20350 times) [2024-06-27 22:45:48,170][06887] Signal inference workers to resume experience collection... (20350 times) [2024-06-27 22:45:48,187][06909] InferenceWorker_p0-w0: stopping experience collection (20350 times) [2024-06-27 22:45:48,187][06909] InferenceWorker_p0-w0: resuming experience collection (20350 times) [2024-06-27 22:45:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1521713152. Throughput: 0: 44210.6. Samples: 1424622280. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 22:45:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:45:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000092878_1521713152.pth... [2024-06-27 22:45:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000092233_1511145472.pth [2024-06-27 22:45:51,123][06909] Updated weights for policy 0, policy_version 92883 (0.0026) [2024-06-27 22:45:53,850][06674] Fps is (10 sec: 47514.2, 60 sec: 44510.0, 300 sec: 44097.9). Total num frames: 1521942528. Throughput: 0: 44047.2. Samples: 1424881480. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 22:45:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:45:54,600][06909] Updated weights for policy 0, policy_version 92893 (0.0044) [2024-06-27 22:45:58,546][06909] Updated weights for policy 0, policy_version 92903 (0.0026) [2024-06-27 22:45:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 1522139136. Throughput: 0: 44114.3. Samples: 1425011740. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 22:45:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:46:02,518][06909] Updated weights for policy 0, policy_version 92913 (0.0028) [2024-06-27 22:46:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1522368512. Throughput: 0: 44059.3. Samples: 1425278980. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 22:46:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:46:06,025][06909] Updated weights for policy 0, policy_version 92923 (0.0028) [2024-06-27 22:46:08,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1522597888. Throughput: 0: 43719.9. Samples: 1425530780. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 22:46:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:46:09,843][06909] Updated weights for policy 0, policy_version 92933 (0.0042) [2024-06-27 22:46:13,633][06909] Updated weights for policy 0, policy_version 92943 (0.0048) [2024-06-27 22:46:13,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.9, 300 sec: 43931.3). Total num frames: 1522778112. Throughput: 0: 43919.2. Samples: 1425669440. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 22:46:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:46:17,122][06909] Updated weights for policy 0, policy_version 92953 (0.0042) [2024-06-27 22:46:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 1523023872. Throughput: 0: 43890.8. Samples: 1425939800. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 22:46:18,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:46:20,937][06909] Updated weights for policy 0, policy_version 92963 (0.0034) [2024-06-27 22:46:23,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1523253248. Throughput: 0: 43800.5. Samples: 1426196560. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-27 22:46:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:46:24,417][06909] Updated weights for policy 0, policy_version 92973 (0.0038) [2024-06-27 22:46:28,368][06909] Updated weights for policy 0, policy_version 92983 (0.0028) [2024-06-27 22:46:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44509.9, 300 sec: 44098.3). Total num frames: 1523466240. Throughput: 0: 43877.9. Samples: 1426332240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:46:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:46:32,212][06909] Updated weights for policy 0, policy_version 92993 (0.0026) [2024-06-27 22:46:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 1523679232. Throughput: 0: 43922.7. Samples: 1426598800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:46:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:46:35,860][06909] Updated weights for policy 0, policy_version 93003 (0.0027) [2024-06-27 22:46:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1523908608. Throughput: 0: 43796.4. Samples: 1426852320. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:46:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:46:39,598][06909] Updated weights for policy 0, policy_version 93013 (0.0033) [2024-06-27 22:46:43,160][06909] Updated weights for policy 0, policy_version 93023 (0.0033) [2024-06-27 22:46:43,852][06674] Fps is (10 sec: 44227.9, 60 sec: 44235.4, 300 sec: 44042.1). Total num frames: 1524121600. Throughput: 0: 44029.5. Samples: 1426993160. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:46:43,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:46:47,011][06909] Updated weights for policy 0, policy_version 93033 (0.0029) [2024-06-27 22:46:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 1524334592. Throughput: 0: 44005.3. Samples: 1427259220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:46:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:46:50,627][06909] Updated weights for policy 0, policy_version 93043 (0.0042) [2024-06-27 22:46:53,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1524563968. Throughput: 0: 44165.4. Samples: 1427518220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:46:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:46:54,219][06909] Updated weights for policy 0, policy_version 93053 (0.0038) [2024-06-27 22:46:58,150][06909] Updated weights for policy 0, policy_version 93063 (0.0038) [2024-06-27 22:46:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1524776960. Throughput: 0: 44193.7. Samples: 1427658160. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:46:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:47:00,731][06887] Signal inference workers to stop experience collection... (20400 times) [2024-06-27 22:47:00,734][06887] Signal inference workers to resume experience collection... (20400 times) [2024-06-27 22:47:00,772][06909] InferenceWorker_p0-w0: stopping experience collection (20400 times) [2024-06-27 22:47:00,772][06909] InferenceWorker_p0-w0: resuming experience collection (20400 times) [2024-06-27 22:47:01,478][06909] Updated weights for policy 0, policy_version 93073 (0.0035) [2024-06-27 22:47:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1524989952. Throughput: 0: 43925.4. Samples: 1427916440. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:47:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:47:05,302][06909] Updated weights for policy 0, policy_version 93083 (0.0031) [2024-06-27 22:47:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1525219328. Throughput: 0: 44140.1. Samples: 1428182860. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:47:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:47:09,116][06909] Updated weights for policy 0, policy_version 93093 (0.0032) [2024-06-27 22:47:12,763][06909] Updated weights for policy 0, policy_version 93103 (0.0029) [2024-06-27 22:47:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1525432320. Throughput: 0: 44091.5. Samples: 1428316360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:47:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:47:16,654][06909] Updated weights for policy 0, policy_version 93113 (0.0034) [2024-06-27 22:47:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1525661696. Throughput: 0: 44065.0. Samples: 1428581720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-27 22:47:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:47:20,058][06909] Updated weights for policy 0, policy_version 93123 (0.0034) [2024-06-27 22:47:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1525874688. Throughput: 0: 44336.9. Samples: 1428847480. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 22:47:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:47:24,078][06909] Updated weights for policy 0, policy_version 93133 (0.0039) [2024-06-27 22:47:27,829][06909] Updated weights for policy 0, policy_version 93143 (0.0028) [2024-06-27 22:47:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 1526104064. Throughput: 0: 44140.2. Samples: 1428979380. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 22:47:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:47:31,478][06909] Updated weights for policy 0, policy_version 93153 (0.0033) [2024-06-27 22:47:33,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1526317056. Throughput: 0: 44026.9. Samples: 1429240440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 22:47:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:47:35,285][06909] Updated weights for policy 0, policy_version 93163 (0.0025) [2024-06-27 22:47:38,798][06909] Updated weights for policy 0, policy_version 93173 (0.0033) [2024-06-27 22:47:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 1526546432. Throughput: 0: 44318.4. Samples: 1429512540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 22:47:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:47:42,564][06909] Updated weights for policy 0, policy_version 93183 (0.0027) [2024-06-27 22:47:43,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43692.1, 300 sec: 43986.9). Total num frames: 1526743040. Throughput: 0: 43985.3. Samples: 1429637500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 22:47:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:47:46,654][06909] Updated weights for policy 0, policy_version 93193 (0.0031) [2024-06-27 22:47:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1526988800. Throughput: 0: 44092.0. Samples: 1429900580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 22:47:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:47:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000093200_1526988800.pth... [2024-06-27 22:47:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000092555_1516421120.pth [2024-06-27 22:47:49,830][06909] Updated weights for policy 0, policy_version 93203 (0.0025) [2024-06-27 22:47:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1527185408. Throughput: 0: 44239.4. Samples: 1430173640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 22:47:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:47:54,108][06909] Updated weights for policy 0, policy_version 93213 (0.0044) [2024-06-27 22:47:57,267][06909] Updated weights for policy 0, policy_version 93223 (0.0031) [2024-06-27 22:47:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 44153.9). Total num frames: 1527447552. Throughput: 0: 44188.0. Samples: 1430304820. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 22:47:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:48:01,695][06909] Updated weights for policy 0, policy_version 93233 (0.0035) [2024-06-27 22:48:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1527644160. Throughput: 0: 43999.1. Samples: 1430561680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 22:48:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:48:04,729][06909] Updated weights for policy 0, policy_version 93243 (0.0029) [2024-06-27 22:48:05,521][06887] Signal inference workers to stop experience collection... (20450 times) [2024-06-27 22:48:05,522][06887] Signal inference workers to resume experience collection... (20450 times) [2024-06-27 22:48:05,553][06909] InferenceWorker_p0-w0: stopping experience collection (20450 times) [2024-06-27 22:48:05,554][06909] InferenceWorker_p0-w0: resuming experience collection (20450 times) [2024-06-27 22:48:08,723][06909] Updated weights for policy 0, policy_version 93253 (0.0021) [2024-06-27 22:48:08,850][06674] Fps is (10 sec: 40959.0, 60 sec: 43963.5, 300 sec: 44042.4). Total num frames: 1527857152. Throughput: 0: 44262.8. Samples: 1430839320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 22:48:08,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:48:12,232][06909] Updated weights for policy 0, policy_version 93263 (0.0041) [2024-06-27 22:48:13,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1528086528. Throughput: 0: 44187.0. Samples: 1430967800. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 22:48:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:48:15,969][06909] Updated weights for policy 0, policy_version 93273 (0.0034) [2024-06-27 22:48:18,850][06674] Fps is (10 sec: 44237.8, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1528299520. Throughput: 0: 44202.4. Samples: 1431229540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-27 22:48:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:48:19,475][06909] Updated weights for policy 0, policy_version 93283 (0.0026) [2024-06-27 22:48:23,180][06909] Updated weights for policy 0, policy_version 93293 (0.0032) [2024-06-27 22:48:23,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1528512512. Throughput: 0: 44054.2. Samples: 1431494980. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:48:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:48:26,935][06909] Updated weights for policy 0, policy_version 93303 (0.0033) [2024-06-27 22:48:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1528758272. Throughput: 0: 44257.8. Samples: 1431629100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:48:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:48:30,876][06909] Updated weights for policy 0, policy_version 93313 (0.0036) [2024-06-27 22:48:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44237.0, 300 sec: 44098.0). Total num frames: 1528971264. Throughput: 0: 44214.7. Samples: 1431890240. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:48:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:48:34,471][06909] Updated weights for policy 0, policy_version 93323 (0.0039) [2024-06-27 22:48:38,406][06909] Updated weights for policy 0, policy_version 93333 (0.0035) [2024-06-27 22:48:38,852][06674] Fps is (10 sec: 44227.5, 60 sec: 44235.2, 300 sec: 44097.7). Total num frames: 1529200640. Throughput: 0: 44179.3. Samples: 1432161800. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:48:38,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:48:41,826][06909] Updated weights for policy 0, policy_version 93343 (0.0033) [2024-06-27 22:48:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 1529430016. Throughput: 0: 44208.4. Samples: 1432294200. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:48:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:48:45,795][06909] Updated weights for policy 0, policy_version 93353 (0.0028) [2024-06-27 22:48:48,850][06674] Fps is (10 sec: 40968.7, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1529610240. Throughput: 0: 44259.1. Samples: 1432553340. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:48:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:48:49,386][06909] Updated weights for policy 0, policy_version 93363 (0.0024) [2024-06-27 22:48:53,110][06909] Updated weights for policy 0, policy_version 93373 (0.0031) [2024-06-27 22:48:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 1529856000. Throughput: 0: 43984.2. Samples: 1432818600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:48:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:48:56,670][06909] Updated weights for policy 0, policy_version 93383 (0.0038) [2024-06-27 22:48:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1530068992. Throughput: 0: 44111.2. Samples: 1432952800. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:48:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:49:00,395][06909] Updated weights for policy 0, policy_version 93393 (0.0040) [2024-06-27 22:49:03,856][06674] Fps is (10 sec: 44210.9, 60 sec: 44232.5, 300 sec: 44152.6). Total num frames: 1530298368. Throughput: 0: 44095.2. Samples: 1433214080. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:49:03,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:49:04,084][06909] Updated weights for policy 0, policy_version 93403 (0.0031) [2024-06-27 22:49:08,042][06909] Updated weights for policy 0, policy_version 93413 (0.0037) [2024-06-27 22:49:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44237.0, 300 sec: 44098.0). Total num frames: 1530511360. Throughput: 0: 44137.3. Samples: 1433481160. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:49:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:49:11,821][06909] Updated weights for policy 0, policy_version 93423 (0.0040) [2024-06-27 22:49:13,856][06674] Fps is (10 sec: 45874.6, 60 sec: 44505.5, 300 sec: 44208.1). Total num frames: 1530757120. Throughput: 0: 44067.9. Samples: 1433612420. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:49:13,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:49:15,338][06909] Updated weights for policy 0, policy_version 93433 (0.0026) [2024-06-27 22:49:18,852][06674] Fps is (10 sec: 44227.5, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 1530953728. Throughput: 0: 44202.4. Samples: 1433879440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:49:18,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:49:19,144][06909] Updated weights for policy 0, policy_version 93443 (0.0042) [2024-06-27 22:49:22,920][06909] Updated weights for policy 0, policy_version 93453 (0.0032) [2024-06-27 22:49:23,850][06674] Fps is (10 sec: 42624.1, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 1531183104. Throughput: 0: 43812.3. Samples: 1434133260. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:49:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:49:26,604][06909] Updated weights for policy 0, policy_version 93463 (0.0029) [2024-06-27 22:49:28,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1531396096. Throughput: 0: 43855.1. Samples: 1434267680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:49:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:49:30,253][06909] Updated weights for policy 0, policy_version 93473 (0.0029) [2024-06-27 22:49:33,604][06887] Signal inference workers to stop experience collection... (20500 times) [2024-06-27 22:49:33,606][06887] Signal inference workers to resume experience collection... (20500 times) [2024-06-27 22:49:33,636][06909] InferenceWorker_p0-w0: stopping experience collection (20500 times) [2024-06-27 22:49:33,636][06909] InferenceWorker_p0-w0: resuming experience collection (20500 times) [2024-06-27 22:49:33,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.6, 300 sec: 44098.0). Total num frames: 1531609088. Throughput: 0: 43963.9. Samples: 1434531720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:49:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:49:33,887][06909] Updated weights for policy 0, policy_version 93483 (0.0028) [2024-06-27 22:49:37,662][06909] Updated weights for policy 0, policy_version 93493 (0.0028) [2024-06-27 22:49:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 1531838464. Throughput: 0: 43899.4. Samples: 1434794080. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:49:38,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:49:41,588][06909] Updated weights for policy 0, policy_version 93503 (0.0031) [2024-06-27 22:49:43,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1532067840. Throughput: 0: 43920.4. Samples: 1434929220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:49:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:49:45,148][06909] Updated weights for policy 0, policy_version 93513 (0.0028) [2024-06-27 22:49:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1532264448. Throughput: 0: 44017.2. Samples: 1435194600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:49:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:49:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000093522_1532264448.pth... [2024-06-27 22:49:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000092878_1521713152.pth [2024-06-27 22:49:49,187][06909] Updated weights for policy 0, policy_version 93523 (0.0033) [2024-06-27 22:49:52,383][06909] Updated weights for policy 0, policy_version 93533 (0.0028) [2024-06-27 22:49:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1532493824. Throughput: 0: 43846.2. Samples: 1435454240. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:49:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:49:56,482][06909] Updated weights for policy 0, policy_version 93543 (0.0033) [2024-06-27 22:49:58,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1532723200. Throughput: 0: 44013.8. Samples: 1435592780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:49:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:50:00,016][06909] Updated weights for policy 0, policy_version 93553 (0.0027) [2024-06-27 22:50:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43694.9, 300 sec: 43931.3). Total num frames: 1532919808. Throughput: 0: 43833.1. Samples: 1435851840. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:50:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:50:04,359][06909] Updated weights for policy 0, policy_version 93563 (0.0024) [2024-06-27 22:50:07,272][06909] Updated weights for policy 0, policy_version 93573 (0.0021) [2024-06-27 22:50:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1533149184. Throughput: 0: 44023.9. Samples: 1436114340. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:50:08,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:50:11,478][06909] Updated weights for policy 0, policy_version 93583 (0.0020) [2024-06-27 22:50:13,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43695.1, 300 sec: 43986.9). Total num frames: 1533378560. Throughput: 0: 44076.5. Samples: 1436251120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 22:50:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:50:14,658][06909] Updated weights for policy 0, policy_version 93593 (0.0027) [2024-06-27 22:50:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43692.1, 300 sec: 43931.3). Total num frames: 1533575168. Throughput: 0: 44336.0. Samples: 1436526840. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 22:50:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:50:19,038][06909] Updated weights for policy 0, policy_version 93603 (0.0032) [2024-06-27 22:50:22,393][06909] Updated weights for policy 0, policy_version 93613 (0.0046) [2024-06-27 22:50:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1533820928. Throughput: 0: 44196.5. Samples: 1436782920. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 22:50:23,850][06674] Avg episode reward: [(0, '0.448')] [2024-06-27 22:50:26,312][06909] Updated weights for policy 0, policy_version 93623 (0.0023) [2024-06-27 22:50:28,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1534050304. Throughput: 0: 44179.8. Samples: 1436917320. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 22:50:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:50:29,758][06909] Updated weights for policy 0, policy_version 93633 (0.0026) [2024-06-27 22:50:33,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1534230528. Throughput: 0: 44086.2. Samples: 1437178480. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 22:50:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:50:34,162][06909] Updated weights for policy 0, policy_version 93643 (0.0028) [2024-06-27 22:50:37,048][06909] Updated weights for policy 0, policy_version 93653 (0.0033) [2024-06-27 22:50:38,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43962.3, 300 sec: 44097.7). Total num frames: 1534476288. Throughput: 0: 44203.3. Samples: 1437443480. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 22:50:38,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:50:41,274][06909] Updated weights for policy 0, policy_version 93663 (0.0033) [2024-06-27 22:50:42,129][06887] Signal inference workers to stop experience collection... (20550 times) [2024-06-27 22:50:42,174][06909] InferenceWorker_p0-w0: stopping experience collection (20550 times) [2024-06-27 22:50:42,241][06887] Signal inference workers to resume experience collection... (20550 times) [2024-06-27 22:50:42,241][06909] InferenceWorker_p0-w0: resuming experience collection (20550 times) [2024-06-27 22:50:43,850][06674] Fps is (10 sec: 49152.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1534722048. Throughput: 0: 44049.4. Samples: 1437575000. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 22:50:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:50:44,394][06909] Updated weights for policy 0, policy_version 93673 (0.0030) [2024-06-27 22:50:48,393][06909] Updated weights for policy 0, policy_version 93683 (0.0034) [2024-06-27 22:50:48,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1534902272. Throughput: 0: 44130.7. Samples: 1437837720. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 22:50:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:50:51,815][06909] Updated weights for policy 0, policy_version 93693 (0.0036) [2024-06-27 22:50:53,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1535131648. Throughput: 0: 44289.5. Samples: 1438107360. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 22:50:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:50:56,334][06909] Updated weights for policy 0, policy_version 93703 (0.0032) [2024-06-27 22:50:58,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1535377408. Throughput: 0: 44293.7. Samples: 1438244340. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 22:50:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:50:59,527][06909] Updated weights for policy 0, policy_version 93713 (0.0035) [2024-06-27 22:51:03,555][06909] Updated weights for policy 0, policy_version 93723 (0.0028) [2024-06-27 22:51:03,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1535590400. Throughput: 0: 43977.8. Samples: 1438505840. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 22:51:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:51:06,742][06909] Updated weights for policy 0, policy_version 93733 (0.0022) [2024-06-27 22:51:08,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1535803392. Throughput: 0: 44181.3. Samples: 1438771080. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-27 22:51:08,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:51:11,019][06909] Updated weights for policy 0, policy_version 93743 (0.0026) [2024-06-27 22:51:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 1536032768. Throughput: 0: 44301.4. Samples: 1438910880. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-27 22:51:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:51:13,890][06909] Updated weights for policy 0, policy_version 93753 (0.0036) [2024-06-27 22:51:18,459][06909] Updated weights for policy 0, policy_version 93763 (0.0035) [2024-06-27 22:51:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1536229376. Throughput: 0: 44301.0. Samples: 1439172020. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-27 22:51:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:51:21,522][06909] Updated weights for policy 0, policy_version 93773 (0.0026) [2024-06-27 22:51:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1536458752. Throughput: 0: 44197.6. Samples: 1439432280. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-27 22:51:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:51:25,815][06909] Updated weights for policy 0, policy_version 93783 (0.0042) [2024-06-27 22:51:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1536688128. Throughput: 0: 44284.4. Samples: 1439567800. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-27 22:51:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:51:28,906][06909] Updated weights for policy 0, policy_version 93793 (0.0031) [2024-06-27 22:51:33,187][06909] Updated weights for policy 0, policy_version 93803 (0.0043) [2024-06-27 22:51:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 1536901120. Throughput: 0: 44313.8. Samples: 1439831840. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-27 22:51:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:51:36,437][06909] Updated weights for policy 0, policy_version 93813 (0.0028) [2024-06-27 22:51:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44238.4, 300 sec: 44098.3). Total num frames: 1537130496. Throughput: 0: 44287.0. Samples: 1440100280. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-27 22:51:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:51:40,355][06909] Updated weights for policy 0, policy_version 93823 (0.0024) [2024-06-27 22:51:43,754][06909] Updated weights for policy 0, policy_version 93833 (0.0028) [2024-06-27 22:51:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 1537359872. Throughput: 0: 44137.2. Samples: 1440230520. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-27 22:51:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:51:48,122][06909] Updated weights for policy 0, policy_version 93843 (0.0031) [2024-06-27 22:51:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1537540096. Throughput: 0: 44115.6. Samples: 1440491040. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-27 22:51:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:51:48,896][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000093845_1537556480.pth... [2024-06-27 22:51:48,968][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000093200_1526988800.pth [2024-06-27 22:51:51,039][06909] Updated weights for policy 0, policy_version 93853 (0.0030) [2024-06-27 22:51:53,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 1537785856. Throughput: 0: 44114.3. Samples: 1440756220. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-27 22:51:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:51:55,341][06909] Updated weights for policy 0, policy_version 93863 (0.0039) [2024-06-27 22:51:58,745][06909] Updated weights for policy 0, policy_version 93873 (0.0033) [2024-06-27 22:51:58,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 1538015232. Throughput: 0: 43931.0. Samples: 1440887780. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-27 22:51:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:52:02,838][06909] Updated weights for policy 0, policy_version 93883 (0.0032) [2024-06-27 22:52:03,266][06887] Signal inference workers to stop experience collection... (20600 times) [2024-06-27 22:52:03,266][06887] Signal inference workers to resume experience collection... (20600 times) [2024-06-27 22:52:03,284][06909] InferenceWorker_p0-w0: stopping experience collection (20600 times) [2024-06-27 22:52:03,284][06909] InferenceWorker_p0-w0: resuming experience collection (20600 times) [2024-06-27 22:52:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1538211840. Throughput: 0: 44017.8. Samples: 1441152820. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-27 22:52:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:52:06,062][06909] Updated weights for policy 0, policy_version 93893 (0.0028) [2024-06-27 22:52:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1538457600. Throughput: 0: 44076.4. Samples: 1441415720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:52:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:52:10,158][06909] Updated weights for policy 0, policy_version 93903 (0.0026) [2024-06-27 22:52:13,507][06909] Updated weights for policy 0, policy_version 93913 (0.0034) [2024-06-27 22:52:13,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1538686976. Throughput: 0: 44091.2. Samples: 1441551900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:52:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:52:17,600][06909] Updated weights for policy 0, policy_version 93923 (0.0031) [2024-06-27 22:52:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1538899968. Throughput: 0: 44090.3. Samples: 1441815900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:52:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:52:21,057][06909] Updated weights for policy 0, policy_version 93933 (0.0031) [2024-06-27 22:52:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1539129344. Throughput: 0: 43887.1. Samples: 1442075200. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:52:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:52:25,025][06909] Updated weights for policy 0, policy_version 93943 (0.0032) [2024-06-27 22:52:28,330][06909] Updated weights for policy 0, policy_version 93953 (0.0041) [2024-06-27 22:52:28,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1539358720. Throughput: 0: 44045.8. Samples: 1442212580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:52:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:52:32,311][06909] Updated weights for policy 0, policy_version 93963 (0.0030) [2024-06-27 22:52:33,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1539555328. Throughput: 0: 44059.5. Samples: 1442473720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:52:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:52:35,863][06909] Updated weights for policy 0, policy_version 93973 (0.0031) [2024-06-27 22:52:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44509.8, 300 sec: 44264.6). Total num frames: 1539801088. Throughput: 0: 43973.3. Samples: 1442735020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:52:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-27 22:52:39,801][06909] Updated weights for policy 0, policy_version 93983 (0.0031) [2024-06-27 22:52:43,140][06909] Updated weights for policy 0, policy_version 93993 (0.0024) [2024-06-27 22:52:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1539997696. Throughput: 0: 44116.0. Samples: 1442873000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:52:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:52:46,976][06909] Updated weights for policy 0, policy_version 94003 (0.0026) [2024-06-27 22:52:48,854][06674] Fps is (10 sec: 42580.6, 60 sec: 44779.8, 300 sec: 44208.4). Total num frames: 1540227072. Throughput: 0: 44235.4. Samples: 1443143600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:52:48,854][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:52:50,449][06909] Updated weights for policy 0, policy_version 94013 (0.0025) [2024-06-27 22:52:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 1540456448. Throughput: 0: 44297.7. Samples: 1443409120. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:52:53,854][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:52:54,496][06909] Updated weights for policy 0, policy_version 94023 (0.0040) [2024-06-27 22:52:58,410][06909] Updated weights for policy 0, policy_version 94033 (0.0025) [2024-06-27 22:52:58,850][06674] Fps is (10 sec: 42616.6, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 1540653056. Throughput: 0: 44245.8. Samples: 1443542960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:52:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:53:01,831][06909] Updated weights for policy 0, policy_version 94043 (0.0033) [2024-06-27 22:53:03,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1540882432. Throughput: 0: 44244.9. Samples: 1443806920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-27 22:53:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:53:05,602][06909] Updated weights for policy 0, policy_version 94053 (0.0023) [2024-06-27 22:53:08,851][06674] Fps is (10 sec: 45869.9, 60 sec: 44236.0, 300 sec: 44153.3). Total num frames: 1541111808. Throughput: 0: 44272.3. Samples: 1444067500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 22:53:08,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:53:09,018][06909] Updated weights for policy 0, policy_version 94063 (0.0036) [2024-06-27 22:53:13,331][06909] Updated weights for policy 0, policy_version 94073 (0.0033) [2024-06-27 22:53:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1541308416. Throughput: 0: 44181.4. Samples: 1444200740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 22:53:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:53:14,971][06887] Signal inference workers to stop experience collection... (20650 times) [2024-06-27 22:53:14,972][06887] Signal inference workers to resume experience collection... (20650 times) [2024-06-27 22:53:14,986][06909] InferenceWorker_p0-w0: stopping experience collection (20650 times) [2024-06-27 22:53:14,986][06909] InferenceWorker_p0-w0: resuming experience collection (20650 times) [2024-06-27 22:53:16,558][06909] Updated weights for policy 0, policy_version 94083 (0.0039) [2024-06-27 22:53:18,850][06674] Fps is (10 sec: 44241.0, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 1541554176. Throughput: 0: 44319.5. Samples: 1444468100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 22:53:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:53:20,536][06909] Updated weights for policy 0, policy_version 94093 (0.0031) [2024-06-27 22:53:23,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1541767168. Throughput: 0: 44405.7. Samples: 1444733280. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 22:53:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:53:24,025][06909] Updated weights for policy 0, policy_version 94103 (0.0034) [2024-06-27 22:53:28,072][06909] Updated weights for policy 0, policy_version 94113 (0.0032) [2024-06-27 22:53:28,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1541980160. Throughput: 0: 44405.4. Samples: 1444871240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 22:53:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:53:31,222][06909] Updated weights for policy 0, policy_version 94123 (0.0031) [2024-06-27 22:53:33,852][06674] Fps is (10 sec: 45866.1, 60 sec: 44508.4, 300 sec: 44153.5). Total num frames: 1542225920. Throughput: 0: 44248.8. Samples: 1445134700. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 22:53:33,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:53:35,327][06909] Updated weights for policy 0, policy_version 94133 (0.0025) [2024-06-27 22:53:38,557][06909] Updated weights for policy 0, policy_version 94143 (0.0024) [2024-06-27 22:53:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1542438912. Throughput: 0: 44343.6. Samples: 1445404580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 22:53:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:53:42,485][06909] Updated weights for policy 0, policy_version 94153 (0.0030) [2024-06-27 22:53:43,850][06674] Fps is (10 sec: 42606.7, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1542651904. Throughput: 0: 44243.4. Samples: 1445533920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 22:53:43,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:53:45,877][06909] Updated weights for policy 0, policy_version 94163 (0.0033) [2024-06-27 22:53:48,852][06674] Fps is (10 sec: 44227.6, 60 sec: 44238.4, 300 sec: 44153.2). Total num frames: 1542881280. Throughput: 0: 44234.4. Samples: 1445797560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 22:53:48,852][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 22:53:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000094170_1542881280.pth... [2024-06-27 22:53:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000093522_1532264448.pth [2024-06-27 22:53:50,224][06909] Updated weights for policy 0, policy_version 94173 (0.0037) [2024-06-27 22:53:53,369][06909] Updated weights for policy 0, policy_version 94183 (0.0041) [2024-06-27 22:53:53,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 1543110656. Throughput: 0: 44341.6. Samples: 1446062820. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 22:53:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:53:57,635][06909] Updated weights for policy 0, policy_version 94193 (0.0040) [2024-06-27 22:53:58,850][06674] Fps is (10 sec: 44246.4, 60 sec: 44509.9, 300 sec: 44154.4). Total num frames: 1543323648. Throughput: 0: 44453.4. Samples: 1446201140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-27 22:53:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:54:00,910][06909] Updated weights for policy 0, policy_version 94203 (0.0039) [2024-06-27 22:54:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1543553024. Throughput: 0: 44406.3. Samples: 1446466380. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 22:54:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:54:05,459][06909] Updated weights for policy 0, policy_version 94213 (0.0035) [2024-06-27 22:54:08,294][06909] Updated weights for policy 0, policy_version 94223 (0.0031) [2024-06-27 22:54:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44510.7, 300 sec: 44154.4). Total num frames: 1543782400. Throughput: 0: 44269.8. Samples: 1446725420. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 22:54:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:54:12,758][06909] Updated weights for policy 0, policy_version 94233 (0.0023) [2024-06-27 22:54:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44509.9, 300 sec: 44153.8). Total num frames: 1543979008. Throughput: 0: 44139.2. Samples: 1446857500. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 22:54:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:54:15,690][06909] Updated weights for policy 0, policy_version 94243 (0.0043) [2024-06-27 22:54:18,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1544208384. Throughput: 0: 44133.4. Samples: 1447120620. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 22:54:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:54:20,005][06909] Updated weights for policy 0, policy_version 94253 (0.0041) [2024-06-27 22:54:22,990][06909] Updated weights for policy 0, policy_version 94263 (0.0040) [2024-06-27 22:54:23,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1544437760. Throughput: 0: 44058.1. Samples: 1447387200. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 22:54:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:54:27,906][06909] Updated weights for policy 0, policy_version 94273 (0.0037) [2024-06-27 22:54:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44782.8, 300 sec: 44264.6). Total num frames: 1544667136. Throughput: 0: 44167.1. Samples: 1447521440. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 22:54:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:54:30,627][06909] Updated weights for policy 0, policy_version 94283 (0.0028) [2024-06-27 22:54:31,490][06887] Signal inference workers to stop experience collection... (20700 times) [2024-06-27 22:54:31,492][06887] Signal inference workers to resume experience collection... (20700 times) [2024-06-27 22:54:31,522][06909] InferenceWorker_p0-w0: stopping experience collection (20700 times) [2024-06-27 22:54:31,522][06909] InferenceWorker_p0-w0: resuming experience collection (20700 times) [2024-06-27 22:54:33,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 1544863744. Throughput: 0: 44113.2. Samples: 1447782560. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 22:54:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:54:35,056][06909] Updated weights for policy 0, policy_version 94293 (0.0030) [2024-06-27 22:54:38,032][06909] Updated weights for policy 0, policy_version 94303 (0.0035) [2024-06-27 22:54:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1545109504. Throughput: 0: 44196.8. Samples: 1448051680. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 22:54:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:54:42,210][06909] Updated weights for policy 0, policy_version 94313 (0.0032) [2024-06-27 22:54:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.9, 300 sec: 44209.1). Total num frames: 1545306112. Throughput: 0: 44036.8. Samples: 1448182800. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 22:54:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:54:45,400][06909] Updated weights for policy 0, policy_version 94323 (0.0035) [2024-06-27 22:54:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44238.4, 300 sec: 44209.0). Total num frames: 1545535488. Throughput: 0: 44116.0. Samples: 1448451600. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 22:54:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:54:49,773][06909] Updated weights for policy 0, policy_version 94333 (0.0032) [2024-06-27 22:54:52,920][06909] Updated weights for policy 0, policy_version 94343 (0.0039) [2024-06-27 22:54:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1545748480. Throughput: 0: 44093.4. Samples: 1448709620. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-27 22:54:53,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-27 22:54:57,039][06909] Updated weights for policy 0, policy_version 94353 (0.0023) [2024-06-27 22:54:58,852][06674] Fps is (10 sec: 44226.5, 60 sec: 44235.0, 300 sec: 44264.2). Total num frames: 1545977856. Throughput: 0: 44181.2. Samples: 1448845760. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 22:54:58,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:55:00,184][06909] Updated weights for policy 0, policy_version 94363 (0.0028) [2024-06-27 22:55:03,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 1546174464. Throughput: 0: 44057.3. Samples: 1449103200. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 22:55:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:55:04,996][06909] Updated weights for policy 0, policy_version 94373 (0.0027) [2024-06-27 22:55:08,059][06909] Updated weights for policy 0, policy_version 94383 (0.0024) [2024-06-27 22:55:08,850][06674] Fps is (10 sec: 44246.5, 60 sec: 43963.6, 300 sec: 44209.0). Total num frames: 1546420224. Throughput: 0: 43935.1. Samples: 1449364280. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 22:55:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:55:12,289][06909] Updated weights for policy 0, policy_version 94393 (0.0041) [2024-06-27 22:55:13,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1546616832. Throughput: 0: 43887.7. Samples: 1449496380. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 22:55:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:55:15,312][06909] Updated weights for policy 0, policy_version 94403 (0.0029) [2024-06-27 22:55:18,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1546846208. Throughput: 0: 43878.6. Samples: 1449757100. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 22:55:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:55:19,867][06909] Updated weights for policy 0, policy_version 94413 (0.0042) [2024-06-27 22:55:22,882][06909] Updated weights for policy 0, policy_version 94423 (0.0028) [2024-06-27 22:55:23,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.9, 300 sec: 44209.1). Total num frames: 1547091968. Throughput: 0: 43800.9. Samples: 1450022720. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 22:55:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:55:27,105][06909] Updated weights for policy 0, policy_version 94433 (0.0032) [2024-06-27 22:55:28,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43689.2, 300 sec: 44264.3). Total num frames: 1547288576. Throughput: 0: 43838.4. Samples: 1450155620. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 22:55:28,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:55:30,314][06909] Updated weights for policy 0, policy_version 94443 (0.0030) [2024-06-27 22:55:33,852][06674] Fps is (10 sec: 40951.4, 60 sec: 43962.2, 300 sec: 44153.5). Total num frames: 1547501568. Throughput: 0: 43816.2. Samples: 1450423420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 22:55:33,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:55:34,412][06909] Updated weights for policy 0, policy_version 94453 (0.0033) [2024-06-27 22:55:37,599][06909] Updated weights for policy 0, policy_version 94463 (0.0030) [2024-06-27 22:55:38,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 1547730944. Throughput: 0: 43847.5. Samples: 1450682760. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 22:55:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:55:42,025][06909] Updated weights for policy 0, policy_version 94473 (0.0025) [2024-06-27 22:55:42,656][06887] Signal inference workers to stop experience collection... (20750 times) [2024-06-27 22:55:42,657][06887] Signal inference workers to resume experience collection... (20750 times) [2024-06-27 22:55:42,672][06909] InferenceWorker_p0-w0: stopping experience collection (20750 times) [2024-06-27 22:55:42,672][06909] InferenceWorker_p0-w0: resuming experience collection (20750 times) [2024-06-27 22:55:43,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 1547927552. Throughput: 0: 43829.7. Samples: 1450818000. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 22:55:43,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:55:45,046][06909] Updated weights for policy 0, policy_version 94483 (0.0032) [2024-06-27 22:55:48,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 1548156928. Throughput: 0: 43981.8. Samples: 1451082380. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 22:55:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:55:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000094492_1548156928.pth... [2024-06-27 22:55:48,919][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000093845_1537556480.pth [2024-06-27 22:55:49,422][06909] Updated weights for policy 0, policy_version 94493 (0.0031) [2024-06-27 22:55:52,660][06909] Updated weights for policy 0, policy_version 94503 (0.0029) [2024-06-27 22:55:53,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1548402688. Throughput: 0: 43833.5. Samples: 1451336780. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 22:55:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:55:56,892][06909] Updated weights for policy 0, policy_version 94513 (0.0023) [2024-06-27 22:55:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43692.3, 300 sec: 44098.0). Total num frames: 1548599296. Throughput: 0: 44004.8. Samples: 1451476600. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-27 22:55:58,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:56:00,055][06909] Updated weights for policy 0, policy_version 94523 (0.0036) [2024-06-27 22:56:03,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1548812288. Throughput: 0: 44135.0. Samples: 1451743180. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-27 22:56:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:56:04,208][06909] Updated weights for policy 0, policy_version 94533 (0.0038) [2024-06-27 22:56:07,462][06909] Updated weights for policy 0, policy_version 94543 (0.0023) [2024-06-27 22:56:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1549058048. Throughput: 0: 44080.8. Samples: 1452006360. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-27 22:56:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:56:11,916][06909] Updated weights for policy 0, policy_version 94553 (0.0021) [2024-06-27 22:56:13,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1549254656. Throughput: 0: 44094.1. Samples: 1452139760. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-27 22:56:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:56:14,792][06909] Updated weights for policy 0, policy_version 94563 (0.0036) [2024-06-27 22:56:18,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 1549467648. Throughput: 0: 44053.5. Samples: 1452405740. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-27 22:56:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:56:19,046][06909] Updated weights for policy 0, policy_version 94573 (0.0038) [2024-06-27 22:56:22,182][06909] Updated weights for policy 0, policy_version 94583 (0.0033) [2024-06-27 22:56:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 1549713408. Throughput: 0: 44004.4. Samples: 1452662960. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-27 22:56:23,854][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:56:26,482][06909] Updated weights for policy 0, policy_version 94593 (0.0028) [2024-06-27 22:56:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43692.2, 300 sec: 44098.0). Total num frames: 1549910016. Throughput: 0: 44067.6. Samples: 1452801040. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-27 22:56:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:56:29,794][06909] Updated weights for policy 0, policy_version 94603 (0.0027) [2024-06-27 22:56:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43692.1, 300 sec: 44042.4). Total num frames: 1550123008. Throughput: 0: 44133.4. Samples: 1453068380. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-27 22:56:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:56:34,186][06909] Updated weights for policy 0, policy_version 94613 (0.0035) [2024-06-27 22:56:37,146][06909] Updated weights for policy 0, policy_version 94623 (0.0034) [2024-06-27 22:56:38,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1550368768. Throughput: 0: 44156.8. Samples: 1453323840. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-27 22:56:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:56:41,607][06909] Updated weights for policy 0, policy_version 94633 (0.0027) [2024-06-27 22:56:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1550581760. Throughput: 0: 44200.9. Samples: 1453465640. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-27 22:56:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:56:44,631][06909] Updated weights for policy 0, policy_version 94643 (0.0031) [2024-06-27 22:56:48,653][06909] Updated weights for policy 0, policy_version 94653 (0.0034) [2024-06-27 22:56:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1550794752. Throughput: 0: 44115.7. Samples: 1453728380. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-27 22:56:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:56:48,879][06887] Signal inference workers to stop experience collection... (20800 times) [2024-06-27 22:56:48,880][06887] Signal inference workers to resume experience collection... (20800 times) [2024-06-27 22:56:48,901][06909] InferenceWorker_p0-w0: stopping experience collection (20800 times) [2024-06-27 22:56:48,901][06909] InferenceWorker_p0-w0: resuming experience collection (20800 times) [2024-06-27 22:56:51,737][06909] Updated weights for policy 0, policy_version 94663 (0.0040) [2024-06-27 22:56:53,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1551040512. Throughput: 0: 44063.5. Samples: 1453989220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 22:56:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:56:56,022][06909] Updated weights for policy 0, policy_version 94673 (0.0036) [2024-06-27 22:56:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1551253504. Throughput: 0: 44132.8. Samples: 1454125740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 22:56:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:56:59,414][06909] Updated weights for policy 0, policy_version 94683 (0.0022) [2024-06-27 22:57:03,553][06909] Updated weights for policy 0, policy_version 94693 (0.0024) [2024-06-27 22:57:03,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1551466496. Throughput: 0: 44091.5. Samples: 1454389860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 22:57:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:57:06,889][06909] Updated weights for policy 0, policy_version 94703 (0.0043) [2024-06-27 22:57:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1551679488. Throughput: 0: 43915.6. Samples: 1454639160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 22:57:08,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:57:11,433][06909] Updated weights for policy 0, policy_version 94713 (0.0033) [2024-06-27 22:57:13,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1551908864. Throughput: 0: 44033.8. Samples: 1454782560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 22:57:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:57:14,507][06909] Updated weights for policy 0, policy_version 94723 (0.0028) [2024-06-27 22:57:18,687][06909] Updated weights for policy 0, policy_version 94733 (0.0028) [2024-06-27 22:57:18,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.6, 300 sec: 43986.8). Total num frames: 1552105472. Throughput: 0: 43936.3. Samples: 1455045520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 22:57:18,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:57:22,090][06909] Updated weights for policy 0, policy_version 94743 (0.0025) [2024-06-27 22:57:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1552334848. Throughput: 0: 43927.7. Samples: 1455300580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 22:57:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:57:26,183][06909] Updated weights for policy 0, policy_version 94753 (0.0028) [2024-06-27 22:57:28,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44509.7, 300 sec: 44153.5). Total num frames: 1552580608. Throughput: 0: 43914.2. Samples: 1455441780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 22:57:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:57:29,240][06909] Updated weights for policy 0, policy_version 94763 (0.0034) [2024-06-27 22:57:33,548][06909] Updated weights for policy 0, policy_version 94773 (0.0036) [2024-06-27 22:57:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1552777216. Throughput: 0: 43971.5. Samples: 1455707100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 22:57:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:57:36,510][06909] Updated weights for policy 0, policy_version 94783 (0.0039) [2024-06-27 22:57:38,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1552990208. Throughput: 0: 43877.8. Samples: 1455963720. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 22:57:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:57:41,208][06909] Updated weights for policy 0, policy_version 94793 (0.0023) [2024-06-27 22:57:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44043.0). Total num frames: 1553219584. Throughput: 0: 43864.4. Samples: 1456099640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 22:57:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:57:44,175][06909] Updated weights for policy 0, policy_version 94803 (0.0041) [2024-06-27 22:57:48,654][06909] Updated weights for policy 0, policy_version 94813 (0.0025) [2024-06-27 22:57:48,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1553416192. Throughput: 0: 43818.7. Samples: 1456361700. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 22:57:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:57:48,987][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000094814_1553432576.pth... [2024-06-27 22:57:49,046][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000094170_1542881280.pth [2024-06-27 22:57:52,087][06909] Updated weights for policy 0, policy_version 94823 (0.0028) [2024-06-27 22:57:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 1553645568. Throughput: 0: 43990.7. Samples: 1456618740. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 22:57:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:57:56,125][06909] Updated weights for policy 0, policy_version 94833 (0.0020) [2024-06-27 22:57:58,447][06887] Signal inference workers to stop experience collection... (20850 times) [2024-06-27 22:57:58,447][06887] Signal inference workers to resume experience collection... (20850 times) [2024-06-27 22:57:58,471][06909] InferenceWorker_p0-w0: stopping experience collection (20850 times) [2024-06-27 22:57:58,471][06909] InferenceWorker_p0-w0: resuming experience collection (20850 times) [2024-06-27 22:57:58,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1553891328. Throughput: 0: 43875.0. Samples: 1456756940. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 22:57:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:57:59,213][06909] Updated weights for policy 0, policy_version 94843 (0.0036) [2024-06-27 22:58:03,283][06909] Updated weights for policy 0, policy_version 94853 (0.0035) [2024-06-27 22:58:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43987.0). Total num frames: 1554087936. Throughput: 0: 43936.5. Samples: 1457022660. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 22:58:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:58:06,473][06909] Updated weights for policy 0, policy_version 94863 (0.0039) [2024-06-27 22:58:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1554317312. Throughput: 0: 44067.1. Samples: 1457283600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 22:58:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:58:10,682][06909] Updated weights for policy 0, policy_version 94873 (0.0024) [2024-06-27 22:58:13,627][06909] Updated weights for policy 0, policy_version 94883 (0.0030) [2024-06-27 22:58:13,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 1554563072. Throughput: 0: 43923.1. Samples: 1457418320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 22:58:13,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 22:58:18,071][06909] Updated weights for policy 0, policy_version 94893 (0.0025) [2024-06-27 22:58:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 1554743296. Throughput: 0: 44076.1. Samples: 1457690520. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 22:58:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:58:21,198][06909] Updated weights for policy 0, policy_version 94903 (0.0031) [2024-06-27 22:58:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1554972672. Throughput: 0: 44116.8. Samples: 1457948980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 22:58:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-27 22:58:26,029][06909] Updated weights for policy 0, policy_version 94913 (0.0044) [2024-06-27 22:58:28,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 1555202048. Throughput: 0: 43926.7. Samples: 1458076340. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 22:58:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:58:29,234][06909] Updated weights for policy 0, policy_version 94923 (0.0032) [2024-06-27 22:58:33,223][06909] Updated weights for policy 0, policy_version 94933 (0.0039) [2024-06-27 22:58:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1555431424. Throughput: 0: 44094.3. Samples: 1458345940. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 22:58:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:58:36,427][06909] Updated weights for policy 0, policy_version 94943 (0.0028) [2024-06-27 22:58:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1555644416. Throughput: 0: 44195.5. Samples: 1458607540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 22:58:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:58:40,437][06909] Updated weights for policy 0, policy_version 94953 (0.0034) [2024-06-27 22:58:43,566][06909] Updated weights for policy 0, policy_version 94963 (0.0020) [2024-06-27 22:58:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44042.7). Total num frames: 1555873792. Throughput: 0: 44140.9. Samples: 1458743280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-27 22:58:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:58:47,710][06909] Updated weights for policy 0, policy_version 94973 (0.0033) [2024-06-27 22:58:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 1556086784. Throughput: 0: 44332.1. Samples: 1459017600. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 22:58:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:58:50,913][06909] Updated weights for policy 0, policy_version 94983 (0.0038) [2024-06-27 22:58:53,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1556283392. Throughput: 0: 44090.6. Samples: 1459267680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 22:58:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 22:58:55,524][06909] Updated weights for policy 0, policy_version 94993 (0.0033) [2024-06-27 22:58:58,522][06909] Updated weights for policy 0, policy_version 95003 (0.0035) [2024-06-27 22:58:58,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1556545536. Throughput: 0: 43929.3. Samples: 1459395140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 22:58:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 22:59:03,019][06909] Updated weights for policy 0, policy_version 95013 (0.0027) [2024-06-27 22:59:03,850][06674] Fps is (10 sec: 47514.2, 60 sec: 44510.0, 300 sec: 43986.9). Total num frames: 1556758528. Throughput: 0: 43915.5. Samples: 1459666720. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 22:59:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:59:06,352][06909] Updated weights for policy 0, policy_version 95023 (0.0040) [2024-06-27 22:59:08,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1556955136. Throughput: 0: 43866.7. Samples: 1459922980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 22:59:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:59:10,052][06887] Signal inference workers to stop experience collection... (20900 times) [2024-06-27 22:59:10,102][06887] Signal inference workers to resume experience collection... (20900 times) [2024-06-27 22:59:10,108][06909] InferenceWorker_p0-w0: stopping experience collection (20900 times) [2024-06-27 22:59:10,119][06909] InferenceWorker_p0-w0: resuming experience collection (20900 times) [2024-06-27 22:59:10,452][06909] Updated weights for policy 0, policy_version 95033 (0.0024) [2024-06-27 22:59:13,692][06909] Updated weights for policy 0, policy_version 95043 (0.0027) [2024-06-27 22:59:13,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1557184512. Throughput: 0: 43926.3. Samples: 1460053020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 22:59:13,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:59:17,706][06909] Updated weights for policy 0, policy_version 95053 (0.0043) [2024-06-27 22:59:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1557397504. Throughput: 0: 44044.0. Samples: 1460327920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 22:59:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 22:59:20,886][06909] Updated weights for policy 0, policy_version 95063 (0.0035) [2024-06-27 22:59:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1557594112. Throughput: 0: 43969.9. Samples: 1460586180. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 22:59:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 22:59:25,076][06909] Updated weights for policy 0, policy_version 95073 (0.0035) [2024-06-27 22:59:28,191][06909] Updated weights for policy 0, policy_version 95083 (0.0035) [2024-06-27 22:59:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43986.8). Total num frames: 1557839872. Throughput: 0: 43727.0. Samples: 1460711000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 22:59:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:59:32,788][06909] Updated weights for policy 0, policy_version 95093 (0.0042) [2024-06-27 22:59:33,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1558069248. Throughput: 0: 43513.7. Samples: 1460975720. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 22:59:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:59:36,134][06909] Updated weights for policy 0, policy_version 95103 (0.0037) [2024-06-27 22:59:38,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43417.7, 300 sec: 43875.8). Total num frames: 1558249472. Throughput: 0: 43801.4. Samples: 1461238740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 22:59:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 22:59:40,399][06909] Updated weights for policy 0, policy_version 95113 (0.0044) [2024-06-27 22:59:43,805][06909] Updated weights for policy 0, policy_version 95123 (0.0038) [2024-06-27 22:59:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1558495232. Throughput: 0: 43829.4. Samples: 1461367460. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 22:59:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:59:47,686][06909] Updated weights for policy 0, policy_version 95133 (0.0026) [2024-06-27 22:59:48,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1558724608. Throughput: 0: 43930.5. Samples: 1461643600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 22:59:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 22:59:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000095137_1558724608.pth... [2024-06-27 22:59:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000094492_1548156928.pth [2024-06-27 22:59:50,916][06909] Updated weights for policy 0, policy_version 95143 (0.0037) [2024-06-27 22:59:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43876.1). Total num frames: 1558921216. Throughput: 0: 44127.5. Samples: 1461908720. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 22:59:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 22:59:54,905][06909] Updated weights for policy 0, policy_version 95153 (0.0044) [2024-06-27 22:59:58,064][06909] Updated weights for policy 0, policy_version 95163 (0.0029) [2024-06-27 22:59:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.7, 300 sec: 43986.9). Total num frames: 1559150592. Throughput: 0: 44140.1. Samples: 1462039320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 22:59:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:00:02,143][06909] Updated weights for policy 0, policy_version 95173 (0.0046) [2024-06-27 23:00:03,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1559396352. Throughput: 0: 43890.7. Samples: 1462303000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 23:00:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:00:05,966][06909] Updated weights for policy 0, policy_version 95183 (0.0044) [2024-06-27 23:00:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1559592960. Throughput: 0: 44055.5. Samples: 1462568680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 23:00:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:00:10,108][06909] Updated weights for policy 0, policy_version 95193 (0.0036) [2024-06-27 23:00:10,287][06887] Signal inference workers to stop experience collection... (20950 times) [2024-06-27 23:00:10,328][06909] InferenceWorker_p0-w0: stopping experience collection (20950 times) [2024-06-27 23:00:10,346][06887] Signal inference workers to resume experience collection... (20950 times) [2024-06-27 23:00:10,347][06909] InferenceWorker_p0-w0: resuming experience collection (20950 times) [2024-06-27 23:00:13,407][06909] Updated weights for policy 0, policy_version 95203 (0.0025) [2024-06-27 23:00:13,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1559838720. Throughput: 0: 44029.7. Samples: 1462692340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 23:00:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:00:17,459][06909] Updated weights for policy 0, policy_version 95213 (0.0023) [2024-06-27 23:00:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1560035328. Throughput: 0: 44174.7. Samples: 1462963580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 23:00:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:00:20,819][06909] Updated weights for policy 0, policy_version 95223 (0.0031) [2024-06-27 23:00:23,850][06674] Fps is (10 sec: 40960.8, 60 sec: 44236.8, 300 sec: 43931.6). Total num frames: 1560248320. Throughput: 0: 44100.5. Samples: 1463223260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 23:00:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:00:24,841][06909] Updated weights for policy 0, policy_version 95233 (0.0024) [2024-06-27 23:00:28,248][06909] Updated weights for policy 0, policy_version 95243 (0.0027) [2024-06-27 23:00:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43931.6). Total num frames: 1560461312. Throughput: 0: 44100.9. Samples: 1463352000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 23:00:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:00:32,060][06909] Updated weights for policy 0, policy_version 95253 (0.0031) [2024-06-27 23:00:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1560707072. Throughput: 0: 44037.9. Samples: 1463625300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 23:00:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:00:35,701][06909] Updated weights for policy 0, policy_version 95263 (0.0040) [2024-06-27 23:00:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1560903680. Throughput: 0: 44155.9. Samples: 1463895740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-27 23:00:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:00:39,368][06909] Updated weights for policy 0, policy_version 95273 (0.0039) [2024-06-27 23:00:43,190][06909] Updated weights for policy 0, policy_version 95283 (0.0029) [2024-06-27 23:00:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1561133056. Throughput: 0: 44018.2. Samples: 1464020140. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 23:00:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:00:46,741][06909] Updated weights for policy 0, policy_version 95293 (0.0046) [2024-06-27 23:00:48,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1561378816. Throughput: 0: 44100.8. Samples: 1464287540. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 23:00:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-27 23:00:50,417][06909] Updated weights for policy 0, policy_version 95303 (0.0035) [2024-06-27 23:00:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1561559040. Throughput: 0: 43929.4. Samples: 1464545500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 23:00:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:00:54,593][06909] Updated weights for policy 0, policy_version 95313 (0.0028) [2024-06-27 23:00:58,064][06909] Updated weights for policy 0, policy_version 95323 (0.0044) [2024-06-27 23:00:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1561804800. Throughput: 0: 44037.0. Samples: 1464674000. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 23:00:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:01:01,926][06909] Updated weights for policy 0, policy_version 95333 (0.0032) [2024-06-27 23:01:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1562017792. Throughput: 0: 43868.6. Samples: 1464937660. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 23:01:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:01:05,476][06909] Updated weights for policy 0, policy_version 95343 (0.0027) [2024-06-27 23:01:08,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43986.8). Total num frames: 1562230784. Throughput: 0: 44087.4. Samples: 1465207200. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 23:01:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-27 23:01:09,283][06909] Updated weights for policy 0, policy_version 95353 (0.0027) [2024-06-27 23:01:12,894][06909] Updated weights for policy 0, policy_version 95363 (0.0025) [2024-06-27 23:01:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.8, 300 sec: 43986.9). Total num frames: 1562443776. Throughput: 0: 44072.5. Samples: 1465335260. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 23:01:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:01:16,412][06909] Updated weights for policy 0, policy_version 95373 (0.0031) [2024-06-27 23:01:18,850][06674] Fps is (10 sec: 47514.4, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1562705920. Throughput: 0: 43997.3. Samples: 1465605180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 23:01:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:01:20,034][06909] Updated weights for policy 0, policy_version 95383 (0.0025) [2024-06-27 23:01:23,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1562902528. Throughput: 0: 43881.3. Samples: 1465870400. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 23:01:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:01:24,057][06909] Updated weights for policy 0, policy_version 95393 (0.0033) [2024-06-27 23:01:27,470][06909] Updated weights for policy 0, policy_version 95403 (0.0032) [2024-06-27 23:01:28,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1563115520. Throughput: 0: 43801.3. Samples: 1465991200. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 23:01:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:01:30,696][06887] Signal inference workers to stop experience collection... (21000 times) [2024-06-27 23:01:30,703][06887] Signal inference workers to resume experience collection... (21000 times) [2024-06-27 23:01:30,733][06909] InferenceWorker_p0-w0: stopping experience collection (21000 times) [2024-06-27 23:01:30,740][06909] InferenceWorker_p0-w0: resuming experience collection (21000 times) [2024-06-27 23:01:31,831][06909] Updated weights for policy 0, policy_version 95413 (0.0034) [2024-06-27 23:01:33,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1563344896. Throughput: 0: 43615.7. Samples: 1466250240. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 23:01:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:01:35,238][06909] Updated weights for policy 0, policy_version 95423 (0.0026) [2024-06-27 23:01:38,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 1563541504. Throughput: 0: 44063.2. Samples: 1466528340. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-27 23:01:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:01:39,096][06909] Updated weights for policy 0, policy_version 95433 (0.0032) [2024-06-27 23:01:42,558][06909] Updated weights for policy 0, policy_version 95443 (0.0031) [2024-06-27 23:01:43,852][06674] Fps is (10 sec: 42589.2, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 1563770880. Throughput: 0: 43974.5. Samples: 1466652940. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:01:43,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:01:46,458][06909] Updated weights for policy 0, policy_version 95453 (0.0023) [2024-06-27 23:01:48,853][06674] Fps is (10 sec: 47499.1, 60 sec: 43961.6, 300 sec: 43986.4). Total num frames: 1564016640. Throughput: 0: 44057.5. Samples: 1466920380. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:01:48,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:01:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000095460_1564016640.pth... [2024-06-27 23:01:48,916][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000094814_1553432576.pth [2024-06-27 23:01:50,203][06909] Updated weights for policy 0, policy_version 95463 (0.0027) [2024-06-27 23:01:53,689][06909] Updated weights for policy 0, policy_version 95473 (0.0021) [2024-06-27 23:01:53,850][06674] Fps is (10 sec: 45884.6, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 1564229632. Throughput: 0: 44142.4. Samples: 1467193600. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:01:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:01:57,378][06909] Updated weights for policy 0, policy_version 95483 (0.0032) [2024-06-27 23:01:58,850][06674] Fps is (10 sec: 40972.4, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 1564426240. Throughput: 0: 44040.9. Samples: 1467317100. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:01:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:02:01,347][06909] Updated weights for policy 0, policy_version 95493 (0.0033) [2024-06-27 23:02:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1564672000. Throughput: 0: 43795.1. Samples: 1467575960. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:02:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:02:04,843][06909] Updated weights for policy 0, policy_version 95503 (0.0028) [2024-06-27 23:02:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1564868608. Throughput: 0: 43951.6. Samples: 1467848220. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:02:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:02:09,082][06909] Updated weights for policy 0, policy_version 95513 (0.0024) [2024-06-27 23:02:12,329][06909] Updated weights for policy 0, policy_version 95523 (0.0032) [2024-06-27 23:02:13,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1565081600. Throughput: 0: 44055.7. Samples: 1467973700. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:02:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:02:16,459][06909] Updated weights for policy 0, policy_version 95533 (0.0041) [2024-06-27 23:02:18,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1565327360. Throughput: 0: 44090.5. Samples: 1468234320. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:02:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:02:19,615][06909] Updated weights for policy 0, policy_version 95543 (0.0031) [2024-06-27 23:02:23,791][06909] Updated weights for policy 0, policy_version 95553 (0.0039) [2024-06-27 23:02:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 1565540352. Throughput: 0: 44009.3. Samples: 1468508760. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:02:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:02:27,354][06909] Updated weights for policy 0, policy_version 95563 (0.0021) [2024-06-27 23:02:28,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1565753344. Throughput: 0: 43943.8. Samples: 1468630320. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:02:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:02:31,422][06909] Updated weights for policy 0, policy_version 95573 (0.0028) [2024-06-27 23:02:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1565999104. Throughput: 0: 43839.9. Samples: 1468893040. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:02:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:02:34,790][06909] Updated weights for policy 0, policy_version 95583 (0.0032) [2024-06-27 23:02:38,818][06909] Updated weights for policy 0, policy_version 95593 (0.0035) [2024-06-27 23:02:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1566195712. Throughput: 0: 43919.5. Samples: 1469169980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 23:02:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:02:41,920][06909] Updated weights for policy 0, policy_version 95603 (0.0034) [2024-06-27 23:02:43,856][06674] Fps is (10 sec: 40934.2, 60 sec: 43960.7, 300 sec: 44041.5). Total num frames: 1566408704. Throughput: 0: 43819.7. Samples: 1469289260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 23:02:43,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:02:46,383][06909] Updated weights for policy 0, policy_version 95613 (0.0035) [2024-06-27 23:02:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43965.9, 300 sec: 44098.0). Total num frames: 1566654464. Throughput: 0: 43908.0. Samples: 1469551820. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 23:02:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-27 23:02:49,402][06909] Updated weights for policy 0, policy_version 95623 (0.0030) [2024-06-27 23:02:49,954][06887] Signal inference workers to stop experience collection... (21050 times) [2024-06-27 23:02:49,985][06909] InferenceWorker_p0-w0: stopping experience collection (21050 times) [2024-06-27 23:02:50,016][06887] Signal inference workers to resume experience collection... (21050 times) [2024-06-27 23:02:50,019][06909] InferenceWorker_p0-w0: resuming experience collection (21050 times) [2024-06-27 23:02:53,702][06909] Updated weights for policy 0, policy_version 95633 (0.0022) [2024-06-27 23:02:53,850][06674] Fps is (10 sec: 44264.1, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1566851072. Throughput: 0: 44053.3. Samples: 1469830620. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 23:02:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:02:57,389][06909] Updated weights for policy 0, policy_version 95643 (0.0037) [2024-06-27 23:02:58,852][06674] Fps is (10 sec: 42589.3, 60 sec: 44235.2, 300 sec: 44042.1). Total num frames: 1567080448. Throughput: 0: 44042.3. Samples: 1469955700. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 23:02:58,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:03:00,966][06909] Updated weights for policy 0, policy_version 95653 (0.0029) [2024-06-27 23:03:03,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1567326208. Throughput: 0: 44094.3. Samples: 1470218560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 23:03:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:03:04,785][06909] Updated weights for policy 0, policy_version 95663 (0.0033) [2024-06-27 23:03:08,578][06909] Updated weights for policy 0, policy_version 95673 (0.0032) [2024-06-27 23:03:08,850][06674] Fps is (10 sec: 42607.5, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1567506432. Throughput: 0: 44140.5. Samples: 1470495080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 23:03:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:03:11,907][06909] Updated weights for policy 0, policy_version 95683 (0.0034) [2024-06-27 23:03:13,850][06674] Fps is (10 sec: 40959.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1567735808. Throughput: 0: 44261.4. Samples: 1470622080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 23:03:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:03:15,973][06909] Updated weights for policy 0, policy_version 95693 (0.0035) [2024-06-27 23:03:18,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1567981568. Throughput: 0: 44185.1. Samples: 1470881380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 23:03:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:03:19,074][06909] Updated weights for policy 0, policy_version 95703 (0.0030) [2024-06-27 23:03:23,510][06909] Updated weights for policy 0, policy_version 95713 (0.0045) [2024-06-27 23:03:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1568178176. Throughput: 0: 44073.3. Samples: 1471153280. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 23:03:23,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:03:26,624][06909] Updated weights for policy 0, policy_version 95723 (0.0022) [2024-06-27 23:03:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1568391168. Throughput: 0: 44222.4. Samples: 1471279000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 23:03:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:03:30,732][06909] Updated weights for policy 0, policy_version 95733 (0.0035) [2024-06-27 23:03:33,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1568636928. Throughput: 0: 44322.7. Samples: 1471546340. Policy #0 lag: (min: 0.0, avg: 9.2, max: 23.0) [2024-06-27 23:03:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:03:33,992][06909] Updated weights for policy 0, policy_version 95743 (0.0039) [2024-06-27 23:03:38,218][06909] Updated weights for policy 0, policy_version 95753 (0.0037) [2024-06-27 23:03:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1568849920. Throughput: 0: 44061.8. Samples: 1471813400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 23:03:38,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:03:41,885][06909] Updated weights for policy 0, policy_version 95763 (0.0032) [2024-06-27 23:03:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44241.4, 300 sec: 43986.9). Total num frames: 1569062912. Throughput: 0: 44197.2. Samples: 1471944480. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 23:03:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:03:45,714][06909] Updated weights for policy 0, policy_version 95773 (0.0037) [2024-06-27 23:03:48,104][06887] Signal inference workers to stop experience collection... (21100 times) [2024-06-27 23:03:48,154][06909] InferenceWorker_p0-w0: stopping experience collection (21100 times) [2024-06-27 23:03:48,160][06887] Signal inference workers to resume experience collection... (21100 times) [2024-06-27 23:03:48,172][06909] InferenceWorker_p0-w0: resuming experience collection (21100 times) [2024-06-27 23:03:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1569292288. Throughput: 0: 44242.7. Samples: 1472209480. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 23:03:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:03:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000095782_1569292288.pth... [2024-06-27 23:03:48,937][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000095137_1558724608.pth [2024-06-27 23:03:49,102][06909] Updated weights for policy 0, policy_version 95783 (0.0026) [2024-06-27 23:03:53,051][06909] Updated weights for policy 0, policy_version 95793 (0.0037) [2024-06-27 23:03:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 1569521664. Throughput: 0: 43918.6. Samples: 1472471420. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 23:03:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:03:56,340][06909] Updated weights for policy 0, policy_version 95803 (0.0031) [2024-06-27 23:03:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43965.2, 300 sec: 43931.3). Total num frames: 1569718272. Throughput: 0: 43981.3. Samples: 1472601240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 23:03:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:04:00,591][06909] Updated weights for policy 0, policy_version 95813 (0.0028) [2024-06-27 23:04:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1569947648. Throughput: 0: 44127.2. Samples: 1472867100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 23:04:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:04:03,910][06909] Updated weights for policy 0, policy_version 95823 (0.0040) [2024-06-27 23:04:07,939][06909] Updated weights for policy 0, policy_version 95833 (0.0027) [2024-06-27 23:04:08,856][06674] Fps is (10 sec: 45847.3, 60 sec: 44505.3, 300 sec: 44041.5). Total num frames: 1570177024. Throughput: 0: 43899.0. Samples: 1473129000. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 23:04:08,857][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:04:11,282][06909] Updated weights for policy 0, policy_version 95843 (0.0032) [2024-06-27 23:04:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1570373632. Throughput: 0: 44054.7. Samples: 1473261460. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 23:04:13,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:04:15,472][06909] Updated weights for policy 0, policy_version 95853 (0.0035) [2024-06-27 23:04:18,850][06674] Fps is (10 sec: 42624.4, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1570603008. Throughput: 0: 43955.4. Samples: 1473524340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 23:04:18,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:04:19,183][06909] Updated weights for policy 0, policy_version 95863 (0.0027) [2024-06-27 23:04:22,851][06909] Updated weights for policy 0, policy_version 95873 (0.0043) [2024-06-27 23:04:23,852][06674] Fps is (10 sec: 47504.2, 60 sec: 44508.4, 300 sec: 44097.7). Total num frames: 1570848768. Throughput: 0: 43766.5. Samples: 1473782980. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 23:04:23,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:04:26,522][06909] Updated weights for policy 0, policy_version 95883 (0.0035) [2024-06-27 23:04:28,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43962.3, 300 sec: 43931.0). Total num frames: 1571028992. Throughput: 0: 43802.4. Samples: 1473915680. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 23:04:28,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:04:30,355][06909] Updated weights for policy 0, policy_version 95893 (0.0028) [2024-06-27 23:04:33,850][06674] Fps is (10 sec: 40968.5, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1571258368. Throughput: 0: 43676.0. Samples: 1474174900. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-27 23:04:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:04:33,959][06909] Updated weights for policy 0, policy_version 95903 (0.0023) [2024-06-27 23:04:37,909][06909] Updated weights for policy 0, policy_version 95913 (0.0029) [2024-06-27 23:04:38,850][06674] Fps is (10 sec: 45884.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1571487744. Throughput: 0: 43757.8. Samples: 1474440520. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 23:04:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:04:41,302][06909] Updated weights for policy 0, policy_version 95923 (0.0024) [2024-06-27 23:04:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1571684352. Throughput: 0: 43799.2. Samples: 1474572200. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 23:04:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:04:45,417][06909] Updated weights for policy 0, policy_version 95933 (0.0042) [2024-06-27 23:04:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1571930112. Throughput: 0: 43814.8. Samples: 1474838760. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 23:04:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:04:48,851][06909] Updated weights for policy 0, policy_version 95943 (0.0032) [2024-06-27 23:04:52,647][06909] Updated weights for policy 0, policy_version 95953 (0.0029) [2024-06-27 23:04:53,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1572159488. Throughput: 0: 43819.6. Samples: 1475100620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 23:04:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:04:56,422][06909] Updated weights for policy 0, policy_version 95963 (0.0040) [2024-06-27 23:04:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1572339712. Throughput: 0: 43868.1. Samples: 1475235520. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 23:04:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:05:00,113][06909] Updated weights for policy 0, policy_version 95973 (0.0026) [2024-06-27 23:05:03,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1572569088. Throughput: 0: 43863.6. Samples: 1475498200. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 23:05:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:05:04,100][06909] Updated weights for policy 0, policy_version 95983 (0.0039) [2024-06-27 23:05:07,410][06909] Updated weights for policy 0, policy_version 95993 (0.0029) [2024-06-27 23:05:08,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43968.2, 300 sec: 43986.9). Total num frames: 1572814848. Throughput: 0: 43996.6. Samples: 1475762740. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 23:05:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:05:11,370][06909] Updated weights for policy 0, policy_version 96003 (0.0038) [2024-06-27 23:05:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1572995072. Throughput: 0: 44122.0. Samples: 1475901080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 23:05:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:05:15,007][06909] Updated weights for policy 0, policy_version 96013 (0.0042) [2024-06-27 23:05:18,712][06909] Updated weights for policy 0, policy_version 96023 (0.0038) [2024-06-27 23:05:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1573240832. Throughput: 0: 44175.5. Samples: 1476162800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 23:05:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:05:21,215][06887] Signal inference workers to stop experience collection... (21150 times) [2024-06-27 23:05:21,216][06887] Signal inference workers to resume experience collection... (21150 times) [2024-06-27 23:05:21,228][06909] InferenceWorker_p0-w0: stopping experience collection (21150 times) [2024-06-27 23:05:21,228][06909] InferenceWorker_p0-w0: resuming experience collection (21150 times) [2024-06-27 23:05:22,434][06909] Updated weights for policy 0, policy_version 96033 (0.0025) [2024-06-27 23:05:23,850][06674] Fps is (10 sec: 49152.4, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 1573486592. Throughput: 0: 44011.6. Samples: 1476421040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 23:05:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:05:26,136][06909] Updated weights for policy 0, policy_version 96043 (0.0027) [2024-06-27 23:05:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43692.1, 300 sec: 43875.8). Total num frames: 1573650432. Throughput: 0: 44065.3. Samples: 1476555140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-27 23:05:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:05:29,897][06909] Updated weights for policy 0, policy_version 96053 (0.0031) [2024-06-27 23:05:33,599][06909] Updated weights for policy 0, policy_version 96063 (0.0030) [2024-06-27 23:05:33,850][06674] Fps is (10 sec: 40959.1, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1573896192. Throughput: 0: 44129.6. Samples: 1476824600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 23:05:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:05:37,328][06909] Updated weights for policy 0, policy_version 96073 (0.0041) [2024-06-27 23:05:38,850][06674] Fps is (10 sec: 49152.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1574141952. Throughput: 0: 44187.8. Samples: 1477089060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 23:05:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:05:40,908][06909] Updated weights for policy 0, policy_version 96083 (0.0025) [2024-06-27 23:05:43,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1574338560. Throughput: 0: 44279.5. Samples: 1477228100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 23:05:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:05:44,521][06909] Updated weights for policy 0, policy_version 96093 (0.0042) [2024-06-27 23:05:48,388][06909] Updated weights for policy 0, policy_version 96103 (0.0036) [2024-06-27 23:05:48,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1574567936. Throughput: 0: 44259.5. Samples: 1477489880. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 23:05:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:05:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000096104_1574567936.pth... [2024-06-27 23:05:48,916][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000095460_1564016640.pth [2024-06-27 23:05:52,165][06909] Updated weights for policy 0, policy_version 96113 (0.0028) [2024-06-27 23:05:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 1574797312. Throughput: 0: 44053.8. Samples: 1477745160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 23:05:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:05:55,983][06909] Updated weights for policy 0, policy_version 96123 (0.0028) [2024-06-27 23:05:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1574993920. Throughput: 0: 44012.4. Samples: 1477881640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 23:05:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:05:59,603][06909] Updated weights for policy 0, policy_version 96133 (0.0032) [2024-06-27 23:06:03,508][06909] Updated weights for policy 0, policy_version 96143 (0.0027) [2024-06-27 23:06:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1575223296. Throughput: 0: 44029.8. Samples: 1478144140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 23:06:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:06:06,813][06909] Updated weights for policy 0, policy_version 96153 (0.0030) [2024-06-27 23:06:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1575452672. Throughput: 0: 44094.5. Samples: 1478405300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 23:06:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:06:10,686][06909] Updated weights for policy 0, policy_version 96163 (0.0028) [2024-06-27 23:06:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.9, 300 sec: 43931.3). Total num frames: 1575665664. Throughput: 0: 44210.3. Samples: 1478544600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 23:06:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:06:14,250][06909] Updated weights for policy 0, policy_version 96173 (0.0032) [2024-06-27 23:06:17,968][06909] Updated weights for policy 0, policy_version 96183 (0.0037) [2024-06-27 23:06:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1575878656. Throughput: 0: 44278.7. Samples: 1478817140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 23:06:18,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:06:21,647][06909] Updated weights for policy 0, policy_version 96193 (0.0027) [2024-06-27 23:06:23,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1576140800. Throughput: 0: 44042.6. Samples: 1479070980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 23:06:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:06:25,134][06909] Updated weights for policy 0, policy_version 96203 (0.0025) [2024-06-27 23:06:28,435][06887] Signal inference workers to stop experience collection... (21200 times) [2024-06-27 23:06:28,436][06887] Signal inference workers to resume experience collection... (21200 times) [2024-06-27 23:06:28,449][06909] InferenceWorker_p0-w0: stopping experience collection (21200 times) [2024-06-27 23:06:28,459][06909] InferenceWorker_p0-w0: resuming experience collection (21200 times) [2024-06-27 23:06:28,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44783.0, 300 sec: 44042.4). Total num frames: 1576337408. Throughput: 0: 44187.2. Samples: 1479216520. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-27 23:06:28,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:06:29,115][06909] Updated weights for policy 0, policy_version 96213 (0.0027) [2024-06-27 23:06:32,629][06909] Updated weights for policy 0, policy_version 96223 (0.0027) [2024-06-27 23:06:33,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 1576550400. Throughput: 0: 44128.0. Samples: 1479475640. Policy #0 lag: (min: 0.0, avg: 12.7, max: 22.0) [2024-06-27 23:06:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:06:36,593][06909] Updated weights for policy 0, policy_version 96233 (0.0037) [2024-06-27 23:06:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 1576796160. Throughput: 0: 44132.1. Samples: 1479731100. Policy #0 lag: (min: 0.0, avg: 12.7, max: 22.0) [2024-06-27 23:06:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:06:40,416][06909] Updated weights for policy 0, policy_version 96243 (0.0037) [2024-06-27 23:06:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43987.3). Total num frames: 1576992768. Throughput: 0: 44317.4. Samples: 1479875920. Policy #0 lag: (min: 0.0, avg: 12.7, max: 22.0) [2024-06-27 23:06:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:06:44,148][06909] Updated weights for policy 0, policy_version 96253 (0.0027) [2024-06-27 23:06:47,751][06909] Updated weights for policy 0, policy_version 96263 (0.0028) [2024-06-27 23:06:48,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 1577189376. Throughput: 0: 44107.6. Samples: 1480128980. Policy #0 lag: (min: 0.0, avg: 12.7, max: 22.0) [2024-06-27 23:06:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:06:51,633][06909] Updated weights for policy 0, policy_version 96273 (0.0035) [2024-06-27 23:06:53,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1577451520. Throughput: 0: 44146.8. Samples: 1480391900. Policy #0 lag: (min: 0.0, avg: 12.7, max: 22.0) [2024-06-27 23:06:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:06:55,127][06909] Updated weights for policy 0, policy_version 96283 (0.0028) [2024-06-27 23:06:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1577648128. Throughput: 0: 44106.2. Samples: 1480529380. Policy #0 lag: (min: 0.0, avg: 12.7, max: 22.0) [2024-06-27 23:06:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:06:58,951][06909] Updated weights for policy 0, policy_version 96293 (0.0034) [2024-06-27 23:07:02,565][06909] Updated weights for policy 0, policy_version 96303 (0.0030) [2024-06-27 23:07:03,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1577861120. Throughput: 0: 43869.8. Samples: 1480791280. Policy #0 lag: (min: 0.0, avg: 12.7, max: 22.0) [2024-06-27 23:07:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:07:06,254][06909] Updated weights for policy 0, policy_version 96313 (0.0022) [2024-06-27 23:07:08,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1578106880. Throughput: 0: 43984.3. Samples: 1481050280. Policy #0 lag: (min: 0.0, avg: 12.7, max: 22.0) [2024-06-27 23:07:08,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:07:10,018][06909] Updated weights for policy 0, policy_version 96323 (0.0032) [2024-06-27 23:07:13,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1578303488. Throughput: 0: 43903.5. Samples: 1481192180. Policy #0 lag: (min: 0.0, avg: 12.7, max: 22.0) [2024-06-27 23:07:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:07:13,909][06909] Updated weights for policy 0, policy_version 96333 (0.0045) [2024-06-27 23:07:17,691][06909] Updated weights for policy 0, policy_version 96343 (0.0034) [2024-06-27 23:07:18,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1578500096. Throughput: 0: 43835.4. Samples: 1481448240. Policy #0 lag: (min: 0.0, avg: 12.7, max: 22.0) [2024-06-27 23:07:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:07:21,185][06909] Updated weights for policy 0, policy_version 96353 (0.0025) [2024-06-27 23:07:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 1578762240. Throughput: 0: 43879.8. Samples: 1481705700. Policy #0 lag: (min: 0.0, avg: 12.7, max: 22.0) [2024-06-27 23:07:23,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 23:07:24,922][06909] Updated weights for policy 0, policy_version 96363 (0.0035) [2024-06-27 23:07:28,829][06909] Updated weights for policy 0, policy_version 96373 (0.0049) [2024-06-27 23:07:28,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.6, 300 sec: 43986.8). Total num frames: 1578975232. Throughput: 0: 43978.2. Samples: 1481854940. Policy #0 lag: (min: 0.0, avg: 12.7, max: 22.0) [2024-06-27 23:07:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:07:32,468][06909] Updated weights for policy 0, policy_version 96383 (0.0034) [2024-06-27 23:07:33,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1579171840. Throughput: 0: 44091.1. Samples: 1482113080. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 23:07:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:07:36,140][06909] Updated weights for policy 0, policy_version 96393 (0.0027) [2024-06-27 23:07:38,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 44098.9). Total num frames: 1579417600. Throughput: 0: 44043.6. Samples: 1482373860. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 23:07:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 23:07:39,643][06909] Updated weights for policy 0, policy_version 96403 (0.0040) [2024-06-27 23:07:43,604][06909] Updated weights for policy 0, policy_version 96413 (0.0036) [2024-06-27 23:07:43,856][06674] Fps is (10 sec: 47484.2, 60 sec: 44232.3, 300 sec: 44041.5). Total num frames: 1579646976. Throughput: 0: 44244.1. Samples: 1482520640. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 23:07:43,857][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:07:46,950][06909] Updated weights for policy 0, policy_version 96423 (0.0028) [2024-06-27 23:07:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1579843584. Throughput: 0: 44101.5. Samples: 1482775840. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 23:07:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:07:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000096426_1579843584.pth... [2024-06-27 23:07:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000095782_1569292288.pth [2024-06-27 23:07:50,999][06909] Updated weights for policy 0, policy_version 96433 (0.0022) [2024-06-27 23:07:51,390][06887] Signal inference workers to stop experience collection... (21250 times) [2024-06-27 23:07:51,391][06887] Signal inference workers to resume experience collection... (21250 times) [2024-06-27 23:07:51,405][06909] InferenceWorker_p0-w0: stopping experience collection (21250 times) [2024-06-27 23:07:51,437][06909] InferenceWorker_p0-w0: resuming experience collection (21250 times) [2024-06-27 23:07:53,850][06674] Fps is (10 sec: 42624.6, 60 sec: 43690.6, 300 sec: 44042.7). Total num frames: 1580072960. Throughput: 0: 44101.4. Samples: 1483034840. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 23:07:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:07:54,645][06909] Updated weights for policy 0, policy_version 96443 (0.0026) [2024-06-27 23:07:58,204][06909] Updated weights for policy 0, policy_version 96453 (0.0026) [2024-06-27 23:07:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1580302336. Throughput: 0: 44208.9. Samples: 1483181580. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 23:07:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:08:01,860][06909] Updated weights for policy 0, policy_version 96463 (0.0027) [2024-06-27 23:08:03,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1580482560. Throughput: 0: 44200.5. Samples: 1483437260. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 23:08:03,854][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:08:05,847][06909] Updated weights for policy 0, policy_version 96473 (0.0040) [2024-06-27 23:08:08,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1580728320. Throughput: 0: 44301.8. Samples: 1483699280. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 23:08:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:08:09,581][06909] Updated weights for policy 0, policy_version 96483 (0.0029) [2024-06-27 23:08:13,202][06909] Updated weights for policy 0, policy_version 96493 (0.0025) [2024-06-27 23:08:13,852][06674] Fps is (10 sec: 49142.6, 60 sec: 44508.4, 300 sec: 44042.1). Total num frames: 1580974080. Throughput: 0: 44220.3. Samples: 1483844940. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 23:08:13,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:08:16,753][06909] Updated weights for policy 0, policy_version 96503 (0.0040) [2024-06-27 23:08:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1581154304. Throughput: 0: 44289.6. Samples: 1484106120. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 23:08:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:08:20,551][06909] Updated weights for policy 0, policy_version 96513 (0.0032) [2024-06-27 23:08:23,850][06674] Fps is (10 sec: 44246.0, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1581416448. Throughput: 0: 44303.6. Samples: 1484367520. Policy #0 lag: (min: 1.0, avg: 10.6, max: 23.0) [2024-06-27 23:08:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:08:23,933][06909] Updated weights for policy 0, policy_version 96523 (0.0028) [2024-06-27 23:08:28,047][06909] Updated weights for policy 0, policy_version 96533 (0.0028) [2024-06-27 23:08:28,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1581629440. Throughput: 0: 44122.5. Samples: 1484505880. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 23:08:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:08:31,685][06909] Updated weights for policy 0, policy_version 96543 (0.0034) [2024-06-27 23:08:33,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1581809664. Throughput: 0: 44303.6. Samples: 1484769500. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 23:08:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:08:35,246][06909] Updated weights for policy 0, policy_version 96553 (0.0039) [2024-06-27 23:08:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1582071808. Throughput: 0: 44312.1. Samples: 1485028880. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 23:08:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:08:38,880][06909] Updated weights for policy 0, policy_version 96563 (0.0026) [2024-06-27 23:08:42,830][06909] Updated weights for policy 0, policy_version 96573 (0.0035) [2024-06-27 23:08:43,850][06674] Fps is (10 sec: 49151.7, 60 sec: 44241.3, 300 sec: 44097.9). Total num frames: 1582301184. Throughput: 0: 44123.5. Samples: 1485167140. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 23:08:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:08:46,766][06909] Updated weights for policy 0, policy_version 96583 (0.0026) [2024-06-27 23:08:48,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1582497792. Throughput: 0: 44227.6. Samples: 1485427500. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 23:08:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:08:50,191][06909] Updated weights for policy 0, policy_version 96593 (0.0031) [2024-06-27 23:08:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1582727168. Throughput: 0: 44217.8. Samples: 1485689080. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 23:08:53,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:08:54,129][06909] Updated weights for policy 0, policy_version 96603 (0.0044) [2024-06-27 23:08:57,590][06909] Updated weights for policy 0, policy_version 96613 (0.0030) [2024-06-27 23:08:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1582956544. Throughput: 0: 43969.0. Samples: 1485823460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 23:08:58,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:09:01,454][06909] Updated weights for policy 0, policy_version 96623 (0.0021) [2024-06-27 23:09:03,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44236.8, 300 sec: 43932.2). Total num frames: 1583136768. Throughput: 0: 43755.1. Samples: 1486075100. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 23:09:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:09:05,401][06909] Updated weights for policy 0, policy_version 96633 (0.0036) [2024-06-27 23:09:05,941][06887] Signal inference workers to stop experience collection... (21300 times) [2024-06-27 23:09:05,942][06887] Signal inference workers to resume experience collection... (21300 times) [2024-06-27 23:09:05,955][06909] InferenceWorker_p0-w0: stopping experience collection (21300 times) [2024-06-27 23:09:05,955][06909] InferenceWorker_p0-w0: resuming experience collection (21300 times) [2024-06-27 23:09:08,850][06674] Fps is (10 sec: 42599.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1583382528. Throughput: 0: 43829.8. Samples: 1486339860. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 23:09:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:09:08,981][06909] Updated weights for policy 0, policy_version 96643 (0.0036) [2024-06-27 23:09:12,848][06909] Updated weights for policy 0, policy_version 96653 (0.0028) [2024-06-27 23:09:13,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43965.2, 300 sec: 44098.0). Total num frames: 1583611904. Throughput: 0: 43863.5. Samples: 1486479740. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 23:09:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:09:16,506][06909] Updated weights for policy 0, policy_version 96663 (0.0033) [2024-06-27 23:09:18,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43963.7, 300 sec: 43876.1). Total num frames: 1583792128. Throughput: 0: 43761.2. Samples: 1486738760. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 23:09:18,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:09:20,040][06909] Updated weights for policy 0, policy_version 96673 (0.0048) [2024-06-27 23:09:23,856][06674] Fps is (10 sec: 42572.7, 60 sec: 43686.2, 300 sec: 44097.4). Total num frames: 1584037888. Throughput: 0: 43881.2. Samples: 1487003800. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-27 23:09:23,856][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 23:09:24,362][06909] Updated weights for policy 0, policy_version 96683 (0.0027) [2024-06-27 23:09:27,515][06909] Updated weights for policy 0, policy_version 96693 (0.0036) [2024-06-27 23:09:28,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 1584267264. Throughput: 0: 43724.4. Samples: 1487134740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:09:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:09:31,560][06909] Updated weights for policy 0, policy_version 96703 (0.0038) [2024-06-27 23:09:33,852][06674] Fps is (10 sec: 42615.5, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 1584463872. Throughput: 0: 43842.5. Samples: 1487400500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:09:33,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:09:35,121][06909] Updated weights for policy 0, policy_version 96713 (0.0035) [2024-06-27 23:09:38,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 1584693248. Throughput: 0: 43882.7. Samples: 1487663800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:09:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:09:39,098][06909] Updated weights for policy 0, policy_version 96723 (0.0027) [2024-06-27 23:09:42,400][06909] Updated weights for policy 0, policy_version 96733 (0.0035) [2024-06-27 23:09:43,850][06674] Fps is (10 sec: 47523.2, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1584939008. Throughput: 0: 43932.1. Samples: 1487800400. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:09:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:09:46,341][06909] Updated weights for policy 0, policy_version 96743 (0.0040) [2024-06-27 23:09:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1585135616. Throughput: 0: 44188.1. Samples: 1488063560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:09:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:09:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000096749_1585135616.pth... [2024-06-27 23:09:48,926][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000096104_1574567936.pth [2024-06-27 23:09:49,900][06909] Updated weights for policy 0, policy_version 96753 (0.0030) [2024-06-27 23:09:53,649][06909] Updated weights for policy 0, policy_version 96763 (0.0022) [2024-06-27 23:09:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1585364992. Throughput: 0: 43982.5. Samples: 1488319080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:09:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:09:57,283][06909] Updated weights for policy 0, policy_version 96773 (0.0037) [2024-06-27 23:09:58,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1585594368. Throughput: 0: 44018.3. Samples: 1488460560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:09:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:10:01,404][06909] Updated weights for policy 0, policy_version 96783 (0.0033) [2024-06-27 23:10:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 1585807360. Throughput: 0: 44150.8. Samples: 1488725540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:10:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:10:04,787][06909] Updated weights for policy 0, policy_version 96793 (0.0032) [2024-06-27 23:10:08,536][06909] Updated weights for policy 0, policy_version 96803 (0.0031) [2024-06-27 23:10:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1586020352. Throughput: 0: 44209.1. Samples: 1488992940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:10:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:10:12,025][06909] Updated weights for policy 0, policy_version 96813 (0.0031) [2024-06-27 23:10:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1586266112. Throughput: 0: 44322.7. Samples: 1489129260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:10:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:10:16,089][06909] Updated weights for policy 0, policy_version 96823 (0.0045) [2024-06-27 23:10:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 1586462720. Throughput: 0: 44285.1. Samples: 1489393240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:10:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:10:19,368][06909] Updated weights for policy 0, policy_version 96833 (0.0041) [2024-06-27 23:10:23,369][06909] Updated weights for policy 0, policy_version 96843 (0.0047) [2024-06-27 23:10:23,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44241.3, 300 sec: 44209.0). Total num frames: 1586692096. Throughput: 0: 44130.7. Samples: 1489649680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:10:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:10:26,941][06909] Updated weights for policy 0, policy_version 96853 (0.0038) [2024-06-27 23:10:28,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1586921472. Throughput: 0: 44057.9. Samples: 1489783000. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 23:10:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:10:30,799][06909] Updated weights for policy 0, policy_version 96863 (0.0039) [2024-06-27 23:10:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44238.4, 300 sec: 43986.9). Total num frames: 1587118080. Throughput: 0: 44124.6. Samples: 1490049160. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 23:10:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:10:34,009][06887] Signal inference workers to stop experience collection... (21350 times) [2024-06-27 23:10:34,009][06887] Signal inference workers to resume experience collection... (21350 times) [2024-06-27 23:10:34,058][06909] InferenceWorker_p0-w0: stopping experience collection (21350 times) [2024-06-27 23:10:34,059][06909] InferenceWorker_p0-w0: resuming experience collection (21350 times) [2024-06-27 23:10:34,397][06909] Updated weights for policy 0, policy_version 96873 (0.0039) [2024-06-27 23:10:38,534][06909] Updated weights for policy 0, policy_version 96883 (0.0041) [2024-06-27 23:10:38,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1587331072. Throughput: 0: 44163.1. Samples: 1490306420. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 23:10:38,859][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 23:10:41,880][06909] Updated weights for policy 0, policy_version 96893 (0.0032) [2024-06-27 23:10:43,856][06674] Fps is (10 sec: 45847.0, 60 sec: 43959.3, 300 sec: 44097.1). Total num frames: 1587576832. Throughput: 0: 44010.9. Samples: 1490441320. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 23:10:43,856][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 23:10:45,729][06909] Updated weights for policy 0, policy_version 96903 (0.0032) [2024-06-27 23:10:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1587773440. Throughput: 0: 44030.2. Samples: 1490706900. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 23:10:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:10:49,400][06909] Updated weights for policy 0, policy_version 96913 (0.0035) [2024-06-27 23:10:53,219][06909] Updated weights for policy 0, policy_version 96923 (0.0046) [2024-06-27 23:10:53,850][06674] Fps is (10 sec: 40985.1, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 1587986432. Throughput: 0: 43813.8. Samples: 1490964560. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 23:10:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:10:56,978][06909] Updated weights for policy 0, policy_version 96933 (0.0038) [2024-06-27 23:10:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1588232192. Throughput: 0: 43819.6. Samples: 1491101140. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 23:10:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:11:01,074][06909] Updated weights for policy 0, policy_version 96943 (0.0036) [2024-06-27 23:11:03,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1588428800. Throughput: 0: 43772.0. Samples: 1491362980. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 23:11:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:11:04,295][06909] Updated weights for policy 0, policy_version 96953 (0.0030) [2024-06-27 23:11:08,313][06909] Updated weights for policy 0, policy_version 96963 (0.0025) [2024-06-27 23:11:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1588641792. Throughput: 0: 43774.2. Samples: 1491619520. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 23:11:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:11:11,790][06909] Updated weights for policy 0, policy_version 96973 (0.0025) [2024-06-27 23:11:13,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1588887552. Throughput: 0: 43801.3. Samples: 1491754060. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 23:11:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:11:15,871][06909] Updated weights for policy 0, policy_version 96983 (0.0024) [2024-06-27 23:11:18,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1589100544. Throughput: 0: 43793.7. Samples: 1492019880. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 23:11:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:11:19,091][06909] Updated weights for policy 0, policy_version 96993 (0.0033) [2024-06-27 23:11:23,086][06909] Updated weights for policy 0, policy_version 97003 (0.0035) [2024-06-27 23:11:23,852][06674] Fps is (10 sec: 40951.6, 60 sec: 43416.1, 300 sec: 43931.0). Total num frames: 1589297152. Throughput: 0: 44096.7. Samples: 1492290860. Policy #0 lag: (min: 0.0, avg: 7.9, max: 22.0) [2024-06-27 23:11:23,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:11:26,659][06909] Updated weights for policy 0, policy_version 97013 (0.0029) [2024-06-27 23:11:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1589542912. Throughput: 0: 43973.1. Samples: 1492419840. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 23:11:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:11:30,269][06909] Updated weights for policy 0, policy_version 97023 (0.0035) [2024-06-27 23:11:33,850][06674] Fps is (10 sec: 45884.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1589755904. Throughput: 0: 43972.9. Samples: 1492685680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 23:11:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:11:34,129][06909] Updated weights for policy 0, policy_version 97033 (0.0029) [2024-06-27 23:11:37,636][06909] Updated weights for policy 0, policy_version 97043 (0.0041) [2024-06-27 23:11:38,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1589952512. Throughput: 0: 44030.1. Samples: 1492945920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 23:11:38,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:11:41,494][06909] Updated weights for policy 0, policy_version 97053 (0.0033) [2024-06-27 23:11:43,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43695.0, 300 sec: 44097.9). Total num frames: 1590198272. Throughput: 0: 43765.2. Samples: 1493070580. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 23:11:43,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:11:45,640][06909] Updated weights for policy 0, policy_version 97063 (0.0037) [2024-06-27 23:11:48,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1590427648. Throughput: 0: 43897.8. Samples: 1493338380. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 23:11:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:11:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000097072_1590427648.pth... [2024-06-27 23:11:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000096426_1579843584.pth [2024-06-27 23:11:49,078][06909] Updated weights for policy 0, policy_version 97073 (0.0032) [2024-06-27 23:11:53,271][06909] Updated weights for policy 0, policy_version 97083 (0.0027) [2024-06-27 23:11:53,852][06674] Fps is (10 sec: 40951.9, 60 sec: 43689.1, 300 sec: 43931.0). Total num frames: 1590607872. Throughput: 0: 44051.7. Samples: 1493601940. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 23:11:53,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:11:56,374][06909] Updated weights for policy 0, policy_version 97093 (0.0034) [2024-06-27 23:11:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1590853632. Throughput: 0: 43914.3. Samples: 1493730200. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 23:11:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:11:59,557][06887] Signal inference workers to stop experience collection... (21400 times) [2024-06-27 23:11:59,559][06887] Signal inference workers to resume experience collection... (21400 times) [2024-06-27 23:11:59,572][06909] InferenceWorker_p0-w0: stopping experience collection (21400 times) [2024-06-27 23:11:59,572][06909] InferenceWorker_p0-w0: resuming experience collection (21400 times) [2024-06-27 23:12:00,362][06909] Updated weights for policy 0, policy_version 97103 (0.0025) [2024-06-27 23:12:03,850][06674] Fps is (10 sec: 47523.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1591083008. Throughput: 0: 44180.4. Samples: 1494008000. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 23:12:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:12:03,886][06909] Updated weights for policy 0, policy_version 97113 (0.0027) [2024-06-27 23:12:07,566][06909] Updated weights for policy 0, policy_version 97123 (0.0026) [2024-06-27 23:12:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1591296000. Throughput: 0: 44151.8. Samples: 1494277600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 23:12:08,858][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:12:11,212][06909] Updated weights for policy 0, policy_version 97133 (0.0031) [2024-06-27 23:12:13,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1591525376. Throughput: 0: 43907.6. Samples: 1494395680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 23:12:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:12:14,894][06909] Updated weights for policy 0, policy_version 97143 (0.0031) [2024-06-27 23:12:18,582][06909] Updated weights for policy 0, policy_version 97153 (0.0026) [2024-06-27 23:12:18,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1591754752. Throughput: 0: 44077.7. Samples: 1494669180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-27 23:12:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:12:22,684][06909] Updated weights for policy 0, policy_version 97163 (0.0030) [2024-06-27 23:12:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44238.3, 300 sec: 43986.9). Total num frames: 1591951360. Throughput: 0: 44112.5. Samples: 1494930980. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 23:12:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:12:26,342][06909] Updated weights for policy 0, policy_version 97173 (0.0033) [2024-06-27 23:12:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1592180736. Throughput: 0: 44100.1. Samples: 1495055080. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 23:12:28,859][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:12:30,361][06909] Updated weights for policy 0, policy_version 97183 (0.0034) [2024-06-27 23:12:33,672][06909] Updated weights for policy 0, policy_version 97193 (0.0037) [2024-06-27 23:12:33,852][06674] Fps is (10 sec: 45866.0, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 1592410112. Throughput: 0: 44113.1. Samples: 1495323560. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 23:12:33,861][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:12:37,509][06909] Updated weights for policy 0, policy_version 97203 (0.0031) [2024-06-27 23:12:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44509.9, 300 sec: 43987.8). Total num frames: 1592623104. Throughput: 0: 44342.9. Samples: 1495597280. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 23:12:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:12:40,936][06909] Updated weights for policy 0, policy_version 97213 (0.0032) [2024-06-27 23:12:43,850][06674] Fps is (10 sec: 44246.3, 60 sec: 44237.0, 300 sec: 44098.0). Total num frames: 1592852480. Throughput: 0: 44402.2. Samples: 1495728300. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 23:12:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:12:44,632][06909] Updated weights for policy 0, policy_version 97223 (0.0026) [2024-06-27 23:12:48,159][06909] Updated weights for policy 0, policy_version 97233 (0.0032) [2024-06-27 23:12:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1593081856. Throughput: 0: 44109.4. Samples: 1495992920. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 23:12:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:12:52,358][06909] Updated weights for policy 0, policy_version 97243 (0.0029) [2024-06-27 23:12:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44511.5, 300 sec: 43986.9). Total num frames: 1593278464. Throughput: 0: 44035.2. Samples: 1496259180. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 23:12:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:12:55,543][06909] Updated weights for policy 0, policy_version 97253 (0.0039) [2024-06-27 23:12:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1593507840. Throughput: 0: 44220.0. Samples: 1496385580. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 23:12:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:12:59,539][06909] Updated weights for policy 0, policy_version 97263 (0.0037) [2024-06-27 23:13:03,489][06909] Updated weights for policy 0, policy_version 97273 (0.0033) [2024-06-27 23:13:03,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1593720832. Throughput: 0: 44031.5. Samples: 1496650600. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 23:13:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:13:07,111][06909] Updated weights for policy 0, policy_version 97283 (0.0022) [2024-06-27 23:13:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 1593950208. Throughput: 0: 44177.4. Samples: 1496918960. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 23:13:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:13:09,458][06887] Signal inference workers to stop experience collection... (21450 times) [2024-06-27 23:13:09,495][06909] InferenceWorker_p0-w0: stopping experience collection (21450 times) [2024-06-27 23:13:09,515][06887] Signal inference workers to resume experience collection... (21450 times) [2024-06-27 23:13:09,516][06909] InferenceWorker_p0-w0: resuming experience collection (21450 times) [2024-06-27 23:13:10,622][06909] Updated weights for policy 0, policy_version 97293 (0.0038) [2024-06-27 23:13:13,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44509.7, 300 sec: 44209.0). Total num frames: 1594195968. Throughput: 0: 44395.5. Samples: 1497052880. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 23:13:13,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:13:14,311][06909] Updated weights for policy 0, policy_version 97303 (0.0023) [2024-06-27 23:13:17,768][06909] Updated weights for policy 0, policy_version 97313 (0.0033) [2024-06-27 23:13:18,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1594408960. Throughput: 0: 44357.5. Samples: 1497319560. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-27 23:13:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:13:21,716][06909] Updated weights for policy 0, policy_version 97323 (0.0026) [2024-06-27 23:13:23,850][06674] Fps is (10 sec: 40960.9, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1594605568. Throughput: 0: 44301.9. Samples: 1497590860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 23:13:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:13:25,251][06909] Updated weights for policy 0, policy_version 97333 (0.0021) [2024-06-27 23:13:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1594834944. Throughput: 0: 44224.8. Samples: 1497718420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 23:13:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:13:29,426][06909] Updated weights for policy 0, policy_version 97343 (0.0032) [2024-06-27 23:13:32,893][06909] Updated weights for policy 0, policy_version 97353 (0.0040) [2024-06-27 23:13:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44238.4, 300 sec: 44042.4). Total num frames: 1595064320. Throughput: 0: 44047.2. Samples: 1497975040. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 23:13:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:13:36,607][06909] Updated weights for policy 0, policy_version 97363 (0.0036) [2024-06-27 23:13:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1595277312. Throughput: 0: 44135.9. Samples: 1498245300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 23:13:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:13:40,286][06909] Updated weights for policy 0, policy_version 97373 (0.0031) [2024-06-27 23:13:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1595490304. Throughput: 0: 44161.3. Samples: 1498372840. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 23:13:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:13:44,350][06909] Updated weights for policy 0, policy_version 97383 (0.0025) [2024-06-27 23:13:47,580][06909] Updated weights for policy 0, policy_version 97393 (0.0029) [2024-06-27 23:13:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1595736064. Throughput: 0: 44185.9. Samples: 1498638960. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 23:13:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:13:48,974][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000097397_1595752448.pth... [2024-06-27 23:13:49,024][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000096749_1585135616.pth [2024-06-27 23:13:52,103][06909] Updated weights for policy 0, policy_version 97403 (0.0035) [2024-06-27 23:13:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1595932672. Throughput: 0: 44242.6. Samples: 1498909880. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 23:13:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:13:54,861][06909] Updated weights for policy 0, policy_version 97413 (0.0039) [2024-06-27 23:13:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1596162048. Throughput: 0: 44111.6. Samples: 1499037900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 23:13:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:13:59,405][06909] Updated weights for policy 0, policy_version 97423 (0.0036) [2024-06-27 23:14:02,527][06909] Updated weights for policy 0, policy_version 97433 (0.0027) [2024-06-27 23:14:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 1596391424. Throughput: 0: 44104.0. Samples: 1499304240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 23:14:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:14:06,619][06909] Updated weights for policy 0, policy_version 97443 (0.0029) [2024-06-27 23:14:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1596604416. Throughput: 0: 43963.9. Samples: 1499569240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 23:14:08,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:14:10,212][06909] Updated weights for policy 0, policy_version 97453 (0.0041) [2024-06-27 23:14:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.8, 300 sec: 44153.5). Total num frames: 1596817408. Throughput: 0: 43903.1. Samples: 1499694060. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 23:14:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:14:14,173][06909] Updated weights for policy 0, policy_version 97463 (0.0047) [2024-06-27 23:14:17,524][06909] Updated weights for policy 0, policy_version 97473 (0.0033) [2024-06-27 23:14:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44098.9). Total num frames: 1597046784. Throughput: 0: 44083.5. Samples: 1499958800. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-27 23:14:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:14:21,742][06909] Updated weights for policy 0, policy_version 97483 (0.0029) [2024-06-27 23:14:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1597243392. Throughput: 0: 44010.2. Samples: 1500225760. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 23:14:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:14:24,840][06909] Updated weights for policy 0, policy_version 97493 (0.0051) [2024-06-27 23:14:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 1597472768. Throughput: 0: 43944.5. Samples: 1500350340. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 23:14:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:14:29,190][06909] Updated weights for policy 0, policy_version 97503 (0.0033) [2024-06-27 23:14:32,153][06909] Updated weights for policy 0, policy_version 97513 (0.0033) [2024-06-27 23:14:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1597702144. Throughput: 0: 43964.5. Samples: 1500617360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 23:14:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:14:36,610][06909] Updated weights for policy 0, policy_version 97523 (0.0028) [2024-06-27 23:14:38,596][06887] Signal inference workers to stop experience collection... (21500 times) [2024-06-27 23:14:38,643][06909] InferenceWorker_p0-w0: stopping experience collection (21500 times) [2024-06-27 23:14:38,713][06887] Signal inference workers to resume experience collection... (21500 times) [2024-06-27 23:14:38,713][06909] InferenceWorker_p0-w0: resuming experience collection (21500 times) [2024-06-27 23:14:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1597915136. Throughput: 0: 44072.1. Samples: 1500893120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 23:14:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:14:39,867][06909] Updated weights for policy 0, policy_version 97533 (0.0030) [2024-06-27 23:14:43,749][06909] Updated weights for policy 0, policy_version 97543 (0.0042) [2024-06-27 23:14:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1598144512. Throughput: 0: 44005.0. Samples: 1501018120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 23:14:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:14:47,519][06909] Updated weights for policy 0, policy_version 97553 (0.0043) [2024-06-27 23:14:48,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1598373888. Throughput: 0: 43932.0. Samples: 1501281180. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 23:14:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:14:51,566][06909] Updated weights for policy 0, policy_version 97563 (0.0034) [2024-06-27 23:14:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1598570496. Throughput: 0: 43815.5. Samples: 1501540940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 23:14:53,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:14:54,829][06909] Updated weights for policy 0, policy_version 97573 (0.0037) [2024-06-27 23:14:58,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1598783488. Throughput: 0: 43966.7. Samples: 1501672560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 23:14:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:14:58,963][06909] Updated weights for policy 0, policy_version 97583 (0.0025) [2024-06-27 23:15:02,211][06909] Updated weights for policy 0, policy_version 97593 (0.0030) [2024-06-27 23:15:03,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1599029248. Throughput: 0: 43889.9. Samples: 1501933840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 23:15:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:15:06,526][06909] Updated weights for policy 0, policy_version 97603 (0.0033) [2024-06-27 23:15:08,852][06674] Fps is (10 sec: 44227.2, 60 sec: 43689.1, 300 sec: 43931.0). Total num frames: 1599225856. Throughput: 0: 44089.9. Samples: 1502209900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 23:15:08,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:15:09,755][06909] Updated weights for policy 0, policy_version 97613 (0.0033) [2024-06-27 23:15:13,848][06909] Updated weights for policy 0, policy_version 97623 (0.0036) [2024-06-27 23:15:13,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1599455232. Throughput: 0: 44073.6. Samples: 1502333660. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 23:15:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:15:17,538][06909] Updated weights for policy 0, policy_version 97633 (0.0031) [2024-06-27 23:15:18,850][06674] Fps is (10 sec: 45884.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1599684608. Throughput: 0: 43972.8. Samples: 1502596140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-27 23:15:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:15:21,002][06909] Updated weights for policy 0, policy_version 97643 (0.0028) [2024-06-27 23:15:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1599897600. Throughput: 0: 43764.8. Samples: 1502862540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 23:15:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:15:24,806][06909] Updated weights for policy 0, policy_version 97653 (0.0031) [2024-06-27 23:15:28,658][06909] Updated weights for policy 0, policy_version 97663 (0.0038) [2024-06-27 23:15:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1600110592. Throughput: 0: 43922.6. Samples: 1502994640. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 23:15:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:15:31,968][06909] Updated weights for policy 0, policy_version 97673 (0.0026) [2024-06-27 23:15:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1600339968. Throughput: 0: 43838.3. Samples: 1503253900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 23:15:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:15:36,031][06909] Updated weights for policy 0, policy_version 97683 (0.0033) [2024-06-27 23:15:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43987.8). Total num frames: 1600552960. Throughput: 0: 44277.4. Samples: 1503533420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 23:15:38,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:15:39,279][06909] Updated weights for policy 0, policy_version 97693 (0.0043) [2024-06-27 23:15:43,408][06909] Updated weights for policy 0, policy_version 97703 (0.0031) [2024-06-27 23:15:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1600782336. Throughput: 0: 44198.2. Samples: 1503661480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 23:15:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:15:46,705][06909] Updated weights for policy 0, policy_version 97713 (0.0030) [2024-06-27 23:15:48,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1601028096. Throughput: 0: 44325.7. Samples: 1503928500. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 23:15:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:15:48,963][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000097720_1601044480.pth... [2024-06-27 23:15:49,019][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000097072_1590427648.pth [2024-06-27 23:15:50,812][06887] Signal inference workers to stop experience collection... (21550 times) [2024-06-27 23:15:50,812][06887] Signal inference workers to resume experience collection... (21550 times) [2024-06-27 23:15:50,853][06909] InferenceWorker_p0-w0: stopping experience collection (21550 times) [2024-06-27 23:15:50,853][06909] InferenceWorker_p0-w0: resuming experience collection (21550 times) [2024-06-27 23:15:50,953][06909] Updated weights for policy 0, policy_version 97723 (0.0028) [2024-06-27 23:15:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1601224704. Throughput: 0: 43991.8. Samples: 1504189440. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 23:15:53,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:15:54,435][06909] Updated weights for policy 0, policy_version 97733 (0.0031) [2024-06-27 23:15:58,212][06909] Updated weights for policy 0, policy_version 97743 (0.0032) [2024-06-27 23:15:58,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 1601437696. Throughput: 0: 44047.5. Samples: 1504315800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 23:15:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:16:01,839][06909] Updated weights for policy 0, policy_version 97753 (0.0036) [2024-06-27 23:16:03,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1601683456. Throughput: 0: 43961.0. Samples: 1504574380. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 23:16:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:16:06,006][06909] Updated weights for policy 0, policy_version 97763 (0.0032) [2024-06-27 23:16:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 1601880064. Throughput: 0: 43984.4. Samples: 1504841840. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 23:16:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:16:09,090][06909] Updated weights for policy 0, policy_version 97773 (0.0028) [2024-06-27 23:16:13,338][06909] Updated weights for policy 0, policy_version 97783 (0.0026) [2024-06-27 23:16:13,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1602093056. Throughput: 0: 44039.6. Samples: 1504976420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-27 23:16:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:16:16,500][06909] Updated weights for policy 0, policy_version 97793 (0.0034) [2024-06-27 23:16:18,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.9, 300 sec: 44209.3). Total num frames: 1602338816. Throughput: 0: 44215.1. Samples: 1505243580. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 23:16:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:16:20,699][06909] Updated weights for policy 0, policy_version 97803 (0.0025) [2024-06-27 23:16:23,805][06909] Updated weights for policy 0, policy_version 97813 (0.0039) [2024-06-27 23:16:23,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44509.7, 300 sec: 44153.5). Total num frames: 1602568192. Throughput: 0: 43883.9. Samples: 1505508200. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 23:16:23,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:16:28,032][06909] Updated weights for policy 0, policy_version 97823 (0.0040) [2024-06-27 23:16:28,850][06674] Fps is (10 sec: 40959.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1602748416. Throughput: 0: 43893.2. Samples: 1505636680. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 23:16:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:16:31,241][06909] Updated weights for policy 0, policy_version 97833 (0.0047) [2024-06-27 23:16:33,850][06674] Fps is (10 sec: 44237.7, 60 sec: 44509.9, 300 sec: 44264.6). Total num frames: 1603010560. Throughput: 0: 44021.4. Samples: 1505909460. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 23:16:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:16:35,325][06909] Updated weights for policy 0, policy_version 97843 (0.0030) [2024-06-27 23:16:38,749][06909] Updated weights for policy 0, policy_version 97853 (0.0026) [2024-06-27 23:16:38,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1603223552. Throughput: 0: 43968.0. Samples: 1506168000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 23:16:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:16:42,809][06909] Updated weights for policy 0, policy_version 97863 (0.0036) [2024-06-27 23:16:43,850][06674] Fps is (10 sec: 39321.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1603403776. Throughput: 0: 44158.7. Samples: 1506302940. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 23:16:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:16:45,982][06909] Updated weights for policy 0, policy_version 97873 (0.0031) [2024-06-27 23:16:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44264.9). Total num frames: 1603665920. Throughput: 0: 44332.4. Samples: 1506569340. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 23:16:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:16:50,522][06909] Updated weights for policy 0, policy_version 97883 (0.0024) [2024-06-27 23:16:53,371][06909] Updated weights for policy 0, policy_version 97893 (0.0035) [2024-06-27 23:16:53,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1603878912. Throughput: 0: 44077.8. Samples: 1506825340. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 23:16:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:16:57,805][06909] Updated weights for policy 0, policy_version 97903 (0.0033) [2024-06-27 23:16:58,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1604075520. Throughput: 0: 44140.4. Samples: 1506962740. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 23:16:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:17:00,866][06909] Updated weights for policy 0, policy_version 97913 (0.0035) [2024-06-27 23:17:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 1604304896. Throughput: 0: 44133.2. Samples: 1507229580. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 23:17:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:17:03,854][06887] Signal inference workers to stop experience collection... (21600 times) [2024-06-27 23:17:03,855][06887] Signal inference workers to resume experience collection... (21600 times) [2024-06-27 23:17:03,866][06909] InferenceWorker_p0-w0: stopping experience collection (21600 times) [2024-06-27 23:17:03,866][06909] InferenceWorker_p0-w0: resuming experience collection (21600 times) [2024-06-27 23:17:05,228][06909] Updated weights for policy 0, policy_version 97923 (0.0049) [2024-06-27 23:17:08,405][06909] Updated weights for policy 0, policy_version 97933 (0.0032) [2024-06-27 23:17:08,852][06674] Fps is (10 sec: 47503.8, 60 sec: 44508.4, 300 sec: 44153.2). Total num frames: 1604550656. Throughput: 0: 43993.2. Samples: 1507487980. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 23:17:08,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:17:12,472][06909] Updated weights for policy 0, policy_version 97943 (0.0020) [2024-06-27 23:17:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1604730880. Throughput: 0: 44259.7. Samples: 1507628360. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-27 23:17:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:17:15,807][06909] Updated weights for policy 0, policy_version 97953 (0.0029) [2024-06-27 23:17:18,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1604976640. Throughput: 0: 43987.0. Samples: 1507888880. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 23:17:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:17:19,871][06909] Updated weights for policy 0, policy_version 97963 (0.0035) [2024-06-27 23:17:23,179][06909] Updated weights for policy 0, policy_version 97973 (0.0032) [2024-06-27 23:17:23,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1605206016. Throughput: 0: 44029.3. Samples: 1508149320. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 23:17:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:17:27,679][06909] Updated weights for policy 0, policy_version 97983 (0.0029) [2024-06-27 23:17:28,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.9, 300 sec: 43987.2). Total num frames: 1605386240. Throughput: 0: 43993.4. Samples: 1508282640. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 23:17:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:17:30,701][06909] Updated weights for policy 0, policy_version 97993 (0.0034) [2024-06-27 23:17:33,850][06674] Fps is (10 sec: 40960.9, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 1605615616. Throughput: 0: 44042.8. Samples: 1508551260. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 23:17:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:17:35,054][06909] Updated weights for policy 0, policy_version 98003 (0.0031) [2024-06-27 23:17:38,045][06909] Updated weights for policy 0, policy_version 98013 (0.0025) [2024-06-27 23:17:38,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1605861376. Throughput: 0: 44072.4. Samples: 1508808600. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 23:17:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:17:42,242][06909] Updated weights for policy 0, policy_version 98023 (0.0034) [2024-06-27 23:17:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1606041600. Throughput: 0: 44193.8. Samples: 1508951460. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 23:17:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:17:45,523][06909] Updated weights for policy 0, policy_version 98033 (0.0035) [2024-06-27 23:17:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 1606270976. Throughput: 0: 43959.6. Samples: 1509207760. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 23:17:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:17:48,994][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000098040_1606287360.pth... [2024-06-27 23:17:49,031][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000097397_1595752448.pth [2024-06-27 23:17:49,609][06909] Updated weights for policy 0, policy_version 98043 (0.0028) [2024-06-27 23:17:53,322][06909] Updated weights for policy 0, policy_version 98053 (0.0033) [2024-06-27 23:17:53,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1606516736. Throughput: 0: 44086.4. Samples: 1509471780. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 23:17:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:17:57,176][06909] Updated weights for policy 0, policy_version 98063 (0.0030) [2024-06-27 23:17:58,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1606729728. Throughput: 0: 44119.4. Samples: 1509613740. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 23:17:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:18:00,627][06909] Updated weights for policy 0, policy_version 98073 (0.0024) [2024-06-27 23:18:03,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 43986.8). Total num frames: 1606926336. Throughput: 0: 44113.2. Samples: 1509873980. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 23:18:03,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 23:18:04,550][06909] Updated weights for policy 0, policy_version 98083 (0.0021) [2024-06-27 23:18:08,017][06909] Updated weights for policy 0, policy_version 98093 (0.0027) [2024-06-27 23:18:08,850][06674] Fps is (10 sec: 45876.1, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 1607188480. Throughput: 0: 44255.7. Samples: 1510140820. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 23:18:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:18:11,825][06909] Updated weights for policy 0, policy_version 98103 (0.0026) [2024-06-27 23:18:13,850][06674] Fps is (10 sec: 47514.6, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1607401472. Throughput: 0: 44486.2. Samples: 1510284520. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-27 23:18:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:18:15,209][06909] Updated weights for policy 0, policy_version 98113 (0.0030) [2024-06-27 23:18:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1607614464. Throughput: 0: 44344.7. Samples: 1510546780. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:18:18,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:18:19,297][06909] Updated weights for policy 0, policy_version 98123 (0.0026) [2024-06-27 23:18:22,352][06887] Signal inference workers to stop experience collection... (21650 times) [2024-06-27 23:18:22,352][06887] Signal inference workers to resume experience collection... (21650 times) [2024-06-27 23:18:22,369][06909] InferenceWorker_p0-w0: stopping experience collection (21650 times) [2024-06-27 23:18:22,369][06909] InferenceWorker_p0-w0: resuming experience collection (21650 times) [2024-06-27 23:18:22,494][06909] Updated weights for policy 0, policy_version 98133 (0.0023) [2024-06-27 23:18:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1607860224. Throughput: 0: 44523.5. Samples: 1510812160. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:18:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:18:26,434][06909] Updated weights for policy 0, policy_version 98143 (0.0035) [2024-06-27 23:18:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 1608056832. Throughput: 0: 44211.5. Samples: 1510940980. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:18:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:18:30,510][06909] Updated weights for policy 0, policy_version 98153 (0.0039) [2024-06-27 23:18:33,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 1608286208. Throughput: 0: 44384.5. Samples: 1511205060. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:18:33,850][06674] Avg episode reward: [(0, '0.395')] [2024-06-27 23:18:33,950][06909] Updated weights for policy 0, policy_version 98163 (0.0031) [2024-06-27 23:18:37,757][06909] Updated weights for policy 0, policy_version 98173 (0.0031) [2024-06-27 23:18:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1608515584. Throughput: 0: 44348.9. Samples: 1511467480. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:18:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:18:41,535][06909] Updated weights for policy 0, policy_version 98183 (0.0026) [2024-06-27 23:18:43,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44782.9, 300 sec: 44042.4). Total num frames: 1608728576. Throughput: 0: 44310.7. Samples: 1511607720. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:18:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:18:44,882][06909] Updated weights for policy 0, policy_version 98193 (0.0024) [2024-06-27 23:18:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 1608941568. Throughput: 0: 44442.4. Samples: 1511873880. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:18:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:18:49,164][06909] Updated weights for policy 0, policy_version 98203 (0.0039) [2024-06-27 23:18:52,149][06909] Updated weights for policy 0, policy_version 98213 (0.0030) [2024-06-27 23:18:53,852][06674] Fps is (10 sec: 45866.3, 60 sec: 44508.4, 300 sec: 44153.2). Total num frames: 1609187328. Throughput: 0: 44374.0. Samples: 1512137740. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:18:53,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:18:56,387][06909] Updated weights for policy 0, policy_version 98223 (0.0043) [2024-06-27 23:18:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44237.0, 300 sec: 44042.4). Total num frames: 1609383936. Throughput: 0: 44206.7. Samples: 1512273820. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:18:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:18:59,833][06909] Updated weights for policy 0, policy_version 98233 (0.0029) [2024-06-27 23:19:03,585][06909] Updated weights for policy 0, policy_version 98243 (0.0026) [2024-06-27 23:19:03,850][06674] Fps is (10 sec: 42606.7, 60 sec: 44783.0, 300 sec: 44097.9). Total num frames: 1609613312. Throughput: 0: 44171.1. Samples: 1512534480. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:19:03,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:19:07,558][06909] Updated weights for policy 0, policy_version 98253 (0.0037) [2024-06-27 23:19:08,852][06674] Fps is (10 sec: 45865.5, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 1609842688. Throughput: 0: 44062.0. Samples: 1512795040. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:19:08,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:19:11,406][06909] Updated weights for policy 0, policy_version 98263 (0.0035) [2024-06-27 23:19:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1610039296. Throughput: 0: 44135.6. Samples: 1512927080. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-27 23:19:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:19:14,755][06909] Updated weights for policy 0, policy_version 98273 (0.0027) [2024-06-27 23:19:18,591][06909] Updated weights for policy 0, policy_version 98283 (0.0035) [2024-06-27 23:19:18,850][06674] Fps is (10 sec: 42607.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1610268672. Throughput: 0: 44241.3. Samples: 1513195920. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 23:19:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:19:22,013][06909] Updated weights for policy 0, policy_version 98293 (0.0036) [2024-06-27 23:19:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1610498048. Throughput: 0: 44182.8. Samples: 1513455700. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 23:19:23,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 23:19:26,232][06909] Updated weights for policy 0, policy_version 98303 (0.0032) [2024-06-27 23:19:28,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44783.0, 300 sec: 44209.0). Total num frames: 1610743808. Throughput: 0: 44152.5. Samples: 1513594580. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 23:19:28,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:19:29,166][06909] Updated weights for policy 0, policy_version 98313 (0.0045) [2024-06-27 23:19:33,560][06909] Updated weights for policy 0, policy_version 98323 (0.0029) [2024-06-27 23:19:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1610940416. Throughput: 0: 44047.1. Samples: 1513856000. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 23:19:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:19:36,878][06909] Updated weights for policy 0, policy_version 98333 (0.0026) [2024-06-27 23:19:38,852][06674] Fps is (10 sec: 40951.7, 60 sec: 43962.2, 300 sec: 44097.6). Total num frames: 1611153408. Throughput: 0: 44036.0. Samples: 1514119360. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 23:19:38,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:19:40,752][06909] Updated weights for policy 0, policy_version 98343 (0.0036) [2024-06-27 23:19:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1611382784. Throughput: 0: 44212.0. Samples: 1514263360. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 23:19:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:19:44,458][06909] Updated weights for policy 0, policy_version 98353 (0.0030) [2024-06-27 23:19:47,954][06909] Updated weights for policy 0, policy_version 98363 (0.0041) [2024-06-27 23:19:48,850][06674] Fps is (10 sec: 44246.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1611595776. Throughput: 0: 44074.4. Samples: 1514517820. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 23:19:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:19:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000098365_1611612160.pth... [2024-06-27 23:19:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000097720_1601044480.pth [2024-06-27 23:19:49,914][06887] Signal inference workers to stop experience collection... (21700 times) [2024-06-27 23:19:49,914][06887] Signal inference workers to resume experience collection... (21700 times) [2024-06-27 23:19:49,928][06909] InferenceWorker_p0-w0: stopping experience collection (21700 times) [2024-06-27 23:19:49,928][06909] InferenceWorker_p0-w0: resuming experience collection (21700 times) [2024-06-27 23:19:51,837][06909] Updated weights for policy 0, policy_version 98373 (0.0029) [2024-06-27 23:19:53,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43690.6, 300 sec: 44153.2). Total num frames: 1611808768. Throughput: 0: 44181.8. Samples: 1514783220. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 23:19:53,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:19:55,730][06909] Updated weights for policy 0, policy_version 98383 (0.0032) [2024-06-27 23:19:58,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1612054528. Throughput: 0: 44210.6. Samples: 1514916560. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 23:19:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:19:59,065][06909] Updated weights for policy 0, policy_version 98393 (0.0037) [2024-06-27 23:20:03,028][06909] Updated weights for policy 0, policy_version 98403 (0.0029) [2024-06-27 23:20:03,850][06674] Fps is (10 sec: 45884.5, 60 sec: 44236.8, 300 sec: 44209.3). Total num frames: 1612267520. Throughput: 0: 44228.8. Samples: 1515186220. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 23:20:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:20:06,338][06909] Updated weights for policy 0, policy_version 98413 (0.0037) [2024-06-27 23:20:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 1612480512. Throughput: 0: 44379.5. Samples: 1515452780. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 23:20:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:20:10,622][06909] Updated weights for policy 0, policy_version 98423 (0.0044) [2024-06-27 23:20:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1612709888. Throughput: 0: 44117.0. Samples: 1515579840. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-27 23:20:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:20:14,246][06909] Updated weights for policy 0, policy_version 98433 (0.0025) [2024-06-27 23:20:17,794][06909] Updated weights for policy 0, policy_version 98443 (0.0034) [2024-06-27 23:20:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1612922880. Throughput: 0: 44220.0. Samples: 1515845900. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 23:20:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:20:21,864][06909] Updated weights for policy 0, policy_version 98453 (0.0034) [2024-06-27 23:20:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1613152256. Throughput: 0: 44248.7. Samples: 1516110460. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 23:20:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:20:25,217][06909] Updated weights for policy 0, policy_version 98463 (0.0031) [2024-06-27 23:20:28,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 1613365248. Throughput: 0: 43845.6. Samples: 1516236420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 23:20:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:20:29,233][06909] Updated weights for policy 0, policy_version 98473 (0.0037) [2024-06-27 23:20:32,700][06909] Updated weights for policy 0, policy_version 98483 (0.0036) [2024-06-27 23:20:33,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 1613578240. Throughput: 0: 44035.8. Samples: 1516499440. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 23:20:33,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:20:36,500][06909] Updated weights for policy 0, policy_version 98493 (0.0050) [2024-06-27 23:20:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44238.2, 300 sec: 44153.5). Total num frames: 1613807616. Throughput: 0: 44153.4. Samples: 1516770040. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 23:20:38,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 23:20:40,436][06909] Updated weights for policy 0, policy_version 98503 (0.0030) [2024-06-27 23:20:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1614020608. Throughput: 0: 44067.5. Samples: 1516899600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 23:20:43,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:20:44,037][06909] Updated weights for policy 0, policy_version 98513 (0.0030) [2024-06-27 23:20:47,614][06909] Updated weights for policy 0, policy_version 98523 (0.0028) [2024-06-27 23:20:48,852][06674] Fps is (10 sec: 44227.8, 60 sec: 44235.2, 300 sec: 44153.2). Total num frames: 1614249984. Throughput: 0: 44019.2. Samples: 1517167180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 23:20:48,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:20:51,408][06909] Updated weights for policy 0, policy_version 98533 (0.0027) [2024-06-27 23:20:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 1614462976. Throughput: 0: 44048.0. Samples: 1517434940. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 23:20:53,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:20:54,876][06909] Updated weights for policy 0, policy_version 98543 (0.0029) [2024-06-27 23:20:58,848][06909] Updated weights for policy 0, policy_version 98553 (0.0030) [2024-06-27 23:20:58,850][06674] Fps is (10 sec: 44246.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1614692352. Throughput: 0: 44080.8. Samples: 1517563480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 23:20:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:21:02,095][06909] Updated weights for policy 0, policy_version 98563 (0.0050) [2024-06-27 23:21:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1614888960. Throughput: 0: 44003.5. Samples: 1517826060. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 23:21:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:21:05,947][06909] Updated weights for policy 0, policy_version 98573 (0.0037) [2024-06-27 23:21:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1615134720. Throughput: 0: 44092.8. Samples: 1518094640. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 23:21:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:21:09,706][06909] Updated weights for policy 0, policy_version 98583 (0.0028) [2024-06-27 23:21:13,405][06909] Updated weights for policy 0, policy_version 98593 (0.0026) [2024-06-27 23:21:13,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1615364096. Throughput: 0: 44257.4. Samples: 1518228000. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-27 23:21:13,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-27 23:21:17,507][06909] Updated weights for policy 0, policy_version 98603 (0.0030) [2024-06-27 23:21:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 1615577088. Throughput: 0: 44389.0. Samples: 1518496940. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:21:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:21:20,689][06909] Updated weights for policy 0, policy_version 98613 (0.0029) [2024-06-27 23:21:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44209.1). Total num frames: 1615790080. Throughput: 0: 44240.6. Samples: 1518760860. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:21:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:21:24,637][06909] Updated weights for policy 0, policy_version 98623 (0.0025) [2024-06-27 23:21:28,283][06909] Updated weights for policy 0, policy_version 98633 (0.0024) [2024-06-27 23:21:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1616035840. Throughput: 0: 44439.1. Samples: 1518899360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:21:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:21:31,861][06909] Updated weights for policy 0, policy_version 98643 (0.0031) [2024-06-27 23:21:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1616232448. Throughput: 0: 44359.1. Samples: 1519163240. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:21:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:21:35,449][06909] Updated weights for policy 0, policy_version 98653 (0.0041) [2024-06-27 23:21:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44510.0, 300 sec: 44320.1). Total num frames: 1616478208. Throughput: 0: 44211.2. Samples: 1519424440. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:21:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 23:21:39,100][06909] Updated weights for policy 0, policy_version 98663 (0.0035) [2024-06-27 23:21:43,159][06909] Updated weights for policy 0, policy_version 98673 (0.0035) [2024-06-27 23:21:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1616691200. Throughput: 0: 44333.4. Samples: 1519558480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:21:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:21:46,221][06887] Signal inference workers to stop experience collection... (21750 times) [2024-06-27 23:21:46,221][06887] Signal inference workers to resume experience collection... (21750 times) [2024-06-27 23:21:46,235][06909] InferenceWorker_p0-w0: stopping experience collection (21750 times) [2024-06-27 23:21:46,235][06909] InferenceWorker_p0-w0: resuming experience collection (21750 times) [2024-06-27 23:21:46,778][06909] Updated weights for policy 0, policy_version 98683 (0.0025) [2024-06-27 23:21:48,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 1616904192. Throughput: 0: 44341.2. Samples: 1519821420. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:21:48,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:21:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000098688_1616904192.pth... [2024-06-27 23:21:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000098040_1606287360.pth [2024-06-27 23:21:50,425][06909] Updated weights for policy 0, policy_version 98693 (0.0032) [2024-06-27 23:21:53,853][06674] Fps is (10 sec: 42584.9, 60 sec: 44234.5, 300 sec: 44208.6). Total num frames: 1617117184. Throughput: 0: 44187.6. Samples: 1520083220. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:21:53,854][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:21:54,563][06909] Updated weights for policy 0, policy_version 98703 (0.0024) [2024-06-27 23:21:57,814][06909] Updated weights for policy 0, policy_version 98713 (0.0030) [2024-06-27 23:21:58,852][06674] Fps is (10 sec: 44228.1, 60 sec: 44235.3, 300 sec: 44208.7). Total num frames: 1617346560. Throughput: 0: 44290.9. Samples: 1520221180. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:21:58,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:22:01,707][06909] Updated weights for policy 0, policy_version 98723 (0.0035) [2024-06-27 23:22:03,850][06674] Fps is (10 sec: 44250.3, 60 sec: 44509.8, 300 sec: 44098.2). Total num frames: 1617559552. Throughput: 0: 44101.7. Samples: 1520481520. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:22:03,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:22:05,284][06909] Updated weights for policy 0, policy_version 98733 (0.0031) [2024-06-27 23:22:08,850][06674] Fps is (10 sec: 44245.3, 60 sec: 44236.7, 300 sec: 44264.6). Total num frames: 1617788928. Throughput: 0: 44222.5. Samples: 1520750880. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:22:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:22:08,917][06909] Updated weights for policy 0, policy_version 98743 (0.0030) [2024-06-27 23:22:12,508][06909] Updated weights for policy 0, policy_version 98753 (0.0025) [2024-06-27 23:22:13,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1618001920. Throughput: 0: 44159.6. Samples: 1520886540. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:22:13,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:22:16,172][06909] Updated weights for policy 0, policy_version 98763 (0.0026) [2024-06-27 23:22:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1618214912. Throughput: 0: 44093.6. Samples: 1521147460. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 23:22:18,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:22:20,039][06909] Updated weights for policy 0, policy_version 98773 (0.0026) [2024-06-27 23:22:23,752][06909] Updated weights for policy 0, policy_version 98783 (0.0030) [2024-06-27 23:22:23,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44320.1). Total num frames: 1618460672. Throughput: 0: 44223.9. Samples: 1521414520. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 23:22:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:22:27,596][06909] Updated weights for policy 0, policy_version 98793 (0.0033) [2024-06-27 23:22:28,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 1618657280. Throughput: 0: 44220.0. Samples: 1521548380. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 23:22:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:22:31,155][06909] Updated weights for policy 0, policy_version 98803 (0.0040) [2024-06-27 23:22:33,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1618870272. Throughput: 0: 44146.8. Samples: 1521808020. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 23:22:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:22:34,865][06909] Updated weights for policy 0, policy_version 98813 (0.0035) [2024-06-27 23:22:38,750][06909] Updated weights for policy 0, policy_version 98823 (0.0031) [2024-06-27 23:22:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44320.1). Total num frames: 1619116032. Throughput: 0: 44326.7. Samples: 1522077780. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 23:22:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:22:42,414][06909] Updated weights for policy 0, policy_version 98833 (0.0039) [2024-06-27 23:22:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 1619312640. Throughput: 0: 44190.9. Samples: 1522209680. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 23:22:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:22:46,117][06909] Updated weights for policy 0, policy_version 98843 (0.0023) [2024-06-27 23:22:48,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 1619525632. Throughput: 0: 44283.7. Samples: 1522474280. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 23:22:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:22:49,621][06909] Updated weights for policy 0, policy_version 98853 (0.0041) [2024-06-27 23:22:53,604][06909] Updated weights for policy 0, policy_version 98863 (0.0033) [2024-06-27 23:22:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44239.2, 300 sec: 44209.1). Total num frames: 1619771392. Throughput: 0: 44108.2. Samples: 1522735740. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 23:22:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:22:57,147][06909] Updated weights for policy 0, policy_version 98873 (0.0036) [2024-06-27 23:22:58,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44238.3, 300 sec: 44320.1). Total num frames: 1620000768. Throughput: 0: 44165.3. Samples: 1522873980. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 23:22:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:23:00,953][06909] Updated weights for policy 0, policy_version 98883 (0.0035) [2024-06-27 23:23:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1620197376. Throughput: 0: 44112.6. Samples: 1523132520. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 23:23:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:23:04,724][06909] Updated weights for policy 0, policy_version 98893 (0.0036) [2024-06-27 23:23:08,460][06909] Updated weights for policy 0, policy_version 98903 (0.0039) [2024-06-27 23:23:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1620426752. Throughput: 0: 44175.1. Samples: 1523402400. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 23:23:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:23:11,972][06909] Updated weights for policy 0, policy_version 98913 (0.0036) [2024-06-27 23:23:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1620656128. Throughput: 0: 44099.6. Samples: 1523532860. Policy #0 lag: (min: 1.0, avg: 10.2, max: 21.0) [2024-06-27 23:23:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:23:16,020][06909] Updated weights for policy 0, policy_version 98923 (0.0040) [2024-06-27 23:23:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1620869120. Throughput: 0: 44211.6. Samples: 1523797540. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2024-06-27 23:23:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:23:19,255][06909] Updated weights for policy 0, policy_version 98933 (0.0022) [2024-06-27 23:23:21,030][06887] Signal inference workers to stop experience collection... (21800 times) [2024-06-27 23:23:21,034][06887] Signal inference workers to resume experience collection... (21800 times) [2024-06-27 23:23:21,052][06909] InferenceWorker_p0-w0: stopping experience collection (21800 times) [2024-06-27 23:23:21,052][06909] InferenceWorker_p0-w0: resuming experience collection (21800 times) [2024-06-27 23:23:23,322][06909] Updated weights for policy 0, policy_version 98943 (0.0032) [2024-06-27 23:23:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 44264.6). Total num frames: 1621114880. Throughput: 0: 44104.9. Samples: 1524062500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2024-06-27 23:23:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:23:26,533][06909] Updated weights for policy 0, policy_version 98953 (0.0037) [2024-06-27 23:23:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1621311488. Throughput: 0: 44082.2. Samples: 1524193380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2024-06-27 23:23:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:23:30,593][06909] Updated weights for policy 0, policy_version 98963 (0.0026) [2024-06-27 23:23:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44782.9, 300 sec: 44209.0). Total num frames: 1621557248. Throughput: 0: 44260.8. Samples: 1524466020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2024-06-27 23:23:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:23:33,937][06909] Updated weights for policy 0, policy_version 98973 (0.0037) [2024-06-27 23:23:37,678][06909] Updated weights for policy 0, policy_version 98983 (0.0034) [2024-06-27 23:23:38,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1621770240. Throughput: 0: 44360.8. Samples: 1524731980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2024-06-27 23:23:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:23:41,658][06909] Updated weights for policy 0, policy_version 98993 (0.0029) [2024-06-27 23:23:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1621983232. Throughput: 0: 44100.9. Samples: 1524858520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2024-06-27 23:23:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:23:45,621][06909] Updated weights for policy 0, policy_version 99003 (0.0025) [2024-06-27 23:23:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44782.8, 300 sec: 44153.8). Total num frames: 1622212608. Throughput: 0: 44477.2. Samples: 1525134000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2024-06-27 23:23:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:23:48,902][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000099013_1622228992.pth... [2024-06-27 23:23:48,909][06909] Updated weights for policy 0, policy_version 99013 (0.0042) [2024-06-27 23:23:48,964][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000098365_1611612160.pth [2024-06-27 23:23:52,937][06909] Updated weights for policy 0, policy_version 99023 (0.0035) [2024-06-27 23:23:53,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 1622425600. Throughput: 0: 44227.5. Samples: 1525392640. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2024-06-27 23:23:53,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:23:56,207][06909] Updated weights for policy 0, policy_version 99033 (0.0024) [2024-06-27 23:23:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1622638592. Throughput: 0: 44161.4. Samples: 1525520120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2024-06-27 23:23:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:24:00,365][06909] Updated weights for policy 0, policy_version 99043 (0.0030) [2024-06-27 23:24:03,425][06909] Updated weights for policy 0, policy_version 99053 (0.0033) [2024-06-27 23:24:03,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44782.9, 300 sec: 44209.3). Total num frames: 1622884352. Throughput: 0: 44449.3. Samples: 1525797760. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2024-06-27 23:24:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:24:07,586][06909] Updated weights for policy 0, policy_version 99063 (0.0028) [2024-06-27 23:24:08,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44783.0, 300 sec: 44320.1). Total num frames: 1623113728. Throughput: 0: 44405.8. Samples: 1526060760. Policy #0 lag: (min: 0.0, avg: 9.4, max: 19.0) [2024-06-27 23:24:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:24:10,775][06909] Updated weights for policy 0, policy_version 99073 (0.0030) [2024-06-27 23:24:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1623310336. Throughput: 0: 44393.4. Samples: 1526191080. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 23:24:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:24:14,879][06909] Updated weights for policy 0, policy_version 99083 (0.0035) [2024-06-27 23:24:18,612][06909] Updated weights for policy 0, policy_version 99093 (0.0027) [2024-06-27 23:24:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1623539712. Throughput: 0: 44262.6. Samples: 1526457840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 23:24:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:24:22,572][06909] Updated weights for policy 0, policy_version 99103 (0.0027) [2024-06-27 23:24:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1623752704. Throughput: 0: 44168.1. Samples: 1526719540. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 23:24:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:24:26,039][06909] Updated weights for policy 0, policy_version 99113 (0.0033) [2024-06-27 23:24:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44782.9, 300 sec: 44264.6). Total num frames: 1623998464. Throughput: 0: 44401.8. Samples: 1526856600. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 23:24:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:24:29,672][06909] Updated weights for policy 0, policy_version 99123 (0.0024) [2024-06-27 23:24:31,304][06887] Signal inference workers to stop experience collection... (21850 times) [2024-06-27 23:24:31,331][06909] InferenceWorker_p0-w0: stopping experience collection (21850 times) [2024-06-27 23:24:31,365][06887] Signal inference workers to resume experience collection... (21850 times) [2024-06-27 23:24:31,365][06909] InferenceWorker_p0-w0: resuming experience collection (21850 times) [2024-06-27 23:24:33,220][06909] Updated weights for policy 0, policy_version 99133 (0.0033) [2024-06-27 23:24:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44264.9). Total num frames: 1624211456. Throughput: 0: 44114.3. Samples: 1527119140. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 23:24:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:24:37,433][06909] Updated weights for policy 0, policy_version 99143 (0.0025) [2024-06-27 23:24:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1624424448. Throughput: 0: 44356.5. Samples: 1527388680. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 23:24:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:24:40,721][06909] Updated weights for policy 0, policy_version 99153 (0.0036) [2024-06-27 23:24:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1624637440. Throughput: 0: 44437.7. Samples: 1527519820. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 23:24:43,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:24:44,626][06909] Updated weights for policy 0, policy_version 99163 (0.0043) [2024-06-27 23:24:48,038][06909] Updated weights for policy 0, policy_version 99173 (0.0030) [2024-06-27 23:24:48,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44509.9, 300 sec: 44320.4). Total num frames: 1624883200. Throughput: 0: 44205.8. Samples: 1527787020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 23:24:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:24:52,115][06909] Updated weights for policy 0, policy_version 99183 (0.0035) [2024-06-27 23:24:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1625079808. Throughput: 0: 44256.9. Samples: 1528052320. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 23:24:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:24:55,934][06909] Updated weights for policy 0, policy_version 99193 (0.0036) [2024-06-27 23:24:58,852][06674] Fps is (10 sec: 42589.4, 60 sec: 44508.3, 300 sec: 44208.7). Total num frames: 1625309184. Throughput: 0: 44149.9. Samples: 1528177920. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 23:24:58,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:24:59,723][06909] Updated weights for policy 0, policy_version 99203 (0.0033) [2024-06-27 23:25:03,326][06909] Updated weights for policy 0, policy_version 99213 (0.0022) [2024-06-27 23:25:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 1625538560. Throughput: 0: 44069.4. Samples: 1528440960. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 23:25:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:25:07,270][06909] Updated weights for policy 0, policy_version 99223 (0.0032) [2024-06-27 23:25:08,850][06674] Fps is (10 sec: 42607.0, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 1625735168. Throughput: 0: 44126.6. Samples: 1528705240. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-27 23:25:08,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-27 23:25:10,583][06909] Updated weights for policy 0, policy_version 99233 (0.0030) [2024-06-27 23:25:13,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 1625948160. Throughput: 0: 43883.0. Samples: 1528831340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:25:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:25:14,806][06909] Updated weights for policy 0, policy_version 99243 (0.0041) [2024-06-27 23:25:17,926][06909] Updated weights for policy 0, policy_version 99253 (0.0026) [2024-06-27 23:25:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1626193920. Throughput: 0: 43890.6. Samples: 1529094220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:25:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:25:22,169][06909] Updated weights for policy 0, policy_version 99263 (0.0041) [2024-06-27 23:25:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 1626374144. Throughput: 0: 43818.6. Samples: 1529360520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:25:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:25:25,214][06909] Updated weights for policy 0, policy_version 99273 (0.0030) [2024-06-27 23:25:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.6, 300 sec: 44153.5). Total num frames: 1626603520. Throughput: 0: 43653.8. Samples: 1529484240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:25:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:25:29,753][06909] Updated weights for policy 0, policy_version 99283 (0.0025) [2024-06-27 23:25:33,145][06909] Updated weights for policy 0, policy_version 99293 (0.0036) [2024-06-27 23:25:33,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43963.7, 300 sec: 44209.1). Total num frames: 1626849280. Throughput: 0: 43687.1. Samples: 1529752940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:25:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:25:37,254][06909] Updated weights for policy 0, policy_version 99303 (0.0032) [2024-06-27 23:25:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 44098.0). Total num frames: 1627029504. Throughput: 0: 43798.6. Samples: 1530023260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:25:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:25:40,391][06909] Updated weights for policy 0, policy_version 99313 (0.0038) [2024-06-27 23:25:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 44153.8). Total num frames: 1627275264. Throughput: 0: 43827.8. Samples: 1530150080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:25:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:25:44,644][06909] Updated weights for policy 0, policy_version 99323 (0.0030) [2024-06-27 23:25:47,565][06909] Updated weights for policy 0, policy_version 99333 (0.0025) [2024-06-27 23:25:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43417.6, 300 sec: 44153.5). Total num frames: 1627488256. Throughput: 0: 43893.8. Samples: 1530416180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:25:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:25:48,884][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000099335_1627504640.pth... [2024-06-27 23:25:48,939][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000098688_1616904192.pth [2024-06-27 23:25:51,837][06909] Updated weights for policy 0, policy_version 99343 (0.0042) [2024-06-27 23:25:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1627701248. Throughput: 0: 44022.3. Samples: 1530686240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:25:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:25:55,199][06909] Updated weights for policy 0, policy_version 99353 (0.0026) [2024-06-27 23:25:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43965.3, 300 sec: 44264.6). Total num frames: 1627947008. Throughput: 0: 44060.6. Samples: 1530814060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:25:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:25:59,076][06909] Updated weights for policy 0, policy_version 99363 (0.0025) [2024-06-27 23:26:02,659][06909] Updated weights for policy 0, policy_version 99373 (0.0028) [2024-06-27 23:26:03,852][06674] Fps is (10 sec: 45865.8, 60 sec: 43689.2, 300 sec: 44153.2). Total num frames: 1628160000. Throughput: 0: 44107.4. Samples: 1531079140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:26:03,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:26:06,676][06909] Updated weights for policy 0, policy_version 99383 (0.0032) [2024-06-27 23:26:08,798][06887] Signal inference workers to stop experience collection... (21900 times) [2024-06-27 23:26:08,836][06909] InferenceWorker_p0-w0: stopping experience collection (21900 times) [2024-06-27 23:26:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1628372992. Throughput: 0: 44193.8. Samples: 1531349240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:26:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:26:08,853][06887] Signal inference workers to resume experience collection... (21900 times) [2024-06-27 23:26:08,854][06909] InferenceWorker_p0-w0: resuming experience collection (21900 times) [2024-06-27 23:26:10,354][06909] Updated weights for policy 0, policy_version 99393 (0.0031) [2024-06-27 23:26:13,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1628602368. Throughput: 0: 44274.2. Samples: 1531476580. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 23:26:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:26:14,005][06909] Updated weights for policy 0, policy_version 99403 (0.0032) [2024-06-27 23:26:17,501][06909] Updated weights for policy 0, policy_version 99413 (0.0031) [2024-06-27 23:26:18,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.8, 300 sec: 44153.5). Total num frames: 1628815360. Throughput: 0: 44235.2. Samples: 1531743520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 23:26:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:26:21,323][06909] Updated weights for policy 0, policy_version 99423 (0.0030) [2024-06-27 23:26:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44783.0, 300 sec: 44153.5). Total num frames: 1629061120. Throughput: 0: 44247.7. Samples: 1532014400. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 23:26:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:26:24,709][06909] Updated weights for policy 0, policy_version 99433 (0.0026) [2024-06-27 23:26:28,579][06909] Updated weights for policy 0, policy_version 99443 (0.0021) [2024-06-27 23:26:28,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1629274112. Throughput: 0: 44404.9. Samples: 1532148300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 23:26:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:26:32,120][06909] Updated weights for policy 0, policy_version 99453 (0.0025) [2024-06-27 23:26:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1629470720. Throughput: 0: 44234.7. Samples: 1532406740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 23:26:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:26:36,191][06909] Updated weights for policy 0, policy_version 99463 (0.0027) [2024-06-27 23:26:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44783.0, 300 sec: 44153.5). Total num frames: 1629716480. Throughput: 0: 44202.7. Samples: 1532675360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 23:26:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:26:39,526][06909] Updated weights for policy 0, policy_version 99473 (0.0034) [2024-06-27 23:26:43,345][06909] Updated weights for policy 0, policy_version 99483 (0.0038) [2024-06-27 23:26:43,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1629929472. Throughput: 0: 44320.3. Samples: 1532808480. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 23:26:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:26:47,339][06909] Updated weights for policy 0, policy_version 99493 (0.0044) [2024-06-27 23:26:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.8, 300 sec: 44154.0). Total num frames: 1630142464. Throughput: 0: 44202.0. Samples: 1533068140. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 23:26:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:26:51,039][06909] Updated weights for policy 0, policy_version 99503 (0.0032) [2024-06-27 23:26:53,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44782.9, 300 sec: 44209.3). Total num frames: 1630388224. Throughput: 0: 44188.9. Samples: 1533337740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 23:26:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:26:54,552][06909] Updated weights for policy 0, policy_version 99513 (0.0030) [2024-06-27 23:26:58,262][06909] Updated weights for policy 0, policy_version 99523 (0.0029) [2024-06-27 23:26:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 1630584832. Throughput: 0: 44313.7. Samples: 1533470700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 23:26:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:27:01,795][06909] Updated weights for policy 0, policy_version 99533 (0.0036) [2024-06-27 23:27:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43965.2, 300 sec: 44098.0). Total num frames: 1630797824. Throughput: 0: 44191.0. Samples: 1533732120. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 23:27:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:27:05,875][06909] Updated weights for policy 0, policy_version 99543 (0.0039) [2024-06-27 23:27:08,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44782.9, 300 sec: 44264.6). Total num frames: 1631059968. Throughput: 0: 44087.9. Samples: 1533998360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-27 23:27:08,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:27:09,115][06909] Updated weights for policy 0, policy_version 99553 (0.0028) [2024-06-27 23:27:13,326][06909] Updated weights for policy 0, policy_version 99563 (0.0039) [2024-06-27 23:27:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1631256576. Throughput: 0: 44049.4. Samples: 1534130520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 23:27:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:27:16,670][06909] Updated weights for policy 0, policy_version 99573 (0.0030) [2024-06-27 23:27:18,850][06674] Fps is (10 sec: 40960.5, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1631469568. Throughput: 0: 44123.1. Samples: 1534392280. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 23:27:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:27:20,646][06909] Updated weights for policy 0, policy_version 99583 (0.0020) [2024-06-27 23:27:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.6, 300 sec: 44209.0). Total num frames: 1631698944. Throughput: 0: 43998.1. Samples: 1534655280. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 23:27:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:27:24,411][06909] Updated weights for policy 0, policy_version 99593 (0.0043) [2024-06-27 23:27:28,161][06909] Updated weights for policy 0, policy_version 99603 (0.0030) [2024-06-27 23:27:28,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1631911936. Throughput: 0: 44016.5. Samples: 1534789220. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 23:27:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:27:30,651][06887] Signal inference workers to stop experience collection... (21950 times) [2024-06-27 23:27:30,695][06909] InferenceWorker_p0-w0: stopping experience collection (21950 times) [2024-06-27 23:27:30,710][06887] Signal inference workers to resume experience collection... (21950 times) [2024-06-27 23:27:30,712][06909] InferenceWorker_p0-w0: resuming experience collection (21950 times) [2024-06-27 23:27:31,695][06909] Updated weights for policy 0, policy_version 99613 (0.0034) [2024-06-27 23:27:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1632141312. Throughput: 0: 44100.0. Samples: 1535052640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 23:27:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:27:35,925][06909] Updated weights for policy 0, policy_version 99623 (0.0025) [2024-06-27 23:27:38,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 1632370688. Throughput: 0: 44073.8. Samples: 1535321060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 23:27:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:27:38,901][06909] Updated weights for policy 0, policy_version 99633 (0.0022) [2024-06-27 23:27:43,246][06909] Updated weights for policy 0, policy_version 99643 (0.0030) [2024-06-27 23:27:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 1632567296. Throughput: 0: 44081.8. Samples: 1535454380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 23:27:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:27:46,294][06909] Updated weights for policy 0, policy_version 99653 (0.0042) [2024-06-27 23:27:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1632796672. Throughput: 0: 44157.4. Samples: 1535719200. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 23:27:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:27:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000099658_1632796672.pth... [2024-06-27 23:27:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000099013_1622228992.pth [2024-06-27 23:27:50,581][06909] Updated weights for policy 0, policy_version 99663 (0.0042) [2024-06-27 23:27:53,779][06909] Updated weights for policy 0, policy_version 99673 (0.0026) [2024-06-27 23:27:53,852][06674] Fps is (10 sec: 47504.4, 60 sec: 44235.3, 300 sec: 44208.7). Total num frames: 1633042432. Throughput: 0: 43930.1. Samples: 1535975300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 23:27:53,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:27:58,133][06909] Updated weights for policy 0, policy_version 99683 (0.0035) [2024-06-27 23:27:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1633239040. Throughput: 0: 43962.6. Samples: 1536108840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 23:27:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:28:01,508][06909] Updated weights for policy 0, policy_version 99693 (0.0032) [2024-06-27 23:28:03,850][06674] Fps is (10 sec: 42607.1, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1633468416. Throughput: 0: 44168.0. Samples: 1536379840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 23:28:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:28:05,375][06909] Updated weights for policy 0, policy_version 99703 (0.0023) [2024-06-27 23:28:08,779][06909] Updated weights for policy 0, policy_version 99713 (0.0032) [2024-06-27 23:28:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1633697792. Throughput: 0: 44018.2. Samples: 1536636100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-27 23:28:08,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:28:12,741][06909] Updated weights for policy 0, policy_version 99723 (0.0025) [2024-06-27 23:28:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1633894400. Throughput: 0: 44080.1. Samples: 1536772820. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 23:28:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:28:16,097][06909] Updated weights for policy 0, policy_version 99733 (0.0027) [2024-06-27 23:28:18,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1634107392. Throughput: 0: 44321.3. Samples: 1537047100. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 23:28:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:28:20,320][06909] Updated weights for policy 0, policy_version 99743 (0.0039) [2024-06-27 23:28:23,596][06909] Updated weights for policy 0, policy_version 99753 (0.0031) [2024-06-27 23:28:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 1634353152. Throughput: 0: 44142.2. Samples: 1537307460. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 23:28:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:28:27,526][06909] Updated weights for policy 0, policy_version 99763 (0.0026) [2024-06-27 23:28:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1634549760. Throughput: 0: 44132.5. Samples: 1537440340. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 23:28:28,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:28:30,867][06909] Updated weights for policy 0, policy_version 99773 (0.0029) [2024-06-27 23:28:32,908][06887] Signal inference workers to stop experience collection... (22000 times) [2024-06-27 23:28:32,908][06887] Signal inference workers to resume experience collection... (22000 times) [2024-06-27 23:28:32,933][06909] InferenceWorker_p0-w0: stopping experience collection (22000 times) [2024-06-27 23:28:32,933][06909] InferenceWorker_p0-w0: resuming experience collection (22000 times) [2024-06-27 23:28:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1634795520. Throughput: 0: 44192.5. Samples: 1537707860. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 23:28:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:28:35,186][06909] Updated weights for policy 0, policy_version 99783 (0.0031) [2024-06-27 23:28:38,355][06909] Updated weights for policy 0, policy_version 99793 (0.0029) [2024-06-27 23:28:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1635008512. Throughput: 0: 44253.9. Samples: 1537966640. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 23:28:38,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:28:42,332][06909] Updated weights for policy 0, policy_version 99803 (0.0035) [2024-06-27 23:28:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 1635205120. Throughput: 0: 44374.4. Samples: 1538105680. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 23:28:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:28:45,574][06909] Updated weights for policy 0, policy_version 99813 (0.0036) [2024-06-27 23:28:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1635450880. Throughput: 0: 44213.3. Samples: 1538369440. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 23:28:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:28:49,558][06909] Updated weights for policy 0, policy_version 99823 (0.0033) [2024-06-27 23:28:53,287][06909] Updated weights for policy 0, policy_version 99833 (0.0036) [2024-06-27 23:28:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43692.2, 300 sec: 44153.5). Total num frames: 1635663872. Throughput: 0: 44330.4. Samples: 1538630960. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 23:28:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:28:57,468][06909] Updated weights for policy 0, policy_version 99843 (0.0029) [2024-06-27 23:28:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1635876864. Throughput: 0: 44236.4. Samples: 1538763460. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 23:28:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:29:00,808][06909] Updated weights for policy 0, policy_version 99853 (0.0024) [2024-06-27 23:29:03,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1636139008. Throughput: 0: 44105.3. Samples: 1539031840. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 23:29:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:29:04,645][06909] Updated weights for policy 0, policy_version 99863 (0.0035) [2024-06-27 23:29:08,101][06909] Updated weights for policy 0, policy_version 99873 (0.0027) [2024-06-27 23:29:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1636335616. Throughput: 0: 44302.2. Samples: 1539301060. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 23:29:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:29:11,839][06909] Updated weights for policy 0, policy_version 99883 (0.0026) [2024-06-27 23:29:13,850][06674] Fps is (10 sec: 40959.6, 60 sec: 44236.6, 300 sec: 44097.9). Total num frames: 1636548608. Throughput: 0: 44212.8. Samples: 1539429920. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-27 23:29:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:29:15,848][06909] Updated weights for policy 0, policy_version 99893 (0.0032) [2024-06-27 23:29:18,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44782.9, 300 sec: 44209.0). Total num frames: 1636794368. Throughput: 0: 44145.2. Samples: 1539694400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 23:29:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:29:19,504][06909] Updated weights for policy 0, policy_version 99903 (0.0028) [2024-06-27 23:29:23,278][06909] Updated weights for policy 0, policy_version 99913 (0.0031) [2024-06-27 23:29:23,852][06674] Fps is (10 sec: 42589.2, 60 sec: 43688.9, 300 sec: 43986.5). Total num frames: 1636974592. Throughput: 0: 44177.8. Samples: 1539954740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 23:29:23,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:29:26,727][06909] Updated weights for policy 0, policy_version 99923 (0.0026) [2024-06-27 23:29:28,850][06674] Fps is (10 sec: 40960.9, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1637203968. Throughput: 0: 43983.1. Samples: 1540084920. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 23:29:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:29:30,650][06909] Updated weights for policy 0, policy_version 99933 (0.0042) [2024-06-27 23:29:33,850][06674] Fps is (10 sec: 47525.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1637449728. Throughput: 0: 44185.4. Samples: 1540357780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 23:29:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:29:33,951][06909] Updated weights for policy 0, policy_version 99943 (0.0024) [2024-06-27 23:29:38,114][06909] Updated weights for policy 0, policy_version 99953 (0.0050) [2024-06-27 23:29:38,852][06674] Fps is (10 sec: 45865.5, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 1637662720. Throughput: 0: 44314.4. Samples: 1540625200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 23:29:38,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:29:41,573][06909] Updated weights for policy 0, policy_version 99963 (0.0034) [2024-06-27 23:29:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 1637875712. Throughput: 0: 44229.7. Samples: 1540753800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 23:29:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:29:45,298][06909] Updated weights for policy 0, policy_version 99973 (0.0030) [2024-06-27 23:29:48,776][06909] Updated weights for policy 0, policy_version 99983 (0.0033) [2024-06-27 23:29:48,850][06674] Fps is (10 sec: 45884.1, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1638121472. Throughput: 0: 44111.1. Samples: 1541016840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 23:29:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:29:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000099983_1638121472.pth... [2024-06-27 23:29:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000099335_1627504640.pth [2024-06-27 23:29:53,331][06909] Updated weights for policy 0, policy_version 99993 (0.0040) [2024-06-27 23:29:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 1638301696. Throughput: 0: 44054.7. Samples: 1541283520. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 23:29:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:29:56,426][06909] Updated weights for policy 0, policy_version 100003 (0.0034) [2024-06-27 23:29:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1638531072. Throughput: 0: 44103.6. Samples: 1541414580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 23:29:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:30:00,520][06909] Updated weights for policy 0, policy_version 100013 (0.0024) [2024-06-27 23:30:03,381][06887] Signal inference workers to stop experience collection... (22050 times) [2024-06-27 23:30:03,381][06887] Signal inference workers to resume experience collection... (22050 times) [2024-06-27 23:30:03,423][06909] InferenceWorker_p0-w0: stopping experience collection (22050 times) [2024-06-27 23:30:03,423][06909] InferenceWorker_p0-w0: resuming experience collection (22050 times) [2024-06-27 23:30:03,665][06909] Updated weights for policy 0, policy_version 100023 (0.0033) [2024-06-27 23:30:03,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 1638776832. Throughput: 0: 44053.4. Samples: 1541676800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 23:30:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:30:07,933][06909] Updated weights for policy 0, policy_version 100033 (0.0029) [2024-06-27 23:30:08,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1638973440. Throughput: 0: 44267.7. Samples: 1541946680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 23:30:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:30:10,954][06909] Updated weights for policy 0, policy_version 100043 (0.0030) [2024-06-27 23:30:13,850][06674] Fps is (10 sec: 42599.1, 60 sec: 44237.0, 300 sec: 44098.0). Total num frames: 1639202816. Throughput: 0: 44368.0. Samples: 1542081480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-27 23:30:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:30:15,131][06909] Updated weights for policy 0, policy_version 100053 (0.0041) [2024-06-27 23:30:18,492][06909] Updated weights for policy 0, policy_version 100063 (0.0038) [2024-06-27 23:30:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.9, 300 sec: 44264.6). Total num frames: 1639432192. Throughput: 0: 44378.6. Samples: 1542354820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 23:30:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:30:22,524][06909] Updated weights for policy 0, policy_version 100073 (0.0033) [2024-06-27 23:30:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44511.6, 300 sec: 44209.0). Total num frames: 1639645184. Throughput: 0: 44286.0. Samples: 1542617980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 23:30:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:30:26,046][06909] Updated weights for policy 0, policy_version 100083 (0.0040) [2024-06-27 23:30:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1639874560. Throughput: 0: 44265.8. Samples: 1542745760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 23:30:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:30:30,433][06909] Updated weights for policy 0, policy_version 100093 (0.0023) [2024-06-27 23:30:33,299][06909] Updated weights for policy 0, policy_version 100103 (0.0037) [2024-06-27 23:30:33,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.6, 300 sec: 44264.6). Total num frames: 1640087552. Throughput: 0: 44294.6. Samples: 1543010100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 23:30:33,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:30:37,973][06909] Updated weights for policy 0, policy_version 100113 (0.0034) [2024-06-27 23:30:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 1640300544. Throughput: 0: 44103.5. Samples: 1543268180. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 23:30:38,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:30:41,076][06909] Updated weights for policy 0, policy_version 100123 (0.0031) [2024-06-27 23:30:43,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1640529920. Throughput: 0: 44053.8. Samples: 1543397000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 23:30:43,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:30:45,389][06909] Updated weights for policy 0, policy_version 100133 (0.0022) [2024-06-27 23:30:48,416][06909] Updated weights for policy 0, policy_version 100143 (0.0029) [2024-06-27 23:30:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 44209.0). Total num frames: 1640742912. Throughput: 0: 44249.9. Samples: 1543668040. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 23:30:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:30:52,826][06909] Updated weights for policy 0, policy_version 100153 (0.0022) [2024-06-27 23:30:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1640972288. Throughput: 0: 44158.1. Samples: 1543933800. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 23:30:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:30:56,040][06909] Updated weights for policy 0, policy_version 100163 (0.0032) [2024-06-27 23:30:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 1641185280. Throughput: 0: 43985.1. Samples: 1544060820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 23:30:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:30:59,974][06909] Updated weights for policy 0, policy_version 100173 (0.0039) [2024-06-27 23:31:03,659][06909] Updated weights for policy 0, policy_version 100183 (0.0041) [2024-06-27 23:31:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 1641398272. Throughput: 0: 43730.2. Samples: 1544322680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 23:31:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:31:07,609][06909] Updated weights for policy 0, policy_version 100193 (0.0039) [2024-06-27 23:31:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1641627648. Throughput: 0: 43760.5. Samples: 1544587200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 23:31:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:31:11,187][06909] Updated weights for policy 0, policy_version 100203 (0.0032) [2024-06-27 23:31:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1641857024. Throughput: 0: 43916.1. Samples: 1544721980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-27 23:31:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:31:15,167][06909] Updated weights for policy 0, policy_version 100213 (0.0032) [2024-06-27 23:31:18,495][06909] Updated weights for policy 0, policy_version 100223 (0.0031) [2024-06-27 23:31:18,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 1642070016. Throughput: 0: 43845.4. Samples: 1544983140. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2024-06-27 23:31:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:31:22,560][06909] Updated weights for policy 0, policy_version 100233 (0.0032) [2024-06-27 23:31:23,644][06887] Signal inference workers to stop experience collection... (22100 times) [2024-06-27 23:31:23,683][06909] InferenceWorker_p0-w0: stopping experience collection (22100 times) [2024-06-27 23:31:23,698][06887] Signal inference workers to resume experience collection... (22100 times) [2024-06-27 23:31:23,701][06909] InferenceWorker_p0-w0: resuming experience collection (22100 times) [2024-06-27 23:31:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1642299392. Throughput: 0: 44057.3. Samples: 1545250760. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2024-06-27 23:31:23,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:31:25,710][06909] Updated weights for policy 0, policy_version 100243 (0.0026) [2024-06-27 23:31:28,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 1642512384. Throughput: 0: 44200.1. Samples: 1545386000. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2024-06-27 23:31:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:31:29,829][06909] Updated weights for policy 0, policy_version 100253 (0.0045) [2024-06-27 23:31:33,053][06909] Updated weights for policy 0, policy_version 100263 (0.0038) [2024-06-27 23:31:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1642725376. Throughput: 0: 44023.9. Samples: 1545649120. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2024-06-27 23:31:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:31:37,107][06909] Updated weights for policy 0, policy_version 100273 (0.0031) [2024-06-27 23:31:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1642954752. Throughput: 0: 44125.0. Samples: 1545919420. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2024-06-27 23:31:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-27 23:31:40,583][06909] Updated weights for policy 0, policy_version 100283 (0.0027) [2024-06-27 23:31:43,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 1643184128. Throughput: 0: 44265.0. Samples: 1546052740. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2024-06-27 23:31:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:31:44,450][06909] Updated weights for policy 0, policy_version 100293 (0.0036) [2024-06-27 23:31:47,780][06909] Updated weights for policy 0, policy_version 100303 (0.0040) [2024-06-27 23:31:48,850][06674] Fps is (10 sec: 44235.8, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1643397120. Throughput: 0: 44192.8. Samples: 1546311360. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2024-06-27 23:31:48,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:31:48,855][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000100305_1643397120.pth... [2024-06-27 23:31:48,907][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000099658_1632796672.pth [2024-06-27 23:31:52,129][06909] Updated weights for policy 0, policy_version 100313 (0.0044) [2024-06-27 23:31:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1643610112. Throughput: 0: 44224.0. Samples: 1546577280. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2024-06-27 23:31:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:31:55,493][06909] Updated weights for policy 0, policy_version 100323 (0.0036) [2024-06-27 23:31:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1643839488. Throughput: 0: 44190.1. Samples: 1546710540. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2024-06-27 23:31:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:31:59,557][06909] Updated weights for policy 0, policy_version 100333 (0.0033) [2024-06-27 23:32:02,682][06909] Updated weights for policy 0, policy_version 100343 (0.0026) [2024-06-27 23:32:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1644052480. Throughput: 0: 44328.6. Samples: 1546977920. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2024-06-27 23:32:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:32:06,858][06909] Updated weights for policy 0, policy_version 100353 (0.0040) [2024-06-27 23:32:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1644281856. Throughput: 0: 44400.0. Samples: 1547248760. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2024-06-27 23:32:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:32:10,026][06909] Updated weights for policy 0, policy_version 100363 (0.0052) [2024-06-27 23:32:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1644494848. Throughput: 0: 44349.4. Samples: 1547381720. Policy #0 lag: (min: 0.0, avg: 8.1, max: 19.0) [2024-06-27 23:32:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:32:13,965][06909] Updated weights for policy 0, policy_version 100373 (0.0042) [2024-06-27 23:32:17,386][06909] Updated weights for policy 0, policy_version 100383 (0.0027) [2024-06-27 23:32:18,851][06674] Fps is (10 sec: 42593.0, 60 sec: 43962.9, 300 sec: 44097.8). Total num frames: 1644707840. Throughput: 0: 44385.9. Samples: 1547646540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:32:18,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:32:21,461][06909] Updated weights for policy 0, policy_version 100393 (0.0030) [2024-06-27 23:32:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1644953600. Throughput: 0: 44233.7. Samples: 1547909940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:32:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:32:24,632][06909] Updated weights for policy 0, policy_version 100403 (0.0028) [2024-06-27 23:32:28,850][06674] Fps is (10 sec: 44242.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1645150208. Throughput: 0: 44455.1. Samples: 1548053220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:32:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:32:28,960][06909] Updated weights for policy 0, policy_version 100413 (0.0037) [2024-06-27 23:32:31,978][06909] Updated weights for policy 0, policy_version 100423 (0.0038) [2024-06-27 23:32:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1645379584. Throughput: 0: 44416.6. Samples: 1548310100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:32:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:32:34,743][06887] Signal inference workers to stop experience collection... (22150 times) [2024-06-27 23:32:34,779][06909] InferenceWorker_p0-w0: stopping experience collection (22150 times) [2024-06-27 23:32:34,804][06887] Signal inference workers to resume experience collection... (22150 times) [2024-06-27 23:32:34,808][06909] InferenceWorker_p0-w0: resuming experience collection (22150 times) [2024-06-27 23:32:36,227][06909] Updated weights for policy 0, policy_version 100433 (0.0027) [2024-06-27 23:32:38,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44509.8, 300 sec: 44264.6). Total num frames: 1645625344. Throughput: 0: 44300.0. Samples: 1548570780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:32:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-27 23:32:39,607][06909] Updated weights for policy 0, policy_version 100443 (0.0031) [2024-06-27 23:32:43,574][06909] Updated weights for policy 0, policy_version 100453 (0.0035) [2024-06-27 23:32:43,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 1645821952. Throughput: 0: 44555.5. Samples: 1548715540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:32:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-27 23:32:47,131][06909] Updated weights for policy 0, policy_version 100463 (0.0033) [2024-06-27 23:32:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.9, 300 sec: 44098.3). Total num frames: 1646051328. Throughput: 0: 44377.8. Samples: 1548974920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:32:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:32:50,865][06909] Updated weights for policy 0, policy_version 100473 (0.0046) [2024-06-27 23:32:53,856][06674] Fps is (10 sec: 44210.7, 60 sec: 44232.3, 300 sec: 44152.6). Total num frames: 1646264320. Throughput: 0: 44088.8. Samples: 1549233020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:32:53,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:32:54,573][06909] Updated weights for policy 0, policy_version 100483 (0.0041) [2024-06-27 23:32:58,454][06909] Updated weights for policy 0, policy_version 100493 (0.0035) [2024-06-27 23:32:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1646493696. Throughput: 0: 44283.0. Samples: 1549374460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:32:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:33:02,284][06909] Updated weights for policy 0, policy_version 100503 (0.0032) [2024-06-27 23:33:03,850][06674] Fps is (10 sec: 44263.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1646706688. Throughput: 0: 44126.6. Samples: 1549632180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:33:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:33:05,989][06909] Updated weights for policy 0, policy_version 100513 (0.0035) [2024-06-27 23:33:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1646936064. Throughput: 0: 44055.9. Samples: 1549892460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:33:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:33:09,557][06909] Updated weights for policy 0, policy_version 100523 (0.0027) [2024-06-27 23:33:13,447][06909] Updated weights for policy 0, policy_version 100533 (0.0034) [2024-06-27 23:33:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1647149056. Throughput: 0: 43936.4. Samples: 1550030360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-27 23:33:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:33:16,996][06909] Updated weights for policy 0, policy_version 100543 (0.0032) [2024-06-27 23:33:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44237.7, 300 sec: 44097.9). Total num frames: 1647362048. Throughput: 0: 44006.1. Samples: 1550290380. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 23:33:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:33:20,905][06909] Updated weights for policy 0, policy_version 100553 (0.0024) [2024-06-27 23:33:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1647591424. Throughput: 0: 44128.4. Samples: 1550556560. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 23:33:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:33:24,594][06909] Updated weights for policy 0, policy_version 100563 (0.0037) [2024-06-27 23:33:28,144][06909] Updated weights for policy 0, policy_version 100573 (0.0029) [2024-06-27 23:33:28,856][06674] Fps is (10 sec: 45848.0, 60 sec: 44505.4, 300 sec: 44152.6). Total num frames: 1647820800. Throughput: 0: 43977.3. Samples: 1550694780. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 23:33:28,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:33:31,748][06909] Updated weights for policy 0, policy_version 100583 (0.0033) [2024-06-27 23:33:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1648017408. Throughput: 0: 44090.7. Samples: 1550959000. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 23:33:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:33:35,586][06909] Updated weights for policy 0, policy_version 100593 (0.0029) [2024-06-27 23:33:38,850][06674] Fps is (10 sec: 42624.5, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 1648246784. Throughput: 0: 44142.4. Samples: 1551219160. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 23:33:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:33:39,343][06909] Updated weights for policy 0, policy_version 100603 (0.0031) [2024-06-27 23:33:43,001][06909] Updated weights for policy 0, policy_version 100613 (0.0031) [2024-06-27 23:33:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1648476160. Throughput: 0: 44042.3. Samples: 1551356360. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 23:33:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:33:46,840][06909] Updated weights for policy 0, policy_version 100623 (0.0032) [2024-06-27 23:33:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1648689152. Throughput: 0: 44232.9. Samples: 1551622660. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 23:33:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:33:48,967][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000100629_1648705536.pth... [2024-06-27 23:33:49,029][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000099983_1638121472.pth [2024-06-27 23:33:50,121][06909] Updated weights for policy 0, policy_version 100633 (0.0032) [2024-06-27 23:33:53,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44241.2, 300 sec: 44209.0). Total num frames: 1648918528. Throughput: 0: 44358.2. Samples: 1551888580. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 23:33:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:33:54,081][06909] Updated weights for policy 0, policy_version 100643 (0.0027) [2024-06-27 23:33:56,937][06887] Signal inference workers to stop experience collection... (22200 times) [2024-06-27 23:33:56,938][06887] Signal inference workers to resume experience collection... (22200 times) [2024-06-27 23:33:56,949][06909] InferenceWorker_p0-w0: stopping experience collection (22200 times) [2024-06-27 23:33:56,950][06909] InferenceWorker_p0-w0: resuming experience collection (22200 times) [2024-06-27 23:33:57,870][06909] Updated weights for policy 0, policy_version 100653 (0.0035) [2024-06-27 23:33:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1649147904. Throughput: 0: 44341.8. Samples: 1552025740. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 23:33:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:34:01,679][06909] Updated weights for policy 0, policy_version 100663 (0.0022) [2024-06-27 23:34:03,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1649344512. Throughput: 0: 44371.2. Samples: 1552287080. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 23:34:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:34:05,269][06909] Updated weights for policy 0, policy_version 100673 (0.0028) [2024-06-27 23:34:08,814][06909] Updated weights for policy 0, policy_version 100683 (0.0034) [2024-06-27 23:34:08,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44235.4, 300 sec: 44208.8). Total num frames: 1649590272. Throughput: 0: 44306.4. Samples: 1552550440. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 23:34:08,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:34:12,675][06909] Updated weights for policy 0, policy_version 100693 (0.0038) [2024-06-27 23:34:13,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1649819648. Throughput: 0: 44208.5. Samples: 1552683900. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-27 23:34:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:34:16,551][06909] Updated weights for policy 0, policy_version 100703 (0.0025) [2024-06-27 23:34:18,850][06674] Fps is (10 sec: 44245.2, 60 sec: 44509.8, 300 sec: 44264.9). Total num frames: 1650032640. Throughput: 0: 44242.9. Samples: 1552949940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 23:34:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:34:19,960][06909] Updated weights for policy 0, policy_version 100713 (0.0027) [2024-06-27 23:34:23,739][06909] Updated weights for policy 0, policy_version 100723 (0.0030) [2024-06-27 23:34:23,850][06674] Fps is (10 sec: 42599.1, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 1650245632. Throughput: 0: 44334.7. Samples: 1553214220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 23:34:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:34:27,251][06909] Updated weights for policy 0, policy_version 100733 (0.0026) [2024-06-27 23:34:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44514.3, 300 sec: 44209.0). Total num frames: 1650491392. Throughput: 0: 44240.3. Samples: 1553347180. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 23:34:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:34:30,873][06909] Updated weights for policy 0, policy_version 100743 (0.0041) [2024-06-27 23:34:33,850][06674] Fps is (10 sec: 44235.8, 60 sec: 44509.7, 300 sec: 44153.8). Total num frames: 1650688000. Throughput: 0: 44321.2. Samples: 1553617120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 23:34:33,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:34:34,775][06909] Updated weights for policy 0, policy_version 100753 (0.0026) [2024-06-27 23:34:38,194][06909] Updated weights for policy 0, policy_version 100763 (0.0026) [2024-06-27 23:34:38,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44236.6, 300 sec: 44153.5). Total num frames: 1650900992. Throughput: 0: 44210.2. Samples: 1553878040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 23:34:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:34:42,456][06909] Updated weights for policy 0, policy_version 100773 (0.0030) [2024-06-27 23:34:43,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1651146752. Throughput: 0: 44074.2. Samples: 1554009080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 23:34:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:34:45,630][06909] Updated weights for policy 0, policy_version 100783 (0.0040) [2024-06-27 23:34:48,856][06674] Fps is (10 sec: 44210.4, 60 sec: 44232.3, 300 sec: 44208.1). Total num frames: 1651343360. Throughput: 0: 44170.4. Samples: 1554275020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 23:34:48,857][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:34:49,665][06909] Updated weights for policy 0, policy_version 100793 (0.0030) [2024-06-27 23:34:53,003][06909] Updated weights for policy 0, policy_version 100803 (0.0032) [2024-06-27 23:34:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1651572736. Throughput: 0: 44127.3. Samples: 1554536080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 23:34:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-27 23:34:56,989][06909] Updated weights for policy 0, policy_version 100813 (0.0025) [2024-06-27 23:34:58,850][06674] Fps is (10 sec: 47542.7, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1651818496. Throughput: 0: 44221.9. Samples: 1554673880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 23:34:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:35:00,386][06909] Updated weights for policy 0, policy_version 100823 (0.0035) [2024-06-27 23:35:03,850][06674] Fps is (10 sec: 44235.0, 60 sec: 44509.5, 300 sec: 44209.0). Total num frames: 1652015104. Throughput: 0: 44161.5. Samples: 1554937220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 23:35:03,851][06674] Avg episode reward: [(0, '0.401')] [2024-06-27 23:35:04,305][06909] Updated weights for policy 0, policy_version 100833 (0.0035) [2024-06-27 23:35:08,069][06909] Updated weights for policy 0, policy_version 100843 (0.0041) [2024-06-27 23:35:08,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43692.1, 300 sec: 44097.9). Total num frames: 1652211712. Throughput: 0: 44142.9. Samples: 1555200660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 23:35:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:35:12,076][06909] Updated weights for policy 0, policy_version 100853 (0.0029) [2024-06-27 23:35:13,013][06887] Signal inference workers to stop experience collection... (22250 times) [2024-06-27 23:35:13,013][06887] Signal inference workers to resume experience collection... (22250 times) [2024-06-27 23:35:13,046][06909] InferenceWorker_p0-w0: stopping experience collection (22250 times) [2024-06-27 23:35:13,046][06909] InferenceWorker_p0-w0: resuming experience collection (22250 times) [2024-06-27 23:35:13,850][06674] Fps is (10 sec: 47515.2, 60 sec: 44509.9, 300 sec: 44264.6). Total num frames: 1652490240. Throughput: 0: 44131.5. Samples: 1555333100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-27 23:35:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:35:15,157][06909] Updated weights for policy 0, policy_version 100863 (0.0036) [2024-06-27 23:35:18,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 1652670464. Throughput: 0: 44230.4. Samples: 1555607480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:35:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:35:19,287][06909] Updated weights for policy 0, policy_version 100873 (0.0036) [2024-06-27 23:35:22,396][06909] Updated weights for policy 0, policy_version 100883 (0.0026) [2024-06-27 23:35:23,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.6, 300 sec: 44153.5). Total num frames: 1652899840. Throughput: 0: 44189.3. Samples: 1555866560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:35:23,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:35:26,736][06909] Updated weights for policy 0, policy_version 100893 (0.0025) [2024-06-27 23:35:28,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 1653145600. Throughput: 0: 44261.7. Samples: 1556000860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:35:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:35:30,095][06909] Updated weights for policy 0, policy_version 100903 (0.0033) [2024-06-27 23:35:33,850][06674] Fps is (10 sec: 40960.9, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 1653309440. Throughput: 0: 44222.1. Samples: 1556264740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:35:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:35:34,320][06909] Updated weights for policy 0, policy_version 100913 (0.0036) [2024-06-27 23:35:37,270][06909] Updated weights for policy 0, policy_version 100923 (0.0042) [2024-06-27 23:35:38,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1653555200. Throughput: 0: 44266.3. Samples: 1556528060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:35:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:35:41,901][06909] Updated weights for policy 0, policy_version 100933 (0.0023) [2024-06-27 23:35:43,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 1653800960. Throughput: 0: 44120.0. Samples: 1556659280. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:35:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:35:44,910][06909] Updated weights for policy 0, policy_version 100943 (0.0027) [2024-06-27 23:35:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44241.3, 300 sec: 44153.5). Total num frames: 1653997568. Throughput: 0: 44196.4. Samples: 1556926040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:35:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:35:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000100952_1653997568.pth... [2024-06-27 23:35:48,925][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000100305_1643397120.pth [2024-06-27 23:35:49,415][06909] Updated weights for policy 0, policy_version 100953 (0.0030) [2024-06-27 23:35:52,344][06909] Updated weights for policy 0, policy_version 100963 (0.0033) [2024-06-27 23:35:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1654226944. Throughput: 0: 44259.6. Samples: 1557192340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:35:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:35:56,687][06909] Updated weights for policy 0, policy_version 100973 (0.0031) [2024-06-27 23:35:58,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 44320.1). Total num frames: 1654472704. Throughput: 0: 44300.1. Samples: 1557326600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:35:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:35:59,528][06909] Updated weights for policy 0, policy_version 100983 (0.0032) [2024-06-27 23:36:03,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43962.6, 300 sec: 44153.2). Total num frames: 1654652928. Throughput: 0: 43954.4. Samples: 1557585520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:36:03,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:36:04,058][06909] Updated weights for policy 0, policy_version 100993 (0.0038) [2024-06-27 23:36:07,320][06909] Updated weights for policy 0, policy_version 101003 (0.0036) [2024-06-27 23:36:08,850][06674] Fps is (10 sec: 39321.7, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 1654865920. Throughput: 0: 43986.3. Samples: 1557845940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:36:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:36:11,526][06909] Updated weights for policy 0, policy_version 101013 (0.0040) [2024-06-27 23:36:13,850][06674] Fps is (10 sec: 45884.5, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 1655111680. Throughput: 0: 43923.2. Samples: 1557977400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:36:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:36:14,741][06909] Updated weights for policy 0, policy_version 101023 (0.0023) [2024-06-27 23:36:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1655308288. Throughput: 0: 43983.0. Samples: 1558243980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:36:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:36:19,053][06909] Updated weights for policy 0, policy_version 101033 (0.0031) [2024-06-27 23:36:22,216][06909] Updated weights for policy 0, policy_version 101043 (0.0032) [2024-06-27 23:36:23,852][06674] Fps is (10 sec: 40951.6, 60 sec: 43689.3, 300 sec: 44097.6). Total num frames: 1655521280. Throughput: 0: 43876.6. Samples: 1558502600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:36:23,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:36:26,509][06909] Updated weights for policy 0, policy_version 101053 (0.0038) [2024-06-27 23:36:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 1655767040. Throughput: 0: 43935.9. Samples: 1558636400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:36:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:36:29,587][06909] Updated weights for policy 0, policy_version 101063 (0.0021) [2024-06-27 23:36:32,443][06887] Signal inference workers to stop experience collection... (22300 times) [2024-06-27 23:36:32,477][06909] InferenceWorker_p0-w0: stopping experience collection (22300 times) [2024-06-27 23:36:32,501][06887] Signal inference workers to resume experience collection... (22300 times) [2024-06-27 23:36:32,502][06909] InferenceWorker_p0-w0: resuming experience collection (22300 times) [2024-06-27 23:36:33,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1655963648. Throughput: 0: 43898.7. Samples: 1558901480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:36:33,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:36:34,098][06909] Updated weights for policy 0, policy_version 101073 (0.0037) [2024-06-27 23:36:36,934][06909] Updated weights for policy 0, policy_version 101083 (0.0029) [2024-06-27 23:36:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1656193024. Throughput: 0: 43676.0. Samples: 1559157760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:36:38,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:36:41,216][06909] Updated weights for policy 0, policy_version 101093 (0.0020) [2024-06-27 23:36:43,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 1656422400. Throughput: 0: 43645.0. Samples: 1559290620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:36:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:36:44,743][06909] Updated weights for policy 0, policy_version 101103 (0.0038) [2024-06-27 23:36:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1656619008. Throughput: 0: 43841.5. Samples: 1559558300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:36:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:36:48,867][06909] Updated weights for policy 0, policy_version 101113 (0.0036) [2024-06-27 23:36:52,090][06909] Updated weights for policy 0, policy_version 101123 (0.0033) [2024-06-27 23:36:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1656864768. Throughput: 0: 43837.4. Samples: 1559818620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:36:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:36:56,145][06909] Updated weights for policy 0, policy_version 101133 (0.0039) [2024-06-27 23:36:58,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43417.5, 300 sec: 44153.5). Total num frames: 1657077760. Throughput: 0: 43910.1. Samples: 1559953360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:36:58,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:36:59,606][06909] Updated weights for policy 0, policy_version 101143 (0.0030) [2024-06-27 23:37:03,405][06909] Updated weights for policy 0, policy_version 101153 (0.0040) [2024-06-27 23:37:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43965.3, 300 sec: 44098.0). Total num frames: 1657290752. Throughput: 0: 43868.5. Samples: 1560218060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:37:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:37:06,809][06909] Updated weights for policy 0, policy_version 101163 (0.0043) [2024-06-27 23:37:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1657503744. Throughput: 0: 44026.8. Samples: 1560483720. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:37:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:37:11,026][06909] Updated weights for policy 0, policy_version 101173 (0.0031) [2024-06-27 23:37:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 44153.7). Total num frames: 1657733120. Throughput: 0: 43939.6. Samples: 1560613680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-27 23:37:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:37:14,420][06909] Updated weights for policy 0, policy_version 101183 (0.0030) [2024-06-27 23:37:18,293][06909] Updated weights for policy 0, policy_version 101193 (0.0038) [2024-06-27 23:37:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1657962496. Throughput: 0: 43930.6. Samples: 1560878360. Policy #0 lag: (min: 1.0, avg: 10.3, max: 23.0) [2024-06-27 23:37:18,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 23:37:21,863][06909] Updated weights for policy 0, policy_version 101203 (0.0030) [2024-06-27 23:37:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43965.3, 300 sec: 44098.0). Total num frames: 1658159104. Throughput: 0: 44000.1. Samples: 1561137760. Policy #0 lag: (min: 1.0, avg: 10.3, max: 23.0) [2024-06-27 23:37:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:37:26,265][06909] Updated weights for policy 0, policy_version 101213 (0.0034) [2024-06-27 23:37:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1658388480. Throughput: 0: 43936.4. Samples: 1561267760. Policy #0 lag: (min: 1.0, avg: 10.3, max: 23.0) [2024-06-27 23:37:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:37:29,330][06909] Updated weights for policy 0, policy_version 101223 (0.0031) [2024-06-27 23:37:33,644][06909] Updated weights for policy 0, policy_version 101233 (0.0026) [2024-06-27 23:37:33,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1658617856. Throughput: 0: 43858.1. Samples: 1561531920. Policy #0 lag: (min: 1.0, avg: 10.3, max: 23.0) [2024-06-27 23:37:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:37:36,972][06909] Updated weights for policy 0, policy_version 101243 (0.0031) [2024-06-27 23:37:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 1658814464. Throughput: 0: 43822.7. Samples: 1561790640. Policy #0 lag: (min: 1.0, avg: 10.3, max: 23.0) [2024-06-27 23:37:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:37:40,966][06909] Updated weights for policy 0, policy_version 101253 (0.0030) [2024-06-27 23:37:43,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1659060224. Throughput: 0: 43797.5. Samples: 1561924240. Policy #0 lag: (min: 1.0, avg: 10.3, max: 23.0) [2024-06-27 23:37:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:37:44,845][06909] Updated weights for policy 0, policy_version 101263 (0.0031) [2024-06-27 23:37:48,509][06909] Updated weights for policy 0, policy_version 101273 (0.0036) [2024-06-27 23:37:48,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44509.9, 300 sec: 44154.4). Total num frames: 1659289600. Throughput: 0: 43855.5. Samples: 1562191560. Policy #0 lag: (min: 1.0, avg: 10.3, max: 23.0) [2024-06-27 23:37:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:37:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000101275_1659289600.pth... [2024-06-27 23:37:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000100629_1648705536.pth [2024-06-27 23:37:52,258][06909] Updated weights for policy 0, policy_version 101283 (0.0037) [2024-06-27 23:37:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1659486208. Throughput: 0: 43863.2. Samples: 1562457560. Policy #0 lag: (min: 1.0, avg: 10.3, max: 23.0) [2024-06-27 23:37:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:37:55,707][06909] Updated weights for policy 0, policy_version 101293 (0.0031) [2024-06-27 23:37:56,985][06887] Signal inference workers to stop experience collection... (22350 times) [2024-06-27 23:37:57,039][06909] InferenceWorker_p0-w0: stopping experience collection (22350 times) [2024-06-27 23:37:57,039][06887] Signal inference workers to resume experience collection... (22350 times) [2024-06-27 23:37:57,052][06909] InferenceWorker_p0-w0: resuming experience collection (22350 times) [2024-06-27 23:37:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1659731968. Throughput: 0: 43733.3. Samples: 1562581680. Policy #0 lag: (min: 1.0, avg: 10.3, max: 23.0) [2024-06-27 23:37:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:37:59,608][06909] Updated weights for policy 0, policy_version 101303 (0.0039) [2024-06-27 23:38:03,369][06909] Updated weights for policy 0, policy_version 101313 (0.0022) [2024-06-27 23:38:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1659928576. Throughput: 0: 43972.5. Samples: 1562857120. Policy #0 lag: (min: 1.0, avg: 10.3, max: 23.0) [2024-06-27 23:38:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:38:06,982][06909] Updated weights for policy 0, policy_version 101323 (0.0029) [2024-06-27 23:38:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1660141568. Throughput: 0: 43879.0. Samples: 1563112320. Policy #0 lag: (min: 1.0, avg: 10.3, max: 23.0) [2024-06-27 23:38:08,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:38:10,718][06909] Updated weights for policy 0, policy_version 101333 (0.0026) [2024-06-27 23:38:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1660387328. Throughput: 0: 43915.5. Samples: 1563243960. Policy #0 lag: (min: 1.0, avg: 10.3, max: 23.0) [2024-06-27 23:38:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:38:14,442][06909] Updated weights for policy 0, policy_version 101343 (0.0039) [2024-06-27 23:38:18,391][06909] Updated weights for policy 0, policy_version 101353 (0.0027) [2024-06-27 23:38:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1660600320. Throughput: 0: 43956.1. Samples: 1563509940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2024-06-27 23:38:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:38:22,119][06909] Updated weights for policy 0, policy_version 101363 (0.0037) [2024-06-27 23:38:23,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.6, 300 sec: 43987.8). Total num frames: 1660796928. Throughput: 0: 44152.7. Samples: 1563777520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2024-06-27 23:38:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:38:25,599][06909] Updated weights for policy 0, policy_version 101373 (0.0032) [2024-06-27 23:38:28,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1661042688. Throughput: 0: 44030.3. Samples: 1563905600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2024-06-27 23:38:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:38:29,386][06909] Updated weights for policy 0, policy_version 101383 (0.0026) [2024-06-27 23:38:33,067][06909] Updated weights for policy 0, policy_version 101393 (0.0034) [2024-06-27 23:38:33,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1661255680. Throughput: 0: 43962.7. Samples: 1564169880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2024-06-27 23:38:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:38:36,970][06909] Updated weights for policy 0, policy_version 101403 (0.0040) [2024-06-27 23:38:38,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1661452288. Throughput: 0: 43847.5. Samples: 1564430700. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2024-06-27 23:38:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:38:40,664][06909] Updated weights for policy 0, policy_version 101413 (0.0033) [2024-06-27 23:38:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 1661698048. Throughput: 0: 44024.4. Samples: 1564562780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2024-06-27 23:38:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-27 23:38:44,487][06909] Updated weights for policy 0, policy_version 101423 (0.0031) [2024-06-27 23:38:48,076][06909] Updated weights for policy 0, policy_version 101433 (0.0032) [2024-06-27 23:38:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1661911040. Throughput: 0: 43779.5. Samples: 1564827200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2024-06-27 23:38:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:38:51,892][06909] Updated weights for policy 0, policy_version 101443 (0.0042) [2024-06-27 23:38:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1662107648. Throughput: 0: 43873.8. Samples: 1565086640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2024-06-27 23:38:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:38:55,514][06909] Updated weights for policy 0, policy_version 101453 (0.0038) [2024-06-27 23:38:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1662353408. Throughput: 0: 43857.3. Samples: 1565217540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2024-06-27 23:38:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:38:59,594][06909] Updated weights for policy 0, policy_version 101463 (0.0036) [2024-06-27 23:39:02,815][06909] Updated weights for policy 0, policy_version 101473 (0.0037) [2024-06-27 23:39:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 43987.2). Total num frames: 1662566400. Throughput: 0: 43972.9. Samples: 1565488720. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2024-06-27 23:39:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:39:06,789][06909] Updated weights for policy 0, policy_version 101483 (0.0037) [2024-06-27 23:39:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1662779392. Throughput: 0: 43808.5. Samples: 1565748900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2024-06-27 23:39:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-27 23:39:10,287][06909] Updated weights for policy 0, policy_version 101493 (0.0031) [2024-06-27 23:39:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1663008768. Throughput: 0: 43867.1. Samples: 1565879620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 19.0) [2024-06-27 23:39:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:39:14,561][06909] Updated weights for policy 0, policy_version 101503 (0.0035) [2024-06-27 23:39:17,961][06909] Updated weights for policy 0, policy_version 101513 (0.0040) [2024-06-27 23:39:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1663238144. Throughput: 0: 43908.5. Samples: 1566145760. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 23:39:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:39:22,010][06909] Updated weights for policy 0, policy_version 101523 (0.0028) [2024-06-27 23:39:23,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43962.3, 300 sec: 43875.5). Total num frames: 1663434752. Throughput: 0: 43914.9. Samples: 1566406960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 23:39:23,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:39:25,322][06909] Updated weights for policy 0, policy_version 101533 (0.0027) [2024-06-27 23:39:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1663664128. Throughput: 0: 43838.3. Samples: 1566535500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 23:39:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:39:29,607][06909] Updated weights for policy 0, policy_version 101543 (0.0023) [2024-06-27 23:39:32,976][06909] Updated weights for policy 0, policy_version 101553 (0.0031) [2024-06-27 23:39:33,824][06887] Signal inference workers to stop experience collection... (22400 times) [2024-06-27 23:39:33,824][06887] Signal inference workers to resume experience collection... (22400 times) [2024-06-27 23:39:33,850][06674] Fps is (10 sec: 45885.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1663893504. Throughput: 0: 43893.5. Samples: 1566802400. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 23:39:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:39:33,865][06909] InferenceWorker_p0-w0: stopping experience collection (22400 times) [2024-06-27 23:39:33,865][06909] InferenceWorker_p0-w0: resuming experience collection (22400 times) [2024-06-27 23:39:36,874][06909] Updated weights for policy 0, policy_version 101563 (0.0026) [2024-06-27 23:39:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1664090112. Throughput: 0: 44104.0. Samples: 1567071320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 23:39:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:39:40,158][06909] Updated weights for policy 0, policy_version 101573 (0.0028) [2024-06-27 23:39:43,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.7, 300 sec: 43987.8). Total num frames: 1664319488. Throughput: 0: 44061.3. Samples: 1567200300. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 23:39:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:39:44,077][06909] Updated weights for policy 0, policy_version 101583 (0.0035) [2024-06-27 23:39:47,600][06909] Updated weights for policy 0, policy_version 101593 (0.0036) [2024-06-27 23:39:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1664548864. Throughput: 0: 43877.8. Samples: 1567463220. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 23:39:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:39:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000101596_1664548864.pth... [2024-06-27 23:39:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000100952_1653997568.pth [2024-06-27 23:39:51,690][06909] Updated weights for policy 0, policy_version 101603 (0.0029) [2024-06-27 23:39:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1664761856. Throughput: 0: 44169.4. Samples: 1567736520. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 23:39:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:39:55,066][06909] Updated weights for policy 0, policy_version 101613 (0.0037) [2024-06-27 23:39:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 1664974848. Throughput: 0: 44035.1. Samples: 1567861200. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 23:39:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-27 23:39:58,944][06909] Updated weights for policy 0, policy_version 101623 (0.0040) [2024-06-27 23:40:02,374][06909] Updated weights for policy 0, policy_version 101633 (0.0030) [2024-06-27 23:40:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1665220608. Throughput: 0: 43979.6. Samples: 1568124840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 23:40:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:40:06,535][06909] Updated weights for policy 0, policy_version 101643 (0.0027) [2024-06-27 23:40:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1665417216. Throughput: 0: 44103.4. Samples: 1568391520. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 23:40:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:40:10,025][06909] Updated weights for policy 0, policy_version 101653 (0.0037) [2024-06-27 23:40:13,850][06674] Fps is (10 sec: 40959.1, 60 sec: 43690.5, 300 sec: 43931.3). Total num frames: 1665630208. Throughput: 0: 44027.8. Samples: 1568516760. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 23:40:13,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:40:13,991][06909] Updated weights for policy 0, policy_version 101663 (0.0041) [2024-06-27 23:40:17,290][06909] Updated weights for policy 0, policy_version 101673 (0.0026) [2024-06-27 23:40:18,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1665875968. Throughput: 0: 43893.6. Samples: 1568777620. Policy #0 lag: (min: 0.0, avg: 10.9, max: 24.0) [2024-06-27 23:40:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:40:21,318][06909] Updated weights for policy 0, policy_version 101683 (0.0047) [2024-06-27 23:40:23,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43965.3, 300 sec: 43820.3). Total num frames: 1666072576. Throughput: 0: 43844.1. Samples: 1569044300. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:40:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:40:24,848][06909] Updated weights for policy 0, policy_version 101693 (0.0036) [2024-06-27 23:40:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1666285568. Throughput: 0: 43775.6. Samples: 1569170200. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:40:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:40:29,192][06909] Updated weights for policy 0, policy_version 101703 (0.0034) [2024-06-27 23:40:32,167][06909] Updated weights for policy 0, policy_version 101713 (0.0028) [2024-06-27 23:40:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1666514944. Throughput: 0: 43709.9. Samples: 1569430160. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:40:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:40:36,573][06909] Updated weights for policy 0, policy_version 101723 (0.0032) [2024-06-27 23:40:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43820.2). Total num frames: 1666727936. Throughput: 0: 43695.1. Samples: 1569702800. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:40:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:40:39,693][06909] Updated weights for policy 0, policy_version 101733 (0.0037) [2024-06-27 23:40:43,758][06909] Updated weights for policy 0, policy_version 101743 (0.0030) [2024-06-27 23:40:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1666957312. Throughput: 0: 43766.2. Samples: 1569830680. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:40:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:40:47,406][06909] Updated weights for policy 0, policy_version 101753 (0.0028) [2024-06-27 23:40:48,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1667203072. Throughput: 0: 43836.8. Samples: 1570097500. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:40:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:40:51,506][06909] Updated weights for policy 0, policy_version 101763 (0.0034) [2024-06-27 23:40:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1667399680. Throughput: 0: 43842.7. Samples: 1570364440. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:40:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:40:54,759][06909] Updated weights for policy 0, policy_version 101773 (0.0036) [2024-06-27 23:40:58,736][06909] Updated weights for policy 0, policy_version 101783 (0.0043) [2024-06-27 23:40:58,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 43931.6). Total num frames: 1667612672. Throughput: 0: 43918.9. Samples: 1570493100. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:40:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:41:01,989][06909] Updated weights for policy 0, policy_version 101793 (0.0029) [2024-06-27 23:41:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1667842048. Throughput: 0: 44053.5. Samples: 1570760020. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:41:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:41:06,415][06909] Updated weights for policy 0, policy_version 101803 (0.0045) [2024-06-27 23:41:08,850][06674] Fps is (10 sec: 45874.1, 60 sec: 44236.6, 300 sec: 43931.3). Total num frames: 1668071424. Throughput: 0: 44050.0. Samples: 1571026560. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:41:08,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:41:09,481][06909] Updated weights for policy 0, policy_version 101813 (0.0033) [2024-06-27 23:41:13,619][06887] Signal inference workers to stop experience collection... (22450 times) [2024-06-27 23:41:13,620][06887] Signal inference workers to resume experience collection... (22450 times) [2024-06-27 23:41:13,627][06909] Updated weights for policy 0, policy_version 101823 (0.0037) [2024-06-27 23:41:13,645][06909] InferenceWorker_p0-w0: stopping experience collection (22450 times) [2024-06-27 23:41:13,645][06909] InferenceWorker_p0-w0: resuming experience collection (22450 times) [2024-06-27 23:41:13,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1668284416. Throughput: 0: 44089.4. Samples: 1571154220. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:41:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:41:17,198][06909] Updated weights for policy 0, policy_version 101833 (0.0040) [2024-06-27 23:41:18,850][06674] Fps is (10 sec: 44237.8, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 1668513792. Throughput: 0: 44205.3. Samples: 1571419400. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:41:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:41:21,419][06909] Updated weights for policy 0, policy_version 101843 (0.0037) [2024-06-27 23:41:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1668710400. Throughput: 0: 44023.6. Samples: 1571683860. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 23:41:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:41:24,487][06909] Updated weights for policy 0, policy_version 101853 (0.0028) [2024-06-27 23:41:28,542][06909] Updated weights for policy 0, policy_version 101863 (0.0036) [2024-06-27 23:41:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1668939776. Throughput: 0: 44092.8. Samples: 1571814860. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 23:41:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:41:31,570][06909] Updated weights for policy 0, policy_version 101873 (0.0029) [2024-06-27 23:41:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1669152768. Throughput: 0: 44230.3. Samples: 1572087860. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 23:41:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:41:35,798][06909] Updated weights for policy 0, policy_version 101883 (0.0030) [2024-06-27 23:41:38,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 1669398528. Throughput: 0: 44052.1. Samples: 1572346780. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 23:41:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:41:38,921][06909] Updated weights for policy 0, policy_version 101893 (0.0029) [2024-06-27 23:41:43,490][06909] Updated weights for policy 0, policy_version 101903 (0.0039) [2024-06-27 23:41:43,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 1669595136. Throughput: 0: 44126.4. Samples: 1572478880. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 23:41:43,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:41:46,513][06909] Updated weights for policy 0, policy_version 101913 (0.0033) [2024-06-27 23:41:48,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1669840896. Throughput: 0: 44279.9. Samples: 1572752620. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 23:41:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:41:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000101919_1669840896.pth... [2024-06-27 23:41:48,905][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000101275_1659289600.pth [2024-06-27 23:41:50,843][06909] Updated weights for policy 0, policy_version 101923 (0.0039) [2024-06-27 23:41:53,850][06674] Fps is (10 sec: 45884.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1670053888. Throughput: 0: 43958.9. Samples: 1573004700. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 23:41:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:41:54,268][06909] Updated weights for policy 0, policy_version 101933 (0.0029) [2024-06-27 23:41:58,267][06909] Updated weights for policy 0, policy_version 101943 (0.0022) [2024-06-27 23:41:58,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1670250496. Throughput: 0: 44136.4. Samples: 1573140360. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 23:41:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:42:01,733][06909] Updated weights for policy 0, policy_version 101953 (0.0026) [2024-06-27 23:42:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1670496256. Throughput: 0: 44039.9. Samples: 1573401200. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 23:42:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:42:05,850][06909] Updated weights for policy 0, policy_version 101963 (0.0029) [2024-06-27 23:42:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1670709248. Throughput: 0: 44005.7. Samples: 1573664120. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 23:42:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:42:08,966][06909] Updated weights for policy 0, policy_version 101973 (0.0028) [2024-06-27 23:42:13,170][06909] Updated weights for policy 0, policy_version 101983 (0.0028) [2024-06-27 23:42:13,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1670905856. Throughput: 0: 44105.4. Samples: 1573799600. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 23:42:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:42:16,297][06909] Updated weights for policy 0, policy_version 101993 (0.0035) [2024-06-27 23:42:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1671135232. Throughput: 0: 43844.4. Samples: 1574060860. Policy #0 lag: (min: 1.0, avg: 11.4, max: 21.0) [2024-06-27 23:42:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:42:20,980][06909] Updated weights for policy 0, policy_version 102003 (0.0030) [2024-06-27 23:42:23,687][06909] Updated weights for policy 0, policy_version 102013 (0.0037) [2024-06-27 23:42:23,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 1671380992. Throughput: 0: 43871.9. Samples: 1574321020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:42:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:42:28,116][06909] Updated weights for policy 0, policy_version 102023 (0.0033) [2024-06-27 23:42:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1671593984. Throughput: 0: 44012.2. Samples: 1574459340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:42:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:42:31,913][06909] Updated weights for policy 0, policy_version 102033 (0.0027) [2024-06-27 23:42:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 1671823360. Throughput: 0: 43752.9. Samples: 1574721500. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:42:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:42:35,616][06909] Updated weights for policy 0, policy_version 102043 (0.0035) [2024-06-27 23:42:38,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1672019968. Throughput: 0: 44163.9. Samples: 1574992080. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:42:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:42:39,008][06909] Updated weights for policy 0, policy_version 102053 (0.0035) [2024-06-27 23:42:41,308][06887] Signal inference workers to stop experience collection... (22500 times) [2024-06-27 23:42:41,312][06887] Signal inference workers to resume experience collection... (22500 times) [2024-06-27 23:42:41,354][06909] InferenceWorker_p0-w0: stopping experience collection (22500 times) [2024-06-27 23:42:41,354][06909] InferenceWorker_p0-w0: resuming experience collection (22500 times) [2024-06-27 23:42:42,767][06909] Updated weights for policy 0, policy_version 102063 (0.0028) [2024-06-27 23:42:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44511.3, 300 sec: 43986.9). Total num frames: 1672265728. Throughput: 0: 44167.1. Samples: 1575127880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:42:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:42:46,287][06909] Updated weights for policy 0, policy_version 102073 (0.0037) [2024-06-27 23:42:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1672478720. Throughput: 0: 44266.1. Samples: 1575393180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:42:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:42:50,041][06909] Updated weights for policy 0, policy_version 102083 (0.0038) [2024-06-27 23:42:53,491][06909] Updated weights for policy 0, policy_version 102093 (0.0028) [2024-06-27 23:42:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1672691712. Throughput: 0: 44332.5. Samples: 1575659080. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:42:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:42:57,392][06909] Updated weights for policy 0, policy_version 102103 (0.0035) [2024-06-27 23:42:58,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44782.9, 300 sec: 44097.9). Total num frames: 1672937472. Throughput: 0: 44335.9. Samples: 1575794720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:42:58,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:43:00,710][06909] Updated weights for policy 0, policy_version 102113 (0.0036) [2024-06-27 23:43:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1673134080. Throughput: 0: 44379.1. Samples: 1576057920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:43:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:43:04,907][06909] Updated weights for policy 0, policy_version 102123 (0.0028) [2024-06-27 23:43:08,538][06909] Updated weights for policy 0, policy_version 102133 (0.0026) [2024-06-27 23:43:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1673363456. Throughput: 0: 44576.4. Samples: 1576326960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:43:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:43:12,387][06909] Updated weights for policy 0, policy_version 102143 (0.0036) [2024-06-27 23:43:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 1673576448. Throughput: 0: 44321.7. Samples: 1576453820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:43:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:43:16,019][06909] Updated weights for policy 0, policy_version 102153 (0.0038) [2024-06-27 23:43:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1673789440. Throughput: 0: 44208.8. Samples: 1576710900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:43:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:43:19,803][06909] Updated weights for policy 0, policy_version 102163 (0.0028) [2024-06-27 23:43:23,480][06909] Updated weights for policy 0, policy_version 102173 (0.0031) [2024-06-27 23:43:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43986.8). Total num frames: 1674018816. Throughput: 0: 44179.5. Samples: 1576980160. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 23:43:23,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:43:27,566][06909] Updated weights for policy 0, policy_version 102183 (0.0038) [2024-06-27 23:43:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1674231808. Throughput: 0: 43984.5. Samples: 1577107180. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 23:43:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:43:30,843][06909] Updated weights for policy 0, policy_version 102193 (0.0035) [2024-06-27 23:43:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1674444800. Throughput: 0: 43808.0. Samples: 1577364540. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 23:43:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:43:34,856][06909] Updated weights for policy 0, policy_version 102203 (0.0030) [2024-06-27 23:43:38,617][06909] Updated weights for policy 0, policy_version 102213 (0.0042) [2024-06-27 23:43:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1674674176. Throughput: 0: 43903.9. Samples: 1577634760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 23:43:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:43:42,210][06909] Updated weights for policy 0, policy_version 102223 (0.0033) [2024-06-27 23:43:43,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1674903552. Throughput: 0: 43892.0. Samples: 1577769860. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 23:43:43,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:43:45,889][06909] Updated weights for policy 0, policy_version 102233 (0.0038) [2024-06-27 23:43:48,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1675116544. Throughput: 0: 43752.1. Samples: 1578026760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 23:43:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:43:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000102241_1675116544.pth... [2024-06-27 23:43:48,909][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000101596_1664548864.pth [2024-06-27 23:43:49,636][06909] Updated weights for policy 0, policy_version 102243 (0.0023) [2024-06-27 23:43:53,411][06909] Updated weights for policy 0, policy_version 102253 (0.0032) [2024-06-27 23:43:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1675329536. Throughput: 0: 43800.6. Samples: 1578297980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 23:43:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:43:57,059][06909] Updated weights for policy 0, policy_version 102263 (0.0030) [2024-06-27 23:43:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1675558912. Throughput: 0: 43968.9. Samples: 1578432420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 23:43:58,850][06674] Avg episode reward: [(0, '0.408')] [2024-06-27 23:44:00,724][06909] Updated weights for policy 0, policy_version 102273 (0.0033) [2024-06-27 23:44:03,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 1675755520. Throughput: 0: 43946.5. Samples: 1578688580. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 23:44:03,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:44:04,487][06909] Updated weights for policy 0, policy_version 102283 (0.0036) [2024-06-27 23:44:07,928][06909] Updated weights for policy 0, policy_version 102293 (0.0037) [2024-06-27 23:44:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1675984896. Throughput: 0: 43959.2. Samples: 1578958320. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 23:44:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:44:12,264][06887] Signal inference workers to stop experience collection... (22550 times) [2024-06-27 23:44:12,314][06887] Signal inference workers to resume experience collection... (22550 times) [2024-06-27 23:44:12,314][06909] InferenceWorker_p0-w0: stopping experience collection (22550 times) [2024-06-27 23:44:12,317][06909] Updated weights for policy 0, policy_version 102303 (0.0031) [2024-06-27 23:44:12,340][06909] InferenceWorker_p0-w0: resuming experience collection (22550 times) [2024-06-27 23:44:13,850][06674] Fps is (10 sec: 45884.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1676214272. Throughput: 0: 44098.3. Samples: 1579091600. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 23:44:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:44:15,687][06909] Updated weights for policy 0, policy_version 102313 (0.0030) [2024-06-27 23:44:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 1676427264. Throughput: 0: 44066.7. Samples: 1579347540. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-27 23:44:18,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:44:19,470][06909] Updated weights for policy 0, policy_version 102323 (0.0031) [2024-06-27 23:44:23,033][06909] Updated weights for policy 0, policy_version 102333 (0.0040) [2024-06-27 23:44:23,852][06674] Fps is (10 sec: 45865.8, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 1676673024. Throughput: 0: 44080.7. Samples: 1579618480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 23:44:23,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:44:26,603][06909] Updated weights for policy 0, policy_version 102343 (0.0032) [2024-06-27 23:44:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1676869632. Throughput: 0: 44025.9. Samples: 1579751020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 23:44:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:44:30,465][06909] Updated weights for policy 0, policy_version 102353 (0.0027) [2024-06-27 23:44:33,850][06674] Fps is (10 sec: 42607.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1677099008. Throughput: 0: 44185.3. Samples: 1580015100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 23:44:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:44:34,044][06909] Updated weights for policy 0, policy_version 102363 (0.0031) [2024-06-27 23:44:37,712][06909] Updated weights for policy 0, policy_version 102373 (0.0025) [2024-06-27 23:44:38,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1677328384. Throughput: 0: 44031.9. Samples: 1580279420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 23:44:38,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:44:41,562][06909] Updated weights for policy 0, policy_version 102383 (0.0038) [2024-06-27 23:44:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1677524992. Throughput: 0: 44076.5. Samples: 1580415860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 23:44:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:44:45,036][06909] Updated weights for policy 0, policy_version 102393 (0.0022) [2024-06-27 23:44:48,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 1677754368. Throughput: 0: 44202.2. Samples: 1580677680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 23:44:48,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:44:49,170][06909] Updated weights for policy 0, policy_version 102403 (0.0046) [2024-06-27 23:44:52,757][06909] Updated weights for policy 0, policy_version 102413 (0.0030) [2024-06-27 23:44:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1677983744. Throughput: 0: 44033.8. Samples: 1580939840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 23:44:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:44:56,722][06909] Updated weights for policy 0, policy_version 102423 (0.0040) [2024-06-27 23:44:58,850][06674] Fps is (10 sec: 42606.8, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1678180352. Throughput: 0: 43985.7. Samples: 1581070960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 23:44:58,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:45:00,338][06909] Updated weights for policy 0, policy_version 102433 (0.0033) [2024-06-27 23:45:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 1678409728. Throughput: 0: 44153.8. Samples: 1581334460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 23:45:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:45:04,006][06909] Updated weights for policy 0, policy_version 102443 (0.0030) [2024-06-27 23:45:07,638][06909] Updated weights for policy 0, policy_version 102453 (0.0041) [2024-06-27 23:45:08,850][06674] Fps is (10 sec: 49152.2, 60 sec: 44782.9, 300 sec: 44209.0). Total num frames: 1678671872. Throughput: 0: 44007.3. Samples: 1581598720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 23:45:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:45:11,444][06909] Updated weights for policy 0, policy_version 102463 (0.0035) [2024-06-27 23:45:13,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1678835712. Throughput: 0: 44095.5. Samples: 1581735320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 23:45:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:45:14,878][06909] Updated weights for policy 0, policy_version 102473 (0.0027) [2024-06-27 23:45:18,851][06674] Fps is (10 sec: 39318.6, 60 sec: 43963.2, 300 sec: 44042.3). Total num frames: 1679065088. Throughput: 0: 43924.1. Samples: 1581991720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-27 23:45:18,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:45:19,134][06909] Updated weights for policy 0, policy_version 102483 (0.0037) [2024-06-27 23:45:22,546][06909] Updated weights for policy 0, policy_version 102493 (0.0031) [2024-06-27 23:45:23,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 1679310848. Throughput: 0: 44017.8. Samples: 1582260220. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 23:45:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:45:26,481][06909] Updated weights for policy 0, policy_version 102503 (0.0029) [2024-06-27 23:45:28,850][06674] Fps is (10 sec: 42601.9, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1679491072. Throughput: 0: 43962.2. Samples: 1582394160. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 23:45:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:45:30,050][06909] Updated weights for policy 0, policy_version 102513 (0.0032) [2024-06-27 23:45:33,705][06909] Updated weights for policy 0, policy_version 102523 (0.0045) [2024-06-27 23:45:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1679736832. Throughput: 0: 43889.1. Samples: 1582652600. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 23:45:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:45:37,320][06909] Updated weights for policy 0, policy_version 102533 (0.0037) [2024-06-27 23:45:37,334][06887] Signal inference workers to stop experience collection... (22600 times) [2024-06-27 23:45:37,334][06887] Signal inference workers to resume experience collection... (22600 times) [2024-06-27 23:45:37,374][06909] InferenceWorker_p0-w0: stopping experience collection (22600 times) [2024-06-27 23:45:37,374][06909] InferenceWorker_p0-w0: resuming experience collection (22600 times) [2024-06-27 23:45:38,850][06674] Fps is (10 sec: 49151.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1679982592. Throughput: 0: 43956.4. Samples: 1582917880. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 23:45:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:45:40,836][06909] Updated weights for policy 0, policy_version 102543 (0.0041) [2024-06-27 23:45:43,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1680146432. Throughput: 0: 44052.0. Samples: 1583053300. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 23:45:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:45:44,618][06909] Updated weights for policy 0, policy_version 102553 (0.0037) [2024-06-27 23:45:48,010][06909] Updated weights for policy 0, policy_version 102563 (0.0023) [2024-06-27 23:45:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44238.3, 300 sec: 44097.9). Total num frames: 1680408576. Throughput: 0: 44141.2. Samples: 1583320820. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 23:45:48,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:45:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000102564_1680408576.pth... [2024-06-27 23:45:48,937][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000101919_1669840896.pth [2024-06-27 23:45:52,030][06909] Updated weights for policy 0, policy_version 102573 (0.0043) [2024-06-27 23:45:53,850][06674] Fps is (10 sec: 49152.5, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1680637952. Throughput: 0: 43973.4. Samples: 1583577520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 23:45:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:45:55,905][06909] Updated weights for policy 0, policy_version 102583 (0.0036) [2024-06-27 23:45:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1680834560. Throughput: 0: 44035.1. Samples: 1583716900. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 23:45:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:45:59,735][06909] Updated weights for policy 0, policy_version 102593 (0.0028) [2024-06-27 23:46:03,818][06909] Updated weights for policy 0, policy_version 102603 (0.0032) [2024-06-27 23:46:03,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1681047552. Throughput: 0: 44019.0. Samples: 1583972540. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 23:46:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:46:07,277][06909] Updated weights for policy 0, policy_version 102613 (0.0032) [2024-06-27 23:46:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 1681276928. Throughput: 0: 43808.9. Samples: 1584231620. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 23:46:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:46:11,040][06909] Updated weights for policy 0, policy_version 102623 (0.0030) [2024-06-27 23:46:13,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 1681457152. Throughput: 0: 43972.1. Samples: 1584372900. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 23:46:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:46:14,800][06909] Updated weights for policy 0, policy_version 102633 (0.0035) [2024-06-27 23:46:18,332][06909] Updated weights for policy 0, policy_version 102643 (0.0026) [2024-06-27 23:46:18,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44510.5, 300 sec: 44153.5). Total num frames: 1681735680. Throughput: 0: 44051.6. Samples: 1584634920. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-27 23:46:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:46:21,905][06909] Updated weights for policy 0, policy_version 102653 (0.0041) [2024-06-27 23:46:23,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1681932288. Throughput: 0: 44152.9. Samples: 1584904760. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 23:46:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:46:25,720][06909] Updated weights for policy 0, policy_version 102663 (0.0031) [2024-06-27 23:46:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 1682161664. Throughput: 0: 44119.3. Samples: 1585038660. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 23:46:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:46:29,472][06909] Updated weights for policy 0, policy_version 102673 (0.0031) [2024-06-27 23:46:33,340][06909] Updated weights for policy 0, policy_version 102683 (0.0020) [2024-06-27 23:46:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1682391040. Throughput: 0: 44100.1. Samples: 1585305320. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 23:46:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:46:36,598][06909] Updated weights for policy 0, policy_version 102693 (0.0027) [2024-06-27 23:46:38,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 44098.2). Total num frames: 1682604032. Throughput: 0: 44370.6. Samples: 1585574200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 23:46:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:46:40,472][06909] Updated weights for policy 0, policy_version 102703 (0.0031) [2024-06-27 23:46:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44783.0, 300 sec: 44042.4). Total num frames: 1682833408. Throughput: 0: 44236.0. Samples: 1585707520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 23:46:43,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:46:44,202][06909] Updated weights for policy 0, policy_version 102713 (0.0039) [2024-06-27 23:46:47,686][06909] Updated weights for policy 0, policy_version 102723 (0.0032) [2024-06-27 23:46:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1683062784. Throughput: 0: 44335.0. Samples: 1585967620. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 23:46:48,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:46:51,455][06909] Updated weights for policy 0, policy_version 102733 (0.0036) [2024-06-27 23:46:52,413][06887] Signal inference workers to stop experience collection... (22650 times) [2024-06-27 23:46:52,463][06909] InferenceWorker_p0-w0: stopping experience collection (22650 times) [2024-06-27 23:46:52,465][06887] Signal inference workers to resume experience collection... (22650 times) [2024-06-27 23:46:52,473][06909] InferenceWorker_p0-w0: resuming experience collection (22650 times) [2024-06-27 23:46:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1683275776. Throughput: 0: 44639.1. Samples: 1586240380. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 23:46:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:46:54,956][06909] Updated weights for policy 0, policy_version 102743 (0.0029) [2024-06-27 23:46:58,708][06909] Updated weights for policy 0, policy_version 102753 (0.0043) [2024-06-27 23:46:58,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 1683505152. Throughput: 0: 44402.6. Samples: 1586371020. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 23:46:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:47:02,395][06909] Updated weights for policy 0, policy_version 102763 (0.0027) [2024-06-27 23:47:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1683701760. Throughput: 0: 44472.8. Samples: 1586636200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 23:47:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:47:06,266][06909] Updated weights for policy 0, policy_version 102773 (0.0026) [2024-06-27 23:47:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1683931136. Throughput: 0: 44339.1. Samples: 1586900020. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 23:47:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:47:10,044][06909] Updated weights for policy 0, policy_version 102783 (0.0037) [2024-06-27 23:47:13,581][06909] Updated weights for policy 0, policy_version 102793 (0.0045) [2024-06-27 23:47:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 45055.9, 300 sec: 44153.5). Total num frames: 1684160512. Throughput: 0: 44298.1. Samples: 1587032080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 23:47:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:47:17,298][06909] Updated weights for policy 0, policy_version 102803 (0.0027) [2024-06-27 23:47:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1684373504. Throughput: 0: 44131.6. Samples: 1587291240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-27 23:47:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:47:21,316][06909] Updated weights for policy 0, policy_version 102813 (0.0037) [2024-06-27 23:47:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1684586496. Throughput: 0: 44129.4. Samples: 1587560020. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 23:47:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:47:24,803][06909] Updated weights for policy 0, policy_version 102823 (0.0027) [2024-06-27 23:47:28,639][06909] Updated weights for policy 0, policy_version 102833 (0.0033) [2024-06-27 23:47:28,850][06674] Fps is (10 sec: 44235.6, 60 sec: 44236.6, 300 sec: 44042.4). Total num frames: 1684815872. Throughput: 0: 44045.2. Samples: 1587689560. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 23:47:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:47:32,182][06909] Updated weights for policy 0, policy_version 102843 (0.0026) [2024-06-27 23:47:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1685045248. Throughput: 0: 44157.8. Samples: 1587954720. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 23:47:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:47:35,963][06909] Updated weights for policy 0, policy_version 102853 (0.0025) [2024-06-27 23:47:38,850][06674] Fps is (10 sec: 45876.7, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 1685274624. Throughput: 0: 44031.3. Samples: 1588221780. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 23:47:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 23:47:40,057][06909] Updated weights for policy 0, policy_version 102863 (0.0034) [2024-06-27 23:47:43,395][06909] Updated weights for policy 0, policy_version 102873 (0.0030) [2024-06-27 23:47:43,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1685487616. Throughput: 0: 44034.2. Samples: 1588352560. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 23:47:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:47:47,320][06909] Updated weights for policy 0, policy_version 102883 (0.0027) [2024-06-27 23:47:48,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1685684224. Throughput: 0: 44080.8. Samples: 1588619840. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 23:47:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:47:48,880][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000102886_1685684224.pth... [2024-06-27 23:47:48,936][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000102241_1675116544.pth [2024-06-27 23:47:50,606][06909] Updated weights for policy 0, policy_version 102893 (0.0033) [2024-06-27 23:47:53,082][06887] Signal inference workers to stop experience collection... (22700 times) [2024-06-27 23:47:53,082][06887] Signal inference workers to resume experience collection... (22700 times) [2024-06-27 23:47:53,132][06909] InferenceWorker_p0-w0: stopping experience collection (22700 times) [2024-06-27 23:47:53,132][06909] InferenceWorker_p0-w0: resuming experience collection (22700 times) [2024-06-27 23:47:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1685913600. Throughput: 0: 44072.9. Samples: 1588883300. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 23:47:53,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:47:54,646][06909] Updated weights for policy 0, policy_version 102903 (0.0033) [2024-06-27 23:47:58,112][06909] Updated weights for policy 0, policy_version 102913 (0.0037) [2024-06-27 23:47:58,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1686159360. Throughput: 0: 44178.3. Samples: 1589020100. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 23:47:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:48:02,031][06909] Updated weights for policy 0, policy_version 102923 (0.0028) [2024-06-27 23:48:03,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1686339584. Throughput: 0: 44292.5. Samples: 1589284400. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 23:48:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:48:05,628][06909] Updated weights for policy 0, policy_version 102933 (0.0043) [2024-06-27 23:48:08,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1686568960. Throughput: 0: 44081.4. Samples: 1589543680. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 23:48:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:48:09,694][06909] Updated weights for policy 0, policy_version 102943 (0.0029) [2024-06-27 23:48:12,994][06909] Updated weights for policy 0, policy_version 102953 (0.0036) [2024-06-27 23:48:13,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1686831104. Throughput: 0: 44242.0. Samples: 1589680440. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 23:48:13,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:48:16,998][06909] Updated weights for policy 0, policy_version 102963 (0.0026) [2024-06-27 23:48:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1687011328. Throughput: 0: 44043.1. Samples: 1589936660. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-27 23:48:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:48:20,493][06909] Updated weights for policy 0, policy_version 102973 (0.0030) [2024-06-27 23:48:23,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1687224320. Throughput: 0: 44043.9. Samples: 1590203760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 23:48:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:48:24,394][06909] Updated weights for policy 0, policy_version 102983 (0.0032) [2024-06-27 23:48:27,848][06909] Updated weights for policy 0, policy_version 102993 (0.0029) [2024-06-27 23:48:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44237.0, 300 sec: 44153.5). Total num frames: 1687470080. Throughput: 0: 44147.6. Samples: 1590339200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 23:48:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:48:32,130][06909] Updated weights for policy 0, policy_version 103003 (0.0037) [2024-06-27 23:48:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1687666688. Throughput: 0: 44004.5. Samples: 1590600040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 23:48:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:48:35,319][06909] Updated weights for policy 0, policy_version 103013 (0.0038) [2024-06-27 23:48:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1687896064. Throughput: 0: 43874.7. Samples: 1590857660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 23:48:38,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:48:39,700][06909] Updated weights for policy 0, policy_version 103023 (0.0033) [2024-06-27 23:48:42,990][06909] Updated weights for policy 0, policy_version 103033 (0.0028) [2024-06-27 23:48:43,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1688141824. Throughput: 0: 43980.4. Samples: 1590999220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 23:48:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:48:47,056][06909] Updated weights for policy 0, policy_version 103043 (0.0040) [2024-06-27 23:48:48,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1688305664. Throughput: 0: 43781.7. Samples: 1591254580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 23:48:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:48:50,325][06909] Updated weights for policy 0, policy_version 103053 (0.0031) [2024-06-27 23:48:53,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1688535040. Throughput: 0: 43835.5. Samples: 1591516280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 23:48:53,860][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:48:54,424][06909] Updated weights for policy 0, policy_version 103063 (0.0037) [2024-06-27 23:48:57,585][06909] Updated weights for policy 0, policy_version 103073 (0.0031) [2024-06-27 23:48:58,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43690.6, 300 sec: 44153.8). Total num frames: 1688780800. Throughput: 0: 43890.5. Samples: 1591655520. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 23:48:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-27 23:49:01,669][06909] Updated weights for policy 0, policy_version 103083 (0.0028) [2024-06-27 23:49:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1688977408. Throughput: 0: 44033.8. Samples: 1591918180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 23:49:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:49:05,123][06887] Signal inference workers to stop experience collection... (22750 times) [2024-06-27 23:49:05,125][06887] Signal inference workers to resume experience collection... (22750 times) [2024-06-27 23:49:05,144][06909] InferenceWorker_p0-w0: stopping experience collection (22750 times) [2024-06-27 23:49:05,144][06909] InferenceWorker_p0-w0: resuming experience collection (22750 times) [2024-06-27 23:49:05,301][06909] Updated weights for policy 0, policy_version 103093 (0.0027) [2024-06-27 23:49:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1689223168. Throughput: 0: 43867.0. Samples: 1592177780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 23:49:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:49:09,104][06909] Updated weights for policy 0, policy_version 103103 (0.0039) [2024-06-27 23:49:12,725][06909] Updated weights for policy 0, policy_version 103113 (0.0035) [2024-06-27 23:49:13,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 1689452544. Throughput: 0: 43872.4. Samples: 1592313460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 23:49:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:49:16,879][06909] Updated weights for policy 0, policy_version 103123 (0.0037) [2024-06-27 23:49:18,856][06674] Fps is (10 sec: 40934.9, 60 sec: 43686.2, 300 sec: 43930.7). Total num frames: 1689632768. Throughput: 0: 43965.9. Samples: 1592578780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-27 23:49:18,857][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:49:20,246][06909] Updated weights for policy 0, policy_version 103133 (0.0031) [2024-06-27 23:49:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1689878528. Throughput: 0: 43944.4. Samples: 1592835160. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 23:49:23,850][06674] Avg episode reward: [(0, '0.397')] [2024-06-27 23:49:24,163][06909] Updated weights for policy 0, policy_version 103143 (0.0037) [2024-06-27 23:49:27,757][06909] Updated weights for policy 0, policy_version 103153 (0.0032) [2024-06-27 23:49:28,850][06674] Fps is (10 sec: 45903.5, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1690091520. Throughput: 0: 43826.3. Samples: 1592971400. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 23:49:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:49:31,436][06909] Updated weights for policy 0, policy_version 103163 (0.0042) [2024-06-27 23:49:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1690288128. Throughput: 0: 43837.8. Samples: 1593227280. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 23:49:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:49:35,155][06909] Updated weights for policy 0, policy_version 103173 (0.0037) [2024-06-27 23:49:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1690533888. Throughput: 0: 43891.2. Samples: 1593491380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 23:49:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:49:39,233][06909] Updated weights for policy 0, policy_version 103183 (0.0035) [2024-06-27 23:49:42,851][06909] Updated weights for policy 0, policy_version 103193 (0.0034) [2024-06-27 23:49:43,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43690.7, 300 sec: 44098.3). Total num frames: 1690763264. Throughput: 0: 43755.2. Samples: 1593624500. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 23:49:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:49:46,731][06909] Updated weights for policy 0, policy_version 103203 (0.0035) [2024-06-27 23:49:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1690959872. Throughput: 0: 43825.3. Samples: 1593890320. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 23:49:48,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:49:49,015][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000103209_1690976256.pth... [2024-06-27 23:49:49,075][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000102564_1680408576.pth [2024-06-27 23:49:50,001][06909] Updated weights for policy 0, policy_version 103213 (0.0027) [2024-06-27 23:49:53,852][06674] Fps is (10 sec: 42589.8, 60 sec: 44235.3, 300 sec: 44097.7). Total num frames: 1691189248. Throughput: 0: 43851.8. Samples: 1594151200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 23:49:53,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:49:54,000][06909] Updated weights for policy 0, policy_version 103223 (0.0026) [2024-06-27 23:49:57,719][06909] Updated weights for policy 0, policy_version 103233 (0.0023) [2024-06-27 23:49:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 1691418624. Throughput: 0: 43969.0. Samples: 1594292060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 23:49:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:50:01,213][06909] Updated weights for policy 0, policy_version 103243 (0.0034) [2024-06-27 23:50:03,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1691615232. Throughput: 0: 43896.7. Samples: 1594553860. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 23:50:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:50:04,973][06909] Updated weights for policy 0, policy_version 103253 (0.0031) [2024-06-27 23:50:08,794][06909] Updated weights for policy 0, policy_version 103263 (0.0034) [2024-06-27 23:50:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1691860992. Throughput: 0: 43893.3. Samples: 1594810360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 23:50:08,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:50:11,199][06887] Signal inference workers to stop experience collection... (22800 times) [2024-06-27 23:50:11,235][06909] InferenceWorker_p0-w0: stopping experience collection (22800 times) [2024-06-27 23:50:11,257][06887] Signal inference workers to resume experience collection... (22800 times) [2024-06-27 23:50:11,258][06909] InferenceWorker_p0-w0: resuming experience collection (22800 times) [2024-06-27 23:50:12,175][06909] Updated weights for policy 0, policy_version 103273 (0.0048) [2024-06-27 23:50:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 44098.1). Total num frames: 1692073984. Throughput: 0: 43880.9. Samples: 1594946040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 23:50:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:50:16,186][06909] Updated weights for policy 0, policy_version 103283 (0.0029) [2024-06-27 23:50:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44241.3, 300 sec: 43986.9). Total num frames: 1692286976. Throughput: 0: 44107.5. Samples: 1595212120. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-27 23:50:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:50:19,891][06909] Updated weights for policy 0, policy_version 103293 (0.0029) [2024-06-27 23:50:23,625][06909] Updated weights for policy 0, policy_version 103303 (0.0036) [2024-06-27 23:50:23,852][06674] Fps is (10 sec: 44226.1, 60 sec: 43962.0, 300 sec: 44153.1). Total num frames: 1692516352. Throughput: 0: 44140.7. Samples: 1595477820. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 23:50:23,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:50:27,532][06909] Updated weights for policy 0, policy_version 103313 (0.0033) [2024-06-27 23:50:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1692745728. Throughput: 0: 43976.8. Samples: 1595603460. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 23:50:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:50:30,921][06909] Updated weights for policy 0, policy_version 103323 (0.0033) [2024-06-27 23:50:33,850][06674] Fps is (10 sec: 44247.1, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 1692958720. Throughput: 0: 44224.4. Samples: 1595880420. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 23:50:33,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:50:34,682][06909] Updated weights for policy 0, policy_version 103333 (0.0039) [2024-06-27 23:50:38,132][06909] Updated weights for policy 0, policy_version 103343 (0.0026) [2024-06-27 23:50:38,856][06674] Fps is (10 sec: 42572.9, 60 sec: 43959.3, 300 sec: 44152.6). Total num frames: 1693171712. Throughput: 0: 44140.1. Samples: 1596137680. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 23:50:38,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:50:42,166][06909] Updated weights for policy 0, policy_version 103353 (0.0031) [2024-06-27 23:50:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1693401088. Throughput: 0: 43975.5. Samples: 1596270960. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 23:50:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:50:46,322][06909] Updated weights for policy 0, policy_version 103363 (0.0032) [2024-06-27 23:50:48,850][06674] Fps is (10 sec: 44263.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1693614080. Throughput: 0: 44159.5. Samples: 1596541040. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 23:50:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:50:49,607][06909] Updated weights for policy 0, policy_version 103373 (0.0041) [2024-06-27 23:50:53,656][06909] Updated weights for policy 0, policy_version 103383 (0.0030) [2024-06-27 23:50:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 1693827072. Throughput: 0: 44197.8. Samples: 1596799260. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 23:50:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:50:57,136][06909] Updated weights for policy 0, policy_version 103393 (0.0032) [2024-06-27 23:50:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1694056448. Throughput: 0: 44120.4. Samples: 1596931460. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 23:50:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:51:00,824][06909] Updated weights for policy 0, policy_version 103403 (0.0030) [2024-06-27 23:51:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1694269440. Throughput: 0: 44264.9. Samples: 1597204040. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 23:51:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:51:04,534][06909] Updated weights for policy 0, policy_version 103413 (0.0037) [2024-06-27 23:51:08,514][06909] Updated weights for policy 0, policy_version 103423 (0.0025) [2024-06-27 23:51:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 1694482432. Throughput: 0: 44152.1. Samples: 1597464560. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 23:51:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:51:11,749][06909] Updated weights for policy 0, policy_version 103433 (0.0039) [2024-06-27 23:51:13,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1694711808. Throughput: 0: 44217.9. Samples: 1597593260. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 23:51:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:51:15,941][06909] Updated weights for policy 0, policy_version 103443 (0.0033) [2024-06-27 23:51:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1694941184. Throughput: 0: 44034.7. Samples: 1597861980. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 23:51:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:51:19,307][06909] Updated weights for policy 0, policy_version 103453 (0.0045) [2024-06-27 23:51:23,493][06909] Updated weights for policy 0, policy_version 103463 (0.0041) [2024-06-27 23:51:23,850][06674] Fps is (10 sec: 42597.5, 60 sec: 43692.3, 300 sec: 43986.8). Total num frames: 1695137792. Throughput: 0: 43936.9. Samples: 1598114580. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-27 23:51:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:51:27,135][06909] Updated weights for policy 0, policy_version 103473 (0.0034) [2024-06-27 23:51:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1695367168. Throughput: 0: 43846.2. Samples: 1598244040. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 23:51:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:51:30,634][06909] Updated weights for policy 0, policy_version 103483 (0.0037) [2024-06-27 23:51:33,850][06674] Fps is (10 sec: 45876.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1695596544. Throughput: 0: 43881.8. Samples: 1598515720. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 23:51:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:51:34,581][06909] Updated weights for policy 0, policy_version 103493 (0.0034) [2024-06-27 23:51:36,424][06887] Signal inference workers to stop experience collection... (22850 times) [2024-06-27 23:51:36,424][06887] Signal inference workers to resume experience collection... (22850 times) [2024-06-27 23:51:36,443][06909] InferenceWorker_p0-w0: stopping experience collection (22850 times) [2024-06-27 23:51:36,443][06909] InferenceWorker_p0-w0: resuming experience collection (22850 times) [2024-06-27 23:51:37,993][06909] Updated weights for policy 0, policy_version 103503 (0.0032) [2024-06-27 23:51:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43968.1, 300 sec: 43986.9). Total num frames: 1695809536. Throughput: 0: 44033.6. Samples: 1598780780. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 23:51:38,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:51:41,701][06909] Updated weights for policy 0, policy_version 103513 (0.0039) [2024-06-27 23:51:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1696038912. Throughput: 0: 44001.3. Samples: 1598911520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 23:51:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:51:45,673][06909] Updated weights for policy 0, policy_version 103523 (0.0044) [2024-06-27 23:51:48,852][06674] Fps is (10 sec: 45866.4, 60 sec: 44235.2, 300 sec: 44042.1). Total num frames: 1696268288. Throughput: 0: 43860.3. Samples: 1599177840. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 23:51:48,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:51:49,002][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000103533_1696284672.pth... [2024-06-27 23:51:49,004][06909] Updated weights for policy 0, policy_version 103533 (0.0031) [2024-06-27 23:51:49,053][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000102886_1685684224.pth [2024-06-27 23:51:53,446][06909] Updated weights for policy 0, policy_version 103543 (0.0043) [2024-06-27 23:51:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 1696464896. Throughput: 0: 43938.1. Samples: 1599441780. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 23:51:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:51:56,610][06909] Updated weights for policy 0, policy_version 103553 (0.0023) [2024-06-27 23:51:58,852][06674] Fps is (10 sec: 40960.0, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 1696677888. Throughput: 0: 43803.7. Samples: 1599564520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 23:51:58,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:52:00,678][06909] Updated weights for policy 0, policy_version 103563 (0.0024) [2024-06-27 23:52:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1696923648. Throughput: 0: 43792.8. Samples: 1599832660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 23:52:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:52:04,071][06909] Updated weights for policy 0, policy_version 103573 (0.0029) [2024-06-27 23:52:07,915][06909] Updated weights for policy 0, policy_version 103583 (0.0030) [2024-06-27 23:52:08,850][06674] Fps is (10 sec: 44246.1, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 1697120256. Throughput: 0: 44025.5. Samples: 1600095720. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 23:52:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:52:11,895][06909] Updated weights for policy 0, policy_version 103593 (0.0028) [2024-06-27 23:52:13,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1697349632. Throughput: 0: 44061.4. Samples: 1600226800. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 23:52:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:52:15,126][06909] Updated weights for policy 0, policy_version 103603 (0.0039) [2024-06-27 23:52:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1697579008. Throughput: 0: 43998.7. Samples: 1600495660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 23:52:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:52:19,191][06909] Updated weights for policy 0, policy_version 103613 (0.0028) [2024-06-27 23:52:23,485][06909] Updated weights for policy 0, policy_version 103623 (0.0029) [2024-06-27 23:52:23,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 1697775616. Throughput: 0: 44058.3. Samples: 1600763400. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-27 23:52:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:52:26,407][06909] Updated weights for policy 0, policy_version 103633 (0.0030) [2024-06-27 23:52:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1698004992. Throughput: 0: 43948.0. Samples: 1600889180. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:52:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:52:30,707][06909] Updated weights for policy 0, policy_version 103643 (0.0033) [2024-06-27 23:52:33,795][06909] Updated weights for policy 0, policy_version 103653 (0.0031) [2024-06-27 23:52:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.7, 300 sec: 43986.8). Total num frames: 1698250752. Throughput: 0: 43909.0. Samples: 1601153660. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:52:33,854][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:52:37,862][06909] Updated weights for policy 0, policy_version 103663 (0.0033) [2024-06-27 23:52:38,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1698463744. Throughput: 0: 44047.3. Samples: 1601423900. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:52:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:52:41,469][06909] Updated weights for policy 0, policy_version 103673 (0.0038) [2024-06-27 23:52:43,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1698660352. Throughput: 0: 44154.5. Samples: 1601551380. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:52:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:52:45,618][06909] Updated weights for policy 0, policy_version 103683 (0.0028) [2024-06-27 23:52:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43692.2, 300 sec: 43986.9). Total num frames: 1698889728. Throughput: 0: 43936.1. Samples: 1601809780. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:52:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:52:49,159][06909] Updated weights for policy 0, policy_version 103693 (0.0040) [2024-06-27 23:52:53,055][06909] Updated weights for policy 0, policy_version 103703 (0.0028) [2024-06-27 23:52:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 1699119104. Throughput: 0: 44187.5. Samples: 1602084160. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:52:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:52:56,396][06909] Updated weights for policy 0, policy_version 103713 (0.0038) [2024-06-27 23:52:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44511.4, 300 sec: 44097.9). Total num frames: 1699348480. Throughput: 0: 44189.3. Samples: 1602215320. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:52:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:53:00,704][06909] Updated weights for policy 0, policy_version 103723 (0.0044) [2024-06-27 23:53:01,466][06887] Signal inference workers to stop experience collection... (22900 times) [2024-06-27 23:53:01,504][06909] InferenceWorker_p0-w0: stopping experience collection (22900 times) [2024-06-27 23:53:01,532][06887] Signal inference workers to resume experience collection... (22900 times) [2024-06-27 23:53:01,535][06909] InferenceWorker_p0-w0: resuming experience collection (22900 times) [2024-06-27 23:53:03,626][06909] Updated weights for policy 0, policy_version 103733 (0.0035) [2024-06-27 23:53:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 1699577856. Throughput: 0: 43982.6. Samples: 1602474880. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:53:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:53:08,089][06909] Updated weights for policy 0, policy_version 103743 (0.0031) [2024-06-27 23:53:08,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44509.9, 300 sec: 43931.3). Total num frames: 1699790848. Throughput: 0: 44051.7. Samples: 1602745720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:53:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:53:10,838][06909] Updated weights for policy 0, policy_version 103753 (0.0034) [2024-06-27 23:53:13,852][06674] Fps is (10 sec: 42589.8, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 1700003840. Throughput: 0: 44010.5. Samples: 1602869740. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:53:13,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:53:15,444][06909] Updated weights for policy 0, policy_version 103763 (0.0027) [2024-06-27 23:53:18,650][06909] Updated weights for policy 0, policy_version 103773 (0.0042) [2024-06-27 23:53:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1700233216. Throughput: 0: 44061.4. Samples: 1603136420. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:53:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:53:23,098][06909] Updated weights for policy 0, policy_version 103783 (0.0034) [2024-06-27 23:53:23,850][06674] Fps is (10 sec: 42607.5, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 1700429824. Throughput: 0: 43809.8. Samples: 1603395340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-27 23:53:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:53:26,418][06909] Updated weights for policy 0, policy_version 103793 (0.0042) [2024-06-27 23:53:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1700659200. Throughput: 0: 43937.8. Samples: 1603528580. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-27 23:53:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:53:30,360][06909] Updated weights for policy 0, policy_version 103803 (0.0028) [2024-06-27 23:53:33,771][06909] Updated weights for policy 0, policy_version 103813 (0.0038) [2024-06-27 23:53:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1700872192. Throughput: 0: 43905.8. Samples: 1603785540. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-27 23:53:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:53:37,957][06909] Updated weights for policy 0, policy_version 103823 (0.0044) [2024-06-27 23:53:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1701085184. Throughput: 0: 43904.0. Samples: 1604059840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-27 23:53:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:53:40,888][06909] Updated weights for policy 0, policy_version 103833 (0.0027) [2024-06-27 23:53:43,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44235.3, 300 sec: 44097.7). Total num frames: 1701314560. Throughput: 0: 43810.5. Samples: 1604186880. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-27 23:53:43,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:53:45,360][06909] Updated weights for policy 0, policy_version 103843 (0.0036) [2024-06-27 23:53:48,388][06909] Updated weights for policy 0, policy_version 103853 (0.0035) [2024-06-27 23:53:48,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1701560320. Throughput: 0: 43876.4. Samples: 1604449320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-27 23:53:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:53:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000103855_1701560320.pth... [2024-06-27 23:53:48,909][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000103209_1690976256.pth [2024-06-27 23:53:52,903][06909] Updated weights for policy 0, policy_version 103863 (0.0047) [2024-06-27 23:53:53,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 1701740544. Throughput: 0: 43691.5. Samples: 1604711840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-27 23:53:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:53:56,002][06909] Updated weights for policy 0, policy_version 103873 (0.0030) [2024-06-27 23:53:58,856][06674] Fps is (10 sec: 40935.3, 60 sec: 43686.2, 300 sec: 44041.5). Total num frames: 1701969920. Throughput: 0: 43845.4. Samples: 1604842960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-27 23:53:58,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:54:00,166][06909] Updated weights for policy 0, policy_version 103883 (0.0033) [2024-06-27 23:54:03,822][06909] Updated weights for policy 0, policy_version 103893 (0.0030) [2024-06-27 23:54:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 1702182912. Throughput: 0: 43626.6. Samples: 1605099620. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-27 23:54:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:54:07,370][06909] Updated weights for policy 0, policy_version 103903 (0.0042) [2024-06-27 23:54:08,850][06674] Fps is (10 sec: 44263.6, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1702412288. Throughput: 0: 43953.2. Samples: 1605373240. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-27 23:54:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:54:10,979][06909] Updated weights for policy 0, policy_version 103913 (0.0025) [2024-06-27 23:54:11,233][06887] Signal inference workers to stop experience collection... (22950 times) [2024-06-27 23:54:11,276][06909] InferenceWorker_p0-w0: stopping experience collection (22950 times) [2024-06-27 23:54:11,290][06887] Signal inference workers to resume experience collection... (22950 times) [2024-06-27 23:54:11,292][06909] InferenceWorker_p0-w0: resuming experience collection (22950 times) [2024-06-27 23:54:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43692.1, 300 sec: 44043.3). Total num frames: 1702625280. Throughput: 0: 43779.5. Samples: 1605498660. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-27 23:54:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:54:15,283][06909] Updated weights for policy 0, policy_version 103923 (0.0027) [2024-06-27 23:54:18,196][06909] Updated weights for policy 0, policy_version 103933 (0.0034) [2024-06-27 23:54:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1702871040. Throughput: 0: 44090.7. Samples: 1605769620. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-27 23:54:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:54:22,452][06909] Updated weights for policy 0, policy_version 103943 (0.0032) [2024-06-27 23:54:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1703067648. Throughput: 0: 43927.1. Samples: 1606036560. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-27 23:54:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:54:25,683][06909] Updated weights for policy 0, policy_version 103953 (0.0023) [2024-06-27 23:54:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1703297024. Throughput: 0: 43885.1. Samples: 1606161620. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:54:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:54:29,932][06909] Updated weights for policy 0, policy_version 103963 (0.0042) [2024-06-27 23:54:33,086][06909] Updated weights for policy 0, policy_version 103973 (0.0027) [2024-06-27 23:54:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1703526400. Throughput: 0: 44120.6. Samples: 1606434740. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:54:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:54:37,129][06909] Updated weights for policy 0, policy_version 103983 (0.0030) [2024-06-27 23:54:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1703723008. Throughput: 0: 44026.6. Samples: 1606693040. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:54:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:54:41,008][06909] Updated weights for policy 0, policy_version 103993 (0.0034) [2024-06-27 23:54:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 1703952384. Throughput: 0: 44045.9. Samples: 1606824760. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:54:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:54:44,459][06909] Updated weights for policy 0, policy_version 104003 (0.0034) [2024-06-27 23:54:48,231][06909] Updated weights for policy 0, policy_version 104013 (0.0023) [2024-06-27 23:54:48,852][06674] Fps is (10 sec: 44228.1, 60 sec: 43416.2, 300 sec: 43986.9). Total num frames: 1704165376. Throughput: 0: 44269.6. Samples: 1607091840. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:54:48,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:54:52,370][06909] Updated weights for policy 0, policy_version 104023 (0.0029) [2024-06-27 23:54:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1704394752. Throughput: 0: 44072.0. Samples: 1607356480. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:54:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:54:55,655][06909] Updated weights for policy 0, policy_version 104033 (0.0029) [2024-06-27 23:54:58,850][06674] Fps is (10 sec: 45884.3, 60 sec: 44241.2, 300 sec: 44097.9). Total num frames: 1704624128. Throughput: 0: 44191.6. Samples: 1607487280. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:54:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:54:59,497][06909] Updated weights for policy 0, policy_version 104043 (0.0030) [2024-06-27 23:55:02,763][06909] Updated weights for policy 0, policy_version 104053 (0.0037) [2024-06-27 23:55:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1704837120. Throughput: 0: 44195.1. Samples: 1607758400. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:55:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:55:07,099][06909] Updated weights for policy 0, policy_version 104063 (0.0029) [2024-06-27 23:55:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1705050112. Throughput: 0: 43998.7. Samples: 1608016500. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:55:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:55:10,434][06909] Updated weights for policy 0, policy_version 104073 (0.0026) [2024-06-27 23:55:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1705279488. Throughput: 0: 44194.2. Samples: 1608150360. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:55:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:55:14,381][06909] Updated weights for policy 0, policy_version 104083 (0.0034) [2024-06-27 23:55:18,002][06909] Updated weights for policy 0, policy_version 104093 (0.0040) [2024-06-27 23:55:18,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44042.8). Total num frames: 1705508864. Throughput: 0: 44080.1. Samples: 1608418340. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:55:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-27 23:55:21,582][06909] Updated weights for policy 0, policy_version 104103 (0.0026) [2024-06-27 23:55:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1705705472. Throughput: 0: 44117.4. Samples: 1608678320. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-27 23:55:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:55:25,182][06909] Updated weights for policy 0, policy_version 104113 (0.0031) [2024-06-27 23:55:28,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1705934848. Throughput: 0: 44184.9. Samples: 1608813080. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 23:55:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:55:29,246][06909] Updated weights for policy 0, policy_version 104123 (0.0026) [2024-06-27 23:55:32,606][06909] Updated weights for policy 0, policy_version 104133 (0.0022) [2024-06-27 23:55:33,852][06674] Fps is (10 sec: 47504.3, 60 sec: 44235.3, 300 sec: 44098.6). Total num frames: 1706180608. Throughput: 0: 44223.1. Samples: 1609081880. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 23:55:33,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:55:36,414][06909] Updated weights for policy 0, policy_version 104143 (0.0021) [2024-06-27 23:55:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1706360832. Throughput: 0: 44344.4. Samples: 1609351980. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 23:55:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:55:39,848][06909] Updated weights for policy 0, policy_version 104153 (0.0036) [2024-06-27 23:55:43,850][06674] Fps is (10 sec: 40968.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1706590208. Throughput: 0: 44243.3. Samples: 1609478220. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 23:55:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:55:43,934][06909] Updated weights for policy 0, policy_version 104163 (0.0041) [2024-06-27 23:55:47,519][06909] Updated weights for policy 0, policy_version 104173 (0.0033) [2024-06-27 23:55:48,851][06674] Fps is (10 sec: 47506.2, 60 sec: 44510.2, 300 sec: 44097.7). Total num frames: 1706835968. Throughput: 0: 44197.1. Samples: 1609747340. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 23:55:48,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:55:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000104177_1706835968.pth... [2024-06-27 23:55:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000103533_1696284672.pth [2024-06-27 23:55:51,462][06909] Updated weights for policy 0, policy_version 104183 (0.0025) [2024-06-27 23:55:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1707032576. Throughput: 0: 44270.2. Samples: 1610008660. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 23:55:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:55:54,958][06909] Updated weights for policy 0, policy_version 104193 (0.0028) [2024-06-27 23:55:55,353][06887] Signal inference workers to stop experience collection... (23000 times) [2024-06-27 23:55:55,402][06909] InferenceWorker_p0-w0: stopping experience collection (23000 times) [2024-06-27 23:55:55,413][06887] Signal inference workers to resume experience collection... (23000 times) [2024-06-27 23:55:55,419][06909] InferenceWorker_p0-w0: resuming experience collection (23000 times) [2024-06-27 23:55:58,842][06909] Updated weights for policy 0, policy_version 104203 (0.0039) [2024-06-27 23:55:58,850][06674] Fps is (10 sec: 42605.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1707261952. Throughput: 0: 44157.8. Samples: 1610137460. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 23:55:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:56:02,612][06909] Updated weights for policy 0, policy_version 104213 (0.0028) [2024-06-27 23:56:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1707491328. Throughput: 0: 44209.7. Samples: 1610407780. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 23:56:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:56:06,064][06909] Updated weights for policy 0, policy_version 104223 (0.0031) [2024-06-27 23:56:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1707704320. Throughput: 0: 44385.9. Samples: 1610675680. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 23:56:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:56:09,745][06909] Updated weights for policy 0, policy_version 104233 (0.0037) [2024-06-27 23:56:13,546][06909] Updated weights for policy 0, policy_version 104243 (0.0045) [2024-06-27 23:56:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1707917312. Throughput: 0: 44199.6. Samples: 1610802060. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 23:56:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:56:16,979][06909] Updated weights for policy 0, policy_version 104253 (0.0025) [2024-06-27 23:56:18,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43963.5, 300 sec: 44097.9). Total num frames: 1708146688. Throughput: 0: 44254.2. Samples: 1611073240. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 23:56:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:56:21,129][06909] Updated weights for policy 0, policy_version 104263 (0.0030) [2024-06-27 23:56:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1708343296. Throughput: 0: 44131.6. Samples: 1611337900. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-27 23:56:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:56:24,468][06909] Updated weights for policy 0, policy_version 104273 (0.0029) [2024-06-27 23:56:28,434][06909] Updated weights for policy 0, policy_version 104283 (0.0046) [2024-06-27 23:56:28,850][06674] Fps is (10 sec: 44237.8, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1708589056. Throughput: 0: 44219.5. Samples: 1611468100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 23:56:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:56:31,928][06909] Updated weights for policy 0, policy_version 104293 (0.0025) [2024-06-27 23:56:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43692.1, 300 sec: 44042.4). Total num frames: 1708802048. Throughput: 0: 44189.5. Samples: 1611735800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 23:56:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:56:35,797][06909] Updated weights for policy 0, policy_version 104303 (0.0025) [2024-06-27 23:56:38,852][06674] Fps is (10 sec: 44227.4, 60 sec: 44508.4, 300 sec: 44042.1). Total num frames: 1709031424. Throughput: 0: 44285.5. Samples: 1612001600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 23:56:38,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:56:39,721][06909] Updated weights for policy 0, policy_version 104313 (0.0042) [2024-06-27 23:56:43,378][06909] Updated weights for policy 0, policy_version 104323 (0.0028) [2024-06-27 23:56:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 1709244416. Throughput: 0: 44291.6. Samples: 1612130580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 23:56:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:56:47,117][06909] Updated weights for policy 0, policy_version 104333 (0.0048) [2024-06-27 23:56:48,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43964.9, 300 sec: 44098.0). Total num frames: 1709473792. Throughput: 0: 44096.8. Samples: 1612392140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 23:56:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:56:50,980][06909] Updated weights for policy 0, policy_version 104343 (0.0032) [2024-06-27 23:56:53,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.7, 300 sec: 44098.2). Total num frames: 1709686784. Throughput: 0: 43951.9. Samples: 1612653520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 23:56:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:56:54,410][06909] Updated weights for policy 0, policy_version 104353 (0.0036) [2024-06-27 23:56:58,303][06909] Updated weights for policy 0, policy_version 104363 (0.0045) [2024-06-27 23:56:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1709916160. Throughput: 0: 44093.2. Samples: 1612786260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 23:56:58,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:57:01,734][06909] Updated weights for policy 0, policy_version 104373 (0.0025) [2024-06-27 23:57:03,852][06674] Fps is (10 sec: 44227.3, 60 sec: 43962.1, 300 sec: 44097.6). Total num frames: 1710129152. Throughput: 0: 43877.1. Samples: 1613047800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 23:57:03,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:57:05,555][06909] Updated weights for policy 0, policy_version 104383 (0.0039) [2024-06-27 23:57:08,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1710342144. Throughput: 0: 44049.9. Samples: 1613320140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 23:57:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:57:09,138][06909] Updated weights for policy 0, policy_version 104393 (0.0039) [2024-06-27 23:57:12,999][06909] Updated weights for policy 0, policy_version 104403 (0.0024) [2024-06-27 23:57:13,850][06674] Fps is (10 sec: 44246.6, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1710571520. Throughput: 0: 44032.8. Samples: 1613449580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 23:57:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:57:16,815][06909] Updated weights for policy 0, policy_version 104413 (0.0038) [2024-06-27 23:57:17,617][06887] Signal inference workers to stop experience collection... (23050 times) [2024-06-27 23:57:17,619][06887] Signal inference workers to resume experience collection... (23050 times) [2024-06-27 23:57:17,652][06909] InferenceWorker_p0-w0: stopping experience collection (23050 times) [2024-06-27 23:57:17,653][06909] InferenceWorker_p0-w0: resuming experience collection (23050 times) [2024-06-27 23:57:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 1710784512. Throughput: 0: 44061.9. Samples: 1613718580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 23:57:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:57:20,394][06909] Updated weights for policy 0, policy_version 104423 (0.0034) [2024-06-27 23:57:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 1711013888. Throughput: 0: 43903.4. Samples: 1613977160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-27 23:57:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:57:23,991][06909] Updated weights for policy 0, policy_version 104433 (0.0033) [2024-06-27 23:57:27,989][06909] Updated weights for policy 0, policy_version 104443 (0.0040) [2024-06-27 23:57:28,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 1711259648. Throughput: 0: 44122.6. Samples: 1614116100. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2024-06-27 23:57:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:57:31,234][06909] Updated weights for policy 0, policy_version 104453 (0.0027) [2024-06-27 23:57:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1711439872. Throughput: 0: 44148.0. Samples: 1614378800. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2024-06-27 23:57:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:57:35,294][06909] Updated weights for policy 0, policy_version 104463 (0.0037) [2024-06-27 23:57:38,618][06909] Updated weights for policy 0, policy_version 104473 (0.0035) [2024-06-27 23:57:38,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44238.2, 300 sec: 44153.5). Total num frames: 1711685632. Throughput: 0: 44195.0. Samples: 1614642300. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2024-06-27 23:57:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:57:42,739][06909] Updated weights for policy 0, policy_version 104483 (0.0030) [2024-06-27 23:57:43,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1711915008. Throughput: 0: 44246.7. Samples: 1614777360. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2024-06-27 23:57:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:57:46,319][06909] Updated weights for policy 0, policy_version 104493 (0.0039) [2024-06-27 23:57:48,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1712111616. Throughput: 0: 44224.9. Samples: 1615037820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2024-06-27 23:57:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-27 23:57:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000104499_1712111616.pth... [2024-06-27 23:57:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000103855_1701560320.pth [2024-06-27 23:57:49,867][06909] Updated weights for policy 0, policy_version 104503 (0.0025) [2024-06-27 23:57:53,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1712324608. Throughput: 0: 44107.5. Samples: 1615304980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2024-06-27 23:57:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:57:54,094][06909] Updated weights for policy 0, policy_version 104513 (0.0033) [2024-06-27 23:57:57,752][06909] Updated weights for policy 0, policy_version 104523 (0.0037) [2024-06-27 23:57:58,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1712570368. Throughput: 0: 44188.0. Samples: 1615438040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2024-06-27 23:57:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:58:01,277][06909] Updated weights for policy 0, policy_version 104533 (0.0026) [2024-06-27 23:58:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43965.4, 300 sec: 43986.9). Total num frames: 1712766976. Throughput: 0: 43956.0. Samples: 1615696600. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2024-06-27 23:58:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:58:04,978][06909] Updated weights for policy 0, policy_version 104543 (0.0038) [2024-06-27 23:58:08,367][06909] Updated weights for policy 0, policy_version 104553 (0.0033) [2024-06-27 23:58:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44782.8, 300 sec: 44153.8). Total num frames: 1713029120. Throughput: 0: 44256.3. Samples: 1615968700. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2024-06-27 23:58:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:58:12,339][06909] Updated weights for policy 0, policy_version 104563 (0.0027) [2024-06-27 23:58:13,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1713225728. Throughput: 0: 44192.0. Samples: 1616104740. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2024-06-27 23:58:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-27 23:58:15,759][06909] Updated weights for policy 0, policy_version 104573 (0.0026) [2024-06-27 23:58:18,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1713422336. Throughput: 0: 43976.0. Samples: 1616357720. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2024-06-27 23:58:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:58:19,615][06909] Updated weights for policy 0, policy_version 104583 (0.0025) [2024-06-27 23:58:23,585][06909] Updated weights for policy 0, policy_version 104593 (0.0033) [2024-06-27 23:58:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1713668096. Throughput: 0: 44176.2. Samples: 1616630220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2024-06-27 23:58:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:58:27,098][06909] Updated weights for policy 0, policy_version 104603 (0.0033) [2024-06-27 23:58:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1713881088. Throughput: 0: 44169.3. Samples: 1616764980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 19.0) [2024-06-27 23:58:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:58:31,100][06909] Updated weights for policy 0, policy_version 104613 (0.0029) [2024-06-27 23:58:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1714094080. Throughput: 0: 44141.8. Samples: 1617024200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 23:58:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:58:34,846][06909] Updated weights for policy 0, policy_version 104623 (0.0026) [2024-06-27 23:58:34,930][06887] Signal inference workers to stop experience collection... (23100 times) [2024-06-27 23:58:34,984][06909] InferenceWorker_p0-w0: stopping experience collection (23100 times) [2024-06-27 23:58:35,043][06887] Signal inference workers to resume experience collection... (23100 times) [2024-06-27 23:58:35,044][06909] InferenceWorker_p0-w0: resuming experience collection (23100 times) [2024-06-27 23:58:38,310][06909] Updated weights for policy 0, policy_version 104633 (0.0031) [2024-06-27 23:58:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 44153.8). Total num frames: 1714339840. Throughput: 0: 44073.3. Samples: 1617288280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 23:58:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:58:42,211][06909] Updated weights for policy 0, policy_version 104643 (0.0038) [2024-06-27 23:58:43,852][06674] Fps is (10 sec: 45865.6, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 1714552832. Throughput: 0: 44256.3. Samples: 1617429660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 23:58:43,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-27 23:58:45,474][06909] Updated weights for policy 0, policy_version 104653 (0.0032) [2024-06-27 23:58:48,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1714749440. Throughput: 0: 44297.3. Samples: 1617689980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 23:58:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:58:49,386][06909] Updated weights for policy 0, policy_version 104663 (0.0030) [2024-06-27 23:58:52,769][06909] Updated weights for policy 0, policy_version 104673 (0.0031) [2024-06-27 23:58:53,850][06674] Fps is (10 sec: 44246.2, 60 sec: 44509.9, 300 sec: 44154.4). Total num frames: 1714995200. Throughput: 0: 43987.7. Samples: 1617948140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 23:58:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-27 23:58:56,917][06909] Updated weights for policy 0, policy_version 104683 (0.0039) [2024-06-27 23:58:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1715208192. Throughput: 0: 44142.7. Samples: 1618091160. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 23:58:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:59:00,307][06909] Updated weights for policy 0, policy_version 104693 (0.0034) [2024-06-27 23:59:03,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1715421184. Throughput: 0: 44298.6. Samples: 1618351160. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 23:59:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:59:04,150][06909] Updated weights for policy 0, policy_version 104703 (0.0034) [2024-06-27 23:59:08,161][06909] Updated weights for policy 0, policy_version 104713 (0.0026) [2024-06-27 23:59:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 1715666944. Throughput: 0: 44142.6. Samples: 1618616640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 23:59:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-27 23:59:11,645][06909] Updated weights for policy 0, policy_version 104723 (0.0032) [2024-06-27 23:59:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1715863552. Throughput: 0: 44139.5. Samples: 1618751260. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 23:59:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:59:15,423][06909] Updated weights for policy 0, policy_version 104733 (0.0028) [2024-06-27 23:59:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1716092928. Throughput: 0: 44273.3. Samples: 1619016500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 23:59:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:59:19,074][06909] Updated weights for policy 0, policy_version 104743 (0.0035) [2024-06-27 23:59:22,742][06909] Updated weights for policy 0, policy_version 104753 (0.0025) [2024-06-27 23:59:23,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1716338688. Throughput: 0: 44105.7. Samples: 1619273040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 23:59:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:59:26,326][06909] Updated weights for policy 0, policy_version 104763 (0.0033) [2024-06-27 23:59:28,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1716535296. Throughput: 0: 44042.5. Samples: 1619411480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-27 23:59:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:59:30,247][06909] Updated weights for policy 0, policy_version 104773 (0.0036) [2024-06-27 23:59:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1716748288. Throughput: 0: 44116.1. Samples: 1619675200. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:59:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-27 23:59:33,976][06909] Updated weights for policy 0, policy_version 104783 (0.0029) [2024-06-27 23:59:37,441][06909] Updated weights for policy 0, policy_version 104793 (0.0028) [2024-06-27 23:59:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1716977664. Throughput: 0: 44177.7. Samples: 1619936140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:59:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-27 23:59:41,141][06909] Updated weights for policy 0, policy_version 104803 (0.0032) [2024-06-27 23:59:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44238.4, 300 sec: 44209.3). Total num frames: 1717207040. Throughput: 0: 44086.8. Samples: 1620075060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:59:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-27 23:59:45,023][06909] Updated weights for policy 0, policy_version 104813 (0.0038) [2024-06-27 23:59:48,800][06887] Signal inference workers to stop experience collection... (23150 times) [2024-06-27 23:59:48,800][06887] Signal inference workers to resume experience collection... (23150 times) [2024-06-27 23:59:48,810][06909] Updated weights for policy 0, policy_version 104823 (0.0038) [2024-06-27 23:59:48,829][06909] InferenceWorker_p0-w0: stopping experience collection (23150 times) [2024-06-27 23:59:48,830][06909] InferenceWorker_p0-w0: resuming experience collection (23150 times) [2024-06-27 23:59:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1717420032. Throughput: 0: 44297.9. Samples: 1620344560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:59:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:59:49,081][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000104825_1717452800.pth... [2024-06-27 23:59:49,129][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000104177_1706835968.pth [2024-06-27 23:59:52,346][06909] Updated weights for policy 0, policy_version 104833 (0.0035) [2024-06-27 23:59:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1717649408. Throughput: 0: 44061.3. Samples: 1620599400. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:59:53,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:59:56,138][06909] Updated weights for policy 0, policy_version 104843 (0.0039) [2024-06-27 23:59:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1717846016. Throughput: 0: 44088.9. Samples: 1620735260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-27 23:59:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-27 23:59:59,584][06909] Updated weights for policy 0, policy_version 104853 (0.0029) [2024-06-28 00:00:03,316][06909] Updated weights for policy 0, policy_version 104863 (0.0041) [2024-06-28 00:00:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1718091776. Throughput: 0: 44270.1. Samples: 1621008660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:00:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:00:06,985][06909] Updated weights for policy 0, policy_version 104873 (0.0022) [2024-06-28 00:00:08,850][06674] Fps is (10 sec: 45874.2, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 1718304768. Throughput: 0: 44246.4. Samples: 1621264140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:00:08,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:00:11,087][06909] Updated weights for policy 0, policy_version 104883 (0.0032) [2024-06-28 00:00:13,850][06674] Fps is (10 sec: 42599.3, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 1718517760. Throughput: 0: 44121.8. Samples: 1621396960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:00:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:00:14,608][06909] Updated weights for policy 0, policy_version 104893 (0.0037) [2024-06-28 00:00:18,519][06909] Updated weights for policy 0, policy_version 104903 (0.0031) [2024-06-28 00:00:18,850][06674] Fps is (10 sec: 44237.8, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1718747136. Throughput: 0: 44114.1. Samples: 1621660340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:00:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:00:22,426][06909] Updated weights for policy 0, policy_version 104913 (0.0034) [2024-06-28 00:00:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 1718960128. Throughput: 0: 44096.0. Samples: 1621920460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:00:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:00:25,890][06909] Updated weights for policy 0, policy_version 104923 (0.0031) [2024-06-28 00:00:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 1719173120. Throughput: 0: 44059.5. Samples: 1622057740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:00:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:00:29,636][06909] Updated weights for policy 0, policy_version 104933 (0.0025) [2024-06-28 00:00:33,089][06909] Updated weights for policy 0, policy_version 104943 (0.0027) [2024-06-28 00:00:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 1719402496. Throughput: 0: 44006.6. Samples: 1622324860. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-28 00:00:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:00:36,960][06909] Updated weights for policy 0, policy_version 104953 (0.0040) [2024-06-28 00:00:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1719615488. Throughput: 0: 44158.3. Samples: 1622586520. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-28 00:00:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:00:40,559][06909] Updated weights for policy 0, policy_version 104963 (0.0026) [2024-06-28 00:00:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 44042.6). Total num frames: 1719828480. Throughput: 0: 44125.3. Samples: 1622720900. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-28 00:00:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:00:44,666][06909] Updated weights for policy 0, policy_version 104973 (0.0031) [2024-06-28 00:00:48,393][06909] Updated weights for policy 0, policy_version 104983 (0.0030) [2024-06-28 00:00:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1720057856. Throughput: 0: 43797.5. Samples: 1622979540. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-28 00:00:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:00:52,425][06909] Updated weights for policy 0, policy_version 104993 (0.0028) [2024-06-28 00:00:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 1720270848. Throughput: 0: 43927.9. Samples: 1623240880. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-28 00:00:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:00:55,658][06909] Updated weights for policy 0, policy_version 105003 (0.0041) [2024-06-28 00:00:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1720483840. Throughput: 0: 43955.9. Samples: 1623374980. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-28 00:00:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:00:59,689][06909] Updated weights for policy 0, policy_version 105013 (0.0026) [2024-06-28 00:01:03,280][06909] Updated weights for policy 0, policy_version 105023 (0.0037) [2024-06-28 00:01:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 1720729600. Throughput: 0: 43919.6. Samples: 1623636720. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-28 00:01:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 00:01:06,995][06909] Updated weights for policy 0, policy_version 105033 (0.0035) [2024-06-28 00:01:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.9, 300 sec: 44098.0). Total num frames: 1720926208. Throughput: 0: 44026.7. Samples: 1623901660. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-28 00:01:08,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:01:09,983][06887] Signal inference workers to stop experience collection... (23200 times) [2024-06-28 00:01:09,984][06887] Signal inference workers to resume experience collection... (23200 times) [2024-06-28 00:01:10,021][06909] InferenceWorker_p0-w0: stopping experience collection (23200 times) [2024-06-28 00:01:10,021][06909] InferenceWorker_p0-w0: resuming experience collection (23200 times) [2024-06-28 00:01:10,555][06909] Updated weights for policy 0, policy_version 105043 (0.0033) [2024-06-28 00:01:13,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.6, 300 sec: 44098.0). Total num frames: 1721155584. Throughput: 0: 43883.4. Samples: 1624032500. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-28 00:01:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:01:14,186][06909] Updated weights for policy 0, policy_version 105053 (0.0032) [2024-06-28 00:01:17,819][06909] Updated weights for policy 0, policy_version 105063 (0.0033) [2024-06-28 00:01:18,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1721384960. Throughput: 0: 43937.3. Samples: 1624302040. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-28 00:01:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:01:21,725][06909] Updated weights for policy 0, policy_version 105073 (0.0024) [2024-06-28 00:01:23,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1721581568. Throughput: 0: 43835.1. Samples: 1624559100. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-28 00:01:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:01:25,185][06909] Updated weights for policy 0, policy_version 105083 (0.0036) [2024-06-28 00:01:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1721827328. Throughput: 0: 43851.1. Samples: 1624694200. Policy #0 lag: (min: 0.0, avg: 11.8, max: 23.0) [2024-06-28 00:01:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:01:29,599][06909] Updated weights for policy 0, policy_version 105093 (0.0034) [2024-06-28 00:01:32,602][06909] Updated weights for policy 0, policy_version 105103 (0.0027) [2024-06-28 00:01:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 1722040320. Throughput: 0: 43921.0. Samples: 1624955980. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 00:01:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:01:36,891][06909] Updated weights for policy 0, policy_version 105113 (0.0034) [2024-06-28 00:01:38,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 1722253312. Throughput: 0: 44110.5. Samples: 1625225860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 00:01:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:01:40,215][06909] Updated weights for policy 0, policy_version 105123 (0.0029) [2024-06-28 00:01:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1722466304. Throughput: 0: 44067.1. Samples: 1625358000. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 00:01:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:01:44,054][06909] Updated weights for policy 0, policy_version 105133 (0.0024) [2024-06-28 00:01:47,591][06909] Updated weights for policy 0, policy_version 105143 (0.0038) [2024-06-28 00:01:48,850][06674] Fps is (10 sec: 44237.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1722695680. Throughput: 0: 44103.6. Samples: 1625621380. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 00:01:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:01:48,944][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000105146_1722712064.pth... [2024-06-28 00:01:49,006][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000104499_1712111616.pth [2024-06-28 00:01:51,392][06909] Updated weights for policy 0, policy_version 105153 (0.0043) [2024-06-28 00:01:53,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1722925056. Throughput: 0: 44227.8. Samples: 1625891920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 00:01:53,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:01:54,895][06909] Updated weights for policy 0, policy_version 105163 (0.0033) [2024-06-28 00:01:58,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 1723138048. Throughput: 0: 44203.1. Samples: 1626021640. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 00:01:58,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:01:59,058][06909] Updated weights for policy 0, policy_version 105173 (0.0029) [2024-06-28 00:02:02,724][06909] Updated weights for policy 0, policy_version 105183 (0.0027) [2024-06-28 00:02:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 1723367424. Throughput: 0: 43925.7. Samples: 1626278700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 00:02:03,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:02:06,270][06909] Updated weights for policy 0, policy_version 105193 (0.0039) [2024-06-28 00:02:08,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1723580416. Throughput: 0: 44323.1. Samples: 1626553640. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 00:02:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:02:09,846][06909] Updated weights for policy 0, policy_version 105203 (0.0034) [2024-06-28 00:02:13,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 1723793408. Throughput: 0: 44144.5. Samples: 1626680700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 00:02:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:02:13,921][06909] Updated weights for policy 0, policy_version 105213 (0.0034) [2024-06-28 00:02:17,465][06909] Updated weights for policy 0, policy_version 105223 (0.0041) [2024-06-28 00:02:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1724006400. Throughput: 0: 44138.2. Samples: 1626942200. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 00:02:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:02:21,035][06909] Updated weights for policy 0, policy_version 105233 (0.0024) [2024-06-28 00:02:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1724252160. Throughput: 0: 44141.5. Samples: 1627212220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 00:02:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 00:02:24,866][06909] Updated weights for policy 0, policy_version 105243 (0.0032) [2024-06-28 00:02:28,398][06909] Updated weights for policy 0, policy_version 105253 (0.0026) [2024-06-28 00:02:28,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1724465152. Throughput: 0: 44100.4. Samples: 1627342520. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 00:02:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:02:31,674][06887] Signal inference workers to stop experience collection... (23250 times) [2024-06-28 00:02:31,674][06887] Signal inference workers to resume experience collection... (23250 times) [2024-06-28 00:02:31,720][06909] InferenceWorker_p0-w0: stopping experience collection (23250 times) [2024-06-28 00:02:31,720][06909] InferenceWorker_p0-w0: resuming experience collection (23250 times) [2024-06-28 00:02:32,121][06909] Updated weights for policy 0, policy_version 105263 (0.0042) [2024-06-28 00:02:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1724678144. Throughput: 0: 44132.0. Samples: 1627607320. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 00:02:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:02:36,145][06909] Updated weights for policy 0, policy_version 105273 (0.0039) [2024-06-28 00:02:38,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 43986.8). Total num frames: 1724891136. Throughput: 0: 43819.9. Samples: 1627863820. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 00:02:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:02:39,868][06909] Updated weights for policy 0, policy_version 105283 (0.0034) [2024-06-28 00:02:43,541][06909] Updated weights for policy 0, policy_version 105293 (0.0033) [2024-06-28 00:02:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1725120512. Throughput: 0: 43793.4. Samples: 1627992340. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 00:02:43,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:02:47,258][06909] Updated weights for policy 0, policy_version 105303 (0.0033) [2024-06-28 00:02:48,850][06674] Fps is (10 sec: 45876.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1725349888. Throughput: 0: 44104.1. Samples: 1628263380. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 00:02:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:02:50,905][06909] Updated weights for policy 0, policy_version 105313 (0.0033) [2024-06-28 00:02:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1725562880. Throughput: 0: 43832.8. Samples: 1628526120. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 00:02:53,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:02:54,669][06909] Updated weights for policy 0, policy_version 105323 (0.0025) [2024-06-28 00:02:58,008][06909] Updated weights for policy 0, policy_version 105333 (0.0036) [2024-06-28 00:02:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1725792256. Throughput: 0: 44029.3. Samples: 1628662020. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 00:02:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:03:01,902][06909] Updated weights for policy 0, policy_version 105343 (0.0030) [2024-06-28 00:03:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 1726005248. Throughput: 0: 44104.1. Samples: 1628926880. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 00:03:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:03:05,357][06909] Updated weights for policy 0, policy_version 105353 (0.0035) [2024-06-28 00:03:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1726218240. Throughput: 0: 44245.8. Samples: 1629203280. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 00:03:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:03:09,357][06909] Updated weights for policy 0, policy_version 105363 (0.0030) [2024-06-28 00:03:13,053][06909] Updated weights for policy 0, policy_version 105373 (0.0035) [2024-06-28 00:03:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1726464000. Throughput: 0: 44331.3. Samples: 1629337420. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 00:03:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:03:16,799][06909] Updated weights for policy 0, policy_version 105383 (0.0044) [2024-06-28 00:03:18,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 1726676992. Throughput: 0: 44201.3. Samples: 1629596380. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 00:03:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:03:20,356][06909] Updated weights for policy 0, policy_version 105393 (0.0025) [2024-06-28 00:03:23,856][06674] Fps is (10 sec: 44211.4, 60 sec: 44232.5, 300 sec: 44152.6). Total num frames: 1726906368. Throughput: 0: 44331.5. Samples: 1629858980. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 00:03:23,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:03:24,074][06909] Updated weights for policy 0, policy_version 105403 (0.0030) [2024-06-28 00:03:27,800][06909] Updated weights for policy 0, policy_version 105413 (0.0025) [2024-06-28 00:03:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1727102976. Throughput: 0: 44396.4. Samples: 1629990180. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 00:03:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:03:32,025][06909] Updated weights for policy 0, policy_version 105423 (0.0023) [2024-06-28 00:03:33,850][06674] Fps is (10 sec: 42622.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1727332352. Throughput: 0: 44050.3. Samples: 1630245640. Policy #0 lag: (min: 1.0, avg: 10.1, max: 25.0) [2024-06-28 00:03:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:03:35,317][06909] Updated weights for policy 0, policy_version 105433 (0.0035) [2024-06-28 00:03:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 43987.2). Total num frames: 1727528960. Throughput: 0: 44250.1. Samples: 1630517380. Policy #0 lag: (min: 1.0, avg: 10.1, max: 25.0) [2024-06-28 00:03:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:03:39,233][06909] Updated weights for policy 0, policy_version 105443 (0.0027) [2024-06-28 00:03:42,819][06909] Updated weights for policy 0, policy_version 105453 (0.0030) [2024-06-28 00:03:43,850][06674] Fps is (10 sec: 44235.9, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1727774720. Throughput: 0: 44159.9. Samples: 1630649220. Policy #0 lag: (min: 1.0, avg: 10.1, max: 25.0) [2024-06-28 00:03:43,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:03:46,582][06909] Updated weights for policy 0, policy_version 105463 (0.0037) [2024-06-28 00:03:48,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1728004096. Throughput: 0: 44123.9. Samples: 1630912460. Policy #0 lag: (min: 1.0, avg: 10.1, max: 25.0) [2024-06-28 00:03:48,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:03:48,877][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000105469_1728004096.pth... [2024-06-28 00:03:48,965][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000104825_1717452800.pth [2024-06-28 00:03:50,269][06909] Updated weights for policy 0, policy_version 105473 (0.0056) [2024-06-28 00:03:53,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1728217088. Throughput: 0: 43901.3. Samples: 1631178840. Policy #0 lag: (min: 1.0, avg: 10.1, max: 25.0) [2024-06-28 00:03:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:03:54,103][06909] Updated weights for policy 0, policy_version 105483 (0.0034) [2024-06-28 00:03:58,169][06909] Updated weights for policy 0, policy_version 105493 (0.0039) [2024-06-28 00:03:58,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1728430080. Throughput: 0: 43790.2. Samples: 1631307980. Policy #0 lag: (min: 1.0, avg: 10.1, max: 25.0) [2024-06-28 00:03:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 00:04:01,489][06909] Updated weights for policy 0, policy_version 105503 (0.0039) [2024-06-28 00:04:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1728643072. Throughput: 0: 43912.0. Samples: 1631572420. Policy #0 lag: (min: 1.0, avg: 10.1, max: 25.0) [2024-06-28 00:04:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:04:05,317][06909] Updated weights for policy 0, policy_version 105513 (0.0029) [2024-06-28 00:04:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 1728872448. Throughput: 0: 44128.7. Samples: 1631844520. Policy #0 lag: (min: 1.0, avg: 10.1, max: 25.0) [2024-06-28 00:04:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:04:09,057][06909] Updated weights for policy 0, policy_version 105523 (0.0034) [2024-06-28 00:04:12,761][06909] Updated weights for policy 0, policy_version 105533 (0.0031) [2024-06-28 00:04:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.5, 300 sec: 44042.4). Total num frames: 1729085440. Throughput: 0: 44157.7. Samples: 1631977280. Policy #0 lag: (min: 1.0, avg: 10.1, max: 25.0) [2024-06-28 00:04:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:04:13,872][06887] Signal inference workers to stop experience collection... (23300 times) [2024-06-28 00:04:13,928][06909] InferenceWorker_p0-w0: stopping experience collection (23300 times) [2024-06-28 00:04:13,978][06887] Signal inference workers to resume experience collection... (23300 times) [2024-06-28 00:04:13,978][06909] InferenceWorker_p0-w0: resuming experience collection (23300 times) [2024-06-28 00:04:16,406][06909] Updated weights for policy 0, policy_version 105543 (0.0022) [2024-06-28 00:04:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1729298432. Throughput: 0: 44238.6. Samples: 1632236380. Policy #0 lag: (min: 1.0, avg: 10.1, max: 25.0) [2024-06-28 00:04:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:04:20,487][06909] Updated weights for policy 0, policy_version 105553 (0.0036) [2024-06-28 00:04:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43694.8, 300 sec: 44042.4). Total num frames: 1729527808. Throughput: 0: 44131.7. Samples: 1632503300. Policy #0 lag: (min: 1.0, avg: 10.1, max: 25.0) [2024-06-28 00:04:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:04:23,975][06909] Updated weights for policy 0, policy_version 105563 (0.0043) [2024-06-28 00:04:27,863][06909] Updated weights for policy 0, policy_version 105573 (0.0026) [2024-06-28 00:04:28,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1729757184. Throughput: 0: 43996.1. Samples: 1632629040. Policy #0 lag: (min: 1.0, avg: 10.1, max: 25.0) [2024-06-28 00:04:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:04:31,605][06909] Updated weights for policy 0, policy_version 105583 (0.0042) [2024-06-28 00:04:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1729953792. Throughput: 0: 43958.8. Samples: 1632890600. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:04:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:04:35,653][06909] Updated weights for policy 0, policy_version 105593 (0.0037) [2024-06-28 00:04:38,791][06909] Updated weights for policy 0, policy_version 105603 (0.0027) [2024-06-28 00:04:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 1730199552. Throughput: 0: 43932.9. Samples: 1633155820. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:04:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:04:42,889][06909] Updated weights for policy 0, policy_version 105613 (0.0046) [2024-06-28 00:04:43,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 1730412544. Throughput: 0: 44100.5. Samples: 1633292500. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:04:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:04:46,087][06909] Updated weights for policy 0, policy_version 105623 (0.0037) [2024-06-28 00:04:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1730625536. Throughput: 0: 44058.7. Samples: 1633555060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:04:48,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:04:50,039][06909] Updated weights for policy 0, policy_version 105633 (0.0041) [2024-06-28 00:04:53,633][06909] Updated weights for policy 0, policy_version 105643 (0.0033) [2024-06-28 00:04:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1730854912. Throughput: 0: 43869.8. Samples: 1633818660. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:04:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:04:58,009][06909] Updated weights for policy 0, policy_version 105653 (0.0041) [2024-06-28 00:04:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1731067904. Throughput: 0: 43781.9. Samples: 1633947460. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:04:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:05:01,047][06909] Updated weights for policy 0, policy_version 105663 (0.0032) [2024-06-28 00:05:03,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1731280896. Throughput: 0: 43900.8. Samples: 1634211920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:05:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:05:05,197][06909] Updated weights for policy 0, policy_version 105673 (0.0031) [2024-06-28 00:05:08,361][06909] Updated weights for policy 0, policy_version 105683 (0.0042) [2024-06-28 00:05:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1731526656. Throughput: 0: 43828.1. Samples: 1634475560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:05:08,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 00:05:12,799][06909] Updated weights for policy 0, policy_version 105693 (0.0032) [2024-06-28 00:05:13,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1731739648. Throughput: 0: 44261.9. Samples: 1634620820. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:05:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:05:15,571][06909] Updated weights for policy 0, policy_version 105703 (0.0039) [2024-06-28 00:05:18,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1731919872. Throughput: 0: 44122.7. Samples: 1634876120. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:05:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:05:20,030][06909] Updated weights for policy 0, policy_version 105713 (0.0027) [2024-06-28 00:05:22,496][06887] Signal inference workers to stop experience collection... (23350 times) [2024-06-28 00:05:22,550][06909] InferenceWorker_p0-w0: stopping experience collection (23350 times) [2024-06-28 00:05:22,550][06887] Signal inference workers to resume experience collection... (23350 times) [2024-06-28 00:05:22,563][06909] InferenceWorker_p0-w0: resuming experience collection (23350 times) [2024-06-28 00:05:23,065][06909] Updated weights for policy 0, policy_version 105723 (0.0025) [2024-06-28 00:05:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1732198400. Throughput: 0: 44055.1. Samples: 1635138300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:05:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:05:27,245][06909] Updated weights for policy 0, policy_version 105733 (0.0046) [2024-06-28 00:05:28,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1732395008. Throughput: 0: 44115.9. Samples: 1635277720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:05:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:05:30,539][06909] Updated weights for policy 0, policy_version 105743 (0.0037) [2024-06-28 00:05:33,850][06674] Fps is (10 sec: 40960.4, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1732608000. Throughput: 0: 44047.1. Samples: 1635537180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:05:33,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-28 00:05:34,931][06909] Updated weights for policy 0, policy_version 105753 (0.0027) [2024-06-28 00:05:38,183][06909] Updated weights for policy 0, policy_version 105763 (0.0037) [2024-06-28 00:05:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1732853760. Throughput: 0: 44046.7. Samples: 1635800760. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:05:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:05:42,210][06909] Updated weights for policy 0, policy_version 105773 (0.0043) [2024-06-28 00:05:43,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1733066752. Throughput: 0: 44190.6. Samples: 1635936040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:05:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:05:45,386][06909] Updated weights for policy 0, policy_version 105783 (0.0043) [2024-06-28 00:05:48,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1733246976. Throughput: 0: 44117.0. Samples: 1636197180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:05:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:05:48,878][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000105790_1733263360.pth... [2024-06-28 00:05:48,925][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000105146_1722712064.pth [2024-06-28 00:05:49,901][06909] Updated weights for policy 0, policy_version 105793 (0.0027) [2024-06-28 00:05:52,768][06909] Updated weights for policy 0, policy_version 105803 (0.0030) [2024-06-28 00:05:53,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1733509120. Throughput: 0: 43861.7. Samples: 1636449340. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:05:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:05:57,263][06909] Updated weights for policy 0, policy_version 105813 (0.0032) [2024-06-28 00:05:58,852][06674] Fps is (10 sec: 47504.1, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 1733722112. Throughput: 0: 43846.9. Samples: 1636594020. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:05:58,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:06:00,403][06909] Updated weights for policy 0, policy_version 105823 (0.0020) [2024-06-28 00:06:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1733935104. Throughput: 0: 44023.5. Samples: 1636857180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:06:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:06:04,514][06909] Updated weights for policy 0, policy_version 105833 (0.0029) [2024-06-28 00:06:07,821][06909] Updated weights for policy 0, policy_version 105843 (0.0040) [2024-06-28 00:06:08,850][06674] Fps is (10 sec: 44246.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1734164480. Throughput: 0: 44026.3. Samples: 1637119480. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:06:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:06:12,228][06909] Updated weights for policy 0, policy_version 105853 (0.0041) [2024-06-28 00:06:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1734377472. Throughput: 0: 44051.7. Samples: 1637260040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:06:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:06:15,121][06909] Updated weights for policy 0, policy_version 105863 (0.0045) [2024-06-28 00:06:18,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 1734590464. Throughput: 0: 44156.3. Samples: 1637524220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:06:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:06:19,673][06909] Updated weights for policy 0, policy_version 105873 (0.0036) [2024-06-28 00:06:22,694][06909] Updated weights for policy 0, policy_version 105883 (0.0022) [2024-06-28 00:06:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1734836224. Throughput: 0: 43931.1. Samples: 1637777660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:06:23,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:06:27,259][06909] Updated weights for policy 0, policy_version 105893 (0.0024) [2024-06-28 00:06:28,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1735032832. Throughput: 0: 44114.8. Samples: 1637921200. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:06:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:06:30,039][06909] Updated weights for policy 0, policy_version 105903 (0.0032) [2024-06-28 00:06:33,856][06674] Fps is (10 sec: 40935.5, 60 sec: 43959.3, 300 sec: 44041.5). Total num frames: 1735245824. Throughput: 0: 44091.5. Samples: 1638181560. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:06:33,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:06:34,515][06909] Updated weights for policy 0, policy_version 105913 (0.0037) [2024-06-28 00:06:37,387][06909] Updated weights for policy 0, policy_version 105923 (0.0032) [2024-06-28 00:06:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1735491584. Throughput: 0: 44110.2. Samples: 1638434300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 00:06:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:06:41,861][06909] Updated weights for policy 0, policy_version 105933 (0.0034) [2024-06-28 00:06:43,850][06674] Fps is (10 sec: 45902.8, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1735704576. Throughput: 0: 44102.4. Samples: 1638578540. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 00:06:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:06:44,979][06909] Updated weights for policy 0, policy_version 105943 (0.0029) [2024-06-28 00:06:48,851][06674] Fps is (10 sec: 40954.0, 60 sec: 44235.8, 300 sec: 43986.7). Total num frames: 1735901184. Throughput: 0: 44043.5. Samples: 1638839200. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 00:06:48,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:06:49,566][06909] Updated weights for policy 0, policy_version 105953 (0.0042) [2024-06-28 00:06:50,114][06887] Signal inference workers to stop experience collection... (23400 times) [2024-06-28 00:06:50,114][06887] Signal inference workers to resume experience collection... (23400 times) [2024-06-28 00:06:50,156][06909] InferenceWorker_p0-w0: stopping experience collection (23400 times) [2024-06-28 00:06:50,157][06909] InferenceWorker_p0-w0: resuming experience collection (23400 times) [2024-06-28 00:06:52,427][06909] Updated weights for policy 0, policy_version 105963 (0.0026) [2024-06-28 00:06:53,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1736163328. Throughput: 0: 43936.7. Samples: 1639096640. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 00:06:53,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:06:56,850][06909] Updated weights for policy 0, policy_version 105973 (0.0039) [2024-06-28 00:06:58,850][06674] Fps is (10 sec: 44243.3, 60 sec: 43692.2, 300 sec: 43986.9). Total num frames: 1736343552. Throughput: 0: 44006.2. Samples: 1639240320. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 00:06:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:06:59,853][06909] Updated weights for policy 0, policy_version 105983 (0.0037) [2024-06-28 00:07:03,850][06674] Fps is (10 sec: 39322.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1736556544. Throughput: 0: 43937.9. Samples: 1639501420. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 00:07:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:07:04,226][06909] Updated weights for policy 0, policy_version 105993 (0.0030) [2024-06-28 00:07:07,398][06909] Updated weights for policy 0, policy_version 106003 (0.0030) [2024-06-28 00:07:08,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1736818688. Throughput: 0: 43985.9. Samples: 1639757020. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 00:07:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:07:11,535][06909] Updated weights for policy 0, policy_version 106013 (0.0037) [2024-06-28 00:07:13,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1737031680. Throughput: 0: 43873.2. Samples: 1639895500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 00:07:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:07:14,779][06909] Updated weights for policy 0, policy_version 106023 (0.0031) [2024-06-28 00:07:18,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1737228288. Throughput: 0: 43976.9. Samples: 1640160260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 00:07:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:07:19,066][06909] Updated weights for policy 0, policy_version 106033 (0.0038) [2024-06-28 00:07:22,233][06909] Updated weights for policy 0, policy_version 106043 (0.0040) [2024-06-28 00:07:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1737474048. Throughput: 0: 44146.2. Samples: 1640420880. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 00:07:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:07:26,635][06909] Updated weights for policy 0, policy_version 106053 (0.0040) [2024-06-28 00:07:28,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1737687040. Throughput: 0: 44125.9. Samples: 1640564200. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 00:07:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:07:29,494][06909] Updated weights for policy 0, policy_version 106063 (0.0028) [2024-06-28 00:07:33,761][06909] Updated weights for policy 0, policy_version 106073 (0.0027) [2024-06-28 00:07:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44241.3, 300 sec: 44098.0). Total num frames: 1737900032. Throughput: 0: 44261.0. Samples: 1640830880. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 00:07:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:07:36,731][06909] Updated weights for policy 0, policy_version 106083 (0.0026) [2024-06-28 00:07:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1738129408. Throughput: 0: 44297.9. Samples: 1641090040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:07:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:07:40,870][06909] Updated weights for policy 0, policy_version 106093 (0.0027) [2024-06-28 00:07:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1738358784. Throughput: 0: 44162.1. Samples: 1641227620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:07:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:07:44,215][06909] Updated weights for policy 0, policy_version 106103 (0.0026) [2024-06-28 00:07:48,197][06909] Updated weights for policy 0, policy_version 106113 (0.0040) [2024-06-28 00:07:48,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44510.8, 300 sec: 44097.9). Total num frames: 1738571776. Throughput: 0: 44410.0. Samples: 1641499880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:07:48,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:07:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000106114_1738571776.pth... [2024-06-28 00:07:48,925][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000105469_1728004096.pth [2024-06-28 00:07:51,491][06909] Updated weights for policy 0, policy_version 106123 (0.0037) [2024-06-28 00:07:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1738801152. Throughput: 0: 44454.6. Samples: 1641757480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:07:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:07:55,941][06909] Updated weights for policy 0, policy_version 106133 (0.0029) [2024-06-28 00:07:58,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 1739030528. Throughput: 0: 44400.4. Samples: 1641893520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:07:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:07:59,160][06909] Updated weights for policy 0, policy_version 106143 (0.0038) [2024-06-28 00:08:03,650][06909] Updated weights for policy 0, policy_version 106153 (0.0030) [2024-06-28 00:08:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 1739227136. Throughput: 0: 44320.5. Samples: 1642154680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:08:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:08:06,696][06909] Updated weights for policy 0, policy_version 106163 (0.0028) [2024-06-28 00:08:08,851][06674] Fps is (10 sec: 42594.2, 60 sec: 43962.9, 300 sec: 44042.3). Total num frames: 1739456512. Throughput: 0: 44394.0. Samples: 1642418660. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:08:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:08:10,874][06909] Updated weights for policy 0, policy_version 106173 (0.0026) [2024-06-28 00:08:13,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1739685888. Throughput: 0: 44120.3. Samples: 1642549620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:08:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:08:13,947][06909] Updated weights for policy 0, policy_version 106183 (0.0036) [2024-06-28 00:08:18,153][06909] Updated weights for policy 0, policy_version 106193 (0.0033) [2024-06-28 00:08:18,850][06674] Fps is (10 sec: 42602.5, 60 sec: 44236.8, 300 sec: 43987.7). Total num frames: 1739882496. Throughput: 0: 44182.6. Samples: 1642819100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:08:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:08:21,389][06909] Updated weights for policy 0, policy_version 106203 (0.0035) [2024-06-28 00:08:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 1740111872. Throughput: 0: 44293.2. Samples: 1643083240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:08:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:08:25,277][06909] Updated weights for policy 0, policy_version 106213 (0.0023) [2024-06-28 00:08:26,569][06887] Signal inference workers to stop experience collection... (23450 times) [2024-06-28 00:08:26,625][06887] Signal inference workers to resume experience collection... (23450 times) [2024-06-28 00:08:26,625][06909] InferenceWorker_p0-w0: stopping experience collection (23450 times) [2024-06-28 00:08:26,647][06909] InferenceWorker_p0-w0: resuming experience collection (23450 times) [2024-06-28 00:08:28,720][06909] Updated weights for policy 0, policy_version 106223 (0.0025) [2024-06-28 00:08:28,850][06674] Fps is (10 sec: 49152.4, 60 sec: 44782.9, 300 sec: 44209.0). Total num frames: 1740374016. Throughput: 0: 44272.5. Samples: 1643219880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:08:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:08:33,012][06909] Updated weights for policy 0, policy_version 106233 (0.0036) [2024-06-28 00:08:33,850][06674] Fps is (10 sec: 44234.8, 60 sec: 44236.4, 300 sec: 44153.4). Total num frames: 1740554240. Throughput: 0: 44025.4. Samples: 1643481040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:08:33,851][06674] Avg episode reward: [(0, '0.416')] [2024-06-28 00:08:36,425][06909] Updated weights for policy 0, policy_version 106243 (0.0033) [2024-06-28 00:08:38,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1740767232. Throughput: 0: 44112.5. Samples: 1643742540. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:08:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:08:40,457][06909] Updated weights for policy 0, policy_version 106253 (0.0026) [2024-06-28 00:08:43,610][06909] Updated weights for policy 0, policy_version 106263 (0.0025) [2024-06-28 00:08:43,850][06674] Fps is (10 sec: 45877.8, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1741012992. Throughput: 0: 44145.4. Samples: 1643880060. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:08:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:08:47,939][06909] Updated weights for policy 0, policy_version 106273 (0.0036) [2024-06-28 00:08:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1741209600. Throughput: 0: 44267.5. Samples: 1644146720. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:08:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:08:50,929][06909] Updated weights for policy 0, policy_version 106283 (0.0039) [2024-06-28 00:08:53,851][06674] Fps is (10 sec: 40956.3, 60 sec: 43690.0, 300 sec: 44042.3). Total num frames: 1741422592. Throughput: 0: 44257.9. Samples: 1644410260. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:08:53,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:08:55,215][06909] Updated weights for policy 0, policy_version 106293 (0.0038) [2024-06-28 00:08:58,457][06909] Updated weights for policy 0, policy_version 106303 (0.0030) [2024-06-28 00:08:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1741668352. Throughput: 0: 44253.0. Samples: 1644541000. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:08:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:09:02,598][06909] Updated weights for policy 0, policy_version 106313 (0.0038) [2024-06-28 00:09:03,850][06674] Fps is (10 sec: 45879.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1741881344. Throughput: 0: 44240.5. Samples: 1644809920. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:09:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:09:05,788][06909] Updated weights for policy 0, policy_version 106323 (0.0032) [2024-06-28 00:09:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43691.5, 300 sec: 44042.4). Total num frames: 1742077952. Throughput: 0: 44153.5. Samples: 1645070140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:09:08,860][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:09:10,135][06909] Updated weights for policy 0, policy_version 106333 (0.0032) [2024-06-28 00:09:13,388][06909] Updated weights for policy 0, policy_version 106343 (0.0037) [2024-06-28 00:09:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1742323712. Throughput: 0: 44142.7. Samples: 1645206300. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:09:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:09:17,847][06909] Updated weights for policy 0, policy_version 106353 (0.0032) [2024-06-28 00:09:18,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1742536704. Throughput: 0: 44074.7. Samples: 1645464380. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:09:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:09:20,632][06909] Updated weights for policy 0, policy_version 106363 (0.0024) [2024-06-28 00:09:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1742749696. Throughput: 0: 44188.9. Samples: 1645731040. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:09:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:09:25,103][06909] Updated weights for policy 0, policy_version 106373 (0.0036) [2024-06-28 00:09:28,107][06909] Updated weights for policy 0, policy_version 106383 (0.0021) [2024-06-28 00:09:28,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 44209.0). Total num frames: 1742995456. Throughput: 0: 44077.2. Samples: 1645863540. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:09:28,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:09:32,327][06909] Updated weights for policy 0, policy_version 106393 (0.0030) [2024-06-28 00:09:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44237.2, 300 sec: 44098.0). Total num frames: 1743208448. Throughput: 0: 43993.8. Samples: 1646126440. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:09:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:09:35,574][06909] Updated weights for policy 0, policy_version 106403 (0.0031) [2024-06-28 00:09:38,852][06674] Fps is (10 sec: 40951.5, 60 sec: 43962.1, 300 sec: 44042.1). Total num frames: 1743405056. Throughput: 0: 44144.1. Samples: 1646396800. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:09:38,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:09:39,565][06909] Updated weights for policy 0, policy_version 106413 (0.0038) [2024-06-28 00:09:42,922][06909] Updated weights for policy 0, policy_version 106423 (0.0034) [2024-06-28 00:09:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1743667200. Throughput: 0: 44100.9. Samples: 1646525540. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:09:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:09:47,213][06909] Updated weights for policy 0, policy_version 106433 (0.0034) [2024-06-28 00:09:48,850][06674] Fps is (10 sec: 45884.7, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1743863808. Throughput: 0: 44011.0. Samples: 1646790420. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:09:48,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:09:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000106437_1743863808.pth... [2024-06-28 00:09:48,947][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000105790_1733263360.pth [2024-06-28 00:09:50,395][06909] Updated weights for policy 0, policy_version 106443 (0.0031) [2024-06-28 00:09:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44237.5, 300 sec: 44098.0). Total num frames: 1744076800. Throughput: 0: 44116.0. Samples: 1647055360. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:09:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:09:54,986][06909] Updated weights for policy 0, policy_version 106453 (0.0036) [2024-06-28 00:09:56,329][06887] Signal inference workers to stop experience collection... (23500 times) [2024-06-28 00:09:56,329][06887] Signal inference workers to resume experience collection... (23500 times) [2024-06-28 00:09:56,370][06909] InferenceWorker_p0-w0: stopping experience collection (23500 times) [2024-06-28 00:09:56,370][06909] InferenceWorker_p0-w0: resuming experience collection (23500 times) [2024-06-28 00:09:57,750][06909] Updated weights for policy 0, policy_version 106463 (0.0023) [2024-06-28 00:09:58,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 1744322560. Throughput: 0: 43887.4. Samples: 1647181240. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:09:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:10:02,288][06909] Updated weights for policy 0, policy_version 106473 (0.0034) [2024-06-28 00:10:03,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1744519168. Throughput: 0: 44064.8. Samples: 1647447300. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:10:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:10:05,194][06909] Updated weights for policy 0, policy_version 106483 (0.0021) [2024-06-28 00:10:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1744732160. Throughput: 0: 43951.5. Samples: 1647708860. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:10:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:10:09,733][06909] Updated weights for policy 0, policy_version 106493 (0.0039) [2024-06-28 00:10:12,827][06909] Updated weights for policy 0, policy_version 106503 (0.0036) [2024-06-28 00:10:13,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 1744977920. Throughput: 0: 44038.8. Samples: 1647845280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:10:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:10:17,045][06909] Updated weights for policy 0, policy_version 106513 (0.0029) [2024-06-28 00:10:18,852][06674] Fps is (10 sec: 44227.9, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 1745174528. Throughput: 0: 43852.2. Samples: 1648099880. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:10:18,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:10:20,185][06909] Updated weights for policy 0, policy_version 106523 (0.0025) [2024-06-28 00:10:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1745387520. Throughput: 0: 43654.6. Samples: 1648361160. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:10:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:10:24,970][06909] Updated weights for policy 0, policy_version 106533 (0.0028) [2024-06-28 00:10:27,466][06909] Updated weights for policy 0, policy_version 106543 (0.0038) [2024-06-28 00:10:28,850][06674] Fps is (10 sec: 44246.4, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 1745616896. Throughput: 0: 43841.9. Samples: 1648498420. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:10:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:10:32,259][06909] Updated weights for policy 0, policy_version 106553 (0.0040) [2024-06-28 00:10:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1745846272. Throughput: 0: 43786.4. Samples: 1648760800. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:10:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:10:35,327][06909] Updated weights for policy 0, policy_version 106563 (0.0027) [2024-06-28 00:10:38,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43965.3, 300 sec: 43986.9). Total num frames: 1746042880. Throughput: 0: 43734.1. Samples: 1649023400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:10:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:10:39,534][06909] Updated weights for policy 0, policy_version 106573 (0.0033) [2024-06-28 00:10:42,735][06909] Updated weights for policy 0, policy_version 106583 (0.0022) [2024-06-28 00:10:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 1746288640. Throughput: 0: 43912.1. Samples: 1649157280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:10:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:10:46,892][06909] Updated weights for policy 0, policy_version 106593 (0.0028) [2024-06-28 00:10:48,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1746501632. Throughput: 0: 43844.0. Samples: 1649420280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:10:48,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:10:50,143][06909] Updated weights for policy 0, policy_version 106603 (0.0035) [2024-06-28 00:10:53,852][06674] Fps is (10 sec: 44227.8, 60 sec: 44235.3, 300 sec: 44098.0). Total num frames: 1746731008. Throughput: 0: 44095.0. Samples: 1649693220. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:10:53,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:10:54,036][06909] Updated weights for policy 0, policy_version 106613 (0.0038) [2024-06-28 00:10:57,506][06909] Updated weights for policy 0, policy_version 106623 (0.0035) [2024-06-28 00:10:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1746944000. Throughput: 0: 43942.1. Samples: 1649822680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:10:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:11:01,714][06909] Updated weights for policy 0, policy_version 106633 (0.0026) [2024-06-28 00:11:03,850][06674] Fps is (10 sec: 44245.5, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1747173376. Throughput: 0: 44142.9. Samples: 1650086220. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:11:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:11:05,012][06909] Updated weights for policy 0, policy_version 106643 (0.0031) [2024-06-28 00:11:08,528][06887] Signal inference workers to stop experience collection... (23550 times) [2024-06-28 00:11:08,535][06887] Signal inference workers to resume experience collection... (23550 times) [2024-06-28 00:11:08,566][06909] InferenceWorker_p0-w0: stopping experience collection (23550 times) [2024-06-28 00:11:08,566][06909] InferenceWorker_p0-w0: resuming experience collection (23550 times) [2024-06-28 00:11:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 1747386368. Throughput: 0: 44264.9. Samples: 1650353080. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:11:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:11:09,286][06909] Updated weights for policy 0, policy_version 106653 (0.0036) [2024-06-28 00:11:12,562][06909] Updated weights for policy 0, policy_version 106663 (0.0027) [2024-06-28 00:11:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1747615744. Throughput: 0: 43903.0. Samples: 1650474060. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:11:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:11:16,701][06909] Updated weights for policy 0, policy_version 106673 (0.0041) [2024-06-28 00:11:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 1747812352. Throughput: 0: 44022.6. Samples: 1650741820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:11:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:11:20,012][06909] Updated weights for policy 0, policy_version 106683 (0.0029) [2024-06-28 00:11:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1748025344. Throughput: 0: 44035.7. Samples: 1651005000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:11:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:11:24,102][06909] Updated weights for policy 0, policy_version 106693 (0.0036) [2024-06-28 00:11:27,318][06909] Updated weights for policy 0, policy_version 106703 (0.0028) [2024-06-28 00:11:28,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 44098.8). Total num frames: 1748254720. Throughput: 0: 44025.6. Samples: 1651138440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:11:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:11:31,716][06909] Updated weights for policy 0, policy_version 106713 (0.0027) [2024-06-28 00:11:33,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1748500480. Throughput: 0: 44035.5. Samples: 1651401880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 00:11:33,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:11:34,961][06909] Updated weights for policy 0, policy_version 106723 (0.0020) [2024-06-28 00:11:38,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1748697088. Throughput: 0: 44017.1. Samples: 1651673900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:11:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:11:38,976][06909] Updated weights for policy 0, policy_version 106733 (0.0025) [2024-06-28 00:11:42,201][06909] Updated weights for policy 0, policy_version 106743 (0.0038) [2024-06-28 00:11:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.6, 300 sec: 44153.7). Total num frames: 1748926464. Throughput: 0: 43754.2. Samples: 1651791620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:11:43,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:11:46,688][06909] Updated weights for policy 0, policy_version 106753 (0.0036) [2024-06-28 00:11:48,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 1749172224. Throughput: 0: 44027.5. Samples: 1652067460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:11:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:11:48,874][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000106761_1749172224.pth... [2024-06-28 00:11:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000106114_1738571776.pth [2024-06-28 00:11:49,943][06909] Updated weights for policy 0, policy_version 106763 (0.0030) [2024-06-28 00:11:53,852][06674] Fps is (10 sec: 42590.2, 60 sec: 43690.7, 300 sec: 44097.6). Total num frames: 1749352448. Throughput: 0: 43965.6. Samples: 1652331620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:11:53,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:11:53,992][06909] Updated weights for policy 0, policy_version 106773 (0.0035) [2024-06-28 00:11:57,360][06909] Updated weights for policy 0, policy_version 106783 (0.0036) [2024-06-28 00:11:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1749598208. Throughput: 0: 44047.1. Samples: 1652456180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:11:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:12:01,221][06909] Updated weights for policy 0, policy_version 106793 (0.0036) [2024-06-28 00:12:03,853][06674] Fps is (10 sec: 45872.1, 60 sec: 43961.8, 300 sec: 44042.0). Total num frames: 1749811200. Throughput: 0: 44067.2. Samples: 1652724960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:12:03,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:12:04,563][06909] Updated weights for policy 0, policy_version 106803 (0.0037) [2024-06-28 00:12:08,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1750007808. Throughput: 0: 44063.2. Samples: 1652987840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:12:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:12:08,920][06909] Updated weights for policy 0, policy_version 106813 (0.0033) [2024-06-28 00:12:12,348][06909] Updated weights for policy 0, policy_version 106823 (0.0031) [2024-06-28 00:12:13,850][06674] Fps is (10 sec: 44249.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1750253568. Throughput: 0: 43947.3. Samples: 1653116060. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:12:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:12:16,313][06909] Updated weights for policy 0, policy_version 106833 (0.0035) [2024-06-28 00:12:18,850][06674] Fps is (10 sec: 49150.9, 60 sec: 44782.8, 300 sec: 44153.5). Total num frames: 1750499328. Throughput: 0: 44130.6. Samples: 1653387760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:12:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:12:19,981][06909] Updated weights for policy 0, policy_version 106843 (0.0034) [2024-06-28 00:12:23,799][06909] Updated weights for policy 0, policy_version 106853 (0.0038) [2024-06-28 00:12:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1750679552. Throughput: 0: 44036.0. Samples: 1653655520. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:12:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:12:27,180][06909] Updated weights for policy 0, policy_version 106863 (0.0045) [2024-06-28 00:12:28,850][06674] Fps is (10 sec: 40960.7, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1750908928. Throughput: 0: 44191.7. Samples: 1653780240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:12:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:12:31,113][06909] Updated weights for policy 0, policy_version 106873 (0.0024) [2024-06-28 00:12:31,876][06887] Signal inference workers to stop experience collection... (23600 times) [2024-06-28 00:12:31,877][06887] Signal inference workers to resume experience collection... (23600 times) [2024-06-28 00:12:31,893][06909] InferenceWorker_p0-w0: stopping experience collection (23600 times) [2024-06-28 00:12:31,893][06909] InferenceWorker_p0-w0: resuming experience collection (23600 times) [2024-06-28 00:12:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1751138304. Throughput: 0: 43994.8. Samples: 1654047220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:12:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:12:34,393][06909] Updated weights for policy 0, policy_version 106883 (0.0032) [2024-06-28 00:12:38,324][06909] Updated weights for policy 0, policy_version 106893 (0.0034) [2024-06-28 00:12:38,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1751334912. Throughput: 0: 44001.0. Samples: 1654311580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 00:12:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:12:41,583][06909] Updated weights for policy 0, policy_version 106903 (0.0021) [2024-06-28 00:12:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1751580672. Throughput: 0: 44138.8. Samples: 1654442420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 00:12:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:12:45,883][06909] Updated weights for policy 0, policy_version 106913 (0.0039) [2024-06-28 00:12:48,850][06674] Fps is (10 sec: 47514.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1751810048. Throughput: 0: 44173.3. Samples: 1654712640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 00:12:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:12:48,939][06909] Updated weights for policy 0, policy_version 106923 (0.0035) [2024-06-28 00:12:53,226][06909] Updated weights for policy 0, policy_version 106933 (0.0030) [2024-06-28 00:12:53,850][06674] Fps is (10 sec: 40958.9, 60 sec: 43965.0, 300 sec: 43931.3). Total num frames: 1751990272. Throughput: 0: 44021.9. Samples: 1654968840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 00:12:53,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:12:57,086][06909] Updated weights for policy 0, policy_version 106943 (0.0034) [2024-06-28 00:12:58,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43962.3, 300 sec: 44097.6). Total num frames: 1752236032. Throughput: 0: 44113.9. Samples: 1655101280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 00:12:58,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:13:00,774][06909] Updated weights for policy 0, policy_version 106953 (0.0036) [2024-06-28 00:13:03,850][06674] Fps is (10 sec: 47514.7, 60 sec: 44238.8, 300 sec: 44098.1). Total num frames: 1752465408. Throughput: 0: 44041.4. Samples: 1655369620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 00:13:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:13:04,282][06909] Updated weights for policy 0, policy_version 106963 (0.0027) [2024-06-28 00:13:07,972][06909] Updated weights for policy 0, policy_version 106973 (0.0031) [2024-06-28 00:13:08,850][06674] Fps is (10 sec: 40967.9, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 1752645632. Throughput: 0: 43997.6. Samples: 1655635420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 00:13:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:13:11,542][06909] Updated weights for policy 0, policy_version 106983 (0.0029) [2024-06-28 00:13:13,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 1752891392. Throughput: 0: 44136.7. Samples: 1655766400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 00:13:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:13:15,127][06909] Updated weights for policy 0, policy_version 106993 (0.0028) [2024-06-28 00:13:18,850][06674] Fps is (10 sec: 47514.5, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 1753120768. Throughput: 0: 44099.5. Samples: 1656031700. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 00:13:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:13:18,881][06909] Updated weights for policy 0, policy_version 107003 (0.0031) [2024-06-28 00:13:23,038][06909] Updated weights for policy 0, policy_version 107013 (0.0035) [2024-06-28 00:13:23,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.5, 300 sec: 43820.2). Total num frames: 1753300992. Throughput: 0: 44125.3. Samples: 1656297220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 00:13:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:13:26,902][06909] Updated weights for policy 0, policy_version 107023 (0.0045) [2024-06-28 00:13:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 44042.5). Total num frames: 1753546752. Throughput: 0: 44064.0. Samples: 1656425300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 00:13:28,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:13:30,248][06909] Updated weights for policy 0, policy_version 107033 (0.0031) [2024-06-28 00:13:33,850][06674] Fps is (10 sec: 47514.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1753776128. Throughput: 0: 43928.9. Samples: 1656689440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 00:13:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:13:34,305][06909] Updated weights for policy 0, policy_version 107043 (0.0033) [2024-06-28 00:13:37,831][06909] Updated weights for policy 0, policy_version 107053 (0.0035) [2024-06-28 00:13:38,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.9, 300 sec: 43931.3). Total num frames: 1753972736. Throughput: 0: 44337.2. Samples: 1656964000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 00:13:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:13:38,960][06887] Signal inference workers to stop experience collection... (23650 times) [2024-06-28 00:13:38,984][06909] InferenceWorker_p0-w0: stopping experience collection (23650 times) [2024-06-28 00:13:39,023][06887] Signal inference workers to resume experience collection... (23650 times) [2024-06-28 00:13:39,024][06909] InferenceWorker_p0-w0: resuming experience collection (23650 times) [2024-06-28 00:13:41,474][06909] Updated weights for policy 0, policy_version 107063 (0.0034) [2024-06-28 00:13:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1754202112. Throughput: 0: 44249.1. Samples: 1657092400. Policy #0 lag: (min: 2.0, avg: 9.2, max: 21.0) [2024-06-28 00:13:43,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:13:45,057][06909] Updated weights for policy 0, policy_version 107073 (0.0028) [2024-06-28 00:13:48,815][06909] Updated weights for policy 0, policy_version 107083 (0.0032) [2024-06-28 00:13:48,851][06674] Fps is (10 sec: 47509.8, 60 sec: 43963.2, 300 sec: 44153.5). Total num frames: 1754447872. Throughput: 0: 44155.3. Samples: 1657356640. Policy #0 lag: (min: 2.0, avg: 9.2, max: 21.0) [2024-06-28 00:13:48,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:13:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000107083_1754447872.pth... [2024-06-28 00:13:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000106437_1743863808.pth [2024-06-28 00:13:52,651][06909] Updated weights for policy 0, policy_version 107093 (0.0037) [2024-06-28 00:13:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44237.0, 300 sec: 43986.9). Total num frames: 1754644480. Throughput: 0: 44048.6. Samples: 1657617600. Policy #0 lag: (min: 2.0, avg: 9.2, max: 21.0) [2024-06-28 00:13:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 00:13:56,142][06909] Updated weights for policy 0, policy_version 107103 (0.0032) [2024-06-28 00:13:58,850][06674] Fps is (10 sec: 40963.3, 60 sec: 43692.2, 300 sec: 43986.9). Total num frames: 1754857472. Throughput: 0: 43940.7. Samples: 1657743720. Policy #0 lag: (min: 2.0, avg: 9.2, max: 21.0) [2024-06-28 00:13:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:14:00,165][06909] Updated weights for policy 0, policy_version 107113 (0.0043) [2024-06-28 00:14:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1755086848. Throughput: 0: 44067.9. Samples: 1658014760. Policy #0 lag: (min: 2.0, avg: 9.2, max: 21.0) [2024-06-28 00:14:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:14:03,934][06909] Updated weights for policy 0, policy_version 107123 (0.0031) [2024-06-28 00:14:07,364][06909] Updated weights for policy 0, policy_version 107133 (0.0028) [2024-06-28 00:14:08,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1755299840. Throughput: 0: 44103.7. Samples: 1658281880. Policy #0 lag: (min: 2.0, avg: 9.2, max: 21.0) [2024-06-28 00:14:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:14:11,444][06909] Updated weights for policy 0, policy_version 107143 (0.0040) [2024-06-28 00:14:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1755545600. Throughput: 0: 44381.3. Samples: 1658422460. Policy #0 lag: (min: 2.0, avg: 9.2, max: 21.0) [2024-06-28 00:14:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:14:14,904][06909] Updated weights for policy 0, policy_version 107153 (0.0026) [2024-06-28 00:14:18,537][06909] Updated weights for policy 0, policy_version 107163 (0.0044) [2024-06-28 00:14:18,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1755774976. Throughput: 0: 44274.1. Samples: 1658681780. Policy #0 lag: (min: 2.0, avg: 9.2, max: 21.0) [2024-06-28 00:14:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:14:22,327][06909] Updated weights for policy 0, policy_version 107173 (0.0029) [2024-06-28 00:14:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44783.1, 300 sec: 44042.4). Total num frames: 1755987968. Throughput: 0: 44104.8. Samples: 1658948720. Policy #0 lag: (min: 2.0, avg: 9.2, max: 21.0) [2024-06-28 00:14:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:14:25,882][06909] Updated weights for policy 0, policy_version 107183 (0.0035) [2024-06-28 00:14:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 1756217344. Throughput: 0: 44227.6. Samples: 1659082640. Policy #0 lag: (min: 2.0, avg: 9.2, max: 21.0) [2024-06-28 00:14:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:14:30,067][06909] Updated weights for policy 0, policy_version 107193 (0.0036) [2024-06-28 00:14:33,450][06909] Updated weights for policy 0, policy_version 107203 (0.0038) [2024-06-28 00:14:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 1756413952. Throughput: 0: 44187.9. Samples: 1659345060. Policy #0 lag: (min: 2.0, avg: 9.2, max: 21.0) [2024-06-28 00:14:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:14:37,278][06909] Updated weights for policy 0, policy_version 107213 (0.0039) [2024-06-28 00:14:38,855][06674] Fps is (10 sec: 44214.0, 60 sec: 44779.0, 300 sec: 44041.6). Total num frames: 1756659712. Throughput: 0: 44246.9. Samples: 1659608940. Policy #0 lag: (min: 2.0, avg: 9.2, max: 21.0) [2024-06-28 00:14:38,855][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:14:40,925][06909] Updated weights for policy 0, policy_version 107223 (0.0039) [2024-06-28 00:14:43,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1756856320. Throughput: 0: 44448.3. Samples: 1659743900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 00:14:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:14:44,480][06909] Updated weights for policy 0, policy_version 107233 (0.0034) [2024-06-28 00:14:48,518][06909] Updated weights for policy 0, policy_version 107243 (0.0021) [2024-06-28 00:14:48,851][06674] Fps is (10 sec: 42614.4, 60 sec: 43963.2, 300 sec: 44097.7). Total num frames: 1757085696. Throughput: 0: 44186.6. Samples: 1660003220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 00:14:48,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:14:52,353][06909] Updated weights for policy 0, policy_version 107253 (0.0030) [2024-06-28 00:14:53,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.7, 300 sec: 44042.4). Total num frames: 1757315072. Throughput: 0: 44035.5. Samples: 1660263480. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 00:14:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:14:55,882][06909] Updated weights for policy 0, policy_version 107263 (0.0028) [2024-06-28 00:14:58,852][06674] Fps is (10 sec: 42595.9, 60 sec: 44235.2, 300 sec: 44042.1). Total num frames: 1757511680. Throughput: 0: 43944.7. Samples: 1660400060. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 00:14:58,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:14:59,661][06909] Updated weights for policy 0, policy_version 107273 (0.0031) [2024-06-28 00:15:03,092][06909] Updated weights for policy 0, policy_version 107283 (0.0031) [2024-06-28 00:15:03,850][06674] Fps is (10 sec: 42599.3, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1757741056. Throughput: 0: 44036.1. Samples: 1660663400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 00:15:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:15:07,268][06909] Updated weights for policy 0, policy_version 107293 (0.0030) [2024-06-28 00:15:08,850][06674] Fps is (10 sec: 45884.7, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 1757970432. Throughput: 0: 43911.1. Samples: 1660924720. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 00:15:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:15:09,823][06887] Signal inference workers to stop experience collection... (23700 times) [2024-06-28 00:15:09,829][06887] Signal inference workers to resume experience collection... (23700 times) [2024-06-28 00:15:09,844][06909] InferenceWorker_p0-w0: stopping experience collection (23700 times) [2024-06-28 00:15:09,845][06909] InferenceWorker_p0-w0: resuming experience collection (23700 times) [2024-06-28 00:15:10,408][06909] Updated weights for policy 0, policy_version 107303 (0.0039) [2024-06-28 00:15:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.9, 300 sec: 44098.3). Total num frames: 1758183424. Throughput: 0: 44138.4. Samples: 1661068860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 00:15:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:15:14,478][06909] Updated weights for policy 0, policy_version 107313 (0.0039) [2024-06-28 00:15:18,174][06909] Updated weights for policy 0, policy_version 107323 (0.0036) [2024-06-28 00:15:18,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.7, 300 sec: 44042.4). Total num frames: 1758380032. Throughput: 0: 43927.1. Samples: 1661321780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 00:15:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:15:22,094][06909] Updated weights for policy 0, policy_version 107333 (0.0037) [2024-06-28 00:15:23,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1758642176. Throughput: 0: 43900.2. Samples: 1661584220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 00:15:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:15:25,845][06909] Updated weights for policy 0, policy_version 107343 (0.0022) [2024-06-28 00:15:28,853][06674] Fps is (10 sec: 45861.3, 60 sec: 43688.5, 300 sec: 44042.0). Total num frames: 1758838784. Throughput: 0: 43948.7. Samples: 1661721720. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 00:15:28,853][06674] Avg episode reward: [(0, '0.413')] [2024-06-28 00:15:29,308][06909] Updated weights for policy 0, policy_version 107353 (0.0028) [2024-06-28 00:15:33,036][06909] Updated weights for policy 0, policy_version 107363 (0.0032) [2024-06-28 00:15:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1759068160. Throughput: 0: 43994.7. Samples: 1661982920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 00:15:33,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:15:36,865][06909] Updated weights for policy 0, policy_version 107373 (0.0037) [2024-06-28 00:15:38,850][06674] Fps is (10 sec: 45888.5, 60 sec: 43967.5, 300 sec: 44097.9). Total num frames: 1759297536. Throughput: 0: 44038.7. Samples: 1662245220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 00:15:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:15:40,441][06909] Updated weights for policy 0, policy_version 107383 (0.0034) [2024-06-28 00:15:43,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1759510528. Throughput: 0: 43988.0. Samples: 1662379440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 00:15:43,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:15:44,286][06909] Updated weights for policy 0, policy_version 107393 (0.0037) [2024-06-28 00:15:47,563][06909] Updated weights for policy 0, policy_version 107403 (0.0034) [2024-06-28 00:15:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43964.8, 300 sec: 44042.7). Total num frames: 1759723520. Throughput: 0: 44081.3. Samples: 1662647060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 00:15:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:15:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000107405_1759723520.pth... [2024-06-28 00:15:48,931][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000106761_1749172224.pth [2024-06-28 00:15:51,502][06909] Updated weights for policy 0, policy_version 107413 (0.0037) [2024-06-28 00:15:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1759952896. Throughput: 0: 44155.4. Samples: 1662911720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 00:15:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:15:55,461][06909] Updated weights for policy 0, policy_version 107423 (0.0032) [2024-06-28 00:15:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 1760165888. Throughput: 0: 43914.9. Samples: 1663045040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 00:15:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:15:59,077][06909] Updated weights for policy 0, policy_version 107433 (0.0038) [2024-06-28 00:16:03,138][06909] Updated weights for policy 0, policy_version 107443 (0.0035) [2024-06-28 00:16:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1760395264. Throughput: 0: 44069.2. Samples: 1663304900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 00:16:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:16:06,432][06909] Updated weights for policy 0, policy_version 107453 (0.0037) [2024-06-28 00:16:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1760608256. Throughput: 0: 44081.8. Samples: 1663567900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 00:16:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:16:10,359][06909] Updated weights for policy 0, policy_version 107463 (0.0033) [2024-06-28 00:16:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1760821248. Throughput: 0: 43887.4. Samples: 1663696520. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 00:16:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:16:14,288][06909] Updated weights for policy 0, policy_version 107473 (0.0034) [2024-06-28 00:16:17,679][06909] Updated weights for policy 0, policy_version 107483 (0.0039) [2024-06-28 00:16:18,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1761034240. Throughput: 0: 44136.0. Samples: 1663969040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 00:16:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:16:21,669][06909] Updated weights for policy 0, policy_version 107493 (0.0038) [2024-06-28 00:16:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1761263616. Throughput: 0: 44041.5. Samples: 1664227080. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 00:16:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:16:25,405][06909] Updated weights for policy 0, policy_version 107503 (0.0037) [2024-06-28 00:16:28,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43966.0, 300 sec: 43986.9). Total num frames: 1761476608. Throughput: 0: 44086.9. Samples: 1664363340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 00:16:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:16:28,924][06909] Updated weights for policy 0, policy_version 107513 (0.0036) [2024-06-28 00:16:32,875][06909] Updated weights for policy 0, policy_version 107523 (0.0032) [2024-06-28 00:16:33,850][06674] Fps is (10 sec: 44235.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1761705984. Throughput: 0: 44042.5. Samples: 1664628980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 00:16:33,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:16:36,125][06909] Updated weights for policy 0, policy_version 107533 (0.0032) [2024-06-28 00:16:38,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1761935360. Throughput: 0: 43880.5. Samples: 1664886340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 00:16:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:16:40,243][06909] Updated weights for policy 0, policy_version 107543 (0.0047) [2024-06-28 00:16:43,401][06909] Updated weights for policy 0, policy_version 107553 (0.0027) [2024-06-28 00:16:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1762148352. Throughput: 0: 43898.7. Samples: 1665020480. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 00:16:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:16:47,469][06909] Updated weights for policy 0, policy_version 107563 (0.0031) [2024-06-28 00:16:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 1762361344. Throughput: 0: 44061.3. Samples: 1665287660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 00:16:48,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-28 00:16:51,823][06909] Updated weights for policy 0, policy_version 107573 (0.0032) [2024-06-28 00:16:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1762590720. Throughput: 0: 44020.4. Samples: 1665548820. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 00:16:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:16:54,943][06909] Updated weights for policy 0, policy_version 107583 (0.0022) [2024-06-28 00:16:55,117][06887] Signal inference workers to stop experience collection... (23750 times) [2024-06-28 00:16:55,145][06909] InferenceWorker_p0-w0: stopping experience collection (23750 times) [2024-06-28 00:16:55,181][06887] Signal inference workers to resume experience collection... (23750 times) [2024-06-28 00:16:55,182][06909] InferenceWorker_p0-w0: resuming experience collection (23750 times) [2024-06-28 00:16:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43987.3). Total num frames: 1762787328. Throughput: 0: 44195.9. Samples: 1665685340. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 00:16:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:16:59,026][06909] Updated weights for policy 0, policy_version 107593 (0.0047) [2024-06-28 00:17:02,585][06909] Updated weights for policy 0, policy_version 107603 (0.0023) [2024-06-28 00:17:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1763033088. Throughput: 0: 44157.4. Samples: 1665956120. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 00:17:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 00:17:06,283][06909] Updated weights for policy 0, policy_version 107613 (0.0036) [2024-06-28 00:17:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1763246080. Throughput: 0: 44124.8. Samples: 1666212700. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 00:17:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:17:10,080][06909] Updated weights for policy 0, policy_version 107623 (0.0029) [2024-06-28 00:17:13,397][06909] Updated weights for policy 0, policy_version 107633 (0.0038) [2024-06-28 00:17:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1763475456. Throughput: 0: 44156.8. Samples: 1666350400. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 00:17:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:17:17,544][06909] Updated weights for policy 0, policy_version 107643 (0.0027) [2024-06-28 00:17:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1763688448. Throughput: 0: 44117.1. Samples: 1666614240. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 00:17:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:17:20,865][06909] Updated weights for policy 0, policy_version 107653 (0.0031) [2024-06-28 00:17:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1763917824. Throughput: 0: 44172.9. Samples: 1666874120. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 00:17:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:17:24,801][06909] Updated weights for policy 0, policy_version 107663 (0.0038) [2024-06-28 00:17:28,492][06909] Updated weights for policy 0, policy_version 107673 (0.0031) [2024-06-28 00:17:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1764114432. Throughput: 0: 44168.5. Samples: 1667008060. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 00:17:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:17:32,307][06909] Updated weights for policy 0, policy_version 107683 (0.0034) [2024-06-28 00:17:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 1764343808. Throughput: 0: 44111.6. Samples: 1667272680. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 00:17:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:17:35,912][06909] Updated weights for policy 0, policy_version 107693 (0.0044) [2024-06-28 00:17:38,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1764573184. Throughput: 0: 44129.9. Samples: 1667534660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 00:17:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:17:39,855][06909] Updated weights for policy 0, policy_version 107703 (0.0033) [2024-06-28 00:17:43,577][06909] Updated weights for policy 0, policy_version 107713 (0.0031) [2024-06-28 00:17:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1764786176. Throughput: 0: 44108.1. Samples: 1667670200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:17:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:17:47,231][06909] Updated weights for policy 0, policy_version 107723 (0.0031) [2024-06-28 00:17:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1765015552. Throughput: 0: 44058.7. Samples: 1667938760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:17:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:17:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000107728_1765015552.pth... [2024-06-28 00:17:48,916][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000107083_1754447872.pth [2024-06-28 00:17:50,763][06909] Updated weights for policy 0, policy_version 107733 (0.0032) [2024-06-28 00:17:50,974][06887] Signal inference workers to stop experience collection... (23800 times) [2024-06-28 00:17:51,010][06909] InferenceWorker_p0-w0: stopping experience collection (23800 times) [2024-06-28 00:17:51,035][06887] Signal inference workers to resume experience collection... (23800 times) [2024-06-28 00:17:51,036][06909] InferenceWorker_p0-w0: resuming experience collection (23800 times) [2024-06-28 00:17:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 1765228544. Throughput: 0: 44000.0. Samples: 1668192700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:17:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:17:54,631][06909] Updated weights for policy 0, policy_version 107743 (0.0032) [2024-06-28 00:17:58,213][06909] Updated weights for policy 0, policy_version 107753 (0.0035) [2024-06-28 00:17:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1765457920. Throughput: 0: 43819.6. Samples: 1668322280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:17:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:18:02,172][06909] Updated weights for policy 0, policy_version 107763 (0.0051) [2024-06-28 00:18:03,851][06674] Fps is (10 sec: 44229.8, 60 sec: 43962.6, 300 sec: 44153.3). Total num frames: 1765670912. Throughput: 0: 43956.6. Samples: 1668592360. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:18:03,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:18:05,853][06909] Updated weights for policy 0, policy_version 107773 (0.0031) [2024-06-28 00:18:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 1765900288. Throughput: 0: 43989.3. Samples: 1668853640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:18:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:18:09,503][06909] Updated weights for policy 0, policy_version 107783 (0.0021) [2024-06-28 00:18:13,028][06909] Updated weights for policy 0, policy_version 107793 (0.0027) [2024-06-28 00:18:13,850][06674] Fps is (10 sec: 45883.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1766129664. Throughput: 0: 44099.2. Samples: 1668992520. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:18:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:18:16,847][06909] Updated weights for policy 0, policy_version 107803 (0.0026) [2024-06-28 00:18:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1766326272. Throughput: 0: 44206.1. Samples: 1669261960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:18:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:18:20,612][06909] Updated weights for policy 0, policy_version 107813 (0.0025) [2024-06-28 00:18:23,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1766539264. Throughput: 0: 44026.1. Samples: 1669515840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:18:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:18:24,407][06909] Updated weights for policy 0, policy_version 107823 (0.0035) [2024-06-28 00:18:27,999][06909] Updated weights for policy 0, policy_version 107833 (0.0030) [2024-06-28 00:18:28,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44509.7, 300 sec: 44097.9). Total num frames: 1766785024. Throughput: 0: 44045.0. Samples: 1669652240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:18:28,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:18:31,791][06909] Updated weights for policy 0, policy_version 107843 (0.0026) [2024-06-28 00:18:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1766981632. Throughput: 0: 43977.3. Samples: 1669917740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:18:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:18:35,457][06909] Updated weights for policy 0, policy_version 107853 (0.0035) [2024-06-28 00:18:38,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1767211008. Throughput: 0: 44153.8. Samples: 1670179620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 00:18:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:18:39,042][06909] Updated weights for policy 0, policy_version 107863 (0.0033) [2024-06-28 00:18:42,818][06909] Updated weights for policy 0, policy_version 107873 (0.0030) [2024-06-28 00:18:43,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44509.7, 300 sec: 44098.1). Total num frames: 1767456768. Throughput: 0: 44293.3. Samples: 1670315480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 00:18:43,859][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:18:46,411][06909] Updated weights for policy 0, policy_version 107883 (0.0024) [2024-06-28 00:18:48,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 1767653376. Throughput: 0: 44250.7. Samples: 1670583580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 00:18:48,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:18:50,150][06909] Updated weights for policy 0, policy_version 107893 (0.0037) [2024-06-28 00:18:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1767866368. Throughput: 0: 44365.8. Samples: 1670850100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 00:18:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:18:53,893][06909] Updated weights for policy 0, policy_version 107903 (0.0043) [2024-06-28 00:18:57,563][06909] Updated weights for policy 0, policy_version 107913 (0.0026) [2024-06-28 00:18:58,850][06674] Fps is (10 sec: 47514.5, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1768128512. Throughput: 0: 44307.9. Samples: 1670986380. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 00:18:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:19:01,218][06909] Updated weights for policy 0, policy_version 107923 (0.0032) [2024-06-28 00:19:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44238.0, 300 sec: 44153.5). Total num frames: 1768325120. Throughput: 0: 44205.4. Samples: 1671251200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 00:19:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:19:04,861][06909] Updated weights for policy 0, policy_version 107933 (0.0038) [2024-06-28 00:19:08,531][06909] Updated weights for policy 0, policy_version 107943 (0.0036) [2024-06-28 00:19:08,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1768538112. Throughput: 0: 44401.3. Samples: 1671513900. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 00:19:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:19:12,222][06909] Updated weights for policy 0, policy_version 107953 (0.0027) [2024-06-28 00:19:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1768767488. Throughput: 0: 44256.6. Samples: 1671643780. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 00:19:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:19:15,125][06887] Signal inference workers to stop experience collection... (23850 times) [2024-06-28 00:19:15,166][06909] InferenceWorker_p0-w0: stopping experience collection (23850 times) [2024-06-28 00:19:15,187][06887] Signal inference workers to resume experience collection... (23850 times) [2024-06-28 00:19:15,189][06909] InferenceWorker_p0-w0: resuming experience collection (23850 times) [2024-06-28 00:19:16,136][06909] Updated weights for policy 0, policy_version 107963 (0.0038) [2024-06-28 00:19:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1768980480. Throughput: 0: 44183.0. Samples: 1671905980. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 00:19:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:19:19,861][06909] Updated weights for policy 0, policy_version 107973 (0.0041) [2024-06-28 00:19:23,473][06909] Updated weights for policy 0, policy_version 107983 (0.0042) [2024-06-28 00:19:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1769209856. Throughput: 0: 44256.0. Samples: 1672171140. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 00:19:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:19:27,297][06909] Updated weights for policy 0, policy_version 107993 (0.0034) [2024-06-28 00:19:28,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44237.0, 300 sec: 44153.5). Total num frames: 1769439232. Throughput: 0: 44179.2. Samples: 1672303540. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 00:19:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:19:30,861][06909] Updated weights for policy 0, policy_version 108003 (0.0023) [2024-06-28 00:19:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 43987.6). Total num frames: 1769635840. Throughput: 0: 44114.3. Samples: 1672568720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 00:19:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:19:34,696][06909] Updated weights for policy 0, policy_version 108013 (0.0037) [2024-06-28 00:19:38,131][06909] Updated weights for policy 0, policy_version 108023 (0.0035) [2024-06-28 00:19:38,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1769848832. Throughput: 0: 44121.7. Samples: 1672835580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 00:19:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:19:42,121][06909] Updated weights for policy 0, policy_version 108033 (0.0036) [2024-06-28 00:19:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44098.2). Total num frames: 1770094592. Throughput: 0: 44053.7. Samples: 1672968800. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 00:19:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:19:45,804][06909] Updated weights for policy 0, policy_version 108043 (0.0042) [2024-06-28 00:19:48,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 1770291200. Throughput: 0: 43814.7. Samples: 1673222860. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 00:19:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:19:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000108050_1770291200.pth... [2024-06-28 00:19:48,909][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000107405_1759723520.pth [2024-06-28 00:19:49,644][06909] Updated weights for policy 0, policy_version 108053 (0.0022) [2024-06-28 00:19:53,434][06909] Updated weights for policy 0, policy_version 108063 (0.0030) [2024-06-28 00:19:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.9, 300 sec: 44098.3). Total num frames: 1770520576. Throughput: 0: 44073.5. Samples: 1673497200. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 00:19:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:19:57,126][06909] Updated weights for policy 0, policy_version 108073 (0.0032) [2024-06-28 00:19:58,856][06674] Fps is (10 sec: 45847.3, 60 sec: 43686.2, 300 sec: 44097.0). Total num frames: 1770749952. Throughput: 0: 44102.6. Samples: 1673628660. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 00:19:58,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:20:00,658][06909] Updated weights for policy 0, policy_version 108083 (0.0031) [2024-06-28 00:20:03,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1770946560. Throughput: 0: 44052.5. Samples: 1673888340. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 00:20:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:20:04,602][06909] Updated weights for policy 0, policy_version 108093 (0.0029) [2024-06-28 00:20:07,918][06909] Updated weights for policy 0, policy_version 108103 (0.0033) [2024-06-28 00:20:08,850][06674] Fps is (10 sec: 44263.0, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1771192320. Throughput: 0: 44055.9. Samples: 1674153660. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 00:20:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:20:11,749][06909] Updated weights for policy 0, policy_version 108113 (0.0021) [2024-06-28 00:20:13,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1771421696. Throughput: 0: 44167.1. Samples: 1674291060. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 00:20:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:20:15,388][06909] Updated weights for policy 0, policy_version 108123 (0.0033) [2024-06-28 00:20:18,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1771618304. Throughput: 0: 44059.6. Samples: 1674551400. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 00:20:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:20:19,242][06909] Updated weights for policy 0, policy_version 108133 (0.0028) [2024-06-28 00:20:23,226][06909] Updated weights for policy 0, policy_version 108143 (0.0026) [2024-06-28 00:20:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44098.4). Total num frames: 1771847680. Throughput: 0: 44038.4. Samples: 1674817300. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 00:20:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:20:26,800][06909] Updated weights for policy 0, policy_version 108153 (0.0041) [2024-06-28 00:20:28,853][06674] Fps is (10 sec: 44222.5, 60 sec: 43688.3, 300 sec: 44041.9). Total num frames: 1772060672. Throughput: 0: 43862.2. Samples: 1674942740. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 00:20:28,854][06674] Avg episode reward: [(0, '0.459')] [2024-06-28 00:20:30,567][06909] Updated weights for policy 0, policy_version 108163 (0.0035) [2024-06-28 00:20:33,856][06674] Fps is (10 sec: 44209.9, 60 sec: 44232.4, 300 sec: 44041.5). Total num frames: 1772290048. Throughput: 0: 44150.9. Samples: 1675209920. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 00:20:33,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:20:34,317][06909] Updated weights for policy 0, policy_version 108173 (0.0027) [2024-06-28 00:20:38,021][06909] Updated weights for policy 0, policy_version 108183 (0.0033) [2024-06-28 00:20:38,850][06674] Fps is (10 sec: 45889.6, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 1772519424. Throughput: 0: 43875.0. Samples: 1675471580. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 00:20:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:20:41,690][06909] Updated weights for policy 0, policy_version 108193 (0.0029) [2024-06-28 00:20:41,851][06887] Signal inference workers to stop experience collection... (23900 times) [2024-06-28 00:20:41,852][06887] Signal inference workers to resume experience collection... (23900 times) [2024-06-28 00:20:41,870][06909] InferenceWorker_p0-w0: stopping experience collection (23900 times) [2024-06-28 00:20:41,870][06909] InferenceWorker_p0-w0: resuming experience collection (23900 times) [2024-06-28 00:20:43,850][06674] Fps is (10 sec: 44263.5, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1772732416. Throughput: 0: 44089.0. Samples: 1675612400. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 00:20:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:20:45,199][06909] Updated weights for policy 0, policy_version 108203 (0.0035) [2024-06-28 00:20:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1772945408. Throughput: 0: 44124.5. Samples: 1675873940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:20:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:20:48,873][06909] Updated weights for policy 0, policy_version 108213 (0.0035) [2024-06-28 00:20:52,951][06909] Updated weights for policy 0, policy_version 108223 (0.0027) [2024-06-28 00:20:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1773174784. Throughput: 0: 43990.0. Samples: 1676133200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:20:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:20:56,459][06909] Updated weights for policy 0, policy_version 108233 (0.0028) [2024-06-28 00:20:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43968.2, 300 sec: 44042.4). Total num frames: 1773387776. Throughput: 0: 43919.1. Samples: 1676267420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:20:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:21:00,445][06909] Updated weights for policy 0, policy_version 108243 (0.0035) [2024-06-28 00:21:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1773600768. Throughput: 0: 43978.2. Samples: 1676530420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:21:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:21:03,941][06909] Updated weights for policy 0, policy_version 108253 (0.0034) [2024-06-28 00:21:07,797][06909] Updated weights for policy 0, policy_version 108263 (0.0039) [2024-06-28 00:21:08,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43962.3, 300 sec: 44097.6). Total num frames: 1773830144. Throughput: 0: 43901.1. Samples: 1676792940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:21:08,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:21:11,562][06909] Updated weights for policy 0, policy_version 108273 (0.0037) [2024-06-28 00:21:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 1774043136. Throughput: 0: 44132.4. Samples: 1676928560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:21:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:21:14,965][06909] Updated weights for policy 0, policy_version 108283 (0.0024) [2024-06-28 00:21:18,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1774256128. Throughput: 0: 44109.9. Samples: 1677194600. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:21:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:21:18,918][06909] Updated weights for policy 0, policy_version 108293 (0.0028) [2024-06-28 00:21:22,211][06909] Updated weights for policy 0, policy_version 108303 (0.0039) [2024-06-28 00:21:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1774501888. Throughput: 0: 44159.1. Samples: 1677458740. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:21:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:21:26,129][06909] Updated weights for policy 0, policy_version 108313 (0.0045) [2024-06-28 00:21:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43966.1, 300 sec: 44042.4). Total num frames: 1774698496. Throughput: 0: 43868.9. Samples: 1677586500. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:21:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:21:29,862][06909] Updated weights for policy 0, policy_version 108323 (0.0025) [2024-06-28 00:21:33,620][06909] Updated weights for policy 0, policy_version 108333 (0.0034) [2024-06-28 00:21:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44241.2, 300 sec: 44097.9). Total num frames: 1774944256. Throughput: 0: 44056.4. Samples: 1677856480. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:21:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:21:37,383][06909] Updated weights for policy 0, policy_version 108343 (0.0038) [2024-06-28 00:21:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1775157248. Throughput: 0: 44059.0. Samples: 1678115860. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:21:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:21:41,107][06909] Updated weights for policy 0, policy_version 108353 (0.0027) [2024-06-28 00:21:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1775353856. Throughput: 0: 44096.9. Samples: 1678251780. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 00:21:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:21:44,810][06909] Updated weights for policy 0, policy_version 108363 (0.0041) [2024-06-28 00:21:48,394][06909] Updated weights for policy 0, policy_version 108373 (0.0028) [2024-06-28 00:21:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1775599616. Throughput: 0: 44001.8. Samples: 1678510500. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 00:21:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:21:48,968][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000108375_1775616000.pth... [2024-06-28 00:21:49,034][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000107728_1765015552.pth [2024-06-28 00:21:52,149][06909] Updated weights for policy 0, policy_version 108383 (0.0023) [2024-06-28 00:21:53,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1775828992. Throughput: 0: 44027.3. Samples: 1678774080. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 00:21:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:21:55,933][06909] Updated weights for policy 0, policy_version 108393 (0.0036) [2024-06-28 00:21:58,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1776009216. Throughput: 0: 43950.3. Samples: 1678906320. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 00:21:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:21:59,655][06909] Updated weights for policy 0, policy_version 108403 (0.0041) [2024-06-28 00:22:03,241][06909] Updated weights for policy 0, policy_version 108413 (0.0030) [2024-06-28 00:22:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1776271360. Throughput: 0: 44138.1. Samples: 1679180820. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 00:22:03,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:22:06,987][06909] Updated weights for policy 0, policy_version 108423 (0.0044) [2024-06-28 00:22:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 1776467968. Throughput: 0: 44065.8. Samples: 1679441700. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 00:22:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:22:10,829][06909] Updated weights for policy 0, policy_version 108433 (0.0027) [2024-06-28 00:22:13,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1776680960. Throughput: 0: 44089.4. Samples: 1679570520. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 00:22:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:22:14,515][06909] Updated weights for policy 0, policy_version 108443 (0.0031) [2024-06-28 00:22:18,181][06909] Updated weights for policy 0, policy_version 108453 (0.0034) [2024-06-28 00:22:18,484][06887] Signal inference workers to stop experience collection... (23950 times) [2024-06-28 00:22:18,520][06909] InferenceWorker_p0-w0: stopping experience collection (23950 times) [2024-06-28 00:22:18,532][06887] Signal inference workers to resume experience collection... (23950 times) [2024-06-28 00:22:18,539][06909] InferenceWorker_p0-w0: resuming experience collection (23950 times) [2024-06-28 00:22:18,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 1776943104. Throughput: 0: 44108.0. Samples: 1679841340. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 00:22:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:22:21,979][06909] Updated weights for policy 0, policy_version 108463 (0.0029) [2024-06-28 00:22:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1777139712. Throughput: 0: 44076.5. Samples: 1680099300. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 00:22:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:22:25,706][06909] Updated weights for policy 0, policy_version 108473 (0.0035) [2024-06-28 00:22:28,852][06674] Fps is (10 sec: 39313.5, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 1777336320. Throughput: 0: 43959.3. Samples: 1680230040. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 00:22:28,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:22:29,614][06909] Updated weights for policy 0, policy_version 108483 (0.0027) [2024-06-28 00:22:32,959][06909] Updated weights for policy 0, policy_version 108493 (0.0043) [2024-06-28 00:22:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1777582080. Throughput: 0: 44114.2. Samples: 1680495640. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 00:22:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:22:36,899][06909] Updated weights for policy 0, policy_version 108503 (0.0032) [2024-06-28 00:22:38,852][06674] Fps is (10 sec: 45875.2, 60 sec: 43962.2, 300 sec: 44097.6). Total num frames: 1777795072. Throughput: 0: 44271.3. Samples: 1680766380. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 00:22:38,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:22:40,386][06909] Updated weights for policy 0, policy_version 108513 (0.0038) [2024-06-28 00:22:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 1778024448. Throughput: 0: 44236.8. Samples: 1680896980. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 00:22:43,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:22:44,027][06909] Updated weights for policy 0, policy_version 108523 (0.0037) [2024-06-28 00:22:47,771][06909] Updated weights for policy 0, policy_version 108533 (0.0032) [2024-06-28 00:22:48,850][06674] Fps is (10 sec: 45884.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1778253824. Throughput: 0: 44177.0. Samples: 1681168780. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 00:22:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:22:51,609][06909] Updated weights for policy 0, policy_version 108543 (0.0033) [2024-06-28 00:22:53,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1778466816. Throughput: 0: 44149.4. Samples: 1681428420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 00:22:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:22:55,259][06909] Updated weights for policy 0, policy_version 108553 (0.0032) [2024-06-28 00:22:58,798][06909] Updated weights for policy 0, policy_version 108563 (0.0036) [2024-06-28 00:22:58,853][06674] Fps is (10 sec: 44220.6, 60 sec: 44780.2, 300 sec: 44153.2). Total num frames: 1778696192. Throughput: 0: 44175.0. Samples: 1681558560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 00:22:58,854][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:23:02,570][06909] Updated weights for policy 0, policy_version 108573 (0.0027) [2024-06-28 00:23:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1778909184. Throughput: 0: 44220.5. Samples: 1681831260. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 00:23:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:23:06,194][06909] Updated weights for policy 0, policy_version 108583 (0.0029) [2024-06-28 00:23:08,850][06674] Fps is (10 sec: 44252.6, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 1779138560. Throughput: 0: 44268.8. Samples: 1682091400. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 00:23:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:23:09,891][06909] Updated weights for policy 0, policy_version 108593 (0.0031) [2024-06-28 00:23:13,707][06909] Updated weights for policy 0, policy_version 108603 (0.0038) [2024-06-28 00:23:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1779351552. Throughput: 0: 44198.1. Samples: 1682218860. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 00:23:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:23:17,451][06909] Updated weights for policy 0, policy_version 108613 (0.0043) [2024-06-28 00:23:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1779580928. Throughput: 0: 44310.7. Samples: 1682489620. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 00:23:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:23:20,953][06909] Updated weights for policy 0, policy_version 108623 (0.0042) [2024-06-28 00:23:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1779793920. Throughput: 0: 44096.7. Samples: 1682750640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 00:23:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:23:24,934][06909] Updated weights for policy 0, policy_version 108633 (0.0032) [2024-06-28 00:23:28,785][06909] Updated weights for policy 0, policy_version 108643 (0.0035) [2024-06-28 00:23:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44511.3, 300 sec: 44153.5). Total num frames: 1780006912. Throughput: 0: 43987.6. Samples: 1682876420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 00:23:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:23:32,187][06909] Updated weights for policy 0, policy_version 108653 (0.0033) [2024-06-28 00:23:33,855][06674] Fps is (10 sec: 45850.3, 60 sec: 44505.8, 300 sec: 44208.2). Total num frames: 1780252672. Throughput: 0: 43916.1. Samples: 1683145240. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 00:23:33,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:23:35,979][06909] Updated weights for policy 0, policy_version 108663 (0.0025) [2024-06-28 00:23:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 1780449280. Throughput: 0: 44123.4. Samples: 1683413980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 00:23:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:23:39,838][06909] Updated weights for policy 0, policy_version 108673 (0.0040) [2024-06-28 00:23:43,469][06909] Updated weights for policy 0, policy_version 108683 (0.0026) [2024-06-28 00:23:43,850][06674] Fps is (10 sec: 42621.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1780678656. Throughput: 0: 44138.6. Samples: 1683544640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 00:23:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:23:46,900][06887] Signal inference workers to stop experience collection... (24000 times) [2024-06-28 00:23:46,950][06909] InferenceWorker_p0-w0: stopping experience collection (24000 times) [2024-06-28 00:23:47,017][06887] Signal inference workers to resume experience collection... (24000 times) [2024-06-28 00:23:47,018][06909] InferenceWorker_p0-w0: resuming experience collection (24000 times) [2024-06-28 00:23:47,179][06909] Updated weights for policy 0, policy_version 108693 (0.0036) [2024-06-28 00:23:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 1780891648. Throughput: 0: 43939.4. Samples: 1683808540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:23:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:23:48,978][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000108698_1780908032.pth... [2024-06-28 00:23:49,027][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000108050_1770291200.pth [2024-06-28 00:23:50,518][06909] Updated weights for policy 0, policy_version 108703 (0.0040) [2024-06-28 00:23:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1781104640. Throughput: 0: 44240.0. Samples: 1684082200. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:23:53,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:23:54,419][06909] Updated weights for policy 0, policy_version 108713 (0.0029) [2024-06-28 00:23:57,953][06909] Updated weights for policy 0, policy_version 108723 (0.0037) [2024-06-28 00:23:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43966.4, 300 sec: 44098.0). Total num frames: 1781334016. Throughput: 0: 44369.3. Samples: 1684215480. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:23:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:24:01,911][06909] Updated weights for policy 0, policy_version 108733 (0.0024) [2024-06-28 00:24:03,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1781563392. Throughput: 0: 44048.9. Samples: 1684471820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:24:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:24:05,771][06909] Updated weights for policy 0, policy_version 108743 (0.0032) [2024-06-28 00:24:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1781760000. Throughput: 0: 44271.6. Samples: 1684742860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:24:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:24:09,247][06909] Updated weights for policy 0, policy_version 108753 (0.0026) [2024-06-28 00:24:12,881][06909] Updated weights for policy 0, policy_version 108763 (0.0028) [2024-06-28 00:24:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1782005760. Throughput: 0: 44449.8. Samples: 1684876660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:24:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:24:16,724][06909] Updated weights for policy 0, policy_version 108773 (0.0030) [2024-06-28 00:24:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1782218752. Throughput: 0: 44215.1. Samples: 1685134680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:24:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:24:20,363][06909] Updated weights for policy 0, policy_version 108783 (0.0028) [2024-06-28 00:24:23,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1782415360. Throughput: 0: 44326.7. Samples: 1685408680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:24:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:24:24,218][06909] Updated weights for policy 0, policy_version 108793 (0.0035) [2024-06-28 00:24:27,917][06909] Updated weights for policy 0, policy_version 108803 (0.0026) [2024-06-28 00:24:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1782661120. Throughput: 0: 44229.8. Samples: 1685534980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:24:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:24:31,718][06909] Updated weights for policy 0, policy_version 108813 (0.0041) [2024-06-28 00:24:33,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43967.7, 300 sec: 44209.1). Total num frames: 1782890496. Throughput: 0: 44229.5. Samples: 1685798860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:24:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 00:24:35,119][06909] Updated weights for policy 0, policy_version 108823 (0.0035) [2024-06-28 00:24:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1783087104. Throughput: 0: 44112.5. Samples: 1686067260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:24:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:24:39,042][06909] Updated weights for policy 0, policy_version 108833 (0.0033) [2024-06-28 00:24:42,666][06909] Updated weights for policy 0, policy_version 108843 (0.0038) [2024-06-28 00:24:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1783316480. Throughput: 0: 44009.2. Samples: 1686195900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 00:24:43,851][06674] Avg episode reward: [(0, '0.434')] [2024-06-28 00:24:46,564][06909] Updated weights for policy 0, policy_version 108853 (0.0039) [2024-06-28 00:24:48,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1783562240. Throughput: 0: 44131.9. Samples: 1686457760. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:24:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:24:50,083][06909] Updated weights for policy 0, policy_version 108863 (0.0036) [2024-06-28 00:24:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44098.9). Total num frames: 1783758848. Throughput: 0: 44111.5. Samples: 1686727880. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:24:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:24:54,104][06909] Updated weights for policy 0, policy_version 108873 (0.0031) [2024-06-28 00:24:57,428][06909] Updated weights for policy 0, policy_version 108883 (0.0028) [2024-06-28 00:24:58,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1783971840. Throughput: 0: 43953.0. Samples: 1686854540. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:24:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:25:01,554][06909] Updated weights for policy 0, policy_version 108893 (0.0049) [2024-06-28 00:25:02,391][06887] Signal inference workers to stop experience collection... (24050 times) [2024-06-28 00:25:02,439][06909] InferenceWorker_p0-w0: stopping experience collection (24050 times) [2024-06-28 00:25:02,446][06887] Signal inference workers to resume experience collection... (24050 times) [2024-06-28 00:25:02,456][06909] InferenceWorker_p0-w0: resuming experience collection (24050 times) [2024-06-28 00:25:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1784201216. Throughput: 0: 44156.8. Samples: 1687121740. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:25:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:25:04,779][06909] Updated weights for policy 0, policy_version 108903 (0.0024) [2024-06-28 00:25:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1784414208. Throughput: 0: 44125.4. Samples: 1687394320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:25:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:25:08,913][06909] Updated weights for policy 0, policy_version 108913 (0.0028) [2024-06-28 00:25:12,052][06909] Updated weights for policy 0, policy_version 108923 (0.0037) [2024-06-28 00:25:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1784627200. Throughput: 0: 44255.6. Samples: 1687526480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:25:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:25:16,090][06909] Updated weights for policy 0, policy_version 108933 (0.0032) [2024-06-28 00:25:18,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1784872960. Throughput: 0: 44256.8. Samples: 1687790420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:25:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:25:19,465][06909] Updated weights for policy 0, policy_version 108943 (0.0039) [2024-06-28 00:25:23,414][06909] Updated weights for policy 0, policy_version 108953 (0.0029) [2024-06-28 00:25:23,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44154.0). Total num frames: 1785085952. Throughput: 0: 44135.5. Samples: 1688053360. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:25:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:25:26,899][06909] Updated weights for policy 0, policy_version 108963 (0.0027) [2024-06-28 00:25:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.8). Total num frames: 1785298944. Throughput: 0: 44142.7. Samples: 1688182320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:25:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:25:31,209][06909] Updated weights for policy 0, policy_version 108973 (0.0035) [2024-06-28 00:25:33,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1785544704. Throughput: 0: 44205.4. Samples: 1688447000. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:25:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:25:34,505][06909] Updated weights for policy 0, policy_version 108983 (0.0035) [2024-06-28 00:25:38,481][06909] Updated weights for policy 0, policy_version 108993 (0.0030) [2024-06-28 00:25:38,855][06674] Fps is (10 sec: 45850.3, 60 sec: 44505.8, 300 sec: 44152.7). Total num frames: 1785757696. Throughput: 0: 44199.1. Samples: 1688717080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:25:38,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:25:41,679][06909] Updated weights for policy 0, policy_version 109003 (0.0032) [2024-06-28 00:25:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1785970688. Throughput: 0: 44221.6. Samples: 1688844520. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:25:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:25:45,830][06909] Updated weights for policy 0, policy_version 109013 (0.0027) [2024-06-28 00:25:48,850][06674] Fps is (10 sec: 45900.2, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1786216448. Throughput: 0: 44259.6. Samples: 1689113420. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 00:25:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:25:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000109022_1786216448.pth... [2024-06-28 00:25:48,919][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000108375_1775616000.pth [2024-06-28 00:25:49,178][06909] Updated weights for policy 0, policy_version 109023 (0.0022) [2024-06-28 00:25:53,216][06909] Updated weights for policy 0, policy_version 109033 (0.0020) [2024-06-28 00:25:53,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1786396672. Throughput: 0: 44122.6. Samples: 1689379840. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 00:25:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:25:56,399][06909] Updated weights for policy 0, policy_version 109043 (0.0043) [2024-06-28 00:25:58,850][06674] Fps is (10 sec: 40959.5, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1786626048. Throughput: 0: 44014.5. Samples: 1689507140. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 00:25:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:26:00,497][06909] Updated weights for policy 0, policy_version 109053 (0.0030) [2024-06-28 00:26:03,834][06909] Updated weights for policy 0, policy_version 109063 (0.0036) [2024-06-28 00:26:03,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44782.9, 300 sec: 44264.9). Total num frames: 1786888192. Throughput: 0: 44134.2. Samples: 1689776460. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 00:26:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:26:08,212][06909] Updated weights for policy 0, policy_version 109073 (0.0036) [2024-06-28 00:26:08,852][06674] Fps is (10 sec: 45866.5, 60 sec: 44508.3, 300 sec: 44208.7). Total num frames: 1787084800. Throughput: 0: 44288.3. Samples: 1690046420. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 00:26:08,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:26:11,322][06909] Updated weights for policy 0, policy_version 109083 (0.0031) [2024-06-28 00:26:13,850][06674] Fps is (10 sec: 39321.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1787281408. Throughput: 0: 44142.7. Samples: 1690168740. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 00:26:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:26:15,746][06909] Updated weights for policy 0, policy_version 109093 (0.0028) [2024-06-28 00:26:18,783][06909] Updated weights for policy 0, policy_version 109103 (0.0025) [2024-06-28 00:26:18,850][06674] Fps is (10 sec: 45883.9, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1787543552. Throughput: 0: 44324.8. Samples: 1690441620. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 00:26:18,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:26:23,010][06909] Updated weights for policy 0, policy_version 109113 (0.0039) [2024-06-28 00:26:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1787723776. Throughput: 0: 44195.2. Samples: 1690705620. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 00:26:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:26:26,185][06909] Updated weights for policy 0, policy_version 109123 (0.0024) [2024-06-28 00:26:28,850][06674] Fps is (10 sec: 40960.5, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1787953152. Throughput: 0: 44069.4. Samples: 1690827640. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 00:26:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:26:30,313][06909] Updated weights for policy 0, policy_version 109133 (0.0036) [2024-06-28 00:26:33,607][06909] Updated weights for policy 0, policy_version 109143 (0.0034) [2024-06-28 00:26:33,856][06674] Fps is (10 sec: 47484.9, 60 sec: 44232.4, 300 sec: 44208.1). Total num frames: 1788198912. Throughput: 0: 44020.3. Samples: 1691094600. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 00:26:33,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 00:26:37,436][06887] Signal inference workers to stop experience collection... (24100 times) [2024-06-28 00:26:37,437][06887] Signal inference workers to resume experience collection... (24100 times) [2024-06-28 00:26:37,498][06909] InferenceWorker_p0-w0: stopping experience collection (24100 times) [2024-06-28 00:26:37,498][06909] InferenceWorker_p0-w0: resuming experience collection (24100 times) [2024-06-28 00:26:38,590][06909] Updated weights for policy 0, policy_version 109153 (0.0038) [2024-06-28 00:26:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43694.6, 300 sec: 44153.5). Total num frames: 1788379136. Throughput: 0: 44035.1. Samples: 1691361420. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 00:26:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:26:41,267][06909] Updated weights for policy 0, policy_version 109163 (0.0024) [2024-06-28 00:26:43,850][06674] Fps is (10 sec: 40984.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1788608512. Throughput: 0: 43953.8. Samples: 1691485060. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 00:26:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:26:45,811][06909] Updated weights for policy 0, policy_version 109173 (0.0042) [2024-06-28 00:26:48,674][06909] Updated weights for policy 0, policy_version 109183 (0.0023) [2024-06-28 00:26:48,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1788854272. Throughput: 0: 43944.5. Samples: 1691753960. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 00:26:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:26:53,010][06909] Updated weights for policy 0, policy_version 109193 (0.0035) [2024-06-28 00:26:53,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1789050880. Throughput: 0: 43949.6. Samples: 1692024060. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 00:26:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:26:56,192][06909] Updated weights for policy 0, policy_version 109203 (0.0040) [2024-06-28 00:26:58,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 1789263872. Throughput: 0: 44088.5. Samples: 1692152720. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 00:26:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:27:00,281][06909] Updated weights for policy 0, policy_version 109213 (0.0037) [2024-06-28 00:27:03,749][06909] Updated weights for policy 0, policy_version 109223 (0.0029) [2024-06-28 00:27:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 1789509632. Throughput: 0: 44019.8. Samples: 1692422500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 00:27:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:27:07,476][06909] Updated weights for policy 0, policy_version 109233 (0.0033) [2024-06-28 00:27:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43692.2, 300 sec: 44153.5). Total num frames: 1789706240. Throughput: 0: 44073.9. Samples: 1692688940. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 00:27:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:27:11,109][06909] Updated weights for policy 0, policy_version 109243 (0.0028) [2024-06-28 00:27:13,850][06674] Fps is (10 sec: 42597.7, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1789935616. Throughput: 0: 44224.8. Samples: 1692817760. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 00:27:13,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:27:15,454][06909] Updated weights for policy 0, policy_version 109253 (0.0035) [2024-06-28 00:27:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43417.7, 300 sec: 44098.0). Total num frames: 1790148608. Throughput: 0: 44172.1. Samples: 1693082080. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 00:27:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:27:18,913][06909] Updated weights for policy 0, policy_version 109263 (0.0035) [2024-06-28 00:27:22,759][06909] Updated weights for policy 0, policy_version 109273 (0.0038) [2024-06-28 00:27:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44209.3). Total num frames: 1790377984. Throughput: 0: 44098.2. Samples: 1693345840. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 00:27:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 00:27:26,304][06909] Updated weights for policy 0, policy_version 109283 (0.0032) [2024-06-28 00:27:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1790590976. Throughput: 0: 44238.7. Samples: 1693475800. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 00:27:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:27:29,899][06909] Updated weights for policy 0, policy_version 109293 (0.0031) [2024-06-28 00:27:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43421.9, 300 sec: 44098.3). Total num frames: 1790803968. Throughput: 0: 44085.2. Samples: 1693737800. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 00:27:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:27:33,895][06909] Updated weights for policy 0, policy_version 109303 (0.0034) [2024-06-28 00:27:37,516][06909] Updated weights for policy 0, policy_version 109313 (0.0028) [2024-06-28 00:27:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1791049728. Throughput: 0: 44048.8. Samples: 1694006260. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 00:27:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:27:41,074][06909] Updated weights for policy 0, policy_version 109323 (0.0020) [2024-06-28 00:27:43,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1791262720. Throughput: 0: 44198.1. Samples: 1694141640. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 00:27:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:27:44,660][06909] Updated weights for policy 0, policy_version 109333 (0.0040) [2024-06-28 00:27:48,262][06909] Updated weights for policy 0, policy_version 109343 (0.0032) [2024-06-28 00:27:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 1791475712. Throughput: 0: 44063.9. Samples: 1694405380. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 00:27:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:27:48,995][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000109344_1791492096.pth... [2024-06-28 00:27:49,045][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000108698_1780908032.pth [2024-06-28 00:27:52,425][06909] Updated weights for policy 0, policy_version 109353 (0.0022) [2024-06-28 00:27:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 44098.5). Total num frames: 1791705088. Throughput: 0: 44144.3. Samples: 1694675440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 00:27:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:27:55,370][06909] Updated weights for policy 0, policy_version 109363 (0.0031) [2024-06-28 00:27:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1791918080. Throughput: 0: 44297.5. Samples: 1694811140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 00:27:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:27:59,552][06909] Updated weights for policy 0, policy_version 109373 (0.0027) [2024-06-28 00:28:03,393][06909] Updated weights for policy 0, policy_version 109383 (0.0031) [2024-06-28 00:28:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1792131072. Throughput: 0: 44188.9. Samples: 1695070580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 00:28:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:28:04,381][06887] Signal inference workers to stop experience collection... (24150 times) [2024-06-28 00:28:04,382][06887] Signal inference workers to resume experience collection... (24150 times) [2024-06-28 00:28:04,425][06909] InferenceWorker_p0-w0: stopping experience collection (24150 times) [2024-06-28 00:28:04,425][06909] InferenceWorker_p0-w0: resuming experience collection (24150 times) [2024-06-28 00:28:06,650][06909] Updated weights for policy 0, policy_version 109393 (0.0038) [2024-06-28 00:28:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1792376832. Throughput: 0: 44365.0. Samples: 1695342260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 00:28:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:28:10,828][06909] Updated weights for policy 0, policy_version 109403 (0.0037) [2024-06-28 00:28:13,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1792606208. Throughput: 0: 44473.4. Samples: 1695477100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 00:28:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:28:14,311][06909] Updated weights for policy 0, policy_version 109413 (0.0033) [2024-06-28 00:28:17,986][06909] Updated weights for policy 0, policy_version 109423 (0.0031) [2024-06-28 00:28:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1792819200. Throughput: 0: 44454.3. Samples: 1695738240. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 00:28:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:28:21,583][06909] Updated weights for policy 0, policy_version 109433 (0.0035) [2024-06-28 00:28:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1793048576. Throughput: 0: 44423.0. Samples: 1696005300. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 00:28:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:28:25,297][06909] Updated weights for policy 0, policy_version 109443 (0.0026) [2024-06-28 00:28:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44098.8). Total num frames: 1793261568. Throughput: 0: 44456.5. Samples: 1696142180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 00:28:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:28:29,268][06909] Updated weights for policy 0, policy_version 109453 (0.0041) [2024-06-28 00:28:32,869][06909] Updated weights for policy 0, policy_version 109463 (0.0028) [2024-06-28 00:28:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1793474560. Throughput: 0: 44452.4. Samples: 1696405740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 00:28:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:28:36,481][06909] Updated weights for policy 0, policy_version 109473 (0.0040) [2024-06-28 00:28:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1793720320. Throughput: 0: 44276.9. Samples: 1696667900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 00:28:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:28:40,656][06909] Updated weights for policy 0, policy_version 109483 (0.0037) [2024-06-28 00:28:43,675][06909] Updated weights for policy 0, policy_version 109493 (0.0026) [2024-06-28 00:28:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1793933312. Throughput: 0: 44389.6. Samples: 1696808680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 00:28:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:28:47,886][06909] Updated weights for policy 0, policy_version 109503 (0.0030) [2024-06-28 00:28:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44509.9, 300 sec: 44209.1). Total num frames: 1794146304. Throughput: 0: 44417.4. Samples: 1697069360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 00:28:48,855][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:28:50,853][06909] Updated weights for policy 0, policy_version 109513 (0.0034) [2024-06-28 00:28:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1794375680. Throughput: 0: 44208.8. Samples: 1697331660. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 00:28:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:28:55,462][06909] Updated weights for policy 0, policy_version 109523 (0.0031) [2024-06-28 00:28:58,609][06909] Updated weights for policy 0, policy_version 109533 (0.0029) [2024-06-28 00:28:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1794588672. Throughput: 0: 43969.8. Samples: 1697455740. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 00:28:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:29:02,830][06909] Updated weights for policy 0, policy_version 109543 (0.0031) [2024-06-28 00:29:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1794801664. Throughput: 0: 44197.3. Samples: 1697727120. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 00:29:03,850][06674] Avg episode reward: [(0, '0.506')] [2024-06-28 00:29:03,923][06887] Saving new best policy, reward=0.506! [2024-06-28 00:29:06,055][06909] Updated weights for policy 0, policy_version 109553 (0.0034) [2024-06-28 00:29:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1795047424. Throughput: 0: 44120.2. Samples: 1697990700. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 00:29:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:29:10,328][06909] Updated weights for policy 0, policy_version 109563 (0.0032) [2024-06-28 00:29:13,497][06909] Updated weights for policy 0, policy_version 109573 (0.0023) [2024-06-28 00:29:13,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1795244032. Throughput: 0: 44047.2. Samples: 1698124300. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 00:29:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:29:17,506][06909] Updated weights for policy 0, policy_version 109583 (0.0025) [2024-06-28 00:29:18,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 1795457024. Throughput: 0: 44048.1. Samples: 1698387900. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 00:29:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:29:20,923][06909] Updated weights for policy 0, policy_version 109593 (0.0028) [2024-06-28 00:29:23,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1795686400. Throughput: 0: 44048.8. Samples: 1698650100. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 00:29:23,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:29:25,001][06909] Updated weights for policy 0, policy_version 109603 (0.0044) [2024-06-28 00:29:28,316][06909] Updated weights for policy 0, policy_version 109613 (0.0027) [2024-06-28 00:29:28,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1795932160. Throughput: 0: 43890.3. Samples: 1698783740. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 00:29:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:29:32,509][06909] Updated weights for policy 0, policy_version 109623 (0.0050) [2024-06-28 00:29:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1796112384. Throughput: 0: 43980.4. Samples: 1699048480. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 00:29:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:29:35,935][06909] Updated weights for policy 0, policy_version 109633 (0.0032) [2024-06-28 00:29:38,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 1796341760. Throughput: 0: 43940.5. Samples: 1699308980. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 00:29:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:29:39,740][06909] Updated weights for policy 0, policy_version 109643 (0.0038) [2024-06-28 00:29:40,071][06887] Signal inference workers to stop experience collection... (24200 times) [2024-06-28 00:29:40,074][06887] Signal inference workers to resume experience collection... (24200 times) [2024-06-28 00:29:40,105][06909] InferenceWorker_p0-w0: stopping experience collection (24200 times) [2024-06-28 00:29:40,105][06909] InferenceWorker_p0-w0: resuming experience collection (24200 times) [2024-06-28 00:29:43,437][06909] Updated weights for policy 0, policy_version 109653 (0.0032) [2024-06-28 00:29:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1796571136. Throughput: 0: 44253.7. Samples: 1699447160. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 00:29:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:29:47,097][06909] Updated weights for policy 0, policy_version 109663 (0.0031) [2024-06-28 00:29:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1796784128. Throughput: 0: 44048.9. Samples: 1699709320. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 00:29:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:29:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000109667_1796784128.pth... [2024-06-28 00:29:48,925][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000109022_1786216448.pth [2024-06-28 00:29:51,017][06909] Updated weights for policy 0, policy_version 109673 (0.0036) [2024-06-28 00:29:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 1797013504. Throughput: 0: 43964.9. Samples: 1699969120. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-28 00:29:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:29:54,743][06909] Updated weights for policy 0, policy_version 109683 (0.0031) [2024-06-28 00:29:58,239][06909] Updated weights for policy 0, policy_version 109693 (0.0044) [2024-06-28 00:29:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1797242880. Throughput: 0: 44022.6. Samples: 1700105320. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-28 00:29:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:30:02,213][06909] Updated weights for policy 0, policy_version 109703 (0.0028) [2024-06-28 00:30:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1797439488. Throughput: 0: 44183.1. Samples: 1700376140. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-28 00:30:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:30:05,338][06909] Updated weights for policy 0, policy_version 109713 (0.0033) [2024-06-28 00:30:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 44209.0). Total num frames: 1797668864. Throughput: 0: 44133.4. Samples: 1700636100. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-28 00:30:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:30:09,625][06909] Updated weights for policy 0, policy_version 109723 (0.0031) [2024-06-28 00:30:13,051][06909] Updated weights for policy 0, policy_version 109733 (0.0026) [2024-06-28 00:30:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1797898240. Throughput: 0: 44187.6. Samples: 1700772180. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-28 00:30:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:30:16,802][06909] Updated weights for policy 0, policy_version 109743 (0.0029) [2024-06-28 00:30:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.6, 300 sec: 44098.0). Total num frames: 1798094848. Throughput: 0: 44171.1. Samples: 1701036180. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-28 00:30:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:30:20,553][06909] Updated weights for policy 0, policy_version 109753 (0.0022) [2024-06-28 00:30:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1798340608. Throughput: 0: 44338.6. Samples: 1701304220. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-28 00:30:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:30:24,171][06909] Updated weights for policy 0, policy_version 109763 (0.0030) [2024-06-28 00:30:27,911][06909] Updated weights for policy 0, policy_version 109773 (0.0039) [2024-06-28 00:30:28,850][06674] Fps is (10 sec: 47514.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1798569984. Throughput: 0: 44224.1. Samples: 1701437240. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-28 00:30:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:30:31,640][06909] Updated weights for policy 0, policy_version 109783 (0.0030) [2024-06-28 00:30:33,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.9, 300 sec: 44098.8). Total num frames: 1798766592. Throughput: 0: 44327.2. Samples: 1701704040. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-28 00:30:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:30:35,161][06909] Updated weights for policy 0, policy_version 109793 (0.0040) [2024-06-28 00:30:38,850][06674] Fps is (10 sec: 42597.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1798995968. Throughput: 0: 44378.5. Samples: 1701966160. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-28 00:30:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:30:39,207][06909] Updated weights for policy 0, policy_version 109803 (0.0031) [2024-06-28 00:30:42,543][06909] Updated weights for policy 0, policy_version 109813 (0.0028) [2024-06-28 00:30:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1799225344. Throughput: 0: 44331.6. Samples: 1702100240. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-28 00:30:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 00:30:46,650][06909] Updated weights for policy 0, policy_version 109823 (0.0032) [2024-06-28 00:30:48,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1799421952. Throughput: 0: 44103.6. Samples: 1702360800. Policy #0 lag: (min: 1.0, avg: 11.6, max: 21.0) [2024-06-28 00:30:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:30:50,162][06909] Updated weights for policy 0, policy_version 109833 (0.0033) [2024-06-28 00:30:53,848][06909] Updated weights for policy 0, policy_version 109843 (0.0038) [2024-06-28 00:30:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 1799667712. Throughput: 0: 44167.1. Samples: 1702623620. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:30:53,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:30:57,458][06909] Updated weights for policy 0, policy_version 109853 (0.0033) [2024-06-28 00:30:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1799880704. Throughput: 0: 44213.4. Samples: 1702761780. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:30:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:31:01,598][06909] Updated weights for policy 0, policy_version 109863 (0.0026) [2024-06-28 00:31:03,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 1800093696. Throughput: 0: 44121.9. Samples: 1703021660. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:31:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:31:05,134][06909] Updated weights for policy 0, policy_version 109873 (0.0031) [2024-06-28 00:31:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1800306688. Throughput: 0: 44065.0. Samples: 1703287140. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:31:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:31:08,940][06909] Updated weights for policy 0, policy_version 109883 (0.0034) [2024-06-28 00:31:12,315][06909] Updated weights for policy 0, policy_version 109893 (0.0041) [2024-06-28 00:31:13,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1800552448. Throughput: 0: 44059.0. Samples: 1703419900. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:31:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:31:16,324][06909] Updated weights for policy 0, policy_version 109903 (0.0044) [2024-06-28 00:31:18,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44783.0, 300 sec: 44264.6). Total num frames: 1800781824. Throughput: 0: 44134.2. Samples: 1703690080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:31:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:31:19,727][06909] Updated weights for policy 0, policy_version 109913 (0.0032) [2024-06-28 00:31:23,622][06909] Updated weights for policy 0, policy_version 109923 (0.0028) [2024-06-28 00:31:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1800978432. Throughput: 0: 44149.4. Samples: 1703952880. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:31:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:31:26,962][06909] Updated weights for policy 0, policy_version 109933 (0.0033) [2024-06-28 00:31:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44098.9). Total num frames: 1801207808. Throughput: 0: 44131.1. Samples: 1704086140. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:31:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:31:30,874][06909] Updated weights for policy 0, policy_version 109943 (0.0024) [2024-06-28 00:31:33,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.8, 300 sec: 44264.6). Total num frames: 1801437184. Throughput: 0: 44270.6. Samples: 1704352980. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:31:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:31:34,418][06909] Updated weights for policy 0, policy_version 109953 (0.0040) [2024-06-28 00:31:34,643][06887] Signal inference workers to stop experience collection... (24250 times) [2024-06-28 00:31:34,644][06887] Signal inference workers to resume experience collection... (24250 times) [2024-06-28 00:31:34,688][06909] InferenceWorker_p0-w0: stopping experience collection (24250 times) [2024-06-28 00:31:34,688][06909] InferenceWorker_p0-w0: resuming experience collection (24250 times) [2024-06-28 00:31:38,502][06909] Updated weights for policy 0, policy_version 109963 (0.0044) [2024-06-28 00:31:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1801633792. Throughput: 0: 44335.2. Samples: 1704618700. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:31:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:31:41,698][06909] Updated weights for policy 0, policy_version 109973 (0.0041) [2024-06-28 00:31:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1801879552. Throughput: 0: 44109.7. Samples: 1704746720. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:31:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:31:45,897][06909] Updated weights for policy 0, policy_version 109983 (0.0027) [2024-06-28 00:31:48,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44782.9, 300 sec: 44264.6). Total num frames: 1802108928. Throughput: 0: 44255.9. Samples: 1705013180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:31:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:31:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000109992_1802108928.pth... [2024-06-28 00:31:48,943][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000109344_1791492096.pth [2024-06-28 00:31:49,083][06909] Updated weights for policy 0, policy_version 109993 (0.0040) [2024-06-28 00:31:53,480][06909] Updated weights for policy 0, policy_version 110003 (0.0041) [2024-06-28 00:31:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 1802305536. Throughput: 0: 44307.5. Samples: 1705280980. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:31:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:31:56,474][06909] Updated weights for policy 0, policy_version 110013 (0.0030) [2024-06-28 00:31:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1802518528. Throughput: 0: 44065.8. Samples: 1705402860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 00:31:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:32:00,739][06909] Updated weights for policy 0, policy_version 110023 (0.0027) [2024-06-28 00:32:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44264.5). Total num frames: 1802764288. Throughput: 0: 44027.4. Samples: 1705671320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 00:32:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:32:04,389][06909] Updated weights for policy 0, policy_version 110033 (0.0028) [2024-06-28 00:32:08,229][06909] Updated weights for policy 0, policy_version 110043 (0.0024) [2024-06-28 00:32:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1802960896. Throughput: 0: 44002.3. Samples: 1705932980. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 00:32:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:32:11,710][06909] Updated weights for policy 0, policy_version 110053 (0.0025) [2024-06-28 00:32:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1803190272. Throughput: 0: 43980.8. Samples: 1706065280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 00:32:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:32:15,909][06909] Updated weights for policy 0, policy_version 110063 (0.0026) [2024-06-28 00:32:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1803419648. Throughput: 0: 43932.0. Samples: 1706329920. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 00:32:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:32:18,981][06909] Updated weights for policy 0, policy_version 110073 (0.0034) [2024-06-28 00:32:23,628][06909] Updated weights for policy 0, policy_version 110083 (0.0029) [2024-06-28 00:32:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1803616256. Throughput: 0: 43905.6. Samples: 1706594460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 00:32:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:32:26,225][06909] Updated weights for policy 0, policy_version 110093 (0.0036) [2024-06-28 00:32:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1803845632. Throughput: 0: 43718.7. Samples: 1706714060. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 00:32:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:32:30,978][06909] Updated weights for policy 0, policy_version 110103 (0.0034) [2024-06-28 00:32:33,717][06909] Updated weights for policy 0, policy_version 110113 (0.0032) [2024-06-28 00:32:33,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 1804091392. Throughput: 0: 43890.6. Samples: 1706988260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 00:32:33,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:32:38,152][06909] Updated weights for policy 0, policy_version 110123 (0.0030) [2024-06-28 00:32:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 1804271616. Throughput: 0: 43807.1. Samples: 1707252300. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 00:32:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:32:41,579][06909] Updated weights for policy 0, policy_version 110133 (0.0031) [2024-06-28 00:32:43,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 1804500992. Throughput: 0: 43971.5. Samples: 1707381580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 00:32:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:32:46,071][06909] Updated weights for policy 0, policy_version 110143 (0.0025) [2024-06-28 00:32:47,520][06887] Signal inference workers to stop experience collection... (24300 times) [2024-06-28 00:32:47,556][06909] InferenceWorker_p0-w0: stopping experience collection (24300 times) [2024-06-28 00:32:47,578][06887] Signal inference workers to resume experience collection... (24300 times) [2024-06-28 00:32:47,584][06909] InferenceWorker_p0-w0: resuming experience collection (24300 times) [2024-06-28 00:32:48,742][06909] Updated weights for policy 0, policy_version 110153 (0.0033) [2024-06-28 00:32:48,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 1804746752. Throughput: 0: 43974.7. Samples: 1707650180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 00:32:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:32:53,406][06909] Updated weights for policy 0, policy_version 110163 (0.0038) [2024-06-28 00:32:53,850][06674] Fps is (10 sec: 42599.4, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 1804926976. Throughput: 0: 44107.7. Samples: 1707917820. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 00:32:53,850][06674] Avg episode reward: [(0, '0.473')] [2024-06-28 00:32:56,052][06909] Updated weights for policy 0, policy_version 110173 (0.0031) [2024-06-28 00:32:58,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 1805172736. Throughput: 0: 43869.7. Samples: 1708039420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:32:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:33:00,687][06909] Updated weights for policy 0, policy_version 110183 (0.0042) [2024-06-28 00:33:03,293][06909] Updated weights for policy 0, policy_version 110193 (0.0043) [2024-06-28 00:33:03,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1805402112. Throughput: 0: 44110.1. Samples: 1708314880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:33:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:33:07,908][06909] Updated weights for policy 0, policy_version 110203 (0.0031) [2024-06-28 00:33:08,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1805582336. Throughput: 0: 44229.4. Samples: 1708584780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:33:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:33:10,813][06909] Updated weights for policy 0, policy_version 110213 (0.0025) [2024-06-28 00:33:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1805828096. Throughput: 0: 44288.0. Samples: 1708707020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:33:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:33:15,542][06909] Updated weights for policy 0, policy_version 110223 (0.0024) [2024-06-28 00:33:18,514][06909] Updated weights for policy 0, policy_version 110233 (0.0030) [2024-06-28 00:33:18,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1806057472. Throughput: 0: 44102.2. Samples: 1708972860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:33:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:33:23,170][06909] Updated weights for policy 0, policy_version 110243 (0.0039) [2024-06-28 00:33:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1806254080. Throughput: 0: 44240.5. Samples: 1709243120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:33:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:33:26,026][06909] Updated weights for policy 0, policy_version 110253 (0.0039) [2024-06-28 00:33:28,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1806499840. Throughput: 0: 44044.2. Samples: 1709363560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:33:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:33:30,361][06909] Updated weights for policy 0, policy_version 110263 (0.0030) [2024-06-28 00:33:33,185][06909] Updated weights for policy 0, policy_version 110273 (0.0025) [2024-06-28 00:33:33,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 1806712832. Throughput: 0: 44106.7. Samples: 1709634980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:33:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:33:37,640][06909] Updated weights for policy 0, policy_version 110283 (0.0041) [2024-06-28 00:33:38,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1806909440. Throughput: 0: 44207.5. Samples: 1709907160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:33:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:33:40,415][06909] Updated weights for policy 0, policy_version 110293 (0.0024) [2024-06-28 00:33:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 1807155200. Throughput: 0: 44265.0. Samples: 1710031340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:33:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 00:33:44,901][06909] Updated weights for policy 0, policy_version 110303 (0.0020) [2024-06-28 00:33:48,360][06909] Updated weights for policy 0, policy_version 110313 (0.0044) [2024-06-28 00:33:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1807368192. Throughput: 0: 44040.6. Samples: 1710296700. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:33:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:33:49,044][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000110315_1807400960.pth... [2024-06-28 00:33:49,082][06887] Signal inference workers to stop experience collection... (24350 times) [2024-06-28 00:33:49,082][06887] Signal inference workers to resume experience collection... (24350 times) [2024-06-28 00:33:49,098][06909] InferenceWorker_p0-w0: stopping experience collection (24350 times) [2024-06-28 00:33:49,099][06909] InferenceWorker_p0-w0: resuming experience collection (24350 times) [2024-06-28 00:33:49,104][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000109667_1796784128.pth [2024-06-28 00:33:52,598][06909] Updated weights for policy 0, policy_version 110323 (0.0029) [2024-06-28 00:33:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1807581184. Throughput: 0: 44093.4. Samples: 1710568980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:33:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:33:55,701][06909] Updated weights for policy 0, policy_version 110333 (0.0032) [2024-06-28 00:33:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1807810560. Throughput: 0: 44010.6. Samples: 1710687500. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-28 00:33:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:33:59,980][06909] Updated weights for policy 0, policy_version 110343 (0.0032) [2024-06-28 00:34:02,920][06909] Updated weights for policy 0, policy_version 110353 (0.0024) [2024-06-28 00:34:03,850][06674] Fps is (10 sec: 49151.6, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1808072704. Throughput: 0: 44222.3. Samples: 1710962860. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-28 00:34:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:34:07,413][06909] Updated weights for policy 0, policy_version 110363 (0.0029) [2024-06-28 00:34:08,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 1808252928. Throughput: 0: 44285.4. Samples: 1711235960. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-28 00:34:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:34:10,166][06909] Updated weights for policy 0, policy_version 110373 (0.0027) [2024-06-28 00:34:13,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1808465920. Throughput: 0: 44496.8. Samples: 1711365920. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-28 00:34:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:34:14,632][06909] Updated weights for policy 0, policy_version 110383 (0.0029) [2024-06-28 00:34:17,889][06909] Updated weights for policy 0, policy_version 110393 (0.0029) [2024-06-28 00:34:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1808728064. Throughput: 0: 44238.6. Samples: 1711625720. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-28 00:34:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:34:21,892][06909] Updated weights for policy 0, policy_version 110403 (0.0028) [2024-06-28 00:34:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1808924672. Throughput: 0: 44195.5. Samples: 1711895960. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-28 00:34:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:34:25,167][06909] Updated weights for policy 0, policy_version 110413 (0.0049) [2024-06-28 00:34:28,853][06674] Fps is (10 sec: 39308.1, 60 sec: 43688.1, 300 sec: 44097.4). Total num frames: 1809121280. Throughput: 0: 43944.2. Samples: 1712008980. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-28 00:34:28,854][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:34:29,621][06909] Updated weights for policy 0, policy_version 110423 (0.0036) [2024-06-28 00:34:32,783][06909] Updated weights for policy 0, policy_version 110433 (0.0031) [2024-06-28 00:34:33,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44782.9, 300 sec: 44264.6). Total num frames: 1809399808. Throughput: 0: 44091.9. Samples: 1712280840. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-28 00:34:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:34:37,109][06909] Updated weights for policy 0, policy_version 110443 (0.0031) [2024-06-28 00:34:38,850][06674] Fps is (10 sec: 45890.5, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 1809580032. Throughput: 0: 44005.2. Samples: 1712549220. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-28 00:34:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:34:40,209][06909] Updated weights for policy 0, policy_version 110453 (0.0033) [2024-06-28 00:34:43,850][06674] Fps is (10 sec: 37682.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1809776640. Throughput: 0: 44135.1. Samples: 1712673580. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-28 00:34:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:34:44,385][06909] Updated weights for policy 0, policy_version 110463 (0.0038) [2024-06-28 00:34:47,643][06909] Updated weights for policy 0, policy_version 110473 (0.0029) [2024-06-28 00:34:48,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44782.8, 300 sec: 44209.0). Total num frames: 1810055168. Throughput: 0: 43911.5. Samples: 1712938880. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-28 00:34:48,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:34:51,758][06909] Updated weights for policy 0, policy_version 110483 (0.0035) [2024-06-28 00:34:53,850][06674] Fps is (10 sec: 47514.6, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 1810251776. Throughput: 0: 43931.2. Samples: 1713212860. Policy #0 lag: (min: 1.0, avg: 11.2, max: 21.0) [2024-06-28 00:34:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:34:55,327][06909] Updated weights for policy 0, policy_version 110493 (0.0030) [2024-06-28 00:34:58,850][06674] Fps is (10 sec: 40960.5, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1810464768. Throughput: 0: 43847.6. Samples: 1713339060. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:34:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:34:58,997][06909] Updated weights for policy 0, policy_version 110503 (0.0043) [2024-06-28 00:35:02,647][06909] Updated weights for policy 0, policy_version 110513 (0.0034) [2024-06-28 00:35:03,260][06887] Signal inference workers to stop experience collection... (24400 times) [2024-06-28 00:35:03,260][06887] Signal inference workers to resume experience collection... (24400 times) [2024-06-28 00:35:03,278][06909] InferenceWorker_p0-w0: stopping experience collection (24400 times) [2024-06-28 00:35:03,278][06909] InferenceWorker_p0-w0: resuming experience collection (24400 times) [2024-06-28 00:35:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 1810710528. Throughput: 0: 44048.0. Samples: 1713607880. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:35:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:35:06,514][06909] Updated weights for policy 0, policy_version 110523 (0.0029) [2024-06-28 00:35:08,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 1810907136. Throughput: 0: 43826.0. Samples: 1713868220. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:35:08,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:35:10,406][06909] Updated weights for policy 0, policy_version 110533 (0.0043) [2024-06-28 00:35:13,850][06674] Fps is (10 sec: 40959.1, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1811120128. Throughput: 0: 44083.6. Samples: 1713992600. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:35:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:35:14,290][06909] Updated weights for policy 0, policy_version 110543 (0.0030) [2024-06-28 00:35:17,715][06909] Updated weights for policy 0, policy_version 110553 (0.0036) [2024-06-28 00:35:18,850][06674] Fps is (10 sec: 45884.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1811365888. Throughput: 0: 44040.0. Samples: 1714262640. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:35:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:35:21,493][06909] Updated weights for policy 0, policy_version 110563 (0.0040) [2024-06-28 00:35:23,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1811562496. Throughput: 0: 44031.6. Samples: 1714530640. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:35:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:35:25,082][06909] Updated weights for policy 0, policy_version 110573 (0.0036) [2024-06-28 00:35:28,816][06909] Updated weights for policy 0, policy_version 110583 (0.0034) [2024-06-28 00:35:28,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44512.3, 300 sec: 44153.5). Total num frames: 1811791872. Throughput: 0: 44115.5. Samples: 1714658780. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:35:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:35:32,591][06909] Updated weights for policy 0, policy_version 110593 (0.0031) [2024-06-28 00:35:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 1812021248. Throughput: 0: 44198.7. Samples: 1714927820. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:35:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:35:36,197][06909] Updated weights for policy 0, policy_version 110603 (0.0032) [2024-06-28 00:35:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1812234240. Throughput: 0: 43903.8. Samples: 1715188540. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:35:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:35:39,741][06909] Updated weights for policy 0, policy_version 110613 (0.0040) [2024-06-28 00:35:43,534][06909] Updated weights for policy 0, policy_version 110623 (0.0039) [2024-06-28 00:35:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1812447232. Throughput: 0: 44063.9. Samples: 1715321940. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:35:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:35:47,498][06909] Updated weights for policy 0, policy_version 110633 (0.0035) [2024-06-28 00:35:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43417.7, 300 sec: 44042.4). Total num frames: 1812660224. Throughput: 0: 44029.3. Samples: 1715589200. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:35:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:35:48,893][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000110637_1812676608.pth... [2024-06-28 00:35:48,951][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000109992_1802108928.pth [2024-06-28 00:35:51,260][06909] Updated weights for policy 0, policy_version 110643 (0.0032) [2024-06-28 00:35:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1812889600. Throughput: 0: 44219.8. Samples: 1715858020. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 00:35:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:35:54,831][06909] Updated weights for policy 0, policy_version 110653 (0.0033) [2024-06-28 00:35:58,444][06909] Updated weights for policy 0, policy_version 110663 (0.0031) [2024-06-28 00:35:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1813102592. Throughput: 0: 44291.8. Samples: 1715985720. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 00:35:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:36:02,382][06909] Updated weights for policy 0, policy_version 110673 (0.0030) [2024-06-28 00:36:03,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43690.5, 300 sec: 44153.5). Total num frames: 1813331968. Throughput: 0: 44227.5. Samples: 1716252880. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 00:36:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:36:05,974][06909] Updated weights for policy 0, policy_version 110683 (0.0029) [2024-06-28 00:36:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 1813561344. Throughput: 0: 43946.3. Samples: 1716508220. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 00:36:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:36:09,725][06909] Updated weights for policy 0, policy_version 110693 (0.0023) [2024-06-28 00:36:13,468][06909] Updated weights for policy 0, policy_version 110703 (0.0035) [2024-06-28 00:36:13,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 1813757952. Throughput: 0: 44100.6. Samples: 1716643300. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 00:36:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 00:36:16,970][06909] Updated weights for policy 0, policy_version 110713 (0.0035) [2024-06-28 00:36:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 1813987328. Throughput: 0: 43987.1. Samples: 1716907240. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 00:36:18,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 00:36:20,817][06909] Updated weights for policy 0, policy_version 110723 (0.0039) [2024-06-28 00:36:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1814200320. Throughput: 0: 43955.6. Samples: 1717166540. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 00:36:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:36:24,655][06909] Updated weights for policy 0, policy_version 110733 (0.0032) [2024-06-28 00:36:28,354][06909] Updated weights for policy 0, policy_version 110743 (0.0021) [2024-06-28 00:36:28,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 1814429696. Throughput: 0: 44078.8. Samples: 1717305480. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 00:36:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:36:31,207][06887] Signal inference workers to stop experience collection... (24450 times) [2024-06-28 00:36:31,209][06887] Signal inference workers to resume experience collection... (24450 times) [2024-06-28 00:36:31,223][06909] InferenceWorker_p0-w0: stopping experience collection (24450 times) [2024-06-28 00:36:31,257][06909] InferenceWorker_p0-w0: resuming experience collection (24450 times) [2024-06-28 00:36:31,995][06909] Updated weights for policy 0, policy_version 110753 (0.0034) [2024-06-28 00:36:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1814642688. Throughput: 0: 43826.6. Samples: 1717561400. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 00:36:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:36:35,757][06909] Updated weights for policy 0, policy_version 110763 (0.0024) [2024-06-28 00:36:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1814872064. Throughput: 0: 43812.0. Samples: 1717829560. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 00:36:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:36:39,252][06909] Updated weights for policy 0, policy_version 110773 (0.0033) [2024-06-28 00:36:43,353][06909] Updated weights for policy 0, policy_version 110783 (0.0040) [2024-06-28 00:36:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1815085056. Throughput: 0: 44039.6. Samples: 1717967500. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 00:36:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:36:46,979][06909] Updated weights for policy 0, policy_version 110793 (0.0028) [2024-06-28 00:36:48,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1815298048. Throughput: 0: 43896.0. Samples: 1718228200. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 00:36:48,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:36:50,688][06909] Updated weights for policy 0, policy_version 110803 (0.0049) [2024-06-28 00:36:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1815527424. Throughput: 0: 44073.7. Samples: 1718491540. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 00:36:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:36:54,317][06909] Updated weights for policy 0, policy_version 110813 (0.0036) [2024-06-28 00:36:58,259][06909] Updated weights for policy 0, policy_version 110823 (0.0035) [2024-06-28 00:36:58,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1815740416. Throughput: 0: 44126.7. Samples: 1718629000. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 00:36:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:37:01,509][06909] Updated weights for policy 0, policy_version 110833 (0.0033) [2024-06-28 00:37:03,852][06674] Fps is (10 sec: 42590.3, 60 sec: 43689.4, 300 sec: 44042.1). Total num frames: 1815953408. Throughput: 0: 44023.1. Samples: 1718888360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:37:03,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:37:05,760][06909] Updated weights for policy 0, policy_version 110843 (0.0041) [2024-06-28 00:37:08,813][06909] Updated weights for policy 0, policy_version 110853 (0.0035) [2024-06-28 00:37:08,852][06674] Fps is (10 sec: 47503.4, 60 sec: 44235.2, 300 sec: 44153.2). Total num frames: 1816215552. Throughput: 0: 44180.2. Samples: 1719154740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:37:08,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:37:13,166][06909] Updated weights for policy 0, policy_version 110863 (0.0031) [2024-06-28 00:37:13,850][06674] Fps is (10 sec: 47522.6, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 1816428544. Throughput: 0: 44164.4. Samples: 1719292880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:37:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:37:16,260][06909] Updated weights for policy 0, policy_version 110873 (0.0024) [2024-06-28 00:37:18,850][06674] Fps is (10 sec: 40968.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1816625152. Throughput: 0: 44210.6. Samples: 1719550880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:37:18,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:37:20,713][06909] Updated weights for policy 0, policy_version 110883 (0.0034) [2024-06-28 00:37:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1816854528. Throughput: 0: 44000.9. Samples: 1719809600. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:37:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:37:23,890][06909] Updated weights for policy 0, policy_version 110893 (0.0036) [2024-06-28 00:37:28,351][06909] Updated weights for policy 0, policy_version 110903 (0.0030) [2024-06-28 00:37:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1817067520. Throughput: 0: 44054.0. Samples: 1719949940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:37:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:37:31,322][06909] Updated weights for policy 0, policy_version 110913 (0.0036) [2024-06-28 00:37:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1817296896. Throughput: 0: 44105.8. Samples: 1720212960. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:37:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:37:35,527][06909] Updated weights for policy 0, policy_version 110923 (0.0037) [2024-06-28 00:37:38,716][06909] Updated weights for policy 0, policy_version 110933 (0.0027) [2024-06-28 00:37:38,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1817526272. Throughput: 0: 44005.8. Samples: 1720471800. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:37:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:37:43,322][06909] Updated weights for policy 0, policy_version 110943 (0.0037) [2024-06-28 00:37:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1817722880. Throughput: 0: 43879.1. Samples: 1720603560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:37:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:37:46,153][06909] Updated weights for policy 0, policy_version 110953 (0.0039) [2024-06-28 00:37:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1817952256. Throughput: 0: 44129.9. Samples: 1720874120. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:37:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:37:48,953][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000110960_1817968640.pth... [2024-06-28 00:37:49,014][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000110315_1807400960.pth [2024-06-28 00:37:50,466][06909] Updated weights for policy 0, policy_version 110963 (0.0032) [2024-06-28 00:37:53,357][06909] Updated weights for policy 0, policy_version 110973 (0.0034) [2024-06-28 00:37:53,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1818198016. Throughput: 0: 44022.0. Samples: 1721135640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:37:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:37:57,843][06909] Updated weights for policy 0, policy_version 110983 (0.0034) [2024-06-28 00:37:58,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1818394624. Throughput: 0: 44178.6. Samples: 1721280920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:37:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:38:00,450][06909] Updated weights for policy 0, policy_version 110993 (0.0040) [2024-06-28 00:38:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44511.3, 300 sec: 44209.0). Total num frames: 1818624000. Throughput: 0: 44260.9. Samples: 1721542620. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 00:38:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:38:05,403][06909] Updated weights for policy 0, policy_version 111003 (0.0031) [2024-06-28 00:38:08,334][06909] Updated weights for policy 0, policy_version 111013 (0.0045) [2024-06-28 00:38:08,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44238.3, 300 sec: 44209.0). Total num frames: 1818869760. Throughput: 0: 44168.3. Samples: 1721797180. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 00:38:08,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:38:12,692][06909] Updated weights for policy 0, policy_version 111023 (0.0037) [2024-06-28 00:38:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1819049984. Throughput: 0: 44126.9. Samples: 1721935640. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 00:38:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:38:13,962][06887] Signal inference workers to stop experience collection... (24500 times) [2024-06-28 00:38:13,963][06887] Signal inference workers to resume experience collection... (24500 times) [2024-06-28 00:38:14,016][06909] InferenceWorker_p0-w0: stopping experience collection (24500 times) [2024-06-28 00:38:14,016][06909] InferenceWorker_p0-w0: resuming experience collection (24500 times) [2024-06-28 00:38:15,709][06909] Updated weights for policy 0, policy_version 111033 (0.0027) [2024-06-28 00:38:18,850][06674] Fps is (10 sec: 40960.5, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1819279360. Throughput: 0: 44142.7. Samples: 1722199380. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 00:38:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:38:20,073][06909] Updated weights for policy 0, policy_version 111043 (0.0030) [2024-06-28 00:38:23,053][06909] Updated weights for policy 0, policy_version 111053 (0.0029) [2024-06-28 00:38:23,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1819525120. Throughput: 0: 44101.7. Samples: 1722456380. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 00:38:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:38:27,777][06909] Updated weights for policy 0, policy_version 111063 (0.0045) [2024-06-28 00:38:28,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1819688960. Throughput: 0: 44227.0. Samples: 1722593780. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 00:38:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:38:30,439][06909] Updated weights for policy 0, policy_version 111073 (0.0039) [2024-06-28 00:38:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1819951104. Throughput: 0: 44161.2. Samples: 1722861380. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 00:38:33,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:38:34,932][06909] Updated weights for policy 0, policy_version 111083 (0.0037) [2024-06-28 00:38:38,075][06909] Updated weights for policy 0, policy_version 111093 (0.0037) [2024-06-28 00:38:38,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1820164096. Throughput: 0: 44016.9. Samples: 1723116400. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 00:38:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:38:42,379][06909] Updated weights for policy 0, policy_version 111103 (0.0040) [2024-06-28 00:38:43,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1820377088. Throughput: 0: 43787.3. Samples: 1723251340. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 00:38:43,850][06674] Avg episode reward: [(0, '0.442')] [2024-06-28 00:38:45,619][06909] Updated weights for policy 0, policy_version 111113 (0.0034) [2024-06-28 00:38:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1820590080. Throughput: 0: 43751.1. Samples: 1723511420. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 00:38:48,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:38:49,802][06909] Updated weights for policy 0, policy_version 111123 (0.0051) [2024-06-28 00:38:53,251][06909] Updated weights for policy 0, policy_version 111133 (0.0031) [2024-06-28 00:38:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1820835840. Throughput: 0: 43875.6. Samples: 1723771580. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 00:38:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:38:57,181][06909] Updated weights for policy 0, policy_version 111143 (0.0029) [2024-06-28 00:38:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 1821016064. Throughput: 0: 43858.2. Samples: 1723909260. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 00:38:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:39:00,414][06909] Updated weights for policy 0, policy_version 111153 (0.0036) [2024-06-28 00:39:03,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1821245440. Throughput: 0: 43836.8. Samples: 1724172040. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:39:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:39:04,689][06909] Updated weights for policy 0, policy_version 111163 (0.0038) [2024-06-28 00:39:07,803][06909] Updated weights for policy 0, policy_version 111173 (0.0031) [2024-06-28 00:39:08,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43690.8, 300 sec: 44153.5). Total num frames: 1821491200. Throughput: 0: 43951.2. Samples: 1724434180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:39:08,850][06674] Avg episode reward: [(0, '0.475')] [2024-06-28 00:39:12,049][06909] Updated weights for policy 0, policy_version 111183 (0.0040) [2024-06-28 00:39:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1821687808. Throughput: 0: 43972.9. Samples: 1724572560. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:39:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:39:15,296][06909] Updated weights for policy 0, policy_version 111193 (0.0050) [2024-06-28 00:39:18,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1821900800. Throughput: 0: 43781.8. Samples: 1724831560. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:39:18,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:39:19,214][06887] Signal inference workers to stop experience collection... (24550 times) [2024-06-28 00:39:19,215][06887] Signal inference workers to resume experience collection... (24550 times) [2024-06-28 00:39:19,240][06909] InferenceWorker_p0-w0: stopping experience collection (24550 times) [2024-06-28 00:39:19,240][06909] InferenceWorker_p0-w0: resuming experience collection (24550 times) [2024-06-28 00:39:19,609][06909] Updated weights for policy 0, policy_version 111203 (0.0044) [2024-06-28 00:39:23,008][06909] Updated weights for policy 0, policy_version 111213 (0.0025) [2024-06-28 00:39:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 44154.0). Total num frames: 1822146560. Throughput: 0: 43832.0. Samples: 1725088840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:39:23,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:39:27,197][06909] Updated weights for policy 0, policy_version 111223 (0.0028) [2024-06-28 00:39:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 1822326784. Throughput: 0: 43862.1. Samples: 1725225140. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:39:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:39:30,642][06909] Updated weights for policy 0, policy_version 111233 (0.0037) [2024-06-28 00:39:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.7, 300 sec: 43986.9). Total num frames: 1822556160. Throughput: 0: 43889.8. Samples: 1725486460. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:39:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:39:34,856][06909] Updated weights for policy 0, policy_version 111243 (0.0043) [2024-06-28 00:39:38,083][06909] Updated weights for policy 0, policy_version 111253 (0.0039) [2024-06-28 00:39:38,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1822801920. Throughput: 0: 43849.8. Samples: 1725744820. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:39:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:39:42,295][06909] Updated weights for policy 0, policy_version 111263 (0.0031) [2024-06-28 00:39:43,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 1823014912. Throughput: 0: 43884.8. Samples: 1725884080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:39:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:39:45,585][06909] Updated weights for policy 0, policy_version 111273 (0.0027) [2024-06-28 00:39:48,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43986.8). Total num frames: 1823227904. Throughput: 0: 43833.8. Samples: 1726144560. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:39:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:39:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000111281_1823227904.pth... [2024-06-28 00:39:48,916][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000110637_1812676608.pth [2024-06-28 00:39:49,735][06909] Updated weights for policy 0, policy_version 111283 (0.0025) [2024-06-28 00:39:53,174][06909] Updated weights for policy 0, policy_version 111293 (0.0041) [2024-06-28 00:39:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1823457280. Throughput: 0: 43858.1. Samples: 1726407800. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:39:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:39:57,163][06909] Updated weights for policy 0, policy_version 111303 (0.0029) [2024-06-28 00:39:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1823670272. Throughput: 0: 43785.7. Samples: 1726542920. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:39:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:40:00,742][06909] Updated weights for policy 0, policy_version 111313 (0.0027) [2024-06-28 00:40:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43987.2). Total num frames: 1823883264. Throughput: 0: 43738.2. Samples: 1726799780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:40:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:40:04,971][06909] Updated weights for policy 0, policy_version 111323 (0.0038) [2024-06-28 00:40:08,140][06909] Updated weights for policy 0, policy_version 111333 (0.0033) [2024-06-28 00:40:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1824112640. Throughput: 0: 43904.0. Samples: 1727064520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:40:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:40:12,458][06909] Updated weights for policy 0, policy_version 111343 (0.0039) [2024-06-28 00:40:13,852][06674] Fps is (10 sec: 44228.1, 60 sec: 43962.3, 300 sec: 43931.0). Total num frames: 1824325632. Throughput: 0: 43818.5. Samples: 1727197060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:40:13,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:40:15,878][06909] Updated weights for policy 0, policy_version 111353 (0.0028) [2024-06-28 00:40:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1824555008. Throughput: 0: 43967.5. Samples: 1727465000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:40:18,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 00:40:19,821][06909] Updated weights for policy 0, policy_version 111363 (0.0032) [2024-06-28 00:40:23,227][06909] Updated weights for policy 0, policy_version 111373 (0.0025) [2024-06-28 00:40:23,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1824768000. Throughput: 0: 44056.4. Samples: 1727727360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:40:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:40:27,090][06909] Updated weights for policy 0, policy_version 111383 (0.0022) [2024-06-28 00:40:28,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 1824997376. Throughput: 0: 43850.4. Samples: 1727857340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:40:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:40:30,411][06909] Updated weights for policy 0, policy_version 111393 (0.0037) [2024-06-28 00:40:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1825193984. Throughput: 0: 43972.0. Samples: 1728123300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:40:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:40:34,367][06909] Updated weights for policy 0, policy_version 111403 (0.0037) [2024-06-28 00:40:38,105][06909] Updated weights for policy 0, policy_version 111413 (0.0028) [2024-06-28 00:40:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1825423360. Throughput: 0: 44010.3. Samples: 1728388260. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:40:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:40:41,939][06909] Updated weights for policy 0, policy_version 111423 (0.0026) [2024-06-28 00:40:43,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1825636352. Throughput: 0: 43791.3. Samples: 1728513520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:40:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:40:45,484][06909] Updated weights for policy 0, policy_version 111433 (0.0046) [2024-06-28 00:40:48,368][06887] Signal inference workers to stop experience collection... (24600 times) [2024-06-28 00:40:48,391][06909] InferenceWorker_p0-w0: stopping experience collection (24600 times) [2024-06-28 00:40:48,428][06887] Signal inference workers to resume experience collection... (24600 times) [2024-06-28 00:40:48,428][06909] InferenceWorker_p0-w0: resuming experience collection (24600 times) [2024-06-28 00:40:48,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 1825865728. Throughput: 0: 43989.2. Samples: 1728779380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:40:48,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 00:40:49,552][06909] Updated weights for policy 0, policy_version 111443 (0.0033) [2024-06-28 00:40:52,807][06909] Updated weights for policy 0, policy_version 111453 (0.0032) [2024-06-28 00:40:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1826078720. Throughput: 0: 44114.7. Samples: 1729049680. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:40:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:40:56,928][06909] Updated weights for policy 0, policy_version 111463 (0.0030) [2024-06-28 00:40:58,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 1826291712. Throughput: 0: 44064.7. Samples: 1729179880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:40:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:41:00,119][06909] Updated weights for policy 0, policy_version 111473 (0.0024) [2024-06-28 00:41:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1826521088. Throughput: 0: 43951.0. Samples: 1729442800. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 00:41:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:41:04,253][06909] Updated weights for policy 0, policy_version 111483 (0.0044) [2024-06-28 00:41:07,933][06909] Updated weights for policy 0, policy_version 111493 (0.0028) [2024-06-28 00:41:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1826734080. Throughput: 0: 44058.7. Samples: 1729710000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:41:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:41:11,520][06909] Updated weights for policy 0, policy_version 111503 (0.0032) [2024-06-28 00:41:13,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 1826963456. Throughput: 0: 44139.5. Samples: 1729843620. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:41:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:41:15,358][06909] Updated weights for policy 0, policy_version 111513 (0.0034) [2024-06-28 00:41:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1827176448. Throughput: 0: 43995.7. Samples: 1730103100. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:41:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:41:19,199][06909] Updated weights for policy 0, policy_version 111523 (0.0035) [2024-06-28 00:41:22,715][06909] Updated weights for policy 0, policy_version 111533 (0.0040) [2024-06-28 00:41:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1827389440. Throughput: 0: 43968.4. Samples: 1730366840. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:41:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:41:26,796][06909] Updated weights for policy 0, policy_version 111543 (0.0039) [2024-06-28 00:41:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1827618816. Throughput: 0: 44231.0. Samples: 1730503920. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:41:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:41:29,961][06909] Updated weights for policy 0, policy_version 111553 (0.0032) [2024-06-28 00:41:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.9, 300 sec: 43931.3). Total num frames: 1827831808. Throughput: 0: 44161.6. Samples: 1730766560. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:41:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:41:34,007][06909] Updated weights for policy 0, policy_version 111563 (0.0027) [2024-06-28 00:41:37,475][06909] Updated weights for policy 0, policy_version 111573 (0.0030) [2024-06-28 00:41:38,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.6, 300 sec: 44042.4). Total num frames: 1828077568. Throughput: 0: 44146.1. Samples: 1731036260. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:41:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:41:41,181][06909] Updated weights for policy 0, policy_version 111583 (0.0037) [2024-06-28 00:41:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1828290560. Throughput: 0: 44256.9. Samples: 1731171440. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:41:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:41:44,898][06909] Updated weights for policy 0, policy_version 111593 (0.0036) [2024-06-28 00:41:48,262][06909] Updated weights for policy 0, policy_version 111603 (0.0038) [2024-06-28 00:41:48,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 1828503552. Throughput: 0: 44281.1. Samples: 1731435440. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:41:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:41:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000111603_1828503552.pth... [2024-06-28 00:41:48,929][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000110960_1817968640.pth [2024-06-28 00:41:52,223][06909] Updated weights for policy 0, policy_version 111613 (0.0027) [2024-06-28 00:41:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1828732928. Throughput: 0: 44253.4. Samples: 1731701400. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:41:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:41:55,467][06909] Updated weights for policy 0, policy_version 111623 (0.0030) [2024-06-28 00:41:58,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44509.8, 300 sec: 44098.2). Total num frames: 1828962304. Throughput: 0: 44312.7. Samples: 1731837700. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:41:58,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:41:59,284][06909] Updated weights for policy 0, policy_version 111633 (0.0039) [2024-06-28 00:42:03,520][06909] Updated weights for policy 0, policy_version 111643 (0.0026) [2024-06-28 00:42:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44237.0, 300 sec: 43931.7). Total num frames: 1829175296. Throughput: 0: 44373.8. Samples: 1732099920. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:42:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:42:07,040][06909] Updated weights for policy 0, policy_version 111653 (0.0042) [2024-06-28 00:42:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 1829404672. Throughput: 0: 44389.3. Samples: 1732364360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 00:42:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:42:10,713][06909] Updated weights for policy 0, policy_version 111663 (0.0030) [2024-06-28 00:42:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1829617664. Throughput: 0: 44356.5. Samples: 1732499960. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 00:42:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:42:14,187][06909] Updated weights for policy 0, policy_version 111673 (0.0039) [2024-06-28 00:42:18,067][06909] Updated weights for policy 0, policy_version 111683 (0.0028) [2024-06-28 00:42:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 1829847040. Throughput: 0: 44460.7. Samples: 1732767300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 00:42:18,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:42:22,107][06909] Updated weights for policy 0, policy_version 111693 (0.0034) [2024-06-28 00:42:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1830060032. Throughput: 0: 44326.4. Samples: 1733030940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 00:42:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:42:25,305][06909] Updated weights for policy 0, policy_version 111703 (0.0024) [2024-06-28 00:42:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1830273024. Throughput: 0: 44230.7. Samples: 1733161820. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 00:42:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:42:29,217][06909] Updated weights for policy 0, policy_version 111713 (0.0038) [2024-06-28 00:42:33,266][06909] Updated weights for policy 0, policy_version 111723 (0.0043) [2024-06-28 00:42:33,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44509.7, 300 sec: 43986.9). Total num frames: 1830502400. Throughput: 0: 44132.8. Samples: 1733421420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 00:42:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:42:36,463][06909] Updated weights for policy 0, policy_version 111733 (0.0041) [2024-06-28 00:42:36,861][06887] Signal inference workers to stop experience collection... (24650 times) [2024-06-28 00:42:36,904][06909] InferenceWorker_p0-w0: stopping experience collection (24650 times) [2024-06-28 00:42:36,915][06887] Signal inference workers to resume experience collection... (24650 times) [2024-06-28 00:42:36,926][06909] InferenceWorker_p0-w0: resuming experience collection (24650 times) [2024-06-28 00:42:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 1830715392. Throughput: 0: 44193.3. Samples: 1733690100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 00:42:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:42:40,570][06909] Updated weights for policy 0, policy_version 111743 (0.0031) [2024-06-28 00:42:43,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1830944768. Throughput: 0: 44099.7. Samples: 1733822180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 00:42:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:42:44,054][06909] Updated weights for policy 0, policy_version 111753 (0.0025) [2024-06-28 00:42:48,391][06909] Updated weights for policy 0, policy_version 111763 (0.0027) [2024-06-28 00:42:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 1831157760. Throughput: 0: 44180.4. Samples: 1734088040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 00:42:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:42:51,248][06909] Updated weights for policy 0, policy_version 111773 (0.0026) [2024-06-28 00:42:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1831370752. Throughput: 0: 44149.9. Samples: 1734351100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 00:42:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:42:55,565][06909] Updated weights for policy 0, policy_version 111783 (0.0029) [2024-06-28 00:42:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1831600128. Throughput: 0: 44040.4. Samples: 1734481780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 00:42:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:42:59,127][06909] Updated weights for policy 0, policy_version 111793 (0.0043) [2024-06-28 00:43:03,014][06909] Updated weights for policy 0, policy_version 111803 (0.0037) [2024-06-28 00:43:03,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1831829504. Throughput: 0: 43932.4. Samples: 1734744260. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 00:43:03,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:43:06,335][06909] Updated weights for policy 0, policy_version 111813 (0.0036) [2024-06-28 00:43:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1832058880. Throughput: 0: 44008.3. Samples: 1735011320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 00:43:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 00:43:10,431][06909] Updated weights for policy 0, policy_version 111823 (0.0033) [2024-06-28 00:43:13,645][06909] Updated weights for policy 0, policy_version 111833 (0.0039) [2024-06-28 00:43:13,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1832271872. Throughput: 0: 44053.8. Samples: 1735144240. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 00:43:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:43:17,774][06909] Updated weights for policy 0, policy_version 111843 (0.0040) [2024-06-28 00:43:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1832501248. Throughput: 0: 44228.9. Samples: 1735411720. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 00:43:18,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:43:20,999][06909] Updated weights for policy 0, policy_version 111853 (0.0035) [2024-06-28 00:43:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1832714240. Throughput: 0: 44135.0. Samples: 1735676180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 00:43:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:43:25,353][06909] Updated weights for policy 0, policy_version 111863 (0.0035) [2024-06-28 00:43:28,384][06909] Updated weights for policy 0, policy_version 111873 (0.0023) [2024-06-28 00:43:28,850][06674] Fps is (10 sec: 42599.3, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1832927232. Throughput: 0: 44169.8. Samples: 1735809820. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 00:43:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:43:32,551][06909] Updated weights for policy 0, policy_version 111883 (0.0042) [2024-06-28 00:43:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1833156608. Throughput: 0: 44243.0. Samples: 1736078980. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 00:43:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:43:35,925][06909] Updated weights for policy 0, policy_version 111893 (0.0039) [2024-06-28 00:43:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1833369600. Throughput: 0: 44073.8. Samples: 1736334420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 00:43:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:43:40,021][06909] Updated weights for policy 0, policy_version 111903 (0.0029) [2024-06-28 00:43:43,445][06909] Updated weights for policy 0, policy_version 111913 (0.0035) [2024-06-28 00:43:43,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1833598976. Throughput: 0: 44184.5. Samples: 1736470080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 00:43:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:43:47,535][06909] Updated weights for policy 0, policy_version 111923 (0.0039) [2024-06-28 00:43:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1833811968. Throughput: 0: 44288.5. Samples: 1736737240. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 00:43:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:43:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000111927_1833811968.pth... [2024-06-28 00:43:48,912][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000111281_1823227904.pth [2024-06-28 00:43:50,716][06909] Updated weights for policy 0, policy_version 111933 (0.0039) [2024-06-28 00:43:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1834024960. Throughput: 0: 44048.9. Samples: 1736993520. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 00:43:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:43:54,735][06909] Updated weights for policy 0, policy_version 111943 (0.0029) [2024-06-28 00:43:58,297][06909] Updated weights for policy 0, policy_version 111953 (0.0036) [2024-06-28 00:43:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1834254336. Throughput: 0: 44104.4. Samples: 1737128940. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 00:43:58,859][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:44:02,305][06909] Updated weights for policy 0, policy_version 111963 (0.0041) [2024-06-28 00:44:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1834467328. Throughput: 0: 44032.1. Samples: 1737393160. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 00:44:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:44:04,605][06887] Signal inference workers to stop experience collection... (24700 times) [2024-06-28 00:44:04,605][06887] Signal inference workers to resume experience collection... (24700 times) [2024-06-28 00:44:04,647][06909] InferenceWorker_p0-w0: stopping experience collection (24700 times) [2024-06-28 00:44:04,647][06909] InferenceWorker_p0-w0: resuming experience collection (24700 times) [2024-06-28 00:44:05,608][06909] Updated weights for policy 0, policy_version 111973 (0.0033) [2024-06-28 00:44:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1834680320. Throughput: 0: 43884.1. Samples: 1737650960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:44:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:44:09,521][06909] Updated weights for policy 0, policy_version 111983 (0.0035) [2024-06-28 00:44:13,204][06909] Updated weights for policy 0, policy_version 111993 (0.0028) [2024-06-28 00:44:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1834909696. Throughput: 0: 44053.7. Samples: 1737792240. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:44:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:44:16,963][06909] Updated weights for policy 0, policy_version 112003 (0.0044) [2024-06-28 00:44:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1835122688. Throughput: 0: 43885.9. Samples: 1738053840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:44:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:44:20,546][06909] Updated weights for policy 0, policy_version 112013 (0.0029) [2024-06-28 00:44:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1835352064. Throughput: 0: 44049.3. Samples: 1738316640. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:44:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:44:24,568][06909] Updated weights for policy 0, policy_version 112023 (0.0039) [2024-06-28 00:44:28,052][06909] Updated weights for policy 0, policy_version 112033 (0.0025) [2024-06-28 00:44:28,856][06674] Fps is (10 sec: 45846.9, 60 sec: 44232.2, 300 sec: 44152.6). Total num frames: 1835581440. Throughput: 0: 44139.3. Samples: 1738456620. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:44:28,857][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:44:31,905][06909] Updated weights for policy 0, policy_version 112043 (0.0031) [2024-06-28 00:44:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1835810816. Throughput: 0: 44196.5. Samples: 1738726080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:44:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:44:35,405][06909] Updated weights for policy 0, policy_version 112053 (0.0040) [2024-06-28 00:44:38,852][06674] Fps is (10 sec: 44254.8, 60 sec: 44235.2, 300 sec: 44097.7). Total num frames: 1836023808. Throughput: 0: 44184.7. Samples: 1738981920. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:44:38,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:44:39,442][06909] Updated weights for policy 0, policy_version 112063 (0.0038) [2024-06-28 00:44:42,942][06909] Updated weights for policy 0, policy_version 112073 (0.0037) [2024-06-28 00:44:43,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1836253184. Throughput: 0: 44189.7. Samples: 1739117480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:44:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:44:46,633][06909] Updated weights for policy 0, policy_version 112083 (0.0036) [2024-06-28 00:44:48,852][06674] Fps is (10 sec: 42598.4, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 1836449792. Throughput: 0: 44237.6. Samples: 1739383940. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:44:48,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:44:50,522][06909] Updated weights for policy 0, policy_version 112093 (0.0036) [2024-06-28 00:44:53,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1836679168. Throughput: 0: 44252.4. Samples: 1739642320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:44:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 00:44:54,428][06909] Updated weights for policy 0, policy_version 112103 (0.0045) [2024-06-28 00:44:57,808][06909] Updated weights for policy 0, policy_version 112113 (0.0028) [2024-06-28 00:44:58,850][06674] Fps is (10 sec: 45884.0, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1836908544. Throughput: 0: 44040.2. Samples: 1739774060. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:44:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:45:01,776][06909] Updated weights for policy 0, policy_version 112123 (0.0042) [2024-06-28 00:45:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1837121536. Throughput: 0: 44056.9. Samples: 1740036400. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:45:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:45:05,251][06909] Updated weights for policy 0, policy_version 112133 (0.0038) [2024-06-28 00:45:08,850][06674] Fps is (10 sec: 42599.2, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 1837334528. Throughput: 0: 44130.3. Samples: 1740302500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:45:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:45:09,369][06909] Updated weights for policy 0, policy_version 112143 (0.0029) [2024-06-28 00:45:12,720][06909] Updated weights for policy 0, policy_version 112153 (0.0037) [2024-06-28 00:45:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1837580288. Throughput: 0: 44027.8. Samples: 1740437600. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 00:45:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:45:16,502][06909] Updated weights for policy 0, policy_version 112163 (0.0032) [2024-06-28 00:45:18,856][06674] Fps is (10 sec: 44209.6, 60 sec: 44232.3, 300 sec: 44097.0). Total num frames: 1837776896. Throughput: 0: 43982.9. Samples: 1740705580. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 00:45:18,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:45:20,249][06909] Updated weights for policy 0, policy_version 112173 (0.0036) [2024-06-28 00:45:23,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1837989888. Throughput: 0: 44149.5. Samples: 1740968560. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 00:45:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:45:23,892][06909] Updated weights for policy 0, policy_version 112183 (0.0029) [2024-06-28 00:45:27,783][06909] Updated weights for policy 0, policy_version 112193 (0.0028) [2024-06-28 00:45:28,850][06674] Fps is (10 sec: 45903.7, 60 sec: 44241.4, 300 sec: 44209.1). Total num frames: 1838235648. Throughput: 0: 44126.4. Samples: 1741103160. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 00:45:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:45:31,285][06909] Updated weights for policy 0, policy_version 112203 (0.0030) [2024-06-28 00:45:33,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 1838415872. Throughput: 0: 43991.4. Samples: 1741363460. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 00:45:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:45:35,163][06909] Updated weights for policy 0, policy_version 112213 (0.0037) [2024-06-28 00:45:38,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43692.1, 300 sec: 44097.9). Total num frames: 1838645248. Throughput: 0: 44178.2. Samples: 1741630340. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 00:45:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:45:38,946][06909] Updated weights for policy 0, policy_version 112223 (0.0023) [2024-06-28 00:45:42,524][06909] Updated weights for policy 0, policy_version 112233 (0.0029) [2024-06-28 00:45:43,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.9, 300 sec: 44153.8). Total num frames: 1838891008. Throughput: 0: 44247.8. Samples: 1741765200. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 00:45:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:45:46,277][06909] Updated weights for policy 0, policy_version 112243 (0.0032) [2024-06-28 00:45:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 1839104000. Throughput: 0: 44251.9. Samples: 1742027740. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 00:45:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:45:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000112250_1839104000.pth... [2024-06-28 00:45:48,916][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000111603_1828503552.pth [2024-06-28 00:45:49,894][06909] Updated weights for policy 0, policy_version 112253 (0.0022) [2024-06-28 00:45:51,519][06887] Signal inference workers to stop experience collection... (24750 times) [2024-06-28 00:45:51,563][06909] InferenceWorker_p0-w0: stopping experience collection (24750 times) [2024-06-28 00:45:51,573][06887] Signal inference workers to resume experience collection... (24750 times) [2024-06-28 00:45:51,584][06909] InferenceWorker_p0-w0: resuming experience collection (24750 times) [2024-06-28 00:45:53,521][06909] Updated weights for policy 0, policy_version 112263 (0.0041) [2024-06-28 00:45:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1839333376. Throughput: 0: 44331.9. Samples: 1742297440. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 00:45:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:45:57,177][06909] Updated weights for policy 0, policy_version 112273 (0.0027) [2024-06-28 00:45:58,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1839562752. Throughput: 0: 44405.1. Samples: 1742435840. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 00:45:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:46:00,874][06909] Updated weights for policy 0, policy_version 112283 (0.0029) [2024-06-28 00:46:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1839759360. Throughput: 0: 44122.1. Samples: 1742690800. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 00:46:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:46:04,685][06909] Updated weights for policy 0, policy_version 112293 (0.0044) [2024-06-28 00:46:08,561][06909] Updated weights for policy 0, policy_version 112303 (0.0039) [2024-06-28 00:46:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1839988736. Throughput: 0: 44260.0. Samples: 1742960260. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 00:46:08,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:46:11,871][06909] Updated weights for policy 0, policy_version 112313 (0.0033) [2024-06-28 00:46:13,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1840218112. Throughput: 0: 44246.6. Samples: 1743094260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:46:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:46:15,805][06909] Updated weights for policy 0, policy_version 112323 (0.0029) [2024-06-28 00:46:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44241.3, 300 sec: 44209.0). Total num frames: 1840431104. Throughput: 0: 44197.3. Samples: 1743352340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:46:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:46:19,485][06909] Updated weights for policy 0, policy_version 112333 (0.0034) [2024-06-28 00:46:23,276][06909] Updated weights for policy 0, policy_version 112343 (0.0040) [2024-06-28 00:46:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1840660480. Throughput: 0: 44264.4. Samples: 1743622240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:46:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:46:26,901][06909] Updated weights for policy 0, policy_version 112353 (0.0026) [2024-06-28 00:46:28,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 1840889856. Throughput: 0: 44165.3. Samples: 1743752640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:46:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:46:30,444][06909] Updated weights for policy 0, policy_version 112363 (0.0031) [2024-06-28 00:46:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1841070080. Throughput: 0: 44324.5. Samples: 1744022340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:46:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:46:34,390][06909] Updated weights for policy 0, policy_version 112373 (0.0025) [2024-06-28 00:46:37,979][06909] Updated weights for policy 0, policy_version 112383 (0.0046) [2024-06-28 00:46:38,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1841315840. Throughput: 0: 44101.3. Samples: 1744282000. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:46:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:46:41,950][06909] Updated weights for policy 0, policy_version 112393 (0.0041) [2024-06-28 00:46:43,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1841545216. Throughput: 0: 44010.4. Samples: 1744416300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:46:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:46:45,231][06909] Updated weights for policy 0, policy_version 112403 (0.0027) [2024-06-28 00:46:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1841758208. Throughput: 0: 44399.5. Samples: 1744688780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:46:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:46:48,964][06909] Updated weights for policy 0, policy_version 112413 (0.0024) [2024-06-28 00:46:52,795][06909] Updated weights for policy 0, policy_version 112423 (0.0034) [2024-06-28 00:46:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1841987584. Throughput: 0: 44212.9. Samples: 1744949840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:46:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:46:56,434][06909] Updated weights for policy 0, policy_version 112433 (0.0043) [2024-06-28 00:46:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 1842216960. Throughput: 0: 44136.0. Samples: 1745080380. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:46:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:47:00,031][06909] Updated weights for policy 0, policy_version 112443 (0.0025) [2024-06-28 00:47:03,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.6, 300 sec: 44097.9). Total num frames: 1842413568. Throughput: 0: 44407.8. Samples: 1745350700. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:47:03,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:47:04,064][06909] Updated weights for policy 0, policy_version 112453 (0.0034) [2024-06-28 00:47:07,295][06909] Updated weights for policy 0, policy_version 112463 (0.0041) [2024-06-28 00:47:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1842659328. Throughput: 0: 44179.5. Samples: 1745610320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:47:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:47:11,341][06909] Updated weights for policy 0, policy_version 112473 (0.0032) [2024-06-28 00:47:13,850][06674] Fps is (10 sec: 45876.3, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1842872320. Throughput: 0: 44292.0. Samples: 1745745780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 00:47:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:47:15,001][06909] Updated weights for policy 0, policy_version 112483 (0.0034) [2024-06-28 00:47:18,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1843068928. Throughput: 0: 44220.0. Samples: 1746012240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 00:47:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:47:18,984][06909] Updated weights for policy 0, policy_version 112493 (0.0036) [2024-06-28 00:47:22,334][06909] Updated weights for policy 0, policy_version 112503 (0.0023) [2024-06-28 00:47:22,887][06887] Signal inference workers to stop experience collection... (24800 times) [2024-06-28 00:47:22,887][06887] Signal inference workers to resume experience collection... (24800 times) [2024-06-28 00:47:22,939][06909] InferenceWorker_p0-w0: stopping experience collection (24800 times) [2024-06-28 00:47:22,940][06909] InferenceWorker_p0-w0: resuming experience collection (24800 times) [2024-06-28 00:47:23,852][06674] Fps is (10 sec: 44227.3, 60 sec: 44235.3, 300 sec: 44208.7). Total num frames: 1843314688. Throughput: 0: 44176.2. Samples: 1746270020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 00:47:23,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:47:26,218][06909] Updated weights for policy 0, policy_version 112513 (0.0029) [2024-06-28 00:47:28,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1843527680. Throughput: 0: 44221.4. Samples: 1746406260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 00:47:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:47:29,620][06909] Updated weights for policy 0, policy_version 112523 (0.0028) [2024-06-28 00:47:33,420][06909] Updated weights for policy 0, policy_version 112533 (0.0035) [2024-06-28 00:47:33,850][06674] Fps is (10 sec: 42607.2, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1843740672. Throughput: 0: 44149.8. Samples: 1746675520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 00:47:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:47:37,446][06909] Updated weights for policy 0, policy_version 112543 (0.0025) [2024-06-28 00:47:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1843970048. Throughput: 0: 44072.1. Samples: 1746933080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 00:47:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:47:41,117][06909] Updated weights for policy 0, policy_version 112553 (0.0030) [2024-06-28 00:47:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 1844199424. Throughput: 0: 44192.9. Samples: 1747069060. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 00:47:43,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:47:44,689][06909] Updated weights for policy 0, policy_version 112563 (0.0027) [2024-06-28 00:47:48,395][06909] Updated weights for policy 0, policy_version 112573 (0.0034) [2024-06-28 00:47:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1844412416. Throughput: 0: 44177.1. Samples: 1747338660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 00:47:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:47:48,886][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000112575_1844428800.pth... [2024-06-28 00:47:48,945][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000111927_1833811968.pth [2024-06-28 00:47:52,126][06909] Updated weights for policy 0, policy_version 112583 (0.0031) [2024-06-28 00:47:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1844625408. Throughput: 0: 44017.5. Samples: 1747591100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 00:47:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:47:55,904][06909] Updated weights for policy 0, policy_version 112593 (0.0032) [2024-06-28 00:47:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1844854784. Throughput: 0: 43992.8. Samples: 1747725460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 00:47:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:47:59,580][06909] Updated weights for policy 0, policy_version 112603 (0.0030) [2024-06-28 00:48:03,251][06909] Updated weights for policy 0, policy_version 112613 (0.0031) [2024-06-28 00:48:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 1845084160. Throughput: 0: 44125.3. Samples: 1747997880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 00:48:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:48:07,215][06909] Updated weights for policy 0, policy_version 112623 (0.0039) [2024-06-28 00:48:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1845297152. Throughput: 0: 44216.7. Samples: 1748259680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 00:48:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:48:10,474][06909] Updated weights for policy 0, policy_version 112633 (0.0025) [2024-06-28 00:48:13,851][06674] Fps is (10 sec: 42591.7, 60 sec: 43962.5, 300 sec: 44097.7). Total num frames: 1845510144. Throughput: 0: 44141.5. Samples: 1748392700. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 00:48:13,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:48:14,758][06909] Updated weights for policy 0, policy_version 112643 (0.0031) [2024-06-28 00:48:17,948][06909] Updated weights for policy 0, policy_version 112653 (0.0036) [2024-06-28 00:48:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44782.9, 300 sec: 44209.0). Total num frames: 1845755904. Throughput: 0: 44164.0. Samples: 1748662900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 00:48:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:48:22,059][06909] Updated weights for policy 0, policy_version 112663 (0.0034) [2024-06-28 00:48:23,850][06674] Fps is (10 sec: 44243.5, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 1845952512. Throughput: 0: 44224.3. Samples: 1748923180. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 00:48:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:48:25,328][06909] Updated weights for policy 0, policy_version 112673 (0.0035) [2024-06-28 00:48:28,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1846181888. Throughput: 0: 44168.9. Samples: 1749056660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 00:48:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:48:29,166][06909] Updated weights for policy 0, policy_version 112683 (0.0021) [2024-06-28 00:48:32,536][06909] Updated weights for policy 0, policy_version 112693 (0.0028) [2024-06-28 00:48:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1846411264. Throughput: 0: 44125.6. Samples: 1749324320. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 00:48:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:48:36,864][06909] Updated weights for policy 0, policy_version 112703 (0.0036) [2024-06-28 00:48:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1846607872. Throughput: 0: 44405.7. Samples: 1749589360. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 00:48:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:48:40,185][06909] Updated weights for policy 0, policy_version 112713 (0.0028) [2024-06-28 00:48:43,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1846837248. Throughput: 0: 44373.4. Samples: 1749722260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 00:48:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:48:44,069][06909] Updated weights for policy 0, policy_version 112723 (0.0028) [2024-06-28 00:48:46,431][06887] Signal inference workers to stop experience collection... (24850 times) [2024-06-28 00:48:46,434][06887] Signal inference workers to resume experience collection... (24850 times) [2024-06-28 00:48:46,471][06909] InferenceWorker_p0-w0: stopping experience collection (24850 times) [2024-06-28 00:48:46,471][06909] InferenceWorker_p0-w0: resuming experience collection (24850 times) [2024-06-28 00:48:47,358][06909] Updated weights for policy 0, policy_version 112733 (0.0032) [2024-06-28 00:48:48,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44509.8, 300 sec: 44264.6). Total num frames: 1847083008. Throughput: 0: 44250.6. Samples: 1749989160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 00:48:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:48:51,633][06909] Updated weights for policy 0, policy_version 112743 (0.0038) [2024-06-28 00:48:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1847296000. Throughput: 0: 44335.1. Samples: 1750254760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 00:48:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:48:54,959][06909] Updated weights for policy 0, policy_version 112753 (0.0028) [2024-06-28 00:48:58,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1847492608. Throughput: 0: 44248.3. Samples: 1750383800. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 00:48:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:48:58,954][06909] Updated weights for policy 0, policy_version 112763 (0.0041) [2024-06-28 00:49:02,207][06909] Updated weights for policy 0, policy_version 112773 (0.0045) [2024-06-28 00:49:03,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 1847738368. Throughput: 0: 44135.5. Samples: 1750649000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 00:49:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:49:06,210][06909] Updated weights for policy 0, policy_version 112783 (0.0039) [2024-06-28 00:49:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1847918592. Throughput: 0: 44295.2. Samples: 1750916460. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 00:49:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:49:09,781][06909] Updated weights for policy 0, policy_version 112793 (0.0033) [2024-06-28 00:49:13,591][06909] Updated weights for policy 0, policy_version 112803 (0.0027) [2024-06-28 00:49:13,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44238.0, 300 sec: 44209.0). Total num frames: 1848164352. Throughput: 0: 44092.5. Samples: 1751040820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 00:49:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:49:17,551][06909] Updated weights for policy 0, policy_version 112813 (0.0033) [2024-06-28 00:49:18,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 1848410112. Throughput: 0: 44149.9. Samples: 1751311060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 00:49:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:49:21,377][06909] Updated weights for policy 0, policy_version 112823 (0.0028) [2024-06-28 00:49:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 44098.9). Total num frames: 1848590336. Throughput: 0: 44120.5. Samples: 1751574780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 00:49:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:49:24,914][06909] Updated weights for policy 0, policy_version 112833 (0.0031) [2024-06-28 00:49:28,724][06909] Updated weights for policy 0, policy_version 112843 (0.0032) [2024-06-28 00:49:28,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1848819712. Throughput: 0: 43869.3. Samples: 1751696380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 00:49:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:49:32,473][06909] Updated weights for policy 0, policy_version 112853 (0.0031) [2024-06-28 00:49:33,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.9, 300 sec: 44209.3). Total num frames: 1849065472. Throughput: 0: 43977.0. Samples: 1751968120. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 00:49:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:49:36,123][06909] Updated weights for policy 0, policy_version 112863 (0.0028) [2024-06-28 00:49:38,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 1849245696. Throughput: 0: 44130.7. Samples: 1752240640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 00:49:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:49:39,627][06909] Updated weights for policy 0, policy_version 112873 (0.0033) [2024-06-28 00:49:43,338][06909] Updated weights for policy 0, policy_version 112883 (0.0030) [2024-06-28 00:49:43,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 44153.8). Total num frames: 1849475072. Throughput: 0: 43963.1. Samples: 1752362140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 00:49:43,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:49:46,943][06909] Updated weights for policy 0, policy_version 112893 (0.0036) [2024-06-28 00:49:48,850][06674] Fps is (10 sec: 49151.2, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 1849737216. Throughput: 0: 44030.2. Samples: 1752630360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 00:49:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:49:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000112899_1849737216.pth... [2024-06-28 00:49:48,929][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000112250_1839104000.pth [2024-06-28 00:49:50,861][06909] Updated weights for policy 0, policy_version 112903 (0.0036) [2024-06-28 00:49:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 1849901056. Throughput: 0: 44032.0. Samples: 1752897900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 00:49:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:49:54,874][06909] Updated weights for policy 0, policy_version 112913 (0.0037) [2024-06-28 00:49:58,605][06909] Updated weights for policy 0, policy_version 112923 (0.0040) [2024-06-28 00:49:58,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 1850130432. Throughput: 0: 43963.8. Samples: 1753019200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 00:49:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:49:58,930][06887] Signal inference workers to stop experience collection... (24900 times) [2024-06-28 00:49:58,931][06887] Signal inference workers to resume experience collection... (24900 times) [2024-06-28 00:49:58,959][06909] InferenceWorker_p0-w0: stopping experience collection (24900 times) [2024-06-28 00:49:58,960][06909] InferenceWorker_p0-w0: resuming experience collection (24900 times) [2024-06-28 00:50:02,078][06909] Updated weights for policy 0, policy_version 112933 (0.0029) [2024-06-28 00:50:03,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 1850392576. Throughput: 0: 43951.9. Samples: 1753288900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 00:50:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:50:05,919][06909] Updated weights for policy 0, policy_version 112943 (0.0023) [2024-06-28 00:50:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1850572800. Throughput: 0: 44019.9. Samples: 1753555680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 00:50:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:50:09,499][06909] Updated weights for policy 0, policy_version 112953 (0.0028) [2024-06-28 00:50:13,232][06909] Updated weights for policy 0, policy_version 112963 (0.0036) [2024-06-28 00:50:13,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.7, 300 sec: 44209.9). Total num frames: 1850818560. Throughput: 0: 44155.4. Samples: 1753683380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 00:50:13,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:50:16,780][06909] Updated weights for policy 0, policy_version 112973 (0.0033) [2024-06-28 00:50:18,850][06674] Fps is (10 sec: 49152.1, 60 sec: 44236.7, 300 sec: 44320.1). Total num frames: 1851064320. Throughput: 0: 44006.1. Samples: 1753948400. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:50:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:50:20,551][06909] Updated weights for policy 0, policy_version 112983 (0.0032) [2024-06-28 00:50:23,850][06674] Fps is (10 sec: 42599.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1851244544. Throughput: 0: 43999.1. Samples: 1754220600. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:50:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:50:24,142][06909] Updated weights for policy 0, policy_version 112993 (0.0030) [2024-06-28 00:50:28,216][06909] Updated weights for policy 0, policy_version 113003 (0.0038) [2024-06-28 00:50:28,850][06674] Fps is (10 sec: 40959.6, 60 sec: 44236.7, 300 sec: 44264.5). Total num frames: 1851473920. Throughput: 0: 43962.9. Samples: 1754340480. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:50:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:50:31,647][06909] Updated weights for policy 0, policy_version 113013 (0.0027) [2024-06-28 00:50:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 44209.0). Total num frames: 1851686912. Throughput: 0: 43910.7. Samples: 1754606340. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:50:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:50:35,870][06909] Updated weights for policy 0, policy_version 113023 (0.0046) [2024-06-28 00:50:38,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44509.6, 300 sec: 44153.4). Total num frames: 1851916288. Throughput: 0: 43987.3. Samples: 1754877340. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:50:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:50:39,288][06909] Updated weights for policy 0, policy_version 113033 (0.0029) [2024-06-28 00:50:43,046][06909] Updated weights for policy 0, policy_version 113043 (0.0026) [2024-06-28 00:50:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1852129280. Throughput: 0: 44046.4. Samples: 1755001280. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:50:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:50:46,707][06909] Updated weights for policy 0, policy_version 113053 (0.0029) [2024-06-28 00:50:48,850][06674] Fps is (10 sec: 44238.2, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 1852358656. Throughput: 0: 43948.5. Samples: 1755266580. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:50:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:50:50,551][06909] Updated weights for policy 0, policy_version 113063 (0.0028) [2024-06-28 00:50:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 1852571648. Throughput: 0: 44003.6. Samples: 1755535840. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:50:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:50:54,157][06909] Updated weights for policy 0, policy_version 113073 (0.0026) [2024-06-28 00:50:58,079][06909] Updated weights for policy 0, policy_version 113083 (0.0038) [2024-06-28 00:50:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1852784640. Throughput: 0: 43933.4. Samples: 1755660380. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:50:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:51:01,266][06909] Updated weights for policy 0, policy_version 113093 (0.0020) [2024-06-28 00:51:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 44098.0). Total num frames: 1852997632. Throughput: 0: 44065.4. Samples: 1755931340. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:51:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:51:05,315][06909] Updated weights for policy 0, policy_version 113103 (0.0031) [2024-06-28 00:51:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1853227008. Throughput: 0: 43884.0. Samples: 1756195380. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:51:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:51:08,904][06909] Updated weights for policy 0, policy_version 113113 (0.0036) [2024-06-28 00:51:12,808][06909] Updated weights for policy 0, policy_version 113123 (0.0032) [2024-06-28 00:51:13,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1853456384. Throughput: 0: 44147.7. Samples: 1756327120. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 00:51:13,850][06674] Avg episode reward: [(0, '0.488')] [2024-06-28 00:51:15,271][06887] Signal inference workers to stop experience collection... (24950 times) [2024-06-28 00:51:15,273][06887] Signal inference workers to resume experience collection... (24950 times) [2024-06-28 00:51:15,287][06909] InferenceWorker_p0-w0: stopping experience collection (24950 times) [2024-06-28 00:51:15,287][06909] InferenceWorker_p0-w0: resuming experience collection (24950 times) [2024-06-28 00:51:16,446][06909] Updated weights for policy 0, policy_version 113133 (0.0032) [2024-06-28 00:51:18,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 1853685760. Throughput: 0: 44097.3. Samples: 1756590720. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:51:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:51:20,003][06909] Updated weights for policy 0, policy_version 113143 (0.0030) [2024-06-28 00:51:23,806][06909] Updated weights for policy 0, policy_version 113153 (0.0038) [2024-06-28 00:51:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1853898752. Throughput: 0: 43950.5. Samples: 1756855100. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:51:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:51:27,359][06909] Updated weights for policy 0, policy_version 113163 (0.0039) [2024-06-28 00:51:28,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 1854095360. Throughput: 0: 44060.3. Samples: 1756984000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:51:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:51:31,255][06909] Updated weights for policy 0, policy_version 113173 (0.0026) [2024-06-28 00:51:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1854341120. Throughput: 0: 44156.8. Samples: 1757253640. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:51:33,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:51:35,021][06909] Updated weights for policy 0, policy_version 113183 (0.0037) [2024-06-28 00:51:38,467][06909] Updated weights for policy 0, policy_version 113193 (0.0035) [2024-06-28 00:51:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.9, 300 sec: 44097.9). Total num frames: 1854554112. Throughput: 0: 44045.8. Samples: 1757517900. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:51:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:51:42,410][06909] Updated weights for policy 0, policy_version 113203 (0.0028) [2024-06-28 00:51:43,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1854750720. Throughput: 0: 44223.3. Samples: 1757650420. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:51:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:51:45,763][06909] Updated weights for policy 0, policy_version 113213 (0.0026) [2024-06-28 00:51:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 1854996480. Throughput: 0: 44083.4. Samples: 1757915100. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:51:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 00:51:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000113220_1854996480.pth... [2024-06-28 00:51:48,914][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000112575_1844428800.pth [2024-06-28 00:51:49,923][06909] Updated weights for policy 0, policy_version 113223 (0.0028) [2024-06-28 00:51:53,500][06909] Updated weights for policy 0, policy_version 113233 (0.0024) [2024-06-28 00:51:53,852][06674] Fps is (10 sec: 49141.5, 60 sec: 44508.4, 300 sec: 44153.2). Total num frames: 1855242240. Throughput: 0: 44154.4. Samples: 1758182420. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:51:53,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:51:57,151][06909] Updated weights for policy 0, policy_version 113243 (0.0024) [2024-06-28 00:51:58,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1855406080. Throughput: 0: 44214.7. Samples: 1758316780. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:51:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:52:00,783][06909] Updated weights for policy 0, policy_version 113253 (0.0028) [2024-06-28 00:52:03,850][06674] Fps is (10 sec: 40968.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1855651840. Throughput: 0: 44085.5. Samples: 1758574560. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:52:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:52:04,497][06909] Updated weights for policy 0, policy_version 113263 (0.0041) [2024-06-28 00:52:08,520][06909] Updated weights for policy 0, policy_version 113273 (0.0033) [2024-06-28 00:52:08,850][06674] Fps is (10 sec: 49152.1, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1855897600. Throughput: 0: 44175.1. Samples: 1758842980. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:52:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:52:12,030][06909] Updated weights for policy 0, policy_version 113283 (0.0040) [2024-06-28 00:52:13,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1856094208. Throughput: 0: 44308.9. Samples: 1758977900. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:52:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:52:15,821][06909] Updated weights for policy 0, policy_version 113293 (0.0032) [2024-06-28 00:52:18,852][06674] Fps is (10 sec: 40951.6, 60 sec: 43689.2, 300 sec: 44042.4). Total num frames: 1856307200. Throughput: 0: 44074.5. Samples: 1759237080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 00:52:18,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:52:19,592][06909] Updated weights for policy 0, policy_version 113303 (0.0032) [2024-06-28 00:52:23,019][06909] Updated weights for policy 0, policy_version 113313 (0.0042) [2024-06-28 00:52:23,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1856536576. Throughput: 0: 44022.8. Samples: 1759498920. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 00:52:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:52:23,858][06887] Signal inference workers to stop experience collection... (25000 times) [2024-06-28 00:52:23,888][06909] InferenceWorker_p0-w0: stopping experience collection (25000 times) [2024-06-28 00:52:23,919][06887] Signal inference workers to resume experience collection... (25000 times) [2024-06-28 00:52:23,922][06909] InferenceWorker_p0-w0: resuming experience collection (25000 times) [2024-06-28 00:52:27,166][06909] Updated weights for policy 0, policy_version 113323 (0.0026) [2024-06-28 00:52:28,850][06674] Fps is (10 sec: 44245.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1856749568. Throughput: 0: 44067.8. Samples: 1759633480. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 00:52:28,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:52:30,695][06909] Updated weights for policy 0, policy_version 113333 (0.0028) [2024-06-28 00:52:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1856962560. Throughput: 0: 44033.5. Samples: 1759896600. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 00:52:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:52:34,583][06909] Updated weights for policy 0, policy_version 113343 (0.0038) [2024-06-28 00:52:38,243][06909] Updated weights for policy 0, policy_version 113353 (0.0034) [2024-06-28 00:52:38,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1857208320. Throughput: 0: 43809.1. Samples: 1760153740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 00:52:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:52:42,277][06909] Updated weights for policy 0, policy_version 113363 (0.0033) [2024-06-28 00:52:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1857404928. Throughput: 0: 43878.3. Samples: 1760291300. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 00:52:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:52:45,655][06909] Updated weights for policy 0, policy_version 113373 (0.0035) [2024-06-28 00:52:48,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43962.3, 300 sec: 44097.6). Total num frames: 1857634304. Throughput: 0: 43898.8. Samples: 1760550100. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 00:52:48,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:52:49,508][06909] Updated weights for policy 0, policy_version 113383 (0.0034) [2024-06-28 00:52:52,867][06909] Updated weights for policy 0, policy_version 113393 (0.0022) [2024-06-28 00:52:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43692.2, 300 sec: 44098.0). Total num frames: 1857863680. Throughput: 0: 43916.5. Samples: 1760819220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 00:52:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:52:57,265][06909] Updated weights for policy 0, policy_version 113403 (0.0027) [2024-06-28 00:52:58,850][06674] Fps is (10 sec: 42606.6, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1858060288. Throughput: 0: 43919.5. Samples: 1760954280. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 00:52:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:53:00,772][06909] Updated weights for policy 0, policy_version 113413 (0.0037) [2024-06-28 00:53:03,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1858273280. Throughput: 0: 43782.4. Samples: 1761207200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 00:53:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:53:04,808][06909] Updated weights for policy 0, policy_version 113423 (0.0031) [2024-06-28 00:53:08,116][06909] Updated weights for policy 0, policy_version 113433 (0.0031) [2024-06-28 00:53:08,850][06674] Fps is (10 sec: 49152.4, 60 sec: 44236.8, 300 sec: 44209.3). Total num frames: 1858551808. Throughput: 0: 43814.6. Samples: 1761470580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 00:53:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:53:12,544][06909] Updated weights for policy 0, policy_version 113443 (0.0025) [2024-06-28 00:53:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1858715648. Throughput: 0: 43971.2. Samples: 1761612180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 00:53:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:53:15,457][06909] Updated weights for policy 0, policy_version 113453 (0.0042) [2024-06-28 00:53:18,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 1858945024. Throughput: 0: 43777.3. Samples: 1761866580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 00:53:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:53:19,772][06909] Updated weights for policy 0, policy_version 113463 (0.0026) [2024-06-28 00:53:22,976][06909] Updated weights for policy 0, policy_version 113473 (0.0032) [2024-06-28 00:53:23,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1859190784. Throughput: 0: 44020.4. Samples: 1762134660. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 00:53:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:53:27,013][06909] Updated weights for policy 0, policy_version 113483 (0.0034) [2024-06-28 00:53:28,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1859387392. Throughput: 0: 43995.8. Samples: 1762271120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 00:53:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:53:30,248][06909] Updated weights for policy 0, policy_version 113493 (0.0040) [2024-06-28 00:53:33,851][06674] Fps is (10 sec: 40957.1, 60 sec: 43963.1, 300 sec: 44042.3). Total num frames: 1859600384. Throughput: 0: 43991.9. Samples: 1762529680. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 00:53:33,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:53:34,660][06909] Updated weights for policy 0, policy_version 113503 (0.0027) [2024-06-28 00:53:38,099][06909] Updated weights for policy 0, policy_version 113513 (0.0029) [2024-06-28 00:53:38,850][06674] Fps is (10 sec: 47512.4, 60 sec: 44236.6, 300 sec: 44153.4). Total num frames: 1859862528. Throughput: 0: 43840.5. Samples: 1762792060. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 00:53:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:53:40,195][06887] Signal inference workers to stop experience collection... (25050 times) [2024-06-28 00:53:40,195][06887] Signal inference workers to resume experience collection... (25050 times) [2024-06-28 00:53:40,217][06909] InferenceWorker_p0-w0: stopping experience collection (25050 times) [2024-06-28 00:53:40,218][06909] InferenceWorker_p0-w0: resuming experience collection (25050 times) [2024-06-28 00:53:42,319][06909] Updated weights for policy 0, policy_version 113523 (0.0032) [2024-06-28 00:53:43,850][06674] Fps is (10 sec: 45879.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1860059136. Throughput: 0: 43933.5. Samples: 1762931280. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 00:53:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:53:45,274][06909] Updated weights for policy 0, policy_version 113533 (0.0029) [2024-06-28 00:53:48,850][06674] Fps is (10 sec: 40961.5, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 1860272128. Throughput: 0: 44076.5. Samples: 1763190640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 00:53:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:53:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000113542_1860272128.pth... [2024-06-28 00:53:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000112899_1849737216.pth [2024-06-28 00:53:49,527][06909] Updated weights for policy 0, policy_version 113543 (0.0031) [2024-06-28 00:53:52,838][06909] Updated weights for policy 0, policy_version 113553 (0.0029) [2024-06-28 00:53:53,850][06674] Fps is (10 sec: 45874.1, 60 sec: 44236.6, 300 sec: 44153.4). Total num frames: 1860517888. Throughput: 0: 43929.2. Samples: 1763447400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 00:53:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:53:56,782][06909] Updated weights for policy 0, policy_version 113563 (0.0030) [2024-06-28 00:53:58,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43962.3, 300 sec: 43931.0). Total num frames: 1860698112. Throughput: 0: 43895.3. Samples: 1763587560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 00:53:58,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:54:00,085][06909] Updated weights for policy 0, policy_version 113573 (0.0041) [2024-06-28 00:54:03,850][06674] Fps is (10 sec: 40961.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1860927488. Throughput: 0: 44028.9. Samples: 1763847880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 00:54:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:54:03,967][06909] Updated weights for policy 0, policy_version 113583 (0.0025) [2024-06-28 00:54:07,631][06909] Updated weights for policy 0, policy_version 113593 (0.0036) [2024-06-28 00:54:08,850][06674] Fps is (10 sec: 47523.3, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 1861173248. Throughput: 0: 43932.5. Samples: 1764111620. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 00:54:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 00:54:11,703][06909] Updated weights for policy 0, policy_version 113603 (0.0035) [2024-06-28 00:54:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 1861386240. Throughput: 0: 43993.0. Samples: 1764250800. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 00:54:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:54:15,371][06909] Updated weights for policy 0, policy_version 113613 (0.0020) [2024-06-28 00:54:18,852][06674] Fps is (10 sec: 40950.7, 60 sec: 43962.0, 300 sec: 44042.1). Total num frames: 1861582848. Throughput: 0: 44090.1. Samples: 1764513800. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 00:54:18,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:54:19,205][06909] Updated weights for policy 0, policy_version 113623 (0.0021) [2024-06-28 00:54:22,553][06909] Updated weights for policy 0, policy_version 113633 (0.0040) [2024-06-28 00:54:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1861828608. Throughput: 0: 43900.0. Samples: 1764767540. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:54:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:54:26,430][06909] Updated weights for policy 0, policy_version 113643 (0.0032) [2024-06-28 00:54:28,850][06674] Fps is (10 sec: 45886.0, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1862041600. Throughput: 0: 44088.9. Samples: 1764915280. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:54:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:54:29,787][06909] Updated weights for policy 0, policy_version 113653 (0.0029) [2024-06-28 00:54:33,579][06909] Updated weights for policy 0, policy_version 113663 (0.0029) [2024-06-28 00:54:33,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44237.4, 300 sec: 44097.9). Total num frames: 1862254592. Throughput: 0: 44171.1. Samples: 1765178340. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:54:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:54:37,150][06909] Updated weights for policy 0, policy_version 113673 (0.0028) [2024-06-28 00:54:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43964.0, 300 sec: 44153.5). Total num frames: 1862500352. Throughput: 0: 44226.9. Samples: 1765437600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:54:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:54:40,496][06887] Signal inference workers to stop experience collection... (25100 times) [2024-06-28 00:54:40,529][06909] InferenceWorker_p0-w0: stopping experience collection (25100 times) [2024-06-28 00:54:40,552][06887] Signal inference workers to resume experience collection... (25100 times) [2024-06-28 00:54:40,559][06909] InferenceWorker_p0-w0: resuming experience collection (25100 times) [2024-06-28 00:54:40,843][06909] Updated weights for policy 0, policy_version 113683 (0.0019) [2024-06-28 00:54:43,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43962.2, 300 sec: 43931.0). Total num frames: 1862696960. Throughput: 0: 44251.2. Samples: 1765578860. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:54:43,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:54:44,822][06909] Updated weights for policy 0, policy_version 113693 (0.0023) [2024-06-28 00:54:48,732][06909] Updated weights for policy 0, policy_version 113703 (0.0044) [2024-06-28 00:54:48,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1862909952. Throughput: 0: 44255.1. Samples: 1765839360. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:54:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:54:52,267][06909] Updated weights for policy 0, policy_version 113713 (0.0038) [2024-06-28 00:54:53,850][06674] Fps is (10 sec: 45884.4, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 1863155712. Throughput: 0: 44054.7. Samples: 1766094080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:54:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:54:56,050][06909] Updated weights for policy 0, policy_version 113723 (0.0031) [2024-06-28 00:54:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44238.4, 300 sec: 43931.3). Total num frames: 1863352320. Throughput: 0: 44080.9. Samples: 1766234440. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:54:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:54:59,359][06909] Updated weights for policy 0, policy_version 113733 (0.0027) [2024-06-28 00:55:03,271][06909] Updated weights for policy 0, policy_version 113743 (0.0039) [2024-06-28 00:55:03,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1863565312. Throughput: 0: 44137.7. Samples: 1766499900. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:55:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:55:06,957][06909] Updated weights for policy 0, policy_version 113753 (0.0039) [2024-06-28 00:55:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1863811072. Throughput: 0: 44160.8. Samples: 1766754780. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:55:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:55:10,830][06909] Updated weights for policy 0, policy_version 113763 (0.0027) [2024-06-28 00:55:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1864007680. Throughput: 0: 44053.7. Samples: 1766897700. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:55:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:55:14,370][06909] Updated weights for policy 0, policy_version 113773 (0.0035) [2024-06-28 00:55:18,049][06909] Updated weights for policy 0, policy_version 113783 (0.0037) [2024-06-28 00:55:18,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43965.4, 300 sec: 43986.9). Total num frames: 1864220672. Throughput: 0: 44027.1. Samples: 1767159560. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 00:55:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:55:22,078][06909] Updated weights for policy 0, policy_version 113793 (0.0039) [2024-06-28 00:55:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1864466432. Throughput: 0: 43971.1. Samples: 1767416300. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 00:55:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:55:25,760][06909] Updated weights for policy 0, policy_version 113803 (0.0030) [2024-06-28 00:55:28,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1864679424. Throughput: 0: 43986.5. Samples: 1767558160. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 00:55:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:55:29,713][06909] Updated weights for policy 0, policy_version 113813 (0.0033) [2024-06-28 00:55:33,199][06909] Updated weights for policy 0, policy_version 113823 (0.0034) [2024-06-28 00:55:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 1864876032. Throughput: 0: 43908.0. Samples: 1767815220. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 00:55:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:55:36,925][06909] Updated weights for policy 0, policy_version 113833 (0.0032) [2024-06-28 00:55:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1865121792. Throughput: 0: 44095.2. Samples: 1768078360. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 00:55:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:55:40,363][06909] Updated weights for policy 0, policy_version 113843 (0.0028) [2024-06-28 00:55:42,677][06887] Signal inference workers to stop experience collection... (25150 times) [2024-06-28 00:55:42,677][06887] Signal inference workers to resume experience collection... (25150 times) [2024-06-28 00:55:42,692][06909] InferenceWorker_p0-w0: stopping experience collection (25150 times) [2024-06-28 00:55:42,720][06909] InferenceWorker_p0-w0: resuming experience collection (25150 times) [2024-06-28 00:55:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 1865334784. Throughput: 0: 44200.4. Samples: 1768223460. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 00:55:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:55:44,307][06909] Updated weights for policy 0, policy_version 113853 (0.0038) [2024-06-28 00:55:47,458][06909] Updated weights for policy 0, policy_version 113863 (0.0025) [2024-06-28 00:55:48,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1865547776. Throughput: 0: 44039.0. Samples: 1768481660. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 00:55:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:55:48,936][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000113865_1865564160.pth... [2024-06-28 00:55:48,986][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000113220_1854996480.pth [2024-06-28 00:55:51,655][06909] Updated weights for policy 0, policy_version 113873 (0.0045) [2024-06-28 00:55:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1865777152. Throughput: 0: 44220.5. Samples: 1768744700. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 00:55:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:55:54,913][06909] Updated weights for policy 0, policy_version 113883 (0.0032) [2024-06-28 00:55:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1865990144. Throughput: 0: 44072.4. Samples: 1768880960. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 00:55:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:55:59,131][06909] Updated weights for policy 0, policy_version 113893 (0.0028) [2024-06-28 00:56:02,981][06909] Updated weights for policy 0, policy_version 113903 (0.0029) [2024-06-28 00:56:03,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1866219520. Throughput: 0: 44030.6. Samples: 1769140940. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 00:56:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:56:06,469][06909] Updated weights for policy 0, policy_version 113913 (0.0023) [2024-06-28 00:56:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1866448896. Throughput: 0: 44384.5. Samples: 1769413600. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 00:56:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:56:10,105][06909] Updated weights for policy 0, policy_version 113923 (0.0029) [2024-06-28 00:56:13,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1866661888. Throughput: 0: 44190.2. Samples: 1769546720. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 00:56:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:56:13,891][06909] Updated weights for policy 0, policy_version 113933 (0.0027) [2024-06-28 00:56:17,260][06909] Updated weights for policy 0, policy_version 113943 (0.0035) [2024-06-28 00:56:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1866874880. Throughput: 0: 44283.5. Samples: 1769807980. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 00:56:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:56:21,110][06909] Updated weights for policy 0, policy_version 113953 (0.0033) [2024-06-28 00:56:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1867104256. Throughput: 0: 44532.0. Samples: 1770082300. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 00:56:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:56:24,619][06909] Updated weights for policy 0, policy_version 113963 (0.0036) [2024-06-28 00:56:28,480][06909] Updated weights for policy 0, policy_version 113973 (0.0038) [2024-06-28 00:56:28,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 1867350016. Throughput: 0: 44340.4. Samples: 1770218780. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 00:56:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:56:32,126][06909] Updated weights for policy 0, policy_version 113983 (0.0031) [2024-06-28 00:56:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 1867546624. Throughput: 0: 44155.2. Samples: 1770468640. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 00:56:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:56:35,759][06887] Signal inference workers to stop experience collection... (25200 times) [2024-06-28 00:56:35,760][06887] Signal inference workers to resume experience collection... (25200 times) [2024-06-28 00:56:35,780][06909] InferenceWorker_p0-w0: stopping experience collection (25200 times) [2024-06-28 00:56:35,780][06909] InferenceWorker_p0-w0: resuming experience collection (25200 times) [2024-06-28 00:56:36,063][06909] Updated weights for policy 0, policy_version 113993 (0.0030) [2024-06-28 00:56:38,856][06674] Fps is (10 sec: 42573.0, 60 sec: 44232.3, 300 sec: 44152.6). Total num frames: 1867776000. Throughput: 0: 44381.1. Samples: 1770742120. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 00:56:38,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 00:56:40,129][06909] Updated weights for policy 0, policy_version 114003 (0.0024) [2024-06-28 00:56:43,737][06909] Updated weights for policy 0, policy_version 114013 (0.0034) [2024-06-28 00:56:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1867988992. Throughput: 0: 44212.9. Samples: 1770870540. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 00:56:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:56:47,341][06909] Updated weights for policy 0, policy_version 114023 (0.0029) [2024-06-28 00:56:48,850][06674] Fps is (10 sec: 42624.0, 60 sec: 44236.9, 300 sec: 43931.6). Total num frames: 1868201984. Throughput: 0: 44260.5. Samples: 1771132660. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 00:56:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:56:51,040][06909] Updated weights for policy 0, policy_version 114033 (0.0033) [2024-06-28 00:56:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1868431360. Throughput: 0: 44011.0. Samples: 1771394100. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 00:56:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:56:54,638][06909] Updated weights for policy 0, policy_version 114043 (0.0041) [2024-06-28 00:56:58,418][06909] Updated weights for policy 0, policy_version 114053 (0.0029) [2024-06-28 00:56:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 1868660736. Throughput: 0: 44044.9. Samples: 1771528740. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 00:56:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:57:01,964][06909] Updated weights for policy 0, policy_version 114063 (0.0036) [2024-06-28 00:57:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1868857344. Throughput: 0: 44091.9. Samples: 1771792120. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 00:57:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:57:05,886][06909] Updated weights for policy 0, policy_version 114073 (0.0036) [2024-06-28 00:57:08,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1869086720. Throughput: 0: 43894.1. Samples: 1772057540. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 00:57:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:57:09,696][06909] Updated weights for policy 0, policy_version 114083 (0.0036) [2024-06-28 00:57:13,189][06909] Updated weights for policy 0, policy_version 114093 (0.0027) [2024-06-28 00:57:13,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 1869316096. Throughput: 0: 43701.9. Samples: 1772185360. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 00:57:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:57:17,119][06909] Updated weights for policy 0, policy_version 114103 (0.0033) [2024-06-28 00:57:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1869529088. Throughput: 0: 44027.2. Samples: 1772449860. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 00:57:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 00:57:21,041][06909] Updated weights for policy 0, policy_version 114113 (0.0039) [2024-06-28 00:57:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1869742080. Throughput: 0: 43844.0. Samples: 1772714840. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 00:57:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:57:24,609][06909] Updated weights for policy 0, policy_version 114123 (0.0022) [2024-06-28 00:57:28,221][06909] Updated weights for policy 0, policy_version 114133 (0.0028) [2024-06-28 00:57:28,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 1869971456. Throughput: 0: 43902.9. Samples: 1772846180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 00:57:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:57:32,063][06909] Updated weights for policy 0, policy_version 114143 (0.0025) [2024-06-28 00:57:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1870184448. Throughput: 0: 43908.0. Samples: 1773108520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 00:57:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:57:35,516][06909] Updated weights for policy 0, policy_version 114153 (0.0023) [2024-06-28 00:57:38,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43695.1, 300 sec: 44042.4). Total num frames: 1870397440. Throughput: 0: 44083.2. Samples: 1773377840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 00:57:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:57:39,277][06909] Updated weights for policy 0, policy_version 114163 (0.0049) [2024-06-28 00:57:43,053][06909] Updated weights for policy 0, policy_version 114173 (0.0027) [2024-06-28 00:57:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 1870626816. Throughput: 0: 43982.3. Samples: 1773507940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 00:57:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:57:45,395][06887] Signal inference workers to stop experience collection... (25250 times) [2024-06-28 00:57:45,420][06909] InferenceWorker_p0-w0: stopping experience collection (25250 times) [2024-06-28 00:57:45,505][06887] Signal inference workers to resume experience collection... (25250 times) [2024-06-28 00:57:45,505][06909] InferenceWorker_p0-w0: resuming experience collection (25250 times) [2024-06-28 00:57:47,272][06909] Updated weights for policy 0, policy_version 114183 (0.0032) [2024-06-28 00:57:48,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.6, 300 sec: 43986.8). Total num frames: 1870839808. Throughput: 0: 44043.9. Samples: 1773774100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 00:57:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:57:48,909][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000114188_1870856192.pth... [2024-06-28 00:57:48,960][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000113542_1860272128.pth [2024-06-28 00:57:50,648][06909] Updated weights for policy 0, policy_version 114193 (0.0031) [2024-06-28 00:57:53,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1871085568. Throughput: 0: 43925.8. Samples: 1774034200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 00:57:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:57:54,538][06909] Updated weights for policy 0, policy_version 114203 (0.0033) [2024-06-28 00:57:58,115][06909] Updated weights for policy 0, policy_version 114213 (0.0039) [2024-06-28 00:57:58,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1871298560. Throughput: 0: 44094.2. Samples: 1774169600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 00:57:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:58:01,788][06909] Updated weights for policy 0, policy_version 114223 (0.0034) [2024-06-28 00:58:03,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1871495168. Throughput: 0: 44108.7. Samples: 1774434760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 00:58:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:58:05,328][06909] Updated weights for policy 0, policy_version 114233 (0.0035) [2024-06-28 00:58:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1871740928. Throughput: 0: 44092.9. Samples: 1774699020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 00:58:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:58:09,431][06909] Updated weights for policy 0, policy_version 114243 (0.0031) [2024-06-28 00:58:12,615][06909] Updated weights for policy 0, policy_version 114253 (0.0039) [2024-06-28 00:58:13,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1871970304. Throughput: 0: 44150.8. Samples: 1774832960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 00:58:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:58:16,641][06909] Updated weights for policy 0, policy_version 114263 (0.0038) [2024-06-28 00:58:18,853][06674] Fps is (10 sec: 42584.8, 60 sec: 43961.3, 300 sec: 43986.4). Total num frames: 1872166912. Throughput: 0: 44104.0. Samples: 1775093340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 00:58:18,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:58:20,161][06909] Updated weights for policy 0, policy_version 114273 (0.0036) [2024-06-28 00:58:23,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1872379904. Throughput: 0: 44013.4. Samples: 1775358440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 00:58:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:58:24,271][06909] Updated weights for policy 0, policy_version 114283 (0.0044) [2024-06-28 00:58:27,445][06909] Updated weights for policy 0, policy_version 114293 (0.0037) [2024-06-28 00:58:28,850][06674] Fps is (10 sec: 45890.1, 60 sec: 44237.0, 300 sec: 44153.6). Total num frames: 1872625664. Throughput: 0: 44114.2. Samples: 1775493080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:58:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 00:58:31,631][06909] Updated weights for policy 0, policy_version 114303 (0.0038) [2024-06-28 00:58:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1872838656. Throughput: 0: 44035.2. Samples: 1775755680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:58:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 00:58:35,169][06909] Updated weights for policy 0, policy_version 114313 (0.0033) [2024-06-28 00:58:38,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1873051648. Throughput: 0: 44207.6. Samples: 1776023540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:58:38,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 00:58:39,320][06909] Updated weights for policy 0, policy_version 114323 (0.0035) [2024-06-28 00:58:42,330][06909] Updated weights for policy 0, policy_version 114333 (0.0034) [2024-06-28 00:58:43,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1873297408. Throughput: 0: 44107.1. Samples: 1776154420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:58:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:58:46,599][06909] Updated weights for policy 0, policy_version 114343 (0.0040) [2024-06-28 00:58:48,856][06674] Fps is (10 sec: 44210.4, 60 sec: 44232.5, 300 sec: 43986.0). Total num frames: 1873494016. Throughput: 0: 44094.7. Samples: 1776419280. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:58:48,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:58:49,796][06909] Updated weights for policy 0, policy_version 114353 (0.0025) [2024-06-28 00:58:53,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 44098.3). Total num frames: 1873707008. Throughput: 0: 44037.8. Samples: 1776680720. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:58:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 00:58:54,160][06909] Updated weights for policy 0, policy_version 114363 (0.0037) [2024-06-28 00:58:57,312][06909] Updated weights for policy 0, policy_version 114373 (0.0037) [2024-06-28 00:58:58,850][06674] Fps is (10 sec: 45901.1, 60 sec: 44236.5, 300 sec: 44153.4). Total num frames: 1873952768. Throughput: 0: 43988.6. Samples: 1776812460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:58:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:59:01,503][06909] Updated weights for policy 0, policy_version 114383 (0.0033) [2024-06-28 00:59:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 1874165760. Throughput: 0: 44288.1. Samples: 1777086160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:59:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:59:04,439][06909] Updated weights for policy 0, policy_version 114393 (0.0041) [2024-06-28 00:59:08,852][06674] Fps is (10 sec: 40953.3, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 1874362368. Throughput: 0: 44135.7. Samples: 1777344640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:59:08,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:59:09,064][06909] Updated weights for policy 0, policy_version 114403 (0.0032) [2024-06-28 00:59:12,001][06909] Updated weights for policy 0, policy_version 114413 (0.0029) [2024-06-28 00:59:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44153.8). Total num frames: 1874608128. Throughput: 0: 44103.1. Samples: 1777477720. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:59:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:59:14,602][06887] Signal inference workers to stop experience collection... (25300 times) [2024-06-28 00:59:14,644][06909] InferenceWorker_p0-w0: stopping experience collection (25300 times) [2024-06-28 00:59:14,653][06887] Signal inference workers to resume experience collection... (25300 times) [2024-06-28 00:59:14,657][06909] InferenceWorker_p0-w0: resuming experience collection (25300 times) [2024-06-28 00:59:16,385][06909] Updated weights for policy 0, policy_version 114423 (0.0032) [2024-06-28 00:59:18,850][06674] Fps is (10 sec: 47523.0, 60 sec: 44512.2, 300 sec: 44097.9). Total num frames: 1874837504. Throughput: 0: 44186.6. Samples: 1777744080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:59:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:59:19,180][06909] Updated weights for policy 0, policy_version 114433 (0.0020) [2024-06-28 00:59:23,584][06909] Updated weights for policy 0, policy_version 114443 (0.0025) [2024-06-28 00:59:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1875034112. Throughput: 0: 44069.0. Samples: 1778006640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:59:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:59:26,909][06909] Updated weights for policy 0, policy_version 114453 (0.0032) [2024-06-28 00:59:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1875263488. Throughput: 0: 43965.3. Samples: 1778132860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 00:59:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 00:59:31,229][06909] Updated weights for policy 0, policy_version 114463 (0.0024) [2024-06-28 00:59:33,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 1875509248. Throughput: 0: 44238.3. Samples: 1778409740. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 00:59:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 00:59:34,106][06909] Updated weights for policy 0, policy_version 114473 (0.0035) [2024-06-28 00:59:38,452][06909] Updated weights for policy 0, policy_version 114483 (0.0035) [2024-06-28 00:59:38,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 1875689472. Throughput: 0: 44332.3. Samples: 1778675680. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 00:59:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:59:41,631][06909] Updated weights for policy 0, policy_version 114493 (0.0037) [2024-06-28 00:59:43,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 1875918848. Throughput: 0: 44027.9. Samples: 1778793700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 00:59:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 00:59:46,181][06909] Updated weights for policy 0, policy_version 114503 (0.0031) [2024-06-28 00:59:48,850][06674] Fps is (10 sec: 47514.3, 60 sec: 44514.3, 300 sec: 44098.0). Total num frames: 1876164608. Throughput: 0: 44072.4. Samples: 1779069420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 00:59:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 00:59:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000114512_1876164608.pth... [2024-06-28 00:59:48,906][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000113865_1865564160.pth [2024-06-28 00:59:49,073][06909] Updated weights for policy 0, policy_version 114513 (0.0036) [2024-06-28 00:59:53,412][06909] Updated weights for policy 0, policy_version 114523 (0.0040) [2024-06-28 00:59:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1876344832. Throughput: 0: 44242.0. Samples: 1779335440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 00:59:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 00:59:56,238][06909] Updated weights for policy 0, policy_version 114533 (0.0030) [2024-06-28 00:59:58,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.9, 300 sec: 44098.0). Total num frames: 1876574208. Throughput: 0: 43982.1. Samples: 1779456920. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 00:59:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:00:00,951][06909] Updated weights for policy 0, policy_version 114543 (0.0037) [2024-06-28 01:00:03,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 1876819968. Throughput: 0: 44201.8. Samples: 1779733160. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 01:00:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:00:04,019][06909] Updated weights for policy 0, policy_version 114553 (0.0040) [2024-06-28 01:00:08,266][06909] Updated weights for policy 0, policy_version 114563 (0.0031) [2024-06-28 01:00:08,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44238.1, 300 sec: 44097.9). Total num frames: 1877016576. Throughput: 0: 44193.5. Samples: 1779995360. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 01:00:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:00:11,210][06909] Updated weights for policy 0, policy_version 114573 (0.0027) [2024-06-28 01:00:13,856][06674] Fps is (10 sec: 42572.9, 60 sec: 43959.3, 300 sec: 44152.6). Total num frames: 1877245952. Throughput: 0: 44168.7. Samples: 1780120720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 01:00:13,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:00:15,711][06909] Updated weights for policy 0, policy_version 114583 (0.0027) [2024-06-28 01:00:17,930][06887] Signal inference workers to stop experience collection... (25350 times) [2024-06-28 01:00:17,983][06909] InferenceWorker_p0-w0: stopping experience collection (25350 times) [2024-06-28 01:00:17,985][06887] Signal inference workers to resume experience collection... (25350 times) [2024-06-28 01:00:17,995][06909] InferenceWorker_p0-w0: resuming experience collection (25350 times) [2024-06-28 01:00:18,767][06909] Updated weights for policy 0, policy_version 114593 (0.0028) [2024-06-28 01:00:18,850][06674] Fps is (10 sec: 47514.9, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1877491712. Throughput: 0: 44148.1. Samples: 1780396400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 01:00:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:00:23,044][06909] Updated weights for policy 0, policy_version 114603 (0.0052) [2024-06-28 01:00:23,850][06674] Fps is (10 sec: 44263.2, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1877688320. Throughput: 0: 44077.4. Samples: 1780659160. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 01:00:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:00:26,191][06909] Updated weights for policy 0, policy_version 114613 (0.0022) [2024-06-28 01:00:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1877901312. Throughput: 0: 44266.8. Samples: 1780785700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 01:00:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:00:30,409][06909] Updated weights for policy 0, policy_version 114623 (0.0024) [2024-06-28 01:00:33,763][06909] Updated weights for policy 0, policy_version 114633 (0.0033) [2024-06-28 01:00:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1878147072. Throughput: 0: 44252.9. Samples: 1781060800. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 01:00:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:00:37,932][06909] Updated weights for policy 0, policy_version 114643 (0.0044) [2024-06-28 01:00:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1878343680. Throughput: 0: 44138.7. Samples: 1781321680. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 01:00:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:00:41,405][06909] Updated weights for policy 0, policy_version 114653 (0.0031) [2024-06-28 01:00:43,851][06674] Fps is (10 sec: 42593.8, 60 sec: 44236.0, 300 sec: 44153.4). Total num frames: 1878573056. Throughput: 0: 44313.7. Samples: 1781451080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 01:00:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:00:45,203][06909] Updated weights for policy 0, policy_version 114663 (0.0032) [2024-06-28 01:00:48,814][06909] Updated weights for policy 0, policy_version 114673 (0.0030) [2024-06-28 01:00:48,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1878802432. Throughput: 0: 44207.6. Samples: 1781722500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 01:00:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:00:52,757][06909] Updated weights for policy 0, policy_version 114683 (0.0028) [2024-06-28 01:00:53,850][06674] Fps is (10 sec: 44241.4, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1879015424. Throughput: 0: 44113.1. Samples: 1781980440. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 01:00:53,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:00:55,963][06909] Updated weights for policy 0, policy_version 114693 (0.0025) [2024-06-28 01:00:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1879228416. Throughput: 0: 44196.6. Samples: 1782109300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 01:00:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:01:00,026][06909] Updated weights for policy 0, policy_version 114703 (0.0031) [2024-06-28 01:01:03,527][06909] Updated weights for policy 0, policy_version 114713 (0.0037) [2024-06-28 01:01:03,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1879490560. Throughput: 0: 44116.4. Samples: 1782381640. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 01:01:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:01:07,284][06909] Updated weights for policy 0, policy_version 114723 (0.0035) [2024-06-28 01:01:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 1879654400. Throughput: 0: 44272.1. Samples: 1782651400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 01:01:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:01:11,122][06909] Updated weights for policy 0, policy_version 114733 (0.0031) [2024-06-28 01:01:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44514.3, 300 sec: 44209.0). Total num frames: 1879916544. Throughput: 0: 44314.6. Samples: 1782779860. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 01:01:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:01:14,741][06909] Updated weights for policy 0, policy_version 114743 (0.0039) [2024-06-28 01:01:18,349][06909] Updated weights for policy 0, policy_version 114753 (0.0022) [2024-06-28 01:01:18,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 1880145920. Throughput: 0: 44228.0. Samples: 1783051060. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 01:01:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:01:21,830][06909] Updated weights for policy 0, policy_version 114763 (0.0026) [2024-06-28 01:01:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1880326144. Throughput: 0: 44270.7. Samples: 1783313860. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 01:01:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:01:25,553][06909] Updated weights for policy 0, policy_version 114773 (0.0028) [2024-06-28 01:01:26,687][06887] Signal inference workers to stop experience collection... (25400 times) [2024-06-28 01:01:26,711][06909] InferenceWorker_p0-w0: stopping experience collection (25400 times) [2024-06-28 01:01:26,748][06887] Signal inference workers to resume experience collection... (25400 times) [2024-06-28 01:01:26,749][06909] InferenceWorker_p0-w0: resuming experience collection (25400 times) [2024-06-28 01:01:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1880571904. Throughput: 0: 44156.6. Samples: 1783438080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 01:01:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:01:29,650][06909] Updated weights for policy 0, policy_version 114783 (0.0025) [2024-06-28 01:01:33,107][06909] Updated weights for policy 0, policy_version 114793 (0.0028) [2024-06-28 01:01:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 44154.4). Total num frames: 1880801280. Throughput: 0: 44096.9. Samples: 1783706860. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:01:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:01:36,864][06909] Updated weights for policy 0, policy_version 114803 (0.0027) [2024-06-28 01:01:38,850][06674] Fps is (10 sec: 42596.9, 60 sec: 44236.6, 300 sec: 44097.9). Total num frames: 1880997888. Throughput: 0: 44468.2. Samples: 1783981520. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:01:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:01:40,403][06909] Updated weights for policy 0, policy_version 114813 (0.0031) [2024-06-28 01:01:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44510.7, 300 sec: 44209.0). Total num frames: 1881243648. Throughput: 0: 44367.2. Samples: 1784105820. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:01:43,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-28 01:01:44,303][06909] Updated weights for policy 0, policy_version 114823 (0.0034) [2024-06-28 01:01:47,987][06909] Updated weights for policy 0, policy_version 114833 (0.0028) [2024-06-28 01:01:48,851][06674] Fps is (10 sec: 45870.3, 60 sec: 44235.8, 300 sec: 44153.3). Total num frames: 1881456640. Throughput: 0: 44194.7. Samples: 1784370460. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:01:48,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:01:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000114835_1881456640.pth... [2024-06-28 01:01:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000114188_1870856192.pth [2024-06-28 01:01:51,575][06909] Updated weights for policy 0, policy_version 114843 (0.0023) [2024-06-28 01:01:53,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1881653248. Throughput: 0: 44236.9. Samples: 1784642060. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:01:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:01:55,291][06909] Updated weights for policy 0, policy_version 114853 (0.0041) [2024-06-28 01:01:58,850][06674] Fps is (10 sec: 44242.7, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1881899008. Throughput: 0: 44119.5. Samples: 1784765240. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:01:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:01:59,396][06909] Updated weights for policy 0, policy_version 114863 (0.0036) [2024-06-28 01:02:02,731][06909] Updated weights for policy 0, policy_version 114873 (0.0023) [2024-06-28 01:02:03,852][06674] Fps is (10 sec: 47503.4, 60 sec: 43962.2, 300 sec: 44208.7). Total num frames: 1882128384. Throughput: 0: 43982.9. Samples: 1785030380. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:02:03,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:02:06,777][06909] Updated weights for policy 0, policy_version 114883 (0.0039) [2024-06-28 01:02:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 1882324992. Throughput: 0: 44096.9. Samples: 1785298220. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:02:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:02:10,353][06909] Updated weights for policy 0, policy_version 114893 (0.0045) [2024-06-28 01:02:13,850][06674] Fps is (10 sec: 42606.8, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1882554368. Throughput: 0: 44239.0. Samples: 1785428840. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:02:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:02:13,983][06909] Updated weights for policy 0, policy_version 114903 (0.0031) [2024-06-28 01:02:17,561][06909] Updated weights for policy 0, policy_version 114913 (0.0036) [2024-06-28 01:02:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 1882783744. Throughput: 0: 44101.3. Samples: 1785691420. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:02:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:02:21,538][06909] Updated weights for policy 0, policy_version 114923 (0.0039) [2024-06-28 01:02:23,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 1882980352. Throughput: 0: 44007.8. Samples: 1785961860. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:02:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:02:24,939][06909] Updated weights for policy 0, policy_version 114933 (0.0036) [2024-06-28 01:02:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1883209728. Throughput: 0: 44088.9. Samples: 1786089820. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:02:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:02:28,883][06909] Updated weights for policy 0, policy_version 114943 (0.0034) [2024-06-28 01:02:32,360][06909] Updated weights for policy 0, policy_version 114953 (0.0041) [2024-06-28 01:02:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1883439104. Throughput: 0: 43986.7. Samples: 1786349800. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 01:02:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:02:36,358][06909] Updated weights for policy 0, policy_version 114963 (0.0024) [2024-06-28 01:02:38,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44237.0, 300 sec: 44153.5). Total num frames: 1883652096. Throughput: 0: 44067.0. Samples: 1786625080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 01:02:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:02:40,047][06909] Updated weights for policy 0, policy_version 114973 (0.0033) [2024-06-28 01:02:43,558][06909] Updated weights for policy 0, policy_version 114983 (0.0034) [2024-06-28 01:02:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1883881472. Throughput: 0: 44182.2. Samples: 1786753440. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 01:02:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:02:47,231][06909] Updated weights for policy 0, policy_version 114993 (0.0040) [2024-06-28 01:02:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43691.7, 300 sec: 44042.4). Total num frames: 1884078080. Throughput: 0: 44079.0. Samples: 1787013840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 01:02:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:02:48,854][06887] Signal inference workers to stop experience collection... (25450 times) [2024-06-28 01:02:48,861][06887] Signal inference workers to resume experience collection... (25450 times) [2024-06-28 01:02:48,892][06909] InferenceWorker_p0-w0: stopping experience collection (25450 times) [2024-06-28 01:02:48,892][06909] InferenceWorker_p0-w0: resuming experience collection (25450 times) [2024-06-28 01:02:50,914][06909] Updated weights for policy 0, policy_version 115003 (0.0026) [2024-06-28 01:02:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1884307456. Throughput: 0: 44101.8. Samples: 1787282800. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 01:02:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:02:54,671][06909] Updated weights for policy 0, policy_version 115013 (0.0037) [2024-06-28 01:02:58,463][06909] Updated weights for policy 0, policy_version 115023 (0.0029) [2024-06-28 01:02:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1884536832. Throughput: 0: 44129.4. Samples: 1787414660. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 01:02:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:03:02,295][06909] Updated weights for policy 0, policy_version 115033 (0.0030) [2024-06-28 01:03:03,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43690.7, 300 sec: 44097.6). Total num frames: 1884749824. Throughput: 0: 44255.7. Samples: 1787683020. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 01:03:03,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:03:06,059][06909] Updated weights for policy 0, policy_version 115043 (0.0023) [2024-06-28 01:03:08,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1884979200. Throughput: 0: 44101.0. Samples: 1787946400. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 01:03:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:03:09,594][06909] Updated weights for policy 0, policy_version 115053 (0.0040) [2024-06-28 01:03:13,753][06909] Updated weights for policy 0, policy_version 115063 (0.0031) [2024-06-28 01:03:13,850][06674] Fps is (10 sec: 44246.4, 60 sec: 43963.9, 300 sec: 44154.0). Total num frames: 1885192192. Throughput: 0: 44163.1. Samples: 1788077160. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 01:03:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:03:17,002][06909] Updated weights for policy 0, policy_version 115073 (0.0035) [2024-06-28 01:03:18,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1885421568. Throughput: 0: 44313.3. Samples: 1788343900. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 01:03:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:03:21,028][06909] Updated weights for policy 0, policy_version 115083 (0.0037) [2024-06-28 01:03:23,852][06674] Fps is (10 sec: 45865.4, 60 sec: 44508.4, 300 sec: 44153.2). Total num frames: 1885650944. Throughput: 0: 43968.3. Samples: 1788603740. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 01:03:23,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:03:24,536][06909] Updated weights for policy 0, policy_version 115093 (0.0033) [2024-06-28 01:03:28,346][06909] Updated weights for policy 0, policy_version 115103 (0.0037) [2024-06-28 01:03:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1885863936. Throughput: 0: 44165.0. Samples: 1788740860. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 01:03:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:03:32,030][06909] Updated weights for policy 0, policy_version 115113 (0.0025) [2024-06-28 01:03:33,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1886076928. Throughput: 0: 44204.0. Samples: 1789003020. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 01:03:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:03:35,725][06909] Updated weights for policy 0, policy_version 115123 (0.0041) [2024-06-28 01:03:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1886322688. Throughput: 0: 44096.8. Samples: 1789267160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:03:38,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:03:39,710][06909] Updated weights for policy 0, policy_version 115133 (0.0039) [2024-06-28 01:03:43,089][06909] Updated weights for policy 0, policy_version 115143 (0.0034) [2024-06-28 01:03:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44154.4). Total num frames: 1886519296. Throughput: 0: 44148.4. Samples: 1789401340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:03:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:03:46,938][06909] Updated weights for policy 0, policy_version 115153 (0.0019) [2024-06-28 01:03:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1886748672. Throughput: 0: 43994.5. Samples: 1789662680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:03:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:03:48,877][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000115158_1886748672.pth... [2024-06-28 01:03:48,935][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000114512_1876164608.pth [2024-06-28 01:03:50,785][06909] Updated weights for policy 0, policy_version 115163 (0.0030) [2024-06-28 01:03:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1886961664. Throughput: 0: 44026.1. Samples: 1789927580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:03:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:03:54,290][06909] Updated weights for policy 0, policy_version 115173 (0.0041) [2024-06-28 01:03:58,072][06909] Updated weights for policy 0, policy_version 115183 (0.0035) [2024-06-28 01:03:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1887174656. Throughput: 0: 44028.3. Samples: 1790058440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:03:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:04:01,650][06909] Updated weights for policy 0, policy_version 115193 (0.0033) [2024-06-28 01:04:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44238.4, 300 sec: 44209.3). Total num frames: 1887404032. Throughput: 0: 43977.0. Samples: 1790322860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:04:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:04:05,481][06909] Updated weights for policy 0, policy_version 115203 (0.0039) [2024-06-28 01:04:08,853][06674] Fps is (10 sec: 45862.4, 60 sec: 44234.6, 300 sec: 44153.1). Total num frames: 1887633408. Throughput: 0: 43938.3. Samples: 1790581000. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:04:08,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:04:09,508][06909] Updated weights for policy 0, policy_version 115213 (0.0030) [2024-06-28 01:04:10,269][06887] Signal inference workers to stop experience collection... (25500 times) [2024-06-28 01:04:10,322][06909] InferenceWorker_p0-w0: stopping experience collection (25500 times) [2024-06-28 01:04:10,329][06887] Signal inference workers to resume experience collection... (25500 times) [2024-06-28 01:04:10,339][06909] InferenceWorker_p0-w0: resuming experience collection (25500 times) [2024-06-28 01:04:12,865][06909] Updated weights for policy 0, policy_version 115223 (0.0032) [2024-06-28 01:04:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 1887846400. Throughput: 0: 43987.5. Samples: 1790720300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:04:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:04:16,703][06909] Updated weights for policy 0, policy_version 115233 (0.0026) [2024-06-28 01:04:18,850][06674] Fps is (10 sec: 42610.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1888059392. Throughput: 0: 44179.6. Samples: 1790991100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:04:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:04:20,004][06909] Updated weights for policy 0, policy_version 115243 (0.0028) [2024-06-28 01:04:23,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43965.1, 300 sec: 44153.5). Total num frames: 1888288768. Throughput: 0: 43954.1. Samples: 1791245100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:04:23,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:04:24,244][06909] Updated weights for policy 0, policy_version 115253 (0.0038) [2024-06-28 01:04:27,812][06909] Updated weights for policy 0, policy_version 115263 (0.0040) [2024-06-28 01:04:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1888485376. Throughput: 0: 43935.7. Samples: 1791378440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:04:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:04:31,538][06909] Updated weights for policy 0, policy_version 115273 (0.0025) [2024-06-28 01:04:33,853][06674] Fps is (10 sec: 42586.5, 60 sec: 43961.5, 300 sec: 44153.1). Total num frames: 1888714752. Throughput: 0: 43956.2. Samples: 1791640840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:04:33,853][06674] Avg episode reward: [(0, '0.401')] [2024-06-28 01:04:35,199][06909] Updated weights for policy 0, policy_version 115283 (0.0039) [2024-06-28 01:04:38,796][06909] Updated weights for policy 0, policy_version 115293 (0.0030) [2024-06-28 01:04:38,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 1888960512. Throughput: 0: 43961.3. Samples: 1791905840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 01:04:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:04:42,542][06909] Updated weights for policy 0, policy_version 115303 (0.0032) [2024-06-28 01:04:43,850][06674] Fps is (10 sec: 42610.9, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 1889140736. Throughput: 0: 43987.5. Samples: 1792037880. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 01:04:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:04:46,418][06909] Updated weights for policy 0, policy_version 115313 (0.0028) [2024-06-28 01:04:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1889386496. Throughput: 0: 44080.8. Samples: 1792306500. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 01:04:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:04:50,110][06909] Updated weights for policy 0, policy_version 115323 (0.0028) [2024-06-28 01:04:53,674][06909] Updated weights for policy 0, policy_version 115333 (0.0031) [2024-06-28 01:04:53,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1889615872. Throughput: 0: 44045.0. Samples: 1792562900. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 01:04:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:04:57,595][06909] Updated weights for policy 0, policy_version 115343 (0.0027) [2024-06-28 01:04:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1889812480. Throughput: 0: 43904.9. Samples: 1792696020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 01:04:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:05:01,715][06909] Updated weights for policy 0, policy_version 115353 (0.0024) [2024-06-28 01:05:03,856][06674] Fps is (10 sec: 42572.5, 60 sec: 43959.3, 300 sec: 44152.6). Total num frames: 1890041856. Throughput: 0: 43695.0. Samples: 1792957640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 01:05:03,857][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:05:05,105][06909] Updated weights for policy 0, policy_version 115363 (0.0047) [2024-06-28 01:05:08,856][06674] Fps is (10 sec: 44210.4, 60 sec: 43688.3, 300 sec: 44098.0). Total num frames: 1890254848. Throughput: 0: 44040.5. Samples: 1793227180. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 01:05:08,857][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:05:08,981][06909] Updated weights for policy 0, policy_version 115373 (0.0032) [2024-06-28 01:05:12,264][06909] Updated weights for policy 0, policy_version 115383 (0.0031) [2024-06-28 01:05:13,850][06674] Fps is (10 sec: 42624.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1890467840. Throughput: 0: 44015.6. Samples: 1793359140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 01:05:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:05:16,399][06909] Updated weights for policy 0, policy_version 115393 (0.0039) [2024-06-28 01:05:18,852][06674] Fps is (10 sec: 45893.5, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 1890713600. Throughput: 0: 44036.5. Samples: 1793622440. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 01:05:18,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:05:19,594][06909] Updated weights for policy 0, policy_version 115403 (0.0023) [2024-06-28 01:05:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.9, 300 sec: 44097.9). Total num frames: 1890910208. Throughput: 0: 43973.0. Samples: 1793884620. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 01:05:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:05:23,977][06909] Updated weights for policy 0, policy_version 115413 (0.0024) [2024-06-28 01:05:26,801][06909] Updated weights for policy 0, policy_version 115423 (0.0021) [2024-06-28 01:05:28,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1891123200. Throughput: 0: 44054.7. Samples: 1794020340. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 01:05:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:05:31,353][06909] Updated weights for policy 0, policy_version 115433 (0.0039) [2024-06-28 01:05:33,715][06887] Signal inference workers to stop experience collection... (25550 times) [2024-06-28 01:05:33,762][06909] InferenceWorker_p0-w0: stopping experience collection (25550 times) [2024-06-28 01:05:33,773][06887] Signal inference workers to resume experience collection... (25550 times) [2024-06-28 01:05:33,781][06909] InferenceWorker_p0-w0: resuming experience collection (25550 times) [2024-06-28 01:05:33,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44512.2, 300 sec: 44209.0). Total num frames: 1891385344. Throughput: 0: 44053.5. Samples: 1794288900. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 01:05:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:05:34,388][06909] Updated weights for policy 0, policy_version 115443 (0.0032) [2024-06-28 01:05:38,677][06909] Updated weights for policy 0, policy_version 115453 (0.0036) [2024-06-28 01:05:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 44098.1). Total num frames: 1891581952. Throughput: 0: 44227.5. Samples: 1794553140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:05:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:05:41,722][06909] Updated weights for policy 0, policy_version 115463 (0.0033) [2024-06-28 01:05:43,850][06674] Fps is (10 sec: 40959.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1891794944. Throughput: 0: 44117.8. Samples: 1794681320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:05:43,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:05:45,886][06909] Updated weights for policy 0, policy_version 115473 (0.0031) [2024-06-28 01:05:48,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1892040704. Throughput: 0: 44261.8. Samples: 1794949160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:05:48,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:05:48,942][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000115482_1892057088.pth... [2024-06-28 01:05:48,989][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000114835_1881456640.pth [2024-06-28 01:05:49,134][06909] Updated weights for policy 0, policy_version 115483 (0.0037) [2024-06-28 01:05:53,705][06909] Updated weights for policy 0, policy_version 115493 (0.0034) [2024-06-28 01:05:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1892237312. Throughput: 0: 44032.1. Samples: 1795208360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:05:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:05:56,527][06909] Updated weights for policy 0, policy_version 115503 (0.0034) [2024-06-28 01:05:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1892466688. Throughput: 0: 43939.8. Samples: 1795336440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:05:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:06:01,003][06909] Updated weights for policy 0, policy_version 115513 (0.0031) [2024-06-28 01:06:03,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44514.3, 300 sec: 44264.5). Total num frames: 1892712448. Throughput: 0: 44139.6. Samples: 1795608640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:06:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:06:03,968][06909] Updated weights for policy 0, policy_version 115523 (0.0025) [2024-06-28 01:06:08,682][06909] Updated weights for policy 0, policy_version 115533 (0.0027) [2024-06-28 01:06:08,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44241.2, 300 sec: 44042.4). Total num frames: 1892909056. Throughput: 0: 44215.5. Samples: 1795874320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:06:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:06:11,368][06909] Updated weights for policy 0, policy_version 115543 (0.0029) [2024-06-28 01:06:13,850][06674] Fps is (10 sec: 42599.2, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 1893138432. Throughput: 0: 43997.9. Samples: 1796000240. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:06:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:06:15,693][06909] Updated weights for policy 0, policy_version 115553 (0.0041) [2024-06-28 01:06:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44238.3, 300 sec: 44209.0). Total num frames: 1893367808. Throughput: 0: 44140.8. Samples: 1796275240. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:06:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:06:19,052][06909] Updated weights for policy 0, policy_version 115563 (0.0046) [2024-06-28 01:06:22,957][06909] Updated weights for policy 0, policy_version 115573 (0.0038) [2024-06-28 01:06:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1893564416. Throughput: 0: 44112.9. Samples: 1796538220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:06:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 01:06:26,439][06909] Updated weights for policy 0, policy_version 115583 (0.0037) [2024-06-28 01:06:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1893793792. Throughput: 0: 44076.0. Samples: 1796664740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:06:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:06:30,602][06909] Updated weights for policy 0, policy_version 115593 (0.0029) [2024-06-28 01:06:33,791][06909] Updated weights for policy 0, policy_version 115603 (0.0041) [2024-06-28 01:06:33,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 44209.1). Total num frames: 1894039552. Throughput: 0: 44240.2. Samples: 1796939960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:06:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:06:37,875][06909] Updated weights for policy 0, policy_version 115613 (0.0030) [2024-06-28 01:06:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1894219776. Throughput: 0: 44326.3. Samples: 1797203040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:06:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:06:41,034][06909] Updated weights for policy 0, policy_version 115623 (0.0029) [2024-06-28 01:06:43,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44236.8, 300 sec: 44042.6). Total num frames: 1894449152. Throughput: 0: 44397.9. Samples: 1797334340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 01:06:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:06:45,350][06909] Updated weights for policy 0, policy_version 115633 (0.0037) [2024-06-28 01:06:48,025][06887] Signal inference workers to stop experience collection... (25600 times) [2024-06-28 01:06:48,048][06909] InferenceWorker_p0-w0: stopping experience collection (25600 times) [2024-06-28 01:06:48,080][06887] Signal inference workers to resume experience collection... (25600 times) [2024-06-28 01:06:48,081][06909] InferenceWorker_p0-w0: resuming experience collection (25600 times) [2024-06-28 01:06:48,706][06909] Updated weights for policy 0, policy_version 115643 (0.0032) [2024-06-28 01:06:48,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 1894694912. Throughput: 0: 44318.7. Samples: 1797602980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 01:06:48,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:06:52,753][06909] Updated weights for policy 0, policy_version 115653 (0.0039) [2024-06-28 01:06:53,857][06674] Fps is (10 sec: 44205.7, 60 sec: 44231.6, 300 sec: 44041.4). Total num frames: 1894891520. Throughput: 0: 44218.0. Samples: 1797864440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 01:06:53,857][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:06:55,946][06909] Updated weights for policy 0, policy_version 115663 (0.0030) [2024-06-28 01:06:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.9, 300 sec: 44042.7). Total num frames: 1895120896. Throughput: 0: 44323.1. Samples: 1797994780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 01:06:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:07:00,398][06909] Updated weights for policy 0, policy_version 115673 (0.0038) [2024-06-28 01:07:03,637][06909] Updated weights for policy 0, policy_version 115683 (0.0030) [2024-06-28 01:07:03,850][06674] Fps is (10 sec: 45907.4, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1895350272. Throughput: 0: 44173.3. Samples: 1798263040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 01:07:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:07:07,713][06909] Updated weights for policy 0, policy_version 115693 (0.0042) [2024-06-28 01:07:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1895546880. Throughput: 0: 44350.6. Samples: 1798534000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 01:07:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:07:11,128][06909] Updated weights for policy 0, policy_version 115703 (0.0039) [2024-06-28 01:07:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1895792640. Throughput: 0: 44416.9. Samples: 1798663500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 01:07:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:07:14,915][06909] Updated weights for policy 0, policy_version 115713 (0.0040) [2024-06-28 01:07:18,458][06909] Updated weights for policy 0, policy_version 115723 (0.0037) [2024-06-28 01:07:18,856][06674] Fps is (10 sec: 47485.6, 60 sec: 44232.5, 300 sec: 44208.2). Total num frames: 1896022016. Throughput: 0: 44121.3. Samples: 1798925680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 01:07:18,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:07:22,607][06909] Updated weights for policy 0, policy_version 115733 (0.0033) [2024-06-28 01:07:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1896235008. Throughput: 0: 44127.9. Samples: 1799188800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 01:07:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:07:25,877][06909] Updated weights for policy 0, policy_version 115743 (0.0031) [2024-06-28 01:07:28,850][06674] Fps is (10 sec: 40983.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1896431616. Throughput: 0: 44140.0. Samples: 1799320640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 01:07:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:07:30,126][06909] Updated weights for policy 0, policy_version 115753 (0.0042) [2024-06-28 01:07:33,782][06909] Updated weights for policy 0, policy_version 115763 (0.0034) [2024-06-28 01:07:33,852][06674] Fps is (10 sec: 42590.1, 60 sec: 43689.2, 300 sec: 44097.7). Total num frames: 1896660992. Throughput: 0: 43904.7. Samples: 1799578780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 01:07:33,861][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:07:37,794][06909] Updated weights for policy 0, policy_version 115773 (0.0041) [2024-06-28 01:07:38,852][06674] Fps is (10 sec: 44228.0, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 1896873984. Throughput: 0: 44004.0. Samples: 1799844400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 01:07:38,861][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:07:41,110][06909] Updated weights for policy 0, policy_version 115783 (0.0029) [2024-06-28 01:07:43,850][06674] Fps is (10 sec: 44245.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1897103360. Throughput: 0: 44075.0. Samples: 1799978160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 01:07:43,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:07:45,037][06909] Updated weights for policy 0, policy_version 115793 (0.0035) [2024-06-28 01:07:48,808][06909] Updated weights for policy 0, policy_version 115803 (0.0034) [2024-06-28 01:07:48,850][06674] Fps is (10 sec: 44245.5, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1897316352. Throughput: 0: 43955.1. Samples: 1800241020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 01:07:48,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:07:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000115803_1897316352.pth... [2024-06-28 01:07:48,944][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000115158_1886748672.pth [2024-06-28 01:07:52,261][06909] Updated weights for policy 0, policy_version 115813 (0.0036) [2024-06-28 01:07:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44242.0, 300 sec: 44097.9). Total num frames: 1897545728. Throughput: 0: 43937.3. Samples: 1800511180. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 01:07:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:07:56,079][06909] Updated weights for policy 0, policy_version 115823 (0.0035) [2024-06-28 01:07:58,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 1897758720. Throughput: 0: 44013.8. Samples: 1800644120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 01:07:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:07:59,637][06909] Updated weights for policy 0, policy_version 115833 (0.0030) [2024-06-28 01:08:03,466][06909] Updated weights for policy 0, policy_version 115843 (0.0035) [2024-06-28 01:08:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1897971712. Throughput: 0: 43998.7. Samples: 1800905360. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 01:08:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:08:07,156][06909] Updated weights for policy 0, policy_version 115853 (0.0029) [2024-06-28 01:08:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1898201088. Throughput: 0: 44101.5. Samples: 1801173360. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 01:08:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:08:10,967][06909] Updated weights for policy 0, policy_version 115863 (0.0032) [2024-06-28 01:08:13,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1898430464. Throughput: 0: 44095.2. Samples: 1801304920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 01:08:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:08:14,891][06909] Updated weights for policy 0, policy_version 115873 (0.0046) [2024-06-28 01:08:18,234][06909] Updated weights for policy 0, policy_version 115883 (0.0022) [2024-06-28 01:08:18,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43968.0, 300 sec: 44098.3). Total num frames: 1898659840. Throughput: 0: 44183.3. Samples: 1801566940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 01:08:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:08:22,083][06909] Updated weights for policy 0, policy_version 115893 (0.0031) [2024-06-28 01:08:23,215][06887] Signal inference workers to stop experience collection... (25650 times) [2024-06-28 01:08:23,263][06909] InferenceWorker_p0-w0: stopping experience collection (25650 times) [2024-06-28 01:08:23,269][06887] Signal inference workers to resume experience collection... (25650 times) [2024-06-28 01:08:23,276][06909] InferenceWorker_p0-w0: resuming experience collection (25650 times) [2024-06-28 01:08:23,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1898889216. Throughput: 0: 44406.8. Samples: 1801842620. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 01:08:23,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:08:25,597][06909] Updated weights for policy 0, policy_version 115903 (0.0025) [2024-06-28 01:08:28,852][06674] Fps is (10 sec: 42589.8, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 1899085824. Throughput: 0: 44342.5. Samples: 1801973660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 01:08:28,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:08:29,330][06909] Updated weights for policy 0, policy_version 115913 (0.0035) [2024-06-28 01:08:33,061][06909] Updated weights for policy 0, policy_version 115923 (0.0034) [2024-06-28 01:08:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44238.2, 300 sec: 44042.4). Total num frames: 1899315200. Throughput: 0: 44319.1. Samples: 1802235380. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 01:08:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:08:36,894][06909] Updated weights for policy 0, policy_version 115933 (0.0032) [2024-06-28 01:08:38,850][06674] Fps is (10 sec: 44246.1, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 1899528192. Throughput: 0: 44152.1. Samples: 1802498020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 01:08:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:08:40,562][06909] Updated weights for policy 0, policy_version 115943 (0.0029) [2024-06-28 01:08:43,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1899757568. Throughput: 0: 44217.3. Samples: 1802633900. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 01:08:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:08:44,356][06909] Updated weights for policy 0, policy_version 115953 (0.0042) [2024-06-28 01:08:47,739][06909] Updated weights for policy 0, policy_version 115963 (0.0029) [2024-06-28 01:08:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1899970560. Throughput: 0: 44325.3. Samples: 1802900000. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 01:08:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:08:51,811][06909] Updated weights for policy 0, policy_version 115973 (0.0033) [2024-06-28 01:08:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1900199936. Throughput: 0: 44178.6. Samples: 1803161400. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 01:08:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:08:55,066][06909] Updated weights for policy 0, policy_version 115983 (0.0034) [2024-06-28 01:08:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1900412928. Throughput: 0: 44345.4. Samples: 1803300460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 01:08:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:08:59,043][06909] Updated weights for policy 0, policy_version 115993 (0.0027) [2024-06-28 01:09:02,488][06909] Updated weights for policy 0, policy_version 116003 (0.0037) [2024-06-28 01:09:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44042.8). Total num frames: 1900625920. Throughput: 0: 44380.5. Samples: 1803564060. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 01:09:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:09:06,545][06909] Updated weights for policy 0, policy_version 116013 (0.0029) [2024-06-28 01:09:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1900871680. Throughput: 0: 43964.5. Samples: 1803821020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 01:09:08,862][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:09:09,951][06909] Updated weights for policy 0, policy_version 116023 (0.0036) [2024-06-28 01:09:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1901068288. Throughput: 0: 44082.0. Samples: 1803957260. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 01:09:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:09:13,950][06909] Updated weights for policy 0, policy_version 116033 (0.0032) [2024-06-28 01:09:17,409][06909] Updated weights for policy 0, policy_version 116043 (0.0037) [2024-06-28 01:09:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1901297664. Throughput: 0: 44079.2. Samples: 1804218940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 01:09:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:09:21,389][06909] Updated weights for policy 0, policy_version 116053 (0.0031) [2024-06-28 01:09:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1901527040. Throughput: 0: 44151.8. Samples: 1804484860. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 01:09:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:09:24,817][06909] Updated weights for policy 0, policy_version 116063 (0.0025) [2024-06-28 01:09:28,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43965.3, 300 sec: 44098.4). Total num frames: 1901723648. Throughput: 0: 44089.8. Samples: 1804617940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 01:09:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:09:28,899][06909] Updated weights for policy 0, policy_version 116073 (0.0039) [2024-06-28 01:09:32,113][06909] Updated weights for policy 0, policy_version 116083 (0.0029) [2024-06-28 01:09:33,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 1901953024. Throughput: 0: 43957.8. Samples: 1804878100. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 01:09:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:09:36,150][06909] Updated weights for policy 0, policy_version 116093 (0.0028) [2024-06-28 01:09:38,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 1902182400. Throughput: 0: 44081.7. Samples: 1805145080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 01:09:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:09:39,625][06909] Updated weights for policy 0, policy_version 116103 (0.0030) [2024-06-28 01:09:43,687][06909] Updated weights for policy 0, policy_version 116113 (0.0031) [2024-06-28 01:09:43,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1902395392. Throughput: 0: 44013.7. Samples: 1805281080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:09:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:09:47,298][06909] Updated weights for policy 0, policy_version 116123 (0.0034) [2024-06-28 01:09:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1902608384. Throughput: 0: 43905.7. Samples: 1805539820. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:09:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:09:48,932][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000116127_1902624768.pth... [2024-06-28 01:09:48,982][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000115482_1892057088.pth [2024-06-28 01:09:51,001][06909] Updated weights for policy 0, policy_version 116133 (0.0028) [2024-06-28 01:09:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1902837760. Throughput: 0: 44105.4. Samples: 1805805760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:09:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:09:54,678][06909] Updated weights for policy 0, policy_version 116143 (0.0034) [2024-06-28 01:09:58,700][06909] Updated weights for policy 0, policy_version 116153 (0.0029) [2024-06-28 01:09:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44098.8). Total num frames: 1903050752. Throughput: 0: 44112.0. Samples: 1805942300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:09:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:10:00,398][06887] Signal inference workers to stop experience collection... (25700 times) [2024-06-28 01:10:00,425][06909] InferenceWorker_p0-w0: stopping experience collection (25700 times) [2024-06-28 01:10:00,461][06887] Signal inference workers to resume experience collection... (25700 times) [2024-06-28 01:10:00,462][06909] InferenceWorker_p0-w0: resuming experience collection (25700 times) [2024-06-28 01:10:01,904][06909] Updated weights for policy 0, policy_version 116163 (0.0028) [2024-06-28 01:10:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44154.4). Total num frames: 1903280128. Throughput: 0: 44235.1. Samples: 1806209520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:10:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:10:05,880][06909] Updated weights for policy 0, policy_version 116173 (0.0028) [2024-06-28 01:10:08,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 44264.5). Total num frames: 1903525888. Throughput: 0: 44059.6. Samples: 1806467540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:10:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:10:09,474][06909] Updated weights for policy 0, policy_version 116183 (0.0029) [2024-06-28 01:10:13,598][06909] Updated weights for policy 0, policy_version 116193 (0.0029) [2024-06-28 01:10:13,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 1903706112. Throughput: 0: 44135.1. Samples: 1806604020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:10:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:10:16,649][06909] Updated weights for policy 0, policy_version 116203 (0.0026) [2024-06-28 01:10:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1903951872. Throughput: 0: 44400.3. Samples: 1806876120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:10:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:10:20,962][06909] Updated weights for policy 0, policy_version 116213 (0.0023) [2024-06-28 01:10:23,852][06674] Fps is (10 sec: 47503.7, 60 sec: 44235.4, 300 sec: 44264.3). Total num frames: 1904181248. Throughput: 0: 44092.3. Samples: 1807129320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:10:23,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:10:24,421][06909] Updated weights for policy 0, policy_version 116223 (0.0025) [2024-06-28 01:10:28,297][06909] Updated weights for policy 0, policy_version 116233 (0.0035) [2024-06-28 01:10:28,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 1904394240. Throughput: 0: 44240.0. Samples: 1807271880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:10:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:10:31,719][06909] Updated weights for policy 0, policy_version 116243 (0.0037) [2024-06-28 01:10:33,850][06674] Fps is (10 sec: 40967.8, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 1904590848. Throughput: 0: 44261.3. Samples: 1807531580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:10:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:10:35,575][06909] Updated weights for policy 0, policy_version 116253 (0.0038) [2024-06-28 01:10:38,852][06674] Fps is (10 sec: 44227.9, 60 sec: 44235.3, 300 sec: 44208.7). Total num frames: 1904836608. Throughput: 0: 44118.4. Samples: 1807791180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:10:38,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:10:38,945][06909] Updated weights for policy 0, policy_version 116263 (0.0040) [2024-06-28 01:10:43,164][06909] Updated weights for policy 0, policy_version 116273 (0.0028) [2024-06-28 01:10:43,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1905049600. Throughput: 0: 44217.4. Samples: 1807932080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 01:10:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:10:46,539][06909] Updated weights for policy 0, policy_version 116283 (0.0045) [2024-06-28 01:10:48,850][06674] Fps is (10 sec: 42607.3, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1905262592. Throughput: 0: 44055.2. Samples: 1808192000. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 01:10:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:10:50,653][06909] Updated weights for policy 0, policy_version 116293 (0.0040) [2024-06-28 01:10:53,856][06909] Updated weights for policy 0, policy_version 116303 (0.0035) [2024-06-28 01:10:53,856][06674] Fps is (10 sec: 45847.6, 60 sec: 44505.4, 300 sec: 44208.1). Total num frames: 1905508352. Throughput: 0: 44052.9. Samples: 1808450180. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 01:10:53,857][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:10:58,080][06909] Updated weights for policy 0, policy_version 116313 (0.0031) [2024-06-28 01:10:58,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 1905721344. Throughput: 0: 44143.1. Samples: 1808590460. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 01:10:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:11:01,566][06909] Updated weights for policy 0, policy_version 116323 (0.0030) [2024-06-28 01:11:03,850][06674] Fps is (10 sec: 39345.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1905901568. Throughput: 0: 43906.8. Samples: 1808851920. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 01:11:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:11:05,285][06909] Updated weights for policy 0, policy_version 116333 (0.0034) [2024-06-28 01:11:08,717][06909] Updated weights for policy 0, policy_version 116343 (0.0026) [2024-06-28 01:11:08,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43962.3, 300 sec: 44153.2). Total num frames: 1906163712. Throughput: 0: 44134.2. Samples: 1809115360. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 01:11:08,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:11:12,891][06909] Updated weights for policy 0, policy_version 116353 (0.0033) [2024-06-28 01:11:13,850][06674] Fps is (10 sec: 49151.7, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 1906393088. Throughput: 0: 44153.8. Samples: 1809258800. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 01:11:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:11:15,933][06909] Updated weights for policy 0, policy_version 116363 (0.0045) [2024-06-28 01:11:18,850][06674] Fps is (10 sec: 40968.5, 60 sec: 43690.8, 300 sec: 44097.9). Total num frames: 1906573312. Throughput: 0: 44152.1. Samples: 1809518420. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 01:11:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:11:20,429][06909] Updated weights for policy 0, policy_version 116373 (0.0039) [2024-06-28 01:11:23,739][06909] Updated weights for policy 0, policy_version 116383 (0.0026) [2024-06-28 01:11:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43965.1, 300 sec: 44153.5). Total num frames: 1906819072. Throughput: 0: 44181.0. Samples: 1809779240. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 01:11:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:11:27,841][06909] Updated weights for policy 0, policy_version 116393 (0.0034) [2024-06-28 01:11:28,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1907048448. Throughput: 0: 43963.1. Samples: 1809910420. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 01:11:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:11:29,115][06887] Signal inference workers to stop experience collection... (25750 times) [2024-06-28 01:11:29,115][06887] Signal inference workers to resume experience collection... (25750 times) [2024-06-28 01:11:29,168][06909] InferenceWorker_p0-w0: stopping experience collection (25750 times) [2024-06-28 01:11:29,168][06909] InferenceWorker_p0-w0: resuming experience collection (25750 times) [2024-06-28 01:11:30,998][06909] Updated weights for policy 0, policy_version 116403 (0.0030) [2024-06-28 01:11:33,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 1907228672. Throughput: 0: 44129.8. Samples: 1810177840. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 01:11:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:11:35,246][06909] Updated weights for policy 0, policy_version 116413 (0.0028) [2024-06-28 01:11:38,652][06909] Updated weights for policy 0, policy_version 116423 (0.0026) [2024-06-28 01:11:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 1907474432. Throughput: 0: 44203.2. Samples: 1810439060. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 01:11:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:11:42,587][06909] Updated weights for policy 0, policy_version 116433 (0.0038) [2024-06-28 01:11:43,850][06674] Fps is (10 sec: 49152.5, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 1907720192. Throughput: 0: 44272.5. Samples: 1810582720. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 01:11:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:11:45,928][06909] Updated weights for policy 0, policy_version 116443 (0.0023) [2024-06-28 01:11:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 44043.5). Total num frames: 1907884032. Throughput: 0: 44183.5. Samples: 1810840180. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 01:11:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:11:48,966][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000116449_1907900416.pth... [2024-06-28 01:11:49,014][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000115803_1897316352.pth [2024-06-28 01:11:49,985][06909] Updated weights for policy 0, policy_version 116453 (0.0031) [2024-06-28 01:11:53,121][06909] Updated weights for policy 0, policy_version 116463 (0.0038) [2024-06-28 01:11:53,850][06674] Fps is (10 sec: 40959.1, 60 sec: 43695.0, 300 sec: 44097.9). Total num frames: 1908129792. Throughput: 0: 44038.0. Samples: 1811096980. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 01:11:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:11:57,422][06909] Updated weights for policy 0, policy_version 116473 (0.0027) [2024-06-28 01:11:58,850][06674] Fps is (10 sec: 50790.8, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1908391936. Throughput: 0: 43948.1. Samples: 1811236460. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 01:11:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:12:00,762][06909] Updated weights for policy 0, policy_version 116483 (0.0026) [2024-06-28 01:12:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1908572160. Throughput: 0: 44019.1. Samples: 1811499280. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 01:12:03,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:12:05,042][06909] Updated weights for policy 0, policy_version 116493 (0.0032) [2024-06-28 01:12:08,155][06909] Updated weights for policy 0, policy_version 116503 (0.0033) [2024-06-28 01:12:08,850][06674] Fps is (10 sec: 39320.9, 60 sec: 43692.1, 300 sec: 44042.4). Total num frames: 1908785152. Throughput: 0: 44046.2. Samples: 1811761320. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 01:12:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:12:12,311][06909] Updated weights for policy 0, policy_version 116513 (0.0022) [2024-06-28 01:12:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44043.3). Total num frames: 1909014528. Throughput: 0: 44197.8. Samples: 1811899320. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 01:12:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:12:15,679][06909] Updated weights for policy 0, policy_version 116523 (0.0034) [2024-06-28 01:12:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1909227520. Throughput: 0: 44071.0. Samples: 1812161040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 01:12:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:12:19,674][06909] Updated weights for policy 0, policy_version 116533 (0.0042) [2024-06-28 01:12:23,026][06909] Updated weights for policy 0, policy_version 116543 (0.0037) [2024-06-28 01:12:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44237.0, 300 sec: 44209.1). Total num frames: 1909473280. Throughput: 0: 44175.7. Samples: 1812426960. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 01:12:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:12:27,203][06909] Updated weights for policy 0, policy_version 116553 (0.0033) [2024-06-28 01:12:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 44153.8). Total num frames: 1909686272. Throughput: 0: 44009.7. Samples: 1812563160. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 01:12:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:12:30,426][06909] Updated weights for policy 0, policy_version 116563 (0.0032) [2024-06-28 01:12:33,850][06674] Fps is (10 sec: 40959.5, 60 sec: 44236.7, 300 sec: 44098.3). Total num frames: 1909882880. Throughput: 0: 43997.8. Samples: 1812820080. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 01:12:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:12:34,607][06909] Updated weights for policy 0, policy_version 116573 (0.0031) [2024-06-28 01:12:37,743][06909] Updated weights for policy 0, policy_version 116583 (0.0039) [2024-06-28 01:12:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1910128640. Throughput: 0: 44250.7. Samples: 1813088260. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 01:12:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:12:42,023][06909] Updated weights for policy 0, policy_version 116593 (0.0028) [2024-06-28 01:12:43,327][06887] Signal inference workers to stop experience collection... (25800 times) [2024-06-28 01:12:43,328][06887] Signal inference workers to resume experience collection... (25800 times) [2024-06-28 01:12:43,346][06909] InferenceWorker_p0-w0: stopping experience collection (25800 times) [2024-06-28 01:12:43,346][06909] InferenceWorker_p0-w0: resuming experience collection (25800 times) [2024-06-28 01:12:43,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44236.6, 300 sec: 44264.6). Total num frames: 1910374400. Throughput: 0: 44135.9. Samples: 1813222580. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 01:12:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:12:45,479][06909] Updated weights for policy 0, policy_version 116603 (0.0024) [2024-06-28 01:12:48,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1910538240. Throughput: 0: 44223.5. Samples: 1813489340. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 01:12:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:12:49,546][06909] Updated weights for policy 0, policy_version 116613 (0.0026) [2024-06-28 01:12:52,750][06909] Updated weights for policy 0, policy_version 116623 (0.0022) [2024-06-28 01:12:53,850][06674] Fps is (10 sec: 40960.5, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1910784000. Throughput: 0: 44173.5. Samples: 1813749120. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 01:12:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:12:56,745][06909] Updated weights for policy 0, policy_version 116633 (0.0035) [2024-06-28 01:12:58,850][06674] Fps is (10 sec: 49152.4, 60 sec: 43963.7, 300 sec: 44264.6). Total num frames: 1911029760. Throughput: 0: 44198.6. Samples: 1813888260. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 01:12:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:13:00,405][06909] Updated weights for policy 0, policy_version 116643 (0.0030) [2024-06-28 01:13:03,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1911209984. Throughput: 0: 44050.2. Samples: 1814143300. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 01:13:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:13:04,509][06909] Updated weights for policy 0, policy_version 116653 (0.0042) [2024-06-28 01:13:07,734][06909] Updated weights for policy 0, policy_version 116663 (0.0026) [2024-06-28 01:13:08,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1911439360. Throughput: 0: 44012.8. Samples: 1814407540. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 01:13:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:13:11,764][06909] Updated weights for policy 0, policy_version 116673 (0.0040) [2024-06-28 01:13:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1911668736. Throughput: 0: 43926.2. Samples: 1814539840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 01:13:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:13:14,916][06909] Updated weights for policy 0, policy_version 116683 (0.0038) [2024-06-28 01:13:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1911881728. Throughput: 0: 44075.1. Samples: 1814803460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 01:13:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:13:19,158][06909] Updated weights for policy 0, policy_version 116693 (0.0030) [2024-06-28 01:13:22,721][06909] Updated weights for policy 0, policy_version 116703 (0.0030) [2024-06-28 01:13:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 44098.3). Total num frames: 1912094720. Throughput: 0: 43967.1. Samples: 1815066780. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 01:13:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:13:26,530][06909] Updated weights for policy 0, policy_version 116713 (0.0034) [2024-06-28 01:13:28,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1912324096. Throughput: 0: 43876.1. Samples: 1815197000. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 01:13:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:13:29,926][06909] Updated weights for policy 0, policy_version 116723 (0.0028) [2024-06-28 01:13:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1912537088. Throughput: 0: 43814.8. Samples: 1815461000. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 01:13:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:13:33,958][06909] Updated weights for policy 0, policy_version 116733 (0.0023) [2024-06-28 01:13:37,569][06909] Updated weights for policy 0, policy_version 116743 (0.0030) [2024-06-28 01:13:38,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1912766464. Throughput: 0: 43873.2. Samples: 1815723420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 01:13:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:13:41,474][06909] Updated weights for policy 0, policy_version 116753 (0.0041) [2024-06-28 01:13:43,851][06674] Fps is (10 sec: 45870.1, 60 sec: 43690.0, 300 sec: 44153.3). Total num frames: 1912995840. Throughput: 0: 43895.4. Samples: 1815863600. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 01:13:43,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:13:44,752][06909] Updated weights for policy 0, policy_version 116763 (0.0030) [2024-06-28 01:13:48,607][06909] Updated weights for policy 0, policy_version 116773 (0.0037) [2024-06-28 01:13:48,852][06674] Fps is (10 sec: 44227.0, 60 sec: 44508.2, 300 sec: 44097.6). Total num frames: 1913208832. Throughput: 0: 44240.1. Samples: 1816134200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 01:13:48,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:13:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000116773_1913208832.pth... [2024-06-28 01:13:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000116127_1902624768.pth [2024-06-28 01:13:52,146][06909] Updated weights for policy 0, policy_version 116783 (0.0037) [2024-06-28 01:13:53,852][06674] Fps is (10 sec: 44232.3, 60 sec: 44235.2, 300 sec: 44153.2). Total num frames: 1913438208. Throughput: 0: 44160.2. Samples: 1816394840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:13:53,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:13:56,167][06909] Updated weights for policy 0, policy_version 116793 (0.0029) [2024-06-28 01:13:58,850][06674] Fps is (10 sec: 44247.1, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 1913651200. Throughput: 0: 44085.4. Samples: 1816523680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:13:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:13:59,749][06909] Updated weights for policy 0, policy_version 116803 (0.0028) [2024-06-28 01:14:03,638][06909] Updated weights for policy 0, policy_version 116813 (0.0031) [2024-06-28 01:14:03,856][06674] Fps is (10 sec: 42581.5, 60 sec: 44232.4, 300 sec: 44041.5). Total num frames: 1913864192. Throughput: 0: 44118.1. Samples: 1816789040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:14:03,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:14:07,137][06909] Updated weights for policy 0, policy_version 116823 (0.0029) [2024-06-28 01:14:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1914093568. Throughput: 0: 44253.3. Samples: 1817058180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:14:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:14:10,831][06909] Updated weights for policy 0, policy_version 116833 (0.0026) [2024-06-28 01:14:11,498][06887] Signal inference workers to stop experience collection... (25850 times) [2024-06-28 01:14:11,499][06887] Signal inference workers to resume experience collection... (25850 times) [2024-06-28 01:14:11,520][06909] InferenceWorker_p0-w0: stopping experience collection (25850 times) [2024-06-28 01:14:11,520][06909] InferenceWorker_p0-w0: resuming experience collection (25850 times) [2024-06-28 01:14:13,850][06674] Fps is (10 sec: 44263.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1914306560. Throughput: 0: 44344.0. Samples: 1817192480. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:14:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:14:14,525][06909] Updated weights for policy 0, policy_version 116843 (0.0041) [2024-06-28 01:14:18,411][06909] Updated weights for policy 0, policy_version 116853 (0.0027) [2024-06-28 01:14:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1914535936. Throughput: 0: 44378.2. Samples: 1817458020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:14:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 01:14:22,282][06909] Updated weights for policy 0, policy_version 116863 (0.0035) [2024-06-28 01:14:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1914748928. Throughput: 0: 44442.3. Samples: 1817723320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:14:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:14:25,690][06909] Updated weights for policy 0, policy_version 116873 (0.0033) [2024-06-28 01:14:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1914978304. Throughput: 0: 44229.1. Samples: 1817853860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:14:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:14:29,672][06909] Updated weights for policy 0, policy_version 116883 (0.0027) [2024-06-28 01:14:33,106][06909] Updated weights for policy 0, policy_version 116893 (0.0034) [2024-06-28 01:14:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1915191296. Throughput: 0: 44089.4. Samples: 1818118120. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:14:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:14:37,089][06909] Updated weights for policy 0, policy_version 116903 (0.0032) [2024-06-28 01:14:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1915420672. Throughput: 0: 44166.0. Samples: 1818382220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:14:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:14:40,585][06909] Updated weights for policy 0, policy_version 116913 (0.0034) [2024-06-28 01:14:43,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44237.5, 300 sec: 44209.0). Total num frames: 1915650048. Throughput: 0: 44321.6. Samples: 1818518160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:14:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:14:44,355][06909] Updated weights for policy 0, policy_version 116923 (0.0034) [2024-06-28 01:14:47,757][06909] Updated weights for policy 0, policy_version 116933 (0.0024) [2024-06-28 01:14:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44238.4, 300 sec: 44153.5). Total num frames: 1915863040. Throughput: 0: 44205.9. Samples: 1818778040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:14:48,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:14:52,094][06909] Updated weights for policy 0, policy_version 116943 (0.0026) [2024-06-28 01:14:53,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 1916076032. Throughput: 0: 44033.7. Samples: 1819039700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 01:14:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:14:55,454][06909] Updated weights for policy 0, policy_version 116953 (0.0033) [2024-06-28 01:14:58,852][06674] Fps is (10 sec: 44227.9, 60 sec: 44235.2, 300 sec: 44153.2). Total num frames: 1916305408. Throughput: 0: 43914.4. Samples: 1819168720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 01:14:58,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:14:59,377][06909] Updated weights for policy 0, policy_version 116963 (0.0036) [2024-06-28 01:15:03,091][06909] Updated weights for policy 0, policy_version 116973 (0.0048) [2024-06-28 01:15:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44514.4, 300 sec: 44098.0). Total num frames: 1916534784. Throughput: 0: 44037.3. Samples: 1819439700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 01:15:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:15:07,080][06909] Updated weights for policy 0, policy_version 116983 (0.0027) [2024-06-28 01:15:08,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1916731392. Throughput: 0: 43922.3. Samples: 1819699820. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 01:15:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:15:10,438][06909] Updated weights for policy 0, policy_version 116993 (0.0048) [2024-06-28 01:15:13,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1916944384. Throughput: 0: 43944.4. Samples: 1819831360. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 01:15:13,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:15:14,381][06909] Updated weights for policy 0, policy_version 117003 (0.0050) [2024-06-28 01:15:17,584][06909] Updated weights for policy 0, policy_version 117013 (0.0024) [2024-06-28 01:15:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 1917173760. Throughput: 0: 43995.5. Samples: 1820097920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 01:15:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:15:21,622][06909] Updated weights for policy 0, policy_version 117023 (0.0026) [2024-06-28 01:15:22,668][06887] Signal inference workers to stop experience collection... (25900 times) [2024-06-28 01:15:22,668][06887] Signal inference workers to resume experience collection... (25900 times) [2024-06-28 01:15:22,730][06909] InferenceWorker_p0-w0: stopping experience collection (25900 times) [2024-06-28 01:15:22,730][06909] InferenceWorker_p0-w0: resuming experience collection (25900 times) [2024-06-28 01:15:23,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1917403136. Throughput: 0: 44121.8. Samples: 1820367700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 01:15:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:15:24,853][06909] Updated weights for policy 0, policy_version 117033 (0.0041) [2024-06-28 01:15:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1917616128. Throughput: 0: 43925.5. Samples: 1820494800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 01:15:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:15:29,262][06909] Updated weights for policy 0, policy_version 117043 (0.0027) [2024-06-28 01:15:32,717][06909] Updated weights for policy 0, policy_version 117053 (0.0032) [2024-06-28 01:15:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 1917845504. Throughput: 0: 44077.9. Samples: 1820761540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 01:15:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 01:15:36,894][06909] Updated weights for policy 0, policy_version 117063 (0.0036) [2024-06-28 01:15:38,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1918058496. Throughput: 0: 44166.4. Samples: 1821027180. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 01:15:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:15:40,108][06909] Updated weights for policy 0, policy_version 117073 (0.0024) [2024-06-28 01:15:43,850][06674] Fps is (10 sec: 42596.8, 60 sec: 43690.5, 300 sec: 44097.9). Total num frames: 1918271488. Throughput: 0: 44087.5. Samples: 1821152580. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 01:15:43,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:15:44,047][06909] Updated weights for policy 0, policy_version 117083 (0.0029) [2024-06-28 01:15:47,378][06909] Updated weights for policy 0, policy_version 117093 (0.0020) [2024-06-28 01:15:48,850][06674] Fps is (10 sec: 45874.3, 60 sec: 44236.8, 300 sec: 44098.8). Total num frames: 1918517248. Throughput: 0: 44069.7. Samples: 1821422840. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 01:15:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:15:48,927][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000117098_1918533632.pth... [2024-06-28 01:15:48,987][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000116449_1907900416.pth [2024-06-28 01:15:51,476][06909] Updated weights for policy 0, policy_version 117103 (0.0021) [2024-06-28 01:15:53,850][06674] Fps is (10 sec: 45876.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1918730240. Throughput: 0: 44306.6. Samples: 1821693620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 01:15:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:15:55,046][06909] Updated weights for policy 0, policy_version 117113 (0.0035) [2024-06-28 01:15:58,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43692.2, 300 sec: 44153.5). Total num frames: 1918926848. Throughput: 0: 44186.8. Samples: 1821819760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 01:15:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:15:59,032][06909] Updated weights for policy 0, policy_version 117123 (0.0044) [2024-06-28 01:16:02,383][06909] Updated weights for policy 0, policy_version 117133 (0.0026) [2024-06-28 01:16:03,856][06674] Fps is (10 sec: 45847.7, 60 sec: 44232.4, 300 sec: 44152.9). Total num frames: 1919188992. Throughput: 0: 44296.8. Samples: 1822091540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 01:16:03,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:16:06,397][06909] Updated weights for policy 0, policy_version 117143 (0.0027) [2024-06-28 01:16:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1919385600. Throughput: 0: 44167.9. Samples: 1822355260. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 01:16:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:16:09,767][06909] Updated weights for policy 0, policy_version 117153 (0.0025) [2024-06-28 01:16:13,714][06909] Updated weights for policy 0, policy_version 117163 (0.0030) [2024-06-28 01:16:13,850][06674] Fps is (10 sec: 40984.8, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1919598592. Throughput: 0: 44021.3. Samples: 1822475760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 01:16:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:16:17,262][06909] Updated weights for policy 0, policy_version 117173 (0.0049) [2024-06-28 01:16:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1919844352. Throughput: 0: 44178.6. Samples: 1822749580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 01:16:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:16:21,346][06909] Updated weights for policy 0, policy_version 117183 (0.0033) [2024-06-28 01:16:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1920040960. Throughput: 0: 44284.3. Samples: 1823019980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 01:16:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:16:24,707][06909] Updated weights for policy 0, policy_version 117193 (0.0027) [2024-06-28 01:16:28,701][06909] Updated weights for policy 0, policy_version 117203 (0.0030) [2024-06-28 01:16:28,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 1920253952. Throughput: 0: 44263.3. Samples: 1823144420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 01:16:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:16:31,791][06909] Updated weights for policy 0, policy_version 117213 (0.0034) [2024-06-28 01:16:33,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 1920516096. Throughput: 0: 44341.0. Samples: 1823418180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 01:16:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:16:36,196][06909] Updated weights for policy 0, policy_version 117223 (0.0030) [2024-06-28 01:16:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.5, 300 sec: 43986.8). Total num frames: 1920696320. Throughput: 0: 44106.5. Samples: 1823678420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 01:16:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:16:39,549][06909] Updated weights for policy 0, policy_version 117233 (0.0032) [2024-06-28 01:16:43,396][06909] Updated weights for policy 0, policy_version 117243 (0.0026) [2024-06-28 01:16:43,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44237.1, 300 sec: 44209.0). Total num frames: 1920925696. Throughput: 0: 44072.9. Samples: 1823803040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 01:16:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:16:47,062][06909] Updated weights for policy 0, policy_version 117253 (0.0032) [2024-06-28 01:16:48,850][06674] Fps is (10 sec: 47514.8, 60 sec: 44236.9, 300 sec: 44209.1). Total num frames: 1921171456. Throughput: 0: 43997.1. Samples: 1824071140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 01:16:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:16:50,861][06909] Updated weights for policy 0, policy_version 117263 (0.0029) [2024-06-28 01:16:53,603][06887] Signal inference workers to stop experience collection... (25950 times) [2024-06-28 01:16:53,639][06909] InferenceWorker_p0-w0: stopping experience collection (25950 times) [2024-06-28 01:16:53,719][06887] Signal inference workers to resume experience collection... (25950 times) [2024-06-28 01:16:53,720][06909] InferenceWorker_p0-w0: resuming experience collection (25950 times) [2024-06-28 01:16:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1921368064. Throughput: 0: 44157.8. Samples: 1824342360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:16:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:16:54,298][06909] Updated weights for policy 0, policy_version 117273 (0.0034) [2024-06-28 01:16:58,334][06909] Updated weights for policy 0, policy_version 117283 (0.0034) [2024-06-28 01:16:58,856][06674] Fps is (10 sec: 40934.9, 60 sec: 44232.3, 300 sec: 44097.1). Total num frames: 1921581056. Throughput: 0: 44297.6. Samples: 1824469420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:16:58,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:17:01,905][06909] Updated weights for policy 0, policy_version 117293 (0.0028) [2024-06-28 01:17:03,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43968.2, 300 sec: 44209.1). Total num frames: 1921826816. Throughput: 0: 44082.7. Samples: 1824733300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:17:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:17:05,632][06909] Updated weights for policy 0, policy_version 117303 (0.0036) [2024-06-28 01:17:08,850][06674] Fps is (10 sec: 45902.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1922039808. Throughput: 0: 44020.8. Samples: 1825000920. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:17:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:17:09,150][06909] Updated weights for policy 0, policy_version 117313 (0.0038) [2024-06-28 01:17:12,883][06909] Updated weights for policy 0, policy_version 117323 (0.0031) [2024-06-28 01:17:13,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1922236416. Throughput: 0: 44281.9. Samples: 1825137100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:17:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:17:16,576][06909] Updated weights for policy 0, policy_version 117333 (0.0031) [2024-06-28 01:17:18,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1922498560. Throughput: 0: 44020.3. Samples: 1825399100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:17:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:17:20,063][06909] Updated weights for policy 0, policy_version 117343 (0.0038) [2024-06-28 01:17:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1922678784. Throughput: 0: 44074.8. Samples: 1825661780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:17:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:17:24,097][06909] Updated weights for policy 0, policy_version 117353 (0.0023) [2024-06-28 01:17:27,974][06909] Updated weights for policy 0, policy_version 117363 (0.0030) [2024-06-28 01:17:28,850][06674] Fps is (10 sec: 40960.4, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1922908160. Throughput: 0: 44147.0. Samples: 1825789660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:17:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:17:31,615][06909] Updated weights for policy 0, policy_version 117373 (0.0028) [2024-06-28 01:17:33,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43690.5, 300 sec: 44097.9). Total num frames: 1923137536. Throughput: 0: 44038.4. Samples: 1826052880. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:17:33,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:17:35,479][06909] Updated weights for policy 0, policy_version 117383 (0.0033) [2024-06-28 01:17:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1923350528. Throughput: 0: 43820.8. Samples: 1826314300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:17:38,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:17:39,035][06909] Updated weights for policy 0, policy_version 117393 (0.0027) [2024-06-28 01:17:42,854][06909] Updated weights for policy 0, policy_version 117403 (0.0021) [2024-06-28 01:17:43,850][06674] Fps is (10 sec: 42599.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1923563520. Throughput: 0: 44061.1. Samples: 1826451900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:17:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:17:46,743][06909] Updated weights for policy 0, policy_version 117413 (0.0027) [2024-06-28 01:17:48,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 1923792896. Throughput: 0: 43925.8. Samples: 1826709960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:17:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:17:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000117420_1923809280.pth... [2024-06-28 01:17:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000116773_1913208832.pth [2024-06-28 01:17:50,108][06909] Updated weights for policy 0, policy_version 117423 (0.0038) [2024-06-28 01:17:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1924005888. Throughput: 0: 43882.8. Samples: 1826975640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:17:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:17:53,930][06909] Updated weights for policy 0, policy_version 117433 (0.0040) [2024-06-28 01:17:57,323][06909] Updated weights for policy 0, policy_version 117443 (0.0027) [2024-06-28 01:17:58,852][06674] Fps is (10 sec: 44225.1, 60 sec: 44239.3, 300 sec: 44153.1). Total num frames: 1924235264. Throughput: 0: 43797.4. Samples: 1827108100. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:17:58,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:18:01,355][06909] Updated weights for policy 0, policy_version 117453 (0.0044) [2024-06-28 01:18:03,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1924464640. Throughput: 0: 43814.7. Samples: 1827370760. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:18:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:18:05,162][06909] Updated weights for policy 0, policy_version 117463 (0.0022) [2024-06-28 01:18:08,513][06909] Updated weights for policy 0, policy_version 117473 (0.0024) [2024-06-28 01:18:08,850][06674] Fps is (10 sec: 44248.6, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1924677632. Throughput: 0: 43953.0. Samples: 1827639660. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:18:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:18:12,636][06909] Updated weights for policy 0, policy_version 117483 (0.0035) [2024-06-28 01:18:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1924890624. Throughput: 0: 44034.6. Samples: 1827771220. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:18:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:18:15,181][06887] Signal inference workers to stop experience collection... (26000 times) [2024-06-28 01:18:15,182][06887] Signal inference workers to resume experience collection... (26000 times) [2024-06-28 01:18:15,204][06909] InferenceWorker_p0-w0: stopping experience collection (26000 times) [2024-06-28 01:18:15,204][06909] InferenceWorker_p0-w0: resuming experience collection (26000 times) [2024-06-28 01:18:16,065][06909] Updated weights for policy 0, policy_version 117493 (0.0032) [2024-06-28 01:18:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 44153.5). Total num frames: 1925120000. Throughput: 0: 43927.8. Samples: 1828029620. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:18:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:18:20,010][06909] Updated weights for policy 0, policy_version 117503 (0.0020) [2024-06-28 01:18:23,547][06909] Updated weights for policy 0, policy_version 117513 (0.0021) [2024-06-28 01:18:23,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1925332992. Throughput: 0: 44056.6. Samples: 1828296840. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:18:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:18:27,482][06909] Updated weights for policy 0, policy_version 117523 (0.0046) [2024-06-28 01:18:28,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1925562368. Throughput: 0: 43958.1. Samples: 1828430020. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:18:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:18:31,155][06909] Updated weights for policy 0, policy_version 117533 (0.0031) [2024-06-28 01:18:33,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 1925775360. Throughput: 0: 43980.8. Samples: 1828689100. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:18:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:18:34,963][06909] Updated weights for policy 0, policy_version 117543 (0.0027) [2024-06-28 01:18:38,528][06909] Updated weights for policy 0, policy_version 117553 (0.0041) [2024-06-28 01:18:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 44042.6). Total num frames: 1925988352. Throughput: 0: 43999.0. Samples: 1828955600. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:18:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:18:42,167][06909] Updated weights for policy 0, policy_version 117563 (0.0023) [2024-06-28 01:18:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 44098.3). Total num frames: 1926217728. Throughput: 0: 44052.7. Samples: 1829090360. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:18:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:18:45,875][06909] Updated weights for policy 0, policy_version 117573 (0.0031) [2024-06-28 01:18:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 1926430720. Throughput: 0: 44058.2. Samples: 1829353380. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:18:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:18:49,871][06909] Updated weights for policy 0, policy_version 117583 (0.0026) [2024-06-28 01:18:53,235][06909] Updated weights for policy 0, policy_version 117593 (0.0041) [2024-06-28 01:18:53,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1926660096. Throughput: 0: 43960.0. Samples: 1829617860. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:18:53,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:18:57,142][06909] Updated weights for policy 0, policy_version 117603 (0.0032) [2024-06-28 01:18:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43965.6, 300 sec: 44098.9). Total num frames: 1926873088. Throughput: 0: 44062.7. Samples: 1829754040. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 01:18:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:19:00,957][06909] Updated weights for policy 0, policy_version 117613 (0.0022) [2024-06-28 01:19:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1927086080. Throughput: 0: 44062.2. Samples: 1830012420. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 01:19:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:19:04,623][06909] Updated weights for policy 0, policy_version 117623 (0.0029) [2024-06-28 01:19:08,206][06909] Updated weights for policy 0, policy_version 117633 (0.0025) [2024-06-28 01:19:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1927331840. Throughput: 0: 44020.8. Samples: 1830277780. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 01:19:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:19:12,092][06909] Updated weights for policy 0, policy_version 117643 (0.0046) [2024-06-28 01:19:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1927544832. Throughput: 0: 44089.4. Samples: 1830414040. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 01:19:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:19:15,724][06909] Updated weights for policy 0, policy_version 117653 (0.0027) [2024-06-28 01:19:18,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 1927741440. Throughput: 0: 44188.5. Samples: 1830677580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 01:19:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:19:19,630][06909] Updated weights for policy 0, policy_version 117663 (0.0050) [2024-06-28 01:19:23,164][06909] Updated weights for policy 0, policy_version 117673 (0.0040) [2024-06-28 01:19:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1927987200. Throughput: 0: 44097.8. Samples: 1830940000. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 01:19:23,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:19:27,017][06909] Updated weights for policy 0, policy_version 117683 (0.0038) [2024-06-28 01:19:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1928200192. Throughput: 0: 44182.3. Samples: 1831078560. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 01:19:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:19:29,917][06887] Signal inference workers to stop experience collection... (26050 times) [2024-06-28 01:19:29,918][06887] Signal inference workers to resume experience collection... (26050 times) [2024-06-28 01:19:29,936][06909] InferenceWorker_p0-w0: stopping experience collection (26050 times) [2024-06-28 01:19:29,936][06909] InferenceWorker_p0-w0: resuming experience collection (26050 times) [2024-06-28 01:19:30,605][06909] Updated weights for policy 0, policy_version 117693 (0.0031) [2024-06-28 01:19:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1928413184. Throughput: 0: 44144.0. Samples: 1831339860. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 01:19:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 01:19:34,645][06909] Updated weights for policy 0, policy_version 117703 (0.0036) [2024-06-28 01:19:37,997][06909] Updated weights for policy 0, policy_version 117713 (0.0037) [2024-06-28 01:19:38,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 1928658944. Throughput: 0: 44144.8. Samples: 1831604380. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 01:19:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:19:42,046][06909] Updated weights for policy 0, policy_version 117723 (0.0031) [2024-06-28 01:19:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1928855552. Throughput: 0: 44069.4. Samples: 1831737160. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 01:19:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:19:45,469][06909] Updated weights for policy 0, policy_version 117733 (0.0030) [2024-06-28 01:19:48,856][06674] Fps is (10 sec: 40935.4, 60 sec: 43959.3, 300 sec: 44041.5). Total num frames: 1929068544. Throughput: 0: 44257.6. Samples: 1832004280. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 01:19:48,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:19:48,901][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000117742_1929084928.pth... [2024-06-28 01:19:48,957][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000117098_1918533632.pth [2024-06-28 01:19:49,162][06909] Updated weights for policy 0, policy_version 117743 (0.0043) [2024-06-28 01:19:52,615][06909] Updated weights for policy 0, policy_version 117753 (0.0038) [2024-06-28 01:19:53,852][06674] Fps is (10 sec: 47503.7, 60 sec: 44508.3, 300 sec: 44153.5). Total num frames: 1929330688. Throughput: 0: 44201.0. Samples: 1832266920. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 01:19:53,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:19:56,372][06909] Updated weights for policy 0, policy_version 117763 (0.0033) [2024-06-28 01:19:58,850][06674] Fps is (10 sec: 47541.9, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 1929543680. Throughput: 0: 44210.1. Samples: 1832403500. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-28 01:19:58,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:19:59,894][06909] Updated weights for policy 0, policy_version 117773 (0.0040) [2024-06-28 01:20:03,856][06674] Fps is (10 sec: 40943.7, 60 sec: 44232.3, 300 sec: 44097.0). Total num frames: 1929740288. Throughput: 0: 44222.9. Samples: 1832667880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-28 01:20:03,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:20:04,069][06909] Updated weights for policy 0, policy_version 117783 (0.0025) [2024-06-28 01:20:07,397][06909] Updated weights for policy 0, policy_version 117793 (0.0036) [2024-06-28 01:20:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 1929986048. Throughput: 0: 44126.7. Samples: 1832925700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-28 01:20:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:20:11,489][06909] Updated weights for policy 0, policy_version 117803 (0.0028) [2024-06-28 01:20:13,850][06674] Fps is (10 sec: 44263.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1930182656. Throughput: 0: 44278.6. Samples: 1833071100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-28 01:20:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:20:14,615][06909] Updated weights for policy 0, policy_version 117813 (0.0036) [2024-06-28 01:20:18,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1930395648. Throughput: 0: 44140.0. Samples: 1833326160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-28 01:20:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:20:19,001][06909] Updated weights for policy 0, policy_version 117823 (0.0035) [2024-06-28 01:20:22,457][06909] Updated weights for policy 0, policy_version 117833 (0.0027) [2024-06-28 01:20:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1930641408. Throughput: 0: 44052.5. Samples: 1833586740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-28 01:20:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:20:26,390][06909] Updated weights for policy 0, policy_version 117843 (0.0023) [2024-06-28 01:20:28,856][06674] Fps is (10 sec: 45847.7, 60 sec: 44232.3, 300 sec: 44097.0). Total num frames: 1930854400. Throughput: 0: 44138.9. Samples: 1833723680. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-28 01:20:28,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:20:29,678][06909] Updated weights for policy 0, policy_version 117853 (0.0035) [2024-06-28 01:20:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1931051008. Throughput: 0: 44086.8. Samples: 1833987920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-28 01:20:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:20:33,883][06909] Updated weights for policy 0, policy_version 117863 (0.0028) [2024-06-28 01:20:37,058][06909] Updated weights for policy 0, policy_version 117873 (0.0024) [2024-06-28 01:20:38,850][06674] Fps is (10 sec: 42624.0, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 1931280384. Throughput: 0: 44067.3. Samples: 1834249860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-28 01:20:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:20:41,298][06909] Updated weights for policy 0, policy_version 117883 (0.0037) [2024-06-28 01:20:43,305][06887] Signal inference workers to stop experience collection... (26100 times) [2024-06-28 01:20:43,305][06887] Signal inference workers to resume experience collection... (26100 times) [2024-06-28 01:20:43,346][06909] InferenceWorker_p0-w0: stopping experience collection (26100 times) [2024-06-28 01:20:43,346][06909] InferenceWorker_p0-w0: resuming experience collection (26100 times) [2024-06-28 01:20:43,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 1931526144. Throughput: 0: 44058.2. Samples: 1834386120. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-28 01:20:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:20:44,528][06909] Updated weights for policy 0, policy_version 117893 (0.0030) [2024-06-28 01:20:48,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43966.6, 300 sec: 43986.6). Total num frames: 1931706368. Throughput: 0: 44018.6. Samples: 1834648540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-28 01:20:48,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:20:48,963][06909] Updated weights for policy 0, policy_version 117903 (0.0042) [2024-06-28 01:20:52,194][06909] Updated weights for policy 0, policy_version 117913 (0.0034) [2024-06-28 01:20:53,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43419.0, 300 sec: 44097.9). Total num frames: 1931935744. Throughput: 0: 44011.9. Samples: 1834906240. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-28 01:20:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:20:56,262][06909] Updated weights for policy 0, policy_version 117923 (0.0031) [2024-06-28 01:20:58,850][06674] Fps is (10 sec: 45884.4, 60 sec: 43690.7, 300 sec: 43987.8). Total num frames: 1932165120. Throughput: 0: 43842.6. Samples: 1835044020. Policy #0 lag: (min: 0.0, avg: 10.4, max: 24.0) [2024-06-28 01:20:58,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:20:59,409][06909] Updated weights for policy 0, policy_version 117933 (0.0027) [2024-06-28 01:21:03,600][06909] Updated weights for policy 0, policy_version 117943 (0.0034) [2024-06-28 01:21:03,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43968.2, 300 sec: 44042.4). Total num frames: 1932378112. Throughput: 0: 44011.7. Samples: 1835306680. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 01:21:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:21:06,847][06909] Updated weights for policy 0, policy_version 117953 (0.0039) [2024-06-28 01:21:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1932607488. Throughput: 0: 44205.7. Samples: 1835576000. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 01:21:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:21:10,709][06909] Updated weights for policy 0, policy_version 117963 (0.0032) [2024-06-28 01:21:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1932820480. Throughput: 0: 44234.1. Samples: 1835713940. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 01:21:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:21:14,239][06909] Updated weights for policy 0, policy_version 117973 (0.0027) [2024-06-28 01:21:18,253][06909] Updated weights for policy 0, policy_version 117983 (0.0038) [2024-06-28 01:21:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1933066240. Throughput: 0: 44130.2. Samples: 1835973780. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 01:21:18,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 01:21:21,578][06909] Updated weights for policy 0, policy_version 117993 (0.0042) [2024-06-28 01:21:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 1933262848. Throughput: 0: 44372.0. Samples: 1836246600. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 01:21:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:21:25,447][06909] Updated weights for policy 0, policy_version 118003 (0.0025) [2024-06-28 01:21:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44241.3, 300 sec: 44042.4). Total num frames: 1933508608. Throughput: 0: 44276.0. Samples: 1836378540. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 01:21:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:21:28,970][06909] Updated weights for policy 0, policy_version 118013 (0.0035) [2024-06-28 01:21:33,012][06909] Updated weights for policy 0, policy_version 118023 (0.0036) [2024-06-28 01:21:33,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 1933721600. Throughput: 0: 44303.9. Samples: 1836642120. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 01:21:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:21:36,410][06909] Updated weights for policy 0, policy_version 118033 (0.0031) [2024-06-28 01:21:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1933934592. Throughput: 0: 44502.3. Samples: 1836908840. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 01:21:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:21:40,200][06909] Updated weights for policy 0, policy_version 118043 (0.0033) [2024-06-28 01:21:43,643][06909] Updated weights for policy 0, policy_version 118053 (0.0030) [2024-06-28 01:21:43,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1934180352. Throughput: 0: 44355.1. Samples: 1837040000. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 01:21:43,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:21:47,360][06909] Updated weights for policy 0, policy_version 118063 (0.0030) [2024-06-28 01:21:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44784.4, 300 sec: 44153.5). Total num frames: 1934393344. Throughput: 0: 44347.9. Samples: 1837302340. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 01:21:48,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:21:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000118066_1934393344.pth... [2024-06-28 01:21:48,924][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000117420_1923809280.pth [2024-06-28 01:21:51,299][06909] Updated weights for policy 0, policy_version 118073 (0.0036) [2024-06-28 01:21:53,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.9, 300 sec: 44098.9). Total num frames: 1934589952. Throughput: 0: 44393.8. Samples: 1837573720. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 01:21:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:21:55,232][06909] Updated weights for policy 0, policy_version 118083 (0.0026) [2024-06-28 01:21:58,606][06909] Updated weights for policy 0, policy_version 118093 (0.0031) [2024-06-28 01:21:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 1934835712. Throughput: 0: 44285.5. Samples: 1837706800. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 01:21:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:22:02,531][06909] Updated weights for policy 0, policy_version 118103 (0.0033) [2024-06-28 01:22:03,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 1935065088. Throughput: 0: 44283.9. Samples: 1837966560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 01:22:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:22:05,941][06909] Updated weights for policy 0, policy_version 118113 (0.0043) [2024-06-28 01:22:08,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1935245312. Throughput: 0: 44322.3. Samples: 1838241100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 01:22:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:22:10,066][06909] Updated weights for policy 0, policy_version 118123 (0.0034) [2024-06-28 01:22:13,554][06909] Updated weights for policy 0, policy_version 118133 (0.0046) [2024-06-28 01:22:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 1935491072. Throughput: 0: 44006.2. Samples: 1838358820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 01:22:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:22:17,217][06909] Updated weights for policy 0, policy_version 118143 (0.0037) [2024-06-28 01:22:18,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 1935720448. Throughput: 0: 44105.6. Samples: 1838626880. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 01:22:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:22:19,599][06887] Signal inference workers to stop experience collection... (26150 times) [2024-06-28 01:22:19,600][06887] Signal inference workers to resume experience collection... (26150 times) [2024-06-28 01:22:19,642][06909] InferenceWorker_p0-w0: stopping experience collection (26150 times) [2024-06-28 01:22:19,642][06909] InferenceWorker_p0-w0: resuming experience collection (26150 times) [2024-06-28 01:22:20,678][06909] Updated weights for policy 0, policy_version 118153 (0.0036) [2024-06-28 01:22:23,856][06674] Fps is (10 sec: 42572.7, 60 sec: 44232.4, 300 sec: 44097.1). Total num frames: 1935917056. Throughput: 0: 44289.2. Samples: 1838902120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 01:22:23,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:22:24,767][06909] Updated weights for policy 0, policy_version 118163 (0.0028) [2024-06-28 01:22:28,224][06909] Updated weights for policy 0, policy_version 118173 (0.0032) [2024-06-28 01:22:28,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1936162816. Throughput: 0: 44209.4. Samples: 1839029420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 01:22:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:22:32,230][06909] Updated weights for policy 0, policy_version 118183 (0.0029) [2024-06-28 01:22:33,850][06674] Fps is (10 sec: 47542.3, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1936392192. Throughput: 0: 44179.2. Samples: 1839290400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 01:22:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:22:35,664][06909] Updated weights for policy 0, policy_version 118193 (0.0030) [2024-06-28 01:22:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1936588800. Throughput: 0: 44201.4. Samples: 1839562780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 01:22:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:22:39,633][06909] Updated weights for policy 0, policy_version 118203 (0.0037) [2024-06-28 01:22:42,770][06909] Updated weights for policy 0, policy_version 118213 (0.0037) [2024-06-28 01:22:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1936818176. Throughput: 0: 44135.6. Samples: 1839692900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 01:22:43,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:22:46,941][06909] Updated weights for policy 0, policy_version 118223 (0.0024) [2024-06-28 01:22:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 1937031168. Throughput: 0: 44157.0. Samples: 1839953620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 01:22:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:22:50,230][06909] Updated weights for policy 0, policy_version 118233 (0.0043) [2024-06-28 01:22:53,850][06674] Fps is (10 sec: 42599.1, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 1937244160. Throughput: 0: 44240.9. Samples: 1840231940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 01:22:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:22:54,355][06909] Updated weights for policy 0, policy_version 118243 (0.0034) [2024-06-28 01:22:57,644][06909] Updated weights for policy 0, policy_version 118253 (0.0024) [2024-06-28 01:22:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 1937473536. Throughput: 0: 44391.1. Samples: 1840356420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 01:22:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:23:01,874][06909] Updated weights for policy 0, policy_version 118263 (0.0036) [2024-06-28 01:23:03,856][06674] Fps is (10 sec: 47484.7, 60 sec: 44232.4, 300 sec: 44208.1). Total num frames: 1937719296. Throughput: 0: 44167.5. Samples: 1840614680. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-28 01:23:03,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:23:05,221][06909] Updated weights for policy 0, policy_version 118273 (0.0038) [2024-06-28 01:23:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1937915904. Throughput: 0: 44350.8. Samples: 1840897640. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-28 01:23:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:23:09,179][06909] Updated weights for policy 0, policy_version 118283 (0.0027) [2024-06-28 01:23:12,459][06909] Updated weights for policy 0, policy_version 118293 (0.0031) [2024-06-28 01:23:13,850][06674] Fps is (10 sec: 40984.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1938128896. Throughput: 0: 44226.6. Samples: 1841019620. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-28 01:23:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:23:16,610][06909] Updated weights for policy 0, policy_version 118303 (0.0030) [2024-06-28 01:23:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1938358272. Throughput: 0: 44306.2. Samples: 1841284180. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-28 01:23:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:23:19,902][06909] Updated weights for policy 0, policy_version 118313 (0.0026) [2024-06-28 01:23:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44514.4, 300 sec: 44153.5). Total num frames: 1938587648. Throughput: 0: 44288.5. Samples: 1841555760. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-28 01:23:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:23:24,041][06909] Updated weights for policy 0, policy_version 118323 (0.0020) [2024-06-28 01:23:24,214][06887] Signal inference workers to stop experience collection... (26200 times) [2024-06-28 01:23:24,249][06909] InferenceWorker_p0-w0: stopping experience collection (26200 times) [2024-06-28 01:23:24,274][06887] Signal inference workers to resume experience collection... (26200 times) [2024-06-28 01:23:24,280][06909] InferenceWorker_p0-w0: resuming experience collection (26200 times) [2024-06-28 01:23:27,082][06909] Updated weights for policy 0, policy_version 118333 (0.0039) [2024-06-28 01:23:28,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1938784256. Throughput: 0: 44260.6. Samples: 1841684620. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-28 01:23:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:23:31,577][06909] Updated weights for policy 0, policy_version 118343 (0.0026) [2024-06-28 01:23:33,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 44264.6). Total num frames: 1939046400. Throughput: 0: 44343.0. Samples: 1841949060. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-28 01:23:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:23:35,146][06909] Updated weights for policy 0, policy_version 118353 (0.0023) [2024-06-28 01:23:38,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1939243008. Throughput: 0: 44242.5. Samples: 1842222860. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-28 01:23:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:23:39,101][06909] Updated weights for policy 0, policy_version 118363 (0.0035) [2024-06-28 01:23:42,238][06909] Updated weights for policy 0, policy_version 118373 (0.0035) [2024-06-28 01:23:43,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 1939472384. Throughput: 0: 44238.3. Samples: 1842347140. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-28 01:23:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:23:46,414][06909] Updated weights for policy 0, policy_version 118383 (0.0034) [2024-06-28 01:23:48,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1939701760. Throughput: 0: 44422.0. Samples: 1842613400. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-28 01:23:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:23:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000118390_1939701760.pth... [2024-06-28 01:23:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000117742_1929084928.pth [2024-06-28 01:23:49,477][06909] Updated weights for policy 0, policy_version 118393 (0.0037) [2024-06-28 01:23:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1939898368. Throughput: 0: 44173.4. Samples: 1842885440. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-28 01:23:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:23:53,977][06909] Updated weights for policy 0, policy_version 118403 (0.0029) [2024-06-28 01:23:56,718][06909] Updated weights for policy 0, policy_version 118413 (0.0043) [2024-06-28 01:23:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1940127744. Throughput: 0: 44213.8. Samples: 1843009240. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-28 01:23:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:24:01,252][06909] Updated weights for policy 0, policy_version 118423 (0.0031) [2024-06-28 01:24:03,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44514.4, 300 sec: 44264.6). Total num frames: 1940389888. Throughput: 0: 44270.3. Samples: 1843276340. Policy #0 lag: (min: 0.0, avg: 11.8, max: 25.0) [2024-06-28 01:24:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:24:04,110][06909] Updated weights for policy 0, policy_version 118433 (0.0036) [2024-06-28 01:24:08,766][06909] Updated weights for policy 0, policy_version 118443 (0.0022) [2024-06-28 01:24:08,852][06674] Fps is (10 sec: 44227.5, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 1940570112. Throughput: 0: 44149.4. Samples: 1843542580. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 01:24:08,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:24:12,173][06909] Updated weights for policy 0, policy_version 118453 (0.0031) [2024-06-28 01:24:13,850][06674] Fps is (10 sec: 39321.9, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 1940783104. Throughput: 0: 44004.1. Samples: 1843664800. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 01:24:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:24:16,273][06909] Updated weights for policy 0, policy_version 118463 (0.0021) [2024-06-28 01:24:18,850][06674] Fps is (10 sec: 47523.2, 60 sec: 44782.9, 300 sec: 44264.6). Total num frames: 1941045248. Throughput: 0: 44162.2. Samples: 1843936360. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 01:24:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:24:19,355][06909] Updated weights for policy 0, policy_version 118473 (0.0031) [2024-06-28 01:24:23,793][06909] Updated weights for policy 0, policy_version 118483 (0.0027) [2024-06-28 01:24:23,852][06674] Fps is (10 sec: 44227.3, 60 sec: 43962.2, 300 sec: 44153.2). Total num frames: 1941225472. Throughput: 0: 44115.0. Samples: 1844208120. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 01:24:23,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:24:26,542][06909] Updated weights for policy 0, policy_version 118493 (0.0031) [2024-06-28 01:24:28,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 1941454848. Throughput: 0: 44093.2. Samples: 1844331340. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 01:24:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:24:30,956][06909] Updated weights for policy 0, policy_version 118503 (0.0035) [2024-06-28 01:24:33,806][06909] Updated weights for policy 0, policy_version 118513 (0.0019) [2024-06-28 01:24:33,850][06674] Fps is (10 sec: 49161.4, 60 sec: 44509.8, 300 sec: 44264.6). Total num frames: 1941716992. Throughput: 0: 44235.4. Samples: 1844604000. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 01:24:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:24:38,167][06909] Updated weights for policy 0, policy_version 118523 (0.0031) [2024-06-28 01:24:38,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 1941880832. Throughput: 0: 44148.0. Samples: 1844872100. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 01:24:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:24:40,216][06887] Signal inference workers to stop experience collection... (26250 times) [2024-06-28 01:24:40,216][06887] Signal inference workers to resume experience collection... (26250 times) [2024-06-28 01:24:40,238][06909] InferenceWorker_p0-w0: stopping experience collection (26250 times) [2024-06-28 01:24:40,238][06909] InferenceWorker_p0-w0: resuming experience collection (26250 times) [2024-06-28 01:24:41,173][06909] Updated weights for policy 0, policy_version 118533 (0.0040) [2024-06-28 01:24:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.7, 300 sec: 44265.5). Total num frames: 1942126592. Throughput: 0: 44137.7. Samples: 1844995440. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 01:24:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:24:45,663][06909] Updated weights for policy 0, policy_version 118543 (0.0027) [2024-06-28 01:24:48,762][06909] Updated weights for policy 0, policy_version 118553 (0.0035) [2024-06-28 01:24:48,850][06674] Fps is (10 sec: 49151.6, 60 sec: 44509.8, 300 sec: 44209.3). Total num frames: 1942372352. Throughput: 0: 44264.0. Samples: 1845268220. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 01:24:48,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:24:52,985][06909] Updated weights for policy 0, policy_version 118563 (0.0025) [2024-06-28 01:24:53,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 1942568960. Throughput: 0: 44473.2. Samples: 1845543780. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 01:24:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:24:56,120][06909] Updated weights for policy 0, policy_version 118573 (0.0031) [2024-06-28 01:24:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44509.8, 300 sec: 44265.5). Total num frames: 1942798336. Throughput: 0: 44636.3. Samples: 1845673440. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 01:24:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:25:00,537][06909] Updated weights for policy 0, policy_version 118583 (0.0032) [2024-06-28 01:25:03,437][06909] Updated weights for policy 0, policy_version 118593 (0.0037) [2024-06-28 01:25:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1943027712. Throughput: 0: 44628.1. Samples: 1845944620. Policy #0 lag: (min: 0.0, avg: 12.2, max: 23.0) [2024-06-28 01:25:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:25:08,038][06909] Updated weights for policy 0, policy_version 118603 (0.0033) [2024-06-28 01:25:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44511.4, 300 sec: 44264.6). Total num frames: 1943240704. Throughput: 0: 44248.7. Samples: 1846199220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:25:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:25:10,660][06909] Updated weights for policy 0, policy_version 118613 (0.0033) [2024-06-28 01:25:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44509.7, 300 sec: 44264.6). Total num frames: 1943453696. Throughput: 0: 44478.7. Samples: 1846332880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:25:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:25:15,217][06909] Updated weights for policy 0, policy_version 118623 (0.0029) [2024-06-28 01:25:18,134][06909] Updated weights for policy 0, policy_version 118633 (0.0025) [2024-06-28 01:25:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 1943683072. Throughput: 0: 44384.9. Samples: 1846601320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:25:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:25:22,700][06909] Updated weights for policy 0, policy_version 118643 (0.0026) [2024-06-28 01:25:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44511.3, 300 sec: 44209.9). Total num frames: 1943896064. Throughput: 0: 44230.4. Samples: 1846862480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:25:23,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:25:25,780][06909] Updated weights for policy 0, policy_version 118653 (0.0029) [2024-06-28 01:25:28,852][06674] Fps is (10 sec: 42590.1, 60 sec: 44235.4, 300 sec: 44264.3). Total num frames: 1944109056. Throughput: 0: 44459.8. Samples: 1846996220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:25:28,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:25:29,992][06909] Updated weights for policy 0, policy_version 118663 (0.0046) [2024-06-28 01:25:33,506][06909] Updated weights for policy 0, policy_version 118673 (0.0042) [2024-06-28 01:25:33,852][06674] Fps is (10 sec: 44228.4, 60 sec: 43689.3, 300 sec: 44264.3). Total num frames: 1944338432. Throughput: 0: 44264.6. Samples: 1847260220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:25:33,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:25:37,650][06909] Updated weights for policy 0, policy_version 118683 (0.0029) [2024-06-28 01:25:38,850][06674] Fps is (10 sec: 45884.3, 60 sec: 44782.8, 300 sec: 44209.0). Total num frames: 1944567808. Throughput: 0: 43991.4. Samples: 1847523400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:25:38,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-28 01:25:40,991][06909] Updated weights for policy 0, policy_version 118693 (0.0036) [2024-06-28 01:25:43,850][06674] Fps is (10 sec: 42605.7, 60 sec: 43963.5, 300 sec: 44264.8). Total num frames: 1944764416. Throughput: 0: 43984.2. Samples: 1847652740. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:25:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:25:45,201][06909] Updated weights for policy 0, policy_version 118703 (0.0025) [2024-06-28 01:25:48,551][06909] Updated weights for policy 0, policy_version 118713 (0.0025) [2024-06-28 01:25:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 44264.6). Total num frames: 1944993792. Throughput: 0: 43556.5. Samples: 1847904660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:25:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:25:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000118713_1944993792.pth... [2024-06-28 01:25:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000118066_1934393344.pth [2024-06-28 01:25:52,610][06909] Updated weights for policy 0, policy_version 118723 (0.0034) [2024-06-28 01:25:53,852][06674] Fps is (10 sec: 44229.2, 60 sec: 43962.2, 300 sec: 44208.7). Total num frames: 1945206784. Throughput: 0: 43898.9. Samples: 1848174760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:25:53,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:25:56,074][06909] Updated weights for policy 0, policy_version 118733 (0.0027) [2024-06-28 01:25:58,414][06887] Signal inference workers to stop experience collection... (26300 times) [2024-06-28 01:25:58,414][06887] Signal inference workers to resume experience collection... (26300 times) [2024-06-28 01:25:58,454][06909] InferenceWorker_p0-w0: stopping experience collection (26300 times) [2024-06-28 01:25:58,454][06909] InferenceWorker_p0-w0: resuming experience collection (26300 times) [2024-06-28 01:25:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 1945419776. Throughput: 0: 43768.5. Samples: 1848302460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:25:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:26:00,171][06909] Updated weights for policy 0, policy_version 118743 (0.0032) [2024-06-28 01:26:03,645][06909] Updated weights for policy 0, policy_version 118753 (0.0049) [2024-06-28 01:26:03,856][06674] Fps is (10 sec: 44219.0, 60 sec: 43686.2, 300 sec: 44208.1). Total num frames: 1945649152. Throughput: 0: 43720.0. Samples: 1848568980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:26:03,857][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:26:07,597][06909] Updated weights for policy 0, policy_version 118763 (0.0036) [2024-06-28 01:26:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44264.6). Total num frames: 1945878528. Throughput: 0: 43718.4. Samples: 1848829800. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 01:26:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:26:11,408][06909] Updated weights for policy 0, policy_version 118773 (0.0038) [2024-06-28 01:26:13,850][06674] Fps is (10 sec: 44263.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1946091520. Throughput: 0: 43881.9. Samples: 1848970820. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 01:26:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:26:14,739][06909] Updated weights for policy 0, policy_version 118783 (0.0027) [2024-06-28 01:26:18,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43417.6, 300 sec: 44153.5). Total num frames: 1946288128. Throughput: 0: 43778.8. Samples: 1849230180. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 01:26:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:26:19,079][06909] Updated weights for policy 0, policy_version 118793 (0.0038) [2024-06-28 01:26:22,177][06909] Updated weights for policy 0, policy_version 118803 (0.0044) [2024-06-28 01:26:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1946533888. Throughput: 0: 43804.0. Samples: 1849494580. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 01:26:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:26:26,490][06909] Updated weights for policy 0, policy_version 118813 (0.0036) [2024-06-28 01:26:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43965.1, 300 sec: 44153.5). Total num frames: 1946746880. Throughput: 0: 44033.5. Samples: 1849634240. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 01:26:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:26:29,857][06909] Updated weights for policy 0, policy_version 118823 (0.0032) [2024-06-28 01:26:33,802][06909] Updated weights for policy 0, policy_version 118833 (0.0031) [2024-06-28 01:26:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43692.1, 300 sec: 44153.5). Total num frames: 1946959872. Throughput: 0: 44019.9. Samples: 1849885560. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 01:26:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:26:37,216][06909] Updated weights for policy 0, policy_version 118843 (0.0030) [2024-06-28 01:26:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1947189248. Throughput: 0: 43874.4. Samples: 1850149020. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 01:26:38,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:26:41,622][06909] Updated weights for policy 0, policy_version 118853 (0.0030) [2024-06-28 01:26:43,852][06674] Fps is (10 sec: 44226.0, 60 sec: 43962.1, 300 sec: 44097.6). Total num frames: 1947402240. Throughput: 0: 44156.6. Samples: 1850289620. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 01:26:43,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:26:44,798][06909] Updated weights for policy 0, policy_version 118863 (0.0035) [2024-06-28 01:26:48,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 44098.0). Total num frames: 1947598848. Throughput: 0: 43962.9. Samples: 1850547040. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 01:26:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:26:48,998][06909] Updated weights for policy 0, policy_version 118873 (0.0030) [2024-06-28 01:26:52,158][06909] Updated weights for policy 0, policy_version 118883 (0.0031) [2024-06-28 01:26:53,850][06674] Fps is (10 sec: 45886.5, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 1947860992. Throughput: 0: 43889.2. Samples: 1850804820. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 01:26:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:26:56,268][06909] Updated weights for policy 0, policy_version 118893 (0.0033) [2024-06-28 01:26:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1948057600. Throughput: 0: 43770.8. Samples: 1850940500. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 01:26:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:26:59,409][06909] Updated weights for policy 0, policy_version 118903 (0.0043) [2024-06-28 01:27:03,723][06909] Updated weights for policy 0, policy_version 118913 (0.0023) [2024-06-28 01:27:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43695.0, 300 sec: 44153.5). Total num frames: 1948270592. Throughput: 0: 43810.7. Samples: 1851201660. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 01:27:03,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:27:07,051][06909] Updated weights for policy 0, policy_version 118923 (0.0037) [2024-06-28 01:27:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1948499968. Throughput: 0: 43922.3. Samples: 1851471080. Policy #0 lag: (min: 0.0, avg: 11.2, max: 24.0) [2024-06-28 01:27:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:27:11,202][06909] Updated weights for policy 0, policy_version 118933 (0.0031) [2024-06-28 01:27:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1948729344. Throughput: 0: 43764.5. Samples: 1851603640. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 01:27:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:27:14,352][06909] Updated weights for policy 0, policy_version 118943 (0.0039) [2024-06-28 01:27:18,406][06909] Updated weights for policy 0, policy_version 118953 (0.0038) [2024-06-28 01:27:18,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.8, 300 sec: 44098.8). Total num frames: 1948925952. Throughput: 0: 43948.5. Samples: 1851863240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 01:27:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:27:21,863][06909] Updated weights for policy 0, policy_version 118963 (0.0020) [2024-06-28 01:27:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1949171712. Throughput: 0: 43939.0. Samples: 1852126280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 01:27:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:27:25,779][06909] Updated weights for policy 0, policy_version 118973 (0.0035) [2024-06-28 01:27:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1949384704. Throughput: 0: 43925.1. Samples: 1852266140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 01:27:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 01:27:29,366][06909] Updated weights for policy 0, policy_version 118983 (0.0033) [2024-06-28 01:27:30,415][06887] Signal inference workers to stop experience collection... (26350 times) [2024-06-28 01:27:30,416][06887] Signal inference workers to resume experience collection... (26350 times) [2024-06-28 01:27:30,456][06909] InferenceWorker_p0-w0: stopping experience collection (26350 times) [2024-06-28 01:27:30,456][06909] InferenceWorker_p0-w0: resuming experience collection (26350 times) [2024-06-28 01:27:33,319][06909] Updated weights for policy 0, policy_version 118993 (0.0035) [2024-06-28 01:27:33,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 1949581312. Throughput: 0: 43867.6. Samples: 1852521080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 01:27:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:27:37,021][06909] Updated weights for policy 0, policy_version 119003 (0.0040) [2024-06-28 01:27:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1949827072. Throughput: 0: 43870.7. Samples: 1852779000. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 01:27:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:27:41,050][06909] Updated weights for policy 0, policy_version 119013 (0.0041) [2024-06-28 01:27:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43965.6, 300 sec: 44097.9). Total num frames: 1950040064. Throughput: 0: 43957.3. Samples: 1852918580. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 01:27:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:27:44,245][06909] Updated weights for policy 0, policy_version 119023 (0.0035) [2024-06-28 01:27:48,581][06909] Updated weights for policy 0, policy_version 119033 (0.0030) [2024-06-28 01:27:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1950253056. Throughput: 0: 44041.9. Samples: 1853183540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 01:27:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:27:48,936][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000119035_1950269440.pth... [2024-06-28 01:27:48,987][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000118390_1939701760.pth [2024-06-28 01:27:51,469][06909] Updated weights for policy 0, policy_version 119043 (0.0037) [2024-06-28 01:27:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1950482432. Throughput: 0: 43816.8. Samples: 1853442840. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 01:27:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:27:55,978][06909] Updated weights for policy 0, policy_version 119053 (0.0036) [2024-06-28 01:27:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44043.3). Total num frames: 1950711808. Throughput: 0: 43932.5. Samples: 1853580600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 01:27:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:27:59,125][06909] Updated weights for policy 0, policy_version 119063 (0.0036) [2024-06-28 01:28:03,371][06909] Updated weights for policy 0, policy_version 119073 (0.0029) [2024-06-28 01:28:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1950908416. Throughput: 0: 43990.7. Samples: 1853842820. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 01:28:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:28:06,844][06909] Updated weights for policy 0, policy_version 119083 (0.0030) [2024-06-28 01:28:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1951154176. Throughput: 0: 43938.8. Samples: 1854103520. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 01:28:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 01:28:10,725][06909] Updated weights for policy 0, policy_version 119093 (0.0034) [2024-06-28 01:28:13,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1951350784. Throughput: 0: 43840.1. Samples: 1854238940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 01:28:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:28:14,170][06909] Updated weights for policy 0, policy_version 119103 (0.0040) [2024-06-28 01:28:18,235][06909] Updated weights for policy 0, policy_version 119113 (0.0044) [2024-06-28 01:28:18,850][06674] Fps is (10 sec: 42597.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1951580160. Throughput: 0: 44024.7. Samples: 1854502200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 01:28:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:28:21,671][06909] Updated weights for policy 0, policy_version 119123 (0.0034) [2024-06-28 01:28:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1951809536. Throughput: 0: 44066.7. Samples: 1854762000. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 01:28:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:28:25,698][06909] Updated weights for policy 0, policy_version 119133 (0.0031) [2024-06-28 01:28:28,837][06909] Updated weights for policy 0, policy_version 119143 (0.0034) [2024-06-28 01:28:28,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1952038912. Throughput: 0: 44114.7. Samples: 1854903740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 01:28:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:28:33,277][06909] Updated weights for policy 0, policy_version 119153 (0.0027) [2024-06-28 01:28:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1952219136. Throughput: 0: 43982.2. Samples: 1855162740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 01:28:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:28:36,365][06909] Updated weights for policy 0, policy_version 119163 (0.0039) [2024-06-28 01:28:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1952481280. Throughput: 0: 44027.2. Samples: 1855424060. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 01:28:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:28:40,606][06909] Updated weights for policy 0, policy_version 119173 (0.0026) [2024-06-28 01:28:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1952661504. Throughput: 0: 43860.5. Samples: 1855554320. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 01:28:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:28:44,040][06909] Updated weights for policy 0, policy_version 119183 (0.0039) [2024-06-28 01:28:48,169][06909] Updated weights for policy 0, policy_version 119193 (0.0032) [2024-06-28 01:28:48,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1952890880. Throughput: 0: 43952.5. Samples: 1855820680. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 01:28:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:28:51,415][06909] Updated weights for policy 0, policy_version 119203 (0.0026) [2024-06-28 01:28:51,871][06887] Signal inference workers to stop experience collection... (26400 times) [2024-06-28 01:28:51,871][06887] Signal inference workers to resume experience collection... (26400 times) [2024-06-28 01:28:51,904][06909] InferenceWorker_p0-w0: stopping experience collection (26400 times) [2024-06-28 01:28:51,904][06909] InferenceWorker_p0-w0: resuming experience collection (26400 times) [2024-06-28 01:28:53,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1953136640. Throughput: 0: 43905.3. Samples: 1856079260. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 01:28:53,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:28:55,370][06909] Updated weights for policy 0, policy_version 119213 (0.0027) [2024-06-28 01:28:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1953333248. Throughput: 0: 44050.6. Samples: 1856221220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 01:28:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:28:58,878][06909] Updated weights for policy 0, policy_version 119223 (0.0026) [2024-06-28 01:29:03,212][06909] Updated weights for policy 0, policy_version 119233 (0.0035) [2024-06-28 01:29:03,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 43987.2). Total num frames: 1953546240. Throughput: 0: 43996.6. Samples: 1856482040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 01:29:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:29:06,059][06909] Updated weights for policy 0, policy_version 119243 (0.0036) [2024-06-28 01:29:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1953792000. Throughput: 0: 44064.4. Samples: 1856744900. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 01:29:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:29:10,346][06909] Updated weights for policy 0, policy_version 119253 (0.0040) [2024-06-28 01:29:13,435][06909] Updated weights for policy 0, policy_version 119263 (0.0028) [2024-06-28 01:29:13,852][06674] Fps is (10 sec: 47502.0, 60 sec: 44508.1, 300 sec: 43986.5). Total num frames: 1954021376. Throughput: 0: 44097.7. Samples: 1856888240. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 01:29:13,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:29:17,894][06909] Updated weights for policy 0, policy_version 119273 (0.0027) [2024-06-28 01:29:18,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.8, 300 sec: 43987.2). Total num frames: 1954201600. Throughput: 0: 44186.2. Samples: 1857151120. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 01:29:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:29:20,906][06909] Updated weights for policy 0, policy_version 119283 (0.0038) [2024-06-28 01:29:23,850][06674] Fps is (10 sec: 44247.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1954463744. Throughput: 0: 44002.6. Samples: 1857404180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 01:29:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:29:25,148][06909] Updated weights for policy 0, policy_version 119293 (0.0022) [2024-06-28 01:29:28,371][06909] Updated weights for policy 0, policy_version 119303 (0.0028) [2024-06-28 01:29:28,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 1954676736. Throughput: 0: 44399.6. Samples: 1857552300. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 01:29:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:29:32,310][06909] Updated weights for policy 0, policy_version 119313 (0.0022) [2024-06-28 01:29:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 1954889728. Throughput: 0: 44258.3. Samples: 1857812300. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 01:29:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:29:35,578][06909] Updated weights for policy 0, policy_version 119323 (0.0042) [2024-06-28 01:29:38,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 1955119104. Throughput: 0: 44300.2. Samples: 1858072860. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 01:29:38,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:29:40,004][06909] Updated weights for policy 0, policy_version 119333 (0.0032) [2024-06-28 01:29:42,813][06909] Updated weights for policy 0, policy_version 119343 (0.0047) [2024-06-28 01:29:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44782.9, 300 sec: 43986.9). Total num frames: 1955348480. Throughput: 0: 44186.2. Samples: 1858209600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 01:29:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:29:47,133][06909] Updated weights for policy 0, policy_version 119353 (0.0031) [2024-06-28 01:29:48,850][06674] Fps is (10 sec: 42607.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1955545088. Throughput: 0: 44335.0. Samples: 1858477120. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 01:29:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:29:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000119357_1955545088.pth... [2024-06-28 01:29:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000118713_1944993792.pth [2024-06-28 01:29:50,274][06909] Updated weights for policy 0, policy_version 119363 (0.0029) [2024-06-28 01:29:53,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 1955758080. Throughput: 0: 44245.0. Samples: 1858735920. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 01:29:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:29:54,684][06909] Updated weights for policy 0, policy_version 119373 (0.0027) [2024-06-28 01:29:57,922][06909] Updated weights for policy 0, policy_version 119383 (0.0028) [2024-06-28 01:29:58,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44782.9, 300 sec: 44042.4). Total num frames: 1956020224. Throughput: 0: 44111.7. Samples: 1858873160. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 01:29:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:30:02,233][06909] Updated weights for policy 0, policy_version 119393 (0.0039) [2024-06-28 01:30:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1956200448. Throughput: 0: 44096.8. Samples: 1859135480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 01:30:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:30:05,178][06909] Updated weights for policy 0, policy_version 119403 (0.0035) [2024-06-28 01:30:08,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1956429824. Throughput: 0: 44303.9. Samples: 1859397860. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 01:30:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:30:09,432][06909] Updated weights for policy 0, policy_version 119413 (0.0035) [2024-06-28 01:30:12,649][06909] Updated weights for policy 0, policy_version 119423 (0.0036) [2024-06-28 01:30:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43965.5, 300 sec: 43986.9). Total num frames: 1956659200. Throughput: 0: 43937.3. Samples: 1859529480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 01:30:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:30:17,321][06909] Updated weights for policy 0, policy_version 119433 (0.0033) [2024-06-28 01:30:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 1956855808. Throughput: 0: 44092.4. Samples: 1859796460. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 01:30:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:30:19,897][06909] Updated weights for policy 0, policy_version 119443 (0.0033) [2024-06-28 01:30:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 1957101568. Throughput: 0: 44094.0. Samples: 1860057000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 01:30:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:30:24,512][06909] Updated weights for policy 0, policy_version 119453 (0.0039) [2024-06-28 01:30:26,181][06887] Signal inference workers to stop experience collection... (26450 times) [2024-06-28 01:30:26,183][06887] Signal inference workers to resume experience collection... (26450 times) [2024-06-28 01:30:26,203][06909] InferenceWorker_p0-w0: stopping experience collection (26450 times) [2024-06-28 01:30:26,203][06909] InferenceWorker_p0-w0: resuming experience collection (26450 times) [2024-06-28 01:30:27,553][06909] Updated weights for policy 0, policy_version 119463 (0.0026) [2024-06-28 01:30:28,852][06674] Fps is (10 sec: 47503.7, 60 sec: 44235.3, 300 sec: 44042.4). Total num frames: 1957330944. Throughput: 0: 43943.7. Samples: 1860187160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 01:30:28,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:30:32,044][06909] Updated weights for policy 0, policy_version 119473 (0.0041) [2024-06-28 01:30:33,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1957511168. Throughput: 0: 43960.1. Samples: 1860455320. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 01:30:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:30:35,107][06909] Updated weights for policy 0, policy_version 119483 (0.0038) [2024-06-28 01:30:38,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43692.1, 300 sec: 43986.9). Total num frames: 1957740544. Throughput: 0: 44021.2. Samples: 1860716880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 01:30:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:30:39,405][06909] Updated weights for policy 0, policy_version 119493 (0.0039) [2024-06-28 01:30:42,517][06909] Updated weights for policy 0, policy_version 119503 (0.0023) [2024-06-28 01:30:43,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1957986304. Throughput: 0: 43956.4. Samples: 1860851200. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 01:30:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:30:46,584][06909] Updated weights for policy 0, policy_version 119513 (0.0031) [2024-06-28 01:30:48,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.9, 300 sec: 44098.3). Total num frames: 1958215680. Throughput: 0: 44211.6. Samples: 1861125000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 01:30:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:30:49,763][06909] Updated weights for policy 0, policy_version 119523 (0.0031) [2024-06-28 01:30:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1958412288. Throughput: 0: 44005.4. Samples: 1861378100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 01:30:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:30:54,195][06909] Updated weights for policy 0, policy_version 119533 (0.0028) [2024-06-28 01:30:57,467][06909] Updated weights for policy 0, policy_version 119543 (0.0029) [2024-06-28 01:30:58,852][06674] Fps is (10 sec: 42591.5, 60 sec: 43689.4, 300 sec: 44043.1). Total num frames: 1958641664. Throughput: 0: 44022.4. Samples: 1861510560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 01:30:58,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:31:01,751][06909] Updated weights for policy 0, policy_version 119553 (0.0036) [2024-06-28 01:31:03,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.8, 300 sec: 43986.8). Total num frames: 1958854656. Throughput: 0: 44260.3. Samples: 1861788180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 01:31:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:31:04,824][06909] Updated weights for policy 0, policy_version 119563 (0.0033) [2024-06-28 01:31:08,850][06674] Fps is (10 sec: 40966.9, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 1959051264. Throughput: 0: 44149.4. Samples: 1862043720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 01:31:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:31:09,502][06909] Updated weights for policy 0, policy_version 119573 (0.0033) [2024-06-28 01:31:12,473][06909] Updated weights for policy 0, policy_version 119583 (0.0023) [2024-06-28 01:31:13,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1959313408. Throughput: 0: 44180.7. Samples: 1862175200. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 01:31:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:31:16,840][06909] Updated weights for policy 0, policy_version 119593 (0.0041) [2024-06-28 01:31:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1959510016. Throughput: 0: 44041.8. Samples: 1862437200. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 01:31:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:31:19,738][06909] Updated weights for policy 0, policy_version 119603 (0.0036) [2024-06-28 01:31:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1959723008. Throughput: 0: 44148.5. Samples: 1862703560. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 01:31:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:31:24,532][06909] Updated weights for policy 0, policy_version 119613 (0.0024) [2024-06-28 01:31:27,271][06909] Updated weights for policy 0, policy_version 119623 (0.0025) [2024-06-28 01:31:28,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43965.2, 300 sec: 44098.0). Total num frames: 1959968768. Throughput: 0: 43936.5. Samples: 1862828340. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 01:31:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:31:31,740][06909] Updated weights for policy 0, policy_version 119633 (0.0040) [2024-06-28 01:31:33,852][06674] Fps is (10 sec: 45865.8, 60 sec: 44508.3, 300 sec: 44042.1). Total num frames: 1960181760. Throughput: 0: 43913.2. Samples: 1863101180. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 01:31:33,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:31:34,647][06909] Updated weights for policy 0, policy_version 119643 (0.0021) [2024-06-28 01:31:38,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43690.8, 300 sec: 43931.7). Total num frames: 1960361984. Throughput: 0: 44267.2. Samples: 1863370120. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 01:31:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:31:39,071][06909] Updated weights for policy 0, policy_version 119653 (0.0034) [2024-06-28 01:31:42,103][06909] Updated weights for policy 0, policy_version 119663 (0.0045) [2024-06-28 01:31:43,850][06674] Fps is (10 sec: 44246.3, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 1960624128. Throughput: 0: 44014.6. Samples: 1863491140. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 01:31:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:31:46,798][06909] Updated weights for policy 0, policy_version 119673 (0.0027) [2024-06-28 01:31:48,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1960837120. Throughput: 0: 43838.8. Samples: 1863760920. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 01:31:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 01:31:48,908][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000119681_1960853504.pth... [2024-06-28 01:31:48,956][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000119035_1950269440.pth [2024-06-28 01:31:49,522][06909] Updated weights for policy 0, policy_version 119683 (0.0038) [2024-06-28 01:31:53,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1961033728. Throughput: 0: 43990.2. Samples: 1864023280. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 01:31:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:31:54,286][06909] Updated weights for policy 0, policy_version 119693 (0.0038) [2024-06-28 01:31:56,929][06909] Updated weights for policy 0, policy_version 119703 (0.0031) [2024-06-28 01:31:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43965.0, 300 sec: 44098.0). Total num frames: 1961279488. Throughput: 0: 43887.1. Samples: 1864150120. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 01:31:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:31:59,253][06887] Signal inference workers to stop experience collection... (26500 times) [2024-06-28 01:31:59,253][06887] Signal inference workers to resume experience collection... (26500 times) [2024-06-28 01:31:59,300][06909] InferenceWorker_p0-w0: stopping experience collection (26500 times) [2024-06-28 01:31:59,300][06909] InferenceWorker_p0-w0: resuming experience collection (26500 times) [2024-06-28 01:32:01,642][06909] Updated weights for policy 0, policy_version 119713 (0.0032) [2024-06-28 01:32:03,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44237.0, 300 sec: 44098.0). Total num frames: 1961508864. Throughput: 0: 44103.1. Samples: 1864421840. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 01:32:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:32:04,245][06909] Updated weights for policy 0, policy_version 119723 (0.0032) [2024-06-28 01:32:08,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 1961689088. Throughput: 0: 43939.0. Samples: 1864680820. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 01:32:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:32:08,999][06909] Updated weights for policy 0, policy_version 119733 (0.0042) [2024-06-28 01:32:11,880][06909] Updated weights for policy 0, policy_version 119743 (0.0025) [2024-06-28 01:32:13,850][06674] Fps is (10 sec: 44235.7, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 1961951232. Throughput: 0: 43938.1. Samples: 1864805560. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 01:32:13,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:32:16,688][06909] Updated weights for policy 0, policy_version 119753 (0.0030) [2024-06-28 01:32:18,850][06674] Fps is (10 sec: 49152.5, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 1962180608. Throughput: 0: 43962.0. Samples: 1865079380. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 01:32:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:32:19,335][06909] Updated weights for policy 0, policy_version 119763 (0.0036) [2024-06-28 01:32:23,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1962344448. Throughput: 0: 43776.3. Samples: 1865340060. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 01:32:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:32:23,916][06909] Updated weights for policy 0, policy_version 119773 (0.0027) [2024-06-28 01:32:26,944][06909] Updated weights for policy 0, policy_version 119783 (0.0030) [2024-06-28 01:32:28,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 1962622976. Throughput: 0: 43980.3. Samples: 1865470260. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 01:32:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:32:31,484][06909] Updated weights for policy 0, policy_version 119793 (0.0035) [2024-06-28 01:32:33,850][06674] Fps is (10 sec: 49151.7, 60 sec: 44238.2, 300 sec: 44097.9). Total num frames: 1962835968. Throughput: 0: 43985.6. Samples: 1865740280. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 01:32:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:32:34,299][06909] Updated weights for policy 0, policy_version 119803 (0.0030) [2024-06-28 01:32:38,643][06909] Updated weights for policy 0, policy_version 119813 (0.0023) [2024-06-28 01:32:38,850][06674] Fps is (10 sec: 39322.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1963016192. Throughput: 0: 43934.3. Samples: 1866000320. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 01:32:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:32:41,551][06909] Updated weights for policy 0, policy_version 119823 (0.0037) [2024-06-28 01:32:43,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1963278336. Throughput: 0: 43874.6. Samples: 1866124480. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 01:32:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:32:46,589][06909] Updated weights for policy 0, policy_version 119833 (0.0032) [2024-06-28 01:32:48,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 1963491328. Throughput: 0: 43904.8. Samples: 1866397560. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 01:32:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:32:49,135][06909] Updated weights for policy 0, policy_version 119843 (0.0022) [2024-06-28 01:32:53,856][06674] Fps is (10 sec: 37660.4, 60 sec: 43686.3, 300 sec: 43874.9). Total num frames: 1963655168. Throughput: 0: 43956.4. Samples: 1866659120. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 01:32:53,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:32:53,981][06909] Updated weights for policy 0, policy_version 119853 (0.0032) [2024-06-28 01:32:56,560][06909] Updated weights for policy 0, policy_version 119863 (0.0034) [2024-06-28 01:32:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1963917312. Throughput: 0: 43962.8. Samples: 1866783880. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 01:32:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:33:01,234][06909] Updated weights for policy 0, policy_version 119873 (0.0031) [2024-06-28 01:33:03,850][06674] Fps is (10 sec: 49181.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1964146688. Throughput: 0: 43944.4. Samples: 1867056880. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 01:33:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:33:04,030][06909] Updated weights for policy 0, policy_version 119883 (0.0037) [2024-06-28 01:33:08,633][06909] Updated weights for policy 0, policy_version 119893 (0.0029) [2024-06-28 01:33:08,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 1964326912. Throughput: 0: 44047.7. Samples: 1867322200. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 01:33:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:33:11,409][06909] Updated weights for policy 0, policy_version 119903 (0.0029) [2024-06-28 01:33:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 1964589056. Throughput: 0: 43939.2. Samples: 1867447520. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 01:33:13,850][06674] Avg episode reward: [(0, '0.447')] [2024-06-28 01:33:15,832][06909] Updated weights for policy 0, policy_version 119913 (0.0024) [2024-06-28 01:33:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1964802048. Throughput: 0: 43782.0. Samples: 1867710460. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 01:33:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:33:19,057][06909] Updated weights for policy 0, policy_version 119923 (0.0051) [2024-06-28 01:33:20,210][06887] Signal inference workers to stop experience collection... (26550 times) [2024-06-28 01:33:20,210][06887] Signal inference workers to resume experience collection... (26550 times) [2024-06-28 01:33:20,240][06909] InferenceWorker_p0-w0: stopping experience collection (26550 times) [2024-06-28 01:33:20,244][06909] InferenceWorker_p0-w0: resuming experience collection (26550 times) [2024-06-28 01:33:23,850][06674] Fps is (10 sec: 37683.4, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 1964965888. Throughput: 0: 43976.5. Samples: 1867979260. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-28 01:33:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:33:23,893][06909] Updated weights for policy 0, policy_version 119933 (0.0019) [2024-06-28 01:33:26,401][06909] Updated weights for policy 0, policy_version 119943 (0.0025) [2024-06-28 01:33:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 1965244416. Throughput: 0: 43962.7. Samples: 1868102800. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-28 01:33:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:33:31,042][06909] Updated weights for policy 0, policy_version 119953 (0.0021) [2024-06-28 01:33:33,714][06909] Updated weights for policy 0, policy_version 119963 (0.0028) [2024-06-28 01:33:33,850][06674] Fps is (10 sec: 50789.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1965473792. Throughput: 0: 44009.7. Samples: 1868378000. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-28 01:33:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:33:38,281][06909] Updated weights for policy 0, policy_version 119973 (0.0031) [2024-06-28 01:33:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1965654016. Throughput: 0: 44141.1. Samples: 1868645200. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-28 01:33:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:33:41,548][06909] Updated weights for policy 0, policy_version 119983 (0.0049) [2024-06-28 01:33:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1965916160. Throughput: 0: 44155.1. Samples: 1868770860. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-28 01:33:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:33:45,589][06909] Updated weights for policy 0, policy_version 119993 (0.0032) [2024-06-28 01:33:48,773][06909] Updated weights for policy 0, policy_version 120003 (0.0041) [2024-06-28 01:33:48,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1966129152. Throughput: 0: 44054.6. Samples: 1869039340. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-28 01:33:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:33:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000120003_1966129152.pth... [2024-06-28 01:33:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000119357_1955545088.pth [2024-06-28 01:33:53,180][06909] Updated weights for policy 0, policy_version 120013 (0.0045) [2024-06-28 01:33:53,850][06674] Fps is (10 sec: 37682.8, 60 sec: 43968.1, 300 sec: 43931.3). Total num frames: 1966292992. Throughput: 0: 44043.8. Samples: 1869304180. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-28 01:33:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:33:56,276][06909] Updated weights for policy 0, policy_version 120023 (0.0035) [2024-06-28 01:33:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1966571520. Throughput: 0: 43943.0. Samples: 1869424960. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-28 01:33:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:34:01,125][06909] Updated weights for policy 0, policy_version 120033 (0.0034) [2024-06-28 01:34:03,592][06909] Updated weights for policy 0, policy_version 120043 (0.0030) [2024-06-28 01:34:03,850][06674] Fps is (10 sec: 49152.0, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1966784512. Throughput: 0: 44025.6. Samples: 1869691620. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-28 01:34:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:34:08,391][06909] Updated weights for policy 0, policy_version 120053 (0.0049) [2024-06-28 01:34:08,850][06674] Fps is (10 sec: 40960.6, 60 sec: 44236.8, 300 sec: 43931.7). Total num frames: 1966981120. Throughput: 0: 44056.4. Samples: 1869961800. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-28 01:34:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:34:11,351][06909] Updated weights for policy 0, policy_version 120063 (0.0039) [2024-06-28 01:34:13,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 1967210496. Throughput: 0: 43998.7. Samples: 1870082740. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-28 01:34:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:34:15,680][06909] Updated weights for policy 0, policy_version 120073 (0.0022) [2024-06-28 01:34:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1967423488. Throughput: 0: 43784.6. Samples: 1870348300. Policy #0 lag: (min: 1.0, avg: 11.1, max: 21.0) [2024-06-28 01:34:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:34:19,260][06909] Updated weights for policy 0, policy_version 120083 (0.0033) [2024-06-28 01:34:22,949][06909] Updated weights for policy 0, policy_version 120093 (0.0029) [2024-06-28 01:34:23,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 1967603712. Throughput: 0: 43774.6. Samples: 1870615060. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 01:34:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:34:26,540][06909] Updated weights for policy 0, policy_version 120103 (0.0034) [2024-06-28 01:34:28,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1967882240. Throughput: 0: 43727.5. Samples: 1870738600. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 01:34:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:34:30,797][06909] Updated weights for policy 0, policy_version 120113 (0.0040) [2024-06-28 01:34:33,814][06909] Updated weights for policy 0, policy_version 120123 (0.0033) [2024-06-28 01:34:33,850][06674] Fps is (10 sec: 49152.3, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 1968095232. Throughput: 0: 43613.8. Samples: 1871001960. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 01:34:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:34:38,595][06909] Updated weights for policy 0, policy_version 120133 (0.0028) [2024-06-28 01:34:38,850][06674] Fps is (10 sec: 39321.1, 60 sec: 43690.5, 300 sec: 43820.2). Total num frames: 1968275456. Throughput: 0: 43815.0. Samples: 1871275860. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 01:34:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:34:41,329][06909] Updated weights for policy 0, policy_version 120143 (0.0043) [2024-06-28 01:34:41,610][06887] Signal inference workers to stop experience collection... (26600 times) [2024-06-28 01:34:41,654][06909] InferenceWorker_p0-w0: stopping experience collection (26600 times) [2024-06-28 01:34:41,727][06887] Signal inference workers to resume experience collection... (26600 times) [2024-06-28 01:34:41,727][06909] InferenceWorker_p0-w0: resuming experience collection (26600 times) [2024-06-28 01:34:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1968537600. Throughput: 0: 43824.1. Samples: 1871397040. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 01:34:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:34:45,892][06909] Updated weights for policy 0, policy_version 120153 (0.0027) [2024-06-28 01:34:48,710][06909] Updated weights for policy 0, policy_version 120163 (0.0036) [2024-06-28 01:34:48,850][06674] Fps is (10 sec: 47514.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1968750592. Throughput: 0: 43852.2. Samples: 1871664960. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 01:34:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:34:53,041][06909] Updated weights for policy 0, policy_version 120173 (0.0024) [2024-06-28 01:34:53,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.9, 300 sec: 43820.3). Total num frames: 1968947200. Throughput: 0: 43703.9. Samples: 1871928480. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 01:34:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:34:56,269][06909] Updated weights for policy 0, policy_version 120183 (0.0033) [2024-06-28 01:34:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 1969192960. Throughput: 0: 43930.6. Samples: 1872059620. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 01:34:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:35:00,399][06909] Updated weights for policy 0, policy_version 120193 (0.0037) [2024-06-28 01:35:03,454][06909] Updated weights for policy 0, policy_version 120203 (0.0030) [2024-06-28 01:35:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1969405952. Throughput: 0: 43806.1. Samples: 1872319580. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 01:35:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:35:08,106][06909] Updated weights for policy 0, policy_version 120213 (0.0038) [2024-06-28 01:35:08,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1969602560. Throughput: 0: 43960.5. Samples: 1872593280. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 01:35:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:35:10,873][06909] Updated weights for policy 0, policy_version 120223 (0.0035) [2024-06-28 01:35:13,852][06674] Fps is (10 sec: 44228.1, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 1969848320. Throughput: 0: 44024.7. Samples: 1872719800. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 01:35:13,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:35:15,621][06909] Updated weights for policy 0, policy_version 120233 (0.0026) [2024-06-28 01:35:18,476][06909] Updated weights for policy 0, policy_version 120243 (0.0031) [2024-06-28 01:35:18,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1970077696. Throughput: 0: 44005.3. Samples: 1872982200. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 01:35:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:35:22,901][06909] Updated weights for policy 0, policy_version 120253 (0.0029) [2024-06-28 01:35:23,850][06674] Fps is (10 sec: 42607.2, 60 sec: 44509.9, 300 sec: 43876.1). Total num frames: 1970274304. Throughput: 0: 44002.4. Samples: 1873255960. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 01:35:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:35:25,686][06909] Updated weights for policy 0, policy_version 120263 (0.0028) [2024-06-28 01:35:28,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43962.3, 300 sec: 44097.6). Total num frames: 1970520064. Throughput: 0: 44110.9. Samples: 1873382120. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 01:35:28,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:35:30,083][06909] Updated weights for policy 0, policy_version 120273 (0.0041) [2024-06-28 01:35:33,337][06909] Updated weights for policy 0, policy_version 120283 (0.0034) [2024-06-28 01:35:33,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1970749440. Throughput: 0: 44032.9. Samples: 1873646440. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 01:35:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:35:37,241][06909] Updated weights for policy 0, policy_version 120293 (0.0045) [2024-06-28 01:35:38,850][06674] Fps is (10 sec: 42607.2, 60 sec: 44510.0, 300 sec: 43931.4). Total num frames: 1970946048. Throughput: 0: 44162.7. Samples: 1873915800. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 01:35:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:35:40,719][06909] Updated weights for policy 0, policy_version 120303 (0.0033) [2024-06-28 01:35:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 1971159040. Throughput: 0: 44078.7. Samples: 1874043160. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 01:35:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:35:45,074][06909] Updated weights for policy 0, policy_version 120313 (0.0025) [2024-06-28 01:35:47,954][06909] Updated weights for policy 0, policy_version 120323 (0.0036) [2024-06-28 01:35:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1971388416. Throughput: 0: 43993.4. Samples: 1874299280. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 01:35:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:35:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000120324_1971388416.pth... [2024-06-28 01:35:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000119681_1960853504.pth [2024-06-28 01:35:52,625][06909] Updated weights for policy 0, policy_version 120333 (0.0039) [2024-06-28 01:35:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43931.6). Total num frames: 1971601408. Throughput: 0: 43988.4. Samples: 1874572760. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 01:35:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:35:55,556][06909] Updated weights for policy 0, policy_version 120343 (0.0026) [2024-06-28 01:35:58,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1971814400. Throughput: 0: 44000.5. Samples: 1874699740. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 01:35:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:35:59,952][06909] Updated weights for policy 0, policy_version 120353 (0.0037) [2024-06-28 01:36:02,867][06909] Updated weights for policy 0, policy_version 120363 (0.0038) [2024-06-28 01:36:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1972043776. Throughput: 0: 43996.9. Samples: 1874962060. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 01:36:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:36:06,292][06887] Signal inference workers to stop experience collection... (26650 times) [2024-06-28 01:36:06,292][06887] Signal inference workers to resume experience collection... (26650 times) [2024-06-28 01:36:06,305][06909] InferenceWorker_p0-w0: stopping experience collection (26650 times) [2024-06-28 01:36:06,306][06909] InferenceWorker_p0-w0: resuming experience collection (26650 times) [2024-06-28 01:36:07,303][06909] Updated weights for policy 0, policy_version 120373 (0.0038) [2024-06-28 01:36:08,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44509.9, 300 sec: 43931.3). Total num frames: 1972273152. Throughput: 0: 43822.6. Samples: 1875227980. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 01:36:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:36:10,687][06909] Updated weights for policy 0, policy_version 120383 (0.0027) [2024-06-28 01:36:13,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43419.0, 300 sec: 43875.8). Total num frames: 1972453376. Throughput: 0: 43846.3. Samples: 1875355120. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 01:36:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:36:14,533][06909] Updated weights for policy 0, policy_version 120393 (0.0025) [2024-06-28 01:36:17,949][06909] Updated weights for policy 0, policy_version 120403 (0.0048) [2024-06-28 01:36:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1972715520. Throughput: 0: 43765.3. Samples: 1875615880. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 01:36:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:36:22,151][06909] Updated weights for policy 0, policy_version 120413 (0.0032) [2024-06-28 01:36:23,850][06674] Fps is (10 sec: 47514.5, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 1972928512. Throughput: 0: 43821.8. Samples: 1875887780. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 01:36:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:36:25,180][06909] Updated weights for policy 0, policy_version 120423 (0.0035) [2024-06-28 01:36:28,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43419.0, 300 sec: 43876.1). Total num frames: 1973125120. Throughput: 0: 43934.6. Samples: 1876020220. Policy #0 lag: (min: 0.0, avg: 12.4, max: 21.0) [2024-06-28 01:36:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 01:36:29,708][06909] Updated weights for policy 0, policy_version 120433 (0.0026) [2024-06-28 01:36:32,719][06909] Updated weights for policy 0, policy_version 120443 (0.0032) [2024-06-28 01:36:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1973387264. Throughput: 0: 44104.5. Samples: 1876283980. Policy #0 lag: (min: 0.0, avg: 12.4, max: 21.0) [2024-06-28 01:36:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:36:37,157][06909] Updated weights for policy 0, policy_version 120453 (0.0035) [2024-06-28 01:36:38,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1973600256. Throughput: 0: 43934.2. Samples: 1876549800. Policy #0 lag: (min: 0.0, avg: 12.4, max: 21.0) [2024-06-28 01:36:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:36:40,139][06909] Updated weights for policy 0, policy_version 120463 (0.0051) [2024-06-28 01:36:43,850][06674] Fps is (10 sec: 39321.1, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1973780480. Throughput: 0: 43994.7. Samples: 1876679500. Policy #0 lag: (min: 0.0, avg: 12.4, max: 21.0) [2024-06-28 01:36:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:36:44,471][06909] Updated weights for policy 0, policy_version 120473 (0.0032) [2024-06-28 01:36:47,513][06909] Updated weights for policy 0, policy_version 120483 (0.0025) [2024-06-28 01:36:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1974026240. Throughput: 0: 44078.5. Samples: 1876945600. Policy #0 lag: (min: 0.0, avg: 12.4, max: 21.0) [2024-06-28 01:36:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:36:52,164][06909] Updated weights for policy 0, policy_version 120493 (0.0027) [2024-06-28 01:36:53,850][06674] Fps is (10 sec: 47514.2, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1974255616. Throughput: 0: 43987.2. Samples: 1877207400. Policy #0 lag: (min: 0.0, avg: 12.4, max: 21.0) [2024-06-28 01:36:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:36:55,140][06909] Updated weights for policy 0, policy_version 120503 (0.0026) [2024-06-28 01:36:58,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 1974435840. Throughput: 0: 44066.0. Samples: 1877338080. Policy #0 lag: (min: 0.0, avg: 12.4, max: 21.0) [2024-06-28 01:36:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:36:59,658][06909] Updated weights for policy 0, policy_version 120513 (0.0033) [2024-06-28 01:37:02,340][06909] Updated weights for policy 0, policy_version 120523 (0.0031) [2024-06-28 01:37:03,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1974681600. Throughput: 0: 44075.0. Samples: 1877599260. Policy #0 lag: (min: 0.0, avg: 12.4, max: 21.0) [2024-06-28 01:37:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:37:07,159][06909] Updated weights for policy 0, policy_version 120533 (0.0033) [2024-06-28 01:37:08,850][06674] Fps is (10 sec: 49151.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 1974927360. Throughput: 0: 43965.3. Samples: 1877866220. Policy #0 lag: (min: 0.0, avg: 12.4, max: 21.0) [2024-06-28 01:37:08,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:37:09,790][06909] Updated weights for policy 0, policy_version 120543 (0.0025) [2024-06-28 01:37:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.9, 300 sec: 43820.3). Total num frames: 1975107584. Throughput: 0: 44052.1. Samples: 1878002560. Policy #0 lag: (min: 0.0, avg: 12.4, max: 21.0) [2024-06-28 01:37:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:37:14,542][06909] Updated weights for policy 0, policy_version 120553 (0.0031) [2024-06-28 01:37:17,107][06909] Updated weights for policy 0, policy_version 120563 (0.0033) [2024-06-28 01:37:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1975353344. Throughput: 0: 44040.4. Samples: 1878265800. Policy #0 lag: (min: 0.0, avg: 12.4, max: 21.0) [2024-06-28 01:37:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:37:21,635][06909] Updated weights for policy 0, policy_version 120573 (0.0032) [2024-06-28 01:37:22,223][06887] Signal inference workers to stop experience collection... (26700 times) [2024-06-28 01:37:22,267][06909] InferenceWorker_p0-w0: stopping experience collection (26700 times) [2024-06-28 01:37:22,277][06887] Signal inference workers to resume experience collection... (26700 times) [2024-06-28 01:37:22,279][06909] InferenceWorker_p0-w0: resuming experience collection (26700 times) [2024-06-28 01:37:23,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1975582720. Throughput: 0: 44173.7. Samples: 1878537620. Policy #0 lag: (min: 0.0, avg: 12.4, max: 21.0) [2024-06-28 01:37:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:37:24,404][06909] Updated weights for policy 0, policy_version 120583 (0.0035) [2024-06-28 01:37:28,850][06674] Fps is (10 sec: 42597.6, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 1975779328. Throughput: 0: 44169.7. Samples: 1878667140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 01:37:28,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:37:29,282][06909] Updated weights for policy 0, policy_version 120593 (0.0029) [2024-06-28 01:37:31,793][06909] Updated weights for policy 0, policy_version 120603 (0.0036) [2024-06-28 01:37:33,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1976008704. Throughput: 0: 44054.4. Samples: 1878928040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 01:37:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:37:36,919][06909] Updated weights for policy 0, policy_version 120613 (0.0039) [2024-06-28 01:37:38,850][06674] Fps is (10 sec: 47514.7, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 1976254464. Throughput: 0: 44055.6. Samples: 1879189900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 01:37:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:37:39,300][06909] Updated weights for policy 0, policy_version 120623 (0.0038) [2024-06-28 01:37:43,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 1976401920. Throughput: 0: 44290.6. Samples: 1879331160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 01:37:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:37:44,301][06909] Updated weights for policy 0, policy_version 120633 (0.0047) [2024-06-28 01:37:46,731][06909] Updated weights for policy 0, policy_version 120643 (0.0032) [2024-06-28 01:37:48,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.9, 300 sec: 44154.4). Total num frames: 1976680448. Throughput: 0: 44253.8. Samples: 1879590680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 01:37:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:37:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000120647_1976680448.pth... [2024-06-28 01:37:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000120003_1966129152.pth [2024-06-28 01:37:51,549][06909] Updated weights for policy 0, policy_version 120653 (0.0023) [2024-06-28 01:37:53,852][06674] Fps is (10 sec: 52417.7, 60 sec: 44508.3, 300 sec: 44097.6). Total num frames: 1976926208. Throughput: 0: 44299.8. Samples: 1879859800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 01:37:53,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:37:54,217][06909] Updated weights for policy 0, policy_version 120663 (0.0036) [2024-06-28 01:37:58,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 1977090048. Throughput: 0: 44229.3. Samples: 1879992880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 01:37:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:37:58,861][06909] Updated weights for policy 0, policy_version 120673 (0.0028) [2024-06-28 01:38:01,407][06909] Updated weights for policy 0, policy_version 120683 (0.0030) [2024-06-28 01:38:03,850][06674] Fps is (10 sec: 40968.6, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1977335808. Throughput: 0: 44191.1. Samples: 1880254400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 01:38:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 01:38:06,608][06909] Updated weights for policy 0, policy_version 120693 (0.0030) [2024-06-28 01:38:08,850][06674] Fps is (10 sec: 49151.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1977581568. Throughput: 0: 43969.3. Samples: 1880516240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 01:38:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:38:09,034][06909] Updated weights for policy 0, policy_version 120703 (0.0046) [2024-06-28 01:38:13,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 1977745408. Throughput: 0: 43991.7. Samples: 1880646760. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 01:38:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:38:14,254][06909] Updated weights for policy 0, policy_version 120713 (0.0026) [2024-06-28 01:38:16,683][06909] Updated weights for policy 0, policy_version 120723 (0.0040) [2024-06-28 01:38:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 1978007552. Throughput: 0: 43984.7. Samples: 1880907360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 01:38:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:38:19,761][06887] Signal inference workers to stop experience collection... (26750 times) [2024-06-28 01:38:19,822][06887] Signal inference workers to resume experience collection... (26750 times) [2024-06-28 01:38:19,823][06909] InferenceWorker_p0-w0: stopping experience collection (26750 times) [2024-06-28 01:38:19,834][06909] InferenceWorker_p0-w0: resuming experience collection (26750 times) [2024-06-28 01:38:21,513][06909] Updated weights for policy 0, policy_version 120733 (0.0041) [2024-06-28 01:38:23,850][06674] Fps is (10 sec: 49152.1, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1978236928. Throughput: 0: 44044.0. Samples: 1881171880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 01:38:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:38:23,945][06909] Updated weights for policy 0, policy_version 120743 (0.0030) [2024-06-28 01:38:28,831][06909] Updated weights for policy 0, policy_version 120753 (0.0025) [2024-06-28 01:38:28,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 1978417152. Throughput: 0: 43951.9. Samples: 1881309000. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 01:38:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:38:31,630][06909] Updated weights for policy 0, policy_version 120763 (0.0032) [2024-06-28 01:38:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1978646528. Throughput: 0: 43996.5. Samples: 1881570520. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 01:38:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:38:36,170][06909] Updated weights for policy 0, policy_version 120773 (0.0024) [2024-06-28 01:38:38,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 1978892288. Throughput: 0: 44084.2. Samples: 1881843500. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 01:38:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:38:39,077][06909] Updated weights for policy 0, policy_version 120783 (0.0046) [2024-06-28 01:38:43,487][06909] Updated weights for policy 0, policy_version 120793 (0.0036) [2024-06-28 01:38:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44509.9, 300 sec: 43875.8). Total num frames: 1979072512. Throughput: 0: 44118.3. Samples: 1881978200. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 01:38:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:38:46,414][06909] Updated weights for policy 0, policy_version 120803 (0.0034) [2024-06-28 01:38:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1979318272. Throughput: 0: 44011.6. Samples: 1882234920. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 01:38:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:38:50,797][06909] Updated weights for policy 0, policy_version 120813 (0.0027) [2024-06-28 01:38:53,602][06909] Updated weights for policy 0, policy_version 120823 (0.0024) [2024-06-28 01:38:53,850][06674] Fps is (10 sec: 50790.0, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 1979580416. Throughput: 0: 44074.3. Samples: 1882499580. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 01:38:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:38:58,357][06909] Updated weights for policy 0, policy_version 120833 (0.0040) [2024-06-28 01:38:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 1979744256. Throughput: 0: 44322.6. Samples: 1882641280. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 01:38:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:39:01,137][06909] Updated weights for policy 0, policy_version 120843 (0.0032) [2024-06-28 01:39:03,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1979973632. Throughput: 0: 44313.9. Samples: 1882901480. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 01:39:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:39:05,753][06909] Updated weights for policy 0, policy_version 120853 (0.0035) [2024-06-28 01:39:08,426][06909] Updated weights for policy 0, policy_version 120863 (0.0027) [2024-06-28 01:39:08,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1980235776. Throughput: 0: 44415.5. Samples: 1883170580. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 01:39:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:39:12,892][06909] Updated weights for policy 0, policy_version 120873 (0.0039) [2024-06-28 01:39:13,851][06674] Fps is (10 sec: 44232.7, 60 sec: 44509.2, 300 sec: 44042.3). Total num frames: 1980416000. Throughput: 0: 44432.0. Samples: 1883308480. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 01:39:13,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:39:15,746][06909] Updated weights for policy 0, policy_version 120883 (0.0028) [2024-06-28 01:39:18,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 1980645376. Throughput: 0: 44455.3. Samples: 1883571020. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 01:39:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:39:20,452][06909] Updated weights for policy 0, policy_version 120893 (0.0037) [2024-06-28 01:39:23,398][06909] Updated weights for policy 0, policy_version 120903 (0.0031) [2024-06-28 01:39:23,850][06674] Fps is (10 sec: 47518.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1980891136. Throughput: 0: 44171.6. Samples: 1883831220. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 01:39:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:39:27,993][06909] Updated weights for policy 0, policy_version 120913 (0.0036) [2024-06-28 01:39:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 1981087744. Throughput: 0: 44192.3. Samples: 1883966860. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 01:39:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:39:31,106][06909] Updated weights for policy 0, policy_version 120923 (0.0040) [2024-06-28 01:39:33,852][06674] Fps is (10 sec: 40951.5, 60 sec: 44235.2, 300 sec: 44153.2). Total num frames: 1981300736. Throughput: 0: 44111.3. Samples: 1884220020. Policy #0 lag: (min: 1.0, avg: 10.1, max: 20.0) [2024-06-28 01:39:33,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:39:35,302][06909] Updated weights for policy 0, policy_version 120933 (0.0031) [2024-06-28 01:39:38,451][06909] Updated weights for policy 0, policy_version 120943 (0.0027) [2024-06-28 01:39:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1981546496. Throughput: 0: 44254.2. Samples: 1884491020. Policy #0 lag: (min: 1.0, avg: 10.1, max: 20.0) [2024-06-28 01:39:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:39:42,818][06909] Updated weights for policy 0, policy_version 120953 (0.0036) [2024-06-28 01:39:43,850][06674] Fps is (10 sec: 45884.9, 60 sec: 44782.9, 300 sec: 44097.9). Total num frames: 1981759488. Throughput: 0: 44208.1. Samples: 1884630640. Policy #0 lag: (min: 1.0, avg: 10.1, max: 20.0) [2024-06-28 01:39:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:39:45,746][06909] Updated weights for policy 0, policy_version 120963 (0.0027) [2024-06-28 01:39:48,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 1981956096. Throughput: 0: 44182.5. Samples: 1884889700. Policy #0 lag: (min: 1.0, avg: 10.1, max: 20.0) [2024-06-28 01:39:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:39:48,914][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000120970_1981972480.pth... [2024-06-28 01:39:48,954][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000120324_1971388416.pth [2024-06-28 01:39:50,387][06909] Updated weights for policy 0, policy_version 120973 (0.0033) [2024-06-28 01:39:53,297][06909] Updated weights for policy 0, policy_version 120983 (0.0036) [2024-06-28 01:39:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 1982201856. Throughput: 0: 44092.5. Samples: 1885154740. Policy #0 lag: (min: 1.0, avg: 10.1, max: 20.0) [2024-06-28 01:39:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:39:57,653][06909] Updated weights for policy 0, policy_version 120993 (0.0028) [2024-06-28 01:39:58,840][06887] Signal inference workers to stop experience collection... (26800 times) [2024-06-28 01:39:58,840][06887] Signal inference workers to resume experience collection... (26800 times) [2024-06-28 01:39:58,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 1982414848. Throughput: 0: 43892.0. Samples: 1885283580. Policy #0 lag: (min: 1.0, avg: 10.1, max: 20.0) [2024-06-28 01:39:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:39:58,871][06909] InferenceWorker_p0-w0: stopping experience collection (26800 times) [2024-06-28 01:39:58,871][06909] InferenceWorker_p0-w0: resuming experience collection (26800 times) [2024-06-28 01:40:00,878][06909] Updated weights for policy 0, policy_version 121003 (0.0039) [2024-06-28 01:40:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1982627840. Throughput: 0: 43903.3. Samples: 1885546660. Policy #0 lag: (min: 1.0, avg: 10.1, max: 20.0) [2024-06-28 01:40:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:40:05,001][06909] Updated weights for policy 0, policy_version 121013 (0.0020) [2024-06-28 01:40:08,022][06909] Updated weights for policy 0, policy_version 121023 (0.0033) [2024-06-28 01:40:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 44042.7). Total num frames: 1982840832. Throughput: 0: 43985.7. Samples: 1885810580. Policy #0 lag: (min: 1.0, avg: 10.1, max: 20.0) [2024-06-28 01:40:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:40:12,292][06909] Updated weights for policy 0, policy_version 121033 (0.0029) [2024-06-28 01:40:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44237.5, 300 sec: 44042.4). Total num frames: 1983070208. Throughput: 0: 44015.6. Samples: 1885947560. Policy #0 lag: (min: 1.0, avg: 10.1, max: 20.0) [2024-06-28 01:40:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:40:15,506][06909] Updated weights for policy 0, policy_version 121043 (0.0026) [2024-06-28 01:40:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1983283200. Throughput: 0: 44298.9. Samples: 1886213380. Policy #0 lag: (min: 1.0, avg: 10.1, max: 20.0) [2024-06-28 01:40:18,854][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:40:19,791][06909] Updated weights for policy 0, policy_version 121053 (0.0031) [2024-06-28 01:40:22,877][06909] Updated weights for policy 0, policy_version 121063 (0.0023) [2024-06-28 01:40:23,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43689.2, 300 sec: 44042.4). Total num frames: 1983512576. Throughput: 0: 44188.7. Samples: 1886479600. Policy #0 lag: (min: 1.0, avg: 10.1, max: 20.0) [2024-06-28 01:40:23,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:40:27,535][06909] Updated weights for policy 0, policy_version 121073 (0.0029) [2024-06-28 01:40:28,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 1983741952. Throughput: 0: 44018.9. Samples: 1886611500. Policy #0 lag: (min: 1.0, avg: 10.1, max: 20.0) [2024-06-28 01:40:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:40:30,309][06909] Updated weights for policy 0, policy_version 121083 (0.0028) [2024-06-28 01:40:33,850][06674] Fps is (10 sec: 45884.7, 60 sec: 44511.4, 300 sec: 44153.5). Total num frames: 1983971328. Throughput: 0: 44223.3. Samples: 1886879740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 01:40:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:40:34,805][06909] Updated weights for policy 0, policy_version 121093 (0.0029) [2024-06-28 01:40:38,084][06909] Updated weights for policy 0, policy_version 121103 (0.0033) [2024-06-28 01:40:38,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43417.5, 300 sec: 44042.4). Total num frames: 1984151552. Throughput: 0: 44074.1. Samples: 1887138080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 01:40:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:40:42,239][06909] Updated weights for policy 0, policy_version 121113 (0.0027) [2024-06-28 01:40:43,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1984380928. Throughput: 0: 44131.6. Samples: 1887269500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 01:40:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:40:45,343][06909] Updated weights for policy 0, policy_version 121123 (0.0025) [2024-06-28 01:40:48,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 1984610304. Throughput: 0: 44215.5. Samples: 1887536360. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 01:40:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:40:49,760][06909] Updated weights for policy 0, policy_version 121133 (0.0033) [2024-06-28 01:40:52,854][06909] Updated weights for policy 0, policy_version 121143 (0.0026) [2024-06-28 01:40:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 1984823296. Throughput: 0: 44050.3. Samples: 1887792840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 01:40:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:40:57,296][06909] Updated weights for policy 0, policy_version 121153 (0.0031) [2024-06-28 01:40:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1985052672. Throughput: 0: 43863.6. Samples: 1887921420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 01:40:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:41:00,340][06909] Updated weights for policy 0, policy_version 121163 (0.0036) [2024-06-28 01:41:03,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1985282048. Throughput: 0: 43925.9. Samples: 1888190040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 01:41:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:41:04,670][06909] Updated weights for policy 0, policy_version 121173 (0.0038) [2024-06-28 01:41:07,589][06909] Updated weights for policy 0, policy_version 121183 (0.0032) [2024-06-28 01:41:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1985478656. Throughput: 0: 43936.2. Samples: 1888456640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 01:41:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:41:12,038][06909] Updated weights for policy 0, policy_version 121193 (0.0033) [2024-06-28 01:41:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 1985724416. Throughput: 0: 43919.6. Samples: 1888587880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 01:41:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:41:15,265][06909] Updated weights for policy 0, policy_version 121203 (0.0038) [2024-06-28 01:41:18,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 1985921024. Throughput: 0: 43922.5. Samples: 1888856260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 01:41:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:41:19,239][06909] Updated weights for policy 0, policy_version 121213 (0.0035) [2024-06-28 01:41:22,525][06909] Updated weights for policy 0, policy_version 121223 (0.0030) [2024-06-28 01:41:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43692.1, 300 sec: 44098.0). Total num frames: 1986134016. Throughput: 0: 44005.9. Samples: 1889118340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 01:41:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:41:26,755][06909] Updated weights for policy 0, policy_version 121233 (0.0035) [2024-06-28 01:41:28,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1986379776. Throughput: 0: 43928.4. Samples: 1889246280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 01:41:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:41:30,086][06909] Updated weights for policy 0, policy_version 121243 (0.0037) [2024-06-28 01:41:30,851][06887] Signal inference workers to stop experience collection... (26850 times) [2024-06-28 01:41:30,901][06887] Signal inference workers to resume experience collection... (26850 times) [2024-06-28 01:41:30,902][06909] InferenceWorker_p0-w0: stopping experience collection (26850 times) [2024-06-28 01:41:30,913][06909] InferenceWorker_p0-w0: resuming experience collection (26850 times) [2024-06-28 01:41:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1986592768. Throughput: 0: 44007.6. Samples: 1889516700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 01:41:33,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:41:34,293][06909] Updated weights for policy 0, policy_version 121253 (0.0026) [2024-06-28 01:41:37,540][06909] Updated weights for policy 0, policy_version 121263 (0.0031) [2024-06-28 01:41:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 1986805760. Throughput: 0: 44008.4. Samples: 1889773220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 01:41:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:41:41,883][06909] Updated weights for policy 0, policy_version 121273 (0.0024) [2024-06-28 01:41:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1987035136. Throughput: 0: 44166.6. Samples: 1889908920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 01:41:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:41:44,947][06909] Updated weights for policy 0, policy_version 121283 (0.0027) [2024-06-28 01:41:48,856][06674] Fps is (10 sec: 44210.0, 60 sec: 43959.2, 300 sec: 44041.5). Total num frames: 1987248128. Throughput: 0: 44101.1. Samples: 1890174860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 01:41:48,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:41:48,883][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000121292_1987248128.pth... [2024-06-28 01:41:48,952][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000120647_1976680448.pth [2024-06-28 01:41:49,114][06909] Updated weights for policy 0, policy_version 121293 (0.0046) [2024-06-28 01:41:52,874][06909] Updated weights for policy 0, policy_version 121303 (0.0037) [2024-06-28 01:41:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 1987477504. Throughput: 0: 44071.0. Samples: 1890439840. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 01:41:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:41:56,533][06909] Updated weights for policy 0, policy_version 121313 (0.0027) [2024-06-28 01:41:58,850][06674] Fps is (10 sec: 45903.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1987706880. Throughput: 0: 44078.3. Samples: 1890571400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 01:41:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:42:00,026][06909] Updated weights for policy 0, policy_version 121323 (0.0033) [2024-06-28 01:42:03,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43417.6, 300 sec: 43931.4). Total num frames: 1987887104. Throughput: 0: 43951.8. Samples: 1890834080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 01:42:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:42:04,200][06909] Updated weights for policy 0, policy_version 121333 (0.0035) [2024-06-28 01:42:07,707][06909] Updated weights for policy 0, policy_version 121343 (0.0025) [2024-06-28 01:42:08,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1988116480. Throughput: 0: 43860.9. Samples: 1891092080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 01:42:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:42:12,008][06909] Updated weights for policy 0, policy_version 121353 (0.0041) [2024-06-28 01:42:13,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1988362240. Throughput: 0: 44063.6. Samples: 1891229140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 01:42:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:42:15,021][06909] Updated weights for policy 0, policy_version 121363 (0.0036) [2024-06-28 01:42:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 1988558848. Throughput: 0: 43886.7. Samples: 1891491600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 01:42:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:42:19,115][06909] Updated weights for policy 0, policy_version 121373 (0.0040) [2024-06-28 01:42:22,221][06909] Updated weights for policy 0, policy_version 121383 (0.0032) [2024-06-28 01:42:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1988788224. Throughput: 0: 44189.0. Samples: 1891761720. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 01:42:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:42:26,274][06909] Updated weights for policy 0, policy_version 121393 (0.0027) [2024-06-28 01:42:28,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 1989033984. Throughput: 0: 44059.1. Samples: 1891891580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 01:42:28,859][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:42:30,030][06909] Updated weights for policy 0, policy_version 121403 (0.0031) [2024-06-28 01:42:33,774][06909] Updated weights for policy 0, policy_version 121413 (0.0033) [2024-06-28 01:42:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1989230592. Throughput: 0: 44058.9. Samples: 1892157240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 01:42:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:42:37,251][06909] Updated weights for policy 0, policy_version 121423 (0.0037) [2024-06-28 01:42:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 1989459968. Throughput: 0: 44019.1. Samples: 1892420700. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-28 01:42:38,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 01:42:41,185][06909] Updated weights for policy 0, policy_version 121433 (0.0043) [2024-06-28 01:42:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1989689344. Throughput: 0: 44055.5. Samples: 1892553900. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-28 01:42:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:42:44,862][06909] Updated weights for policy 0, policy_version 121443 (0.0026) [2024-06-28 01:42:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43695.1, 300 sec: 43876.1). Total num frames: 1989869568. Throughput: 0: 43959.9. Samples: 1892812280. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-28 01:42:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:42:49,271][06909] Updated weights for policy 0, policy_version 121453 (0.0040) [2024-06-28 01:42:52,386][06909] Updated weights for policy 0, policy_version 121463 (0.0038) [2024-06-28 01:42:52,546][06887] Signal inference workers to stop experience collection... (26900 times) [2024-06-28 01:42:52,548][06887] Signal inference workers to resume experience collection... (26900 times) [2024-06-28 01:42:52,570][06909] InferenceWorker_p0-w0: stopping experience collection (26900 times) [2024-06-28 01:42:52,570][06909] InferenceWorker_p0-w0: resuming experience collection (26900 times) [2024-06-28 01:42:53,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1990098944. Throughput: 0: 44142.3. Samples: 1893078480. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-28 01:42:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:42:56,462][06909] Updated weights for policy 0, policy_version 121473 (0.0031) [2024-06-28 01:42:58,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 1990344704. Throughput: 0: 44127.5. Samples: 1893214880. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-28 01:42:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:43:00,142][06909] Updated weights for policy 0, policy_version 121483 (0.0027) [2024-06-28 01:43:03,777][06909] Updated weights for policy 0, policy_version 121493 (0.0027) [2024-06-28 01:43:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 1990541312. Throughput: 0: 43972.5. Samples: 1893470360. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-28 01:43:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 01:43:07,238][06909] Updated weights for policy 0, policy_version 121503 (0.0043) [2024-06-28 01:43:08,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1990770688. Throughput: 0: 43843.8. Samples: 1893734700. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-28 01:43:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:43:10,891][06909] Updated weights for policy 0, policy_version 121513 (0.0034) [2024-06-28 01:43:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1991000064. Throughput: 0: 44065.8. Samples: 1893874540. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-28 01:43:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:43:14,496][06909] Updated weights for policy 0, policy_version 121523 (0.0033) [2024-06-28 01:43:18,507][06909] Updated weights for policy 0, policy_version 121533 (0.0037) [2024-06-28 01:43:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 1991213056. Throughput: 0: 43980.8. Samples: 1894136380. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-28 01:43:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:43:22,173][06909] Updated weights for policy 0, policy_version 121543 (0.0032) [2024-06-28 01:43:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 1991442432. Throughput: 0: 43976.4. Samples: 1894399640. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-28 01:43:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:43:25,741][06909] Updated weights for policy 0, policy_version 121553 (0.0037) [2024-06-28 01:43:28,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 1991655424. Throughput: 0: 44028.0. Samples: 1894535160. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-28 01:43:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:43:29,287][06909] Updated weights for policy 0, policy_version 121563 (0.0031) [2024-06-28 01:43:33,342][06909] Updated weights for policy 0, policy_version 121573 (0.0038) [2024-06-28 01:43:33,852][06674] Fps is (10 sec: 44228.4, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 1991884800. Throughput: 0: 44155.4. Samples: 1894799360. Policy #0 lag: (min: 0.0, avg: 11.2, max: 20.0) [2024-06-28 01:43:33,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:43:36,966][06909] Updated weights for policy 0, policy_version 121583 (0.0034) [2024-06-28 01:43:38,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 1992097792. Throughput: 0: 44062.2. Samples: 1895061280. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:43:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:43:40,614][06909] Updated weights for policy 0, policy_version 121593 (0.0039) [2024-06-28 01:43:43,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1992327168. Throughput: 0: 43976.6. Samples: 1895193820. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:43:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:43:44,243][06909] Updated weights for policy 0, policy_version 121603 (0.0040) [2024-06-28 01:43:47,929][06909] Updated weights for policy 0, policy_version 121613 (0.0031) [2024-06-28 01:43:48,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44509.8, 300 sec: 43931.3). Total num frames: 1992540160. Throughput: 0: 44349.1. Samples: 1895466080. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:43:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:43:48,871][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000121615_1992540160.pth... [2024-06-28 01:43:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000120970_1981972480.pth [2024-06-28 01:43:51,494][06909] Updated weights for policy 0, policy_version 121623 (0.0028) [2024-06-28 01:43:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 1992753152. Throughput: 0: 44214.4. Samples: 1895724340. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:43:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:43:55,583][06909] Updated weights for policy 0, policy_version 121633 (0.0039) [2024-06-28 01:43:58,850][06674] Fps is (10 sec: 42599.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 1992966144. Throughput: 0: 43973.4. Samples: 1895853340. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:43:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:43:59,202][06909] Updated weights for policy 0, policy_version 121643 (0.0033) [2024-06-28 01:44:02,766][06909] Updated weights for policy 0, policy_version 121653 (0.0026) [2024-06-28 01:44:03,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44782.9, 300 sec: 44042.4). Total num frames: 1993228288. Throughput: 0: 44178.3. Samples: 1896124400. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:44:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:44:06,445][06909] Updated weights for policy 0, policy_version 121663 (0.0025) [2024-06-28 01:44:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 44098.1). Total num frames: 1993424896. Throughput: 0: 44115.2. Samples: 1896384820. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:44:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:44:10,258][06909] Updated weights for policy 0, policy_version 121673 (0.0032) [2024-06-28 01:44:13,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 1993637888. Throughput: 0: 43975.4. Samples: 1896514060. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:44:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:44:14,101][06909] Updated weights for policy 0, policy_version 121683 (0.0025) [2024-06-28 01:44:17,551][06909] Updated weights for policy 0, policy_version 121693 (0.0034) [2024-06-28 01:44:18,850][06674] Fps is (10 sec: 45874.1, 60 sec: 44509.7, 300 sec: 44042.4). Total num frames: 1993883648. Throughput: 0: 44208.9. Samples: 1896788680. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:44:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:44:21,319][06909] Updated weights for policy 0, policy_version 121703 (0.0028) [2024-06-28 01:44:22,961][06887] Signal inference workers to stop experience collection... (26950 times) [2024-06-28 01:44:23,013][06887] Signal inference workers to resume experience collection... (26950 times) [2024-06-28 01:44:23,014][06909] InferenceWorker_p0-w0: stopping experience collection (26950 times) [2024-06-28 01:44:23,040][06909] InferenceWorker_p0-w0: resuming experience collection (26950 times) [2024-06-28 01:44:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 1994080256. Throughput: 0: 44134.6. Samples: 1897047340. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:44:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:44:25,053][06909] Updated weights for policy 0, policy_version 121713 (0.0033) [2024-06-28 01:44:28,850][06674] Fps is (10 sec: 40961.0, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 1994293248. Throughput: 0: 44123.5. Samples: 1897179380. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:44:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:44:29,105][06909] Updated weights for policy 0, policy_version 121723 (0.0036) [2024-06-28 01:44:32,456][06909] Updated weights for policy 0, policy_version 121733 (0.0034) [2024-06-28 01:44:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 1994539008. Throughput: 0: 43911.3. Samples: 1897442080. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:44:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:44:36,397][06909] Updated weights for policy 0, policy_version 121743 (0.0034) [2024-06-28 01:44:38,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1994752000. Throughput: 0: 44217.6. Samples: 1897714140. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 01:44:38,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:44:40,011][06909] Updated weights for policy 0, policy_version 121753 (0.0031) [2024-06-28 01:44:43,637][06909] Updated weights for policy 0, policy_version 121763 (0.0040) [2024-06-28 01:44:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 1994964992. Throughput: 0: 44134.2. Samples: 1897839380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 01:44:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:44:47,278][06909] Updated weights for policy 0, policy_version 121773 (0.0043) [2024-06-28 01:44:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 1995194368. Throughput: 0: 44122.1. Samples: 1898109900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 01:44:48,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:44:51,194][06909] Updated weights for policy 0, policy_version 121783 (0.0030) [2024-06-28 01:44:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1995407360. Throughput: 0: 44157.3. Samples: 1898371900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 01:44:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:44:54,950][06909] Updated weights for policy 0, policy_version 121793 (0.0030) [2024-06-28 01:44:58,683][06909] Updated weights for policy 0, policy_version 121803 (0.0029) [2024-06-28 01:44:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 1995620352. Throughput: 0: 44068.4. Samples: 1898497140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 01:44:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:45:02,250][06909] Updated weights for policy 0, policy_version 121813 (0.0030) [2024-06-28 01:45:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 1995866112. Throughput: 0: 43853.1. Samples: 1898762060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 01:45:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:45:06,357][06909] Updated weights for policy 0, policy_version 121823 (0.0028) [2024-06-28 01:45:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 1996079104. Throughput: 0: 44075.1. Samples: 1899030720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 01:45:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:45:09,682][06909] Updated weights for policy 0, policy_version 121833 (0.0027) [2024-06-28 01:45:13,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 1996259328. Throughput: 0: 43864.9. Samples: 1899153300. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 01:45:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:45:13,930][06909] Updated weights for policy 0, policy_version 121843 (0.0030) [2024-06-28 01:45:17,166][06909] Updated weights for policy 0, policy_version 121853 (0.0027) [2024-06-28 01:45:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 44098.2). Total num frames: 1996521472. Throughput: 0: 44018.1. Samples: 1899422900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 01:45:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:45:21,407][06909] Updated weights for policy 0, policy_version 121863 (0.0028) [2024-06-28 01:45:23,856][06674] Fps is (10 sec: 47484.4, 60 sec: 44232.4, 300 sec: 44041.5). Total num frames: 1996734464. Throughput: 0: 43799.5. Samples: 1899685380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 01:45:23,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:45:24,762][06909] Updated weights for policy 0, policy_version 121873 (0.0037) [2024-06-28 01:45:28,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 1996914688. Throughput: 0: 43963.5. Samples: 1899817740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 01:45:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:45:29,006][06909] Updated weights for policy 0, policy_version 121883 (0.0034) [2024-06-28 01:45:31,898][06909] Updated weights for policy 0, policy_version 121893 (0.0036) [2024-06-28 01:45:33,850][06674] Fps is (10 sec: 44263.0, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 1997176832. Throughput: 0: 43811.0. Samples: 1900081400. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 01:45:33,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:45:36,219][06909] Updated weights for policy 0, policy_version 121903 (0.0030) [2024-06-28 01:45:38,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1997389824. Throughput: 0: 44006.7. Samples: 1900352200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 01:45:38,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:45:39,236][06909] Updated weights for policy 0, policy_version 121913 (0.0035) [2024-06-28 01:45:43,506][06909] Updated weights for policy 0, policy_version 121923 (0.0025) [2024-06-28 01:45:43,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 1997586432. Throughput: 0: 43933.5. Samples: 1900474140. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 01:45:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:45:47,001][06909] Updated weights for policy 0, policy_version 121933 (0.0033) [2024-06-28 01:45:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 1997832192. Throughput: 0: 43929.4. Samples: 1900738880. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 01:45:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:45:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000121938_1997832192.pth... [2024-06-28 01:45:48,930][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000121292_1987248128.pth [2024-06-28 01:45:50,970][06909] Updated weights for policy 0, policy_version 121943 (0.0035) [2024-06-28 01:45:53,851][06674] Fps is (10 sec: 45869.1, 60 sec: 43962.8, 300 sec: 44042.2). Total num frames: 1998045184. Throughput: 0: 43861.5. Samples: 1901004540. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 01:45:53,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:45:54,482][06909] Updated weights for policy 0, policy_version 121953 (0.0041) [2024-06-28 01:45:55,071][06887] Signal inference workers to stop experience collection... (27000 times) [2024-06-28 01:45:55,071][06887] Signal inference workers to resume experience collection... (27000 times) [2024-06-28 01:45:55,119][06909] InferenceWorker_p0-w0: stopping experience collection (27000 times) [2024-06-28 01:45:55,120][06909] InferenceWorker_p0-w0: resuming experience collection (27000 times) [2024-06-28 01:45:58,573][06909] Updated weights for policy 0, policy_version 121963 (0.0031) [2024-06-28 01:45:58,850][06674] Fps is (10 sec: 40959.1, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 1998241792. Throughput: 0: 44089.6. Samples: 1901137340. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 01:45:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:46:01,755][06909] Updated weights for policy 0, policy_version 121973 (0.0030) [2024-06-28 01:46:03,850][06674] Fps is (10 sec: 44242.5, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 1998487552. Throughput: 0: 43888.6. Samples: 1901397880. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 01:46:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:46:06,047][06909] Updated weights for policy 0, policy_version 121983 (0.0020) [2024-06-28 01:46:08,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 1998684160. Throughput: 0: 44105.9. Samples: 1901669880. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 01:46:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:46:09,404][06909] Updated weights for policy 0, policy_version 121993 (0.0033) [2024-06-28 01:46:13,270][06909] Updated weights for policy 0, policy_version 122003 (0.0039) [2024-06-28 01:46:13,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 1998897152. Throughput: 0: 43989.4. Samples: 1901797260. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 01:46:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:46:16,683][06909] Updated weights for policy 0, policy_version 122013 (0.0029) [2024-06-28 01:46:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 1999142912. Throughput: 0: 43930.8. Samples: 1902058280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 01:46:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:46:20,974][06909] Updated weights for policy 0, policy_version 122023 (0.0024) [2024-06-28 01:46:23,852][06674] Fps is (10 sec: 47503.9, 60 sec: 43966.7, 300 sec: 44042.1). Total num frames: 1999372288. Throughput: 0: 43753.6. Samples: 1902321200. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 01:46:23,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:46:24,363][06909] Updated weights for policy 0, policy_version 122033 (0.0039) [2024-06-28 01:46:28,182][06909] Updated weights for policy 0, policy_version 122043 (0.0026) [2024-06-28 01:46:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 1999585280. Throughput: 0: 44032.9. Samples: 1902455620. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 01:46:28,855][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:46:31,518][06909] Updated weights for policy 0, policy_version 122053 (0.0026) [2024-06-28 01:46:33,850][06674] Fps is (10 sec: 44245.5, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 1999814656. Throughput: 0: 44134.1. Samples: 1902724920. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 01:46:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:46:35,646][06909] Updated weights for policy 0, policy_version 122063 (0.0041) [2024-06-28 01:46:38,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2000027648. Throughput: 0: 44158.5. Samples: 1902991620. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 01:46:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:46:38,890][06909] Updated weights for policy 0, policy_version 122073 (0.0033) [2024-06-28 01:46:43,034][06909] Updated weights for policy 0, policy_version 122083 (0.0035) [2024-06-28 01:46:43,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.8, 300 sec: 44043.3). Total num frames: 2000240640. Throughput: 0: 44098.0. Samples: 1903121740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 01:46:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:46:46,405][06909] Updated weights for policy 0, policy_version 122093 (0.0042) [2024-06-28 01:46:48,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2000486400. Throughput: 0: 44189.7. Samples: 1903386420. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 01:46:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:46:50,624][06909] Updated weights for policy 0, policy_version 122103 (0.0026) [2024-06-28 01:46:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43964.7, 300 sec: 43986.9). Total num frames: 2000683008. Throughput: 0: 44025.9. Samples: 1903651040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 01:46:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:46:53,981][06909] Updated weights for policy 0, policy_version 122113 (0.0027) [2024-06-28 01:46:57,930][06909] Updated weights for policy 0, policy_version 122123 (0.0032) [2024-06-28 01:46:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 2000912384. Throughput: 0: 44142.7. Samples: 1903783680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 01:46:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:47:01,170][06909] Updated weights for policy 0, policy_version 122133 (0.0034) [2024-06-28 01:47:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2001108992. Throughput: 0: 44137.4. Samples: 1904044460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 01:47:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:47:05,152][06909] Updated weights for policy 0, policy_version 122143 (0.0040) [2024-06-28 01:47:08,465][06909] Updated weights for policy 0, policy_version 122153 (0.0031) [2024-06-28 01:47:08,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2001354752. Throughput: 0: 44418.8. Samples: 1904319960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 01:47:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:47:12,407][06909] Updated weights for policy 0, policy_version 122163 (0.0028) [2024-06-28 01:47:13,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44783.0, 300 sec: 44153.5). Total num frames: 2001584128. Throughput: 0: 44424.5. Samples: 1904454720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 01:47:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:47:16,142][06909] Updated weights for policy 0, policy_version 122173 (0.0023) [2024-06-28 01:47:16,428][06887] Signal inference workers to stop experience collection... (27050 times) [2024-06-28 01:47:16,487][06887] Signal inference workers to resume experience collection... (27050 times) [2024-06-28 01:47:16,488][06909] InferenceWorker_p0-w0: stopping experience collection (27050 times) [2024-06-28 01:47:16,501][06909] InferenceWorker_p0-w0: resuming experience collection (27050 times) [2024-06-28 01:47:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2001797120. Throughput: 0: 44217.4. Samples: 1904714700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 01:47:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:47:19,962][06909] Updated weights for policy 0, policy_version 122183 (0.0036) [2024-06-28 01:47:23,505][06909] Updated weights for policy 0, policy_version 122193 (0.0025) [2024-06-28 01:47:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 2002010112. Throughput: 0: 44255.2. Samples: 1904983100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 01:47:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:47:27,473][06909] Updated weights for policy 0, policy_version 122203 (0.0042) [2024-06-28 01:47:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2002239488. Throughput: 0: 44177.3. Samples: 1905109720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 01:47:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:47:31,362][06909] Updated weights for policy 0, policy_version 122213 (0.0033) [2024-06-28 01:47:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2002452480. Throughput: 0: 44047.6. Samples: 1905368560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 01:47:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:47:34,794][06909] Updated weights for policy 0, policy_version 122223 (0.0026) [2024-06-28 01:47:38,442][06909] Updated weights for policy 0, policy_version 122233 (0.0028) [2024-06-28 01:47:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 2002665472. Throughput: 0: 44238.7. Samples: 1905641780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 01:47:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:47:42,358][06909] Updated weights for policy 0, policy_version 122243 (0.0029) [2024-06-28 01:47:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2002911232. Throughput: 0: 44268.3. Samples: 1905775760. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 01:47:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:47:45,670][06909] Updated weights for policy 0, policy_version 122253 (0.0028) [2024-06-28 01:47:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2003107840. Throughput: 0: 44181.7. Samples: 1906032640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 01:47:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:47:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000122260_2003107840.pth... [2024-06-28 01:47:48,929][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000121615_1992540160.pth [2024-06-28 01:47:49,667][06909] Updated weights for policy 0, policy_version 122263 (0.0031) [2024-06-28 01:47:53,410][06909] Updated weights for policy 0, policy_version 122273 (0.0044) [2024-06-28 01:47:53,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2003320832. Throughput: 0: 44022.8. Samples: 1906300980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 01:47:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:47:57,054][06909] Updated weights for policy 0, policy_version 122283 (0.0052) [2024-06-28 01:47:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2003566592. Throughput: 0: 44006.5. Samples: 1906435020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 01:47:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:48:00,649][06909] Updated weights for policy 0, policy_version 122293 (0.0022) [2024-06-28 01:48:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2003779584. Throughput: 0: 44139.1. Samples: 1906700960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 01:48:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:48:04,542][06909] Updated weights for policy 0, policy_version 122303 (0.0030) [2024-06-28 01:48:08,187][06909] Updated weights for policy 0, policy_version 122313 (0.0036) [2024-06-28 01:48:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2003992576. Throughput: 0: 43976.0. Samples: 1906962020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 01:48:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:48:11,971][06909] Updated weights for policy 0, policy_version 122323 (0.0028) [2024-06-28 01:48:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2004221952. Throughput: 0: 44133.8. Samples: 1907095740. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 01:48:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:48:15,581][06909] Updated weights for policy 0, policy_version 122333 (0.0037) [2024-06-28 01:48:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2004434944. Throughput: 0: 44200.0. Samples: 1907357560. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 01:48:18,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 01:48:19,611][06909] Updated weights for policy 0, policy_version 122343 (0.0033) [2024-06-28 01:48:22,785][06909] Updated weights for policy 0, policy_version 122353 (0.0021) [2024-06-28 01:48:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2004664320. Throughput: 0: 44082.1. Samples: 1907625480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 01:48:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:48:26,775][06909] Updated weights for policy 0, policy_version 122363 (0.0034) [2024-06-28 01:48:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44098.2). Total num frames: 2004893696. Throughput: 0: 44036.5. Samples: 1907757400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 01:48:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:48:30,501][06909] Updated weights for policy 0, policy_version 122373 (0.0037) [2024-06-28 01:48:33,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2005090304. Throughput: 0: 44260.8. Samples: 1908024380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 01:48:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:48:34,260][06909] Updated weights for policy 0, policy_version 122383 (0.0028) [2024-06-28 01:48:37,707][06909] Updated weights for policy 0, policy_version 122393 (0.0031) [2024-06-28 01:48:38,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2005303296. Throughput: 0: 44085.7. Samples: 1908284840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 01:48:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:48:39,451][06887] Signal inference workers to stop experience collection... (27100 times) [2024-06-28 01:48:39,452][06887] Signal inference workers to resume experience collection... (27100 times) [2024-06-28 01:48:39,485][06909] InferenceWorker_p0-w0: stopping experience collection (27100 times) [2024-06-28 01:48:39,485][06909] InferenceWorker_p0-w0: resuming experience collection (27100 times) [2024-06-28 01:48:41,763][06909] Updated weights for policy 0, policy_version 122403 (0.0036) [2024-06-28 01:48:43,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2005549056. Throughput: 0: 43999.6. Samples: 1908415000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 01:48:43,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:48:45,460][06909] Updated weights for policy 0, policy_version 122413 (0.0042) [2024-06-28 01:48:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2005745664. Throughput: 0: 43977.7. Samples: 1908679960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:48:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:48:49,167][06909] Updated weights for policy 0, policy_version 122423 (0.0036) [2024-06-28 01:48:52,696][06909] Updated weights for policy 0, policy_version 122433 (0.0030) [2024-06-28 01:48:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2005975040. Throughput: 0: 44027.9. Samples: 1908943280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:48:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:48:56,865][06909] Updated weights for policy 0, policy_version 122443 (0.0030) [2024-06-28 01:48:58,852][06674] Fps is (10 sec: 45865.9, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 2006204416. Throughput: 0: 44098.8. Samples: 1909080280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:48:58,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:49:00,219][06909] Updated weights for policy 0, policy_version 122453 (0.0033) [2024-06-28 01:49:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.5, 300 sec: 43931.3). Total num frames: 2006384640. Throughput: 0: 44014.2. Samples: 1909338200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:49:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:49:04,360][06909] Updated weights for policy 0, policy_version 122463 (0.0046) [2024-06-28 01:49:07,835][06909] Updated weights for policy 0, policy_version 122473 (0.0028) [2024-06-28 01:49:08,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2006630400. Throughput: 0: 43806.7. Samples: 1909596780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:49:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:49:11,743][06909] Updated weights for policy 0, policy_version 122483 (0.0035) [2024-06-28 01:49:13,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2006859776. Throughput: 0: 43961.4. Samples: 1909735660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:49:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:49:15,098][06909] Updated weights for policy 0, policy_version 122493 (0.0028) [2024-06-28 01:49:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2007056384. Throughput: 0: 43770.8. Samples: 1909994060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:49:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:49:19,139][06909] Updated weights for policy 0, policy_version 122503 (0.0032) [2024-06-28 01:49:22,514][06909] Updated weights for policy 0, policy_version 122513 (0.0032) [2024-06-28 01:49:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2007285760. Throughput: 0: 43839.6. Samples: 1910257620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:49:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:49:26,539][06909] Updated weights for policy 0, policy_version 122523 (0.0025) [2024-06-28 01:49:28,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2007531520. Throughput: 0: 43983.0. Samples: 1910394240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:49:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:49:30,121][06909] Updated weights for policy 0, policy_version 122533 (0.0032) [2024-06-28 01:49:33,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 2007728128. Throughput: 0: 44063.3. Samples: 1910662900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:49:33,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 01:49:34,025][06909] Updated weights for policy 0, policy_version 122543 (0.0046) [2024-06-28 01:49:37,264][06909] Updated weights for policy 0, policy_version 122553 (0.0027) [2024-06-28 01:49:38,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2007957504. Throughput: 0: 43821.4. Samples: 1910915240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:49:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:49:41,606][06909] Updated weights for policy 0, policy_version 122563 (0.0029) [2024-06-28 01:49:43,850][06674] Fps is (10 sec: 45884.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2008186880. Throughput: 0: 43896.6. Samples: 1911055540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:49:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:49:44,847][06909] Updated weights for policy 0, policy_version 122573 (0.0033) [2024-06-28 01:49:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2008383488. Throughput: 0: 44115.7. Samples: 1911323400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 01:49:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:49:48,873][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000122582_2008383488.pth... [2024-06-28 01:49:48,941][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000121938_1997832192.pth [2024-06-28 01:49:49,084][06909] Updated weights for policy 0, policy_version 122583 (0.0021) [2024-06-28 01:49:52,232][06909] Updated weights for policy 0, policy_version 122593 (0.0032) [2024-06-28 01:49:53,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2008596480. Throughput: 0: 44136.4. Samples: 1911582920. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:49:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:49:56,306][06909] Updated weights for policy 0, policy_version 122603 (0.0032) [2024-06-28 01:49:58,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43965.1, 300 sec: 43986.9). Total num frames: 2008842240. Throughput: 0: 43971.9. Samples: 1911714400. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:49:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:49:59,560][06909] Updated weights for policy 0, policy_version 122613 (0.0039) [2024-06-28 01:50:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2009038848. Throughput: 0: 44298.2. Samples: 1911987480. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:50:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:50:04,049][06909] Updated weights for policy 0, policy_version 122623 (0.0039) [2024-06-28 01:50:04,144][06887] Signal inference workers to stop experience collection... (27150 times) [2024-06-28 01:50:04,166][06909] InferenceWorker_p0-w0: stopping experience collection (27150 times) [2024-06-28 01:50:04,204][06887] Signal inference workers to resume experience collection... (27150 times) [2024-06-28 01:50:04,204][06909] InferenceWorker_p0-w0: resuming experience collection (27150 times) [2024-06-28 01:50:06,844][06909] Updated weights for policy 0, policy_version 122633 (0.0031) [2024-06-28 01:50:08,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2009268224. Throughput: 0: 44263.1. Samples: 1912249460. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:50:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:50:11,307][06909] Updated weights for policy 0, policy_version 122643 (0.0028) [2024-06-28 01:50:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2009497600. Throughput: 0: 44162.4. Samples: 1912381540. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:50:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:50:14,197][06909] Updated weights for policy 0, policy_version 122653 (0.0028) [2024-06-28 01:50:18,687][06909] Updated weights for policy 0, policy_version 122663 (0.0043) [2024-06-28 01:50:18,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.7, 300 sec: 43987.8). Total num frames: 2009710592. Throughput: 0: 44106.3. Samples: 1912647600. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:50:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:50:21,633][06909] Updated weights for policy 0, policy_version 122673 (0.0020) [2024-06-28 01:50:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2009923584. Throughput: 0: 44262.2. Samples: 1912907040. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:50:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:50:26,170][06909] Updated weights for policy 0, policy_version 122683 (0.0032) [2024-06-28 01:50:28,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2010169344. Throughput: 0: 44172.1. Samples: 1913043280. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:50:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:50:29,046][06909] Updated weights for policy 0, policy_version 122693 (0.0028) [2024-06-28 01:50:33,698][06909] Updated weights for policy 0, policy_version 122703 (0.0042) [2024-06-28 01:50:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 2010365952. Throughput: 0: 43922.6. Samples: 1913299920. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:50:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:50:36,702][06909] Updated weights for policy 0, policy_version 122713 (0.0028) [2024-06-28 01:50:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2010595328. Throughput: 0: 43933.7. Samples: 1913559940. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:50:38,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:50:41,279][06909] Updated weights for policy 0, policy_version 122723 (0.0035) [2024-06-28 01:50:43,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2010824704. Throughput: 0: 43964.6. Samples: 1913692800. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:50:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:50:44,140][06909] Updated weights for policy 0, policy_version 122733 (0.0036) [2024-06-28 01:50:48,730][06909] Updated weights for policy 0, policy_version 122743 (0.0027) [2024-06-28 01:50:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.7, 300 sec: 43987.1). Total num frames: 2011021312. Throughput: 0: 43804.5. Samples: 1913958680. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 01:50:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:50:51,519][06909] Updated weights for policy 0, policy_version 122753 (0.0023) [2024-06-28 01:50:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2011250688. Throughput: 0: 43792.8. Samples: 1914220140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:50:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:50:56,183][06909] Updated weights for policy 0, policy_version 122763 (0.0031) [2024-06-28 01:50:58,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2011496448. Throughput: 0: 43906.7. Samples: 1914357340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:50:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:50:59,146][06909] Updated weights for policy 0, policy_version 122773 (0.0023) [2024-06-28 01:51:03,391][06909] Updated weights for policy 0, policy_version 122783 (0.0031) [2024-06-28 01:51:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2011693056. Throughput: 0: 43917.9. Samples: 1914623900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:51:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:51:06,624][06909] Updated weights for policy 0, policy_version 122793 (0.0034) [2024-06-28 01:51:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2011922432. Throughput: 0: 43887.6. Samples: 1914881980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:51:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:51:10,792][06909] Updated weights for policy 0, policy_version 122803 (0.0021) [2024-06-28 01:51:13,175][06887] Signal inference workers to stop experience collection... (27200 times) [2024-06-28 01:51:13,176][06887] Signal inference workers to resume experience collection... (27200 times) [2024-06-28 01:51:13,187][06909] InferenceWorker_p0-w0: stopping experience collection (27200 times) [2024-06-28 01:51:13,187][06909] InferenceWorker_p0-w0: resuming experience collection (27200 times) [2024-06-28 01:51:13,769][06909] Updated weights for policy 0, policy_version 122813 (0.0034) [2024-06-28 01:51:13,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2012168192. Throughput: 0: 43767.9. Samples: 1915012840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:51:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:51:18,160][06909] Updated weights for policy 0, policy_version 122823 (0.0030) [2024-06-28 01:51:18,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44509.9, 300 sec: 44098.2). Total num frames: 2012381184. Throughput: 0: 44136.4. Samples: 1915286060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:51:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:51:21,091][06909] Updated weights for policy 0, policy_version 122833 (0.0027) [2024-06-28 01:51:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2012577792. Throughput: 0: 44296.0. Samples: 1915553260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:51:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:51:25,470][06909] Updated weights for policy 0, policy_version 122843 (0.0033) [2024-06-28 01:51:28,401][06909] Updated weights for policy 0, policy_version 122853 (0.0050) [2024-06-28 01:51:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2012823552. Throughput: 0: 44059.0. Samples: 1915675460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:51:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:51:33,031][06909] Updated weights for policy 0, policy_version 122863 (0.0047) [2024-06-28 01:51:33,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2013036544. Throughput: 0: 44313.8. Samples: 1915952800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:51:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:51:35,969][06909] Updated weights for policy 0, policy_version 122873 (0.0034) [2024-06-28 01:51:38,852][06674] Fps is (10 sec: 42589.9, 60 sec: 44235.4, 300 sec: 44097.6). Total num frames: 2013249536. Throughput: 0: 44219.4. Samples: 1916210100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:51:38,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:51:40,585][06909] Updated weights for policy 0, policy_version 122883 (0.0026) [2024-06-28 01:51:43,475][06909] Updated weights for policy 0, policy_version 122893 (0.0036) [2024-06-28 01:51:43,853][06674] Fps is (10 sec: 44223.5, 60 sec: 44234.6, 300 sec: 44042.0). Total num frames: 2013478912. Throughput: 0: 43934.9. Samples: 1916334540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:51:43,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:51:47,930][06909] Updated weights for policy 0, policy_version 122903 (0.0034) [2024-06-28 01:51:48,850][06674] Fps is (10 sec: 42607.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2013675520. Throughput: 0: 44069.0. Samples: 1916607000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 01:51:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:51:48,885][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000122906_2013691904.pth... [2024-06-28 01:51:48,929][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000122260_2003107840.pth [2024-06-28 01:51:51,499][06909] Updated weights for policy 0, policy_version 122913 (0.0038) [2024-06-28 01:51:53,850][06674] Fps is (10 sec: 42610.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2013904896. Throughput: 0: 44116.3. Samples: 1916867220. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-28 01:51:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:51:55,300][06909] Updated weights for policy 0, policy_version 122923 (0.0041) [2024-06-28 01:51:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2014117888. Throughput: 0: 43960.0. Samples: 1916991040. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-28 01:51:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:51:58,953][06909] Updated weights for policy 0, policy_version 122933 (0.0040) [2024-06-28 01:52:02,695][06909] Updated weights for policy 0, policy_version 122943 (0.0035) [2024-06-28 01:52:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2014363648. Throughput: 0: 44044.0. Samples: 1917268040. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-28 01:52:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:52:06,337][06909] Updated weights for policy 0, policy_version 122953 (0.0031) [2024-06-28 01:52:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2014560256. Throughput: 0: 44107.1. Samples: 1917538080. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-28 01:52:08,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:52:09,977][06909] Updated weights for policy 0, policy_version 122963 (0.0033) [2024-06-28 01:52:13,638][06909] Updated weights for policy 0, policy_version 122973 (0.0034) [2024-06-28 01:52:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2014789632. Throughput: 0: 44290.7. Samples: 1917668540. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-28 01:52:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:52:17,446][06909] Updated weights for policy 0, policy_version 122983 (0.0027) [2024-06-28 01:52:18,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 2015002624. Throughput: 0: 43880.9. Samples: 1917927440. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-28 01:52:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:52:20,901][06909] Updated weights for policy 0, policy_version 122993 (0.0034) [2024-06-28 01:52:22,204][06887] Signal inference workers to stop experience collection... (27250 times) [2024-06-28 01:52:22,204][06887] Signal inference workers to resume experience collection... (27250 times) [2024-06-28 01:52:22,243][06909] InferenceWorker_p0-w0: stopping experience collection (27250 times) [2024-06-28 01:52:22,243][06909] InferenceWorker_p0-w0: resuming experience collection (27250 times) [2024-06-28 01:52:23,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2015215616. Throughput: 0: 44075.8. Samples: 1918193420. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-28 01:52:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:52:24,937][06909] Updated weights for policy 0, policy_version 123003 (0.0036) [2024-06-28 01:52:28,572][06909] Updated weights for policy 0, policy_version 123013 (0.0038) [2024-06-28 01:52:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2015444992. Throughput: 0: 44131.3. Samples: 1918320320. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-28 01:52:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:52:32,397][06909] Updated weights for policy 0, policy_version 123023 (0.0043) [2024-06-28 01:52:33,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2015674368. Throughput: 0: 44071.5. Samples: 1918590220. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-28 01:52:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:52:36,367][06909] Updated weights for policy 0, policy_version 123033 (0.0032) [2024-06-28 01:52:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 2015887360. Throughput: 0: 44129.9. Samples: 1918853060. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-28 01:52:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:52:39,769][06909] Updated weights for policy 0, policy_version 123043 (0.0048) [2024-06-28 01:52:43,579][06909] Updated weights for policy 0, policy_version 123053 (0.0035) [2024-06-28 01:52:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43965.9, 300 sec: 44098.0). Total num frames: 2016116736. Throughput: 0: 44406.7. Samples: 1918989340. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-28 01:52:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:52:46,997][06909] Updated weights for policy 0, policy_version 123063 (0.0041) [2024-06-28 01:52:48,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.6, 300 sec: 44097.9). Total num frames: 2016329728. Throughput: 0: 44152.4. Samples: 1919254900. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-28 01:52:48,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:52:50,712][06909] Updated weights for policy 0, policy_version 123073 (0.0034) [2024-06-28 01:52:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 2016542720. Throughput: 0: 43993.9. Samples: 1919517800. Policy #0 lag: (min: 1.0, avg: 9.7, max: 20.0) [2024-06-28 01:52:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:52:54,674][06909] Updated weights for policy 0, policy_version 123083 (0.0035) [2024-06-28 01:52:58,355][06909] Updated weights for policy 0, policy_version 123093 (0.0037) [2024-06-28 01:52:58,852][06674] Fps is (10 sec: 42590.4, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 2016755712. Throughput: 0: 44047.4. Samples: 1919650760. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 01:52:58,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:53:01,758][06909] Updated weights for policy 0, policy_version 123103 (0.0025) [2024-06-28 01:53:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2017001472. Throughput: 0: 44059.5. Samples: 1919910120. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 01:53:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:53:05,800][06909] Updated weights for policy 0, policy_version 123113 (0.0034) [2024-06-28 01:53:08,850][06674] Fps is (10 sec: 44246.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2017198080. Throughput: 0: 44088.0. Samples: 1920177380. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 01:53:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:53:09,501][06909] Updated weights for policy 0, policy_version 123123 (0.0040) [2024-06-28 01:53:13,162][06909] Updated weights for policy 0, policy_version 123133 (0.0030) [2024-06-28 01:53:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2017427456. Throughput: 0: 44202.2. Samples: 1920309420. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 01:53:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:53:16,880][06909] Updated weights for policy 0, policy_version 123143 (0.0041) [2024-06-28 01:53:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2017656832. Throughput: 0: 44010.3. Samples: 1920570680. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 01:53:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:53:20,677][06909] Updated weights for policy 0, policy_version 123153 (0.0034) [2024-06-28 01:53:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2017869824. Throughput: 0: 44243.5. Samples: 1920844020. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 01:53:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:53:24,238][06909] Updated weights for policy 0, policy_version 123163 (0.0032) [2024-06-28 01:53:24,416][06887] Signal inference workers to stop experience collection... (27300 times) [2024-06-28 01:53:24,417][06887] Signal inference workers to resume experience collection... (27300 times) [2024-06-28 01:53:24,446][06909] InferenceWorker_p0-w0: stopping experience collection (27300 times) [2024-06-28 01:53:24,447][06909] InferenceWorker_p0-w0: resuming experience collection (27300 times) [2024-06-28 01:53:28,141][06909] Updated weights for policy 0, policy_version 123173 (0.0028) [2024-06-28 01:53:28,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2018066432. Throughput: 0: 44039.1. Samples: 1920971100. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 01:53:28,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-28 01:53:31,666][06909] Updated weights for policy 0, policy_version 123183 (0.0044) [2024-06-28 01:53:33,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2018312192. Throughput: 0: 43898.4. Samples: 1921230320. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 01:53:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:53:35,984][06909] Updated weights for policy 0, policy_version 123193 (0.0040) [2024-06-28 01:53:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2018525184. Throughput: 0: 43959.4. Samples: 1921495980. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 01:53:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:53:39,319][06909] Updated weights for policy 0, policy_version 123203 (0.0035) [2024-06-28 01:53:43,185][06909] Updated weights for policy 0, policy_version 123213 (0.0054) [2024-06-28 01:53:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2018738176. Throughput: 0: 43806.9. Samples: 1921621980. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 01:53:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:53:46,705][06909] Updated weights for policy 0, policy_version 123223 (0.0024) [2024-06-28 01:53:48,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 2018983936. Throughput: 0: 43833.7. Samples: 1921882640. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 01:53:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:53:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000123229_2018983936.pth... [2024-06-28 01:53:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000122582_2008383488.pth [2024-06-28 01:53:50,395][06909] Updated weights for policy 0, policy_version 123233 (0.0032) [2024-06-28 01:53:53,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44042.7). Total num frames: 2019196928. Throughput: 0: 44090.6. Samples: 1922161460. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 01:53:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:53:53,936][06909] Updated weights for policy 0, policy_version 123243 (0.0022) [2024-06-28 01:53:57,568][06909] Updated weights for policy 0, policy_version 123253 (0.0033) [2024-06-28 01:53:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44238.2, 300 sec: 44153.5). Total num frames: 2019409920. Throughput: 0: 43982.6. Samples: 1922288640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:53:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:54:01,226][06909] Updated weights for policy 0, policy_version 123263 (0.0035) [2024-06-28 01:54:03,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2019622912. Throughput: 0: 44113.8. Samples: 1922555800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:54:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:54:05,785][06909] Updated weights for policy 0, policy_version 123273 (0.0021) [2024-06-28 01:54:08,633][06909] Updated weights for policy 0, policy_version 123283 (0.0027) [2024-06-28 01:54:08,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2019868672. Throughput: 0: 44025.9. Samples: 1922825180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:54:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:54:13,208][06909] Updated weights for policy 0, policy_version 123293 (0.0040) [2024-06-28 01:54:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2020065280. Throughput: 0: 44061.4. Samples: 1922953860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:54:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:54:16,378][06909] Updated weights for policy 0, policy_version 123303 (0.0027) [2024-06-28 01:54:18,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2020294656. Throughput: 0: 43965.6. Samples: 1923208780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:54:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:54:20,373][06909] Updated weights for policy 0, policy_version 123313 (0.0031) [2024-06-28 01:54:23,767][06909] Updated weights for policy 0, policy_version 123323 (0.0038) [2024-06-28 01:54:23,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2020524032. Throughput: 0: 44065.3. Samples: 1923478920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:54:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:54:27,635][06909] Updated weights for policy 0, policy_version 123333 (0.0036) [2024-06-28 01:54:28,852][06674] Fps is (10 sec: 40952.2, 60 sec: 43962.2, 300 sec: 43986.9). Total num frames: 2020704256. Throughput: 0: 44041.6. Samples: 1923603940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:54:28,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:54:31,126][06909] Updated weights for policy 0, policy_version 123343 (0.0030) [2024-06-28 01:54:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2020966400. Throughput: 0: 44312.0. Samples: 1923876680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:54:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:54:34,885][06909] Updated weights for policy 0, policy_version 123353 (0.0032) [2024-06-28 01:54:38,607][06909] Updated weights for policy 0, policy_version 123363 (0.0029) [2024-06-28 01:54:38,850][06674] Fps is (10 sec: 47523.5, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2021179392. Throughput: 0: 43874.8. Samples: 1924135820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:54:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:54:40,590][06887] Signal inference workers to stop experience collection... (27350 times) [2024-06-28 01:54:40,590][06887] Signal inference workers to resume experience collection... (27350 times) [2024-06-28 01:54:40,612][06909] InferenceWorker_p0-w0: stopping experience collection (27350 times) [2024-06-28 01:54:40,612][06909] InferenceWorker_p0-w0: resuming experience collection (27350 times) [2024-06-28 01:54:43,142][06909] Updated weights for policy 0, policy_version 123373 (0.0029) [2024-06-28 01:54:43,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2021376000. Throughput: 0: 43852.5. Samples: 1924262000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:54:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:54:46,066][06909] Updated weights for policy 0, policy_version 123383 (0.0023) [2024-06-28 01:54:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2021621760. Throughput: 0: 43821.7. Samples: 1924527780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:54:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:54:50,520][06909] Updated weights for policy 0, policy_version 123393 (0.0042) [2024-06-28 01:54:53,724][06909] Updated weights for policy 0, policy_version 123403 (0.0044) [2024-06-28 01:54:53,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2021834752. Throughput: 0: 43684.0. Samples: 1924790960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:54:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:54:57,750][06909] Updated weights for policy 0, policy_version 123413 (0.0048) [2024-06-28 01:54:58,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2022031360. Throughput: 0: 43670.2. Samples: 1924919020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 01:54:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:55:01,130][06909] Updated weights for policy 0, policy_version 123423 (0.0029) [2024-06-28 01:55:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2022277120. Throughput: 0: 43950.8. Samples: 1925186560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 01:55:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:55:04,950][06909] Updated weights for policy 0, policy_version 123433 (0.0025) [2024-06-28 01:55:08,391][06909] Updated weights for policy 0, policy_version 123443 (0.0043) [2024-06-28 01:55:08,850][06674] Fps is (10 sec: 49151.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2022522880. Throughput: 0: 43842.7. Samples: 1925451840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 01:55:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:55:12,182][06909] Updated weights for policy 0, policy_version 123453 (0.0023) [2024-06-28 01:55:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2022703104. Throughput: 0: 44078.5. Samples: 1925587380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 01:55:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:55:15,877][06909] Updated weights for policy 0, policy_version 123463 (0.0027) [2024-06-28 01:55:18,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2022932480. Throughput: 0: 43849.2. Samples: 1925849900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 01:55:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:55:20,284][06909] Updated weights for policy 0, policy_version 123473 (0.0030) [2024-06-28 01:55:23,402][06909] Updated weights for policy 0, policy_version 123483 (0.0034) [2024-06-28 01:55:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 2023161856. Throughput: 0: 43888.0. Samples: 1926110780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 01:55:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:55:27,515][06909] Updated weights for policy 0, policy_version 123493 (0.0031) [2024-06-28 01:55:28,850][06674] Fps is (10 sec: 42599.6, 60 sec: 44238.4, 300 sec: 44042.4). Total num frames: 2023358464. Throughput: 0: 43992.1. Samples: 1926241640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 01:55:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:55:30,959][06909] Updated weights for policy 0, policy_version 123503 (0.0028) [2024-06-28 01:55:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2023587840. Throughput: 0: 43886.2. Samples: 1926502660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 01:55:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:55:34,855][06909] Updated weights for policy 0, policy_version 123513 (0.0033) [2024-06-28 01:55:38,279][06909] Updated weights for policy 0, policy_version 123523 (0.0031) [2024-06-28 01:55:38,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2023817216. Throughput: 0: 43912.7. Samples: 1926767040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 01:55:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:55:42,060][06909] Updated weights for policy 0, policy_version 123533 (0.0037) [2024-06-28 01:55:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2024030208. Throughput: 0: 44176.5. Samples: 1926906960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 01:55:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:55:45,494][06909] Updated weights for policy 0, policy_version 123543 (0.0026) [2024-06-28 01:55:48,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2024243200. Throughput: 0: 44091.1. Samples: 1927170660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 01:55:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:55:48,900][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000123551_2024259584.pth... [2024-06-28 01:55:48,950][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000122906_2013691904.pth [2024-06-28 01:55:49,379][06909] Updated weights for policy 0, policy_version 123553 (0.0028) [2024-06-28 01:55:53,065][06909] Updated weights for policy 0, policy_version 123563 (0.0042) [2024-06-28 01:55:53,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2024488960. Throughput: 0: 44124.9. Samples: 1927437460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 01:55:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:55:57,222][06909] Updated weights for policy 0, policy_version 123573 (0.0042) [2024-06-28 01:55:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2024701952. Throughput: 0: 44123.6. Samples: 1927572940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 01:55:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:55:59,917][06887] Signal inference workers to stop experience collection... (27400 times) [2024-06-28 01:55:59,956][06909] InferenceWorker_p0-w0: stopping experience collection (27400 times) [2024-06-28 01:55:59,963][06887] Signal inference workers to resume experience collection... (27400 times) [2024-06-28 01:55:59,978][06909] InferenceWorker_p0-w0: resuming experience collection (27400 times) [2024-06-28 01:56:00,316][06909] Updated weights for policy 0, policy_version 123583 (0.0033) [2024-06-28 01:56:03,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2024898560. Throughput: 0: 44122.5. Samples: 1927835400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 01:56:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:56:04,713][06909] Updated weights for policy 0, policy_version 123593 (0.0037) [2024-06-28 01:56:07,846][06909] Updated weights for policy 0, policy_version 123603 (0.0033) [2024-06-28 01:56:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 2025144320. Throughput: 0: 44236.5. Samples: 1928101420. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 01:56:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:56:11,921][06909] Updated weights for policy 0, policy_version 123613 (0.0032) [2024-06-28 01:56:13,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2025357312. Throughput: 0: 44425.7. Samples: 1928240800. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 01:56:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:56:15,219][06909] Updated weights for policy 0, policy_version 123623 (0.0028) [2024-06-28 01:56:18,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43962.4, 300 sec: 44042.1). Total num frames: 2025570304. Throughput: 0: 44338.0. Samples: 1928497960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 01:56:18,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:56:19,324][06909] Updated weights for policy 0, policy_version 123633 (0.0030) [2024-06-28 01:56:22,482][06909] Updated weights for policy 0, policy_version 123643 (0.0027) [2024-06-28 01:56:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2025799680. Throughput: 0: 44384.2. Samples: 1928764320. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 01:56:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:56:26,621][06909] Updated weights for policy 0, policy_version 123653 (0.0038) [2024-06-28 01:56:28,850][06674] Fps is (10 sec: 47523.0, 60 sec: 44782.8, 300 sec: 44097.9). Total num frames: 2026045440. Throughput: 0: 44276.8. Samples: 1928899420. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 01:56:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:56:30,095][06909] Updated weights for policy 0, policy_version 123663 (0.0028) [2024-06-28 01:56:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44042.7). Total num frames: 2026242048. Throughput: 0: 44333.8. Samples: 1929165680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 01:56:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:56:34,165][06909] Updated weights for policy 0, policy_version 123673 (0.0021) [2024-06-28 01:56:37,734][06909] Updated weights for policy 0, policy_version 123683 (0.0029) [2024-06-28 01:56:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44098.4). Total num frames: 2026487808. Throughput: 0: 44117.8. Samples: 1929422760. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 01:56:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:56:41,954][06909] Updated weights for policy 0, policy_version 123693 (0.0033) [2024-06-28 01:56:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2026668032. Throughput: 0: 44113.2. Samples: 1929558040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 01:56:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 01:56:44,946][06909] Updated weights for policy 0, policy_version 123703 (0.0040) [2024-06-28 01:56:48,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2026881024. Throughput: 0: 44135.9. Samples: 1929821520. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 01:56:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:56:49,298][06909] Updated weights for policy 0, policy_version 123713 (0.0041) [2024-06-28 01:56:52,339][06909] Updated weights for policy 0, policy_version 123723 (0.0022) [2024-06-28 01:56:53,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2027126784. Throughput: 0: 44092.9. Samples: 1930085600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 01:56:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:56:56,769][06909] Updated weights for policy 0, policy_version 123733 (0.0055) [2024-06-28 01:56:58,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2027356160. Throughput: 0: 44044.8. Samples: 1930222820. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 01:56:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:56:59,894][06909] Updated weights for policy 0, policy_version 123743 (0.0031) [2024-06-28 01:57:03,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2027552768. Throughput: 0: 44282.8. Samples: 1930490600. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 01:57:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:57:04,371][06909] Updated weights for policy 0, policy_version 123753 (0.0038) [2024-06-28 01:57:07,323][06909] Updated weights for policy 0, policy_version 123763 (0.0029) [2024-06-28 01:57:08,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2027798528. Throughput: 0: 44099.5. Samples: 1930748800. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 01:57:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:57:11,698][06909] Updated weights for policy 0, policy_version 123773 (0.0031) [2024-06-28 01:57:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2028011520. Throughput: 0: 44036.9. Samples: 1930881080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 01:57:13,852][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 01:57:14,846][06909] Updated weights for policy 0, policy_version 123783 (0.0028) [2024-06-28 01:57:18,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 2028208128. Throughput: 0: 43913.8. Samples: 1931141800. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 01:57:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:57:19,098][06909] Updated weights for policy 0, policy_version 123793 (0.0027) [2024-06-28 01:57:22,253][06909] Updated weights for policy 0, policy_version 123803 (0.0024) [2024-06-28 01:57:23,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2028453888. Throughput: 0: 44070.0. Samples: 1931405900. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 01:57:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:57:23,881][06887] Signal inference workers to stop experience collection... (27450 times) [2024-06-28 01:57:23,881][06887] Signal inference workers to resume experience collection... (27450 times) [2024-06-28 01:57:23,920][06909] InferenceWorker_p0-w0: stopping experience collection (27450 times) [2024-06-28 01:57:23,920][06909] InferenceWorker_p0-w0: resuming experience collection (27450 times) [2024-06-28 01:57:26,450][06909] Updated weights for policy 0, policy_version 123813 (0.0024) [2024-06-28 01:57:28,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 2028683264. Throughput: 0: 44129.0. Samples: 1931543840. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 01:57:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:57:29,524][06909] Updated weights for policy 0, policy_version 123823 (0.0033) [2024-06-28 01:57:33,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2028863488. Throughput: 0: 44190.6. Samples: 1931810100. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 01:57:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:57:33,857][06909] Updated weights for policy 0, policy_version 123833 (0.0039) [2024-06-28 01:57:37,146][06909] Updated weights for policy 0, policy_version 123843 (0.0023) [2024-06-28 01:57:38,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2029109248. Throughput: 0: 44173.7. Samples: 1932073420. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 01:57:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:57:41,332][06909] Updated weights for policy 0, policy_version 123853 (0.0037) [2024-06-28 01:57:43,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 2029338624. Throughput: 0: 44118.8. Samples: 1932208160. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 01:57:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:57:44,402][06909] Updated weights for policy 0, policy_version 123863 (0.0022) [2024-06-28 01:57:48,680][06909] Updated weights for policy 0, policy_version 123873 (0.0034) [2024-06-28 01:57:48,855][06674] Fps is (10 sec: 42576.8, 60 sec: 44233.0, 300 sec: 44041.6). Total num frames: 2029535232. Throughput: 0: 43829.3. Samples: 1932463140. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 01:57:48,855][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:57:48,876][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000123873_2029535232.pth... [2024-06-28 01:57:48,925][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000123229_2018983936.pth [2024-06-28 01:57:52,211][06909] Updated weights for policy 0, policy_version 123883 (0.0025) [2024-06-28 01:57:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 2029764608. Throughput: 0: 43969.8. Samples: 1932727440. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 01:57:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:57:56,276][06909] Updated weights for policy 0, policy_version 123893 (0.0021) [2024-06-28 01:57:58,850][06674] Fps is (10 sec: 47538.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2030010368. Throughput: 0: 44111.2. Samples: 1932866080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 01:57:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:57:59,370][06909] Updated weights for policy 0, policy_version 123903 (0.0032) [2024-06-28 01:58:03,623][06909] Updated weights for policy 0, policy_version 123913 (0.0035) [2024-06-28 01:58:03,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2030190592. Throughput: 0: 44168.3. Samples: 1933129380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 01:58:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:58:06,540][06909] Updated weights for policy 0, policy_version 123923 (0.0036) [2024-06-28 01:58:08,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2030436352. Throughput: 0: 44124.7. Samples: 1933391520. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 01:58:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:58:11,163][06909] Updated weights for policy 0, policy_version 123933 (0.0039) [2024-06-28 01:58:13,850][06674] Fps is (10 sec: 47514.3, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2030665728. Throughput: 0: 44180.8. Samples: 1933531980. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 01:58:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:58:14,418][06909] Updated weights for policy 0, policy_version 123943 (0.0044) [2024-06-28 01:58:18,389][06909] Updated weights for policy 0, policy_version 123953 (0.0035) [2024-06-28 01:58:18,850][06674] Fps is (10 sec: 40958.5, 60 sec: 43963.4, 300 sec: 43986.8). Total num frames: 2030845952. Throughput: 0: 44242.2. Samples: 1933801020. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 01:58:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:58:21,542][06909] Updated weights for policy 0, policy_version 123963 (0.0031) [2024-06-28 01:58:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2031091712. Throughput: 0: 44009.0. Samples: 1934053820. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 01:58:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:58:25,906][06909] Updated weights for policy 0, policy_version 123973 (0.0024) [2024-06-28 01:58:28,850][06674] Fps is (10 sec: 47515.1, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2031321088. Throughput: 0: 44013.6. Samples: 1934188780. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 01:58:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:58:29,211][06909] Updated weights for policy 0, policy_version 123983 (0.0031) [2024-06-28 01:58:33,166][06909] Updated weights for policy 0, policy_version 123993 (0.0029) [2024-06-28 01:58:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2031534080. Throughput: 0: 44298.8. Samples: 1934456360. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 01:58:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:58:36,512][06909] Updated weights for policy 0, policy_version 124003 (0.0031) [2024-06-28 01:58:38,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2031747072. Throughput: 0: 44142.2. Samples: 1934713840. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 01:58:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:58:40,659][06909] Updated weights for policy 0, policy_version 124013 (0.0034) [2024-06-28 01:58:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2031976448. Throughput: 0: 44074.2. Samples: 1934849420. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 01:58:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:58:43,891][06909] Updated weights for policy 0, policy_version 124023 (0.0028) [2024-06-28 01:58:48,077][06909] Updated weights for policy 0, policy_version 124033 (0.0032) [2024-06-28 01:58:48,856][06674] Fps is (10 sec: 44211.2, 60 sec: 44236.3, 300 sec: 44041.6). Total num frames: 2032189440. Throughput: 0: 44181.6. Samples: 1935117800. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 01:58:48,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:58:51,441][06909] Updated weights for policy 0, policy_version 124043 (0.0028) [2024-06-28 01:58:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2032402432. Throughput: 0: 44142.7. Samples: 1935377940. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 01:58:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:58:55,403][06909] Updated weights for policy 0, policy_version 124053 (0.0031) [2024-06-28 01:58:58,772][06909] Updated weights for policy 0, policy_version 124063 (0.0037) [2024-06-28 01:58:58,856][06674] Fps is (10 sec: 45874.0, 60 sec: 43959.3, 300 sec: 44152.6). Total num frames: 2032648192. Throughput: 0: 43948.7. Samples: 1935509940. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 01:58:58,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:59:03,159][06909] Updated weights for policy 0, policy_version 124073 (0.0039) [2024-06-28 01:59:03,800][06887] Signal inference workers to stop experience collection... (27500 times) [2024-06-28 01:59:03,800][06887] Signal inference workers to resume experience collection... (27500 times) [2024-06-28 01:59:03,849][06909] InferenceWorker_p0-w0: stopping experience collection (27500 times) [2024-06-28 01:59:03,849][06909] InferenceWorker_p0-w0: resuming experience collection (27500 times) [2024-06-28 01:59:03,852][06674] Fps is (10 sec: 44227.2, 60 sec: 44235.2, 300 sec: 43986.5). Total num frames: 2032844800. Throughput: 0: 43742.3. Samples: 1935769500. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 01:59:03,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:59:06,620][06909] Updated weights for policy 0, policy_version 124083 (0.0032) [2024-06-28 01:59:08,850][06674] Fps is (10 sec: 40984.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2033057792. Throughput: 0: 43932.7. Samples: 1936030800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:59:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:59:10,516][06909] Updated weights for policy 0, policy_version 124093 (0.0038) [2024-06-28 01:59:13,853][06674] Fps is (10 sec: 44230.8, 60 sec: 43688.0, 300 sec: 44041.9). Total num frames: 2033287168. Throughput: 0: 43838.0. Samples: 1936161640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:59:13,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:59:14,037][06909] Updated weights for policy 0, policy_version 124103 (0.0039) [2024-06-28 01:59:17,935][06909] Updated weights for policy 0, policy_version 124113 (0.0030) [2024-06-28 01:59:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44510.1, 300 sec: 44042.4). Total num frames: 2033516544. Throughput: 0: 44042.6. Samples: 1936438280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:59:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 01:59:21,272][06909] Updated weights for policy 0, policy_version 124123 (0.0043) [2024-06-28 01:59:23,850][06674] Fps is (10 sec: 44252.9, 60 sec: 43963.7, 300 sec: 44153.8). Total num frames: 2033729536. Throughput: 0: 44026.7. Samples: 1936695040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:59:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 01:59:25,367][06909] Updated weights for policy 0, policy_version 124133 (0.0032) [2024-06-28 01:59:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2033942528. Throughput: 0: 43983.1. Samples: 1936828660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:59:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 01:59:28,892][06909] Updated weights for policy 0, policy_version 124143 (0.0026) [2024-06-28 01:59:32,535][06909] Updated weights for policy 0, policy_version 124153 (0.0036) [2024-06-28 01:59:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2034171904. Throughput: 0: 43961.7. Samples: 1937095820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:59:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:59:36,035][06909] Updated weights for policy 0, policy_version 124163 (0.0026) [2024-06-28 01:59:38,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2034384896. Throughput: 0: 43978.5. Samples: 1937356980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:59:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 01:59:40,281][06909] Updated weights for policy 0, policy_version 124173 (0.0036) [2024-06-28 01:59:43,663][06909] Updated weights for policy 0, policy_version 124183 (0.0039) [2024-06-28 01:59:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2034614272. Throughput: 0: 43918.4. Samples: 1937486000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:59:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 01:59:47,464][06909] Updated weights for policy 0, policy_version 124193 (0.0027) [2024-06-28 01:59:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44241.0, 300 sec: 44097.9). Total num frames: 2034843648. Throughput: 0: 44148.3. Samples: 1937756080. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:59:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:59:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000124197_2034843648.pth... [2024-06-28 01:59:48,924][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000123551_2024259584.pth [2024-06-28 01:59:51,093][06909] Updated weights for policy 0, policy_version 124203 (0.0031) [2024-06-28 01:59:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2035056640. Throughput: 0: 44254.8. Samples: 1938022260. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:59:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 01:59:55,214][06909] Updated weights for policy 0, policy_version 124213 (0.0038) [2024-06-28 01:59:58,566][06909] Updated weights for policy 0, policy_version 124223 (0.0031) [2024-06-28 01:59:58,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43968.2, 300 sec: 44098.0). Total num frames: 2035286016. Throughput: 0: 44140.1. Samples: 1938147780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 01:59:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:00:02,413][06909] Updated weights for policy 0, policy_version 124233 (0.0031) [2024-06-28 02:00:03,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44238.5, 300 sec: 43986.9). Total num frames: 2035499008. Throughput: 0: 44051.3. Samples: 1938420580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 02:00:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:00:06,006][06909] Updated weights for policy 0, policy_version 124243 (0.0027) [2024-06-28 02:00:08,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 2035695616. Throughput: 0: 44176.4. Samples: 1938682980. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-28 02:00:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:00:09,806][06909] Updated weights for policy 0, policy_version 124253 (0.0030) [2024-06-28 02:00:13,544][06909] Updated weights for policy 0, policy_version 124263 (0.0036) [2024-06-28 02:00:13,794][06887] Signal inference workers to stop experience collection... (27550 times) [2024-06-28 02:00:13,798][06887] Signal inference workers to resume experience collection... (27550 times) [2024-06-28 02:00:13,845][06909] InferenceWorker_p0-w0: stopping experience collection (27550 times) [2024-06-28 02:00:13,846][06909] InferenceWorker_p0-w0: resuming experience collection (27550 times) [2024-06-28 02:00:13,852][06674] Fps is (10 sec: 44227.8, 60 sec: 44238.0, 300 sec: 44097.7). Total num frames: 2035941376. Throughput: 0: 44138.5. Samples: 1938814980. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-28 02:00:13,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:00:17,367][06909] Updated weights for policy 0, policy_version 124273 (0.0033) [2024-06-28 02:00:18,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2036154368. Throughput: 0: 44029.3. Samples: 1939077140. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-28 02:00:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:00:20,836][06909] Updated weights for policy 0, policy_version 124283 (0.0035) [2024-06-28 02:00:23,852][06674] Fps is (10 sec: 42598.2, 60 sec: 43962.2, 300 sec: 44097.6). Total num frames: 2036367360. Throughput: 0: 44176.4. Samples: 1939345000. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-28 02:00:23,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:00:24,767][06909] Updated weights for policy 0, policy_version 124293 (0.0030) [2024-06-28 02:00:28,275][06909] Updated weights for policy 0, policy_version 124303 (0.0039) [2024-06-28 02:00:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2036596736. Throughput: 0: 44197.7. Samples: 1939474900. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-28 02:00:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:00:32,189][06909] Updated weights for policy 0, policy_version 124313 (0.0041) [2024-06-28 02:00:33,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2036809728. Throughput: 0: 44041.8. Samples: 1939737960. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-28 02:00:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:00:35,812][06909] Updated weights for policy 0, policy_version 124323 (0.0039) [2024-06-28 02:00:38,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43962.4, 300 sec: 44042.1). Total num frames: 2037022720. Throughput: 0: 44016.7. Samples: 1940003100. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-28 02:00:38,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:00:39,358][06909] Updated weights for policy 0, policy_version 124333 (0.0040) [2024-06-28 02:00:43,035][06909] Updated weights for policy 0, policy_version 124343 (0.0035) [2024-06-28 02:00:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2037268480. Throughput: 0: 44263.0. Samples: 1940139620. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-28 02:00:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:00:46,923][06909] Updated weights for policy 0, policy_version 124353 (0.0042) [2024-06-28 02:00:48,850][06674] Fps is (10 sec: 45884.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2037481472. Throughput: 0: 43871.0. Samples: 1940394780. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-28 02:00:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:00:50,607][06909] Updated weights for policy 0, policy_version 124363 (0.0030) [2024-06-28 02:00:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2037694464. Throughput: 0: 43982.6. Samples: 1940662200. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-28 02:00:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:00:54,509][06909] Updated weights for policy 0, policy_version 124373 (0.0036) [2024-06-28 02:00:57,983][06909] Updated weights for policy 0, policy_version 124383 (0.0030) [2024-06-28 02:00:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.5, 300 sec: 44153.5). Total num frames: 2037923840. Throughput: 0: 43925.8. Samples: 1940791560. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-28 02:00:58,859][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:01:02,138][06909] Updated weights for policy 0, policy_version 124393 (0.0038) [2024-06-28 02:01:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2038136832. Throughput: 0: 43944.0. Samples: 1941054620. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-28 02:01:03,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:01:05,408][06909] Updated weights for policy 0, policy_version 124403 (0.0035) [2024-06-28 02:01:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.6, 300 sec: 44042.4). Total num frames: 2038349824. Throughput: 0: 43859.2. Samples: 1941318580. Policy #0 lag: (min: 1.0, avg: 11.0, max: 21.0) [2024-06-28 02:01:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:01:09,424][06909] Updated weights for policy 0, policy_version 124413 (0.0027) [2024-06-28 02:01:13,255][06909] Updated weights for policy 0, policy_version 124423 (0.0041) [2024-06-28 02:01:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43965.2, 300 sec: 44098.3). Total num frames: 2038579200. Throughput: 0: 43900.0. Samples: 1941450400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:01:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:01:16,630][06909] Updated weights for policy 0, policy_version 124433 (0.0027) [2024-06-28 02:01:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2038792192. Throughput: 0: 43918.6. Samples: 1941714300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:01:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:01:20,456][06909] Updated weights for policy 0, policy_version 124443 (0.0023) [2024-06-28 02:01:23,851][06674] Fps is (10 sec: 44231.8, 60 sec: 44237.5, 300 sec: 43986.7). Total num frames: 2039021568. Throughput: 0: 43972.0. Samples: 1941981800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:01:23,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:01:24,022][06909] Updated weights for policy 0, policy_version 124453 (0.0040) [2024-06-28 02:01:27,831][06909] Updated weights for policy 0, policy_version 124463 (0.0027) [2024-06-28 02:01:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2039234560. Throughput: 0: 43912.0. Samples: 1942115660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:01:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:01:31,417][06909] Updated weights for policy 0, policy_version 124473 (0.0025) [2024-06-28 02:01:33,850][06674] Fps is (10 sec: 44242.3, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2039463936. Throughput: 0: 44189.4. Samples: 1942383300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:01:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:01:35,086][06909] Updated weights for policy 0, policy_version 124483 (0.0035) [2024-06-28 02:01:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 2039676928. Throughput: 0: 44140.9. Samples: 1942648540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:01:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:01:38,951][06909] Updated weights for policy 0, policy_version 124493 (0.0040) [2024-06-28 02:01:39,240][06887] Signal inference workers to stop experience collection... (27600 times) [2024-06-28 02:01:39,241][06887] Signal inference workers to resume experience collection... (27600 times) [2024-06-28 02:01:39,262][06909] InferenceWorker_p0-w0: stopping experience collection (27600 times) [2024-06-28 02:01:39,262][06909] InferenceWorker_p0-w0: resuming experience collection (27600 times) [2024-06-28 02:01:42,323][06909] Updated weights for policy 0, policy_version 124503 (0.0032) [2024-06-28 02:01:43,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2039906304. Throughput: 0: 44075.6. Samples: 1942774960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:01:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:01:46,403][06909] Updated weights for policy 0, policy_version 124513 (0.0042) [2024-06-28 02:01:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2040119296. Throughput: 0: 44064.0. Samples: 1943037500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:01:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:01:48,952][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000124520_2040135680.pth... [2024-06-28 02:01:49,003][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000123873_2029535232.pth [2024-06-28 02:01:50,537][06909] Updated weights for policy 0, policy_version 124523 (0.0037) [2024-06-28 02:01:53,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2040332288. Throughput: 0: 44025.1. Samples: 1943299700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:01:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:01:53,910][06909] Updated weights for policy 0, policy_version 124533 (0.0032) [2024-06-28 02:01:57,881][06909] Updated weights for policy 0, policy_version 124543 (0.0031) [2024-06-28 02:01:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2040561664. Throughput: 0: 44076.4. Samples: 1943433840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:01:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:02:01,171][06909] Updated weights for policy 0, policy_version 124553 (0.0038) [2024-06-28 02:02:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2040774656. Throughput: 0: 44051.2. Samples: 1943696600. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:02:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:02:05,268][06909] Updated weights for policy 0, policy_version 124563 (0.0038) [2024-06-28 02:02:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 2040987648. Throughput: 0: 44037.2. Samples: 1943963420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:02:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:02:08,956][06909] Updated weights for policy 0, policy_version 124573 (0.0033) [2024-06-28 02:02:12,458][06909] Updated weights for policy 0, policy_version 124583 (0.0034) [2024-06-28 02:02:13,852][06674] Fps is (10 sec: 45863.9, 60 sec: 44235.0, 300 sec: 44153.1). Total num frames: 2041233408. Throughput: 0: 43871.9. Samples: 1944090000. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:02:13,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:02:16,335][06909] Updated weights for policy 0, policy_version 124593 (0.0025) [2024-06-28 02:02:18,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 2041430016. Throughput: 0: 43925.9. Samples: 1944360060. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:02:18,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:02:19,947][06909] Updated weights for policy 0, policy_version 124603 (0.0037) [2024-06-28 02:02:23,482][06909] Updated weights for policy 0, policy_version 124613 (0.0028) [2024-06-28 02:02:23,850][06674] Fps is (10 sec: 44248.0, 60 sec: 44237.7, 300 sec: 44042.4). Total num frames: 2041675776. Throughput: 0: 44041.9. Samples: 1944630420. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:02:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:02:27,514][06909] Updated weights for policy 0, policy_version 124623 (0.0026) [2024-06-28 02:02:28,850][06674] Fps is (10 sec: 45884.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2041888768. Throughput: 0: 44172.4. Samples: 1944762720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:02:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:02:30,948][06909] Updated weights for policy 0, policy_version 124633 (0.0028) [2024-06-28 02:02:33,852][06674] Fps is (10 sec: 42589.5, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 2042101760. Throughput: 0: 44047.8. Samples: 1945019740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:02:33,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 02:02:34,821][06909] Updated weights for policy 0, policy_version 124643 (0.0032) [2024-06-28 02:02:38,337][06909] Updated weights for policy 0, policy_version 124653 (0.0042) [2024-06-28 02:02:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2042331136. Throughput: 0: 44182.6. Samples: 1945287920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:02:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:02:42,043][06909] Updated weights for policy 0, policy_version 124663 (0.0023) [2024-06-28 02:02:43,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43963.8, 300 sec: 44098.7). Total num frames: 2042544128. Throughput: 0: 44332.0. Samples: 1945428780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:02:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:02:45,609][06909] Updated weights for policy 0, policy_version 124673 (0.0029) [2024-06-28 02:02:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2042757120. Throughput: 0: 44333.4. Samples: 1945691600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:02:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:02:49,401][06909] Updated weights for policy 0, policy_version 124683 (0.0033) [2024-06-28 02:02:53,183][06909] Updated weights for policy 0, policy_version 124693 (0.0032) [2024-06-28 02:02:53,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2043002880. Throughput: 0: 44315.4. Samples: 1945957620. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:02:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:02:57,036][06909] Updated weights for policy 0, policy_version 124703 (0.0034) [2024-06-28 02:02:58,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2043215872. Throughput: 0: 44427.4. Samples: 1946089120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:02:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:03:00,723][06909] Updated weights for policy 0, policy_version 124713 (0.0037) [2024-06-28 02:03:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2043428864. Throughput: 0: 44349.1. Samples: 1946355680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:03:03,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:03:04,491][06909] Updated weights for policy 0, policy_version 124723 (0.0043) [2024-06-28 02:03:08,141][06909] Updated weights for policy 0, policy_version 124733 (0.0031) [2024-06-28 02:03:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2043658240. Throughput: 0: 44167.1. Samples: 1946617940. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:03:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:03:11,721][06909] Updated weights for policy 0, policy_version 124743 (0.0036) [2024-06-28 02:03:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43692.5, 300 sec: 44098.0). Total num frames: 2043854848. Throughput: 0: 44006.7. Samples: 1946743020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 02:03:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:03:15,428][06909] Updated weights for policy 0, policy_version 124753 (0.0044) [2024-06-28 02:03:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44511.4, 300 sec: 44097.9). Total num frames: 2044100608. Throughput: 0: 44226.5. Samples: 1947009840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 02:03:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:03:18,985][06909] Updated weights for policy 0, policy_version 124763 (0.0030) [2024-06-28 02:03:23,107][06909] Updated weights for policy 0, policy_version 124773 (0.0031) [2024-06-28 02:03:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2044297216. Throughput: 0: 44181.8. Samples: 1947276100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 02:03:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:03:26,602][06909] Updated weights for policy 0, policy_version 124783 (0.0028) [2024-06-28 02:03:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2044526592. Throughput: 0: 44049.8. Samples: 1947411020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 02:03:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:03:30,298][06909] Updated weights for policy 0, policy_version 124793 (0.0034) [2024-06-28 02:03:31,590][06887] Signal inference workers to stop experience collection... (27650 times) [2024-06-28 02:03:31,590][06887] Signal inference workers to resume experience collection... (27650 times) [2024-06-28 02:03:31,632][06909] InferenceWorker_p0-w0: stopping experience collection (27650 times) [2024-06-28 02:03:31,636][06909] InferenceWorker_p0-w0: resuming experience collection (27650 times) [2024-06-28 02:03:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44238.3, 300 sec: 44097.9). Total num frames: 2044755968. Throughput: 0: 44086.6. Samples: 1947675500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 02:03:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:03:34,449][06909] Updated weights for policy 0, policy_version 124803 (0.0048) [2024-06-28 02:03:37,817][06909] Updated weights for policy 0, policy_version 124813 (0.0033) [2024-06-28 02:03:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2044968960. Throughput: 0: 44121.8. Samples: 1947943100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 02:03:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:03:41,737][06909] Updated weights for policy 0, policy_version 124823 (0.0038) [2024-06-28 02:03:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.6, 300 sec: 44043.3). Total num frames: 2045181952. Throughput: 0: 44067.9. Samples: 1948072180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 02:03:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:03:45,103][06909] Updated weights for policy 0, policy_version 124833 (0.0037) [2024-06-28 02:03:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2045411328. Throughput: 0: 44083.1. Samples: 1948339420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 02:03:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:03:48,914][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000124843_2045427712.pth... [2024-06-28 02:03:48,916][06909] Updated weights for policy 0, policy_version 124843 (0.0031) [2024-06-28 02:03:48,973][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000124197_2034843648.pth [2024-06-28 02:03:52,290][06909] Updated weights for policy 0, policy_version 124853 (0.0029) [2024-06-28 02:03:53,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44236.8, 300 sec: 44098.8). Total num frames: 2045657088. Throughput: 0: 44096.8. Samples: 1948602300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 02:03:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:03:56,298][06909] Updated weights for policy 0, policy_version 124863 (0.0022) [2024-06-28 02:03:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 2045870080. Throughput: 0: 44335.1. Samples: 1948738100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 02:03:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:03:59,837][06909] Updated weights for policy 0, policy_version 124873 (0.0039) [2024-06-28 02:04:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2046066688. Throughput: 0: 44453.8. Samples: 1949010260. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 02:04:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:04:04,037][06909] Updated weights for policy 0, policy_version 124883 (0.0032) [2024-06-28 02:04:07,126][06909] Updated weights for policy 0, policy_version 124893 (0.0036) [2024-06-28 02:04:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44154.0). Total num frames: 2046312448. Throughput: 0: 44420.3. Samples: 1949275020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 02:04:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:04:11,348][06909] Updated weights for policy 0, policy_version 124903 (0.0034) [2024-06-28 02:04:13,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44783.0, 300 sec: 44153.5). Total num frames: 2046541824. Throughput: 0: 44421.8. Samples: 1949410000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 02:04:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:04:14,693][06909] Updated weights for policy 0, policy_version 124913 (0.0035) [2024-06-28 02:04:18,637][06909] Updated weights for policy 0, policy_version 124923 (0.0029) [2024-06-28 02:04:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2046738432. Throughput: 0: 44307.9. Samples: 1949669360. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 02:04:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:04:22,163][06909] Updated weights for policy 0, policy_version 124933 (0.0039) [2024-06-28 02:04:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44783.0, 300 sec: 44209.0). Total num frames: 2046984192. Throughput: 0: 44001.0. Samples: 1949923140. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 02:04:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:04:25,979][06909] Updated weights for policy 0, policy_version 124943 (0.0043) [2024-06-28 02:04:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2047197184. Throughput: 0: 44294.8. Samples: 1950065440. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 02:04:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:04:29,594][06909] Updated weights for policy 0, policy_version 124953 (0.0033) [2024-06-28 02:04:33,434][06909] Updated weights for policy 0, policy_version 124963 (0.0035) [2024-06-28 02:04:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2047410176. Throughput: 0: 44352.5. Samples: 1950335280. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 02:04:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:04:37,000][06909] Updated weights for policy 0, policy_version 124973 (0.0027) [2024-06-28 02:04:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2047639552. Throughput: 0: 44270.6. Samples: 1950594480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 02:04:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:04:40,957][06909] Updated weights for policy 0, policy_version 124983 (0.0029) [2024-06-28 02:04:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 2047852544. Throughput: 0: 44260.5. Samples: 1950729820. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 02:04:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:04:44,213][06909] Updated weights for policy 0, policy_version 124993 (0.0038) [2024-06-28 02:04:48,242][06909] Updated weights for policy 0, policy_version 125003 (0.0031) [2024-06-28 02:04:48,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 2048081920. Throughput: 0: 44218.3. Samples: 1951000080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 02:04:48,850][06674] Avg episode reward: [(0, '0.402')] [2024-06-28 02:04:51,449][06909] Updated weights for policy 0, policy_version 125013 (0.0050) [2024-06-28 02:04:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2048294912. Throughput: 0: 44061.3. Samples: 1951257780. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 02:04:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:04:55,882][06909] Updated weights for policy 0, policy_version 125023 (0.0037) [2024-06-28 02:04:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2048524288. Throughput: 0: 43996.8. Samples: 1951389860. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 02:04:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 02:04:59,235][06909] Updated weights for policy 0, policy_version 125033 (0.0040) [2024-06-28 02:05:03,229][06909] Updated weights for policy 0, policy_version 125043 (0.0039) [2024-06-28 02:05:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2048720896. Throughput: 0: 44146.3. Samples: 1951655940. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 02:05:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:05:06,727][06909] Updated weights for policy 0, policy_version 125053 (0.0037) [2024-06-28 02:05:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44098.2). Total num frames: 2048950272. Throughput: 0: 44340.8. Samples: 1951918480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 02:05:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:05:10,280][06887] Signal inference workers to stop experience collection... (27700 times) [2024-06-28 02:05:10,329][06909] InferenceWorker_p0-w0: stopping experience collection (27700 times) [2024-06-28 02:05:10,393][06887] Signal inference workers to resume experience collection... (27700 times) [2024-06-28 02:05:10,394][06909] InferenceWorker_p0-w0: resuming experience collection (27700 times) [2024-06-28 02:05:10,535][06909] Updated weights for policy 0, policy_version 125063 (0.0023) [2024-06-28 02:05:13,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2049179648. Throughput: 0: 44235.2. Samples: 1952056020. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 02:05:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:05:13,981][06909] Updated weights for policy 0, policy_version 125073 (0.0027) [2024-06-28 02:05:17,978][06909] Updated weights for policy 0, policy_version 125083 (0.0043) [2024-06-28 02:05:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 2049376256. Throughput: 0: 44062.7. Samples: 1952318100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 02:05:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:05:21,632][06909] Updated weights for policy 0, policy_version 125093 (0.0025) [2024-06-28 02:05:23,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 2049622016. Throughput: 0: 44041.3. Samples: 1952576340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 02:05:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:05:25,599][06909] Updated weights for policy 0, policy_version 125103 (0.0031) [2024-06-28 02:05:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2049818624. Throughput: 0: 44081.3. Samples: 1952713480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 02:05:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:05:29,043][06909] Updated weights for policy 0, policy_version 125113 (0.0023) [2024-06-28 02:05:33,002][06909] Updated weights for policy 0, policy_version 125123 (0.0038) [2024-06-28 02:05:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 44098.2). Total num frames: 2050031616. Throughput: 0: 43904.2. Samples: 1952975780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 02:05:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:05:36,501][06909] Updated weights for policy 0, policy_version 125133 (0.0032) [2024-06-28 02:05:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2050260992. Throughput: 0: 43863.6. Samples: 1953231640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 02:05:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:05:40,439][06909] Updated weights for policy 0, policy_version 125143 (0.0035) [2024-06-28 02:05:43,850][06674] Fps is (10 sec: 45876.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2050490368. Throughput: 0: 44111.7. Samples: 1953374880. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 02:05:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:05:43,891][06909] Updated weights for policy 0, policy_version 125153 (0.0025) [2024-06-28 02:05:47,921][06909] Updated weights for policy 0, policy_version 125163 (0.0036) [2024-06-28 02:05:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.5, 300 sec: 44042.4). Total num frames: 2050686976. Throughput: 0: 44044.0. Samples: 1953637920. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 02:05:48,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:05:48,903][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000125165_2050703360.pth... [2024-06-28 02:05:48,958][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000124520_2040135680.pth [2024-06-28 02:05:51,245][06909] Updated weights for policy 0, policy_version 125173 (0.0031) [2024-06-28 02:05:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2050932736. Throughput: 0: 44014.7. Samples: 1953899140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 02:05:53,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:05:55,469][06909] Updated weights for policy 0, policy_version 125183 (0.0026) [2024-06-28 02:05:58,501][06909] Updated weights for policy 0, policy_version 125193 (0.0030) [2024-06-28 02:05:58,850][06674] Fps is (10 sec: 49152.5, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2051178496. Throughput: 0: 44062.2. Samples: 1954038820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 02:05:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:06:02,628][06909] Updated weights for policy 0, policy_version 125203 (0.0022) [2024-06-28 02:06:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2051358720. Throughput: 0: 44092.9. Samples: 1954302280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 02:06:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:06:06,110][06909] Updated weights for policy 0, policy_version 125213 (0.0026) [2024-06-28 02:06:08,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2051588096. Throughput: 0: 44107.2. Samples: 1954561160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 02:06:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:06:10,149][06909] Updated weights for policy 0, policy_version 125223 (0.0048) [2024-06-28 02:06:13,601][06909] Updated weights for policy 0, policy_version 125233 (0.0036) [2024-06-28 02:06:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2051817472. Throughput: 0: 44050.3. Samples: 1954695740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 02:06:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:06:17,467][06909] Updated weights for policy 0, policy_version 125243 (0.0028) [2024-06-28 02:06:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.6). Total num frames: 2052014080. Throughput: 0: 44032.1. Samples: 1954957220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 02:06:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:06:20,986][06909] Updated weights for policy 0, policy_version 125253 (0.0042) [2024-06-28 02:06:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2052259840. Throughput: 0: 44340.0. Samples: 1955226940. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-28 02:06:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:06:24,687][06909] Updated weights for policy 0, policy_version 125263 (0.0029) [2024-06-28 02:06:28,480][06909] Updated weights for policy 0, policy_version 125273 (0.0034) [2024-06-28 02:06:28,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44783.0, 300 sec: 44209.0). Total num frames: 2052505600. Throughput: 0: 44174.1. Samples: 1955362720. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-28 02:06:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:06:32,665][06909] Updated weights for policy 0, policy_version 125283 (0.0041) [2024-06-28 02:06:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2052685824. Throughput: 0: 44105.4. Samples: 1955622660. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-28 02:06:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:06:35,899][06909] Updated weights for policy 0, policy_version 125293 (0.0037) [2024-06-28 02:06:36,148][06887] Signal inference workers to stop experience collection... (27750 times) [2024-06-28 02:06:36,188][06909] InferenceWorker_p0-w0: stopping experience collection (27750 times) [2024-06-28 02:06:36,204][06887] Signal inference workers to resume experience collection... (27750 times) [2024-06-28 02:06:36,207][06909] InferenceWorker_p0-w0: resuming experience collection (27750 times) [2024-06-28 02:06:38,850][06674] Fps is (10 sec: 40960.5, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2052915200. Throughput: 0: 44078.8. Samples: 1955882680. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-28 02:06:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:06:39,886][06909] Updated weights for policy 0, policy_version 125303 (0.0034) [2024-06-28 02:06:43,516][06909] Updated weights for policy 0, policy_version 125313 (0.0040) [2024-06-28 02:06:43,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2053160960. Throughput: 0: 43865.7. Samples: 1956012780. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-28 02:06:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:06:47,434][06909] Updated weights for policy 0, policy_version 125323 (0.0040) [2024-06-28 02:06:48,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2053357568. Throughput: 0: 43934.6. Samples: 1956279340. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-28 02:06:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:06:51,011][06909] Updated weights for policy 0, policy_version 125333 (0.0043) [2024-06-28 02:06:53,852][06674] Fps is (10 sec: 40951.8, 60 sec: 43962.2, 300 sec: 44097.7). Total num frames: 2053570560. Throughput: 0: 43911.8. Samples: 1956537280. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-28 02:06:53,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:06:54,711][06909] Updated weights for policy 0, policy_version 125343 (0.0027) [2024-06-28 02:06:58,498][06909] Updated weights for policy 0, policy_version 125353 (0.0040) [2024-06-28 02:06:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2053816320. Throughput: 0: 43862.3. Samples: 1956669540. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-28 02:06:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:07:02,186][06909] Updated weights for policy 0, policy_version 125363 (0.0027) [2024-06-28 02:07:03,850][06674] Fps is (10 sec: 44246.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2054012928. Throughput: 0: 43932.5. Samples: 1956934180. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-28 02:07:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:07:05,829][06909] Updated weights for policy 0, policy_version 125373 (0.0031) [2024-06-28 02:07:08,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 44042.8). Total num frames: 2054225920. Throughput: 0: 43714.2. Samples: 1957194080. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-28 02:07:08,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:07:09,623][06909] Updated weights for policy 0, policy_version 125383 (0.0039) [2024-06-28 02:07:13,423][06909] Updated weights for policy 0, policy_version 125393 (0.0030) [2024-06-28 02:07:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 44209.3). Total num frames: 2054471680. Throughput: 0: 43752.9. Samples: 1957331600. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-28 02:07:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:07:17,203][06909] Updated weights for policy 0, policy_version 125403 (0.0027) [2024-06-28 02:07:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2054684672. Throughput: 0: 43785.8. Samples: 1957593020. Policy #0 lag: (min: 1.0, avg: 9.4, max: 20.0) [2024-06-28 02:07:18,859][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:07:20,877][06909] Updated weights for policy 0, policy_version 125413 (0.0030) [2024-06-28 02:07:23,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2054881280. Throughput: 0: 43846.4. Samples: 1957855780. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 02:07:23,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:07:24,638][06909] Updated weights for policy 0, policy_version 125423 (0.0032) [2024-06-28 02:07:28,435][06909] Updated weights for policy 0, policy_version 125433 (0.0037) [2024-06-28 02:07:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.7, 300 sec: 44098.3). Total num frames: 2055110656. Throughput: 0: 43856.5. Samples: 1957986320. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 02:07:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:07:31,842][06909] Updated weights for policy 0, policy_version 125443 (0.0037) [2024-06-28 02:07:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2055340032. Throughput: 0: 43883.0. Samples: 1958254080. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 02:07:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:07:35,740][06909] Updated weights for policy 0, policy_version 125453 (0.0052) [2024-06-28 02:07:38,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2055536640. Throughput: 0: 43920.3. Samples: 1958513600. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 02:07:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:07:39,377][06909] Updated weights for policy 0, policy_version 125463 (0.0024) [2024-06-28 02:07:43,143][06909] Updated weights for policy 0, policy_version 125473 (0.0038) [2024-06-28 02:07:43,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2055798784. Throughput: 0: 44083.6. Samples: 1958653300. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 02:07:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:07:46,942][06909] Updated weights for policy 0, policy_version 125483 (0.0041) [2024-06-28 02:07:48,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2055995392. Throughput: 0: 44148.8. Samples: 1958920880. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 02:07:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:07:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000125488_2055995392.pth... [2024-06-28 02:07:48,903][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000124843_2045427712.pth [2024-06-28 02:07:50,620][06887] Signal inference workers to stop experience collection... (27800 times) [2024-06-28 02:07:50,662][06909] InferenceWorker_p0-w0: stopping experience collection (27800 times) [2024-06-28 02:07:50,674][06887] Signal inference workers to resume experience collection... (27800 times) [2024-06-28 02:07:50,681][06909] InferenceWorker_p0-w0: resuming experience collection (27800 times) [2024-06-28 02:07:50,688][06909] Updated weights for policy 0, policy_version 125493 (0.0041) [2024-06-28 02:07:53,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44238.4, 300 sec: 44098.0). Total num frames: 2056224768. Throughput: 0: 44206.0. Samples: 1959183340. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 02:07:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:07:54,308][06909] Updated weights for policy 0, policy_version 125503 (0.0030) [2024-06-28 02:07:57,756][06909] Updated weights for policy 0, policy_version 125513 (0.0040) [2024-06-28 02:07:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2056454144. Throughput: 0: 44209.3. Samples: 1959321020. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 02:07:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:08:01,799][06909] Updated weights for policy 0, policy_version 125523 (0.0024) [2024-06-28 02:08:03,850][06674] Fps is (10 sec: 44235.8, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2056667136. Throughput: 0: 44311.9. Samples: 1959587060. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 02:08:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:08:05,232][06909] Updated weights for policy 0, policy_version 125533 (0.0021) [2024-06-28 02:08:08,850][06674] Fps is (10 sec: 42597.7, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2056880128. Throughput: 0: 44376.0. Samples: 1959852700. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 02:08:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:08:08,878][06909] Updated weights for policy 0, policy_version 125543 (0.0033) [2024-06-28 02:08:12,554][06909] Updated weights for policy 0, policy_version 125553 (0.0036) [2024-06-28 02:08:13,853][06674] Fps is (10 sec: 45863.3, 60 sec: 44234.8, 300 sec: 44153.1). Total num frames: 2057125888. Throughput: 0: 44552.4. Samples: 1959991300. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 02:08:13,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:08:16,130][06909] Updated weights for policy 0, policy_version 125563 (0.0029) [2024-06-28 02:08:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2057322496. Throughput: 0: 44392.0. Samples: 1960251720. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 02:08:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:08:20,019][06909] Updated weights for policy 0, policy_version 125573 (0.0028) [2024-06-28 02:08:23,801][06909] Updated weights for policy 0, policy_version 125583 (0.0027) [2024-06-28 02:08:23,850][06674] Fps is (10 sec: 42609.9, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 2057551872. Throughput: 0: 44523.5. Samples: 1960517160. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 02:08:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:08:27,347][06909] Updated weights for policy 0, policy_version 125593 (0.0037) [2024-06-28 02:08:28,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2057764864. Throughput: 0: 44321.4. Samples: 1960647760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 02:08:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:08:31,170][06909] Updated weights for policy 0, policy_version 125603 (0.0025) [2024-06-28 02:08:33,856][06674] Fps is (10 sec: 44209.8, 60 sec: 44232.4, 300 sec: 44152.6). Total num frames: 2057994240. Throughput: 0: 44205.1. Samples: 1960910380. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 02:08:33,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:08:34,614][06909] Updated weights for policy 0, policy_version 125613 (0.0041) [2024-06-28 02:08:38,808][06909] Updated weights for policy 0, policy_version 125623 (0.0035) [2024-06-28 02:08:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2058207232. Throughput: 0: 44398.6. Samples: 1961181280. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 02:08:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:08:42,151][06909] Updated weights for policy 0, policy_version 125633 (0.0039) [2024-06-28 02:08:43,850][06674] Fps is (10 sec: 44263.0, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 2058436608. Throughput: 0: 44302.5. Samples: 1961314640. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 02:08:43,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:08:45,952][06909] Updated weights for policy 0, policy_version 125643 (0.0031) [2024-06-28 02:08:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2058649600. Throughput: 0: 44108.5. Samples: 1961571940. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 02:08:48,862][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:08:49,622][06909] Updated weights for policy 0, policy_version 125653 (0.0034) [2024-06-28 02:08:53,216][06909] Updated weights for policy 0, policy_version 125663 (0.0035) [2024-06-28 02:08:53,850][06674] Fps is (10 sec: 42599.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2058862592. Throughput: 0: 44312.7. Samples: 1961846760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 02:08:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:08:56,770][06909] Updated weights for policy 0, policy_version 125673 (0.0028) [2024-06-28 02:08:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2059091968. Throughput: 0: 44221.8. Samples: 1961981160. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 02:08:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:09:00,806][06909] Updated weights for policy 0, policy_version 125683 (0.0032) [2024-06-28 02:09:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2059321344. Throughput: 0: 44197.1. Samples: 1962240580. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 02:09:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:09:04,277][06909] Updated weights for policy 0, policy_version 125693 (0.0039) [2024-06-28 02:09:08,218][06909] Updated weights for policy 0, policy_version 125703 (0.0027) [2024-06-28 02:09:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 2059550720. Throughput: 0: 44346.7. Samples: 1962512760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 02:09:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:09:11,374][06909] Updated weights for policy 0, policy_version 125713 (0.0039) [2024-06-28 02:09:13,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43965.7, 300 sec: 44153.5). Total num frames: 2059763712. Throughput: 0: 44323.0. Samples: 1962642300. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 02:09:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:09:15,540][06909] Updated weights for policy 0, policy_version 125723 (0.0029) [2024-06-28 02:09:18,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2059993088. Throughput: 0: 44287.7. Samples: 1962903060. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 02:09:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:09:18,988][06909] Updated weights for policy 0, policy_version 125733 (0.0033) [2024-06-28 02:09:22,892][06909] Updated weights for policy 0, policy_version 125743 (0.0038) [2024-06-28 02:09:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2060222464. Throughput: 0: 44303.1. Samples: 1963174920. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 02:09:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:09:26,449][06909] Updated weights for policy 0, policy_version 125753 (0.0033) [2024-06-28 02:09:27,913][06887] Signal inference workers to stop experience collection... (27850 times) [2024-06-28 02:09:27,914][06887] Signal inference workers to resume experience collection... (27850 times) [2024-06-28 02:09:27,929][06909] InferenceWorker_p0-w0: stopping experience collection (27850 times) [2024-06-28 02:09:27,957][06909] InferenceWorker_p0-w0: resuming experience collection (27850 times) [2024-06-28 02:09:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2060419072. Throughput: 0: 44369.4. Samples: 1963311260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 02:09:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:09:30,339][06909] Updated weights for policy 0, policy_version 125763 (0.0026) [2024-06-28 02:09:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44241.3, 300 sec: 44098.0). Total num frames: 2060648448. Throughput: 0: 44528.1. Samples: 1963575700. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 02:09:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:09:33,931][06909] Updated weights for policy 0, policy_version 125773 (0.0028) [2024-06-28 02:09:37,871][06909] Updated weights for policy 0, policy_version 125783 (0.0027) [2024-06-28 02:09:38,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44782.9, 300 sec: 44209.0). Total num frames: 2060894208. Throughput: 0: 44252.3. Samples: 1963838120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 02:09:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:09:41,146][06909] Updated weights for policy 0, policy_version 125793 (0.0036) [2024-06-28 02:09:43,850][06674] Fps is (10 sec: 42597.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2061074432. Throughput: 0: 44219.8. Samples: 1963971060. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 02:09:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:09:45,324][06909] Updated weights for policy 0, policy_version 125803 (0.0025) [2024-06-28 02:09:48,477][06909] Updated weights for policy 0, policy_version 125813 (0.0042) [2024-06-28 02:09:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2061320192. Throughput: 0: 44387.8. Samples: 1964238040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 02:09:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:09:48,901][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000125814_2061336576.pth... [2024-06-28 02:09:48,951][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000125165_2050703360.pth [2024-06-28 02:09:52,978][06909] Updated weights for policy 0, policy_version 125823 (0.0023) [2024-06-28 02:09:53,850][06674] Fps is (10 sec: 45876.4, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2061533184. Throughput: 0: 44027.2. Samples: 1964493980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 02:09:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:09:56,090][06909] Updated weights for policy 0, policy_version 125833 (0.0039) [2024-06-28 02:09:58,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2061729792. Throughput: 0: 43997.4. Samples: 1964622180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 02:09:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:10:00,195][06909] Updated weights for policy 0, policy_version 125843 (0.0031) [2024-06-28 02:10:03,626][06909] Updated weights for policy 0, policy_version 125853 (0.0026) [2024-06-28 02:10:03,850][06674] Fps is (10 sec: 44235.7, 60 sec: 44236.6, 300 sec: 44153.5). Total num frames: 2061975552. Throughput: 0: 44165.3. Samples: 1964890500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 02:10:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:10:07,679][06909] Updated weights for policy 0, policy_version 125863 (0.0029) [2024-06-28 02:10:08,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2062204928. Throughput: 0: 43911.5. Samples: 1965150940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 02:10:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:10:10,854][06909] Updated weights for policy 0, policy_version 125873 (0.0026) [2024-06-28 02:10:13,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2062385152. Throughput: 0: 43870.4. Samples: 1965285420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 02:10:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:10:15,150][06909] Updated weights for policy 0, policy_version 125883 (0.0028) [2024-06-28 02:10:18,520][06909] Updated weights for policy 0, policy_version 125893 (0.0039) [2024-06-28 02:10:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2062647296. Throughput: 0: 43845.3. Samples: 1965548740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 02:10:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:10:22,553][06909] Updated weights for policy 0, policy_version 125903 (0.0035) [2024-06-28 02:10:23,850][06674] Fps is (10 sec: 49151.9, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 2062876672. Throughput: 0: 43962.7. Samples: 1965816440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 02:10:23,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:10:25,701][06909] Updated weights for policy 0, policy_version 125913 (0.0022) [2024-06-28 02:10:28,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2063056896. Throughput: 0: 43964.5. Samples: 1965949460. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:10:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:10:30,043][06909] Updated weights for policy 0, policy_version 125923 (0.0030) [2024-06-28 02:10:33,124][06909] Updated weights for policy 0, policy_version 125933 (0.0039) [2024-06-28 02:10:33,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44509.7, 300 sec: 44264.6). Total num frames: 2063319040. Throughput: 0: 43800.4. Samples: 1966209060. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:10:33,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:10:37,563][06909] Updated weights for policy 0, policy_version 125943 (0.0029) [2024-06-28 02:10:38,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2063515648. Throughput: 0: 43909.7. Samples: 1966469920. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:10:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:10:39,765][06887] Signal inference workers to stop experience collection... (27900 times) [2024-06-28 02:10:39,766][06887] Signal inference workers to resume experience collection... (27900 times) [2024-06-28 02:10:39,810][06909] InferenceWorker_p0-w0: stopping experience collection (27900 times) [2024-06-28 02:10:39,810][06909] InferenceWorker_p0-w0: resuming experience collection (27900 times) [2024-06-28 02:10:40,737][06909] Updated weights for policy 0, policy_version 125953 (0.0032) [2024-06-28 02:10:43,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2063712256. Throughput: 0: 44054.6. Samples: 1966604640. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:10:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:10:44,970][06909] Updated weights for policy 0, policy_version 125963 (0.0028) [2024-06-28 02:10:47,997][06909] Updated weights for policy 0, policy_version 125973 (0.0032) [2024-06-28 02:10:48,852][06674] Fps is (10 sec: 45865.7, 60 sec: 44235.3, 300 sec: 44208.7). Total num frames: 2063974400. Throughput: 0: 44059.4. Samples: 1966873260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:10:48,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:10:52,240][06909] Updated weights for policy 0, policy_version 125983 (0.0037) [2024-06-28 02:10:53,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2064187392. Throughput: 0: 44190.8. Samples: 1967139520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:10:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:10:55,458][06909] Updated weights for policy 0, policy_version 125993 (0.0026) [2024-06-28 02:10:58,852][06674] Fps is (10 sec: 40960.1, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 2064384000. Throughput: 0: 44304.6. Samples: 1967279220. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:10:58,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:10:59,323][06909] Updated weights for policy 0, policy_version 126003 (0.0027) [2024-06-28 02:11:02,695][06909] Updated weights for policy 0, policy_version 126013 (0.0025) [2024-06-28 02:11:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2064629760. Throughput: 0: 44322.6. Samples: 1967543260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:11:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:11:07,183][06909] Updated weights for policy 0, policy_version 126023 (0.0039) [2024-06-28 02:11:08,850][06674] Fps is (10 sec: 45884.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2064842752. Throughput: 0: 44197.6. Samples: 1967805340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:11:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:11:09,984][06909] Updated weights for policy 0, policy_version 126033 (0.0038) [2024-06-28 02:11:13,850][06674] Fps is (10 sec: 40960.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2065039360. Throughput: 0: 44169.0. Samples: 1967937060. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:11:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:11:14,451][06909] Updated weights for policy 0, policy_version 126043 (0.0030) [2024-06-28 02:11:17,797][06909] Updated weights for policy 0, policy_version 126053 (0.0030) [2024-06-28 02:11:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2065285120. Throughput: 0: 44230.3. Samples: 1968199420. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:11:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:11:21,879][06909] Updated weights for policy 0, policy_version 126063 (0.0031) [2024-06-28 02:11:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2065498112. Throughput: 0: 44392.5. Samples: 1968467580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:11:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 02:11:25,104][06909] Updated weights for policy 0, policy_version 126073 (0.0029) [2024-06-28 02:11:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2065711104. Throughput: 0: 44290.7. Samples: 1968597720. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:11:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:11:29,151][06909] Updated weights for policy 0, policy_version 126083 (0.0028) [2024-06-28 02:11:32,756][06909] Updated weights for policy 0, policy_version 126093 (0.0031) [2024-06-28 02:11:33,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2065940480. Throughput: 0: 44175.3. Samples: 1968861060. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 02:11:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:11:36,511][06909] Updated weights for policy 0, policy_version 126103 (0.0026) [2024-06-28 02:11:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2066169856. Throughput: 0: 44271.0. Samples: 1969131720. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 02:11:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:11:39,995][06909] Updated weights for policy 0, policy_version 126113 (0.0039) [2024-06-28 02:11:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2066366464. Throughput: 0: 43972.2. Samples: 1969257880. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 02:11:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:11:44,268][06909] Updated weights for policy 0, policy_version 126123 (0.0031) [2024-06-28 02:11:47,294][06909] Updated weights for policy 0, policy_version 126133 (0.0022) [2024-06-28 02:11:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43692.2, 300 sec: 44153.8). Total num frames: 2066595840. Throughput: 0: 43844.5. Samples: 1969516260. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 02:11:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:11:48,978][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000126136_2066612224.pth... [2024-06-28 02:11:49,031][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000125488_2055995392.pth [2024-06-28 02:11:51,600][06909] Updated weights for policy 0, policy_version 126143 (0.0036) [2024-06-28 02:11:52,720][06887] Signal inference workers to stop experience collection... (27950 times) [2024-06-28 02:11:52,720][06887] Signal inference workers to resume experience collection... (27950 times) [2024-06-28 02:11:52,733][06909] InferenceWorker_p0-w0: stopping experience collection (27950 times) [2024-06-28 02:11:52,733][06909] InferenceWorker_p0-w0: resuming experience collection (27950 times) [2024-06-28 02:11:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2066808832. Throughput: 0: 43958.3. Samples: 1969783460. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 02:11:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:11:55,209][06909] Updated weights for policy 0, policy_version 126153 (0.0031) [2024-06-28 02:11:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2067038208. Throughput: 0: 44024.9. Samples: 1969918180. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 02:11:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 02:11:59,020][06909] Updated weights for policy 0, policy_version 126163 (0.0029) [2024-06-28 02:12:02,606][06909] Updated weights for policy 0, policy_version 126173 (0.0037) [2024-06-28 02:12:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 44153.5). Total num frames: 2067251200. Throughput: 0: 44056.6. Samples: 1970181960. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 02:12:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:12:06,319][06909] Updated weights for policy 0, policy_version 126183 (0.0041) [2024-06-28 02:12:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2067496960. Throughput: 0: 44019.1. Samples: 1970448440. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 02:12:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:12:09,802][06909] Updated weights for policy 0, policy_version 126193 (0.0026) [2024-06-28 02:12:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2067693568. Throughput: 0: 44206.2. Samples: 1970587000. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 02:12:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:12:13,875][06909] Updated weights for policy 0, policy_version 126203 (0.0040) [2024-06-28 02:12:17,375][06909] Updated weights for policy 0, policy_version 126213 (0.0043) [2024-06-28 02:12:18,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2067906560. Throughput: 0: 44183.2. Samples: 1970849300. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 02:12:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:12:21,349][06909] Updated weights for policy 0, policy_version 126223 (0.0024) [2024-06-28 02:12:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2068152320. Throughput: 0: 44000.6. Samples: 1971111740. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 02:12:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:12:24,832][06909] Updated weights for policy 0, policy_version 126233 (0.0044) [2024-06-28 02:12:28,475][06909] Updated weights for policy 0, policy_version 126243 (0.0030) [2024-06-28 02:12:28,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44509.9, 300 sec: 44209.1). Total num frames: 2068381696. Throughput: 0: 44289.8. Samples: 1971250920. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 02:12:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:12:32,178][06909] Updated weights for policy 0, policy_version 126253 (0.0036) [2024-06-28 02:12:33,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2068578304. Throughput: 0: 44238.2. Samples: 1971506980. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 02:12:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:12:35,842][06909] Updated weights for policy 0, policy_version 126263 (0.0037) [2024-06-28 02:12:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2068824064. Throughput: 0: 44134.3. Samples: 1971769500. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 02:12:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:12:39,822][06909] Updated weights for policy 0, policy_version 126273 (0.0034) [2024-06-28 02:12:43,124][06909] Updated weights for policy 0, policy_version 126283 (0.0036) [2024-06-28 02:12:43,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44782.9, 300 sec: 44264.6). Total num frames: 2069053440. Throughput: 0: 44355.9. Samples: 1971914200. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 02:12:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:12:47,038][06909] Updated weights for policy 0, policy_version 126293 (0.0026) [2024-06-28 02:12:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2069250048. Throughput: 0: 44445.8. Samples: 1972182020. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 02:12:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:12:50,706][06909] Updated weights for policy 0, policy_version 126303 (0.0038) [2024-06-28 02:12:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44782.9, 300 sec: 44209.0). Total num frames: 2069495808. Throughput: 0: 44172.8. Samples: 1972436220. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 02:12:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:12:54,303][06909] Updated weights for policy 0, policy_version 126313 (0.0034) [2024-06-28 02:12:58,332][06909] Updated weights for policy 0, policy_version 126323 (0.0022) [2024-06-28 02:12:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2069708800. Throughput: 0: 44159.5. Samples: 1972574180. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 02:12:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:13:01,674][06909] Updated weights for policy 0, policy_version 126333 (0.0027) [2024-06-28 02:13:03,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2069905408. Throughput: 0: 44204.4. Samples: 1972838500. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 02:13:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:13:05,598][06909] Updated weights for policy 0, policy_version 126343 (0.0028) [2024-06-28 02:13:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 2070134784. Throughput: 0: 44154.1. Samples: 1973098680. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 02:13:08,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:13:09,344][06909] Updated weights for policy 0, policy_version 126353 (0.0032) [2024-06-28 02:13:12,796][06909] Updated weights for policy 0, policy_version 126363 (0.0027) [2024-06-28 02:13:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2070364160. Throughput: 0: 44142.6. Samples: 1973237340. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 02:13:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:13:16,874][06909] Updated weights for policy 0, policy_version 126373 (0.0031) [2024-06-28 02:13:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2070577152. Throughput: 0: 44394.3. Samples: 1973504720. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 02:13:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:13:19,540][06887] Signal inference workers to stop experience collection... (28000 times) [2024-06-28 02:13:19,540][06887] Signal inference workers to resume experience collection... (28000 times) [2024-06-28 02:13:19,557][06909] InferenceWorker_p0-w0: stopping experience collection (28000 times) [2024-06-28 02:13:19,557][06909] InferenceWorker_p0-w0: resuming experience collection (28000 times) [2024-06-28 02:13:20,224][06909] Updated weights for policy 0, policy_version 126383 (0.0030) [2024-06-28 02:13:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2070806528. Throughput: 0: 44298.2. Samples: 1973762920. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 02:13:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:13:24,237][06909] Updated weights for policy 0, policy_version 126393 (0.0034) [2024-06-28 02:13:27,676][06909] Updated weights for policy 0, policy_version 126403 (0.0042) [2024-06-28 02:13:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44154.4). Total num frames: 2071019520. Throughput: 0: 44106.2. Samples: 1973898980. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 02:13:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:13:31,460][06909] Updated weights for policy 0, policy_version 126413 (0.0025) [2024-06-28 02:13:33,852][06674] Fps is (10 sec: 42589.7, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 2071232512. Throughput: 0: 43996.6. Samples: 1974161960. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 02:13:33,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:13:35,303][06909] Updated weights for policy 0, policy_version 126423 (0.0036) [2024-06-28 02:13:38,843][06909] Updated weights for policy 0, policy_version 126433 (0.0046) [2024-06-28 02:13:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44209.1). Total num frames: 2071478272. Throughput: 0: 44126.8. Samples: 1974421920. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 02:13:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:13:42,643][06909] Updated weights for policy 0, policy_version 126443 (0.0032) [2024-06-28 02:13:43,850][06674] Fps is (10 sec: 44246.1, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2071674880. Throughput: 0: 44149.8. Samples: 1974560920. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 02:13:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:13:46,617][06909] Updated weights for policy 0, policy_version 126453 (0.0045) [2024-06-28 02:13:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2071887872. Throughput: 0: 44077.4. Samples: 1974821980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 02:13:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:13:48,897][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000126459_2071904256.pth... [2024-06-28 02:13:48,944][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000125814_2061336576.pth [2024-06-28 02:13:49,997][06909] Updated weights for policy 0, policy_version 126463 (0.0032) [2024-06-28 02:13:53,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 2072117248. Throughput: 0: 44202.6. Samples: 1975087800. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 02:13:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:13:53,963][06909] Updated weights for policy 0, policy_version 126473 (0.0044) [2024-06-28 02:13:57,398][06909] Updated weights for policy 0, policy_version 126483 (0.0036) [2024-06-28 02:13:58,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2072346624. Throughput: 0: 44220.4. Samples: 1975227260. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 02:13:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:14:01,152][06909] Updated weights for policy 0, policy_version 126493 (0.0033) [2024-06-28 02:14:03,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2072543232. Throughput: 0: 44042.6. Samples: 1975486640. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 02:14:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:14:04,933][06909] Updated weights for policy 0, policy_version 126503 (0.0042) [2024-06-28 02:14:08,334][06909] Updated weights for policy 0, policy_version 126513 (0.0021) [2024-06-28 02:14:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2072788992. Throughput: 0: 44078.7. Samples: 1975746460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 02:14:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:14:12,375][06909] Updated weights for policy 0, policy_version 126523 (0.0028) [2024-06-28 02:14:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2073001984. Throughput: 0: 44187.6. Samples: 1975887420. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 02:14:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:14:15,892][06909] Updated weights for policy 0, policy_version 126533 (0.0031) [2024-06-28 02:14:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2073231360. Throughput: 0: 44295.8. Samples: 1976155180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 02:14:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:14:19,532][06909] Updated weights for policy 0, policy_version 126543 (0.0034) [2024-06-28 02:14:23,578][06909] Updated weights for policy 0, policy_version 126553 (0.0043) [2024-06-28 02:14:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.9, 300 sec: 44209.1). Total num frames: 2073460736. Throughput: 0: 44296.9. Samples: 1976415280. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 02:14:23,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 02:14:27,035][06909] Updated weights for policy 0, policy_version 126563 (0.0027) [2024-06-28 02:14:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2073673728. Throughput: 0: 44132.8. Samples: 1976546900. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 02:14:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:14:30,980][06909] Updated weights for policy 0, policy_version 126573 (0.0034) [2024-06-28 02:14:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44511.4, 300 sec: 44098.0). Total num frames: 2073903104. Throughput: 0: 44212.4. Samples: 1976811540. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 02:14:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:14:34,305][06909] Updated weights for policy 0, policy_version 126583 (0.0027) [2024-06-28 02:14:38,166][06909] Updated weights for policy 0, policy_version 126593 (0.0031) [2024-06-28 02:14:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2074116096. Throughput: 0: 44168.5. Samples: 1977075380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:14:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:14:42,091][06909] Updated weights for policy 0, policy_version 126603 (0.0031) [2024-06-28 02:14:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2074329088. Throughput: 0: 44158.8. Samples: 1977214400. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:14:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:14:45,497][06909] Updated weights for policy 0, policy_version 126613 (0.0031) [2024-06-28 02:14:48,855][06674] Fps is (10 sec: 42574.5, 60 sec: 44232.6, 300 sec: 44097.1). Total num frames: 2074542080. Throughput: 0: 44076.2. Samples: 1977470320. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:14:48,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:14:49,425][06909] Updated weights for policy 0, policy_version 126623 (0.0027) [2024-06-28 02:14:50,808][06887] Signal inference workers to stop experience collection... (28050 times) [2024-06-28 02:14:50,810][06887] Signal inference workers to resume experience collection... (28050 times) [2024-06-28 02:14:50,825][06909] InferenceWorker_p0-w0: stopping experience collection (28050 times) [2024-06-28 02:14:50,826][06909] InferenceWorker_p0-w0: resuming experience collection (28050 times) [2024-06-28 02:14:53,265][06909] Updated weights for policy 0, policy_version 126633 (0.0033) [2024-06-28 02:14:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2074771456. Throughput: 0: 44027.9. Samples: 1977727720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:14:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:14:56,795][06909] Updated weights for policy 0, policy_version 126643 (0.0036) [2024-06-28 02:14:58,852][06674] Fps is (10 sec: 45891.4, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 2075000832. Throughput: 0: 43919.7. Samples: 1977863900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:14:58,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:15:00,924][06909] Updated weights for policy 0, policy_version 126653 (0.0024) [2024-06-28 02:15:03,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2075213824. Throughput: 0: 43936.1. Samples: 1978132300. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:15:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:15:04,286][06909] Updated weights for policy 0, policy_version 126663 (0.0027) [2024-06-28 02:15:08,256][06909] Updated weights for policy 0, policy_version 126673 (0.0025) [2024-06-28 02:15:08,856][06674] Fps is (10 sec: 42581.6, 60 sec: 43959.3, 300 sec: 44208.1). Total num frames: 2075426816. Throughput: 0: 43877.1. Samples: 1978390020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:15:08,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:15:11,943][06909] Updated weights for policy 0, policy_version 126683 (0.0032) [2024-06-28 02:15:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2075656192. Throughput: 0: 43848.0. Samples: 1978520060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:15:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:15:15,426][06909] Updated weights for policy 0, policy_version 126693 (0.0032) [2024-06-28 02:15:18,850][06674] Fps is (10 sec: 44263.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2075869184. Throughput: 0: 43989.7. Samples: 1978791080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:15:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:15:19,441][06909] Updated weights for policy 0, policy_version 126703 (0.0029) [2024-06-28 02:15:22,879][06909] Updated weights for policy 0, policy_version 126713 (0.0022) [2024-06-28 02:15:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.6, 300 sec: 44209.0). Total num frames: 2076098560. Throughput: 0: 44071.9. Samples: 1979058620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:15:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:15:26,946][06909] Updated weights for policy 0, policy_version 126723 (0.0036) [2024-06-28 02:15:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2076327936. Throughput: 0: 43833.8. Samples: 1979186920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:15:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:15:30,767][06909] Updated weights for policy 0, policy_version 126733 (0.0030) [2024-06-28 02:15:33,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43962.2, 300 sec: 44153.2). Total num frames: 2076540928. Throughput: 0: 44066.2. Samples: 1979453140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:15:33,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:15:34,403][06909] Updated weights for policy 0, policy_version 126743 (0.0042) [2024-06-28 02:15:38,265][06909] Updated weights for policy 0, policy_version 126753 (0.0041) [2024-06-28 02:15:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2076753920. Throughput: 0: 44166.8. Samples: 1979715220. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 02:15:38,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 02:15:41,796][06909] Updated weights for policy 0, policy_version 126763 (0.0031) [2024-06-28 02:15:43,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 2076966912. Throughput: 0: 44007.0. Samples: 1979844120. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 02:15:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:15:45,395][06909] Updated weights for policy 0, policy_version 126773 (0.0034) [2024-06-28 02:15:48,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44240.9, 300 sec: 44097.9). Total num frames: 2077196288. Throughput: 0: 44083.9. Samples: 1980116080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 02:15:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:15:48,983][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000126783_2077212672.pth... [2024-06-28 02:15:48,990][06909] Updated weights for policy 0, policy_version 126783 (0.0031) [2024-06-28 02:15:49,037][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000126136_2066612224.pth [2024-06-28 02:15:52,665][06909] Updated weights for policy 0, policy_version 126793 (0.0028) [2024-06-28 02:15:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44153.8). Total num frames: 2077409280. Throughput: 0: 44158.8. Samples: 1980376900. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 02:15:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:15:56,704][06909] Updated weights for policy 0, policy_version 126803 (0.0046) [2024-06-28 02:15:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43965.3, 300 sec: 44098.0). Total num frames: 2077638656. Throughput: 0: 44129.3. Samples: 1980505880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 02:15:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:16:00,307][06909] Updated weights for policy 0, policy_version 126813 (0.0031) [2024-06-28 02:16:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2077851648. Throughput: 0: 43978.7. Samples: 1980770120. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 02:16:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:16:04,309][06909] Updated weights for policy 0, policy_version 126823 (0.0028) [2024-06-28 02:16:07,927][06909] Updated weights for policy 0, policy_version 126833 (0.0036) [2024-06-28 02:16:08,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43968.1, 300 sec: 44153.5). Total num frames: 2078064640. Throughput: 0: 43977.3. Samples: 1981037600. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 02:16:08,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:16:11,446][06909] Updated weights for policy 0, policy_version 126843 (0.0027) [2024-06-28 02:16:13,851][06674] Fps is (10 sec: 44232.7, 60 sec: 43963.1, 300 sec: 44097.8). Total num frames: 2078294016. Throughput: 0: 44092.0. Samples: 1981171100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 02:16:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:16:15,241][06909] Updated weights for policy 0, policy_version 126853 (0.0033) [2024-06-28 02:16:18,801][06909] Updated weights for policy 0, policy_version 126863 (0.0044) [2024-06-28 02:16:18,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2078523392. Throughput: 0: 44105.6. Samples: 1981437800. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 02:16:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:16:19,784][06887] Signal inference workers to stop experience collection... (28100 times) [2024-06-28 02:16:19,840][06909] InferenceWorker_p0-w0: stopping experience collection (28100 times) [2024-06-28 02:16:19,841][06887] Signal inference workers to resume experience collection... (28100 times) [2024-06-28 02:16:19,856][06909] InferenceWorker_p0-w0: resuming experience collection (28100 times) [2024-06-28 02:16:22,568][06909] Updated weights for policy 0, policy_version 126873 (0.0026) [2024-06-28 02:16:23,850][06674] Fps is (10 sec: 42602.4, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 2078720000. Throughput: 0: 44228.9. Samples: 1981705520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 02:16:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:16:26,079][06909] Updated weights for policy 0, policy_version 126883 (0.0042) [2024-06-28 02:16:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 2078949376. Throughput: 0: 44259.0. Samples: 1981835780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 02:16:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:16:29,863][06909] Updated weights for policy 0, policy_version 126893 (0.0035) [2024-06-28 02:16:33,533][06909] Updated weights for policy 0, policy_version 126903 (0.0025) [2024-06-28 02:16:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43965.3, 300 sec: 44098.0). Total num frames: 2079178752. Throughput: 0: 44168.2. Samples: 1982103640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 02:16:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:16:37,633][06909] Updated weights for policy 0, policy_version 126913 (0.0045) [2024-06-28 02:16:38,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44509.8, 300 sec: 44264.6). Total num frames: 2079424512. Throughput: 0: 44300.8. Samples: 1982370440. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 02:16:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:16:40,998][06909] Updated weights for policy 0, policy_version 126923 (0.0046) [2024-06-28 02:16:43,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2079621120. Throughput: 0: 44297.3. Samples: 1982499260. Policy #0 lag: (min: 1.0, avg: 9.7, max: 19.0) [2024-06-28 02:16:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:16:44,996][06909] Updated weights for policy 0, policy_version 126933 (0.0027) [2024-06-28 02:16:48,129][06909] Updated weights for policy 0, policy_version 126943 (0.0035) [2024-06-28 02:16:48,852][06674] Fps is (10 sec: 44227.8, 60 sec: 44508.4, 300 sec: 44264.3). Total num frames: 2079866880. Throughput: 0: 44381.5. Samples: 1982767380. Policy #0 lag: (min: 1.0, avg: 9.7, max: 19.0) [2024-06-28 02:16:48,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:16:52,344][06909] Updated weights for policy 0, policy_version 126953 (0.0038) [2024-06-28 02:16:53,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 2080079872. Throughput: 0: 44350.8. Samples: 1983033380. Policy #0 lag: (min: 1.0, avg: 9.7, max: 19.0) [2024-06-28 02:16:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:16:55,506][06909] Updated weights for policy 0, policy_version 126963 (0.0028) [2024-06-28 02:16:58,850][06674] Fps is (10 sec: 40968.7, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2080276480. Throughput: 0: 44433.8. Samples: 1983170580. Policy #0 lag: (min: 1.0, avg: 9.7, max: 19.0) [2024-06-28 02:16:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:16:59,926][06909] Updated weights for policy 0, policy_version 126973 (0.0021) [2024-06-28 02:17:03,105][06909] Updated weights for policy 0, policy_version 126983 (0.0037) [2024-06-28 02:17:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2080522240. Throughput: 0: 44328.9. Samples: 1983432600. Policy #0 lag: (min: 1.0, avg: 9.7, max: 19.0) [2024-06-28 02:17:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:17:07,030][06909] Updated weights for policy 0, policy_version 126993 (0.0027) [2024-06-28 02:17:08,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44783.0, 300 sec: 44264.6). Total num frames: 2080751616. Throughput: 0: 44354.6. Samples: 1983701480. Policy #0 lag: (min: 1.0, avg: 9.7, max: 19.0) [2024-06-28 02:17:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:17:10,239][06909] Updated weights for policy 0, policy_version 127003 (0.0029) [2024-06-28 02:17:13,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44510.5, 300 sec: 44264.6). Total num frames: 2080964608. Throughput: 0: 44423.1. Samples: 1983834820. Policy #0 lag: (min: 1.0, avg: 9.7, max: 19.0) [2024-06-28 02:17:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:17:14,392][06909] Updated weights for policy 0, policy_version 127013 (0.0035) [2024-06-28 02:17:17,831][06909] Updated weights for policy 0, policy_version 127023 (0.0034) [2024-06-28 02:17:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2081177600. Throughput: 0: 44328.4. Samples: 1984098420. Policy #0 lag: (min: 1.0, avg: 9.7, max: 19.0) [2024-06-28 02:17:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:17:21,663][06909] Updated weights for policy 0, policy_version 127033 (0.0034) [2024-06-28 02:17:23,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 2081406976. Throughput: 0: 44424.1. Samples: 1984369520. Policy #0 lag: (min: 1.0, avg: 9.7, max: 19.0) [2024-06-28 02:17:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:17:24,923][06909] Updated weights for policy 0, policy_version 127043 (0.0035) [2024-06-28 02:17:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2081603584. Throughput: 0: 44464.9. Samples: 1984500180. Policy #0 lag: (min: 1.0, avg: 9.7, max: 19.0) [2024-06-28 02:17:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:17:29,245][06909] Updated weights for policy 0, policy_version 127053 (0.0031) [2024-06-28 02:17:32,618][06909] Updated weights for policy 0, policy_version 127063 (0.0031) [2024-06-28 02:17:33,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44782.8, 300 sec: 44209.0). Total num frames: 2081865728. Throughput: 0: 44462.4. Samples: 1984768100. Policy #0 lag: (min: 1.0, avg: 9.7, max: 19.0) [2024-06-28 02:17:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:17:36,558][06909] Updated weights for policy 0, policy_version 127073 (0.0028) [2024-06-28 02:17:38,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2082078720. Throughput: 0: 44502.7. Samples: 1985036000. Policy #0 lag: (min: 1.0, avg: 9.7, max: 19.0) [2024-06-28 02:17:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:17:39,942][06909] Updated weights for policy 0, policy_version 127083 (0.0023) [2024-06-28 02:17:40,526][06887] Signal inference workers to stop experience collection... (28150 times) [2024-06-28 02:17:40,526][06887] Signal inference workers to resume experience collection... (28150 times) [2024-06-28 02:17:40,543][06909] InferenceWorker_p0-w0: stopping experience collection (28150 times) [2024-06-28 02:17:40,543][06909] InferenceWorker_p0-w0: resuming experience collection (28150 times) [2024-06-28 02:17:43,783][06909] Updated weights for policy 0, policy_version 127093 (0.0030) [2024-06-28 02:17:43,852][06674] Fps is (10 sec: 42590.1, 60 sec: 44508.4, 300 sec: 44208.7). Total num frames: 2082291712. Throughput: 0: 44355.7. Samples: 1985166680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 02:17:43,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:17:47,116][06909] Updated weights for policy 0, policy_version 127103 (0.0026) [2024-06-28 02:17:48,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43965.2, 300 sec: 44098.0). Total num frames: 2082504704. Throughput: 0: 44427.4. Samples: 1985431840. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 02:17:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:17:48,986][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000127107_2082521088.pth... [2024-06-28 02:17:49,035][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000126459_2071904256.pth [2024-06-28 02:17:51,202][06909] Updated weights for policy 0, policy_version 127113 (0.0028) [2024-06-28 02:17:53,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2082734080. Throughput: 0: 44276.0. Samples: 1985693900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 02:17:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:17:54,704][06909] Updated weights for policy 0, policy_version 127123 (0.0034) [2024-06-28 02:17:58,633][06909] Updated weights for policy 0, policy_version 127133 (0.0029) [2024-06-28 02:17:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2082947072. Throughput: 0: 44319.6. Samples: 1985829200. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 02:17:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:18:01,991][06909] Updated weights for policy 0, policy_version 127143 (0.0023) [2024-06-28 02:18:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.8, 300 sec: 44264.6). Total num frames: 2083192832. Throughput: 0: 44167.5. Samples: 1986085960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 02:18:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:18:06,114][06909] Updated weights for policy 0, policy_version 127153 (0.0041) [2024-06-28 02:18:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2083405824. Throughput: 0: 43984.9. Samples: 1986348840. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 02:18:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:18:09,600][06909] Updated weights for policy 0, policy_version 127163 (0.0033) [2024-06-28 02:18:13,563][06909] Updated weights for policy 0, policy_version 127173 (0.0026) [2024-06-28 02:18:13,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2083602432. Throughput: 0: 43936.9. Samples: 1986477340. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 02:18:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:18:17,050][06909] Updated weights for policy 0, policy_version 127183 (0.0030) [2024-06-28 02:18:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2083831808. Throughput: 0: 44016.1. Samples: 1986748820. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 02:18:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:18:21,137][06909] Updated weights for policy 0, policy_version 127193 (0.0028) [2024-06-28 02:18:23,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2084061184. Throughput: 0: 43746.1. Samples: 1987004580. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 02:18:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:18:24,467][06909] Updated weights for policy 0, policy_version 127203 (0.0029) [2024-06-28 02:18:28,539][06909] Updated weights for policy 0, policy_version 127213 (0.0030) [2024-06-28 02:18:28,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44509.8, 300 sec: 44209.3). Total num frames: 2084274176. Throughput: 0: 43854.8. Samples: 1987140060. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 02:18:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:18:32,064][06909] Updated weights for policy 0, policy_version 127223 (0.0037) [2024-06-28 02:18:33,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 2084487168. Throughput: 0: 43949.9. Samples: 1987409580. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 02:18:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:18:35,817][06909] Updated weights for policy 0, policy_version 127233 (0.0031) [2024-06-28 02:18:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.7, 300 sec: 44264.6). Total num frames: 2084732928. Throughput: 0: 43951.5. Samples: 1987671720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 02:18:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:18:39,683][06909] Updated weights for policy 0, policy_version 127243 (0.0037) [2024-06-28 02:18:43,370][06909] Updated weights for policy 0, policy_version 127253 (0.0038) [2024-06-28 02:18:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43692.1, 300 sec: 44153.5). Total num frames: 2084913152. Throughput: 0: 43911.1. Samples: 1987805200. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 02:18:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:18:47,406][06909] Updated weights for policy 0, policy_version 127263 (0.0029) [2024-06-28 02:18:47,691][06887] Signal inference workers to stop experience collection... (28200 times) [2024-06-28 02:18:47,692][06887] Signal inference workers to resume experience collection... (28200 times) [2024-06-28 02:18:47,711][06909] InferenceWorker_p0-w0: stopping experience collection (28200 times) [2024-06-28 02:18:47,712][06909] InferenceWorker_p0-w0: resuming experience collection (28200 times) [2024-06-28 02:18:48,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2085142528. Throughput: 0: 44112.9. Samples: 1988071040. Policy #0 lag: (min: 1.0, avg: 10.3, max: 24.0) [2024-06-28 02:18:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:18:50,892][06909] Updated weights for policy 0, policy_version 127273 (0.0027) [2024-06-28 02:18:53,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2085371904. Throughput: 0: 44098.9. Samples: 1988333300. Policy #0 lag: (min: 1.0, avg: 10.3, max: 24.0) [2024-06-28 02:18:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:18:54,643][06909] Updated weights for policy 0, policy_version 127283 (0.0031) [2024-06-28 02:18:58,202][06909] Updated weights for policy 0, policy_version 127293 (0.0034) [2024-06-28 02:18:58,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 2085601280. Throughput: 0: 44291.1. Samples: 1988470440. Policy #0 lag: (min: 1.0, avg: 10.3, max: 24.0) [2024-06-28 02:18:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:19:01,707][06909] Updated weights for policy 0, policy_version 127303 (0.0036) [2024-06-28 02:19:03,852][06674] Fps is (10 sec: 44228.2, 60 sec: 43689.2, 300 sec: 44153.2). Total num frames: 2085814272. Throughput: 0: 44072.2. Samples: 1988732160. Policy #0 lag: (min: 1.0, avg: 10.3, max: 24.0) [2024-06-28 02:19:03,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:19:05,459][06909] Updated weights for policy 0, policy_version 127313 (0.0030) [2024-06-28 02:19:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2086043648. Throughput: 0: 44336.6. Samples: 1988999720. Policy #0 lag: (min: 1.0, avg: 10.3, max: 24.0) [2024-06-28 02:19:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:19:08,923][06909] Updated weights for policy 0, policy_version 127323 (0.0035) [2024-06-28 02:19:12,886][06909] Updated weights for policy 0, policy_version 127333 (0.0028) [2024-06-28 02:19:13,856][06674] Fps is (10 sec: 44219.2, 60 sec: 44232.3, 300 sec: 44152.6). Total num frames: 2086256640. Throughput: 0: 44383.5. Samples: 1989137580. Policy #0 lag: (min: 1.0, avg: 10.3, max: 24.0) [2024-06-28 02:19:13,856][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 02:19:16,521][06909] Updated weights for policy 0, policy_version 127343 (0.0042) [2024-06-28 02:19:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2086469632. Throughput: 0: 44151.4. Samples: 1989396400. Policy #0 lag: (min: 1.0, avg: 10.3, max: 24.0) [2024-06-28 02:19:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:19:20,362][06909] Updated weights for policy 0, policy_version 127353 (0.0026) [2024-06-28 02:19:23,850][06674] Fps is (10 sec: 44263.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2086699008. Throughput: 0: 44048.5. Samples: 1989653900. Policy #0 lag: (min: 1.0, avg: 10.3, max: 24.0) [2024-06-28 02:19:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:19:24,299][06909] Updated weights for policy 0, policy_version 127363 (0.0032) [2024-06-28 02:19:28,040][06909] Updated weights for policy 0, policy_version 127373 (0.0022) [2024-06-28 02:19:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2086895616. Throughput: 0: 44132.3. Samples: 1989791160. Policy #0 lag: (min: 1.0, avg: 10.3, max: 24.0) [2024-06-28 02:19:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:19:31,662][06909] Updated weights for policy 0, policy_version 127383 (0.0029) [2024-06-28 02:19:33,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43690.5, 300 sec: 44042.4). Total num frames: 2087108608. Throughput: 0: 43940.8. Samples: 1990048380. Policy #0 lag: (min: 1.0, avg: 10.3, max: 24.0) [2024-06-28 02:19:33,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:19:35,324][06909] Updated weights for policy 0, policy_version 127393 (0.0039) [2024-06-28 02:19:38,848][06909] Updated weights for policy 0, policy_version 127403 (0.0035) [2024-06-28 02:19:38,850][06674] Fps is (10 sec: 47514.4, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2087370752. Throughput: 0: 44036.2. Samples: 1990314920. Policy #0 lag: (min: 1.0, avg: 10.3, max: 24.0) [2024-06-28 02:19:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:19:42,762][06909] Updated weights for policy 0, policy_version 127413 (0.0029) [2024-06-28 02:19:43,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44236.8, 300 sec: 44154.3). Total num frames: 2087567360. Throughput: 0: 44131.6. Samples: 1990456360. Policy #0 lag: (min: 1.0, avg: 10.3, max: 24.0) [2024-06-28 02:19:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:19:45,982][06909] Updated weights for policy 0, policy_version 127423 (0.0040) [2024-06-28 02:19:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2087780352. Throughput: 0: 44068.3. Samples: 1990715140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 02:19:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:19:48,990][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000127429_2087796736.pth... [2024-06-28 02:19:49,042][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000126783_2077212672.pth [2024-06-28 02:19:50,182][06909] Updated weights for policy 0, policy_version 127433 (0.0031) [2024-06-28 02:19:53,727][06909] Updated weights for policy 0, policy_version 127443 (0.0035) [2024-06-28 02:19:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44153.8). Total num frames: 2088026112. Throughput: 0: 44212.9. Samples: 1990989300. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 02:19:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:19:57,322][06909] Updated weights for policy 0, policy_version 127453 (0.0032) [2024-06-28 02:19:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2088222720. Throughput: 0: 44084.6. Samples: 1991121120. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 02:19:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:20:01,184][06909] Updated weights for policy 0, policy_version 127463 (0.0037) [2024-06-28 02:20:01,773][06887] Signal inference workers to stop experience collection... (28250 times) [2024-06-28 02:20:01,814][06909] InferenceWorker_p0-w0: stopping experience collection (28250 times) [2024-06-28 02:20:01,823][06887] Signal inference workers to resume experience collection... (28250 times) [2024-06-28 02:20:01,833][06909] InferenceWorker_p0-w0: resuming experience collection (28250 times) [2024-06-28 02:20:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43692.2, 300 sec: 44098.9). Total num frames: 2088435712. Throughput: 0: 43996.6. Samples: 1991376240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 02:20:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:20:04,952][06909] Updated weights for policy 0, policy_version 127473 (0.0040) [2024-06-28 02:20:08,833][06909] Updated weights for policy 0, policy_version 127483 (0.0037) [2024-06-28 02:20:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2088681472. Throughput: 0: 44207.0. Samples: 1991643220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 02:20:08,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:20:12,326][06909] Updated weights for policy 0, policy_version 127493 (0.0035) [2024-06-28 02:20:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43968.2, 300 sec: 44153.5). Total num frames: 2088894464. Throughput: 0: 44179.3. Samples: 1991779220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 02:20:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:20:16,116][06909] Updated weights for policy 0, policy_version 127503 (0.0031) [2024-06-28 02:20:18,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2089107456. Throughput: 0: 44258.9. Samples: 1992040020. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 02:20:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:20:19,942][06909] Updated weights for policy 0, policy_version 127513 (0.0029) [2024-06-28 02:20:23,667][06909] Updated weights for policy 0, policy_version 127523 (0.0041) [2024-06-28 02:20:23,850][06674] Fps is (10 sec: 44235.0, 60 sec: 43963.5, 300 sec: 44097.9). Total num frames: 2089336832. Throughput: 0: 44166.7. Samples: 1992302440. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 02:20:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:20:27,322][06909] Updated weights for policy 0, policy_version 127533 (0.0031) [2024-06-28 02:20:28,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44509.9, 300 sec: 44153.8). Total num frames: 2089566208. Throughput: 0: 44095.4. Samples: 1992440660. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 02:20:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:20:31,210][06909] Updated weights for policy 0, policy_version 127543 (0.0028) [2024-06-28 02:20:33,850][06674] Fps is (10 sec: 40961.8, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 2089746432. Throughput: 0: 44030.7. Samples: 1992696520. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 02:20:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:20:34,587][06909] Updated weights for policy 0, policy_version 127553 (0.0026) [2024-06-28 02:20:38,608][06909] Updated weights for policy 0, policy_version 127563 (0.0023) [2024-06-28 02:20:38,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2090008576. Throughput: 0: 43888.9. Samples: 1992964300. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 02:20:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:20:42,160][06909] Updated weights for policy 0, policy_version 127573 (0.0037) [2024-06-28 02:20:43,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2090221568. Throughput: 0: 43907.2. Samples: 1993096940. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 02:20:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:20:45,923][06909] Updated weights for policy 0, policy_version 127583 (0.0032) [2024-06-28 02:20:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2090418176. Throughput: 0: 44014.2. Samples: 1993356880. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 02:20:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:20:49,560][06909] Updated weights for policy 0, policy_version 127593 (0.0041) [2024-06-28 02:20:53,037][06909] Updated weights for policy 0, policy_version 127603 (0.0026) [2024-06-28 02:20:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2090663936. Throughput: 0: 44032.5. Samples: 1993624680. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 02:20:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:20:57,022][06909] Updated weights for policy 0, policy_version 127613 (0.0031) [2024-06-28 02:20:58,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2090893312. Throughput: 0: 44146.6. Samples: 1993765820. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 02:20:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:21:00,436][06909] Updated weights for policy 0, policy_version 127623 (0.0034) [2024-06-28 02:21:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.9, 300 sec: 44209.1). Total num frames: 2091106304. Throughput: 0: 44166.2. Samples: 1994027500. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 02:21:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:21:04,613][06909] Updated weights for policy 0, policy_version 127633 (0.0027) [2024-06-28 02:21:08,325][06909] Updated weights for policy 0, policy_version 127643 (0.0035) [2024-06-28 02:21:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44209.2). Total num frames: 2091335680. Throughput: 0: 44251.1. Samples: 1994293720. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 02:21:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:21:11,854][06909] Updated weights for policy 0, policy_version 127653 (0.0032) [2024-06-28 02:21:13,852][06674] Fps is (10 sec: 44227.6, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 2091548672. Throughput: 0: 43995.4. Samples: 1994420540. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 02:21:13,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:21:15,635][06909] Updated weights for policy 0, policy_version 127663 (0.0034) [2024-06-28 02:21:18,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2091745280. Throughput: 0: 44153.8. Samples: 1994683440. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 02:21:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:21:19,322][06909] Updated weights for policy 0, policy_version 127673 (0.0029) [2024-06-28 02:21:21,484][06887] Signal inference workers to stop experience collection... (28300 times) [2024-06-28 02:21:21,485][06887] Signal inference workers to resume experience collection... (28300 times) [2024-06-28 02:21:21,511][06909] InferenceWorker_p0-w0: stopping experience collection (28300 times) [2024-06-28 02:21:21,511][06909] InferenceWorker_p0-w0: resuming experience collection (28300 times) [2024-06-28 02:21:23,029][06909] Updated weights for policy 0, policy_version 127683 (0.0039) [2024-06-28 02:21:23,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44237.1, 300 sec: 44209.0). Total num frames: 2091991040. Throughput: 0: 44101.3. Samples: 1994948860. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 02:21:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:21:26,715][06909] Updated weights for policy 0, policy_version 127693 (0.0029) [2024-06-28 02:21:28,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2092204032. Throughput: 0: 44108.3. Samples: 1995081820. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 02:21:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:21:30,426][06909] Updated weights for policy 0, policy_version 127703 (0.0031) [2024-06-28 02:21:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2092417024. Throughput: 0: 44128.4. Samples: 1995342660. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 02:21:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:21:34,093][06909] Updated weights for policy 0, policy_version 127713 (0.0029) [2024-06-28 02:21:38,092][06909] Updated weights for policy 0, policy_version 127723 (0.0032) [2024-06-28 02:21:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2092646400. Throughput: 0: 44113.3. Samples: 1995609780. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 02:21:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:21:41,625][06909] Updated weights for policy 0, policy_version 127733 (0.0031) [2024-06-28 02:21:43,850][06674] Fps is (10 sec: 45873.8, 60 sec: 44236.6, 300 sec: 44098.2). Total num frames: 2092875776. Throughput: 0: 43844.2. Samples: 1995738820. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 02:21:43,851][06674] Avg episode reward: [(0, '0.408')] [2024-06-28 02:21:45,645][06909] Updated weights for policy 0, policy_version 127743 (0.0032) [2024-06-28 02:21:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2093072384. Throughput: 0: 43901.4. Samples: 1996003060. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 02:21:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:21:48,936][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000127752_2093088768.pth... [2024-06-28 02:21:48,982][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000127107_2082521088.pth [2024-06-28 02:21:49,261][06909] Updated weights for policy 0, policy_version 127753 (0.0032) [2024-06-28 02:21:53,003][06909] Updated weights for policy 0, policy_version 127763 (0.0035) [2024-06-28 02:21:53,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2093301760. Throughput: 0: 43699.9. Samples: 1996260220. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-28 02:21:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:21:56,582][06909] Updated weights for policy 0, policy_version 127773 (0.0028) [2024-06-28 02:21:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2093531136. Throughput: 0: 43860.2. Samples: 1996394160. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-28 02:21:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 02:22:00,178][06909] Updated weights for policy 0, policy_version 127783 (0.0025) [2024-06-28 02:22:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2093727744. Throughput: 0: 43904.4. Samples: 1996659140. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-28 02:22:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:22:04,051][06909] Updated weights for policy 0, policy_version 127793 (0.0036) [2024-06-28 02:22:07,479][06909] Updated weights for policy 0, policy_version 127803 (0.0020) [2024-06-28 02:22:08,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2093957120. Throughput: 0: 44016.4. Samples: 1996929600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-28 02:22:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:22:11,321][06909] Updated weights for policy 0, policy_version 127813 (0.0035) [2024-06-28 02:22:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43692.2, 300 sec: 44042.4). Total num frames: 2094170112. Throughput: 0: 43794.3. Samples: 1997052560. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-28 02:22:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:22:15,070][06909] Updated weights for policy 0, policy_version 127823 (0.0027) [2024-06-28 02:22:18,670][06909] Updated weights for policy 0, policy_version 127833 (0.0032) [2024-06-28 02:22:18,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2094415872. Throughput: 0: 43980.0. Samples: 1997321760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-28 02:22:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:22:22,833][06909] Updated weights for policy 0, policy_version 127843 (0.0027) [2024-06-28 02:22:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.7, 300 sec: 44042.4). Total num frames: 2094596096. Throughput: 0: 43828.1. Samples: 1997582040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-28 02:22:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:22:26,403][06909] Updated weights for policy 0, policy_version 127853 (0.0026) [2024-06-28 02:22:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2094858240. Throughput: 0: 43879.7. Samples: 1997713400. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-28 02:22:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:22:29,952][06909] Updated weights for policy 0, policy_version 127863 (0.0037) [2024-06-28 02:22:33,674][06909] Updated weights for policy 0, policy_version 127873 (0.0039) [2024-06-28 02:22:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2095071232. Throughput: 0: 44039.5. Samples: 1997984840. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-28 02:22:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:22:37,251][06909] Updated weights for policy 0, policy_version 127883 (0.0029) [2024-06-28 02:22:38,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43417.7, 300 sec: 43931.6). Total num frames: 2095251456. Throughput: 0: 44181.0. Samples: 1998248360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-28 02:22:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:22:41,316][06909] Updated weights for policy 0, policy_version 127893 (0.0036) [2024-06-28 02:22:43,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.4, 300 sec: 44097.7). Total num frames: 2095513600. Throughput: 0: 43946.0. Samples: 1998371820. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-28 02:22:43,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:22:44,532][06909] Updated weights for policy 0, policy_version 127903 (0.0040) [2024-06-28 02:22:48,590][06909] Updated weights for policy 0, policy_version 127913 (0.0028) [2024-06-28 02:22:48,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2095726592. Throughput: 0: 44169.7. Samples: 1998646780. Policy #0 lag: (min: 0.0, avg: 10.5, max: 19.0) [2024-06-28 02:22:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:22:52,418][06909] Updated weights for policy 0, policy_version 127923 (0.0035) [2024-06-28 02:22:53,850][06674] Fps is (10 sec: 40968.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2095923200. Throughput: 0: 43903.6. Samples: 1998905260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 02:22:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:22:54,926][06887] Signal inference workers to stop experience collection... (28350 times) [2024-06-28 02:22:54,980][06909] InferenceWorker_p0-w0: stopping experience collection (28350 times) [2024-06-28 02:22:55,044][06887] Signal inference workers to resume experience collection... (28350 times) [2024-06-28 02:22:55,044][06909] InferenceWorker_p0-w0: resuming experience collection (28350 times) [2024-06-28 02:22:56,073][06909] Updated weights for policy 0, policy_version 127933 (0.0035) [2024-06-28 02:22:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2096168960. Throughput: 0: 44056.4. Samples: 1999035100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 02:22:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:23:00,142][06909] Updated weights for policy 0, policy_version 127943 (0.0036) [2024-06-28 02:23:03,530][06909] Updated weights for policy 0, policy_version 127953 (0.0024) [2024-06-28 02:23:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2096381952. Throughput: 0: 44130.5. Samples: 1999307640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 02:23:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:23:07,371][06909] Updated weights for policy 0, policy_version 127963 (0.0031) [2024-06-28 02:23:08,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2096578560. Throughput: 0: 44062.6. Samples: 1999564860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 02:23:08,850][06674] Avg episode reward: [(0, '0.462')] [2024-06-28 02:23:10,916][06909] Updated weights for policy 0, policy_version 127973 (0.0037) [2024-06-28 02:23:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2096824320. Throughput: 0: 43950.2. Samples: 1999691160. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 02:23:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:23:14,589][06909] Updated weights for policy 0, policy_version 127983 (0.0033) [2024-06-28 02:23:18,594][06909] Updated weights for policy 0, policy_version 127993 (0.0029) [2024-06-28 02:23:18,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2097053696. Throughput: 0: 44015.1. Samples: 1999965520. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 02:23:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:23:22,078][06909] Updated weights for policy 0, policy_version 128003 (0.0032) [2024-06-28 02:23:23,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 2097233920. Throughput: 0: 44005.0. Samples: 2000228580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 02:23:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:23:25,829][06909] Updated weights for policy 0, policy_version 128013 (0.0045) [2024-06-28 02:23:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2097496064. Throughput: 0: 44185.5. Samples: 2000360080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 02:23:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:23:30,103][06909] Updated weights for policy 0, policy_version 128023 (0.0032) [2024-06-28 02:23:33,203][06909] Updated weights for policy 0, policy_version 128033 (0.0030) [2024-06-28 02:23:33,852][06674] Fps is (10 sec: 45864.9, 60 sec: 43689.1, 300 sec: 43931.0). Total num frames: 2097692672. Throughput: 0: 44068.1. Samples: 2000629940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 02:23:33,861][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:23:37,235][06909] Updated weights for policy 0, policy_version 128043 (0.0030) [2024-06-28 02:23:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2097905664. Throughput: 0: 44223.5. Samples: 2000895320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 02:23:38,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:23:40,869][06909] Updated weights for policy 0, policy_version 128053 (0.0021) [2024-06-28 02:23:43,850][06674] Fps is (10 sec: 47524.5, 60 sec: 44238.4, 300 sec: 44153.5). Total num frames: 2098167808. Throughput: 0: 44098.3. Samples: 2001019520. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 02:23:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:23:44,414][06909] Updated weights for policy 0, policy_version 128063 (0.0026) [2024-06-28 02:23:48,101][06909] Updated weights for policy 0, policy_version 128073 (0.0033) [2024-06-28 02:23:48,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2098380800. Throughput: 0: 44078.7. Samples: 2001291180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 02:23:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:23:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000128075_2098380800.pth... [2024-06-28 02:23:48,907][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000127429_2087796736.pth [2024-06-28 02:23:52,143][06909] Updated weights for policy 0, policy_version 128083 (0.0022) [2024-06-28 02:23:53,850][06674] Fps is (10 sec: 39320.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2098561024. Throughput: 0: 44412.0. Samples: 2001563400. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 02:23:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:23:55,619][06909] Updated weights for policy 0, policy_version 128093 (0.0036) [2024-06-28 02:23:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 44098.3). Total num frames: 2098823168. Throughput: 0: 44295.6. Samples: 2001684460. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-28 02:23:58,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:23:59,288][06909] Updated weights for policy 0, policy_version 128103 (0.0027) [2024-06-28 02:24:02,853][06909] Updated weights for policy 0, policy_version 128113 (0.0025) [2024-06-28 02:24:03,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2099036160. Throughput: 0: 44392.4. Samples: 2001963180. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-28 02:24:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:24:03,921][06887] Signal inference workers to stop experience collection... (28400 times) [2024-06-28 02:24:03,948][06909] InferenceWorker_p0-w0: stopping experience collection (28400 times) [2024-06-28 02:24:04,038][06887] Signal inference workers to resume experience collection... (28400 times) [2024-06-28 02:24:04,038][06909] InferenceWorker_p0-w0: resuming experience collection (28400 times) [2024-06-28 02:24:07,169][06909] Updated weights for policy 0, policy_version 128123 (0.0020) [2024-06-28 02:24:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.8, 300 sec: 43987.8). Total num frames: 2099232768. Throughput: 0: 44416.3. Samples: 2002227320. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-28 02:24:08,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:24:10,494][06909] Updated weights for policy 0, policy_version 128133 (0.0028) [2024-06-28 02:24:13,852][06674] Fps is (10 sec: 44227.9, 60 sec: 44235.4, 300 sec: 44097.7). Total num frames: 2099478528. Throughput: 0: 44130.5. Samples: 2002346040. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-28 02:24:13,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:24:14,372][06909] Updated weights for policy 0, policy_version 128143 (0.0036) [2024-06-28 02:24:17,844][06909] Updated weights for policy 0, policy_version 128153 (0.0042) [2024-06-28 02:24:18,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2099707904. Throughput: 0: 44203.9. Samples: 2002619020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-28 02:24:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:24:21,540][06909] Updated weights for policy 0, policy_version 128163 (0.0025) [2024-06-28 02:24:23,850][06674] Fps is (10 sec: 40968.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2099888128. Throughput: 0: 44189.9. Samples: 2002883860. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-28 02:24:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:24:25,344][06909] Updated weights for policy 0, policy_version 128173 (0.0035) [2024-06-28 02:24:28,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2100133888. Throughput: 0: 44268.8. Samples: 2003011620. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-28 02:24:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:24:29,275][06909] Updated weights for policy 0, policy_version 128183 (0.0026) [2024-06-28 02:24:32,741][06909] Updated weights for policy 0, policy_version 128193 (0.0040) [2024-06-28 02:24:33,850][06674] Fps is (10 sec: 49151.7, 60 sec: 44784.6, 300 sec: 44098.0). Total num frames: 2100379648. Throughput: 0: 44151.2. Samples: 2003277980. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-28 02:24:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:24:36,388][06909] Updated weights for policy 0, policy_version 128203 (0.0029) [2024-06-28 02:24:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2100576256. Throughput: 0: 44092.0. Samples: 2003547540. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-28 02:24:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:24:40,221][06909] Updated weights for policy 0, policy_version 128213 (0.0038) [2024-06-28 02:24:43,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 2100789248. Throughput: 0: 44077.0. Samples: 2003667920. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-28 02:24:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:24:44,341][06909] Updated weights for policy 0, policy_version 128223 (0.0031) [2024-06-28 02:24:47,614][06909] Updated weights for policy 0, policy_version 128233 (0.0035) [2024-06-28 02:24:48,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2101035008. Throughput: 0: 43985.9. Samples: 2003942540. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-28 02:24:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:24:51,677][06909] Updated weights for policy 0, policy_version 128243 (0.0037) [2024-06-28 02:24:53,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2101198848. Throughput: 0: 44000.8. Samples: 2004207360. Policy #0 lag: (min: 0.0, avg: 8.6, max: 24.0) [2024-06-28 02:24:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:24:55,009][06909] Updated weights for policy 0, policy_version 128253 (0.0026) [2024-06-28 02:24:58,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2101444608. Throughput: 0: 44064.1. Samples: 2004328840. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 02:24:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:24:58,943][06909] Updated weights for policy 0, policy_version 128263 (0.0026) [2024-06-28 02:25:02,351][06909] Updated weights for policy 0, policy_version 128273 (0.0032) [2024-06-28 02:25:03,850][06674] Fps is (10 sec: 47514.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2101673984. Throughput: 0: 43770.3. Samples: 2004588680. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 02:25:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:25:06,249][06909] Updated weights for policy 0, policy_version 128283 (0.0033) [2024-06-28 02:25:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2101870592. Throughput: 0: 43975.0. Samples: 2004862740. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 02:25:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:25:09,859][06909] Updated weights for policy 0, policy_version 128293 (0.0025) [2024-06-28 02:25:13,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43692.1, 300 sec: 44042.4). Total num frames: 2102099968. Throughput: 0: 43786.1. Samples: 2004982000. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 02:25:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:25:13,975][06909] Updated weights for policy 0, policy_version 128303 (0.0033) [2024-06-28 02:25:17,851][06909] Updated weights for policy 0, policy_version 128313 (0.0028) [2024-06-28 02:25:18,852][06674] Fps is (10 sec: 47504.2, 60 sec: 43962.2, 300 sec: 44097.7). Total num frames: 2102345728. Throughput: 0: 43796.2. Samples: 2005248900. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 02:25:18,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:25:21,846][06909] Updated weights for policy 0, policy_version 128323 (0.0041) [2024-06-28 02:25:23,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 2102525952. Throughput: 0: 43673.9. Samples: 2005512860. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 02:25:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:25:25,153][06909] Updated weights for policy 0, policy_version 128333 (0.0038) [2024-06-28 02:25:26,169][06887] Signal inference workers to stop experience collection... (28450 times) [2024-06-28 02:25:26,170][06887] Signal inference workers to resume experience collection... (28450 times) [2024-06-28 02:25:26,185][06909] InferenceWorker_p0-w0: stopping experience collection (28450 times) [2024-06-28 02:25:26,185][06909] InferenceWorker_p0-w0: resuming experience collection (28450 times) [2024-06-28 02:25:28,851][06674] Fps is (10 sec: 40961.9, 60 sec: 43689.5, 300 sec: 44097.7). Total num frames: 2102755328. Throughput: 0: 43782.9. Samples: 2005638220. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 02:25:28,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:25:29,059][06909] Updated weights for policy 0, policy_version 128343 (0.0026) [2024-06-28 02:25:32,307][06909] Updated weights for policy 0, policy_version 128353 (0.0027) [2024-06-28 02:25:33,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2103001088. Throughput: 0: 43664.7. Samples: 2005907460. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 02:25:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:25:36,133][06909] Updated weights for policy 0, policy_version 128363 (0.0028) [2024-06-28 02:25:38,850][06674] Fps is (10 sec: 44244.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2103197696. Throughput: 0: 43893.0. Samples: 2006182540. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 02:25:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:25:39,822][06909] Updated weights for policy 0, policy_version 128373 (0.0034) [2024-06-28 02:25:43,289][06909] Updated weights for policy 0, policy_version 128383 (0.0035) [2024-06-28 02:25:43,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2103427072. Throughput: 0: 43988.2. Samples: 2006308300. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 02:25:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:25:46,920][06909] Updated weights for policy 0, policy_version 128393 (0.0036) [2024-06-28 02:25:48,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2103672832. Throughput: 0: 44168.7. Samples: 2006576280. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 02:25:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:25:49,016][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000128399_2103689216.pth... [2024-06-28 02:25:49,062][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000127752_2093088768.pth [2024-06-28 02:25:51,032][06909] Updated weights for policy 0, policy_version 128403 (0.0022) [2024-06-28 02:25:53,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44508.4, 300 sec: 43986.6). Total num frames: 2103869440. Throughput: 0: 43853.2. Samples: 2006836220. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 02:25:53,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:25:54,758][06909] Updated weights for policy 0, policy_version 128413 (0.0032) [2024-06-28 02:25:58,690][06909] Updated weights for policy 0, policy_version 128423 (0.0025) [2024-06-28 02:25:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2104082432. Throughput: 0: 43980.9. Samples: 2006961140. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 02:25:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:26:02,110][06909] Updated weights for policy 0, policy_version 128433 (0.0030) [2024-06-28 02:26:03,850][06674] Fps is (10 sec: 45884.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2104328192. Throughput: 0: 44109.6. Samples: 2007233740. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 02:26:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:26:05,985][06909] Updated weights for policy 0, policy_version 128443 (0.0038) [2024-06-28 02:26:08,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.9, 300 sec: 43987.2). Total num frames: 2104524800. Throughput: 0: 44169.3. Samples: 2007500480. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 02:26:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:26:09,417][06909] Updated weights for policy 0, policy_version 128453 (0.0028) [2024-06-28 02:26:13,151][06909] Updated weights for policy 0, policy_version 128463 (0.0035) [2024-06-28 02:26:13,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2104737792. Throughput: 0: 44126.9. Samples: 2007623860. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 02:26:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:26:17,497][06909] Updated weights for policy 0, policy_version 128473 (0.0029) [2024-06-28 02:26:18,850][06674] Fps is (10 sec: 47512.7, 60 sec: 44238.2, 300 sec: 44097.9). Total num frames: 2104999936. Throughput: 0: 44195.6. Samples: 2007896260. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 02:26:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:26:20,778][06909] Updated weights for policy 0, policy_version 128483 (0.0045) [2024-06-28 02:26:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2105196544. Throughput: 0: 43842.6. Samples: 2008155460. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 02:26:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:26:25,067][06909] Updated weights for policy 0, policy_version 128493 (0.0029) [2024-06-28 02:26:26,258][06887] Signal inference workers to stop experience collection... (28500 times) [2024-06-28 02:26:26,278][06909] InferenceWorker_p0-w0: stopping experience collection (28500 times) [2024-06-28 02:26:26,374][06887] Signal inference workers to resume experience collection... (28500 times) [2024-06-28 02:26:26,374][06909] InferenceWorker_p0-w0: resuming experience collection (28500 times) [2024-06-28 02:26:27,954][06909] Updated weights for policy 0, policy_version 128503 (0.0034) [2024-06-28 02:26:28,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43964.9, 300 sec: 43986.9). Total num frames: 2105393152. Throughput: 0: 43788.0. Samples: 2008278760. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 02:26:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:26:32,589][06909] Updated weights for policy 0, policy_version 128513 (0.0033) [2024-06-28 02:26:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2105638912. Throughput: 0: 43947.6. Samples: 2008553920. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 02:26:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:26:36,117][06909] Updated weights for policy 0, policy_version 128523 (0.0033) [2024-06-28 02:26:38,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2105868288. Throughput: 0: 43998.3. Samples: 2008816060. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 02:26:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:26:39,741][06909] Updated weights for policy 0, policy_version 128533 (0.0031) [2024-06-28 02:26:43,339][06909] Updated weights for policy 0, policy_version 128543 (0.0031) [2024-06-28 02:26:43,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.6, 300 sec: 44097.9). Total num frames: 2106081280. Throughput: 0: 44032.8. Samples: 2008942620. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 02:26:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:26:47,332][06909] Updated weights for policy 0, policy_version 128553 (0.0031) [2024-06-28 02:26:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2106327040. Throughput: 0: 44123.5. Samples: 2009219300. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 02:26:48,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:26:50,658][06909] Updated weights for policy 0, policy_version 128563 (0.0033) [2024-06-28 02:26:53,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44238.2, 300 sec: 44042.4). Total num frames: 2106523648. Throughput: 0: 43858.1. Samples: 2009474100. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 02:26:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:26:54,570][06909] Updated weights for policy 0, policy_version 128573 (0.0035) [2024-06-28 02:26:58,446][06909] Updated weights for policy 0, policy_version 128583 (0.0041) [2024-06-28 02:26:58,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2106720256. Throughput: 0: 43860.0. Samples: 2009597560. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 02:26:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:27:02,088][06909] Updated weights for policy 0, policy_version 128593 (0.0034) [2024-06-28 02:27:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2106982400. Throughput: 0: 43971.2. Samples: 2009874960. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:27:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:27:06,103][06909] Updated weights for policy 0, policy_version 128603 (0.0033) [2024-06-28 02:27:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2107162624. Throughput: 0: 44003.7. Samples: 2010135620. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:27:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:27:09,475][06909] Updated weights for policy 0, policy_version 128613 (0.0028) [2024-06-28 02:27:13,546][06909] Updated weights for policy 0, policy_version 128623 (0.0032) [2024-06-28 02:27:13,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2107375616. Throughput: 0: 44037.3. Samples: 2010260440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:27:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:27:16,813][06909] Updated weights for policy 0, policy_version 128633 (0.0036) [2024-06-28 02:27:18,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2107637760. Throughput: 0: 43951.1. Samples: 2010531720. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:27:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:27:20,681][06909] Updated weights for policy 0, policy_version 128643 (0.0034) [2024-06-28 02:27:23,332][06887] Signal inference workers to stop experience collection... (28550 times) [2024-06-28 02:27:23,333][06887] Signal inference workers to resume experience collection... (28550 times) [2024-06-28 02:27:23,351][06909] InferenceWorker_p0-w0: stopping experience collection (28550 times) [2024-06-28 02:27:23,383][06909] InferenceWorker_p0-w0: resuming experience collection (28550 times) [2024-06-28 02:27:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2107834368. Throughput: 0: 44034.3. Samples: 2010797600. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:27:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:27:24,146][06909] Updated weights for policy 0, policy_version 128653 (0.0029) [2024-06-28 02:27:28,233][06909] Updated weights for policy 0, policy_version 128663 (0.0036) [2024-06-28 02:27:28,856][06674] Fps is (10 sec: 40935.6, 60 sec: 44232.4, 300 sec: 43986.0). Total num frames: 2108047360. Throughput: 0: 44069.8. Samples: 2010926020. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:27:28,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:27:31,922][06909] Updated weights for policy 0, policy_version 128673 (0.0039) [2024-06-28 02:27:33,850][06674] Fps is (10 sec: 45872.8, 60 sec: 44236.4, 300 sec: 44208.9). Total num frames: 2108293120. Throughput: 0: 43839.1. Samples: 2011192080. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:27:33,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:27:35,410][06909] Updated weights for policy 0, policy_version 128683 (0.0030) [2024-06-28 02:27:38,850][06674] Fps is (10 sec: 44263.5, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 2108489728. Throughput: 0: 44063.6. Samples: 2011456960. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:27:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:27:39,140][06909] Updated weights for policy 0, policy_version 128693 (0.0034) [2024-06-28 02:27:42,915][06909] Updated weights for policy 0, policy_version 128703 (0.0032) [2024-06-28 02:27:43,850][06674] Fps is (10 sec: 40962.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2108702720. Throughput: 0: 44339.5. Samples: 2011592840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:27:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:27:46,563][06909] Updated weights for policy 0, policy_version 128713 (0.0031) [2024-06-28 02:27:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2108948480. Throughput: 0: 43881.8. Samples: 2011849640. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:27:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:27:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000128720_2108948480.pth... [2024-06-28 02:27:48,913][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000128075_2098380800.pth [2024-06-28 02:27:50,378][06909] Updated weights for policy 0, policy_version 128723 (0.0039) [2024-06-28 02:27:53,807][06909] Updated weights for policy 0, policy_version 128733 (0.0039) [2024-06-28 02:27:53,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2109161472. Throughput: 0: 44052.8. Samples: 2012118000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:27:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:27:57,840][06909] Updated weights for policy 0, policy_version 128743 (0.0022) [2024-06-28 02:27:58,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2109358080. Throughput: 0: 44297.3. Samples: 2012253820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:27:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 02:28:01,459][06909] Updated weights for policy 0, policy_version 128753 (0.0025) [2024-06-28 02:28:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2109620224. Throughput: 0: 44078.3. Samples: 2012515240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:28:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:28:05,133][06909] Updated weights for policy 0, policy_version 128763 (0.0029) [2024-06-28 02:28:08,577][06909] Updated weights for policy 0, policy_version 128773 (0.0044) [2024-06-28 02:28:08,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2109816832. Throughput: 0: 44150.7. Samples: 2012784380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:28:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:28:12,567][06909] Updated weights for policy 0, policy_version 128783 (0.0039) [2024-06-28 02:28:13,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2110029824. Throughput: 0: 44218.4. Samples: 2012915580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:28:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:28:16,362][06909] Updated weights for policy 0, policy_version 128793 (0.0023) [2024-06-28 02:28:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2110259200. Throughput: 0: 44107.3. Samples: 2013176880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:28:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:28:20,382][06909] Updated weights for policy 0, policy_version 128803 (0.0035) [2024-06-28 02:28:23,526][06909] Updated weights for policy 0, policy_version 128813 (0.0034) [2024-06-28 02:28:23,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2110488576. Throughput: 0: 44218.1. Samples: 2013446780. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:28:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:28:27,719][06909] Updated weights for policy 0, policy_version 128823 (0.0027) [2024-06-28 02:28:28,852][06674] Fps is (10 sec: 42589.3, 60 sec: 43966.6, 300 sec: 44042.4). Total num frames: 2110685184. Throughput: 0: 44114.0. Samples: 2013578060. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:28:28,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:28:30,796][06909] Updated weights for policy 0, policy_version 128833 (0.0031) [2024-06-28 02:28:33,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43691.1, 300 sec: 44098.0). Total num frames: 2110914560. Throughput: 0: 44111.2. Samples: 2013834640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:28:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:28:34,969][06909] Updated weights for policy 0, policy_version 128843 (0.0048) [2024-06-28 02:28:38,301][06909] Updated weights for policy 0, policy_version 128853 (0.0029) [2024-06-28 02:28:38,850][06674] Fps is (10 sec: 47523.2, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2111160320. Throughput: 0: 44177.3. Samples: 2014105980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:28:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:28:42,196][06909] Updated weights for policy 0, policy_version 128863 (0.0035) [2024-06-28 02:28:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2111356928. Throughput: 0: 44183.6. Samples: 2014242080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:28:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:28:45,671][06909] Updated weights for policy 0, policy_version 128873 (0.0047) [2024-06-28 02:28:48,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2111569920. Throughput: 0: 44036.3. Samples: 2014496880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:28:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:28:50,127][06909] Updated weights for policy 0, policy_version 128883 (0.0032) [2024-06-28 02:28:51,908][06887] Signal inference workers to stop experience collection... (28600 times) [2024-06-28 02:28:51,908][06887] Signal inference workers to resume experience collection... (28600 times) [2024-06-28 02:28:51,926][06909] InferenceWorker_p0-w0: stopping experience collection (28600 times) [2024-06-28 02:28:51,927][06909] InferenceWorker_p0-w0: resuming experience collection (28600 times) [2024-06-28 02:28:53,326][06909] Updated weights for policy 0, policy_version 128893 (0.0042) [2024-06-28 02:28:53,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2111815680. Throughput: 0: 44030.7. Samples: 2014765760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:28:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:28:57,682][06909] Updated weights for policy 0, policy_version 128903 (0.0031) [2024-06-28 02:28:58,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2112012288. Throughput: 0: 43994.7. Samples: 2014895340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:28:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:29:00,642][06909] Updated weights for policy 0, policy_version 128913 (0.0030) [2024-06-28 02:29:03,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43144.5, 300 sec: 43986.9). Total num frames: 2112208896. Throughput: 0: 43897.3. Samples: 2015152260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 02:29:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:29:05,155][06909] Updated weights for policy 0, policy_version 128923 (0.0038) [2024-06-28 02:29:07,816][06909] Updated weights for policy 0, policy_version 128933 (0.0032) [2024-06-28 02:29:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 2112471040. Throughput: 0: 43864.1. Samples: 2015420660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:29:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:29:12,353][06909] Updated weights for policy 0, policy_version 128943 (0.0031) [2024-06-28 02:29:13,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 2112667648. Throughput: 0: 44027.3. Samples: 2015559200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:29:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:29:15,631][06909] Updated weights for policy 0, policy_version 128953 (0.0032) [2024-06-28 02:29:18,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2112880640. Throughput: 0: 43994.1. Samples: 2015814380. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:29:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:29:20,058][06909] Updated weights for policy 0, policy_version 128963 (0.0039) [2024-06-28 02:29:22,955][06909] Updated weights for policy 0, policy_version 128973 (0.0035) [2024-06-28 02:29:23,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 2113142784. Throughput: 0: 43896.0. Samples: 2016081300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:29:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:29:27,574][06909] Updated weights for policy 0, policy_version 128983 (0.0046) [2024-06-28 02:29:28,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44238.4, 300 sec: 43931.3). Total num frames: 2113339392. Throughput: 0: 44012.5. Samples: 2016222640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:29:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:29:30,396][06909] Updated weights for policy 0, policy_version 128993 (0.0036) [2024-06-28 02:29:33,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2113536000. Throughput: 0: 44035.7. Samples: 2016478480. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:29:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:29:34,992][06909] Updated weights for policy 0, policy_version 129003 (0.0030) [2024-06-28 02:29:37,576][06909] Updated weights for policy 0, policy_version 129013 (0.0034) [2024-06-28 02:29:38,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2113814528. Throughput: 0: 43896.8. Samples: 2016741120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:29:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:29:42,327][06909] Updated weights for policy 0, policy_version 129023 (0.0024) [2024-06-28 02:29:43,850][06674] Fps is (10 sec: 49151.5, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2114027520. Throughput: 0: 44356.3. Samples: 2016891380. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:29:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:29:44,802][06909] Updated weights for policy 0, policy_version 129033 (0.0026) [2024-06-28 02:29:48,850][06674] Fps is (10 sec: 37683.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2114191360. Throughput: 0: 44308.3. Samples: 2017146140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:29:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:29:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000129040_2114191360.pth... [2024-06-28 02:29:48,914][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000128399_2103689216.pth [2024-06-28 02:29:49,563][06909] Updated weights for policy 0, policy_version 129043 (0.0044) [2024-06-28 02:29:52,510][06909] Updated weights for policy 0, policy_version 129053 (0.0028) [2024-06-28 02:29:53,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2114469888. Throughput: 0: 44079.6. Samples: 2017404240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:29:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:29:56,926][06909] Updated weights for policy 0, policy_version 129063 (0.0034) [2024-06-28 02:29:58,850][06674] Fps is (10 sec: 49152.4, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2114682880. Throughput: 0: 44414.8. Samples: 2017557860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:29:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:29:59,582][06887] Signal inference workers to stop experience collection... (28650 times) [2024-06-28 02:29:59,582][06887] Signal inference workers to resume experience collection... (28650 times) [2024-06-28 02:29:59,608][06909] InferenceWorker_p0-w0: stopping experience collection (28650 times) [2024-06-28 02:29:59,608][06909] InferenceWorker_p0-w0: resuming experience collection (28650 times) [2024-06-28 02:29:59,717][06909] Updated weights for policy 0, policy_version 129073 (0.0032) [2024-06-28 02:30:03,850][06674] Fps is (10 sec: 39320.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2114863104. Throughput: 0: 44435.1. Samples: 2017813960. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:30:03,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:30:04,162][06909] Updated weights for policy 0, policy_version 129083 (0.0031) [2024-06-28 02:30:07,170][06909] Updated weights for policy 0, policy_version 129093 (0.0034) [2024-06-28 02:30:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2115125248. Throughput: 0: 44310.3. Samples: 2018075260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:30:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:30:12,248][06909] Updated weights for policy 0, policy_version 129103 (0.0025) [2024-06-28 02:30:13,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44509.9, 300 sec: 44042.7). Total num frames: 2115338240. Throughput: 0: 44360.4. Samples: 2018218860. Policy #0 lag: (min: 0.0, avg: 13.1, max: 22.0) [2024-06-28 02:30:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:30:14,389][06909] Updated weights for policy 0, policy_version 129113 (0.0040) [2024-06-28 02:30:18,850][06674] Fps is (10 sec: 37683.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2115502080. Throughput: 0: 44307.6. Samples: 2018472320. Policy #0 lag: (min: 0.0, avg: 13.1, max: 22.0) [2024-06-28 02:30:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:30:19,579][06909] Updated weights for policy 0, policy_version 129123 (0.0034) [2024-06-28 02:30:21,781][06909] Updated weights for policy 0, policy_version 129133 (0.0042) [2024-06-28 02:30:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44209.3). Total num frames: 2115796992. Throughput: 0: 44199.2. Samples: 2018730080. Policy #0 lag: (min: 0.0, avg: 13.1, max: 22.0) [2024-06-28 02:30:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:30:26,766][06909] Updated weights for policy 0, policy_version 129143 (0.0037) [2024-06-28 02:30:28,850][06674] Fps is (10 sec: 49151.7, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2115993600. Throughput: 0: 44302.7. Samples: 2018885000. Policy #0 lag: (min: 0.0, avg: 13.1, max: 22.0) [2024-06-28 02:30:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:30:29,456][06909] Updated weights for policy 0, policy_version 129153 (0.0042) [2024-06-28 02:30:33,850][06674] Fps is (10 sec: 37683.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2116173824. Throughput: 0: 44337.9. Samples: 2019141340. Policy #0 lag: (min: 0.0, avg: 13.1, max: 22.0) [2024-06-28 02:30:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:30:34,098][06909] Updated weights for policy 0, policy_version 129163 (0.0031) [2024-06-28 02:30:36,897][06909] Updated weights for policy 0, policy_version 129173 (0.0027) [2024-06-28 02:30:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2116452352. Throughput: 0: 44288.7. Samples: 2019397240. Policy #0 lag: (min: 0.0, avg: 13.1, max: 22.0) [2024-06-28 02:30:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:30:41,695][06909] Updated weights for policy 0, policy_version 129183 (0.0038) [2024-06-28 02:30:43,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2116648960. Throughput: 0: 44062.2. Samples: 2019540660. Policy #0 lag: (min: 0.0, avg: 13.1, max: 22.0) [2024-06-28 02:30:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:30:44,411][06909] Updated weights for policy 0, policy_version 129193 (0.0030) [2024-06-28 02:30:48,852][06674] Fps is (10 sec: 39313.7, 60 sec: 44235.3, 300 sec: 43986.9). Total num frames: 2116845568. Throughput: 0: 44075.0. Samples: 2019797420. Policy #0 lag: (min: 0.0, avg: 13.1, max: 22.0) [2024-06-28 02:30:48,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 02:30:49,356][06909] Updated weights for policy 0, policy_version 129203 (0.0040) [2024-06-28 02:30:51,629][06909] Updated weights for policy 0, policy_version 129213 (0.0033) [2024-06-28 02:30:53,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2117107712. Throughput: 0: 43906.2. Samples: 2020051040. Policy #0 lag: (min: 0.0, avg: 13.1, max: 22.0) [2024-06-28 02:30:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:30:56,635][06909] Updated weights for policy 0, policy_version 129223 (0.0040) [2024-06-28 02:30:58,850][06674] Fps is (10 sec: 49162.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2117337088. Throughput: 0: 44119.1. Samples: 2020204220. Policy #0 lag: (min: 0.0, avg: 13.1, max: 22.0) [2024-06-28 02:30:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:30:58,987][06909] Updated weights for policy 0, policy_version 129233 (0.0038) [2024-06-28 02:31:03,765][06909] Updated weights for policy 0, policy_version 129243 (0.0032) [2024-06-28 02:31:03,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2117517312. Throughput: 0: 44288.0. Samples: 2020465280. Policy #0 lag: (min: 0.0, avg: 13.1, max: 22.0) [2024-06-28 02:31:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:31:06,664][06909] Updated weights for policy 0, policy_version 129253 (0.0039) [2024-06-28 02:31:08,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2117763072. Throughput: 0: 44326.6. Samples: 2020724780. Policy #0 lag: (min: 0.0, avg: 13.1, max: 22.0) [2024-06-28 02:31:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:31:10,902][06909] Updated weights for policy 0, policy_version 129263 (0.0029) [2024-06-28 02:31:13,020][06887] Signal inference workers to stop experience collection... (28700 times) [2024-06-28 02:31:13,022][06887] Signal inference workers to resume experience collection... (28700 times) [2024-06-28 02:31:13,037][06909] InferenceWorker_p0-w0: stopping experience collection (28700 times) [2024-06-28 02:31:13,037][06909] InferenceWorker_p0-w0: resuming experience collection (28700 times) [2024-06-28 02:31:13,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2117992448. Throughput: 0: 44022.7. Samples: 2020866020. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:31:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:31:13,987][06909] Updated weights for policy 0, policy_version 129273 (0.0036) [2024-06-28 02:31:18,146][06909] Updated weights for policy 0, policy_version 129283 (0.0030) [2024-06-28 02:31:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44782.9, 300 sec: 44042.4). Total num frames: 2118189056. Throughput: 0: 44146.2. Samples: 2021127920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:31:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:31:21,188][06909] Updated weights for policy 0, policy_version 129293 (0.0041) [2024-06-28 02:31:23,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43689.1, 300 sec: 44153.2). Total num frames: 2118418432. Throughput: 0: 44151.8. Samples: 2021384160. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:31:23,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:31:26,158][06909] Updated weights for policy 0, policy_version 129303 (0.0024) [2024-06-28 02:31:28,536][06909] Updated weights for policy 0, policy_version 129313 (0.0024) [2024-06-28 02:31:28,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2118664192. Throughput: 0: 44141.4. Samples: 2021527020. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:31:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:31:33,401][06909] Updated weights for policy 0, policy_version 129323 (0.0025) [2024-06-28 02:31:33,850][06674] Fps is (10 sec: 42607.3, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2118844416. Throughput: 0: 44408.3. Samples: 2021795700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:31:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:31:35,936][06909] Updated weights for policy 0, policy_version 129333 (0.0041) [2024-06-28 02:31:38,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2119073792. Throughput: 0: 44397.2. Samples: 2022048920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:31:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:31:40,717][06909] Updated weights for policy 0, policy_version 129343 (0.0039) [2024-06-28 02:31:43,665][06909] Updated weights for policy 0, policy_version 129353 (0.0031) [2024-06-28 02:31:43,850][06674] Fps is (10 sec: 49151.3, 60 sec: 44782.9, 300 sec: 44098.0). Total num frames: 2119335936. Throughput: 0: 44042.5. Samples: 2022186140. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:31:43,855][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:31:47,956][06909] Updated weights for policy 0, policy_version 129363 (0.0035) [2024-06-28 02:31:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44511.3, 300 sec: 44042.4). Total num frames: 2119516160. Throughput: 0: 44205.3. Samples: 2022454520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:31:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:31:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000129365_2119516160.pth... [2024-06-28 02:31:48,913][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000128720_2108948480.pth [2024-06-28 02:31:50,930][06909] Updated weights for policy 0, policy_version 129373 (0.0034) [2024-06-28 02:31:53,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 2119729152. Throughput: 0: 44174.2. Samples: 2022712620. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:31:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:31:55,558][06909] Updated weights for policy 0, policy_version 129383 (0.0048) [2024-06-28 02:31:58,214][06909] Updated weights for policy 0, policy_version 129393 (0.0032) [2024-06-28 02:31:58,852][06674] Fps is (10 sec: 47504.3, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 2119991296. Throughput: 0: 43986.1. Samples: 2022845480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:31:58,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:32:03,378][06909] Updated weights for policy 0, policy_version 129403 (0.0038) [2024-06-28 02:32:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2120171520. Throughput: 0: 44156.8. Samples: 2023114980. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:32:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:32:05,653][06909] Updated weights for policy 0, policy_version 129413 (0.0038) [2024-06-28 02:32:08,850][06674] Fps is (10 sec: 39329.3, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2120384512. Throughput: 0: 44067.7. Samples: 2023367120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:32:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:32:10,631][06909] Updated weights for policy 0, policy_version 129423 (0.0029) [2024-06-28 02:32:13,260][06909] Updated weights for policy 0, policy_version 129433 (0.0036) [2024-06-28 02:32:13,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2120646656. Throughput: 0: 43926.5. Samples: 2023503720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:32:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:32:17,967][06909] Updated weights for policy 0, policy_version 129443 (0.0035) [2024-06-28 02:32:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2120826880. Throughput: 0: 43937.7. Samples: 2023772900. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 02:32:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:32:20,034][06887] Signal inference workers to stop experience collection... (28750 times) [2024-06-28 02:32:20,039][06887] Signal inference workers to resume experience collection... (28750 times) [2024-06-28 02:32:20,084][06909] InferenceWorker_p0-w0: stopping experience collection (28750 times) [2024-06-28 02:32:20,084][06909] InferenceWorker_p0-w0: resuming experience collection (28750 times) [2024-06-28 02:32:20,904][06909] Updated weights for policy 0, policy_version 129453 (0.0031) [2024-06-28 02:32:23,850][06674] Fps is (10 sec: 39322.2, 60 sec: 43692.2, 300 sec: 44043.3). Total num frames: 2121039872. Throughput: 0: 43887.7. Samples: 2024023860. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 02:32:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:32:25,204][06909] Updated weights for policy 0, policy_version 129463 (0.0026) [2024-06-28 02:32:28,273][06909] Updated weights for policy 0, policy_version 129473 (0.0036) [2024-06-28 02:32:28,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2121302016. Throughput: 0: 43918.8. Samples: 2024162480. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 02:32:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:32:33,085][06909] Updated weights for policy 0, policy_version 129483 (0.0026) [2024-06-28 02:32:33,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2121482240. Throughput: 0: 43892.4. Samples: 2024429680. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 02:32:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:32:35,786][06909] Updated weights for policy 0, policy_version 129493 (0.0026) [2024-06-28 02:32:38,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2121695232. Throughput: 0: 43752.0. Samples: 2024681460. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 02:32:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:32:40,457][06909] Updated weights for policy 0, policy_version 129503 (0.0035) [2024-06-28 02:32:42,962][06909] Updated weights for policy 0, policy_version 129513 (0.0041) [2024-06-28 02:32:43,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2121957376. Throughput: 0: 43814.8. Samples: 2024817060. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 02:32:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:32:47,923][06909] Updated weights for policy 0, policy_version 129523 (0.0032) [2024-06-28 02:32:48,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2122153984. Throughput: 0: 43833.0. Samples: 2025087460. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 02:32:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:32:50,754][06909] Updated weights for policy 0, policy_version 129533 (0.0027) [2024-06-28 02:32:53,850][06674] Fps is (10 sec: 39322.3, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 2122350592. Throughput: 0: 43966.8. Samples: 2025345620. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 02:32:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:32:55,119][06909] Updated weights for policy 0, policy_version 129543 (0.0026) [2024-06-28 02:32:58,240][06909] Updated weights for policy 0, policy_version 129553 (0.0039) [2024-06-28 02:32:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43692.2, 300 sec: 44042.4). Total num frames: 2122612736. Throughput: 0: 43891.2. Samples: 2025478820. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 02:32:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 02:33:02,329][06909] Updated weights for policy 0, policy_version 129563 (0.0033) [2024-06-28 02:33:03,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2122825728. Throughput: 0: 43764.1. Samples: 2025742280. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 02:33:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:33:05,492][06909] Updated weights for policy 0, policy_version 129573 (0.0033) [2024-06-28 02:33:08,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2123005952. Throughput: 0: 44130.1. Samples: 2026009720. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 02:33:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:33:10,167][06909] Updated weights for policy 0, policy_version 129583 (0.0034) [2024-06-28 02:33:12,857][06909] Updated weights for policy 0, policy_version 129593 (0.0035) [2024-06-28 02:33:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2123284480. Throughput: 0: 43843.6. Samples: 2026135440. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 02:33:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:33:17,321][06909] Updated weights for policy 0, policy_version 129603 (0.0026) [2024-06-28 02:33:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2123481088. Throughput: 0: 43929.8. Samples: 2026406520. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 02:33:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:33:20,060][06909] Updated weights for policy 0, policy_version 129613 (0.0030) [2024-06-28 02:33:23,850][06674] Fps is (10 sec: 37683.1, 60 sec: 43690.6, 300 sec: 43987.2). Total num frames: 2123661312. Throughput: 0: 44355.2. Samples: 2026677440. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 02:33:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:33:24,698][06909] Updated weights for policy 0, policy_version 129623 (0.0024) [2024-06-28 02:33:27,690][06909] Updated weights for policy 0, policy_version 129633 (0.0026) [2024-06-28 02:33:28,852][06674] Fps is (10 sec: 47504.2, 60 sec: 44235.3, 300 sec: 44208.7). Total num frames: 2123956224. Throughput: 0: 44212.7. Samples: 2026806720. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 02:33:28,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:33:31,798][06887] Signal inference workers to stop experience collection... (28800 times) [2024-06-28 02:33:31,798][06887] Signal inference workers to resume experience collection... (28800 times) [2024-06-28 02:33:31,843][06909] InferenceWorker_p0-w0: stopping experience collection (28800 times) [2024-06-28 02:33:31,843][06909] InferenceWorker_p0-w0: resuming experience collection (28800 times) [2024-06-28 02:33:31,944][06909] Updated weights for policy 0, policy_version 129643 (0.0033) [2024-06-28 02:33:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2124136448. Throughput: 0: 44060.4. Samples: 2027070180. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 02:33:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:33:35,290][06909] Updated weights for policy 0, policy_version 129653 (0.0032) [2024-06-28 02:33:38,850][06674] Fps is (10 sec: 39329.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2124349440. Throughput: 0: 44292.3. Samples: 2027338780. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 02:33:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:33:39,742][06909] Updated weights for policy 0, policy_version 129663 (0.0030) [2024-06-28 02:33:42,493][06909] Updated weights for policy 0, policy_version 129673 (0.0036) [2024-06-28 02:33:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2124595200. Throughput: 0: 44143.0. Samples: 2027465260. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 02:33:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 02:33:47,357][06909] Updated weights for policy 0, policy_version 129683 (0.0031) [2024-06-28 02:33:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2124808192. Throughput: 0: 44238.7. Samples: 2027733020. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 02:33:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:33:48,977][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000129689_2124824576.pth... [2024-06-28 02:33:49,040][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000129040_2114191360.pth [2024-06-28 02:33:49,792][06909] Updated weights for policy 0, policy_version 129693 (0.0033) [2024-06-28 02:33:53,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2125004800. Throughput: 0: 44347.1. Samples: 2028005340. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 02:33:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:33:54,500][06909] Updated weights for policy 0, policy_version 129703 (0.0027) [2024-06-28 02:33:57,034][06909] Updated weights for policy 0, policy_version 129713 (0.0035) [2024-06-28 02:33:58,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44509.9, 300 sec: 44320.1). Total num frames: 2125283328. Throughput: 0: 44345.3. Samples: 2028130980. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 02:33:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:34:02,078][06909] Updated weights for policy 0, policy_version 129723 (0.0034) [2024-06-28 02:34:03,850][06674] Fps is (10 sec: 47514.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2125479936. Throughput: 0: 44257.8. Samples: 2028398120. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 02:34:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:34:04,674][06909] Updated weights for policy 0, policy_version 129733 (0.0030) [2024-06-28 02:34:08,850][06674] Fps is (10 sec: 37682.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2125660160. Throughput: 0: 44182.2. Samples: 2028665640. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 02:34:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:34:09,370][06909] Updated weights for policy 0, policy_version 129743 (0.0032) [2024-06-28 02:34:12,265][06909] Updated weights for policy 0, policy_version 129753 (0.0037) [2024-06-28 02:34:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 2125938688. Throughput: 0: 44149.1. Samples: 2028793340. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 02:34:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:34:17,027][06909] Updated weights for policy 0, policy_version 129763 (0.0040) [2024-06-28 02:34:18,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2126135296. Throughput: 0: 44145.8. Samples: 2029056740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:34:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:34:19,581][06909] Updated weights for policy 0, policy_version 129773 (0.0030) [2024-06-28 02:34:23,850][06674] Fps is (10 sec: 40959.6, 60 sec: 44782.9, 300 sec: 44097.9). Total num frames: 2126348288. Throughput: 0: 44102.6. Samples: 2029323400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:34:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:34:24,252][06909] Updated weights for policy 0, policy_version 129783 (0.0033) [2024-06-28 02:34:26,813][06909] Updated weights for policy 0, policy_version 129793 (0.0030) [2024-06-28 02:34:28,852][06674] Fps is (10 sec: 45865.8, 60 sec: 43963.7, 300 sec: 44264.3). Total num frames: 2126594048. Throughput: 0: 44072.3. Samples: 2029448600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:34:28,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:34:30,988][06887] Signal inference workers to stop experience collection... (28850 times) [2024-06-28 02:34:31,018][06909] InferenceWorker_p0-w0: stopping experience collection (28850 times) [2024-06-28 02:34:31,110][06887] Signal inference workers to resume experience collection... (28850 times) [2024-06-28 02:34:31,110][06909] InferenceWorker_p0-w0: resuming experience collection (28850 times) [2024-06-28 02:34:31,401][06909] Updated weights for policy 0, policy_version 129803 (0.0028) [2024-06-28 02:34:33,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2126807040. Throughput: 0: 44190.2. Samples: 2029721580. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:34:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:34:34,366][06909] Updated weights for policy 0, policy_version 129813 (0.0030) [2024-06-28 02:34:38,850][06674] Fps is (10 sec: 40968.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2127003648. Throughput: 0: 44193.8. Samples: 2029994060. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:34:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:34:38,916][06909] Updated weights for policy 0, policy_version 129823 (0.0027) [2024-06-28 02:34:41,752][06909] Updated weights for policy 0, policy_version 129833 (0.0028) [2024-06-28 02:34:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44264.6). Total num frames: 2127249408. Throughput: 0: 44092.0. Samples: 2030115120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:34:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:34:46,491][06909] Updated weights for policy 0, policy_version 129843 (0.0029) [2024-06-28 02:34:48,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2127478784. Throughput: 0: 44125.3. Samples: 2030383760. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:34:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:34:49,300][06909] Updated weights for policy 0, policy_version 129853 (0.0036) [2024-06-28 02:34:53,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2127659008. Throughput: 0: 43978.7. Samples: 2030644680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:34:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:34:53,958][06909] Updated weights for policy 0, policy_version 129863 (0.0025) [2024-06-28 02:34:56,841][06909] Updated weights for policy 0, policy_version 129873 (0.0034) [2024-06-28 02:34:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 44209.0). Total num frames: 2127904768. Throughput: 0: 43827.1. Samples: 2030765560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:34:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:35:01,449][06909] Updated weights for policy 0, policy_version 129883 (0.0025) [2024-06-28 02:35:03,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2128134144. Throughput: 0: 44016.9. Samples: 2031037500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:35:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:35:04,238][06909] Updated weights for policy 0, policy_version 129893 (0.0036) [2024-06-28 02:35:08,853][06674] Fps is (10 sec: 40947.2, 60 sec: 44234.5, 300 sec: 43986.4). Total num frames: 2128314368. Throughput: 0: 44061.9. Samples: 2031306320. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:35:08,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:35:08,864][06909] Updated weights for policy 0, policy_version 129903 (0.0029) [2024-06-28 02:35:11,901][06909] Updated weights for policy 0, policy_version 129913 (0.0033) [2024-06-28 02:35:13,851][06674] Fps is (10 sec: 42593.5, 60 sec: 43689.8, 300 sec: 44264.4). Total num frames: 2128560128. Throughput: 0: 43997.8. Samples: 2031428460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:35:13,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:35:16,291][06909] Updated weights for policy 0, policy_version 129923 (0.0034) [2024-06-28 02:35:18,850][06674] Fps is (10 sec: 47528.1, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2128789504. Throughput: 0: 43803.0. Samples: 2031692720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 02:35:18,851][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 02:35:19,407][06909] Updated weights for policy 0, policy_version 129933 (0.0037) [2024-06-28 02:35:23,852][06674] Fps is (10 sec: 40956.1, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 2128969728. Throughput: 0: 43672.7. Samples: 2031959420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 02:35:23,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:35:23,998][06909] Updated weights for policy 0, policy_version 129943 (0.0021) [2024-06-28 02:35:26,913][06909] Updated weights for policy 0, policy_version 129953 (0.0034) [2024-06-28 02:35:28,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43692.2, 300 sec: 44209.0). Total num frames: 2129215488. Throughput: 0: 43689.0. Samples: 2032081120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 02:35:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:35:31,411][06909] Updated weights for policy 0, policy_version 129963 (0.0031) [2024-06-28 02:35:33,850][06674] Fps is (10 sec: 45884.4, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2129428480. Throughput: 0: 43650.6. Samples: 2032348040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 02:35:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:35:34,442][06909] Updated weights for policy 0, policy_version 129973 (0.0037) [2024-06-28 02:35:38,389][06887] Signal inference workers to stop experience collection... (28900 times) [2024-06-28 02:35:38,446][06909] InferenceWorker_p0-w0: stopping experience collection (28900 times) [2024-06-28 02:35:38,452][06887] Signal inference workers to resume experience collection... (28900 times) [2024-06-28 02:35:38,464][06909] InferenceWorker_p0-w0: resuming experience collection (28900 times) [2024-06-28 02:35:38,591][06909] Updated weights for policy 0, policy_version 129983 (0.0036) [2024-06-28 02:35:38,852][06674] Fps is (10 sec: 42589.3, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 2129641472. Throughput: 0: 43898.0. Samples: 2032620180. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 02:35:38,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:35:41,592][06909] Updated weights for policy 0, policy_version 129993 (0.0036) [2024-06-28 02:35:43,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43689.1, 300 sec: 44153.5). Total num frames: 2129870848. Throughput: 0: 43953.6. Samples: 2032743560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 02:35:43,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 02:35:46,005][06909] Updated weights for policy 0, policy_version 130003 (0.0026) [2024-06-28 02:35:48,850][06674] Fps is (10 sec: 47523.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2130116608. Throughput: 0: 43931.5. Samples: 2033014420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 02:35:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:35:48,871][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000130012_2130116608.pth... [2024-06-28 02:35:48,947][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000129365_2119516160.pth [2024-06-28 02:35:49,119][06909] Updated weights for policy 0, policy_version 130013 (0.0035) [2024-06-28 02:35:53,478][06909] Updated weights for policy 0, policy_version 130023 (0.0024) [2024-06-28 02:35:53,850][06674] Fps is (10 sec: 44245.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2130313216. Throughput: 0: 43856.4. Samples: 2033279720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 02:35:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:35:56,524][06909] Updated weights for policy 0, policy_version 130033 (0.0026) [2024-06-28 02:35:58,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2130526208. Throughput: 0: 43922.8. Samples: 2033404940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 02:35:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:36:01,114][06909] Updated weights for policy 0, policy_version 130043 (0.0035) [2024-06-28 02:36:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2130771968. Throughput: 0: 44130.2. Samples: 2033678580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 02:36:03,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:36:04,042][06909] Updated weights for policy 0, policy_version 130053 (0.0024) [2024-06-28 02:36:08,269][06909] Updated weights for policy 0, policy_version 130063 (0.0044) [2024-06-28 02:36:08,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43965.9, 300 sec: 43931.3). Total num frames: 2130952192. Throughput: 0: 44051.2. Samples: 2033941640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 02:36:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:36:11,551][06909] Updated weights for policy 0, policy_version 130073 (0.0033) [2024-06-28 02:36:13,852][06674] Fps is (10 sec: 40951.8, 60 sec: 43690.0, 300 sec: 44042.1). Total num frames: 2131181568. Throughput: 0: 44162.8. Samples: 2034068540. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 02:36:13,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:36:15,766][06909] Updated weights for policy 0, policy_version 130083 (0.0035) [2024-06-28 02:36:18,850][06674] Fps is (10 sec: 47514.7, 60 sec: 43963.9, 300 sec: 44098.3). Total num frames: 2131427328. Throughput: 0: 44085.1. Samples: 2034331860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 02:36:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:36:18,910][06909] Updated weights for policy 0, policy_version 130093 (0.0027) [2024-06-28 02:36:23,198][06909] Updated weights for policy 0, policy_version 130103 (0.0043) [2024-06-28 02:36:23,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44238.3, 300 sec: 43931.3). Total num frames: 2131623936. Throughput: 0: 43941.5. Samples: 2034597460. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 02:36:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:36:26,207][06909] Updated weights for policy 0, policy_version 130113 (0.0038) [2024-06-28 02:36:28,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2131836928. Throughput: 0: 43935.3. Samples: 2034720560. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 02:36:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:36:30,668][06909] Updated weights for policy 0, policy_version 130123 (0.0040) [2024-06-28 02:36:33,674][06909] Updated weights for policy 0, policy_version 130133 (0.0028) [2024-06-28 02:36:33,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2132099072. Throughput: 0: 44014.7. Samples: 2034995080. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 02:36:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:36:38,448][06909] Updated weights for policy 0, policy_version 130143 (0.0035) [2024-06-28 02:36:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43965.2, 300 sec: 43875.8). Total num frames: 2132279296. Throughput: 0: 43892.9. Samples: 2035254900. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 02:36:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:36:41,489][06909] Updated weights for policy 0, policy_version 130153 (0.0035) [2024-06-28 02:36:43,850][06674] Fps is (10 sec: 37683.3, 60 sec: 43419.1, 300 sec: 43931.4). Total num frames: 2132475904. Throughput: 0: 43853.0. Samples: 2035378320. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 02:36:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:36:45,729][06909] Updated weights for policy 0, policy_version 130163 (0.0042) [2024-06-28 02:36:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2132738048. Throughput: 0: 43662.7. Samples: 2035643400. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 02:36:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:36:48,923][06909] Updated weights for policy 0, policy_version 130173 (0.0035) [2024-06-28 02:36:53,122][06909] Updated weights for policy 0, policy_version 130183 (0.0028) [2024-06-28 02:36:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.6, 300 sec: 43820.6). Total num frames: 2132918272. Throughput: 0: 43686.3. Samples: 2035907520. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 02:36:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:36:56,280][06909] Updated weights for policy 0, policy_version 130193 (0.0030) [2024-06-28 02:36:58,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 2133131264. Throughput: 0: 43549.0. Samples: 2036028160. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 02:36:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:37:00,708][06909] Updated weights for policy 0, policy_version 130203 (0.0047) [2024-06-28 02:37:02,935][06887] Signal inference workers to stop experience collection... (28950 times) [2024-06-28 02:37:02,936][06887] Signal inference workers to resume experience collection... (28950 times) [2024-06-28 02:37:02,979][06909] InferenceWorker_p0-w0: stopping experience collection (28950 times) [2024-06-28 02:37:02,980][06909] InferenceWorker_p0-w0: resuming experience collection (28950 times) [2024-06-28 02:37:03,600][06909] Updated weights for policy 0, policy_version 130213 (0.0031) [2024-06-28 02:37:03,850][06674] Fps is (10 sec: 49152.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2133409792. Throughput: 0: 43731.4. Samples: 2036299780. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 02:37:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:37:07,974][06909] Updated weights for policy 0, policy_version 130223 (0.0036) [2024-06-28 02:37:08,850][06674] Fps is (10 sec: 47514.4, 60 sec: 44237.0, 300 sec: 43931.4). Total num frames: 2133606400. Throughput: 0: 43879.7. Samples: 2036572040. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 02:37:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:37:11,122][06909] Updated weights for policy 0, policy_version 130233 (0.0021) [2024-06-28 02:37:13,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43692.1, 300 sec: 43986.9). Total num frames: 2133803008. Throughput: 0: 43894.1. Samples: 2036695800. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 02:37:13,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:37:15,431][06909] Updated weights for policy 0, policy_version 130243 (0.0024) [2024-06-28 02:37:18,514][06909] Updated weights for policy 0, policy_version 130253 (0.0025) [2024-06-28 02:37:18,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2134065152. Throughput: 0: 43848.0. Samples: 2036968240. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 02:37:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:37:22,893][06909] Updated weights for policy 0, policy_version 130263 (0.0037) [2024-06-28 02:37:23,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2134261760. Throughput: 0: 43832.1. Samples: 2037227340. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 02:37:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:37:26,129][06909] Updated weights for policy 0, policy_version 130273 (0.0031) [2024-06-28 02:37:28,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2134474752. Throughput: 0: 43946.5. Samples: 2037355920. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:37:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:37:30,159][06909] Updated weights for policy 0, policy_version 130283 (0.0045) [2024-06-28 02:37:33,419][06909] Updated weights for policy 0, policy_version 130293 (0.0023) [2024-06-28 02:37:33,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2134736896. Throughput: 0: 44198.7. Samples: 2037632340. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:37:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:37:37,738][06909] Updated weights for policy 0, policy_version 130303 (0.0021) [2024-06-28 02:37:38,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2134949888. Throughput: 0: 44143.6. Samples: 2037893980. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:37:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:37:40,706][06909] Updated weights for policy 0, policy_version 130313 (0.0030) [2024-06-28 02:37:43,850][06674] Fps is (10 sec: 39321.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2135130112. Throughput: 0: 44253.4. Samples: 2038019560. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:37:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:37:45,023][06909] Updated weights for policy 0, policy_version 130323 (0.0044) [2024-06-28 02:37:48,278][06909] Updated weights for policy 0, policy_version 130333 (0.0026) [2024-06-28 02:37:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44264.6). Total num frames: 2135408640. Throughput: 0: 44258.7. Samples: 2038291420. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:37:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:37:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000130335_2135408640.pth... [2024-06-28 02:37:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000129689_2124824576.pth [2024-06-28 02:37:52,623][06909] Updated weights for policy 0, policy_version 130343 (0.0045) [2024-06-28 02:37:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2135588864. Throughput: 0: 43995.1. Samples: 2038551820. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:37:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:37:55,616][06909] Updated weights for policy 0, policy_version 130353 (0.0030) [2024-06-28 02:37:58,850][06674] Fps is (10 sec: 37683.2, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 2135785472. Throughput: 0: 44034.3. Samples: 2038677340. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:37:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:38:00,100][06909] Updated weights for policy 0, policy_version 130363 (0.0042) [2024-06-28 02:38:02,801][06887] Signal inference workers to stop experience collection... (29000 times) [2024-06-28 02:38:02,808][06887] Signal inference workers to resume experience collection... (29000 times) [2024-06-28 02:38:02,816][06909] InferenceWorker_p0-w0: stopping experience collection (29000 times) [2024-06-28 02:38:02,850][06909] InferenceWorker_p0-w0: resuming experience collection (29000 times) [2024-06-28 02:38:03,174][06909] Updated weights for policy 0, policy_version 130373 (0.0027) [2024-06-28 02:38:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2136047616. Throughput: 0: 44051.9. Samples: 2038950580. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:38:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:38:07,463][06909] Updated weights for policy 0, policy_version 130383 (0.0030) [2024-06-28 02:38:08,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2136260608. Throughput: 0: 44084.0. Samples: 2039211120. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:38:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:38:10,601][06909] Updated weights for policy 0, policy_version 130393 (0.0031) [2024-06-28 02:38:13,850][06674] Fps is (10 sec: 36045.1, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 2136408064. Throughput: 0: 44056.1. Samples: 2039338440. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:38:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:38:15,037][06909] Updated weights for policy 0, policy_version 130403 (0.0036) [2024-06-28 02:38:18,164][06909] Updated weights for policy 0, policy_version 130413 (0.0027) [2024-06-28 02:38:18,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.7, 300 sec: 44264.6). Total num frames: 2136719360. Throughput: 0: 43785.7. Samples: 2039602700. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:38:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:38:22,516][06909] Updated weights for policy 0, policy_version 130423 (0.0037) [2024-06-28 02:38:23,850][06674] Fps is (10 sec: 50790.0, 60 sec: 44236.7, 300 sec: 43931.6). Total num frames: 2136915968. Throughput: 0: 43946.1. Samples: 2039871560. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:38:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:38:25,470][06909] Updated weights for policy 0, policy_version 130433 (0.0039) [2024-06-28 02:38:28,850][06674] Fps is (10 sec: 37683.0, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2137096192. Throughput: 0: 43995.4. Samples: 2039999360. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:38:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:38:29,923][06909] Updated weights for policy 0, policy_version 130443 (0.0029) [2024-06-28 02:38:33,047][06909] Updated weights for policy 0, policy_version 130453 (0.0037) [2024-06-28 02:38:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2137374720. Throughput: 0: 43853.7. Samples: 2040264840. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 02:38:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:38:37,285][06909] Updated weights for policy 0, policy_version 130463 (0.0033) [2024-06-28 02:38:38,850][06674] Fps is (10 sec: 49152.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2137587712. Throughput: 0: 43902.6. Samples: 2040527440. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 02:38:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:38:40,303][06909] Updated weights for policy 0, policy_version 130473 (0.0026) [2024-06-28 02:38:43,850][06674] Fps is (10 sec: 37683.2, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 2137751552. Throughput: 0: 44087.1. Samples: 2040661260. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 02:38:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:38:44,842][06909] Updated weights for policy 0, policy_version 130483 (0.0041) [2024-06-28 02:38:47,172][06887] Signal inference workers to stop experience collection... (29050 times) [2024-06-28 02:38:47,173][06887] Signal inference workers to resume experience collection... (29050 times) [2024-06-28 02:38:47,189][06909] InferenceWorker_p0-w0: stopping experience collection (29050 times) [2024-06-28 02:38:47,189][06909] InferenceWorker_p0-w0: resuming experience collection (29050 times) [2024-06-28 02:38:47,645][06909] Updated weights for policy 0, policy_version 130493 (0.0032) [2024-06-28 02:38:48,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2138030080. Throughput: 0: 43907.7. Samples: 2040926420. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 02:38:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:38:52,405][06909] Updated weights for policy 0, policy_version 130503 (0.0034) [2024-06-28 02:38:53,853][06674] Fps is (10 sec: 49134.9, 60 sec: 44234.2, 300 sec: 43930.8). Total num frames: 2138243072. Throughput: 0: 44012.1. Samples: 2041191820. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 02:38:53,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:38:55,034][06909] Updated weights for policy 0, policy_version 130513 (0.0021) [2024-06-28 02:38:58,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 2138439680. Throughput: 0: 44231.6. Samples: 2041328860. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 02:38:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:38:59,646][06909] Updated weights for policy 0, policy_version 130523 (0.0024) [2024-06-28 02:39:02,362][06909] Updated weights for policy 0, policy_version 130533 (0.0031) [2024-06-28 02:39:03,850][06674] Fps is (10 sec: 44251.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2138685440. Throughput: 0: 44131.5. Samples: 2041588620. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 02:39:03,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:39:07,240][06909] Updated weights for policy 0, policy_version 130543 (0.0031) [2024-06-28 02:39:08,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 2138898432. Throughput: 0: 44094.2. Samples: 2041855800. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 02:39:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:39:09,750][06909] Updated weights for policy 0, policy_version 130553 (0.0037) [2024-06-28 02:39:13,856][06674] Fps is (10 sec: 40935.6, 60 sec: 44778.4, 300 sec: 43930.4). Total num frames: 2139095040. Throughput: 0: 44289.7. Samples: 2041992660. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 02:39:13,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 02:39:14,468][06909] Updated weights for policy 0, policy_version 130563 (0.0043) [2024-06-28 02:39:17,194][06909] Updated weights for policy 0, policy_version 130573 (0.0031) [2024-06-28 02:39:18,852][06674] Fps is (10 sec: 44228.8, 60 sec: 43689.3, 300 sec: 44042.1). Total num frames: 2139340800. Throughput: 0: 44079.0. Samples: 2042248480. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 02:39:18,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:39:22,123][06909] Updated weights for policy 0, policy_version 130583 (0.0041) [2024-06-28 02:39:23,850][06674] Fps is (10 sec: 49181.6, 60 sec: 44509.9, 300 sec: 44042.7). Total num frames: 2139586560. Throughput: 0: 44248.9. Samples: 2042518640. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 02:39:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:39:24,610][06909] Updated weights for policy 0, policy_version 130593 (0.0019) [2024-06-28 02:39:28,850][06674] Fps is (10 sec: 40967.8, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 2139750400. Throughput: 0: 44145.3. Samples: 2042647800. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 02:39:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:39:29,815][06909] Updated weights for policy 0, policy_version 130603 (0.0028) [2024-06-28 02:39:32,438][06909] Updated weights for policy 0, policy_version 130613 (0.0030) [2024-06-28 02:39:33,851][06674] Fps is (10 sec: 40955.8, 60 sec: 43689.9, 300 sec: 44042.3). Total num frames: 2139996160. Throughput: 0: 43986.5. Samples: 2042905860. Policy #0 lag: (min: 1.0, avg: 11.4, max: 19.0) [2024-06-28 02:39:33,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:39:37,061][06909] Updated weights for policy 0, policy_version 130623 (0.0029) [2024-06-28 02:39:38,856][06674] Fps is (10 sec: 47484.9, 60 sec: 43959.3, 300 sec: 43986.0). Total num frames: 2140225536. Throughput: 0: 44187.3. Samples: 2043180360. Policy #0 lag: (min: 1.0, avg: 11.4, max: 19.0) [2024-06-28 02:39:38,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:39:39,616][06909] Updated weights for policy 0, policy_version 130633 (0.0029) [2024-06-28 02:39:43,850][06674] Fps is (10 sec: 40964.4, 60 sec: 44236.8, 300 sec: 43820.3). Total num frames: 2140405760. Throughput: 0: 44016.4. Samples: 2043309600. Policy #0 lag: (min: 1.0, avg: 11.4, max: 19.0) [2024-06-28 02:39:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:39:44,403][06909] Updated weights for policy 0, policy_version 130643 (0.0026) [2024-06-28 02:39:46,987][06909] Updated weights for policy 0, policy_version 130653 (0.0020) [2024-06-28 02:39:48,850][06674] Fps is (10 sec: 42623.5, 60 sec: 43690.5, 300 sec: 44042.4). Total num frames: 2140651520. Throughput: 0: 43959.5. Samples: 2043566800. Policy #0 lag: (min: 1.0, avg: 11.4, max: 19.0) [2024-06-28 02:39:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:39:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000130655_2140651520.pth... [2024-06-28 02:39:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000130012_2130116608.pth [2024-06-28 02:39:51,903][06909] Updated weights for policy 0, policy_version 130663 (0.0033) [2024-06-28 02:39:52,277][06887] Signal inference workers to stop experience collection... (29100 times) [2024-06-28 02:39:52,277][06887] Signal inference workers to resume experience collection... (29100 times) [2024-06-28 02:39:52,291][06909] InferenceWorker_p0-w0: stopping experience collection (29100 times) [2024-06-28 02:39:52,291][06909] InferenceWorker_p0-w0: resuming experience collection (29100 times) [2024-06-28 02:39:53,856][06674] Fps is (10 sec: 47484.7, 60 sec: 43961.9, 300 sec: 43986.0). Total num frames: 2140880896. Throughput: 0: 43945.8. Samples: 2043833620. Policy #0 lag: (min: 1.0, avg: 11.4, max: 19.0) [2024-06-28 02:39:53,857][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:39:54,417][06909] Updated weights for policy 0, policy_version 130673 (0.0043) [2024-06-28 02:39:58,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 2141061120. Throughput: 0: 43848.1. Samples: 2043965560. Policy #0 lag: (min: 1.0, avg: 11.4, max: 19.0) [2024-06-28 02:39:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:39:59,703][06909] Updated weights for policy 0, policy_version 130683 (0.0021) [2024-06-28 02:40:02,225][06909] Updated weights for policy 0, policy_version 130693 (0.0041) [2024-06-28 02:40:03,850][06674] Fps is (10 sec: 42624.1, 60 sec: 43690.7, 300 sec: 44042.9). Total num frames: 2141306880. Throughput: 0: 43755.2. Samples: 2044217380. Policy #0 lag: (min: 1.0, avg: 11.4, max: 19.0) [2024-06-28 02:40:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 02:40:07,026][06909] Updated weights for policy 0, policy_version 130703 (0.0039) [2024-06-28 02:40:08,850][06674] Fps is (10 sec: 49152.5, 60 sec: 44236.9, 300 sec: 44042.6). Total num frames: 2141552640. Throughput: 0: 43863.2. Samples: 2044492480. Policy #0 lag: (min: 1.0, avg: 11.4, max: 19.0) [2024-06-28 02:40:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:40:09,668][06909] Updated weights for policy 0, policy_version 130713 (0.0022) [2024-06-28 02:40:13,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43695.1, 300 sec: 43820.3). Total num frames: 2141716480. Throughput: 0: 44006.3. Samples: 2044628080. Policy #0 lag: (min: 1.0, avg: 11.4, max: 19.0) [2024-06-28 02:40:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:40:14,388][06909] Updated weights for policy 0, policy_version 130723 (0.0029) [2024-06-28 02:40:16,940][06909] Updated weights for policy 0, policy_version 130733 (0.0050) [2024-06-28 02:40:18,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43692.1, 300 sec: 44042.7). Total num frames: 2141962240. Throughput: 0: 43985.5. Samples: 2044885160. Policy #0 lag: (min: 1.0, avg: 11.4, max: 19.0) [2024-06-28 02:40:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:40:21,538][06909] Updated weights for policy 0, policy_version 130743 (0.0035) [2024-06-28 02:40:23,850][06674] Fps is (10 sec: 49151.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2142208000. Throughput: 0: 43896.1. Samples: 2045155420. Policy #0 lag: (min: 1.0, avg: 11.4, max: 19.0) [2024-06-28 02:40:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:40:24,435][06909] Updated weights for policy 0, policy_version 130753 (0.0029) [2024-06-28 02:40:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2142404608. Throughput: 0: 43988.1. Samples: 2045289060. Policy #0 lag: (min: 1.0, avg: 11.4, max: 19.0) [2024-06-28 02:40:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:40:28,976][06909] Updated weights for policy 0, policy_version 130763 (0.0042) [2024-06-28 02:40:32,118][06909] Updated weights for policy 0, policy_version 130773 (0.0040) [2024-06-28 02:40:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43964.5, 300 sec: 44042.7). Total num frames: 2142633984. Throughput: 0: 43994.0. Samples: 2045546520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:40:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:40:36,398][06909] Updated weights for policy 0, policy_version 130783 (0.0026) [2024-06-28 02:40:38,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44241.3, 300 sec: 44098.3). Total num frames: 2142879744. Throughput: 0: 44112.6. Samples: 2045818420. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:40:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:40:39,398][06909] Updated weights for policy 0, policy_version 130793 (0.0040) [2024-06-28 02:40:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 2143059968. Throughput: 0: 44135.0. Samples: 2045951640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:40:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:40:44,013][06909] Updated weights for policy 0, policy_version 130803 (0.0030) [2024-06-28 02:40:46,708][06909] Updated weights for policy 0, policy_version 130813 (0.0036) [2024-06-28 02:40:48,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 2143272960. Throughput: 0: 44236.9. Samples: 2046208040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:40:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:40:51,554][06909] Updated weights for policy 0, policy_version 130823 (0.0028) [2024-06-28 02:40:53,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44241.2, 300 sec: 44097.9). Total num frames: 2143535104. Throughput: 0: 44106.1. Samples: 2046477260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:40:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 02:40:54,322][06909] Updated weights for policy 0, policy_version 130833 (0.0032) [2024-06-28 02:40:58,730][06909] Updated weights for policy 0, policy_version 130843 (0.0045) [2024-06-28 02:40:58,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44509.9, 300 sec: 43931.4). Total num frames: 2143731712. Throughput: 0: 44120.5. Samples: 2046613500. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:40:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:41:00,708][06887] Signal inference workers to stop experience collection... (29150 times) [2024-06-28 02:41:00,708][06887] Signal inference workers to resume experience collection... (29150 times) [2024-06-28 02:41:00,748][06909] InferenceWorker_p0-w0: stopping experience collection (29150 times) [2024-06-28 02:41:00,749][06909] InferenceWorker_p0-w0: resuming experience collection (29150 times) [2024-06-28 02:41:01,927][06909] Updated weights for policy 0, policy_version 130853 (0.0028) [2024-06-28 02:41:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2143961088. Throughput: 0: 44209.7. Samples: 2046874600. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:41:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:41:06,019][06909] Updated weights for policy 0, policy_version 130863 (0.0028) [2024-06-28 02:41:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 2144190464. Throughput: 0: 43991.6. Samples: 2047135040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:41:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:41:09,515][06909] Updated weights for policy 0, policy_version 130873 (0.0027) [2024-06-28 02:41:13,315][06909] Updated weights for policy 0, policy_version 130883 (0.0022) [2024-06-28 02:41:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44782.9, 300 sec: 43986.8). Total num frames: 2144403456. Throughput: 0: 44134.1. Samples: 2047275100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:41:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:41:16,727][06909] Updated weights for policy 0, policy_version 130893 (0.0046) [2024-06-28 02:41:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2144616448. Throughput: 0: 44231.4. Samples: 2047536940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:41:18,854][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:41:21,391][06909] Updated weights for policy 0, policy_version 130903 (0.0035) [2024-06-28 02:41:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2144845824. Throughput: 0: 43921.8. Samples: 2047794900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:41:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:41:24,390][06909] Updated weights for policy 0, policy_version 130913 (0.0035) [2024-06-28 02:41:28,580][06909] Updated weights for policy 0, policy_version 130923 (0.0036) [2024-06-28 02:41:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 2145042432. Throughput: 0: 44035.2. Samples: 2047933220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:41:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 02:41:31,860][06909] Updated weights for policy 0, policy_version 130933 (0.0034) [2024-06-28 02:41:33,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2145271808. Throughput: 0: 44204.9. Samples: 2048197260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 02:41:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:41:35,785][06909] Updated weights for policy 0, policy_version 130943 (0.0026) [2024-06-28 02:41:38,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2145517568. Throughput: 0: 43985.3. Samples: 2048456600. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:41:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:41:39,269][06909] Updated weights for policy 0, policy_version 130953 (0.0036) [2024-06-28 02:41:43,199][06909] Updated weights for policy 0, policy_version 130963 (0.0040) [2024-06-28 02:41:43,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2145730560. Throughput: 0: 44074.6. Samples: 2048596860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:41:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:41:46,581][06909] Updated weights for policy 0, policy_version 130973 (0.0032) [2024-06-28 02:41:48,850][06674] Fps is (10 sec: 40959.9, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2145927168. Throughput: 0: 44165.7. Samples: 2048862060. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:41:48,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:41:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000130977_2145927168.pth... [2024-06-28 02:41:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000130335_2135408640.pth [2024-06-28 02:41:50,507][06909] Updated weights for policy 0, policy_version 130983 (0.0032) [2024-06-28 02:41:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2146172928. Throughput: 0: 44151.5. Samples: 2049121860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:41:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 02:41:54,222][06909] Updated weights for policy 0, policy_version 130993 (0.0023) [2024-06-28 02:41:58,185][06909] Updated weights for policy 0, policy_version 131003 (0.0045) [2024-06-28 02:41:58,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2146385920. Throughput: 0: 44133.0. Samples: 2049261080. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:41:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:42:02,106][06909] Updated weights for policy 0, policy_version 131013 (0.0032) [2024-06-28 02:42:03,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2146582528. Throughput: 0: 44107.2. Samples: 2049521760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:42:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:42:05,515][06909] Updated weights for policy 0, policy_version 131023 (0.0036) [2024-06-28 02:42:08,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2146828288. Throughput: 0: 44087.9. Samples: 2049778860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:42:08,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 02:42:09,570][06909] Updated weights for policy 0, policy_version 131033 (0.0031) [2024-06-28 02:42:12,985][06909] Updated weights for policy 0, policy_version 131043 (0.0031) [2024-06-28 02:42:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2147041280. Throughput: 0: 44101.3. Samples: 2049917780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:42:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:42:14,949][06887] Signal inference workers to stop experience collection... (29200 times) [2024-06-28 02:42:14,950][06887] Signal inference workers to resume experience collection... (29200 times) [2024-06-28 02:42:14,994][06909] InferenceWorker_p0-w0: stopping experience collection (29200 times) [2024-06-28 02:42:14,994][06909] InferenceWorker_p0-w0: resuming experience collection (29200 times) [2024-06-28 02:42:16,920][06909] Updated weights for policy 0, policy_version 131053 (0.0043) [2024-06-28 02:42:18,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2147237888. Throughput: 0: 44033.0. Samples: 2050178740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:42:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:42:20,434][06909] Updated weights for policy 0, policy_version 131063 (0.0041) [2024-06-28 02:42:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2147467264. Throughput: 0: 44027.2. Samples: 2050437820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:42:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:42:24,395][06909] Updated weights for policy 0, policy_version 131073 (0.0042) [2024-06-28 02:42:27,917][06909] Updated weights for policy 0, policy_version 131083 (0.0044) [2024-06-28 02:42:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2147696640. Throughput: 0: 43993.4. Samples: 2050576560. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:42:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:42:31,879][06909] Updated weights for policy 0, policy_version 131093 (0.0032) [2024-06-28 02:42:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2147909632. Throughput: 0: 43963.7. Samples: 2050840420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 02:42:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:42:35,536][06909] Updated weights for policy 0, policy_version 131103 (0.0038) [2024-06-28 02:42:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2148139008. Throughput: 0: 43909.3. Samples: 2051097780. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 02:42:38,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 02:42:39,293][06909] Updated weights for policy 0, policy_version 131113 (0.0034) [2024-06-28 02:42:42,816][06909] Updated weights for policy 0, policy_version 131123 (0.0036) [2024-06-28 02:42:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 2148352000. Throughput: 0: 43872.9. Samples: 2051235360. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 02:42:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:42:46,822][06909] Updated weights for policy 0, policy_version 131133 (0.0030) [2024-06-28 02:42:48,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 2148548608. Throughput: 0: 43876.0. Samples: 2051496180. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 02:42:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 02:42:50,150][06909] Updated weights for policy 0, policy_version 131143 (0.0037) [2024-06-28 02:42:53,852][06674] Fps is (10 sec: 44227.2, 60 sec: 43689.1, 300 sec: 44097.6). Total num frames: 2148794368. Throughput: 0: 44001.1. Samples: 2051759000. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 02:42:53,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:42:54,493][06909] Updated weights for policy 0, policy_version 131153 (0.0049) [2024-06-28 02:42:57,565][06909] Updated weights for policy 0, policy_version 131163 (0.0043) [2024-06-28 02:42:58,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2149007360. Throughput: 0: 43972.0. Samples: 2051896520. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 02:42:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:43:01,737][06909] Updated weights for policy 0, policy_version 131173 (0.0032) [2024-06-28 02:43:03,850][06674] Fps is (10 sec: 40968.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 2149203968. Throughput: 0: 43924.9. Samples: 2052155360. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 02:43:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:43:05,351][06909] Updated weights for policy 0, policy_version 131183 (0.0033) [2024-06-28 02:43:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 2149449728. Throughput: 0: 43798.2. Samples: 2052408740. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 02:43:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:43:09,286][06909] Updated weights for policy 0, policy_version 131193 (0.0039) [2024-06-28 02:43:12,499][06909] Updated weights for policy 0, policy_version 131203 (0.0028) [2024-06-28 02:43:13,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2149679104. Throughput: 0: 43837.3. Samples: 2052549240. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 02:43:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:43:16,521][06909] Updated weights for policy 0, policy_version 131213 (0.0031) [2024-06-28 02:43:18,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 2149859328. Throughput: 0: 43908.0. Samples: 2052816280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 02:43:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:43:20,145][06909] Updated weights for policy 0, policy_version 131223 (0.0027) [2024-06-28 02:43:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2150105088. Throughput: 0: 43878.2. Samples: 2053072300. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 02:43:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:43:24,437][06909] Updated weights for policy 0, policy_version 131233 (0.0040) [2024-06-28 02:43:25,622][06887] Signal inference workers to stop experience collection... (29250 times) [2024-06-28 02:43:25,678][06909] InferenceWorker_p0-w0: stopping experience collection (29250 times) [2024-06-28 02:43:25,686][06887] Signal inference workers to resume experience collection... (29250 times) [2024-06-28 02:43:25,687][06909] InferenceWorker_p0-w0: resuming experience collection (29250 times) [2024-06-28 02:43:27,605][06909] Updated weights for policy 0, policy_version 131243 (0.0036) [2024-06-28 02:43:28,850][06674] Fps is (10 sec: 49151.2, 60 sec: 44236.6, 300 sec: 43986.9). Total num frames: 2150350848. Throughput: 0: 43889.1. Samples: 2053210380. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 02:43:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:43:31,677][06909] Updated weights for policy 0, policy_version 131253 (0.0038) [2024-06-28 02:43:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 2150531072. Throughput: 0: 43961.6. Samples: 2053474460. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 02:43:33,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:43:34,873][06909] Updated weights for policy 0, policy_version 131263 (0.0046) [2024-06-28 02:43:38,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2150760448. Throughput: 0: 43974.8. Samples: 2053737780. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 02:43:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:43:38,924][06909] Updated weights for policy 0, policy_version 131273 (0.0040) [2024-06-28 02:43:42,347][06909] Updated weights for policy 0, policy_version 131283 (0.0031) [2024-06-28 02:43:43,850][06674] Fps is (10 sec: 49152.1, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2151022592. Throughput: 0: 44083.5. Samples: 2053880280. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 02:43:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:43:46,347][06909] Updated weights for policy 0, policy_version 131293 (0.0035) [2024-06-28 02:43:48,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.7, 300 sec: 43931.9). Total num frames: 2151202816. Throughput: 0: 44183.5. Samples: 2054143620. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 02:43:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:43:48,924][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000131300_2151219200.pth... [2024-06-28 02:43:48,972][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000130655_2140651520.pth [2024-06-28 02:43:49,897][06909] Updated weights for policy 0, policy_version 131303 (0.0035) [2024-06-28 02:43:53,646][06909] Updated weights for policy 0, policy_version 131313 (0.0043) [2024-06-28 02:43:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 2151432192. Throughput: 0: 44195.5. Samples: 2054397540. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 02:43:53,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:43:57,412][06909] Updated weights for policy 0, policy_version 131323 (0.0041) [2024-06-28 02:43:58,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2151677952. Throughput: 0: 44086.6. Samples: 2054533140. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 02:43:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:44:01,241][06909] Updated weights for policy 0, policy_version 131333 (0.0030) [2024-06-28 02:44:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 2151858176. Throughput: 0: 44193.4. Samples: 2054804980. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 02:44:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:44:04,730][06909] Updated weights for policy 0, policy_version 131343 (0.0027) [2024-06-28 02:44:08,684][06909] Updated weights for policy 0, policy_version 131353 (0.0034) [2024-06-28 02:44:08,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 44043.3). Total num frames: 2152087552. Throughput: 0: 44323.2. Samples: 2055066840. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 02:44:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:44:12,357][06909] Updated weights for policy 0, policy_version 131363 (0.0032) [2024-06-28 02:44:13,850][06674] Fps is (10 sec: 49151.4, 60 sec: 44509.8, 300 sec: 44098.2). Total num frames: 2152349696. Throughput: 0: 44174.8. Samples: 2055198240. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 02:44:13,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 02:44:16,009][06909] Updated weights for policy 0, policy_version 131373 (0.0041) [2024-06-28 02:44:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 43820.3). Total num frames: 2152513536. Throughput: 0: 44200.5. Samples: 2055463480. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 02:44:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 02:44:19,871][06909] Updated weights for policy 0, policy_version 131383 (0.0022) [2024-06-28 02:44:23,701][06909] Updated weights for policy 0, policy_version 131393 (0.0035) [2024-06-28 02:44:23,850][06674] Fps is (10 sec: 39322.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2152742912. Throughput: 0: 44100.7. Samples: 2055722300. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 02:44:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:44:27,371][06909] Updated weights for policy 0, policy_version 131403 (0.0039) [2024-06-28 02:44:28,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44237.0, 300 sec: 44098.1). Total num frames: 2153005056. Throughput: 0: 43967.6. Samples: 2055858820. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 02:44:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:44:30,791][06909] Updated weights for policy 0, policy_version 131413 (0.0025) [2024-06-28 02:44:31,380][06887] Signal inference workers to stop experience collection... (29300 times) [2024-06-28 02:44:31,427][06887] Signal inference workers to resume experience collection... (29300 times) [2024-06-28 02:44:31,428][06909] InferenceWorker_p0-w0: stopping experience collection (29300 times) [2024-06-28 02:44:31,445][06909] InferenceWorker_p0-w0: resuming experience collection (29300 times) [2024-06-28 02:44:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43932.2). Total num frames: 2153185280. Throughput: 0: 44028.5. Samples: 2056124900. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 02:44:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:44:34,534][06909] Updated weights for policy 0, policy_version 131423 (0.0034) [2024-06-28 02:44:38,384][06909] Updated weights for policy 0, policy_version 131433 (0.0029) [2024-06-28 02:44:38,856][06674] Fps is (10 sec: 42572.6, 60 sec: 44505.5, 300 sec: 44152.6). Total num frames: 2153431040. Throughput: 0: 44343.0. Samples: 2056393240. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 02:44:38,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:44:41,702][06909] Updated weights for policy 0, policy_version 131443 (0.0036) [2024-06-28 02:44:43,850][06674] Fps is (10 sec: 49151.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2153676800. Throughput: 0: 44272.0. Samples: 2056525380. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 02:44:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:44:45,609][06909] Updated weights for policy 0, policy_version 131453 (0.0026) [2024-06-28 02:44:48,850][06674] Fps is (10 sec: 42623.8, 60 sec: 44236.8, 300 sec: 43987.8). Total num frames: 2153857024. Throughput: 0: 44187.5. Samples: 2056793420. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 02:44:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:44:49,347][06909] Updated weights for policy 0, policy_version 131463 (0.0019) [2024-06-28 02:44:52,803][06909] Updated weights for policy 0, policy_version 131473 (0.0037) [2024-06-28 02:44:53,850][06674] Fps is (10 sec: 40959.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2154086400. Throughput: 0: 44187.8. Samples: 2057055300. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 02:44:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:44:57,023][06909] Updated weights for policy 0, policy_version 131483 (0.0039) [2024-06-28 02:44:58,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2154332160. Throughput: 0: 44177.8. Samples: 2057186240. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 02:44:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:45:00,596][06909] Updated weights for policy 0, policy_version 131493 (0.0040) [2024-06-28 02:45:03,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 2154496000. Throughput: 0: 44127.5. Samples: 2057449220. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 02:45:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:45:04,394][06909] Updated weights for policy 0, policy_version 131503 (0.0037) [2024-06-28 02:45:07,889][06909] Updated weights for policy 0, policy_version 131513 (0.0031) [2024-06-28 02:45:08,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2154725376. Throughput: 0: 44175.1. Samples: 2057710180. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 02:45:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:45:11,790][06909] Updated weights for policy 0, policy_version 131523 (0.0031) [2024-06-28 02:45:13,850][06674] Fps is (10 sec: 49151.7, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2154987520. Throughput: 0: 44180.8. Samples: 2057846960. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 02:45:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:45:15,397][06909] Updated weights for policy 0, policy_version 131533 (0.0026) [2024-06-28 02:45:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 2155184128. Throughput: 0: 44178.2. Samples: 2058112920. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 02:45:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:45:19,188][06909] Updated weights for policy 0, policy_version 131543 (0.0037) [2024-06-28 02:45:22,706][06909] Updated weights for policy 0, policy_version 131553 (0.0034) [2024-06-28 02:45:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2155397120. Throughput: 0: 44042.7. Samples: 2058374900. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 02:45:23,854][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:45:26,538][06909] Updated weights for policy 0, policy_version 131563 (0.0030) [2024-06-28 02:45:28,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2155659264. Throughput: 0: 44143.5. Samples: 2058511840. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 02:45:28,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:45:29,885][06909] Updated weights for policy 0, policy_version 131573 (0.0032) [2024-06-28 02:45:33,809][06909] Updated weights for policy 0, policy_version 131583 (0.0028) [2024-06-28 02:45:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2155855872. Throughput: 0: 44153.4. Samples: 2058780320. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 02:45:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:45:37,533][06909] Updated weights for policy 0, policy_version 131593 (0.0037) [2024-06-28 02:45:38,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43968.1, 300 sec: 44098.0). Total num frames: 2156068864. Throughput: 0: 44209.0. Samples: 2059044700. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 02:45:38,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:45:41,239][06909] Updated weights for policy 0, policy_version 131603 (0.0038) [2024-06-28 02:45:43,852][06674] Fps is (10 sec: 45865.7, 60 sec: 43962.2, 300 sec: 44208.7). Total num frames: 2156314624. Throughput: 0: 44239.8. Samples: 2059177120. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 02:45:43,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:45:44,768][06909] Updated weights for policy 0, policy_version 131613 (0.0035) [2024-06-28 02:45:48,708][06909] Updated weights for policy 0, policy_version 131623 (0.0026) [2024-06-28 02:45:48,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2156527616. Throughput: 0: 44263.5. Samples: 2059441080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 02:45:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:45:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000131624_2156527616.pth... [2024-06-28 02:45:48,912][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000130977_2145927168.pth [2024-06-28 02:45:52,326][06909] Updated weights for policy 0, policy_version 131633 (0.0031) [2024-06-28 02:45:53,754][06887] Signal inference workers to stop experience collection... (29350 times) [2024-06-28 02:45:53,754][06887] Signal inference workers to resume experience collection... (29350 times) [2024-06-28 02:45:53,798][06909] InferenceWorker_p0-w0: stopping experience collection (29350 times) [2024-06-28 02:45:53,798][06909] InferenceWorker_p0-w0: resuming experience collection (29350 times) [2024-06-28 02:45:53,850][06674] Fps is (10 sec: 40968.8, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 2156724224. Throughput: 0: 44303.1. Samples: 2059703820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 02:45:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:45:56,081][06909] Updated weights for policy 0, policy_version 131643 (0.0029) [2024-06-28 02:45:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2156969984. Throughput: 0: 44088.0. Samples: 2059830920. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 02:45:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:45:59,750][06909] Updated weights for policy 0, policy_version 131653 (0.0033) [2024-06-28 02:46:03,331][06909] Updated weights for policy 0, policy_version 131663 (0.0025) [2024-06-28 02:46:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44782.9, 300 sec: 44042.4). Total num frames: 2157182976. Throughput: 0: 44208.9. Samples: 2060102320. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 02:46:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:46:07,145][06909] Updated weights for policy 0, policy_version 131673 (0.0035) [2024-06-28 02:46:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2157395968. Throughput: 0: 44237.0. Samples: 2060365560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 02:46:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:46:10,990][06909] Updated weights for policy 0, policy_version 131683 (0.0034) [2024-06-28 02:46:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2157625344. Throughput: 0: 44049.0. Samples: 2060494040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 02:46:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:46:14,611][06909] Updated weights for policy 0, policy_version 131693 (0.0034) [2024-06-28 02:46:18,527][06909] Updated weights for policy 0, policy_version 131703 (0.0027) [2024-06-28 02:46:18,856][06674] Fps is (10 sec: 44210.1, 60 sec: 44232.4, 300 sec: 44041.5). Total num frames: 2157838336. Throughput: 0: 43899.5. Samples: 2060756060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 02:46:18,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:46:22,280][06909] Updated weights for policy 0, policy_version 131713 (0.0035) [2024-06-28 02:46:23,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44508.4, 300 sec: 44153.2). Total num frames: 2158067712. Throughput: 0: 44034.5. Samples: 2061026340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 02:46:23,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:46:25,704][06909] Updated weights for policy 0, policy_version 131723 (0.0021) [2024-06-28 02:46:28,850][06674] Fps is (10 sec: 42624.3, 60 sec: 43417.8, 300 sec: 44042.4). Total num frames: 2158264320. Throughput: 0: 43990.6. Samples: 2061156600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 02:46:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:46:29,538][06909] Updated weights for policy 0, policy_version 131733 (0.0026) [2024-06-28 02:46:33,063][06909] Updated weights for policy 0, policy_version 131743 (0.0034) [2024-06-28 02:46:33,850][06674] Fps is (10 sec: 44245.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2158510080. Throughput: 0: 44031.5. Samples: 2061422500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 02:46:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:46:37,288][06909] Updated weights for policy 0, policy_version 131753 (0.0042) [2024-06-28 02:46:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2158706688. Throughput: 0: 44081.3. Samples: 2061687480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 02:46:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:46:40,462][06909] Updated weights for policy 0, policy_version 131763 (0.0032) [2024-06-28 02:46:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 2158952448. Throughput: 0: 44055.6. Samples: 2061813420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 02:46:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:46:44,447][06909] Updated weights for policy 0, policy_version 131773 (0.0036) [2024-06-28 02:46:47,977][06909] Updated weights for policy 0, policy_version 131783 (0.0040) [2024-06-28 02:46:48,850][06674] Fps is (10 sec: 45874.2, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2159165440. Throughput: 0: 43999.8. Samples: 2062082320. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 02:46:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:46:51,639][06909] Updated weights for policy 0, policy_version 131793 (0.0043) [2024-06-28 02:46:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2159394816. Throughput: 0: 44095.5. Samples: 2062349860. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 02:46:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:46:55,422][06909] Updated weights for policy 0, policy_version 131803 (0.0027) [2024-06-28 02:46:58,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2159607808. Throughput: 0: 44096.9. Samples: 2062478400. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 02:46:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:46:59,374][06909] Updated weights for policy 0, policy_version 131813 (0.0028) [2024-06-28 02:47:02,694][06909] Updated weights for policy 0, policy_version 131823 (0.0025) [2024-06-28 02:47:03,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2159820800. Throughput: 0: 44160.2. Samples: 2062743000. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 02:47:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:47:06,659][06909] Updated weights for policy 0, policy_version 131833 (0.0038) [2024-06-28 02:47:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2160050176. Throughput: 0: 44168.8. Samples: 2063013840. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 02:47:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:47:09,975][06909] Updated weights for policy 0, policy_version 131843 (0.0030) [2024-06-28 02:47:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2160263168. Throughput: 0: 44038.2. Samples: 2063138320. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 02:47:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:47:14,109][06909] Updated weights for policy 0, policy_version 131853 (0.0041) [2024-06-28 02:47:17,271][06909] Updated weights for policy 0, policy_version 131863 (0.0042) [2024-06-28 02:47:18,850][06674] Fps is (10 sec: 44235.9, 60 sec: 44241.1, 300 sec: 44153.5). Total num frames: 2160492544. Throughput: 0: 43958.6. Samples: 2063400640. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 02:47:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:47:19,439][06887] Signal inference workers to stop experience collection... (29400 times) [2024-06-28 02:47:19,439][06887] Signal inference workers to resume experience collection... (29400 times) [2024-06-28 02:47:19,470][06909] InferenceWorker_p0-w0: stopping experience collection (29400 times) [2024-06-28 02:47:19,470][06909] InferenceWorker_p0-w0: resuming experience collection (29400 times) [2024-06-28 02:47:21,328][06909] Updated weights for policy 0, policy_version 131873 (0.0044) [2024-06-28 02:47:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43965.3, 300 sec: 44098.0). Total num frames: 2160705536. Throughput: 0: 44230.7. Samples: 2063677860. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 02:47:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:47:24,970][06909] Updated weights for policy 0, policy_version 131883 (0.0036) [2024-06-28 02:47:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2160918528. Throughput: 0: 44308.4. Samples: 2063807300. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 02:47:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:47:29,019][06909] Updated weights for policy 0, policy_version 131893 (0.0037) [2024-06-28 02:47:32,434][06909] Updated weights for policy 0, policy_version 131903 (0.0036) [2024-06-28 02:47:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2161147904. Throughput: 0: 44173.6. Samples: 2064070120. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 02:47:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:47:36,466][06909] Updated weights for policy 0, policy_version 131913 (0.0038) [2024-06-28 02:47:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2161377280. Throughput: 0: 44013.3. Samples: 2064330460. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 02:47:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:47:39,681][06909] Updated weights for policy 0, policy_version 131923 (0.0045) [2024-06-28 02:47:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2161573888. Throughput: 0: 44042.6. Samples: 2064460320. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 02:47:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:47:44,042][06909] Updated weights for policy 0, policy_version 131933 (0.0034) [2024-06-28 02:47:47,236][06909] Updated weights for policy 0, policy_version 131943 (0.0035) [2024-06-28 02:47:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44153.8). Total num frames: 2161819648. Throughput: 0: 43955.1. Samples: 2064720980. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 02:47:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:47:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000131947_2161819648.pth... [2024-06-28 02:47:48,937][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000131300_2151219200.pth [2024-06-28 02:47:51,381][06909] Updated weights for policy 0, policy_version 131953 (0.0034) [2024-06-28 02:47:53,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2162032640. Throughput: 0: 43952.7. Samples: 2064991720. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:47:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:47:54,557][06909] Updated weights for policy 0, policy_version 131963 (0.0025) [2024-06-28 02:47:58,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 2162229248. Throughput: 0: 44158.6. Samples: 2065125460. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:47:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:47:58,988][06909] Updated weights for policy 0, policy_version 131973 (0.0038) [2024-06-28 02:48:01,772][06909] Updated weights for policy 0, policy_version 131983 (0.0033) [2024-06-28 02:48:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2162475008. Throughput: 0: 44201.3. Samples: 2065389700. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:48:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:48:06,328][06909] Updated weights for policy 0, policy_version 131993 (0.0030) [2024-06-28 02:48:08,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2162720768. Throughput: 0: 43951.9. Samples: 2065655700. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:48:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:48:09,618][06909] Updated weights for policy 0, policy_version 132003 (0.0026) [2024-06-28 02:48:13,528][06909] Updated weights for policy 0, policy_version 132013 (0.0022) [2024-06-28 02:48:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2162900992. Throughput: 0: 44109.3. Samples: 2065792220. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:48:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:48:17,043][06909] Updated weights for policy 0, policy_version 132023 (0.0037) [2024-06-28 02:48:18,852][06674] Fps is (10 sec: 42589.8, 60 sec: 44235.4, 300 sec: 44208.7). Total num frames: 2163146752. Throughput: 0: 44045.0. Samples: 2066052240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:48:18,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:48:20,969][06909] Updated weights for policy 0, policy_version 132033 (0.0038) [2024-06-28 02:48:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2163359744. Throughput: 0: 44258.7. Samples: 2066322100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:48:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:48:24,310][06909] Updated weights for policy 0, policy_version 132043 (0.0036) [2024-06-28 02:48:28,267][06909] Updated weights for policy 0, policy_version 132053 (0.0031) [2024-06-28 02:48:28,852][06674] Fps is (10 sec: 40959.9, 60 sec: 43962.3, 300 sec: 44153.2). Total num frames: 2163556352. Throughput: 0: 44259.7. Samples: 2066452100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:48:28,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:48:31,464][06909] Updated weights for policy 0, policy_version 132063 (0.0024) [2024-06-28 02:48:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2163785728. Throughput: 0: 44298.2. Samples: 2066714400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:48:33,850][06674] Avg episode reward: [(0, '0.410')] [2024-06-28 02:48:35,791][06909] Updated weights for policy 0, policy_version 132073 (0.0034) [2024-06-28 02:48:38,850][06674] Fps is (10 sec: 47523.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2164031488. Throughput: 0: 44245.9. Samples: 2066982780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:48:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:48:39,104][06909] Updated weights for policy 0, policy_version 132083 (0.0031) [2024-06-28 02:48:43,191][06909] Updated weights for policy 0, policy_version 132093 (0.0035) [2024-06-28 02:48:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2164211712. Throughput: 0: 44251.2. Samples: 2067116760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:48:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:48:46,538][06909] Updated weights for policy 0, policy_version 132103 (0.0029) [2024-06-28 02:48:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2164457472. Throughput: 0: 44175.6. Samples: 2067377600. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 02:48:48,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:48:50,595][06909] Updated weights for policy 0, policy_version 132113 (0.0026) [2024-06-28 02:48:52,123][06887] Signal inference workers to stop experience collection... (29450 times) [2024-06-28 02:48:52,148][06909] InferenceWorker_p0-w0: stopping experience collection (29450 times) [2024-06-28 02:48:52,182][06887] Signal inference workers to resume experience collection... (29450 times) [2024-06-28 02:48:52,183][06909] InferenceWorker_p0-w0: resuming experience collection (29450 times) [2024-06-28 02:48:53,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2164686848. Throughput: 0: 44109.8. Samples: 2067640640. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-28 02:48:53,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:48:53,994][06909] Updated weights for policy 0, policy_version 132123 (0.0024) [2024-06-28 02:48:58,048][06909] Updated weights for policy 0, policy_version 132133 (0.0037) [2024-06-28 02:48:58,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2164883456. Throughput: 0: 43988.8. Samples: 2067771720. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-28 02:48:58,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:49:01,479][06909] Updated weights for policy 0, policy_version 132143 (0.0032) [2024-06-28 02:49:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2165129216. Throughput: 0: 44078.5. Samples: 2068035680. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-28 02:49:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:49:05,741][06909] Updated weights for policy 0, policy_version 132153 (0.0040) [2024-06-28 02:49:08,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2165342208. Throughput: 0: 43912.9. Samples: 2068298180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-28 02:49:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:49:09,003][06909] Updated weights for policy 0, policy_version 132163 (0.0030) [2024-06-28 02:49:12,997][06909] Updated weights for policy 0, policy_version 132173 (0.0032) [2024-06-28 02:49:13,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2165522432. Throughput: 0: 44070.4. Samples: 2068435180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-28 02:49:13,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:49:16,472][06909] Updated weights for policy 0, policy_version 132183 (0.0038) [2024-06-28 02:49:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43965.1, 300 sec: 44209.0). Total num frames: 2165784576. Throughput: 0: 44155.9. Samples: 2068701420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-28 02:49:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:49:20,667][06909] Updated weights for policy 0, policy_version 132193 (0.0023) [2024-06-28 02:49:23,852][06674] Fps is (10 sec: 47504.1, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 2165997568. Throughput: 0: 43901.1. Samples: 2068958420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-28 02:49:23,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:49:23,970][06909] Updated weights for policy 0, policy_version 132203 (0.0039) [2024-06-28 02:49:27,883][06909] Updated weights for policy 0, policy_version 132213 (0.0022) [2024-06-28 02:49:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2166210560. Throughput: 0: 43947.8. Samples: 2069094420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-28 02:49:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:49:31,298][06909] Updated weights for policy 0, policy_version 132223 (0.0035) [2024-06-28 02:49:33,850][06674] Fps is (10 sec: 44245.6, 60 sec: 44236.8, 300 sec: 44098.8). Total num frames: 2166439936. Throughput: 0: 44081.3. Samples: 2069361260. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-28 02:49:33,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:49:35,168][06909] Updated weights for policy 0, policy_version 132233 (0.0038) [2024-06-28 02:49:38,699][06909] Updated weights for policy 0, policy_version 132243 (0.0027) [2024-06-28 02:49:38,852][06674] Fps is (10 sec: 45866.0, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 2166669312. Throughput: 0: 44013.1. Samples: 2069621320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-28 02:49:38,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:49:42,862][06909] Updated weights for policy 0, policy_version 132253 (0.0029) [2024-06-28 02:49:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2166882304. Throughput: 0: 44076.3. Samples: 2069755140. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-28 02:49:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 02:49:46,096][06909] Updated weights for policy 0, policy_version 132263 (0.0039) [2024-06-28 02:49:48,850][06674] Fps is (10 sec: 44246.3, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2167111680. Throughput: 0: 44131.1. Samples: 2070021580. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-28 02:49:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 02:49:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000132270_2167111680.pth... [2024-06-28 02:49:48,919][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000131624_2156527616.pth [2024-06-28 02:49:49,999][06909] Updated weights for policy 0, policy_version 132273 (0.0030) [2024-06-28 02:49:53,536][06909] Updated weights for policy 0, policy_version 132283 (0.0036) [2024-06-28 02:49:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2167324672. Throughput: 0: 44238.2. Samples: 2070288900. Policy #0 lag: (min: 0.0, avg: 10.9, max: 27.0) [2024-06-28 02:49:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:49:57,271][06909] Updated weights for policy 0, policy_version 132293 (0.0022) [2024-06-28 02:49:58,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2167537664. Throughput: 0: 44094.2. Samples: 2070419420. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:49:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:50:00,983][06909] Updated weights for policy 0, policy_version 132303 (0.0035) [2024-06-28 02:50:03,851][06674] Fps is (10 sec: 45870.9, 60 sec: 44236.0, 300 sec: 44264.4). Total num frames: 2167783424. Throughput: 0: 44027.6. Samples: 2070682700. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:50:03,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:50:04,802][06909] Updated weights for policy 0, policy_version 132313 (0.0029) [2024-06-28 02:50:08,371][06909] Updated weights for policy 0, policy_version 132323 (0.0039) [2024-06-28 02:50:08,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2167996416. Throughput: 0: 44175.1. Samples: 2070946220. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:50:08,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 02:50:12,125][06909] Updated weights for policy 0, policy_version 132333 (0.0027) [2024-06-28 02:50:13,850][06674] Fps is (10 sec: 42603.0, 60 sec: 44783.0, 300 sec: 44153.5). Total num frames: 2168209408. Throughput: 0: 44188.2. Samples: 2071082880. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:50:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:50:15,991][06887] Signal inference workers to stop experience collection... (29500 times) [2024-06-28 02:50:16,019][06909] InferenceWorker_p0-w0: stopping experience collection (29500 times) [2024-06-28 02:50:16,051][06887] Signal inference workers to resume experience collection... (29500 times) [2024-06-28 02:50:16,052][06909] InferenceWorker_p0-w0: resuming experience collection (29500 times) [2024-06-28 02:50:16,055][06909] Updated weights for policy 0, policy_version 132343 (0.0028) [2024-06-28 02:50:18,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2168422400. Throughput: 0: 44149.8. Samples: 2071348000. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:50:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:50:19,903][06909] Updated weights for policy 0, policy_version 132353 (0.0021) [2024-06-28 02:50:23,233][06909] Updated weights for policy 0, policy_version 132363 (0.0026) [2024-06-28 02:50:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 2168651776. Throughput: 0: 44243.4. Samples: 2071612180. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:50:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:50:27,064][06909] Updated weights for policy 0, policy_version 132373 (0.0036) [2024-06-28 02:50:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2168864768. Throughput: 0: 44190.2. Samples: 2071743700. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:50:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:50:30,698][06909] Updated weights for policy 0, policy_version 132383 (0.0040) [2024-06-28 02:50:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2169094144. Throughput: 0: 44255.5. Samples: 2072013080. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:50:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:50:34,487][06909] Updated weights for policy 0, policy_version 132393 (0.0028) [2024-06-28 02:50:38,087][06909] Updated weights for policy 0, policy_version 132403 (0.0030) [2024-06-28 02:50:38,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44238.3, 300 sec: 44098.3). Total num frames: 2169323520. Throughput: 0: 44223.2. Samples: 2072278940. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:50:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:50:41,781][06909] Updated weights for policy 0, policy_version 132413 (0.0028) [2024-06-28 02:50:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2169536512. Throughput: 0: 44272.6. Samples: 2072411680. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:50:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 02:50:45,600][06909] Updated weights for policy 0, policy_version 132423 (0.0054) [2024-06-28 02:50:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2169749504. Throughput: 0: 44316.2. Samples: 2072676880. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:50:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:50:49,067][06909] Updated weights for policy 0, policy_version 132433 (0.0031) [2024-06-28 02:50:53,287][06909] Updated weights for policy 0, policy_version 132443 (0.0030) [2024-06-28 02:50:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2169962496. Throughput: 0: 44153.6. Samples: 2072933120. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:50:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:50:56,878][06909] Updated weights for policy 0, policy_version 132453 (0.0042) [2024-06-28 02:50:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2170175488. Throughput: 0: 44022.2. Samples: 2073063880. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 02:50:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:51:00,700][06909] Updated weights for policy 0, policy_version 132463 (0.0030) [2024-06-28 02:51:03,852][06674] Fps is (10 sec: 45865.6, 60 sec: 43963.0, 300 sec: 44153.2). Total num frames: 2170421248. Throughput: 0: 43959.8. Samples: 2073326280. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 02:51:03,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:51:04,033][06909] Updated weights for policy 0, policy_version 132473 (0.0031) [2024-06-28 02:51:08,262][06909] Updated weights for policy 0, policy_version 132483 (0.0035) [2024-06-28 02:51:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 2170634240. Throughput: 0: 44163.1. Samples: 2073599520. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 02:51:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:51:11,641][06909] Updated weights for policy 0, policy_version 132493 (0.0035) [2024-06-28 02:51:13,850][06674] Fps is (10 sec: 42606.5, 60 sec: 43963.6, 300 sec: 44098.8). Total num frames: 2170847232. Throughput: 0: 44070.1. Samples: 2073726860. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 02:51:13,855][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:51:15,666][06909] Updated weights for policy 0, policy_version 132503 (0.0036) [2024-06-28 02:51:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 2171076608. Throughput: 0: 43925.7. Samples: 2073989740. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 02:51:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:51:18,879][06909] Updated weights for policy 0, policy_version 132513 (0.0029) [2024-06-28 02:51:22,787][06909] Updated weights for policy 0, policy_version 132523 (0.0031) [2024-06-28 02:51:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.5, 300 sec: 44097.9). Total num frames: 2171273216. Throughput: 0: 44067.5. Samples: 2074261980. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 02:51:23,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:51:26,456][06909] Updated weights for policy 0, policy_version 132533 (0.0028) [2024-06-28 02:51:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2171502592. Throughput: 0: 43981.8. Samples: 2074390860. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 02:51:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:51:30,346][06909] Updated weights for policy 0, policy_version 132543 (0.0037) [2024-06-28 02:51:33,852][06674] Fps is (10 sec: 45866.5, 60 sec: 43962.2, 300 sec: 44153.2). Total num frames: 2171731968. Throughput: 0: 43877.1. Samples: 2074651440. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 02:51:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:51:34,021][06909] Updated weights for policy 0, policy_version 132553 (0.0029) [2024-06-28 02:51:36,642][06887] Signal inference workers to stop experience collection... (29550 times) [2024-06-28 02:51:36,675][06909] InferenceWorker_p0-w0: stopping experience collection (29550 times) [2024-06-28 02:51:36,700][06887] Signal inference workers to resume experience collection... (29550 times) [2024-06-28 02:51:36,701][06909] InferenceWorker_p0-w0: resuming experience collection (29550 times) [2024-06-28 02:51:37,478][06909] Updated weights for policy 0, policy_version 132563 (0.0027) [2024-06-28 02:51:38,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2171977728. Throughput: 0: 44320.8. Samples: 2074927560. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 02:51:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:51:41,465][06909] Updated weights for policy 0, policy_version 132573 (0.0036) [2024-06-28 02:51:43,850][06674] Fps is (10 sec: 44246.0, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2172174336. Throughput: 0: 44412.0. Samples: 2075062420. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 02:51:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:51:45,070][06909] Updated weights for policy 0, policy_version 132583 (0.0046) [2024-06-28 02:51:48,687][06909] Updated weights for policy 0, policy_version 132593 (0.0023) [2024-06-28 02:51:48,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2172403712. Throughput: 0: 44292.7. Samples: 2075319360. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 02:51:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:51:48,855][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000132593_2172403712.pth... [2024-06-28 02:51:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000131947_2161819648.pth [2024-06-28 02:51:52,383][06909] Updated weights for policy 0, policy_version 132603 (0.0031) [2024-06-28 02:51:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2172616704. Throughput: 0: 44192.4. Samples: 2075588180. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 02:51:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:51:56,261][06909] Updated weights for policy 0, policy_version 132613 (0.0030) [2024-06-28 02:51:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2172829696. Throughput: 0: 44372.2. Samples: 2075723600. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 02:51:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 02:51:59,766][06909] Updated weights for policy 0, policy_version 132623 (0.0031) [2024-06-28 02:52:03,441][06909] Updated weights for policy 0, policy_version 132633 (0.0029) [2024-06-28 02:52:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43965.2, 300 sec: 44097.9). Total num frames: 2173059072. Throughput: 0: 44395.1. Samples: 2075987520. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 02:52:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:52:07,121][06909] Updated weights for policy 0, policy_version 132643 (0.0049) [2024-06-28 02:52:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2173288448. Throughput: 0: 44233.9. Samples: 2076252500. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 02:52:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:52:10,983][06909] Updated weights for policy 0, policy_version 132653 (0.0029) [2024-06-28 02:52:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2173501440. Throughput: 0: 44349.8. Samples: 2076386600. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 02:52:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:52:14,514][06909] Updated weights for policy 0, policy_version 132663 (0.0030) [2024-06-28 02:52:18,501][06909] Updated weights for policy 0, policy_version 132673 (0.0041) [2024-06-28 02:52:18,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2173714432. Throughput: 0: 44414.5. Samples: 2076650000. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 02:52:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:52:22,083][06909] Updated weights for policy 0, policy_version 132683 (0.0043) [2024-06-28 02:52:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 2173943808. Throughput: 0: 44138.8. Samples: 2076913800. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 02:52:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:52:25,651][06909] Updated weights for policy 0, policy_version 132693 (0.0024) [2024-06-28 02:52:28,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2174173184. Throughput: 0: 44109.2. Samples: 2077047340. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 02:52:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:52:29,325][06909] Updated weights for policy 0, policy_version 132703 (0.0024) [2024-06-28 02:52:33,028][06909] Updated weights for policy 0, policy_version 132713 (0.0030) [2024-06-28 02:52:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44511.4, 300 sec: 44153.5). Total num frames: 2174402560. Throughput: 0: 44389.7. Samples: 2077316900. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 02:52:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:52:36,810][06909] Updated weights for policy 0, policy_version 132723 (0.0026) [2024-06-28 02:52:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2174615552. Throughput: 0: 44370.2. Samples: 2077584840. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 02:52:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:52:40,288][06909] Updated weights for policy 0, policy_version 132733 (0.0030) [2024-06-28 02:52:43,852][06674] Fps is (10 sec: 44226.7, 60 sec: 44508.2, 300 sec: 44153.2). Total num frames: 2174844928. Throughput: 0: 44282.2. Samples: 2077716400. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 02:52:43,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:52:44,153][06909] Updated weights for policy 0, policy_version 132743 (0.0032) [2024-06-28 02:52:47,938][06909] Updated weights for policy 0, policy_version 132753 (0.0029) [2024-06-28 02:52:48,852][06674] Fps is (10 sec: 42588.4, 60 sec: 43961.9, 300 sec: 44097.6). Total num frames: 2175041536. Throughput: 0: 44373.2. Samples: 2077984420. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 02:52:48,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:52:51,667][06909] Updated weights for policy 0, policy_version 132763 (0.0030) [2024-06-28 02:52:53,850][06674] Fps is (10 sec: 42607.9, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2175270912. Throughput: 0: 44188.5. Samples: 2078240980. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 02:52:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:52:55,492][06909] Updated weights for policy 0, policy_version 132773 (0.0034) [2024-06-28 02:52:58,850][06674] Fps is (10 sec: 45886.3, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2175500288. Throughput: 0: 44303.6. Samples: 2078380260. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 02:52:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:52:59,236][06909] Updated weights for policy 0, policy_version 132783 (0.0029) [2024-06-28 02:53:02,775][06909] Updated weights for policy 0, policy_version 132793 (0.0032) [2024-06-28 02:53:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2175713280. Throughput: 0: 44187.4. Samples: 2078638440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:53:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:53:06,578][06909] Updated weights for policy 0, policy_version 132803 (0.0034) [2024-06-28 02:53:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2175926272. Throughput: 0: 44305.8. Samples: 2078907560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:53:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:53:10,094][06909] Updated weights for policy 0, policy_version 132813 (0.0031) [2024-06-28 02:53:12,671][06887] Signal inference workers to stop experience collection... (29600 times) [2024-06-28 02:53:12,671][06887] Signal inference workers to resume experience collection... (29600 times) [2024-06-28 02:53:12,683][06909] InferenceWorker_p0-w0: stopping experience collection (29600 times) [2024-06-28 02:53:12,683][06909] InferenceWorker_p0-w0: resuming experience collection (29600 times) [2024-06-28 02:53:13,770][06909] Updated weights for policy 0, policy_version 132823 (0.0028) [2024-06-28 02:53:13,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.8, 300 sec: 44153.8). Total num frames: 2176172032. Throughput: 0: 44139.2. Samples: 2079033600. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:53:13,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:53:17,725][06909] Updated weights for policy 0, policy_version 132833 (0.0027) [2024-06-28 02:53:18,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44782.9, 300 sec: 44209.0). Total num frames: 2176401408. Throughput: 0: 44190.2. Samples: 2079305460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:53:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:53:21,371][06909] Updated weights for policy 0, policy_version 132843 (0.0036) [2024-06-28 02:53:23,852][06674] Fps is (10 sec: 42589.9, 60 sec: 44235.3, 300 sec: 44209.0). Total num frames: 2176598016. Throughput: 0: 44000.3. Samples: 2079564940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:53:23,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:53:25,119][06909] Updated weights for policy 0, policy_version 132853 (0.0038) [2024-06-28 02:53:28,638][06909] Updated weights for policy 0, policy_version 132863 (0.0035) [2024-06-28 02:53:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2176827392. Throughput: 0: 43991.9. Samples: 2079695940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:53:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:53:32,843][06909] Updated weights for policy 0, policy_version 132873 (0.0024) [2024-06-28 02:53:33,850][06674] Fps is (10 sec: 45884.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2177056768. Throughput: 0: 44104.1. Samples: 2079969000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:53:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:53:36,327][06909] Updated weights for policy 0, policy_version 132883 (0.0029) [2024-06-28 02:53:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 44264.6). Total num frames: 2177269760. Throughput: 0: 44242.7. Samples: 2080231900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:53:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:53:40,086][06909] Updated weights for policy 0, policy_version 132893 (0.0029) [2024-06-28 02:53:43,607][06909] Updated weights for policy 0, policy_version 132903 (0.0034) [2024-06-28 02:53:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 2177482752. Throughput: 0: 44058.6. Samples: 2080362900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:53:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:53:47,259][06909] Updated weights for policy 0, policy_version 132913 (0.0030) [2024-06-28 02:53:48,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44784.7, 300 sec: 44209.0). Total num frames: 2177728512. Throughput: 0: 44326.3. Samples: 2080633120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:53:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:53:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000132918_2177728512.pth... [2024-06-28 02:53:48,932][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000132270_2167111680.pth [2024-06-28 02:53:51,153][06909] Updated weights for policy 0, policy_version 132923 (0.0039) [2024-06-28 02:53:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44209.1). Total num frames: 2177925120. Throughput: 0: 44163.0. Samples: 2080894900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:53:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:53:54,940][06909] Updated weights for policy 0, policy_version 132933 (0.0035) [2024-06-28 02:53:58,541][06909] Updated weights for policy 0, policy_version 132943 (0.0040) [2024-06-28 02:53:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2178154496. Throughput: 0: 44260.8. Samples: 2081025340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:53:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:54:02,560][06909] Updated weights for policy 0, policy_version 132953 (0.0035) [2024-06-28 02:54:03,854][06674] Fps is (10 sec: 44220.4, 60 sec: 44234.1, 300 sec: 44152.9). Total num frames: 2178367488. Throughput: 0: 44254.5. Samples: 2081297080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:54:03,854][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:54:06,120][06909] Updated weights for policy 0, policy_version 132963 (0.0037) [2024-06-28 02:54:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.7, 300 sec: 44264.6). Total num frames: 2178580480. Throughput: 0: 44090.3. Samples: 2081548920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2024-06-28 02:54:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:54:09,859][06909] Updated weights for policy 0, policy_version 132973 (0.0032) [2024-06-28 02:54:13,410][06909] Updated weights for policy 0, policy_version 132983 (0.0037) [2024-06-28 02:54:13,850][06674] Fps is (10 sec: 44253.5, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2178809856. Throughput: 0: 44174.7. Samples: 2081683800. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2024-06-28 02:54:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:54:16,999][06909] Updated weights for policy 0, policy_version 132993 (0.0033) [2024-06-28 02:54:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44209.3). Total num frames: 2179039232. Throughput: 0: 44158.1. Samples: 2081956120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2024-06-28 02:54:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:54:20,526][06909] Updated weights for policy 0, policy_version 133003 (0.0026) [2024-06-28 02:54:23,850][06674] Fps is (10 sec: 42597.5, 60 sec: 43965.1, 300 sec: 44153.5). Total num frames: 2179235840. Throughput: 0: 44115.7. Samples: 2082217120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2024-06-28 02:54:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:54:24,539][06909] Updated weights for policy 0, policy_version 133013 (0.0036) [2024-06-28 02:54:28,346][06909] Updated weights for policy 0, policy_version 133023 (0.0036) [2024-06-28 02:54:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2179481600. Throughput: 0: 44188.5. Samples: 2082351380. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2024-06-28 02:54:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:54:32,350][06909] Updated weights for policy 0, policy_version 133033 (0.0032) [2024-06-28 02:54:33,850][06674] Fps is (10 sec: 45876.1, 60 sec: 43963.7, 300 sec: 44153.8). Total num frames: 2179694592. Throughput: 0: 44002.7. Samples: 2082613240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2024-06-28 02:54:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:54:35,607][06909] Updated weights for policy 0, policy_version 133043 (0.0024) [2024-06-28 02:54:38,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2179891200. Throughput: 0: 43984.8. Samples: 2082874220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2024-06-28 02:54:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:54:39,773][06909] Updated weights for policy 0, policy_version 133053 (0.0032) [2024-06-28 02:54:43,087][06909] Updated weights for policy 0, policy_version 133063 (0.0048) [2024-06-28 02:54:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2180136960. Throughput: 0: 44028.6. Samples: 2083006620. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2024-06-28 02:54:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:54:44,437][06887] Signal inference workers to stop experience collection... (29650 times) [2024-06-28 02:54:44,474][06909] InferenceWorker_p0-w0: stopping experience collection (29650 times) [2024-06-28 02:54:44,491][06887] Signal inference workers to resume experience collection... (29650 times) [2024-06-28 02:54:44,492][06909] InferenceWorker_p0-w0: resuming experience collection (29650 times) [2024-06-28 02:54:47,200][06909] Updated weights for policy 0, policy_version 133073 (0.0025) [2024-06-28 02:54:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 2180349952. Throughput: 0: 43871.1. Samples: 2083271120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2024-06-28 02:54:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:54:50,291][06909] Updated weights for policy 0, policy_version 133083 (0.0028) [2024-06-28 02:54:53,850][06674] Fps is (10 sec: 44235.9, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2180579328. Throughput: 0: 44370.2. Samples: 2083545580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2024-06-28 02:54:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:54:54,245][06909] Updated weights for policy 0, policy_version 133093 (0.0037) [2024-06-28 02:54:57,770][06909] Updated weights for policy 0, policy_version 133103 (0.0035) [2024-06-28 02:54:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.9, 300 sec: 44153.6). Total num frames: 2180808704. Throughput: 0: 44299.5. Samples: 2083677280. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2024-06-28 02:54:58,850][06674] Avg episode reward: [(0, '0.405')] [2024-06-28 02:55:01,532][06909] Updated weights for policy 0, policy_version 133113 (0.0029) [2024-06-28 02:55:03,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44239.5, 300 sec: 44153.5). Total num frames: 2181021696. Throughput: 0: 44156.0. Samples: 2083943140. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2024-06-28 02:55:03,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:55:04,946][06909] Updated weights for policy 0, policy_version 133123 (0.0029) [2024-06-28 02:55:08,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2181218304. Throughput: 0: 44287.6. Samples: 2084210060. Policy #0 lag: (min: 0.0, avg: 8.9, max: 19.0) [2024-06-28 02:55:08,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:55:09,182][06909] Updated weights for policy 0, policy_version 133133 (0.0028) [2024-06-28 02:55:12,607][06909] Updated weights for policy 0, policy_version 133143 (0.0030) [2024-06-28 02:55:13,853][06674] Fps is (10 sec: 42583.9, 60 sec: 43961.2, 300 sec: 44153.0). Total num frames: 2181447680. Throughput: 0: 44058.4. Samples: 2084334160. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:55:13,854][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:55:16,596][06909] Updated weights for policy 0, policy_version 133153 (0.0039) [2024-06-28 02:55:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2181677056. Throughput: 0: 44152.8. Samples: 2084600120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:55:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:55:19,963][06909] Updated weights for policy 0, policy_version 133163 (0.0046) [2024-06-28 02:55:23,850][06674] Fps is (10 sec: 44252.3, 60 sec: 44237.0, 300 sec: 44153.5). Total num frames: 2181890048. Throughput: 0: 44297.5. Samples: 2084867600. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:55:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:55:23,875][06909] Updated weights for policy 0, policy_version 133173 (0.0028) [2024-06-28 02:55:27,543][06909] Updated weights for policy 0, policy_version 133183 (0.0035) [2024-06-28 02:55:28,852][06674] Fps is (10 sec: 45866.3, 60 sec: 44235.3, 300 sec: 44208.7). Total num frames: 2182135808. Throughput: 0: 44375.2. Samples: 2085003600. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:55:28,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 02:55:31,340][06909] Updated weights for policy 0, policy_version 133193 (0.0037) [2024-06-28 02:55:33,852][06674] Fps is (10 sec: 45865.6, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 2182348800. Throughput: 0: 44283.0. Samples: 2085263940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:55:33,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:55:34,948][06909] Updated weights for policy 0, policy_version 133203 (0.0046) [2024-06-28 02:55:38,497][06909] Updated weights for policy 0, policy_version 133213 (0.0040) [2024-06-28 02:55:38,850][06674] Fps is (10 sec: 42607.1, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2182561792. Throughput: 0: 44274.8. Samples: 2085537940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:55:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:55:42,397][06909] Updated weights for policy 0, policy_version 133223 (0.0029) [2024-06-28 02:55:43,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2182791168. Throughput: 0: 44241.8. Samples: 2085668160. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:55:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:55:46,247][06909] Updated weights for policy 0, policy_version 133233 (0.0025) [2024-06-28 02:55:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2183004160. Throughput: 0: 44116.5. Samples: 2085928380. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:55:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:55:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000133240_2183004160.pth... [2024-06-28 02:55:48,914][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000132593_2172403712.pth [2024-06-28 02:55:49,754][06909] Updated weights for policy 0, policy_version 133243 (0.0042) [2024-06-28 02:55:53,630][06909] Updated weights for policy 0, policy_version 133253 (0.0045) [2024-06-28 02:55:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.9, 300 sec: 44209.0). Total num frames: 2183217152. Throughput: 0: 44065.1. Samples: 2086192980. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:55:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:55:57,254][06909] Updated weights for policy 0, policy_version 133263 (0.0032) [2024-06-28 02:55:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44153.8). Total num frames: 2183446528. Throughput: 0: 44237.7. Samples: 2086324700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:55:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:55:59,357][06887] Signal inference workers to stop experience collection... (29700 times) [2024-06-28 02:55:59,392][06909] InferenceWorker_p0-w0: stopping experience collection (29700 times) [2024-06-28 02:55:59,417][06887] Signal inference workers to resume experience collection... (29700 times) [2024-06-28 02:55:59,417][06909] InferenceWorker_p0-w0: resuming experience collection (29700 times) [2024-06-28 02:56:00,794][06909] Updated weights for policy 0, policy_version 133273 (0.0033) [2024-06-28 02:56:03,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2183675904. Throughput: 0: 44267.6. Samples: 2086592160. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:56:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:56:04,547][06909] Updated weights for policy 0, policy_version 133283 (0.0042) [2024-06-28 02:56:08,287][06909] Updated weights for policy 0, policy_version 133293 (0.0026) [2024-06-28 02:56:08,856][06674] Fps is (10 sec: 45847.0, 60 sec: 44778.5, 300 sec: 44263.7). Total num frames: 2183905280. Throughput: 0: 44236.6. Samples: 2086858520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 02:56:08,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:56:11,933][06909] Updated weights for policy 0, policy_version 133303 (0.0021) [2024-06-28 02:56:13,852][06674] Fps is (10 sec: 44228.2, 60 sec: 44510.9, 300 sec: 44208.7). Total num frames: 2184118272. Throughput: 0: 44216.5. Samples: 2086993340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:56:13,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:56:15,581][06909] Updated weights for policy 0, policy_version 133313 (0.0029) [2024-06-28 02:56:18,850][06674] Fps is (10 sec: 42623.9, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 2184331264. Throughput: 0: 44333.9. Samples: 2087258880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:56:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:56:19,260][06909] Updated weights for policy 0, policy_version 133323 (0.0035) [2024-06-28 02:56:23,097][06909] Updated weights for policy 0, policy_version 133333 (0.0038) [2024-06-28 02:56:23,850][06674] Fps is (10 sec: 44245.6, 60 sec: 44509.8, 300 sec: 44264.6). Total num frames: 2184560640. Throughput: 0: 44064.0. Samples: 2087520820. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:56:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:56:26,754][06909] Updated weights for policy 0, policy_version 133343 (0.0031) [2024-06-28 02:56:28,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43965.3, 300 sec: 44209.3). Total num frames: 2184773632. Throughput: 0: 44218.7. Samples: 2087658000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:56:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:56:30,597][06909] Updated weights for policy 0, policy_version 133353 (0.0033) [2024-06-28 02:56:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2185003008. Throughput: 0: 44150.3. Samples: 2087915140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:56:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:56:34,274][06909] Updated weights for policy 0, policy_version 133363 (0.0022) [2024-06-28 02:56:37,956][06909] Updated weights for policy 0, policy_version 133373 (0.0032) [2024-06-28 02:56:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44264.6). Total num frames: 2185232384. Throughput: 0: 44134.2. Samples: 2088179020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:56:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:56:41,831][06909] Updated weights for policy 0, policy_version 133383 (0.0042) [2024-06-28 02:56:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2185428992. Throughput: 0: 44260.8. Samples: 2088316440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:56:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:56:45,242][06909] Updated weights for policy 0, policy_version 133393 (0.0027) [2024-06-28 02:56:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2185658368. Throughput: 0: 44202.0. Samples: 2088581240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:56:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:56:48,975][06909] Updated weights for policy 0, policy_version 133403 (0.0032) [2024-06-28 02:56:52,592][06909] Updated weights for policy 0, policy_version 133413 (0.0020) [2024-06-28 02:56:53,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2185871360. Throughput: 0: 44183.0. Samples: 2088846480. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:56:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:56:56,400][06909] Updated weights for policy 0, policy_version 133423 (0.0037) [2024-06-28 02:56:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2186100736. Throughput: 0: 44158.9. Samples: 2088980400. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:56:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:57:00,062][06909] Updated weights for policy 0, policy_version 133433 (0.0042) [2024-06-28 02:57:03,744][06909] Updated weights for policy 0, policy_version 133443 (0.0030) [2024-06-28 02:57:03,852][06674] Fps is (10 sec: 45865.2, 60 sec: 44235.4, 300 sec: 44208.7). Total num frames: 2186330112. Throughput: 0: 44104.8. Samples: 2089243680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:57:03,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:57:07,667][06909] Updated weights for policy 0, policy_version 133453 (0.0031) [2024-06-28 02:57:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43968.2, 300 sec: 44209.0). Total num frames: 2186543104. Throughput: 0: 44200.5. Samples: 2089509840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:57:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:57:11,156][06909] Updated weights for policy 0, policy_version 133463 (0.0043) [2024-06-28 02:57:13,850][06674] Fps is (10 sec: 44246.0, 60 sec: 44238.4, 300 sec: 44264.6). Total num frames: 2186772480. Throughput: 0: 44079.6. Samples: 2089641580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 02:57:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:57:14,930][06909] Updated weights for policy 0, policy_version 133473 (0.0030) [2024-06-28 02:57:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 2186969088. Throughput: 0: 44332.1. Samples: 2089910080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:57:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:57:18,931][06909] Updated weights for policy 0, policy_version 133483 (0.0031) [2024-06-28 02:57:22,316][06909] Updated weights for policy 0, policy_version 133493 (0.0025) [2024-06-28 02:57:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2187214848. Throughput: 0: 44163.5. Samples: 2090166380. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:57:23,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:57:26,235][06909] Updated weights for policy 0, policy_version 133503 (0.0020) [2024-06-28 02:57:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2187411456. Throughput: 0: 44064.5. Samples: 2090299340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:57:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:57:29,835][06909] Updated weights for policy 0, policy_version 133513 (0.0032) [2024-06-28 02:57:30,908][06887] Signal inference workers to stop experience collection... (29750 times) [2024-06-28 02:57:30,908][06887] Signal inference workers to resume experience collection... (29750 times) [2024-06-28 02:57:30,949][06909] InferenceWorker_p0-w0: stopping experience collection (29750 times) [2024-06-28 02:57:30,949][06909] InferenceWorker_p0-w0: resuming experience collection (29750 times) [2024-06-28 02:57:33,617][06909] Updated weights for policy 0, policy_version 133523 (0.0028) [2024-06-28 02:57:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2187657216. Throughput: 0: 44066.4. Samples: 2090564240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:57:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:57:37,232][06909] Updated weights for policy 0, policy_version 133533 (0.0040) [2024-06-28 02:57:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44153.8). Total num frames: 2187870208. Throughput: 0: 44050.5. Samples: 2090828760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:57:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:57:41,024][06909] Updated weights for policy 0, policy_version 133543 (0.0028) [2024-06-28 02:57:43,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.8, 300 sec: 44153.8). Total num frames: 2188066816. Throughput: 0: 43869.8. Samples: 2090954540. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:57:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:57:44,889][06909] Updated weights for policy 0, policy_version 133553 (0.0044) [2024-06-28 02:57:48,507][06909] Updated weights for policy 0, policy_version 133563 (0.0026) [2024-06-28 02:57:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 2188296192. Throughput: 0: 43925.9. Samples: 2091220260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:57:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:57:48,957][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000133564_2188312576.pth... [2024-06-28 02:57:48,997][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000132918_2177728512.pth [2024-06-28 02:57:52,178][06909] Updated weights for policy 0, policy_version 133573 (0.0020) [2024-06-28 02:57:53,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.6, 300 sec: 44153.5). Total num frames: 2188525568. Throughput: 0: 43831.9. Samples: 2091482280. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:57:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:57:56,166][06909] Updated weights for policy 0, policy_version 133583 (0.0038) [2024-06-28 02:57:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2188738560. Throughput: 0: 43774.9. Samples: 2091611460. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:57:58,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:57:59,798][06909] Updated weights for policy 0, policy_version 133593 (0.0023) [2024-06-28 02:58:03,387][06909] Updated weights for policy 0, policy_version 133603 (0.0033) [2024-06-28 02:58:03,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43965.3, 300 sec: 44209.0). Total num frames: 2188967936. Throughput: 0: 43876.5. Samples: 2091884520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:58:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:58:06,982][06909] Updated weights for policy 0, policy_version 133613 (0.0029) [2024-06-28 02:58:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2189197312. Throughput: 0: 44082.2. Samples: 2092150080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:58:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:58:10,633][06909] Updated weights for policy 0, policy_version 133623 (0.0039) [2024-06-28 02:58:13,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2189393920. Throughput: 0: 44003.9. Samples: 2092279520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 02:58:13,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 02:58:14,263][06909] Updated weights for policy 0, policy_version 133633 (0.0046) [2024-06-28 02:58:18,348][06909] Updated weights for policy 0, policy_version 133643 (0.0030) [2024-06-28 02:58:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.7, 300 sec: 44209.3). Total num frames: 2189639680. Throughput: 0: 44179.1. Samples: 2092552300. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 02:58:18,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 02:58:22,048][06909] Updated weights for policy 0, policy_version 133653 (0.0039) [2024-06-28 02:58:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2189836288. Throughput: 0: 44125.3. Samples: 2092814400. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 02:58:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:58:25,567][06909] Updated weights for policy 0, policy_version 133663 (0.0038) [2024-06-28 02:58:28,850][06674] Fps is (10 sec: 42599.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2190065664. Throughput: 0: 44164.1. Samples: 2092941920. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 02:58:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:58:29,239][06909] Updated weights for policy 0, policy_version 133673 (0.0034) [2024-06-28 02:58:33,205][06909] Updated weights for policy 0, policy_version 133683 (0.0045) [2024-06-28 02:58:33,585][06887] Signal inference workers to stop experience collection... (29800 times) [2024-06-28 02:58:33,631][06909] InferenceWorker_p0-w0: stopping experience collection (29800 times) [2024-06-28 02:58:33,637][06887] Signal inference workers to resume experience collection... (29800 times) [2024-06-28 02:58:33,644][06909] InferenceWorker_p0-w0: resuming experience collection (29800 times) [2024-06-28 02:58:33,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 2190295040. Throughput: 0: 44145.0. Samples: 2093206780. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 02:58:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:58:36,878][06909] Updated weights for policy 0, policy_version 133693 (0.0031) [2024-06-28 02:58:38,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2190508032. Throughput: 0: 44168.1. Samples: 2093469840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 02:58:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:58:40,427][06909] Updated weights for policy 0, policy_version 133703 (0.0043) [2024-06-28 02:58:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2190737408. Throughput: 0: 44201.0. Samples: 2093600500. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 02:58:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 02:58:44,109][06909] Updated weights for policy 0, policy_version 133713 (0.0034) [2024-06-28 02:58:48,017][06909] Updated weights for policy 0, policy_version 133723 (0.0031) [2024-06-28 02:58:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 2190966784. Throughput: 0: 44092.8. Samples: 2093868700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 02:58:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:58:52,130][06909] Updated weights for policy 0, policy_version 133733 (0.0033) [2024-06-28 02:58:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2191163392. Throughput: 0: 43903.7. Samples: 2094125740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 02:58:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:58:55,584][06909] Updated weights for policy 0, policy_version 133743 (0.0031) [2024-06-28 02:58:58,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.9, 300 sec: 44098.5). Total num frames: 2191376384. Throughput: 0: 43937.9. Samples: 2094256720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 02:58:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:58:59,190][06909] Updated weights for policy 0, policy_version 133753 (0.0023) [2024-06-28 02:59:02,840][06909] Updated weights for policy 0, policy_version 133763 (0.0033) [2024-06-28 02:59:03,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2191622144. Throughput: 0: 43976.5. Samples: 2094531240. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 02:59:03,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 02:59:06,518][06909] Updated weights for policy 0, policy_version 133773 (0.0032) [2024-06-28 02:59:08,851][06674] Fps is (10 sec: 45867.7, 60 sec: 43962.7, 300 sec: 44153.3). Total num frames: 2191835136. Throughput: 0: 43898.1. Samples: 2094789880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 02:59:08,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:59:10,489][06909] Updated weights for policy 0, policy_version 133783 (0.0030) [2024-06-28 02:59:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2192048128. Throughput: 0: 43999.4. Samples: 2094921900. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 02:59:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:59:14,135][06909] Updated weights for policy 0, policy_version 133793 (0.0036) [2024-06-28 02:59:17,580][06909] Updated weights for policy 0, policy_version 133803 (0.0034) [2024-06-28 02:59:18,850][06674] Fps is (10 sec: 44243.5, 60 sec: 43963.8, 300 sec: 44209.1). Total num frames: 2192277504. Throughput: 0: 44045.7. Samples: 2095188840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 02:59:18,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 02:59:21,285][06909] Updated weights for policy 0, policy_version 133813 (0.0032) [2024-06-28 02:59:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2192474112. Throughput: 0: 44162.3. Samples: 2095457140. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 02:59:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:59:24,770][06909] Updated weights for policy 0, policy_version 133823 (0.0031) [2024-06-28 02:59:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2192703488. Throughput: 0: 44235.1. Samples: 2095591080. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 02:59:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:59:29,154][06909] Updated weights for policy 0, policy_version 133833 (0.0032) [2024-06-28 02:59:32,349][06909] Updated weights for policy 0, policy_version 133843 (0.0033) [2024-06-28 02:59:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2192932864. Throughput: 0: 43996.0. Samples: 2095848520. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 02:59:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:59:36,244][06909] Updated weights for policy 0, policy_version 133853 (0.0036) [2024-06-28 02:59:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2193162240. Throughput: 0: 44513.4. Samples: 2096128840. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 02:59:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 02:59:39,388][06909] Updated weights for policy 0, policy_version 133863 (0.0030) [2024-06-28 02:59:43,375][06909] Updated weights for policy 0, policy_version 133873 (0.0032) [2024-06-28 02:59:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2193391616. Throughput: 0: 44538.6. Samples: 2096260960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 02:59:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:59:47,196][06909] Updated weights for policy 0, policy_version 133883 (0.0029) [2024-06-28 02:59:48,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2193604608. Throughput: 0: 44175.6. Samples: 2096519140. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 02:59:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 02:59:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000133887_2193604608.pth... [2024-06-28 02:59:48,896][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000133240_2183004160.pth [2024-06-28 02:59:50,833][06909] Updated weights for policy 0, policy_version 133893 (0.0044) [2024-06-28 02:59:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2193833984. Throughput: 0: 44386.9. Samples: 2096787220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 02:59:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 02:59:54,367][06909] Updated weights for policy 0, policy_version 133903 (0.0030) [2024-06-28 02:59:58,368][06909] Updated weights for policy 0, policy_version 133913 (0.0035) [2024-06-28 02:59:58,850][06674] Fps is (10 sec: 42599.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2194030592. Throughput: 0: 44393.4. Samples: 2096919600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 02:59:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:00:01,849][06909] Updated weights for policy 0, policy_version 133923 (0.0043) [2024-06-28 03:00:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.9, 300 sec: 44209.1). Total num frames: 2194259968. Throughput: 0: 44262.8. Samples: 2097180660. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:00:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:00:05,922][06909] Updated weights for policy 0, policy_version 133933 (0.0038) [2024-06-28 03:00:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44238.0, 300 sec: 44209.6). Total num frames: 2194489344. Throughput: 0: 44256.0. Samples: 2097448660. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:00:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:00:09,247][06909] Updated weights for policy 0, policy_version 133943 (0.0027) [2024-06-28 03:00:13,196][06909] Updated weights for policy 0, policy_version 133953 (0.0030) [2024-06-28 03:00:13,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 2194718720. Throughput: 0: 44216.4. Samples: 2097580820. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:00:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:00:14,828][06887] Signal inference workers to stop experience collection... (29850 times) [2024-06-28 03:00:14,828][06887] Signal inference workers to resume experience collection... (29850 times) [2024-06-28 03:00:14,863][06909] InferenceWorker_p0-w0: stopping experience collection (29850 times) [2024-06-28 03:00:14,863][06909] InferenceWorker_p0-w0: resuming experience collection (29850 times) [2024-06-28 03:00:16,740][06909] Updated weights for policy 0, policy_version 133963 (0.0026) [2024-06-28 03:00:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2194931712. Throughput: 0: 44424.5. Samples: 2097847620. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:00:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:00:20,476][06909] Updated weights for policy 0, policy_version 133973 (0.0025) [2024-06-28 03:00:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44782.9, 300 sec: 44153.8). Total num frames: 2195161088. Throughput: 0: 44103.9. Samples: 2098113520. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 03:00:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:00:24,005][06909] Updated weights for policy 0, policy_version 133983 (0.0042) [2024-06-28 03:00:27,625][06909] Updated weights for policy 0, policy_version 133993 (0.0026) [2024-06-28 03:00:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.8, 300 sec: 44153.8). Total num frames: 2195374080. Throughput: 0: 44192.0. Samples: 2098249600. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 03:00:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 03:00:31,407][06909] Updated weights for policy 0, policy_version 134003 (0.0028) [2024-06-28 03:00:33,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2195587072. Throughput: 0: 44320.6. Samples: 2098513560. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 03:00:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:00:35,227][06909] Updated weights for policy 0, policy_version 134013 (0.0037) [2024-06-28 03:00:38,739][06909] Updated weights for policy 0, policy_version 134023 (0.0024) [2024-06-28 03:00:38,851][06674] Fps is (10 sec: 45868.0, 60 sec: 44508.6, 300 sec: 44208.8). Total num frames: 2195832832. Throughput: 0: 44221.5. Samples: 2098777260. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 03:00:38,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:00:42,820][06909] Updated weights for policy 0, policy_version 134033 (0.0029) [2024-06-28 03:00:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2196045824. Throughput: 0: 44272.4. Samples: 2098911860. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 03:00:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:00:46,353][06909] Updated weights for policy 0, policy_version 134043 (0.0050) [2024-06-28 03:00:48,850][06674] Fps is (10 sec: 40966.9, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 2196242432. Throughput: 0: 44323.6. Samples: 2099175220. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 03:00:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:00:50,349][06909] Updated weights for policy 0, policy_version 134053 (0.0042) [2024-06-28 03:00:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2196471808. Throughput: 0: 44178.6. Samples: 2099436700. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 03:00:53,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:00:54,025][06909] Updated weights for policy 0, policy_version 134063 (0.0033) [2024-06-28 03:00:57,550][06909] Updated weights for policy 0, policy_version 134073 (0.0023) [2024-06-28 03:00:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2196684800. Throughput: 0: 44167.6. Samples: 2099568360. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 03:00:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:01:01,159][06909] Updated weights for policy 0, policy_version 134083 (0.0029) [2024-06-28 03:01:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44154.4). Total num frames: 2196930560. Throughput: 0: 44257.3. Samples: 2099839200. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 03:01:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:01:04,779][06909] Updated weights for policy 0, policy_version 134093 (0.0036) [2024-06-28 03:01:08,653][06909] Updated weights for policy 0, policy_version 134103 (0.0030) [2024-06-28 03:01:08,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 44153.8). Total num frames: 2197143552. Throughput: 0: 44189.3. Samples: 2100102040. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 03:01:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:01:12,311][06909] Updated weights for policy 0, policy_version 134113 (0.0032) [2024-06-28 03:01:13,853][06674] Fps is (10 sec: 42584.5, 60 sec: 43961.3, 300 sec: 44153.0). Total num frames: 2197356544. Throughput: 0: 44181.7. Samples: 2100237920. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 03:01:13,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:01:15,934][06909] Updated weights for policy 0, policy_version 134123 (0.0042) [2024-06-28 03:01:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2197585920. Throughput: 0: 44268.8. Samples: 2100505660. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 03:01:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:01:20,073][06909] Updated weights for policy 0, policy_version 134133 (0.0024) [2024-06-28 03:01:23,502][06909] Updated weights for policy 0, policy_version 134143 (0.0030) [2024-06-28 03:01:23,850][06674] Fps is (10 sec: 44251.9, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2197798912. Throughput: 0: 43948.4. Samples: 2100754860. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 03:01:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:01:27,594][06909] Updated weights for policy 0, policy_version 134153 (0.0029) [2024-06-28 03:01:28,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2197995520. Throughput: 0: 43942.2. Samples: 2100889260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 03:01:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:01:29,580][06887] Signal inference workers to stop experience collection... (29900 times) [2024-06-28 03:01:29,580][06887] Signal inference workers to resume experience collection... (29900 times) [2024-06-28 03:01:29,613][06909] InferenceWorker_p0-w0: stopping experience collection (29900 times) [2024-06-28 03:01:29,613][06909] InferenceWorker_p0-w0: resuming experience collection (29900 times) [2024-06-28 03:01:31,070][06909] Updated weights for policy 0, policy_version 134163 (0.0035) [2024-06-28 03:01:33,856][06674] Fps is (10 sec: 44209.6, 60 sec: 44232.3, 300 sec: 44097.0). Total num frames: 2198241280. Throughput: 0: 43982.5. Samples: 2101154700. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 03:01:33,857][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:01:34,818][06909] Updated weights for policy 0, policy_version 134173 (0.0029) [2024-06-28 03:01:38,186][06909] Updated weights for policy 0, policy_version 134183 (0.0033) [2024-06-28 03:01:38,852][06674] Fps is (10 sec: 45866.0, 60 sec: 43690.4, 300 sec: 44153.2). Total num frames: 2198454272. Throughput: 0: 43940.2. Samples: 2101414100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 03:01:38,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:01:42,294][06909] Updated weights for policy 0, policy_version 134193 (0.0030) [2024-06-28 03:01:43,850][06674] Fps is (10 sec: 42624.3, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2198667264. Throughput: 0: 44002.7. Samples: 2101548480. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 03:01:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:01:45,757][06909] Updated weights for policy 0, policy_version 134203 (0.0038) [2024-06-28 03:01:48,850][06674] Fps is (10 sec: 44245.3, 60 sec: 44236.7, 300 sec: 44153.4). Total num frames: 2198896640. Throughput: 0: 43989.7. Samples: 2101818740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 03:01:48,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:01:48,875][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000134211_2198913024.pth... [2024-06-28 03:01:48,930][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000133564_2188312576.pth [2024-06-28 03:01:49,946][06909] Updated weights for policy 0, policy_version 134213 (0.0029) [2024-06-28 03:01:52,906][06909] Updated weights for policy 0, policy_version 134223 (0.0039) [2024-06-28 03:01:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2199109632. Throughput: 0: 43951.7. Samples: 2102079860. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 03:01:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:01:57,594][06909] Updated weights for policy 0, policy_version 134233 (0.0037) [2024-06-28 03:01:58,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 2199339008. Throughput: 0: 43837.5. Samples: 2102210460. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 03:01:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:02:00,794][06909] Updated weights for policy 0, policy_version 134243 (0.0027) [2024-06-28 03:02:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2199568384. Throughput: 0: 43922.8. Samples: 2102482180. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 03:02:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:02:04,677][06909] Updated weights for policy 0, policy_version 134253 (0.0027) [2024-06-28 03:02:08,177][06909] Updated weights for policy 0, policy_version 134263 (0.0028) [2024-06-28 03:02:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2199764992. Throughput: 0: 44176.3. Samples: 2102742800. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 03:02:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:02:11,847][06909] Updated weights for policy 0, policy_version 134273 (0.0027) [2024-06-28 03:02:13,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44239.2, 300 sec: 44209.0). Total num frames: 2200010752. Throughput: 0: 44087.6. Samples: 2102873200. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 03:02:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:02:15,424][06909] Updated weights for policy 0, policy_version 134283 (0.0035) [2024-06-28 03:02:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 2200207360. Throughput: 0: 44175.3. Samples: 2103142320. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 03:02:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:02:19,423][06909] Updated weights for policy 0, policy_version 134293 (0.0027) [2024-06-28 03:02:22,612][06909] Updated weights for policy 0, policy_version 134303 (0.0031) [2024-06-28 03:02:23,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.5, 300 sec: 44097.9). Total num frames: 2200420352. Throughput: 0: 44250.4. Samples: 2103405280. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 03:02:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:02:26,974][06909] Updated weights for policy 0, policy_version 134313 (0.0048) [2024-06-28 03:02:28,856][06674] Fps is (10 sec: 45849.1, 60 sec: 44505.7, 300 sec: 44097.1). Total num frames: 2200666112. Throughput: 0: 44251.7. Samples: 2103540060. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:02:28,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:02:30,049][06909] Updated weights for policy 0, policy_version 134323 (0.0025) [2024-06-28 03:02:32,177][06887] Signal inference workers to stop experience collection... (29950 times) [2024-06-28 03:02:32,226][06887] Signal inference workers to resume experience collection... (29950 times) [2024-06-28 03:02:32,227][06909] InferenceWorker_p0-w0: stopping experience collection (29950 times) [2024-06-28 03:02:32,242][06909] InferenceWorker_p0-w0: resuming experience collection (29950 times) [2024-06-28 03:02:33,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43968.2, 300 sec: 44098.0). Total num frames: 2200879104. Throughput: 0: 44186.4. Samples: 2103807120. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:02:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:02:34,333][06909] Updated weights for policy 0, policy_version 134333 (0.0039) [2024-06-28 03:02:37,727][06909] Updated weights for policy 0, policy_version 134343 (0.0034) [2024-06-28 03:02:38,850][06674] Fps is (10 sec: 40983.0, 60 sec: 43692.1, 300 sec: 44097.9). Total num frames: 2201075712. Throughput: 0: 44097.2. Samples: 2104064240. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:02:38,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:02:41,596][06909] Updated weights for policy 0, policy_version 134353 (0.0025) [2024-06-28 03:02:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2201321472. Throughput: 0: 44052.4. Samples: 2104192820. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:02:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:02:45,612][06909] Updated weights for policy 0, policy_version 134363 (0.0028) [2024-06-28 03:02:48,852][06674] Fps is (10 sec: 47504.2, 60 sec: 44235.4, 300 sec: 44153.2). Total num frames: 2201550848. Throughput: 0: 44012.6. Samples: 2104462840. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:02:48,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:02:48,995][06909] Updated weights for policy 0, policy_version 134373 (0.0025) [2024-06-28 03:02:52,819][06909] Updated weights for policy 0, policy_version 134383 (0.0035) [2024-06-28 03:02:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2201763840. Throughput: 0: 44212.8. Samples: 2104732380. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:02:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:02:56,347][06909] Updated weights for policy 0, policy_version 134393 (0.0034) [2024-06-28 03:02:58,850][06674] Fps is (10 sec: 44245.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2201993216. Throughput: 0: 44112.0. Samples: 2104858240. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:02:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:02:59,916][06909] Updated weights for policy 0, policy_version 134403 (0.0045) [2024-06-28 03:03:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.5, 300 sec: 44097.9). Total num frames: 2202206208. Throughput: 0: 44231.3. Samples: 2105132740. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:03:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:03:04,107][06909] Updated weights for policy 0, policy_version 134413 (0.0024) [2024-06-28 03:03:07,485][06909] Updated weights for policy 0, policy_version 134423 (0.0034) [2024-06-28 03:03:08,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2202402816. Throughput: 0: 44145.5. Samples: 2105391820. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:03:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:03:11,389][06909] Updated weights for policy 0, policy_version 134433 (0.0033) [2024-06-28 03:03:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.6, 300 sec: 44098.0). Total num frames: 2202648576. Throughput: 0: 44082.7. Samples: 2105523540. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:03:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:03:14,660][06909] Updated weights for policy 0, policy_version 134443 (0.0041) [2024-06-28 03:03:18,646][06909] Updated weights for policy 0, policy_version 134453 (0.0031) [2024-06-28 03:03:18,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2202877952. Throughput: 0: 43969.3. Samples: 2105785740. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:03:18,853][06674] Avg episode reward: [(0, '0.406')] [2024-06-28 03:03:22,275][06909] Updated weights for policy 0, policy_version 134463 (0.0028) [2024-06-28 03:03:23,850][06674] Fps is (10 sec: 42599.2, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 2203074560. Throughput: 0: 44226.3. Samples: 2106054420. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:03:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:03:26,338][06909] Updated weights for policy 0, policy_version 134473 (0.0031) [2024-06-28 03:03:28,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44241.0, 300 sec: 44153.5). Total num frames: 2203320320. Throughput: 0: 44283.2. Samples: 2106185560. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:03:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:03:29,439][06909] Updated weights for policy 0, policy_version 134483 (0.0028) [2024-06-28 03:03:33,667][06909] Updated weights for policy 0, policy_version 134493 (0.0023) [2024-06-28 03:03:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2203533312. Throughput: 0: 44275.7. Samples: 2106455160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:03:33,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:03:36,621][06909] Updated weights for policy 0, policy_version 134503 (0.0035) [2024-06-28 03:03:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2203729920. Throughput: 0: 44145.5. Samples: 2106718920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:03:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:03:41,113][06909] Updated weights for policy 0, policy_version 134513 (0.0031) [2024-06-28 03:03:43,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2203992064. Throughput: 0: 44379.1. Samples: 2106855300. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:03:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:03:44,220][06909] Updated weights for policy 0, policy_version 134523 (0.0034) [2024-06-28 03:03:48,285][06887] Signal inference workers to stop experience collection... (30000 times) [2024-06-28 03:03:48,285][06887] Signal inference workers to resume experience collection... (30000 times) [2024-06-28 03:03:48,320][06909] InferenceWorker_p0-w0: stopping experience collection (30000 times) [2024-06-28 03:03:48,321][06909] InferenceWorker_p0-w0: resuming experience collection (30000 times) [2024-06-28 03:03:48,426][06909] Updated weights for policy 0, policy_version 134533 (0.0029) [2024-06-28 03:03:48,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 2204188672. Throughput: 0: 44042.3. Samples: 2107114640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:03:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:03:48,902][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000134534_2204205056.pth... [2024-06-28 03:03:48,963][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000133887_2193604608.pth [2024-06-28 03:03:52,488][06909] Updated weights for policy 0, policy_version 134543 (0.0031) [2024-06-28 03:03:53,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2204401664. Throughput: 0: 44075.5. Samples: 2107375220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:03:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 03:03:55,835][06909] Updated weights for policy 0, policy_version 134553 (0.0025) [2024-06-28 03:03:58,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2204647424. Throughput: 0: 44072.1. Samples: 2107506780. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:03:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:03:59,862][06909] Updated weights for policy 0, policy_version 134563 (0.0038) [2024-06-28 03:04:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.9, 300 sec: 44098.2). Total num frames: 2204844032. Throughput: 0: 44228.6. Samples: 2107776020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:04:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:04:03,858][06909] Updated weights for policy 0, policy_version 134573 (0.0035) [2024-06-28 03:04:07,216][06909] Updated weights for policy 0, policy_version 134583 (0.0041) [2024-06-28 03:04:08,852][06674] Fps is (10 sec: 42589.8, 60 sec: 44508.3, 300 sec: 44153.2). Total num frames: 2205073408. Throughput: 0: 44067.8. Samples: 2108037560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:04:08,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:04:11,579][06909] Updated weights for policy 0, policy_version 134593 (0.0046) [2024-06-28 03:04:13,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44510.0, 300 sec: 44209.0). Total num frames: 2205319168. Throughput: 0: 44179.9. Samples: 2108173660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:04:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:04:14,490][06909] Updated weights for policy 0, policy_version 134603 (0.0032) [2024-06-28 03:04:18,755][06909] Updated weights for policy 0, policy_version 134613 (0.0038) [2024-06-28 03:04:18,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2205499392. Throughput: 0: 44120.1. Samples: 2108440560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:04:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:04:22,194][06909] Updated weights for policy 0, policy_version 134623 (0.0022) [2024-06-28 03:04:23,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2205712384. Throughput: 0: 43994.2. Samples: 2108698660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:04:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 03:04:26,015][06909] Updated weights for policy 0, policy_version 134633 (0.0038) [2024-06-28 03:04:28,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2205974528. Throughput: 0: 43916.4. Samples: 2108831540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:04:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:04:29,553][06909] Updated weights for policy 0, policy_version 134643 (0.0038) [2024-06-28 03:04:33,178][06909] Updated weights for policy 0, policy_version 134653 (0.0027) [2024-06-28 03:04:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2206171136. Throughput: 0: 44040.1. Samples: 2109096440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 03:04:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:04:37,174][06909] Updated weights for policy 0, policy_version 134663 (0.0031) [2024-06-28 03:04:38,850][06674] Fps is (10 sec: 40960.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2206384128. Throughput: 0: 44092.9. Samples: 2109359400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 03:04:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:04:40,939][06909] Updated weights for policy 0, policy_version 134673 (0.0030) [2024-06-28 03:04:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2206629888. Throughput: 0: 44155.5. Samples: 2109493780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 03:04:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:04:44,389][06909] Updated weights for policy 0, policy_version 134683 (0.0034) [2024-06-28 03:04:48,331][06909] Updated weights for policy 0, policy_version 134693 (0.0024) [2024-06-28 03:04:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 2206810112. Throughput: 0: 44041.8. Samples: 2109757900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 03:04:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:04:51,687][06909] Updated weights for policy 0, policy_version 134703 (0.0024) [2024-06-28 03:04:53,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2207039488. Throughput: 0: 44093.5. Samples: 2110021680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 03:04:53,851][06674] Avg episode reward: [(0, '0.401')] [2024-06-28 03:04:55,802][06909] Updated weights for policy 0, policy_version 134713 (0.0027) [2024-06-28 03:04:58,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2207285248. Throughput: 0: 44012.4. Samples: 2110154220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 03:04:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:04:59,020][06909] Updated weights for policy 0, policy_version 134723 (0.0033) [2024-06-28 03:05:03,233][06909] Updated weights for policy 0, policy_version 134733 (0.0043) [2024-06-28 03:05:03,850][06674] Fps is (10 sec: 45873.5, 60 sec: 44236.4, 300 sec: 44097.9). Total num frames: 2207498240. Throughput: 0: 43873.7. Samples: 2110414900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 03:05:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:05:05,099][06887] Signal inference workers to stop experience collection... (30050 times) [2024-06-28 03:05:05,136][06909] InferenceWorker_p0-w0: stopping experience collection (30050 times) [2024-06-28 03:05:05,218][06887] Signal inference workers to resume experience collection... (30050 times) [2024-06-28 03:05:05,219][06909] InferenceWorker_p0-w0: resuming experience collection (30050 times) [2024-06-28 03:05:07,175][06909] Updated weights for policy 0, policy_version 134743 (0.0029) [2024-06-28 03:05:08,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 2207711232. Throughput: 0: 43949.8. Samples: 2110676400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 03:05:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 03:05:10,473][06909] Updated weights for policy 0, policy_version 134753 (0.0031) [2024-06-28 03:05:13,850][06674] Fps is (10 sec: 44238.6, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2207940608. Throughput: 0: 43907.1. Samples: 2110807360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 03:05:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:05:14,366][06909] Updated weights for policy 0, policy_version 134763 (0.0022) [2024-06-28 03:05:18,202][06909] Updated weights for policy 0, policy_version 134773 (0.0034) [2024-06-28 03:05:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2208153600. Throughput: 0: 43927.5. Samples: 2111073180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 03:05:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:05:21,620][06909] Updated weights for policy 0, policy_version 134783 (0.0032) [2024-06-28 03:05:23,856][06674] Fps is (10 sec: 42573.1, 60 sec: 44232.4, 300 sec: 44041.5). Total num frames: 2208366592. Throughput: 0: 43969.6. Samples: 2111338300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 03:05:23,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:05:26,008][06909] Updated weights for policy 0, policy_version 134793 (0.0039) [2024-06-28 03:05:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2208595968. Throughput: 0: 43793.4. Samples: 2111464480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 03:05:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:05:28,896][06909] Updated weights for policy 0, policy_version 134803 (0.0027) [2024-06-28 03:05:33,193][06909] Updated weights for policy 0, policy_version 134813 (0.0045) [2024-06-28 03:05:33,850][06674] Fps is (10 sec: 42623.7, 60 sec: 43690.6, 300 sec: 43931.6). Total num frames: 2208792576. Throughput: 0: 43926.1. Samples: 2111734580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 03:05:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:05:36,654][06909] Updated weights for policy 0, policy_version 134823 (0.0032) [2024-06-28 03:05:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2209021952. Throughput: 0: 43696.4. Samples: 2111988020. Policy #0 lag: (min: 0.0, avg: 12.8, max: 23.0) [2024-06-28 03:05:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:05:40,449][06909] Updated weights for policy 0, policy_version 134833 (0.0025) [2024-06-28 03:05:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2209251328. Throughput: 0: 43786.7. Samples: 2112124620. Policy #0 lag: (min: 0.0, avg: 12.8, max: 23.0) [2024-06-28 03:05:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:05:44,265][06909] Updated weights for policy 0, policy_version 134843 (0.0036) [2024-06-28 03:05:47,661][06909] Updated weights for policy 0, policy_version 134853 (0.0041) [2024-06-28 03:05:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.6, 300 sec: 44042.4). Total num frames: 2209464320. Throughput: 0: 43927.8. Samples: 2112391640. Policy #0 lag: (min: 0.0, avg: 12.8, max: 23.0) [2024-06-28 03:05:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:05:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000134855_2209464320.pth... [2024-06-28 03:05:48,936][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000134211_2198913024.pth [2024-06-28 03:05:51,654][06909] Updated weights for policy 0, policy_version 134863 (0.0031) [2024-06-28 03:05:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2209677312. Throughput: 0: 44005.2. Samples: 2112656640. Policy #0 lag: (min: 0.0, avg: 12.8, max: 23.0) [2024-06-28 03:05:53,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:05:55,240][06909] Updated weights for policy 0, policy_version 134873 (0.0040) [2024-06-28 03:05:58,846][06909] Updated weights for policy 0, policy_version 134883 (0.0032) [2024-06-28 03:05:58,850][06674] Fps is (10 sec: 45876.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2209923072. Throughput: 0: 44117.0. Samples: 2112792620. Policy #0 lag: (min: 0.0, avg: 12.8, max: 23.0) [2024-06-28 03:05:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 03:06:03,247][06909] Updated weights for policy 0, policy_version 134893 (0.0035) [2024-06-28 03:06:03,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43691.0, 300 sec: 43986.9). Total num frames: 2210119680. Throughput: 0: 44031.2. Samples: 2113054580. Policy #0 lag: (min: 0.0, avg: 12.8, max: 23.0) [2024-06-28 03:06:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:06:06,319][06909] Updated weights for policy 0, policy_version 134903 (0.0036) [2024-06-28 03:06:08,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43987.4). Total num frames: 2210332672. Throughput: 0: 43987.2. Samples: 2113317460. Policy #0 lag: (min: 0.0, avg: 12.8, max: 23.0) [2024-06-28 03:06:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:06:10,415][06909] Updated weights for policy 0, policy_version 134913 (0.0038) [2024-06-28 03:06:13,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2210562048. Throughput: 0: 44060.8. Samples: 2113447220. Policy #0 lag: (min: 0.0, avg: 12.8, max: 23.0) [2024-06-28 03:06:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:06:14,144][06909] Updated weights for policy 0, policy_version 134923 (0.0037) [2024-06-28 03:06:17,653][06909] Updated weights for policy 0, policy_version 134933 (0.0043) [2024-06-28 03:06:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2210775040. Throughput: 0: 43888.9. Samples: 2113709580. Policy #0 lag: (min: 0.0, avg: 12.8, max: 23.0) [2024-06-28 03:06:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:06:19,219][06887] Signal inference workers to stop experience collection... (30100 times) [2024-06-28 03:06:19,272][06909] InferenceWorker_p0-w0: stopping experience collection (30100 times) [2024-06-28 03:06:19,271][06887] Signal inference workers to resume experience collection... (30100 times) [2024-06-28 03:06:19,285][06909] InferenceWorker_p0-w0: resuming experience collection (30100 times) [2024-06-28 03:06:21,488][06909] Updated weights for policy 0, policy_version 134943 (0.0039) [2024-06-28 03:06:23,856][06674] Fps is (10 sec: 44210.6, 60 sec: 43963.7, 300 sec: 44097.1). Total num frames: 2211004416. Throughput: 0: 44102.6. Samples: 2113972900. Policy #0 lag: (min: 0.0, avg: 12.8, max: 23.0) [2024-06-28 03:06:23,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:06:24,950][06909] Updated weights for policy 0, policy_version 134953 (0.0032) [2024-06-28 03:06:28,753][06909] Updated weights for policy 0, policy_version 134963 (0.0025) [2024-06-28 03:06:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44043.3). Total num frames: 2211233792. Throughput: 0: 44031.1. Samples: 2114106020. Policy #0 lag: (min: 0.0, avg: 12.8, max: 23.0) [2024-06-28 03:06:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:06:32,449][06909] Updated weights for policy 0, policy_version 134973 (0.0034) [2024-06-28 03:06:33,850][06674] Fps is (10 sec: 44263.6, 60 sec: 44236.9, 300 sec: 44042.7). Total num frames: 2211446784. Throughput: 0: 44030.5. Samples: 2114373000. Policy #0 lag: (min: 0.0, avg: 12.8, max: 23.0) [2024-06-28 03:06:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:06:35,934][06909] Updated weights for policy 0, policy_version 134983 (0.0022) [2024-06-28 03:06:38,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2211643392. Throughput: 0: 44061.4. Samples: 2114639400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 03:06:38,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 03:06:40,148][06909] Updated weights for policy 0, policy_version 134993 (0.0047) [2024-06-28 03:06:43,539][06909] Updated weights for policy 0, policy_version 135003 (0.0028) [2024-06-28 03:06:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2211889152. Throughput: 0: 43842.2. Samples: 2114765520. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 03:06:43,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:06:47,423][06909] Updated weights for policy 0, policy_version 135013 (0.0038) [2024-06-28 03:06:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 2212102144. Throughput: 0: 43898.7. Samples: 2115030020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 03:06:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:06:51,306][06909] Updated weights for policy 0, policy_version 135023 (0.0031) [2024-06-28 03:06:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2212331520. Throughput: 0: 44032.9. Samples: 2115298940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 03:06:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:06:54,725][06909] Updated weights for policy 0, policy_version 135033 (0.0028) [2024-06-28 03:06:58,495][06909] Updated weights for policy 0, policy_version 135043 (0.0048) [2024-06-28 03:06:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2212544512. Throughput: 0: 43957.0. Samples: 2115425280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 03:06:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:07:01,979][06909] Updated weights for policy 0, policy_version 135053 (0.0040) [2024-06-28 03:07:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2212773888. Throughput: 0: 44105.4. Samples: 2115694320. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 03:07:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:07:05,654][06909] Updated weights for policy 0, policy_version 135063 (0.0037) [2024-06-28 03:07:08,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2213003264. Throughput: 0: 44293.1. Samples: 2115965820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 03:07:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:07:09,821][06909] Updated weights for policy 0, policy_version 135073 (0.0030) [2024-06-28 03:07:12,857][06909] Updated weights for policy 0, policy_version 135083 (0.0036) [2024-06-28 03:07:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2213199872. Throughput: 0: 44216.9. Samples: 2116095780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 03:07:13,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:07:17,318][06909] Updated weights for policy 0, policy_version 135093 (0.0031) [2024-06-28 03:07:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2213445632. Throughput: 0: 44123.5. Samples: 2116358560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 03:07:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:07:20,119][06909] Updated weights for policy 0, policy_version 135103 (0.0033) [2024-06-28 03:07:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43968.0, 300 sec: 43987.7). Total num frames: 2213642240. Throughput: 0: 44012.4. Samples: 2116619960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 03:07:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:07:24,579][06909] Updated weights for policy 0, policy_version 135113 (0.0029) [2024-06-28 03:07:28,130][06909] Updated weights for policy 0, policy_version 135123 (0.0034) [2024-06-28 03:07:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2213871616. Throughput: 0: 44016.9. Samples: 2116746280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 03:07:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:07:32,053][06909] Updated weights for policy 0, policy_version 135133 (0.0031) [2024-06-28 03:07:33,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2214068224. Throughput: 0: 43977.7. Samples: 2117009020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 03:07:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:07:34,182][06887] Signal inference workers to stop experience collection... (30150 times) [2024-06-28 03:07:34,204][06909] InferenceWorker_p0-w0: stopping experience collection (30150 times) [2024-06-28 03:07:34,242][06887] Signal inference workers to resume experience collection... (30150 times) [2024-06-28 03:07:34,242][06909] InferenceWorker_p0-w0: resuming experience collection (30150 times) [2024-06-28 03:07:35,465][06909] Updated weights for policy 0, policy_version 135143 (0.0025) [2024-06-28 03:07:38,856][06674] Fps is (10 sec: 42572.5, 60 sec: 44232.4, 300 sec: 43986.0). Total num frames: 2214297600. Throughput: 0: 44030.9. Samples: 2117280600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 03:07:38,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:07:39,295][06909] Updated weights for policy 0, policy_version 135153 (0.0037) [2024-06-28 03:07:42,841][06909] Updated weights for policy 0, policy_version 135163 (0.0038) [2024-06-28 03:07:43,855][06674] Fps is (10 sec: 45849.7, 60 sec: 43959.7, 300 sec: 43986.3). Total num frames: 2214526976. Throughput: 0: 44131.9. Samples: 2117411460. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:07:43,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:07:46,835][06909] Updated weights for policy 0, policy_version 135173 (0.0034) [2024-06-28 03:07:48,850][06674] Fps is (10 sec: 45903.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2214756352. Throughput: 0: 44029.7. Samples: 2117675660. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:07:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:07:48,962][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000135179_2214772736.pth... [2024-06-28 03:07:49,008][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000134534_2204205056.pth [2024-06-28 03:07:50,095][06909] Updated weights for policy 0, policy_version 135183 (0.0029) [2024-06-28 03:07:53,850][06674] Fps is (10 sec: 45901.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2214985728. Throughput: 0: 43845.7. Samples: 2117938880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:07:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:07:54,369][06909] Updated weights for policy 0, policy_version 135193 (0.0040) [2024-06-28 03:07:57,663][06909] Updated weights for policy 0, policy_version 135203 (0.0035) [2024-06-28 03:07:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2215182336. Throughput: 0: 43817.8. Samples: 2118067580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:07:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:08:01,691][06909] Updated weights for policy 0, policy_version 135213 (0.0032) [2024-06-28 03:08:03,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2215411712. Throughput: 0: 43770.9. Samples: 2118328260. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:08:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:08:05,379][06909] Updated weights for policy 0, policy_version 135223 (0.0028) [2024-06-28 03:08:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2215641088. Throughput: 0: 43885.1. Samples: 2118594780. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:08:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:08:09,045][06909] Updated weights for policy 0, policy_version 135233 (0.0033) [2024-06-28 03:08:12,618][06909] Updated weights for policy 0, policy_version 135243 (0.0028) [2024-06-28 03:08:13,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2215870464. Throughput: 0: 44120.0. Samples: 2118731680. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:08:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:08:16,297][06909] Updated weights for policy 0, policy_version 135253 (0.0036) [2024-06-28 03:08:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2216067072. Throughput: 0: 44204.0. Samples: 2118998200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:08:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:08:20,009][06909] Updated weights for policy 0, policy_version 135263 (0.0027) [2024-06-28 03:08:23,850][06674] Fps is (10 sec: 42596.3, 60 sec: 44236.6, 300 sec: 43986.8). Total num frames: 2216296448. Throughput: 0: 44157.5. Samples: 2119267440. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:08:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:08:23,951][06909] Updated weights for policy 0, policy_version 135273 (0.0030) [2024-06-28 03:08:27,415][06909] Updated weights for policy 0, policy_version 135283 (0.0026) [2024-06-28 03:08:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2216525824. Throughput: 0: 44029.5. Samples: 2119392540. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:08:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:08:31,471][06909] Updated weights for policy 0, policy_version 135293 (0.0038) [2024-06-28 03:08:33,850][06674] Fps is (10 sec: 44238.7, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2216738816. Throughput: 0: 43983.1. Samples: 2119654900. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:08:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:08:34,838][06909] Updated weights for policy 0, policy_version 135303 (0.0030) [2024-06-28 03:08:38,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44241.2, 300 sec: 43931.3). Total num frames: 2216951808. Throughput: 0: 44022.1. Samples: 2119919880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:08:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:08:39,216][06909] Updated weights for policy 0, policy_version 135313 (0.0032) [2024-06-28 03:08:42,475][06909] Updated weights for policy 0, policy_version 135323 (0.0035) [2024-06-28 03:08:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44240.9, 300 sec: 44042.4). Total num frames: 2217181184. Throughput: 0: 44031.9. Samples: 2120049020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:08:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:08:46,515][06909] Updated weights for policy 0, policy_version 135333 (0.0029) [2024-06-28 03:08:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2217394176. Throughput: 0: 44177.8. Samples: 2120316260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 03:08:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:08:49,916][06909] Updated weights for policy 0, policy_version 135343 (0.0029) [2024-06-28 03:08:53,671][06909] Updated weights for policy 0, policy_version 135353 (0.0032) [2024-06-28 03:08:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2217623552. Throughput: 0: 44105.8. Samples: 2120579540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 03:08:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:08:57,260][06909] Updated weights for policy 0, policy_version 135363 (0.0027) [2024-06-28 03:08:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2217852928. Throughput: 0: 44035.5. Samples: 2120713280. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 03:08:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:09:00,823][06909] Updated weights for policy 0, policy_version 135373 (0.0036) [2024-06-28 03:09:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.9, 300 sec: 44042.7). Total num frames: 2218065920. Throughput: 0: 44238.2. Samples: 2120988920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 03:09:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:09:04,889][06909] Updated weights for policy 0, policy_version 135383 (0.0035) [2024-06-28 03:09:08,545][06909] Updated weights for policy 0, policy_version 135393 (0.0042) [2024-06-28 03:09:08,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 2218278912. Throughput: 0: 43968.8. Samples: 2121246020. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 03:09:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:09:09,648][06887] Signal inference workers to stop experience collection... (30200 times) [2024-06-28 03:09:09,649][06887] Signal inference workers to resume experience collection... (30200 times) [2024-06-28 03:09:09,670][06909] InferenceWorker_p0-w0: stopping experience collection (30200 times) [2024-06-28 03:09:09,671][06909] InferenceWorker_p0-w0: resuming experience collection (30200 times) [2024-06-28 03:09:12,176][06909] Updated weights for policy 0, policy_version 135403 (0.0027) [2024-06-28 03:09:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2218508288. Throughput: 0: 44097.7. Samples: 2121376940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 03:09:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:09:15,922][06909] Updated weights for policy 0, policy_version 135413 (0.0030) [2024-06-28 03:09:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2218721280. Throughput: 0: 44246.7. Samples: 2121646000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 03:09:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:09:19,839][06909] Updated weights for policy 0, policy_version 135423 (0.0040) [2024-06-28 03:09:23,266][06909] Updated weights for policy 0, policy_version 135433 (0.0041) [2024-06-28 03:09:23,852][06674] Fps is (10 sec: 44227.9, 60 sec: 44235.6, 300 sec: 43986.6). Total num frames: 2218950656. Throughput: 0: 44227.0. Samples: 2121910180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 03:09:23,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:09:27,146][06909] Updated weights for policy 0, policy_version 135443 (0.0035) [2024-06-28 03:09:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2219163648. Throughput: 0: 44429.0. Samples: 2122048320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 03:09:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:09:30,420][06909] Updated weights for policy 0, policy_version 135453 (0.0030) [2024-06-28 03:09:33,856][06674] Fps is (10 sec: 45858.3, 60 sec: 44505.6, 300 sec: 44152.6). Total num frames: 2219409408. Throughput: 0: 44546.0. Samples: 2122321080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 03:09:33,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:09:34,572][06909] Updated weights for policy 0, policy_version 135463 (0.0028) [2024-06-28 03:09:37,699][06909] Updated weights for policy 0, policy_version 135473 (0.0038) [2024-06-28 03:09:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2219606016. Throughput: 0: 44438.6. Samples: 2122579280. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 03:09:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:09:41,992][06909] Updated weights for policy 0, policy_version 135483 (0.0041) [2024-06-28 03:09:43,850][06674] Fps is (10 sec: 40983.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2219819008. Throughput: 0: 44332.5. Samples: 2122708240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 03:09:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:09:45,545][06909] Updated weights for policy 0, policy_version 135493 (0.0031) [2024-06-28 03:09:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2220048384. Throughput: 0: 44004.9. Samples: 2122969140. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:09:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:09:48,899][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000135502_2220064768.pth... [2024-06-28 03:09:48,942][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000134855_2209464320.pth [2024-06-28 03:09:49,471][06909] Updated weights for policy 0, policy_version 135503 (0.0032) [2024-06-28 03:09:52,985][06909] Updated weights for policy 0, policy_version 135513 (0.0022) [2024-06-28 03:09:53,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2220277760. Throughput: 0: 44160.1. Samples: 2123233220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:09:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:09:57,090][06909] Updated weights for policy 0, policy_version 135523 (0.0025) [2024-06-28 03:09:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43987.0). Total num frames: 2220474368. Throughput: 0: 44226.4. Samples: 2123367120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:09:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:10:00,467][06909] Updated weights for policy 0, policy_version 135533 (0.0028) [2024-06-28 03:10:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2220720128. Throughput: 0: 44189.8. Samples: 2123634540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:10:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:10:04,517][06909] Updated weights for policy 0, policy_version 135543 (0.0031) [2024-06-28 03:10:07,776][06909] Updated weights for policy 0, policy_version 135553 (0.0035) [2024-06-28 03:10:08,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2220933120. Throughput: 0: 44128.2. Samples: 2123895860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:10:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:10:12,007][06909] Updated weights for policy 0, policy_version 135563 (0.0034) [2024-06-28 03:10:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2221146112. Throughput: 0: 43992.0. Samples: 2124027960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:10:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:10:15,495][06909] Updated weights for policy 0, policy_version 135573 (0.0027) [2024-06-28 03:10:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44043.3). Total num frames: 2221359104. Throughput: 0: 43630.3. Samples: 2124284200. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:10:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:10:19,528][06909] Updated weights for policy 0, policy_version 135583 (0.0047) [2024-06-28 03:10:22,884][06909] Updated weights for policy 0, policy_version 135593 (0.0041) [2024-06-28 03:10:23,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43965.1, 300 sec: 44042.4). Total num frames: 2221588480. Throughput: 0: 43735.9. Samples: 2124547400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:10:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:10:27,274][06909] Updated weights for policy 0, policy_version 135603 (0.0035) [2024-06-28 03:10:28,682][06887] Signal inference workers to stop experience collection... (30250 times) [2024-06-28 03:10:28,727][06909] InferenceWorker_p0-w0: stopping experience collection (30250 times) [2024-06-28 03:10:28,796][06887] Signal inference workers to resume experience collection... (30250 times) [2024-06-28 03:10:28,796][06909] InferenceWorker_p0-w0: resuming experience collection (30250 times) [2024-06-28 03:10:28,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2221785088. Throughput: 0: 43740.0. Samples: 2124676540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:10:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 03:10:30,165][06909] Updated weights for policy 0, policy_version 135613 (0.0032) [2024-06-28 03:10:33,856][06674] Fps is (10 sec: 44210.8, 60 sec: 43690.5, 300 sec: 44097.1). Total num frames: 2222030848. Throughput: 0: 43815.5. Samples: 2124941100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:10:33,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:10:34,639][06909] Updated weights for policy 0, policy_version 135623 (0.0033) [2024-06-28 03:10:37,873][06909] Updated weights for policy 0, policy_version 135633 (0.0024) [2024-06-28 03:10:38,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2222243840. Throughput: 0: 43833.2. Samples: 2125205720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:10:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:10:41,971][06909] Updated weights for policy 0, policy_version 135643 (0.0029) [2024-06-28 03:10:43,850][06674] Fps is (10 sec: 44263.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2222473216. Throughput: 0: 43885.8. Samples: 2125341980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:10:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:10:45,106][06909] Updated weights for policy 0, policy_version 135653 (0.0038) [2024-06-28 03:10:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2222686208. Throughput: 0: 43933.1. Samples: 2125611540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:10:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:10:49,409][06909] Updated weights for policy 0, policy_version 135663 (0.0041) [2024-06-28 03:10:52,203][06909] Updated weights for policy 0, policy_version 135673 (0.0035) [2024-06-28 03:10:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2222915584. Throughput: 0: 44034.3. Samples: 2125877400. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 03:10:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:10:56,614][06909] Updated weights for policy 0, policy_version 135683 (0.0042) [2024-06-28 03:10:58,850][06674] Fps is (10 sec: 44238.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2223128576. Throughput: 0: 43989.8. Samples: 2126007500. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 03:10:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:10:59,733][06909] Updated weights for policy 0, policy_version 135693 (0.0029) [2024-06-28 03:11:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2223341568. Throughput: 0: 44071.2. Samples: 2126267400. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 03:11:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:11:04,247][06909] Updated weights for policy 0, policy_version 135703 (0.0023) [2024-06-28 03:11:07,134][06909] Updated weights for policy 0, policy_version 135713 (0.0025) [2024-06-28 03:11:08,854][06674] Fps is (10 sec: 44219.0, 60 sec: 43960.9, 300 sec: 44097.4). Total num frames: 2223570944. Throughput: 0: 44144.7. Samples: 2126534080. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 03:11:08,854][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:11:11,451][06909] Updated weights for policy 0, policy_version 135723 (0.0029) [2024-06-28 03:11:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2223800320. Throughput: 0: 44406.2. Samples: 2126674820. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 03:11:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:11:14,823][06909] Updated weights for policy 0, policy_version 135733 (0.0034) [2024-06-28 03:11:18,850][06674] Fps is (10 sec: 42615.2, 60 sec: 43963.8, 300 sec: 44043.3). Total num frames: 2223996928. Throughput: 0: 44274.4. Samples: 2126933180. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 03:11:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:11:19,382][06909] Updated weights for policy 0, policy_version 135743 (0.0022) [2024-06-28 03:11:22,086][06909] Updated weights for policy 0, policy_version 135753 (0.0025) [2024-06-28 03:11:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2224242688. Throughput: 0: 44391.7. Samples: 2127203340. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 03:11:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:11:26,599][06909] Updated weights for policy 0, policy_version 135763 (0.0028) [2024-06-28 03:11:28,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 2224472064. Throughput: 0: 44430.5. Samples: 2127341360. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 03:11:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:11:29,203][06909] Updated weights for policy 0, policy_version 135773 (0.0044) [2024-06-28 03:11:30,408][06887] Signal inference workers to stop experience collection... (30300 times) [2024-06-28 03:11:30,456][06909] InferenceWorker_p0-w0: stopping experience collection (30300 times) [2024-06-28 03:11:30,458][06887] Signal inference workers to resume experience collection... (30300 times) [2024-06-28 03:11:30,467][06909] InferenceWorker_p0-w0: resuming experience collection (30300 times) [2024-06-28 03:11:33,852][06674] Fps is (10 sec: 40950.9, 60 sec: 43693.4, 300 sec: 44097.6). Total num frames: 2224652288. Throughput: 0: 44020.6. Samples: 2127592560. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 03:11:33,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:11:34,414][06909] Updated weights for policy 0, policy_version 135783 (0.0027) [2024-06-28 03:11:36,937][06909] Updated weights for policy 0, policy_version 135793 (0.0036) [2024-06-28 03:11:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2224898048. Throughput: 0: 43960.4. Samples: 2127855620. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 03:11:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:11:41,504][06909] Updated weights for policy 0, policy_version 135803 (0.0036) [2024-06-28 03:11:43,850][06674] Fps is (10 sec: 47524.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2225127424. Throughput: 0: 44245.3. Samples: 2127998540. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 03:11:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:11:44,135][06909] Updated weights for policy 0, policy_version 135813 (0.0029) [2024-06-28 03:11:48,757][06909] Updated weights for policy 0, policy_version 135823 (0.0032) [2024-06-28 03:11:48,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43962.4, 300 sec: 44042.1). Total num frames: 2225324032. Throughput: 0: 44154.4. Samples: 2128254440. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 03:11:48,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:11:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000135823_2225324032.pth... [2024-06-28 03:11:48,931][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000135179_2214772736.pth [2024-06-28 03:11:51,477][06909] Updated weights for policy 0, policy_version 135833 (0.0037) [2024-06-28 03:11:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2225553408. Throughput: 0: 44217.7. Samples: 2128523700. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 03:11:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:11:56,463][06909] Updated weights for policy 0, policy_version 135843 (0.0025) [2024-06-28 03:11:58,850][06674] Fps is (10 sec: 47523.3, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2225799168. Throughput: 0: 44272.5. Samples: 2128667080. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 03:11:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:11:59,001][06909] Updated weights for policy 0, policy_version 135853 (0.0037) [2024-06-28 03:12:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2225963008. Throughput: 0: 44114.7. Samples: 2128918340. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 03:12:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:12:03,963][06909] Updated weights for policy 0, policy_version 135863 (0.0031) [2024-06-28 03:12:06,644][06909] Updated weights for policy 0, policy_version 135873 (0.0040) [2024-06-28 03:12:08,852][06674] Fps is (10 sec: 42589.6, 60 sec: 44238.2, 300 sec: 44153.2). Total num frames: 2226225152. Throughput: 0: 43898.5. Samples: 2129178860. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 03:12:08,861][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:12:11,613][06909] Updated weights for policy 0, policy_version 135883 (0.0044) [2024-06-28 03:12:13,853][06674] Fps is (10 sec: 49135.8, 60 sec: 44234.4, 300 sec: 44097.5). Total num frames: 2226454528. Throughput: 0: 44044.0. Samples: 2129323480. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 03:12:13,854][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:12:14,288][06909] Updated weights for policy 0, policy_version 135893 (0.0039) [2024-06-28 03:12:18,850][06674] Fps is (10 sec: 39329.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2226618368. Throughput: 0: 43956.5. Samples: 2129570500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 03:12:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:12:18,907][06909] Updated weights for policy 0, policy_version 135903 (0.0033) [2024-06-28 03:12:22,150][06909] Updated weights for policy 0, policy_version 135913 (0.0030) [2024-06-28 03:12:23,394][06887] Signal inference workers to stop experience collection... (30350 times) [2024-06-28 03:12:23,394][06887] Signal inference workers to resume experience collection... (30350 times) [2024-06-28 03:12:23,412][06909] InferenceWorker_p0-w0: stopping experience collection (30350 times) [2024-06-28 03:12:23,412][06909] InferenceWorker_p0-w0: resuming experience collection (30350 times) [2024-06-28 03:12:23,850][06674] Fps is (10 sec: 44251.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2226896896. Throughput: 0: 44136.5. Samples: 2129841760. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 03:12:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:12:26,096][06909] Updated weights for policy 0, policy_version 135923 (0.0028) [2024-06-28 03:12:28,850][06674] Fps is (10 sec: 49151.9, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2227109888. Throughput: 0: 43957.8. Samples: 2129976640. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 03:12:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 03:12:29,398][06909] Updated weights for policy 0, policy_version 135933 (0.0034) [2024-06-28 03:12:33,693][06909] Updated weights for policy 0, policy_version 135943 (0.0046) [2024-06-28 03:12:33,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43965.4, 300 sec: 44043.3). Total num frames: 2227290112. Throughput: 0: 43953.1. Samples: 2130232240. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 03:12:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:12:36,547][06909] Updated weights for policy 0, policy_version 135953 (0.0045) [2024-06-28 03:12:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44154.3). Total num frames: 2227552256. Throughput: 0: 43880.5. Samples: 2130498320. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 03:12:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:12:41,075][06909] Updated weights for policy 0, policy_version 135963 (0.0024) [2024-06-28 03:12:43,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2227765248. Throughput: 0: 43814.3. Samples: 2130638720. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 03:12:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:12:44,694][06909] Updated weights for policy 0, policy_version 135973 (0.0021) [2024-06-28 03:12:48,531][06909] Updated weights for policy 0, policy_version 135983 (0.0039) [2024-06-28 03:12:48,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43692.2, 300 sec: 43931.3). Total num frames: 2227945472. Throughput: 0: 43829.7. Samples: 2130890680. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 03:12:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:12:52,015][06909] Updated weights for policy 0, policy_version 135993 (0.0028) [2024-06-28 03:12:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 2228224000. Throughput: 0: 43819.8. Samples: 2131150660. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 03:12:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:12:55,833][06909] Updated weights for policy 0, policy_version 136003 (0.0043) [2024-06-28 03:12:58,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2228420608. Throughput: 0: 43761.9. Samples: 2131292620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:12:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 03:12:59,174][06909] Updated weights for policy 0, policy_version 136013 (0.0033) [2024-06-28 03:13:03,259][06909] Updated weights for policy 0, policy_version 136023 (0.0035) [2024-06-28 03:13:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2228633600. Throughput: 0: 43930.7. Samples: 2131547380. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:13:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 03:13:06,623][06909] Updated weights for policy 0, policy_version 136033 (0.0036) [2024-06-28 03:13:08,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44511.4, 300 sec: 44153.5). Total num frames: 2228895744. Throughput: 0: 43790.7. Samples: 2131812340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:13:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:13:10,713][06909] Updated weights for policy 0, policy_version 136043 (0.0026) [2024-06-28 03:13:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43693.1, 300 sec: 44098.0). Total num frames: 2229075968. Throughput: 0: 44008.9. Samples: 2131957040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:13:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:13:13,962][06909] Updated weights for policy 0, policy_version 136053 (0.0043) [2024-06-28 03:13:18,334][06909] Updated weights for policy 0, policy_version 136063 (0.0027) [2024-06-28 03:13:18,850][06674] Fps is (10 sec: 37682.9, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2229272576. Throughput: 0: 43976.4. Samples: 2132211180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:13:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:13:21,810][06909] Updated weights for policy 0, policy_version 136073 (0.0034) [2024-06-28 03:13:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2229534720. Throughput: 0: 43820.0. Samples: 2132470220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:13:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:13:25,626][06909] Updated weights for policy 0, policy_version 136083 (0.0031) [2024-06-28 03:13:28,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2229731328. Throughput: 0: 43923.8. Samples: 2132615300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:13:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:13:29,104][06909] Updated weights for policy 0, policy_version 136093 (0.0035) [2024-06-28 03:13:32,827][06909] Updated weights for policy 0, policy_version 136103 (0.0032) [2024-06-28 03:13:33,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2229944320. Throughput: 0: 44216.9. Samples: 2132880440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:13:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:13:36,398][06909] Updated weights for policy 0, policy_version 136113 (0.0027) [2024-06-28 03:13:38,344][06887] Signal inference workers to stop experience collection... (30400 times) [2024-06-28 03:13:38,392][06909] InferenceWorker_p0-w0: stopping experience collection (30400 times) [2024-06-28 03:13:38,399][06887] Signal inference workers to resume experience collection... (30400 times) [2024-06-28 03:13:38,404][06909] InferenceWorker_p0-w0: resuming experience collection (30400 times) [2024-06-28 03:13:38,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2230206464. Throughput: 0: 44203.1. Samples: 2133139800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:13:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:13:40,498][06909] Updated weights for policy 0, policy_version 136123 (0.0035) [2024-06-28 03:13:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.5, 300 sec: 43986.9). Total num frames: 2230370304. Throughput: 0: 44105.7. Samples: 2133277380. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:13:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:13:44,205][06909] Updated weights for policy 0, policy_version 136133 (0.0029) [2024-06-28 03:13:47,929][06909] Updated weights for policy 0, policy_version 136143 (0.0037) [2024-06-28 03:13:48,850][06674] Fps is (10 sec: 39321.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2230599680. Throughput: 0: 44161.8. Samples: 2133534660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:13:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:13:48,959][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000136146_2230616064.pth... [2024-06-28 03:13:49,011][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000135502_2220064768.pth [2024-06-28 03:13:51,658][06909] Updated weights for policy 0, policy_version 136153 (0.0035) [2024-06-28 03:13:53,850][06674] Fps is (10 sec: 49151.9, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2230861824. Throughput: 0: 44014.1. Samples: 2133792980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:13:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:13:55,656][06909] Updated weights for policy 0, policy_version 136163 (0.0042) [2024-06-28 03:13:58,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43417.5, 300 sec: 43931.3). Total num frames: 2231025664. Throughput: 0: 43917.6. Samples: 2133933340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 03:13:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:13:59,430][06909] Updated weights for policy 0, policy_version 136173 (0.0036) [2024-06-28 03:14:02,974][06909] Updated weights for policy 0, policy_version 136183 (0.0027) [2024-06-28 03:14:03,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2231271424. Throughput: 0: 43952.4. Samples: 2134189040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 03:14:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:14:06,714][06909] Updated weights for policy 0, policy_version 136193 (0.0026) [2024-06-28 03:14:08,850][06674] Fps is (10 sec: 49152.2, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 2231517184. Throughput: 0: 44027.5. Samples: 2134451460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 03:14:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:14:10,189][06909] Updated weights for policy 0, policy_version 136203 (0.0029) [2024-06-28 03:14:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2231697408. Throughput: 0: 43944.1. Samples: 2134592780. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 03:14:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:14:14,110][06909] Updated weights for policy 0, policy_version 136213 (0.0041) [2024-06-28 03:14:18,040][06909] Updated weights for policy 0, policy_version 136223 (0.0031) [2024-06-28 03:14:18,850][06674] Fps is (10 sec: 39321.1, 60 sec: 43963.6, 300 sec: 43931.6). Total num frames: 2231910400. Throughput: 0: 43763.0. Samples: 2134849780. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 03:14:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:14:21,775][06909] Updated weights for policy 0, policy_version 136233 (0.0026) [2024-06-28 03:14:23,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2232172544. Throughput: 0: 43644.8. Samples: 2135103820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 03:14:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:14:25,266][06909] Updated weights for policy 0, policy_version 136243 (0.0026) [2024-06-28 03:14:28,850][06674] Fps is (10 sec: 44237.8, 60 sec: 43690.8, 300 sec: 43876.7). Total num frames: 2232352768. Throughput: 0: 43743.2. Samples: 2135245820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 03:14:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:14:28,966][06909] Updated weights for policy 0, policy_version 136253 (0.0036) [2024-06-28 03:14:32,485][06909] Updated weights for policy 0, policy_version 136263 (0.0041) [2024-06-28 03:14:33,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2232582144. Throughput: 0: 43984.8. Samples: 2135513980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 03:14:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:14:36,555][06909] Updated weights for policy 0, policy_version 136273 (0.0032) [2024-06-28 03:14:38,852][06674] Fps is (10 sec: 47503.5, 60 sec: 43689.2, 300 sec: 44097.6). Total num frames: 2232827904. Throughput: 0: 43825.6. Samples: 2135765220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 03:14:38,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:14:40,218][06909] Updated weights for policy 0, policy_version 136283 (0.0026) [2024-06-28 03:14:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2233008128. Throughput: 0: 43748.5. Samples: 2135902020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 03:14:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:14:43,939][06909] Updated weights for policy 0, policy_version 136293 (0.0040) [2024-06-28 03:14:44,906][06887] Signal inference workers to stop experience collection... (30450 times) [2024-06-28 03:14:44,906][06887] Signal inference workers to resume experience collection... (30450 times) [2024-06-28 03:14:44,920][06909] InferenceWorker_p0-w0: stopping experience collection (30450 times) [2024-06-28 03:14:44,920][06909] InferenceWorker_p0-w0: resuming experience collection (30450 times) [2024-06-28 03:14:47,518][06909] Updated weights for policy 0, policy_version 136303 (0.0032) [2024-06-28 03:14:48,851][06674] Fps is (10 sec: 40964.7, 60 sec: 43963.0, 300 sec: 43931.2). Total num frames: 2233237504. Throughput: 0: 44033.4. Samples: 2136170580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 03:14:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:14:51,390][06909] Updated weights for policy 0, policy_version 136313 (0.0026) [2024-06-28 03:14:53,852][06674] Fps is (10 sec: 47504.4, 60 sec: 43689.3, 300 sec: 44097.6). Total num frames: 2233483264. Throughput: 0: 43889.6. Samples: 2136426580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 03:14:53,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:14:55,155][06909] Updated weights for policy 0, policy_version 136323 (0.0032) [2024-06-28 03:14:58,850][06674] Fps is (10 sec: 42602.5, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 2233663488. Throughput: 0: 43827.6. Samples: 2136565020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 03:14:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:14:58,991][06909] Updated weights for policy 0, policy_version 136333 (0.0031) [2024-06-28 03:15:02,643][06909] Updated weights for policy 0, policy_version 136343 (0.0027) [2024-06-28 03:15:03,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2233909248. Throughput: 0: 43982.0. Samples: 2136828960. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-28 03:15:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:15:06,261][06909] Updated weights for policy 0, policy_version 136353 (0.0042) [2024-06-28 03:15:08,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2234138624. Throughput: 0: 44031.7. Samples: 2137085240. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-28 03:15:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:15:10,319][06909] Updated weights for policy 0, policy_version 136363 (0.0026) [2024-06-28 03:15:13,710][06909] Updated weights for policy 0, policy_version 136373 (0.0040) [2024-06-28 03:15:13,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2234335232. Throughput: 0: 43913.6. Samples: 2137221940. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-28 03:15:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:15:17,580][06909] Updated weights for policy 0, policy_version 136383 (0.0025) [2024-06-28 03:15:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44237.0, 300 sec: 43986.9). Total num frames: 2234564608. Throughput: 0: 43895.6. Samples: 2137489280. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-28 03:15:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 03:15:21,147][06909] Updated weights for policy 0, policy_version 136393 (0.0028) [2024-06-28 03:15:23,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2234810368. Throughput: 0: 44094.9. Samples: 2137749400. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-28 03:15:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:15:24,948][06909] Updated weights for policy 0, policy_version 136403 (0.0034) [2024-06-28 03:15:28,340][06909] Updated weights for policy 0, policy_version 136413 (0.0038) [2024-06-28 03:15:28,850][06674] Fps is (10 sec: 42597.3, 60 sec: 43963.5, 300 sec: 43932.2). Total num frames: 2234990592. Throughput: 0: 44040.3. Samples: 2137883840. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-28 03:15:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:15:32,349][06909] Updated weights for policy 0, policy_version 136423 (0.0036) [2024-06-28 03:15:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2235236352. Throughput: 0: 43958.2. Samples: 2138148660. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-28 03:15:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:15:35,946][06909] Updated weights for policy 0, policy_version 136433 (0.0037) [2024-06-28 03:15:38,852][06674] Fps is (10 sec: 45867.1, 60 sec: 43690.7, 300 sec: 43986.6). Total num frames: 2235449344. Throughput: 0: 44013.4. Samples: 2138407180. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-28 03:15:38,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:15:39,891][06909] Updated weights for policy 0, policy_version 136443 (0.0038) [2024-06-28 03:15:43,072][06909] Updated weights for policy 0, policy_version 136453 (0.0033) [2024-06-28 03:15:43,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 2235645952. Throughput: 0: 43937.3. Samples: 2138542200. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-28 03:15:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:15:47,408][06909] Updated weights for policy 0, policy_version 136463 (0.0040) [2024-06-28 03:15:48,850][06674] Fps is (10 sec: 45883.8, 60 sec: 44510.5, 300 sec: 44042.4). Total num frames: 2235908096. Throughput: 0: 44136.8. Samples: 2138815120. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-28 03:15:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:15:48,874][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000136469_2235908096.pth... [2024-06-28 03:15:48,948][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000135823_2225324032.pth [2024-06-28 03:15:50,754][06909] Updated weights for policy 0, policy_version 136473 (0.0031) [2024-06-28 03:15:53,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 2236121088. Throughput: 0: 44214.3. Samples: 2139074880. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-28 03:15:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:15:54,618][06909] Updated weights for policy 0, policy_version 136483 (0.0030) [2024-06-28 03:15:58,063][06909] Updated weights for policy 0, policy_version 136493 (0.0041) [2024-06-28 03:15:58,850][06674] Fps is (10 sec: 40960.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2236317696. Throughput: 0: 44057.9. Samples: 2139204540. Policy #0 lag: (min: 0.0, avg: 12.5, max: 23.0) [2024-06-28 03:15:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:16:02,134][06909] Updated weights for policy 0, policy_version 136503 (0.0024) [2024-06-28 03:16:03,379][06887] Signal inference workers to stop experience collection... (30500 times) [2024-06-28 03:16:03,410][06909] InferenceWorker_p0-w0: stopping experience collection (30500 times) [2024-06-28 03:16:03,433][06887] Signal inference workers to resume experience collection... (30500 times) [2024-06-28 03:16:03,433][06909] InferenceWorker_p0-w0: resuming experience collection (30500 times) [2024-06-28 03:16:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 44098.6). Total num frames: 2236579840. Throughput: 0: 44158.2. Samples: 2139476400. Policy #0 lag: (min: 0.0, avg: 11.9, max: 24.0) [2024-06-28 03:16:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:16:05,801][06909] Updated weights for policy 0, policy_version 136513 (0.0040) [2024-06-28 03:16:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2236776448. Throughput: 0: 44261.0. Samples: 2139741140. Policy #0 lag: (min: 0.0, avg: 11.9, max: 24.0) [2024-06-28 03:16:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:16:09,295][06909] Updated weights for policy 0, policy_version 136523 (0.0032) [2024-06-28 03:16:12,997][06909] Updated weights for policy 0, policy_version 136533 (0.0029) [2024-06-28 03:16:13,850][06674] Fps is (10 sec: 40959.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2236989440. Throughput: 0: 43973.4. Samples: 2139862640. Policy #0 lag: (min: 0.0, avg: 11.9, max: 24.0) [2024-06-28 03:16:13,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 03:16:16,537][06909] Updated weights for policy 0, policy_version 136543 (0.0036) [2024-06-28 03:16:18,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2237235200. Throughput: 0: 44191.1. Samples: 2140137260. Policy #0 lag: (min: 0.0, avg: 11.9, max: 24.0) [2024-06-28 03:16:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:16:20,373][06909] Updated weights for policy 0, policy_version 136553 (0.0028) [2024-06-28 03:16:23,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43417.7, 300 sec: 43875.8). Total num frames: 2237415424. Throughput: 0: 44290.9. Samples: 2140400180. Policy #0 lag: (min: 0.0, avg: 11.9, max: 24.0) [2024-06-28 03:16:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:16:24,496][06909] Updated weights for policy 0, policy_version 136563 (0.0035) [2024-06-28 03:16:27,586][06909] Updated weights for policy 0, policy_version 136573 (0.0031) [2024-06-28 03:16:28,850][06674] Fps is (10 sec: 40960.5, 60 sec: 44237.0, 300 sec: 44042.8). Total num frames: 2237644800. Throughput: 0: 44047.2. Samples: 2140524320. Policy #0 lag: (min: 0.0, avg: 11.9, max: 24.0) [2024-06-28 03:16:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:16:31,943][06909] Updated weights for policy 0, policy_version 136583 (0.0036) [2024-06-28 03:16:33,852][06674] Fps is (10 sec: 45867.1, 60 sec: 43962.5, 300 sec: 43986.6). Total num frames: 2237874176. Throughput: 0: 43914.5. Samples: 2140791340. Policy #0 lag: (min: 0.0, avg: 11.9, max: 24.0) [2024-06-28 03:16:33,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:16:35,495][06909] Updated weights for policy 0, policy_version 136593 (0.0023) [2024-06-28 03:16:38,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43965.1, 300 sec: 43931.3). Total num frames: 2238087168. Throughput: 0: 44115.8. Samples: 2141060100. Policy #0 lag: (min: 0.0, avg: 11.9, max: 24.0) [2024-06-28 03:16:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:16:39,259][06909] Updated weights for policy 0, policy_version 136603 (0.0034) [2024-06-28 03:16:42,879][06909] Updated weights for policy 0, policy_version 136613 (0.0040) [2024-06-28 03:16:43,850][06674] Fps is (10 sec: 42605.5, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 2238300160. Throughput: 0: 44052.0. Samples: 2141186880. Policy #0 lag: (min: 0.0, avg: 11.9, max: 24.0) [2024-06-28 03:16:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:16:46,782][06909] Updated weights for policy 0, policy_version 136623 (0.0030) [2024-06-28 03:16:48,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2238545920. Throughput: 0: 43959.4. Samples: 2141454580. Policy #0 lag: (min: 0.0, avg: 11.9, max: 24.0) [2024-06-28 03:16:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:16:50,221][06909] Updated weights for policy 0, policy_version 136633 (0.0039) [2024-06-28 03:16:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.5, 300 sec: 43820.2). Total num frames: 2238726144. Throughput: 0: 43851.5. Samples: 2141714460. Policy #0 lag: (min: 0.0, avg: 11.9, max: 24.0) [2024-06-28 03:16:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:16:54,240][06909] Updated weights for policy 0, policy_version 136643 (0.0027) [2024-06-28 03:16:57,538][06909] Updated weights for policy 0, policy_version 136653 (0.0032) [2024-06-28 03:16:58,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2238988288. Throughput: 0: 44027.7. Samples: 2141843880. Policy #0 lag: (min: 0.0, avg: 11.9, max: 24.0) [2024-06-28 03:16:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:17:01,676][06909] Updated weights for policy 0, policy_version 136663 (0.0030) [2024-06-28 03:17:03,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43690.5, 300 sec: 43987.2). Total num frames: 2239201280. Throughput: 0: 43974.2. Samples: 2142116100. Policy #0 lag: (min: 0.0, avg: 11.9, max: 24.0) [2024-06-28 03:17:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:17:05,040][06909] Updated weights for policy 0, policy_version 136673 (0.0025) [2024-06-28 03:17:08,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43876.3). Total num frames: 2239397888. Throughput: 0: 44009.7. Samples: 2142380620. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 03:17:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:17:09,110][06909] Updated weights for policy 0, policy_version 136683 (0.0034) [2024-06-28 03:17:12,330][06909] Updated weights for policy 0, policy_version 136693 (0.0031) [2024-06-28 03:17:13,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2239643648. Throughput: 0: 44131.5. Samples: 2142510240. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 03:17:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:17:16,439][06909] Updated weights for policy 0, policy_version 136703 (0.0032) [2024-06-28 03:17:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2239856640. Throughput: 0: 44165.2. Samples: 2142778700. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 03:17:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:17:20,037][06909] Updated weights for policy 0, policy_version 136713 (0.0037) [2024-06-28 03:17:20,356][06887] Signal inference workers to stop experience collection... (30550 times) [2024-06-28 03:17:20,381][06909] InferenceWorker_p0-w0: stopping experience collection (30550 times) [2024-06-28 03:17:20,411][06887] Signal inference workers to resume experience collection... (30550 times) [2024-06-28 03:17:20,420][06909] InferenceWorker_p0-w0: resuming experience collection (30550 times) [2024-06-28 03:17:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 2240053248. Throughput: 0: 44060.2. Samples: 2143042800. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 03:17:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:17:23,877][06909] Updated weights for policy 0, policy_version 136723 (0.0032) [2024-06-28 03:17:27,540][06909] Updated weights for policy 0, policy_version 136733 (0.0041) [2024-06-28 03:17:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2240299008. Throughput: 0: 44120.1. Samples: 2143172280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 03:17:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:17:31,423][06909] Updated weights for policy 0, policy_version 136743 (0.0039) [2024-06-28 03:17:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43965.0, 300 sec: 43931.3). Total num frames: 2240512000. Throughput: 0: 43988.2. Samples: 2143434040. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 03:17:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 03:17:34,806][06909] Updated weights for policy 0, policy_version 136753 (0.0044) [2024-06-28 03:17:38,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 2240708608. Throughput: 0: 44262.3. Samples: 2143706260. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 03:17:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:17:38,927][06909] Updated weights for policy 0, policy_version 136763 (0.0031) [2024-06-28 03:17:42,286][06909] Updated weights for policy 0, policy_version 136773 (0.0025) [2024-06-28 03:17:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2240970752. Throughput: 0: 44165.3. Samples: 2143831320. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 03:17:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:17:46,160][06909] Updated weights for policy 0, policy_version 136783 (0.0038) [2024-06-28 03:17:48,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 2241167360. Throughput: 0: 43968.1. Samples: 2144094660. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 03:17:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:17:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000136790_2241167360.pth... [2024-06-28 03:17:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000136146_2230616064.pth [2024-06-28 03:17:49,710][06909] Updated weights for policy 0, policy_version 136793 (0.0024) [2024-06-28 03:17:53,490][06909] Updated weights for policy 0, policy_version 136803 (0.0026) [2024-06-28 03:17:53,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 2241380352. Throughput: 0: 44073.4. Samples: 2144363920. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 03:17:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:17:57,200][06909] Updated weights for policy 0, policy_version 136813 (0.0033) [2024-06-28 03:17:58,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2241626112. Throughput: 0: 44063.0. Samples: 2144493080. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 03:17:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:18:01,027][06909] Updated weights for policy 0, policy_version 136823 (0.0030) [2024-06-28 03:18:03,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.7, 300 sec: 43820.2). Total num frames: 2241822720. Throughput: 0: 43960.7. Samples: 2144756940. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 03:18:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:18:04,675][06909] Updated weights for policy 0, policy_version 136833 (0.0032) [2024-06-28 03:18:08,287][06909] Updated weights for policy 0, policy_version 136843 (0.0031) [2024-06-28 03:18:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2242068480. Throughput: 0: 43954.9. Samples: 2145020780. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 03:18:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:18:12,126][06909] Updated weights for policy 0, policy_version 136853 (0.0044) [2024-06-28 03:18:13,850][06674] Fps is (10 sec: 45876.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2242281472. Throughput: 0: 44049.8. Samples: 2145154520. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 03:18:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:18:15,814][06909] Updated weights for policy 0, policy_version 136863 (0.0035) [2024-06-28 03:18:18,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 2242478080. Throughput: 0: 44063.4. Samples: 2145416900. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 03:18:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:18:19,237][06909] Updated weights for policy 0, policy_version 136873 (0.0039) [2024-06-28 03:18:23,063][06909] Updated weights for policy 0, policy_version 136883 (0.0027) [2024-06-28 03:18:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2242723840. Throughput: 0: 43997.3. Samples: 2145686140. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 03:18:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:18:26,846][06909] Updated weights for policy 0, policy_version 136893 (0.0033) [2024-06-28 03:18:28,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2242936832. Throughput: 0: 44094.3. Samples: 2145815560. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 03:18:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:18:30,596][06909] Updated weights for policy 0, policy_version 136903 (0.0031) [2024-06-28 03:18:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 2243149824. Throughput: 0: 43992.9. Samples: 2146074340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 03:18:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:18:34,348][06909] Updated weights for policy 0, policy_version 136913 (0.0030) [2024-06-28 03:18:37,312][06887] Signal inference workers to stop experience collection... (30600 times) [2024-06-28 03:18:37,365][06909] InferenceWorker_p0-w0: stopping experience collection (30600 times) [2024-06-28 03:18:37,423][06887] Signal inference workers to resume experience collection... (30600 times) [2024-06-28 03:18:37,423][06909] InferenceWorker_p0-w0: resuming experience collection (30600 times) [2024-06-28 03:18:38,167][06909] Updated weights for policy 0, policy_version 136923 (0.0033) [2024-06-28 03:18:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2243379200. Throughput: 0: 43920.0. Samples: 2146340320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 03:18:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:18:41,890][06909] Updated weights for policy 0, policy_version 136933 (0.0039) [2024-06-28 03:18:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2243592192. Throughput: 0: 44022.7. Samples: 2146474100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 03:18:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:18:45,349][06909] Updated weights for policy 0, policy_version 136943 (0.0039) [2024-06-28 03:18:48,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 2243805184. Throughput: 0: 43998.7. Samples: 2146736880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 03:18:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:18:49,202][06909] Updated weights for policy 0, policy_version 136953 (0.0045) [2024-06-28 03:18:52,907][06909] Updated weights for policy 0, policy_version 136963 (0.0040) [2024-06-28 03:18:53,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2244050944. Throughput: 0: 44182.8. Samples: 2147009000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 03:18:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:18:56,656][06909] Updated weights for policy 0, policy_version 136973 (0.0031) [2024-06-28 03:18:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2244263936. Throughput: 0: 44211.9. Samples: 2147144060. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 03:18:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:19:00,152][06909] Updated weights for policy 0, policy_version 136983 (0.0039) [2024-06-28 03:19:03,850][06674] Fps is (10 sec: 42597.6, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2244476928. Throughput: 0: 44216.4. Samples: 2147406640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 03:19:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:19:04,093][06909] Updated weights for policy 0, policy_version 136993 (0.0030) [2024-06-28 03:19:07,642][06909] Updated weights for policy 0, policy_version 137003 (0.0030) [2024-06-28 03:19:08,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2244722688. Throughput: 0: 44124.1. Samples: 2147671720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 03:19:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:19:11,301][06909] Updated weights for policy 0, policy_version 137013 (0.0046) [2024-06-28 03:19:13,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2244935680. Throughput: 0: 44302.2. Samples: 2147809160. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:19:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:19:14,831][06909] Updated weights for policy 0, policy_version 137023 (0.0030) [2024-06-28 03:19:18,591][06909] Updated weights for policy 0, policy_version 137033 (0.0022) [2024-06-28 03:19:18,852][06674] Fps is (10 sec: 42589.2, 60 sec: 44508.4, 300 sec: 43986.6). Total num frames: 2245148672. Throughput: 0: 44585.4. Samples: 2148080780. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:19:18,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:19:22,240][06909] Updated weights for policy 0, policy_version 137043 (0.0030) [2024-06-28 03:19:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 2245394432. Throughput: 0: 44419.1. Samples: 2148339180. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:19:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:19:26,113][06909] Updated weights for policy 0, policy_version 137053 (0.0036) [2024-06-28 03:19:28,850][06674] Fps is (10 sec: 44245.9, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2245591040. Throughput: 0: 44510.7. Samples: 2148477080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:19:28,859][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:19:29,378][06909] Updated weights for policy 0, policy_version 137063 (0.0025) [2024-06-28 03:19:33,485][06909] Updated weights for policy 0, policy_version 137073 (0.0033) [2024-06-28 03:19:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44509.8, 300 sec: 44042.7). Total num frames: 2245820416. Throughput: 0: 44553.0. Samples: 2148741760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:19:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:19:37,205][06909] Updated weights for policy 0, policy_version 137083 (0.0031) [2024-06-28 03:19:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2246049792. Throughput: 0: 44212.3. Samples: 2148998560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:19:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:19:40,839][06909] Updated weights for policy 0, policy_version 137093 (0.0030) [2024-06-28 03:19:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.9, 300 sec: 44098.1). Total num frames: 2246246400. Throughput: 0: 44212.6. Samples: 2149133620. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:19:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:19:44,359][06909] Updated weights for policy 0, policy_version 137103 (0.0034) [2024-06-28 03:19:48,194][06909] Updated weights for policy 0, policy_version 137113 (0.0031) [2024-06-28 03:19:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44782.9, 300 sec: 44098.2). Total num frames: 2246492160. Throughput: 0: 44341.8. Samples: 2149402020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:19:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:19:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000137115_2246492160.pth... [2024-06-28 03:19:48,940][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000136469_2235908096.pth [2024-06-28 03:19:52,063][06909] Updated weights for policy 0, policy_version 137123 (0.0033) [2024-06-28 03:19:53,850][06674] Fps is (10 sec: 45874.3, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2246705152. Throughput: 0: 44146.9. Samples: 2149658340. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:19:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:19:55,683][06909] Updated weights for policy 0, policy_version 137133 (0.0028) [2024-06-28 03:19:58,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2246901760. Throughput: 0: 44114.7. Samples: 2149794320. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:19:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:19:59,431][06909] Updated weights for policy 0, policy_version 137143 (0.0050) [2024-06-28 03:20:03,192][06887] Signal inference workers to stop experience collection... (30650 times) [2024-06-28 03:20:03,193][06887] Signal inference workers to resume experience collection... (30650 times) [2024-06-28 03:20:03,209][06909] InferenceWorker_p0-w0: stopping experience collection (30650 times) [2024-06-28 03:20:03,211][06909] Updated weights for policy 0, policy_version 137153 (0.0027) [2024-06-28 03:20:03,242][06909] InferenceWorker_p0-w0: resuming experience collection (30650 times) [2024-06-28 03:20:03,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 2247147520. Throughput: 0: 43999.0. Samples: 2150060640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:20:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:20:06,891][06909] Updated weights for policy 0, policy_version 137163 (0.0037) [2024-06-28 03:20:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 2247360512. Throughput: 0: 43978.6. Samples: 2150318220. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:20:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 03:20:10,456][06909] Updated weights for policy 0, policy_version 137173 (0.0042) [2024-06-28 03:20:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2247573504. Throughput: 0: 43889.4. Samples: 2150452100. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:20:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:20:14,259][06909] Updated weights for policy 0, policy_version 137183 (0.0039) [2024-06-28 03:20:17,977][06909] Updated weights for policy 0, policy_version 137193 (0.0027) [2024-06-28 03:20:18,852][06674] Fps is (10 sec: 45866.0, 60 sec: 44509.9, 300 sec: 44097.7). Total num frames: 2247819264. Throughput: 0: 44042.5. Samples: 2150723760. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 03:20:18,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:20:21,582][06909] Updated weights for policy 0, policy_version 137203 (0.0034) [2024-06-28 03:20:23,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2248032256. Throughput: 0: 44200.4. Samples: 2150987580. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 03:20:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:20:25,401][06909] Updated weights for policy 0, policy_version 137213 (0.0037) [2024-06-28 03:20:28,850][06674] Fps is (10 sec: 42607.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2248245248. Throughput: 0: 44200.4. Samples: 2151122640. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 03:20:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:20:28,965][06909] Updated weights for policy 0, policy_version 137223 (0.0036) [2024-06-28 03:20:32,915][06909] Updated weights for policy 0, policy_version 137233 (0.0035) [2024-06-28 03:20:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44098.2). Total num frames: 2248458240. Throughput: 0: 44127.2. Samples: 2151387740. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 03:20:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:20:36,834][06909] Updated weights for policy 0, policy_version 137243 (0.0040) [2024-06-28 03:20:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.8, 300 sec: 44153.5). Total num frames: 2248671232. Throughput: 0: 44079.3. Samples: 2151641900. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 03:20:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:20:40,274][06909] Updated weights for policy 0, policy_version 137253 (0.0030) [2024-06-28 03:20:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2248900608. Throughput: 0: 44041.3. Samples: 2151776180. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 03:20:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:20:44,110][06909] Updated weights for policy 0, policy_version 137263 (0.0020) [2024-06-28 03:20:47,424][06909] Updated weights for policy 0, policy_version 137273 (0.0031) [2024-06-28 03:20:48,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2249129984. Throughput: 0: 44067.0. Samples: 2152043660. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 03:20:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:20:51,453][06909] Updated weights for policy 0, policy_version 137283 (0.0036) [2024-06-28 03:20:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2249359360. Throughput: 0: 44203.1. Samples: 2152307360. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 03:20:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:20:54,888][06909] Updated weights for policy 0, policy_version 137293 (0.0028) [2024-06-28 03:20:58,809][06909] Updated weights for policy 0, policy_version 137303 (0.0033) [2024-06-28 03:20:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2249572352. Throughput: 0: 44161.8. Samples: 2152439380. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 03:20:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:21:02,390][06909] Updated weights for policy 0, policy_version 137313 (0.0026) [2024-06-28 03:21:03,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2249768960. Throughput: 0: 43964.1. Samples: 2152702060. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 03:21:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:21:06,175][06909] Updated weights for policy 0, policy_version 137323 (0.0029) [2024-06-28 03:21:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2249998336. Throughput: 0: 44036.1. Samples: 2152969200. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 03:21:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:21:10,120][06909] Updated weights for policy 0, policy_version 137333 (0.0036) [2024-06-28 03:21:13,803][06909] Updated weights for policy 0, policy_version 137343 (0.0040) [2024-06-28 03:21:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2250227712. Throughput: 0: 43931.0. Samples: 2153099540. Policy #0 lag: (min: 1.0, avg: 10.6, max: 22.0) [2024-06-28 03:21:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:21:17,402][06909] Updated weights for policy 0, policy_version 137353 (0.0036) [2024-06-28 03:21:18,852][06674] Fps is (10 sec: 45865.6, 60 sec: 43963.7, 300 sec: 44208.7). Total num frames: 2250457088. Throughput: 0: 43842.0. Samples: 2153360720. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 03:21:18,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:21:21,387][06909] Updated weights for policy 0, policy_version 137363 (0.0026) [2024-06-28 03:21:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2250670080. Throughput: 0: 44031.0. Samples: 2153623300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 03:21:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:21:24,564][06909] Updated weights for policy 0, policy_version 137373 (0.0022) [2024-06-28 03:21:28,828][06909] Updated weights for policy 0, policy_version 137383 (0.0039) [2024-06-28 03:21:28,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43963.7, 300 sec: 44098.2). Total num frames: 2250883072. Throughput: 0: 44011.9. Samples: 2153756720. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 03:21:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:21:32,401][06909] Updated weights for policy 0, policy_version 137393 (0.0027) [2024-06-28 03:21:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2251096064. Throughput: 0: 43945.7. Samples: 2154021220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 03:21:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:21:36,342][06909] Updated weights for policy 0, policy_version 137403 (0.0040) [2024-06-28 03:21:38,852][06674] Fps is (10 sec: 44228.8, 60 sec: 44235.4, 300 sec: 44153.2). Total num frames: 2251325440. Throughput: 0: 43926.2. Samples: 2154284120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 03:21:38,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:21:39,776][06909] Updated weights for policy 0, policy_version 137413 (0.0036) [2024-06-28 03:21:43,595][06909] Updated weights for policy 0, policy_version 137423 (0.0034) [2024-06-28 03:21:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2251538432. Throughput: 0: 44004.3. Samples: 2154419580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 03:21:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:21:47,416][06909] Updated weights for policy 0, policy_version 137433 (0.0030) [2024-06-28 03:21:48,850][06674] Fps is (10 sec: 44244.7, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2251767808. Throughput: 0: 44057.8. Samples: 2154684660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 03:21:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:21:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000137437_2251767808.pth... [2024-06-28 03:21:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000136790_2241167360.pth [2024-06-28 03:21:51,208][06909] Updated weights for policy 0, policy_version 137443 (0.0033) [2024-06-28 03:21:52,173][06887] Signal inference workers to stop experience collection... (30700 times) [2024-06-28 03:21:52,212][06909] InferenceWorker_p0-w0: stopping experience collection (30700 times) [2024-06-28 03:21:52,235][06887] Signal inference workers to resume experience collection... (30700 times) [2024-06-28 03:21:52,237][06909] InferenceWorker_p0-w0: resuming experience collection (30700 times) [2024-06-28 03:21:53,852][06674] Fps is (10 sec: 42590.0, 60 sec: 43416.1, 300 sec: 43986.6). Total num frames: 2251964416. Throughput: 0: 43970.0. Samples: 2154947940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 03:21:53,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:21:54,631][06909] Updated weights for policy 0, policy_version 137453 (0.0021) [2024-06-28 03:21:58,562][06909] Updated weights for policy 0, policy_version 137463 (0.0023) [2024-06-28 03:21:58,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2252193792. Throughput: 0: 43998.0. Samples: 2155079440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 03:21:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:22:01,918][06909] Updated weights for policy 0, policy_version 137473 (0.0023) [2024-06-28 03:22:03,850][06674] Fps is (10 sec: 45884.6, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2252423168. Throughput: 0: 44107.4. Samples: 2155345460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 03:22:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:22:06,084][06909] Updated weights for policy 0, policy_version 137483 (0.0034) [2024-06-28 03:22:08,850][06674] Fps is (10 sec: 47512.7, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2252668928. Throughput: 0: 44161.8. Samples: 2155610580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 03:22:08,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:22:09,429][06909] Updated weights for policy 0, policy_version 137493 (0.0028) [2024-06-28 03:22:13,230][06909] Updated weights for policy 0, policy_version 137503 (0.0038) [2024-06-28 03:22:13,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2252881920. Throughput: 0: 44211.9. Samples: 2155746260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 03:22:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:22:17,078][06909] Updated weights for policy 0, policy_version 137513 (0.0032) [2024-06-28 03:22:18,852][06674] Fps is (10 sec: 40951.9, 60 sec: 43690.7, 300 sec: 44153.2). Total num frames: 2253078528. Throughput: 0: 44155.0. Samples: 2156008280. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 03:22:18,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:22:20,589][06909] Updated weights for policy 0, policy_version 137523 (0.0037) [2024-06-28 03:22:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2253291520. Throughput: 0: 44115.1. Samples: 2156269220. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:22:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:22:24,525][06909] Updated weights for policy 0, policy_version 137533 (0.0037) [2024-06-28 03:22:27,887][06909] Updated weights for policy 0, policy_version 137543 (0.0032) [2024-06-28 03:22:28,850][06674] Fps is (10 sec: 45884.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2253537280. Throughput: 0: 44129.9. Samples: 2156405420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:22:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:22:31,736][06909] Updated weights for policy 0, policy_version 137553 (0.0036) [2024-06-28 03:22:33,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43962.3, 300 sec: 44153.2). Total num frames: 2253733888. Throughput: 0: 44038.5. Samples: 2156666480. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:22:33,861][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:22:35,559][06909] Updated weights for policy 0, policy_version 137563 (0.0030) [2024-06-28 03:22:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43965.1, 300 sec: 44042.4). Total num frames: 2253963264. Throughput: 0: 44191.8. Samples: 2156936480. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:22:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:22:39,100][06909] Updated weights for policy 0, policy_version 137573 (0.0030) [2024-06-28 03:22:42,954][06909] Updated weights for policy 0, policy_version 137583 (0.0029) [2024-06-28 03:22:43,850][06674] Fps is (10 sec: 45884.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2254192640. Throughput: 0: 44304.3. Samples: 2157073140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:22:43,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 03:22:46,789][06909] Updated weights for policy 0, policy_version 137593 (0.0026) [2024-06-28 03:22:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2254405632. Throughput: 0: 44130.7. Samples: 2157331340. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:22:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:22:50,163][06909] Updated weights for policy 0, policy_version 137603 (0.0033) [2024-06-28 03:22:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 2254618624. Throughput: 0: 44086.8. Samples: 2157594480. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:22:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:22:54,406][06909] Updated weights for policy 0, policy_version 137613 (0.0037) [2024-06-28 03:22:57,677][06909] Updated weights for policy 0, policy_version 137623 (0.0039) [2024-06-28 03:22:58,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44782.8, 300 sec: 44264.6). Total num frames: 2254880768. Throughput: 0: 44068.5. Samples: 2157729340. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:22:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:23:01,746][06909] Updated weights for policy 0, policy_version 137633 (0.0030) [2024-06-28 03:23:03,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2255060992. Throughput: 0: 43953.9. Samples: 2157986120. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:23:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:23:05,293][06909] Updated weights for policy 0, policy_version 137643 (0.0032) [2024-06-28 03:23:08,850][06674] Fps is (10 sec: 37683.6, 60 sec: 43144.6, 300 sec: 43986.9). Total num frames: 2255257600. Throughput: 0: 44094.7. Samples: 2158253480. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:23:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 03:23:09,411][06909] Updated weights for policy 0, policy_version 137653 (0.0032) [2024-06-28 03:23:12,675][06909] Updated weights for policy 0, policy_version 137663 (0.0041) [2024-06-28 03:23:13,811][06887] Signal inference workers to stop experience collection... (30750 times) [2024-06-28 03:23:13,812][06887] Signal inference workers to resume experience collection... (30750 times) [2024-06-28 03:23:13,850][06674] Fps is (10 sec: 47514.4, 60 sec: 44236.9, 300 sec: 44264.6). Total num frames: 2255536128. Throughput: 0: 43948.5. Samples: 2158383100. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:23:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:23:13,854][06909] InferenceWorker_p0-w0: stopping experience collection (30750 times) [2024-06-28 03:23:13,854][06909] InferenceWorker_p0-w0: resuming experience collection (30750 times) [2024-06-28 03:23:16,636][06909] Updated weights for policy 0, policy_version 137673 (0.0038) [2024-06-28 03:23:18,852][06674] Fps is (10 sec: 45865.6, 60 sec: 43963.7, 300 sec: 44042.1). Total num frames: 2255716352. Throughput: 0: 44008.0. Samples: 2158646840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:23:18,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:23:20,198][06909] Updated weights for policy 0, policy_version 137683 (0.0040) [2024-06-28 03:23:23,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2255929344. Throughput: 0: 43982.6. Samples: 2158915700. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 03:23:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:23:24,181][06909] Updated weights for policy 0, policy_version 137693 (0.0023) [2024-06-28 03:23:27,500][06909] Updated weights for policy 0, policy_version 137703 (0.0045) [2024-06-28 03:23:28,852][06674] Fps is (10 sec: 47513.5, 60 sec: 44235.3, 300 sec: 44208.7). Total num frames: 2256191488. Throughput: 0: 43871.8. Samples: 2159047460. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 03:23:28,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:23:31,769][06909] Updated weights for policy 0, policy_version 137713 (0.0032) [2024-06-28 03:23:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 2256371712. Throughput: 0: 44052.8. Samples: 2159313720. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 03:23:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:23:35,014][06909] Updated weights for policy 0, policy_version 137723 (0.0035) [2024-06-28 03:23:38,852][06674] Fps is (10 sec: 39321.5, 60 sec: 43689.1, 300 sec: 44042.1). Total num frames: 2256584704. Throughput: 0: 44065.0. Samples: 2159577500. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 03:23:38,853][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 03:23:39,151][06909] Updated weights for policy 0, policy_version 137733 (0.0031) [2024-06-28 03:23:42,540][06909] Updated weights for policy 0, policy_version 137743 (0.0032) [2024-06-28 03:23:43,850][06674] Fps is (10 sec: 49152.6, 60 sec: 44509.9, 300 sec: 44264.6). Total num frames: 2256863232. Throughput: 0: 43889.9. Samples: 2159704380. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 03:23:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:23:46,747][06909] Updated weights for policy 0, policy_version 137753 (0.0029) [2024-06-28 03:23:48,850][06674] Fps is (10 sec: 44246.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2257027072. Throughput: 0: 44139.7. Samples: 2159972400. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 03:23:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 03:23:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000137758_2257027072.pth... [2024-06-28 03:23:48,927][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000137115_2246492160.pth [2024-06-28 03:23:49,846][06909] Updated weights for policy 0, policy_version 137763 (0.0045) [2024-06-28 03:23:53,850][06674] Fps is (10 sec: 37683.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2257240064. Throughput: 0: 44043.1. Samples: 2160235420. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 03:23:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:23:54,247][06909] Updated weights for policy 0, policy_version 137773 (0.0025) [2024-06-28 03:23:57,431][06909] Updated weights for policy 0, policy_version 137783 (0.0026) [2024-06-28 03:23:58,850][06674] Fps is (10 sec: 49151.7, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2257518592. Throughput: 0: 44016.8. Samples: 2160363860. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 03:23:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:24:01,790][06909] Updated weights for policy 0, policy_version 137793 (0.0029) [2024-06-28 03:24:03,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43963.7, 300 sec: 43986.8). Total num frames: 2257698816. Throughput: 0: 44041.9. Samples: 2160628640. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 03:24:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:24:04,791][06909] Updated weights for policy 0, policy_version 137803 (0.0036) [2024-06-28 03:24:08,850][06674] Fps is (10 sec: 39321.5, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2257911808. Throughput: 0: 43955.9. Samples: 2160893720. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 03:24:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:24:09,549][06909] Updated weights for policy 0, policy_version 137813 (0.0030) [2024-06-28 03:24:12,018][06909] Updated weights for policy 0, policy_version 137823 (0.0032) [2024-06-28 03:24:13,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.7, 300 sec: 44153.8). Total num frames: 2258173952. Throughput: 0: 43774.4. Samples: 2161017220. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 03:24:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:24:16,688][06909] Updated weights for policy 0, policy_version 137833 (0.0031) [2024-06-28 03:24:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43965.2, 300 sec: 43931.3). Total num frames: 2258354176. Throughput: 0: 43972.5. Samples: 2161292480. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 03:24:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:24:19,828][06909] Updated weights for policy 0, policy_version 137843 (0.0026) [2024-06-28 03:24:23,852][06674] Fps is (10 sec: 37673.6, 60 sec: 43688.8, 300 sec: 43931.0). Total num frames: 2258550784. Throughput: 0: 43854.6. Samples: 2161550980. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 03:24:23,864][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:24:24,019][06909] Updated weights for policy 0, policy_version 137853 (0.0040) [2024-06-28 03:24:27,105][06909] Updated weights for policy 0, policy_version 137863 (0.0024) [2024-06-28 03:24:27,269][06887] Signal inference workers to stop experience collection... (30800 times) [2024-06-28 03:24:27,320][06909] InferenceWorker_p0-w0: stopping experience collection (30800 times) [2024-06-28 03:24:27,386][06887] Signal inference workers to resume experience collection... (30800 times) [2024-06-28 03:24:27,386][06909] InferenceWorker_p0-w0: resuming experience collection (30800 times) [2024-06-28 03:24:28,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43965.2, 300 sec: 44098.0). Total num frames: 2258829312. Throughput: 0: 43956.4. Samples: 2161682420. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 03:24:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:24:31,173][06909] Updated weights for policy 0, policy_version 137873 (0.0031) [2024-06-28 03:24:33,850][06674] Fps is (10 sec: 45886.9, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2259009536. Throughput: 0: 44055.1. Samples: 2161954880. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 03:24:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:24:34,434][06909] Updated weights for policy 0, policy_version 137883 (0.0045) [2024-06-28 03:24:38,814][06909] Updated weights for policy 0, policy_version 137893 (0.0032) [2024-06-28 03:24:38,850][06674] Fps is (10 sec: 40959.5, 60 sec: 44238.2, 300 sec: 44042.4). Total num frames: 2259238912. Throughput: 0: 44197.1. Samples: 2162224300. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 03:24:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:24:41,709][06909] Updated weights for policy 0, policy_version 137903 (0.0040) [2024-06-28 03:24:43,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2259484672. Throughput: 0: 44096.0. Samples: 2162348180. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 03:24:43,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:24:46,270][06909] Updated weights for policy 0, policy_version 137913 (0.0039) [2024-06-28 03:24:48,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44782.9, 300 sec: 44098.0). Total num frames: 2259714048. Throughput: 0: 44238.7. Samples: 2162619380. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 03:24:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:24:49,321][06909] Updated weights for policy 0, policy_version 137923 (0.0037) [2024-06-28 03:24:53,645][06909] Updated weights for policy 0, policy_version 137933 (0.0027) [2024-06-28 03:24:53,850][06674] Fps is (10 sec: 40960.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2259894272. Throughput: 0: 44225.0. Samples: 2162883840. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 03:24:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:24:56,603][06909] Updated weights for policy 0, policy_version 137943 (0.0027) [2024-06-28 03:24:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2260140032. Throughput: 0: 44361.8. Samples: 2163013500. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 03:24:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:25:00,682][06909] Updated weights for policy 0, policy_version 137953 (0.0040) [2024-06-28 03:25:03,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 2260369408. Throughput: 0: 44299.6. Samples: 2163285960. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 03:25:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:25:03,968][06909] Updated weights for policy 0, policy_version 137963 (0.0036) [2024-06-28 03:25:08,240][06909] Updated weights for policy 0, policy_version 137973 (0.0040) [2024-06-28 03:25:08,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2260566016. Throughput: 0: 44305.1. Samples: 2163544600. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 03:25:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:25:11,574][06909] Updated weights for policy 0, policy_version 137983 (0.0026) [2024-06-28 03:25:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 2260795392. Throughput: 0: 44234.7. Samples: 2163672980. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 03:25:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:25:15,951][06909] Updated weights for policy 0, policy_version 137993 (0.0039) [2024-06-28 03:25:18,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2261024768. Throughput: 0: 44275.6. Samples: 2163947280. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 03:25:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:25:18,861][06909] Updated weights for policy 0, policy_version 138003 (0.0028) [2024-06-28 03:25:23,137][06909] Updated weights for policy 0, policy_version 138013 (0.0033) [2024-06-28 03:25:23,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44511.7, 300 sec: 43986.9). Total num frames: 2261221376. Throughput: 0: 44169.4. Samples: 2164211920. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 03:25:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:25:26,188][06909] Updated weights for policy 0, policy_version 138023 (0.0028) [2024-06-28 03:25:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2261450752. Throughput: 0: 44088.5. Samples: 2164332160. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 03:25:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:25:30,505][06909] Updated weights for policy 0, policy_version 138033 (0.0032) [2024-06-28 03:25:31,933][06887] Signal inference workers to stop experience collection... (30850 times) [2024-06-28 03:25:31,934][06887] Signal inference workers to resume experience collection... (30850 times) [2024-06-28 03:25:31,960][06909] InferenceWorker_p0-w0: stopping experience collection (30850 times) [2024-06-28 03:25:31,961][06909] InferenceWorker_p0-w0: resuming experience collection (30850 times) [2024-06-28 03:25:33,723][06909] Updated weights for policy 0, policy_version 138043 (0.0031) [2024-06-28 03:25:33,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44783.0, 300 sec: 44153.5). Total num frames: 2261696512. Throughput: 0: 44245.9. Samples: 2164610440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:25:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:25:37,791][06909] Updated weights for policy 0, policy_version 138053 (0.0046) [2024-06-28 03:25:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2261893120. Throughput: 0: 44259.4. Samples: 2164875520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:25:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 03:25:41,018][06909] Updated weights for policy 0, policy_version 138063 (0.0038) [2024-06-28 03:25:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2262122496. Throughput: 0: 44151.1. Samples: 2165000300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:25:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 03:25:45,068][06909] Updated weights for policy 0, policy_version 138073 (0.0041) [2024-06-28 03:25:48,385][06909] Updated weights for policy 0, policy_version 138083 (0.0037) [2024-06-28 03:25:48,850][06674] Fps is (10 sec: 50790.9, 60 sec: 44783.0, 300 sec: 44209.0). Total num frames: 2262401024. Throughput: 0: 44338.2. Samples: 2165281180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:25:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:25:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000138086_2262401024.pth... [2024-06-28 03:25:48,899][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000137437_2251767808.pth [2024-06-28 03:25:52,618][06909] Updated weights for policy 0, policy_version 138093 (0.0027) [2024-06-28 03:25:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2262548480. Throughput: 0: 44589.0. Samples: 2165551100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:25:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:25:55,656][06909] Updated weights for policy 0, policy_version 138103 (0.0029) [2024-06-28 03:25:58,850][06674] Fps is (10 sec: 39321.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2262794240. Throughput: 0: 44272.9. Samples: 2165665260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:25:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:25:59,893][06909] Updated weights for policy 0, policy_version 138113 (0.0029) [2024-06-28 03:26:03,515][06909] Updated weights for policy 0, policy_version 138123 (0.0029) [2024-06-28 03:26:03,850][06674] Fps is (10 sec: 49151.7, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2263040000. Throughput: 0: 44160.4. Samples: 2165934500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:26:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:26:07,292][06909] Updated weights for policy 0, policy_version 138133 (0.0030) [2024-06-28 03:26:08,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 2263203840. Throughput: 0: 44231.2. Samples: 2166202320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:26:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:26:10,771][06909] Updated weights for policy 0, policy_version 138143 (0.0031) [2024-06-28 03:26:13,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 2263449600. Throughput: 0: 44303.6. Samples: 2166325820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:26:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:26:14,702][06909] Updated weights for policy 0, policy_version 138153 (0.0033) [2024-06-28 03:26:18,353][06909] Updated weights for policy 0, policy_version 138163 (0.0045) [2024-06-28 03:26:18,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2263695360. Throughput: 0: 44260.4. Samples: 2166602160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:26:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:26:22,440][06909] Updated weights for policy 0, policy_version 138173 (0.0021) [2024-06-28 03:26:23,850][06674] Fps is (10 sec: 40959.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2263859200. Throughput: 0: 44128.3. Samples: 2166861300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:26:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:26:25,690][06909] Updated weights for policy 0, policy_version 138183 (0.0030) [2024-06-28 03:26:26,002][06887] Signal inference workers to stop experience collection... (30900 times) [2024-06-28 03:26:26,002][06887] Signal inference workers to resume experience collection... (30900 times) [2024-06-28 03:26:26,047][06909] InferenceWorker_p0-w0: stopping experience collection (30900 times) [2024-06-28 03:26:26,047][06909] InferenceWorker_p0-w0: resuming experience collection (30900 times) [2024-06-28 03:26:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2264104960. Throughput: 0: 44049.8. Samples: 2166982540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 03:26:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:26:29,604][06909] Updated weights for policy 0, policy_version 138193 (0.0035) [2024-06-28 03:26:32,902][06909] Updated weights for policy 0, policy_version 138203 (0.0036) [2024-06-28 03:26:33,850][06674] Fps is (10 sec: 52429.9, 60 sec: 44782.9, 300 sec: 44264.9). Total num frames: 2264383488. Throughput: 0: 43951.1. Samples: 2167258980. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-28 03:26:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:26:36,741][06909] Updated weights for policy 0, policy_version 138213 (0.0030) [2024-06-28 03:26:38,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2264530944. Throughput: 0: 43777.7. Samples: 2167521100. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-28 03:26:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:26:40,662][06909] Updated weights for policy 0, policy_version 138223 (0.0034) [2024-06-28 03:26:43,850][06674] Fps is (10 sec: 39321.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2264776704. Throughput: 0: 43923.6. Samples: 2167641820. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-28 03:26:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:26:44,681][06909] Updated weights for policy 0, policy_version 138233 (0.0031) [2024-06-28 03:26:47,993][06909] Updated weights for policy 0, policy_version 138243 (0.0025) [2024-06-28 03:26:48,852][06674] Fps is (10 sec: 47504.1, 60 sec: 43416.1, 300 sec: 44209.0). Total num frames: 2265006080. Throughput: 0: 43900.2. Samples: 2167910100. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-28 03:26:48,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:26:52,287][06909] Updated weights for policy 0, policy_version 138253 (0.0034) [2024-06-28 03:26:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2265186304. Throughput: 0: 43937.7. Samples: 2168179520. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-28 03:26:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:26:55,483][06909] Updated weights for policy 0, policy_version 138263 (0.0037) [2024-06-28 03:26:58,852][06674] Fps is (10 sec: 42598.6, 60 sec: 43962.2, 300 sec: 44097.6). Total num frames: 2265432064. Throughput: 0: 43910.0. Samples: 2168301860. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-28 03:26:58,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:26:59,516][06909] Updated weights for policy 0, policy_version 138273 (0.0031) [2024-06-28 03:27:02,739][06909] Updated weights for policy 0, policy_version 138283 (0.0022) [2024-06-28 03:27:03,850][06674] Fps is (10 sec: 49151.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2265677824. Throughput: 0: 43884.3. Samples: 2168576960. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-28 03:27:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:27:06,942][06909] Updated weights for policy 0, policy_version 138293 (0.0031) [2024-06-28 03:27:08,850][06674] Fps is (10 sec: 42607.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2265858048. Throughput: 0: 44162.4. Samples: 2168848600. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-28 03:27:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:27:09,907][06909] Updated weights for policy 0, policy_version 138303 (0.0028) [2024-06-28 03:27:13,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 2266087424. Throughput: 0: 44221.8. Samples: 2168972520. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-28 03:27:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:27:14,234][06909] Updated weights for policy 0, policy_version 138313 (0.0035) [2024-06-28 03:27:17,405][06909] Updated weights for policy 0, policy_version 138323 (0.0035) [2024-06-28 03:27:18,850][06674] Fps is (10 sec: 49151.7, 60 sec: 44236.7, 300 sec: 44264.6). Total num frames: 2266349568. Throughput: 0: 44127.0. Samples: 2169244700. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-28 03:27:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:27:21,348][06909] Updated weights for policy 0, policy_version 138333 (0.0035) [2024-06-28 03:27:23,850][06674] Fps is (10 sec: 47513.4, 60 sec: 45056.1, 300 sec: 44153.5). Total num frames: 2266562560. Throughput: 0: 44441.4. Samples: 2169520960. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-28 03:27:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:27:24,538][06909] Updated weights for policy 0, policy_version 138343 (0.0031) [2024-06-28 03:27:28,850][06674] Fps is (10 sec: 40960.4, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 2266759168. Throughput: 0: 44628.5. Samples: 2169650100. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-28 03:27:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 03:27:28,968][06909] Updated weights for policy 0, policy_version 138353 (0.0043) [2024-06-28 03:27:32,364][06909] Updated weights for policy 0, policy_version 138363 (0.0041) [2024-06-28 03:27:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 44153.5). Total num frames: 2266988544. Throughput: 0: 44456.3. Samples: 2169910540. Policy #0 lag: (min: 0.0, avg: 7.9, max: 19.0) [2024-06-28 03:27:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:27:36,837][06909] Updated weights for policy 0, policy_version 138373 (0.0036) [2024-06-28 03:27:37,689][06887] Signal inference workers to stop experience collection... (30950 times) [2024-06-28 03:27:37,689][06887] Signal inference workers to resume experience collection... (30950 times) [2024-06-28 03:27:37,735][06909] InferenceWorker_p0-w0: stopping experience collection (30950 times) [2024-06-28 03:27:37,735][06909] InferenceWorker_p0-w0: resuming experience collection (30950 times) [2024-06-28 03:27:38,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2267201536. Throughput: 0: 44430.1. Samples: 2170178880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 03:27:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:27:39,554][06909] Updated weights for policy 0, policy_version 138383 (0.0021) [2024-06-28 03:27:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2267414528. Throughput: 0: 44585.2. Samples: 2170308100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 03:27:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:27:44,037][06909] Updated weights for policy 0, policy_version 138393 (0.0026) [2024-06-28 03:27:47,121][06909] Updated weights for policy 0, policy_version 138403 (0.0035) [2024-06-28 03:27:48,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44238.3, 300 sec: 44209.0). Total num frames: 2267660288. Throughput: 0: 44178.7. Samples: 2170565000. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 03:27:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:27:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000138407_2267660288.pth... [2024-06-28 03:27:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000137758_2257027072.pth [2024-06-28 03:27:51,431][06909] Updated weights for policy 0, policy_version 138413 (0.0029) [2024-06-28 03:27:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44783.0, 300 sec: 44042.4). Total num frames: 2267873280. Throughput: 0: 44132.4. Samples: 2170834560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 03:27:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:27:54,440][06909] Updated weights for policy 0, policy_version 138423 (0.0028) [2024-06-28 03:27:58,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43965.2, 300 sec: 44098.0). Total num frames: 2268069888. Throughput: 0: 44311.0. Samples: 2170966520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 03:27:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:27:58,870][06909] Updated weights for policy 0, policy_version 138433 (0.0021) [2024-06-28 03:28:01,721][06909] Updated weights for policy 0, policy_version 138443 (0.0043) [2024-06-28 03:28:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 2268299264. Throughput: 0: 44083.1. Samples: 2171228440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 03:28:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:28:06,159][06909] Updated weights for policy 0, policy_version 138453 (0.0037) [2024-06-28 03:28:08,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44783.0, 300 sec: 44098.0). Total num frames: 2268545024. Throughput: 0: 43940.9. Samples: 2171498300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 03:28:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:28:09,234][06909] Updated weights for policy 0, policy_version 138463 (0.0026) [2024-06-28 03:28:13,640][06909] Updated weights for policy 0, policy_version 138473 (0.0044) [2024-06-28 03:28:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 2268741632. Throughput: 0: 44027.2. Samples: 2171631320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 03:28:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:28:16,556][06909] Updated weights for policy 0, policy_version 138483 (0.0041) [2024-06-28 03:28:18,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 44209.0). Total num frames: 2268971008. Throughput: 0: 44038.6. Samples: 2171892280. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 03:28:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:28:21,404][06909] Updated weights for policy 0, policy_version 138493 (0.0030) [2024-06-28 03:28:23,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 2269216768. Throughput: 0: 44002.3. Samples: 2172158980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 03:28:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 03:28:24,157][06909] Updated weights for policy 0, policy_version 138503 (0.0029) [2024-06-28 03:28:28,812][06909] Updated weights for policy 0, policy_version 138513 (0.0036) [2024-06-28 03:28:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 2269396992. Throughput: 0: 44073.6. Samples: 2172291420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 03:28:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:28:31,535][06909] Updated weights for policy 0, policy_version 138523 (0.0027) [2024-06-28 03:28:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 44209.3). Total num frames: 2269626368. Throughput: 0: 44034.6. Samples: 2172546560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 03:28:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:28:36,186][06909] Updated weights for policy 0, policy_version 138533 (0.0039) [2024-06-28 03:28:38,850][06674] Fps is (10 sec: 47514.7, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 2269872128. Throughput: 0: 44178.3. Samples: 2172822580. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 03:28:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:28:38,882][06909] Updated weights for policy 0, policy_version 138543 (0.0036) [2024-06-28 03:28:43,551][06909] Updated weights for policy 0, policy_version 138553 (0.0037) [2024-06-28 03:28:43,851][06674] Fps is (10 sec: 44230.0, 60 sec: 44235.6, 300 sec: 44208.8). Total num frames: 2270068736. Throughput: 0: 44190.5. Samples: 2172955160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 03:28:43,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:28:46,366][06909] Updated weights for policy 0, policy_version 138563 (0.0034) [2024-06-28 03:28:48,850][06674] Fps is (10 sec: 40959.0, 60 sec: 43690.5, 300 sec: 44209.0). Total num frames: 2270281728. Throughput: 0: 44094.5. Samples: 2173212700. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 03:28:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:28:51,242][06909] Updated weights for policy 0, policy_version 138573 (0.0031) [2024-06-28 03:28:53,826][06909] Updated weights for policy 0, policy_version 138583 (0.0025) [2024-06-28 03:28:53,850][06674] Fps is (10 sec: 47520.6, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2270543872. Throughput: 0: 43967.4. Samples: 2173476840. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 03:28:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:28:57,612][06887] Signal inference workers to stop experience collection... (31000 times) [2024-06-28 03:28:57,648][06909] InferenceWorker_p0-w0: stopping experience collection (31000 times) [2024-06-28 03:28:57,671][06887] Signal inference workers to resume experience collection... (31000 times) [2024-06-28 03:28:57,672][06909] InferenceWorker_p0-w0: resuming experience collection (31000 times) [2024-06-28 03:28:58,518][06909] Updated weights for policy 0, policy_version 138593 (0.0028) [2024-06-28 03:28:58,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2270724096. Throughput: 0: 44230.1. Samples: 2173621680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 03:28:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:29:01,163][06909] Updated weights for policy 0, policy_version 138603 (0.0028) [2024-06-28 03:29:03,852][06674] Fps is (10 sec: 39314.1, 60 sec: 43962.3, 300 sec: 44153.2). Total num frames: 2270937088. Throughput: 0: 44012.8. Samples: 2173872940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 03:29:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:29:05,920][06909] Updated weights for policy 0, policy_version 138613 (0.0032) [2024-06-28 03:29:08,409][06909] Updated weights for policy 0, policy_version 138623 (0.0029) [2024-06-28 03:29:08,850][06674] Fps is (10 sec: 50790.3, 60 sec: 44782.9, 300 sec: 44264.6). Total num frames: 2271232000. Throughput: 0: 44084.4. Samples: 2174142780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 03:29:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:29:13,401][06909] Updated weights for policy 0, policy_version 138633 (0.0031) [2024-06-28 03:29:13,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2271379456. Throughput: 0: 44242.3. Samples: 2174282320. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 03:29:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:29:15,806][06909] Updated weights for policy 0, policy_version 138643 (0.0033) [2024-06-28 03:29:18,850][06674] Fps is (10 sec: 37683.3, 60 sec: 43963.8, 300 sec: 44265.0). Total num frames: 2271608832. Throughput: 0: 44362.7. Samples: 2174542880. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 03:29:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 03:29:20,821][06909] Updated weights for policy 0, policy_version 138653 (0.0037) [2024-06-28 03:29:23,338][06909] Updated weights for policy 0, policy_version 138663 (0.0033) [2024-06-28 03:29:23,850][06674] Fps is (10 sec: 49151.6, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2271870976. Throughput: 0: 43936.7. Samples: 2174799740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 03:29:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:29:28,304][06909] Updated weights for policy 0, policy_version 138673 (0.0030) [2024-06-28 03:29:28,850][06674] Fps is (10 sec: 44235.6, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2272051200. Throughput: 0: 44155.5. Samples: 2174942100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 03:29:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:29:30,739][06909] Updated weights for policy 0, policy_version 138683 (0.0027) [2024-06-28 03:29:33,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2272264192. Throughput: 0: 44107.7. Samples: 2175197540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 03:29:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:29:35,766][06909] Updated weights for policy 0, policy_version 138693 (0.0031) [2024-06-28 03:29:38,538][06909] Updated weights for policy 0, policy_version 138703 (0.0028) [2024-06-28 03:29:38,850][06674] Fps is (10 sec: 47514.7, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2272526336. Throughput: 0: 43933.8. Samples: 2175453860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 03:29:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:29:43,081][06909] Updated weights for policy 0, policy_version 138713 (0.0029) [2024-06-28 03:29:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43964.9, 300 sec: 44042.4). Total num frames: 2272706560. Throughput: 0: 43883.1. Samples: 2175596420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 03:29:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:29:45,738][06909] Updated weights for policy 0, policy_version 138723 (0.0032) [2024-06-28 03:29:48,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2272919552. Throughput: 0: 44106.3. Samples: 2175857640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 03:29:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:29:48,873][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000138728_2272919552.pth... [2024-06-28 03:29:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000138086_2262401024.pth [2024-06-28 03:29:50,605][06909] Updated weights for policy 0, policy_version 138733 (0.0035) [2024-06-28 03:29:53,095][06909] Updated weights for policy 0, policy_version 138743 (0.0029) [2024-06-28 03:29:53,850][06674] Fps is (10 sec: 49152.1, 60 sec: 44236.9, 300 sec: 44264.6). Total num frames: 2273198080. Throughput: 0: 43877.4. Samples: 2176117260. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 03:29:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:29:58,328][06909] Updated weights for policy 0, policy_version 138753 (0.0034) [2024-06-28 03:29:58,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2273378304. Throughput: 0: 43902.7. Samples: 2176257940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 03:29:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:29:58,900][06887] Signal inference workers to stop experience collection... (31050 times) [2024-06-28 03:29:58,935][06909] InferenceWorker_p0-w0: stopping experience collection (31050 times) [2024-06-28 03:29:58,959][06887] Signal inference workers to resume experience collection... (31050 times) [2024-06-28 03:29:58,960][06909] InferenceWorker_p0-w0: resuming experience collection (31050 times) [2024-06-28 03:30:00,764][06909] Updated weights for policy 0, policy_version 138763 (0.0025) [2024-06-28 03:30:03,856][06674] Fps is (10 sec: 37660.4, 60 sec: 43960.8, 300 sec: 44097.1). Total num frames: 2273574912. Throughput: 0: 43794.6. Samples: 2176513900. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 03:30:03,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:30:05,613][06909] Updated weights for policy 0, policy_version 138773 (0.0036) [2024-06-28 03:30:08,199][06909] Updated weights for policy 0, policy_version 138783 (0.0027) [2024-06-28 03:30:08,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43417.6, 300 sec: 44209.0). Total num frames: 2273837056. Throughput: 0: 43827.6. Samples: 2176771980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 03:30:08,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:30:12,961][06909] Updated weights for policy 0, policy_version 138793 (0.0041) [2024-06-28 03:30:13,850][06674] Fps is (10 sec: 44263.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2274017280. Throughput: 0: 43729.1. Samples: 2176909900. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 03:30:13,854][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:30:15,860][06909] Updated weights for policy 0, policy_version 138803 (0.0032) [2024-06-28 03:30:18,850][06674] Fps is (10 sec: 39322.2, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2274230272. Throughput: 0: 43848.5. Samples: 2177170720. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 03:30:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:30:20,191][06909] Updated weights for policy 0, policy_version 138813 (0.0030) [2024-06-28 03:30:23,066][06909] Updated weights for policy 0, policy_version 138823 (0.0031) [2024-06-28 03:30:23,850][06674] Fps is (10 sec: 49152.5, 60 sec: 43963.8, 300 sec: 44264.6). Total num frames: 2274508800. Throughput: 0: 43948.0. Samples: 2177431520. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 03:30:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:30:27,866][06909] Updated weights for policy 0, policy_version 138833 (0.0033) [2024-06-28 03:30:28,850][06674] Fps is (10 sec: 47512.6, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 2274705408. Throughput: 0: 44126.5. Samples: 2177582120. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 03:30:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:30:30,391][06909] Updated weights for policy 0, policy_version 138843 (0.0035) [2024-06-28 03:30:33,852][06674] Fps is (10 sec: 37675.5, 60 sec: 43689.2, 300 sec: 44042.1). Total num frames: 2274885632. Throughput: 0: 43902.1. Samples: 2177833320. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 03:30:33,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:30:35,403][06909] Updated weights for policy 0, policy_version 138853 (0.0030) [2024-06-28 03:30:37,962][06909] Updated weights for policy 0, policy_version 138863 (0.0036) [2024-06-28 03:30:38,852][06674] Fps is (10 sec: 45866.4, 60 sec: 43962.2, 300 sec: 44208.7). Total num frames: 2275164160. Throughput: 0: 43970.4. Samples: 2178096020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 03:30:38,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:30:42,554][06909] Updated weights for policy 0, policy_version 138873 (0.0031) [2024-06-28 03:30:43,850][06674] Fps is (10 sec: 47522.9, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 2275360768. Throughput: 0: 44017.6. Samples: 2178238740. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 03:30:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:30:45,902][06909] Updated weights for policy 0, policy_version 138883 (0.0033) [2024-06-28 03:30:48,850][06674] Fps is (10 sec: 37690.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2275540992. Throughput: 0: 44013.0. Samples: 2178494220. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 03:30:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:30:50,054][06909] Updated weights for policy 0, policy_version 138893 (0.0036) [2024-06-28 03:30:53,209][06909] Updated weights for policy 0, policy_version 138903 (0.0031) [2024-06-28 03:30:53,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2275819520. Throughput: 0: 44011.7. Samples: 2178752500. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 03:30:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:30:57,802][06909] Updated weights for policy 0, policy_version 138913 (0.0036) [2024-06-28 03:30:58,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2276016128. Throughput: 0: 44147.0. Samples: 2178896520. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 03:30:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:31:00,363][06909] Updated weights for policy 0, policy_version 138923 (0.0032) [2024-06-28 03:31:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44241.3, 300 sec: 44153.5). Total num frames: 2276229120. Throughput: 0: 44264.9. Samples: 2179162640. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 03:31:03,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 03:31:05,088][06909] Updated weights for policy 0, policy_version 138933 (0.0033) [2024-06-28 03:31:07,774][06909] Updated weights for policy 0, policy_version 138943 (0.0022) [2024-06-28 03:31:08,850][06674] Fps is (10 sec: 45876.1, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2276474880. Throughput: 0: 44005.8. Samples: 2179411780. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 03:31:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:31:12,330][06909] Updated weights for policy 0, policy_version 138953 (0.0036) [2024-06-28 03:31:13,174][06887] Signal inference workers to stop experience collection... (31100 times) [2024-06-28 03:31:13,228][06887] Signal inference workers to resume experience collection... (31100 times) [2024-06-28 03:31:13,232][06909] InferenceWorker_p0-w0: stopping experience collection (31100 times) [2024-06-28 03:31:13,242][06909] InferenceWorker_p0-w0: resuming experience collection (31100 times) [2024-06-28 03:31:13,850][06674] Fps is (10 sec: 44235.8, 60 sec: 44236.7, 300 sec: 43986.8). Total num frames: 2276671488. Throughput: 0: 43788.5. Samples: 2179552600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 03:31:13,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:31:15,290][06909] Updated weights for policy 0, policy_version 138963 (0.0035) [2024-06-28 03:31:18,852][06674] Fps is (10 sec: 39313.4, 60 sec: 43962.2, 300 sec: 44097.7). Total num frames: 2276868096. Throughput: 0: 43972.0. Samples: 2179812060. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 03:31:18,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:31:20,031][06909] Updated weights for policy 0, policy_version 138973 (0.0027) [2024-06-28 03:31:22,735][06909] Updated weights for policy 0, policy_version 138983 (0.0025) [2024-06-28 03:31:23,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2277130240. Throughput: 0: 44117.1. Samples: 2180081200. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 03:31:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:31:27,244][06909] Updated weights for policy 0, policy_version 138993 (0.0031) [2024-06-28 03:31:28,850][06674] Fps is (10 sec: 49161.8, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2277359616. Throughput: 0: 43916.0. Samples: 2180214960. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 03:31:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:31:30,264][06909] Updated weights for policy 0, policy_version 139003 (0.0024) [2024-06-28 03:31:33,850][06674] Fps is (10 sec: 40960.4, 60 sec: 44238.4, 300 sec: 44098.0). Total num frames: 2277539840. Throughput: 0: 44227.7. Samples: 2180484460. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 03:31:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 03:31:34,442][06909] Updated weights for policy 0, policy_version 139013 (0.0031) [2024-06-28 03:31:37,393][06909] Updated weights for policy 0, policy_version 139023 (0.0026) [2024-06-28 03:31:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43692.2, 300 sec: 44097.9). Total num frames: 2277785600. Throughput: 0: 44260.8. Samples: 2180744240. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 03:31:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:31:41,795][06909] Updated weights for policy 0, policy_version 139033 (0.0038) [2024-06-28 03:31:43,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.9, 300 sec: 44098.3). Total num frames: 2278014976. Throughput: 0: 44135.7. Samples: 2180882620. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 03:31:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:31:44,915][06909] Updated weights for policy 0, policy_version 139043 (0.0035) [2024-06-28 03:31:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2278211584. Throughput: 0: 44235.9. Samples: 2181153260. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 03:31:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:31:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000139052_2278227968.pth... [2024-06-28 03:31:48,907][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000138407_2267660288.pth [2024-06-28 03:31:49,308][06909] Updated weights for policy 0, policy_version 139053 (0.0041) [2024-06-28 03:31:52,530][06909] Updated weights for policy 0, policy_version 139063 (0.0028) [2024-06-28 03:31:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44153.8). Total num frames: 2278457344. Throughput: 0: 44360.0. Samples: 2181407980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 03:31:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:31:56,956][06909] Updated weights for policy 0, policy_version 139073 (0.0032) [2024-06-28 03:31:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2278670336. Throughput: 0: 44251.7. Samples: 2181543920. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 03:31:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:32:00,232][06909] Updated weights for policy 0, policy_version 139083 (0.0037) [2024-06-28 03:32:03,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2278883328. Throughput: 0: 44305.9. Samples: 2181805740. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 03:32:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:32:04,167][06909] Updated weights for policy 0, policy_version 139093 (0.0027) [2024-06-28 03:32:07,486][06909] Updated weights for policy 0, policy_version 139103 (0.0040) [2024-06-28 03:32:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2279096320. Throughput: 0: 44102.6. Samples: 2182065820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 03:32:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:32:11,496][06909] Updated weights for policy 0, policy_version 139113 (0.0033) [2024-06-28 03:32:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 2279342080. Throughput: 0: 44144.5. Samples: 2182201460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 03:32:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:32:14,706][06909] Updated weights for policy 0, policy_version 139123 (0.0028) [2024-06-28 03:32:18,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44511.5, 300 sec: 43986.9). Total num frames: 2279538688. Throughput: 0: 44208.9. Samples: 2182473860. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 03:32:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:32:18,871][06909] Updated weights for policy 0, policy_version 139133 (0.0033) [2024-06-28 03:32:21,874][06909] Updated weights for policy 0, policy_version 139143 (0.0032) [2024-06-28 03:32:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2279768064. Throughput: 0: 44220.9. Samples: 2182734180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 03:32:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:32:26,533][06909] Updated weights for policy 0, policy_version 139153 (0.0032) [2024-06-28 03:32:28,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2280013824. Throughput: 0: 44142.2. Samples: 2182869020. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 03:32:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:32:29,616][06909] Updated weights for policy 0, policy_version 139163 (0.0024) [2024-06-28 03:32:30,413][06887] Signal inference workers to stop experience collection... (31150 times) [2024-06-28 03:32:30,471][06909] InferenceWorker_p0-w0: stopping experience collection (31150 times) [2024-06-28 03:32:30,473][06887] Signal inference workers to resume experience collection... (31150 times) [2024-06-28 03:32:30,490][06909] InferenceWorker_p0-w0: resuming experience collection (31150 times) [2024-06-28 03:32:33,734][06909] Updated weights for policy 0, policy_version 139173 (0.0034) [2024-06-28 03:32:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 2280210432. Throughput: 0: 44107.6. Samples: 2183138100. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 03:32:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:32:37,245][06909] Updated weights for policy 0, policy_version 139183 (0.0038) [2024-06-28 03:32:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2280423424. Throughput: 0: 44128.8. Samples: 2183393780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 03:32:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:32:41,066][06909] Updated weights for policy 0, policy_version 139193 (0.0031) [2024-06-28 03:32:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2280652800. Throughput: 0: 44066.3. Samples: 2183526900. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 03:32:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:32:44,530][06909] Updated weights for policy 0, policy_version 139203 (0.0040) [2024-06-28 03:32:48,444][06909] Updated weights for policy 0, policy_version 139213 (0.0047) [2024-06-28 03:32:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2280882176. Throughput: 0: 44214.7. Samples: 2183795400. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 03:32:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:32:51,809][06909] Updated weights for policy 0, policy_version 139223 (0.0037) [2024-06-28 03:32:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2281078784. Throughput: 0: 44187.2. Samples: 2184054240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:32:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:32:55,965][06909] Updated weights for policy 0, policy_version 139233 (0.0035) [2024-06-28 03:32:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2281308160. Throughput: 0: 44061.8. Samples: 2184184240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:32:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:32:59,264][06909] Updated weights for policy 0, policy_version 139243 (0.0029) [2024-06-28 03:33:03,706][06909] Updated weights for policy 0, policy_version 139253 (0.0030) [2024-06-28 03:33:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2281521152. Throughput: 0: 43963.0. Samples: 2184452200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:33:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:33:07,000][06909] Updated weights for policy 0, policy_version 139263 (0.0037) [2024-06-28 03:33:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2281734144. Throughput: 0: 44011.1. Samples: 2184714680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:33:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:33:11,097][06909] Updated weights for policy 0, policy_version 139273 (0.0036) [2024-06-28 03:33:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2281963520. Throughput: 0: 43895.2. Samples: 2184844300. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:33:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:33:14,459][06909] Updated weights for policy 0, policy_version 139283 (0.0036) [2024-06-28 03:33:18,364][06909] Updated weights for policy 0, policy_version 139293 (0.0051) [2024-06-28 03:33:18,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.6, 300 sec: 43986.9). Total num frames: 2282192896. Throughput: 0: 43864.7. Samples: 2185112020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:33:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:33:21,735][06909] Updated weights for policy 0, policy_version 139303 (0.0036) [2024-06-28 03:33:23,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.3, 300 sec: 44097.7). Total num frames: 2282405888. Throughput: 0: 44047.4. Samples: 2185376000. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:33:23,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:33:25,756][06909] Updated weights for policy 0, policy_version 139313 (0.0035) [2024-06-28 03:33:28,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2282635264. Throughput: 0: 43874.2. Samples: 2185501240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:33:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:33:29,103][06909] Updated weights for policy 0, policy_version 139323 (0.0035) [2024-06-28 03:33:33,234][06909] Updated weights for policy 0, policy_version 139333 (0.0022) [2024-06-28 03:33:33,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2282831872. Throughput: 0: 43949.8. Samples: 2185773140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:33:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:33:36,648][06909] Updated weights for policy 0, policy_version 139343 (0.0045) [2024-06-28 03:33:38,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43987.1). Total num frames: 2283044864. Throughput: 0: 43998.1. Samples: 2186034160. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:33:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:33:40,647][06909] Updated weights for policy 0, policy_version 139353 (0.0031) [2024-06-28 03:33:43,852][06674] Fps is (10 sec: 47504.0, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 2283307008. Throughput: 0: 43988.7. Samples: 2186163820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:33:43,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:33:44,267][06909] Updated weights for policy 0, policy_version 139363 (0.0039) [2024-06-28 03:33:48,341][06909] Updated weights for policy 0, policy_version 139373 (0.0028) [2024-06-28 03:33:48,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 2283503616. Throughput: 0: 43870.8. Samples: 2186426380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:33:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:33:48,855][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000139374_2283503616.pth... [2024-06-28 03:33:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000138728_2272919552.pth [2024-06-28 03:33:52,082][06909] Updated weights for policy 0, policy_version 139383 (0.0035) [2024-06-28 03:33:53,850][06674] Fps is (10 sec: 40967.8, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2283716608. Throughput: 0: 43963.1. Samples: 2186693020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:33:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:33:55,545][06909] Updated weights for policy 0, policy_version 139393 (0.0033) [2024-06-28 03:33:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 2283962368. Throughput: 0: 43889.3. Samples: 2186819320. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:33:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:33:59,199][06909] Updated weights for policy 0, policy_version 139403 (0.0022) [2024-06-28 03:34:02,464][06887] Signal inference workers to stop experience collection... (31200 times) [2024-06-28 03:34:02,519][06909] InferenceWorker_p0-w0: stopping experience collection (31200 times) [2024-06-28 03:34:02,525][06887] Signal inference workers to resume experience collection... (31200 times) [2024-06-28 03:34:02,538][06909] InferenceWorker_p0-w0: resuming experience collection (31200 times) [2024-06-28 03:34:02,913][06909] Updated weights for policy 0, policy_version 139413 (0.0027) [2024-06-28 03:34:03,852][06674] Fps is (10 sec: 44228.6, 60 sec: 43962.3, 300 sec: 43820.0). Total num frames: 2284158976. Throughput: 0: 44064.9. Samples: 2187095020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:34:03,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:34:06,364][06909] Updated weights for policy 0, policy_version 139423 (0.0032) [2024-06-28 03:34:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2284371968. Throughput: 0: 44118.9. Samples: 2187361260. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:34:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 03:34:10,482][06909] Updated weights for policy 0, policy_version 139433 (0.0036) [2024-06-28 03:34:13,850][06674] Fps is (10 sec: 45883.6, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2284617728. Throughput: 0: 44159.9. Samples: 2187488440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:34:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:34:13,953][06909] Updated weights for policy 0, policy_version 139443 (0.0036) [2024-06-28 03:34:17,882][06909] Updated weights for policy 0, policy_version 139453 (0.0039) [2024-06-28 03:34:18,854][06674] Fps is (10 sec: 45858.1, 60 sec: 43961.2, 300 sec: 43930.8). Total num frames: 2284830720. Throughput: 0: 43984.8. Samples: 2187752620. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:34:18,854][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:34:21,688][06909] Updated weights for policy 0, policy_version 139463 (0.0038) [2024-06-28 03:34:23,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43692.2, 300 sec: 43986.9). Total num frames: 2285027328. Throughput: 0: 44118.3. Samples: 2188019480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:34:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:34:25,469][06909] Updated weights for policy 0, policy_version 139473 (0.0027) [2024-06-28 03:34:28,852][06674] Fps is (10 sec: 44244.0, 60 sec: 43962.2, 300 sec: 44097.6). Total num frames: 2285273088. Throughput: 0: 43870.2. Samples: 2188137980. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:34:28,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:34:29,041][06909] Updated weights for policy 0, policy_version 139483 (0.0042) [2024-06-28 03:34:32,841][06909] Updated weights for policy 0, policy_version 139493 (0.0033) [2024-06-28 03:34:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 2285486080. Throughput: 0: 44120.7. Samples: 2188411820. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:34:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:34:36,351][06909] Updated weights for policy 0, policy_version 139503 (0.0030) [2024-06-28 03:34:38,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2285682688. Throughput: 0: 44026.7. Samples: 2188674220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:34:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:34:40,290][06909] Updated weights for policy 0, policy_version 139513 (0.0038) [2024-06-28 03:34:43,698][06909] Updated weights for policy 0, policy_version 139523 (0.0049) [2024-06-28 03:34:43,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 2285944832. Throughput: 0: 44182.7. Samples: 2188807540. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:34:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:34:48,119][06909] Updated weights for policy 0, policy_version 139533 (0.0031) [2024-06-28 03:34:48,850][06674] Fps is (10 sec: 49152.5, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2286174208. Throughput: 0: 43982.0. Samples: 2189074120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:34:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:34:50,988][06909] Updated weights for policy 0, policy_version 139543 (0.0040) [2024-06-28 03:34:53,852][06674] Fps is (10 sec: 40951.5, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 2286354432. Throughput: 0: 44003.7. Samples: 2189341520. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:34:53,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 03:34:55,374][06909] Updated weights for policy 0, policy_version 139553 (0.0033) [2024-06-28 03:34:58,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 44098.9). Total num frames: 2286583808. Throughput: 0: 43841.9. Samples: 2189461320. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 03:34:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:34:59,318][06909] Updated weights for policy 0, policy_version 139563 (0.0039) [2024-06-28 03:35:02,835][06909] Updated weights for policy 0, policy_version 139573 (0.0036) [2024-06-28 03:35:03,850][06674] Fps is (10 sec: 45884.5, 60 sec: 44238.2, 300 sec: 43986.9). Total num frames: 2286813184. Throughput: 0: 43936.0. Samples: 2189729580. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 03:35:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:35:06,709][06909] Updated weights for policy 0, policy_version 139583 (0.0041) [2024-06-28 03:35:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2287009792. Throughput: 0: 43826.7. Samples: 2189991680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 03:35:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:35:10,147][06909] Updated weights for policy 0, policy_version 139593 (0.0031) [2024-06-28 03:35:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 44097.9). Total num frames: 2287239168. Throughput: 0: 44037.2. Samples: 2190119560. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 03:35:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:35:13,866][06909] Updated weights for policy 0, policy_version 139603 (0.0039) [2024-06-28 03:35:17,753][06909] Updated weights for policy 0, policy_version 139613 (0.0037) [2024-06-28 03:35:18,087][06887] Signal inference workers to stop experience collection... (31250 times) [2024-06-28 03:35:18,088][06887] Signal inference workers to resume experience collection... (31250 times) [2024-06-28 03:35:18,136][06909] InferenceWorker_p0-w0: stopping experience collection (31250 times) [2024-06-28 03:35:18,136][06909] InferenceWorker_p0-w0: resuming experience collection (31250 times) [2024-06-28 03:35:18,850][06674] Fps is (10 sec: 47512.8, 60 sec: 44239.4, 300 sec: 43986.9). Total num frames: 2287484928. Throughput: 0: 43926.6. Samples: 2190388520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 03:35:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:35:21,187][06909] Updated weights for policy 0, policy_version 139623 (0.0031) [2024-06-28 03:35:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 2287665152. Throughput: 0: 44123.2. Samples: 2190659760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 03:35:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:35:25,195][06909] Updated weights for policy 0, policy_version 139633 (0.0026) [2024-06-28 03:35:28,510][06909] Updated weights for policy 0, policy_version 139643 (0.0036) [2024-06-28 03:35:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43965.2, 300 sec: 44153.8). Total num frames: 2287910912. Throughput: 0: 43921.7. Samples: 2190784020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 03:35:28,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:35:32,423][06909] Updated weights for policy 0, policy_version 139653 (0.0028) [2024-06-28 03:35:33,852][06674] Fps is (10 sec: 47501.9, 60 sec: 44235.1, 300 sec: 43986.8). Total num frames: 2288140288. Throughput: 0: 43835.0. Samples: 2191046800. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 03:35:33,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:35:36,190][06909] Updated weights for policy 0, policy_version 139663 (0.0037) [2024-06-28 03:35:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2288353280. Throughput: 0: 43760.7. Samples: 2191310660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 03:35:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:35:40,117][06909] Updated weights for policy 0, policy_version 139673 (0.0033) [2024-06-28 03:35:43,822][06909] Updated weights for policy 0, policy_version 139683 (0.0033) [2024-06-28 03:35:43,850][06674] Fps is (10 sec: 42608.7, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2288566272. Throughput: 0: 43858.3. Samples: 2191434940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 03:35:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:35:47,399][06909] Updated weights for policy 0, policy_version 139693 (0.0028) [2024-06-28 03:35:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2288795648. Throughput: 0: 43959.6. Samples: 2191707760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 03:35:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:35:48,954][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000139698_2288812032.pth... [2024-06-28 03:35:49,003][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000139052_2278227968.pth [2024-06-28 03:35:51,214][06909] Updated weights for policy 0, policy_version 139703 (0.0033) [2024-06-28 03:35:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 2288992256. Throughput: 0: 44044.9. Samples: 2191973700. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 03:35:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:35:55,221][06909] Updated weights for policy 0, policy_version 139713 (0.0042) [2024-06-28 03:35:58,843][06909] Updated weights for policy 0, policy_version 139723 (0.0027) [2024-06-28 03:35:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2289221632. Throughput: 0: 43930.3. Samples: 2192096420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 03:35:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:36:02,569][06909] Updated weights for policy 0, policy_version 139733 (0.0031) [2024-06-28 03:36:03,851][06674] Fps is (10 sec: 47506.7, 60 sec: 44235.7, 300 sec: 44042.2). Total num frames: 2289467392. Throughput: 0: 44047.6. Samples: 2192370720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:36:03,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:36:06,365][06909] Updated weights for policy 0, policy_version 139743 (0.0035) [2024-06-28 03:36:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2289664000. Throughput: 0: 43757.7. Samples: 2192628860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:36:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:36:09,819][06909] Updated weights for policy 0, policy_version 139753 (0.0030) [2024-06-28 03:36:13,850][06674] Fps is (10 sec: 39327.2, 60 sec: 43690.6, 300 sec: 44042.7). Total num frames: 2289860608. Throughput: 0: 43748.9. Samples: 2192752720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:36:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:36:14,043][06909] Updated weights for policy 0, policy_version 139763 (0.0043) [2024-06-28 03:36:17,212][06909] Updated weights for policy 0, policy_version 139773 (0.0035) [2024-06-28 03:36:18,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2290122752. Throughput: 0: 43940.9. Samples: 2193024040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:36:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:36:21,221][06909] Updated weights for policy 0, policy_version 139783 (0.0029) [2024-06-28 03:36:23,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 2290319360. Throughput: 0: 44016.1. Samples: 2193291380. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:36:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:36:24,534][06909] Updated weights for policy 0, policy_version 139793 (0.0037) [2024-06-28 03:36:28,405][06909] Updated weights for policy 0, policy_version 139803 (0.0029) [2024-06-28 03:36:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2290548736. Throughput: 0: 44188.9. Samples: 2193423440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:36:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:36:31,955][06909] Updated weights for policy 0, policy_version 139813 (0.0036) [2024-06-28 03:36:33,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44238.6, 300 sec: 44098.0). Total num frames: 2290794496. Throughput: 0: 44137.0. Samples: 2193693920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:36:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:36:35,888][06909] Updated weights for policy 0, policy_version 139823 (0.0038) [2024-06-28 03:36:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2290991104. Throughput: 0: 44172.5. Samples: 2193961460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:36:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:36:39,285][06909] Updated weights for policy 0, policy_version 139833 (0.0026) [2024-06-28 03:36:43,360][06909] Updated weights for policy 0, policy_version 139843 (0.0037) [2024-06-28 03:36:43,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2291187712. Throughput: 0: 44222.6. Samples: 2194086440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:36:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:36:47,143][06909] Updated weights for policy 0, policy_version 139853 (0.0029) [2024-06-28 03:36:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2291433472. Throughput: 0: 43905.4. Samples: 2194346400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:36:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:36:51,066][06909] Updated weights for policy 0, policy_version 139863 (0.0029) [2024-06-28 03:36:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2291630080. Throughput: 0: 43933.8. Samples: 2194605880. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:36:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:36:54,577][06909] Updated weights for policy 0, policy_version 139873 (0.0047) [2024-06-28 03:36:56,617][06887] Signal inference workers to stop experience collection... (31300 times) [2024-06-28 03:36:56,617][06887] Signal inference workers to resume experience collection... (31300 times) [2024-06-28 03:36:56,657][06909] InferenceWorker_p0-w0: stopping experience collection (31300 times) [2024-06-28 03:36:56,657][06909] InferenceWorker_p0-w0: resuming experience collection (31300 times) [2024-06-28 03:36:58,328][06909] Updated weights for policy 0, policy_version 139883 (0.0024) [2024-06-28 03:36:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2291843072. Throughput: 0: 44104.9. Samples: 2194737440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 03:36:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 03:37:02,091][06909] Updated weights for policy 0, policy_version 139893 (0.0035) [2024-06-28 03:37:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43691.7, 300 sec: 44042.4). Total num frames: 2292088832. Throughput: 0: 43989.4. Samples: 2195003560. Policy #0 lag: (min: 1.0, avg: 9.7, max: 22.0) [2024-06-28 03:37:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:37:05,958][06909] Updated weights for policy 0, policy_version 139903 (0.0027) [2024-06-28 03:37:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 2292285440. Throughput: 0: 44008.9. Samples: 2195271780. Policy #0 lag: (min: 1.0, avg: 9.7, max: 22.0) [2024-06-28 03:37:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:37:09,436][06909] Updated weights for policy 0, policy_version 139913 (0.0033) [2024-06-28 03:37:13,300][06909] Updated weights for policy 0, policy_version 139923 (0.0034) [2024-06-28 03:37:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2292514816. Throughput: 0: 43885.7. Samples: 2195398300. Policy #0 lag: (min: 1.0, avg: 9.7, max: 22.0) [2024-06-28 03:37:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:37:17,011][06909] Updated weights for policy 0, policy_version 139933 (0.0029) [2024-06-28 03:37:18,850][06674] Fps is (10 sec: 47510.7, 60 sec: 43963.4, 300 sec: 44042.3). Total num frames: 2292760576. Throughput: 0: 43666.5. Samples: 2195658940. Policy #0 lag: (min: 1.0, avg: 9.7, max: 22.0) [2024-06-28 03:37:18,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:37:21,238][06909] Updated weights for policy 0, policy_version 139943 (0.0035) [2024-06-28 03:37:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 2292940800. Throughput: 0: 43510.7. Samples: 2195919440. Policy #0 lag: (min: 1.0, avg: 9.7, max: 22.0) [2024-06-28 03:37:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:37:24,649][06909] Updated weights for policy 0, policy_version 139953 (0.0027) [2024-06-28 03:37:28,598][06909] Updated weights for policy 0, policy_version 139963 (0.0031) [2024-06-28 03:37:28,850][06674] Fps is (10 sec: 39323.7, 60 sec: 43417.5, 300 sec: 43875.8). Total num frames: 2293153792. Throughput: 0: 43628.8. Samples: 2196049740. Policy #0 lag: (min: 1.0, avg: 9.7, max: 22.0) [2024-06-28 03:37:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:37:31,915][06909] Updated weights for policy 0, policy_version 139973 (0.0040) [2024-06-28 03:37:33,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2293415936. Throughput: 0: 43660.0. Samples: 2196311100. Policy #0 lag: (min: 1.0, avg: 9.7, max: 22.0) [2024-06-28 03:37:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:37:35,817][06909] Updated weights for policy 0, policy_version 139983 (0.0039) [2024-06-28 03:37:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43875.8). Total num frames: 2293596160. Throughput: 0: 43948.9. Samples: 2196583580. Policy #0 lag: (min: 1.0, avg: 9.7, max: 22.0) [2024-06-28 03:37:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:37:39,214][06909] Updated weights for policy 0, policy_version 139993 (0.0026) [2024-06-28 03:37:42,952][06909] Updated weights for policy 0, policy_version 140003 (0.0039) [2024-06-28 03:37:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 2293841920. Throughput: 0: 43826.7. Samples: 2196709640. Policy #0 lag: (min: 1.0, avg: 9.7, max: 22.0) [2024-06-28 03:37:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:37:46,754][06909] Updated weights for policy 0, policy_version 140013 (0.0032) [2024-06-28 03:37:48,850][06674] Fps is (10 sec: 49151.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2294087680. Throughput: 0: 44047.9. Samples: 2196985720. Policy #0 lag: (min: 1.0, avg: 9.7, max: 22.0) [2024-06-28 03:37:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:37:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000140020_2294087680.pth... [2024-06-28 03:37:48,904][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000139374_2283503616.pth [2024-06-28 03:37:50,405][06909] Updated weights for policy 0, policy_version 140023 (0.0039) [2024-06-28 03:37:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2294284288. Throughput: 0: 43974.2. Samples: 2197250620. Policy #0 lag: (min: 1.0, avg: 9.7, max: 22.0) [2024-06-28 03:37:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:37:54,058][06909] Updated weights for policy 0, policy_version 140033 (0.0029) [2024-06-28 03:37:58,475][06909] Updated weights for policy 0, policy_version 140043 (0.0032) [2024-06-28 03:37:58,850][06674] Fps is (10 sec: 37683.3, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 2294464512. Throughput: 0: 43871.1. Samples: 2197372500. Policy #0 lag: (min: 1.0, avg: 9.7, max: 22.0) [2024-06-28 03:37:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:38:01,753][06909] Updated weights for policy 0, policy_version 140053 (0.0040) [2024-06-28 03:38:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2294743040. Throughput: 0: 43889.9. Samples: 2197633960. Policy #0 lag: (min: 1.0, avg: 9.7, max: 22.0) [2024-06-28 03:38:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:38:05,779][06909] Updated weights for policy 0, policy_version 140063 (0.0025) [2024-06-28 03:38:08,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2294939648. Throughput: 0: 44137.3. Samples: 2197905620. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-28 03:38:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:38:09,083][06909] Updated weights for policy 0, policy_version 140073 (0.0029) [2024-06-28 03:38:12,935][06909] Updated weights for policy 0, policy_version 140083 (0.0034) [2024-06-28 03:38:13,815][06887] Signal inference workers to stop experience collection... (31350 times) [2024-06-28 03:38:13,815][06887] Signal inference workers to resume experience collection... (31350 times) [2024-06-28 03:38:13,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 2295152640. Throughput: 0: 44066.3. Samples: 2198032720. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-28 03:38:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:38:13,868][06909] InferenceWorker_p0-w0: stopping experience collection (31350 times) [2024-06-28 03:38:13,868][06909] InferenceWorker_p0-w0: resuming experience collection (31350 times) [2024-06-28 03:38:16,663][06909] Updated weights for policy 0, policy_version 140093 (0.0038) [2024-06-28 03:38:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43964.2, 300 sec: 44042.7). Total num frames: 2295398400. Throughput: 0: 44164.5. Samples: 2198298500. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-28 03:38:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:38:20,181][06909] Updated weights for policy 0, policy_version 140103 (0.0028) [2024-06-28 03:38:23,791][06909] Updated weights for policy 0, policy_version 140113 (0.0036) [2024-06-28 03:38:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 2295611392. Throughput: 0: 44287.9. Samples: 2198576540. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-28 03:38:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 03:38:27,966][06909] Updated weights for policy 0, policy_version 140123 (0.0036) [2024-06-28 03:38:28,851][06674] Fps is (10 sec: 40954.3, 60 sec: 44235.8, 300 sec: 43986.7). Total num frames: 2295808000. Throughput: 0: 44272.8. Samples: 2198701980. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-28 03:38:28,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:38:31,232][06909] Updated weights for policy 0, policy_version 140133 (0.0040) [2024-06-28 03:38:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2296053760. Throughput: 0: 43868.0. Samples: 2198959780. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-28 03:38:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:38:35,469][06909] Updated weights for policy 0, policy_version 140143 (0.0023) [2024-06-28 03:38:38,716][06909] Updated weights for policy 0, policy_version 140153 (0.0030) [2024-06-28 03:38:38,850][06674] Fps is (10 sec: 45881.7, 60 sec: 44509.9, 300 sec: 43931.6). Total num frames: 2296266752. Throughput: 0: 43990.7. Samples: 2199230200. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-28 03:38:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:38:42,682][06909] Updated weights for policy 0, policy_version 140163 (0.0033) [2024-06-28 03:38:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2296496128. Throughput: 0: 44193.8. Samples: 2199361220. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-28 03:38:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:38:45,987][06909] Updated weights for policy 0, policy_version 140173 (0.0034) [2024-06-28 03:38:48,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43689.2, 300 sec: 44042.1). Total num frames: 2296709120. Throughput: 0: 44254.4. Samples: 2199625500. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-28 03:38:48,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 03:38:49,917][06909] Updated weights for policy 0, policy_version 140183 (0.0037) [2024-06-28 03:38:53,510][06909] Updated weights for policy 0, policy_version 140193 (0.0027) [2024-06-28 03:38:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2296922112. Throughput: 0: 44136.0. Samples: 2199891740. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-28 03:38:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:38:57,429][06909] Updated weights for policy 0, policy_version 140203 (0.0040) [2024-06-28 03:38:58,850][06674] Fps is (10 sec: 44245.2, 60 sec: 44782.9, 300 sec: 44042.7). Total num frames: 2297151488. Throughput: 0: 44194.9. Samples: 2200021500. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-28 03:38:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:39:01,004][06909] Updated weights for policy 0, policy_version 140213 (0.0025) [2024-06-28 03:39:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2297364480. Throughput: 0: 44109.4. Samples: 2200283420. Policy #0 lag: (min: 0.0, avg: 8.5, max: 20.0) [2024-06-28 03:39:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:39:05,011][06909] Updated weights for policy 0, policy_version 140223 (0.0028) [2024-06-28 03:39:08,359][06909] Updated weights for policy 0, policy_version 140233 (0.0035) [2024-06-28 03:39:08,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2297610240. Throughput: 0: 43861.4. Samples: 2200550300. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-28 03:39:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:39:12,289][06909] Updated weights for policy 0, policy_version 140243 (0.0040) [2024-06-28 03:39:13,850][06674] Fps is (10 sec: 44235.5, 60 sec: 44236.6, 300 sec: 43987.4). Total num frames: 2297806848. Throughput: 0: 44122.9. Samples: 2200687460. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-28 03:39:13,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:39:15,791][06909] Updated weights for policy 0, policy_version 140253 (0.0027) [2024-06-28 03:39:18,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2298019840. Throughput: 0: 44186.4. Samples: 2200948160. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-28 03:39:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 03:39:19,494][06909] Updated weights for policy 0, policy_version 140263 (0.0024) [2024-06-28 03:39:23,196][06909] Updated weights for policy 0, policy_version 140273 (0.0032) [2024-06-28 03:39:23,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 2298265600. Throughput: 0: 44043.9. Samples: 2201212180. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-28 03:39:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:39:27,108][06909] Updated weights for policy 0, policy_version 140283 (0.0026) [2024-06-28 03:39:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44237.8, 300 sec: 43986.9). Total num frames: 2298462208. Throughput: 0: 44298.3. Samples: 2201354640. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-28 03:39:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:39:30,629][06887] Signal inference workers to stop experience collection... (31400 times) [2024-06-28 03:39:30,629][06887] Signal inference workers to resume experience collection... (31400 times) [2024-06-28 03:39:30,657][06909] InferenceWorker_p0-w0: stopping experience collection (31400 times) [2024-06-28 03:39:30,658][06909] InferenceWorker_p0-w0: resuming experience collection (31400 times) [2024-06-28 03:39:30,794][06909] Updated weights for policy 0, policy_version 140293 (0.0030) [2024-06-28 03:39:33,852][06674] Fps is (10 sec: 40951.8, 60 sec: 43689.2, 300 sec: 44042.1). Total num frames: 2298675200. Throughput: 0: 43940.4. Samples: 2201602820. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-28 03:39:33,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:39:34,570][06909] Updated weights for policy 0, policy_version 140303 (0.0029) [2024-06-28 03:39:38,143][06909] Updated weights for policy 0, policy_version 140313 (0.0027) [2024-06-28 03:39:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2298920960. Throughput: 0: 44076.1. Samples: 2201875160. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-28 03:39:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:39:41,959][06909] Updated weights for policy 0, policy_version 140323 (0.0033) [2024-06-28 03:39:43,850][06674] Fps is (10 sec: 45884.7, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2299133952. Throughput: 0: 44401.5. Samples: 2202019560. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-28 03:39:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:39:45,478][06909] Updated weights for policy 0, policy_version 140333 (0.0024) [2024-06-28 03:39:48,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43692.2, 300 sec: 43987.2). Total num frames: 2299330560. Throughput: 0: 44081.3. Samples: 2202267080. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-28 03:39:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:39:48,901][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000140341_2299346944.pth... [2024-06-28 03:39:48,958][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000139698_2288812032.pth [2024-06-28 03:39:49,347][06909] Updated weights for policy 0, policy_version 140343 (0.0022) [2024-06-28 03:39:52,767][06909] Updated weights for policy 0, policy_version 140353 (0.0048) [2024-06-28 03:39:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2299576320. Throughput: 0: 44163.5. Samples: 2202537660. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-28 03:39:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:39:56,695][06909] Updated weights for policy 0, policy_version 140363 (0.0038) [2024-06-28 03:39:58,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2299789312. Throughput: 0: 44282.4. Samples: 2202680160. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-28 03:39:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:40:00,246][06909] Updated weights for policy 0, policy_version 140373 (0.0029) [2024-06-28 03:40:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2300002304. Throughput: 0: 44239.5. Samples: 2202938940. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-28 03:40:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:40:04,204][06909] Updated weights for policy 0, policy_version 140383 (0.0037) [2024-06-28 03:40:07,921][06909] Updated weights for policy 0, policy_version 140393 (0.0030) [2024-06-28 03:40:08,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2300248064. Throughput: 0: 44171.7. Samples: 2203199900. Policy #0 lag: (min: 0.0, avg: 11.5, max: 25.0) [2024-06-28 03:40:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:40:11,566][06909] Updated weights for policy 0, policy_version 140403 (0.0037) [2024-06-28 03:40:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.9, 300 sec: 43931.4). Total num frames: 2300444672. Throughput: 0: 44074.2. Samples: 2203337980. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:40:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:40:15,248][06909] Updated weights for policy 0, policy_version 140413 (0.0034) [2024-06-28 03:40:18,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2300641280. Throughput: 0: 44223.4. Samples: 2203592780. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:40:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:40:19,213][06909] Updated weights for policy 0, policy_version 140423 (0.0031) [2024-06-28 03:40:22,831][06909] Updated weights for policy 0, policy_version 140433 (0.0038) [2024-06-28 03:40:23,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2300887040. Throughput: 0: 44071.4. Samples: 2203858380. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:40:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:40:26,398][06909] Updated weights for policy 0, policy_version 140443 (0.0026) [2024-06-28 03:40:28,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 43931.7). Total num frames: 2301100032. Throughput: 0: 43951.5. Samples: 2203997380. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:40:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:40:30,073][06909] Updated weights for policy 0, policy_version 140453 (0.0031) [2024-06-28 03:40:31,989][06887] Signal inference workers to stop experience collection... (31450 times) [2024-06-28 03:40:31,991][06887] Signal inference workers to resume experience collection... (31450 times) [2024-06-28 03:40:32,037][06909] InferenceWorker_p0-w0: stopping experience collection (31450 times) [2024-06-28 03:40:32,038][06909] InferenceWorker_p0-w0: resuming experience collection (31450 times) [2024-06-28 03:40:33,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44238.3, 300 sec: 43986.9). Total num frames: 2301329408. Throughput: 0: 44237.8. Samples: 2204257780. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:40:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 03:40:33,879][06909] Updated weights for policy 0, policy_version 140463 (0.0042) [2024-06-28 03:40:37,246][06909] Updated weights for policy 0, policy_version 140473 (0.0029) [2024-06-28 03:40:38,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2301575168. Throughput: 0: 44184.4. Samples: 2204525960. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:40:38,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:40:41,321][06909] Updated weights for policy 0, policy_version 140483 (0.0046) [2024-06-28 03:40:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2301771776. Throughput: 0: 43957.8. Samples: 2204658260. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:40:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:40:44,954][06909] Updated weights for policy 0, policy_version 140493 (0.0032) [2024-06-28 03:40:48,856][06674] Fps is (10 sec: 40935.5, 60 sec: 44232.3, 300 sec: 44041.5). Total num frames: 2301984768. Throughput: 0: 44085.1. Samples: 2204923040. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:40:48,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:40:48,984][06909] Updated weights for policy 0, policy_version 140503 (0.0030) [2024-06-28 03:40:52,381][06909] Updated weights for policy 0, policy_version 140513 (0.0028) [2024-06-28 03:40:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2302230528. Throughput: 0: 44054.6. Samples: 2205182360. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:40:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:40:56,402][06909] Updated weights for policy 0, policy_version 140523 (0.0029) [2024-06-28 03:40:58,852][06674] Fps is (10 sec: 44254.6, 60 sec: 43962.3, 300 sec: 43931.2). Total num frames: 2302427136. Throughput: 0: 43990.4. Samples: 2205317640. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:40:58,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:41:00,014][06909] Updated weights for policy 0, policy_version 140533 (0.0037) [2024-06-28 03:41:03,785][06909] Updated weights for policy 0, policy_version 140543 (0.0031) [2024-06-28 03:41:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2302656512. Throughput: 0: 44182.2. Samples: 2205580980. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:41:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:41:07,299][06909] Updated weights for policy 0, policy_version 140553 (0.0027) [2024-06-28 03:41:08,857][06674] Fps is (10 sec: 45851.6, 60 sec: 43958.4, 300 sec: 44152.4). Total num frames: 2302885888. Throughput: 0: 44097.0. Samples: 2205843060. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:41:08,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:41:11,227][06909] Updated weights for policy 0, policy_version 140563 (0.0026) [2024-06-28 03:41:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2303098880. Throughput: 0: 43982.3. Samples: 2205976580. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 03:41:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:41:14,573][06909] Updated weights for policy 0, policy_version 140573 (0.0038) [2024-06-28 03:41:18,473][06909] Updated weights for policy 0, policy_version 140583 (0.0040) [2024-06-28 03:41:18,850][06674] Fps is (10 sec: 44268.0, 60 sec: 44782.7, 300 sec: 44097.9). Total num frames: 2303328256. Throughput: 0: 44147.8. Samples: 2206244440. Policy #0 lag: (min: 1.0, avg: 10.5, max: 20.0) [2024-06-28 03:41:18,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:41:22,172][06909] Updated weights for policy 0, policy_version 140593 (0.0032) [2024-06-28 03:41:23,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2303557632. Throughput: 0: 43956.9. Samples: 2206504020. Policy #0 lag: (min: 1.0, avg: 10.5, max: 20.0) [2024-06-28 03:41:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 03:41:26,021][06909] Updated weights for policy 0, policy_version 140603 (0.0024) [2024-06-28 03:41:28,850][06674] Fps is (10 sec: 42599.5, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 2303754240. Throughput: 0: 44087.7. Samples: 2206642200. Policy #0 lag: (min: 1.0, avg: 10.5, max: 20.0) [2024-06-28 03:41:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:41:29,541][06909] Updated weights for policy 0, policy_version 140613 (0.0029) [2024-06-28 03:41:33,516][06909] Updated weights for policy 0, policy_version 140623 (0.0038) [2024-06-28 03:41:33,850][06674] Fps is (10 sec: 44237.7, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2304000000. Throughput: 0: 44165.6. Samples: 2206910220. Policy #0 lag: (min: 1.0, avg: 10.5, max: 20.0) [2024-06-28 03:41:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:41:35,748][06887] Signal inference workers to stop experience collection... (31500 times) [2024-06-28 03:41:35,752][06887] Signal inference workers to resume experience collection... (31500 times) [2024-06-28 03:41:35,760][06909] InferenceWorker_p0-w0: stopping experience collection (31500 times) [2024-06-28 03:41:35,791][06909] InferenceWorker_p0-w0: resuming experience collection (31500 times) [2024-06-28 03:41:36,858][06909] Updated weights for policy 0, policy_version 140633 (0.0031) [2024-06-28 03:41:38,852][06674] Fps is (10 sec: 45865.5, 60 sec: 43962.3, 300 sec: 44153.2). Total num frames: 2304212992. Throughput: 0: 44117.1. Samples: 2207167720. Policy #0 lag: (min: 1.0, avg: 10.5, max: 20.0) [2024-06-28 03:41:38,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:41:40,860][06909] Updated weights for policy 0, policy_version 140643 (0.0032) [2024-06-28 03:41:43,850][06674] Fps is (10 sec: 42597.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2304425984. Throughput: 0: 43953.0. Samples: 2207295440. Policy #0 lag: (min: 1.0, avg: 10.5, max: 20.0) [2024-06-28 03:41:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:41:44,284][06909] Updated weights for policy 0, policy_version 140653 (0.0039) [2024-06-28 03:41:48,518][06909] Updated weights for policy 0, policy_version 140663 (0.0033) [2024-06-28 03:41:48,850][06674] Fps is (10 sec: 42607.2, 60 sec: 44241.3, 300 sec: 44097.9). Total num frames: 2304638976. Throughput: 0: 44110.2. Samples: 2207565940. Policy #0 lag: (min: 1.0, avg: 10.5, max: 20.0) [2024-06-28 03:41:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:41:48,951][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000140665_2304655360.pth... [2024-06-28 03:41:49,002][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000140020_2294087680.pth [2024-06-28 03:41:52,008][06909] Updated weights for policy 0, policy_version 140673 (0.0034) [2024-06-28 03:41:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 2304868352. Throughput: 0: 44085.6. Samples: 2207826600. Policy #0 lag: (min: 1.0, avg: 10.5, max: 20.0) [2024-06-28 03:41:53,851][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 03:41:55,989][06909] Updated weights for policy 0, policy_version 140683 (0.0039) [2024-06-28 03:41:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43965.3, 300 sec: 43986.9). Total num frames: 2305064960. Throughput: 0: 44106.8. Samples: 2207961380. Policy #0 lag: (min: 1.0, avg: 10.5, max: 20.0) [2024-06-28 03:41:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:41:59,245][06909] Updated weights for policy 0, policy_version 140693 (0.0036) [2024-06-28 03:42:03,352][06909] Updated weights for policy 0, policy_version 140703 (0.0037) [2024-06-28 03:42:03,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2305294336. Throughput: 0: 44007.3. Samples: 2208224760. Policy #0 lag: (min: 1.0, avg: 10.5, max: 20.0) [2024-06-28 03:42:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:42:06,947][06909] Updated weights for policy 0, policy_version 140713 (0.0027) [2024-06-28 03:42:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43969.1, 300 sec: 44098.0). Total num frames: 2305523712. Throughput: 0: 43955.7. Samples: 2208482020. Policy #0 lag: (min: 1.0, avg: 10.5, max: 20.0) [2024-06-28 03:42:08,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 03:42:10,705][06909] Updated weights for policy 0, policy_version 140723 (0.0023) [2024-06-28 03:42:13,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44042.5). Total num frames: 2305753088. Throughput: 0: 43834.1. Samples: 2208614740. Policy #0 lag: (min: 1.0, avg: 10.5, max: 20.0) [2024-06-28 03:42:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:42:14,256][06909] Updated weights for policy 0, policy_version 140733 (0.0027) [2024-06-28 03:42:18,198][06909] Updated weights for policy 0, policy_version 140743 (0.0040) [2024-06-28 03:42:18,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2305966080. Throughput: 0: 43897.5. Samples: 2208885620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 03:42:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:42:21,529][06909] Updated weights for policy 0, policy_version 140753 (0.0024) [2024-06-28 03:42:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2306179072. Throughput: 0: 43863.7. Samples: 2209141500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 03:42:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 03:42:25,756][06909] Updated weights for policy 0, policy_version 140763 (0.0035) [2024-06-28 03:42:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2306392064. Throughput: 0: 43892.5. Samples: 2209270600. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 03:42:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:42:29,159][06909] Updated weights for policy 0, policy_version 140773 (0.0031) [2024-06-28 03:42:32,948][06909] Updated weights for policy 0, policy_version 140783 (0.0032) [2024-06-28 03:42:33,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43417.6, 300 sec: 44098.0). Total num frames: 2306605056. Throughput: 0: 43901.8. Samples: 2209541520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 03:42:33,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 03:42:36,706][06909] Updated weights for policy 0, policy_version 140793 (0.0031) [2024-06-28 03:42:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43692.2, 300 sec: 44042.4). Total num frames: 2306834432. Throughput: 0: 43828.6. Samples: 2209798880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 03:42:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:42:40,599][06909] Updated weights for policy 0, policy_version 140803 (0.0036) [2024-06-28 03:42:43,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2307063808. Throughput: 0: 43767.4. Samples: 2209930920. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 03:42:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:42:44,165][06909] Updated weights for policy 0, policy_version 140813 (0.0031) [2024-06-28 03:42:48,010][06909] Updated weights for policy 0, policy_version 140823 (0.0020) [2024-06-28 03:42:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2307276800. Throughput: 0: 43989.2. Samples: 2210204280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 03:42:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:42:51,345][06909] Updated weights for policy 0, policy_version 140833 (0.0031) [2024-06-28 03:42:53,853][06674] Fps is (10 sec: 42585.7, 60 sec: 43688.5, 300 sec: 44153.0). Total num frames: 2307489792. Throughput: 0: 44072.5. Samples: 2210465420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 03:42:53,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:42:55,303][06887] Signal inference workers to stop experience collection... (31550 times) [2024-06-28 03:42:55,339][06909] InferenceWorker_p0-w0: stopping experience collection (31550 times) [2024-06-28 03:42:55,417][06887] Signal inference workers to resume experience collection... (31550 times) [2024-06-28 03:42:55,417][06909] InferenceWorker_p0-w0: resuming experience collection (31550 times) [2024-06-28 03:42:55,418][06909] Updated weights for policy 0, policy_version 140843 (0.0035) [2024-06-28 03:42:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2307719168. Throughput: 0: 44113.0. Samples: 2210599820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 03:42:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:42:58,933][06909] Updated weights for policy 0, policy_version 140853 (0.0043) [2024-06-28 03:43:02,968][06909] Updated weights for policy 0, policy_version 140863 (0.0023) [2024-06-28 03:43:03,850][06674] Fps is (10 sec: 44249.3, 60 sec: 43963.5, 300 sec: 44042.4). Total num frames: 2307932160. Throughput: 0: 43981.7. Samples: 2210864800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 03:43:03,851][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 03:43:06,162][06909] Updated weights for policy 0, policy_version 140873 (0.0033) [2024-06-28 03:43:08,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2308145152. Throughput: 0: 44188.5. Samples: 2211129980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 03:43:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:43:10,269][06909] Updated weights for policy 0, policy_version 140883 (0.0034) [2024-06-28 03:43:13,741][06909] Updated weights for policy 0, policy_version 140893 (0.0037) [2024-06-28 03:43:13,850][06674] Fps is (10 sec: 45876.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2308390912. Throughput: 0: 44176.9. Samples: 2211258560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 03:43:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:43:17,432][06909] Updated weights for policy 0, policy_version 140903 (0.0033) [2024-06-28 03:43:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 2308620288. Throughput: 0: 44071.9. Samples: 2211524760. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 03:43:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:43:21,210][06909] Updated weights for policy 0, policy_version 140913 (0.0021) [2024-06-28 03:43:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.8, 300 sec: 44042.6). Total num frames: 2308800512. Throughput: 0: 44320.5. Samples: 2211793300. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:43:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:43:24,618][06909] Updated weights for policy 0, policy_version 140923 (0.0023) [2024-06-28 03:43:28,483][06909] Updated weights for policy 0, policy_version 140933 (0.0032) [2024-06-28 03:43:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2309062656. Throughput: 0: 44385.4. Samples: 2211928260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:43:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 03:43:32,345][06909] Updated weights for policy 0, policy_version 140943 (0.0041) [2024-06-28 03:43:33,852][06674] Fps is (10 sec: 47503.8, 60 sec: 44508.3, 300 sec: 44097.6). Total num frames: 2309275648. Throughput: 0: 44123.8. Samples: 2212189940. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:43:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:43:35,912][06909] Updated weights for policy 0, policy_version 140953 (0.0039) [2024-06-28 03:43:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2309472256. Throughput: 0: 44275.0. Samples: 2212457660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:43:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:43:39,883][06909] Updated weights for policy 0, policy_version 140963 (0.0026) [2024-06-28 03:43:43,154][06909] Updated weights for policy 0, policy_version 140973 (0.0027) [2024-06-28 03:43:43,852][06674] Fps is (10 sec: 45875.2, 60 sec: 44508.4, 300 sec: 44153.5). Total num frames: 2309734400. Throughput: 0: 44239.7. Samples: 2212590700. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:43:43,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:43:47,135][06909] Updated weights for policy 0, policy_version 140983 (0.0025) [2024-06-28 03:43:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2309931008. Throughput: 0: 44383.3. Samples: 2212862040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:43:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:43:49,024][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000140988_2309947392.pth... [2024-06-28 03:43:49,075][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000140341_2299346944.pth [2024-06-28 03:43:50,886][06909] Updated weights for policy 0, policy_version 140993 (0.0040) [2024-06-28 03:43:53,852][06674] Fps is (10 sec: 40959.8, 60 sec: 44237.5, 300 sec: 44042.1). Total num frames: 2310144000. Throughput: 0: 44331.8. Samples: 2213125000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:43:53,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:43:54,641][06909] Updated weights for policy 0, policy_version 141003 (0.0022) [2024-06-28 03:43:58,279][06909] Updated weights for policy 0, policy_version 141013 (0.0033) [2024-06-28 03:43:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2310373376. Throughput: 0: 44373.8. Samples: 2213255380. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:43:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:44:01,961][06909] Updated weights for policy 0, policy_version 141023 (0.0030) [2024-06-28 03:44:03,850][06674] Fps is (10 sec: 45884.3, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 2310602752. Throughput: 0: 44384.4. Samples: 2213522060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:44:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:44:05,480][06909] Updated weights for policy 0, policy_version 141033 (0.0026) [2024-06-28 03:44:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2310799360. Throughput: 0: 44245.7. Samples: 2213784360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:44:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:44:09,903][06909] Updated weights for policy 0, policy_version 141043 (0.0025) [2024-06-28 03:44:12,911][06909] Updated weights for policy 0, policy_version 141053 (0.0024) [2024-06-28 03:44:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2311045120. Throughput: 0: 44190.7. Samples: 2213916840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:44:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:44:17,108][06909] Updated weights for policy 0, policy_version 141063 (0.0033) [2024-06-28 03:44:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2311258112. Throughput: 0: 44296.6. Samples: 2214183200. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:44:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:44:19,711][06887] Signal inference workers to stop experience collection... (31600 times) [2024-06-28 03:44:19,712][06887] Signal inference workers to resume experience collection... (31600 times) [2024-06-28 03:44:19,739][06909] InferenceWorker_p0-w0: stopping experience collection (31600 times) [2024-06-28 03:44:19,739][06909] InferenceWorker_p0-w0: resuming experience collection (31600 times) [2024-06-28 03:44:20,051][06909] Updated weights for policy 0, policy_version 141073 (0.0026) [2024-06-28 03:44:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2311454720. Throughput: 0: 44199.6. Samples: 2214446640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 03:44:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:44:24,335][06909] Updated weights for policy 0, policy_version 141083 (0.0029) [2024-06-28 03:44:27,877][06909] Updated weights for policy 0, policy_version 141093 (0.0041) [2024-06-28 03:44:28,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.9, 300 sec: 44209.3). Total num frames: 2311716864. Throughput: 0: 44194.5. Samples: 2214579360. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:44:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:44:32,047][06909] Updated weights for policy 0, policy_version 141103 (0.0025) [2024-06-28 03:44:33,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 2311913472. Throughput: 0: 43939.3. Samples: 2214839300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:44:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:44:35,410][06909] Updated weights for policy 0, policy_version 141113 (0.0034) [2024-06-28 03:44:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2312142848. Throughput: 0: 44093.2. Samples: 2215109100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:44:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:44:39,269][06909] Updated weights for policy 0, policy_version 141123 (0.0033) [2024-06-28 03:44:42,935][06909] Updated weights for policy 0, policy_version 141133 (0.0033) [2024-06-28 03:44:43,850][06674] Fps is (10 sec: 47512.5, 60 sec: 44238.2, 300 sec: 44264.5). Total num frames: 2312388608. Throughput: 0: 44095.8. Samples: 2215239700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:44:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:44:47,111][06909] Updated weights for policy 0, policy_version 141143 (0.0028) [2024-06-28 03:44:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2312585216. Throughput: 0: 44021.8. Samples: 2215503040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:44:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:44:50,285][06909] Updated weights for policy 0, policy_version 141153 (0.0027) [2024-06-28 03:44:53,850][06674] Fps is (10 sec: 40960.6, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 2312798208. Throughput: 0: 44061.8. Samples: 2215767140. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:44:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:44:54,286][06909] Updated weights for policy 0, policy_version 141163 (0.0034) [2024-06-28 03:44:57,613][06909] Updated weights for policy 0, policy_version 141173 (0.0036) [2024-06-28 03:44:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2313027584. Throughput: 0: 44097.3. Samples: 2215901220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:44:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:45:01,534][06909] Updated weights for policy 0, policy_version 141183 (0.0029) [2024-06-28 03:45:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2313256960. Throughput: 0: 44034.3. Samples: 2216164740. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:45:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:45:05,232][06909] Updated weights for policy 0, policy_version 141193 (0.0031) [2024-06-28 03:45:08,795][06909] Updated weights for policy 0, policy_version 141203 (0.0037) [2024-06-28 03:45:08,856][06674] Fps is (10 sec: 44210.2, 60 sec: 44505.4, 300 sec: 44152.6). Total num frames: 2313469952. Throughput: 0: 43906.1. Samples: 2216422680. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:45:08,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:45:12,712][06909] Updated weights for policy 0, policy_version 141213 (0.0023) [2024-06-28 03:45:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 2313699328. Throughput: 0: 43892.8. Samples: 2216554540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:45:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:45:16,649][06909] Updated weights for policy 0, policy_version 141223 (0.0026) [2024-06-28 03:45:18,850][06674] Fps is (10 sec: 44263.9, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2313912320. Throughput: 0: 44267.5. Samples: 2216831340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:45:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:45:19,906][06909] Updated weights for policy 0, policy_version 141233 (0.0027) [2024-06-28 03:45:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2314108928. Throughput: 0: 43817.3. Samples: 2217080880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 03:45:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:45:24,490][06909] Updated weights for policy 0, policy_version 141243 (0.0027) [2024-06-28 03:45:27,609][06909] Updated weights for policy 0, policy_version 141253 (0.0040) [2024-06-28 03:45:28,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2314338304. Throughput: 0: 43778.8. Samples: 2217209740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 03:45:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:45:31,862][06909] Updated weights for policy 0, policy_version 141263 (0.0029) [2024-06-28 03:45:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2314551296. Throughput: 0: 43811.7. Samples: 2217474560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 03:45:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:45:35,260][06909] Updated weights for policy 0, policy_version 141273 (0.0027) [2024-06-28 03:45:38,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2314764288. Throughput: 0: 43834.2. Samples: 2217739680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 03:45:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:45:39,113][06909] Updated weights for policy 0, policy_version 141283 (0.0028) [2024-06-28 03:45:42,445][06909] Updated weights for policy 0, policy_version 141293 (0.0032) [2024-06-28 03:45:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.8, 300 sec: 44154.4). Total num frames: 2315010048. Throughput: 0: 43718.8. Samples: 2217868560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 03:45:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:45:46,788][06909] Updated weights for policy 0, policy_version 141303 (0.0027) [2024-06-28 03:45:48,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2315223040. Throughput: 0: 43780.4. Samples: 2218134860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 03:45:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:45:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000141310_2315223040.pth... [2024-06-28 03:45:48,927][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000140665_2304655360.pth [2024-06-28 03:45:50,071][06909] Updated weights for policy 0, policy_version 141313 (0.0036) [2024-06-28 03:45:53,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 44042.7). Total num frames: 2315419648. Throughput: 0: 43933.4. Samples: 2218399420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 03:45:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:45:54,006][06909] Updated weights for policy 0, policy_version 141323 (0.0025) [2024-06-28 03:45:57,297][06909] Updated weights for policy 0, policy_version 141333 (0.0027) [2024-06-28 03:45:58,214][06887] Signal inference workers to stop experience collection... (31650 times) [2024-06-28 03:45:58,262][06909] InferenceWorker_p0-w0: stopping experience collection (31650 times) [2024-06-28 03:45:58,270][06887] Signal inference workers to resume experience collection... (31650 times) [2024-06-28 03:45:58,283][06909] InferenceWorker_p0-w0: resuming experience collection (31650 times) [2024-06-28 03:45:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2315665408. Throughput: 0: 43944.4. Samples: 2218532040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 03:45:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:46:01,577][06909] Updated weights for policy 0, policy_version 141343 (0.0035) [2024-06-28 03:46:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 44043.5). Total num frames: 2315878400. Throughput: 0: 43660.4. Samples: 2218796060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 03:46:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:46:04,677][06909] Updated weights for policy 0, policy_version 141353 (0.0033) [2024-06-28 03:46:08,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43422.0, 300 sec: 43986.9). Total num frames: 2316075008. Throughput: 0: 43965.3. Samples: 2219059320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 03:46:08,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 03:46:09,066][06909] Updated weights for policy 0, policy_version 141363 (0.0039) [2024-06-28 03:46:12,255][06909] Updated weights for policy 0, policy_version 141373 (0.0036) [2024-06-28 03:46:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2316320768. Throughput: 0: 44059.6. Samples: 2219192420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 03:46:13,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:46:16,314][06909] Updated weights for policy 0, policy_version 141383 (0.0041) [2024-06-28 03:46:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43931.4). Total num frames: 2316517376. Throughput: 0: 44030.6. Samples: 2219455940. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 03:46:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:46:19,587][06909] Updated weights for policy 0, policy_version 141393 (0.0029) [2024-06-28 03:46:23,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43986.8). Total num frames: 2316730368. Throughput: 0: 43915.5. Samples: 2219715880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 03:46:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:46:24,135][06909] Updated weights for policy 0, policy_version 141403 (0.0038) [2024-06-28 03:46:26,996][06909] Updated weights for policy 0, policy_version 141413 (0.0036) [2024-06-28 03:46:28,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2316976128. Throughput: 0: 43982.2. Samples: 2219847760. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 03:46:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:46:31,498][06909] Updated weights for policy 0, policy_version 141423 (0.0034) [2024-06-28 03:46:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.6, 300 sec: 43987.2). Total num frames: 2317189120. Throughput: 0: 44025.8. Samples: 2220116020. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 03:46:33,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:46:34,726][06909] Updated weights for policy 0, policy_version 141433 (0.0045) [2024-06-28 03:46:38,783][06909] Updated weights for policy 0, policy_version 141443 (0.0024) [2024-06-28 03:46:38,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2317402112. Throughput: 0: 44208.8. Samples: 2220388820. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 03:46:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:46:42,105][06909] Updated weights for policy 0, policy_version 141453 (0.0044) [2024-06-28 03:46:43,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2317664256. Throughput: 0: 44122.3. Samples: 2220517540. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 03:46:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 03:46:46,216][06909] Updated weights for policy 0, policy_version 141463 (0.0034) [2024-06-28 03:46:48,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2317860864. Throughput: 0: 44086.2. Samples: 2220779940. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 03:46:48,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:46:49,309][06909] Updated weights for policy 0, policy_version 141473 (0.0038) [2024-06-28 03:46:53,674][06909] Updated weights for policy 0, policy_version 141483 (0.0029) [2024-06-28 03:46:53,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2318057472. Throughput: 0: 44198.3. Samples: 2221048240. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 03:46:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:46:56,862][06909] Updated weights for policy 0, policy_version 141493 (0.0029) [2024-06-28 03:46:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2318303232. Throughput: 0: 44043.1. Samples: 2221174360. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 03:46:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:47:00,941][06909] Updated weights for policy 0, policy_version 141503 (0.0027) [2024-06-28 03:47:03,852][06674] Fps is (10 sec: 45865.7, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 2318516224. Throughput: 0: 44035.3. Samples: 2221437620. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 03:47:03,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:47:04,291][06909] Updated weights for policy 0, policy_version 141513 (0.0032) [2024-06-28 03:47:08,558][06909] Updated weights for policy 0, policy_version 141523 (0.0023) [2024-06-28 03:47:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2318729216. Throughput: 0: 44327.2. Samples: 2221710600. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 03:47:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:47:11,936][06909] Updated weights for policy 0, policy_version 141533 (0.0036) [2024-06-28 03:47:13,850][06674] Fps is (10 sec: 47523.0, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2318991360. Throughput: 0: 44208.0. Samples: 2221837120. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 03:47:13,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:47:15,696][06909] Updated weights for policy 0, policy_version 141543 (0.0034) [2024-06-28 03:47:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 2319187968. Throughput: 0: 44208.0. Samples: 2222105380. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 03:47:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:47:19,247][06909] Updated weights for policy 0, policy_version 141553 (0.0033) [2024-06-28 03:47:20,349][06887] Signal inference workers to stop experience collection... (31700 times) [2024-06-28 03:47:20,349][06887] Signal inference workers to resume experience collection... (31700 times) [2024-06-28 03:47:20,395][06909] InferenceWorker_p0-w0: stopping experience collection (31700 times) [2024-06-28 03:47:20,395][06909] InferenceWorker_p0-w0: resuming experience collection (31700 times) [2024-06-28 03:47:23,322][06909] Updated weights for policy 0, policy_version 141563 (0.0030) [2024-06-28 03:47:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 2319400960. Throughput: 0: 44014.0. Samples: 2222369440. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 03:47:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:47:26,458][06909] Updated weights for policy 0, policy_version 141573 (0.0021) [2024-06-28 03:47:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2319613952. Throughput: 0: 44009.8. Samples: 2222497980. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 03:47:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:47:30,978][06909] Updated weights for policy 0, policy_version 141583 (0.0041) [2024-06-28 03:47:33,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2319843328. Throughput: 0: 43932.0. Samples: 2222756880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 03:47:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:47:34,095][06909] Updated weights for policy 0, policy_version 141593 (0.0038) [2024-06-28 03:47:38,348][06909] Updated weights for policy 0, policy_version 141603 (0.0028) [2024-06-28 03:47:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2320039936. Throughput: 0: 43895.5. Samples: 2223023540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 03:47:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:47:41,594][06909] Updated weights for policy 0, policy_version 141613 (0.0025) [2024-06-28 03:47:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2320285696. Throughput: 0: 43956.1. Samples: 2223152380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 03:47:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:47:46,030][06909] Updated weights for policy 0, policy_version 141623 (0.0022) [2024-06-28 03:47:48,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43689.3, 300 sec: 44042.6). Total num frames: 2320482304. Throughput: 0: 43923.5. Samples: 2223414180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 03:47:48,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:47:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000141631_2320482304.pth... [2024-06-28 03:47:48,931][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000140988_2309947392.pth [2024-06-28 03:47:49,306][06909] Updated weights for policy 0, policy_version 141633 (0.0027) [2024-06-28 03:47:53,410][06909] Updated weights for policy 0, policy_version 141643 (0.0037) [2024-06-28 03:47:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2320711680. Throughput: 0: 43841.8. Samples: 2223683480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 03:47:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:47:56,595][06909] Updated weights for policy 0, policy_version 141653 (0.0026) [2024-06-28 03:47:58,850][06674] Fps is (10 sec: 44246.0, 60 sec: 43690.8, 300 sec: 44042.5). Total num frames: 2320924672. Throughput: 0: 43873.4. Samples: 2223811420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 03:47:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:48:00,798][06909] Updated weights for policy 0, policy_version 141663 (0.0036) [2024-06-28 03:48:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43965.2, 300 sec: 44098.0). Total num frames: 2321154048. Throughput: 0: 43734.3. Samples: 2224073420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 03:48:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:48:03,862][06909] Updated weights for policy 0, policy_version 141673 (0.0034) [2024-06-28 03:48:08,245][06909] Updated weights for policy 0, policy_version 141683 (0.0026) [2024-06-28 03:48:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2321367040. Throughput: 0: 43851.6. Samples: 2224342760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 03:48:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:48:11,244][06909] Updated weights for policy 0, policy_version 141693 (0.0031) [2024-06-28 03:48:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.7, 300 sec: 43986.9). Total num frames: 2321596416. Throughput: 0: 43780.1. Samples: 2224468080. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 03:48:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:48:15,609][06909] Updated weights for policy 0, policy_version 141703 (0.0036) [2024-06-28 03:48:18,707][06909] Updated weights for policy 0, policy_version 141713 (0.0029) [2024-06-28 03:48:18,856][06674] Fps is (10 sec: 45847.1, 60 sec: 43959.3, 300 sec: 44152.6). Total num frames: 2321825792. Throughput: 0: 43883.0. Samples: 2224731880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 03:48:18,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:48:23,162][06909] Updated weights for policy 0, policy_version 141723 (0.0028) [2024-06-28 03:48:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2322022400. Throughput: 0: 44104.0. Samples: 2225008220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 03:48:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:48:26,514][06909] Updated weights for policy 0, policy_version 141733 (0.0042) [2024-06-28 03:48:28,850][06674] Fps is (10 sec: 44263.8, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 2322268160. Throughput: 0: 44023.6. Samples: 2225133440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 03:48:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:48:30,411][06909] Updated weights for policy 0, policy_version 141743 (0.0030) [2024-06-28 03:48:33,691][06909] Updated weights for policy 0, policy_version 141753 (0.0043) [2024-06-28 03:48:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2322481152. Throughput: 0: 44105.6. Samples: 2225398840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 03:48:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:48:37,986][06909] Updated weights for policy 0, policy_version 141763 (0.0036) [2024-06-28 03:48:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 43987.2). Total num frames: 2322710528. Throughput: 0: 44024.4. Samples: 2225664580. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:48:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:48:40,873][06909] Updated weights for policy 0, policy_version 141773 (0.0030) [2024-06-28 03:48:42,188][06887] Signal inference workers to stop experience collection... (31750 times) [2024-06-28 03:48:42,190][06887] Signal inference workers to resume experience collection... (31750 times) [2024-06-28 03:48:42,211][06909] InferenceWorker_p0-w0: stopping experience collection (31750 times) [2024-06-28 03:48:42,211][06909] InferenceWorker_p0-w0: resuming experience collection (31750 times) [2024-06-28 03:48:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2322923520. Throughput: 0: 44028.5. Samples: 2225792700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:48:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:48:45,382][06909] Updated weights for policy 0, policy_version 141783 (0.0041) [2024-06-28 03:48:48,609][06909] Updated weights for policy 0, policy_version 141793 (0.0035) [2024-06-28 03:48:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44511.3, 300 sec: 44098.2). Total num frames: 2323152896. Throughput: 0: 44077.3. Samples: 2226056900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:48:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:48:52,568][06909] Updated weights for policy 0, policy_version 141803 (0.0024) [2024-06-28 03:48:53,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 2323349504. Throughput: 0: 44036.2. Samples: 2226324480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:48:53,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:48:55,838][06909] Updated weights for policy 0, policy_version 141813 (0.0047) [2024-06-28 03:48:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2323578880. Throughput: 0: 44063.5. Samples: 2226450940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:48:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:49:00,201][06909] Updated weights for policy 0, policy_version 141823 (0.0033) [2024-06-28 03:49:03,610][06909] Updated weights for policy 0, policy_version 141833 (0.0040) [2024-06-28 03:49:03,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2323791872. Throughput: 0: 44087.3. Samples: 2226715540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:49:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:49:07,514][06909] Updated weights for policy 0, policy_version 141843 (0.0032) [2024-06-28 03:49:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2324037632. Throughput: 0: 43794.7. Samples: 2226978980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:49:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:49:11,075][06909] Updated weights for policy 0, policy_version 141853 (0.0033) [2024-06-28 03:49:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2324234240. Throughput: 0: 43971.1. Samples: 2227112140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:49:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:49:15,253][06909] Updated weights for policy 0, policy_version 141863 (0.0027) [2024-06-28 03:49:18,272][06909] Updated weights for policy 0, policy_version 141873 (0.0040) [2024-06-28 03:49:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43968.2, 300 sec: 44098.0). Total num frames: 2324463616. Throughput: 0: 43948.4. Samples: 2227376520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:49:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:49:22,424][06909] Updated weights for policy 0, policy_version 141883 (0.0020) [2024-06-28 03:49:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2324676608. Throughput: 0: 43927.5. Samples: 2227641320. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:49:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:49:25,755][06909] Updated weights for policy 0, policy_version 141893 (0.0038) [2024-06-28 03:49:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2324889600. Throughput: 0: 43888.4. Samples: 2227767680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:49:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:49:30,283][06909] Updated weights for policy 0, policy_version 141903 (0.0039) [2024-06-28 03:49:33,174][06909] Updated weights for policy 0, policy_version 141913 (0.0029) [2024-06-28 03:49:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2325118976. Throughput: 0: 44056.9. Samples: 2228039460. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:49:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:49:37,410][06909] Updated weights for policy 0, policy_version 141923 (0.0028) [2024-06-28 03:49:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 2325348352. Throughput: 0: 44005.6. Samples: 2228304640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:49:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:49:40,707][06909] Updated weights for policy 0, policy_version 141933 (0.0039) [2024-06-28 03:49:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.5, 300 sec: 43931.3). Total num frames: 2325544960. Throughput: 0: 44009.2. Samples: 2228431360. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 03:49:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:49:45,149][06909] Updated weights for policy 0, policy_version 141943 (0.0039) [2024-06-28 03:49:48,103][06909] Updated weights for policy 0, policy_version 141953 (0.0028) [2024-06-28 03:49:48,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 2325790720. Throughput: 0: 44031.3. Samples: 2228697040. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 03:49:48,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:49:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000141955_2325790720.pth... [2024-06-28 03:49:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000141310_2315223040.pth [2024-06-28 03:49:52,434][06909] Updated weights for policy 0, policy_version 141963 (0.0025) [2024-06-28 03:49:53,850][06674] Fps is (10 sec: 45876.2, 60 sec: 44238.3, 300 sec: 43986.9). Total num frames: 2326003712. Throughput: 0: 43962.6. Samples: 2228957300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 03:49:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:49:55,473][06909] Updated weights for policy 0, policy_version 141973 (0.0029) [2024-06-28 03:49:58,850][06674] Fps is (10 sec: 40968.5, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 2326200320. Throughput: 0: 43962.3. Samples: 2229090440. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 03:49:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:49:59,981][06909] Updated weights for policy 0, policy_version 141983 (0.0043) [2024-06-28 03:50:02,982][06909] Updated weights for policy 0, policy_version 141993 (0.0047) [2024-06-28 03:50:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43987.8). Total num frames: 2326446080. Throughput: 0: 43980.9. Samples: 2229355660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 03:50:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:50:07,390][06909] Updated weights for policy 0, policy_version 142003 (0.0025) [2024-06-28 03:50:07,396][06887] Signal inference workers to stop experience collection... (31800 times) [2024-06-28 03:50:07,396][06887] Signal inference workers to resume experience collection... (31800 times) [2024-06-28 03:50:07,435][06909] InferenceWorker_p0-w0: stopping experience collection (31800 times) [2024-06-28 03:50:07,435][06909] InferenceWorker_p0-w0: resuming experience collection (31800 times) [2024-06-28 03:50:08,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2326675456. Throughput: 0: 43959.1. Samples: 2229619480. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 03:50:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:50:10,282][06909] Updated weights for policy 0, policy_version 142013 (0.0031) [2024-06-28 03:50:13,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 2326855680. Throughput: 0: 44063.4. Samples: 2229750540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 03:50:13,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:50:14,850][06909] Updated weights for policy 0, policy_version 142023 (0.0035) [2024-06-28 03:50:18,003][06909] Updated weights for policy 0, policy_version 142033 (0.0044) [2024-06-28 03:50:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2327117824. Throughput: 0: 43969.0. Samples: 2230018060. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 03:50:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:50:22,303][06909] Updated weights for policy 0, policy_version 142043 (0.0036) [2024-06-28 03:50:23,850][06674] Fps is (10 sec: 45876.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2327314432. Throughput: 0: 43902.7. Samples: 2230280260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 03:50:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:50:25,425][06909] Updated weights for policy 0, policy_version 142053 (0.0030) [2024-06-28 03:50:28,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2327511040. Throughput: 0: 44063.3. Samples: 2230414200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 03:50:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:50:29,453][06909] Updated weights for policy 0, policy_version 142063 (0.0036) [2024-06-28 03:50:32,554][06909] Updated weights for policy 0, policy_version 142073 (0.0027) [2024-06-28 03:50:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2327773184. Throughput: 0: 44146.1. Samples: 2230683520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 03:50:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:50:36,812][06909] Updated weights for policy 0, policy_version 142083 (0.0030) [2024-06-28 03:50:38,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2327986176. Throughput: 0: 44156.9. Samples: 2230944360. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 03:50:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:50:40,174][06909] Updated weights for policy 0, policy_version 142093 (0.0029) [2024-06-28 03:50:43,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.9, 300 sec: 43931.4). Total num frames: 2328182784. Throughput: 0: 44167.1. Samples: 2231077960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 03:50:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:50:44,448][06909] Updated weights for policy 0, policy_version 142103 (0.0026) [2024-06-28 03:50:47,715][06909] Updated weights for policy 0, policy_version 142113 (0.0028) [2024-06-28 03:50:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2328444928. Throughput: 0: 44220.0. Samples: 2231345560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 03:50:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:50:51,741][06909] Updated weights for policy 0, policy_version 142123 (0.0026) [2024-06-28 03:50:53,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2328657920. Throughput: 0: 44190.3. Samples: 2231608040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 03:50:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:50:55,013][06909] Updated weights for policy 0, policy_version 142133 (0.0041) [2024-06-28 03:50:58,850][06674] Fps is (10 sec: 40960.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2328854528. Throughput: 0: 44231.8. Samples: 2231740960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 03:50:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:50:59,443][06909] Updated weights for policy 0, policy_version 142143 (0.0026) [2024-06-28 03:51:02,279][06909] Updated weights for policy 0, policy_version 142153 (0.0026) [2024-06-28 03:51:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2329100288. Throughput: 0: 44131.2. Samples: 2232003960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 03:51:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:51:06,607][06909] Updated weights for policy 0, policy_version 142163 (0.0028) [2024-06-28 03:51:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2329313280. Throughput: 0: 44244.3. Samples: 2232271260. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 03:51:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:51:09,788][06909] Updated weights for policy 0, policy_version 142173 (0.0034) [2024-06-28 03:51:13,840][06909] Updated weights for policy 0, policy_version 142183 (0.0031) [2024-06-28 03:51:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44510.0, 300 sec: 44097.9). Total num frames: 2329526272. Throughput: 0: 44212.8. Samples: 2232403780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 03:51:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:51:17,180][06909] Updated weights for policy 0, policy_version 142193 (0.0032) [2024-06-28 03:51:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2329755648. Throughput: 0: 43949.3. Samples: 2232661240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 03:51:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:51:21,806][06909] Updated weights for policy 0, policy_version 142203 (0.0033) [2024-06-28 03:51:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2329968640. Throughput: 0: 43894.3. Samples: 2232919600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 03:51:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:51:24,913][06909] Updated weights for policy 0, policy_version 142213 (0.0032) [2024-06-28 03:51:28,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2330165248. Throughput: 0: 43925.2. Samples: 2233054600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 03:51:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:51:29,132][06909] Updated weights for policy 0, policy_version 142223 (0.0033) [2024-06-28 03:51:29,824][06887] Signal inference workers to stop experience collection... (31850 times) [2024-06-28 03:51:29,828][06887] Signal inference workers to resume experience collection... (31850 times) [2024-06-28 03:51:29,879][06909] InferenceWorker_p0-w0: stopping experience collection (31850 times) [2024-06-28 03:51:29,879][06909] InferenceWorker_p0-w0: resuming experience collection (31850 times) [2024-06-28 03:51:32,475][06909] Updated weights for policy 0, policy_version 142233 (0.0039) [2024-06-28 03:51:33,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 44098.0). Total num frames: 2330411008. Throughput: 0: 43948.0. Samples: 2233323220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 03:51:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:51:36,417][06909] Updated weights for policy 0, policy_version 142243 (0.0024) [2024-06-28 03:51:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 2330607616. Throughput: 0: 43896.8. Samples: 2233583400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 03:51:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:51:39,665][06909] Updated weights for policy 0, policy_version 142253 (0.0041) [2024-06-28 03:51:43,815][06909] Updated weights for policy 0, policy_version 142263 (0.0028) [2024-06-28 03:51:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2330836992. Throughput: 0: 43933.2. Samples: 2233717960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 03:51:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:51:47,142][06909] Updated weights for policy 0, policy_version 142273 (0.0036) [2024-06-28 03:51:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2331066368. Throughput: 0: 44022.6. Samples: 2233984980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:51:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:51:48,875][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000142277_2331066368.pth... [2024-06-28 03:51:48,944][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000141631_2320482304.pth [2024-06-28 03:51:51,177][06909] Updated weights for policy 0, policy_version 142283 (0.0037) [2024-06-28 03:51:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2331279360. Throughput: 0: 43936.9. Samples: 2234248420. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:51:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:51:54,533][06909] Updated weights for policy 0, policy_version 142293 (0.0036) [2024-06-28 03:51:58,699][06909] Updated weights for policy 0, policy_version 142303 (0.0035) [2024-06-28 03:51:58,852][06674] Fps is (10 sec: 42590.1, 60 sec: 43962.2, 300 sec: 43986.9). Total num frames: 2331492352. Throughput: 0: 43893.6. Samples: 2234379080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:51:58,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:52:01,931][06909] Updated weights for policy 0, policy_version 142313 (0.0023) [2024-06-28 03:52:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2331721728. Throughput: 0: 44046.6. Samples: 2234643340. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:52:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:52:06,151][06909] Updated weights for policy 0, policy_version 142323 (0.0036) [2024-06-28 03:52:08,850][06674] Fps is (10 sec: 44245.4, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 2331934720. Throughput: 0: 44127.9. Samples: 2234905360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:52:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:52:09,607][06909] Updated weights for policy 0, policy_version 142333 (0.0041) [2024-06-28 03:52:13,527][06909] Updated weights for policy 0, policy_version 142343 (0.0032) [2024-06-28 03:52:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 2332147712. Throughput: 0: 44113.9. Samples: 2235039720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:52:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:52:16,778][06909] Updated weights for policy 0, policy_version 142353 (0.0036) [2024-06-28 03:52:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2332393472. Throughput: 0: 44052.9. Samples: 2235305600. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:52:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:52:20,753][06909] Updated weights for policy 0, policy_version 142363 (0.0031) [2024-06-28 03:52:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2332590080. Throughput: 0: 44103.2. Samples: 2235568040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:52:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:52:24,457][06909] Updated weights for policy 0, policy_version 142373 (0.0035) [2024-06-28 03:52:28,415][06909] Updated weights for policy 0, policy_version 142383 (0.0036) [2024-06-28 03:52:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2332803072. Throughput: 0: 43934.7. Samples: 2235695020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:52:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:52:31,865][06909] Updated weights for policy 0, policy_version 142393 (0.0029) [2024-06-28 03:52:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2333048832. Throughput: 0: 44005.8. Samples: 2235965240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:52:33,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:52:35,914][06909] Updated weights for policy 0, policy_version 142403 (0.0025) [2024-06-28 03:52:38,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2333261824. Throughput: 0: 43978.3. Samples: 2236227440. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:52:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:52:39,327][06909] Updated weights for policy 0, policy_version 142413 (0.0026) [2024-06-28 03:52:43,042][06909] Updated weights for policy 0, policy_version 142423 (0.0027) [2024-06-28 03:52:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 2333491200. Throughput: 0: 44166.9. Samples: 2236366500. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:52:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:52:46,528][06909] Updated weights for policy 0, policy_version 142433 (0.0019) [2024-06-28 03:52:48,852][06674] Fps is (10 sec: 45865.7, 60 sec: 44235.4, 300 sec: 44097.6). Total num frames: 2333720576. Throughput: 0: 44226.1. Samples: 2236633600. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 03:52:48,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:52:50,438][06909] Updated weights for policy 0, policy_version 142443 (0.0032) [2024-06-28 03:52:53,654][06909] Updated weights for policy 0, policy_version 142453 (0.0035) [2024-06-28 03:52:53,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2333949952. Throughput: 0: 44425.7. Samples: 2236904520. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:52:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:52:57,645][06909] Updated weights for policy 0, policy_version 142463 (0.0033) [2024-06-28 03:52:58,850][06674] Fps is (10 sec: 42607.1, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 2334146560. Throughput: 0: 44375.5. Samples: 2237036620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:52:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:53:00,467][06887] Signal inference workers to stop experience collection... (31900 times) [2024-06-28 03:53:00,515][06909] InferenceWorker_p0-w0: stopping experience collection (31900 times) [2024-06-28 03:53:00,581][06887] Signal inference workers to resume experience collection... (31900 times) [2024-06-28 03:53:00,582][06909] InferenceWorker_p0-w0: resuming experience collection (31900 times) [2024-06-28 03:53:01,174][06909] Updated weights for policy 0, policy_version 142473 (0.0034) [2024-06-28 03:53:03,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2334392320. Throughput: 0: 44352.5. Samples: 2237301460. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:53:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:53:05,370][06909] Updated weights for policy 0, policy_version 142483 (0.0030) [2024-06-28 03:53:08,683][06909] Updated weights for policy 0, policy_version 142493 (0.0033) [2024-06-28 03:53:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2334605312. Throughput: 0: 44326.1. Samples: 2237562720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:53:08,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 03:53:12,669][06909] Updated weights for policy 0, policy_version 142503 (0.0028) [2024-06-28 03:53:13,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44509.9, 300 sec: 44043.3). Total num frames: 2334818304. Throughput: 0: 44477.5. Samples: 2237696500. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:53:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:53:16,163][06909] Updated weights for policy 0, policy_version 142513 (0.0035) [2024-06-28 03:53:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2335047680. Throughput: 0: 44347.1. Samples: 2237960860. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:53:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:53:20,004][06909] Updated weights for policy 0, policy_version 142523 (0.0040) [2024-06-28 03:53:23,382][06909] Updated weights for policy 0, policy_version 142533 (0.0030) [2024-06-28 03:53:23,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44782.8, 300 sec: 44097.9). Total num frames: 2335277056. Throughput: 0: 44392.3. Samples: 2238225100. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:53:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:53:27,262][06909] Updated weights for policy 0, policy_version 142543 (0.0033) [2024-06-28 03:53:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 2335473664. Throughput: 0: 44316.6. Samples: 2238360740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:53:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:53:30,682][06909] Updated weights for policy 0, policy_version 142553 (0.0037) [2024-06-28 03:53:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2335703040. Throughput: 0: 44298.0. Samples: 2238626920. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:53:33,850][06674] Avg episode reward: [(0, '0.404')] [2024-06-28 03:53:34,518][06909] Updated weights for policy 0, policy_version 142563 (0.0046) [2024-06-28 03:53:38,222][06909] Updated weights for policy 0, policy_version 142573 (0.0038) [2024-06-28 03:53:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2335932416. Throughput: 0: 44079.8. Samples: 2238888100. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:53:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:53:41,871][06909] Updated weights for policy 0, policy_version 142583 (0.0026) [2024-06-28 03:53:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2336145408. Throughput: 0: 44079.6. Samples: 2239020200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:53:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:53:45,900][06909] Updated weights for policy 0, policy_version 142593 (0.0042) [2024-06-28 03:53:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43965.3, 300 sec: 44098.3). Total num frames: 2336358400. Throughput: 0: 44050.3. Samples: 2239283720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 03:53:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:53:48,938][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000142601_2336374784.pth... [2024-06-28 03:53:49,000][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000141955_2325790720.pth [2024-06-28 03:53:49,520][06909] Updated weights for policy 0, policy_version 142603 (0.0033) [2024-06-28 03:53:53,286][06909] Updated weights for policy 0, policy_version 142613 (0.0027) [2024-06-28 03:53:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2336604160. Throughput: 0: 44076.1. Samples: 2239546140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 03:53:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:53:57,085][06909] Updated weights for policy 0, policy_version 142623 (0.0039) [2024-06-28 03:53:58,850][06674] Fps is (10 sec: 44235.8, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2336800768. Throughput: 0: 44181.1. Samples: 2239684660. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 03:53:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:54:00,574][06909] Updated weights for policy 0, policy_version 142633 (0.0046) [2024-06-28 03:54:03,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2337013760. Throughput: 0: 44160.9. Samples: 2239948100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 03:54:03,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 03:54:04,419][06909] Updated weights for policy 0, policy_version 142643 (0.0020) [2024-06-28 03:54:08,076][06909] Updated weights for policy 0, policy_version 142653 (0.0029) [2024-06-28 03:54:08,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2337243136. Throughput: 0: 44014.3. Samples: 2240205740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 03:54:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:54:11,810][06909] Updated weights for policy 0, policy_version 142663 (0.0032) [2024-06-28 03:54:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2337456128. Throughput: 0: 44117.2. Samples: 2240346020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 03:54:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:54:15,425][06909] Updated weights for policy 0, policy_version 142673 (0.0030) [2024-06-28 03:54:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2337701888. Throughput: 0: 44109.8. Samples: 2240611860. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 03:54:18,850][06674] Avg episode reward: [(0, '0.457')] [2024-06-28 03:54:19,158][06909] Updated weights for policy 0, policy_version 142683 (0.0025) [2024-06-28 03:54:22,695][06909] Updated weights for policy 0, policy_version 142693 (0.0030) [2024-06-28 03:54:23,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2337914880. Throughput: 0: 44081.3. Samples: 2240871760. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 03:54:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:54:26,714][06909] Updated weights for policy 0, policy_version 142703 (0.0031) [2024-06-28 03:54:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2338127872. Throughput: 0: 44026.6. Samples: 2241001400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 03:54:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:54:30,318][06909] Updated weights for policy 0, policy_version 142713 (0.0028) [2024-06-28 03:54:33,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2338340864. Throughput: 0: 44087.1. Samples: 2241267640. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 03:54:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:54:34,248][06909] Updated weights for policy 0, policy_version 142723 (0.0033) [2024-06-28 03:54:36,180][06887] Signal inference workers to stop experience collection... (31950 times) [2024-06-28 03:54:36,181][06887] Signal inference workers to resume experience collection... (31950 times) [2024-06-28 03:54:36,215][06909] InferenceWorker_p0-w0: stopping experience collection (31950 times) [2024-06-28 03:54:36,215][06909] InferenceWorker_p0-w0: resuming experience collection (31950 times) [2024-06-28 03:54:37,801][06909] Updated weights for policy 0, policy_version 142733 (0.0034) [2024-06-28 03:54:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2338570240. Throughput: 0: 44066.7. Samples: 2241529140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 03:54:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:54:41,703][06909] Updated weights for policy 0, policy_version 142743 (0.0023) [2024-06-28 03:54:43,852][06674] Fps is (10 sec: 42589.3, 60 sec: 43689.1, 300 sec: 43986.9). Total num frames: 2338766848. Throughput: 0: 43951.0. Samples: 2241662540. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 03:54:43,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:54:45,179][06909] Updated weights for policy 0, policy_version 142753 (0.0026) [2024-06-28 03:54:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2338996224. Throughput: 0: 43890.8. Samples: 2241923180. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 03:54:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:54:49,117][06909] Updated weights for policy 0, policy_version 142763 (0.0027) [2024-06-28 03:54:52,695][06909] Updated weights for policy 0, policy_version 142773 (0.0028) [2024-06-28 03:54:53,850][06674] Fps is (10 sec: 47523.3, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2339241984. Throughput: 0: 43949.8. Samples: 2242183480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 03:54:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:54:56,930][06909] Updated weights for policy 0, policy_version 142783 (0.0034) [2024-06-28 03:54:58,850][06674] Fps is (10 sec: 42596.6, 60 sec: 43690.5, 300 sec: 43986.8). Total num frames: 2339422208. Throughput: 0: 43764.2. Samples: 2242315420. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 03:54:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:55:00,060][06909] Updated weights for policy 0, policy_version 142793 (0.0028) [2024-06-28 03:55:03,852][06674] Fps is (10 sec: 42589.5, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 2339667968. Throughput: 0: 43903.3. Samples: 2242587600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 03:55:03,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:55:04,156][06909] Updated weights for policy 0, policy_version 142803 (0.0037) [2024-06-28 03:55:07,564][06909] Updated weights for policy 0, policy_version 142813 (0.0024) [2024-06-28 03:55:08,850][06674] Fps is (10 sec: 47515.2, 60 sec: 44236.8, 300 sec: 44209.1). Total num frames: 2339897344. Throughput: 0: 43837.8. Samples: 2242844460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 03:55:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:55:11,671][06909] Updated weights for policy 0, policy_version 142823 (0.0036) [2024-06-28 03:55:13,850][06674] Fps is (10 sec: 44245.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2340110336. Throughput: 0: 43964.9. Samples: 2242979820. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 03:55:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:55:14,958][06909] Updated weights for policy 0, policy_version 142833 (0.0026) [2024-06-28 03:55:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2340323328. Throughput: 0: 43935.5. Samples: 2243244740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 03:55:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:55:18,911][06909] Updated weights for policy 0, policy_version 142843 (0.0035) [2024-06-28 03:55:22,570][06909] Updated weights for policy 0, policy_version 142853 (0.0022) [2024-06-28 03:55:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2340552704. Throughput: 0: 43938.6. Samples: 2243506380. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 03:55:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:55:26,315][06909] Updated weights for policy 0, policy_version 142863 (0.0041) [2024-06-28 03:55:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2340749312. Throughput: 0: 43866.4. Samples: 2243636440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 03:55:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:55:29,899][06909] Updated weights for policy 0, policy_version 142873 (0.0029) [2024-06-28 03:55:33,825][06909] Updated weights for policy 0, policy_version 142883 (0.0033) [2024-06-28 03:55:33,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2340995072. Throughput: 0: 44052.4. Samples: 2243905540. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 03:55:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:55:37,591][06909] Updated weights for policy 0, policy_version 142893 (0.0032) [2024-06-28 03:55:38,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2341208064. Throughput: 0: 44059.9. Samples: 2244166180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 03:55:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:55:41,356][06909] Updated weights for policy 0, policy_version 142903 (0.0031) [2024-06-28 03:55:43,856][06674] Fps is (10 sec: 42572.4, 60 sec: 44233.8, 300 sec: 43986.0). Total num frames: 2341421056. Throughput: 0: 44050.0. Samples: 2244297920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 03:55:43,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:55:44,971][06909] Updated weights for policy 0, policy_version 142913 (0.0028) [2024-06-28 03:55:48,759][06909] Updated weights for policy 0, policy_version 142923 (0.0028) [2024-06-28 03:55:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2341650432. Throughput: 0: 43896.6. Samples: 2244562860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 03:55:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:55:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000142923_2341650432.pth... [2024-06-28 03:55:48,924][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000142277_2331066368.pth [2024-06-28 03:55:52,571][06909] Updated weights for policy 0, policy_version 142933 (0.0026) [2024-06-28 03:55:53,850][06674] Fps is (10 sec: 45902.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2341879808. Throughput: 0: 43906.6. Samples: 2244820260. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 03:55:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:55:56,156][06909] Updated weights for policy 0, policy_version 142943 (0.0023) [2024-06-28 03:55:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44237.0, 300 sec: 43986.9). Total num frames: 2342076416. Throughput: 0: 43846.2. Samples: 2244952900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 03:55:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:56:00,317][06909] Updated weights for policy 0, policy_version 142953 (0.0030) [2024-06-28 03:56:03,629][06909] Updated weights for policy 0, policy_version 142963 (0.0031) [2024-06-28 03:56:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 2342305792. Throughput: 0: 43927.8. Samples: 2245221500. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:56:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:56:07,529][06909] Updated weights for policy 0, policy_version 142973 (0.0043) [2024-06-28 03:56:08,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2342518784. Throughput: 0: 43886.8. Samples: 2245481280. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:56:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:56:11,134][06909] Updated weights for policy 0, policy_version 142983 (0.0040) [2024-06-28 03:56:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2342748160. Throughput: 0: 43904.0. Samples: 2245612120. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:56:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:56:14,280][06887] Signal inference workers to stop experience collection... (32000 times) [2024-06-28 03:56:14,281][06887] Signal inference workers to resume experience collection... (32000 times) [2024-06-28 03:56:14,328][06909] InferenceWorker_p0-w0: stopping experience collection (32000 times) [2024-06-28 03:56:14,328][06909] InferenceWorker_p0-w0: resuming experience collection (32000 times) [2024-06-28 03:56:15,053][06909] Updated weights for policy 0, policy_version 142993 (0.0021) [2024-06-28 03:56:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2342944768. Throughput: 0: 43804.0. Samples: 2245876720. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:56:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:56:18,943][06909] Updated weights for policy 0, policy_version 143003 (0.0041) [2024-06-28 03:56:22,394][06909] Updated weights for policy 0, policy_version 143013 (0.0032) [2024-06-28 03:56:23,851][06674] Fps is (10 sec: 44230.4, 60 sec: 43962.7, 300 sec: 44153.3). Total num frames: 2343190528. Throughput: 0: 43835.5. Samples: 2246138840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:56:23,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:56:26,177][06909] Updated weights for policy 0, policy_version 143023 (0.0039) [2024-06-28 03:56:28,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2343403520. Throughput: 0: 43867.6. Samples: 2246271700. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:56:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:56:29,571][06909] Updated weights for policy 0, policy_version 143033 (0.0027) [2024-06-28 03:56:33,763][06909] Updated weights for policy 0, policy_version 143043 (0.0028) [2024-06-28 03:56:33,850][06674] Fps is (10 sec: 42604.6, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 2343616512. Throughput: 0: 43853.8. Samples: 2246536280. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:56:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:56:37,357][06909] Updated weights for policy 0, policy_version 143053 (0.0037) [2024-06-28 03:56:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2343845888. Throughput: 0: 44011.2. Samples: 2246800760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:56:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:56:40,932][06909] Updated weights for policy 0, policy_version 143063 (0.0031) [2024-06-28 03:56:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43968.2, 300 sec: 44042.4). Total num frames: 2344058880. Throughput: 0: 44046.3. Samples: 2246934980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:56:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:56:44,502][06909] Updated weights for policy 0, policy_version 143073 (0.0027) [2024-06-28 03:56:48,116][06909] Updated weights for policy 0, policy_version 143083 (0.0045) [2024-06-28 03:56:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 2344271872. Throughput: 0: 44009.5. Samples: 2247201920. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:56:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:56:52,158][06909] Updated weights for policy 0, policy_version 143093 (0.0035) [2024-06-28 03:56:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 44098.2). Total num frames: 2344501248. Throughput: 0: 44089.7. Samples: 2247465320. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:56:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:56:55,989][06909] Updated weights for policy 0, policy_version 143103 (0.0047) [2024-06-28 03:56:58,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2344730624. Throughput: 0: 44171.5. Samples: 2247599840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 03:56:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:56:59,221][06909] Updated weights for policy 0, policy_version 143113 (0.0026) [2024-06-28 03:57:03,139][06909] Updated weights for policy 0, policy_version 143123 (0.0027) [2024-06-28 03:57:03,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 2344943616. Throughput: 0: 44252.4. Samples: 2247868080. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 03:57:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:57:07,078][06909] Updated weights for policy 0, policy_version 143133 (0.0041) [2024-06-28 03:57:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2345172992. Throughput: 0: 44171.7. Samples: 2248126500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 03:57:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:57:10,709][06909] Updated weights for policy 0, policy_version 143143 (0.0034) [2024-06-28 03:57:13,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2345385984. Throughput: 0: 44280.0. Samples: 2248264300. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 03:57:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:57:14,291][06909] Updated weights for policy 0, policy_version 143153 (0.0022) [2024-06-28 03:57:17,828][06909] Updated weights for policy 0, policy_version 143163 (0.0036) [2024-06-28 03:57:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2345598976. Throughput: 0: 44200.9. Samples: 2248525320. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 03:57:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:57:21,806][06909] Updated weights for policy 0, policy_version 143173 (0.0037) [2024-06-28 03:57:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43964.8, 300 sec: 44153.5). Total num frames: 2345828352. Throughput: 0: 44151.9. Samples: 2248787600. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 03:57:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:57:25,231][06909] Updated weights for policy 0, policy_version 143183 (0.0034) [2024-06-28 03:57:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2346041344. Throughput: 0: 44221.8. Samples: 2248924960. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 03:57:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:57:29,135][06909] Updated weights for policy 0, policy_version 143193 (0.0040) [2024-06-28 03:57:32,577][06909] Updated weights for policy 0, policy_version 143203 (0.0027) [2024-06-28 03:57:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2346270720. Throughput: 0: 44128.8. Samples: 2249187720. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 03:57:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:57:35,315][06887] Signal inference workers to stop experience collection... (32050 times) [2024-06-28 03:57:35,315][06887] Signal inference workers to resume experience collection... (32050 times) [2024-06-28 03:57:35,355][06909] InferenceWorker_p0-w0: stopping experience collection (32050 times) [2024-06-28 03:57:35,355][06909] InferenceWorker_p0-w0: resuming experience collection (32050 times) [2024-06-28 03:57:36,332][06909] Updated weights for policy 0, policy_version 143213 (0.0032) [2024-06-28 03:57:38,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2346467328. Throughput: 0: 44215.5. Samples: 2249455020. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 03:57:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:57:40,198][06909] Updated weights for policy 0, policy_version 143223 (0.0032) [2024-06-28 03:57:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 2346713088. Throughput: 0: 44235.2. Samples: 2249590420. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 03:57:43,850][06674] Avg episode reward: [(0, '0.441')] [2024-06-28 03:57:43,941][06909] Updated weights for policy 0, policy_version 143233 (0.0028) [2024-06-28 03:57:47,687][06909] Updated weights for policy 0, policy_version 143243 (0.0034) [2024-06-28 03:57:48,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2346926080. Throughput: 0: 43942.2. Samples: 2249845480. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 03:57:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:57:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000143245_2346926080.pth... [2024-06-28 03:57:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000142601_2336374784.pth [2024-06-28 03:57:51,537][06909] Updated weights for policy 0, policy_version 143253 (0.0026) [2024-06-28 03:57:53,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 2347122688. Throughput: 0: 44117.0. Samples: 2250111760. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 03:57:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:57:54,874][06909] Updated weights for policy 0, policy_version 143263 (0.0028) [2024-06-28 03:57:58,737][06909] Updated weights for policy 0, policy_version 143273 (0.0036) [2024-06-28 03:57:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2347384832. Throughput: 0: 44047.6. Samples: 2250246440. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 03:57:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:58:02,569][06909] Updated weights for policy 0, policy_version 143283 (0.0028) [2024-06-28 03:58:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2347581440. Throughput: 0: 43959.6. Samples: 2250503500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 03:58:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:58:06,174][06909] Updated weights for policy 0, policy_version 143293 (0.0031) [2024-06-28 03:58:08,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2347794432. Throughput: 0: 44159.1. Samples: 2250774760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 03:58:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:58:10,167][06909] Updated weights for policy 0, policy_version 143303 (0.0030) [2024-06-28 03:58:13,616][06909] Updated weights for policy 0, policy_version 143313 (0.0041) [2024-06-28 03:58:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2348040192. Throughput: 0: 44020.4. Samples: 2250905880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 03:58:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:58:17,601][06909] Updated weights for policy 0, policy_version 143323 (0.0033) [2024-06-28 03:58:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 2348236800. Throughput: 0: 43919.2. Samples: 2251164080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 03:58:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 03:58:21,215][06909] Updated weights for policy 0, policy_version 143333 (0.0038) [2024-06-28 03:58:23,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2348449792. Throughput: 0: 43944.0. Samples: 2251432500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 03:58:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 03:58:25,029][06909] Updated weights for policy 0, policy_version 143343 (0.0040) [2024-06-28 03:58:28,679][06909] Updated weights for policy 0, policy_version 143353 (0.0029) [2024-06-28 03:58:28,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2348695552. Throughput: 0: 43798.2. Samples: 2251561340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 03:58:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:58:32,738][06909] Updated weights for policy 0, policy_version 143363 (0.0037) [2024-06-28 03:58:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2348892160. Throughput: 0: 43875.6. Samples: 2251819880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 03:58:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:58:36,094][06909] Updated weights for policy 0, policy_version 143373 (0.0032) [2024-06-28 03:58:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2349121536. Throughput: 0: 44115.0. Samples: 2252096940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 03:58:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:58:40,110][06909] Updated weights for policy 0, policy_version 143383 (0.0034) [2024-06-28 03:58:43,338][06909] Updated weights for policy 0, policy_version 143393 (0.0029) [2024-06-28 03:58:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2349350912. Throughput: 0: 44035.6. Samples: 2252228040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 03:58:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 03:58:47,430][06909] Updated weights for policy 0, policy_version 143403 (0.0037) [2024-06-28 03:58:48,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2349563904. Throughput: 0: 44245.8. Samples: 2252494560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 03:58:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:58:50,917][06909] Updated weights for policy 0, policy_version 143413 (0.0027) [2024-06-28 03:58:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2349776896. Throughput: 0: 44096.1. Samples: 2252759080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 03:58:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:58:55,061][06909] Updated weights for policy 0, policy_version 143423 (0.0037) [2024-06-28 03:58:56,888][06887] Signal inference workers to stop experience collection... (32100 times) [2024-06-28 03:58:56,931][06909] InferenceWorker_p0-w0: stopping experience collection (32100 times) [2024-06-28 03:58:56,940][06887] Signal inference workers to resume experience collection... (32100 times) [2024-06-28 03:58:56,952][06909] InferenceWorker_p0-w0: resuming experience collection (32100 times) [2024-06-28 03:58:58,068][06909] Updated weights for policy 0, policy_version 143433 (0.0037) [2024-06-28 03:58:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2350022656. Throughput: 0: 44123.6. Samples: 2252891440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 03:58:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:59:02,368][06909] Updated weights for policy 0, policy_version 143443 (0.0037) [2024-06-28 03:59:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2350235648. Throughput: 0: 44162.7. Samples: 2253151400. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 03:59:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:59:05,749][06909] Updated weights for policy 0, policy_version 143453 (0.0035) [2024-06-28 03:59:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2350448640. Throughput: 0: 44068.0. Samples: 2253415560. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 03:59:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 03:59:10,118][06909] Updated weights for policy 0, policy_version 143463 (0.0042) [2024-06-28 03:59:13,166][06909] Updated weights for policy 0, policy_version 143473 (0.0027) [2024-06-28 03:59:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2350661632. Throughput: 0: 44053.3. Samples: 2253543740. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 03:59:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:59:17,465][06909] Updated weights for policy 0, policy_version 143483 (0.0027) [2024-06-28 03:59:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2350891008. Throughput: 0: 44310.7. Samples: 2253813860. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 03:59:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:59:20,588][06909] Updated weights for policy 0, policy_version 143493 (0.0022) [2024-06-28 03:59:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2351087616. Throughput: 0: 43963.1. Samples: 2254075280. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 03:59:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:59:25,041][06909] Updated weights for policy 0, policy_version 143503 (0.0052) [2024-06-28 03:59:28,120][06909] Updated weights for policy 0, policy_version 143513 (0.0026) [2024-06-28 03:59:28,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2351349760. Throughput: 0: 43819.0. Samples: 2254199900. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 03:59:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:59:32,471][06909] Updated weights for policy 0, policy_version 143523 (0.0034) [2024-06-28 03:59:33,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2351562752. Throughput: 0: 43866.1. Samples: 2254468540. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 03:59:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:59:35,385][06909] Updated weights for policy 0, policy_version 143533 (0.0043) [2024-06-28 03:59:38,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 2351742976. Throughput: 0: 43887.1. Samples: 2254734000. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 03:59:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:59:39,838][06909] Updated weights for policy 0, policy_version 143543 (0.0032) [2024-06-28 03:59:42,687][06909] Updated weights for policy 0, policy_version 143553 (0.0032) [2024-06-28 03:59:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2352005120. Throughput: 0: 43694.2. Samples: 2254857680. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 03:59:43,850][06674] Avg episode reward: [(0, '0.434')] [2024-06-28 03:59:47,230][06909] Updated weights for policy 0, policy_version 143563 (0.0041) [2024-06-28 03:59:48,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2352218112. Throughput: 0: 43984.3. Samples: 2255130700. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 03:59:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 03:59:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000143568_2352218112.pth... [2024-06-28 03:59:48,919][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000142923_2341650432.pth [2024-06-28 03:59:50,358][06909] Updated weights for policy 0, policy_version 143573 (0.0032) [2024-06-28 03:59:53,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 44042.5). Total num frames: 2352414720. Throughput: 0: 43864.8. Samples: 2255389480. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 03:59:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 03:59:54,615][06909] Updated weights for policy 0, policy_version 143583 (0.0022) [2024-06-28 03:59:57,679][06909] Updated weights for policy 0, policy_version 143593 (0.0044) [2024-06-28 03:59:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 2352660480. Throughput: 0: 43981.7. Samples: 2255522920. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 03:59:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:00:02,082][06909] Updated weights for policy 0, policy_version 143603 (0.0043) [2024-06-28 04:00:03,851][06674] Fps is (10 sec: 45869.9, 60 sec: 43962.8, 300 sec: 43986.7). Total num frames: 2352873472. Throughput: 0: 43849.9. Samples: 2255787160. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 04:00:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:00:05,359][06909] Updated weights for policy 0, policy_version 143613 (0.0034) [2024-06-28 04:00:08,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2353070080. Throughput: 0: 44107.2. Samples: 2256060100. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 04:00:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:00:09,490][06909] Updated weights for policy 0, policy_version 143623 (0.0025) [2024-06-28 04:00:11,839][06887] Signal inference workers to stop experience collection... (32150 times) [2024-06-28 04:00:11,847][06887] Signal inference workers to resume experience collection... (32150 times) [2024-06-28 04:00:11,853][06909] InferenceWorker_p0-w0: stopping experience collection (32150 times) [2024-06-28 04:00:11,877][06909] InferenceWorker_p0-w0: resuming experience collection (32150 times) [2024-06-28 04:00:12,676][06909] Updated weights for policy 0, policy_version 143633 (0.0046) [2024-06-28 04:00:13,850][06674] Fps is (10 sec: 44242.6, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2353315840. Throughput: 0: 44090.4. Samples: 2256183960. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 04:00:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:00:17,288][06909] Updated weights for policy 0, policy_version 143643 (0.0019) [2024-06-28 04:00:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2353528832. Throughput: 0: 44189.4. Samples: 2256457060. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 04:00:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:00:19,805][06909] Updated weights for policy 0, policy_version 143653 (0.0033) [2024-06-28 04:00:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2353758208. Throughput: 0: 44103.5. Samples: 2256718660. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 04:00:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:00:24,670][06909] Updated weights for policy 0, policy_version 143663 (0.0028) [2024-06-28 04:00:27,405][06909] Updated weights for policy 0, policy_version 143673 (0.0047) [2024-06-28 04:00:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2353971200. Throughput: 0: 44266.3. Samples: 2256849660. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 04:00:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:00:31,911][06909] Updated weights for policy 0, policy_version 143683 (0.0032) [2024-06-28 04:00:33,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 2354184192. Throughput: 0: 44070.1. Samples: 2257113940. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 04:00:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:00:34,777][06909] Updated weights for policy 0, policy_version 143693 (0.0031) [2024-06-28 04:00:38,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.7, 300 sec: 43987.8). Total num frames: 2354397184. Throughput: 0: 44203.0. Samples: 2257378620. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 04:00:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:00:39,116][06909] Updated weights for policy 0, policy_version 143703 (0.0029) [2024-06-28 04:00:42,464][06909] Updated weights for policy 0, policy_version 143713 (0.0036) [2024-06-28 04:00:43,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2354626560. Throughput: 0: 44144.9. Samples: 2257509440. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 04:00:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:00:46,782][06909] Updated weights for policy 0, policy_version 143723 (0.0028) [2024-06-28 04:00:48,852][06674] Fps is (10 sec: 47504.5, 60 sec: 44235.4, 300 sec: 44042.1). Total num frames: 2354872320. Throughput: 0: 44152.5. Samples: 2257774060. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 04:00:48,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:00:49,742][06909] Updated weights for policy 0, policy_version 143733 (0.0028) [2024-06-28 04:00:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2355068928. Throughput: 0: 44003.5. Samples: 2258040260. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 04:00:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:00:54,414][06909] Updated weights for policy 0, policy_version 143743 (0.0032) [2024-06-28 04:00:57,520][06909] Updated weights for policy 0, policy_version 143753 (0.0030) [2024-06-28 04:00:58,850][06674] Fps is (10 sec: 42607.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2355298304. Throughput: 0: 44097.2. Samples: 2258168340. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 04:00:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:01:01,606][06909] Updated weights for policy 0, policy_version 143763 (0.0027) [2024-06-28 04:01:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43691.5, 300 sec: 43986.9). Total num frames: 2355494912. Throughput: 0: 43963.9. Samples: 2258435440. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 04:01:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:01:04,818][06909] Updated weights for policy 0, policy_version 143773 (0.0049) [2024-06-28 04:01:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2355724288. Throughput: 0: 43962.2. Samples: 2258696960. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 04:01:08,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:01:09,010][06909] Updated weights for policy 0, policy_version 143783 (0.0031) [2024-06-28 04:01:12,297][06909] Updated weights for policy 0, policy_version 143793 (0.0049) [2024-06-28 04:01:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2355953664. Throughput: 0: 43872.3. Samples: 2258823920. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 04:01:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:01:16,301][06909] Updated weights for policy 0, policy_version 143803 (0.0027) [2024-06-28 04:01:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44042.6). Total num frames: 2356183040. Throughput: 0: 43868.2. Samples: 2259087920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 04:01:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:01:19,915][06909] Updated weights for policy 0, policy_version 143813 (0.0027) [2024-06-28 04:01:23,850][06674] Fps is (10 sec: 42596.3, 60 sec: 43690.2, 300 sec: 43986.8). Total num frames: 2356379648. Throughput: 0: 43856.0. Samples: 2259352160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 04:01:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:01:24,123][06909] Updated weights for policy 0, policy_version 143823 (0.0035) [2024-06-28 04:01:27,073][06909] Updated weights for policy 0, policy_version 143833 (0.0027) [2024-06-28 04:01:28,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2356609024. Throughput: 0: 43832.3. Samples: 2259481900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 04:01:28,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:01:31,640][06909] Updated weights for policy 0, policy_version 143843 (0.0034) [2024-06-28 04:01:33,852][06674] Fps is (10 sec: 45868.6, 60 sec: 44236.8, 300 sec: 44042.1). Total num frames: 2356838400. Throughput: 0: 43956.4. Samples: 2259752100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 04:01:33,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 04:01:34,723][06909] Updated weights for policy 0, policy_version 143853 (0.0038) [2024-06-28 04:01:38,759][06909] Updated weights for policy 0, policy_version 143863 (0.0041) [2024-06-28 04:01:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2357051392. Throughput: 0: 43932.9. Samples: 2260017240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 04:01:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:01:41,709][06887] Signal inference workers to stop experience collection... (32200 times) [2024-06-28 04:01:41,762][06909] InferenceWorker_p0-w0: stopping experience collection (32200 times) [2024-06-28 04:01:41,763][06887] Signal inference workers to resume experience collection... (32200 times) [2024-06-28 04:01:41,778][06909] InferenceWorker_p0-w0: resuming experience collection (32200 times) [2024-06-28 04:01:41,903][06909] Updated weights for policy 0, policy_version 143873 (0.0028) [2024-06-28 04:01:43,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2357280768. Throughput: 0: 43964.9. Samples: 2260146760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 04:01:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:01:46,046][06909] Updated weights for policy 0, policy_version 143883 (0.0039) [2024-06-28 04:01:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43965.3, 300 sec: 44098.0). Total num frames: 2357510144. Throughput: 0: 44080.5. Samples: 2260419060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 04:01:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:01:48,955][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000143892_2357526528.pth... [2024-06-28 04:01:49,018][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000143245_2346926080.pth [2024-06-28 04:01:49,364][06909] Updated weights for policy 0, policy_version 143893 (0.0032) [2024-06-28 04:01:53,372][06909] Updated weights for policy 0, policy_version 143903 (0.0027) [2024-06-28 04:01:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2357723136. Throughput: 0: 44103.6. Samples: 2260681620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 04:01:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:01:56,781][06909] Updated weights for policy 0, policy_version 143913 (0.0036) [2024-06-28 04:01:58,856][06674] Fps is (10 sec: 42572.5, 60 sec: 43959.3, 300 sec: 44041.5). Total num frames: 2357936128. Throughput: 0: 44235.9. Samples: 2260814800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 04:01:58,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:02:00,893][06909] Updated weights for policy 0, policy_version 143923 (0.0032) [2024-06-28 04:02:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44783.0, 300 sec: 44098.0). Total num frames: 2358181888. Throughput: 0: 44336.4. Samples: 2261083060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 04:02:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:02:04,241][06909] Updated weights for policy 0, policy_version 143933 (0.0028) [2024-06-28 04:02:08,343][06909] Updated weights for policy 0, policy_version 143943 (0.0025) [2024-06-28 04:02:08,850][06674] Fps is (10 sec: 44263.8, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2358378496. Throughput: 0: 44365.5. Samples: 2261348580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 04:02:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:02:11,509][06909] Updated weights for policy 0, policy_version 143953 (0.0025) [2024-06-28 04:02:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2358607872. Throughput: 0: 44448.5. Samples: 2261482080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 04:02:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:02:15,564][06909] Updated weights for policy 0, policy_version 143963 (0.0030) [2024-06-28 04:02:18,820][06909] Updated weights for policy 0, policy_version 143973 (0.0025) [2024-06-28 04:02:18,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2358853632. Throughput: 0: 44563.8. Samples: 2261757380. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 04:02:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:02:22,935][06909] Updated weights for policy 0, policy_version 143983 (0.0027) [2024-06-28 04:02:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44237.2, 300 sec: 44042.4). Total num frames: 2359033856. Throughput: 0: 44207.6. Samples: 2262006580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 04:02:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:02:26,449][06909] Updated weights for policy 0, policy_version 143993 (0.0039) [2024-06-28 04:02:28,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2359263232. Throughput: 0: 44264.8. Samples: 2262138680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 04:02:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:02:30,460][06909] Updated weights for policy 0, policy_version 144003 (0.0030) [2024-06-28 04:02:33,689][06909] Updated weights for policy 0, policy_version 144013 (0.0036) [2024-06-28 04:02:33,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44511.3, 300 sec: 44209.0). Total num frames: 2359508992. Throughput: 0: 44253.2. Samples: 2262410460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 04:02:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:02:37,704][06909] Updated weights for policy 0, policy_version 144023 (0.0038) [2024-06-28 04:02:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2359705600. Throughput: 0: 44408.5. Samples: 2262680000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 04:02:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:02:41,212][06909] Updated weights for policy 0, policy_version 144033 (0.0036) [2024-06-28 04:02:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2359934976. Throughput: 0: 44248.1. Samples: 2262805700. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 04:02:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:02:45,212][06909] Updated weights for policy 0, policy_version 144043 (0.0030) [2024-06-28 04:02:48,409][06909] Updated weights for policy 0, policy_version 144053 (0.0024) [2024-06-28 04:02:48,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44509.8, 300 sec: 44264.5). Total num frames: 2360180736. Throughput: 0: 44486.2. Samples: 2263084940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 04:02:48,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:02:52,431][06909] Updated weights for policy 0, policy_version 144063 (0.0022) [2024-06-28 04:02:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2360377344. Throughput: 0: 44393.7. Samples: 2263346300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 04:02:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:02:56,070][06909] Updated weights for policy 0, policy_version 144073 (0.0043) [2024-06-28 04:02:58,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44241.3, 300 sec: 44097.9). Total num frames: 2360590336. Throughput: 0: 44236.9. Samples: 2263472740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 04:02:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:02:59,859][06909] Updated weights for policy 0, policy_version 144083 (0.0033) [2024-06-28 04:03:03,470][06909] Updated weights for policy 0, policy_version 144093 (0.0028) [2024-06-28 04:03:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2360836096. Throughput: 0: 44057.3. Samples: 2263739960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 04:03:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:03:07,256][06909] Updated weights for policy 0, policy_version 144103 (0.0034) [2024-06-28 04:03:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2361032704. Throughput: 0: 44391.0. Samples: 2264004180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 04:03:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:03:10,962][06909] Updated weights for policy 0, policy_version 144113 (0.0036) [2024-06-28 04:03:13,850][06674] Fps is (10 sec: 40959.2, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2361245696. Throughput: 0: 44432.3. Samples: 2264138140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 04:03:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 04:03:14,763][06909] Updated weights for policy 0, policy_version 144123 (0.0030) [2024-06-28 04:03:17,411][06887] Signal inference workers to stop experience collection... (32250 times) [2024-06-28 04:03:17,464][06887] Signal inference workers to resume experience collection... (32250 times) [2024-06-28 04:03:17,465][06909] InferenceWorker_p0-w0: stopping experience collection (32250 times) [2024-06-28 04:03:17,480][06909] InferenceWorker_p0-w0: resuming experience collection (32250 times) [2024-06-28 04:03:18,418][06909] Updated weights for policy 0, policy_version 144133 (0.0041) [2024-06-28 04:03:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2361491456. Throughput: 0: 44292.0. Samples: 2264403600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 04:03:18,861][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:03:22,403][06909] Updated weights for policy 0, policy_version 144143 (0.0028) [2024-06-28 04:03:23,850][06674] Fps is (10 sec: 45876.3, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2361704448. Throughput: 0: 44082.7. Samples: 2264663720. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 04:03:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:03:25,728][06909] Updated weights for policy 0, policy_version 144153 (0.0033) [2024-06-28 04:03:28,852][06674] Fps is (10 sec: 42590.3, 60 sec: 44235.4, 300 sec: 44153.2). Total num frames: 2361917440. Throughput: 0: 44287.9. Samples: 2264798740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 04:03:28,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:03:29,660][06909] Updated weights for policy 0, policy_version 144163 (0.0030) [2024-06-28 04:03:33,246][06909] Updated weights for policy 0, policy_version 144173 (0.0038) [2024-06-28 04:03:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2362163200. Throughput: 0: 43960.4. Samples: 2265063160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 04:03:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:03:37,117][06909] Updated weights for policy 0, policy_version 144183 (0.0032) [2024-06-28 04:03:38,850][06674] Fps is (10 sec: 44245.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2362359808. Throughput: 0: 44016.0. Samples: 2265327020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 04:03:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:03:40,615][06909] Updated weights for policy 0, policy_version 144193 (0.0041) [2024-06-28 04:03:43,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2362572800. Throughput: 0: 43990.3. Samples: 2265452300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 04:03:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:03:44,647][06909] Updated weights for policy 0, policy_version 144203 (0.0034) [2024-06-28 04:03:47,933][06909] Updated weights for policy 0, policy_version 144213 (0.0019) [2024-06-28 04:03:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2362802176. Throughput: 0: 44152.0. Samples: 2265726800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 04:03:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:03:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000144214_2362802176.pth... [2024-06-28 04:03:48,943][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000143568_2352218112.pth [2024-06-28 04:03:52,074][06909] Updated weights for policy 0, policy_version 144223 (0.0034) [2024-06-28 04:03:53,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2363031552. Throughput: 0: 44130.7. Samples: 2265990060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 04:03:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:03:55,433][06909] Updated weights for policy 0, policy_version 144233 (0.0039) [2024-06-28 04:03:58,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 2363244544. Throughput: 0: 44052.0. Samples: 2266120560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 04:03:58,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:03:59,431][06909] Updated weights for policy 0, policy_version 144243 (0.0032) [2024-06-28 04:04:02,712][06909] Updated weights for policy 0, policy_version 144253 (0.0031) [2024-06-28 04:04:03,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2363473920. Throughput: 0: 44151.7. Samples: 2266390420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 04:04:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:04:06,653][06909] Updated weights for policy 0, policy_version 144263 (0.0036) [2024-06-28 04:04:08,850][06674] Fps is (10 sec: 44245.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2363686912. Throughput: 0: 44296.3. Samples: 2266657060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 04:04:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:04:10,319][06909] Updated weights for policy 0, policy_version 144273 (0.0039) [2024-06-28 04:04:13,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 2363899904. Throughput: 0: 44177.9. Samples: 2266786660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 04:04:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:04:14,356][06909] Updated weights for policy 0, policy_version 144283 (0.0024) [2024-06-28 04:04:17,545][06909] Updated weights for policy 0, policy_version 144293 (0.0049) [2024-06-28 04:04:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2364129280. Throughput: 0: 44079.6. Samples: 2267046740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 04:04:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:04:21,592][06909] Updated weights for policy 0, policy_version 144303 (0.0034) [2024-06-28 04:04:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2364342272. Throughput: 0: 44013.7. Samples: 2267307640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 04:04:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:04:25,375][06909] Updated weights for policy 0, policy_version 144313 (0.0033) [2024-06-28 04:04:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 2364555264. Throughput: 0: 44377.7. Samples: 2267449300. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 04:04:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:04:29,282][06909] Updated weights for policy 0, policy_version 144323 (0.0025) [2024-06-28 04:04:32,571][06909] Updated weights for policy 0, policy_version 144333 (0.0032) [2024-06-28 04:04:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 2364784640. Throughput: 0: 44100.5. Samples: 2267711320. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 04:04:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 04:04:36,660][06909] Updated weights for policy 0, policy_version 144343 (0.0033) [2024-06-28 04:04:38,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2365030400. Throughput: 0: 44132.0. Samples: 2267976000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 04:04:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:04:39,782][06909] Updated weights for policy 0, policy_version 144353 (0.0029) [2024-06-28 04:04:43,852][06674] Fps is (10 sec: 44227.9, 60 sec: 44235.3, 300 sec: 44097.7). Total num frames: 2365227008. Throughput: 0: 44162.7. Samples: 2268107880. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 04:04:43,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:04:44,103][06909] Updated weights for policy 0, policy_version 144363 (0.0033) [2024-06-28 04:04:47,626][06909] Updated weights for policy 0, policy_version 144373 (0.0042) [2024-06-28 04:04:48,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2365440000. Throughput: 0: 43990.7. Samples: 2268370000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 04:04:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:04:51,581][06909] Updated weights for policy 0, policy_version 144383 (0.0035) [2024-06-28 04:04:52,111][06887] Signal inference workers to stop experience collection... (32300 times) [2024-06-28 04:04:52,116][06887] Signal inference workers to resume experience collection... (32300 times) [2024-06-28 04:04:52,145][06909] InferenceWorker_p0-w0: stopping experience collection (32300 times) [2024-06-28 04:04:52,145][06909] InferenceWorker_p0-w0: resuming experience collection (32300 times) [2024-06-28 04:04:53,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2365669376. Throughput: 0: 43851.3. Samples: 2268630360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 04:04:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:04:54,915][06909] Updated weights for policy 0, policy_version 144393 (0.0031) [2024-06-28 04:04:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43965.2, 300 sec: 44098.1). Total num frames: 2365882368. Throughput: 0: 44082.7. Samples: 2268770380. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 04:04:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:04:58,934][06909] Updated weights for policy 0, policy_version 144403 (0.0028) [2024-06-28 04:05:02,275][06909] Updated weights for policy 0, policy_version 144413 (0.0034) [2024-06-28 04:05:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2366095360. Throughput: 0: 43964.1. Samples: 2269025120. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 04:05:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:05:06,501][06909] Updated weights for policy 0, policy_version 144423 (0.0035) [2024-06-28 04:05:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2366341120. Throughput: 0: 44112.4. Samples: 2269292700. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 04:05:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:05:10,033][06909] Updated weights for policy 0, policy_version 144433 (0.0032) [2024-06-28 04:05:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2366537728. Throughput: 0: 43883.2. Samples: 2269424040. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 04:05:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:05:13,871][06909] Updated weights for policy 0, policy_version 144443 (0.0030) [2024-06-28 04:05:17,265][06909] Updated weights for policy 0, policy_version 144453 (0.0044) [2024-06-28 04:05:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2366767104. Throughput: 0: 43959.1. Samples: 2269689480. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 04:05:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:05:21,174][06909] Updated weights for policy 0, policy_version 144463 (0.0031) [2024-06-28 04:05:23,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 2367012864. Throughput: 0: 44005.4. Samples: 2269956240. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 04:05:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:05:24,528][06909] Updated weights for policy 0, policy_version 144473 (0.0033) [2024-06-28 04:05:28,802][06909] Updated weights for policy 0, policy_version 144483 (0.0029) [2024-06-28 04:05:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 2367209472. Throughput: 0: 44078.3. Samples: 2270091320. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:05:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:05:32,259][06909] Updated weights for policy 0, policy_version 144493 (0.0041) [2024-06-28 04:05:33,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2367438848. Throughput: 0: 43935.8. Samples: 2270347120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:05:33,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:05:36,201][06909] Updated weights for policy 0, policy_version 144503 (0.0033) [2024-06-28 04:05:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2367651840. Throughput: 0: 44114.7. Samples: 2270615520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:05:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:05:39,929][06909] Updated weights for policy 0, policy_version 144513 (0.0035) [2024-06-28 04:05:43,588][06909] Updated weights for policy 0, policy_version 144523 (0.0034) [2024-06-28 04:05:43,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43965.2, 300 sec: 44042.7). Total num frames: 2367864832. Throughput: 0: 43929.0. Samples: 2270747180. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:05:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:05:47,105][06909] Updated weights for policy 0, policy_version 144533 (0.0029) [2024-06-28 04:05:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2368094208. Throughput: 0: 44205.8. Samples: 2271014380. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:05:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:05:48,912][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000144538_2368110592.pth... [2024-06-28 04:05:48,978][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000143892_2357526528.pth [2024-06-28 04:05:50,982][06909] Updated weights for policy 0, policy_version 144543 (0.0032) [2024-06-28 04:05:53,851][06674] Fps is (10 sec: 45867.6, 60 sec: 44235.6, 300 sec: 44153.3). Total num frames: 2368323584. Throughput: 0: 44198.0. Samples: 2271281680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:05:53,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:05:54,341][06909] Updated weights for policy 0, policy_version 144553 (0.0029) [2024-06-28 04:05:58,251][06909] Updated weights for policy 0, policy_version 144563 (0.0022) [2024-06-28 04:05:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2368536576. Throughput: 0: 44299.6. Samples: 2271417520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:05:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:06:01,607][06909] Updated weights for policy 0, policy_version 144573 (0.0031) [2024-06-28 04:06:03,856][06674] Fps is (10 sec: 42579.6, 60 sec: 44232.3, 300 sec: 44152.6). Total num frames: 2368749568. Throughput: 0: 44287.0. Samples: 2271682660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:06:03,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:06:05,584][06909] Updated weights for policy 0, policy_version 144583 (0.0032) [2024-06-28 04:06:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2368978944. Throughput: 0: 44028.9. Samples: 2271937540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:06:08,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:06:09,371][06909] Updated weights for policy 0, policy_version 144593 (0.0040) [2024-06-28 04:06:13,154][06909] Updated weights for policy 0, policy_version 144603 (0.0032) [2024-06-28 04:06:13,850][06674] Fps is (10 sec: 44263.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2369191936. Throughput: 0: 44096.5. Samples: 2272075660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:06:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:06:17,246][06909] Updated weights for policy 0, policy_version 144613 (0.0027) [2024-06-28 04:06:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 44153.6). Total num frames: 2369404928. Throughput: 0: 44101.9. Samples: 2272331700. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:06:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:06:20,870][06909] Updated weights for policy 0, policy_version 144623 (0.0039) [2024-06-28 04:06:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2369634304. Throughput: 0: 44058.3. Samples: 2272598140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:06:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:06:24,447][06909] Updated weights for policy 0, policy_version 144633 (0.0025) [2024-06-28 04:06:28,096][06909] Updated weights for policy 0, policy_version 144643 (0.0027) [2024-06-28 04:06:28,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 44098.2). Total num frames: 2369847296. Throughput: 0: 44030.5. Samples: 2272728560. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:06:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:06:32,098][06909] Updated weights for policy 0, policy_version 144653 (0.0038) [2024-06-28 04:06:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2370076672. Throughput: 0: 44170.2. Samples: 2273002040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 04:06:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:06:33,878][06887] Signal inference workers to stop experience collection... (32350 times) [2024-06-28 04:06:33,936][06909] InferenceWorker_p0-w0: stopping experience collection (32350 times) [2024-06-28 04:06:33,990][06887] Signal inference workers to resume experience collection... (32350 times) [2024-06-28 04:06:33,990][06909] InferenceWorker_p0-w0: resuming experience collection (32350 times) [2024-06-28 04:06:35,313][06909] Updated weights for policy 0, policy_version 144663 (0.0025) [2024-06-28 04:06:38,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2370306048. Throughput: 0: 44071.0. Samples: 2273264800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 04:06:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:06:39,453][06909] Updated weights for policy 0, policy_version 144673 (0.0030) [2024-06-28 04:06:42,625][06909] Updated weights for policy 0, policy_version 144683 (0.0025) [2024-06-28 04:06:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2370519040. Throughput: 0: 44077.8. Samples: 2273401020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 04:06:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:06:47,018][06909] Updated weights for policy 0, policy_version 144693 (0.0035) [2024-06-28 04:06:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2370748416. Throughput: 0: 44050.8. Samples: 2273664680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 04:06:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:06:49,914][06909] Updated weights for policy 0, policy_version 144703 (0.0039) [2024-06-28 04:06:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43691.8, 300 sec: 44098.9). Total num frames: 2370945024. Throughput: 0: 44095.1. Samples: 2273921820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 04:06:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:06:54,467][06909] Updated weights for policy 0, policy_version 144713 (0.0035) [2024-06-28 04:06:57,984][06909] Updated weights for policy 0, policy_version 144723 (0.0025) [2024-06-28 04:06:58,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 2371174400. Throughput: 0: 43938.0. Samples: 2274052960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 04:06:58,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:07:01,843][06909] Updated weights for policy 0, policy_version 144733 (0.0041) [2024-06-28 04:07:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44241.2, 300 sec: 44153.5). Total num frames: 2371403776. Throughput: 0: 44238.6. Samples: 2274322440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 04:07:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:07:05,215][06909] Updated weights for policy 0, policy_version 144743 (0.0024) [2024-06-28 04:07:08,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2371616768. Throughput: 0: 44191.1. Samples: 2274586740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 04:07:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:07:09,324][06909] Updated weights for policy 0, policy_version 144753 (0.0031) [2024-06-28 04:07:12,364][06909] Updated weights for policy 0, policy_version 144763 (0.0031) [2024-06-28 04:07:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2371846144. Throughput: 0: 44351.1. Samples: 2274724360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 04:07:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:07:17,093][06909] Updated weights for policy 0, policy_version 144773 (0.0033) [2024-06-28 04:07:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2372042752. Throughput: 0: 44242.6. Samples: 2274992960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 04:07:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:07:19,642][06909] Updated weights for policy 0, policy_version 144783 (0.0030) [2024-06-28 04:07:23,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2372272128. Throughput: 0: 43859.1. Samples: 2275238460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 04:07:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:07:24,356][06909] Updated weights for policy 0, policy_version 144793 (0.0043) [2024-06-28 04:07:27,364][06909] Updated weights for policy 0, policy_version 144803 (0.0039) [2024-06-28 04:07:28,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2372517888. Throughput: 0: 43884.8. Samples: 2275375840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 04:07:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:07:31,681][06909] Updated weights for policy 0, policy_version 144813 (0.0027) [2024-06-28 04:07:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2372714496. Throughput: 0: 43921.8. Samples: 2275641160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 04:07:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:07:34,833][06909] Updated weights for policy 0, policy_version 144823 (0.0041) [2024-06-28 04:07:38,853][06674] Fps is (10 sec: 40945.4, 60 sec: 43688.0, 300 sec: 44041.9). Total num frames: 2372927488. Throughput: 0: 43900.1. Samples: 2275897480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 04:07:38,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:07:39,104][06909] Updated weights for policy 0, policy_version 144833 (0.0033) [2024-06-28 04:07:42,104][06909] Updated weights for policy 0, policy_version 144843 (0.0040) [2024-06-28 04:07:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2373173248. Throughput: 0: 44049.5. Samples: 2276035100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 04:07:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:07:46,262][06909] Updated weights for policy 0, policy_version 144853 (0.0020) [2024-06-28 04:07:48,850][06674] Fps is (10 sec: 45892.0, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2373386240. Throughput: 0: 44228.1. Samples: 2276312700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 04:07:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:07:48,956][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000144861_2373402624.pth... [2024-06-28 04:07:49,014][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000144214_2362802176.pth [2024-06-28 04:07:49,446][06909] Updated weights for policy 0, policy_version 144863 (0.0030) [2024-06-28 04:07:53,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2373582848. Throughput: 0: 44057.8. Samples: 2276569340. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 04:07:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:07:53,934][06909] Updated weights for policy 0, policy_version 144873 (0.0031) [2024-06-28 04:07:56,955][06909] Updated weights for policy 0, policy_version 144883 (0.0022) [2024-06-28 04:07:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 2373828608. Throughput: 0: 43897.4. Samples: 2276699740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 04:07:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:08:01,739][06909] Updated weights for policy 0, policy_version 144893 (0.0035) [2024-06-28 04:08:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2374025216. Throughput: 0: 43700.4. Samples: 2276959480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 04:08:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:08:04,715][06909] Updated weights for policy 0, policy_version 144903 (0.0033) [2024-06-28 04:08:05,836][06887] Signal inference workers to stop experience collection... (32400 times) [2024-06-28 04:08:05,837][06887] Signal inference workers to resume experience collection... (32400 times) [2024-06-28 04:08:05,877][06909] InferenceWorker_p0-w0: stopping experience collection (32400 times) [2024-06-28 04:08:05,878][06909] InferenceWorker_p0-w0: resuming experience collection (32400 times) [2024-06-28 04:08:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2374238208. Throughput: 0: 44056.0. Samples: 2277220980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 04:08:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:08:08,906][06909] Updated weights for policy 0, policy_version 144913 (0.0035) [2024-06-28 04:08:12,207][06909] Updated weights for policy 0, policy_version 144923 (0.0039) [2024-06-28 04:08:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2374483968. Throughput: 0: 43969.4. Samples: 2277354460. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 04:08:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:08:16,363][06909] Updated weights for policy 0, policy_version 144933 (0.0026) [2024-06-28 04:08:18,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2374696960. Throughput: 0: 44033.7. Samples: 2277622680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 04:08:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:08:19,614][06909] Updated weights for policy 0, policy_version 144943 (0.0025) [2024-06-28 04:08:23,572][06909] Updated weights for policy 0, policy_version 144953 (0.0027) [2024-06-28 04:08:23,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.6, 300 sec: 44042.7). Total num frames: 2374909952. Throughput: 0: 44248.4. Samples: 2277888500. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 04:08:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:08:26,805][06909] Updated weights for policy 0, policy_version 144963 (0.0022) [2024-06-28 04:08:28,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2375155712. Throughput: 0: 44090.6. Samples: 2278019180. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 04:08:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:08:30,877][06909] Updated weights for policy 0, policy_version 144973 (0.0029) [2024-06-28 04:08:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2375352320. Throughput: 0: 43814.1. Samples: 2278284340. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 04:08:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:08:34,450][06909] Updated weights for policy 0, policy_version 144983 (0.0028) [2024-06-28 04:08:38,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43693.2, 300 sec: 43986.9). Total num frames: 2375548928. Throughput: 0: 43737.2. Samples: 2278537520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 04:08:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:08:39,023][06909] Updated weights for policy 0, policy_version 144993 (0.0033) [2024-06-28 04:08:42,201][06909] Updated weights for policy 0, policy_version 145003 (0.0036) [2024-06-28 04:08:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2375811072. Throughput: 0: 43897.7. Samples: 2278675140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 04:08:43,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:08:46,163][06909] Updated weights for policy 0, policy_version 145013 (0.0046) [2024-06-28 04:08:48,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2376007680. Throughput: 0: 43971.6. Samples: 2278938200. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 04:08:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:08:49,738][06909] Updated weights for policy 0, policy_version 145023 (0.0037) [2024-06-28 04:08:53,354][06909] Updated weights for policy 0, policy_version 145033 (0.0032) [2024-06-28 04:08:53,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.7, 300 sec: 43987.2). Total num frames: 2376220672. Throughput: 0: 44053.3. Samples: 2279203380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 04:08:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:08:56,960][06909] Updated weights for policy 0, policy_version 145043 (0.0031) [2024-06-28 04:08:58,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2376482816. Throughput: 0: 44219.6. Samples: 2279344340. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 04:08:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:09:00,670][06909] Updated weights for policy 0, policy_version 145053 (0.0031) [2024-06-28 04:09:03,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 2376663040. Throughput: 0: 44225.6. Samples: 2279612920. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 04:09:03,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:09:04,361][06909] Updated weights for policy 0, policy_version 145063 (0.0040) [2024-06-28 04:09:07,986][06909] Updated weights for policy 0, policy_version 145073 (0.0036) [2024-06-28 04:09:08,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2376892416. Throughput: 0: 44180.1. Samples: 2279876600. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 04:09:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:09:11,757][06909] Updated weights for policy 0, policy_version 145083 (0.0041) [2024-06-28 04:09:13,850][06674] Fps is (10 sec: 49161.4, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2377154560. Throughput: 0: 44113.7. Samples: 2280004300. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 04:09:13,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:09:15,689][06909] Updated weights for policy 0, policy_version 145093 (0.0034) [2024-06-28 04:09:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2377334784. Throughput: 0: 44123.2. Samples: 2280269880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 04:09:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:09:19,153][06909] Updated weights for policy 0, policy_version 145103 (0.0037) [2024-06-28 04:09:22,845][06909] Updated weights for policy 0, policy_version 145113 (0.0035) [2024-06-28 04:09:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2377564160. Throughput: 0: 44449.4. Samples: 2280537740. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 04:09:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:09:26,749][06909] Updated weights for policy 0, policy_version 145123 (0.0027) [2024-06-28 04:09:28,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2377809920. Throughput: 0: 44330.3. Samples: 2280670000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 04:09:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 04:09:30,482][06909] Updated weights for policy 0, policy_version 145133 (0.0036) [2024-06-28 04:09:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2377990144. Throughput: 0: 44241.7. Samples: 2280929080. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 04:09:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:09:34,102][06909] Updated weights for policy 0, policy_version 145143 (0.0034) [2024-06-28 04:09:37,759][06909] Updated weights for policy 0, policy_version 145153 (0.0036) [2024-06-28 04:09:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44509.9, 300 sec: 44042.7). Total num frames: 2378219520. Throughput: 0: 44297.2. Samples: 2281196760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 04:09:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:09:41,754][06909] Updated weights for policy 0, policy_version 145163 (0.0035) [2024-06-28 04:09:43,412][06887] Signal inference workers to stop experience collection... (32450 times) [2024-06-28 04:09:43,412][06887] Signal inference workers to resume experience collection... (32450 times) [2024-06-28 04:09:43,435][06909] InferenceWorker_p0-w0: stopping experience collection (32450 times) [2024-06-28 04:09:43,435][06909] InferenceWorker_p0-w0: resuming experience collection (32450 times) [2024-06-28 04:09:43,850][06674] Fps is (10 sec: 47514.4, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2378465280. Throughput: 0: 44029.8. Samples: 2281325680. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 04:09:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:09:45,259][06909] Updated weights for policy 0, policy_version 145173 (0.0027) [2024-06-28 04:09:48,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2378661888. Throughput: 0: 43970.9. Samples: 2281591520. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 04:09:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:09:48,879][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000145182_2378661888.pth... [2024-06-28 04:09:48,927][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000144538_2368110592.pth [2024-06-28 04:09:49,255][06909] Updated weights for policy 0, policy_version 145183 (0.0037) [2024-06-28 04:09:52,740][06909] Updated weights for policy 0, policy_version 145193 (0.0030) [2024-06-28 04:09:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2378891264. Throughput: 0: 43787.0. Samples: 2281847020. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 04:09:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:09:56,534][06909] Updated weights for policy 0, policy_version 145203 (0.0026) [2024-06-28 04:09:58,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2379120640. Throughput: 0: 43950.7. Samples: 2281982080. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 04:09:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:10:00,010][06909] Updated weights for policy 0, policy_version 145213 (0.0031) [2024-06-28 04:10:03,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43965.2, 300 sec: 43931.4). Total num frames: 2379300864. Throughput: 0: 43902.2. Samples: 2282245480. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 04:10:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:10:04,167][06909] Updated weights for policy 0, policy_version 145223 (0.0032) [2024-06-28 04:10:07,414][06909] Updated weights for policy 0, policy_version 145233 (0.0030) [2024-06-28 04:10:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2379546624. Throughput: 0: 43897.4. Samples: 2282513120. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 04:10:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:10:11,577][06909] Updated weights for policy 0, policy_version 145243 (0.0037) [2024-06-28 04:10:13,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2379776000. Throughput: 0: 44085.4. Samples: 2282653840. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 04:10:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:10:14,845][06909] Updated weights for policy 0, policy_version 145253 (0.0027) [2024-06-28 04:10:18,813][06909] Updated weights for policy 0, policy_version 145263 (0.0047) [2024-06-28 04:10:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2379988992. Throughput: 0: 44040.1. Samples: 2282910880. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 04:10:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:10:22,442][06909] Updated weights for policy 0, policy_version 145273 (0.0022) [2024-06-28 04:10:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2380218368. Throughput: 0: 43961.9. Samples: 2283175040. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 04:10:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:10:26,336][06909] Updated weights for policy 0, policy_version 145283 (0.0030) [2024-06-28 04:10:28,856][06674] Fps is (10 sec: 44209.8, 60 sec: 43686.3, 300 sec: 44041.5). Total num frames: 2380431360. Throughput: 0: 43974.9. Samples: 2283304820. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 04:10:28,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:10:29,921][06909] Updated weights for policy 0, policy_version 145293 (0.0036) [2024-06-28 04:10:33,848][06909] Updated weights for policy 0, policy_version 145303 (0.0029) [2024-06-28 04:10:33,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2380644352. Throughput: 0: 43830.2. Samples: 2283563880. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 04:10:33,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:10:37,379][06909] Updated weights for policy 0, policy_version 145313 (0.0041) [2024-06-28 04:10:38,850][06674] Fps is (10 sec: 42623.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2380857344. Throughput: 0: 44057.3. Samples: 2283829600. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 04:10:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:10:41,209][06909] Updated weights for policy 0, policy_version 145323 (0.0029) [2024-06-28 04:10:43,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43417.4, 300 sec: 43986.8). Total num frames: 2381070336. Throughput: 0: 44010.1. Samples: 2283962540. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 04:10:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:10:44,790][06909] Updated weights for policy 0, policy_version 145333 (0.0040) [2024-06-28 04:10:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.6, 300 sec: 43931.6). Total num frames: 2381283328. Throughput: 0: 43878.6. Samples: 2284220020. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 04:10:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:10:48,874][06909] Updated weights for policy 0, policy_version 145343 (0.0034) [2024-06-28 04:10:52,287][06909] Updated weights for policy 0, policy_version 145353 (0.0037) [2024-06-28 04:10:53,850][06674] Fps is (10 sec: 45876.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2381529088. Throughput: 0: 43885.3. Samples: 2284487960. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 04:10:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:10:56,023][06909] Updated weights for policy 0, policy_version 145363 (0.0024) [2024-06-28 04:10:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.7, 300 sec: 43987.8). Total num frames: 2381725696. Throughput: 0: 43710.7. Samples: 2284620820. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 04:10:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:10:59,820][06909] Updated weights for policy 0, policy_version 145373 (0.0044) [2024-06-28 04:11:03,472][06909] Updated weights for policy 0, policy_version 145383 (0.0036) [2024-06-28 04:11:03,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2381955072. Throughput: 0: 43663.4. Samples: 2284875740. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 04:11:03,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:11:07,555][06909] Updated weights for policy 0, policy_version 145393 (0.0041) [2024-06-28 04:11:08,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2382168064. Throughput: 0: 43721.2. Samples: 2285142500. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 04:11:08,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 04:11:11,023][06909] Updated weights for policy 0, policy_version 145403 (0.0036) [2024-06-28 04:11:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 2382381056. Throughput: 0: 43759.6. Samples: 2285273740. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 04:11:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:11:14,762][06909] Updated weights for policy 0, policy_version 145413 (0.0038) [2024-06-28 04:11:18,662][06909] Updated weights for policy 0, policy_version 145423 (0.0045) [2024-06-28 04:11:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2382610432. Throughput: 0: 43895.5. Samples: 2285539180. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 04:11:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:11:22,483][06909] Updated weights for policy 0, policy_version 145433 (0.0031) [2024-06-28 04:11:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2382839808. Throughput: 0: 43856.5. Samples: 2285803140. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 04:11:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:11:25,832][06909] Updated weights for policy 0, policy_version 145443 (0.0024) [2024-06-28 04:11:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43695.1, 300 sec: 43986.9). Total num frames: 2383052800. Throughput: 0: 44019.7. Samples: 2285943420. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 04:11:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:11:29,973][06909] Updated weights for policy 0, policy_version 145453 (0.0028) [2024-06-28 04:11:30,689][06887] Signal inference workers to stop experience collection... (32500 times) [2024-06-28 04:11:30,690][06887] Signal inference workers to resume experience collection... (32500 times) [2024-06-28 04:11:30,709][06909] InferenceWorker_p0-w0: stopping experience collection (32500 times) [2024-06-28 04:11:30,709][06909] InferenceWorker_p0-w0: resuming experience collection (32500 times) [2024-06-28 04:11:32,993][06909] Updated weights for policy 0, policy_version 145463 (0.0025) [2024-06-28 04:11:33,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2383282176. Throughput: 0: 44110.3. Samples: 2286204980. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 04:11:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:11:37,213][06909] Updated weights for policy 0, policy_version 145473 (0.0034) [2024-06-28 04:11:38,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2383511552. Throughput: 0: 43944.9. Samples: 2286465480. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 04:11:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:11:40,275][06909] Updated weights for policy 0, policy_version 145483 (0.0038) [2024-06-28 04:11:43,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2383708160. Throughput: 0: 43994.1. Samples: 2286600560. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 04:11:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:11:44,407][06909] Updated weights for policy 0, policy_version 145493 (0.0030) [2024-06-28 04:11:48,044][06909] Updated weights for policy 0, policy_version 145503 (0.0032) [2024-06-28 04:11:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2383953920. Throughput: 0: 44117.0. Samples: 2286861000. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 04:11:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:11:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000145505_2383953920.pth... [2024-06-28 04:11:48,907][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000144861_2373402624.pth [2024-06-28 04:11:52,106][06909] Updated weights for policy 0, policy_version 145513 (0.0024) [2024-06-28 04:11:53,852][06674] Fps is (10 sec: 45866.3, 60 sec: 43962.2, 300 sec: 44042.4). Total num frames: 2384166912. Throughput: 0: 44020.8. Samples: 2287123520. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 04:11:53,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:11:55,492][06909] Updated weights for policy 0, policy_version 145523 (0.0026) [2024-06-28 04:11:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2384379904. Throughput: 0: 44218.7. Samples: 2287263580. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 04:11:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:11:59,360][06909] Updated weights for policy 0, policy_version 145533 (0.0038) [2024-06-28 04:12:03,246][06909] Updated weights for policy 0, policy_version 145543 (0.0030) [2024-06-28 04:12:03,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2384592896. Throughput: 0: 43993.0. Samples: 2287518860. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 04:12:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:12:07,017][06909] Updated weights for policy 0, policy_version 145553 (0.0030) [2024-06-28 04:12:08,850][06674] Fps is (10 sec: 44235.8, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2384822272. Throughput: 0: 43953.2. Samples: 2287781040. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 04:12:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:12:10,467][06909] Updated weights for policy 0, policy_version 145563 (0.0029) [2024-06-28 04:12:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2385035264. Throughput: 0: 43874.3. Samples: 2287917760. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 04:12:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:12:14,293][06909] Updated weights for policy 0, policy_version 145573 (0.0028) [2024-06-28 04:12:17,790][06909] Updated weights for policy 0, policy_version 145583 (0.0024) [2024-06-28 04:12:18,850][06674] Fps is (10 sec: 44237.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2385264640. Throughput: 0: 43872.9. Samples: 2288179260. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 04:12:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:12:21,550][06909] Updated weights for policy 0, policy_version 145593 (0.0025) [2024-06-28 04:12:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 2385461248. Throughput: 0: 43963.1. Samples: 2288443820. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 04:12:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:12:25,282][06909] Updated weights for policy 0, policy_version 145603 (0.0027) [2024-06-28 04:12:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2385707008. Throughput: 0: 43925.4. Samples: 2288577200. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 04:12:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:12:29,006][06909] Updated weights for policy 0, policy_version 145613 (0.0033) [2024-06-28 04:12:32,611][06909] Updated weights for policy 0, policy_version 145623 (0.0037) [2024-06-28 04:12:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44043.0). Total num frames: 2385920000. Throughput: 0: 44030.7. Samples: 2288842380. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 04:12:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:12:36,638][06909] Updated weights for policy 0, policy_version 145633 (0.0038) [2024-06-28 04:12:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2386149376. Throughput: 0: 44094.9. Samples: 2289107700. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 04:12:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:12:40,164][06909] Updated weights for policy 0, policy_version 145643 (0.0025) [2024-06-28 04:12:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2386362368. Throughput: 0: 43944.5. Samples: 2289241080. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 04:12:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:12:43,952][06909] Updated weights for policy 0, policy_version 145653 (0.0032) [2024-06-28 04:12:47,312][06909] Updated weights for policy 0, policy_version 145663 (0.0032) [2024-06-28 04:12:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2386575360. Throughput: 0: 44017.7. Samples: 2289499660. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 04:12:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:12:51,549][06909] Updated weights for policy 0, policy_version 145673 (0.0030) [2024-06-28 04:12:53,778][06887] Signal inference workers to stop experience collection... (32550 times) [2024-06-28 04:12:53,779][06887] Signal inference workers to resume experience collection... (32550 times) [2024-06-28 04:12:53,798][06909] InferenceWorker_p0-w0: stopping experience collection (32550 times) [2024-06-28 04:12:53,798][06909] InferenceWorker_p0-w0: resuming experience collection (32550 times) [2024-06-28 04:12:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43965.3, 300 sec: 43986.9). Total num frames: 2386804736. Throughput: 0: 44225.6. Samples: 2289771180. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 04:12:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:12:54,514][06909] Updated weights for policy 0, policy_version 145683 (0.0029) [2024-06-28 04:12:58,703][06909] Updated weights for policy 0, policy_version 145693 (0.0030) [2024-06-28 04:12:58,852][06674] Fps is (10 sec: 45866.3, 60 sec: 44235.3, 300 sec: 44097.7). Total num frames: 2387034112. Throughput: 0: 43984.6. Samples: 2289897160. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 04:12:58,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:13:02,555][06909] Updated weights for policy 0, policy_version 145703 (0.0025) [2024-06-28 04:13:03,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2387247104. Throughput: 0: 44233.2. Samples: 2290169760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 04:13:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:13:06,023][06909] Updated weights for policy 0, policy_version 145713 (0.0031) [2024-06-28 04:13:08,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 2387460096. Throughput: 0: 44203.0. Samples: 2290432960. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 04:13:08,859][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:13:10,391][06909] Updated weights for policy 0, policy_version 145723 (0.0030) [2024-06-28 04:13:13,747][06909] Updated weights for policy 0, policy_version 145733 (0.0027) [2024-06-28 04:13:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2387689472. Throughput: 0: 43958.1. Samples: 2290555320. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 04:13:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:13:17,751][06909] Updated weights for policy 0, policy_version 145743 (0.0042) [2024-06-28 04:13:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2387902464. Throughput: 0: 44016.8. Samples: 2290823140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 04:13:18,860][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:13:21,413][06909] Updated weights for policy 0, policy_version 145753 (0.0021) [2024-06-28 04:13:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 2388131840. Throughput: 0: 43935.1. Samples: 2291084780. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 04:13:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:13:25,049][06909] Updated weights for policy 0, policy_version 145763 (0.0032) [2024-06-28 04:13:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2388328448. Throughput: 0: 43956.4. Samples: 2291219120. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 04:13:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:13:28,903][06909] Updated weights for policy 0, policy_version 145773 (0.0029) [2024-06-28 04:13:32,629][06909] Updated weights for policy 0, policy_version 145783 (0.0026) [2024-06-28 04:13:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2388557824. Throughput: 0: 43944.9. Samples: 2291477180. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 04:13:33,862][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:13:36,213][06909] Updated weights for policy 0, policy_version 145793 (0.0028) [2024-06-28 04:13:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2388787200. Throughput: 0: 43913.2. Samples: 2291747280. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 04:13:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:13:40,048][06909] Updated weights for policy 0, policy_version 145803 (0.0040) [2024-06-28 04:13:43,573][06909] Updated weights for policy 0, policy_version 145813 (0.0033) [2024-06-28 04:13:43,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2389000192. Throughput: 0: 44022.5. Samples: 2291878080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 04:13:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:13:47,754][06909] Updated weights for policy 0, policy_version 145823 (0.0032) [2024-06-28 04:13:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2389213184. Throughput: 0: 43873.9. Samples: 2292144080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 04:13:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:13:48,960][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000145827_2389229568.pth... [2024-06-28 04:13:49,004][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000145182_2378661888.pth [2024-06-28 04:13:51,220][06909] Updated weights for policy 0, policy_version 145833 (0.0042) [2024-06-28 04:13:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2389442560. Throughput: 0: 43727.2. Samples: 2292400680. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 04:13:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:13:55,185][06909] Updated weights for policy 0, policy_version 145843 (0.0041) [2024-06-28 04:13:58,852][06674] Fps is (10 sec: 42589.3, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 2389639168. Throughput: 0: 43834.1. Samples: 2292527940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:13:58,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:13:58,976][06909] Updated weights for policy 0, policy_version 145853 (0.0029) [2024-06-28 04:14:02,479][06909] Updated weights for policy 0, policy_version 145863 (0.0028) [2024-06-28 04:14:03,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 2389884928. Throughput: 0: 43793.6. Samples: 2292793940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:14:03,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:14:06,285][06909] Updated weights for policy 0, policy_version 145873 (0.0032) [2024-06-28 04:14:08,850][06674] Fps is (10 sec: 45884.4, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 2390097920. Throughput: 0: 43826.2. Samples: 2293056960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:14:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:14:10,087][06909] Updated weights for policy 0, policy_version 145883 (0.0033) [2024-06-28 04:14:13,512][06909] Updated weights for policy 0, policy_version 145893 (0.0047) [2024-06-28 04:14:13,833][06887] Signal inference workers to stop experience collection... (32600 times) [2024-06-28 04:14:13,833][06887] Signal inference workers to resume experience collection... (32600 times) [2024-06-28 04:14:13,843][06909] InferenceWorker_p0-w0: stopping experience collection (32600 times) [2024-06-28 04:14:13,843][06909] InferenceWorker_p0-w0: resuming experience collection (32600 times) [2024-06-28 04:14:13,850][06674] Fps is (10 sec: 44246.2, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 2390327296. Throughput: 0: 43897.4. Samples: 2293194500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:14:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:14:17,214][06909] Updated weights for policy 0, policy_version 145903 (0.0033) [2024-06-28 04:14:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2390540288. Throughput: 0: 44059.6. Samples: 2293459860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:14:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:14:20,731][06909] Updated weights for policy 0, policy_version 145913 (0.0032) [2024-06-28 04:14:23,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 2390753280. Throughput: 0: 43979.1. Samples: 2293726340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:14:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:14:24,888][06909] Updated weights for policy 0, policy_version 145923 (0.0034) [2024-06-28 04:14:28,170][06909] Updated weights for policy 0, policy_version 145933 (0.0033) [2024-06-28 04:14:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2390966272. Throughput: 0: 44007.4. Samples: 2293858420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:14:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:14:32,346][06909] Updated weights for policy 0, policy_version 145943 (0.0022) [2024-06-28 04:14:33,852][06674] Fps is (10 sec: 44228.1, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 2391195648. Throughput: 0: 44033.5. Samples: 2294125680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:14:33,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 04:14:36,239][06909] Updated weights for policy 0, policy_version 145953 (0.0025) [2024-06-28 04:14:38,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2391425024. Throughput: 0: 43954.6. Samples: 2294378640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:14:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:14:39,776][06909] Updated weights for policy 0, policy_version 145963 (0.0041) [2024-06-28 04:14:43,491][06909] Updated weights for policy 0, policy_version 145973 (0.0026) [2024-06-28 04:14:43,850][06674] Fps is (10 sec: 45884.0, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2391654400. Throughput: 0: 44214.8. Samples: 2294517520. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:14:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:14:47,064][06909] Updated weights for policy 0, policy_version 145983 (0.0028) [2024-06-28 04:14:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2391851008. Throughput: 0: 44359.8. Samples: 2294790040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:14:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:14:50,702][06909] Updated weights for policy 0, policy_version 145993 (0.0034) [2024-06-28 04:14:53,856][06674] Fps is (10 sec: 40935.7, 60 sec: 43686.3, 300 sec: 43874.9). Total num frames: 2392064000. Throughput: 0: 44242.6. Samples: 2295048140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:14:53,857][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:14:54,728][06909] Updated weights for policy 0, policy_version 146003 (0.0024) [2024-06-28 04:14:57,988][06909] Updated weights for policy 0, policy_version 146013 (0.0026) [2024-06-28 04:14:58,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44784.4, 300 sec: 44153.5). Total num frames: 2392326144. Throughput: 0: 44167.9. Samples: 2295182060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:14:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:15:01,983][06909] Updated weights for policy 0, policy_version 146023 (0.0029) [2024-06-28 04:15:03,852][06674] Fps is (10 sec: 45893.6, 60 sec: 43963.8, 300 sec: 43986.6). Total num frames: 2392522752. Throughput: 0: 44254.9. Samples: 2295451420. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 04:15:03,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:15:05,233][06909] Updated weights for policy 0, policy_version 146033 (0.0026) [2024-06-28 04:15:08,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2392735744. Throughput: 0: 44100.6. Samples: 2295710860. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 04:15:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:15:09,314][06909] Updated weights for policy 0, policy_version 146043 (0.0028) [2024-06-28 04:15:12,984][06909] Updated weights for policy 0, policy_version 146053 (0.0041) [2024-06-28 04:15:13,850][06674] Fps is (10 sec: 45884.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2392981504. Throughput: 0: 44231.6. Samples: 2295848840. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 04:15:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:15:16,610][06909] Updated weights for policy 0, policy_version 146063 (0.0032) [2024-06-28 04:15:18,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2393178112. Throughput: 0: 44069.0. Samples: 2296108700. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 04:15:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:15:20,362][06909] Updated weights for policy 0, policy_version 146073 (0.0027) [2024-06-28 04:15:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 43987.8). Total num frames: 2393407488. Throughput: 0: 44320.4. Samples: 2296373060. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 04:15:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:15:24,170][06909] Updated weights for policy 0, policy_version 146083 (0.0031) [2024-06-28 04:15:27,845][06909] Updated weights for policy 0, policy_version 146093 (0.0040) [2024-06-28 04:15:28,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 2393636864. Throughput: 0: 44131.3. Samples: 2296503420. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 04:15:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:15:31,550][06909] Updated weights for policy 0, policy_version 146103 (0.0038) [2024-06-28 04:15:31,886][06887] Signal inference workers to stop experience collection... (32650 times) [2024-06-28 04:15:31,886][06887] Signal inference workers to resume experience collection... (32650 times) [2024-06-28 04:15:31,925][06909] InferenceWorker_p0-w0: stopping experience collection (32650 times) [2024-06-28 04:15:31,926][06909] InferenceWorker_p0-w0: resuming experience collection (32650 times) [2024-06-28 04:15:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44511.4, 300 sec: 44098.0). Total num frames: 2393866240. Throughput: 0: 43944.4. Samples: 2296767540. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 04:15:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:15:35,055][06909] Updated weights for policy 0, policy_version 146113 (0.0037) [2024-06-28 04:15:38,850][06674] Fps is (10 sec: 42597.5, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2394062848. Throughput: 0: 44067.5. Samples: 2297030920. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 04:15:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:15:39,022][06909] Updated weights for policy 0, policy_version 146123 (0.0038) [2024-06-28 04:15:42,718][06909] Updated weights for policy 0, policy_version 146133 (0.0037) [2024-06-28 04:15:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2394308608. Throughput: 0: 44073.8. Samples: 2297165380. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 04:15:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:15:46,458][06909] Updated weights for policy 0, policy_version 146143 (0.0036) [2024-06-28 04:15:48,850][06674] Fps is (10 sec: 44237.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2394505216. Throughput: 0: 43988.3. Samples: 2297430800. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 04:15:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:15:48,873][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000146149_2394505216.pth... [2024-06-28 04:15:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000145505_2383953920.pth [2024-06-28 04:15:50,435][06909] Updated weights for policy 0, policy_version 146153 (0.0028) [2024-06-28 04:15:53,850][06674] Fps is (10 sec: 40960.5, 60 sec: 44241.3, 300 sec: 44042.4). Total num frames: 2394718208. Throughput: 0: 43929.8. Samples: 2297687700. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 04:15:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:15:53,919][06909] Updated weights for policy 0, policy_version 146163 (0.0036) [2024-06-28 04:15:57,568][06909] Updated weights for policy 0, policy_version 146173 (0.0024) [2024-06-28 04:15:58,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2394963968. Throughput: 0: 43963.9. Samples: 2297827220. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 04:15:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:16:00,971][06909] Updated weights for policy 0, policy_version 146183 (0.0031) [2024-06-28 04:16:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 2395176960. Throughput: 0: 44270.8. Samples: 2298100880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:16:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:16:04,948][06909] Updated weights for policy 0, policy_version 146193 (0.0028) [2024-06-28 04:16:08,672][06909] Updated weights for policy 0, policy_version 146203 (0.0039) [2024-06-28 04:16:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2395389952. Throughput: 0: 44188.5. Samples: 2298361540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:16:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:16:12,159][06909] Updated weights for policy 0, policy_version 146213 (0.0041) [2024-06-28 04:16:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2395619328. Throughput: 0: 44312.3. Samples: 2298497480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:16:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:16:16,275][06909] Updated weights for policy 0, policy_version 146223 (0.0022) [2024-06-28 04:16:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2395832320. Throughput: 0: 44115.1. Samples: 2298752720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:16:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:16:19,848][06909] Updated weights for policy 0, policy_version 146233 (0.0041) [2024-06-28 04:16:23,560][06909] Updated weights for policy 0, policy_version 146243 (0.0024) [2024-06-28 04:16:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2396045312. Throughput: 0: 44192.6. Samples: 2299019580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:16:23,859][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:16:27,465][06909] Updated weights for policy 0, policy_version 146253 (0.0027) [2024-06-28 04:16:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2396274688. Throughput: 0: 44078.8. Samples: 2299148920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:16:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:16:30,856][06909] Updated weights for policy 0, policy_version 146263 (0.0029) [2024-06-28 04:16:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2396487680. Throughput: 0: 44187.5. Samples: 2299419240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:16:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 04:16:34,740][06909] Updated weights for policy 0, policy_version 146273 (0.0037) [2024-06-28 04:16:38,213][06909] Updated weights for policy 0, policy_version 146283 (0.0032) [2024-06-28 04:16:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44237.0, 300 sec: 44098.0). Total num frames: 2396717056. Throughput: 0: 44218.7. Samples: 2299677540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:16:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:16:42,101][06909] Updated weights for policy 0, policy_version 146293 (0.0035) [2024-06-28 04:16:43,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43986.8). Total num frames: 2396930048. Throughput: 0: 44131.0. Samples: 2299813120. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:16:43,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 04:16:45,872][06909] Updated weights for policy 0, policy_version 146303 (0.0031) [2024-06-28 04:16:48,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44236.7, 300 sec: 44042.7). Total num frames: 2397159424. Throughput: 0: 43812.8. Samples: 2300072460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:16:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:16:49,753][06909] Updated weights for policy 0, policy_version 146313 (0.0032) [2024-06-28 04:16:53,400][06909] Updated weights for policy 0, policy_version 146323 (0.0027) [2024-06-28 04:16:53,850][06674] Fps is (10 sec: 44237.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2397372416. Throughput: 0: 43976.5. Samples: 2300340480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:16:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:16:54,398][06887] Signal inference workers to stop experience collection... (32700 times) [2024-06-28 04:16:54,432][06909] InferenceWorker_p0-w0: stopping experience collection (32700 times) [2024-06-28 04:16:54,455][06887] Signal inference workers to resume experience collection... (32700 times) [2024-06-28 04:16:54,456][06909] InferenceWorker_p0-w0: resuming experience collection (32700 times) [2024-06-28 04:16:57,395][06909] Updated weights for policy 0, policy_version 146333 (0.0028) [2024-06-28 04:16:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2397585408. Throughput: 0: 43881.7. Samples: 2300472160. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:16:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:17:00,684][06909] Updated weights for policy 0, policy_version 146343 (0.0031) [2024-06-28 04:17:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2397798400. Throughput: 0: 44104.9. Samples: 2300737440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:17:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:17:04,639][06909] Updated weights for policy 0, policy_version 146353 (0.0033) [2024-06-28 04:17:07,944][06909] Updated weights for policy 0, policy_version 146363 (0.0034) [2024-06-28 04:17:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2398027776. Throughput: 0: 44070.2. Samples: 2301002740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 04:17:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:17:11,913][06909] Updated weights for policy 0, policy_version 146373 (0.0032) [2024-06-28 04:17:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2398257152. Throughput: 0: 44132.9. Samples: 2301134900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 04:17:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:17:15,183][06909] Updated weights for policy 0, policy_version 146383 (0.0033) [2024-06-28 04:17:18,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2398470144. Throughput: 0: 44056.5. Samples: 2301401780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 04:17:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:17:19,082][06909] Updated weights for policy 0, policy_version 146393 (0.0031) [2024-06-28 04:17:23,080][06909] Updated weights for policy 0, policy_version 146403 (0.0035) [2024-06-28 04:17:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2398699520. Throughput: 0: 44182.0. Samples: 2301665740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 04:17:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:17:26,729][06909] Updated weights for policy 0, policy_version 146413 (0.0039) [2024-06-28 04:17:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2398912512. Throughput: 0: 44028.2. Samples: 2301794380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 04:17:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:17:30,376][06909] Updated weights for policy 0, policy_version 146423 (0.0042) [2024-06-28 04:17:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2399125504. Throughput: 0: 44167.2. Samples: 2302059980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 04:17:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:17:34,313][06909] Updated weights for policy 0, policy_version 146433 (0.0037) [2024-06-28 04:17:38,013][06909] Updated weights for policy 0, policy_version 146443 (0.0039) [2024-06-28 04:17:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2399354880. Throughput: 0: 44052.0. Samples: 2302322820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 04:17:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:17:41,815][06909] Updated weights for policy 0, policy_version 146453 (0.0036) [2024-06-28 04:17:43,852][06674] Fps is (10 sec: 45865.7, 60 sec: 44235.4, 300 sec: 44097.7). Total num frames: 2399584256. Throughput: 0: 44014.0. Samples: 2302452880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 04:17:43,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:17:45,616][06909] Updated weights for policy 0, policy_version 146463 (0.0041) [2024-06-28 04:17:48,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 2399797248. Throughput: 0: 43913.5. Samples: 2302713640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 04:17:48,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:17:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000146472_2399797248.pth... [2024-06-28 04:17:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000145827_2389229568.pth [2024-06-28 04:17:49,073][06909] Updated weights for policy 0, policy_version 146473 (0.0037) [2024-06-28 04:17:52,901][06909] Updated weights for policy 0, policy_version 146483 (0.0037) [2024-06-28 04:17:53,850][06674] Fps is (10 sec: 42607.5, 60 sec: 43963.7, 300 sec: 43987.2). Total num frames: 2400010240. Throughput: 0: 43935.7. Samples: 2302979840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 04:17:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:17:56,294][06909] Updated weights for policy 0, policy_version 146493 (0.0028) [2024-06-28 04:17:58,850][06674] Fps is (10 sec: 40968.4, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2400206848. Throughput: 0: 43689.3. Samples: 2303100920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 04:17:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:18:00,713][06909] Updated weights for policy 0, policy_version 146503 (0.0033) [2024-06-28 04:18:03,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2400436224. Throughput: 0: 43727.0. Samples: 2303369500. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 04:18:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:18:04,073][06909] Updated weights for policy 0, policy_version 146513 (0.0029) [2024-06-28 04:18:08,012][06909] Updated weights for policy 0, policy_version 146523 (0.0027) [2024-06-28 04:18:08,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 2400665600. Throughput: 0: 43843.3. Samples: 2303638680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 04:18:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:18:11,576][06909] Updated weights for policy 0, policy_version 146533 (0.0032) [2024-06-28 04:18:13,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2400894976. Throughput: 0: 43757.3. Samples: 2303763460. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 04:18:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:18:15,561][06909] Updated weights for policy 0, policy_version 146543 (0.0026) [2024-06-28 04:18:18,850][06674] Fps is (10 sec: 44235.7, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2401107968. Throughput: 0: 43713.2. Samples: 2304027080. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 04:18:18,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:18:19,020][06909] Updated weights for policy 0, policy_version 146553 (0.0039) [2024-06-28 04:18:22,999][06909] Updated weights for policy 0, policy_version 146563 (0.0029) [2024-06-28 04:18:23,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2401320960. Throughput: 0: 43870.5. Samples: 2304297000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 04:18:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:18:26,196][06909] Updated weights for policy 0, policy_version 146573 (0.0023) [2024-06-28 04:18:28,852][06674] Fps is (10 sec: 44228.3, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 2401550336. Throughput: 0: 43840.0. Samples: 2304425680. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 04:18:28,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:18:30,554][06909] Updated weights for policy 0, policy_version 146583 (0.0042) [2024-06-28 04:18:33,582][06909] Updated weights for policy 0, policy_version 146593 (0.0034) [2024-06-28 04:18:33,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2401779712. Throughput: 0: 43863.4. Samples: 2304687400. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 04:18:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:18:37,905][06909] Updated weights for policy 0, policy_version 146603 (0.0030) [2024-06-28 04:18:38,850][06674] Fps is (10 sec: 42607.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2401976320. Throughput: 0: 43960.9. Samples: 2304958080. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 04:18:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:18:41,152][06909] Updated weights for policy 0, policy_version 146613 (0.0040) [2024-06-28 04:18:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43692.2, 300 sec: 44042.4). Total num frames: 2402205696. Throughput: 0: 44108.5. Samples: 2305085800. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 04:18:43,856][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 04:18:45,571][06909] Updated weights for policy 0, policy_version 146623 (0.0035) [2024-06-28 04:18:45,741][06887] Signal inference workers to stop experience collection... (32750 times) [2024-06-28 04:18:45,787][06909] InferenceWorker_p0-w0: stopping experience collection (32750 times) [2024-06-28 04:18:45,794][06887] Signal inference workers to resume experience collection... (32750 times) [2024-06-28 04:18:45,804][06909] InferenceWorker_p0-w0: resuming experience collection (32750 times) [2024-06-28 04:18:48,555][06909] Updated weights for policy 0, policy_version 146633 (0.0023) [2024-06-28 04:18:48,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 2402435072. Throughput: 0: 43904.0. Samples: 2305345180. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 04:18:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:18:52,901][06909] Updated weights for policy 0, policy_version 146643 (0.0035) [2024-06-28 04:18:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 2402648064. Throughput: 0: 43926.1. Samples: 2305615360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 04:18:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:18:56,047][06909] Updated weights for policy 0, policy_version 146653 (0.0034) [2024-06-28 04:18:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 2402861056. Throughput: 0: 43913.8. Samples: 2305739580. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 04:18:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:19:00,036][06909] Updated weights for policy 0, policy_version 146663 (0.0021) [2024-06-28 04:19:03,306][06909] Updated weights for policy 0, policy_version 146673 (0.0035) [2024-06-28 04:19:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2403106816. Throughput: 0: 44020.5. Samples: 2306008000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 04:19:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:19:07,810][06909] Updated weights for policy 0, policy_version 146683 (0.0030) [2024-06-28 04:19:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2403303424. Throughput: 0: 44011.7. Samples: 2306277520. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 04:19:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:19:10,847][06909] Updated weights for policy 0, policy_version 146693 (0.0027) [2024-06-28 04:19:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2403532800. Throughput: 0: 43896.2. Samples: 2306400920. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 04:19:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:19:15,027][06909] Updated weights for policy 0, policy_version 146703 (0.0034) [2024-06-28 04:19:18,326][06909] Updated weights for policy 0, policy_version 146713 (0.0031) [2024-06-28 04:19:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 2403778560. Throughput: 0: 44046.2. Samples: 2306669480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 04:19:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:19:22,639][06909] Updated weights for policy 0, policy_version 146723 (0.0026) [2024-06-28 04:19:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2403958784. Throughput: 0: 43957.7. Samples: 2306936180. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 04:19:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:19:25,634][06909] Updated weights for policy 0, policy_version 146733 (0.0038) [2024-06-28 04:19:28,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43965.3, 300 sec: 44042.7). Total num frames: 2404188160. Throughput: 0: 43893.0. Samples: 2307060980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 04:19:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:19:30,053][06909] Updated weights for policy 0, policy_version 146743 (0.0027) [2024-06-28 04:19:32,956][06909] Updated weights for policy 0, policy_version 146753 (0.0034) [2024-06-28 04:19:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2404417536. Throughput: 0: 44042.7. Samples: 2307327100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 04:19:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:19:37,330][06909] Updated weights for policy 0, policy_version 146763 (0.0027) [2024-06-28 04:19:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2404630528. Throughput: 0: 44242.7. Samples: 2307606280. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 04:19:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:19:40,278][06909] Updated weights for policy 0, policy_version 146773 (0.0036) [2024-06-28 04:19:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2404843520. Throughput: 0: 44131.9. Samples: 2307725520. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 04:19:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 04:19:44,648][06909] Updated weights for policy 0, policy_version 146783 (0.0045) [2024-06-28 04:19:47,829][06909] Updated weights for policy 0, policy_version 146793 (0.0030) [2024-06-28 04:19:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44154.4). Total num frames: 2405089280. Throughput: 0: 44086.3. Samples: 2307991880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 04:19:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:19:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000146795_2405089280.pth... [2024-06-28 04:19:48,919][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000146149_2394505216.pth [2024-06-28 04:19:52,700][06909] Updated weights for policy 0, policy_version 146803 (0.0030) [2024-06-28 04:19:53,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2405302272. Throughput: 0: 44144.0. Samples: 2308264000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 04:19:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:19:55,285][06909] Updated weights for policy 0, policy_version 146813 (0.0031) [2024-06-28 04:19:58,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 43987.2). Total num frames: 2405498880. Throughput: 0: 44292.9. Samples: 2308394100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 04:19:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:19:59,836][06909] Updated weights for policy 0, policy_version 146823 (0.0041) [2024-06-28 04:20:02,728][06909] Updated weights for policy 0, policy_version 146833 (0.0031) [2024-06-28 04:20:03,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2405744640. Throughput: 0: 44119.9. Samples: 2308654880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 04:20:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:20:07,027][06909] Updated weights for policy 0, policy_version 146843 (0.0042) [2024-06-28 04:20:08,852][06674] Fps is (10 sec: 45865.7, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 2405957632. Throughput: 0: 44248.7. Samples: 2308927460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 04:20:08,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:20:09,489][06887] Signal inference workers to stop experience collection... (32800 times) [2024-06-28 04:20:09,519][06909] InferenceWorker_p0-w0: stopping experience collection (32800 times) [2024-06-28 04:20:09,601][06887] Signal inference workers to resume experience collection... (32800 times) [2024-06-28 04:20:09,602][06909] InferenceWorker_p0-w0: resuming experience collection (32800 times) [2024-06-28 04:20:09,972][06909] Updated weights for policy 0, policy_version 146853 (0.0029) [2024-06-28 04:20:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2406187008. Throughput: 0: 44360.3. Samples: 2309057200. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 04:20:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:20:14,226][06909] Updated weights for policy 0, policy_version 146863 (0.0041) [2024-06-28 04:20:17,277][06909] Updated weights for policy 0, policy_version 146873 (0.0022) [2024-06-28 04:20:18,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2406400000. Throughput: 0: 44209.7. Samples: 2309316540. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 04:20:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:20:21,846][06909] Updated weights for policy 0, policy_version 146883 (0.0037) [2024-06-28 04:20:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2406629376. Throughput: 0: 44089.3. Samples: 2309590300. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 04:20:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:20:24,829][06909] Updated weights for policy 0, policy_version 146893 (0.0027) [2024-06-28 04:20:28,854][06674] Fps is (10 sec: 44220.0, 60 sec: 44233.9, 300 sec: 43986.3). Total num frames: 2406842368. Throughput: 0: 44407.8. Samples: 2309724040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 04:20:28,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:20:29,408][06909] Updated weights for policy 0, policy_version 146903 (0.0032) [2024-06-28 04:20:32,443][06909] Updated weights for policy 0, policy_version 146913 (0.0035) [2024-06-28 04:20:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2407088128. Throughput: 0: 44300.9. Samples: 2309985420. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 04:20:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:20:36,875][06909] Updated weights for policy 0, policy_version 146923 (0.0037) [2024-06-28 04:20:38,850][06674] Fps is (10 sec: 44253.6, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2407284736. Throughput: 0: 44127.0. Samples: 2310249720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 04:20:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:20:39,973][06909] Updated weights for policy 0, policy_version 146933 (0.0026) [2024-06-28 04:20:43,850][06674] Fps is (10 sec: 40959.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2407497728. Throughput: 0: 44158.2. Samples: 2310381220. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 04:20:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:20:44,075][06909] Updated weights for policy 0, policy_version 146943 (0.0041) [2024-06-28 04:20:47,143][06909] Updated weights for policy 0, policy_version 146953 (0.0026) [2024-06-28 04:20:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2407710720. Throughput: 0: 44123.6. Samples: 2310640440. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 04:20:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:20:51,282][06909] Updated weights for policy 0, policy_version 146963 (0.0037) [2024-06-28 04:20:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2407956480. Throughput: 0: 44102.9. Samples: 2310912000. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 04:20:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:20:54,410][06909] Updated weights for policy 0, policy_version 146973 (0.0026) [2024-06-28 04:20:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2408153088. Throughput: 0: 44213.3. Samples: 2311046800. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 04:20:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:20:59,029][06909] Updated weights for policy 0, policy_version 146983 (0.0039) [2024-06-28 04:21:02,160][06909] Updated weights for policy 0, policy_version 146993 (0.0027) [2024-06-28 04:21:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2408382464. Throughput: 0: 44077.4. Samples: 2311300020. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 04:21:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:21:06,246][06909] Updated weights for policy 0, policy_version 147003 (0.0028) [2024-06-28 04:21:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 2408611840. Throughput: 0: 44014.1. Samples: 2311570940. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 04:21:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:21:09,355][06909] Updated weights for policy 0, policy_version 147013 (0.0032) [2024-06-28 04:21:13,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 2408808448. Throughput: 0: 43829.4. Samples: 2311696280. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 04:21:13,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:21:14,046][06909] Updated weights for policy 0, policy_version 147023 (0.0034) [2024-06-28 04:21:16,040][06887] Signal inference workers to stop experience collection... (32850 times) [2024-06-28 04:21:16,045][06887] Signal inference workers to resume experience collection... (32850 times) [2024-06-28 04:21:16,069][06909] InferenceWorker_p0-w0: stopping experience collection (32850 times) [2024-06-28 04:21:16,076][06909] InferenceWorker_p0-w0: resuming experience collection (32850 times) [2024-06-28 04:21:17,211][06909] Updated weights for policy 0, policy_version 147033 (0.0030) [2024-06-28 04:21:18,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 2409054208. Throughput: 0: 43917.1. Samples: 2311961780. Policy #0 lag: (min: 0.0, avg: 10.5, max: 24.0) [2024-06-28 04:21:18,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:21:21,286][06909] Updated weights for policy 0, policy_version 147043 (0.0032) [2024-06-28 04:21:23,850][06674] Fps is (10 sec: 47523.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2409283584. Throughput: 0: 43947.2. Samples: 2312227340. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 04:21:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:21:24,422][06909] Updated weights for policy 0, policy_version 147053 (0.0028) [2024-06-28 04:21:28,464][06909] Updated weights for policy 0, policy_version 147063 (0.0028) [2024-06-28 04:21:28,850][06674] Fps is (10 sec: 42607.0, 60 sec: 43966.5, 300 sec: 44042.4). Total num frames: 2409480192. Throughput: 0: 44018.2. Samples: 2312362040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 04:21:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:21:31,657][06909] Updated weights for policy 0, policy_version 147073 (0.0043) [2024-06-28 04:21:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 2409693184. Throughput: 0: 44122.4. Samples: 2312625940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 04:21:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:21:35,817][06909] Updated weights for policy 0, policy_version 147083 (0.0035) [2024-06-28 04:21:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2409938944. Throughput: 0: 43899.1. Samples: 2312887460. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 04:21:38,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:21:39,337][06909] Updated weights for policy 0, policy_version 147093 (0.0024) [2024-06-28 04:21:43,358][06909] Updated weights for policy 0, policy_version 147103 (0.0042) [2024-06-28 04:21:43,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2410135552. Throughput: 0: 43826.1. Samples: 2313018980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 04:21:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:21:46,934][06909] Updated weights for policy 0, policy_version 147113 (0.0031) [2024-06-28 04:21:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2410364928. Throughput: 0: 44244.9. Samples: 2313291040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 04:21:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:21:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000147118_2410381312.pth... [2024-06-28 04:21:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000146472_2399797248.pth [2024-06-28 04:21:50,886][06909] Updated weights for policy 0, policy_version 147123 (0.0029) [2024-06-28 04:21:53,850][06674] Fps is (10 sec: 45876.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2410594304. Throughput: 0: 44013.9. Samples: 2313551560. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 04:21:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:21:54,132][06909] Updated weights for policy 0, policy_version 147133 (0.0038) [2024-06-28 04:21:58,115][06909] Updated weights for policy 0, policy_version 147143 (0.0028) [2024-06-28 04:21:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2410790912. Throughput: 0: 44133.5. Samples: 2313682200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 04:21:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:22:01,714][06909] Updated weights for policy 0, policy_version 147153 (0.0028) [2024-06-28 04:22:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2411036672. Throughput: 0: 44218.0. Samples: 2313951500. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 04:22:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:22:05,259][06909] Updated weights for policy 0, policy_version 147163 (0.0035) [2024-06-28 04:22:08,835][06909] Updated weights for policy 0, policy_version 147173 (0.0043) [2024-06-28 04:22:08,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2411282432. Throughput: 0: 44232.4. Samples: 2314217800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 04:22:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:22:12,763][06909] Updated weights for policy 0, policy_version 147183 (0.0039) [2024-06-28 04:22:13,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 2411446272. Throughput: 0: 44207.6. Samples: 2314351380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 04:22:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:22:16,171][06909] Updated weights for policy 0, policy_version 147193 (0.0029) [2024-06-28 04:22:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 2411708416. Throughput: 0: 44243.0. Samples: 2314616880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 04:22:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:22:20,014][06909] Updated weights for policy 0, policy_version 147203 (0.0034) [2024-06-28 04:22:23,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2411921408. Throughput: 0: 44155.1. Samples: 2314874440. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 04:22:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:22:24,215][06909] Updated weights for policy 0, policy_version 147213 (0.0038) [2024-06-28 04:22:27,687][06909] Updated weights for policy 0, policy_version 147223 (0.0037) [2024-06-28 04:22:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2412118016. Throughput: 0: 44221.9. Samples: 2315008960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 04:22:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:22:29,381][06887] Signal inference workers to stop experience collection... (32900 times) [2024-06-28 04:22:29,420][06909] InferenceWorker_p0-w0: stopping experience collection (32900 times) [2024-06-28 04:22:29,494][06887] Signal inference workers to resume experience collection... (32900 times) [2024-06-28 04:22:29,494][06909] InferenceWorker_p0-w0: resuming experience collection (32900 times) [2024-06-28 04:22:31,336][06909] Updated weights for policy 0, policy_version 147233 (0.0034) [2024-06-28 04:22:33,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2412363776. Throughput: 0: 44297.4. Samples: 2315284420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 04:22:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:22:34,874][06909] Updated weights for policy 0, policy_version 147243 (0.0035) [2024-06-28 04:22:38,416][06909] Updated weights for policy 0, policy_version 147253 (0.0034) [2024-06-28 04:22:38,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.7, 300 sec: 44098.2). Total num frames: 2412593152. Throughput: 0: 44274.5. Samples: 2315543920. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 04:22:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:22:42,240][06909] Updated weights for policy 0, policy_version 147263 (0.0036) [2024-06-28 04:22:43,852][06674] Fps is (10 sec: 42588.2, 60 sec: 44235.2, 300 sec: 44042.4). Total num frames: 2412789760. Throughput: 0: 44365.7. Samples: 2315678760. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 04:22:43,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:22:45,981][06909] Updated weights for policy 0, policy_version 147273 (0.0024) [2024-06-28 04:22:48,850][06674] Fps is (10 sec: 42599.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2413019136. Throughput: 0: 44353.0. Samples: 2315947380. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 04:22:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:22:49,895][06909] Updated weights for policy 0, policy_version 147283 (0.0032) [2024-06-28 04:22:53,382][06909] Updated weights for policy 0, policy_version 147293 (0.0030) [2024-06-28 04:22:53,850][06674] Fps is (10 sec: 45885.7, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2413248512. Throughput: 0: 44011.1. Samples: 2316198300. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 04:22:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:22:57,525][06909] Updated weights for policy 0, policy_version 147303 (0.0021) [2024-06-28 04:22:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2413445120. Throughput: 0: 43993.3. Samples: 2316331080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 04:22:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:23:01,003][06909] Updated weights for policy 0, policy_version 147313 (0.0044) [2024-06-28 04:23:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2413690880. Throughput: 0: 44193.8. Samples: 2316605600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 04:23:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:23:04,663][06909] Updated weights for policy 0, policy_version 147323 (0.0029) [2024-06-28 04:23:08,317][06909] Updated weights for policy 0, policy_version 147333 (0.0026) [2024-06-28 04:23:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2413903872. Throughput: 0: 44226.8. Samples: 2316864640. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 04:23:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:23:11,894][06909] Updated weights for policy 0, policy_version 147343 (0.0035) [2024-06-28 04:23:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 2414116864. Throughput: 0: 44159.6. Samples: 2316996140. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 04:23:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:23:16,059][06909] Updated weights for policy 0, policy_version 147353 (0.0036) [2024-06-28 04:23:18,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2414346240. Throughput: 0: 44070.5. Samples: 2317267600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 04:23:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:23:19,223][06909] Updated weights for policy 0, policy_version 147363 (0.0031) [2024-06-28 04:23:23,535][06909] Updated weights for policy 0, policy_version 147373 (0.0041) [2024-06-28 04:23:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 2414559232. Throughput: 0: 44105.4. Samples: 2317528660. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 04:23:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:23:27,317][06909] Updated weights for policy 0, policy_version 147383 (0.0032) [2024-06-28 04:23:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2414772224. Throughput: 0: 44040.1. Samples: 2317660460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:23:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:23:30,906][06909] Updated weights for policy 0, policy_version 147393 (0.0045) [2024-06-28 04:23:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 2415001600. Throughput: 0: 44009.6. Samples: 2317927820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:23:33,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:23:34,671][06909] Updated weights for policy 0, policy_version 147403 (0.0023) [2024-06-28 04:23:38,482][06909] Updated weights for policy 0, policy_version 147413 (0.0022) [2024-06-28 04:23:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 2415214592. Throughput: 0: 44213.0. Samples: 2318187880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:23:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:23:40,299][06887] Signal inference workers to stop experience collection... (32950 times) [2024-06-28 04:23:40,299][06887] Signal inference workers to resume experience collection... (32950 times) [2024-06-28 04:23:40,335][06909] InferenceWorker_p0-w0: stopping experience collection (32950 times) [2024-06-28 04:23:40,335][06909] InferenceWorker_p0-w0: resuming experience collection (32950 times) [2024-06-28 04:23:42,122][06909] Updated weights for policy 0, policy_version 147423 (0.0042) [2024-06-28 04:23:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44238.5, 300 sec: 44097.9). Total num frames: 2415443968. Throughput: 0: 44140.8. Samples: 2318317420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:23:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:23:45,664][06909] Updated weights for policy 0, policy_version 147433 (0.0031) [2024-06-28 04:23:48,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2415673344. Throughput: 0: 44140.1. Samples: 2318591900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:23:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:23:48,874][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000147442_2415689728.pth... [2024-06-28 04:23:48,931][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000146795_2405089280.pth [2024-06-28 04:23:49,600][06909] Updated weights for policy 0, policy_version 147443 (0.0033) [2024-06-28 04:23:53,679][06909] Updated weights for policy 0, policy_version 147453 (0.0036) [2024-06-28 04:23:53,856][06674] Fps is (10 sec: 42572.6, 60 sec: 43686.3, 300 sec: 44097.0). Total num frames: 2415869952. Throughput: 0: 44091.8. Samples: 2318849040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:23:53,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:23:57,219][06909] Updated weights for policy 0, policy_version 147463 (0.0031) [2024-06-28 04:23:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 2416132096. Throughput: 0: 44083.6. Samples: 2318979900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:23:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:24:01,046][06909] Updated weights for policy 0, policy_version 147473 (0.0028) [2024-06-28 04:24:03,850][06674] Fps is (10 sec: 45903.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2416328704. Throughput: 0: 44057.1. Samples: 2319250160. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:24:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:24:04,818][06909] Updated weights for policy 0, policy_version 147483 (0.0034) [2024-06-28 04:24:08,347][06909] Updated weights for policy 0, policy_version 147493 (0.0033) [2024-06-28 04:24:08,852][06674] Fps is (10 sec: 40951.6, 60 sec: 43962.2, 300 sec: 44097.7). Total num frames: 2416541696. Throughput: 0: 44060.8. Samples: 2319511480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:24:08,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:24:12,561][06909] Updated weights for policy 0, policy_version 147503 (0.0036) [2024-06-28 04:24:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2416771072. Throughput: 0: 44030.7. Samples: 2319641840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:24:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:24:15,778][06909] Updated weights for policy 0, policy_version 147513 (0.0036) [2024-06-28 04:24:18,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2416984064. Throughput: 0: 43927.7. Samples: 2319904560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:24:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:24:20,073][06909] Updated weights for policy 0, policy_version 147523 (0.0039) [2024-06-28 04:24:23,114][06909] Updated weights for policy 0, policy_version 147533 (0.0021) [2024-06-28 04:24:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2417180672. Throughput: 0: 43893.3. Samples: 2320163080. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:24:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:24:27,330][06909] Updated weights for policy 0, policy_version 147543 (0.0033) [2024-06-28 04:24:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2417426432. Throughput: 0: 43877.8. Samples: 2320291920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 04:24:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:24:31,274][06909] Updated weights for policy 0, policy_version 147553 (0.0030) [2024-06-28 04:24:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2417639424. Throughput: 0: 43762.5. Samples: 2320561220. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 04:24:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:24:34,990][06909] Updated weights for policy 0, policy_version 147563 (0.0031) [2024-06-28 04:24:38,514][06909] Updated weights for policy 0, policy_version 147573 (0.0024) [2024-06-28 04:24:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2417836032. Throughput: 0: 43873.5. Samples: 2320823080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 04:24:38,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:24:42,201][06909] Updated weights for policy 0, policy_version 147583 (0.0030) [2024-06-28 04:24:43,511][06887] Signal inference workers to stop experience collection... (33000 times) [2024-06-28 04:24:43,516][06887] Signal inference workers to resume experience collection... (33000 times) [2024-06-28 04:24:43,541][06909] InferenceWorker_p0-w0: stopping experience collection (33000 times) [2024-06-28 04:24:43,542][06909] InferenceWorker_p0-w0: resuming experience collection (33000 times) [2024-06-28 04:24:43,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2418098176. Throughput: 0: 43936.9. Samples: 2320957060. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 04:24:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:24:45,698][06909] Updated weights for policy 0, policy_version 147593 (0.0032) [2024-06-28 04:24:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 2418278400. Throughput: 0: 43667.5. Samples: 2321215200. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 04:24:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:24:49,901][06909] Updated weights for policy 0, policy_version 147603 (0.0045) [2024-06-28 04:24:53,020][06909] Updated weights for policy 0, policy_version 147613 (0.0025) [2024-06-28 04:24:53,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43695.1, 300 sec: 44042.4). Total num frames: 2418491392. Throughput: 0: 43565.9. Samples: 2321471860. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 04:24:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:24:57,280][06909] Updated weights for policy 0, policy_version 147623 (0.0037) [2024-06-28 04:24:58,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 2418753536. Throughput: 0: 43737.7. Samples: 2321610040. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 04:24:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:25:00,219][06909] Updated weights for policy 0, policy_version 147633 (0.0022) [2024-06-28 04:25:03,856][06674] Fps is (10 sec: 44210.1, 60 sec: 43413.1, 300 sec: 43986.3). Total num frames: 2418933760. Throughput: 0: 43709.2. Samples: 2321871740. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 04:25:03,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:25:04,704][06909] Updated weights for policy 0, policy_version 147643 (0.0033) [2024-06-28 04:25:08,368][06909] Updated weights for policy 0, policy_version 147653 (0.0032) [2024-06-28 04:25:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43692.1, 300 sec: 43986.9). Total num frames: 2419163136. Throughput: 0: 43719.5. Samples: 2322130460. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 04:25:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:25:12,635][06909] Updated weights for policy 0, policy_version 147663 (0.0037) [2024-06-28 04:25:13,850][06674] Fps is (10 sec: 49180.7, 60 sec: 44236.6, 300 sec: 44153.5). Total num frames: 2419425280. Throughput: 0: 43862.5. Samples: 2322265740. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 04:25:13,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:25:15,468][06909] Updated weights for policy 0, policy_version 147673 (0.0030) [2024-06-28 04:25:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2419605504. Throughput: 0: 43670.6. Samples: 2322526400. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 04:25:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:25:19,951][06909] Updated weights for policy 0, policy_version 147683 (0.0023) [2024-06-28 04:25:22,667][06909] Updated weights for policy 0, policy_version 147693 (0.0025) [2024-06-28 04:25:23,850][06674] Fps is (10 sec: 37684.3, 60 sec: 43690.7, 300 sec: 43931.9). Total num frames: 2419802112. Throughput: 0: 43699.2. Samples: 2322789540. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 04:25:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:25:27,301][06909] Updated weights for policy 0, policy_version 147703 (0.0033) [2024-06-28 04:25:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2420064256. Throughput: 0: 43692.0. Samples: 2322923200. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 04:25:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:25:30,053][06909] Updated weights for policy 0, policy_version 147713 (0.0039) [2024-06-28 04:25:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 2420244480. Throughput: 0: 43741.7. Samples: 2323183580. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 04:25:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:25:34,661][06909] Updated weights for policy 0, policy_version 147723 (0.0030) [2024-06-28 04:25:37,566][06909] Updated weights for policy 0, policy_version 147733 (0.0025) [2024-06-28 04:25:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2420473856. Throughput: 0: 43958.3. Samples: 2323449980. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 04:25:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:25:42,046][06909] Updated weights for policy 0, policy_version 147743 (0.0023) [2024-06-28 04:25:43,850][06674] Fps is (10 sec: 49152.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2420736000. Throughput: 0: 43741.5. Samples: 2323578400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 04:25:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:25:45,631][06909] Updated weights for policy 0, policy_version 147753 (0.0023) [2024-06-28 04:25:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2420916224. Throughput: 0: 43890.3. Samples: 2323846540. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 04:25:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:25:48,873][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000147761_2420916224.pth... [2024-06-28 04:25:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000147118_2410381312.pth [2024-06-28 04:25:49,680][06909] Updated weights for policy 0, policy_version 147763 (0.0037) [2024-06-28 04:25:52,813][06909] Updated weights for policy 0, policy_version 147773 (0.0033) [2024-06-28 04:25:53,852][06674] Fps is (10 sec: 39312.1, 60 sec: 43962.0, 300 sec: 43986.5). Total num frames: 2421129216. Throughput: 0: 44080.4. Samples: 2324114180. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 04:25:53,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:25:57,087][06909] Updated weights for policy 0, policy_version 147783 (0.0040) [2024-06-28 04:25:58,751][06887] Signal inference workers to stop experience collection... (33050 times) [2024-06-28 04:25:58,751][06887] Signal inference workers to resume experience collection... (33050 times) [2024-06-28 04:25:58,804][06909] InferenceWorker_p0-w0: stopping experience collection (33050 times) [2024-06-28 04:25:58,804][06909] InferenceWorker_p0-w0: resuming experience collection (33050 times) [2024-06-28 04:25:58,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2421391360. Throughput: 0: 44038.9. Samples: 2324247480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 04:25:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:26:00,758][06909] Updated weights for policy 0, policy_version 147793 (0.0033) [2024-06-28 04:26:03,850][06674] Fps is (10 sec: 44247.0, 60 sec: 43968.1, 300 sec: 43931.3). Total num frames: 2421571584. Throughput: 0: 44077.8. Samples: 2324509900. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 04:26:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:26:04,351][06909] Updated weights for policy 0, policy_version 147803 (0.0034) [2024-06-28 04:26:08,334][06909] Updated weights for policy 0, policy_version 147813 (0.0025) [2024-06-28 04:26:08,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 2421800960. Throughput: 0: 44265.7. Samples: 2324781500. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 04:26:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:26:11,574][06909] Updated weights for policy 0, policy_version 147823 (0.0030) [2024-06-28 04:26:13,850][06674] Fps is (10 sec: 49152.4, 60 sec: 43963.9, 300 sec: 44098.3). Total num frames: 2422063104. Throughput: 0: 44128.5. Samples: 2324908980. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 04:26:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:26:15,542][06909] Updated weights for policy 0, policy_version 147833 (0.0034) [2024-06-28 04:26:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2422243328. Throughput: 0: 44092.5. Samples: 2325167740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 04:26:18,859][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:26:19,350][06909] Updated weights for policy 0, policy_version 147843 (0.0032) [2024-06-28 04:26:22,862][06909] Updated weights for policy 0, policy_version 147853 (0.0036) [2024-06-28 04:26:23,852][06674] Fps is (10 sec: 39313.5, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 2422456320. Throughput: 0: 44159.3. Samples: 2325437240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 04:26:23,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:26:26,919][06909] Updated weights for policy 0, policy_version 147863 (0.0039) [2024-06-28 04:26:28,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2422718464. Throughput: 0: 44193.7. Samples: 2325567120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 04:26:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:26:30,780][06909] Updated weights for policy 0, policy_version 147873 (0.0038) [2024-06-28 04:26:33,850][06674] Fps is (10 sec: 44245.5, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2422898688. Throughput: 0: 44055.5. Samples: 2325829040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 04:26:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:26:34,119][06909] Updated weights for policy 0, policy_version 147883 (0.0039) [2024-06-28 04:26:38,243][06909] Updated weights for policy 0, policy_version 147893 (0.0028) [2024-06-28 04:26:38,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2423111680. Throughput: 0: 44097.5. Samples: 2326098460. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:26:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:26:41,397][06909] Updated weights for policy 0, policy_version 147903 (0.0033) [2024-06-28 04:26:43,850][06674] Fps is (10 sec: 49152.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2423390208. Throughput: 0: 43980.1. Samples: 2326226580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:26:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:26:45,642][06909] Updated weights for policy 0, policy_version 147913 (0.0031) [2024-06-28 04:26:48,584][06909] Updated weights for policy 0, policy_version 147923 (0.0034) [2024-06-28 04:26:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2423570432. Throughput: 0: 44059.6. Samples: 2326492580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:26:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:26:52,871][06909] Updated weights for policy 0, policy_version 147933 (0.0039) [2024-06-28 04:26:53,850][06674] Fps is (10 sec: 37682.7, 60 sec: 43965.4, 300 sec: 43986.9). Total num frames: 2423767040. Throughput: 0: 43886.7. Samples: 2326756400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:26:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:26:56,341][06909] Updated weights for policy 0, policy_version 147943 (0.0033) [2024-06-28 04:26:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2424029184. Throughput: 0: 44004.0. Samples: 2326889160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:26:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:27:00,042][06909] Updated weights for policy 0, policy_version 147953 (0.0030) [2024-06-28 04:27:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 2424209408. Throughput: 0: 44177.3. Samples: 2327155720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:27:03,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:27:03,874][06909] Updated weights for policy 0, policy_version 147963 (0.0028) [2024-06-28 04:27:04,283][06887] Signal inference workers to stop experience collection... (33100 times) [2024-06-28 04:27:04,303][06909] InferenceWorker_p0-w0: stopping experience collection (33100 times) [2024-06-28 04:27:04,343][06887] Signal inference workers to resume experience collection... (33100 times) [2024-06-28 04:27:04,343][06909] InferenceWorker_p0-w0: resuming experience collection (33100 times) [2024-06-28 04:27:07,797][06909] Updated weights for policy 0, policy_version 147973 (0.0027) [2024-06-28 04:27:08,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2424438784. Throughput: 0: 44174.4. Samples: 2327425000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:27:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:27:11,039][06909] Updated weights for policy 0, policy_version 147983 (0.0036) [2024-06-28 04:27:13,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2424684544. Throughput: 0: 44098.2. Samples: 2327551540. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:27:13,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:27:14,901][06909] Updated weights for policy 0, policy_version 147993 (0.0031) [2024-06-28 04:27:18,237][06909] Updated weights for policy 0, policy_version 148003 (0.0038) [2024-06-28 04:27:18,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 2424881152. Throughput: 0: 44321.9. Samples: 2327823520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:27:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:27:22,356][06909] Updated weights for policy 0, policy_version 148013 (0.0032) [2024-06-28 04:27:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44238.4, 300 sec: 44042.4). Total num frames: 2425110528. Throughput: 0: 44240.0. Samples: 2328089260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:27:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:27:25,659][06909] Updated weights for policy 0, policy_version 148023 (0.0034) [2024-06-28 04:27:28,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2425356288. Throughput: 0: 44190.2. Samples: 2328215140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:27:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 04:27:29,670][06909] Updated weights for policy 0, policy_version 148033 (0.0034) [2024-06-28 04:27:33,506][06909] Updated weights for policy 0, policy_version 148043 (0.0032) [2024-06-28 04:27:33,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2425552896. Throughput: 0: 43978.6. Samples: 2328471620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:27:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:27:36,995][06909] Updated weights for policy 0, policy_version 148053 (0.0026) [2024-06-28 04:27:38,850][06674] Fps is (10 sec: 40959.9, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 2425765888. Throughput: 0: 44166.3. Samples: 2328743880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:27:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:27:40,750][06909] Updated weights for policy 0, policy_version 148063 (0.0026) [2024-06-28 04:27:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.5, 300 sec: 44042.4). Total num frames: 2426011648. Throughput: 0: 44112.4. Samples: 2328874220. Policy #0 lag: (min: 2.0, avg: 10.5, max: 23.0) [2024-06-28 04:27:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:27:45,019][06909] Updated weights for policy 0, policy_version 148073 (0.0040) [2024-06-28 04:27:48,489][06909] Updated weights for policy 0, policy_version 148083 (0.0030) [2024-06-28 04:27:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2426224640. Throughput: 0: 43991.6. Samples: 2329135340. Policy #0 lag: (min: 2.0, avg: 10.5, max: 23.0) [2024-06-28 04:27:48,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:27:48,983][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000148086_2426241024.pth... [2024-06-28 04:27:49,036][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000147442_2415689728.pth [2024-06-28 04:27:52,252][06909] Updated weights for policy 0, policy_version 148093 (0.0035) [2024-06-28 04:27:53,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2426421248. Throughput: 0: 43918.7. Samples: 2329401340. Policy #0 lag: (min: 2.0, avg: 10.5, max: 23.0) [2024-06-28 04:27:53,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:27:55,767][06909] Updated weights for policy 0, policy_version 148103 (0.0040) [2024-06-28 04:27:56,790][06887] Signal inference workers to stop experience collection... (33150 times) [2024-06-28 04:27:56,791][06887] Signal inference workers to resume experience collection... (33150 times) [2024-06-28 04:27:56,831][06909] InferenceWorker_p0-w0: stopping experience collection (33150 times) [2024-06-28 04:27:56,831][06909] InferenceWorker_p0-w0: resuming experience collection (33150 times) [2024-06-28 04:27:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 2426650624. Throughput: 0: 44002.7. Samples: 2329531660. Policy #0 lag: (min: 2.0, avg: 10.5, max: 23.0) [2024-06-28 04:27:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:27:59,659][06909] Updated weights for policy 0, policy_version 148113 (0.0034) [2024-06-28 04:28:03,186][06909] Updated weights for policy 0, policy_version 148123 (0.0039) [2024-06-28 04:28:03,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2426880000. Throughput: 0: 43764.4. Samples: 2329792920. Policy #0 lag: (min: 2.0, avg: 10.5, max: 23.0) [2024-06-28 04:28:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:28:06,777][06909] Updated weights for policy 0, policy_version 148133 (0.0024) [2024-06-28 04:28:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2427076608. Throughput: 0: 43801.7. Samples: 2330060340. Policy #0 lag: (min: 2.0, avg: 10.5, max: 23.0) [2024-06-28 04:28:08,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:28:10,619][06909] Updated weights for policy 0, policy_version 148143 (0.0029) [2024-06-28 04:28:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 2427305984. Throughput: 0: 43947.6. Samples: 2330192780. Policy #0 lag: (min: 2.0, avg: 10.5, max: 23.0) [2024-06-28 04:28:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:28:14,362][06909] Updated weights for policy 0, policy_version 148153 (0.0025) [2024-06-28 04:28:17,904][06909] Updated weights for policy 0, policy_version 148163 (0.0032) [2024-06-28 04:28:18,850][06674] Fps is (10 sec: 49152.3, 60 sec: 44782.9, 300 sec: 44098.0). Total num frames: 2427568128. Throughput: 0: 44306.8. Samples: 2330465420. Policy #0 lag: (min: 2.0, avg: 10.5, max: 23.0) [2024-06-28 04:28:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:28:21,830][06909] Updated weights for policy 0, policy_version 148173 (0.0030) [2024-06-28 04:28:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2427748352. Throughput: 0: 44104.9. Samples: 2330728600. Policy #0 lag: (min: 2.0, avg: 10.5, max: 23.0) [2024-06-28 04:28:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:28:25,378][06909] Updated weights for policy 0, policy_version 148183 (0.0026) [2024-06-28 04:28:28,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43417.5, 300 sec: 43931.4). Total num frames: 2427961344. Throughput: 0: 43929.4. Samples: 2330851040. Policy #0 lag: (min: 2.0, avg: 10.5, max: 23.0) [2024-06-28 04:28:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:28:29,562][06909] Updated weights for policy 0, policy_version 148193 (0.0033) [2024-06-28 04:28:32,821][06909] Updated weights for policy 0, policy_version 148203 (0.0038) [2024-06-28 04:28:33,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2428223488. Throughput: 0: 44108.4. Samples: 2331120220. Policy #0 lag: (min: 2.0, avg: 10.5, max: 23.0) [2024-06-28 04:28:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:28:36,948][06909] Updated weights for policy 0, policy_version 148213 (0.0031) [2024-06-28 04:28:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2428403712. Throughput: 0: 44112.5. Samples: 2331386400. Policy #0 lag: (min: 2.0, avg: 10.5, max: 23.0) [2024-06-28 04:28:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:28:40,089][06909] Updated weights for policy 0, policy_version 148223 (0.0030) [2024-06-28 04:28:43,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2428633088. Throughput: 0: 44030.9. Samples: 2331513060. Policy #0 lag: (min: 2.0, avg: 10.5, max: 23.0) [2024-06-28 04:28:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:28:44,579][06909] Updated weights for policy 0, policy_version 148233 (0.0034) [2024-06-28 04:28:47,592][06909] Updated weights for policy 0, policy_version 148243 (0.0036) [2024-06-28 04:28:48,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.7, 300 sec: 44098.9). Total num frames: 2428878848. Throughput: 0: 44163.9. Samples: 2331780300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 04:28:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:28:51,758][06909] Updated weights for policy 0, policy_version 148253 (0.0032) [2024-06-28 04:28:53,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 2429075456. Throughput: 0: 44184.5. Samples: 2332048640. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 04:28:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:28:55,094][06909] Updated weights for policy 0, policy_version 148263 (0.0028) [2024-06-28 04:28:58,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2429288448. Throughput: 0: 43949.2. Samples: 2332170500. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 04:28:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:28:59,103][06909] Updated weights for policy 0, policy_version 148273 (0.0028) [2024-06-28 04:29:02,392][06909] Updated weights for policy 0, policy_version 148283 (0.0028) [2024-06-28 04:29:03,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44782.9, 300 sec: 44153.8). Total num frames: 2429566976. Throughput: 0: 43961.3. Samples: 2332443680. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 04:29:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:29:06,780][06909] Updated weights for policy 0, policy_version 148293 (0.0031) [2024-06-28 04:29:07,837][06887] Signal inference workers to stop experience collection... (33200 times) [2024-06-28 04:29:07,840][06887] Signal inference workers to resume experience collection... (33200 times) [2024-06-28 04:29:07,867][06909] InferenceWorker_p0-w0: stopping experience collection (33200 times) [2024-06-28 04:29:07,868][06909] InferenceWorker_p0-w0: resuming experience collection (33200 times) [2024-06-28 04:29:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2429730816. Throughput: 0: 43976.9. Samples: 2332707560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 04:29:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:29:09,915][06909] Updated weights for policy 0, policy_version 148303 (0.0038) [2024-06-28 04:29:13,850][06674] Fps is (10 sec: 37683.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2429943808. Throughput: 0: 44005.4. Samples: 2332831280. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 04:29:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:29:14,233][06909] Updated weights for policy 0, policy_version 148313 (0.0035) [2024-06-28 04:29:17,455][06909] Updated weights for policy 0, policy_version 148323 (0.0031) [2024-06-28 04:29:18,850][06674] Fps is (10 sec: 49151.6, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2430222336. Throughput: 0: 44122.7. Samples: 2333105740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 04:29:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:29:21,618][06909] Updated weights for policy 0, policy_version 148333 (0.0025) [2024-06-28 04:29:23,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2430402560. Throughput: 0: 44130.2. Samples: 2333372260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 04:29:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:29:24,833][06909] Updated weights for policy 0, policy_version 148343 (0.0036) [2024-06-28 04:29:28,850][06674] Fps is (10 sec: 37683.5, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 2430599168. Throughput: 0: 43939.3. Samples: 2333490320. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 04:29:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:29:28,895][06909] Updated weights for policy 0, policy_version 148353 (0.0036) [2024-06-28 04:29:32,136][06909] Updated weights for policy 0, policy_version 148363 (0.0038) [2024-06-28 04:29:33,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2430877696. Throughput: 0: 44097.4. Samples: 2333764680. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 04:29:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:29:36,154][06909] Updated weights for policy 0, policy_version 148373 (0.0036) [2024-06-28 04:29:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 2431041536. Throughput: 0: 44162.2. Samples: 2334035940. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 04:29:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:29:39,389][06909] Updated weights for policy 0, policy_version 148383 (0.0035) [2024-06-28 04:29:43,733][06909] Updated weights for policy 0, policy_version 148393 (0.0044) [2024-06-28 04:29:43,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 2431270912. Throughput: 0: 44088.1. Samples: 2334154460. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 04:29:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:29:47,081][06909] Updated weights for policy 0, policy_version 148403 (0.0030) [2024-06-28 04:29:48,850][06674] Fps is (10 sec: 49151.7, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2431533056. Throughput: 0: 44087.0. Samples: 2334427600. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 04:29:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:29:48,898][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000148410_2431549440.pth... [2024-06-28 04:29:48,947][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000147761_2420916224.pth [2024-06-28 04:29:51,358][06909] Updated weights for policy 0, policy_version 148413 (0.0039) [2024-06-28 04:29:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 2431713280. Throughput: 0: 44089.8. Samples: 2334691600. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 04:29:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:29:54,665][06909] Updated weights for policy 0, policy_version 148423 (0.0037) [2024-06-28 04:29:58,706][06909] Updated weights for policy 0, policy_version 148433 (0.0029) [2024-06-28 04:29:58,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43963.8, 300 sec: 44043.3). Total num frames: 2431926272. Throughput: 0: 43998.7. Samples: 2334811220. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 04:29:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:30:01,942][06909] Updated weights for policy 0, policy_version 148443 (0.0028) [2024-06-28 04:30:03,850][06674] Fps is (10 sec: 49152.1, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2432204800. Throughput: 0: 44009.4. Samples: 2335086160. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 04:30:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:30:06,117][06909] Updated weights for policy 0, policy_version 148453 (0.0036) [2024-06-28 04:30:08,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 2432368640. Throughput: 0: 44242.6. Samples: 2335363180. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 04:30:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:30:09,064][06887] Signal inference workers to stop experience collection... (33250 times) [2024-06-28 04:30:09,064][06887] Signal inference workers to resume experience collection... (33250 times) [2024-06-28 04:30:09,095][06909] InferenceWorker_p0-w0: stopping experience collection (33250 times) [2024-06-28 04:30:09,095][06909] InferenceWorker_p0-w0: resuming experience collection (33250 times) [2024-06-28 04:30:09,241][06909] Updated weights for policy 0, policy_version 148463 (0.0034) [2024-06-28 04:30:13,308][06909] Updated weights for policy 0, policy_version 148473 (0.0025) [2024-06-28 04:30:13,852][06674] Fps is (10 sec: 37675.3, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 2432581632. Throughput: 0: 44118.9. Samples: 2335475760. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 04:30:13,861][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:30:16,620][06909] Updated weights for policy 0, policy_version 148483 (0.0025) [2024-06-28 04:30:18,850][06674] Fps is (10 sec: 49152.7, 60 sec: 43963.8, 300 sec: 44264.6). Total num frames: 2432860160. Throughput: 0: 44054.2. Samples: 2335747120. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 04:30:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:30:20,947][06909] Updated weights for policy 0, policy_version 148493 (0.0031) [2024-06-28 04:30:23,850][06674] Fps is (10 sec: 47522.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2433056768. Throughput: 0: 44005.7. Samples: 2336016200. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 04:30:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:30:24,123][06909] Updated weights for policy 0, policy_version 148503 (0.0031) [2024-06-28 04:30:28,493][06909] Updated weights for policy 0, policy_version 148513 (0.0026) [2024-06-28 04:30:28,850][06674] Fps is (10 sec: 39321.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2433253376. Throughput: 0: 44057.4. Samples: 2336137040. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 04:30:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:30:31,708][06909] Updated weights for policy 0, policy_version 148523 (0.0041) [2024-06-28 04:30:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.6, 300 sec: 44209.0). Total num frames: 2433515520. Throughput: 0: 44006.2. Samples: 2336407880. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 04:30:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:30:35,807][06909] Updated weights for policy 0, policy_version 148533 (0.0023) [2024-06-28 04:30:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2433712128. Throughput: 0: 44252.9. Samples: 2336682980. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 04:30:38,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 04:30:38,890][06909] Updated weights for policy 0, policy_version 148543 (0.0033) [2024-06-28 04:30:43,091][06909] Updated weights for policy 0, policy_version 148553 (0.0039) [2024-06-28 04:30:43,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2433908736. Throughput: 0: 44455.5. Samples: 2336811720. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 04:30:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:30:46,612][06909] Updated weights for policy 0, policy_version 148563 (0.0040) [2024-06-28 04:30:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44209.4). Total num frames: 2434170880. Throughput: 0: 44003.5. Samples: 2337066320. Policy #0 lag: (min: 0.0, avg: 12.8, max: 25.0) [2024-06-28 04:30:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:30:50,487][06909] Updated weights for policy 0, policy_version 148573 (0.0032) [2024-06-28 04:30:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2434367488. Throughput: 0: 43813.5. Samples: 2337334780. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-28 04:30:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:30:53,938][06909] Updated weights for policy 0, policy_version 148583 (0.0030) [2024-06-28 04:30:58,059][06909] Updated weights for policy 0, policy_version 148593 (0.0029) [2024-06-28 04:30:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2434596864. Throughput: 0: 44166.9. Samples: 2337463180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-28 04:30:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:31:01,228][06909] Updated weights for policy 0, policy_version 148603 (0.0030) [2024-06-28 04:31:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 2434826240. Throughput: 0: 44237.8. Samples: 2337737820. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-28 04:31:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:31:05,665][06909] Updated weights for policy 0, policy_version 148613 (0.0028) [2024-06-28 04:31:08,815][06909] Updated weights for policy 0, policy_version 148623 (0.0039) [2024-06-28 04:31:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44510.0, 300 sec: 43986.9). Total num frames: 2435039232. Throughput: 0: 44305.9. Samples: 2338009960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-28 04:31:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:31:12,860][06909] Updated weights for policy 0, policy_version 148633 (0.0035) [2024-06-28 04:31:13,850][06674] Fps is (10 sec: 42597.7, 60 sec: 44511.3, 300 sec: 44097.9). Total num frames: 2435252224. Throughput: 0: 44449.1. Samples: 2338137260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-28 04:31:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:31:16,004][06909] Updated weights for policy 0, policy_version 148643 (0.0033) [2024-06-28 04:31:16,334][06887] Signal inference workers to stop experience collection... (33300 times) [2024-06-28 04:31:16,335][06887] Signal inference workers to resume experience collection... (33300 times) [2024-06-28 04:31:16,356][06909] InferenceWorker_p0-w0: stopping experience collection (33300 times) [2024-06-28 04:31:16,356][06909] InferenceWorker_p0-w0: resuming experience collection (33300 times) [2024-06-28 04:31:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44153.8). Total num frames: 2435481600. Throughput: 0: 44276.1. Samples: 2338400300. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-28 04:31:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:31:20,217][06909] Updated weights for policy 0, policy_version 148653 (0.0025) [2024-06-28 04:31:23,476][06909] Updated weights for policy 0, policy_version 148663 (0.0031) [2024-06-28 04:31:23,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2435710976. Throughput: 0: 44140.4. Samples: 2338669300. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-28 04:31:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 04:31:27,748][06909] Updated weights for policy 0, policy_version 148673 (0.0026) [2024-06-28 04:31:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2435923968. Throughput: 0: 44274.7. Samples: 2338804080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-28 04:31:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:31:30,792][06909] Updated weights for policy 0, policy_version 148683 (0.0030) [2024-06-28 04:31:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2436136960. Throughput: 0: 44349.8. Samples: 2339062060. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-28 04:31:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:31:35,007][06909] Updated weights for policy 0, policy_version 148693 (0.0034) [2024-06-28 04:31:38,348][06909] Updated weights for policy 0, policy_version 148703 (0.0040) [2024-06-28 04:31:38,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.7, 300 sec: 43986.8). Total num frames: 2436366336. Throughput: 0: 44443.4. Samples: 2339334740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-28 04:31:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:31:42,158][06909] Updated weights for policy 0, policy_version 148713 (0.0028) [2024-06-28 04:31:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44783.0, 300 sec: 44153.5). Total num frames: 2436595712. Throughput: 0: 44704.5. Samples: 2339474880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-28 04:31:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:31:45,842][06909] Updated weights for policy 0, policy_version 148723 (0.0036) [2024-06-28 04:31:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 2436792320. Throughput: 0: 44217.7. Samples: 2339727620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-28 04:31:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:31:48,910][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000148731_2436808704.pth... [2024-06-28 04:31:48,967][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000148086_2426241024.pth [2024-06-28 04:31:49,553][06909] Updated weights for policy 0, policy_version 148733 (0.0033) [2024-06-28 04:31:53,143][06909] Updated weights for policy 0, policy_version 148743 (0.0035) [2024-06-28 04:31:53,850][06674] Fps is (10 sec: 47513.5, 60 sec: 45056.0, 300 sec: 44209.0). Total num frames: 2437070848. Throughput: 0: 44200.0. Samples: 2339998960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 23.0) [2024-06-28 04:31:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:31:56,808][06909] Updated weights for policy 0, policy_version 148753 (0.0036) [2024-06-28 04:31:58,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.8, 300 sec: 44264.6). Total num frames: 2437267456. Throughput: 0: 44513.9. Samples: 2340140380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 04:31:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:32:00,365][06909] Updated weights for policy 0, policy_version 148763 (0.0034) [2024-06-28 04:32:03,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2437464064. Throughput: 0: 44267.6. Samples: 2340392340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 04:32:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:32:04,136][06909] Updated weights for policy 0, policy_version 148773 (0.0036) [2024-06-28 04:32:08,016][06909] Updated weights for policy 0, policy_version 148783 (0.0036) [2024-06-28 04:32:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2437693440. Throughput: 0: 44174.6. Samples: 2340657160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 04:32:08,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:32:11,930][06909] Updated weights for policy 0, policy_version 148793 (0.0026) [2024-06-28 04:32:13,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 2437922816. Throughput: 0: 44161.3. Samples: 2340791340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 04:32:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:32:15,679][06909] Updated weights for policy 0, policy_version 148803 (0.0035) [2024-06-28 04:32:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2438135808. Throughput: 0: 44182.6. Samples: 2341050280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 04:32:18,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:32:19,106][06909] Updated weights for policy 0, policy_version 148813 (0.0038) [2024-06-28 04:32:23,086][06909] Updated weights for policy 0, policy_version 148823 (0.0041) [2024-06-28 04:32:23,108][06887] Signal inference workers to stop experience collection... (33350 times) [2024-06-28 04:32:23,109][06887] Signal inference workers to resume experience collection... (33350 times) [2024-06-28 04:32:23,126][06909] InferenceWorker_p0-w0: stopping experience collection (33350 times) [2024-06-28 04:32:23,126][06909] InferenceWorker_p0-w0: resuming experience collection (33350 times) [2024-06-28 04:32:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2438381568. Throughput: 0: 43942.8. Samples: 2341312160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 04:32:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:32:26,896][06909] Updated weights for policy 0, policy_version 148833 (0.0054) [2024-06-28 04:32:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2438578176. Throughput: 0: 43873.3. Samples: 2341449180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 04:32:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:32:30,300][06909] Updated weights for policy 0, policy_version 148843 (0.0035) [2024-06-28 04:32:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2438791168. Throughput: 0: 44045.4. Samples: 2341709660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 04:32:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:32:34,046][06909] Updated weights for policy 0, policy_version 148853 (0.0033) [2024-06-28 04:32:37,638][06909] Updated weights for policy 0, policy_version 148863 (0.0039) [2024-06-28 04:32:38,852][06674] Fps is (10 sec: 45865.8, 60 sec: 44508.4, 300 sec: 44153.2). Total num frames: 2439036928. Throughput: 0: 43856.7. Samples: 2341972600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 04:32:38,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:32:41,734][06909] Updated weights for policy 0, policy_version 148873 (0.0029) [2024-06-28 04:32:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2439233536. Throughput: 0: 43790.7. Samples: 2342110960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 04:32:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:32:45,204][06909] Updated weights for policy 0, policy_version 148883 (0.0034) [2024-06-28 04:32:48,850][06674] Fps is (10 sec: 40968.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2439446528. Throughput: 0: 44053.8. Samples: 2342374760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 04:32:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:32:49,033][06909] Updated weights for policy 0, policy_version 148893 (0.0030) [2024-06-28 04:32:52,616][06909] Updated weights for policy 0, policy_version 148903 (0.0034) [2024-06-28 04:32:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.6, 300 sec: 44209.0). Total num frames: 2439692288. Throughput: 0: 44093.8. Samples: 2342641380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 04:32:53,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 04:32:56,231][06909] Updated weights for policy 0, policy_version 148913 (0.0039) [2024-06-28 04:32:58,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2439905280. Throughput: 0: 44046.2. Samples: 2342773420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 04:32:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:33:00,054][06909] Updated weights for policy 0, policy_version 148923 (0.0040) [2024-06-28 04:33:03,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2440118272. Throughput: 0: 44337.9. Samples: 2343045480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 04:33:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:33:03,854][06909] Updated weights for policy 0, policy_version 148933 (0.0028) [2024-06-28 04:33:07,490][06909] Updated weights for policy 0, policy_version 148943 (0.0030) [2024-06-28 04:33:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44509.9, 300 sec: 44264.6). Total num frames: 2440364032. Throughput: 0: 44039.6. Samples: 2343293940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 04:33:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:33:11,181][06909] Updated weights for policy 0, policy_version 148953 (0.0028) [2024-06-28 04:33:13,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 2440560640. Throughput: 0: 44110.4. Samples: 2343434240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 04:33:13,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:33:14,813][06909] Updated weights for policy 0, policy_version 148963 (0.0021) [2024-06-28 04:33:18,764][06909] Updated weights for policy 0, policy_version 148973 (0.0034) [2024-06-28 04:33:18,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2440773632. Throughput: 0: 44253.3. Samples: 2343701060. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 04:33:18,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:33:22,562][06909] Updated weights for policy 0, policy_version 148983 (0.0032) [2024-06-28 04:33:23,850][06674] Fps is (10 sec: 45884.2, 60 sec: 43963.7, 300 sec: 44264.6). Total num frames: 2441019392. Throughput: 0: 44268.1. Samples: 2343964580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 04:33:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:33:26,061][06909] Updated weights for policy 0, policy_version 148993 (0.0032) [2024-06-28 04:33:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2441216000. Throughput: 0: 44176.8. Samples: 2344098920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 04:33:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:33:29,772][06909] Updated weights for policy 0, policy_version 149003 (0.0029) [2024-06-28 04:33:33,345][06909] Updated weights for policy 0, policy_version 149013 (0.0035) [2024-06-28 04:33:33,852][06674] Fps is (10 sec: 42590.1, 60 sec: 44235.3, 300 sec: 44208.7). Total num frames: 2441445376. Throughput: 0: 44190.4. Samples: 2344363420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 04:33:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:33:37,219][06909] Updated weights for policy 0, policy_version 149023 (0.0035) [2024-06-28 04:33:37,788][06887] Signal inference workers to stop experience collection... (33400 times) [2024-06-28 04:33:37,788][06887] Signal inference workers to resume experience collection... (33400 times) [2024-06-28 04:33:37,830][06909] InferenceWorker_p0-w0: stopping experience collection (33400 times) [2024-06-28 04:33:37,830][06909] InferenceWorker_p0-w0: resuming experience collection (33400 times) [2024-06-28 04:33:38,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43965.3, 300 sec: 44209.1). Total num frames: 2441674752. Throughput: 0: 44067.6. Samples: 2344624420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 04:33:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:33:41,023][06909] Updated weights for policy 0, policy_version 149033 (0.0029) [2024-06-28 04:33:43,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2441871360. Throughput: 0: 44080.6. Samples: 2344757040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 04:33:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:33:44,664][06909] Updated weights for policy 0, policy_version 149043 (0.0042) [2024-06-28 04:33:48,344][06909] Updated weights for policy 0, policy_version 149053 (0.0034) [2024-06-28 04:33:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2442100736. Throughput: 0: 44086.6. Samples: 2345029380. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 04:33:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:33:48,923][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000149055_2442117120.pth... [2024-06-28 04:33:48,974][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000148410_2431549440.pth [2024-06-28 04:33:51,998][06909] Updated weights for policy 0, policy_version 149063 (0.0042) [2024-06-28 04:33:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2442330112. Throughput: 0: 44242.6. Samples: 2345284860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 04:33:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:33:55,909][06909] Updated weights for policy 0, policy_version 149073 (0.0030) [2024-06-28 04:33:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2442526720. Throughput: 0: 44156.2. Samples: 2345421180. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 04:33:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:33:59,585][06909] Updated weights for policy 0, policy_version 149083 (0.0036) [2024-06-28 04:34:03,556][06909] Updated weights for policy 0, policy_version 149093 (0.0041) [2024-06-28 04:34:03,852][06674] Fps is (10 sec: 42588.2, 60 sec: 43961.9, 300 sec: 44153.1). Total num frames: 2442756096. Throughput: 0: 44025.6. Samples: 2345682320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 04:34:03,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:34:06,982][06909] Updated weights for policy 0, policy_version 149103 (0.0027) [2024-06-28 04:34:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 2442985472. Throughput: 0: 44031.2. Samples: 2345945980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 04:34:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:34:10,813][06909] Updated weights for policy 0, policy_version 149113 (0.0029) [2024-06-28 04:34:13,850][06674] Fps is (10 sec: 42608.8, 60 sec: 43692.2, 300 sec: 43931.3). Total num frames: 2443182080. Throughput: 0: 43961.0. Samples: 2346077160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 04:34:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 04:34:14,470][06909] Updated weights for policy 0, policy_version 149123 (0.0021) [2024-06-28 04:34:18,065][06909] Updated weights for policy 0, policy_version 149133 (0.0030) [2024-06-28 04:34:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2443411456. Throughput: 0: 44048.6. Samples: 2346345520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 04:34:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:34:21,764][06909] Updated weights for policy 0, policy_version 149143 (0.0030) [2024-06-28 04:34:23,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.9, 300 sec: 44264.6). Total num frames: 2443657216. Throughput: 0: 44105.0. Samples: 2346609140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 04:34:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:34:25,435][06909] Updated weights for policy 0, policy_version 149153 (0.0027) [2024-06-28 04:34:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2443853824. Throughput: 0: 44140.8. Samples: 2346743380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 04:34:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:34:29,262][06909] Updated weights for policy 0, policy_version 149163 (0.0032) [2024-06-28 04:34:32,823][06909] Updated weights for policy 0, policy_version 149173 (0.0039) [2024-06-28 04:34:33,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43965.2, 300 sec: 44209.0). Total num frames: 2444083200. Throughput: 0: 43981.8. Samples: 2347008560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 04:34:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:34:36,627][06909] Updated weights for policy 0, policy_version 149183 (0.0040) [2024-06-28 04:34:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2444312576. Throughput: 0: 44142.6. Samples: 2347271280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 04:34:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:34:40,364][06909] Updated weights for policy 0, policy_version 149193 (0.0026) [2024-06-28 04:34:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2444525568. Throughput: 0: 44130.3. Samples: 2347407040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 04:34:43,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-28 04:34:43,943][06909] Updated weights for policy 0, policy_version 149203 (0.0046) [2024-06-28 04:34:47,777][06909] Updated weights for policy 0, policy_version 149213 (0.0028) [2024-06-28 04:34:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2444738560. Throughput: 0: 44179.3. Samples: 2347670280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 04:34:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:34:51,605][06909] Updated weights for policy 0, policy_version 149223 (0.0033) [2024-06-28 04:34:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2444967936. Throughput: 0: 44176.8. Samples: 2347933940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 04:34:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:34:55,191][06909] Updated weights for policy 0, policy_version 149233 (0.0027) [2024-06-28 04:34:58,521][06887] Signal inference workers to stop experience collection... (33450 times) [2024-06-28 04:34:58,577][06909] InferenceWorker_p0-w0: stopping experience collection (33450 times) [2024-06-28 04:34:58,585][06887] Signal inference workers to resume experience collection... (33450 times) [2024-06-28 04:34:58,590][06909] InferenceWorker_p0-w0: resuming experience collection (33450 times) [2024-06-28 04:34:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 43986.8). Total num frames: 2445180928. Throughput: 0: 44239.9. Samples: 2348067960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 04:34:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:34:58,892][06909] Updated weights for policy 0, policy_version 149243 (0.0038) [2024-06-28 04:35:02,731][06909] Updated weights for policy 0, policy_version 149253 (0.0030) [2024-06-28 04:35:03,852][06674] Fps is (10 sec: 44228.0, 60 sec: 44237.1, 300 sec: 44208.7). Total num frames: 2445410304. Throughput: 0: 44029.6. Samples: 2348326940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 04:35:03,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:35:06,405][06909] Updated weights for policy 0, policy_version 149263 (0.0043) [2024-06-28 04:35:08,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 44209.3). Total num frames: 2445623296. Throughput: 0: 44111.9. Samples: 2348594180. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-28 04:35:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:35:10,135][06909] Updated weights for policy 0, policy_version 149273 (0.0029) [2024-06-28 04:35:13,811][06909] Updated weights for policy 0, policy_version 149283 (0.0041) [2024-06-28 04:35:13,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2445852672. Throughput: 0: 44045.3. Samples: 2348725420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-28 04:35:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:35:17,615][06909] Updated weights for policy 0, policy_version 149293 (0.0033) [2024-06-28 04:35:18,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2446082048. Throughput: 0: 44009.2. Samples: 2348988980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-28 04:35:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:35:21,191][06909] Updated weights for policy 0, policy_version 149303 (0.0036) [2024-06-28 04:35:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.5, 300 sec: 44153.5). Total num frames: 2446278656. Throughput: 0: 44088.4. Samples: 2349255260. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-28 04:35:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:35:25,013][06909] Updated weights for policy 0, policy_version 149313 (0.0040) [2024-06-28 04:35:28,570][06909] Updated weights for policy 0, policy_version 149323 (0.0028) [2024-06-28 04:35:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2446508032. Throughput: 0: 43880.0. Samples: 2349381640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-28 04:35:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:35:32,270][06909] Updated weights for policy 0, policy_version 149333 (0.0029) [2024-06-28 04:35:33,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2446737408. Throughput: 0: 43893.3. Samples: 2349645480. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-28 04:35:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:35:35,958][06909] Updated weights for policy 0, policy_version 149343 (0.0035) [2024-06-28 04:35:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2446934016. Throughput: 0: 43989.0. Samples: 2349913440. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-28 04:35:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:35:39,750][06909] Updated weights for policy 0, policy_version 149353 (0.0022) [2024-06-28 04:35:43,669][06909] Updated weights for policy 0, policy_version 149363 (0.0029) [2024-06-28 04:35:43,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2447163392. Throughput: 0: 43902.6. Samples: 2350043580. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-28 04:35:43,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 04:35:47,206][06909] Updated weights for policy 0, policy_version 149373 (0.0031) [2024-06-28 04:35:48,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2447409152. Throughput: 0: 44194.0. Samples: 2350315580. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-28 04:35:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 04:35:48,876][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000149378_2447409152.pth... [2024-06-28 04:35:48,937][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000148731_2436808704.pth [2024-06-28 04:35:51,100][06909] Updated weights for policy 0, policy_version 149383 (0.0028) [2024-06-28 04:35:53,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2447605760. Throughput: 0: 44158.6. Samples: 2350581320. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-28 04:35:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:35:54,580][06909] Updated weights for policy 0, policy_version 149393 (0.0029) [2024-06-28 04:35:58,317][06909] Updated weights for policy 0, policy_version 149403 (0.0022) [2024-06-28 04:35:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 2447851520. Throughput: 0: 44140.1. Samples: 2350711720. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-28 04:35:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:36:02,062][06909] Updated weights for policy 0, policy_version 149413 (0.0026) [2024-06-28 04:36:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2448064512. Throughput: 0: 44232.5. Samples: 2350979440. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-28 04:36:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:36:05,630][06909] Updated weights for policy 0, policy_version 149423 (0.0029) [2024-06-28 04:36:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2448277504. Throughput: 0: 44298.0. Samples: 2351248660. Policy #0 lag: (min: 0.0, avg: 10.7, max: 24.0) [2024-06-28 04:36:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:36:09,294][06909] Updated weights for policy 0, policy_version 149433 (0.0032) [2024-06-28 04:36:13,064][06909] Updated weights for policy 0, policy_version 149443 (0.0031) [2024-06-28 04:36:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2448506880. Throughput: 0: 44332.9. Samples: 2351376620. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 04:36:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:36:16,848][06909] Updated weights for policy 0, policy_version 149453 (0.0025) [2024-06-28 04:36:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2448719872. Throughput: 0: 44300.9. Samples: 2351639020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 04:36:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:36:20,386][06887] Signal inference workers to stop experience collection... (33500 times) [2024-06-28 04:36:20,387][06887] Signal inference workers to resume experience collection... (33500 times) [2024-06-28 04:36:20,400][06909] InferenceWorker_p0-w0: stopping experience collection (33500 times) [2024-06-28 04:36:20,401][06909] InferenceWorker_p0-w0: resuming experience collection (33500 times) [2024-06-28 04:36:20,529][06909] Updated weights for policy 0, policy_version 149463 (0.0031) [2024-06-28 04:36:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2448932864. Throughput: 0: 44369.3. Samples: 2351910060. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 04:36:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 04:36:24,173][06909] Updated weights for policy 0, policy_version 149473 (0.0026) [2024-06-28 04:36:27,951][06909] Updated weights for policy 0, policy_version 149483 (0.0035) [2024-06-28 04:36:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2449162240. Throughput: 0: 44347.2. Samples: 2352039200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 04:36:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:36:31,958][06909] Updated weights for policy 0, policy_version 149493 (0.0036) [2024-06-28 04:36:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2449391616. Throughput: 0: 44123.2. Samples: 2352301120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 04:36:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:36:35,363][06909] Updated weights for policy 0, policy_version 149503 (0.0036) [2024-06-28 04:36:38,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2449571840. Throughput: 0: 44244.9. Samples: 2352572340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 04:36:38,859][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:36:39,217][06909] Updated weights for policy 0, policy_version 149513 (0.0023) [2024-06-28 04:36:42,664][06909] Updated weights for policy 0, policy_version 149523 (0.0026) [2024-06-28 04:36:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44510.0, 300 sec: 44209.0). Total num frames: 2449833984. Throughput: 0: 44202.2. Samples: 2352700820. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 04:36:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:36:46,442][06909] Updated weights for policy 0, policy_version 149533 (0.0025) [2024-06-28 04:36:48,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2450046976. Throughput: 0: 44179.5. Samples: 2352967520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 04:36:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:36:50,042][06909] Updated weights for policy 0, policy_version 149543 (0.0026) [2024-06-28 04:36:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2450259968. Throughput: 0: 44059.5. Samples: 2353231340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 04:36:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:36:54,126][06909] Updated weights for policy 0, policy_version 149553 (0.0035) [2024-06-28 04:36:57,645][06909] Updated weights for policy 0, policy_version 149563 (0.0037) [2024-06-28 04:36:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2450505728. Throughput: 0: 44130.2. Samples: 2353362480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 04:36:58,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-28 04:37:01,406][06909] Updated weights for policy 0, policy_version 149573 (0.0032) [2024-06-28 04:37:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2450702336. Throughput: 0: 44133.8. Samples: 2353625040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 04:37:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:37:04,899][06909] Updated weights for policy 0, policy_version 149583 (0.0036) [2024-06-28 04:37:08,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2450915328. Throughput: 0: 44250.7. Samples: 2353901340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 04:37:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:37:08,948][06909] Updated weights for policy 0, policy_version 149593 (0.0029) [2024-06-28 04:37:12,278][06909] Updated weights for policy 0, policy_version 149603 (0.0026) [2024-06-28 04:37:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2451161088. Throughput: 0: 44337.4. Samples: 2354034380. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 04:37:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:37:16,289][06909] Updated weights for policy 0, policy_version 149613 (0.0047) [2024-06-28 04:37:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2451357696. Throughput: 0: 44316.4. Samples: 2354295360. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 04:37:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:37:19,558][06909] Updated weights for policy 0, policy_version 149623 (0.0028) [2024-06-28 04:37:23,552][06909] Updated weights for policy 0, policy_version 149633 (0.0029) [2024-06-28 04:37:23,856][06674] Fps is (10 sec: 44210.2, 60 sec: 44505.4, 300 sec: 44152.6). Total num frames: 2451603456. Throughput: 0: 44265.2. Samples: 2354564540. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 04:37:23,856][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 04:37:27,079][06909] Updated weights for policy 0, policy_version 149643 (0.0034) [2024-06-28 04:37:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2451816448. Throughput: 0: 44228.4. Samples: 2354691100. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 04:37:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:37:31,138][06909] Updated weights for policy 0, policy_version 149653 (0.0037) [2024-06-28 04:37:33,850][06674] Fps is (10 sec: 40985.0, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 2452013056. Throughput: 0: 44089.0. Samples: 2354951520. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 04:37:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:37:34,213][06887] Signal inference workers to stop experience collection... (33550 times) [2024-06-28 04:37:34,215][06887] Signal inference workers to resume experience collection... (33550 times) [2024-06-28 04:37:34,270][06909] InferenceWorker_p0-w0: stopping experience collection (33550 times) [2024-06-28 04:37:34,270][06909] InferenceWorker_p0-w0: resuming experience collection (33550 times) [2024-06-28 04:37:34,547][06909] Updated weights for policy 0, policy_version 149663 (0.0047) [2024-06-28 04:37:38,380][06909] Updated weights for policy 0, policy_version 149673 (0.0029) [2024-06-28 04:37:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 45055.9, 300 sec: 44209.0). Total num frames: 2452275200. Throughput: 0: 44222.6. Samples: 2355221360. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 04:37:38,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:37:41,777][06909] Updated weights for policy 0, policy_version 149683 (0.0045) [2024-06-28 04:37:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2452471808. Throughput: 0: 44297.4. Samples: 2355355860. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 04:37:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:37:45,939][06909] Updated weights for policy 0, policy_version 149693 (0.0035) [2024-06-28 04:37:48,850][06674] Fps is (10 sec: 42599.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2452701184. Throughput: 0: 44214.7. Samples: 2355614700. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 04:37:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:37:48,976][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000149702_2452717568.pth... [2024-06-28 04:37:49,036][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000149055_2442117120.pth [2024-06-28 04:37:49,216][06909] Updated weights for policy 0, policy_version 149703 (0.0033) [2024-06-28 04:37:53,366][06909] Updated weights for policy 0, policy_version 149713 (0.0023) [2024-06-28 04:37:53,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 2452930560. Throughput: 0: 43994.3. Samples: 2355881080. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 04:37:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:37:56,844][06909] Updated weights for policy 0, policy_version 149723 (0.0025) [2024-06-28 04:37:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2453127168. Throughput: 0: 43946.7. Samples: 2356011980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 04:37:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:38:00,748][06909] Updated weights for policy 0, policy_version 149733 (0.0043) [2024-06-28 04:38:03,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2453356544. Throughput: 0: 44025.8. Samples: 2356276520. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 04:38:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:38:04,419][06909] Updated weights for policy 0, policy_version 149743 (0.0026) [2024-06-28 04:38:08,202][06909] Updated weights for policy 0, policy_version 149753 (0.0025) [2024-06-28 04:38:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 44153.8). Total num frames: 2453585920. Throughput: 0: 43997.0. Samples: 2356544140. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 04:38:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:38:11,896][06909] Updated weights for policy 0, policy_version 149763 (0.0027) [2024-06-28 04:38:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2453782528. Throughput: 0: 44087.6. Samples: 2356675040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 04:38:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:38:15,469][06909] Updated weights for policy 0, policy_version 149773 (0.0042) [2024-06-28 04:38:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2454011904. Throughput: 0: 44101.8. Samples: 2356936100. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-28 04:38:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:38:19,116][06909] Updated weights for policy 0, policy_version 149783 (0.0038) [2024-06-28 04:38:23,078][06909] Updated weights for policy 0, policy_version 149793 (0.0034) [2024-06-28 04:38:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43968.2, 300 sec: 44153.5). Total num frames: 2454241280. Throughput: 0: 44013.0. Samples: 2357201940. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-28 04:38:23,850][06674] Avg episode reward: [(0, '0.400')] [2024-06-28 04:38:26,747][06909] Updated weights for policy 0, policy_version 149803 (0.0026) [2024-06-28 04:38:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 2454454272. Throughput: 0: 43938.1. Samples: 2357333080. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-28 04:38:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:38:30,453][06909] Updated weights for policy 0, policy_version 149813 (0.0039) [2024-06-28 04:38:33,852][06674] Fps is (10 sec: 44227.6, 60 sec: 44508.3, 300 sec: 44097.6). Total num frames: 2454683648. Throughput: 0: 44057.5. Samples: 2357597380. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-28 04:38:33,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:38:34,112][06909] Updated weights for policy 0, policy_version 149823 (0.0025) [2024-06-28 04:38:37,948][06909] Updated weights for policy 0, policy_version 149833 (0.0039) [2024-06-28 04:38:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.8, 300 sec: 44153.5). Total num frames: 2454896640. Throughput: 0: 44073.3. Samples: 2357864380. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-28 04:38:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:38:41,899][06909] Updated weights for policy 0, policy_version 149843 (0.0031) [2024-06-28 04:38:43,850][06674] Fps is (10 sec: 44245.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2455126016. Throughput: 0: 44028.4. Samples: 2357993260. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-28 04:38:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:38:45,186][06909] Updated weights for policy 0, policy_version 149853 (0.0031) [2024-06-28 04:38:48,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2455339008. Throughput: 0: 44183.0. Samples: 2358264760. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-28 04:38:48,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:38:49,063][06909] Updated weights for policy 0, policy_version 149863 (0.0027) [2024-06-28 04:38:52,589][06909] Updated weights for policy 0, policy_version 149873 (0.0036) [2024-06-28 04:38:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 2455552000. Throughput: 0: 44022.2. Samples: 2358525140. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-28 04:38:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:38:56,355][06909] Updated weights for policy 0, policy_version 149883 (0.0042) [2024-06-28 04:38:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 2455781376. Throughput: 0: 44004.8. Samples: 2358655260. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-28 04:38:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:39:00,183][06909] Updated weights for policy 0, policy_version 149893 (0.0033) [2024-06-28 04:39:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2455994368. Throughput: 0: 44137.4. Samples: 2358922280. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-28 04:39:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:39:03,946][06909] Updated weights for policy 0, policy_version 149903 (0.0040) [2024-06-28 04:39:04,162][06887] Signal inference workers to stop experience collection... (33600 times) [2024-06-28 04:39:04,213][06887] Signal inference workers to resume experience collection... (33600 times) [2024-06-28 04:39:04,213][06909] InferenceWorker_p0-w0: stopping experience collection (33600 times) [2024-06-28 04:39:04,235][06909] InferenceWorker_p0-w0: resuming experience collection (33600 times) [2024-06-28 04:39:07,430][06909] Updated weights for policy 0, policy_version 149913 (0.0032) [2024-06-28 04:39:08,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2456207360. Throughput: 0: 44207.6. Samples: 2359191280. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-28 04:39:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:39:11,311][06909] Updated weights for policy 0, policy_version 149923 (0.0031) [2024-06-28 04:39:13,856][06674] Fps is (10 sec: 44209.2, 60 sec: 44232.3, 300 sec: 44152.6). Total num frames: 2456436736. Throughput: 0: 44108.2. Samples: 2359318220. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-28 04:39:13,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:39:15,302][06909] Updated weights for policy 0, policy_version 149933 (0.0031) [2024-06-28 04:39:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2456633344. Throughput: 0: 44117.1. Samples: 2359582560. Policy #0 lag: (min: 1.0, avg: 9.5, max: 21.0) [2024-06-28 04:39:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:39:19,006][06909] Updated weights for policy 0, policy_version 149943 (0.0036) [2024-06-28 04:39:22,486][06909] Updated weights for policy 0, policy_version 149953 (0.0036) [2024-06-28 04:39:23,850][06674] Fps is (10 sec: 42624.8, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2456862720. Throughput: 0: 44080.9. Samples: 2359848020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:39:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:39:26,122][06909] Updated weights for policy 0, policy_version 149963 (0.0031) [2024-06-28 04:39:28,850][06674] Fps is (10 sec: 49152.4, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 2457124864. Throughput: 0: 44244.1. Samples: 2359984240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:39:28,859][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:39:29,619][06909] Updated weights for policy 0, policy_version 149973 (0.0027) [2024-06-28 04:39:33,330][06909] Updated weights for policy 0, policy_version 149983 (0.0033) [2024-06-28 04:39:33,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2457337856. Throughput: 0: 44207.1. Samples: 2360254080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:39:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:39:37,238][06909] Updated weights for policy 0, policy_version 149993 (0.0036) [2024-06-28 04:39:38,856][06674] Fps is (10 sec: 40935.4, 60 sec: 43959.3, 300 sec: 44097.1). Total num frames: 2457534464. Throughput: 0: 44186.1. Samples: 2360513780. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:39:38,864][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:39:40,919][06909] Updated weights for policy 0, policy_version 150003 (0.0025) [2024-06-28 04:39:43,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2457763840. Throughput: 0: 44093.5. Samples: 2360639460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:39:43,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:39:44,557][06909] Updated weights for policy 0, policy_version 150013 (0.0024) [2024-06-28 04:39:48,274][06909] Updated weights for policy 0, policy_version 150023 (0.0038) [2024-06-28 04:39:48,850][06674] Fps is (10 sec: 45902.5, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2457993216. Throughput: 0: 44287.4. Samples: 2360915220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:39:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:39:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000150024_2457993216.pth... [2024-06-28 04:39:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000149378_2447409152.pth [2024-06-28 04:39:52,005][06909] Updated weights for policy 0, policy_version 150033 (0.0026) [2024-06-28 04:39:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2458206208. Throughput: 0: 44190.2. Samples: 2361179840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:39:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:39:55,878][06909] Updated weights for policy 0, policy_version 150043 (0.0045) [2024-06-28 04:39:58,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44510.0, 300 sec: 44209.3). Total num frames: 2458451968. Throughput: 0: 44150.5. Samples: 2361304720. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:39:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 04:39:59,335][06909] Updated weights for policy 0, policy_version 150053 (0.0036) [2024-06-28 04:40:03,266][06909] Updated weights for policy 0, policy_version 150063 (0.0033) [2024-06-28 04:40:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2458664960. Throughput: 0: 44343.1. Samples: 2361578000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:40:03,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:40:06,686][06909] Updated weights for policy 0, policy_version 150073 (0.0031) [2024-06-28 04:40:08,850][06674] Fps is (10 sec: 40959.1, 60 sec: 44236.6, 300 sec: 44097.9). Total num frames: 2458861568. Throughput: 0: 44266.0. Samples: 2361840000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:40:08,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:40:10,568][06909] Updated weights for policy 0, policy_version 150083 (0.0032) [2024-06-28 04:40:13,852][06674] Fps is (10 sec: 44228.4, 60 sec: 44513.0, 300 sec: 44153.2). Total num frames: 2459107328. Throughput: 0: 44037.2. Samples: 2361966000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:40:13,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:40:14,527][06909] Updated weights for policy 0, policy_version 150093 (0.0039) [2024-06-28 04:40:18,152][06909] Updated weights for policy 0, policy_version 150103 (0.0035) [2024-06-28 04:40:18,852][06674] Fps is (10 sec: 45866.5, 60 sec: 44781.4, 300 sec: 44208.7). Total num frames: 2459320320. Throughput: 0: 44004.3. Samples: 2362234360. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:40:18,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:40:20,799][06887] Signal inference workers to stop experience collection... (33650 times) [2024-06-28 04:40:20,799][06887] Signal inference workers to resume experience collection... (33650 times) [2024-06-28 04:40:20,815][06909] InferenceWorker_p0-w0: stopping experience collection (33650 times) [2024-06-28 04:40:20,816][06909] InferenceWorker_p0-w0: resuming experience collection (33650 times) [2024-06-28 04:40:21,750][06909] Updated weights for policy 0, policy_version 150113 (0.0027) [2024-06-28 04:40:23,850][06674] Fps is (10 sec: 42606.7, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2459533312. Throughput: 0: 44062.8. Samples: 2362496340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:40:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:40:25,582][06909] Updated weights for policy 0, policy_version 150123 (0.0030) [2024-06-28 04:40:28,856][06674] Fps is (10 sec: 44218.5, 60 sec: 43959.2, 300 sec: 44152.6). Total num frames: 2459762688. Throughput: 0: 44267.1. Samples: 2362631760. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 04:40:28,857][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:40:28,918][06909] Updated weights for policy 0, policy_version 150133 (0.0021) [2024-06-28 04:40:33,147][06909] Updated weights for policy 0, policy_version 150143 (0.0024) [2024-06-28 04:40:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2459975680. Throughput: 0: 44093.3. Samples: 2362899420. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 04:40:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:40:36,051][06909] Updated weights for policy 0, policy_version 150153 (0.0023) [2024-06-28 04:40:38,850][06674] Fps is (10 sec: 42624.4, 60 sec: 44241.1, 300 sec: 44153.5). Total num frames: 2460188672. Throughput: 0: 44044.8. Samples: 2363161860. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 04:40:38,850][06674] Avg episode reward: [(0, '0.416')] [2024-06-28 04:40:40,281][06909] Updated weights for policy 0, policy_version 150163 (0.0031) [2024-06-28 04:40:43,716][06909] Updated weights for policy 0, policy_version 150173 (0.0026) [2024-06-28 04:40:43,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2460434432. Throughput: 0: 44272.0. Samples: 2363296960. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 04:40:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:40:47,802][06909] Updated weights for policy 0, policy_version 150183 (0.0028) [2024-06-28 04:40:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2460631040. Throughput: 0: 44032.9. Samples: 2363559480. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 04:40:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:40:51,238][06909] Updated weights for policy 0, policy_version 150193 (0.0034) [2024-06-28 04:40:53,850][06674] Fps is (10 sec: 40959.0, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2460844032. Throughput: 0: 43944.0. Samples: 2363817480. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 04:40:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:40:55,216][06909] Updated weights for policy 0, policy_version 150203 (0.0022) [2024-06-28 04:40:58,762][06909] Updated weights for policy 0, policy_version 150213 (0.0035) [2024-06-28 04:40:58,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2461089792. Throughput: 0: 44210.8. Samples: 2363955400. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 04:40:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:41:02,765][06909] Updated weights for policy 0, policy_version 150223 (0.0037) [2024-06-28 04:41:03,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2461286400. Throughput: 0: 44038.9. Samples: 2364216020. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 04:41:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:41:05,934][06909] Updated weights for policy 0, policy_version 150233 (0.0040) [2024-06-28 04:41:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2461515776. Throughput: 0: 44278.6. Samples: 2364488880. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 04:41:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:41:10,332][06909] Updated weights for policy 0, policy_version 150243 (0.0030) [2024-06-28 04:41:13,379][06909] Updated weights for policy 0, policy_version 150253 (0.0049) [2024-06-28 04:41:13,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44238.3, 300 sec: 44209.0). Total num frames: 2461761536. Throughput: 0: 44243.9. Samples: 2364622460. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 04:41:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:41:17,459][06909] Updated weights for policy 0, policy_version 150263 (0.0045) [2024-06-28 04:41:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 2461958144. Throughput: 0: 44253.4. Samples: 2364890820. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 04:41:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:41:20,912][06909] Updated weights for policy 0, policy_version 150273 (0.0035) [2024-06-28 04:41:23,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2462187520. Throughput: 0: 44221.8. Samples: 2365151840. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 04:41:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:41:24,691][06909] Updated weights for policy 0, policy_version 150283 (0.0028) [2024-06-28 04:41:28,341][06909] Updated weights for policy 0, policy_version 150293 (0.0038) [2024-06-28 04:41:28,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44514.4, 300 sec: 44209.0). Total num frames: 2462433280. Throughput: 0: 44327.8. Samples: 2365291720. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 04:41:28,850][06674] Avg episode reward: [(0, '0.469')] [2024-06-28 04:41:31,946][06909] Updated weights for policy 0, policy_version 150303 (0.0033) [2024-06-28 04:41:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 2462629888. Throughput: 0: 44284.9. Samples: 2365552300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:41:33,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:41:35,783][06909] Updated weights for policy 0, policy_version 150313 (0.0037) [2024-06-28 04:41:38,850][06674] Fps is (10 sec: 40960.5, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2462842880. Throughput: 0: 44275.8. Samples: 2365809880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:41:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:41:39,929][06887] Signal inference workers to stop experience collection... (33700 times) [2024-06-28 04:41:39,930][06887] Signal inference workers to resume experience collection... (33700 times) [2024-06-28 04:41:39,935][06909] Updated weights for policy 0, policy_version 150323 (0.0039) [2024-06-28 04:41:39,980][06909] InferenceWorker_p0-w0: stopping experience collection (33700 times) [2024-06-28 04:41:39,980][06909] InferenceWorker_p0-w0: resuming experience collection (33700 times) [2024-06-28 04:41:43,449][06909] Updated weights for policy 0, policy_version 150333 (0.0026) [2024-06-28 04:41:43,850][06674] Fps is (10 sec: 44234.8, 60 sec: 43963.4, 300 sec: 44153.4). Total num frames: 2463072256. Throughput: 0: 44174.6. Samples: 2365943280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:41:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:41:47,330][06909] Updated weights for policy 0, policy_version 150343 (0.0031) [2024-06-28 04:41:48,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2463285248. Throughput: 0: 44288.8. Samples: 2366209020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:41:48,859][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:41:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000150348_2463301632.pth... [2024-06-28 04:41:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000149702_2452717568.pth [2024-06-28 04:41:50,679][06909] Updated weights for policy 0, policy_version 150353 (0.0030) [2024-06-28 04:41:53,850][06674] Fps is (10 sec: 42600.5, 60 sec: 44237.0, 300 sec: 44042.4). Total num frames: 2463498240. Throughput: 0: 44193.4. Samples: 2366477580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:41:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:41:54,494][06909] Updated weights for policy 0, policy_version 150363 (0.0025) [2024-06-28 04:41:57,939][06909] Updated weights for policy 0, policy_version 150373 (0.0028) [2024-06-28 04:41:58,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2463727616. Throughput: 0: 44144.4. Samples: 2366608960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:41:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:42:01,626][06909] Updated weights for policy 0, policy_version 150383 (0.0028) [2024-06-28 04:42:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 2463956992. Throughput: 0: 43996.8. Samples: 2366870680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:42:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:42:05,283][06909] Updated weights for policy 0, policy_version 150393 (0.0032) [2024-06-28 04:42:08,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2464186368. Throughput: 0: 44127.5. Samples: 2367137580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:42:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:42:09,150][06909] Updated weights for policy 0, policy_version 150403 (0.0033) [2024-06-28 04:42:12,915][06909] Updated weights for policy 0, policy_version 150413 (0.0034) [2024-06-28 04:42:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2464382976. Throughput: 0: 43928.6. Samples: 2367268500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:42:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:42:16,955][06909] Updated weights for policy 0, policy_version 150423 (0.0037) [2024-06-28 04:42:18,853][06674] Fps is (10 sec: 42583.4, 60 sec: 44234.1, 300 sec: 44098.3). Total num frames: 2464612352. Throughput: 0: 43874.2. Samples: 2367526800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:42:18,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:42:20,475][06909] Updated weights for policy 0, policy_version 150433 (0.0031) [2024-06-28 04:42:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2464808960. Throughput: 0: 44049.3. Samples: 2367792100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:42:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:42:24,581][06909] Updated weights for policy 0, policy_version 150443 (0.0035) [2024-06-28 04:42:28,030][06909] Updated weights for policy 0, policy_version 150453 (0.0034) [2024-06-28 04:42:28,850][06674] Fps is (10 sec: 42614.1, 60 sec: 43417.7, 300 sec: 44153.5). Total num frames: 2465038336. Throughput: 0: 44006.7. Samples: 2367923560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 04:42:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:42:31,851][06909] Updated weights for policy 0, policy_version 150463 (0.0027) [2024-06-28 04:42:33,852][06674] Fps is (10 sec: 47503.9, 60 sec: 44235.3, 300 sec: 44097.7). Total num frames: 2465284096. Throughput: 0: 43945.7. Samples: 2368186660. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 04:42:33,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:42:35,277][06909] Updated weights for policy 0, policy_version 150473 (0.0030) [2024-06-28 04:42:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2465480704. Throughput: 0: 43865.7. Samples: 2368451540. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 04:42:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:42:39,338][06909] Updated weights for policy 0, policy_version 150483 (0.0037) [2024-06-28 04:42:43,063][06909] Updated weights for policy 0, policy_version 150493 (0.0034) [2024-06-28 04:42:43,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43964.0, 300 sec: 44097.9). Total num frames: 2465710080. Throughput: 0: 43938.2. Samples: 2368586180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 04:42:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:42:47,099][06909] Updated weights for policy 0, policy_version 150503 (0.0027) [2024-06-28 04:42:48,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44237.0, 300 sec: 44098.0). Total num frames: 2465939456. Throughput: 0: 43986.4. Samples: 2368850060. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 04:42:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:42:50,650][06909] Updated weights for policy 0, policy_version 150513 (0.0041) [2024-06-28 04:42:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2466136064. Throughput: 0: 43829.4. Samples: 2369109900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 04:42:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:42:54,386][06909] Updated weights for policy 0, policy_version 150523 (0.0036) [2024-06-28 04:42:57,898][06909] Updated weights for policy 0, policy_version 150533 (0.0028) [2024-06-28 04:42:58,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2466365440. Throughput: 0: 43716.8. Samples: 2369235760. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 04:42:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:43:01,642][06909] Updated weights for policy 0, policy_version 150543 (0.0032) [2024-06-28 04:43:02,279][06887] Signal inference workers to stop experience collection... (33750 times) [2024-06-28 04:43:02,279][06887] Signal inference workers to resume experience collection... (33750 times) [2024-06-28 04:43:02,303][06909] InferenceWorker_p0-w0: stopping experience collection (33750 times) [2024-06-28 04:43:02,304][06909] InferenceWorker_p0-w0: resuming experience collection (33750 times) [2024-06-28 04:43:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2466594816. Throughput: 0: 43921.4. Samples: 2369503100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 04:43:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:43:05,385][06909] Updated weights for policy 0, policy_version 150553 (0.0027) [2024-06-28 04:43:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2466807808. Throughput: 0: 43844.0. Samples: 2369765080. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 04:43:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:43:08,972][06909] Updated weights for policy 0, policy_version 150563 (0.0045) [2024-06-28 04:43:13,022][06909] Updated weights for policy 0, policy_version 150573 (0.0033) [2024-06-28 04:43:13,852][06674] Fps is (10 sec: 40951.6, 60 sec: 43689.2, 300 sec: 44042.1). Total num frames: 2467004416. Throughput: 0: 43829.1. Samples: 2369895960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 04:43:13,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:43:16,600][06909] Updated weights for policy 0, policy_version 150583 (0.0030) [2024-06-28 04:43:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43966.4, 300 sec: 44097.9). Total num frames: 2467250176. Throughput: 0: 43857.6. Samples: 2370160160. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 04:43:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 04:43:20,220][06909] Updated weights for policy 0, policy_version 150593 (0.0032) [2024-06-28 04:43:23,850][06674] Fps is (10 sec: 44244.8, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2467446784. Throughput: 0: 43842.0. Samples: 2370424440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 04:43:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:43:24,373][06909] Updated weights for policy 0, policy_version 150603 (0.0027) [2024-06-28 04:43:28,036][06909] Updated weights for policy 0, policy_version 150613 (0.0033) [2024-06-28 04:43:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 2467659776. Throughput: 0: 43658.3. Samples: 2370550800. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 04:43:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:43:31,683][06909] Updated weights for policy 0, policy_version 150623 (0.0021) [2024-06-28 04:43:33,850][06674] Fps is (10 sec: 47514.6, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 2467921920. Throughput: 0: 43853.2. Samples: 2370823460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 04:43:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:43:35,361][06909] Updated weights for policy 0, policy_version 150633 (0.0035) [2024-06-28 04:43:38,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2468118528. Throughput: 0: 43913.3. Samples: 2371086000. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 04:43:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:43:38,976][06909] Updated weights for policy 0, policy_version 150643 (0.0028) [2024-06-28 04:43:42,782][06909] Updated weights for policy 0, policy_version 150653 (0.0028) [2024-06-28 04:43:43,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 2468315136. Throughput: 0: 43961.8. Samples: 2371214040. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 04:43:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:43:46,214][06909] Updated weights for policy 0, policy_version 150663 (0.0043) [2024-06-28 04:43:48,856][06674] Fps is (10 sec: 45847.6, 60 sec: 43959.2, 300 sec: 44152.6). Total num frames: 2468577280. Throughput: 0: 43964.7. Samples: 2371481780. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 04:43:48,857][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:43:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000150670_2468577280.pth... [2024-06-28 04:43:48,909][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000150024_2457993216.pth [2024-06-28 04:43:50,184][06909] Updated weights for policy 0, policy_version 150673 (0.0030) [2024-06-28 04:43:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2468773888. Throughput: 0: 43980.0. Samples: 2371744180. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 04:43:53,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:43:53,878][06909] Updated weights for policy 0, policy_version 150683 (0.0031) [2024-06-28 04:43:58,066][06909] Updated weights for policy 0, policy_version 150693 (0.0031) [2024-06-28 04:43:58,850][06674] Fps is (10 sec: 40984.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2468986880. Throughput: 0: 43945.1. Samples: 2371873400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 04:43:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:44:01,384][06909] Updated weights for policy 0, policy_version 150703 (0.0029) [2024-06-28 04:44:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2469232640. Throughput: 0: 44072.0. Samples: 2372143400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 04:44:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:44:05,307][06909] Updated weights for policy 0, policy_version 150713 (0.0024) [2024-06-28 04:44:08,747][06909] Updated weights for policy 0, policy_version 150723 (0.0028) [2024-06-28 04:44:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44098.9). Total num frames: 2469445632. Throughput: 0: 44098.0. Samples: 2372408840. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 04:44:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:44:12,705][06909] Updated weights for policy 0, policy_version 150733 (0.0039) [2024-06-28 04:44:13,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43965.3, 300 sec: 44098.0). Total num frames: 2469642240. Throughput: 0: 44095.6. Samples: 2372535100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 04:44:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:44:16,527][06909] Updated weights for policy 0, policy_version 150743 (0.0027) [2024-06-28 04:44:16,778][06887] Signal inference workers to stop experience collection... (33800 times) [2024-06-28 04:44:16,782][06887] Signal inference workers to resume experience collection... (33800 times) [2024-06-28 04:44:16,792][06909] InferenceWorker_p0-w0: stopping experience collection (33800 times) [2024-06-28 04:44:16,792][06909] InferenceWorker_p0-w0: resuming experience collection (33800 times) [2024-06-28 04:44:18,850][06674] Fps is (10 sec: 44235.4, 60 sec: 43963.5, 300 sec: 44153.4). Total num frames: 2469888000. Throughput: 0: 43917.1. Samples: 2372799740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 04:44:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:44:19,954][06909] Updated weights for policy 0, policy_version 150753 (0.0034) [2024-06-28 04:44:23,778][06909] Updated weights for policy 0, policy_version 150763 (0.0033) [2024-06-28 04:44:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44237.0, 300 sec: 43986.9). Total num frames: 2470100992. Throughput: 0: 43907.2. Samples: 2373061820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 04:44:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:44:27,665][06909] Updated weights for policy 0, policy_version 150773 (0.0032) [2024-06-28 04:44:28,850][06674] Fps is (10 sec: 42599.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2470313984. Throughput: 0: 44007.2. Samples: 2373194360. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 04:44:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:44:31,309][06909] Updated weights for policy 0, policy_version 150783 (0.0044) [2024-06-28 04:44:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 44098.9). Total num frames: 2470543360. Throughput: 0: 43897.1. Samples: 2373456880. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 04:44:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:44:34,951][06909] Updated weights for policy 0, policy_version 150793 (0.0029) [2024-06-28 04:44:38,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2470739968. Throughput: 0: 44034.6. Samples: 2373725740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 04:44:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:44:38,861][06909] Updated weights for policy 0, policy_version 150803 (0.0029) [2024-06-28 04:44:42,526][06909] Updated weights for policy 0, policy_version 150813 (0.0030) [2024-06-28 04:44:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2470985728. Throughput: 0: 44022.2. Samples: 2373854400. Policy #0 lag: (min: 0.0, avg: 13.2, max: 25.0) [2024-06-28 04:44:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:44:46,049][06909] Updated weights for policy 0, policy_version 150823 (0.0028) [2024-06-28 04:44:48,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43968.2, 300 sec: 44098.0). Total num frames: 2471215104. Throughput: 0: 43832.0. Samples: 2374115840. Policy #0 lag: (min: 0.0, avg: 13.2, max: 25.0) [2024-06-28 04:44:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:44:49,882][06909] Updated weights for policy 0, policy_version 150833 (0.0024) [2024-06-28 04:44:53,665][06909] Updated weights for policy 0, policy_version 150843 (0.0036) [2024-06-28 04:44:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2471411712. Throughput: 0: 43983.6. Samples: 2374388100. Policy #0 lag: (min: 0.0, avg: 13.2, max: 25.0) [2024-06-28 04:44:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:44:57,222][06909] Updated weights for policy 0, policy_version 150853 (0.0035) [2024-06-28 04:44:58,852][06674] Fps is (10 sec: 42589.6, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 2471641088. Throughput: 0: 44131.3. Samples: 2374521100. Policy #0 lag: (min: 0.0, avg: 13.2, max: 25.0) [2024-06-28 04:44:58,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:45:00,965][06909] Updated weights for policy 0, policy_version 150863 (0.0030) [2024-06-28 04:45:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2471854080. Throughput: 0: 43968.2. Samples: 2374778300. Policy #0 lag: (min: 0.0, avg: 13.2, max: 25.0) [2024-06-28 04:45:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:45:04,866][06909] Updated weights for policy 0, policy_version 150873 (0.0030) [2024-06-28 04:45:08,428][06909] Updated weights for policy 0, policy_version 150883 (0.0023) [2024-06-28 04:45:08,850][06674] Fps is (10 sec: 44246.1, 60 sec: 43963.7, 300 sec: 43987.2). Total num frames: 2472083456. Throughput: 0: 44182.2. Samples: 2375050020. Policy #0 lag: (min: 0.0, avg: 13.2, max: 25.0) [2024-06-28 04:45:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:45:12,098][06909] Updated weights for policy 0, policy_version 150893 (0.0028) [2024-06-28 04:45:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44509.8, 300 sec: 44042.7). Total num frames: 2472312832. Throughput: 0: 44198.6. Samples: 2375183300. Policy #0 lag: (min: 0.0, avg: 13.2, max: 25.0) [2024-06-28 04:45:13,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:45:15,900][06909] Updated weights for policy 0, policy_version 150903 (0.0047) [2024-06-28 04:45:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 2472525824. Throughput: 0: 44156.9. Samples: 2375443940. Policy #0 lag: (min: 0.0, avg: 13.2, max: 25.0) [2024-06-28 04:45:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:45:19,544][06909] Updated weights for policy 0, policy_version 150913 (0.0028) [2024-06-28 04:45:23,168][06909] Updated weights for policy 0, policy_version 150923 (0.0041) [2024-06-28 04:45:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44043.3). Total num frames: 2472755200. Throughput: 0: 44110.8. Samples: 2375710720. Policy #0 lag: (min: 0.0, avg: 13.2, max: 25.0) [2024-06-28 04:45:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:45:27,050][06909] Updated weights for policy 0, policy_version 150933 (0.0033) [2024-06-28 04:45:28,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2472951808. Throughput: 0: 44327.1. Samples: 2375849120. Policy #0 lag: (min: 0.0, avg: 13.2, max: 25.0) [2024-06-28 04:45:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:45:30,603][06909] Updated weights for policy 0, policy_version 150943 (0.0034) [2024-06-28 04:45:33,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2473181184. Throughput: 0: 44093.2. Samples: 2376100040. Policy #0 lag: (min: 0.0, avg: 13.2, max: 25.0) [2024-06-28 04:45:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:45:34,456][06909] Updated weights for policy 0, policy_version 150953 (0.0039) [2024-06-28 04:45:38,197][06909] Updated weights for policy 0, policy_version 150963 (0.0042) [2024-06-28 04:45:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44510.0, 300 sec: 43986.9). Total num frames: 2473410560. Throughput: 0: 44029.8. Samples: 2376369440. Policy #0 lag: (min: 0.0, avg: 13.2, max: 25.0) [2024-06-28 04:45:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:45:41,886][06909] Updated weights for policy 0, policy_version 150973 (0.0025) [2024-06-28 04:45:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2473607168. Throughput: 0: 44014.8. Samples: 2376501680. Policy #0 lag: (min: 0.0, avg: 13.2, max: 25.0) [2024-06-28 04:45:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:45:45,548][06909] Updated weights for policy 0, policy_version 150983 (0.0045) [2024-06-28 04:45:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2473836544. Throughput: 0: 44112.0. Samples: 2376763340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 04:45:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:45:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000150991_2473836544.pth... [2024-06-28 04:45:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000150348_2463301632.pth [2024-06-28 04:45:49,487][06909] Updated weights for policy 0, policy_version 150993 (0.0035) [2024-06-28 04:45:53,057][06909] Updated weights for policy 0, policy_version 151003 (0.0032) [2024-06-28 04:45:53,793][06887] Signal inference workers to stop experience collection... (33850 times) [2024-06-28 04:45:53,842][06887] Signal inference workers to resume experience collection... (33850 times) [2024-06-28 04:45:53,842][06909] InferenceWorker_p0-w0: stopping experience collection (33850 times) [2024-06-28 04:45:53,850][06674] Fps is (10 sec: 47514.3, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2474082304. Throughput: 0: 44062.7. Samples: 2377032840. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 04:45:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:45:53,856][06909] InferenceWorker_p0-w0: resuming experience collection (33850 times) [2024-06-28 04:45:56,864][06909] Updated weights for policy 0, policy_version 151013 (0.0026) [2024-06-28 04:45:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 2474278912. Throughput: 0: 44061.4. Samples: 2377166060. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 04:45:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:46:00,554][06909] Updated weights for policy 0, policy_version 151023 (0.0041) [2024-06-28 04:46:03,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2474491904. Throughput: 0: 43890.2. Samples: 2377419000. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 04:46:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:46:04,632][06909] Updated weights for policy 0, policy_version 151033 (0.0029) [2024-06-28 04:46:07,832][06909] Updated weights for policy 0, policy_version 151043 (0.0041) [2024-06-28 04:46:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2474737664. Throughput: 0: 43896.9. Samples: 2377686080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 04:46:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:46:11,835][06909] Updated weights for policy 0, policy_version 151053 (0.0032) [2024-06-28 04:46:13,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2474934272. Throughput: 0: 43864.4. Samples: 2377823020. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 04:46:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:46:15,329][06909] Updated weights for policy 0, policy_version 151063 (0.0025) [2024-06-28 04:46:18,851][06674] Fps is (10 sec: 42592.9, 60 sec: 43962.8, 300 sec: 43986.7). Total num frames: 2475163648. Throughput: 0: 43970.4. Samples: 2378078760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 04:46:18,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:46:19,059][06909] Updated weights for policy 0, policy_version 151073 (0.0036) [2024-06-28 04:46:23,011][06909] Updated weights for policy 0, policy_version 151083 (0.0032) [2024-06-28 04:46:23,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 2475393024. Throughput: 0: 43925.2. Samples: 2378346080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 04:46:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:46:26,672][06909] Updated weights for policy 0, policy_version 151093 (0.0025) [2024-06-28 04:46:28,850][06674] Fps is (10 sec: 44242.2, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2475606016. Throughput: 0: 44077.4. Samples: 2378485160. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 04:46:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:46:30,224][06909] Updated weights for policy 0, policy_version 151103 (0.0037) [2024-06-28 04:46:33,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 2475819008. Throughput: 0: 44104.5. Samples: 2378748040. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 04:46:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:46:33,936][06909] Updated weights for policy 0, policy_version 151113 (0.0035) [2024-06-28 04:46:37,283][06909] Updated weights for policy 0, policy_version 151123 (0.0027) [2024-06-28 04:46:38,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44042.5). Total num frames: 2476064768. Throughput: 0: 44146.1. Samples: 2379019420. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 04:46:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:46:41,560][06909] Updated weights for policy 0, policy_version 151133 (0.0029) [2024-06-28 04:46:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2476261376. Throughput: 0: 44272.4. Samples: 2379158320. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 04:46:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:46:44,501][06909] Updated weights for policy 0, policy_version 151143 (0.0026) [2024-06-28 04:46:48,701][06909] Updated weights for policy 0, policy_version 151153 (0.0030) [2024-06-28 04:46:48,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2476490752. Throughput: 0: 44474.3. Samples: 2379420340. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 04:46:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:46:52,086][06909] Updated weights for policy 0, policy_version 151163 (0.0030) [2024-06-28 04:46:53,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2476736512. Throughput: 0: 44433.7. Samples: 2379685600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 04:46:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:46:56,130][06909] Updated weights for policy 0, policy_version 151173 (0.0025) [2024-06-28 04:46:58,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2476949504. Throughput: 0: 44465.8. Samples: 2379823980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 04:46:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:46:59,275][06909] Updated weights for policy 0, policy_version 151183 (0.0037) [2024-06-28 04:47:03,764][06909] Updated weights for policy 0, policy_version 151193 (0.0036) [2024-06-28 04:47:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 2477146112. Throughput: 0: 44506.1. Samples: 2380081480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 04:47:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:47:07,019][06909] Updated weights for policy 0, policy_version 151203 (0.0030) [2024-06-28 04:47:08,852][06674] Fps is (10 sec: 44227.3, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 2477391872. Throughput: 0: 44305.6. Samples: 2380339920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 04:47:08,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:47:11,278][06909] Updated weights for policy 0, policy_version 151213 (0.0031) [2024-06-28 04:47:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 43987.4). Total num frames: 2477588480. Throughput: 0: 44373.9. Samples: 2380481980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 04:47:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:47:14,297][06909] Updated weights for policy 0, policy_version 151223 (0.0027) [2024-06-28 04:47:18,738][06909] Updated weights for policy 0, policy_version 151233 (0.0023) [2024-06-28 04:47:18,850][06674] Fps is (10 sec: 40968.3, 60 sec: 43964.6, 300 sec: 44042.4). Total num frames: 2477801472. Throughput: 0: 44303.5. Samples: 2380741700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 04:47:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:47:21,551][06909] Updated weights for policy 0, policy_version 151243 (0.0026) [2024-06-28 04:47:23,850][06674] Fps is (10 sec: 45874.1, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2478047232. Throughput: 0: 44063.1. Samples: 2381002260. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 04:47:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:47:25,899][06909] Updated weights for policy 0, policy_version 151253 (0.0035) [2024-06-28 04:47:28,534][06887] Signal inference workers to stop experience collection... (33900 times) [2024-06-28 04:47:28,583][06909] InferenceWorker_p0-w0: stopping experience collection (33900 times) [2024-06-28 04:47:28,585][06887] Signal inference workers to resume experience collection... (33900 times) [2024-06-28 04:47:28,598][06909] InferenceWorker_p0-w0: resuming experience collection (33900 times) [2024-06-28 04:47:28,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44509.9, 300 sec: 44042.7). Total num frames: 2478276608. Throughput: 0: 44205.4. Samples: 2381147560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 04:47:28,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 04:47:28,890][06909] Updated weights for policy 0, policy_version 151263 (0.0040) [2024-06-28 04:47:33,523][06909] Updated weights for policy 0, policy_version 151273 (0.0045) [2024-06-28 04:47:33,850][06674] Fps is (10 sec: 42599.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2478473216. Throughput: 0: 44212.0. Samples: 2381409880. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 04:47:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:47:36,352][06909] Updated weights for policy 0, policy_version 151283 (0.0023) [2024-06-28 04:47:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 2478718976. Throughput: 0: 44046.2. Samples: 2381667680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 04:47:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:47:40,815][06909] Updated weights for policy 0, policy_version 151293 (0.0042) [2024-06-28 04:47:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2478931968. Throughput: 0: 44045.7. Samples: 2381806040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 04:47:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:47:43,891][06909] Updated weights for policy 0, policy_version 151303 (0.0041) [2024-06-28 04:47:48,501][06909] Updated weights for policy 0, policy_version 151313 (0.0021) [2024-06-28 04:47:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2479144960. Throughput: 0: 44229.7. Samples: 2382071820. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 04:47:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:47:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000151315_2479144960.pth... [2024-06-28 04:47:48,924][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000150670_2468577280.pth [2024-06-28 04:47:51,165][06909] Updated weights for policy 0, policy_version 151323 (0.0030) [2024-06-28 04:47:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2479374336. Throughput: 0: 44215.3. Samples: 2382329520. Policy #0 lag: (min: 0.0, avg: 13.6, max: 25.0) [2024-06-28 04:47:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:47:55,844][06909] Updated weights for policy 0, policy_version 151333 (0.0026) [2024-06-28 04:47:58,682][06909] Updated weights for policy 0, policy_version 151343 (0.0022) [2024-06-28 04:47:58,856][06674] Fps is (10 sec: 45847.9, 60 sec: 44232.3, 300 sec: 44097.0). Total num frames: 2479603712. Throughput: 0: 44090.5. Samples: 2382466320. Policy #0 lag: (min: 0.0, avg: 13.6, max: 25.0) [2024-06-28 04:47:58,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:48:03,268][06909] Updated weights for policy 0, policy_version 151353 (0.0031) [2024-06-28 04:48:03,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2479816704. Throughput: 0: 44280.5. Samples: 2382734320. Policy #0 lag: (min: 0.0, avg: 13.6, max: 25.0) [2024-06-28 04:48:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:48:06,151][06909] Updated weights for policy 0, policy_version 151363 (0.0022) [2024-06-28 04:48:08,850][06674] Fps is (10 sec: 42624.0, 60 sec: 43965.2, 300 sec: 44153.8). Total num frames: 2480029696. Throughput: 0: 44318.8. Samples: 2382996600. Policy #0 lag: (min: 0.0, avg: 13.6, max: 25.0) [2024-06-28 04:48:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:48:10,600][06909] Updated weights for policy 0, policy_version 151373 (0.0026) [2024-06-28 04:48:13,268][06909] Updated weights for policy 0, policy_version 151383 (0.0034) [2024-06-28 04:48:13,853][06674] Fps is (10 sec: 47499.5, 60 sec: 45053.7, 300 sec: 44208.6). Total num frames: 2480291840. Throughput: 0: 44198.0. Samples: 2383136600. Policy #0 lag: (min: 0.0, avg: 13.6, max: 25.0) [2024-06-28 04:48:13,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:48:17,762][06909] Updated weights for policy 0, policy_version 151393 (0.0035) [2024-06-28 04:48:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2480472064. Throughput: 0: 44400.0. Samples: 2383407880. Policy #0 lag: (min: 0.0, avg: 13.6, max: 25.0) [2024-06-28 04:48:18,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-28 04:48:20,861][06909] Updated weights for policy 0, policy_version 151403 (0.0029) [2024-06-28 04:48:23,850][06674] Fps is (10 sec: 39332.7, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2480685056. Throughput: 0: 44400.4. Samples: 2383665700. Policy #0 lag: (min: 0.0, avg: 13.6, max: 25.0) [2024-06-28 04:48:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:48:25,351][06909] Updated weights for policy 0, policy_version 151413 (0.0026) [2024-06-28 04:48:28,187][06909] Updated weights for policy 0, policy_version 151423 (0.0030) [2024-06-28 04:48:28,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2480947200. Throughput: 0: 44298.2. Samples: 2383799460. Policy #0 lag: (min: 0.0, avg: 13.6, max: 25.0) [2024-06-28 04:48:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:48:32,757][06909] Updated weights for policy 0, policy_version 151433 (0.0031) [2024-06-28 04:48:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2481127424. Throughput: 0: 44429.8. Samples: 2384071160. Policy #0 lag: (min: 0.0, avg: 13.6, max: 25.0) [2024-06-28 04:48:33,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:48:35,634][06909] Updated weights for policy 0, policy_version 151443 (0.0033) [2024-06-28 04:48:38,856][06674] Fps is (10 sec: 40935.4, 60 sec: 43959.4, 300 sec: 44208.1). Total num frames: 2481356800. Throughput: 0: 44435.5. Samples: 2384329380. Policy #0 lag: (min: 0.0, avg: 13.6, max: 25.0) [2024-06-28 04:48:38,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:48:39,925][06909] Updated weights for policy 0, policy_version 151453 (0.0036) [2024-06-28 04:48:41,969][06887] Signal inference workers to stop experience collection... (33950 times) [2024-06-28 04:48:41,970][06887] Signal inference workers to resume experience collection... (33950 times) [2024-06-28 04:48:41,985][06909] InferenceWorker_p0-w0: stopping experience collection (33950 times) [2024-06-28 04:48:41,985][06909] InferenceWorker_p0-w0: resuming experience collection (33950 times) [2024-06-28 04:48:43,026][06909] Updated weights for policy 0, policy_version 151463 (0.0037) [2024-06-28 04:48:43,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44509.8, 300 sec: 44154.4). Total num frames: 2481602560. Throughput: 0: 44458.8. Samples: 2384466700. Policy #0 lag: (min: 0.0, avg: 13.6, max: 25.0) [2024-06-28 04:48:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:48:47,162][06909] Updated weights for policy 0, policy_version 151473 (0.0031) [2024-06-28 04:48:48,850][06674] Fps is (10 sec: 42623.9, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2481782784. Throughput: 0: 44299.0. Samples: 2384727780. Policy #0 lag: (min: 0.0, avg: 13.6, max: 25.0) [2024-06-28 04:48:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:48:50,462][06909] Updated weights for policy 0, policy_version 151483 (0.0037) [2024-06-28 04:48:53,852][06674] Fps is (10 sec: 40951.6, 60 sec: 43962.3, 300 sec: 44153.2). Total num frames: 2482012160. Throughput: 0: 44378.9. Samples: 2384993740. Policy #0 lag: (min: 0.0, avg: 13.6, max: 25.0) [2024-06-28 04:48:53,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:48:54,867][06909] Updated weights for policy 0, policy_version 151493 (0.0033) [2024-06-28 04:48:58,069][06909] Updated weights for policy 0, policy_version 151503 (0.0032) [2024-06-28 04:48:58,850][06674] Fps is (10 sec: 47514.3, 60 sec: 44241.3, 300 sec: 44153.5). Total num frames: 2482257920. Throughput: 0: 44191.8. Samples: 2385125100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 04:48:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:49:02,390][06909] Updated weights for policy 0, policy_version 151513 (0.0027) [2024-06-28 04:49:03,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2482454528. Throughput: 0: 43998.6. Samples: 2385387820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 04:49:03,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:49:05,455][06909] Updated weights for policy 0, policy_version 151523 (0.0035) [2024-06-28 04:49:08,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2482667520. Throughput: 0: 44020.0. Samples: 2385646600. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 04:49:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:49:09,844][06909] Updated weights for policy 0, policy_version 151533 (0.0030) [2024-06-28 04:49:12,870][06909] Updated weights for policy 0, policy_version 151543 (0.0045) [2024-06-28 04:49:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43692.8, 300 sec: 44153.5). Total num frames: 2482913280. Throughput: 0: 44062.3. Samples: 2385782260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 04:49:13,859][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:49:17,119][06909] Updated weights for policy 0, policy_version 151553 (0.0033) [2024-06-28 04:49:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2483109888. Throughput: 0: 43871.2. Samples: 2386045360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 04:49:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:49:20,254][06909] Updated weights for policy 0, policy_version 151563 (0.0025) [2024-06-28 04:49:23,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2483322880. Throughput: 0: 44085.0. Samples: 2386312940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 04:49:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:49:24,461][06909] Updated weights for policy 0, policy_version 151573 (0.0026) [2024-06-28 04:49:27,663][06909] Updated weights for policy 0, policy_version 151583 (0.0023) [2024-06-28 04:49:28,852][06674] Fps is (10 sec: 47503.9, 60 sec: 43962.3, 300 sec: 44208.7). Total num frames: 2483585024. Throughput: 0: 44052.7. Samples: 2386449160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 04:49:28,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:49:31,840][06909] Updated weights for policy 0, policy_version 151593 (0.0029) [2024-06-28 04:49:33,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 2483765248. Throughput: 0: 43963.7. Samples: 2386706140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 04:49:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:49:35,167][06909] Updated weights for policy 0, policy_version 151603 (0.0023) [2024-06-28 04:49:38,850][06674] Fps is (10 sec: 40968.1, 60 sec: 43968.1, 300 sec: 44097.9). Total num frames: 2483994624. Throughput: 0: 44008.6. Samples: 2386974040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 04:49:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:49:39,289][06909] Updated weights for policy 0, policy_version 151613 (0.0041) [2024-06-28 04:49:42,523][06909] Updated weights for policy 0, policy_version 151623 (0.0027) [2024-06-28 04:49:43,850][06674] Fps is (10 sec: 49151.2, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2484256768. Throughput: 0: 44155.9. Samples: 2387112120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 04:49:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:49:46,709][06909] Updated weights for policy 0, policy_version 151633 (0.0031) [2024-06-28 04:49:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2484420608. Throughput: 0: 44097.5. Samples: 2387372200. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 04:49:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:49:48,858][06887] Signal inference workers to stop experience collection... (34000 times) [2024-06-28 04:49:48,906][06909] InferenceWorker_p0-w0: stopping experience collection (34000 times) [2024-06-28 04:49:48,915][06887] Signal inference workers to resume experience collection... (34000 times) [2024-06-28 04:49:48,920][06909] InferenceWorker_p0-w0: resuming experience collection (34000 times) [2024-06-28 04:49:49,079][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000151639_2484453376.pth... [2024-06-28 04:49:49,127][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000150991_2473836544.pth [2024-06-28 04:49:50,144][06909] Updated weights for policy 0, policy_version 151643 (0.0038) [2024-06-28 04:49:53,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44238.3, 300 sec: 44153.8). Total num frames: 2484666368. Throughput: 0: 44137.7. Samples: 2387632800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 04:49:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:49:54,311][06909] Updated weights for policy 0, policy_version 151653 (0.0028) [2024-06-28 04:49:57,377][06909] Updated weights for policy 0, policy_version 151663 (0.0021) [2024-06-28 04:49:58,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2484895744. Throughput: 0: 44198.7. Samples: 2387771200. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 04:49:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:50:01,579][06909] Updated weights for policy 0, policy_version 151673 (0.0030) [2024-06-28 04:50:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2485092352. Throughput: 0: 44241.7. Samples: 2388036240. Policy #0 lag: (min: 0.0, avg: 11.3, max: 26.0) [2024-06-28 04:50:03,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:50:05,225][06909] Updated weights for policy 0, policy_version 151683 (0.0027) [2024-06-28 04:50:08,783][06909] Updated weights for policy 0, policy_version 151693 (0.0025) [2024-06-28 04:50:08,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2485338112. Throughput: 0: 44088.4. Samples: 2388296920. Policy #0 lag: (min: 0.0, avg: 11.3, max: 26.0) [2024-06-28 04:50:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:50:12,569][06909] Updated weights for policy 0, policy_version 151703 (0.0038) [2024-06-28 04:50:13,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2485567488. Throughput: 0: 44054.3. Samples: 2388431520. Policy #0 lag: (min: 0.0, avg: 11.3, max: 26.0) [2024-06-28 04:50:13,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:50:16,409][06909] Updated weights for policy 0, policy_version 151713 (0.0036) [2024-06-28 04:50:18,850][06674] Fps is (10 sec: 42599.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2485764096. Throughput: 0: 44147.9. Samples: 2388692800. Policy #0 lag: (min: 0.0, avg: 11.3, max: 26.0) [2024-06-28 04:50:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:50:19,913][06909] Updated weights for policy 0, policy_version 151723 (0.0040) [2024-06-28 04:50:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2485977088. Throughput: 0: 44090.7. Samples: 2388958120. Policy #0 lag: (min: 0.0, avg: 11.3, max: 26.0) [2024-06-28 04:50:23,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:50:24,080][06909] Updated weights for policy 0, policy_version 151733 (0.0026) [2024-06-28 04:50:27,543][06909] Updated weights for policy 0, policy_version 151743 (0.0034) [2024-06-28 04:50:28,850][06674] Fps is (10 sec: 45873.8, 60 sec: 43965.0, 300 sec: 44209.0). Total num frames: 2486222848. Throughput: 0: 43937.1. Samples: 2389089300. Policy #0 lag: (min: 0.0, avg: 11.3, max: 26.0) [2024-06-28 04:50:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:50:31,407][06909] Updated weights for policy 0, policy_version 151753 (0.0026) [2024-06-28 04:50:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2486435840. Throughput: 0: 43991.5. Samples: 2389351820. Policy #0 lag: (min: 0.0, avg: 11.3, max: 26.0) [2024-06-28 04:50:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:50:34,804][06909] Updated weights for policy 0, policy_version 151763 (0.0021) [2024-06-28 04:50:38,624][06909] Updated weights for policy 0, policy_version 151773 (0.0030) [2024-06-28 04:50:38,850][06674] Fps is (10 sec: 42599.3, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2486648832. Throughput: 0: 44096.0. Samples: 2389617120. Policy #0 lag: (min: 0.0, avg: 11.3, max: 26.0) [2024-06-28 04:50:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:50:42,557][06909] Updated weights for policy 0, policy_version 151783 (0.0040) [2024-06-28 04:50:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 2486878208. Throughput: 0: 43900.5. Samples: 2389746720. Policy #0 lag: (min: 0.0, avg: 11.3, max: 26.0) [2024-06-28 04:50:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:50:46,275][06909] Updated weights for policy 0, policy_version 151793 (0.0032) [2024-06-28 04:50:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2487091200. Throughput: 0: 43805.9. Samples: 2390007500. Policy #0 lag: (min: 0.0, avg: 11.3, max: 26.0) [2024-06-28 04:50:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:50:49,721][06909] Updated weights for policy 0, policy_version 151803 (0.0034) [2024-06-28 04:50:53,754][06909] Updated weights for policy 0, policy_version 151813 (0.0033) [2024-06-28 04:50:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2487304192. Throughput: 0: 43877.9. Samples: 2390271420. Policy #0 lag: (min: 0.0, avg: 11.3, max: 26.0) [2024-06-28 04:50:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:50:57,169][06909] Updated weights for policy 0, policy_version 151823 (0.0031) [2024-06-28 04:50:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2487533568. Throughput: 0: 43840.5. Samples: 2390404340. Policy #0 lag: (min: 0.0, avg: 11.3, max: 26.0) [2024-06-28 04:50:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:50:59,697][06887] Signal inference workers to stop experience collection... (34050 times) [2024-06-28 04:50:59,748][06909] InferenceWorker_p0-w0: stopping experience collection (34050 times) [2024-06-28 04:50:59,755][06887] Signal inference workers to resume experience collection... (34050 times) [2024-06-28 04:50:59,766][06909] InferenceWorker_p0-w0: resuming experience collection (34050 times) [2024-06-28 04:51:01,215][06909] Updated weights for policy 0, policy_version 151833 (0.0030) [2024-06-28 04:51:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 2487746560. Throughput: 0: 43901.3. Samples: 2390668360. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:51:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:51:04,728][06909] Updated weights for policy 0, policy_version 151843 (0.0031) [2024-06-28 04:51:08,661][06909] Updated weights for policy 0, policy_version 151853 (0.0025) [2024-06-28 04:51:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.8, 300 sec: 44153.5). Total num frames: 2487959552. Throughput: 0: 43738.7. Samples: 2390926360. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:51:08,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:51:12,070][06909] Updated weights for policy 0, policy_version 151863 (0.0041) [2024-06-28 04:51:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 44153.7). Total num frames: 2488188928. Throughput: 0: 43727.4. Samples: 2391057020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:51:13,850][06674] Avg episode reward: [(0, '0.454')] [2024-06-28 04:51:15,872][06909] Updated weights for policy 0, policy_version 151873 (0.0027) [2024-06-28 04:51:18,852][06674] Fps is (10 sec: 44228.1, 60 sec: 43962.2, 300 sec: 44097.7). Total num frames: 2488401920. Throughput: 0: 43737.6. Samples: 2391320100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:51:18,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:51:19,775][06909] Updated weights for policy 0, policy_version 151883 (0.0031) [2024-06-28 04:51:23,726][06909] Updated weights for policy 0, policy_version 151893 (0.0029) [2024-06-28 04:51:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2488614912. Throughput: 0: 43760.1. Samples: 2391586320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:51:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:51:26,943][06909] Updated weights for policy 0, policy_version 151903 (0.0036) [2024-06-28 04:51:28,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43690.9, 300 sec: 44153.5). Total num frames: 2488844288. Throughput: 0: 43832.4. Samples: 2391719180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:51:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:51:31,005][06909] Updated weights for policy 0, policy_version 151913 (0.0034) [2024-06-28 04:51:33,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2489057280. Throughput: 0: 43995.5. Samples: 2391987300. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:51:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:51:34,410][06909] Updated weights for policy 0, policy_version 151923 (0.0038) [2024-06-28 04:51:38,200][06909] Updated weights for policy 0, policy_version 151933 (0.0027) [2024-06-28 04:51:38,853][06674] Fps is (10 sec: 45860.3, 60 sec: 44234.5, 300 sec: 44208.5). Total num frames: 2489303040. Throughput: 0: 43951.5. Samples: 2392249380. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:51:38,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:51:41,695][06909] Updated weights for policy 0, policy_version 151943 (0.0034) [2024-06-28 04:51:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.5, 300 sec: 44097.9). Total num frames: 2489499648. Throughput: 0: 43876.3. Samples: 2392378780. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:51:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:51:45,687][06909] Updated weights for policy 0, policy_version 151953 (0.0044) [2024-06-28 04:51:48,850][06674] Fps is (10 sec: 44251.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2489745408. Throughput: 0: 44074.3. Samples: 2392651700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:51:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:51:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000151963_2489761792.pth... [2024-06-28 04:51:48,860][06909] Updated weights for policy 0, policy_version 151963 (0.0026) [2024-06-28 04:51:48,913][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000151315_2479144960.pth [2024-06-28 04:51:53,016][06909] Updated weights for policy 0, policy_version 151973 (0.0044) [2024-06-28 04:51:53,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2489958400. Throughput: 0: 44151.1. Samples: 2392913160. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:51:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:51:56,462][06909] Updated weights for policy 0, policy_version 151983 (0.0039) [2024-06-28 04:51:58,856][06674] Fps is (10 sec: 40935.2, 60 sec: 43686.3, 300 sec: 44097.1). Total num frames: 2490155008. Throughput: 0: 44223.4. Samples: 2393047340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:51:58,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:52:00,554][06909] Updated weights for policy 0, policy_version 151993 (0.0027) [2024-06-28 04:52:03,744][06909] Updated weights for policy 0, policy_version 152003 (0.0036) [2024-06-28 04:52:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44153.8). Total num frames: 2490417152. Throughput: 0: 44337.4. Samples: 2393315200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 04:52:03,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:52:08,077][06909] Updated weights for policy 0, policy_version 152013 (0.0034) [2024-06-28 04:52:08,850][06674] Fps is (10 sec: 45902.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2490613760. Throughput: 0: 44102.1. Samples: 2393570920. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 04:52:08,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:52:11,151][06909] Updated weights for policy 0, policy_version 152023 (0.0027) [2024-06-28 04:52:13,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 2490810368. Throughput: 0: 44027.5. Samples: 2393700420. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 04:52:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:52:15,276][06909] Updated weights for policy 0, policy_version 152033 (0.0034) [2024-06-28 04:52:17,836][06887] Signal inference workers to stop experience collection... (34100 times) [2024-06-28 04:52:17,836][06887] Signal inference workers to resume experience collection... (34100 times) [2024-06-28 04:52:17,861][06909] InferenceWorker_p0-w0: stopping experience collection (34100 times) [2024-06-28 04:52:17,862][06909] InferenceWorker_p0-w0: resuming experience collection (34100 times) [2024-06-28 04:52:18,834][06909] Updated weights for policy 0, policy_version 152043 (0.0034) [2024-06-28 04:52:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44511.3, 300 sec: 44153.5). Total num frames: 2491072512. Throughput: 0: 44108.9. Samples: 2393972200. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 04:52:18,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:52:22,930][06909] Updated weights for policy 0, policy_version 152053 (0.0034) [2024-06-28 04:52:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2491269120. Throughput: 0: 44027.2. Samples: 2394230460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 04:52:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:52:26,066][06909] Updated weights for policy 0, policy_version 152063 (0.0025) [2024-06-28 04:52:28,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2491465728. Throughput: 0: 44018.4. Samples: 2394359600. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 04:52:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:52:30,441][06909] Updated weights for policy 0, policy_version 152073 (0.0040) [2024-06-28 04:52:33,713][06909] Updated weights for policy 0, policy_version 152083 (0.0028) [2024-06-28 04:52:33,852][06674] Fps is (10 sec: 45865.8, 60 sec: 44508.4, 300 sec: 44097.7). Total num frames: 2491727872. Throughput: 0: 43928.2. Samples: 2394628560. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 04:52:33,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:52:37,879][06909] Updated weights for policy 0, policy_version 152093 (0.0036) [2024-06-28 04:52:38,852][06674] Fps is (10 sec: 45865.7, 60 sec: 43691.5, 300 sec: 44042.1). Total num frames: 2491924480. Throughput: 0: 43899.3. Samples: 2394888720. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 04:52:38,852][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 04:52:41,110][06909] Updated weights for policy 0, policy_version 152103 (0.0038) [2024-06-28 04:52:43,850][06674] Fps is (10 sec: 39329.7, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 2492121088. Throughput: 0: 43916.6. Samples: 2395023320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 04:52:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:52:45,218][06909] Updated weights for policy 0, policy_version 152113 (0.0032) [2024-06-28 04:52:48,378][06909] Updated weights for policy 0, policy_version 152123 (0.0036) [2024-06-28 04:52:48,850][06674] Fps is (10 sec: 45884.4, 60 sec: 43963.6, 300 sec: 44098.0). Total num frames: 2492383232. Throughput: 0: 43901.8. Samples: 2395290780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 04:52:48,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:52:52,723][06909] Updated weights for policy 0, policy_version 152133 (0.0029) [2024-06-28 04:52:53,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43690.6, 300 sec: 43987.8). Total num frames: 2492579840. Throughput: 0: 44019.5. Samples: 2395551800. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 04:52:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:52:55,743][06909] Updated weights for policy 0, policy_version 152143 (0.0032) [2024-06-28 04:52:58,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43695.1, 300 sec: 43931.3). Total num frames: 2492776448. Throughput: 0: 44025.4. Samples: 2395681560. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 04:52:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:53:00,142][06909] Updated weights for policy 0, policy_version 152153 (0.0028) [2024-06-28 04:53:03,213][06909] Updated weights for policy 0, policy_version 152163 (0.0030) [2024-06-28 04:53:03,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2493054976. Throughput: 0: 43965.4. Samples: 2395950640. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 04:53:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:53:07,626][06909] Updated weights for policy 0, policy_version 152173 (0.0022) [2024-06-28 04:53:08,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.8, 300 sec: 43931.8). Total num frames: 2493251584. Throughput: 0: 44186.7. Samples: 2396218860. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 04:53:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:53:10,710][06909] Updated weights for policy 0, policy_version 152183 (0.0036) [2024-06-28 04:53:13,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2493448192. Throughput: 0: 44043.1. Samples: 2396341540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:53:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:53:15,285][06909] Updated weights for policy 0, policy_version 152193 (0.0041) [2024-06-28 04:53:17,853][06887] Signal inference workers to stop experience collection... (34150 times) [2024-06-28 04:53:17,882][06909] InferenceWorker_p0-w0: stopping experience collection (34150 times) [2024-06-28 04:53:17,907][06887] Signal inference workers to resume experience collection... (34150 times) [2024-06-28 04:53:17,909][06909] InferenceWorker_p0-w0: resuming experience collection (34150 times) [2024-06-28 04:53:18,043][06909] Updated weights for policy 0, policy_version 152203 (0.0028) [2024-06-28 04:53:18,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2493726720. Throughput: 0: 44146.8. Samples: 2396615080. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:53:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:53:22,661][06909] Updated weights for policy 0, policy_version 152213 (0.0033) [2024-06-28 04:53:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 2493890560. Throughput: 0: 44062.9. Samples: 2396871460. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:53:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 04:53:25,534][06909] Updated weights for policy 0, policy_version 152223 (0.0023) [2024-06-28 04:53:28,850][06674] Fps is (10 sec: 39321.6, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2494119936. Throughput: 0: 43903.9. Samples: 2396999000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:53:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:53:29,950][06909] Updated weights for policy 0, policy_version 152233 (0.0028) [2024-06-28 04:53:33,046][06909] Updated weights for policy 0, policy_version 152243 (0.0034) [2024-06-28 04:53:33,850][06674] Fps is (10 sec: 49151.7, 60 sec: 44238.2, 300 sec: 44154.4). Total num frames: 2494382080. Throughput: 0: 43999.1. Samples: 2397270740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:53:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:53:37,218][06909] Updated weights for policy 0, policy_version 152253 (0.0043) [2024-06-28 04:53:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43692.2, 300 sec: 43875.8). Total num frames: 2494545920. Throughput: 0: 44197.9. Samples: 2397540700. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:53:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:53:40,367][06909] Updated weights for policy 0, policy_version 152263 (0.0025) [2024-06-28 04:53:43,850][06674] Fps is (10 sec: 37683.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2494758912. Throughput: 0: 43978.2. Samples: 2397660580. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:53:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:53:44,692][06909] Updated weights for policy 0, policy_version 152273 (0.0031) [2024-06-28 04:53:47,947][06909] Updated weights for policy 0, policy_version 152283 (0.0018) [2024-06-28 04:53:48,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 2495037440. Throughput: 0: 43942.6. Samples: 2397928060. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:53:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:53:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000152285_2495037440.pth... [2024-06-28 04:53:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000151639_2484453376.pth [2024-06-28 04:53:52,258][06909] Updated weights for policy 0, policy_version 152293 (0.0034) [2024-06-28 04:53:53,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 2495201280. Throughput: 0: 43965.4. Samples: 2398197300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:53:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:53:55,525][06909] Updated weights for policy 0, policy_version 152303 (0.0040) [2024-06-28 04:53:58,852][06674] Fps is (10 sec: 40951.5, 60 sec: 44508.3, 300 sec: 44042.1). Total num frames: 2495447040. Throughput: 0: 43908.5. Samples: 2398317520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:53:58,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:53:59,872][06909] Updated weights for policy 0, policy_version 152313 (0.0026) [2024-06-28 04:54:02,904][06909] Updated weights for policy 0, policy_version 152323 (0.0039) [2024-06-28 04:54:03,850][06674] Fps is (10 sec: 49151.8, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2495692800. Throughput: 0: 43792.1. Samples: 2398585720. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:54:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:54:07,208][06909] Updated weights for policy 0, policy_version 152333 (0.0035) [2024-06-28 04:54:08,850][06674] Fps is (10 sec: 42606.5, 60 sec: 43690.5, 300 sec: 43931.3). Total num frames: 2495873024. Throughput: 0: 44154.9. Samples: 2398858440. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:54:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:54:10,397][06909] Updated weights for policy 0, policy_version 152343 (0.0033) [2024-06-28 04:54:13,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2496086016. Throughput: 0: 43989.0. Samples: 2398978500. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 04:54:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:54:14,562][06909] Updated weights for policy 0, policy_version 152353 (0.0032) [2024-06-28 04:54:17,924][06909] Updated weights for policy 0, policy_version 152363 (0.0033) [2024-06-28 04:54:18,850][06674] Fps is (10 sec: 47515.0, 60 sec: 43690.8, 300 sec: 44153.5). Total num frames: 2496348160. Throughput: 0: 43805.0. Samples: 2399241960. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 04:54:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:54:22,211][06909] Updated weights for policy 0, policy_version 152373 (0.0031) [2024-06-28 04:54:23,810][06887] Signal inference workers to stop experience collection... (34200 times) [2024-06-28 04:54:23,816][06887] Signal inference workers to resume experience collection... (34200 times) [2024-06-28 04:54:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43876.1). Total num frames: 2496528384. Throughput: 0: 43894.3. Samples: 2399515940. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 04:54:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:54:23,861][06909] InferenceWorker_p0-w0: stopping experience collection (34200 times) [2024-06-28 04:54:23,862][06909] InferenceWorker_p0-w0: resuming experience collection (34200 times) [2024-06-28 04:54:25,475][06909] Updated weights for policy 0, policy_version 152383 (0.0033) [2024-06-28 04:54:28,850][06674] Fps is (10 sec: 39320.9, 60 sec: 43690.6, 300 sec: 43986.8). Total num frames: 2496741376. Throughput: 0: 43907.5. Samples: 2399636420. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 04:54:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:54:29,765][06909] Updated weights for policy 0, policy_version 152393 (0.0037) [2024-06-28 04:54:32,879][06909] Updated weights for policy 0, policy_version 152403 (0.0039) [2024-06-28 04:54:33,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2497003520. Throughput: 0: 43792.9. Samples: 2399898740. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 04:54:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:54:37,096][06909] Updated weights for policy 0, policy_version 152413 (0.0026) [2024-06-28 04:54:38,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 2497183744. Throughput: 0: 43946.2. Samples: 2400174880. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 04:54:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:54:40,259][06909] Updated weights for policy 0, policy_version 152423 (0.0029) [2024-06-28 04:54:43,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2497413120. Throughput: 0: 44034.1. Samples: 2400298960. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 04:54:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:54:44,611][06909] Updated weights for policy 0, policy_version 152433 (0.0024) [2024-06-28 04:54:47,368][06909] Updated weights for policy 0, policy_version 152443 (0.0039) [2024-06-28 04:54:48,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2497658880. Throughput: 0: 43946.6. Samples: 2400563320. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 04:54:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:54:52,103][06909] Updated weights for policy 0, policy_version 152453 (0.0029) [2024-06-28 04:54:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 2497855488. Throughput: 0: 44096.8. Samples: 2400842780. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 04:54:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:54:54,912][06909] Updated weights for policy 0, policy_version 152463 (0.0029) [2024-06-28 04:54:58,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43692.2, 300 sec: 43986.9). Total num frames: 2498068480. Throughput: 0: 44224.3. Samples: 2400968600. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 04:54:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:54:59,386][06909] Updated weights for policy 0, policy_version 152473 (0.0034) [2024-06-28 04:55:02,324][06909] Updated weights for policy 0, policy_version 152483 (0.0031) [2024-06-28 04:55:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2498314240. Throughput: 0: 44011.6. Samples: 2401222480. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 04:55:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:55:06,736][06909] Updated weights for policy 0, policy_version 152493 (0.0037) [2024-06-28 04:55:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44237.0, 300 sec: 43931.3). Total num frames: 2498527232. Throughput: 0: 44215.9. Samples: 2401505660. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 04:55:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:55:09,892][06909] Updated weights for policy 0, policy_version 152503 (0.0026) [2024-06-28 04:55:13,850][06674] Fps is (10 sec: 44235.4, 60 sec: 44509.6, 300 sec: 44042.4). Total num frames: 2498756608. Throughput: 0: 44287.8. Samples: 2401629380. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 04:55:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:55:14,201][06909] Updated weights for policy 0, policy_version 152513 (0.0045) [2024-06-28 04:55:16,567][06887] Signal inference workers to stop experience collection... (34250 times) [2024-06-28 04:55:16,618][06909] InferenceWorker_p0-w0: stopping experience collection (34250 times) [2024-06-28 04:55:16,682][06887] Signal inference workers to resume experience collection... (34250 times) [2024-06-28 04:55:16,683][06909] InferenceWorker_p0-w0: resuming experience collection (34250 times) [2024-06-28 04:55:17,125][06909] Updated weights for policy 0, policy_version 152523 (0.0024) [2024-06-28 04:55:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2498969600. Throughput: 0: 44217.3. Samples: 2401888520. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 04:55:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:55:22,062][06909] Updated weights for policy 0, policy_version 152533 (0.0027) [2024-06-28 04:55:23,850][06674] Fps is (10 sec: 44238.0, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 2499198976. Throughput: 0: 44132.8. Samples: 2402160860. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:55:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:55:24,457][06909] Updated weights for policy 0, policy_version 152543 (0.0021) [2024-06-28 04:55:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2499411968. Throughput: 0: 44318.6. Samples: 2402293300. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:55:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:55:29,098][06909] Updated weights for policy 0, policy_version 152553 (0.0028) [2024-06-28 04:55:31,805][06909] Updated weights for policy 0, policy_version 152563 (0.0051) [2024-06-28 04:55:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2499641344. Throughput: 0: 44262.7. Samples: 2402555140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:55:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:55:36,554][06909] Updated weights for policy 0, policy_version 152573 (0.0039) [2024-06-28 04:55:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44782.8, 300 sec: 44042.4). Total num frames: 2499870720. Throughput: 0: 43972.7. Samples: 2402821560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:55:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:55:39,776][06909] Updated weights for policy 0, policy_version 152583 (0.0038) [2024-06-28 04:55:43,844][06909] Updated weights for policy 0, policy_version 152593 (0.0041) [2024-06-28 04:55:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2500083712. Throughput: 0: 44077.0. Samples: 2402952060. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:55:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:55:47,188][06909] Updated weights for policy 0, policy_version 152603 (0.0022) [2024-06-28 04:55:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2500280320. Throughput: 0: 44080.3. Samples: 2403206100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:55:48,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:55:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000152605_2500280320.pth... [2024-06-28 04:55:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000151963_2489761792.pth [2024-06-28 04:55:51,180][06909] Updated weights for policy 0, policy_version 152613 (0.0037) [2024-06-28 04:55:53,850][06674] Fps is (10 sec: 45874.3, 60 sec: 44782.7, 300 sec: 44097.9). Total num frames: 2500542464. Throughput: 0: 43823.9. Samples: 2403477740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:55:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:55:54,461][06909] Updated weights for policy 0, policy_version 152623 (0.0045) [2024-06-28 04:55:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2500722688. Throughput: 0: 44066.0. Samples: 2403612340. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:55:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:55:59,297][06909] Updated weights for policy 0, policy_version 152633 (0.0033) [2024-06-28 04:56:01,791][06909] Updated weights for policy 0, policy_version 152643 (0.0028) [2024-06-28 04:56:03,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2500952064. Throughput: 0: 44057.8. Samples: 2403871120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:56:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:56:06,372][06909] Updated weights for policy 0, policy_version 152653 (0.0037) [2024-06-28 04:56:08,850][06674] Fps is (10 sec: 49152.2, 60 sec: 44783.0, 300 sec: 44153.5). Total num frames: 2501214208. Throughput: 0: 44030.2. Samples: 2404142220. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:56:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:56:09,198][06909] Updated weights for policy 0, policy_version 152663 (0.0030) [2024-06-28 04:56:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.8, 300 sec: 43987.2). Total num frames: 2501378048. Throughput: 0: 44146.2. Samples: 2404279880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:56:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:56:13,914][06909] Updated weights for policy 0, policy_version 152673 (0.0031) [2024-06-28 04:56:16,755][06909] Updated weights for policy 0, policy_version 152683 (0.0032) [2024-06-28 04:56:18,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2501607424. Throughput: 0: 43881.3. Samples: 2404529800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:56:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:56:21,126][06909] Updated weights for policy 0, policy_version 152693 (0.0031) [2024-06-28 04:56:23,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2501853184. Throughput: 0: 43950.2. Samples: 2404799320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 04:56:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:56:24,238][06909] Updated weights for policy 0, policy_version 152703 (0.0028) [2024-06-28 04:56:28,331][06909] Updated weights for policy 0, policy_version 152713 (0.0020) [2024-06-28 04:56:28,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 2502049792. Throughput: 0: 44169.5. Samples: 2404939780. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 04:56:28,852][06674] Avg episode reward: [(0, '0.402')] [2024-06-28 04:56:31,777][06909] Updated weights for policy 0, policy_version 152723 (0.0034) [2024-06-28 04:56:33,852][06674] Fps is (10 sec: 40951.9, 60 sec: 43689.2, 300 sec: 43931.5). Total num frames: 2502262784. Throughput: 0: 44205.6. Samples: 2405195440. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 04:56:33,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:56:36,274][06909] Updated weights for policy 0, policy_version 152733 (0.0034) [2024-06-28 04:56:37,590][06887] Signal inference workers to stop experience collection... (34300 times) [2024-06-28 04:56:37,619][06909] InferenceWorker_p0-w0: stopping experience collection (34300 times) [2024-06-28 04:56:37,644][06887] Signal inference workers to resume experience collection... (34300 times) [2024-06-28 04:56:37,644][06909] InferenceWorker_p0-w0: resuming experience collection (34300 times) [2024-06-28 04:56:38,856][06674] Fps is (10 sec: 47494.8, 60 sec: 44232.4, 300 sec: 44152.6). Total num frames: 2502524928. Throughput: 0: 44139.6. Samples: 2405464280. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 04:56:38,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:56:39,015][06909] Updated weights for policy 0, policy_version 152743 (0.0027) [2024-06-28 04:56:43,441][06909] Updated weights for policy 0, policy_version 152753 (0.0028) [2024-06-28 04:56:43,850][06674] Fps is (10 sec: 45884.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2502721536. Throughput: 0: 44344.9. Samples: 2405607860. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 04:56:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:56:46,315][06909] Updated weights for policy 0, policy_version 152763 (0.0031) [2024-06-28 04:56:48,850][06674] Fps is (10 sec: 40984.7, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2502934528. Throughput: 0: 44393.0. Samples: 2405868800. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 04:56:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:56:50,783][06909] Updated weights for policy 0, policy_version 152773 (0.0039) [2024-06-28 04:56:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.9, 300 sec: 44154.4). Total num frames: 2503180288. Throughput: 0: 44019.1. Samples: 2406123080. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 04:56:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:56:54,050][06909] Updated weights for policy 0, policy_version 152783 (0.0034) [2024-06-28 04:56:58,206][06909] Updated weights for policy 0, policy_version 152793 (0.0039) [2024-06-28 04:56:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 2503376896. Throughput: 0: 44076.9. Samples: 2406263340. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 04:56:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:57:01,603][06909] Updated weights for policy 0, policy_version 152803 (0.0024) [2024-06-28 04:57:03,852][06674] Fps is (10 sec: 40951.7, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 2503589888. Throughput: 0: 44245.1. Samples: 2406520920. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 04:57:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:57:06,169][06909] Updated weights for policy 0, policy_version 152813 (0.0036) [2024-06-28 04:57:08,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2503835648. Throughput: 0: 44033.9. Samples: 2406780840. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 04:57:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:57:08,915][06909] Updated weights for policy 0, policy_version 152823 (0.0035) [2024-06-28 04:57:13,481][06909] Updated weights for policy 0, policy_version 152833 (0.0032) [2024-06-28 04:57:13,850][06674] Fps is (10 sec: 44245.8, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 2504032256. Throughput: 0: 43973.1. Samples: 2406918480. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 04:57:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:57:16,155][06909] Updated weights for policy 0, policy_version 152843 (0.0030) [2024-06-28 04:57:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2504261632. Throughput: 0: 44189.9. Samples: 2407183900. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 04:57:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:57:20,682][06909] Updated weights for policy 0, policy_version 152853 (0.0031) [2024-06-28 04:57:23,707][06909] Updated weights for policy 0, policy_version 152863 (0.0038) [2024-06-28 04:57:23,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2504507392. Throughput: 0: 44040.0. Samples: 2407445820. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 04:57:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:57:28,067][06909] Updated weights for policy 0, policy_version 152873 (0.0031) [2024-06-28 04:57:28,852][06674] Fps is (10 sec: 44228.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2504704000. Throughput: 0: 43898.0. Samples: 2407583360. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 04:57:28,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:57:31,055][06909] Updated weights for policy 0, policy_version 152883 (0.0032) [2024-06-28 04:57:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44238.3, 300 sec: 44042.7). Total num frames: 2504916992. Throughput: 0: 43956.4. Samples: 2407846840. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 04:57:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:57:35,735][06909] Updated weights for policy 0, policy_version 152893 (0.0036) [2024-06-28 04:57:38,324][06909] Updated weights for policy 0, policy_version 152903 (0.0030) [2024-06-28 04:57:38,850][06674] Fps is (10 sec: 45884.6, 60 sec: 43968.1, 300 sec: 44209.0). Total num frames: 2505162752. Throughput: 0: 44008.9. Samples: 2408103480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 04:57:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 04:57:43,515][06909] Updated weights for policy 0, policy_version 152913 (0.0039) [2024-06-28 04:57:43,665][06887] Signal inference workers to stop experience collection... (34350 times) [2024-06-28 04:57:43,665][06887] Signal inference workers to resume experience collection... (34350 times) [2024-06-28 04:57:43,691][06909] InferenceWorker_p0-w0: stopping experience collection (34350 times) [2024-06-28 04:57:43,691][06909] InferenceWorker_p0-w0: resuming experience collection (34350 times) [2024-06-28 04:57:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2505359360. Throughput: 0: 44072.5. Samples: 2408246600. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 04:57:43,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:57:45,749][06909] Updated weights for policy 0, policy_version 152923 (0.0036) [2024-06-28 04:57:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2505572352. Throughput: 0: 44131.8. Samples: 2408506760. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 04:57:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:57:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000152928_2505572352.pth... [2024-06-28 04:57:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000152285_2495037440.pth [2024-06-28 04:57:50,814][06909] Updated weights for policy 0, policy_version 152933 (0.0027) [2024-06-28 04:57:53,343][06909] Updated weights for policy 0, policy_version 152943 (0.0032) [2024-06-28 04:57:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2505818112. Throughput: 0: 44007.9. Samples: 2408761200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 04:57:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:57:58,125][06909] Updated weights for policy 0, policy_version 152953 (0.0029) [2024-06-28 04:57:58,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2506014720. Throughput: 0: 44198.9. Samples: 2408907440. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 04:57:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:58:00,791][06909] Updated weights for policy 0, policy_version 152963 (0.0028) [2024-06-28 04:58:03,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43965.0, 300 sec: 43986.8). Total num frames: 2506227712. Throughput: 0: 44025.6. Samples: 2409165060. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 04:58:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:58:05,313][06909] Updated weights for policy 0, policy_version 152973 (0.0025) [2024-06-28 04:58:08,563][06909] Updated weights for policy 0, policy_version 152983 (0.0043) [2024-06-28 04:58:08,850][06674] Fps is (10 sec: 45876.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2506473472. Throughput: 0: 43887.7. Samples: 2409420760. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 04:58:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:58:13,078][06909] Updated weights for policy 0, policy_version 152993 (0.0037) [2024-06-28 04:58:13,852][06674] Fps is (10 sec: 45867.5, 60 sec: 44235.4, 300 sec: 43931.1). Total num frames: 2506686464. Throughput: 0: 43895.2. Samples: 2409558640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 04:58:13,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:58:15,915][06909] Updated weights for policy 0, policy_version 153003 (0.0032) [2024-06-28 04:58:18,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2506883072. Throughput: 0: 43858.3. Samples: 2409820460. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 04:58:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 04:58:20,352][06909] Updated weights for policy 0, policy_version 153013 (0.0037) [2024-06-28 04:58:23,548][06909] Updated weights for policy 0, policy_version 153023 (0.0032) [2024-06-28 04:58:23,850][06674] Fps is (10 sec: 44245.0, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2507128832. Throughput: 0: 44002.6. Samples: 2410083600. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 04:58:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:58:27,813][06909] Updated weights for policy 0, policy_version 153033 (0.0029) [2024-06-28 04:58:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43965.2, 300 sec: 43931.3). Total num frames: 2507341824. Throughput: 0: 43958.7. Samples: 2410224740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 04:58:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:58:30,683][06909] Updated weights for policy 0, policy_version 153043 (0.0030) [2024-06-28 04:58:33,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2507554816. Throughput: 0: 44201.4. Samples: 2410495820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 04:58:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:58:34,904][06909] Updated weights for policy 0, policy_version 153053 (0.0038) [2024-06-28 04:58:38,063][06909] Updated weights for policy 0, policy_version 153063 (0.0030) [2024-06-28 04:58:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2507800576. Throughput: 0: 44193.8. Samples: 2410749920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 04:58:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:58:42,294][06909] Updated weights for policy 0, policy_version 153073 (0.0025) [2024-06-28 04:58:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2508013568. Throughput: 0: 44166.0. Samples: 2410894900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 04:58:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:58:45,433][06909] Updated weights for policy 0, policy_version 153083 (0.0038) [2024-06-28 04:58:48,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2508210176. Throughput: 0: 44084.7. Samples: 2411148860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 04:58:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 04:58:50,029][06909] Updated weights for policy 0, policy_version 153093 (0.0032) [2024-06-28 04:58:53,077][06909] Updated weights for policy 0, policy_version 153103 (0.0036) [2024-06-28 04:58:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 2508455936. Throughput: 0: 44177.2. Samples: 2411408740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 04:58:53,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:58:57,286][06909] Updated weights for policy 0, policy_version 153113 (0.0025) [2024-06-28 04:58:58,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2508668928. Throughput: 0: 44248.4. Samples: 2411549740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 04:58:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:59:00,569][06909] Updated weights for policy 0, policy_version 153123 (0.0027) [2024-06-28 04:59:03,852][06674] Fps is (10 sec: 42590.0, 60 sec: 44235.5, 300 sec: 44097.7). Total num frames: 2508881920. Throughput: 0: 44308.7. Samples: 2411814440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 04:59:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:59:04,863][06909] Updated weights for policy 0, policy_version 153133 (0.0025) [2024-06-28 04:59:08,257][06909] Updated weights for policy 0, policy_version 153143 (0.0039) [2024-06-28 04:59:08,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2509111296. Throughput: 0: 44315.2. Samples: 2412077780. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 04:59:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 04:59:12,135][06909] Updated weights for policy 0, policy_version 153153 (0.0042) [2024-06-28 04:59:13,850][06674] Fps is (10 sec: 45884.3, 60 sec: 44238.2, 300 sec: 44042.4). Total num frames: 2509340672. Throughput: 0: 44260.4. Samples: 2412216460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 04:59:13,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:59:15,534][06909] Updated weights for policy 0, policy_version 153163 (0.0023) [2024-06-28 04:59:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2509537280. Throughput: 0: 43977.3. Samples: 2412474800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 04:59:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:59:19,829][06909] Updated weights for policy 0, policy_version 153173 (0.0040) [2024-06-28 04:59:20,345][06887] Signal inference workers to stop experience collection... (34400 times) [2024-06-28 04:59:20,353][06887] Signal inference workers to resume experience collection... (34400 times) [2024-06-28 04:59:20,371][06909] InferenceWorker_p0-w0: stopping experience collection (34400 times) [2024-06-28 04:59:20,371][06909] InferenceWorker_p0-w0: resuming experience collection (34400 times) [2024-06-28 04:59:22,982][06909] Updated weights for policy 0, policy_version 153183 (0.0023) [2024-06-28 04:59:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2509750272. Throughput: 0: 43994.2. Samples: 2412729660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 04:59:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 04:59:27,315][06909] Updated weights for policy 0, policy_version 153193 (0.0036) [2024-06-28 04:59:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2509979648. Throughput: 0: 43795.1. Samples: 2412865680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 04:59:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 04:59:30,611][06909] Updated weights for policy 0, policy_version 153203 (0.0041) [2024-06-28 04:59:33,856][06674] Fps is (10 sec: 45848.1, 60 sec: 44232.4, 300 sec: 44152.6). Total num frames: 2510209024. Throughput: 0: 44157.7. Samples: 2413136220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 04:59:33,865][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:59:34,586][06909] Updated weights for policy 0, policy_version 153213 (0.0031) [2024-06-28 04:59:38,087][06909] Updated weights for policy 0, policy_version 153223 (0.0029) [2024-06-28 04:59:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2510422016. Throughput: 0: 44221.4. Samples: 2413398700. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 04:59:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:59:42,003][06909] Updated weights for policy 0, policy_version 153233 (0.0040) [2024-06-28 04:59:43,850][06674] Fps is (10 sec: 45902.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2510667776. Throughput: 0: 44097.0. Samples: 2413534100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 04:59:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 04:59:45,350][06909] Updated weights for policy 0, policy_version 153243 (0.0034) [2024-06-28 04:59:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2510848000. Throughput: 0: 44014.4. Samples: 2413795000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 04:59:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 04:59:49,040][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000153252_2510880768.pth... [2024-06-28 04:59:49,096][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000152605_2500280320.pth [2024-06-28 04:59:49,447][06909] Updated weights for policy 0, policy_version 153253 (0.0028) [2024-06-28 04:59:52,545][06909] Updated weights for policy 0, policy_version 153263 (0.0044) [2024-06-28 04:59:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2511093760. Throughput: 0: 43997.2. Samples: 2414057660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 04:59:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 04:59:56,913][06909] Updated weights for policy 0, policy_version 153273 (0.0035) [2024-06-28 04:59:58,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2511323136. Throughput: 0: 43799.5. Samples: 2414187440. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 04:59:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:00:00,086][06909] Updated weights for policy 0, policy_version 153283 (0.0023) [2024-06-28 05:00:03,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43692.2, 300 sec: 43986.9). Total num frames: 2511503360. Throughput: 0: 43944.4. Samples: 2414452300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:00:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 05:00:04,376][06909] Updated weights for policy 0, policy_version 153293 (0.0034) [2024-06-28 05:00:08,160][06909] Updated weights for policy 0, policy_version 153303 (0.0024) [2024-06-28 05:00:08,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2511732736. Throughput: 0: 44111.2. Samples: 2414714660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:00:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:00:11,770][06909] Updated weights for policy 0, policy_version 153313 (0.0026) [2024-06-28 05:00:13,850][06674] Fps is (10 sec: 49152.3, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2511994880. Throughput: 0: 44070.3. Samples: 2414848840. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:00:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:00:15,370][06909] Updated weights for policy 0, policy_version 153323 (0.0025) [2024-06-28 05:00:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2512175104. Throughput: 0: 43878.3. Samples: 2415110480. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:00:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:00:19,163][06909] Updated weights for policy 0, policy_version 153333 (0.0033) [2024-06-28 05:00:22,702][06909] Updated weights for policy 0, policy_version 153343 (0.0034) [2024-06-28 05:00:23,850][06674] Fps is (10 sec: 39320.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2512388096. Throughput: 0: 43965.7. Samples: 2415377160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:00:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:00:26,653][06909] Updated weights for policy 0, policy_version 153353 (0.0031) [2024-06-28 05:00:28,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2512650240. Throughput: 0: 43788.9. Samples: 2415504600. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:00:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:00:29,840][06909] Updated weights for policy 0, policy_version 153363 (0.0038) [2024-06-28 05:00:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43694.9, 300 sec: 43931.3). Total num frames: 2512830464. Throughput: 0: 43971.5. Samples: 2415773720. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:00:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:00:34,137][06909] Updated weights for policy 0, policy_version 153373 (0.0027) [2024-06-28 05:00:37,555][06909] Updated weights for policy 0, policy_version 153383 (0.0038) [2024-06-28 05:00:38,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2513059840. Throughput: 0: 44024.5. Samples: 2416038760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:00:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:00:41,352][06909] Updated weights for policy 0, policy_version 153393 (0.0034) [2024-06-28 05:00:43,350][06887] Signal inference workers to stop experience collection... (34450 times) [2024-06-28 05:00:43,383][06909] InferenceWorker_p0-w0: stopping experience collection (34450 times) [2024-06-28 05:00:43,401][06887] Signal inference workers to resume experience collection... (34450 times) [2024-06-28 05:00:43,402][06909] InferenceWorker_p0-w0: resuming experience collection (34450 times) [2024-06-28 05:00:43,850][06674] Fps is (10 sec: 49152.8, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2513321984. Throughput: 0: 43988.2. Samples: 2416166900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 05:00:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:00:45,017][06909] Updated weights for policy 0, policy_version 153403 (0.0029) [2024-06-28 05:00:48,651][06909] Updated weights for policy 0, policy_version 153413 (0.0044) [2024-06-28 05:00:48,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 2513518592. Throughput: 0: 44107.4. Samples: 2416437140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 05:00:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:00:52,367][06909] Updated weights for policy 0, policy_version 153423 (0.0039) [2024-06-28 05:00:53,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2513731584. Throughput: 0: 44279.5. Samples: 2416707240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 05:00:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:00:56,039][06909] Updated weights for policy 0, policy_version 153433 (0.0048) [2024-06-28 05:00:58,852][06674] Fps is (10 sec: 45866.4, 60 sec: 44235.4, 300 sec: 44153.2). Total num frames: 2513977344. Throughput: 0: 44109.9. Samples: 2416833880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 05:00:58,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:00:59,766][06909] Updated weights for policy 0, policy_version 153443 (0.0029) [2024-06-28 05:01:03,669][06909] Updated weights for policy 0, policy_version 153453 (0.0042) [2024-06-28 05:01:03,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44782.8, 300 sec: 43986.9). Total num frames: 2514190336. Throughput: 0: 44281.6. Samples: 2417103160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 05:01:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:01:07,182][06909] Updated weights for policy 0, policy_version 153463 (0.0033) [2024-06-28 05:01:08,850][06674] Fps is (10 sec: 39329.3, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2514370560. Throughput: 0: 44205.8. Samples: 2417366420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 05:01:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:01:11,043][06909] Updated weights for policy 0, policy_version 153473 (0.0042) [2024-06-28 05:01:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2514649088. Throughput: 0: 44147.9. Samples: 2417491260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 05:01:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:01:14,951][06909] Updated weights for policy 0, policy_version 153483 (0.0028) [2024-06-28 05:01:18,720][06909] Updated weights for policy 0, policy_version 153493 (0.0053) [2024-06-28 05:01:18,850][06674] Fps is (10 sec: 47514.4, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2514845696. Throughput: 0: 44054.4. Samples: 2417756160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 05:01:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:01:22,717][06909] Updated weights for policy 0, policy_version 153503 (0.0035) [2024-06-28 05:01:23,852][06674] Fps is (10 sec: 37675.5, 60 sec: 43962.3, 300 sec: 43986.9). Total num frames: 2515025920. Throughput: 0: 43946.0. Samples: 2418016420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 05:01:23,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:01:25,985][06909] Updated weights for policy 0, policy_version 153513 (0.0041) [2024-06-28 05:01:28,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44236.7, 300 sec: 44209.3). Total num frames: 2515304448. Throughput: 0: 44015.0. Samples: 2418147580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 05:01:28,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:01:30,040][06909] Updated weights for policy 0, policy_version 153523 (0.0023) [2024-06-28 05:01:33,499][06909] Updated weights for policy 0, policy_version 153533 (0.0043) [2024-06-28 05:01:33,850][06674] Fps is (10 sec: 47523.6, 60 sec: 44510.0, 300 sec: 43987.8). Total num frames: 2515501056. Throughput: 0: 44088.6. Samples: 2418421120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 05:01:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:01:37,584][06909] Updated weights for policy 0, policy_version 153543 (0.0028) [2024-06-28 05:01:38,856][06674] Fps is (10 sec: 37660.7, 60 sec: 43686.3, 300 sec: 43930.4). Total num frames: 2515681280. Throughput: 0: 43931.9. Samples: 2418684440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 05:01:38,857][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:01:40,874][06909] Updated weights for policy 0, policy_version 153553 (0.0038) [2024-06-28 05:01:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2515959808. Throughput: 0: 43885.6. Samples: 2418808640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 05:01:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:01:44,841][06909] Updated weights for policy 0, policy_version 153563 (0.0036) [2024-06-28 05:01:48,014][06909] Updated weights for policy 0, policy_version 153573 (0.0034) [2024-06-28 05:01:48,850][06674] Fps is (10 sec: 49181.9, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2516172800. Throughput: 0: 44120.1. Samples: 2419088560. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 05:01:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:01:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000153575_2516172800.pth... [2024-06-28 05:01:48,913][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000152928_2505572352.pth [2024-06-28 05:01:52,260][06909] Updated weights for policy 0, policy_version 153583 (0.0035) [2024-06-28 05:01:53,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2516353024. Throughput: 0: 44034.8. Samples: 2419347980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 05:01:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 05:01:55,603][06909] Updated weights for policy 0, policy_version 153593 (0.0024) [2024-06-28 05:01:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43965.2, 300 sec: 44153.8). Total num frames: 2516615168. Throughput: 0: 44060.9. Samples: 2419474000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 05:01:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:01:59,325][06909] Updated weights for policy 0, policy_version 153603 (0.0029) [2024-06-28 05:02:02,934][06909] Updated weights for policy 0, policy_version 153613 (0.0034) [2024-06-28 05:02:03,850][06674] Fps is (10 sec: 49151.6, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 2516844544. Throughput: 0: 44447.0. Samples: 2419756280. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 05:02:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:02:06,551][06909] Updated weights for policy 0, policy_version 153623 (0.0032) [2024-06-28 05:02:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2517041152. Throughput: 0: 44493.6. Samples: 2420018540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 05:02:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:02:10,571][06909] Updated weights for policy 0, policy_version 153633 (0.0029) [2024-06-28 05:02:10,969][06887] Signal inference workers to stop experience collection... (34500 times) [2024-06-28 05:02:10,970][06887] Signal inference workers to resume experience collection... (34500 times) [2024-06-28 05:02:10,983][06909] InferenceWorker_p0-w0: stopping experience collection (34500 times) [2024-06-28 05:02:11,007][06909] InferenceWorker_p0-w0: resuming experience collection (34500 times) [2024-06-28 05:02:13,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2517270528. Throughput: 0: 44390.2. Samples: 2420145140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 05:02:13,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:02:14,109][06909] Updated weights for policy 0, policy_version 153643 (0.0031) [2024-06-28 05:02:17,749][06909] Updated weights for policy 0, policy_version 153653 (0.0037) [2024-06-28 05:02:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2517499904. Throughput: 0: 44336.9. Samples: 2420416280. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 05:02:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:02:21,322][06909] Updated weights for policy 0, policy_version 153663 (0.0025) [2024-06-28 05:02:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44511.4, 300 sec: 44042.7). Total num frames: 2517696512. Throughput: 0: 44445.9. Samples: 2420684240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 05:02:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:02:25,004][06909] Updated weights for policy 0, policy_version 153673 (0.0041) [2024-06-28 05:02:28,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2517925888. Throughput: 0: 44344.3. Samples: 2420804140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 05:02:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:02:29,000][06909] Updated weights for policy 0, policy_version 153683 (0.0041) [2024-06-28 05:02:32,574][06909] Updated weights for policy 0, policy_version 153693 (0.0031) [2024-06-28 05:02:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2518155264. Throughput: 0: 43966.2. Samples: 2421067040. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 05:02:33,860][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:02:36,316][06909] Updated weights for policy 0, policy_version 153703 (0.0028) [2024-06-28 05:02:38,850][06674] Fps is (10 sec: 42599.1, 60 sec: 44514.4, 300 sec: 44042.4). Total num frames: 2518351872. Throughput: 0: 44187.9. Samples: 2421336440. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 05:02:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:02:40,373][06909] Updated weights for policy 0, policy_version 153713 (0.0029) [2024-06-28 05:02:43,496][06909] Updated weights for policy 0, policy_version 153723 (0.0024) [2024-06-28 05:02:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2518597632. Throughput: 0: 44240.5. Samples: 2421464820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 05:02:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:02:47,590][06909] Updated weights for policy 0, policy_version 153733 (0.0040) [2024-06-28 05:02:48,850][06674] Fps is (10 sec: 47512.6, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2518827008. Throughput: 0: 43879.4. Samples: 2421730860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:02:48,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:02:51,466][06909] Updated weights for policy 0, policy_version 153743 (0.0029) [2024-06-28 05:02:53,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2519007232. Throughput: 0: 44023.6. Samples: 2421999600. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:02:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:02:54,799][06909] Updated weights for policy 0, policy_version 153753 (0.0035) [2024-06-28 05:02:58,637][06909] Updated weights for policy 0, policy_version 153763 (0.0029) [2024-06-28 05:02:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 2519252992. Throughput: 0: 44135.4. Samples: 2422131240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:02:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:03:01,983][06909] Updated weights for policy 0, policy_version 153773 (0.0040) [2024-06-28 05:03:03,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2519482368. Throughput: 0: 43999.6. Samples: 2422396260. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:03:03,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:03:06,157][06909] Updated weights for policy 0, policy_version 153783 (0.0034) [2024-06-28 05:03:08,850][06674] Fps is (10 sec: 44238.1, 60 sec: 44236.9, 300 sec: 44098.2). Total num frames: 2519695360. Throughput: 0: 43988.6. Samples: 2422663720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:03:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:03:09,913][06909] Updated weights for policy 0, policy_version 153793 (0.0029) [2024-06-28 05:03:13,326][06909] Updated weights for policy 0, policy_version 153803 (0.0039) [2024-06-28 05:03:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2519908352. Throughput: 0: 44231.2. Samples: 2422794540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:03:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:03:17,266][06909] Updated weights for policy 0, policy_version 153813 (0.0039) [2024-06-28 05:03:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2520154112. Throughput: 0: 44350.3. Samples: 2423062800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:03:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:03:20,980][06909] Updated weights for policy 0, policy_version 153823 (0.0022) [2024-06-28 05:03:23,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2520367104. Throughput: 0: 44269.2. Samples: 2423328560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:03:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:03:24,552][06909] Updated weights for policy 0, policy_version 153833 (0.0030) [2024-06-28 05:03:28,175][06909] Updated weights for policy 0, policy_version 153843 (0.0029) [2024-06-28 05:03:28,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2520563712. Throughput: 0: 44163.1. Samples: 2423452160. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:03:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:03:29,903][06887] Signal inference workers to stop experience collection... (34550 times) [2024-06-28 05:03:29,944][06909] InferenceWorker_p0-w0: stopping experience collection (34550 times) [2024-06-28 05:03:29,958][06887] Signal inference workers to resume experience collection... (34550 times) [2024-06-28 05:03:29,960][06909] InferenceWorker_p0-w0: resuming experience collection (34550 times) [2024-06-28 05:03:31,819][06909] Updated weights for policy 0, policy_version 153853 (0.0027) [2024-06-28 05:03:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2520809472. Throughput: 0: 44292.1. Samples: 2423724000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:03:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:03:35,482][06909] Updated weights for policy 0, policy_version 153863 (0.0044) [2024-06-28 05:03:38,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2521022464. Throughput: 0: 44232.1. Samples: 2423990040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:03:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:03:39,188][06909] Updated weights for policy 0, policy_version 153873 (0.0027) [2024-06-28 05:03:43,285][06909] Updated weights for policy 0, policy_version 153883 (0.0040) [2024-06-28 05:03:43,856][06674] Fps is (10 sec: 40935.2, 60 sec: 43686.2, 300 sec: 44097.0). Total num frames: 2521219072. Throughput: 0: 44143.1. Samples: 2424117940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:03:43,857][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:03:46,880][06909] Updated weights for policy 0, policy_version 153893 (0.0043) [2024-06-28 05:03:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 2521464832. Throughput: 0: 44119.1. Samples: 2424381620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:03:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:03:48,878][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000153899_2521481216.pth... [2024-06-28 05:03:48,943][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000153252_2510880768.pth [2024-06-28 05:03:50,484][06909] Updated weights for policy 0, policy_version 153903 (0.0036) [2024-06-28 05:03:53,850][06674] Fps is (10 sec: 45902.9, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 2521677824. Throughput: 0: 44069.6. Samples: 2424646860. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-28 05:03:53,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:03:54,444][06909] Updated weights for policy 0, policy_version 153913 (0.0033) [2024-06-28 05:03:58,304][06909] Updated weights for policy 0, policy_version 153923 (0.0027) [2024-06-28 05:03:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.9, 300 sec: 44098.2). Total num frames: 2521890816. Throughput: 0: 43950.2. Samples: 2424772300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-28 05:03:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:04:01,856][06909] Updated weights for policy 0, policy_version 153933 (0.0030) [2024-06-28 05:04:03,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2522152960. Throughput: 0: 43887.4. Samples: 2425037740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-28 05:04:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:04:05,512][06909] Updated weights for policy 0, policy_version 153943 (0.0023) [2024-06-28 05:04:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2522349568. Throughput: 0: 44044.0. Samples: 2425310540. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-28 05:04:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:04:09,057][06909] Updated weights for policy 0, policy_version 153953 (0.0033) [2024-06-28 05:04:12,703][06909] Updated weights for policy 0, policy_version 153963 (0.0034) [2024-06-28 05:04:13,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2522546176. Throughput: 0: 44181.2. Samples: 2425440320. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-28 05:04:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:04:16,776][06909] Updated weights for policy 0, policy_version 153973 (0.0048) [2024-06-28 05:04:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.7, 300 sec: 44264.6). Total num frames: 2522808320. Throughput: 0: 44002.7. Samples: 2425704120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-28 05:04:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:04:20,448][06909] Updated weights for policy 0, policy_version 153983 (0.0040) [2024-06-28 05:04:23,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2522988544. Throughput: 0: 43993.3. Samples: 2425969740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-28 05:04:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:04:24,431][06909] Updated weights for policy 0, policy_version 153993 (0.0034) [2024-06-28 05:04:27,770][06909] Updated weights for policy 0, policy_version 154003 (0.0036) [2024-06-28 05:04:28,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44236.8, 300 sec: 44098.8). Total num frames: 2523217920. Throughput: 0: 43956.2. Samples: 2426095700. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-28 05:04:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:04:31,618][06909] Updated weights for policy 0, policy_version 154013 (0.0035) [2024-06-28 05:04:33,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2523463680. Throughput: 0: 44065.3. Samples: 2426364560. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-28 05:04:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:04:35,361][06909] Updated weights for policy 0, policy_version 154023 (0.0042) [2024-06-28 05:04:38,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 2523660288. Throughput: 0: 44095.4. Samples: 2426631240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-28 05:04:38,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:04:39,095][06909] Updated weights for policy 0, policy_version 154033 (0.0030) [2024-06-28 05:04:42,510][06909] Updated weights for policy 0, policy_version 154043 (0.0027) [2024-06-28 05:04:43,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44241.3, 300 sec: 44153.5). Total num frames: 2523873280. Throughput: 0: 44278.7. Samples: 2426764840. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-28 05:04:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 05:04:46,270][06909] Updated weights for policy 0, policy_version 154053 (0.0033) [2024-06-28 05:04:48,850][06674] Fps is (10 sec: 45884.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2524119040. Throughput: 0: 44284.5. Samples: 2427030540. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-28 05:04:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:04:50,207][06909] Updated weights for policy 0, policy_version 154063 (0.0036) [2024-06-28 05:04:53,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 2524315648. Throughput: 0: 44041.1. Samples: 2427292380. Policy #0 lag: (min: 0.0, avg: 9.3, max: 23.0) [2024-06-28 05:04:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:04:54,112][06909] Updated weights for policy 0, policy_version 154073 (0.0037) [2024-06-28 05:04:57,850][06909] Updated weights for policy 0, policy_version 154083 (0.0037) [2024-06-28 05:04:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44509.8, 300 sec: 44264.6). Total num frames: 2524561408. Throughput: 0: 44060.4. Samples: 2427423040. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 05:04:58,851][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 05:05:01,274][06909] Updated weights for policy 0, policy_version 154093 (0.0031) [2024-06-28 05:05:02,981][06887] Signal inference workers to stop experience collection... (34600 times) [2024-06-28 05:05:02,982][06887] Signal inference workers to resume experience collection... (34600 times) [2024-06-28 05:05:03,036][06909] InferenceWorker_p0-w0: stopping experience collection (34600 times) [2024-06-28 05:05:03,036][06909] InferenceWorker_p0-w0: resuming experience collection (34600 times) [2024-06-28 05:05:03,852][06674] Fps is (10 sec: 47503.5, 60 sec: 43962.3, 300 sec: 44264.3). Total num frames: 2524790784. Throughput: 0: 44249.1. Samples: 2427695420. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 05:05:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:05:05,182][06909] Updated weights for policy 0, policy_version 154103 (0.0051) [2024-06-28 05:05:08,550][06909] Updated weights for policy 0, policy_version 154113 (0.0034) [2024-06-28 05:05:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2524987392. Throughput: 0: 44190.2. Samples: 2427958300. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 05:05:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:05:12,485][06909] Updated weights for policy 0, policy_version 154123 (0.0034) [2024-06-28 05:05:13,850][06674] Fps is (10 sec: 40968.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2525200384. Throughput: 0: 44221.3. Samples: 2428085660. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 05:05:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:05:16,073][06909] Updated weights for policy 0, policy_version 154133 (0.0034) [2024-06-28 05:05:18,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43963.6, 300 sec: 44264.6). Total num frames: 2525446144. Throughput: 0: 44202.0. Samples: 2428353660. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 05:05:18,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 05:05:19,700][06909] Updated weights for policy 0, policy_version 154143 (0.0025) [2024-06-28 05:05:23,317][06909] Updated weights for policy 0, policy_version 154153 (0.0040) [2024-06-28 05:05:23,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2525659136. Throughput: 0: 44180.6. Samples: 2428619280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 05:05:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:05:27,301][06909] Updated weights for policy 0, policy_version 154163 (0.0033) [2024-06-28 05:05:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.7, 300 sec: 44264.6). Total num frames: 2525888512. Throughput: 0: 44142.1. Samples: 2428751240. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 05:05:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:05:30,958][06909] Updated weights for policy 0, policy_version 154173 (0.0041) [2024-06-28 05:05:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 44264.6). Total num frames: 2526117888. Throughput: 0: 44213.7. Samples: 2429020160. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 05:05:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:05:34,928][06909] Updated weights for policy 0, policy_version 154183 (0.0037) [2024-06-28 05:05:38,334][06909] Updated weights for policy 0, policy_version 154193 (0.0027) [2024-06-28 05:05:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44511.4, 300 sec: 44097.9). Total num frames: 2526330880. Throughput: 0: 44223.9. Samples: 2429282460. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 05:05:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:05:42,157][06909] Updated weights for policy 0, policy_version 154203 (0.0031) [2024-06-28 05:05:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2526527488. Throughput: 0: 44145.9. Samples: 2429409600. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 05:05:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:05:45,637][06909] Updated weights for policy 0, policy_version 154213 (0.0023) [2024-06-28 05:05:48,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43962.2, 300 sec: 44153.2). Total num frames: 2526756864. Throughput: 0: 43942.1. Samples: 2429672820. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 05:05:48,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:05:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000154222_2526773248.pth... [2024-06-28 05:05:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000153575_2516172800.pth [2024-06-28 05:05:49,848][06909] Updated weights for policy 0, policy_version 154223 (0.0039) [2024-06-28 05:05:53,249][06909] Updated weights for policy 0, policy_version 154233 (0.0035) [2024-06-28 05:05:53,856][06674] Fps is (10 sec: 45847.4, 60 sec: 44505.3, 300 sec: 44097.4). Total num frames: 2526986240. Throughput: 0: 44061.6. Samples: 2429941340. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 05:05:53,857][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:05:57,138][06909] Updated weights for policy 0, policy_version 154243 (0.0035) [2024-06-28 05:05:58,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2527182848. Throughput: 0: 44060.9. Samples: 2430068400. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 05:05:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:06:00,455][06909] Updated weights for policy 0, policy_version 154253 (0.0031) [2024-06-28 05:06:03,850][06674] Fps is (10 sec: 42624.4, 60 sec: 43692.2, 300 sec: 44209.1). Total num frames: 2527412224. Throughput: 0: 43994.9. Samples: 2430333420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 05:06:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:06:04,784][06909] Updated weights for policy 0, policy_version 154263 (0.0045) [2024-06-28 05:06:08,014][06909] Updated weights for policy 0, policy_version 154273 (0.0043) [2024-06-28 05:06:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2527625216. Throughput: 0: 43939.7. Samples: 2430596560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 05:06:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:06:12,062][06909] Updated weights for policy 0, policy_version 154283 (0.0033) [2024-06-28 05:06:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2527854592. Throughput: 0: 43890.9. Samples: 2430726320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 05:06:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:06:15,479][06909] Updated weights for policy 0, policy_version 154293 (0.0032) [2024-06-28 05:06:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.9, 300 sec: 44264.9). Total num frames: 2528083968. Throughput: 0: 43604.1. Samples: 2430982340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 05:06:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 05:06:19,303][06909] Updated weights for policy 0, policy_version 154303 (0.0038) [2024-06-28 05:06:23,043][06909] Updated weights for policy 0, policy_version 154313 (0.0028) [2024-06-28 05:06:23,857][06674] Fps is (10 sec: 44205.6, 60 sec: 43958.6, 300 sec: 44041.4). Total num frames: 2528296960. Throughput: 0: 43803.9. Samples: 2431253940. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 05:06:23,857][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 05:06:26,968][06909] Updated weights for policy 0, policy_version 154323 (0.0027) [2024-06-28 05:06:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.8, 300 sec: 44097.9). Total num frames: 2528509952. Throughput: 0: 44005.8. Samples: 2431389860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 05:06:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:06:30,470][06909] Updated weights for policy 0, policy_version 154333 (0.0029) [2024-06-28 05:06:33,850][06674] Fps is (10 sec: 42628.4, 60 sec: 43417.6, 300 sec: 44209.9). Total num frames: 2528722944. Throughput: 0: 44047.4. Samples: 2431654860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 05:06:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:06:34,208][06909] Updated weights for policy 0, policy_version 154343 (0.0028) [2024-06-28 05:06:37,873][06909] Updated weights for policy 0, policy_version 154353 (0.0020) [2024-06-28 05:06:38,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2528952320. Throughput: 0: 43984.0. Samples: 2431920360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 05:06:38,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:06:41,928][06909] Updated weights for policy 0, policy_version 154363 (0.0021) [2024-06-28 05:06:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2529181696. Throughput: 0: 44251.2. Samples: 2432059700. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 05:06:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:06:45,284][06909] Updated weights for policy 0, policy_version 154373 (0.0036) [2024-06-28 05:06:47,300][06887] Signal inference workers to stop experience collection... (34650 times) [2024-06-28 05:06:47,352][06887] Signal inference workers to resume experience collection... (34650 times) [2024-06-28 05:06:47,353][06909] InferenceWorker_p0-w0: stopping experience collection (34650 times) [2024-06-28 05:06:47,371][06909] InferenceWorker_p0-w0: resuming experience collection (34650 times) [2024-06-28 05:06:48,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43419.1, 300 sec: 44097.9). Total num frames: 2529361920. Throughput: 0: 43962.2. Samples: 2432311720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 05:06:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:06:49,212][06909] Updated weights for policy 0, policy_version 154383 (0.0045) [2024-06-28 05:06:52,666][06909] Updated weights for policy 0, policy_version 154393 (0.0040) [2024-06-28 05:06:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43695.0, 300 sec: 44042.4). Total num frames: 2529607680. Throughput: 0: 43866.1. Samples: 2432570540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 05:06:53,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:06:56,565][06909] Updated weights for policy 0, policy_version 154403 (0.0029) [2024-06-28 05:06:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2529820672. Throughput: 0: 43959.5. Samples: 2432704500. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 05:06:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:07:00,279][06909] Updated weights for policy 0, policy_version 154413 (0.0034) [2024-06-28 05:07:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2530033664. Throughput: 0: 44130.7. Samples: 2432968220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 05:07:03,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:07:04,472][06909] Updated weights for policy 0, policy_version 154423 (0.0036) [2024-06-28 05:07:07,556][06909] Updated weights for policy 0, policy_version 154433 (0.0038) [2024-06-28 05:07:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2530279424. Throughput: 0: 44063.3. Samples: 2433236480. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 05:07:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:07:11,606][06909] Updated weights for policy 0, policy_version 154443 (0.0027) [2024-06-28 05:07:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2530492416. Throughput: 0: 44048.5. Samples: 2433372040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 05:07:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:07:14,797][06909] Updated weights for policy 0, policy_version 154453 (0.0036) [2024-06-28 05:07:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2530705408. Throughput: 0: 43942.2. Samples: 2433632260. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 05:07:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:07:19,139][06909] Updated weights for policy 0, policy_version 154463 (0.0026) [2024-06-28 05:07:22,202][06909] Updated weights for policy 0, policy_version 154473 (0.0034) [2024-06-28 05:07:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43968.9, 300 sec: 44098.0). Total num frames: 2530934784. Throughput: 0: 43976.2. Samples: 2433899280. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 05:07:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:07:26,437][06909] Updated weights for policy 0, policy_version 154483 (0.0036) [2024-06-28 05:07:28,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2531164160. Throughput: 0: 43816.4. Samples: 2434031440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 05:07:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:07:30,217][06909] Updated weights for policy 0, policy_version 154493 (0.0027) [2024-06-28 05:07:33,673][06909] Updated weights for policy 0, policy_version 154503 (0.0024) [2024-06-28 05:07:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2531377152. Throughput: 0: 44075.6. Samples: 2434295120. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 05:07:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:07:37,372][06909] Updated weights for policy 0, policy_version 154513 (0.0032) [2024-06-28 05:07:38,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 2531622912. Throughput: 0: 44147.3. Samples: 2434557160. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 05:07:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:07:41,242][06909] Updated weights for policy 0, policy_version 154523 (0.0031) [2024-06-28 05:07:43,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2531835904. Throughput: 0: 44289.8. Samples: 2434697540. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 05:07:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:07:44,492][06909] Updated weights for policy 0, policy_version 154533 (0.0024) [2024-06-28 05:07:48,409][06909] Updated weights for policy 0, policy_version 154543 (0.0034) [2024-06-28 05:07:48,850][06674] Fps is (10 sec: 40959.6, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2532032512. Throughput: 0: 44268.9. Samples: 2434960320. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 05:07:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:07:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000154543_2532032512.pth... [2024-06-28 05:07:48,914][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000153899_2521481216.pth [2024-06-28 05:07:52,026][06909] Updated weights for policy 0, policy_version 154553 (0.0030) [2024-06-28 05:07:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2532278272. Throughput: 0: 44241.4. Samples: 2435227340. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 05:07:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:07:56,118][06909] Updated weights for policy 0, policy_version 154563 (0.0023) [2024-06-28 05:07:58,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44509.7, 300 sec: 44097.9). Total num frames: 2532491264. Throughput: 0: 44429.5. Samples: 2435371380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 05:07:58,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:07:59,564][06909] Updated weights for policy 0, policy_version 154573 (0.0040) [2024-06-28 05:08:03,381][06909] Updated weights for policy 0, policy_version 154583 (0.0033) [2024-06-28 05:08:03,852][06674] Fps is (10 sec: 40951.5, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 2532687872. Throughput: 0: 44201.5. Samples: 2435621420. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 05:08:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:08:06,985][06909] Updated weights for policy 0, policy_version 154593 (0.0050) [2024-06-28 05:08:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2532933632. Throughput: 0: 44123.8. Samples: 2435884860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 05:08:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:08:11,115][06909] Updated weights for policy 0, policy_version 154603 (0.0026) [2024-06-28 05:08:13,856][06674] Fps is (10 sec: 45857.2, 60 sec: 44232.3, 300 sec: 44041.5). Total num frames: 2533146624. Throughput: 0: 44296.3. Samples: 2436025040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 05:08:13,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:08:14,178][06909] Updated weights for policy 0, policy_version 154613 (0.0032) [2024-06-28 05:08:18,379][06909] Updated weights for policy 0, policy_version 154623 (0.0031) [2024-06-28 05:08:18,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2533359616. Throughput: 0: 44210.1. Samples: 2436284580. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 05:08:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:08:19,810][06887] Signal inference workers to stop experience collection... (34700 times) [2024-06-28 05:08:19,812][06887] Signal inference workers to resume experience collection... (34700 times) [2024-06-28 05:08:19,833][06909] InferenceWorker_p0-w0: stopping experience collection (34700 times) [2024-06-28 05:08:19,833][06909] InferenceWorker_p0-w0: resuming experience collection (34700 times) [2024-06-28 05:08:21,398][06909] Updated weights for policy 0, policy_version 154633 (0.0038) [2024-06-28 05:08:23,850][06674] Fps is (10 sec: 44263.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2533588992. Throughput: 0: 44178.2. Samples: 2436545180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 05:08:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:08:25,787][06909] Updated weights for policy 0, policy_version 154643 (0.0029) [2024-06-28 05:08:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2533818368. Throughput: 0: 44021.6. Samples: 2436678520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 05:08:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:08:29,135][06909] Updated weights for policy 0, policy_version 154653 (0.0033) [2024-06-28 05:08:33,469][06909] Updated weights for policy 0, policy_version 154663 (0.0031) [2024-06-28 05:08:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2533998592. Throughput: 0: 44105.4. Samples: 2436945060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 05:08:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:08:36,446][06909] Updated weights for policy 0, policy_version 154673 (0.0020) [2024-06-28 05:08:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 44154.4). Total num frames: 2534244352. Throughput: 0: 43907.0. Samples: 2437203160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 05:08:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:08:40,683][06909] Updated weights for policy 0, policy_version 154683 (0.0032) [2024-06-28 05:08:43,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2534473728. Throughput: 0: 43878.8. Samples: 2437345920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 05:08:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:08:44,287][06909] Updated weights for policy 0, policy_version 154693 (0.0044) [2024-06-28 05:08:48,294][06909] Updated weights for policy 0, policy_version 154703 (0.0028) [2024-06-28 05:08:48,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2534653952. Throughput: 0: 44020.5. Samples: 2437602260. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 05:08:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:08:51,556][06909] Updated weights for policy 0, policy_version 154713 (0.0026) [2024-06-28 05:08:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2534916096. Throughput: 0: 43929.0. Samples: 2437861660. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 05:08:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:08:55,674][06909] Updated weights for policy 0, policy_version 154723 (0.0041) [2024-06-28 05:08:58,850][06674] Fps is (10 sec: 49153.0, 60 sec: 44237.0, 300 sec: 44042.4). Total num frames: 2535145472. Throughput: 0: 43843.7. Samples: 2437997740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 05:08:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:08:58,852][06909] Updated weights for policy 0, policy_version 154733 (0.0025) [2024-06-28 05:09:03,214][06909] Updated weights for policy 0, policy_version 154743 (0.0030) [2024-06-28 05:09:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 2535342080. Throughput: 0: 43893.0. Samples: 2438259760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 05:09:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:09:06,452][06909] Updated weights for policy 0, policy_version 154753 (0.0030) [2024-06-28 05:09:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 2535571456. Throughput: 0: 44034.2. Samples: 2438526720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 05:09:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:09:10,458][06909] Updated weights for policy 0, policy_version 154763 (0.0038) [2024-06-28 05:09:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43968.2, 300 sec: 43986.9). Total num frames: 2535784448. Throughput: 0: 44097.5. Samples: 2438662900. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 05:09:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 05:09:13,883][06909] Updated weights for policy 0, policy_version 154773 (0.0025) [2024-06-28 05:09:17,834][06909] Updated weights for policy 0, policy_version 154783 (0.0032) [2024-06-28 05:09:18,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2535981056. Throughput: 0: 44005.3. Samples: 2438925300. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 05:09:18,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:09:21,546][06909] Updated weights for policy 0, policy_version 154793 (0.0028) [2024-06-28 05:09:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2536226816. Throughput: 0: 44046.3. Samples: 2439185240. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 05:09:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:09:25,317][06909] Updated weights for policy 0, policy_version 154803 (0.0034) [2024-06-28 05:09:28,841][06909] Updated weights for policy 0, policy_version 154813 (0.0029) [2024-06-28 05:09:28,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2536456192. Throughput: 0: 43904.1. Samples: 2439321600. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 05:09:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 05:09:32,663][06909] Updated weights for policy 0, policy_version 154823 (0.0026) [2024-06-28 05:09:33,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44509.8, 300 sec: 44098.3). Total num frames: 2536669184. Throughput: 0: 44233.0. Samples: 2439592740. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 05:09:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:09:36,291][06909] Updated weights for policy 0, policy_version 154833 (0.0031) [2024-06-28 05:09:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2536898560. Throughput: 0: 44401.0. Samples: 2439859700. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 05:09:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 05:09:40,068][06909] Updated weights for policy 0, policy_version 154843 (0.0028) [2024-06-28 05:09:43,348][06887] Signal inference workers to stop experience collection... (34750 times) [2024-06-28 05:09:43,351][06887] Signal inference workers to resume experience collection... (34750 times) [2024-06-28 05:09:43,373][06909] InferenceWorker_p0-w0: stopping experience collection (34750 times) [2024-06-28 05:09:43,373][06909] InferenceWorker_p0-w0: resuming experience collection (34750 times) [2024-06-28 05:09:43,499][06909] Updated weights for policy 0, policy_version 154853 (0.0026) [2024-06-28 05:09:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2537111552. Throughput: 0: 44362.2. Samples: 2439994040. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 05:09:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:09:47,572][06909] Updated weights for policy 0, policy_version 154863 (0.0034) [2024-06-28 05:09:48,850][06674] Fps is (10 sec: 42597.7, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2537324544. Throughput: 0: 44396.3. Samples: 2440257600. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 05:09:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:09:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000154866_2537324544.pth... [2024-06-28 05:09:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000154222_2526773248.pth [2024-06-28 05:09:51,178][06909] Updated weights for policy 0, policy_version 154873 (0.0032) [2024-06-28 05:09:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2537570304. Throughput: 0: 44218.7. Samples: 2440516560. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 05:09:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:09:55,215][06909] Updated weights for policy 0, policy_version 154883 (0.0035) [2024-06-28 05:09:58,579][06909] Updated weights for policy 0, policy_version 154893 (0.0033) [2024-06-28 05:09:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.6, 300 sec: 43987.2). Total num frames: 2537766912. Throughput: 0: 44129.7. Samples: 2440648740. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 05:09:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:10:02,386][06909] Updated weights for policy 0, policy_version 154903 (0.0039) [2024-06-28 05:10:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2537996288. Throughput: 0: 44246.2. Samples: 2440916380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 05:10:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:10:05,991][06909] Updated weights for policy 0, policy_version 154913 (0.0029) [2024-06-28 05:10:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2538225664. Throughput: 0: 44336.3. Samples: 2441180380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 05:10:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:10:09,547][06909] Updated weights for policy 0, policy_version 154923 (0.0023) [2024-06-28 05:10:13,384][06909] Updated weights for policy 0, policy_version 154933 (0.0038) [2024-06-28 05:10:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2538438656. Throughput: 0: 44365.4. Samples: 2441318040. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 05:10:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:10:17,411][06909] Updated weights for policy 0, policy_version 154943 (0.0027) [2024-06-28 05:10:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2538651648. Throughput: 0: 44297.3. Samples: 2441586120. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:10:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:10:20,894][06909] Updated weights for policy 0, policy_version 154953 (0.0029) [2024-06-28 05:10:23,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 2538897408. Throughput: 0: 44104.4. Samples: 2441844400. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:10:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:10:24,665][06909] Updated weights for policy 0, policy_version 154963 (0.0040) [2024-06-28 05:10:28,172][06909] Updated weights for policy 0, policy_version 154973 (0.0033) [2024-06-28 05:10:28,852][06674] Fps is (10 sec: 45865.7, 60 sec: 44235.2, 300 sec: 44042.1). Total num frames: 2539110400. Throughput: 0: 44127.3. Samples: 2441979860. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:10:28,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:10:32,241][06909] Updated weights for policy 0, policy_version 154983 (0.0030) [2024-06-28 05:10:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2539307008. Throughput: 0: 44189.0. Samples: 2442246100. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:10:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:10:35,712][06909] Updated weights for policy 0, policy_version 154993 (0.0049) [2024-06-28 05:10:38,853][06674] Fps is (10 sec: 44234.1, 60 sec: 44234.8, 300 sec: 44153.1). Total num frames: 2539552768. Throughput: 0: 44212.0. Samples: 2442506220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:10:38,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:10:39,374][06909] Updated weights for policy 0, policy_version 155003 (0.0033) [2024-06-28 05:10:42,938][06909] Updated weights for policy 0, policy_version 155013 (0.0036) [2024-06-28 05:10:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 2539765760. Throughput: 0: 44386.7. Samples: 2442646140. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:10:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:10:46,842][06909] Updated weights for policy 0, policy_version 155023 (0.0022) [2024-06-28 05:10:48,850][06674] Fps is (10 sec: 42610.1, 60 sec: 44236.9, 300 sec: 44043.3). Total num frames: 2539978752. Throughput: 0: 44290.3. Samples: 2442909440. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:10:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:10:50,536][06909] Updated weights for policy 0, policy_version 155033 (0.0033) [2024-06-28 05:10:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2540208128. Throughput: 0: 44330.7. Samples: 2443175260. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:10:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:10:54,088][06909] Updated weights for policy 0, policy_version 155043 (0.0032) [2024-06-28 05:10:58,098][06909] Updated weights for policy 0, policy_version 155053 (0.0040) [2024-06-28 05:10:58,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2540437504. Throughput: 0: 44192.7. Samples: 2443306720. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:10:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:11:01,589][06909] Updated weights for policy 0, policy_version 155063 (0.0034) [2024-06-28 05:11:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2540650496. Throughput: 0: 44053.3. Samples: 2443568520. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:11:03,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:11:05,333][06909] Updated weights for policy 0, policy_version 155073 (0.0031) [2024-06-28 05:11:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2540863488. Throughput: 0: 44039.5. Samples: 2443826180. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:11:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:11:09,427][06909] Updated weights for policy 0, policy_version 155083 (0.0041) [2024-06-28 05:11:12,849][06909] Updated weights for policy 0, policy_version 155093 (0.0044) [2024-06-28 05:11:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2541092864. Throughput: 0: 44009.1. Samples: 2443960180. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:11:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:11:16,640][06909] Updated weights for policy 0, policy_version 155103 (0.0029) [2024-06-28 05:11:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44043.5). Total num frames: 2541289472. Throughput: 0: 43815.0. Samples: 2444217780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:11:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:11:20,025][06909] Updated weights for policy 0, policy_version 155113 (0.0036) [2024-06-28 05:11:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2541518848. Throughput: 0: 43910.1. Samples: 2444482060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:11:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:11:24,158][06909] Updated weights for policy 0, policy_version 155123 (0.0039) [2024-06-28 05:11:27,888][06909] Updated weights for policy 0, policy_version 155133 (0.0030) [2024-06-28 05:11:28,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 2541748224. Throughput: 0: 43823.1. Samples: 2444618180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:11:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:11:31,860][06909] Updated weights for policy 0, policy_version 155143 (0.0039) [2024-06-28 05:11:32,018][06887] Signal inference workers to stop experience collection... (34800 times) [2024-06-28 05:11:32,070][06909] InferenceWorker_p0-w0: stopping experience collection (34800 times) [2024-06-28 05:11:32,071][06887] Signal inference workers to resume experience collection... (34800 times) [2024-06-28 05:11:32,094][06909] InferenceWorker_p0-w0: resuming experience collection (34800 times) [2024-06-28 05:11:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2541944832. Throughput: 0: 43713.2. Samples: 2444876540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:11:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:11:35,598][06909] Updated weights for policy 0, policy_version 155153 (0.0049) [2024-06-28 05:11:38,852][06674] Fps is (10 sec: 42589.1, 60 sec: 43691.1, 300 sec: 44042.1). Total num frames: 2542174208. Throughput: 0: 43561.1. Samples: 2445135600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:11:38,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:11:39,281][06909] Updated weights for policy 0, policy_version 155163 (0.0035) [2024-06-28 05:11:42,774][06909] Updated weights for policy 0, policy_version 155173 (0.0026) [2024-06-28 05:11:43,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 2542419968. Throughput: 0: 43811.3. Samples: 2445278220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:11:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 05:11:46,458][06909] Updated weights for policy 0, policy_version 155183 (0.0035) [2024-06-28 05:11:48,850][06674] Fps is (10 sec: 42607.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2542600192. Throughput: 0: 43885.4. Samples: 2445543360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:11:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:11:48,933][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000155189_2542616576.pth... [2024-06-28 05:11:48,999][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000154543_2532032512.pth [2024-06-28 05:11:50,182][06909] Updated weights for policy 0, policy_version 155193 (0.0041) [2024-06-28 05:11:53,815][06909] Updated weights for policy 0, policy_version 155203 (0.0027) [2024-06-28 05:11:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2542845952. Throughput: 0: 43992.9. Samples: 2445805860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:11:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:11:57,463][06909] Updated weights for policy 0, policy_version 155213 (0.0038) [2024-06-28 05:11:58,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2543075328. Throughput: 0: 43935.1. Samples: 2445937260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:11:58,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:12:01,457][06909] Updated weights for policy 0, policy_version 155223 (0.0036) [2024-06-28 05:12:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2543288320. Throughput: 0: 44041.8. Samples: 2446199660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:12:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:12:05,191][06909] Updated weights for policy 0, policy_version 155233 (0.0042) [2024-06-28 05:12:08,835][06909] Updated weights for policy 0, policy_version 155243 (0.0035) [2024-06-28 05:12:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2543501312. Throughput: 0: 43934.2. Samples: 2446459100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:12:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:12:12,386][06909] Updated weights for policy 0, policy_version 155253 (0.0037) [2024-06-28 05:12:13,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43689.2, 300 sec: 44097.6). Total num frames: 2543714304. Throughput: 0: 43903.7. Samples: 2446593940. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:12:13,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:12:16,317][06909] Updated weights for policy 0, policy_version 155263 (0.0034) [2024-06-28 05:12:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2543943680. Throughput: 0: 44040.9. Samples: 2446858380. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:12:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:12:20,044][06909] Updated weights for policy 0, policy_version 155273 (0.0039) [2024-06-28 05:12:23,512][06909] Updated weights for policy 0, policy_version 155283 (0.0035) [2024-06-28 05:12:23,856][06674] Fps is (10 sec: 44219.4, 60 sec: 43959.4, 300 sec: 44041.5). Total num frames: 2544156672. Throughput: 0: 44055.8. Samples: 2447118280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:12:23,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:12:27,228][06909] Updated weights for policy 0, policy_version 155293 (0.0035) [2024-06-28 05:12:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2544386048. Throughput: 0: 43992.8. Samples: 2447257900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 05:12:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:12:31,340][06909] Updated weights for policy 0, policy_version 155303 (0.0044) [2024-06-28 05:12:33,850][06674] Fps is (10 sec: 44263.4, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2544599040. Throughput: 0: 44063.5. Samples: 2447526220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 05:12:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:12:34,937][06909] Updated weights for policy 0, policy_version 155313 (0.0036) [2024-06-28 05:12:38,449][06909] Updated weights for policy 0, policy_version 155323 (0.0032) [2024-06-28 05:12:38,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44511.5, 300 sec: 44098.0). Total num frames: 2544844800. Throughput: 0: 44107.3. Samples: 2447790680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 05:12:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:12:42,288][06909] Updated weights for policy 0, policy_version 155333 (0.0040) [2024-06-28 05:12:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2545057792. Throughput: 0: 44241.5. Samples: 2447928120. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 05:12:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:12:44,464][06887] Signal inference workers to stop experience collection... (34850 times) [2024-06-28 05:12:44,464][06887] Signal inference workers to resume experience collection... (34850 times) [2024-06-28 05:12:44,484][06909] InferenceWorker_p0-w0: stopping experience collection (34850 times) [2024-06-28 05:12:44,484][06909] InferenceWorker_p0-w0: resuming experience collection (34850 times) [2024-06-28 05:12:45,602][06909] Updated weights for policy 0, policy_version 155343 (0.0038) [2024-06-28 05:12:48,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2545254400. Throughput: 0: 44236.1. Samples: 2448190280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 05:12:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 05:12:49,682][06909] Updated weights for policy 0, policy_version 155353 (0.0034) [2024-06-28 05:12:53,318][06909] Updated weights for policy 0, policy_version 155363 (0.0024) [2024-06-28 05:12:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2545500160. Throughput: 0: 44379.2. Samples: 2448456160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 05:12:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:12:56,893][06909] Updated weights for policy 0, policy_version 155373 (0.0045) [2024-06-28 05:12:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44153.8). Total num frames: 2545713152. Throughput: 0: 44438.5. Samples: 2448593580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 05:12:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:13:00,625][06909] Updated weights for policy 0, policy_version 155383 (0.0030) [2024-06-28 05:13:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2545926144. Throughput: 0: 44379.1. Samples: 2448855440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 05:13:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:13:04,364][06909] Updated weights for policy 0, policy_version 155393 (0.0028) [2024-06-28 05:13:08,009][06909] Updated weights for policy 0, policy_version 155403 (0.0039) [2024-06-28 05:13:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44098.8). Total num frames: 2546155520. Throughput: 0: 44462.7. Samples: 2449118840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 05:13:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:13:11,828][06909] Updated weights for policy 0, policy_version 155413 (0.0028) [2024-06-28 05:13:13,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44511.3, 300 sec: 44153.5). Total num frames: 2546384896. Throughput: 0: 44452.0. Samples: 2449258240. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 05:13:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:13:15,256][06909] Updated weights for policy 0, policy_version 155423 (0.0029) [2024-06-28 05:13:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2546597888. Throughput: 0: 44363.4. Samples: 2449522580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 05:13:18,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:13:18,986][06909] Updated weights for policy 0, policy_version 155433 (0.0026) [2024-06-28 05:13:22,538][06909] Updated weights for policy 0, policy_version 155443 (0.0040) [2024-06-28 05:13:23,856][06674] Fps is (10 sec: 42572.9, 60 sec: 44236.7, 300 sec: 44041.5). Total num frames: 2546810880. Throughput: 0: 44315.7. Samples: 2449785160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 05:13:23,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:13:26,285][06909] Updated weights for policy 0, policy_version 155453 (0.0026) [2024-06-28 05:13:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2547023872. Throughput: 0: 44145.8. Samples: 2449914680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 05:13:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:13:30,233][06909] Updated weights for policy 0, policy_version 155463 (0.0031) [2024-06-28 05:13:33,653][06909] Updated weights for policy 0, policy_version 155473 (0.0038) [2024-06-28 05:13:33,852][06674] Fps is (10 sec: 45893.7, 60 sec: 44508.3, 300 sec: 44153.2). Total num frames: 2547269632. Throughput: 0: 44298.0. Samples: 2450183780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:13:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:13:37,462][06909] Updated weights for policy 0, policy_version 155483 (0.0032) [2024-06-28 05:13:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2547482624. Throughput: 0: 44420.4. Samples: 2450455080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:13:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:13:41,415][06909] Updated weights for policy 0, policy_version 155493 (0.0021) [2024-06-28 05:13:43,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43963.8, 300 sec: 44209.1). Total num frames: 2547695616. Throughput: 0: 44309.0. Samples: 2450587480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:13:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:13:45,078][06909] Updated weights for policy 0, policy_version 155503 (0.0027) [2024-06-28 05:13:48,775][06909] Updated weights for policy 0, policy_version 155513 (0.0036) [2024-06-28 05:13:48,850][06674] Fps is (10 sec: 44235.9, 60 sec: 44509.7, 300 sec: 44097.9). Total num frames: 2547924992. Throughput: 0: 44234.9. Samples: 2450846020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:13:48,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:13:48,904][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000155514_2547941376.pth... [2024-06-28 05:13:48,950][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000154866_2537324544.pth [2024-06-28 05:13:52,339][06909] Updated weights for policy 0, policy_version 155523 (0.0036) [2024-06-28 05:13:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2548121600. Throughput: 0: 44195.2. Samples: 2451107620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:13:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:13:56,064][06909] Updated weights for policy 0, policy_version 155533 (0.0040) [2024-06-28 05:13:58,850][06674] Fps is (10 sec: 42599.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2548350976. Throughput: 0: 44091.7. Samples: 2451242360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:13:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:13:59,827][06909] Updated weights for policy 0, policy_version 155543 (0.0036) [2024-06-28 05:14:03,341][06909] Updated weights for policy 0, policy_version 155553 (0.0032) [2024-06-28 05:14:03,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2548580352. Throughput: 0: 43897.4. Samples: 2451497960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:14:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:14:07,512][06909] Updated weights for policy 0, policy_version 155563 (0.0038) [2024-06-28 05:14:08,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2548793344. Throughput: 0: 43977.0. Samples: 2451763860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:14:08,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:14:10,975][06909] Updated weights for policy 0, policy_version 155573 (0.0024) [2024-06-28 05:14:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2549022720. Throughput: 0: 44083.4. Samples: 2451898440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:14:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:14:14,991][06909] Updated weights for policy 0, policy_version 155583 (0.0043) [2024-06-28 05:14:18,752][06909] Updated weights for policy 0, policy_version 155593 (0.0030) [2024-06-28 05:14:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2549235712. Throughput: 0: 43845.6. Samples: 2452156740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:14:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:14:22,083][06887] Signal inference workers to stop experience collection... (34900 times) [2024-06-28 05:14:22,085][06887] Signal inference workers to resume experience collection... (34900 times) [2024-06-28 05:14:22,132][06909] InferenceWorker_p0-w0: stopping experience collection (34900 times) [2024-06-28 05:14:22,132][06909] InferenceWorker_p0-w0: resuming experience collection (34900 times) [2024-06-28 05:14:22,396][06909] Updated weights for policy 0, policy_version 155603 (0.0028) [2024-06-28 05:14:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44241.2, 300 sec: 44097.9). Total num frames: 2549465088. Throughput: 0: 43846.1. Samples: 2452428160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:14:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:14:25,858][06909] Updated weights for policy 0, policy_version 155613 (0.0029) [2024-06-28 05:14:28,856][06674] Fps is (10 sec: 44209.8, 60 sec: 44232.3, 300 sec: 44097.1). Total num frames: 2549678080. Throughput: 0: 43852.2. Samples: 2452561100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:14:28,856][06674] Avg episode reward: [(0, '0.479')] [2024-06-28 05:14:30,010][06909] Updated weights for policy 0, policy_version 155623 (0.0030) [2024-06-28 05:14:33,195][06909] Updated weights for policy 0, policy_version 155633 (0.0035) [2024-06-28 05:14:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44238.2, 300 sec: 44153.5). Total num frames: 2549923840. Throughput: 0: 43988.1. Samples: 2452825480. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 05:14:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:14:37,167][06909] Updated weights for policy 0, policy_version 155643 (0.0040) [2024-06-28 05:14:38,850][06674] Fps is (10 sec: 45902.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2550136832. Throughput: 0: 44117.7. Samples: 2453092920. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 05:14:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:14:40,738][06909] Updated weights for policy 0, policy_version 155653 (0.0031) [2024-06-28 05:14:43,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2550333440. Throughput: 0: 43809.3. Samples: 2453213780. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 05:14:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:14:44,809][06909] Updated weights for policy 0, policy_version 155663 (0.0031) [2024-06-28 05:14:48,051][06909] Updated weights for policy 0, policy_version 155673 (0.0032) [2024-06-28 05:14:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2550562816. Throughput: 0: 44115.1. Samples: 2453483140. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 05:14:48,850][06674] Avg episode reward: [(0, '0.399')] [2024-06-28 05:14:52,387][06909] Updated weights for policy 0, policy_version 155683 (0.0032) [2024-06-28 05:14:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2550792192. Throughput: 0: 44008.0. Samples: 2453744220. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 05:14:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:14:55,987][06909] Updated weights for policy 0, policy_version 155693 (0.0038) [2024-06-28 05:14:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2550988800. Throughput: 0: 43928.5. Samples: 2453875220. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 05:14:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 05:14:59,629][06909] Updated weights for policy 0, policy_version 155703 (0.0040) [2024-06-28 05:15:03,164][06909] Updated weights for policy 0, policy_version 155713 (0.0040) [2024-06-28 05:15:03,852][06674] Fps is (10 sec: 44228.2, 60 sec: 44235.3, 300 sec: 44097.7). Total num frames: 2551234560. Throughput: 0: 44224.6. Samples: 2454146940. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 05:15:03,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:15:06,975][06909] Updated weights for policy 0, policy_version 155723 (0.0035) [2024-06-28 05:15:08,850][06674] Fps is (10 sec: 49152.1, 60 sec: 44783.0, 300 sec: 44209.0). Total num frames: 2551480320. Throughput: 0: 43987.6. Samples: 2454407600. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 05:15:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:15:10,600][06909] Updated weights for policy 0, policy_version 155733 (0.0028) [2024-06-28 05:15:13,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2551660544. Throughput: 0: 44006.8. Samples: 2454541140. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 05:15:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:15:14,339][06909] Updated weights for policy 0, policy_version 155743 (0.0034) [2024-06-28 05:15:18,038][06909] Updated weights for policy 0, policy_version 155753 (0.0023) [2024-06-28 05:15:18,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2551889920. Throughput: 0: 43965.9. Samples: 2454803940. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 05:15:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:15:21,863][06909] Updated weights for policy 0, policy_version 155763 (0.0025) [2024-06-28 05:15:23,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 2552119296. Throughput: 0: 43836.9. Samples: 2455065580. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 05:15:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:15:25,357][06909] Updated weights for policy 0, policy_version 155773 (0.0032) [2024-06-28 05:15:28,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43695.1, 300 sec: 44042.4). Total num frames: 2552299520. Throughput: 0: 44065.7. Samples: 2455196740. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 05:15:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:15:29,234][06909] Updated weights for policy 0, policy_version 155783 (0.0027) [2024-06-28 05:15:32,984][06909] Updated weights for policy 0, policy_version 155793 (0.0033) [2024-06-28 05:15:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44098.4). Total num frames: 2552561664. Throughput: 0: 44013.4. Samples: 2455463740. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 05:15:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:15:36,717][06909] Updated weights for policy 0, policy_version 155803 (0.0036) [2024-06-28 05:15:38,850][06674] Fps is (10 sec: 49152.7, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2552791040. Throughput: 0: 44158.4. Samples: 2455731340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 05:15:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:15:40,325][06909] Updated weights for policy 0, policy_version 155813 (0.0040) [2024-06-28 05:15:43,852][06674] Fps is (10 sec: 42589.8, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 2552987648. Throughput: 0: 44150.1. Samples: 2455862060. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 05:15:43,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:15:44,492][06909] Updated weights for policy 0, policy_version 155823 (0.0031) [2024-06-28 05:15:47,741][06909] Updated weights for policy 0, policy_version 155833 (0.0029) [2024-06-28 05:15:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2553217024. Throughput: 0: 44026.5. Samples: 2456128040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 05:15:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:15:48,893][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000155837_2553233408.pth... [2024-06-28 05:15:48,944][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000155189_2542616576.pth [2024-06-28 05:15:51,711][06909] Updated weights for policy 0, policy_version 155843 (0.0031) [2024-06-28 05:15:53,850][06674] Fps is (10 sec: 45884.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2553446400. Throughput: 0: 44068.9. Samples: 2456390700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 05:15:53,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:15:55,161][06909] Updated weights for policy 0, policy_version 155853 (0.0028) [2024-06-28 05:15:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2553643008. Throughput: 0: 44027.6. Samples: 2456522380. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 05:15:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:15:59,244][06909] Updated weights for policy 0, policy_version 155863 (0.0028) [2024-06-28 05:16:00,295][06887] Signal inference workers to stop experience collection... (34950 times) [2024-06-28 05:16:00,343][06909] InferenceWorker_p0-w0: stopping experience collection (34950 times) [2024-06-28 05:16:00,349][06887] Signal inference workers to resume experience collection... (34950 times) [2024-06-28 05:16:00,363][06909] InferenceWorker_p0-w0: resuming experience collection (34950 times) [2024-06-28 05:16:02,754][06909] Updated weights for policy 0, policy_version 155873 (0.0030) [2024-06-28 05:16:03,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43965.1, 300 sec: 44097.9). Total num frames: 2553872384. Throughput: 0: 43975.4. Samples: 2456782840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 05:16:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:16:06,610][06909] Updated weights for policy 0, policy_version 155883 (0.0036) [2024-06-28 05:16:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.7, 300 sec: 44042.4). Total num frames: 2554085376. Throughput: 0: 44144.6. Samples: 2457052080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 05:16:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:16:10,054][06909] Updated weights for policy 0, policy_version 155893 (0.0040) [2024-06-28 05:16:13,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2554298368. Throughput: 0: 44122.8. Samples: 2457182260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 05:16:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:16:14,179][06909] Updated weights for policy 0, policy_version 155903 (0.0031) [2024-06-28 05:16:17,487][06909] Updated weights for policy 0, policy_version 155913 (0.0027) [2024-06-28 05:16:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2554511360. Throughput: 0: 44084.9. Samples: 2457447560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 05:16:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:16:21,442][06909] Updated weights for policy 0, policy_version 155923 (0.0028) [2024-06-28 05:16:23,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2554757120. Throughput: 0: 43974.5. Samples: 2457710200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 05:16:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:16:24,869][06909] Updated weights for policy 0, policy_version 155933 (0.0030) [2024-06-28 05:16:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2554953728. Throughput: 0: 44020.7. Samples: 2457842900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 05:16:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:16:28,964][06909] Updated weights for policy 0, policy_version 155943 (0.0031) [2024-06-28 05:16:32,493][06909] Updated weights for policy 0, policy_version 155953 (0.0036) [2024-06-28 05:16:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44153.8). Total num frames: 2555199488. Throughput: 0: 43999.0. Samples: 2458108000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 05:16:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:16:36,333][06909] Updated weights for policy 0, policy_version 155963 (0.0037) [2024-06-28 05:16:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2555412480. Throughput: 0: 44022.7. Samples: 2458371720. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 05:16:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:16:39,831][06909] Updated weights for policy 0, policy_version 155973 (0.0028) [2024-06-28 05:16:43,722][06909] Updated weights for policy 0, policy_version 155983 (0.0036) [2024-06-28 05:16:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 2555625472. Throughput: 0: 43998.6. Samples: 2458502320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 05:16:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:16:47,189][06909] Updated weights for policy 0, policy_version 155993 (0.0036) [2024-06-28 05:16:48,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.5, 300 sec: 44097.9). Total num frames: 2555854848. Throughput: 0: 44105.3. Samples: 2458767580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 05:16:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:16:51,204][06909] Updated weights for policy 0, policy_version 156003 (0.0030) [2024-06-28 05:16:53,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2556067840. Throughput: 0: 44100.0. Samples: 2459036580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 05:16:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:16:54,547][06909] Updated weights for policy 0, policy_version 156013 (0.0031) [2024-06-28 05:16:58,399][06909] Updated weights for policy 0, policy_version 156023 (0.0035) [2024-06-28 05:16:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2556297216. Throughput: 0: 44237.2. Samples: 2459172940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 05:16:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:17:01,991][06909] Updated weights for policy 0, policy_version 156033 (0.0038) [2024-06-28 05:17:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 2556510208. Throughput: 0: 44178.7. Samples: 2459435600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 05:17:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:17:06,003][06909] Updated weights for policy 0, policy_version 156043 (0.0025) [2024-06-28 05:17:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 2556723200. Throughput: 0: 44228.5. Samples: 2459700480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 05:17:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:17:09,539][06909] Updated weights for policy 0, policy_version 156053 (0.0043) [2024-06-28 05:17:12,454][06887] Signal inference workers to stop experience collection... (35000 times) [2024-06-28 05:17:12,456][06887] Signal inference workers to resume experience collection... (35000 times) [2024-06-28 05:17:12,478][06909] InferenceWorker_p0-w0: stopping experience collection (35000 times) [2024-06-28 05:17:12,478][06909] InferenceWorker_p0-w0: resuming experience collection (35000 times) [2024-06-28 05:17:13,463][06909] Updated weights for policy 0, policy_version 156063 (0.0042) [2024-06-28 05:17:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2556952576. Throughput: 0: 44122.6. Samples: 2459828420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 05:17:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:17:16,901][06909] Updated weights for policy 0, policy_version 156073 (0.0035) [2024-06-28 05:17:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44154.4). Total num frames: 2557181952. Throughput: 0: 44198.2. Samples: 2460096920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 05:17:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:17:20,549][06909] Updated weights for policy 0, policy_version 156083 (0.0030) [2024-06-28 05:17:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2557394944. Throughput: 0: 44304.9. Samples: 2460365440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 05:17:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:17:24,221][06909] Updated weights for policy 0, policy_version 156093 (0.0036) [2024-06-28 05:17:27,900][06909] Updated weights for policy 0, policy_version 156103 (0.0035) [2024-06-28 05:17:28,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44782.9, 300 sec: 44209.0). Total num frames: 2557640704. Throughput: 0: 44385.9. Samples: 2460499680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 05:17:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:17:31,563][06909] Updated weights for policy 0, policy_version 156113 (0.0021) [2024-06-28 05:17:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2557837312. Throughput: 0: 44359.8. Samples: 2460763760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 05:17:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:17:35,366][06909] Updated weights for policy 0, policy_version 156123 (0.0035) [2024-06-28 05:17:38,850][06674] Fps is (10 sec: 42597.6, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2558066688. Throughput: 0: 44362.9. Samples: 2461032920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 05:17:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:17:38,914][06909] Updated weights for policy 0, policy_version 156133 (0.0039) [2024-06-28 05:17:42,923][06909] Updated weights for policy 0, policy_version 156143 (0.0030) [2024-06-28 05:17:43,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2558296064. Throughput: 0: 44304.4. Samples: 2461166640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 05:17:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:17:46,562][06909] Updated weights for policy 0, policy_version 156153 (0.0028) [2024-06-28 05:17:48,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 2558525440. Throughput: 0: 44234.7. Samples: 2461426160. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 05:17:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:17:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000156160_2558525440.pth... [2024-06-28 05:17:48,912][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000155514_2547941376.pth [2024-06-28 05:17:50,223][06909] Updated weights for policy 0, policy_version 156163 (0.0034) [2024-06-28 05:17:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2558722048. Throughput: 0: 44239.6. Samples: 2461691260. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 05:17:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:17:53,873][06909] Updated weights for policy 0, policy_version 156173 (0.0048) [2024-06-28 05:17:57,744][06909] Updated weights for policy 0, policy_version 156183 (0.0038) [2024-06-28 05:17:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2558951424. Throughput: 0: 44340.5. Samples: 2461823740. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 05:17:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:18:01,479][06909] Updated weights for policy 0, policy_version 156193 (0.0027) [2024-06-28 05:18:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2559148032. Throughput: 0: 44097.8. Samples: 2462081320. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 05:18:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:18:05,133][06909] Updated weights for policy 0, policy_version 156203 (0.0030) [2024-06-28 05:18:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2559377408. Throughput: 0: 44140.0. Samples: 2462351740. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 05:18:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:18:09,001][06909] Updated weights for policy 0, policy_version 156213 (0.0040) [2024-06-28 05:18:12,637][06909] Updated weights for policy 0, policy_version 156223 (0.0035) [2024-06-28 05:18:13,852][06674] Fps is (10 sec: 47503.9, 60 sec: 44508.3, 300 sec: 44153.2). Total num frames: 2559623168. Throughput: 0: 44182.4. Samples: 2462487980. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 05:18:13,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:18:16,195][06909] Updated weights for policy 0, policy_version 156233 (0.0035) [2024-06-28 05:18:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.9, 300 sec: 44154.4). Total num frames: 2559836160. Throughput: 0: 44021.7. Samples: 2462744740. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 05:18:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:18:19,994][06909] Updated weights for policy 0, policy_version 156243 (0.0036) [2024-06-28 05:18:23,785][06909] Updated weights for policy 0, policy_version 156253 (0.0032) [2024-06-28 05:18:23,850][06674] Fps is (10 sec: 42607.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2560049152. Throughput: 0: 44038.3. Samples: 2463014640. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 05:18:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:18:27,493][06909] Updated weights for policy 0, policy_version 156263 (0.0039) [2024-06-28 05:18:28,856][06674] Fps is (10 sec: 44209.7, 60 sec: 43959.2, 300 sec: 44097.3). Total num frames: 2560278528. Throughput: 0: 43995.0. Samples: 2463146680. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 05:18:28,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:18:30,391][06887] Signal inference workers to stop experience collection... (35050 times) [2024-06-28 05:18:30,412][06909] InferenceWorker_p0-w0: stopping experience collection (35050 times) [2024-06-28 05:18:30,449][06887] Signal inference workers to resume experience collection... (35050 times) [2024-06-28 05:18:30,450][06909] InferenceWorker_p0-w0: resuming experience collection (35050 times) [2024-06-28 05:18:31,376][06909] Updated weights for policy 0, policy_version 156273 (0.0035) [2024-06-28 05:18:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2560475136. Throughput: 0: 43984.5. Samples: 2463405460. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 05:18:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:18:34,692][06909] Updated weights for policy 0, policy_version 156283 (0.0034) [2024-06-28 05:18:38,850][06674] Fps is (10 sec: 40984.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2560688128. Throughput: 0: 44019.4. Samples: 2463672140. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 05:18:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:18:38,877][06909] Updated weights for policy 0, policy_version 156293 (0.0028) [2024-06-28 05:18:42,180][06909] Updated weights for policy 0, policy_version 156303 (0.0038) [2024-06-28 05:18:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2560933888. Throughput: 0: 44055.9. Samples: 2463806260. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 05:18:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:18:46,108][06909] Updated weights for policy 0, policy_version 156313 (0.0042) [2024-06-28 05:18:48,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2561163264. Throughput: 0: 44159.6. Samples: 2464068500. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 05:18:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:18:49,746][06909] Updated weights for policy 0, policy_version 156323 (0.0036) [2024-06-28 05:18:53,558][06909] Updated weights for policy 0, policy_version 156333 (0.0030) [2024-06-28 05:18:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2561376256. Throughput: 0: 44165.8. Samples: 2464339200. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:18:53,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:18:57,065][06909] Updated weights for policy 0, policy_version 156343 (0.0029) [2024-06-28 05:18:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2561589248. Throughput: 0: 44045.6. Samples: 2464469940. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:18:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:19:00,963][06909] Updated weights for policy 0, policy_version 156353 (0.0022) [2024-06-28 05:19:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2561802240. Throughput: 0: 44137.8. Samples: 2464730940. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:19:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:19:04,558][06909] Updated weights for policy 0, policy_version 156363 (0.0037) [2024-06-28 05:19:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2562015232. Throughput: 0: 44065.3. Samples: 2464997580. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:19:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:19:08,854][06909] Updated weights for policy 0, policy_version 156373 (0.0031) [2024-06-28 05:19:11,855][06909] Updated weights for policy 0, policy_version 156383 (0.0036) [2024-06-28 05:19:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43692.2, 300 sec: 44098.0). Total num frames: 2562244608. Throughput: 0: 44044.7. Samples: 2465128420. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:19:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:19:15,986][06909] Updated weights for policy 0, policy_version 156393 (0.0042) [2024-06-28 05:19:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2562473984. Throughput: 0: 44106.1. Samples: 2465390240. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:19:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:19:19,635][06909] Updated weights for policy 0, policy_version 156403 (0.0050) [2024-06-28 05:19:23,165][06909] Updated weights for policy 0, policy_version 156413 (0.0027) [2024-06-28 05:19:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44098.9). Total num frames: 2562686976. Throughput: 0: 44225.5. Samples: 2465662280. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:19:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:19:26,813][06909] Updated weights for policy 0, policy_version 156423 (0.0037) [2024-06-28 05:19:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43695.2, 300 sec: 43986.9). Total num frames: 2562899968. Throughput: 0: 44236.5. Samples: 2465796900. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:19:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:19:30,364][06909] Updated weights for policy 0, policy_version 156433 (0.0026) [2024-06-28 05:19:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 2563145728. Throughput: 0: 44373.3. Samples: 2466065300. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:19:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:19:34,039][06909] Updated weights for policy 0, policy_version 156443 (0.0027) [2024-06-28 05:19:37,783][06909] Updated weights for policy 0, policy_version 156453 (0.0030) [2024-06-28 05:19:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 2563358720. Throughput: 0: 44079.2. Samples: 2466322760. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:19:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:19:41,682][06909] Updated weights for policy 0, policy_version 156463 (0.0030) [2024-06-28 05:19:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2563571712. Throughput: 0: 44174.5. Samples: 2466457800. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:19:43,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 05:19:45,632][06909] Updated weights for policy 0, policy_version 156473 (0.0034) [2024-06-28 05:19:48,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2563801088. Throughput: 0: 44235.0. Samples: 2466721520. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:19:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:19:48,982][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000156483_2563817472.pth... [2024-06-28 05:19:48,985][06909] Updated weights for policy 0, policy_version 156483 (0.0041) [2024-06-28 05:19:49,029][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000155837_2553233408.pth [2024-06-28 05:19:52,872][06909] Updated weights for policy 0, policy_version 156493 (0.0044) [2024-06-28 05:19:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2564014080. Throughput: 0: 44086.7. Samples: 2466981480. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:19:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:19:56,581][06909] Updated weights for policy 0, policy_version 156503 (0.0024) [2024-06-28 05:19:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 2564227072. Throughput: 0: 44186.6. Samples: 2467116820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:19:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:20:00,299][06909] Updated weights for policy 0, policy_version 156513 (0.0026) [2024-06-28 05:20:03,811][06909] Updated weights for policy 0, policy_version 156523 (0.0026) [2024-06-28 05:20:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2564472832. Throughput: 0: 44382.6. Samples: 2467387460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:20:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:20:07,482][06909] Updated weights for policy 0, policy_version 156533 (0.0025) [2024-06-28 05:20:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2564669440. Throughput: 0: 44175.4. Samples: 2467650180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:20:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:20:08,866][06887] Signal inference workers to stop experience collection... (35100 times) [2024-06-28 05:20:08,906][06909] InferenceWorker_p0-w0: stopping experience collection (35100 times) [2024-06-28 05:20:08,914][06887] Signal inference workers to resume experience collection... (35100 times) [2024-06-28 05:20:08,925][06909] InferenceWorker_p0-w0: resuming experience collection (35100 times) [2024-06-28 05:20:11,163][06909] Updated weights for policy 0, policy_version 156543 (0.0028) [2024-06-28 05:20:13,853][06674] Fps is (10 sec: 42584.3, 60 sec: 44234.3, 300 sec: 44097.4). Total num frames: 2564898816. Throughput: 0: 44131.3. Samples: 2467782960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:20:13,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:20:14,966][06909] Updated weights for policy 0, policy_version 156553 (0.0031) [2024-06-28 05:20:18,754][06909] Updated weights for policy 0, policy_version 156563 (0.0038) [2024-06-28 05:20:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2565128192. Throughput: 0: 43931.6. Samples: 2468042220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:20:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:20:22,512][06909] Updated weights for policy 0, policy_version 156573 (0.0041) [2024-06-28 05:20:23,850][06674] Fps is (10 sec: 42612.4, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 2565324800. Throughput: 0: 44096.3. Samples: 2468307100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:20:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:20:26,285][06909] Updated weights for policy 0, policy_version 156583 (0.0027) [2024-06-28 05:20:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2565554176. Throughput: 0: 43888.5. Samples: 2468432780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:20:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:20:29,983][06909] Updated weights for policy 0, policy_version 156593 (0.0039) [2024-06-28 05:20:33,713][06909] Updated weights for policy 0, policy_version 156603 (0.0030) [2024-06-28 05:20:33,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2565783552. Throughput: 0: 44217.4. Samples: 2468711300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:20:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:20:37,542][06909] Updated weights for policy 0, policy_version 156613 (0.0041) [2024-06-28 05:20:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 44153.8). Total num frames: 2566012928. Throughput: 0: 44198.2. Samples: 2468970400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:20:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:20:41,087][06909] Updated weights for policy 0, policy_version 156623 (0.0026) [2024-06-28 05:20:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 2566225920. Throughput: 0: 44105.3. Samples: 2469101560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:20:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:20:44,803][06909] Updated weights for policy 0, policy_version 156633 (0.0034) [2024-06-28 05:20:48,549][06909] Updated weights for policy 0, policy_version 156643 (0.0029) [2024-06-28 05:20:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2566455296. Throughput: 0: 44034.7. Samples: 2469369020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:20:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:20:52,214][06909] Updated weights for policy 0, policy_version 156653 (0.0031) [2024-06-28 05:20:53,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2566668288. Throughput: 0: 44052.4. Samples: 2469632540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:20:53,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:20:55,950][06909] Updated weights for policy 0, policy_version 156663 (0.0031) [2024-06-28 05:20:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2566881280. Throughput: 0: 44030.7. Samples: 2469764200. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:20:58,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:20:59,443][06909] Updated weights for policy 0, policy_version 156673 (0.0029) [2024-06-28 05:21:03,495][06909] Updated weights for policy 0, policy_version 156683 (0.0032) [2024-06-28 05:21:03,852][06674] Fps is (10 sec: 44228.5, 60 sec: 43962.3, 300 sec: 44153.2). Total num frames: 2567110656. Throughput: 0: 44131.4. Samples: 2470028220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:21:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:21:07,254][06909] Updated weights for policy 0, policy_version 156693 (0.0038) [2024-06-28 05:21:08,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2567323648. Throughput: 0: 44077.0. Samples: 2470290560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:21:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:21:10,781][06909] Updated weights for policy 0, policy_version 156703 (0.0022) [2024-06-28 05:21:13,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43966.2, 300 sec: 44153.5). Total num frames: 2567536640. Throughput: 0: 44212.5. Samples: 2470422340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:21:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:21:14,504][06909] Updated weights for policy 0, policy_version 156713 (0.0032) [2024-06-28 05:21:18,369][06909] Updated weights for policy 0, policy_version 156723 (0.0026) [2024-06-28 05:21:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2567782400. Throughput: 0: 44069.7. Samples: 2470694440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:21:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:21:22,054][06909] Updated weights for policy 0, policy_version 156733 (0.0032) [2024-06-28 05:21:23,851][06674] Fps is (10 sec: 45871.9, 60 sec: 44509.4, 300 sec: 44208.9). Total num frames: 2567995392. Throughput: 0: 44025.5. Samples: 2470951580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:21:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:21:25,796][06909] Updated weights for policy 0, policy_version 156743 (0.0047) [2024-06-28 05:21:28,856][06674] Fps is (10 sec: 42572.6, 60 sec: 44232.4, 300 sec: 44097.1). Total num frames: 2568208384. Throughput: 0: 44044.3. Samples: 2471083820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:21:28,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:21:29,236][06909] Updated weights for policy 0, policy_version 156753 (0.0045) [2024-06-28 05:21:33,163][06909] Updated weights for policy 0, policy_version 156763 (0.0038) [2024-06-28 05:21:33,852][06674] Fps is (10 sec: 44231.1, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 2568437760. Throughput: 0: 43935.0. Samples: 2471346180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:21:33,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:21:36,288][06887] Signal inference workers to stop experience collection... (35150 times) [2024-06-28 05:21:36,320][06909] InferenceWorker_p0-w0: stopping experience collection (35150 times) [2024-06-28 05:21:36,342][06887] Signal inference workers to resume experience collection... (35150 times) [2024-06-28 05:21:36,344][06909] InferenceWorker_p0-w0: resuming experience collection (35150 times) [2024-06-28 05:21:36,830][06909] Updated weights for policy 0, policy_version 156773 (0.0027) [2024-06-28 05:21:38,850][06674] Fps is (10 sec: 44263.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2568650752. Throughput: 0: 43894.7. Samples: 2471607800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:21:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:21:40,686][06909] Updated weights for policy 0, policy_version 156783 (0.0039) [2024-06-28 05:21:43,856][06674] Fps is (10 sec: 44219.1, 60 sec: 44232.3, 300 sec: 44152.6). Total num frames: 2568880128. Throughput: 0: 43908.0. Samples: 2471740320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:21:43,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:21:44,216][06909] Updated weights for policy 0, policy_version 156793 (0.0029) [2024-06-28 05:21:47,982][06909] Updated weights for policy 0, policy_version 156803 (0.0028) [2024-06-28 05:21:48,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2569109504. Throughput: 0: 44121.6. Samples: 2472013600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:21:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:21:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000156806_2569109504.pth... [2024-06-28 05:21:48,914][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000156160_2558525440.pth [2024-06-28 05:21:51,855][06909] Updated weights for policy 0, policy_version 156813 (0.0037) [2024-06-28 05:21:53,850][06674] Fps is (10 sec: 42624.3, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 2569306112. Throughput: 0: 44092.9. Samples: 2472274740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:21:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:21:55,471][06909] Updated weights for policy 0, policy_version 156823 (0.0022) [2024-06-28 05:21:58,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2569535488. Throughput: 0: 44008.8. Samples: 2472402740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 05:21:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:21:59,282][06909] Updated weights for policy 0, policy_version 156833 (0.0036) [2024-06-28 05:22:03,005][06909] Updated weights for policy 0, policy_version 156843 (0.0025) [2024-06-28 05:22:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 2569748480. Throughput: 0: 43989.8. Samples: 2472673980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:22:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:22:06,659][06909] Updated weights for policy 0, policy_version 156853 (0.0035) [2024-06-28 05:22:08,852][06674] Fps is (10 sec: 42588.5, 60 sec: 43961.9, 300 sec: 44097.6). Total num frames: 2569961472. Throughput: 0: 43977.4. Samples: 2472930640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:22:08,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:22:10,630][06909] Updated weights for policy 0, policy_version 156863 (0.0030) [2024-06-28 05:22:13,852][06674] Fps is (10 sec: 44227.6, 60 sec: 44235.3, 300 sec: 44097.7). Total num frames: 2570190848. Throughput: 0: 43957.3. Samples: 2473061720. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:22:13,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:22:14,074][06909] Updated weights for policy 0, policy_version 156873 (0.0042) [2024-06-28 05:22:17,915][06909] Updated weights for policy 0, policy_version 156883 (0.0032) [2024-06-28 05:22:18,850][06674] Fps is (10 sec: 44247.3, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2570403840. Throughput: 0: 44114.4. Samples: 2473331240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:22:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:22:21,335][06909] Updated weights for policy 0, policy_version 156893 (0.0026) [2024-06-28 05:22:23,850][06674] Fps is (10 sec: 42606.8, 60 sec: 43691.2, 300 sec: 43986.9). Total num frames: 2570616832. Throughput: 0: 44108.9. Samples: 2473592700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:22:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:22:25,301][06909] Updated weights for policy 0, policy_version 156903 (0.0028) [2024-06-28 05:22:28,844][06909] Updated weights for policy 0, policy_version 156913 (0.0032) [2024-06-28 05:22:28,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44241.3, 300 sec: 44153.5). Total num frames: 2570862592. Throughput: 0: 44114.4. Samples: 2473725200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:22:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:22:32,650][06909] Updated weights for policy 0, policy_version 156923 (0.0032) [2024-06-28 05:22:33,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2571091968. Throughput: 0: 44040.0. Samples: 2473995400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:22:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:22:36,066][06909] Updated weights for policy 0, policy_version 156933 (0.0026) [2024-06-28 05:22:38,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2571272192. Throughput: 0: 44133.2. Samples: 2474260740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:22:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:22:40,413][06909] Updated weights for policy 0, policy_version 156943 (0.0031) [2024-06-28 05:22:43,779][06909] Updated weights for policy 0, policy_version 156953 (0.0038) [2024-06-28 05:22:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43968.1, 300 sec: 44042.4). Total num frames: 2571517952. Throughput: 0: 44117.9. Samples: 2474388040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:22:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 05:22:47,686][06909] Updated weights for policy 0, policy_version 156963 (0.0028) [2024-06-28 05:22:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2571730944. Throughput: 0: 44038.5. Samples: 2474655720. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:22:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:22:51,148][06909] Updated weights for policy 0, policy_version 156973 (0.0032) [2024-06-28 05:22:53,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2571927552. Throughput: 0: 44220.7. Samples: 2474920460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:22:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:22:55,057][06909] Updated weights for policy 0, policy_version 156983 (0.0031) [2024-06-28 05:22:58,624][06909] Updated weights for policy 0, policy_version 156993 (0.0035) [2024-06-28 05:22:58,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 2572173312. Throughput: 0: 44074.0. Samples: 2475044960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:22:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:23:02,258][06909] Updated weights for policy 0, policy_version 157003 (0.0024) [2024-06-28 05:23:03,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2572402688. Throughput: 0: 44121.9. Samples: 2475316720. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 05:23:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:23:05,881][06909] Updated weights for policy 0, policy_version 157013 (0.0040) [2024-06-28 05:23:08,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44238.5, 300 sec: 44042.7). Total num frames: 2572615680. Throughput: 0: 44534.6. Samples: 2475596760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 05:23:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:23:09,279][06887] Signal inference workers to stop experience collection... (35200 times) [2024-06-28 05:23:09,279][06887] Signal inference workers to resume experience collection... (35200 times) [2024-06-28 05:23:09,293][06909] InferenceWorker_p0-w0: stopping experience collection (35200 times) [2024-06-28 05:23:09,293][06909] InferenceWorker_p0-w0: resuming experience collection (35200 times) [2024-06-28 05:23:09,766][06909] Updated weights for policy 0, policy_version 157023 (0.0030) [2024-06-28 05:23:13,444][06909] Updated weights for policy 0, policy_version 157033 (0.0031) [2024-06-28 05:23:13,851][06674] Fps is (10 sec: 44232.2, 60 sec: 44237.6, 300 sec: 44097.8). Total num frames: 2572845056. Throughput: 0: 44374.6. Samples: 2475722100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 05:23:13,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:23:16,980][06909] Updated weights for policy 0, policy_version 157043 (0.0030) [2024-06-28 05:23:18,850][06674] Fps is (10 sec: 47514.2, 60 sec: 44783.0, 300 sec: 44209.0). Total num frames: 2573090816. Throughput: 0: 44345.7. Samples: 2475990960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 05:23:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:23:20,564][06909] Updated weights for policy 0, policy_version 157053 (0.0033) [2024-06-28 05:23:23,852][06674] Fps is (10 sec: 42593.6, 60 sec: 44235.3, 300 sec: 44043.0). Total num frames: 2573271040. Throughput: 0: 44276.7. Samples: 2476253280. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 05:23:23,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:23:24,419][06909] Updated weights for policy 0, policy_version 157063 (0.0035) [2024-06-28 05:23:28,083][06909] Updated weights for policy 0, policy_version 157073 (0.0028) [2024-06-28 05:23:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2573516800. Throughput: 0: 44307.2. Samples: 2476381860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 05:23:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:23:31,843][06909] Updated weights for policy 0, policy_version 157083 (0.0039) [2024-06-28 05:23:33,850][06674] Fps is (10 sec: 47523.6, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 2573746176. Throughput: 0: 44356.6. Samples: 2476651760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 05:23:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:23:35,416][06909] Updated weights for policy 0, policy_version 157093 (0.0038) [2024-06-28 05:23:38,853][06674] Fps is (10 sec: 44221.1, 60 sec: 44780.4, 300 sec: 44153.0). Total num frames: 2573959168. Throughput: 0: 44541.4. Samples: 2476924980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 05:23:38,854][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:23:38,975][06909] Updated weights for policy 0, policy_version 157103 (0.0023) [2024-06-28 05:23:42,495][06909] Updated weights for policy 0, policy_version 157113 (0.0046) [2024-06-28 05:23:43,852][06674] Fps is (10 sec: 42589.5, 60 sec: 44235.3, 300 sec: 44097.7). Total num frames: 2574172160. Throughput: 0: 44565.5. Samples: 2477050500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 05:23:43,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:23:46,719][06909] Updated weights for policy 0, policy_version 157123 (0.0025) [2024-06-28 05:23:48,850][06674] Fps is (10 sec: 45890.7, 60 sec: 44782.9, 300 sec: 44209.0). Total num frames: 2574417920. Throughput: 0: 44540.7. Samples: 2477321060. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 05:23:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:23:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000157130_2574417920.pth... [2024-06-28 05:23:48,916][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000156483_2563817472.pth [2024-06-28 05:23:50,125][06909] Updated weights for policy 0, policy_version 157133 (0.0032) [2024-06-28 05:23:53,850][06674] Fps is (10 sec: 44245.9, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 2574614528. Throughput: 0: 44214.8. Samples: 2477586420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 05:23:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:23:54,185][06909] Updated weights for policy 0, policy_version 157143 (0.0031) [2024-06-28 05:23:57,632][06909] Updated weights for policy 0, policy_version 157153 (0.0037) [2024-06-28 05:23:58,850][06674] Fps is (10 sec: 40960.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2574827520. Throughput: 0: 44191.2. Samples: 2477710660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 05:23:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:24:01,530][06909] Updated weights for policy 0, policy_version 157163 (0.0031) [2024-06-28 05:24:03,856][06674] Fps is (10 sec: 45847.5, 60 sec: 44505.3, 300 sec: 44263.7). Total num frames: 2575073280. Throughput: 0: 44131.0. Samples: 2477977120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 05:24:03,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:24:05,205][06909] Updated weights for policy 0, policy_version 157173 (0.0030) [2024-06-28 05:24:08,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2575269888. Throughput: 0: 44030.8. Samples: 2478234580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 05:24:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:24:09,028][06909] Updated weights for policy 0, policy_version 157183 (0.0024) [2024-06-28 05:24:12,521][06909] Updated weights for policy 0, policy_version 157193 (0.0037) [2024-06-28 05:24:13,850][06674] Fps is (10 sec: 40984.8, 60 sec: 43964.4, 300 sec: 44098.0). Total num frames: 2575482880. Throughput: 0: 43988.9. Samples: 2478361360. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 05:24:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:24:16,357][06909] Updated weights for policy 0, policy_version 157203 (0.0030) [2024-06-28 05:24:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2575728640. Throughput: 0: 44084.3. Samples: 2478635560. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 05:24:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:24:19,685][06909] Updated weights for policy 0, policy_version 157213 (0.0026) [2024-06-28 05:24:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2575925248. Throughput: 0: 43865.2. Samples: 2478898760. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 05:24:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:24:24,026][06909] Updated weights for policy 0, policy_version 157223 (0.0039) [2024-06-28 05:24:27,419][06909] Updated weights for policy 0, policy_version 157233 (0.0043) [2024-06-28 05:24:27,424][06887] Signal inference workers to stop experience collection... (35250 times) [2024-06-28 05:24:27,425][06887] Signal inference workers to resume experience collection... (35250 times) [2024-06-28 05:24:27,447][06909] InferenceWorker_p0-w0: stopping experience collection (35250 times) [2024-06-28 05:24:27,447][06909] InferenceWorker_p0-w0: resuming experience collection (35250 times) [2024-06-28 05:24:28,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2576138240. Throughput: 0: 43962.5. Samples: 2479028720. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 05:24:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:24:31,279][06909] Updated weights for policy 0, policy_version 157243 (0.0035) [2024-06-28 05:24:33,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 2576384000. Throughput: 0: 43848.5. Samples: 2479294240. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 05:24:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:24:35,061][06909] Updated weights for policy 0, policy_version 157253 (0.0031) [2024-06-28 05:24:38,635][06909] Updated weights for policy 0, policy_version 157263 (0.0046) [2024-06-28 05:24:38,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43966.3, 300 sec: 44153.5). Total num frames: 2576596992. Throughput: 0: 43651.5. Samples: 2479550740. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 05:24:38,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:24:42,239][06909] Updated weights for policy 0, policy_version 157273 (0.0028) [2024-06-28 05:24:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43965.2, 300 sec: 44098.0). Total num frames: 2576809984. Throughput: 0: 43883.9. Samples: 2479685440. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 05:24:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:24:45,895][06909] Updated weights for policy 0, policy_version 157283 (0.0030) [2024-06-28 05:24:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2577039360. Throughput: 0: 43997.8. Samples: 2479956760. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 05:24:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:24:49,627][06909] Updated weights for policy 0, policy_version 157293 (0.0024) [2024-06-28 05:24:53,494][06909] Updated weights for policy 0, policy_version 157303 (0.0039) [2024-06-28 05:24:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2577252352. Throughput: 0: 44162.4. Samples: 2480221880. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 05:24:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:24:56,900][06909] Updated weights for policy 0, policy_version 157313 (0.0040) [2024-06-28 05:24:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2577481728. Throughput: 0: 44308.5. Samples: 2480355240. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 05:24:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:25:00,706][06909] Updated weights for policy 0, policy_version 157323 (0.0033) [2024-06-28 05:25:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43695.1, 300 sec: 44153.5). Total num frames: 2577694720. Throughput: 0: 44224.5. Samples: 2480625660. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 05:25:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:25:04,301][06909] Updated weights for policy 0, policy_version 157333 (0.0027) [2024-06-28 05:25:08,384][06909] Updated weights for policy 0, policy_version 157343 (0.0033) [2024-06-28 05:25:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.9, 300 sec: 44154.0). Total num frames: 2577924096. Throughput: 0: 44084.9. Samples: 2480882580. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 05:25:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:25:11,942][06909] Updated weights for policy 0, policy_version 157353 (0.0028) [2024-06-28 05:25:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2578137088. Throughput: 0: 44145.7. Samples: 2481015280. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 05:25:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 05:25:15,950][06909] Updated weights for policy 0, policy_version 157363 (0.0034) [2024-06-28 05:25:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.8, 300 sec: 44153.5). Total num frames: 2578350080. Throughput: 0: 44127.2. Samples: 2481279960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:25:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:25:19,410][06909] Updated weights for policy 0, policy_version 157373 (0.0033) [2024-06-28 05:25:23,467][06909] Updated weights for policy 0, policy_version 157383 (0.0035) [2024-06-28 05:25:23,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2578579456. Throughput: 0: 44032.2. Samples: 2481532180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:25:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:25:26,906][06909] Updated weights for policy 0, policy_version 157393 (0.0026) [2024-06-28 05:25:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2578792448. Throughput: 0: 44052.9. Samples: 2481667820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:25:28,859][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:25:30,915][06909] Updated weights for policy 0, policy_version 157403 (0.0022) [2024-06-28 05:25:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 2579005440. Throughput: 0: 43971.7. Samples: 2481935480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:25:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:25:34,229][06909] Updated weights for policy 0, policy_version 157413 (0.0035) [2024-06-28 05:25:38,105][06909] Updated weights for policy 0, policy_version 157423 (0.0036) [2024-06-28 05:25:38,856][06674] Fps is (10 sec: 45847.4, 60 sec: 44232.3, 300 sec: 44152.6). Total num frames: 2579251200. Throughput: 0: 43988.6. Samples: 2482201640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:25:38,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:25:41,715][06909] Updated weights for policy 0, policy_version 157433 (0.0040) [2024-06-28 05:25:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2579447808. Throughput: 0: 43929.4. Samples: 2482332060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:25:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:25:45,805][06909] Updated weights for policy 0, policy_version 157443 (0.0034) [2024-06-28 05:25:48,748][06887] Signal inference workers to stop experience collection... (35300 times) [2024-06-28 05:25:48,802][06887] Signal inference workers to resume experience collection... (35300 times) [2024-06-28 05:25:48,803][06909] InferenceWorker_p0-w0: stopping experience collection (35300 times) [2024-06-28 05:25:48,830][06909] InferenceWorker_p0-w0: resuming experience collection (35300 times) [2024-06-28 05:25:48,850][06674] Fps is (10 sec: 42624.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2579677184. Throughput: 0: 43856.0. Samples: 2482599180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:25:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:25:48,932][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000157452_2579693568.pth... [2024-06-28 05:25:48,980][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000156806_2569109504.pth [2024-06-28 05:25:49,279][06909] Updated weights for policy 0, policy_version 157453 (0.0026) [2024-06-28 05:25:53,273][06909] Updated weights for policy 0, policy_version 157463 (0.0033) [2024-06-28 05:25:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2579890176. Throughput: 0: 44040.0. Samples: 2482864380. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:25:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:25:56,830][06909] Updated weights for policy 0, policy_version 157473 (0.0033) [2024-06-28 05:25:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 44042.7). Total num frames: 2580103168. Throughput: 0: 43883.1. Samples: 2482990020. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:25:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 05:26:00,817][06909] Updated weights for policy 0, policy_version 157483 (0.0028) [2024-06-28 05:26:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2580332544. Throughput: 0: 43847.5. Samples: 2483253100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:26:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:26:04,153][06909] Updated weights for policy 0, policy_version 157493 (0.0035) [2024-06-28 05:26:08,032][06909] Updated weights for policy 0, policy_version 157503 (0.0042) [2024-06-28 05:26:08,852][06674] Fps is (10 sec: 45865.6, 60 sec: 43962.2, 300 sec: 44153.2). Total num frames: 2580561920. Throughput: 0: 44159.6. Samples: 2483519460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:26:08,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:26:11,393][06909] Updated weights for policy 0, policy_version 157513 (0.0025) [2024-06-28 05:26:13,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 2580758528. Throughput: 0: 44002.4. Samples: 2483647920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:26:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:26:15,221][06909] Updated weights for policy 0, policy_version 157523 (0.0032) [2024-06-28 05:26:18,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43963.7, 300 sec: 44042.5). Total num frames: 2580987904. Throughput: 0: 43926.1. Samples: 2483912160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:26:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:26:19,136][06909] Updated weights for policy 0, policy_version 157533 (0.0030) [2024-06-28 05:26:23,062][06909] Updated weights for policy 0, policy_version 157543 (0.0044) [2024-06-28 05:26:23,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.6, 300 sec: 44098.9). Total num frames: 2581217280. Throughput: 0: 44007.3. Samples: 2484181700. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-28 05:26:23,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:26:26,542][06909] Updated weights for policy 0, policy_version 157553 (0.0027) [2024-06-28 05:26:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 2581430272. Throughput: 0: 44011.8. Samples: 2484312600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-28 05:26:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:26:30,411][06909] Updated weights for policy 0, policy_version 157563 (0.0034) [2024-06-28 05:26:33,793][06909] Updated weights for policy 0, policy_version 157573 (0.0031) [2024-06-28 05:26:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2581676032. Throughput: 0: 43904.8. Samples: 2484574900. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-28 05:26:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:26:37,998][06909] Updated weights for policy 0, policy_version 157583 (0.0030) [2024-06-28 05:26:38,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43695.1, 300 sec: 44043.3). Total num frames: 2581872640. Throughput: 0: 43924.0. Samples: 2484840960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-28 05:26:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:26:41,411][06909] Updated weights for policy 0, policy_version 157593 (0.0031) [2024-06-28 05:26:43,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2582069248. Throughput: 0: 44090.6. Samples: 2484974100. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-28 05:26:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:26:45,199][06909] Updated weights for policy 0, policy_version 157603 (0.0026) [2024-06-28 05:26:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2582315008. Throughput: 0: 44160.5. Samples: 2485240320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-28 05:26:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:26:48,878][06909] Updated weights for policy 0, policy_version 157613 (0.0027) [2024-06-28 05:26:52,607][06909] Updated weights for policy 0, policy_version 157623 (0.0040) [2024-06-28 05:26:53,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2582544384. Throughput: 0: 44080.7. Samples: 2485503000. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-28 05:26:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:26:56,599][06909] Updated weights for policy 0, policy_version 157633 (0.0038) [2024-06-28 05:26:58,827][06887] Signal inference workers to stop experience collection... (35350 times) [2024-06-28 05:26:58,827][06887] Signal inference workers to resume experience collection... (35350 times) [2024-06-28 05:26:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2582773760. Throughput: 0: 44203.5. Samples: 2485637080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-28 05:26:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:26:58,868][06909] InferenceWorker_p0-w0: stopping experience collection (35350 times) [2024-06-28 05:26:58,868][06909] InferenceWorker_p0-w0: resuming experience collection (35350 times) [2024-06-28 05:27:00,123][06909] Updated weights for policy 0, policy_version 157643 (0.0036) [2024-06-28 05:27:03,705][06909] Updated weights for policy 0, policy_version 157653 (0.0030) [2024-06-28 05:27:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44153.9). Total num frames: 2582986752. Throughput: 0: 44269.8. Samples: 2485904300. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-28 05:27:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:27:07,638][06909] Updated weights for policy 0, policy_version 157663 (0.0042) [2024-06-28 05:27:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43965.2, 300 sec: 44098.3). Total num frames: 2583199744. Throughput: 0: 43976.5. Samples: 2486160640. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-28 05:27:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:27:11,065][06909] Updated weights for policy 0, policy_version 157673 (0.0032) [2024-06-28 05:27:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2583412736. Throughput: 0: 44029.5. Samples: 2486293920. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-28 05:27:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:27:15,048][06909] Updated weights for policy 0, policy_version 157683 (0.0032) [2024-06-28 05:27:18,477][06909] Updated weights for policy 0, policy_version 157693 (0.0028) [2024-06-28 05:27:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2583642112. Throughput: 0: 44103.0. Samples: 2486559540. Policy #0 lag: (min: 0.0, avg: 10.9, max: 23.0) [2024-06-28 05:27:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:27:22,279][06909] Updated weights for policy 0, policy_version 157703 (0.0033) [2024-06-28 05:27:23,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2583871488. Throughput: 0: 44061.2. Samples: 2486823720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:27:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:27:25,827][06909] Updated weights for policy 0, policy_version 157713 (0.0020) [2024-06-28 05:27:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2584084480. Throughput: 0: 44206.5. Samples: 2486963400. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:27:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:27:29,573][06909] Updated weights for policy 0, policy_version 157723 (0.0042) [2024-06-28 05:27:33,357][06909] Updated weights for policy 0, policy_version 157733 (0.0036) [2024-06-28 05:27:33,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2584297472. Throughput: 0: 44164.0. Samples: 2487227700. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:27:33,850][06674] Avg episode reward: [(0, '0.436')] [2024-06-28 05:27:37,170][06909] Updated weights for policy 0, policy_version 157743 (0.0030) [2024-06-28 05:27:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2584526848. Throughput: 0: 44193.7. Samples: 2487491720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:27:38,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:27:40,849][06909] Updated weights for policy 0, policy_version 157753 (0.0042) [2024-06-28 05:27:43,852][06674] Fps is (10 sec: 45865.8, 60 sec: 44781.4, 300 sec: 44153.2). Total num frames: 2584756224. Throughput: 0: 44109.1. Samples: 2487622080. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:27:43,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:27:44,638][06909] Updated weights for policy 0, policy_version 157763 (0.0032) [2024-06-28 05:27:48,179][06909] Updated weights for policy 0, policy_version 157773 (0.0034) [2024-06-28 05:27:48,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2584952832. Throughput: 0: 44126.7. Samples: 2487890000. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:27:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:27:48,910][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000157774_2584969216.pth... [2024-06-28 05:27:48,967][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000157130_2574417920.pth [2024-06-28 05:27:51,815][06909] Updated weights for policy 0, policy_version 157783 (0.0037) [2024-06-28 05:27:53,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2585182208. Throughput: 0: 44305.4. Samples: 2488154380. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:27:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:27:55,870][06909] Updated weights for policy 0, policy_version 157793 (0.0047) [2024-06-28 05:27:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2585411584. Throughput: 0: 44187.5. Samples: 2488282360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:27:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 05:27:59,134][06909] Updated weights for policy 0, policy_version 157803 (0.0040) [2024-06-28 05:28:03,093][06909] Updated weights for policy 0, policy_version 157813 (0.0040) [2024-06-28 05:28:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2585624576. Throughput: 0: 44222.3. Samples: 2488549540. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:28:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:28:06,666][06909] Updated weights for policy 0, policy_version 157823 (0.0043) [2024-06-28 05:28:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.6). Total num frames: 2585837568. Throughput: 0: 44059.6. Samples: 2488806400. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:28:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 05:28:10,910][06909] Updated weights for policy 0, policy_version 157833 (0.0033) [2024-06-28 05:28:13,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2586083328. Throughput: 0: 44095.7. Samples: 2488947700. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:28:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:28:14,259][06909] Updated weights for policy 0, policy_version 157843 (0.0035) [2024-06-28 05:28:18,141][06909] Updated weights for policy 0, policy_version 157853 (0.0029) [2024-06-28 05:28:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 2586279936. Throughput: 0: 44064.8. Samples: 2489210620. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:28:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:28:21,555][06909] Updated weights for policy 0, policy_version 157863 (0.0027) [2024-06-28 05:28:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2586509312. Throughput: 0: 43945.4. Samples: 2489469260. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:28:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:28:25,616][06909] Updated weights for policy 0, policy_version 157873 (0.0043) [2024-06-28 05:28:28,833][06909] Updated weights for policy 0, policy_version 157883 (0.0030) [2024-06-28 05:28:28,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44510.0, 300 sec: 44097.9). Total num frames: 2586755072. Throughput: 0: 44186.9. Samples: 2489610400. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 05:28:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:28:32,956][06909] Updated weights for policy 0, policy_version 157893 (0.0040) [2024-06-28 05:28:33,851][06674] Fps is (10 sec: 44229.9, 60 sec: 44235.6, 300 sec: 44042.7). Total num frames: 2586951680. Throughput: 0: 44127.7. Samples: 2489875820. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 05:28:33,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:28:36,190][06909] Updated weights for policy 0, policy_version 157903 (0.0037) [2024-06-28 05:28:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 2587164672. Throughput: 0: 44172.8. Samples: 2490142160. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 05:28:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:28:40,322][06909] Updated weights for policy 0, policy_version 157913 (0.0026) [2024-06-28 05:28:43,665][06909] Updated weights for policy 0, policy_version 157923 (0.0033) [2024-06-28 05:28:43,850][06674] Fps is (10 sec: 45882.9, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 2587410432. Throughput: 0: 44261.9. Samples: 2490274140. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 05:28:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:28:47,155][06887] Signal inference workers to stop experience collection... (35400 times) [2024-06-28 05:28:47,156][06887] Signal inference workers to resume experience collection... (35400 times) [2024-06-28 05:28:47,203][06909] InferenceWorker_p0-w0: stopping experience collection (35400 times) [2024-06-28 05:28:47,204][06909] InferenceWorker_p0-w0: resuming experience collection (35400 times) [2024-06-28 05:28:47,947][06909] Updated weights for policy 0, policy_version 157933 (0.0020) [2024-06-28 05:28:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2587607040. Throughput: 0: 44114.3. Samples: 2490534680. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 05:28:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:28:51,171][06909] Updated weights for policy 0, policy_version 157943 (0.0037) [2024-06-28 05:28:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2587852800. Throughput: 0: 44404.9. Samples: 2490804620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 05:28:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:28:55,156][06909] Updated weights for policy 0, policy_version 157953 (0.0037) [2024-06-28 05:28:58,471][06909] Updated weights for policy 0, policy_version 157963 (0.0030) [2024-06-28 05:28:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44043.3). Total num frames: 2588065792. Throughput: 0: 44324.8. Samples: 2490942320. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 05:28:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:29:02,815][06909] Updated weights for policy 0, policy_version 157973 (0.0028) [2024-06-28 05:29:03,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2588262400. Throughput: 0: 44333.8. Samples: 2491205640. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 05:29:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:29:05,728][06909] Updated weights for policy 0, policy_version 157983 (0.0027) [2024-06-28 05:29:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2588508160. Throughput: 0: 44384.4. Samples: 2491466560. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 05:29:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:29:10,195][06909] Updated weights for policy 0, policy_version 157993 (0.0033) [2024-06-28 05:29:13,315][06909] Updated weights for policy 0, policy_version 158003 (0.0029) [2024-06-28 05:29:13,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2588721152. Throughput: 0: 44382.1. Samples: 2491607600. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 05:29:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 05:29:17,513][06909] Updated weights for policy 0, policy_version 158013 (0.0022) [2024-06-28 05:29:18,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2588917760. Throughput: 0: 44254.9. Samples: 2491867220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 05:29:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:29:20,590][06909] Updated weights for policy 0, policy_version 158023 (0.0028) [2024-06-28 05:29:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2589163520. Throughput: 0: 44138.6. Samples: 2492128400. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 05:29:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:29:24,739][06909] Updated weights for policy 0, policy_version 158033 (0.0027) [2024-06-28 05:29:28,122][06909] Updated weights for policy 0, policy_version 158043 (0.0040) [2024-06-28 05:29:28,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2589392896. Throughput: 0: 44236.4. Samples: 2492264780. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 05:29:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:29:32,767][06909] Updated weights for policy 0, policy_version 158053 (0.0032) [2024-06-28 05:29:33,850][06674] Fps is (10 sec: 44237.7, 60 sec: 44238.1, 300 sec: 44098.0). Total num frames: 2589605888. Throughput: 0: 44305.9. Samples: 2492528440. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:29:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:29:35,451][06909] Updated weights for policy 0, policy_version 158063 (0.0022) [2024-06-28 05:29:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2589818880. Throughput: 0: 44052.1. Samples: 2492786960. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:29:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:29:40,173][06909] Updated weights for policy 0, policy_version 158073 (0.0033) [2024-06-28 05:29:42,699][06909] Updated weights for policy 0, policy_version 158083 (0.0027) [2024-06-28 05:29:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2590031872. Throughput: 0: 44069.9. Samples: 2492925460. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:29:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:29:47,466][06909] Updated weights for policy 0, policy_version 158093 (0.0028) [2024-06-28 05:29:48,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2590244864. Throughput: 0: 44117.6. Samples: 2493190940. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:29:48,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:29:48,906][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000158097_2590261248.pth... [2024-06-28 05:29:48,987][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000157452_2579693568.pth [2024-06-28 05:29:50,392][06909] Updated weights for policy 0, policy_version 158103 (0.0034) [2024-06-28 05:29:53,753][06887] Signal inference workers to stop experience collection... (35450 times) [2024-06-28 05:29:53,791][06909] InferenceWorker_p0-w0: stopping experience collection (35450 times) [2024-06-28 05:29:53,812][06887] Signal inference workers to resume experience collection... (35450 times) [2024-06-28 05:29:53,813][06909] InferenceWorker_p0-w0: resuming experience collection (35450 times) [2024-06-28 05:29:53,856][06674] Fps is (10 sec: 45847.6, 60 sec: 43959.4, 300 sec: 44097.1). Total num frames: 2590490624. Throughput: 0: 44088.9. Samples: 2493450820. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:29:53,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:29:54,681][06909] Updated weights for policy 0, policy_version 158113 (0.0041) [2024-06-28 05:29:57,816][06909] Updated weights for policy 0, policy_version 158123 (0.0033) [2024-06-28 05:29:58,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2590720000. Throughput: 0: 43820.5. Samples: 2493579520. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:29:58,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:30:01,866][06909] Updated weights for policy 0, policy_version 158133 (0.0040) [2024-06-28 05:30:03,850][06674] Fps is (10 sec: 40984.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2590900224. Throughput: 0: 44016.1. Samples: 2493847940. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:30:03,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 05:30:05,241][06909] Updated weights for policy 0, policy_version 158143 (0.0030) [2024-06-28 05:30:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2591145984. Throughput: 0: 43864.9. Samples: 2494102320. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:30:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:30:09,948][06909] Updated weights for policy 0, policy_version 158153 (0.0028) [2024-06-28 05:30:12,980][06909] Updated weights for policy 0, policy_version 158163 (0.0039) [2024-06-28 05:30:13,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2591375360. Throughput: 0: 43887.5. Samples: 2494239720. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:30:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:30:17,089][06909] Updated weights for policy 0, policy_version 158173 (0.0030) [2024-06-28 05:30:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2591571968. Throughput: 0: 43979.8. Samples: 2494507540. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:30:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:30:20,412][06909] Updated weights for policy 0, policy_version 158183 (0.0040) [2024-06-28 05:30:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2591817728. Throughput: 0: 43889.3. Samples: 2494761980. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:30:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:30:24,644][06909] Updated weights for policy 0, policy_version 158193 (0.0037) [2024-06-28 05:30:27,783][06909] Updated weights for policy 0, policy_version 158203 (0.0023) [2024-06-28 05:30:28,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2592047104. Throughput: 0: 43931.0. Samples: 2494902360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:30:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:30:31,799][06909] Updated weights for policy 0, policy_version 158213 (0.0036) [2024-06-28 05:30:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44043.3). Total num frames: 2592243712. Throughput: 0: 43981.1. Samples: 2495170080. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 05:30:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:30:35,293][06909] Updated weights for policy 0, policy_version 158223 (0.0018) [2024-06-28 05:30:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2592473088. Throughput: 0: 43884.0. Samples: 2495425340. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 05:30:38,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 05:30:38,981][06909] Updated weights for policy 0, policy_version 158233 (0.0020) [2024-06-28 05:30:42,691][06909] Updated weights for policy 0, policy_version 158243 (0.0028) [2024-06-28 05:30:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2592686080. Throughput: 0: 44090.8. Samples: 2495563600. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 05:30:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:30:47,112][06909] Updated weights for policy 0, policy_version 158253 (0.0032) [2024-06-28 05:30:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 2592899072. Throughput: 0: 44134.2. Samples: 2495833980. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 05:30:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:30:50,063][06909] Updated weights for policy 0, policy_version 158263 (0.0040) [2024-06-28 05:30:53,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43968.0, 300 sec: 44153.5). Total num frames: 2593128448. Throughput: 0: 44047.0. Samples: 2496084440. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 05:30:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:30:54,367][06909] Updated weights for policy 0, policy_version 158273 (0.0033) [2024-06-28 05:30:57,802][06909] Updated weights for policy 0, policy_version 158283 (0.0032) [2024-06-28 05:30:58,856][06674] Fps is (10 sec: 45847.6, 60 sec: 43959.4, 300 sec: 44152.6). Total num frames: 2593357824. Throughput: 0: 44075.9. Samples: 2496223400. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 05:30:58,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:31:01,826][06909] Updated weights for policy 0, policy_version 158293 (0.0033) [2024-06-28 05:31:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.8, 300 sec: 44098.2). Total num frames: 2593570816. Throughput: 0: 44160.4. Samples: 2496494760. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 05:31:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:31:05,194][06909] Updated weights for policy 0, policy_version 158303 (0.0040) [2024-06-28 05:31:08,850][06674] Fps is (10 sec: 42623.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2593783808. Throughput: 0: 44190.1. Samples: 2496750540. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 05:31:08,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:31:09,059][06909] Updated weights for policy 0, policy_version 158313 (0.0039) [2024-06-28 05:31:10,586][06887] Signal inference workers to stop experience collection... (35500 times) [2024-06-28 05:31:10,587][06887] Signal inference workers to resume experience collection... (35500 times) [2024-06-28 05:31:10,632][06909] InferenceWorker_p0-w0: stopping experience collection (35500 times) [2024-06-28 05:31:10,632][06909] InferenceWorker_p0-w0: resuming experience collection (35500 times) [2024-06-28 05:31:12,928][06909] Updated weights for policy 0, policy_version 158323 (0.0030) [2024-06-28 05:31:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2594013184. Throughput: 0: 44004.0. Samples: 2496882540. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 05:31:13,860][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:31:16,334][06909] Updated weights for policy 0, policy_version 158333 (0.0031) [2024-06-28 05:31:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2594242560. Throughput: 0: 44146.2. Samples: 2497156660. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 05:31:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:31:19,972][06909] Updated weights for policy 0, policy_version 158343 (0.0027) [2024-06-28 05:31:23,470][06909] Updated weights for policy 0, policy_version 158353 (0.0030) [2024-06-28 05:31:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2594455552. Throughput: 0: 44247.7. Samples: 2497416480. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 05:31:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:31:27,254][06909] Updated weights for policy 0, policy_version 158363 (0.0026) [2024-06-28 05:31:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2594684928. Throughput: 0: 44141.3. Samples: 2497549960. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 05:31:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:31:30,649][06909] Updated weights for policy 0, policy_version 158373 (0.0026) [2024-06-28 05:31:33,852][06674] Fps is (10 sec: 44227.6, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 2594897920. Throughput: 0: 44142.9. Samples: 2497820500. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 05:31:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:31:34,845][06909] Updated weights for policy 0, policy_version 158383 (0.0029) [2024-06-28 05:31:38,768][06909] Updated weights for policy 0, policy_version 158393 (0.0034) [2024-06-28 05:31:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2595110912. Throughput: 0: 44380.1. Samples: 2498081540. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 05:31:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:31:41,999][06909] Updated weights for policy 0, policy_version 158403 (0.0027) [2024-06-28 05:31:43,850][06674] Fps is (10 sec: 44245.5, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2595340288. Throughput: 0: 44226.8. Samples: 2498213340. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 05:31:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:31:46,044][06909] Updated weights for policy 0, policy_version 158413 (0.0032) [2024-06-28 05:31:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2595553280. Throughput: 0: 44069.8. Samples: 2498477900. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 05:31:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:31:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000158420_2595553280.pth... [2024-06-28 05:31:48,942][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000157774_2584969216.pth [2024-06-28 05:31:49,765][06909] Updated weights for policy 0, policy_version 158423 (0.0046) [2024-06-28 05:31:53,417][06909] Updated weights for policy 0, policy_version 158433 (0.0037) [2024-06-28 05:31:53,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2595766272. Throughput: 0: 44104.8. Samples: 2498735260. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 05:31:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:31:57,143][06909] Updated weights for policy 0, policy_version 158443 (0.0031) [2024-06-28 05:31:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43968.1, 300 sec: 44097.9). Total num frames: 2595995648. Throughput: 0: 44079.5. Samples: 2498866120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 05:31:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:32:00,859][06909] Updated weights for policy 0, policy_version 158453 (0.0033) [2024-06-28 05:32:03,850][06674] Fps is (10 sec: 44237.9, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 2596208640. Throughput: 0: 43978.4. Samples: 2499135680. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 05:32:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:32:04,380][06909] Updated weights for policy 0, policy_version 158463 (0.0022) [2024-06-28 05:32:08,415][06909] Updated weights for policy 0, policy_version 158473 (0.0020) [2024-06-28 05:32:08,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2596421632. Throughput: 0: 43925.3. Samples: 2499393120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 05:32:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:32:12,252][06909] Updated weights for policy 0, policy_version 158483 (0.0028) [2024-06-28 05:32:13,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2596651008. Throughput: 0: 43886.1. Samples: 2499524840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 05:32:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:32:16,169][06909] Updated weights for policy 0, policy_version 158493 (0.0028) [2024-06-28 05:32:18,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2596880384. Throughput: 0: 43849.0. Samples: 2499793620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 05:32:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:32:19,775][06909] Updated weights for policy 0, policy_version 158503 (0.0028) [2024-06-28 05:32:23,332][06909] Updated weights for policy 0, policy_version 158513 (0.0033) [2024-06-28 05:32:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.6, 300 sec: 44098.0). Total num frames: 2597093376. Throughput: 0: 43846.7. Samples: 2500054640. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 05:32:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:32:27,151][06909] Updated weights for policy 0, policy_version 158523 (0.0037) [2024-06-28 05:32:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2597322752. Throughput: 0: 43855.6. Samples: 2500186840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 05:32:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:32:30,890][06909] Updated weights for policy 0, policy_version 158533 (0.0034) [2024-06-28 05:32:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43965.2, 300 sec: 44098.0). Total num frames: 2597535744. Throughput: 0: 43964.5. Samples: 2500456300. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 05:32:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:32:34,446][06909] Updated weights for policy 0, policy_version 158543 (0.0032) [2024-06-28 05:32:38,432][06909] Updated weights for policy 0, policy_version 158553 (0.0021) [2024-06-28 05:32:38,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43987.2). Total num frames: 2597732352. Throughput: 0: 44231.6. Samples: 2500725680. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 05:32:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:32:39,254][06887] Signal inference workers to stop experience collection... (35550 times) [2024-06-28 05:32:39,295][06909] InferenceWorker_p0-w0: stopping experience collection (35550 times) [2024-06-28 05:32:39,303][06887] Signal inference workers to resume experience collection... (35550 times) [2024-06-28 05:32:39,315][06909] InferenceWorker_p0-w0: resuming experience collection (35550 times) [2024-06-28 05:32:41,556][06909] Updated weights for policy 0, policy_version 158563 (0.0038) [2024-06-28 05:32:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2597978112. Throughput: 0: 44189.3. Samples: 2500854640. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 05:32:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:32:45,979][06909] Updated weights for policy 0, policy_version 158573 (0.0039) [2024-06-28 05:32:48,852][06674] Fps is (10 sec: 47504.3, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 2598207488. Throughput: 0: 44047.7. Samples: 2501117920. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 05:32:48,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:32:49,033][06909] Updated weights for policy 0, policy_version 158583 (0.0027) [2024-06-28 05:32:53,355][06909] Updated weights for policy 0, policy_version 158593 (0.0026) [2024-06-28 05:32:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2598420480. Throughput: 0: 44371.1. Samples: 2501389820. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 05:32:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:32:56,509][06909] Updated weights for policy 0, policy_version 158603 (0.0050) [2024-06-28 05:32:58,853][06674] Fps is (10 sec: 40956.2, 60 sec: 43688.5, 300 sec: 44042.0). Total num frames: 2598617088. Throughput: 0: 44061.2. Samples: 2501507720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 05:32:58,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:33:00,810][06909] Updated weights for policy 0, policy_version 158613 (0.0032) [2024-06-28 05:33:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2598862848. Throughput: 0: 43982.7. Samples: 2501772840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 05:33:03,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:33:04,422][06909] Updated weights for policy 0, policy_version 158623 (0.0026) [2024-06-28 05:33:07,957][06909] Updated weights for policy 0, policy_version 158633 (0.0046) [2024-06-28 05:33:08,850][06674] Fps is (10 sec: 42611.4, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2599043072. Throughput: 0: 44019.7. Samples: 2502035520. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 05:33:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:33:11,493][06909] Updated weights for policy 0, policy_version 158643 (0.0037) [2024-06-28 05:33:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 2599288832. Throughput: 0: 44043.6. Samples: 2502168800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 05:33:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:33:15,949][06909] Updated weights for policy 0, policy_version 158653 (0.0031) [2024-06-28 05:33:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2599518208. Throughput: 0: 43933.9. Samples: 2502433320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 05:33:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:33:18,958][06909] Updated weights for policy 0, policy_version 158663 (0.0046) [2024-06-28 05:33:23,529][06909] Updated weights for policy 0, policy_version 158673 (0.0037) [2024-06-28 05:33:23,850][06674] Fps is (10 sec: 40959.2, 60 sec: 43417.5, 300 sec: 43875.8). Total num frames: 2599698432. Throughput: 0: 43888.9. Samples: 2502700680. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 05:33:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:33:26,497][06909] Updated weights for policy 0, policy_version 158683 (0.0025) [2024-06-28 05:33:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 44042.7). Total num frames: 2599944192. Throughput: 0: 43828.1. Samples: 2502826900. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 05:33:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:33:30,642][06909] Updated weights for policy 0, policy_version 158693 (0.0030) [2024-06-28 05:33:33,822][06909] Updated weights for policy 0, policy_version 158703 (0.0047) [2024-06-28 05:33:33,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2600189952. Throughput: 0: 43902.8. Samples: 2503093460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 05:33:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:33:37,909][06909] Updated weights for policy 0, policy_version 158713 (0.0038) [2024-06-28 05:33:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2600386560. Throughput: 0: 43777.8. Samples: 2503359820. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 05:33:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:33:41,346][06909] Updated weights for policy 0, policy_version 158723 (0.0023) [2024-06-28 05:33:43,850][06674] Fps is (10 sec: 40961.0, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 2600599552. Throughput: 0: 44142.2. Samples: 2503493980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 05:33:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:33:45,356][06909] Updated weights for policy 0, policy_version 158733 (0.0027) [2024-06-28 05:33:48,546][06909] Updated weights for policy 0, policy_version 158743 (0.0029) [2024-06-28 05:33:48,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44238.3, 300 sec: 44097.9). Total num frames: 2600861696. Throughput: 0: 44180.8. Samples: 2503760980. Policy #0 lag: (min: 1.0, avg: 11.1, max: 23.0) [2024-06-28 05:33:48,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:33:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000158744_2600861696.pth... [2024-06-28 05:33:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000158097_2590261248.pth [2024-06-28 05:33:52,822][06909] Updated weights for policy 0, policy_version 158753 (0.0027) [2024-06-28 05:33:53,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2601058304. Throughput: 0: 44293.3. Samples: 2504028720. Policy #0 lag: (min: 1.0, avg: 11.1, max: 23.0) [2024-06-28 05:33:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:33:56,043][06909] Updated weights for policy 0, policy_version 158763 (0.0041) [2024-06-28 05:33:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44512.1, 300 sec: 44153.5). Total num frames: 2601287680. Throughput: 0: 44155.1. Samples: 2504155780. Policy #0 lag: (min: 1.0, avg: 11.1, max: 23.0) [2024-06-28 05:33:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:34:00,540][06887] Signal inference workers to stop experience collection... (35600 times) [2024-06-28 05:34:00,582][06909] InferenceWorker_p0-w0: stopping experience collection (35600 times) [2024-06-28 05:34:00,589][06887] Signal inference workers to resume experience collection... (35600 times) [2024-06-28 05:34:00,605][06909] InferenceWorker_p0-w0: resuming experience collection (35600 times) [2024-06-28 05:34:00,613][06909] Updated weights for policy 0, policy_version 158773 (0.0033) [2024-06-28 05:34:03,649][06909] Updated weights for policy 0, policy_version 158783 (0.0039) [2024-06-28 05:34:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2601500672. Throughput: 0: 44073.2. Samples: 2504416620. Policy #0 lag: (min: 1.0, avg: 11.1, max: 23.0) [2024-06-28 05:34:03,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:34:07,834][06909] Updated weights for policy 0, policy_version 158793 (0.0032) [2024-06-28 05:34:08,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44781.4, 300 sec: 44097.7). Total num frames: 2601730048. Throughput: 0: 44096.4. Samples: 2504685100. Policy #0 lag: (min: 1.0, avg: 11.1, max: 23.0) [2024-06-28 05:34:08,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:34:10,785][06909] Updated weights for policy 0, policy_version 158803 (0.0025) [2024-06-28 05:34:13,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2601943040. Throughput: 0: 44201.7. Samples: 2504815980. Policy #0 lag: (min: 1.0, avg: 11.1, max: 23.0) [2024-06-28 05:34:13,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-28 05:34:15,198][06909] Updated weights for policy 0, policy_version 158813 (0.0036) [2024-06-28 05:34:18,291][06909] Updated weights for policy 0, policy_version 158823 (0.0031) [2024-06-28 05:34:18,850][06674] Fps is (10 sec: 45884.6, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2602188800. Throughput: 0: 44229.5. Samples: 2505083780. Policy #0 lag: (min: 1.0, avg: 11.1, max: 23.0) [2024-06-28 05:34:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:34:22,698][06909] Updated weights for policy 0, policy_version 158833 (0.0040) [2024-06-28 05:34:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44783.1, 300 sec: 44042.4). Total num frames: 2602385408. Throughput: 0: 44204.5. Samples: 2505349020. Policy #0 lag: (min: 1.0, avg: 11.1, max: 23.0) [2024-06-28 05:34:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 05:34:25,808][06909] Updated weights for policy 0, policy_version 158843 (0.0033) [2024-06-28 05:34:28,850][06674] Fps is (10 sec: 40959.6, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2602598400. Throughput: 0: 43939.4. Samples: 2505471260. Policy #0 lag: (min: 1.0, avg: 11.1, max: 23.0) [2024-06-28 05:34:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:34:29,993][06909] Updated weights for policy 0, policy_version 158853 (0.0032) [2024-06-28 05:34:33,134][06909] Updated weights for policy 0, policy_version 158863 (0.0036) [2024-06-28 05:34:33,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2602844160. Throughput: 0: 44133.8. Samples: 2505747000. Policy #0 lag: (min: 1.0, avg: 11.1, max: 23.0) [2024-06-28 05:34:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:34:37,188][06909] Updated weights for policy 0, policy_version 158873 (0.0037) [2024-06-28 05:34:38,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2603057152. Throughput: 0: 44219.2. Samples: 2506018580. Policy #0 lag: (min: 1.0, avg: 11.1, max: 23.0) [2024-06-28 05:34:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:34:40,501][06909] Updated weights for policy 0, policy_version 158883 (0.0033) [2024-06-28 05:34:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2603253760. Throughput: 0: 44267.6. Samples: 2506147820. Policy #0 lag: (min: 1.0, avg: 11.1, max: 23.0) [2024-06-28 05:34:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:34:44,577][06909] Updated weights for policy 0, policy_version 158893 (0.0037) [2024-06-28 05:34:47,798][06909] Updated weights for policy 0, policy_version 158903 (0.0030) [2024-06-28 05:34:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 44098.8). Total num frames: 2603499520. Throughput: 0: 44438.3. Samples: 2506416340. Policy #0 lag: (min: 1.0, avg: 11.1, max: 23.0) [2024-06-28 05:34:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:34:51,834][06909] Updated weights for policy 0, policy_version 158913 (0.0041) [2024-06-28 05:34:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2603696128. Throughput: 0: 44358.4. Samples: 2506681140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 05:34:53,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:34:55,453][06909] Updated weights for policy 0, policy_version 158923 (0.0034) [2024-06-28 05:34:58,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2603925504. Throughput: 0: 44229.8. Samples: 2506806320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 05:34:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:34:59,482][06909] Updated weights for policy 0, policy_version 158933 (0.0040) [2024-06-28 05:35:02,812][06909] Updated weights for policy 0, policy_version 158943 (0.0026) [2024-06-28 05:35:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2604154880. Throughput: 0: 44257.2. Samples: 2507075360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 05:35:03,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:35:06,695][06909] Updated weights for policy 0, policy_version 158953 (0.0038) [2024-06-28 05:35:08,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 2604367872. Throughput: 0: 44265.6. Samples: 2507340980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 05:35:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:35:10,189][06909] Updated weights for policy 0, policy_version 158963 (0.0029) [2024-06-28 05:35:13,852][06674] Fps is (10 sec: 44228.0, 60 sec: 44235.2, 300 sec: 44153.2). Total num frames: 2604597248. Throughput: 0: 44492.2. Samples: 2507473500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 05:35:13,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:35:14,071][06909] Updated weights for policy 0, policy_version 158973 (0.0036) [2024-06-28 05:35:17,571][06909] Updated weights for policy 0, policy_version 158983 (0.0031) [2024-06-28 05:35:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2604810240. Throughput: 0: 44206.2. Samples: 2507736280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 05:35:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:35:21,733][06909] Updated weights for policy 0, policy_version 158993 (0.0030) [2024-06-28 05:35:23,850][06674] Fps is (10 sec: 40968.6, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2605006848. Throughput: 0: 44073.3. Samples: 2508001880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 05:35:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:35:25,267][06909] Updated weights for policy 0, policy_version 159003 (0.0032) [2024-06-28 05:35:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 2605252608. Throughput: 0: 44078.2. Samples: 2508131340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 05:35:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:35:28,936][06909] Updated weights for policy 0, policy_version 159013 (0.0029) [2024-06-28 05:35:32,852][06909] Updated weights for policy 0, policy_version 159023 (0.0035) [2024-06-28 05:35:33,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2605481984. Throughput: 0: 43965.3. Samples: 2508394780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 05:35:33,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 05:35:36,016][06887] Signal inference workers to stop experience collection... (35650 times) [2024-06-28 05:35:36,016][06887] Signal inference workers to resume experience collection... (35650 times) [2024-06-28 05:35:36,058][06909] InferenceWorker_p0-w0: stopping experience collection (35650 times) [2024-06-28 05:35:36,058][06909] InferenceWorker_p0-w0: resuming experience collection (35650 times) [2024-06-28 05:35:36,413][06909] Updated weights for policy 0, policy_version 159033 (0.0035) [2024-06-28 05:35:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2605678592. Throughput: 0: 43982.8. Samples: 2508660360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 05:35:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:35:40,143][06909] Updated weights for policy 0, policy_version 159043 (0.0034) [2024-06-28 05:35:43,703][06909] Updated weights for policy 0, policy_version 159053 (0.0027) [2024-06-28 05:35:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2605924352. Throughput: 0: 44120.8. Samples: 2508791760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 05:35:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:35:47,497][06909] Updated weights for policy 0, policy_version 159063 (0.0037) [2024-06-28 05:35:48,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2606153728. Throughput: 0: 44052.9. Samples: 2509057740. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 05:35:48,851][06674] Avg episode reward: [(0, '0.405')] [2024-06-28 05:35:48,877][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000159067_2606153728.pth... [2024-06-28 05:35:48,929][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000158420_2595553280.pth [2024-06-28 05:35:51,375][06909] Updated weights for policy 0, policy_version 159073 (0.0041) [2024-06-28 05:35:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.9, 300 sec: 44098.8). Total num frames: 2606366720. Throughput: 0: 44058.3. Samples: 2509323600. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 05:35:53,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:35:54,853][06909] Updated weights for policy 0, policy_version 159083 (0.0035) [2024-06-28 05:35:58,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2606563328. Throughput: 0: 44063.8. Samples: 2509456280. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-28 05:35:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:35:58,917][06909] Updated weights for policy 0, policy_version 159093 (0.0027) [2024-06-28 05:36:02,436][06909] Updated weights for policy 0, policy_version 159103 (0.0030) [2024-06-28 05:36:03,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 2606792704. Throughput: 0: 44050.8. Samples: 2509718560. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-28 05:36:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:36:06,221][06909] Updated weights for policy 0, policy_version 159113 (0.0026) [2024-06-28 05:36:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2607005696. Throughput: 0: 43958.2. Samples: 2509980000. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-28 05:36:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:36:09,999][06909] Updated weights for policy 0, policy_version 159123 (0.0030) [2024-06-28 05:36:13,467][06909] Updated weights for policy 0, policy_version 159133 (0.0029) [2024-06-28 05:36:13,850][06674] Fps is (10 sec: 45874.2, 60 sec: 44238.3, 300 sec: 44097.9). Total num frames: 2607251456. Throughput: 0: 44016.8. Samples: 2510112100. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-28 05:36:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:36:17,554][06909] Updated weights for policy 0, policy_version 159143 (0.0027) [2024-06-28 05:36:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2607448064. Throughput: 0: 44053.4. Samples: 2510377180. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-28 05:36:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:36:21,241][06909] Updated weights for policy 0, policy_version 159153 (0.0028) [2024-06-28 05:36:23,850][06674] Fps is (10 sec: 40960.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2607661056. Throughput: 0: 43949.3. Samples: 2510638080. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-28 05:36:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:36:24,768][06909] Updated weights for policy 0, policy_version 159163 (0.0027) [2024-06-28 05:36:28,583][06909] Updated weights for policy 0, policy_version 159173 (0.0041) [2024-06-28 05:36:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 2607890432. Throughput: 0: 44043.0. Samples: 2510773700. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-28 05:36:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:36:32,135][06909] Updated weights for policy 0, policy_version 159183 (0.0029) [2024-06-28 05:36:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2608103424. Throughput: 0: 43903.6. Samples: 2511033400. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-28 05:36:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:36:35,954][06909] Updated weights for policy 0, policy_version 159193 (0.0037) [2024-06-28 05:36:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2608332800. Throughput: 0: 43944.9. Samples: 2511301120. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-28 05:36:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:36:39,747][06909] Updated weights for policy 0, policy_version 159203 (0.0039) [2024-06-28 05:36:43,260][06909] Updated weights for policy 0, policy_version 159213 (0.0038) [2024-06-28 05:36:43,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2608562176. Throughput: 0: 43921.4. Samples: 2511432740. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-28 05:36:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:36:46,934][06909] Updated weights for policy 0, policy_version 159223 (0.0026) [2024-06-28 05:36:48,856][06674] Fps is (10 sec: 44210.0, 60 sec: 43686.3, 300 sec: 44097.1). Total num frames: 2608775168. Throughput: 0: 43878.4. Samples: 2511693360. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-28 05:36:48,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:36:50,622][06909] Updated weights for policy 0, policy_version 159233 (0.0027) [2024-06-28 05:36:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2608988160. Throughput: 0: 44031.6. Samples: 2511961420. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-28 05:36:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:36:54,489][06909] Updated weights for policy 0, policy_version 159243 (0.0042) [2024-06-28 05:36:58,147][06909] Updated weights for policy 0, policy_version 159253 (0.0043) [2024-06-28 05:36:58,850][06674] Fps is (10 sec: 42623.6, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2609201152. Throughput: 0: 43895.9. Samples: 2512087420. Policy #0 lag: (min: 1.0, avg: 10.3, max: 21.0) [2024-06-28 05:36:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:36:58,984][06887] Signal inference workers to stop experience collection... (35700 times) [2024-06-28 05:36:59,019][06909] InferenceWorker_p0-w0: stopping experience collection (35700 times) [2024-06-28 05:36:59,042][06887] Signal inference workers to resume experience collection... (35700 times) [2024-06-28 05:36:59,043][06909] InferenceWorker_p0-w0: resuming experience collection (35700 times) [2024-06-28 05:37:01,721][06909] Updated weights for policy 0, policy_version 159263 (0.0034) [2024-06-28 05:37:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2609430528. Throughput: 0: 43892.5. Samples: 2512352340. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:37:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:37:05,534][06909] Updated weights for policy 0, policy_version 159273 (0.0032) [2024-06-28 05:37:08,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2609659904. Throughput: 0: 44135.5. Samples: 2512624180. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:37:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:37:09,319][06909] Updated weights for policy 0, policy_version 159283 (0.0025) [2024-06-28 05:37:13,002][06909] Updated weights for policy 0, policy_version 159293 (0.0039) [2024-06-28 05:37:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2609889280. Throughput: 0: 43932.1. Samples: 2512750640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:37:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:37:16,962][06909] Updated weights for policy 0, policy_version 159303 (0.0033) [2024-06-28 05:37:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2610085888. Throughput: 0: 44094.7. Samples: 2513017660. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:37:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:37:20,412][06909] Updated weights for policy 0, policy_version 159313 (0.0022) [2024-06-28 05:37:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 2610331648. Throughput: 0: 43998.2. Samples: 2513281040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:37:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:37:24,200][06909] Updated weights for policy 0, policy_version 159323 (0.0030) [2024-06-28 05:37:28,239][06909] Updated weights for policy 0, policy_version 159333 (0.0034) [2024-06-28 05:37:28,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2610544640. Throughput: 0: 44052.3. Samples: 2513415100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:37:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:37:31,659][06909] Updated weights for policy 0, policy_version 159343 (0.0028) [2024-06-28 05:37:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2610757632. Throughput: 0: 44139.4. Samples: 2513679360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:37:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:37:35,585][06909] Updated weights for policy 0, policy_version 159353 (0.0036) [2024-06-28 05:37:38,851][06674] Fps is (10 sec: 44233.8, 60 sec: 44236.3, 300 sec: 44097.9). Total num frames: 2610987008. Throughput: 0: 44052.6. Samples: 2513943820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:37:38,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 05:37:38,891][06909] Updated weights for policy 0, policy_version 159363 (0.0026) [2024-06-28 05:37:42,806][06909] Updated weights for policy 0, policy_version 159373 (0.0027) [2024-06-28 05:37:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 2611200000. Throughput: 0: 44214.9. Samples: 2514077080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:37:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:37:46,251][06909] Updated weights for policy 0, policy_version 159383 (0.0045) [2024-06-28 05:37:48,850][06674] Fps is (10 sec: 42601.5, 60 sec: 43968.2, 300 sec: 44042.4). Total num frames: 2611412992. Throughput: 0: 44242.6. Samples: 2514343260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:37:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:37:48,882][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000159389_2611429376.pth... [2024-06-28 05:37:48,937][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000158744_2600861696.pth [2024-06-28 05:37:50,348][06909] Updated weights for policy 0, policy_version 159393 (0.0037) [2024-06-28 05:37:53,850][06674] Fps is (10 sec: 44235.4, 60 sec: 44236.6, 300 sec: 44153.9). Total num frames: 2611642368. Throughput: 0: 43872.6. Samples: 2514598460. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:37:53,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:37:54,034][06909] Updated weights for policy 0, policy_version 159403 (0.0039) [2024-06-28 05:37:57,547][06909] Updated weights for policy 0, policy_version 159413 (0.0029) [2024-06-28 05:37:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2611855360. Throughput: 0: 44114.2. Samples: 2514735780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:37:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:38:01,089][06909] Updated weights for policy 0, policy_version 159423 (0.0039) [2024-06-28 05:38:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.4, 300 sec: 44097.9). Total num frames: 2612051968. Throughput: 0: 44081.0. Samples: 2515001320. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:38:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:38:04,845][06909] Updated weights for policy 0, policy_version 159433 (0.0024) [2024-06-28 05:38:08,561][06909] Updated weights for policy 0, policy_version 159443 (0.0024) [2024-06-28 05:38:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2612314112. Throughput: 0: 44118.2. Samples: 2515266360. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:38:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:38:12,755][06909] Updated weights for policy 0, policy_version 159453 (0.0037) [2024-06-28 05:38:13,850][06674] Fps is (10 sec: 47514.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2612527104. Throughput: 0: 44212.5. Samples: 2515404660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:38:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:38:15,845][06909] Updated weights for policy 0, policy_version 159463 (0.0039) [2024-06-28 05:38:17,746][06887] Signal inference workers to stop experience collection... (35750 times) [2024-06-28 05:38:17,747][06887] Signal inference workers to resume experience collection... (35750 times) [2024-06-28 05:38:17,787][06909] InferenceWorker_p0-w0: stopping experience collection (35750 times) [2024-06-28 05:38:17,787][06909] InferenceWorker_p0-w0: resuming experience collection (35750 times) [2024-06-28 05:38:18,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2612723712. Throughput: 0: 44109.8. Samples: 2515664300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:38:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:38:20,014][06909] Updated weights for policy 0, policy_version 159473 (0.0036) [2024-06-28 05:38:23,516][06909] Updated weights for policy 0, policy_version 159483 (0.0035) [2024-06-28 05:38:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2612969472. Throughput: 0: 43918.1. Samples: 2515920100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:38:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:38:27,417][06909] Updated weights for policy 0, policy_version 159493 (0.0042) [2024-06-28 05:38:28,853][06674] Fps is (10 sec: 44220.5, 60 sec: 43688.1, 300 sec: 43986.4). Total num frames: 2613166080. Throughput: 0: 43968.4. Samples: 2516055820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:38:28,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:38:30,930][06909] Updated weights for policy 0, policy_version 159503 (0.0025) [2024-06-28 05:38:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2613395456. Throughput: 0: 43927.6. Samples: 2516320000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:38:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 05:38:34,906][06909] Updated weights for policy 0, policy_version 159513 (0.0035) [2024-06-28 05:38:38,539][06909] Updated weights for policy 0, policy_version 159523 (0.0030) [2024-06-28 05:38:38,852][06674] Fps is (10 sec: 45880.4, 60 sec: 43962.5, 300 sec: 44153.1). Total num frames: 2613624832. Throughput: 0: 43947.6. Samples: 2516576200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:38:38,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:38:42,347][06909] Updated weights for policy 0, policy_version 159533 (0.0051) [2024-06-28 05:38:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2613821440. Throughput: 0: 44016.9. Samples: 2516716540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:38:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:38:45,889][06909] Updated weights for policy 0, policy_version 159543 (0.0049) [2024-06-28 05:38:48,850][06674] Fps is (10 sec: 40970.0, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2614034432. Throughput: 0: 43851.8. Samples: 2516974640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:38:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:38:49,951][06909] Updated weights for policy 0, policy_version 159553 (0.0021) [2024-06-28 05:38:53,183][06909] Updated weights for policy 0, policy_version 159563 (0.0037) [2024-06-28 05:38:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 2614280192. Throughput: 0: 43741.8. Samples: 2517234740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:38:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 05:38:57,510][06909] Updated weights for policy 0, policy_version 159573 (0.0039) [2024-06-28 05:38:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2614476800. Throughput: 0: 43777.0. Samples: 2517374620. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:38:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:39:00,767][06909] Updated weights for policy 0, policy_version 159583 (0.0036) [2024-06-28 05:39:03,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.9, 300 sec: 43987.2). Total num frames: 2614706176. Throughput: 0: 43784.7. Samples: 2517634620. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:39:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:39:04,795][06909] Updated weights for policy 0, policy_version 159593 (0.0034) [2024-06-28 05:39:08,230][06909] Updated weights for policy 0, policy_version 159603 (0.0027) [2024-06-28 05:39:08,852][06674] Fps is (10 sec: 45865.8, 60 sec: 43689.2, 300 sec: 44042.1). Total num frames: 2614935552. Throughput: 0: 43784.7. Samples: 2517890500. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:39:08,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:39:12,475][06909] Updated weights for policy 0, policy_version 159613 (0.0031) [2024-06-28 05:39:13,852][06674] Fps is (10 sec: 44228.2, 60 sec: 43689.2, 300 sec: 43931.0). Total num frames: 2615148544. Throughput: 0: 43950.4. Samples: 2518033520. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:39:13,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:39:15,589][06909] Updated weights for policy 0, policy_version 159623 (0.0034) [2024-06-28 05:39:18,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2615361536. Throughput: 0: 44028.4. Samples: 2518301280. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:39:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:39:19,818][06909] Updated weights for policy 0, policy_version 159633 (0.0024) [2024-06-28 05:39:22,953][06909] Updated weights for policy 0, policy_version 159643 (0.0040) [2024-06-28 05:39:23,850][06674] Fps is (10 sec: 47523.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2615623680. Throughput: 0: 44021.1. Samples: 2518557040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:39:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:39:27,141][06909] Updated weights for policy 0, policy_version 159653 (0.0035) [2024-06-28 05:39:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44239.5, 300 sec: 43986.9). Total num frames: 2615820288. Throughput: 0: 44065.4. Samples: 2518699480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:39:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:39:30,265][06909] Updated weights for policy 0, policy_version 159663 (0.0032) [2024-06-28 05:39:33,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2616016896. Throughput: 0: 44205.4. Samples: 2518963880. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:39:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:39:34,601][06909] Updated weights for policy 0, policy_version 159673 (0.0038) [2024-06-28 05:39:35,179][06887] Signal inference workers to stop experience collection... (35800 times) [2024-06-28 05:39:35,231][06887] Signal inference workers to resume experience collection... (35800 times) [2024-06-28 05:39:35,232][06909] InferenceWorker_p0-w0: stopping experience collection (35800 times) [2024-06-28 05:39:35,246][06909] InferenceWorker_p0-w0: resuming experience collection (35800 times) [2024-06-28 05:39:38,053][06909] Updated weights for policy 0, policy_version 159683 (0.0031) [2024-06-28 05:39:38,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43964.1, 300 sec: 44097.6). Total num frames: 2616262656. Throughput: 0: 44149.1. Samples: 2519221540. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:39:38,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:39:42,142][06909] Updated weights for policy 0, policy_version 159693 (0.0038) [2024-06-28 05:39:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 2616459264. Throughput: 0: 44083.1. Samples: 2519358360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:39:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:39:45,651][06909] Updated weights for policy 0, policy_version 159703 (0.0022) [2024-06-28 05:39:48,850][06674] Fps is (10 sec: 42606.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2616688640. Throughput: 0: 44209.0. Samples: 2519624020. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:39:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 05:39:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000159711_2616705024.pth... [2024-06-28 05:39:48,929][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000159067_2606153728.pth [2024-06-28 05:39:49,548][06909] Updated weights for policy 0, policy_version 159713 (0.0033) [2024-06-28 05:39:52,870][06909] Updated weights for policy 0, policy_version 159723 (0.0028) [2024-06-28 05:39:53,852][06674] Fps is (10 sec: 47503.7, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 2616934400. Throughput: 0: 44332.0. Samples: 2519885440. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:39:53,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:39:57,033][06909] Updated weights for policy 0, policy_version 159733 (0.0033) [2024-06-28 05:39:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2617131008. Throughput: 0: 44107.0. Samples: 2520018240. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:39:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:40:00,248][06909] Updated weights for policy 0, policy_version 159743 (0.0035) [2024-06-28 05:40:03,850][06674] Fps is (10 sec: 42606.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2617360384. Throughput: 0: 44127.9. Samples: 2520287040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:40:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:40:04,252][06909] Updated weights for policy 0, policy_version 159753 (0.0036) [2024-06-28 05:40:08,173][06909] Updated weights for policy 0, policy_version 159763 (0.0026) [2024-06-28 05:40:08,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44238.2, 300 sec: 44042.7). Total num frames: 2617589760. Throughput: 0: 44312.4. Samples: 2520551100. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:40:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:40:11,503][06909] Updated weights for policy 0, policy_version 159773 (0.0027) [2024-06-28 05:40:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 2617802752. Throughput: 0: 44079.4. Samples: 2520683060. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 05:40:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:40:15,455][06909] Updated weights for policy 0, policy_version 159783 (0.0034) [2024-06-28 05:40:18,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2618032128. Throughput: 0: 44106.7. Samples: 2520948680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:40:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:40:19,170][06909] Updated weights for policy 0, policy_version 159793 (0.0030) [2024-06-28 05:40:22,724][06909] Updated weights for policy 0, policy_version 159803 (0.0041) [2024-06-28 05:40:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2618245120. Throughput: 0: 44277.1. Samples: 2521213920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:40:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:40:26,415][06909] Updated weights for policy 0, policy_version 159813 (0.0034) [2024-06-28 05:40:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2618474496. Throughput: 0: 44269.3. Samples: 2521350480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:40:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:40:29,900][06909] Updated weights for policy 0, policy_version 159823 (0.0030) [2024-06-28 05:40:33,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2618687488. Throughput: 0: 44241.5. Samples: 2521614880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:40:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:40:33,951][06909] Updated weights for policy 0, policy_version 159833 (0.0044) [2024-06-28 05:40:37,413][06909] Updated weights for policy 0, policy_version 159843 (0.0033) [2024-06-28 05:40:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43965.3, 300 sec: 43986.9). Total num frames: 2618900480. Throughput: 0: 44200.7. Samples: 2521874380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:40:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:40:41,210][06909] Updated weights for policy 0, policy_version 159853 (0.0023) [2024-06-28 05:40:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44782.9, 300 sec: 44042.4). Total num frames: 2619146240. Throughput: 0: 44107.1. Samples: 2522003060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:40:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:40:45,451][06909] Updated weights for policy 0, policy_version 159863 (0.0031) [2024-06-28 05:40:48,658][06909] Updated weights for policy 0, policy_version 159873 (0.0032) [2024-06-28 05:40:48,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2619359232. Throughput: 0: 44090.6. Samples: 2522271120. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:40:48,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:40:52,630][06909] Updated weights for policy 0, policy_version 159883 (0.0029) [2024-06-28 05:40:53,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43692.2, 300 sec: 44042.4). Total num frames: 2619555840. Throughput: 0: 44148.6. Samples: 2522537780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:40:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:40:55,267][06887] Signal inference workers to stop experience collection... (35850 times) [2024-06-28 05:40:55,287][06909] InferenceWorker_p0-w0: stopping experience collection (35850 times) [2024-06-28 05:40:55,382][06887] Signal inference workers to resume experience collection... (35850 times) [2024-06-28 05:40:55,382][06909] InferenceWorker_p0-w0: resuming experience collection (35850 times) [2024-06-28 05:40:56,213][06909] Updated weights for policy 0, policy_version 159893 (0.0047) [2024-06-28 05:40:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.7, 300 sec: 44097.9). Total num frames: 2619801600. Throughput: 0: 44049.3. Samples: 2522665280. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:40:58,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 05:40:59,774][06909] Updated weights for policy 0, policy_version 159903 (0.0023) [2024-06-28 05:41:03,462][06909] Updated weights for policy 0, policy_version 159913 (0.0030) [2024-06-28 05:41:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2620014592. Throughput: 0: 44095.5. Samples: 2522932980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:41:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:41:07,122][06909] Updated weights for policy 0, policy_version 159923 (0.0037) [2024-06-28 05:41:08,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2620227584. Throughput: 0: 44115.1. Samples: 2523199100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:41:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:41:11,366][06909] Updated weights for policy 0, policy_version 159933 (0.0033) [2024-06-28 05:41:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2620456960. Throughput: 0: 43822.6. Samples: 2523322500. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 05:41:13,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:41:15,153][06909] Updated weights for policy 0, policy_version 159943 (0.0023) [2024-06-28 05:41:18,656][06909] Updated weights for policy 0, policy_version 159953 (0.0044) [2024-06-28 05:41:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2620669952. Throughput: 0: 43862.1. Samples: 2523588680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 05:41:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:41:22,793][06909] Updated weights for policy 0, policy_version 159963 (0.0021) [2024-06-28 05:41:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2620882944. Throughput: 0: 43958.7. Samples: 2523852520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 05:41:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:41:25,840][06909] Updated weights for policy 0, policy_version 159973 (0.0032) [2024-06-28 05:41:28,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2621112320. Throughput: 0: 43978.7. Samples: 2523982100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 05:41:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:41:29,905][06909] Updated weights for policy 0, policy_version 159983 (0.0034) [2024-06-28 05:41:33,476][06909] Updated weights for policy 0, policy_version 159993 (0.0028) [2024-06-28 05:41:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2621325312. Throughput: 0: 44033.5. Samples: 2524252620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 05:41:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:41:37,286][06909] Updated weights for policy 0, policy_version 160003 (0.0033) [2024-06-28 05:41:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2621554688. Throughput: 0: 43967.4. Samples: 2524516320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 05:41:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:41:40,848][06909] Updated weights for policy 0, policy_version 160013 (0.0029) [2024-06-28 05:41:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44098.9). Total num frames: 2621784064. Throughput: 0: 44142.3. Samples: 2524651680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 05:41:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:41:45,088][06909] Updated weights for policy 0, policy_version 160023 (0.0028) [2024-06-28 05:41:48,403][06909] Updated weights for policy 0, policy_version 160033 (0.0025) [2024-06-28 05:41:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2621997056. Throughput: 0: 43943.0. Samples: 2524910420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 05:41:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:41:48,873][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000160034_2621997056.pth... [2024-06-28 05:41:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000159389_2611429376.pth [2024-06-28 05:41:52,529][06909] Updated weights for policy 0, policy_version 160043 (0.0023) [2024-06-28 05:41:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2622210048. Throughput: 0: 44084.0. Samples: 2525182880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 05:41:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:41:55,665][06909] Updated weights for policy 0, policy_version 160053 (0.0038) [2024-06-28 05:41:58,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2622455808. Throughput: 0: 44270.3. Samples: 2525314660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 05:41:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:41:59,808][06909] Updated weights for policy 0, policy_version 160063 (0.0033) [2024-06-28 05:42:03,619][06909] Updated weights for policy 0, policy_version 160073 (0.0025) [2024-06-28 05:42:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2622636032. Throughput: 0: 44022.3. Samples: 2525569680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 05:42:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:42:07,067][06909] Updated weights for policy 0, policy_version 160083 (0.0035) [2024-06-28 05:42:08,850][06674] Fps is (10 sec: 40959.2, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2622865408. Throughput: 0: 44137.6. Samples: 2525838720. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 05:42:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:42:11,058][06909] Updated weights for policy 0, policy_version 160093 (0.0034) [2024-06-28 05:42:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2623094784. Throughput: 0: 44157.3. Samples: 2525969180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 05:42:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:42:14,305][06909] Updated weights for policy 0, policy_version 160103 (0.0032) [2024-06-28 05:42:15,819][06887] Signal inference workers to stop experience collection... (35900 times) [2024-06-28 05:42:15,871][06909] InferenceWorker_p0-w0: stopping experience collection (35900 times) [2024-06-28 05:42:15,879][06887] Signal inference workers to resume experience collection... (35900 times) [2024-06-28 05:42:15,889][06909] InferenceWorker_p0-w0: resuming experience collection (35900 times) [2024-06-28 05:42:18,318][06909] Updated weights for policy 0, policy_version 160113 (0.0030) [2024-06-28 05:42:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2623307776. Throughput: 0: 43959.9. Samples: 2526230820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 05:42:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:42:22,563][06909] Updated weights for policy 0, policy_version 160123 (0.0037) [2024-06-28 05:42:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2623537152. Throughput: 0: 43960.0. Samples: 2526494520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 05:42:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:42:25,726][06909] Updated weights for policy 0, policy_version 160133 (0.0030) [2024-06-28 05:42:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2623766528. Throughput: 0: 43896.9. Samples: 2526627040. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 05:42:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:42:29,954][06909] Updated weights for policy 0, policy_version 160143 (0.0035) [2024-06-28 05:42:33,497][06909] Updated weights for policy 0, policy_version 160153 (0.0034) [2024-06-28 05:42:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43987.0). Total num frames: 2623963136. Throughput: 0: 43928.6. Samples: 2526887200. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 05:42:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:42:37,185][06909] Updated weights for policy 0, policy_version 160163 (0.0036) [2024-06-28 05:42:38,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2624176128. Throughput: 0: 43833.8. Samples: 2527155400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 05:42:38,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:42:40,971][06909] Updated weights for policy 0, policy_version 160173 (0.0028) [2024-06-28 05:42:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2624405504. Throughput: 0: 43773.7. Samples: 2527284480. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 05:42:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:42:44,437][06909] Updated weights for policy 0, policy_version 160183 (0.0033) [2024-06-28 05:42:48,285][06909] Updated weights for policy 0, policy_version 160193 (0.0034) [2024-06-28 05:42:48,856][06674] Fps is (10 sec: 44210.0, 60 sec: 43686.3, 300 sec: 43986.0). Total num frames: 2624618496. Throughput: 0: 43735.8. Samples: 2527538060. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 05:42:48,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:42:51,748][06909] Updated weights for policy 0, policy_version 160203 (0.0033) [2024-06-28 05:42:53,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 2624831488. Throughput: 0: 43989.6. Samples: 2527818240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 05:42:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:42:55,616][06909] Updated weights for policy 0, policy_version 160213 (0.0032) [2024-06-28 05:42:58,850][06674] Fps is (10 sec: 45902.7, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 2625077248. Throughput: 0: 43861.6. Samples: 2527942960. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 05:42:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:42:59,853][06909] Updated weights for policy 0, policy_version 160223 (0.0025) [2024-06-28 05:43:02,971][06909] Updated weights for policy 0, policy_version 160233 (0.0033) [2024-06-28 05:43:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 2625257472. Throughput: 0: 43831.7. Samples: 2528203240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 05:43:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:43:07,170][06909] Updated weights for policy 0, policy_version 160243 (0.0038) [2024-06-28 05:43:08,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2625519616. Throughput: 0: 43932.5. Samples: 2528471480. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 05:43:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:43:10,603][06909] Updated weights for policy 0, policy_version 160253 (0.0025) [2024-06-28 05:43:13,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2625732608. Throughput: 0: 43902.6. Samples: 2528602660. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 05:43:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:43:14,413][06909] Updated weights for policy 0, policy_version 160263 (0.0035) [2024-06-28 05:43:18,309][06909] Updated weights for policy 0, policy_version 160273 (0.0029) [2024-06-28 05:43:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2625945600. Throughput: 0: 43871.5. Samples: 2528861420. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 05:43:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:43:21,630][06909] Updated weights for policy 0, policy_version 160283 (0.0036) [2024-06-28 05:43:23,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 44043.0). Total num frames: 2626158592. Throughput: 0: 43830.3. Samples: 2529127760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 05:43:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:43:25,581][06909] Updated weights for policy 0, policy_version 160293 (0.0040) [2024-06-28 05:43:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2626387968. Throughput: 0: 43833.8. Samples: 2529257000. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 05:43:28,850][06674] Avg episode reward: [(0, '0.514')] [2024-06-28 05:43:28,968][06887] Saving new best policy, reward=0.514! [2024-06-28 05:43:28,973][06909] Updated weights for policy 0, policy_version 160303 (0.0032) [2024-06-28 05:43:31,448][06887] Signal inference workers to stop experience collection... (35950 times) [2024-06-28 05:43:31,507][06887] Signal inference workers to resume experience collection... (35950 times) [2024-06-28 05:43:31,507][06909] InferenceWorker_p0-w0: stopping experience collection (35950 times) [2024-06-28 05:43:31,523][06909] InferenceWorker_p0-w0: resuming experience collection (35950 times) [2024-06-28 05:43:32,859][06909] Updated weights for policy 0, policy_version 160313 (0.0034) [2024-06-28 05:43:33,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43962.2, 300 sec: 43986.9). Total num frames: 2626600960. Throughput: 0: 44052.0. Samples: 2529520220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 05:43:33,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:43:36,965][06909] Updated weights for policy 0, policy_version 160323 (0.0033) [2024-06-28 05:43:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2626813952. Throughput: 0: 43707.9. Samples: 2529785100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 05:43:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:43:40,242][06909] Updated weights for policy 0, policy_version 160333 (0.0031) [2024-06-28 05:43:43,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2627043328. Throughput: 0: 43882.4. Samples: 2529917660. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 05:43:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:43:44,224][06909] Updated weights for policy 0, policy_version 160343 (0.0042) [2024-06-28 05:43:47,777][06909] Updated weights for policy 0, policy_version 160353 (0.0041) [2024-06-28 05:43:48,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44241.2, 300 sec: 44042.4). Total num frames: 2627272704. Throughput: 0: 43926.9. Samples: 2530179960. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 05:43:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:43:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000160356_2627272704.pth... [2024-06-28 05:43:48,905][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000159711_2616705024.pth [2024-06-28 05:43:51,542][06909] Updated weights for policy 0, policy_version 160363 (0.0041) [2024-06-28 05:43:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.6, 300 sec: 44097.9). Total num frames: 2627485696. Throughput: 0: 43912.8. Samples: 2530447560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 05:43:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:43:55,270][06909] Updated weights for policy 0, policy_version 160373 (0.0023) [2024-06-28 05:43:58,685][06909] Updated weights for policy 0, policy_version 160383 (0.0026) [2024-06-28 05:43:58,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2627715072. Throughput: 0: 43947.6. Samples: 2530580300. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 05:43:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:44:02,605][06909] Updated weights for policy 0, policy_version 160393 (0.0030) [2024-06-28 05:44:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.7, 300 sec: 43987.2). Total num frames: 2627911680. Throughput: 0: 44100.0. Samples: 2530845920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 05:44:03,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-28 05:44:06,069][06909] Updated weights for policy 0, policy_version 160403 (0.0037) [2024-06-28 05:44:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 2628157440. Throughput: 0: 44152.8. Samples: 2531114640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 05:44:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:44:09,781][06909] Updated weights for policy 0, policy_version 160413 (0.0031) [2024-06-28 05:44:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 2628354048. Throughput: 0: 44135.6. Samples: 2531243100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 05:44:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:44:14,125][06909] Updated weights for policy 0, policy_version 160423 (0.0024) [2024-06-28 05:44:17,434][06909] Updated weights for policy 0, policy_version 160433 (0.0031) [2024-06-28 05:44:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2628599808. Throughput: 0: 44144.6. Samples: 2531506640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 05:44:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:44:21,432][06909] Updated weights for policy 0, policy_version 160443 (0.0022) [2024-06-28 05:44:23,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2628812800. Throughput: 0: 44044.4. Samples: 2531767100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 05:44:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:44:24,580][06909] Updated weights for policy 0, policy_version 160453 (0.0042) [2024-06-28 05:44:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2629009408. Throughput: 0: 44133.3. Samples: 2531903660. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 05:44:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:44:28,916][06909] Updated weights for policy 0, policy_version 160463 (0.0027) [2024-06-28 05:44:32,128][06909] Updated weights for policy 0, policy_version 160473 (0.0036) [2024-06-28 05:44:33,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44238.3, 300 sec: 44042.7). Total num frames: 2629255168. Throughput: 0: 44307.3. Samples: 2532173780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:44:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 05:44:36,330][06909] Updated weights for policy 0, policy_version 160483 (0.0032) [2024-06-28 05:44:38,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2629484544. Throughput: 0: 44193.9. Samples: 2532436280. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:44:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:44:39,437][06909] Updated weights for policy 0, policy_version 160493 (0.0030) [2024-06-28 05:44:43,731][06909] Updated weights for policy 0, policy_version 160503 (0.0035) [2024-06-28 05:44:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2629681152. Throughput: 0: 44233.8. Samples: 2532570820. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:44:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:44:46,819][06909] Updated weights for policy 0, policy_version 160513 (0.0035) [2024-06-28 05:44:48,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 2629926912. Throughput: 0: 44231.4. Samples: 2532836340. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:44:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:44:51,089][06909] Updated weights for policy 0, policy_version 160523 (0.0031) [2024-06-28 05:44:53,852][06674] Fps is (10 sec: 45865.6, 60 sec: 44235.4, 300 sec: 44097.6). Total num frames: 2630139904. Throughput: 0: 44089.5. Samples: 2533098760. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:44:53,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:44:54,288][06909] Updated weights for policy 0, policy_version 160533 (0.0035) [2024-06-28 05:44:58,814][06909] Updated weights for policy 0, policy_version 160543 (0.0035) [2024-06-28 05:44:58,850][06674] Fps is (10 sec: 40961.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2630336512. Throughput: 0: 44195.6. Samples: 2533231900. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:44:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 05:44:58,958][06887] Signal inference workers to stop experience collection... (36000 times) [2024-06-28 05:44:58,959][06887] Signal inference workers to resume experience collection... (36000 times) [2024-06-28 05:44:58,999][06909] InferenceWorker_p0-w0: stopping experience collection (36000 times) [2024-06-28 05:44:58,999][06909] InferenceWorker_p0-w0: resuming experience collection (36000 times) [2024-06-28 05:45:02,042][06909] Updated weights for policy 0, policy_version 160553 (0.0040) [2024-06-28 05:45:03,850][06674] Fps is (10 sec: 44245.4, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2630582272. Throughput: 0: 44085.8. Samples: 2533490500. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:45:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:45:06,127][06909] Updated weights for policy 0, policy_version 160563 (0.0034) [2024-06-28 05:45:08,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2630811648. Throughput: 0: 44234.8. Samples: 2533757660. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:45:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:45:09,319][06909] Updated weights for policy 0, policy_version 160573 (0.0032) [2024-06-28 05:45:13,635][06909] Updated weights for policy 0, policy_version 160583 (0.0031) [2024-06-28 05:45:13,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 2630991872. Throughput: 0: 44296.4. Samples: 2533897000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:45:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:45:16,562][06909] Updated weights for policy 0, policy_version 160593 (0.0027) [2024-06-28 05:45:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2631237632. Throughput: 0: 44180.9. Samples: 2534161920. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:45:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:45:21,114][06909] Updated weights for policy 0, policy_version 160603 (0.0033) [2024-06-28 05:45:23,815][06909] Updated weights for policy 0, policy_version 160613 (0.0023) [2024-06-28 05:45:23,850][06674] Fps is (10 sec: 49152.7, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 2631483392. Throughput: 0: 43979.6. Samples: 2534415360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:45:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:45:28,495][06909] Updated weights for policy 0, policy_version 160623 (0.0030) [2024-06-28 05:45:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2631680000. Throughput: 0: 44012.3. Samples: 2534551380. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:45:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:45:31,435][06909] Updated weights for policy 0, policy_version 160633 (0.0028) [2024-06-28 05:45:33,852][06674] Fps is (10 sec: 40951.4, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 2631892992. Throughput: 0: 44010.6. Samples: 2534816900. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 05:45:33,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:45:35,895][06909] Updated weights for policy 0, policy_version 160643 (0.0037) [2024-06-28 05:45:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2632122368. Throughput: 0: 43985.6. Samples: 2535078020. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 05:45:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:45:38,887][06909] Updated weights for policy 0, policy_version 160653 (0.0038) [2024-06-28 05:45:43,204][06909] Updated weights for policy 0, policy_version 160663 (0.0039) [2024-06-28 05:45:43,852][06674] Fps is (10 sec: 42598.4, 60 sec: 43962.2, 300 sec: 43931.1). Total num frames: 2632318976. Throughput: 0: 44229.5. Samples: 2535222320. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 05:45:43,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:45:46,431][06909] Updated weights for policy 0, policy_version 160673 (0.0030) [2024-06-28 05:45:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 2632548352. Throughput: 0: 44339.2. Samples: 2535485760. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 05:45:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:45:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000160679_2632564736.pth... [2024-06-28 05:45:48,908][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000160034_2621997056.pth [2024-06-28 05:45:50,419][06909] Updated weights for policy 0, policy_version 160683 (0.0022) [2024-06-28 05:45:53,649][06909] Updated weights for policy 0, policy_version 160693 (0.0044) [2024-06-28 05:45:53,850][06674] Fps is (10 sec: 47523.2, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 2632794112. Throughput: 0: 44269.8. Samples: 2535749800. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 05:45:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:45:58,068][06909] Updated weights for policy 0, policy_version 160703 (0.0035) [2024-06-28 05:45:58,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2633007104. Throughput: 0: 44213.0. Samples: 2535886580. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 05:45:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:46:00,802][06909] Updated weights for policy 0, policy_version 160713 (0.0032) [2024-06-28 05:46:03,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2633203712. Throughput: 0: 44079.1. Samples: 2536145480. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 05:46:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:46:04,716][06887] Signal inference workers to stop experience collection... (36050 times) [2024-06-28 05:46:04,716][06887] Signal inference workers to resume experience collection... (36050 times) [2024-06-28 05:46:04,740][06909] InferenceWorker_p0-w0: stopping experience collection (36050 times) [2024-06-28 05:46:04,741][06909] InferenceWorker_p0-w0: resuming experience collection (36050 times) [2024-06-28 05:46:05,263][06909] Updated weights for policy 0, policy_version 160723 (0.0031) [2024-06-28 05:46:08,566][06909] Updated weights for policy 0, policy_version 160733 (0.0035) [2024-06-28 05:46:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2633449472. Throughput: 0: 44290.6. Samples: 2536408440. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 05:46:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:46:12,722][06909] Updated weights for policy 0, policy_version 160743 (0.0027) [2024-06-28 05:46:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2633662464. Throughput: 0: 44354.3. Samples: 2536547320. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 05:46:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:46:16,173][06909] Updated weights for policy 0, policy_version 160753 (0.0039) [2024-06-28 05:46:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2633875456. Throughput: 0: 44181.9. Samples: 2536805000. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 05:46:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:46:20,372][06909] Updated weights for policy 0, policy_version 160763 (0.0033) [2024-06-28 05:46:23,341][06909] Updated weights for policy 0, policy_version 160773 (0.0025) [2024-06-28 05:46:23,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2634121216. Throughput: 0: 44383.6. Samples: 2537075280. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 05:46:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:46:27,655][06909] Updated weights for policy 0, policy_version 160783 (0.0034) [2024-06-28 05:46:28,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2634350592. Throughput: 0: 44215.3. Samples: 2537211920. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 05:46:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:46:30,633][06909] Updated weights for policy 0, policy_version 160793 (0.0028) [2024-06-28 05:46:33,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 2634530816. Throughput: 0: 44128.4. Samples: 2537471540. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 05:46:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:46:35,117][06909] Updated weights for policy 0, policy_version 160803 (0.0022) [2024-06-28 05:46:38,178][06909] Updated weights for policy 0, policy_version 160813 (0.0035) [2024-06-28 05:46:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 2634792960. Throughput: 0: 44029.8. Samples: 2537731140. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 05:46:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:46:42,553][06909] Updated weights for policy 0, policy_version 160823 (0.0032) [2024-06-28 05:46:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44511.4, 300 sec: 44042.4). Total num frames: 2634989568. Throughput: 0: 44196.0. Samples: 2537875400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:46:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:46:45,826][06909] Updated weights for policy 0, policy_version 160833 (0.0032) [2024-06-28 05:46:48,850][06674] Fps is (10 sec: 37683.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2635169792. Throughput: 0: 43986.7. Samples: 2538124880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:46:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:46:50,002][06909] Updated weights for policy 0, policy_version 160843 (0.0039) [2024-06-28 05:46:53,437][06909] Updated weights for policy 0, policy_version 160853 (0.0033) [2024-06-28 05:46:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2635448320. Throughput: 0: 44022.7. Samples: 2538389460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:46:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:46:57,375][06909] Updated weights for policy 0, policy_version 160863 (0.0037) [2024-06-28 05:46:58,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2635644928. Throughput: 0: 43985.8. Samples: 2538526680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:46:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:47:00,822][06909] Updated weights for policy 0, policy_version 160873 (0.0029) [2024-06-28 05:47:03,853][06674] Fps is (10 sec: 40948.4, 60 sec: 44234.7, 300 sec: 44042.0). Total num frames: 2635857920. Throughput: 0: 43993.3. Samples: 2538784820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:47:03,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 05:47:04,927][06909] Updated weights for policy 0, policy_version 160883 (0.0035) [2024-06-28 05:47:08,121][06909] Updated weights for policy 0, policy_version 160893 (0.0039) [2024-06-28 05:47:08,851][06674] Fps is (10 sec: 42593.0, 60 sec: 43689.8, 300 sec: 43986.7). Total num frames: 2636070912. Throughput: 0: 43779.1. Samples: 2539045400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:47:08,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:47:12,208][06887] Signal inference workers to stop experience collection... (36100 times) [2024-06-28 05:47:12,208][06887] Signal inference workers to resume experience collection... (36100 times) [2024-06-28 05:47:12,237][06909] InferenceWorker_p0-w0: stopping experience collection (36100 times) [2024-06-28 05:47:12,238][06909] InferenceWorker_p0-w0: resuming experience collection (36100 times) [2024-06-28 05:47:12,341][06909] Updated weights for policy 0, policy_version 160903 (0.0039) [2024-06-28 05:47:13,850][06674] Fps is (10 sec: 42610.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2636283904. Throughput: 0: 43784.9. Samples: 2539182240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:47:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:47:15,433][06909] Updated weights for policy 0, policy_version 160913 (0.0036) [2024-06-28 05:47:18,850][06674] Fps is (10 sec: 40965.1, 60 sec: 43417.7, 300 sec: 43875.8). Total num frames: 2636480512. Throughput: 0: 43682.6. Samples: 2539437260. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:47:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:47:19,911][06909] Updated weights for policy 0, policy_version 160923 (0.0037) [2024-06-28 05:47:23,265][06909] Updated weights for policy 0, policy_version 160933 (0.0035) [2024-06-28 05:47:23,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2636759040. Throughput: 0: 43758.3. Samples: 2539700260. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:47:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:47:27,261][06909] Updated weights for policy 0, policy_version 160943 (0.0034) [2024-06-28 05:47:28,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 2636955648. Throughput: 0: 43624.3. Samples: 2539838500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:47:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:47:30,666][06909] Updated weights for policy 0, policy_version 160953 (0.0024) [2024-06-28 05:47:33,852][06674] Fps is (10 sec: 40949.4, 60 sec: 43961.9, 300 sec: 44042.0). Total num frames: 2637168640. Throughput: 0: 43908.6. Samples: 2540100880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:47:33,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:47:34,810][06909] Updated weights for policy 0, policy_version 160963 (0.0028) [2024-06-28 05:47:37,989][06909] Updated weights for policy 0, policy_version 160973 (0.0038) [2024-06-28 05:47:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2637414400. Throughput: 0: 43975.5. Samples: 2540368360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:47:38,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:47:42,232][06909] Updated weights for policy 0, policy_version 160983 (0.0049) [2024-06-28 05:47:43,850][06674] Fps is (10 sec: 45886.9, 60 sec: 43963.7, 300 sec: 44098.9). Total num frames: 2637627392. Throughput: 0: 43985.4. Samples: 2540506020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 05:47:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:47:45,402][06909] Updated weights for policy 0, policy_version 160993 (0.0026) [2024-06-28 05:47:48,850][06674] Fps is (10 sec: 40959.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2637824000. Throughput: 0: 43891.1. Samples: 2540759800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:47:48,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:47:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000161000_2637824000.pth... [2024-06-28 05:47:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000160356_2627272704.pth [2024-06-28 05:47:49,496][06909] Updated weights for policy 0, policy_version 161003 (0.0029) [2024-06-28 05:47:52,808][06909] Updated weights for policy 0, policy_version 161013 (0.0043) [2024-06-28 05:47:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2638069760. Throughput: 0: 44013.3. Samples: 2541025940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:47:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:47:57,078][06909] Updated weights for policy 0, policy_version 161023 (0.0028) [2024-06-28 05:47:58,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2638282752. Throughput: 0: 44039.1. Samples: 2541164000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:47:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:48:00,298][06909] Updated weights for policy 0, policy_version 161033 (0.0028) [2024-06-28 05:48:03,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43965.8, 300 sec: 43986.9). Total num frames: 2638495744. Throughput: 0: 44100.0. Samples: 2541421760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:48:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:48:04,425][06909] Updated weights for policy 0, policy_version 161043 (0.0024) [2024-06-28 05:48:07,635][06909] Updated weights for policy 0, policy_version 161053 (0.0030) [2024-06-28 05:48:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44237.7, 300 sec: 44042.4). Total num frames: 2638725120. Throughput: 0: 44256.7. Samples: 2541691820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:48:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:48:11,837][06909] Updated weights for policy 0, policy_version 161063 (0.0035) [2024-06-28 05:48:13,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2638954496. Throughput: 0: 44301.4. Samples: 2541832060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:48:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:48:15,142][06909] Updated weights for policy 0, policy_version 161073 (0.0025) [2024-06-28 05:48:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2639151104. Throughput: 0: 44318.9. Samples: 2542095120. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:48:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:48:19,069][06909] Updated weights for policy 0, policy_version 161083 (0.0031) [2024-06-28 05:48:22,322][06909] Updated weights for policy 0, policy_version 161093 (0.0031) [2024-06-28 05:48:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2639380480. Throughput: 0: 44176.6. Samples: 2542356300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:48:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:48:26,357][06909] Updated weights for policy 0, policy_version 161103 (0.0042) [2024-06-28 05:48:28,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44509.9, 300 sec: 44153.8). Total num frames: 2639626240. Throughput: 0: 44154.6. Samples: 2542492980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:48:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:48:29,604][06909] Updated weights for policy 0, policy_version 161113 (0.0038) [2024-06-28 05:48:33,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44238.6, 300 sec: 44098.0). Total num frames: 2639822848. Throughput: 0: 44361.8. Samples: 2542756080. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:48:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:48:34,140][06909] Updated weights for policy 0, policy_version 161123 (0.0027) [2024-06-28 05:48:37,309][06909] Updated weights for policy 0, policy_version 161133 (0.0036) [2024-06-28 05:48:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2640068608. Throughput: 0: 44277.2. Samples: 2543018420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:48:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:48:40,188][06887] Signal inference workers to stop experience collection... (36150 times) [2024-06-28 05:48:40,202][06909] InferenceWorker_p0-w0: stopping experience collection (36150 times) [2024-06-28 05:48:40,249][06887] Signal inference workers to resume experience collection... (36150 times) [2024-06-28 05:48:40,249][06909] InferenceWorker_p0-w0: resuming experience collection (36150 times) [2024-06-28 05:48:41,308][06909] Updated weights for policy 0, policy_version 161143 (0.0030) [2024-06-28 05:48:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2640281600. Throughput: 0: 44337.3. Samples: 2543159180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:48:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:48:44,575][06909] Updated weights for policy 0, policy_version 161153 (0.0027) [2024-06-28 05:48:48,526][06909] Updated weights for policy 0, policy_version 161163 (0.0026) [2024-06-28 05:48:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 2640494592. Throughput: 0: 44605.9. Samples: 2543429020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 05:48:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:48:52,094][06909] Updated weights for policy 0, policy_version 161173 (0.0046) [2024-06-28 05:48:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2640723968. Throughput: 0: 44372.5. Samples: 2543688580. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:48:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:48:55,983][06909] Updated weights for policy 0, policy_version 161183 (0.0036) [2024-06-28 05:48:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 2640953344. Throughput: 0: 44127.1. Samples: 2543817780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:48:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:48:59,393][06909] Updated weights for policy 0, policy_version 161193 (0.0032) [2024-06-28 05:49:03,720][06909] Updated weights for policy 0, policy_version 161203 (0.0034) [2024-06-28 05:49:03,852][06674] Fps is (10 sec: 42590.0, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 2641149952. Throughput: 0: 44404.7. Samples: 2544093420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:49:03,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:49:06,732][06909] Updated weights for policy 0, policy_version 161213 (0.0035) [2024-06-28 05:49:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2641379328. Throughput: 0: 44228.8. Samples: 2544346600. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:49:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:49:11,023][06909] Updated weights for policy 0, policy_version 161223 (0.0032) [2024-06-28 05:49:13,850][06674] Fps is (10 sec: 45884.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2641608704. Throughput: 0: 44298.7. Samples: 2544486420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:49:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:49:14,172][06909] Updated weights for policy 0, policy_version 161233 (0.0031) [2024-06-28 05:49:18,363][06909] Updated weights for policy 0, policy_version 161243 (0.0024) [2024-06-28 05:49:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 2641821696. Throughput: 0: 44276.8. Samples: 2544748540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:49:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:49:21,817][06909] Updated weights for policy 0, policy_version 161253 (0.0026) [2024-06-28 05:49:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2642034688. Throughput: 0: 44234.4. Samples: 2545008960. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:49:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:49:25,814][06909] Updated weights for policy 0, policy_version 161263 (0.0032) [2024-06-28 05:49:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2642264064. Throughput: 0: 44072.0. Samples: 2545142420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:49:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:49:29,214][06909] Updated weights for policy 0, policy_version 161273 (0.0042) [2024-06-28 05:49:33,165][06909] Updated weights for policy 0, policy_version 161283 (0.0025) [2024-06-28 05:49:33,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2642493440. Throughput: 0: 44016.3. Samples: 2545409760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:49:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:49:36,589][06909] Updated weights for policy 0, policy_version 161293 (0.0030) [2024-06-28 05:49:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2642690048. Throughput: 0: 44051.6. Samples: 2545670900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:49:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:49:41,090][06909] Updated weights for policy 0, policy_version 161303 (0.0029) [2024-06-28 05:49:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2642935808. Throughput: 0: 44152.3. Samples: 2545804640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:49:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:49:44,193][06909] Updated weights for policy 0, policy_version 161313 (0.0039) [2024-06-28 05:49:48,258][06909] Updated weights for policy 0, policy_version 161323 (0.0030) [2024-06-28 05:49:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.7, 300 sec: 44098.2). Total num frames: 2643148800. Throughput: 0: 44063.7. Samples: 2546076200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:49:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:49:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000161325_2643148800.pth... [2024-06-28 05:49:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000160679_2632564736.pth [2024-06-28 05:49:51,459][06909] Updated weights for policy 0, policy_version 161333 (0.0023) [2024-06-28 05:49:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2643345408. Throughput: 0: 44184.8. Samples: 2546334920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:49:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:49:55,597][06909] Updated weights for policy 0, policy_version 161343 (0.0032) [2024-06-28 05:49:56,031][06887] Signal inference workers to stop experience collection... (36200 times) [2024-06-28 05:49:56,032][06887] Signal inference workers to resume experience collection... (36200 times) [2024-06-28 05:49:56,053][06909] InferenceWorker_p0-w0: stopping experience collection (36200 times) [2024-06-28 05:49:56,053][06909] InferenceWorker_p0-w0: resuming experience collection (36200 times) [2024-06-28 05:49:58,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2643591168. Throughput: 0: 43925.8. Samples: 2546463080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-28 05:49:58,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:49:59,209][06909] Updated weights for policy 0, policy_version 161353 (0.0032) [2024-06-28 05:50:02,867][06909] Updated weights for policy 0, policy_version 161363 (0.0039) [2024-06-28 05:50:03,850][06674] Fps is (10 sec: 49153.0, 60 sec: 44784.5, 300 sec: 44153.5). Total num frames: 2643836928. Throughput: 0: 44195.7. Samples: 2546737340. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-28 05:50:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:50:06,757][06909] Updated weights for policy 0, policy_version 161373 (0.0025) [2024-06-28 05:50:08,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2644000768. Throughput: 0: 44160.4. Samples: 2546996180. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-28 05:50:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:50:10,406][06909] Updated weights for policy 0, policy_version 161383 (0.0032) [2024-06-28 05:50:13,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2644246528. Throughput: 0: 44052.0. Samples: 2547124760. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-28 05:50:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:50:13,996][06909] Updated weights for policy 0, policy_version 161393 (0.0026) [2024-06-28 05:50:17,698][06909] Updated weights for policy 0, policy_version 161403 (0.0024) [2024-06-28 05:50:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2644475904. Throughput: 0: 44177.4. Samples: 2547397740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-28 05:50:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:50:21,231][06909] Updated weights for policy 0, policy_version 161413 (0.0024) [2024-06-28 05:50:23,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2644656128. Throughput: 0: 44151.2. Samples: 2547657700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-28 05:50:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:50:25,356][06909] Updated weights for policy 0, policy_version 161423 (0.0036) [2024-06-28 05:50:28,562][06909] Updated weights for policy 0, policy_version 161433 (0.0029) [2024-06-28 05:50:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44153.8). Total num frames: 2644918272. Throughput: 0: 43972.5. Samples: 2547783400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-28 05:50:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:50:32,621][06909] Updated weights for policy 0, policy_version 161443 (0.0028) [2024-06-28 05:50:33,850][06674] Fps is (10 sec: 49151.9, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2645147648. Throughput: 0: 43999.6. Samples: 2548056180. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-28 05:50:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:50:36,252][06909] Updated weights for policy 0, policy_version 161453 (0.0032) [2024-06-28 05:50:38,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 44098.2). Total num frames: 2645327872. Throughput: 0: 44211.6. Samples: 2548324440. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-28 05:50:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:50:40,094][06909] Updated weights for policy 0, policy_version 161463 (0.0021) [2024-06-28 05:50:43,796][06909] Updated weights for policy 0, policy_version 161473 (0.0026) [2024-06-28 05:50:43,852][06674] Fps is (10 sec: 42587.7, 60 sec: 43962.0, 300 sec: 44153.1). Total num frames: 2645573632. Throughput: 0: 44200.2. Samples: 2548452200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-28 05:50:43,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:50:47,545][06909] Updated weights for policy 0, policy_version 161483 (0.0035) [2024-06-28 05:50:48,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2645803008. Throughput: 0: 44052.4. Samples: 2548719700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-28 05:50:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:50:51,163][06909] Updated weights for policy 0, policy_version 161493 (0.0032) [2024-06-28 05:50:53,850][06674] Fps is (10 sec: 40969.5, 60 sec: 43963.7, 300 sec: 43986.8). Total num frames: 2645983232. Throughput: 0: 44022.1. Samples: 2548977180. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-28 05:50:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:50:55,062][06909] Updated weights for policy 0, policy_version 161503 (0.0031) [2024-06-28 05:50:58,497][06909] Updated weights for policy 0, policy_version 161513 (0.0030) [2024-06-28 05:50:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2646228992. Throughput: 0: 43925.9. Samples: 2549101420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 23.0) [2024-06-28 05:50:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:51:02,706][06909] Updated weights for policy 0, policy_version 161523 (0.0035) [2024-06-28 05:51:03,850][06674] Fps is (10 sec: 47514.7, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2646458368. Throughput: 0: 43820.1. Samples: 2549369640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 05:51:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 05:51:04,416][06887] Signal inference workers to stop experience collection... (36250 times) [2024-06-28 05:51:04,417][06887] Signal inference workers to resume experience collection... (36250 times) [2024-06-28 05:51:04,455][06909] InferenceWorker_p0-w0: stopping experience collection (36250 times) [2024-06-28 05:51:04,455][06909] InferenceWorker_p0-w0: resuming experience collection (36250 times) [2024-06-28 05:51:06,063][06909] Updated weights for policy 0, policy_version 161533 (0.0039) [2024-06-28 05:51:08,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2646638592. Throughput: 0: 44091.1. Samples: 2549641800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 05:51:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 05:51:10,075][06909] Updated weights for policy 0, policy_version 161543 (0.0033) [2024-06-28 05:51:13,565][06909] Updated weights for policy 0, policy_version 161553 (0.0031) [2024-06-28 05:51:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2646884352. Throughput: 0: 43910.7. Samples: 2549759380. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 05:51:13,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:51:17,681][06909] Updated weights for policy 0, policy_version 161563 (0.0036) [2024-06-28 05:51:18,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2647113728. Throughput: 0: 43968.5. Samples: 2550034760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 05:51:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:51:20,887][06909] Updated weights for policy 0, policy_version 161573 (0.0042) [2024-06-28 05:51:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 2647293952. Throughput: 0: 43830.8. Samples: 2550296820. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 05:51:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:51:24,938][06909] Updated weights for policy 0, policy_version 161583 (0.0032) [2024-06-28 05:51:28,323][06909] Updated weights for policy 0, policy_version 161593 (0.0042) [2024-06-28 05:51:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2647556096. Throughput: 0: 43637.1. Samples: 2550415760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 05:51:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:51:32,517][06909] Updated weights for policy 0, policy_version 161603 (0.0029) [2024-06-28 05:51:33,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2647769088. Throughput: 0: 43797.4. Samples: 2550690580. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 05:51:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:51:35,698][06909] Updated weights for policy 0, policy_version 161613 (0.0022) [2024-06-28 05:51:38,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 2647949312. Throughput: 0: 43888.2. Samples: 2550952140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 05:51:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:51:39,986][06909] Updated weights for policy 0, policy_version 161623 (0.0034) [2024-06-28 05:51:43,077][06909] Updated weights for policy 0, policy_version 161633 (0.0031) [2024-06-28 05:51:43,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44238.6, 300 sec: 44264.6). Total num frames: 2648227840. Throughput: 0: 43941.7. Samples: 2551078800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 05:51:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:51:47,464][06909] Updated weights for policy 0, policy_version 161643 (0.0039) [2024-06-28 05:51:48,850][06674] Fps is (10 sec: 49152.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2648440832. Throughput: 0: 44145.7. Samples: 2551356200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 05:51:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:51:48,929][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000161649_2648457216.pth... [2024-06-28 05:51:48,998][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000161000_2637824000.pth [2024-06-28 05:51:50,366][06909] Updated weights for policy 0, policy_version 161653 (0.0030) [2024-06-28 05:51:53,856][06674] Fps is (10 sec: 39298.3, 60 sec: 43959.4, 300 sec: 43986.0). Total num frames: 2648621056. Throughput: 0: 44030.5. Samples: 2551623440. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 05:51:53,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:51:54,718][06909] Updated weights for policy 0, policy_version 161663 (0.0042) [2024-06-28 05:51:57,917][06909] Updated weights for policy 0, policy_version 161673 (0.0038) [2024-06-28 05:51:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44153.9). Total num frames: 2648883200. Throughput: 0: 44153.2. Samples: 2551746280. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 05:51:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:52:02,112][06909] Updated weights for policy 0, policy_version 161683 (0.0038) [2024-06-28 05:52:03,852][06674] Fps is (10 sec: 47532.3, 60 sec: 43962.2, 300 sec: 44153.4). Total num frames: 2649096192. Throughput: 0: 44063.7. Samples: 2552017720. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 05:52:03,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:52:05,322][06909] Updated weights for policy 0, policy_version 161693 (0.0028) [2024-06-28 05:52:08,850][06674] Fps is (10 sec: 37683.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2649260032. Throughput: 0: 44117.3. Samples: 2552282100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 05:52:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:52:09,788][06909] Updated weights for policy 0, policy_version 161703 (0.0038) [2024-06-28 05:52:12,625][06909] Updated weights for policy 0, policy_version 161713 (0.0039) [2024-06-28 05:52:13,850][06674] Fps is (10 sec: 44246.0, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 2649538560. Throughput: 0: 44227.5. Samples: 2552406000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 05:52:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:52:17,298][06909] Updated weights for policy 0, policy_version 161723 (0.0034) [2024-06-28 05:52:17,385][06887] Signal inference workers to stop experience collection... (36300 times) [2024-06-28 05:52:17,420][06909] InferenceWorker_p0-w0: stopping experience collection (36300 times) [2024-06-28 05:52:17,443][06887] Signal inference workers to resume experience collection... (36300 times) [2024-06-28 05:52:17,444][06909] InferenceWorker_p0-w0: resuming experience collection (36300 times) [2024-06-28 05:52:18,850][06674] Fps is (10 sec: 49152.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2649751552. Throughput: 0: 44009.8. Samples: 2552671020. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 05:52:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:52:20,237][06909] Updated weights for policy 0, policy_version 161733 (0.0034) [2024-06-28 05:52:23,850][06674] Fps is (10 sec: 40959.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2649948160. Throughput: 0: 44300.3. Samples: 2552945660. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 05:52:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:52:24,461][06909] Updated weights for policy 0, policy_version 161743 (0.0035) [2024-06-28 05:52:27,528][06909] Updated weights for policy 0, policy_version 161753 (0.0034) [2024-06-28 05:52:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44209.4). Total num frames: 2650210304. Throughput: 0: 44288.6. Samples: 2553071780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 05:52:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:52:31,778][06909] Updated weights for policy 0, policy_version 161763 (0.0031) [2024-06-28 05:52:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2650406912. Throughput: 0: 44043.9. Samples: 2553338180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 05:52:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:52:34,812][06909] Updated weights for policy 0, policy_version 161773 (0.0027) [2024-06-28 05:52:38,850][06674] Fps is (10 sec: 40959.6, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2650619904. Throughput: 0: 44204.1. Samples: 2553612360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 05:52:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:52:39,194][06909] Updated weights for policy 0, policy_version 161783 (0.0035) [2024-06-28 05:52:42,260][06909] Updated weights for policy 0, policy_version 161793 (0.0033) [2024-06-28 05:52:43,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2650865664. Throughput: 0: 44106.7. Samples: 2553731080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 05:52:43,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:52:46,806][06909] Updated weights for policy 0, policy_version 161803 (0.0036) [2024-06-28 05:52:48,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2651095040. Throughput: 0: 43969.5. Samples: 2553996260. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 05:52:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:52:49,608][06909] Updated weights for policy 0, policy_version 161813 (0.0036) [2024-06-28 05:52:53,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43968.1, 300 sec: 43986.9). Total num frames: 2651258880. Throughput: 0: 44128.3. Samples: 2554267880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 05:52:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:52:54,248][06909] Updated weights for policy 0, policy_version 161823 (0.0046) [2024-06-28 05:52:57,220][06909] Updated weights for policy 0, policy_version 161833 (0.0027) [2024-06-28 05:52:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2651521024. Throughput: 0: 44087.9. Samples: 2554389960. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 05:52:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:53:01,756][06909] Updated weights for policy 0, policy_version 161843 (0.0030) [2024-06-28 05:53:03,850][06674] Fps is (10 sec: 49152.3, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2651750400. Throughput: 0: 44065.7. Samples: 2554653980. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 05:53:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:53:04,869][06909] Updated weights for policy 0, policy_version 161853 (0.0027) [2024-06-28 05:53:08,850][06674] Fps is (10 sec: 39322.0, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2651914240. Throughput: 0: 44004.6. Samples: 2554925860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:53:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:53:09,185][06909] Updated weights for policy 0, policy_version 161863 (0.0032) [2024-06-28 05:53:12,385][06909] Updated weights for policy 0, policy_version 161873 (0.0033) [2024-06-28 05:53:13,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 2652160000. Throughput: 0: 43800.4. Samples: 2555042800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:53:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:53:16,597][06909] Updated weights for policy 0, policy_version 161883 (0.0028) [2024-06-28 05:53:18,850][06674] Fps is (10 sec: 50790.0, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2652422144. Throughput: 0: 43809.4. Samples: 2555309600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:53:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:53:19,905][06909] Updated weights for policy 0, policy_version 161893 (0.0046) [2024-06-28 05:53:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2652585984. Throughput: 0: 43581.8. Samples: 2555573540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:53:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:53:24,217][06909] Updated weights for policy 0, policy_version 161903 (0.0033) [2024-06-28 05:53:27,100][06909] Updated weights for policy 0, policy_version 161913 (0.0036) [2024-06-28 05:53:28,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43417.5, 300 sec: 44042.4). Total num frames: 2652815360. Throughput: 0: 43657.8. Samples: 2555695680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:53:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:53:31,613][06909] Updated weights for policy 0, policy_version 161923 (0.0035) [2024-06-28 05:53:33,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2653061120. Throughput: 0: 43709.3. Samples: 2555963180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:53:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:53:34,099][06887] Signal inference workers to stop experience collection... (36350 times) [2024-06-28 05:53:34,148][06909] InferenceWorker_p0-w0: stopping experience collection (36350 times) [2024-06-28 05:53:34,153][06887] Signal inference workers to resume experience collection... (36350 times) [2024-06-28 05:53:34,159][06909] InferenceWorker_p0-w0: resuming experience collection (36350 times) [2024-06-28 05:53:34,703][06909] Updated weights for policy 0, policy_version 161933 (0.0032) [2024-06-28 05:53:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2653257728. Throughput: 0: 43833.9. Samples: 2556240400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:53:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:53:39,027][06909] Updated weights for policy 0, policy_version 161943 (0.0026) [2024-06-28 05:53:42,266][06909] Updated weights for policy 0, policy_version 161953 (0.0035) [2024-06-28 05:53:43,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43417.7, 300 sec: 43986.9). Total num frames: 2653470720. Throughput: 0: 43760.1. Samples: 2556359160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:53:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:53:46,302][06909] Updated weights for policy 0, policy_version 161963 (0.0049) [2024-06-28 05:53:48,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2653732864. Throughput: 0: 43895.9. Samples: 2556629300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:53:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:53:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000161971_2653732864.pth... [2024-06-28 05:53:48,947][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000161325_2643148800.pth [2024-06-28 05:53:49,513][06909] Updated weights for policy 0, policy_version 161973 (0.0037) [2024-06-28 05:53:53,702][06909] Updated weights for policy 0, policy_version 161983 (0.0030) [2024-06-28 05:53:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44510.0, 300 sec: 43986.9). Total num frames: 2653929472. Throughput: 0: 43775.5. Samples: 2556895760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:53:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:53:56,765][06909] Updated weights for policy 0, policy_version 161993 (0.0038) [2024-06-28 05:53:58,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 44042.7). Total num frames: 2654142464. Throughput: 0: 43939.5. Samples: 2557020080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:53:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:54:01,439][06909] Updated weights for policy 0, policy_version 162003 (0.0026) [2024-06-28 05:54:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2654388224. Throughput: 0: 43896.9. Samples: 2557284960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:54:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:54:04,298][06909] Updated weights for policy 0, policy_version 162013 (0.0027) [2024-06-28 05:54:08,745][06909] Updated weights for policy 0, policy_version 162023 (0.0027) [2024-06-28 05:54:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.7, 300 sec: 43986.8). Total num frames: 2654584832. Throughput: 0: 43991.9. Samples: 2557553180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 05:54:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:54:11,777][06909] Updated weights for policy 0, policy_version 162033 (0.0040) [2024-06-28 05:54:13,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2654797824. Throughput: 0: 44072.1. Samples: 2557678920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:54:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:54:16,206][06909] Updated weights for policy 0, policy_version 162043 (0.0046) [2024-06-28 05:54:18,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2655043584. Throughput: 0: 43977.0. Samples: 2557942140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:54:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:54:19,257][06909] Updated weights for policy 0, policy_version 162053 (0.0030) [2024-06-28 05:54:23,675][06909] Updated weights for policy 0, policy_version 162063 (0.0044) [2024-06-28 05:54:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 2655256576. Throughput: 0: 43825.4. Samples: 2558212540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:54:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:54:26,850][06909] Updated weights for policy 0, policy_version 162073 (0.0031) [2024-06-28 05:54:28,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2655453184. Throughput: 0: 43954.2. Samples: 2558337100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:54:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:54:31,072][06909] Updated weights for policy 0, policy_version 162083 (0.0026) [2024-06-28 05:54:33,850][06674] Fps is (10 sec: 45874.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2655715328. Throughput: 0: 43972.4. Samples: 2558608060. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:54:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:54:34,093][06909] Updated weights for policy 0, policy_version 162093 (0.0024) [2024-06-28 05:54:38,543][06909] Updated weights for policy 0, policy_version 162103 (0.0028) [2024-06-28 05:54:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2655911936. Throughput: 0: 43961.7. Samples: 2558874040. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:54:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:54:41,773][06909] Updated weights for policy 0, policy_version 162113 (0.0026) [2024-06-28 05:54:43,850][06674] Fps is (10 sec: 39322.3, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 2656108544. Throughput: 0: 43963.3. Samples: 2558998420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:54:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 05:54:45,894][06909] Updated weights for policy 0, policy_version 162123 (0.0029) [2024-06-28 05:54:48,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2656370688. Throughput: 0: 44026.3. Samples: 2559266140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:54:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:54:48,992][06909] Updated weights for policy 0, policy_version 162133 (0.0036) [2024-06-28 05:54:53,566][06909] Updated weights for policy 0, policy_version 162143 (0.0031) [2024-06-28 05:54:53,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2656567296. Throughput: 0: 43950.8. Samples: 2559530960. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:54:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:54:56,644][06909] Updated weights for policy 0, policy_version 162153 (0.0031) [2024-06-28 05:54:58,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 2656763904. Throughput: 0: 44041.3. Samples: 2559660780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:54:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:55:00,875][06909] Updated weights for policy 0, policy_version 162163 (0.0031) [2024-06-28 05:55:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2657026048. Throughput: 0: 44035.6. Samples: 2559923740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:55:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:55:04,053][06909] Updated weights for policy 0, policy_version 162173 (0.0029) [2024-06-28 05:55:08,319][06909] Updated weights for policy 0, policy_version 162183 (0.0026) [2024-06-28 05:55:08,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2657222656. Throughput: 0: 43923.4. Samples: 2560189100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:55:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 05:55:10,883][06887] Signal inference workers to stop experience collection... (36400 times) [2024-06-28 05:55:10,922][06909] InferenceWorker_p0-w0: stopping experience collection (36400 times) [2024-06-28 05:55:10,931][06887] Signal inference workers to resume experience collection... (36400 times) [2024-06-28 05:55:10,941][06909] InferenceWorker_p0-w0: resuming experience collection (36400 times) [2024-06-28 05:55:11,225][06909] Updated weights for policy 0, policy_version 162193 (0.0034) [2024-06-28 05:55:13,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 2657419264. Throughput: 0: 43904.9. Samples: 2560312820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 05:55:13,850][06674] Avg episode reward: [(0, '0.429')] [2024-06-28 05:55:15,958][06909] Updated weights for policy 0, policy_version 162203 (0.0032) [2024-06-28 05:55:18,760][06909] Updated weights for policy 0, policy_version 162213 (0.0024) [2024-06-28 05:55:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2657697792. Throughput: 0: 43715.6. Samples: 2560575260. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-28 05:55:18,860][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:55:23,162][06909] Updated weights for policy 0, policy_version 162223 (0.0034) [2024-06-28 05:55:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2657878016. Throughput: 0: 43874.7. Samples: 2560848400. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-28 05:55:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:55:26,272][06909] Updated weights for policy 0, policy_version 162233 (0.0035) [2024-06-28 05:55:28,856][06674] Fps is (10 sec: 39297.7, 60 sec: 43959.3, 300 sec: 43874.9). Total num frames: 2658091008. Throughput: 0: 43758.4. Samples: 2560967820. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-28 05:55:28,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:55:30,923][06909] Updated weights for policy 0, policy_version 162243 (0.0042) [2024-06-28 05:55:33,844][06909] Updated weights for policy 0, policy_version 162253 (0.0043) [2024-06-28 05:55:33,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 2658353152. Throughput: 0: 43820.5. Samples: 2561238060. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-28 05:55:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:55:38,255][06909] Updated weights for policy 0, policy_version 162263 (0.0029) [2024-06-28 05:55:38,850][06674] Fps is (10 sec: 47542.6, 60 sec: 44236.8, 300 sec: 44042.8). Total num frames: 2658566144. Throughput: 0: 43799.5. Samples: 2561501940. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-28 05:55:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:55:41,598][06909] Updated weights for policy 0, policy_version 162273 (0.0039) [2024-06-28 05:55:43,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 2658746368. Throughput: 0: 43811.9. Samples: 2561632320. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-28 05:55:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:55:45,525][06909] Updated weights for policy 0, policy_version 162283 (0.0031) [2024-06-28 05:55:48,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.5, 300 sec: 44098.0). Total num frames: 2658992128. Throughput: 0: 43842.0. Samples: 2561896640. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-28 05:55:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:55:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000162292_2658992128.pth... [2024-06-28 05:55:48,942][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000161649_2648457216.pth [2024-06-28 05:55:49,085][06909] Updated weights for policy 0, policy_version 162293 (0.0031) [2024-06-28 05:55:53,180][06909] Updated weights for policy 0, policy_version 162303 (0.0043) [2024-06-28 05:55:53,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2659205120. Throughput: 0: 43930.2. Samples: 2562165960. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-28 05:55:53,850][06674] Avg episode reward: [(0, '0.466')] [2024-06-28 05:55:56,241][06909] Updated weights for policy 0, policy_version 162313 (0.0034) [2024-06-28 05:55:58,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 2659401728. Throughput: 0: 43987.5. Samples: 2562292260. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-28 05:55:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:56:00,371][06909] Updated weights for policy 0, policy_version 162323 (0.0036) [2024-06-28 05:56:03,717][06909] Updated weights for policy 0, policy_version 162333 (0.0029) [2024-06-28 05:56:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2659663872. Throughput: 0: 43951.5. Samples: 2562553080. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-28 05:56:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:56:08,065][06909] Updated weights for policy 0, policy_version 162343 (0.0033) [2024-06-28 05:56:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2659860480. Throughput: 0: 43931.4. Samples: 2562825320. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-28 05:56:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:56:11,710][06909] Updated weights for policy 0, policy_version 162353 (0.0035) [2024-06-28 05:56:13,850][06674] Fps is (10 sec: 39322.2, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 2660057088. Throughput: 0: 44165.7. Samples: 2562955000. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-28 05:56:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:56:15,245][06909] Updated weights for policy 0, policy_version 162363 (0.0038) [2024-06-28 05:56:15,769][06887] Signal inference workers to stop experience collection... (36450 times) [2024-06-28 05:56:15,773][06887] Signal inference workers to resume experience collection... (36450 times) [2024-06-28 05:56:15,787][06909] InferenceWorker_p0-w0: stopping experience collection (36450 times) [2024-06-28 05:56:15,792][06909] InferenceWorker_p0-w0: resuming experience collection (36450 times) [2024-06-28 05:56:18,852][06674] Fps is (10 sec: 44225.7, 60 sec: 43415.8, 300 sec: 44097.6). Total num frames: 2660302848. Throughput: 0: 43992.1. Samples: 2563217820. Policy #0 lag: (min: 0.0, avg: 12.3, max: 22.0) [2024-06-28 05:56:18,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:56:19,001][06909] Updated weights for policy 0, policy_version 162373 (0.0033) [2024-06-28 05:56:23,066][06909] Updated weights for policy 0, policy_version 162383 (0.0025) [2024-06-28 05:56:23,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2660515840. Throughput: 0: 44000.5. Samples: 2563481960. Policy #0 lag: (min: 0.0, avg: 7.8, max: 23.0) [2024-06-28 05:56:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:56:26,410][06909] Updated weights for policy 0, policy_version 162393 (0.0041) [2024-06-28 05:56:28,850][06674] Fps is (10 sec: 42608.9, 60 sec: 43968.1, 300 sec: 43931.3). Total num frames: 2660728832. Throughput: 0: 43989.2. Samples: 2563611840. Policy #0 lag: (min: 0.0, avg: 7.8, max: 23.0) [2024-06-28 05:56:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:56:30,310][06909] Updated weights for policy 0, policy_version 162403 (0.0030) [2024-06-28 05:56:33,714][06909] Updated weights for policy 0, policy_version 162413 (0.0023) [2024-06-28 05:56:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2660974592. Throughput: 0: 43935.3. Samples: 2563873720. Policy #0 lag: (min: 0.0, avg: 7.8, max: 23.0) [2024-06-28 05:56:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:56:38,104][06909] Updated weights for policy 0, policy_version 162423 (0.0026) [2024-06-28 05:56:38,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 2661187584. Throughput: 0: 43897.9. Samples: 2564141360. Policy #0 lag: (min: 0.0, avg: 7.8, max: 23.0) [2024-06-28 05:56:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:56:41,338][06909] Updated weights for policy 0, policy_version 162433 (0.0045) [2024-06-28 05:56:43,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 2661367808. Throughput: 0: 43884.9. Samples: 2564267080. Policy #0 lag: (min: 0.0, avg: 7.8, max: 23.0) [2024-06-28 05:56:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:56:45,429][06909] Updated weights for policy 0, policy_version 162443 (0.0027) [2024-06-28 05:56:48,829][06909] Updated weights for policy 0, policy_version 162453 (0.0024) [2024-06-28 05:56:48,853][06674] Fps is (10 sec: 44221.5, 60 sec: 43961.3, 300 sec: 44098.3). Total num frames: 2661629952. Throughput: 0: 44006.0. Samples: 2564533500. Policy #0 lag: (min: 0.0, avg: 7.8, max: 23.0) [2024-06-28 05:56:48,854][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 05:56:52,855][06909] Updated weights for policy 0, policy_version 162463 (0.0027) [2024-06-28 05:56:53,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 2661842944. Throughput: 0: 43861.9. Samples: 2564799100. Policy #0 lag: (min: 0.0, avg: 7.8, max: 23.0) [2024-06-28 05:56:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:56:56,271][06909] Updated weights for policy 0, policy_version 162473 (0.0029) [2024-06-28 05:56:58,850][06674] Fps is (10 sec: 39335.4, 60 sec: 43690.7, 300 sec: 43820.6). Total num frames: 2662023168. Throughput: 0: 43901.8. Samples: 2564930580. Policy #0 lag: (min: 0.0, avg: 7.8, max: 23.0) [2024-06-28 05:56:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:57:00,371][06909] Updated weights for policy 0, policy_version 162483 (0.0029) [2024-06-28 05:57:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.7, 300 sec: 44098.0). Total num frames: 2662268928. Throughput: 0: 43904.8. Samples: 2565193420. Policy #0 lag: (min: 0.0, avg: 7.8, max: 23.0) [2024-06-28 05:57:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:57:03,894][06909] Updated weights for policy 0, policy_version 162493 (0.0049) [2024-06-28 05:57:07,462][06909] Updated weights for policy 0, policy_version 162503 (0.0025) [2024-06-28 05:57:08,850][06674] Fps is (10 sec: 49151.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2662514688. Throughput: 0: 44131.1. Samples: 2565467860. Policy #0 lag: (min: 0.0, avg: 7.8, max: 23.0) [2024-06-28 05:57:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:57:11,139][06909] Updated weights for policy 0, policy_version 162513 (0.0042) [2024-06-28 05:57:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 2662694912. Throughput: 0: 44075.3. Samples: 2565595220. Policy #0 lag: (min: 0.0, avg: 7.8, max: 23.0) [2024-06-28 05:57:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:57:15,206][06909] Updated weights for policy 0, policy_version 162523 (0.0031) [2024-06-28 05:57:18,396][06909] Updated weights for policy 0, policy_version 162533 (0.0040) [2024-06-28 05:57:18,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43965.7, 300 sec: 44042.4). Total num frames: 2662940672. Throughput: 0: 44100.9. Samples: 2565858260. Policy #0 lag: (min: 0.0, avg: 7.8, max: 23.0) [2024-06-28 05:57:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:57:22,363][06909] Updated weights for policy 0, policy_version 162543 (0.0026) [2024-06-28 05:57:23,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2663170048. Throughput: 0: 44090.2. Samples: 2566125420. Policy #0 lag: (min: 0.0, avg: 7.8, max: 23.0) [2024-06-28 05:57:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:57:25,935][06909] Updated weights for policy 0, policy_version 162553 (0.0028) [2024-06-28 05:57:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 2663366656. Throughput: 0: 44287.2. Samples: 2566260000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 05:57:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:57:29,000][06887] Signal inference workers to stop experience collection... (36500 times) [2024-06-28 05:57:29,000][06887] Signal inference workers to resume experience collection... (36500 times) [2024-06-28 05:57:29,020][06909] InferenceWorker_p0-w0: stopping experience collection (36500 times) [2024-06-28 05:57:29,020][06909] InferenceWorker_p0-w0: resuming experience collection (36500 times) [2024-06-28 05:57:29,924][06909] Updated weights for policy 0, policy_version 162563 (0.0030) [2024-06-28 05:57:33,497][06909] Updated weights for policy 0, policy_version 162573 (0.0035) [2024-06-28 05:57:33,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2663596032. Throughput: 0: 44203.0. Samples: 2566522480. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 05:57:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:57:37,262][06909] Updated weights for policy 0, policy_version 162583 (0.0029) [2024-06-28 05:57:38,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2663841792. Throughput: 0: 44149.2. Samples: 2566785820. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 05:57:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:57:40,982][06909] Updated weights for policy 0, policy_version 162593 (0.0026) [2024-06-28 05:57:43,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44509.8, 300 sec: 43875.8). Total num frames: 2664038400. Throughput: 0: 44266.1. Samples: 2566922560. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 05:57:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:57:44,709][06909] Updated weights for policy 0, policy_version 162603 (0.0035) [2024-06-28 05:57:48,230][06909] Updated weights for policy 0, policy_version 162613 (0.0020) [2024-06-28 05:57:48,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43693.2, 300 sec: 44042.4). Total num frames: 2664251392. Throughput: 0: 44160.9. Samples: 2567180660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 05:57:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:57:48,940][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000162614_2664267776.pth... [2024-06-28 05:57:49,000][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000161971_2653732864.pth [2024-06-28 05:57:52,163][06909] Updated weights for policy 0, policy_version 162623 (0.0042) [2024-06-28 05:57:53,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2664497152. Throughput: 0: 43876.5. Samples: 2567442300. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 05:57:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:57:55,686][06909] Updated weights for policy 0, policy_version 162633 (0.0028) [2024-06-28 05:57:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.8, 300 sec: 43875.8). Total num frames: 2664693760. Throughput: 0: 44113.7. Samples: 2567580340. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 05:57:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:57:59,606][06909] Updated weights for policy 0, policy_version 162643 (0.0040) [2024-06-28 05:58:03,293][06909] Updated weights for policy 0, policy_version 162653 (0.0028) [2024-06-28 05:58:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2664923136. Throughput: 0: 44206.6. Samples: 2567847560. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 05:58:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:58:06,943][06909] Updated weights for policy 0, policy_version 162663 (0.0027) [2024-06-28 05:58:08,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2665168896. Throughput: 0: 44140.0. Samples: 2568111720. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 05:58:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:58:10,952][06909] Updated weights for policy 0, policy_version 162673 (0.0031) [2024-06-28 05:58:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44509.9, 300 sec: 43875.8). Total num frames: 2665365504. Throughput: 0: 44213.4. Samples: 2568249600. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 05:58:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:58:14,227][06909] Updated weights for policy 0, policy_version 162683 (0.0039) [2024-06-28 05:58:18,501][06909] Updated weights for policy 0, policy_version 162693 (0.0037) [2024-06-28 05:58:18,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2665562112. Throughput: 0: 44165.2. Samples: 2568509920. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 05:58:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:58:21,833][06909] Updated weights for policy 0, policy_version 162703 (0.0039) [2024-06-28 05:58:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2665824256. Throughput: 0: 44009.9. Samples: 2568766260. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 05:58:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:58:25,665][06909] Updated weights for policy 0, policy_version 162713 (0.0039) [2024-06-28 05:58:28,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2666037248. Throughput: 0: 44186.3. Samples: 2568910940. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 05:58:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:58:28,965][06909] Updated weights for policy 0, policy_version 162723 (0.0043) [2024-06-28 05:58:33,011][06909] Updated weights for policy 0, policy_version 162733 (0.0042) [2024-06-28 05:58:33,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2666250240. Throughput: 0: 44279.4. Samples: 2569173240. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 05:58:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:58:36,551][06909] Updated weights for policy 0, policy_version 162743 (0.0042) [2024-06-28 05:58:38,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43962.3, 300 sec: 44097.6). Total num frames: 2666479616. Throughput: 0: 44250.4. Samples: 2569433660. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 05:58:38,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 05:58:40,511][06909] Updated weights for policy 0, policy_version 162753 (0.0039) [2024-06-28 05:58:43,718][06909] Updated weights for policy 0, policy_version 162763 (0.0026) [2024-06-28 05:58:43,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2666708992. Throughput: 0: 44300.9. Samples: 2569573880. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 05:58:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:58:48,247][06909] Updated weights for policy 0, policy_version 162773 (0.0033) [2024-06-28 05:58:48,850][06674] Fps is (10 sec: 40967.9, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 2666889216. Throughput: 0: 44347.0. Samples: 2569843180. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 05:58:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 05:58:51,073][06909] Updated weights for policy 0, policy_version 162783 (0.0023) [2024-06-28 05:58:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2667151360. Throughput: 0: 44105.7. Samples: 2570096480. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 05:58:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:58:55,688][06909] Updated weights for policy 0, policy_version 162793 (0.0046) [2024-06-28 05:58:58,610][06909] Updated weights for policy 0, policy_version 162803 (0.0036) [2024-06-28 05:58:58,717][06887] Signal inference workers to stop experience collection... (36550 times) [2024-06-28 05:58:58,761][06909] InferenceWorker_p0-w0: stopping experience collection (36550 times) [2024-06-28 05:58:58,831][06887] Signal inference workers to resume experience collection... (36550 times) [2024-06-28 05:58:58,831][06909] InferenceWorker_p0-w0: resuming experience collection (36550 times) [2024-06-28 05:58:58,850][06674] Fps is (10 sec: 49152.7, 60 sec: 44783.0, 300 sec: 44042.4). Total num frames: 2667380736. Throughput: 0: 44271.5. Samples: 2570241820. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 05:58:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 05:59:03,113][06909] Updated weights for policy 0, policy_version 162813 (0.0028) [2024-06-28 05:59:03,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2667560960. Throughput: 0: 44351.6. Samples: 2570505740. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 05:59:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 05:59:05,765][06909] Updated weights for policy 0, policy_version 162823 (0.0037) [2024-06-28 05:59:08,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2667806720. Throughput: 0: 44510.6. Samples: 2570769240. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 05:59:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:59:10,316][06909] Updated weights for policy 0, policy_version 162833 (0.0037) [2024-06-28 05:59:13,457][06909] Updated weights for policy 0, policy_version 162843 (0.0035) [2024-06-28 05:59:13,850][06674] Fps is (10 sec: 47512.7, 60 sec: 44509.7, 300 sec: 44042.4). Total num frames: 2668036096. Throughput: 0: 44230.5. Samples: 2570901320. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 05:59:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 05:59:17,388][06909] Updated weights for policy 0, policy_version 162853 (0.0036) [2024-06-28 05:59:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2668232704. Throughput: 0: 44316.9. Samples: 2571167500. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 05:59:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:59:20,632][06909] Updated weights for policy 0, policy_version 162863 (0.0036) [2024-06-28 05:59:23,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2668478464. Throughput: 0: 44417.6. Samples: 2571432360. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 05:59:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 05:59:25,194][06909] Updated weights for policy 0, policy_version 162873 (0.0038) [2024-06-28 05:59:27,905][06909] Updated weights for policy 0, policy_version 162883 (0.0027) [2024-06-28 05:59:28,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2668707840. Throughput: 0: 44370.2. Samples: 2571570540. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 05:59:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:59:33,102][06909] Updated weights for policy 0, policy_version 162893 (0.0025) [2024-06-28 05:59:33,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2668888064. Throughput: 0: 44205.5. Samples: 2571832420. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 05:59:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 05:59:35,606][06909] Updated weights for policy 0, policy_version 162903 (0.0034) [2024-06-28 05:59:38,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43965.3, 300 sec: 44098.0). Total num frames: 2669117440. Throughput: 0: 44224.1. Samples: 2572086560. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 05:59:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:59:40,305][06909] Updated weights for policy 0, policy_version 162913 (0.0031) [2024-06-28 05:59:42,957][06909] Updated weights for policy 0, policy_version 162923 (0.0025) [2024-06-28 05:59:43,850][06674] Fps is (10 sec: 47512.7, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2669363200. Throughput: 0: 44059.0. Samples: 2572224480. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 05:59:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 05:59:47,533][06909] Updated weights for policy 0, policy_version 162933 (0.0053) [2024-06-28 05:59:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2669543424. Throughput: 0: 44159.0. Samples: 2572492900. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 05:59:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 05:59:48,996][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000162937_2669559808.pth... [2024-06-28 05:59:49,042][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000162292_2658992128.pth [2024-06-28 05:59:50,648][06909] Updated weights for policy 0, policy_version 162943 (0.0044) [2024-06-28 05:59:53,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 2669772800. Throughput: 0: 43873.9. Samples: 2572743560. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 05:59:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 05:59:54,912][06909] Updated weights for policy 0, policy_version 162953 (0.0038) [2024-06-28 05:59:57,913][06909] Updated weights for policy 0, policy_version 162963 (0.0037) [2024-06-28 05:59:58,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2670018560. Throughput: 0: 44095.3. Samples: 2572885600. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 05:59:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:00:02,844][06909] Updated weights for policy 0, policy_version 162973 (0.0024) [2024-06-28 06:00:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2670215168. Throughput: 0: 44103.6. Samples: 2573152160. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 06:00:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:00:05,190][06909] Updated weights for policy 0, policy_version 162983 (0.0026) [2024-06-28 06:00:08,853][06674] Fps is (10 sec: 42584.1, 60 sec: 43961.3, 300 sec: 44153.0). Total num frames: 2670444544. Throughput: 0: 43875.0. Samples: 2573406880. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 06:00:08,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:00:10,027][06909] Updated weights for policy 0, policy_version 162993 (0.0042) [2024-06-28 06:00:12,882][06909] Updated weights for policy 0, policy_version 163003 (0.0034) [2024-06-28 06:00:13,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2670690304. Throughput: 0: 43884.9. Samples: 2573545360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 06:00:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:00:17,171][06909] Updated weights for policy 0, policy_version 163013 (0.0028) [2024-06-28 06:00:18,850][06674] Fps is (10 sec: 42612.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2670870528. Throughput: 0: 43996.4. Samples: 2573812260. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 06:00:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:00:19,507][06887] Signal inference workers to stop experience collection... (36600 times) [2024-06-28 06:00:19,508][06887] Signal inference workers to resume experience collection... (36600 times) [2024-06-28 06:00:19,522][06909] InferenceWorker_p0-w0: stopping experience collection (36600 times) [2024-06-28 06:00:19,533][06909] InferenceWorker_p0-w0: resuming experience collection (36600 times) [2024-06-28 06:00:20,021][06909] Updated weights for policy 0, policy_version 163023 (0.0040) [2024-06-28 06:00:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44154.4). Total num frames: 2671116288. Throughput: 0: 44171.1. Samples: 2574074260. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 06:00:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:00:24,383][06909] Updated weights for policy 0, policy_version 163033 (0.0031) [2024-06-28 06:00:27,673][06909] Updated weights for policy 0, policy_version 163043 (0.0045) [2024-06-28 06:00:28,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2671345664. Throughput: 0: 44180.6. Samples: 2574212600. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 06:00:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:00:32,101][06909] Updated weights for policy 0, policy_version 163053 (0.0026) [2024-06-28 06:00:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2671542272. Throughput: 0: 44188.5. Samples: 2574481380. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 06:00:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:00:35,316][06909] Updated weights for policy 0, policy_version 163063 (0.0038) [2024-06-28 06:00:38,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2671771648. Throughput: 0: 44336.4. Samples: 2574738700. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 06:00:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:00:39,642][06909] Updated weights for policy 0, policy_version 163073 (0.0042) [2024-06-28 06:00:42,783][06909] Updated weights for policy 0, policy_version 163083 (0.0043) [2024-06-28 06:00:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 2672001024. Throughput: 0: 44109.8. Samples: 2574870540. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 06:00:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:00:47,067][06909] Updated weights for policy 0, policy_version 163093 (0.0037) [2024-06-28 06:00:48,856][06674] Fps is (10 sec: 42573.0, 60 sec: 44232.4, 300 sec: 44041.5). Total num frames: 2672197632. Throughput: 0: 44157.7. Samples: 2575139520. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 06:00:48,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:00:50,030][06909] Updated weights for policy 0, policy_version 163103 (0.0041) [2024-06-28 06:00:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2672427008. Throughput: 0: 44066.4. Samples: 2575389720. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 06:00:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:00:54,386][06909] Updated weights for policy 0, policy_version 163113 (0.0031) [2024-06-28 06:00:57,471][06909] Updated weights for policy 0, policy_version 163123 (0.0038) [2024-06-28 06:00:58,850][06674] Fps is (10 sec: 47542.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2672672768. Throughput: 0: 44088.5. Samples: 2575529340. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 06:00:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 06:01:01,545][06909] Updated weights for policy 0, policy_version 163133 (0.0030) [2024-06-28 06:01:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2672852992. Throughput: 0: 44076.0. Samples: 2575795680. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 06:01:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:01:04,864][06909] Updated weights for policy 0, policy_version 163143 (0.0029) [2024-06-28 06:01:08,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43966.1, 300 sec: 44153.5). Total num frames: 2673082368. Throughput: 0: 44038.6. Samples: 2576056000. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 06:01:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:01:09,115][06909] Updated weights for policy 0, policy_version 163153 (0.0041) [2024-06-28 06:01:12,552][06909] Updated weights for policy 0, policy_version 163163 (0.0026) [2024-06-28 06:01:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 44098.3). Total num frames: 2673311744. Throughput: 0: 43895.5. Samples: 2576187900. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 06:01:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:01:16,693][06909] Updated weights for policy 0, policy_version 163173 (0.0049) [2024-06-28 06:01:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2673524736. Throughput: 0: 43807.9. Samples: 2576452740. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 06:01:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:01:19,947][06909] Updated weights for policy 0, policy_version 163183 (0.0041) [2024-06-28 06:01:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2673737728. Throughput: 0: 43797.4. Samples: 2576709580. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 06:01:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:01:23,983][06909] Updated weights for policy 0, policy_version 163193 (0.0031) [2024-06-28 06:01:27,351][06909] Updated weights for policy 0, policy_version 163203 (0.0031) [2024-06-28 06:01:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2673983488. Throughput: 0: 43997.2. Samples: 2576850420. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 06:01:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 06:01:31,295][06909] Updated weights for policy 0, policy_version 163213 (0.0032) [2024-06-28 06:01:33,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2674180096. Throughput: 0: 43853.3. Samples: 2577112660. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 06:01:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:01:34,870][06909] Updated weights for policy 0, policy_version 163223 (0.0029) [2024-06-28 06:01:38,538][06909] Updated weights for policy 0, policy_version 163233 (0.0033) [2024-06-28 06:01:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2674409472. Throughput: 0: 44094.6. Samples: 2577373980. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 06:01:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:01:42,258][06909] Updated weights for policy 0, policy_version 163243 (0.0041) [2024-06-28 06:01:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.5, 300 sec: 44042.9). Total num frames: 2674622464. Throughput: 0: 43936.3. Samples: 2577506480. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 06:01:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 06:01:45,816][06909] Updated weights for policy 0, policy_version 163253 (0.0035) [2024-06-28 06:01:46,805][06887] Signal inference workers to stop experience collection... (36650 times) [2024-06-28 06:01:46,849][06909] InferenceWorker_p0-w0: stopping experience collection (36650 times) [2024-06-28 06:01:46,865][06887] Signal inference workers to resume experience collection... (36650 times) [2024-06-28 06:01:46,866][06909] InferenceWorker_p0-w0: resuming experience collection (36650 times) [2024-06-28 06:01:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43968.1, 300 sec: 44042.4). Total num frames: 2674835456. Throughput: 0: 43870.2. Samples: 2577769840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 06:01:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:01:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000163260_2674851840.pth... [2024-06-28 06:01:48,934][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000162614_2664267776.pth [2024-06-28 06:01:49,563][06909] Updated weights for policy 0, policy_version 163263 (0.0027) [2024-06-28 06:01:53,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2675048448. Throughput: 0: 43811.2. Samples: 2578027500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 06:01:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:01:53,991][06909] Updated weights for policy 0, policy_version 163273 (0.0031) [2024-06-28 06:01:57,300][06909] Updated weights for policy 0, policy_version 163283 (0.0026) [2024-06-28 06:01:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 2675294208. Throughput: 0: 43979.9. Samples: 2578167000. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 06:01:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:02:01,193][06909] Updated weights for policy 0, policy_version 163293 (0.0034) [2024-06-28 06:02:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2675490816. Throughput: 0: 43871.2. Samples: 2578426940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 06:02:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:02:04,704][06909] Updated weights for policy 0, policy_version 163303 (0.0029) [2024-06-28 06:02:08,411][06909] Updated weights for policy 0, policy_version 163313 (0.0027) [2024-06-28 06:02:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2675720192. Throughput: 0: 43987.6. Samples: 2578689020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 06:02:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:02:12,057][06909] Updated weights for policy 0, policy_version 163323 (0.0031) [2024-06-28 06:02:13,854][06674] Fps is (10 sec: 44219.2, 60 sec: 43687.7, 300 sec: 44041.8). Total num frames: 2675933184. Throughput: 0: 43833.5. Samples: 2578823100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 06:02:13,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:02:15,706][06909] Updated weights for policy 0, policy_version 163333 (0.0029) [2024-06-28 06:02:18,850][06674] Fps is (10 sec: 40959.2, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 2676129792. Throughput: 0: 43763.9. Samples: 2579082040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 06:02:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:02:19,658][06909] Updated weights for policy 0, policy_version 163343 (0.0034) [2024-06-28 06:02:23,850][06674] Fps is (10 sec: 42615.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2676359168. Throughput: 0: 43740.5. Samples: 2579342300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 06:02:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:02:23,854][06909] Updated weights for policy 0, policy_version 163353 (0.0031) [2024-06-28 06:02:27,049][06909] Updated weights for policy 0, policy_version 163363 (0.0033) [2024-06-28 06:02:28,850][06674] Fps is (10 sec: 47514.6, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2676604928. Throughput: 0: 43842.0. Samples: 2579479360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 06:02:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:02:31,050][06909] Updated weights for policy 0, policy_version 163373 (0.0025) [2024-06-28 06:02:33,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2676801536. Throughput: 0: 43844.4. Samples: 2579742840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 06:02:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:02:34,354][06909] Updated weights for policy 0, policy_version 163383 (0.0034) [2024-06-28 06:02:38,188][06909] Updated weights for policy 0, policy_version 163393 (0.0040) [2024-06-28 06:02:38,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2677047296. Throughput: 0: 44055.9. Samples: 2580010020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 06:02:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:02:41,770][06909] Updated weights for policy 0, policy_version 163403 (0.0038) [2024-06-28 06:02:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2677260288. Throughput: 0: 43879.6. Samples: 2580141580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 06:02:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:02:45,684][06909] Updated weights for policy 0, policy_version 163413 (0.0029) [2024-06-28 06:02:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2677473280. Throughput: 0: 44028.5. Samples: 2580408220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 06:02:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:02:49,332][06909] Updated weights for policy 0, policy_version 163423 (0.0033) [2024-06-28 06:02:53,475][06909] Updated weights for policy 0, policy_version 163433 (0.0029) [2024-06-28 06:02:53,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2677686272. Throughput: 0: 44107.1. Samples: 2580673840. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 06:02:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:02:56,800][06909] Updated weights for policy 0, policy_version 163443 (0.0020) [2024-06-28 06:02:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 2677915648. Throughput: 0: 43944.9. Samples: 2580800440. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 06:02:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:03:00,991][06909] Updated weights for policy 0, policy_version 163453 (0.0035) [2024-06-28 06:03:01,099][06887] Signal inference workers to stop experience collection... (36700 times) [2024-06-28 06:03:01,134][06909] InferenceWorker_p0-w0: stopping experience collection (36700 times) [2024-06-28 06:03:01,158][06887] Signal inference workers to resume experience collection... (36700 times) [2024-06-28 06:03:01,158][06909] InferenceWorker_p0-w0: resuming experience collection (36700 times) [2024-06-28 06:03:03,850][06674] Fps is (10 sec: 45874.1, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2678145024. Throughput: 0: 44120.9. Samples: 2581067480. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 06:03:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:03:04,078][06909] Updated weights for policy 0, policy_version 163463 (0.0026) [2024-06-28 06:03:08,323][06909] Updated weights for policy 0, policy_version 163473 (0.0035) [2024-06-28 06:03:08,850][06674] Fps is (10 sec: 45873.8, 60 sec: 44236.6, 300 sec: 44097.9). Total num frames: 2678374400. Throughput: 0: 44217.5. Samples: 2581332100. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 06:03:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:03:11,662][06909] Updated weights for policy 0, policy_version 163483 (0.0036) [2024-06-28 06:03:13,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44239.8, 300 sec: 44153.5). Total num frames: 2678587392. Throughput: 0: 44004.8. Samples: 2581459580. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 06:03:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:03:15,869][06909] Updated weights for policy 0, policy_version 163493 (0.0027) [2024-06-28 06:03:18,850][06674] Fps is (10 sec: 44238.0, 60 sec: 44783.1, 300 sec: 44042.4). Total num frames: 2678816768. Throughput: 0: 44062.3. Samples: 2581725640. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 06:03:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:03:18,928][06909] Updated weights for policy 0, policy_version 163503 (0.0042) [2024-06-28 06:03:23,512][06909] Updated weights for policy 0, policy_version 163513 (0.0035) [2024-06-28 06:03:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2679013376. Throughput: 0: 44085.5. Samples: 2581993860. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 06:03:23,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 06:03:26,723][06909] Updated weights for policy 0, policy_version 163523 (0.0023) [2024-06-28 06:03:28,850][06674] Fps is (10 sec: 44235.2, 60 sec: 44236.5, 300 sec: 44097.9). Total num frames: 2679259136. Throughput: 0: 44009.1. Samples: 2582122000. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 06:03:28,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 06:03:30,731][06909] Updated weights for policy 0, policy_version 163533 (0.0040) [2024-06-28 06:03:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.9, 300 sec: 44042.7). Total num frames: 2679472128. Throughput: 0: 43975.1. Samples: 2582387100. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 06:03:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:03:33,874][06909] Updated weights for policy 0, policy_version 163543 (0.0021) [2024-06-28 06:03:37,953][06909] Updated weights for policy 0, policy_version 163553 (0.0034) [2024-06-28 06:03:38,850][06674] Fps is (10 sec: 44238.5, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2679701504. Throughput: 0: 44055.1. Samples: 2582656320. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 06:03:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:03:41,359][06909] Updated weights for policy 0, policy_version 163563 (0.0024) [2024-06-28 06:03:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2679914496. Throughput: 0: 44244.4. Samples: 2582791440. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 06:03:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:03:45,100][06909] Updated weights for policy 0, policy_version 163573 (0.0026) [2024-06-28 06:03:48,517][06909] Updated weights for policy 0, policy_version 163583 (0.0022) [2024-06-28 06:03:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2680143872. Throughput: 0: 44335.3. Samples: 2583062560. Policy #0 lag: (min: 1.0, avg: 9.3, max: 22.0) [2024-06-28 06:03:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:03:48,976][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000163584_2680160256.pth... [2024-06-28 06:03:49,048][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000162937_2669559808.pth [2024-06-28 06:03:52,692][06909] Updated weights for policy 0, policy_version 163593 (0.0037) [2024-06-28 06:03:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 2680356864. Throughput: 0: 44221.5. Samples: 2583322060. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-28 06:03:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:03:55,912][06909] Updated weights for policy 0, policy_version 163603 (0.0039) [2024-06-28 06:03:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2680569856. Throughput: 0: 44351.6. Samples: 2583455400. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-28 06:03:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:04:00,444][06909] Updated weights for policy 0, policy_version 163613 (0.0029) [2024-06-28 06:04:03,463][06909] Updated weights for policy 0, policy_version 163623 (0.0024) [2024-06-28 06:04:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44237.0, 300 sec: 44042.4). Total num frames: 2680799232. Throughput: 0: 44271.6. Samples: 2583717860. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-28 06:04:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:04:07,772][06909] Updated weights for policy 0, policy_version 163633 (0.0024) [2024-06-28 06:04:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44237.0, 300 sec: 44042.4). Total num frames: 2681028608. Throughput: 0: 44079.5. Samples: 2583977440. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-28 06:04:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:04:11,023][06909] Updated weights for policy 0, policy_version 163643 (0.0031) [2024-06-28 06:04:12,735][06887] Signal inference workers to stop experience collection... (36750 times) [2024-06-28 06:04:12,774][06909] InferenceWorker_p0-w0: stopping experience collection (36750 times) [2024-06-28 06:04:12,851][06887] Signal inference workers to resume experience collection... (36750 times) [2024-06-28 06:04:12,852][06909] InferenceWorker_p0-w0: resuming experience collection (36750 times) [2024-06-28 06:04:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2681241600. Throughput: 0: 44232.7. Samples: 2584112460. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-28 06:04:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:04:14,991][06909] Updated weights for policy 0, policy_version 163653 (0.0027) [2024-06-28 06:04:18,767][06909] Updated weights for policy 0, policy_version 163663 (0.0037) [2024-06-28 06:04:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2681454592. Throughput: 0: 44265.8. Samples: 2584379060. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-28 06:04:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:04:22,334][06909] Updated weights for policy 0, policy_version 163673 (0.0032) [2024-06-28 06:04:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 2681683968. Throughput: 0: 44104.4. Samples: 2584641020. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-28 06:04:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:04:25,976][06909] Updated weights for policy 0, policy_version 163683 (0.0035) [2024-06-28 06:04:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43964.0, 300 sec: 44097.9). Total num frames: 2681896960. Throughput: 0: 43973.3. Samples: 2584770240. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-28 06:04:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:04:29,967][06909] Updated weights for policy 0, policy_version 163693 (0.0038) [2024-06-28 06:04:33,211][06909] Updated weights for policy 0, policy_version 163703 (0.0042) [2024-06-28 06:04:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2682109952. Throughput: 0: 43872.9. Samples: 2585036840. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-28 06:04:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:04:37,283][06909] Updated weights for policy 0, policy_version 163713 (0.0030) [2024-06-28 06:04:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2682339328. Throughput: 0: 43793.8. Samples: 2585292780. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-28 06:04:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 06:04:40,682][06909] Updated weights for policy 0, policy_version 163723 (0.0030) [2024-06-28 06:04:43,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.2, 300 sec: 44097.7). Total num frames: 2682552320. Throughput: 0: 43878.0. Samples: 2585430000. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-28 06:04:43,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:04:44,734][06909] Updated weights for policy 0, policy_version 163733 (0.0028) [2024-06-28 06:04:48,004][06909] Updated weights for policy 0, policy_version 163743 (0.0028) [2024-06-28 06:04:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2682765312. Throughput: 0: 44054.2. Samples: 2585700300. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-28 06:04:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:04:52,171][06909] Updated weights for policy 0, policy_version 163753 (0.0035) [2024-06-28 06:04:53,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2682994688. Throughput: 0: 44193.3. Samples: 2585966140. Policy #0 lag: (min: 1.0, avg: 10.8, max: 22.0) [2024-06-28 06:04:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:04:55,647][06909] Updated weights for policy 0, policy_version 163763 (0.0026) [2024-06-28 06:04:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2683207680. Throughput: 0: 44057.4. Samples: 2586095040. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 06:04:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:04:59,283][06909] Updated weights for policy 0, policy_version 163773 (0.0038) [2024-06-28 06:05:02,996][06909] Updated weights for policy 0, policy_version 163783 (0.0040) [2024-06-28 06:05:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.6, 300 sec: 44042.9). Total num frames: 2683437056. Throughput: 0: 44093.7. Samples: 2586363280. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 06:05:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:05:07,078][06909] Updated weights for policy 0, policy_version 163793 (0.0034) [2024-06-28 06:05:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2683650048. Throughput: 0: 44043.0. Samples: 2586622960. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 06:05:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:05:10,264][06909] Updated weights for policy 0, policy_version 163803 (0.0042) [2024-06-28 06:05:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2683879424. Throughput: 0: 44136.0. Samples: 2586756360. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 06:05:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:05:14,394][06909] Updated weights for policy 0, policy_version 163813 (0.0031) [2024-06-28 06:05:18,031][06909] Updated weights for policy 0, policy_version 163823 (0.0028) [2024-06-28 06:05:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2684108800. Throughput: 0: 44087.5. Samples: 2587020780. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 06:05:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:05:21,649][06909] Updated weights for policy 0, policy_version 163833 (0.0037) [2024-06-28 06:05:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2684305408. Throughput: 0: 44348.8. Samples: 2587288480. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 06:05:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:05:24,113][06887] Signal inference workers to stop experience collection... (36800 times) [2024-06-28 06:05:24,154][06909] InferenceWorker_p0-w0: stopping experience collection (36800 times) [2024-06-28 06:05:24,162][06887] Signal inference workers to resume experience collection... (36800 times) [2024-06-28 06:05:24,167][06909] InferenceWorker_p0-w0: resuming experience collection (36800 times) [2024-06-28 06:05:25,358][06909] Updated weights for policy 0, policy_version 163843 (0.0028) [2024-06-28 06:05:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2684551168. Throughput: 0: 44254.4. Samples: 2587421360. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 06:05:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:05:28,981][06909] Updated weights for policy 0, policy_version 163853 (0.0035) [2024-06-28 06:05:32,954][06909] Updated weights for policy 0, policy_version 163863 (0.0045) [2024-06-28 06:05:33,852][06674] Fps is (10 sec: 45866.2, 60 sec: 44235.2, 300 sec: 44042.1). Total num frames: 2684764160. Throughput: 0: 44163.7. Samples: 2587687760. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 06:05:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:05:36,692][06909] Updated weights for policy 0, policy_version 163873 (0.0031) [2024-06-28 06:05:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2684977152. Throughput: 0: 44166.2. Samples: 2587953620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 06:05:38,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:05:40,260][06909] Updated weights for policy 0, policy_version 163883 (0.0025) [2024-06-28 06:05:43,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44238.3, 300 sec: 44098.8). Total num frames: 2685206528. Throughput: 0: 44060.4. Samples: 2588077760. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 06:05:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:05:44,039][06909] Updated weights for policy 0, policy_version 163893 (0.0035) [2024-06-28 06:05:47,575][06909] Updated weights for policy 0, policy_version 163903 (0.0031) [2024-06-28 06:05:48,852][06674] Fps is (10 sec: 44228.1, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 2685419520. Throughput: 0: 43833.7. Samples: 2588335880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 06:05:48,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:05:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000163905_2685419520.pth... [2024-06-28 06:05:48,942][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000163260_2674851840.pth [2024-06-28 06:05:51,600][06909] Updated weights for policy 0, policy_version 163913 (0.0052) [2024-06-28 06:05:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 2685616128. Throughput: 0: 44077.9. Samples: 2588606460. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 06:05:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:05:55,072][06909] Updated weights for policy 0, policy_version 163923 (0.0035) [2024-06-28 06:05:58,850][06674] Fps is (10 sec: 44246.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2685861888. Throughput: 0: 44085.0. Samples: 2588740180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 06:05:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:05:58,876][06909] Updated weights for policy 0, policy_version 163933 (0.0033) [2024-06-28 06:06:02,824][06909] Updated weights for policy 0, policy_version 163943 (0.0036) [2024-06-28 06:06:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2686074880. Throughput: 0: 44087.1. Samples: 2589004700. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 06:06:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:06:06,267][06909] Updated weights for policy 0, policy_version 163953 (0.0033) [2024-06-28 06:06:08,853][06674] Fps is (10 sec: 42585.2, 60 sec: 43961.6, 300 sec: 43986.4). Total num frames: 2686287872. Throughput: 0: 43974.9. Samples: 2589267480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 06:06:08,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:06:10,310][06909] Updated weights for policy 0, policy_version 163963 (0.0040) [2024-06-28 06:06:13,694][06909] Updated weights for policy 0, policy_version 163973 (0.0040) [2024-06-28 06:06:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2686533632. Throughput: 0: 43941.3. Samples: 2589398720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 06:06:13,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:06:17,941][06909] Updated weights for policy 0, policy_version 163983 (0.0023) [2024-06-28 06:06:18,850][06674] Fps is (10 sec: 44250.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2686730240. Throughput: 0: 43826.9. Samples: 2589659880. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 06:06:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:06:21,176][06909] Updated weights for policy 0, policy_version 163993 (0.0037) [2024-06-28 06:06:23,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.9, 300 sec: 43931.4). Total num frames: 2686943232. Throughput: 0: 43822.8. Samples: 2589925640. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 06:06:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:06:25,260][06909] Updated weights for policy 0, policy_version 164003 (0.0040) [2024-06-28 06:06:28,637][06909] Updated weights for policy 0, policy_version 164013 (0.0036) [2024-06-28 06:06:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2687188992. Throughput: 0: 43974.7. Samples: 2590056620. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 06:06:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:06:32,739][06909] Updated weights for policy 0, policy_version 164023 (0.0032) [2024-06-28 06:06:33,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43690.7, 300 sec: 43986.6). Total num frames: 2687385600. Throughput: 0: 44098.2. Samples: 2590320300. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 06:06:33,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:06:36,103][06909] Updated weights for policy 0, policy_version 164033 (0.0031) [2024-06-28 06:06:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2687614976. Throughput: 0: 44053.3. Samples: 2590588860. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 06:06:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:06:40,258][06909] Updated weights for policy 0, policy_version 164043 (0.0029) [2024-06-28 06:06:43,457][06909] Updated weights for policy 0, policy_version 164053 (0.0035) [2024-06-28 06:06:43,850][06674] Fps is (10 sec: 47523.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2687860736. Throughput: 0: 43977.7. Samples: 2590719180. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 06:06:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:06:48,166][06909] Updated weights for policy 0, policy_version 164063 (0.0044) [2024-06-28 06:06:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43692.1, 300 sec: 44042.4). Total num frames: 2688040960. Throughput: 0: 43848.9. Samples: 2590977900. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 06:06:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:06:50,904][06909] Updated weights for policy 0, policy_version 164073 (0.0036) [2024-06-28 06:06:53,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2688253952. Throughput: 0: 43891.9. Samples: 2591242480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 06:06:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 06:06:54,525][06887] Signal inference workers to stop experience collection... (36850 times) [2024-06-28 06:06:54,526][06887] Signal inference workers to resume experience collection... (36850 times) [2024-06-28 06:06:54,566][06909] InferenceWorker_p0-w0: stopping experience collection (36850 times) [2024-06-28 06:06:54,572][06909] InferenceWorker_p0-w0: resuming experience collection (36850 times) [2024-06-28 06:06:55,445][06909] Updated weights for policy 0, policy_version 164083 (0.0038) [2024-06-28 06:06:58,271][06909] Updated weights for policy 0, policy_version 164093 (0.0026) [2024-06-28 06:06:58,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2688516096. Throughput: 0: 43918.7. Samples: 2591375060. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 06:06:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:07:02,669][06909] Updated weights for policy 0, policy_version 164103 (0.0035) [2024-06-28 06:07:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2688696320. Throughput: 0: 43902.2. Samples: 2591635480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 06:07:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:07:05,844][06909] Updated weights for policy 0, policy_version 164113 (0.0035) [2024-06-28 06:07:08,850][06674] Fps is (10 sec: 40959.1, 60 sec: 43965.8, 300 sec: 44043.0). Total num frames: 2688925696. Throughput: 0: 43918.4. Samples: 2591901980. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 06:07:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:07:10,130][06909] Updated weights for policy 0, policy_version 164123 (0.0035) [2024-06-28 06:07:13,569][06909] Updated weights for policy 0, policy_version 164133 (0.0029) [2024-06-28 06:07:13,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.8, 300 sec: 44209.1). Total num frames: 2689171456. Throughput: 0: 43930.7. Samples: 2592033500. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 06:07:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:07:17,604][06909] Updated weights for policy 0, policy_version 164143 (0.0025) [2024-06-28 06:07:18,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2689368064. Throughput: 0: 43975.3. Samples: 2592299100. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 06:07:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:07:20,705][06909] Updated weights for policy 0, policy_version 164153 (0.0026) [2024-06-28 06:07:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2689597440. Throughput: 0: 43702.6. Samples: 2592555480. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 06:07:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:07:25,022][06909] Updated weights for policy 0, policy_version 164163 (0.0035) [2024-06-28 06:07:27,986][06909] Updated weights for policy 0, policy_version 164173 (0.0040) [2024-06-28 06:07:28,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2689826816. Throughput: 0: 44015.6. Samples: 2592699880. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 06:07:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:07:32,374][06909] Updated weights for policy 0, policy_version 164183 (0.0036) [2024-06-28 06:07:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 2690023424. Throughput: 0: 43992.9. Samples: 2592957580. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 06:07:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:07:35,504][06909] Updated weights for policy 0, policy_version 164193 (0.0031) [2024-06-28 06:07:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2690252800. Throughput: 0: 44041.4. Samples: 2593224340. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 06:07:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:07:39,535][06909] Updated weights for policy 0, policy_version 164203 (0.0035) [2024-06-28 06:07:42,878][06909] Updated weights for policy 0, policy_version 164213 (0.0033) [2024-06-28 06:07:43,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2690498560. Throughput: 0: 44085.2. Samples: 2593358900. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 06:07:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:07:47,148][06909] Updated weights for policy 0, policy_version 164223 (0.0032) [2024-06-28 06:07:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2690695168. Throughput: 0: 44094.6. Samples: 2593619740. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 06:07:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:07:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000164227_2690695168.pth... [2024-06-28 06:07:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000163584_2680160256.pth [2024-06-28 06:07:50,241][06909] Updated weights for policy 0, policy_version 164233 (0.0037) [2024-06-28 06:07:53,850][06674] Fps is (10 sec: 40960.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2690908160. Throughput: 0: 44018.9. Samples: 2593882820. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 06:07:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:07:54,712][06909] Updated weights for policy 0, policy_version 164243 (0.0038) [2024-06-28 06:07:58,119][06909] Updated weights for policy 0, policy_version 164253 (0.0037) [2024-06-28 06:07:58,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2691153920. Throughput: 0: 44108.5. Samples: 2594018380. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 06:07:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:08:01,941][06909] Updated weights for policy 0, policy_version 164263 (0.0037) [2024-06-28 06:08:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2691350528. Throughput: 0: 43980.5. Samples: 2594278220. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 06:08:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:08:05,376][06909] Updated weights for policy 0, policy_version 164273 (0.0034) [2024-06-28 06:08:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2691579904. Throughput: 0: 44221.4. Samples: 2594545440. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 06:08:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:08:09,330][06909] Updated weights for policy 0, policy_version 164283 (0.0031) [2024-06-28 06:08:12,920][06909] Updated weights for policy 0, policy_version 164293 (0.0030) [2024-06-28 06:08:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2691809280. Throughput: 0: 44076.0. Samples: 2594683300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 06:08:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:08:16,935][06909] Updated weights for policy 0, policy_version 164303 (0.0038) [2024-06-28 06:08:18,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2692005888. Throughput: 0: 44131.9. Samples: 2594943520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 06:08:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:08:18,961][06887] Signal inference workers to stop experience collection... (36900 times) [2024-06-28 06:08:19,010][06887] Signal inference workers to resume experience collection... (36900 times) [2024-06-28 06:08:19,010][06909] InferenceWorker_p0-w0: stopping experience collection (36900 times) [2024-06-28 06:08:19,037][06909] InferenceWorker_p0-w0: resuming experience collection (36900 times) [2024-06-28 06:08:20,151][06909] Updated weights for policy 0, policy_version 164313 (0.0029) [2024-06-28 06:08:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2692235264. Throughput: 0: 44080.4. Samples: 2595207960. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 06:08:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:08:24,246][06909] Updated weights for policy 0, policy_version 164323 (0.0024) [2024-06-28 06:08:27,340][06909] Updated weights for policy 0, policy_version 164333 (0.0029) [2024-06-28 06:08:28,850][06674] Fps is (10 sec: 45876.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2692464640. Throughput: 0: 44094.3. Samples: 2595343140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 06:08:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:08:31,785][06909] Updated weights for policy 0, policy_version 164343 (0.0052) [2024-06-28 06:08:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2692677632. Throughput: 0: 44210.3. Samples: 2595609200. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 06:08:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:08:34,674][06909] Updated weights for policy 0, policy_version 164353 (0.0033) [2024-06-28 06:08:38,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2692907008. Throughput: 0: 44140.8. Samples: 2595869160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 06:08:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:08:39,115][06909] Updated weights for policy 0, policy_version 164363 (0.0025) [2024-06-28 06:08:42,265][06909] Updated weights for policy 0, policy_version 164373 (0.0028) [2024-06-28 06:08:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2693136384. Throughput: 0: 44192.0. Samples: 2596007020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 06:08:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:08:46,288][06909] Updated weights for policy 0, policy_version 164383 (0.0026) [2024-06-28 06:08:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2693332992. Throughput: 0: 44173.7. Samples: 2596266040. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 06:08:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:08:50,287][06909] Updated weights for policy 0, policy_version 164393 (0.0026) [2024-06-28 06:08:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2693562368. Throughput: 0: 44159.1. Samples: 2596532600. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 06:08:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:08:53,872][06909] Updated weights for policy 0, policy_version 164403 (0.0031) [2024-06-28 06:08:57,417][06909] Updated weights for policy 0, policy_version 164413 (0.0022) [2024-06-28 06:08:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2693791744. Throughput: 0: 43987.0. Samples: 2596662720. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 06:08:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:09:01,435][06909] Updated weights for policy 0, policy_version 164423 (0.0035) [2024-06-28 06:09:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2694021120. Throughput: 0: 44341.5. Samples: 2596938880. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 06:09:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:09:04,581][06909] Updated weights for policy 0, policy_version 164433 (0.0031) [2024-06-28 06:09:08,585][06909] Updated weights for policy 0, policy_version 164443 (0.0039) [2024-06-28 06:09:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2694234112. Throughput: 0: 44354.6. Samples: 2597203920. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 06:09:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:09:12,069][06909] Updated weights for policy 0, policy_version 164453 (0.0035) [2024-06-28 06:09:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2694463488. Throughput: 0: 44095.0. Samples: 2597327420. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 06:09:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:09:16,308][06909] Updated weights for policy 0, policy_version 164463 (0.0039) [2024-06-28 06:09:18,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44783.0, 300 sec: 44097.9). Total num frames: 2694692864. Throughput: 0: 44206.2. Samples: 2597598480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 06:09:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:09:19,505][06909] Updated weights for policy 0, policy_version 164473 (0.0036) [2024-06-28 06:09:23,528][06909] Updated weights for policy 0, policy_version 164483 (0.0036) [2024-06-28 06:09:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2694889472. Throughput: 0: 44325.0. Samples: 2597863780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 06:09:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:09:26,979][06909] Updated weights for policy 0, policy_version 164493 (0.0028) [2024-06-28 06:09:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2695118848. Throughput: 0: 44139.1. Samples: 2597993280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 06:09:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:09:30,794][06909] Updated weights for policy 0, policy_version 164503 (0.0041) [2024-06-28 06:09:31,207][06887] Signal inference workers to stop experience collection... (36950 times) [2024-06-28 06:09:31,243][06909] InferenceWorker_p0-w0: stopping experience collection (36950 times) [2024-06-28 06:09:31,257][06887] Signal inference workers to resume experience collection... (36950 times) [2024-06-28 06:09:31,263][06909] InferenceWorker_p0-w0: resuming experience collection (36950 times) [2024-06-28 06:09:33,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 2695348224. Throughput: 0: 44364.9. Samples: 2598262460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 06:09:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:09:34,685][06909] Updated weights for policy 0, policy_version 164513 (0.0030) [2024-06-28 06:09:38,274][06909] Updated weights for policy 0, policy_version 164523 (0.0030) [2024-06-28 06:09:38,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.7, 300 sec: 44098.2). Total num frames: 2695561216. Throughput: 0: 44390.5. Samples: 2598530180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 06:09:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:09:41,848][06909] Updated weights for policy 0, policy_version 164533 (0.0041) [2024-06-28 06:09:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2695774208. Throughput: 0: 44363.1. Samples: 2598659060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 06:09:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:09:45,645][06909] Updated weights for policy 0, policy_version 164543 (0.0034) [2024-06-28 06:09:48,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44783.0, 300 sec: 44153.5). Total num frames: 2696019968. Throughput: 0: 44155.1. Samples: 2598925860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 06:09:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:09:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000164552_2696019968.pth... [2024-06-28 06:09:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000163905_2685419520.pth [2024-06-28 06:09:49,084][06909] Updated weights for policy 0, policy_version 164553 (0.0035) [2024-06-28 06:09:53,134][06909] Updated weights for policy 0, policy_version 164563 (0.0045) [2024-06-28 06:09:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2696216576. Throughput: 0: 44145.5. Samples: 2599190460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 06:09:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:09:56,747][06909] Updated weights for policy 0, policy_version 164573 (0.0032) [2024-06-28 06:09:58,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2696429568. Throughput: 0: 44270.2. Samples: 2599319580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 06:09:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:10:00,586][06909] Updated weights for policy 0, policy_version 164583 (0.0036) [2024-06-28 06:10:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2696675328. Throughput: 0: 44196.9. Samples: 2599587340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 06:10:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:10:04,038][06909] Updated weights for policy 0, policy_version 164593 (0.0042) [2024-06-28 06:10:08,111][06909] Updated weights for policy 0, policy_version 164603 (0.0037) [2024-06-28 06:10:08,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2696888320. Throughput: 0: 44127.1. Samples: 2599849500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 06:10:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:10:11,723][06909] Updated weights for policy 0, policy_version 164613 (0.0033) [2024-06-28 06:10:13,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2697084928. Throughput: 0: 44184.4. Samples: 2599981580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 06:10:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:10:15,512][06909] Updated weights for policy 0, policy_version 164623 (0.0033) [2024-06-28 06:10:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2697330688. Throughput: 0: 44046.7. Samples: 2600244560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 06:10:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:10:18,980][06909] Updated weights for policy 0, policy_version 164633 (0.0036) [2024-06-28 06:10:23,067][06909] Updated weights for policy 0, policy_version 164643 (0.0044) [2024-06-28 06:10:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2697527296. Throughput: 0: 44073.5. Samples: 2600513480. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 06:10:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:10:26,394][06909] Updated weights for policy 0, policy_version 164653 (0.0030) [2024-06-28 06:10:28,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43963.6, 300 sec: 44042.7). Total num frames: 2697756672. Throughput: 0: 44037.7. Samples: 2600640760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 06:10:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:10:30,487][06909] Updated weights for policy 0, policy_version 164663 (0.0027) [2024-06-28 06:10:33,720][06909] Updated weights for policy 0, policy_version 164673 (0.0026) [2024-06-28 06:10:33,851][06674] Fps is (10 sec: 47510.3, 60 sec: 44236.3, 300 sec: 44153.4). Total num frames: 2698002432. Throughput: 0: 43965.0. Samples: 2600904320. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 06:10:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:10:37,939][06909] Updated weights for policy 0, policy_version 164683 (0.0031) [2024-06-28 06:10:38,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2698215424. Throughput: 0: 43998.1. Samples: 2601170380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 06:10:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:10:41,301][06909] Updated weights for policy 0, policy_version 164693 (0.0046) [2024-06-28 06:10:43,850][06674] Fps is (10 sec: 40963.0, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 2698412032. Throughput: 0: 43985.0. Samples: 2601298900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 06:10:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:10:45,314][06909] Updated weights for policy 0, policy_version 164703 (0.0032) [2024-06-28 06:10:48,695][06909] Updated weights for policy 0, policy_version 164713 (0.0042) [2024-06-28 06:10:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2698657792. Throughput: 0: 43896.3. Samples: 2601562680. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 06:10:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:10:49,186][06887] Signal inference workers to stop experience collection... (37000 times) [2024-06-28 06:10:49,187][06887] Signal inference workers to resume experience collection... (37000 times) [2024-06-28 06:10:49,222][06909] InferenceWorker_p0-w0: stopping experience collection (37000 times) [2024-06-28 06:10:49,222][06909] InferenceWorker_p0-w0: resuming experience collection (37000 times) [2024-06-28 06:10:52,783][06909] Updated weights for policy 0, policy_version 164723 (0.0049) [2024-06-28 06:10:53,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2698870784. Throughput: 0: 43948.2. Samples: 2601827180. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 06:10:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:10:55,894][06909] Updated weights for policy 0, policy_version 164733 (0.0024) [2024-06-28 06:10:58,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2699067392. Throughput: 0: 43955.6. Samples: 2601959580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 06:10:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:11:00,188][06909] Updated weights for policy 0, policy_version 164743 (0.0030) [2024-06-28 06:11:03,368][06909] Updated weights for policy 0, policy_version 164753 (0.0031) [2024-06-28 06:11:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44153.9). Total num frames: 2699313152. Throughput: 0: 43908.8. Samples: 2602220460. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 06:11:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:11:08,037][06909] Updated weights for policy 0, policy_version 164763 (0.0030) [2024-06-28 06:11:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2699526144. Throughput: 0: 43984.5. Samples: 2602492780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 06:11:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:11:10,845][06909] Updated weights for policy 0, policy_version 164773 (0.0047) [2024-06-28 06:11:13,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2699739136. Throughput: 0: 43956.9. Samples: 2602618820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 06:11:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:11:15,230][06909] Updated weights for policy 0, policy_version 164783 (0.0036) [2024-06-28 06:11:18,370][06909] Updated weights for policy 0, policy_version 164793 (0.0027) [2024-06-28 06:11:18,852][06674] Fps is (10 sec: 44228.7, 60 sec: 43962.4, 300 sec: 44153.2). Total num frames: 2699968512. Throughput: 0: 43969.7. Samples: 2602883000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 06:11:18,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:11:22,831][06909] Updated weights for policy 0, policy_version 164803 (0.0042) [2024-06-28 06:11:23,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2700181504. Throughput: 0: 44153.0. Samples: 2603157260. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 06:11:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:11:25,990][06909] Updated weights for policy 0, policy_version 164813 (0.0034) [2024-06-28 06:11:28,850][06674] Fps is (10 sec: 40966.7, 60 sec: 43690.7, 300 sec: 44042.7). Total num frames: 2700378112. Throughput: 0: 43900.3. Samples: 2603274420. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 06:11:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:11:30,419][06909] Updated weights for policy 0, policy_version 164823 (0.0031) [2024-06-28 06:11:33,167][06909] Updated weights for policy 0, policy_version 164833 (0.0025) [2024-06-28 06:11:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43964.3, 300 sec: 44153.5). Total num frames: 2700640256. Throughput: 0: 43924.9. Samples: 2603539300. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 06:11:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:11:37,870][06909] Updated weights for policy 0, policy_version 164843 (0.0021) [2024-06-28 06:11:38,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2700836864. Throughput: 0: 44124.6. Samples: 2603812780. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 06:11:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:11:40,650][06909] Updated weights for policy 0, policy_version 164853 (0.0042) [2024-06-28 06:11:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2701049856. Throughput: 0: 43832.4. Samples: 2603932040. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 06:11:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:11:45,414][06909] Updated weights for policy 0, policy_version 164863 (0.0027) [2024-06-28 06:11:48,241][06909] Updated weights for policy 0, policy_version 164873 (0.0032) [2024-06-28 06:11:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2701279232. Throughput: 0: 44041.7. Samples: 2604202340. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 06:11:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:11:48,949][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000164874_2701295616.pth... [2024-06-28 06:11:49,003][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000164227_2690695168.pth [2024-06-28 06:11:52,797][06909] Updated weights for policy 0, policy_version 164883 (0.0031) [2024-06-28 06:11:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2701508608. Throughput: 0: 43935.0. Samples: 2604469860. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 06:11:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:11:55,539][06909] Updated weights for policy 0, policy_version 164893 (0.0027) [2024-06-28 06:11:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2701705216. Throughput: 0: 43941.5. Samples: 2604596180. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 06:11:58,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 06:12:00,196][06909] Updated weights for policy 0, policy_version 164903 (0.0035) [2024-06-28 06:12:03,330][06909] Updated weights for policy 0, policy_version 164913 (0.0028) [2024-06-28 06:12:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44209.1). Total num frames: 2701967360. Throughput: 0: 43900.4. Samples: 2604858440. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 06:12:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:12:07,990][06909] Updated weights for policy 0, policy_version 164923 (0.0025) [2024-06-28 06:12:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2702163968. Throughput: 0: 43785.4. Samples: 2605127600. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 06:12:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:12:08,940][06887] Signal inference workers to stop experience collection... (37050 times) [2024-06-28 06:12:08,940][06887] Signal inference workers to resume experience collection... (37050 times) [2024-06-28 06:12:08,996][06909] InferenceWorker_p0-w0: stopping experience collection (37050 times) [2024-06-28 06:12:08,996][06909] InferenceWorker_p0-w0: resuming experience collection (37050 times) [2024-06-28 06:12:10,519][06909] Updated weights for policy 0, policy_version 164933 (0.0037) [2024-06-28 06:12:13,850][06674] Fps is (10 sec: 37682.8, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 2702344192. Throughput: 0: 43968.0. Samples: 2605252980. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 06:12:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:12:15,298][06909] Updated weights for policy 0, policy_version 164943 (0.0024) [2024-06-28 06:12:17,929][06909] Updated weights for policy 0, policy_version 164953 (0.0028) [2024-06-28 06:12:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44238.2, 300 sec: 44153.5). Total num frames: 2702622720. Throughput: 0: 43910.8. Samples: 2605515280. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 06:12:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:12:22,544][06909] Updated weights for policy 0, policy_version 164963 (0.0034) [2024-06-28 06:12:23,850][06674] Fps is (10 sec: 49151.4, 60 sec: 44236.6, 300 sec: 44097.9). Total num frames: 2702835712. Throughput: 0: 43769.5. Samples: 2605782420. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 06:12:23,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:12:25,534][06909] Updated weights for policy 0, policy_version 164973 (0.0027) [2024-06-28 06:12:28,850][06674] Fps is (10 sec: 40959.4, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 2703032320. Throughput: 0: 43997.7. Samples: 2605911940. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 06:12:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:12:29,891][06909] Updated weights for policy 0, policy_version 164983 (0.0033) [2024-06-28 06:12:32,773][06909] Updated weights for policy 0, policy_version 164993 (0.0034) [2024-06-28 06:12:33,852][06674] Fps is (10 sec: 44228.9, 60 sec: 43962.3, 300 sec: 44153.2). Total num frames: 2703278080. Throughput: 0: 43922.1. Samples: 2606178920. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 06:12:33,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:12:37,755][06909] Updated weights for policy 0, policy_version 165003 (0.0040) [2024-06-28 06:12:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2703474688. Throughput: 0: 43870.2. Samples: 2606444020. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 06:12:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:12:40,316][06909] Updated weights for policy 0, policy_version 165013 (0.0044) [2024-06-28 06:12:43,850][06674] Fps is (10 sec: 40968.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2703687680. Throughput: 0: 43799.0. Samples: 2606567140. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 06:12:43,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:12:45,359][06909] Updated weights for policy 0, policy_version 165023 (0.0029) [2024-06-28 06:12:47,978][06909] Updated weights for policy 0, policy_version 165033 (0.0038) [2024-06-28 06:12:48,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 2703917056. Throughput: 0: 43608.5. Samples: 2606820820. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 06:12:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:12:52,785][06909] Updated weights for policy 0, policy_version 165043 (0.0027) [2024-06-28 06:12:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2704146432. Throughput: 0: 43689.7. Samples: 2607093640. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 06:12:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:12:55,505][06909] Updated weights for policy 0, policy_version 165053 (0.0040) [2024-06-28 06:12:58,850][06674] Fps is (10 sec: 42597.1, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2704343040. Throughput: 0: 43659.0. Samples: 2607217640. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 06:12:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:13:00,006][06909] Updated weights for policy 0, policy_version 165063 (0.0031) [2024-06-28 06:13:03,215][06909] Updated weights for policy 0, policy_version 165073 (0.0025) [2024-06-28 06:13:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2704588800. Throughput: 0: 43820.8. Samples: 2607487220. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 06:13:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:13:07,404][06909] Updated weights for policy 0, policy_version 165083 (0.0039) [2024-06-28 06:13:08,852][06674] Fps is (10 sec: 45866.7, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 2704801792. Throughput: 0: 44014.7. Samples: 2607763160. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 06:13:08,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:13:10,374][06909] Updated weights for policy 0, policy_version 165093 (0.0030) [2024-06-28 06:13:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 2705014784. Throughput: 0: 43779.2. Samples: 2607882000. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 06:13:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:13:15,338][06909] Updated weights for policy 0, policy_version 165103 (0.0033) [2024-06-28 06:13:17,863][06909] Updated weights for policy 0, policy_version 165113 (0.0035) [2024-06-28 06:13:18,850][06674] Fps is (10 sec: 45884.8, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2705260544. Throughput: 0: 43791.3. Samples: 2608149440. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 06:13:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:13:22,625][06909] Updated weights for policy 0, policy_version 165123 (0.0037) [2024-06-28 06:13:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 2705457152. Throughput: 0: 43635.6. Samples: 2608407620. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 06:13:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:13:25,237][06909] Updated weights for policy 0, policy_version 165133 (0.0028) [2024-06-28 06:13:28,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2705653760. Throughput: 0: 43718.2. Samples: 2608534460. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 06:13:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:13:29,887][06909] Updated weights for policy 0, policy_version 165143 (0.0039) [2024-06-28 06:13:30,450][06887] Signal inference workers to stop experience collection... (37100 times) [2024-06-28 06:13:30,450][06887] Signal inference workers to resume experience collection... (37100 times) [2024-06-28 06:13:30,470][06909] InferenceWorker_p0-w0: stopping experience collection (37100 times) [2024-06-28 06:13:30,470][06909] InferenceWorker_p0-w0: resuming experience collection (37100 times) [2024-06-28 06:13:32,829][06909] Updated weights for policy 0, policy_version 165153 (0.0034) [2024-06-28 06:13:33,852][06674] Fps is (10 sec: 45866.0, 60 sec: 43963.7, 300 sec: 44097.7). Total num frames: 2705915904. Throughput: 0: 44021.4. Samples: 2608801880. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 06:13:33,853][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:13:37,175][06909] Updated weights for policy 0, policy_version 165163 (0.0036) [2024-06-28 06:13:38,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2706112512. Throughput: 0: 43974.6. Samples: 2609072500. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 06:13:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:13:40,498][06909] Updated weights for policy 0, policy_version 165173 (0.0031) [2024-06-28 06:13:43,850][06674] Fps is (10 sec: 39329.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2706309120. Throughput: 0: 43967.8. Samples: 2609196180. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 06:13:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:13:44,814][06909] Updated weights for policy 0, policy_version 165183 (0.0033) [2024-06-28 06:13:47,868][06909] Updated weights for policy 0, policy_version 165193 (0.0029) [2024-06-28 06:13:48,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2706571264. Throughput: 0: 43945.2. Samples: 2609464760. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 06:13:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:13:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000165196_2706571264.pth... [2024-06-28 06:13:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000164552_2696019968.pth [2024-06-28 06:13:52,261][06909] Updated weights for policy 0, policy_version 165203 (0.0031) [2024-06-28 06:13:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2706767872. Throughput: 0: 43702.0. Samples: 2609729660. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 06:13:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:13:55,098][06909] Updated weights for policy 0, policy_version 165213 (0.0025) [2024-06-28 06:13:58,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 2706964480. Throughput: 0: 43892.7. Samples: 2609857180. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 06:13:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:13:59,865][06909] Updated weights for policy 0, policy_version 165223 (0.0038) [2024-06-28 06:14:02,809][06909] Updated weights for policy 0, policy_version 165233 (0.0033) [2024-06-28 06:14:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2707210240. Throughput: 0: 43779.1. Samples: 2610119500. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 06:14:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:14:07,093][06909] Updated weights for policy 0, policy_version 165243 (0.0052) [2024-06-28 06:14:08,850][06674] Fps is (10 sec: 47514.6, 60 sec: 43965.3, 300 sec: 43986.9). Total num frames: 2707439616. Throughput: 0: 43988.5. Samples: 2610387100. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 06:14:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:14:10,049][06909] Updated weights for policy 0, policy_version 165253 (0.0030) [2024-06-28 06:14:13,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 2707636224. Throughput: 0: 44245.3. Samples: 2610525500. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 06:14:13,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:14:14,309][06909] Updated weights for policy 0, policy_version 165263 (0.0034) [2024-06-28 06:14:17,634][06909] Updated weights for policy 0, policy_version 165273 (0.0032) [2024-06-28 06:14:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 2707865600. Throughput: 0: 44236.7. Samples: 2610792440. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 06:14:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:14:22,190][06909] Updated weights for policy 0, policy_version 165283 (0.0020) [2024-06-28 06:14:23,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2708111360. Throughput: 0: 43989.4. Samples: 2611052020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 06:14:23,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:14:24,927][06909] Updated weights for policy 0, policy_version 165293 (0.0032) [2024-06-28 06:14:28,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.8, 300 sec: 43820.3). Total num frames: 2708275200. Throughput: 0: 44185.8. Samples: 2611184540. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 06:14:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:14:29,480][06909] Updated weights for policy 0, policy_version 165303 (0.0034) [2024-06-28 06:14:32,431][06909] Updated weights for policy 0, policy_version 165313 (0.0032) [2024-06-28 06:14:33,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43692.1, 300 sec: 43986.9). Total num frames: 2708537344. Throughput: 0: 44027.9. Samples: 2611446020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 06:14:33,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:14:36,878][06909] Updated weights for policy 0, policy_version 165323 (0.0035) [2024-06-28 06:14:38,852][06674] Fps is (10 sec: 49141.6, 60 sec: 44235.4, 300 sec: 44042.1). Total num frames: 2708766720. Throughput: 0: 43938.9. Samples: 2611707000. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 06:14:38,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:14:39,917][06909] Updated weights for policy 0, policy_version 165333 (0.0027) [2024-06-28 06:14:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 2708963328. Throughput: 0: 44099.2. Samples: 2611841640. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 06:14:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:14:44,231][06909] Updated weights for policy 0, policy_version 165343 (0.0024) [2024-06-28 06:14:47,577][06909] Updated weights for policy 0, policy_version 165353 (0.0034) [2024-06-28 06:14:48,850][06674] Fps is (10 sec: 40968.5, 60 sec: 43417.7, 300 sec: 43931.3). Total num frames: 2709176320. Throughput: 0: 44196.5. Samples: 2612108340. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 06:14:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:14:51,450][06909] Updated weights for policy 0, policy_version 165363 (0.0035) [2024-06-28 06:14:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2709422080. Throughput: 0: 44141.2. Samples: 2612373460. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 06:14:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:14:54,986][06909] Updated weights for policy 0, policy_version 165373 (0.0032) [2024-06-28 06:14:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 2709618688. Throughput: 0: 44153.8. Samples: 2612512420. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 06:14:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:14:59,050][06909] Updated weights for policy 0, policy_version 165383 (0.0031) [2024-06-28 06:15:02,218][06909] Updated weights for policy 0, policy_version 165393 (0.0034) [2024-06-28 06:15:03,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2709848064. Throughput: 0: 43958.7. Samples: 2612770580. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 06:15:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:15:06,104][06887] Signal inference workers to stop experience collection... (37150 times) [2024-06-28 06:15:06,105][06887] Signal inference workers to resume experience collection... (37150 times) [2024-06-28 06:15:06,119][06909] InferenceWorker_p0-w0: stopping experience collection (37150 times) [2024-06-28 06:15:06,119][06909] InferenceWorker_p0-w0: resuming experience collection (37150 times) [2024-06-28 06:15:06,466][06909] Updated weights for policy 0, policy_version 165403 (0.0041) [2024-06-28 06:15:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2710077440. Throughput: 0: 44028.5. Samples: 2613033300. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 06:15:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:15:09,635][06909] Updated weights for policy 0, policy_version 165413 (0.0033) [2024-06-28 06:15:13,835][06909] Updated weights for policy 0, policy_version 165423 (0.0034) [2024-06-28 06:15:13,852][06674] Fps is (10 sec: 44227.5, 60 sec: 44235.4, 300 sec: 43931.0). Total num frames: 2710290432. Throughput: 0: 44212.1. Samples: 2613174180. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 06:15:13,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:15:17,041][06909] Updated weights for policy 0, policy_version 165433 (0.0039) [2024-06-28 06:15:18,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2710487040. Throughput: 0: 44163.2. Samples: 2613433360. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 06:15:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:15:21,074][06909] Updated weights for policy 0, policy_version 165443 (0.0040) [2024-06-28 06:15:23,850][06674] Fps is (10 sec: 45884.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2710749184. Throughput: 0: 44300.6. Samples: 2613700440. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 06:15:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:15:24,273][06909] Updated weights for policy 0, policy_version 165453 (0.0028) [2024-06-28 06:15:28,454][06909] Updated weights for policy 0, policy_version 165463 (0.0035) [2024-06-28 06:15:28,850][06674] Fps is (10 sec: 47514.2, 60 sec: 44782.9, 300 sec: 43931.5). Total num frames: 2710962176. Throughput: 0: 44439.2. Samples: 2613841400. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 06:15:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:15:31,938][06909] Updated weights for policy 0, policy_version 165473 (0.0037) [2024-06-28 06:15:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.9, 300 sec: 43931.3). Total num frames: 2711175168. Throughput: 0: 44416.0. Samples: 2614107060. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 06:15:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:15:35,722][06909] Updated weights for policy 0, policy_version 165483 (0.0032) [2024-06-28 06:15:38,852][06674] Fps is (10 sec: 45865.6, 60 sec: 44236.8, 300 sec: 44097.6). Total num frames: 2711420928. Throughput: 0: 44145.2. Samples: 2614360080. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 06:15:38,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:15:39,213][06909] Updated weights for policy 0, policy_version 165493 (0.0049) [2024-06-28 06:15:43,514][06909] Updated weights for policy 0, policy_version 165503 (0.0037) [2024-06-28 06:15:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 2711601152. Throughput: 0: 44132.9. Samples: 2614498400. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 06:15:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:15:46,878][06909] Updated weights for policy 0, policy_version 165513 (0.0023) [2024-06-28 06:15:48,853][06674] Fps is (10 sec: 39315.6, 60 sec: 43961.1, 300 sec: 43875.3). Total num frames: 2711814144. Throughput: 0: 44044.9. Samples: 2614752760. Policy #0 lag: (min: 0.0, avg: 10.9, max: 19.0) [2024-06-28 06:15:48,854][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:15:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000165516_2711814144.pth... [2024-06-28 06:15:48,925][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000164874_2701295616.pth [2024-06-28 06:15:50,883][06909] Updated weights for policy 0, policy_version 165523 (0.0032) [2024-06-28 06:15:53,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2712076288. Throughput: 0: 44119.0. Samples: 2615018660. Policy #0 lag: (min: 0.0, avg: 10.9, max: 19.0) [2024-06-28 06:15:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:15:54,049][06909] Updated weights for policy 0, policy_version 165533 (0.0033) [2024-06-28 06:15:58,336][06909] Updated weights for policy 0, policy_version 165543 (0.0033) [2024-06-28 06:15:58,850][06674] Fps is (10 sec: 47530.8, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2712289280. Throughput: 0: 44217.6. Samples: 2615163880. Policy #0 lag: (min: 0.0, avg: 10.9, max: 19.0) [2024-06-28 06:15:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:16:01,462][06909] Updated weights for policy 0, policy_version 165553 (0.0027) [2024-06-28 06:16:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 2712485888. Throughput: 0: 44394.7. Samples: 2615431120. Policy #0 lag: (min: 0.0, avg: 10.9, max: 19.0) [2024-06-28 06:16:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:16:05,464][06909] Updated weights for policy 0, policy_version 165563 (0.0029) [2024-06-28 06:16:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2712731648. Throughput: 0: 44293.0. Samples: 2615693620. Policy #0 lag: (min: 0.0, avg: 10.9, max: 19.0) [2024-06-28 06:16:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:16:08,931][06909] Updated weights for policy 0, policy_version 165573 (0.0026) [2024-06-28 06:16:13,098][06909] Updated weights for policy 0, policy_version 165583 (0.0036) [2024-06-28 06:16:13,850][06674] Fps is (10 sec: 49151.1, 60 sec: 44784.3, 300 sec: 44098.2). Total num frames: 2712977408. Throughput: 0: 44277.5. Samples: 2615833900. Policy #0 lag: (min: 0.0, avg: 10.9, max: 19.0) [2024-06-28 06:16:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:16:16,164][06909] Updated weights for policy 0, policy_version 165593 (0.0033) [2024-06-28 06:16:18,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 2713141248. Throughput: 0: 44099.1. Samples: 2616091520. Policy #0 lag: (min: 0.0, avg: 10.9, max: 19.0) [2024-06-28 06:16:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:16:20,385][06909] Updated weights for policy 0, policy_version 165603 (0.0029) [2024-06-28 06:16:23,788][06909] Updated weights for policy 0, policy_version 165613 (0.0029) [2024-06-28 06:16:23,850][06674] Fps is (10 sec: 42599.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2713403392. Throughput: 0: 44111.8. Samples: 2616345020. Policy #0 lag: (min: 0.0, avg: 10.9, max: 19.0) [2024-06-28 06:16:23,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:16:27,710][06909] Updated weights for policy 0, policy_version 165623 (0.0021) [2024-06-28 06:16:28,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2713616384. Throughput: 0: 44347.5. Samples: 2616494040. Policy #0 lag: (min: 0.0, avg: 10.9, max: 19.0) [2024-06-28 06:16:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:16:30,907][06909] Updated weights for policy 0, policy_version 165633 (0.0031) [2024-06-28 06:16:33,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2713812992. Throughput: 0: 44551.2. Samples: 2616757400. Policy #0 lag: (min: 0.0, avg: 10.9, max: 19.0) [2024-06-28 06:16:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:16:35,264][06909] Updated weights for policy 0, policy_version 165643 (0.0036) [2024-06-28 06:16:36,829][06887] Signal inference workers to stop experience collection... (37200 times) [2024-06-28 06:16:36,880][06909] InferenceWorker_p0-w0: stopping experience collection (37200 times) [2024-06-28 06:16:36,888][06887] Signal inference workers to resume experience collection... (37200 times) [2024-06-28 06:16:36,897][06909] InferenceWorker_p0-w0: resuming experience collection (37200 times) [2024-06-28 06:16:38,384][06909] Updated weights for policy 0, policy_version 165653 (0.0036) [2024-06-28 06:16:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43965.3, 300 sec: 44098.0). Total num frames: 2714058752. Throughput: 0: 44353.5. Samples: 2617014560. Policy #0 lag: (min: 0.0, avg: 10.9, max: 19.0) [2024-06-28 06:16:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:16:42,474][06909] Updated weights for policy 0, policy_version 165663 (0.0038) [2024-06-28 06:16:43,850][06674] Fps is (10 sec: 49151.4, 60 sec: 45056.0, 300 sec: 44153.5). Total num frames: 2714304512. Throughput: 0: 44415.9. Samples: 2617162600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 19.0) [2024-06-28 06:16:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:16:45,852][06909] Updated weights for policy 0, policy_version 165673 (0.0021) [2024-06-28 06:16:48,850][06674] Fps is (10 sec: 40959.6, 60 sec: 44239.4, 300 sec: 43931.3). Total num frames: 2714468352. Throughput: 0: 44356.9. Samples: 2617427180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 19.0) [2024-06-28 06:16:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:16:49,820][06909] Updated weights for policy 0, policy_version 165683 (0.0027) [2024-06-28 06:16:53,420][06909] Updated weights for policy 0, policy_version 165693 (0.0040) [2024-06-28 06:16:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2714714112. Throughput: 0: 44211.9. Samples: 2617683160. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 06:16:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:16:57,350][06909] Updated weights for policy 0, policy_version 165703 (0.0031) [2024-06-28 06:16:58,850][06674] Fps is (10 sec: 50789.9, 60 sec: 44782.8, 300 sec: 44097.9). Total num frames: 2714976256. Throughput: 0: 44169.0. Samples: 2617821500. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 06:16:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:17:00,847][06909] Updated weights for policy 0, policy_version 165713 (0.0039) [2024-06-28 06:17:03,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2715123712. Throughput: 0: 44266.5. Samples: 2618083520. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 06:17:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:17:04,909][06909] Updated weights for policy 0, policy_version 165723 (0.0029) [2024-06-28 06:17:08,099][06909] Updated weights for policy 0, policy_version 165733 (0.0031) [2024-06-28 06:17:08,850][06674] Fps is (10 sec: 39322.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2715369472. Throughput: 0: 44203.1. Samples: 2618334160. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 06:17:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:17:12,381][06909] Updated weights for policy 0, policy_version 165743 (0.0026) [2024-06-28 06:17:13,850][06674] Fps is (10 sec: 50790.7, 60 sec: 44237.0, 300 sec: 44097.9). Total num frames: 2715631616. Throughput: 0: 44061.4. Samples: 2618476800. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 06:17:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:17:15,734][06909] Updated weights for policy 0, policy_version 165753 (0.0027) [2024-06-28 06:17:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 2715811840. Throughput: 0: 44199.5. Samples: 2618746380. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 06:17:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:17:19,615][06909] Updated weights for policy 0, policy_version 165763 (0.0034) [2024-06-28 06:17:22,917][06909] Updated weights for policy 0, policy_version 165773 (0.0026) [2024-06-28 06:17:23,850][06674] Fps is (10 sec: 39322.2, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 2716024832. Throughput: 0: 44265.0. Samples: 2619006480. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 06:17:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:17:27,312][06909] Updated weights for policy 0, policy_version 165783 (0.0045) [2024-06-28 06:17:28,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44509.9, 300 sec: 44098.3). Total num frames: 2716286976. Throughput: 0: 43957.9. Samples: 2619140700. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 06:17:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:17:30,475][06909] Updated weights for policy 0, policy_version 165793 (0.0029) [2024-06-28 06:17:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2716467200. Throughput: 0: 44022.8. Samples: 2619408200. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 06:17:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:17:34,492][06909] Updated weights for policy 0, policy_version 165803 (0.0034) [2024-06-28 06:17:37,985][06909] Updated weights for policy 0, policy_version 165813 (0.0045) [2024-06-28 06:17:38,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2716696576. Throughput: 0: 44032.6. Samples: 2619664620. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 06:17:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:17:42,209][06909] Updated weights for policy 0, policy_version 165823 (0.0037) [2024-06-28 06:17:43,856][06674] Fps is (10 sec: 49122.4, 60 sec: 44232.4, 300 sec: 44208.1). Total num frames: 2716958720. Throughput: 0: 43942.7. Samples: 2619799180. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 06:17:43,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:17:45,155][06909] Updated weights for policy 0, policy_version 165833 (0.0031) [2024-06-28 06:17:47,479][06887] Signal inference workers to stop experience collection... (37250 times) [2024-06-28 06:17:47,515][06909] InferenceWorker_p0-w0: stopping experience collection (37250 times) [2024-06-28 06:17:47,537][06887] Signal inference workers to resume experience collection... (37250 times) [2024-06-28 06:17:47,538][06909] InferenceWorker_p0-w0: resuming experience collection (37250 times) [2024-06-28 06:17:48,850][06674] Fps is (10 sec: 44235.9, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2717138944. Throughput: 0: 44257.7. Samples: 2620075120. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 06:17:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:17:48,907][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000165842_2717155328.pth... [2024-06-28 06:17:48,962][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000165196_2706571264.pth [2024-06-28 06:17:49,334][06909] Updated weights for policy 0, policy_version 165843 (0.0034) [2024-06-28 06:17:52,773][06909] Updated weights for policy 0, policy_version 165853 (0.0029) [2024-06-28 06:17:53,850][06674] Fps is (10 sec: 39344.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2717351936. Throughput: 0: 44417.3. Samples: 2620332940. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 06:17:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:17:56,670][06909] Updated weights for policy 0, policy_version 165863 (0.0026) [2024-06-28 06:17:58,851][06674] Fps is (10 sec: 47508.6, 60 sec: 43963.0, 300 sec: 44153.3). Total num frames: 2717614080. Throughput: 0: 44271.3. Samples: 2620469060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 06:17:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:18:00,005][06909] Updated weights for policy 0, policy_version 165873 (0.0030) [2024-06-28 06:18:03,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44783.0, 300 sec: 44098.3). Total num frames: 2717810688. Throughput: 0: 44197.4. Samples: 2620735260. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 06:18:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:18:04,297][06909] Updated weights for policy 0, policy_version 165883 (0.0036) [2024-06-28 06:18:08,124][06909] Updated weights for policy 0, policy_version 165893 (0.0031) [2024-06-28 06:18:08,850][06674] Fps is (10 sec: 39325.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2718007296. Throughput: 0: 44016.7. Samples: 2620987240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 06:18:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:18:11,574][06909] Updated weights for policy 0, policy_version 165903 (0.0035) [2024-06-28 06:18:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2718269440. Throughput: 0: 43915.6. Samples: 2621116900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 06:18:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 06:18:15,305][06909] Updated weights for policy 0, policy_version 165913 (0.0040) [2024-06-28 06:18:18,852][06674] Fps is (10 sec: 45866.2, 60 sec: 44235.3, 300 sec: 44097.7). Total num frames: 2718466048. Throughput: 0: 43974.4. Samples: 2621387140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 06:18:18,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:18:19,432][06909] Updated weights for policy 0, policy_version 165923 (0.0022) [2024-06-28 06:18:22,969][06909] Updated weights for policy 0, policy_version 165933 (0.0032) [2024-06-28 06:18:23,850][06674] Fps is (10 sec: 40959.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2718679040. Throughput: 0: 44115.4. Samples: 2621649820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 06:18:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:18:26,709][06909] Updated weights for policy 0, policy_version 165943 (0.0043) [2024-06-28 06:18:28,850][06674] Fps is (10 sec: 45884.5, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 2718924800. Throughput: 0: 43960.0. Samples: 2621777120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 06:18:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:18:30,086][06909] Updated weights for policy 0, policy_version 165953 (0.0029) [2024-06-28 06:18:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2719121408. Throughput: 0: 44003.1. Samples: 2622055260. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 06:18:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:18:34,146][06909] Updated weights for policy 0, policy_version 165963 (0.0026) [2024-06-28 06:18:37,696][06909] Updated weights for policy 0, policy_version 165973 (0.0032) [2024-06-28 06:18:38,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2719350784. Throughput: 0: 44050.3. Samples: 2622315200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 06:18:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:18:41,501][06909] Updated weights for policy 0, policy_version 165983 (0.0026) [2024-06-28 06:18:43,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43968.1, 300 sec: 44153.5). Total num frames: 2719596544. Throughput: 0: 44074.8. Samples: 2622452380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 06:18:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:18:45,161][06909] Updated weights for policy 0, policy_version 165993 (0.0044) [2024-06-28 06:18:48,661][06909] Updated weights for policy 0, policy_version 166003 (0.0034) [2024-06-28 06:18:48,852][06674] Fps is (10 sec: 44227.8, 60 sec: 44235.4, 300 sec: 44153.2). Total num frames: 2719793152. Throughput: 0: 44118.0. Samples: 2622720660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 06:18:48,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:18:52,555][06909] Updated weights for policy 0, policy_version 166013 (0.0046) [2024-06-28 06:18:53,850][06674] Fps is (10 sec: 39322.2, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2719989760. Throughput: 0: 44172.6. Samples: 2622975000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 06:18:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:18:56,383][06909] Updated weights for policy 0, policy_version 166023 (0.0033) [2024-06-28 06:18:56,593][06887] Signal inference workers to stop experience collection... (37300 times) [2024-06-28 06:18:56,600][06887] Signal inference workers to resume experience collection... (37300 times) [2024-06-28 06:18:56,609][06909] InferenceWorker_p0-w0: stopping experience collection (37300 times) [2024-06-28 06:18:56,640][06909] InferenceWorker_p0-w0: resuming experience collection (37300 times) [2024-06-28 06:18:58,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43691.5, 300 sec: 44153.5). Total num frames: 2720235520. Throughput: 0: 44290.2. Samples: 2623109960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 06:18:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:18:59,959][06909] Updated weights for policy 0, policy_version 166033 (0.0041) [2024-06-28 06:19:03,736][06909] Updated weights for policy 0, policy_version 166043 (0.0036) [2024-06-28 06:19:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2720448512. Throughput: 0: 44250.9. Samples: 2623378340. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 06:19:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:19:07,644][06909] Updated weights for policy 0, policy_version 166053 (0.0031) [2024-06-28 06:19:08,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2720661504. Throughput: 0: 44239.1. Samples: 2623640580. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 06:19:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:19:11,056][06909] Updated weights for policy 0, policy_version 166063 (0.0040) [2024-06-28 06:19:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2720907264. Throughput: 0: 44401.9. Samples: 2623775200. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 06:19:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:19:15,136][06909] Updated weights for policy 0, policy_version 166073 (0.0032) [2024-06-28 06:19:18,406][06909] Updated weights for policy 0, policy_version 166083 (0.0035) [2024-06-28 06:19:18,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44511.4, 300 sec: 44153.5). Total num frames: 2721136640. Throughput: 0: 44178.8. Samples: 2624043300. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 06:19:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:19:22,320][06909] Updated weights for policy 0, policy_version 166093 (0.0031) [2024-06-28 06:19:23,856][06674] Fps is (10 sec: 42572.5, 60 sec: 44232.4, 300 sec: 44263.6). Total num frames: 2721333248. Throughput: 0: 44235.4. Samples: 2624306060. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 06:19:23,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:19:25,718][06909] Updated weights for policy 0, policy_version 166103 (0.0024) [2024-06-28 06:19:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2721562624. Throughput: 0: 43978.2. Samples: 2624431400. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 06:19:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:19:29,890][06909] Updated weights for policy 0, policy_version 166113 (0.0033) [2024-06-28 06:19:33,352][06909] Updated weights for policy 0, policy_version 166123 (0.0026) [2024-06-28 06:19:33,850][06674] Fps is (10 sec: 45903.3, 60 sec: 44510.0, 300 sec: 44153.8). Total num frames: 2721792000. Throughput: 0: 44017.1. Samples: 2624701340. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 06:19:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:19:37,517][06909] Updated weights for policy 0, policy_version 166133 (0.0023) [2024-06-28 06:19:38,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2721988608. Throughput: 0: 44173.7. Samples: 2624962820. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 06:19:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:19:40,714][06909] Updated weights for policy 0, policy_version 166143 (0.0027) [2024-06-28 06:19:43,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 2722217984. Throughput: 0: 43966.5. Samples: 2625088460. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 06:19:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:19:44,949][06909] Updated weights for policy 0, policy_version 166153 (0.0035) [2024-06-28 06:19:48,265][06909] Updated weights for policy 0, policy_version 166163 (0.0034) [2024-06-28 06:19:48,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44511.3, 300 sec: 44209.0). Total num frames: 2722463744. Throughput: 0: 44171.5. Samples: 2625366060. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 06:19:48,854][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:19:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000166167_2722480128.pth... [2024-06-28 06:19:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000165516_2711814144.pth [2024-06-28 06:19:52,284][06909] Updated weights for policy 0, policy_version 166173 (0.0031) [2024-06-28 06:19:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2722643968. Throughput: 0: 44098.7. Samples: 2625625020. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 06:19:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:19:55,640][06909] Updated weights for policy 0, policy_version 166183 (0.0030) [2024-06-28 06:19:58,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2722873344. Throughput: 0: 43907.1. Samples: 2625751020. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 06:19:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:19:59,863][06909] Updated weights for policy 0, policy_version 166193 (0.0028) [2024-06-28 06:20:03,003][06909] Updated weights for policy 0, policy_version 166203 (0.0039) [2024-06-28 06:20:03,850][06674] Fps is (10 sec: 49152.6, 60 sec: 44783.0, 300 sec: 44264.6). Total num frames: 2723135488. Throughput: 0: 44029.9. Samples: 2626024640. Policy #0 lag: (min: 1.0, avg: 12.5, max: 25.0) [2024-06-28 06:20:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:20:06,956][06909] Updated weights for policy 0, policy_version 166213 (0.0034) [2024-06-28 06:20:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 2723315712. Throughput: 0: 44220.6. Samples: 2626295720. Policy #0 lag: (min: 1.0, avg: 12.5, max: 25.0) [2024-06-28 06:20:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:20:10,301][06909] Updated weights for policy 0, policy_version 166223 (0.0038) [2024-06-28 06:20:13,856][06674] Fps is (10 sec: 40935.2, 60 sec: 43959.3, 300 sec: 44263.7). Total num frames: 2723545088. Throughput: 0: 44042.2. Samples: 2626413560. Policy #0 lag: (min: 1.0, avg: 12.5, max: 25.0) [2024-06-28 06:20:13,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:20:14,644][06909] Updated weights for policy 0, policy_version 166233 (0.0037) [2024-06-28 06:20:17,677][06909] Updated weights for policy 0, policy_version 166243 (0.0025) [2024-06-28 06:20:18,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2723790848. Throughput: 0: 44070.1. Samples: 2626684500. Policy #0 lag: (min: 1.0, avg: 12.5, max: 25.0) [2024-06-28 06:20:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:20:22,143][06909] Updated weights for policy 0, policy_version 166253 (0.0033) [2024-06-28 06:20:23,850][06674] Fps is (10 sec: 40984.8, 60 sec: 43695.1, 300 sec: 44042.4). Total num frames: 2723954688. Throughput: 0: 44149.8. Samples: 2626949560. Policy #0 lag: (min: 1.0, avg: 12.5, max: 25.0) [2024-06-28 06:20:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:20:24,654][06887] Signal inference workers to stop experience collection... (37350 times) [2024-06-28 06:20:24,703][06909] InferenceWorker_p0-w0: stopping experience collection (37350 times) [2024-06-28 06:20:24,711][06887] Signal inference workers to resume experience collection... (37350 times) [2024-06-28 06:20:24,716][06909] InferenceWorker_p0-w0: resuming experience collection (37350 times) [2024-06-28 06:20:25,026][06909] Updated weights for policy 0, policy_version 166263 (0.0027) [2024-06-28 06:20:28,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 2724184064. Throughput: 0: 44070.4. Samples: 2627071620. Policy #0 lag: (min: 1.0, avg: 12.5, max: 25.0) [2024-06-28 06:20:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 06:20:29,617][06909] Updated weights for policy 0, policy_version 166273 (0.0045) [2024-06-28 06:20:32,579][06909] Updated weights for policy 0, policy_version 166283 (0.0026) [2024-06-28 06:20:33,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 2724446208. Throughput: 0: 43931.2. Samples: 2627342960. Policy #0 lag: (min: 1.0, avg: 12.5, max: 25.0) [2024-06-28 06:20:33,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:20:36,782][06909] Updated weights for policy 0, policy_version 166293 (0.0028) [2024-06-28 06:20:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2724610048. Throughput: 0: 44241.4. Samples: 2627615880. Policy #0 lag: (min: 1.0, avg: 12.5, max: 25.0) [2024-06-28 06:20:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:20:40,043][06909] Updated weights for policy 0, policy_version 166303 (0.0035) [2024-06-28 06:20:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.8, 300 sec: 44209.6). Total num frames: 2724855808. Throughput: 0: 44049.8. Samples: 2627733260. Policy #0 lag: (min: 1.0, avg: 12.5, max: 25.0) [2024-06-28 06:20:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:20:44,408][06909] Updated weights for policy 0, policy_version 166313 (0.0039) [2024-06-28 06:20:47,523][06909] Updated weights for policy 0, policy_version 166323 (0.0025) [2024-06-28 06:20:48,850][06674] Fps is (10 sec: 50789.9, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2725117952. Throughput: 0: 43974.1. Samples: 2628003480. Policy #0 lag: (min: 1.0, avg: 12.5, max: 25.0) [2024-06-28 06:20:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:20:51,735][06909] Updated weights for policy 0, policy_version 166333 (0.0033) [2024-06-28 06:20:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2725281792. Throughput: 0: 43968.1. Samples: 2628274280. Policy #0 lag: (min: 1.0, avg: 12.5, max: 25.0) [2024-06-28 06:20:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:20:54,901][06909] Updated weights for policy 0, policy_version 166343 (0.0037) [2024-06-28 06:20:58,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2725511168. Throughput: 0: 44050.8. Samples: 2628395580. Policy #0 lag: (min: 1.0, avg: 12.5, max: 25.0) [2024-06-28 06:20:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:20:58,969][06909] Updated weights for policy 0, policy_version 166353 (0.0030) [2024-06-28 06:21:02,293][06909] Updated weights for policy 0, policy_version 166363 (0.0029) [2024-06-28 06:21:03,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2725756928. Throughput: 0: 43952.1. Samples: 2628662340. Policy #0 lag: (min: 1.0, avg: 12.5, max: 25.0) [2024-06-28 06:21:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:21:06,545][06909] Updated weights for policy 0, policy_version 166373 (0.0026) [2024-06-28 06:21:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 2725937152. Throughput: 0: 44072.9. Samples: 2628932840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 06:21:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:21:09,940][06909] Updated weights for policy 0, policy_version 166383 (0.0040) [2024-06-28 06:21:13,765][06909] Updated weights for policy 0, policy_version 166393 (0.0032) [2024-06-28 06:21:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43968.2, 300 sec: 44209.0). Total num frames: 2726182912. Throughput: 0: 44120.0. Samples: 2629057020. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 06:21:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:21:17,391][06909] Updated weights for policy 0, policy_version 166403 (0.0046) [2024-06-28 06:21:18,850][06674] Fps is (10 sec: 49151.8, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2726428672. Throughput: 0: 44042.7. Samples: 2629324880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 06:21:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:21:21,553][06909] Updated weights for policy 0, policy_version 166413 (0.0031) [2024-06-28 06:21:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2726608896. Throughput: 0: 43917.3. Samples: 2629592160. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 06:21:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:21:24,708][06909] Updated weights for policy 0, policy_version 166423 (0.0034) [2024-06-28 06:21:28,773][06909] Updated weights for policy 0, policy_version 166433 (0.0029) [2024-06-28 06:21:28,852][06674] Fps is (10 sec: 40951.5, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 2726838272. Throughput: 0: 44068.3. Samples: 2629716420. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 06:21:28,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:21:32,171][06909] Updated weights for policy 0, policy_version 166443 (0.0035) [2024-06-28 06:21:32,803][06887] Signal inference workers to stop experience collection... (37400 times) [2024-06-28 06:21:32,803][06887] Signal inference workers to resume experience collection... (37400 times) [2024-06-28 06:21:32,854][06909] InferenceWorker_p0-w0: stopping experience collection (37400 times) [2024-06-28 06:21:32,854][06909] InferenceWorker_p0-w0: resuming experience collection (37400 times) [2024-06-28 06:21:33,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2727084032. Throughput: 0: 43862.8. Samples: 2629977300. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 06:21:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:21:36,272][06909] Updated weights for policy 0, policy_version 166453 (0.0034) [2024-06-28 06:21:38,850][06674] Fps is (10 sec: 42607.2, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2727264256. Throughput: 0: 43872.4. Samples: 2630248540. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 06:21:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 06:21:39,741][06909] Updated weights for policy 0, policy_version 166463 (0.0026) [2024-06-28 06:21:43,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2727477248. Throughput: 0: 43984.9. Samples: 2630374900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 06:21:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:21:43,894][06909] Updated weights for policy 0, policy_version 166473 (0.0029) [2024-06-28 06:21:47,153][06909] Updated weights for policy 0, policy_version 166483 (0.0046) [2024-06-28 06:21:48,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2727739392. Throughput: 0: 43926.6. Samples: 2630639040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 06:21:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:21:48,995][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000166489_2727755776.pth... [2024-06-28 06:21:49,047][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000165842_2717155328.pth [2024-06-28 06:21:51,182][06909] Updated weights for policy 0, policy_version 166493 (0.0035) [2024-06-28 06:21:53,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 2727919616. Throughput: 0: 43838.0. Samples: 2630905560. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 06:21:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:21:54,967][06909] Updated weights for policy 0, policy_version 166503 (0.0025) [2024-06-28 06:21:58,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2728132608. Throughput: 0: 43853.3. Samples: 2631030420. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 06:21:58,850][06674] Avg episode reward: [(0, '0.444')] [2024-06-28 06:21:58,869][06909] Updated weights for policy 0, policy_version 166513 (0.0032) [2024-06-28 06:22:02,212][06909] Updated weights for policy 0, policy_version 166523 (0.0039) [2024-06-28 06:22:03,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2728394752. Throughput: 0: 43799.0. Samples: 2631295840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 06:22:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:22:06,132][06909] Updated weights for policy 0, policy_version 166533 (0.0029) [2024-06-28 06:22:08,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 2728607744. Throughput: 0: 43925.2. Samples: 2631568800. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 06:22:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:22:09,315][06909] Updated weights for policy 0, policy_version 166543 (0.0035) [2024-06-28 06:22:13,729][06909] Updated weights for policy 0, policy_version 166553 (0.0034) [2024-06-28 06:22:13,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2728804352. Throughput: 0: 44022.8. Samples: 2631697360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 06:22:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:22:16,913][06909] Updated weights for policy 0, policy_version 166563 (0.0034) [2024-06-28 06:22:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2729066496. Throughput: 0: 44033.7. Samples: 2631958820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 06:22:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:22:21,060][06909] Updated weights for policy 0, policy_version 166573 (0.0029) [2024-06-28 06:22:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2729263104. Throughput: 0: 44128.9. Samples: 2632234340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 06:22:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:22:24,341][06909] Updated weights for policy 0, policy_version 166583 (0.0028) [2024-06-28 06:22:28,211][06909] Updated weights for policy 0, policy_version 166593 (0.0030) [2024-06-28 06:22:28,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43965.2, 300 sec: 44097.9). Total num frames: 2729476096. Throughput: 0: 44014.1. Samples: 2632355540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 06:22:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:22:31,822][06909] Updated weights for policy 0, policy_version 166603 (0.0028) [2024-06-28 06:22:33,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2729738240. Throughput: 0: 44215.5. Samples: 2632628740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 06:22:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:22:35,899][06909] Updated weights for policy 0, policy_version 166613 (0.0038) [2024-06-28 06:22:38,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44509.9, 300 sec: 43987.8). Total num frames: 2729934848. Throughput: 0: 44164.6. Samples: 2632892960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 06:22:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:22:39,177][06909] Updated weights for policy 0, policy_version 166623 (0.0028) [2024-06-28 06:22:43,197][06909] Updated weights for policy 0, policy_version 166633 (0.0036) [2024-06-28 06:22:43,850][06674] Fps is (10 sec: 37683.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2730115072. Throughput: 0: 44212.5. Samples: 2633019980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 06:22:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:22:46,502][06909] Updated weights for policy 0, policy_version 166643 (0.0021) [2024-06-28 06:22:48,852][06674] Fps is (10 sec: 44227.3, 60 sec: 43962.3, 300 sec: 44153.2). Total num frames: 2730377216. Throughput: 0: 44300.2. Samples: 2633289440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 06:22:48,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:22:50,653][06909] Updated weights for policy 0, policy_version 166653 (0.0027) [2024-06-28 06:22:53,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44510.0, 300 sec: 43987.1). Total num frames: 2730590208. Throughput: 0: 44115.2. Samples: 2633553980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 06:22:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:22:53,853][06909] Updated weights for policy 0, policy_version 166663 (0.0028) [2024-06-28 06:22:57,963][06909] Updated weights for policy 0, policy_version 166673 (0.0042) [2024-06-28 06:22:58,850][06674] Fps is (10 sec: 42606.9, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2730803200. Throughput: 0: 44132.8. Samples: 2633683340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 06:22:58,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:23:01,713][06909] Updated weights for policy 0, policy_version 166683 (0.0031) [2024-06-28 06:23:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2731032576. Throughput: 0: 44061.4. Samples: 2633941580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 06:23:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:23:05,343][06909] Updated weights for policy 0, policy_version 166693 (0.0021) [2024-06-28 06:23:08,840][06909] Updated weights for policy 0, policy_version 166703 (0.0030) [2024-06-28 06:23:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2731261952. Throughput: 0: 43963.0. Samples: 2634212680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 06:23:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:23:13,079][06909] Updated weights for policy 0, policy_version 166713 (0.0028) [2024-06-28 06:23:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 2731458560. Throughput: 0: 44444.0. Samples: 2634355520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 06:23:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:23:14,834][06887] Signal inference workers to stop experience collection... (37450 times) [2024-06-28 06:23:14,835][06887] Signal inference workers to resume experience collection... (37450 times) [2024-06-28 06:23:14,857][06909] InferenceWorker_p0-w0: stopping experience collection (37450 times) [2024-06-28 06:23:14,888][06909] InferenceWorker_p0-w0: resuming experience collection (37450 times) [2024-06-28 06:23:16,280][06909] Updated weights for policy 0, policy_version 166723 (0.0032) [2024-06-28 06:23:18,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 2731671552. Throughput: 0: 43923.2. Samples: 2634605280. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 06:23:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:23:20,352][06909] Updated weights for policy 0, policy_version 166733 (0.0033) [2024-06-28 06:23:23,482][06909] Updated weights for policy 0, policy_version 166743 (0.0036) [2024-06-28 06:23:23,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2731933696. Throughput: 0: 44077.4. Samples: 2634876440. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 06:23:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:23:28,233][06909] Updated weights for policy 0, policy_version 166753 (0.0026) [2024-06-28 06:23:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2732113920. Throughput: 0: 44326.6. Samples: 2635014680. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 06:23:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:23:31,046][06909] Updated weights for policy 0, policy_version 166763 (0.0037) [2024-06-28 06:23:33,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 2732343296. Throughput: 0: 43900.7. Samples: 2635264880. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 06:23:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:23:35,463][06909] Updated weights for policy 0, policy_version 166773 (0.0033) [2024-06-28 06:23:38,731][06909] Updated weights for policy 0, policy_version 166783 (0.0027) [2024-06-28 06:23:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2732572672. Throughput: 0: 43955.4. Samples: 2635531980. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 06:23:38,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:23:42,606][06909] Updated weights for policy 0, policy_version 166793 (0.0034) [2024-06-28 06:23:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.7, 300 sec: 43987.2). Total num frames: 2732769280. Throughput: 0: 44093.0. Samples: 2635667520. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 06:23:43,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 06:23:45,939][06909] Updated weights for policy 0, policy_version 166803 (0.0028) [2024-06-28 06:23:48,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 2733015040. Throughput: 0: 44140.4. Samples: 2635927900. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 06:23:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:23:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000166810_2733015040.pth... [2024-06-28 06:23:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000166167_2722480128.pth [2024-06-28 06:23:50,259][06909] Updated weights for policy 0, policy_version 166813 (0.0043) [2024-06-28 06:23:53,410][06909] Updated weights for policy 0, policy_version 166823 (0.0051) [2024-06-28 06:23:53,850][06674] Fps is (10 sec: 49152.2, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2733260800. Throughput: 0: 44133.8. Samples: 2636198700. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 06:23:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:23:57,883][06909] Updated weights for policy 0, policy_version 166833 (0.0047) [2024-06-28 06:23:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2733457408. Throughput: 0: 44039.2. Samples: 2636337280. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 06:23:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:24:00,668][06909] Updated weights for policy 0, policy_version 166843 (0.0035) [2024-06-28 06:24:03,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2733670400. Throughput: 0: 44223.1. Samples: 2636595320. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 06:24:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:24:05,260][06909] Updated weights for policy 0, policy_version 166853 (0.0024) [2024-06-28 06:24:08,049][06909] Updated weights for policy 0, policy_version 166863 (0.0030) [2024-06-28 06:24:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2733916160. Throughput: 0: 44020.9. Samples: 2636857380. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 06:24:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:24:12,502][06909] Updated weights for policy 0, policy_version 166873 (0.0028) [2024-06-28 06:24:13,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43962.3, 300 sec: 43931.0). Total num frames: 2734096384. Throughput: 0: 43989.1. Samples: 2636994280. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 06:24:13,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:24:15,754][06909] Updated weights for policy 0, policy_version 166883 (0.0034) [2024-06-28 06:24:18,850][06674] Fps is (10 sec: 40959.5, 60 sec: 44236.8, 300 sec: 44043.3). Total num frames: 2734325760. Throughput: 0: 44232.0. Samples: 2637255320. Policy #0 lag: (min: 0.0, avg: 11.9, max: 23.0) [2024-06-28 06:24:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:24:19,855][06909] Updated weights for policy 0, policy_version 166893 (0.0025) [2024-06-28 06:24:23,201][06909] Updated weights for policy 0, policy_version 166903 (0.0032) [2024-06-28 06:24:23,850][06674] Fps is (10 sec: 49161.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2734587904. Throughput: 0: 44322.3. Samples: 2637526480. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:24:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:24:27,464][06909] Updated weights for policy 0, policy_version 166913 (0.0036) [2024-06-28 06:24:28,733][06887] Signal inference workers to stop experience collection... (37500 times) [2024-06-28 06:24:28,735][06887] Signal inference workers to resume experience collection... (37500 times) [2024-06-28 06:24:28,746][06909] InferenceWorker_p0-w0: stopping experience collection (37500 times) [2024-06-28 06:24:28,746][06909] InferenceWorker_p0-w0: resuming experience collection (37500 times) [2024-06-28 06:24:28,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2734784512. Throughput: 0: 44348.0. Samples: 2637663180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:24:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:24:30,417][06909] Updated weights for policy 0, policy_version 166923 (0.0027) [2024-06-28 06:24:33,852][06674] Fps is (10 sec: 39313.6, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 2734981120. Throughput: 0: 44258.4. Samples: 2637919620. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:24:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:24:34,640][06909] Updated weights for policy 0, policy_version 166933 (0.0037) [2024-06-28 06:24:37,861][06909] Updated weights for policy 0, policy_version 166943 (0.0036) [2024-06-28 06:24:38,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44782.9, 300 sec: 44209.0). Total num frames: 2735259648. Throughput: 0: 44056.3. Samples: 2638181240. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:24:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:24:42,542][06909] Updated weights for policy 0, policy_version 166953 (0.0036) [2024-06-28 06:24:43,850][06674] Fps is (10 sec: 45884.9, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2735439872. Throughput: 0: 44131.1. Samples: 2638323180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:24:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:24:45,064][06909] Updated weights for policy 0, policy_version 166963 (0.0032) [2024-06-28 06:24:48,850][06674] Fps is (10 sec: 37683.7, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2735636480. Throughput: 0: 44039.6. Samples: 2638577100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:24:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:24:49,757][06909] Updated weights for policy 0, policy_version 166973 (0.0053) [2024-06-28 06:24:52,871][06909] Updated weights for policy 0, policy_version 166983 (0.0034) [2024-06-28 06:24:53,852][06674] Fps is (10 sec: 47503.6, 60 sec: 44235.3, 300 sec: 44208.7). Total num frames: 2735915008. Throughput: 0: 43972.6. Samples: 2638836240. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:24:53,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:24:57,011][06909] Updated weights for policy 0, policy_version 166993 (0.0026) [2024-06-28 06:24:58,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2736095232. Throughput: 0: 44167.3. Samples: 2638981720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:24:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:25:00,309][06909] Updated weights for policy 0, policy_version 167003 (0.0044) [2024-06-28 06:25:03,850][06674] Fps is (10 sec: 37690.7, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2736291840. Throughput: 0: 44068.9. Samples: 2639238420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:25:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:25:04,809][06909] Updated weights for policy 0, policy_version 167013 (0.0031) [2024-06-28 06:25:07,709][06909] Updated weights for policy 0, policy_version 167023 (0.0042) [2024-06-28 06:25:08,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44236.7, 300 sec: 44154.4). Total num frames: 2736570368. Throughput: 0: 43762.6. Samples: 2639495800. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:25:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:25:12,022][06909] Updated weights for policy 0, policy_version 167033 (0.0027) [2024-06-28 06:25:13,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44511.4, 300 sec: 43986.9). Total num frames: 2736766976. Throughput: 0: 44023.5. Samples: 2639644240. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:25:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:25:15,131][06909] Updated weights for policy 0, policy_version 167043 (0.0031) [2024-06-28 06:25:18,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2736963584. Throughput: 0: 44036.2. Samples: 2639901160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:25:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:25:19,377][06909] Updated weights for policy 0, policy_version 167053 (0.0038) [2024-06-28 06:25:22,362][06909] Updated weights for policy 0, policy_version 167063 (0.0032) [2024-06-28 06:25:23,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2737225728. Throughput: 0: 43907.6. Samples: 2640157080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:25:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:25:26,850][06909] Updated weights for policy 0, policy_version 167073 (0.0025) [2024-06-28 06:25:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2737422336. Throughput: 0: 43956.4. Samples: 2640301220. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 06:25:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:25:30,040][06909] Updated weights for policy 0, policy_version 167083 (0.0033) [2024-06-28 06:25:33,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43965.2, 300 sec: 44097.9). Total num frames: 2737618944. Throughput: 0: 44020.0. Samples: 2640558000. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 06:25:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:25:34,627][06909] Updated weights for policy 0, policy_version 167093 (0.0036) [2024-06-28 06:25:37,426][06909] Updated weights for policy 0, policy_version 167103 (0.0032) [2024-06-28 06:25:38,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2737897472. Throughput: 0: 44054.8. Samples: 2640818620. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 06:25:38,850][06674] Avg episode reward: [(0, '0.486')] [2024-06-28 06:25:41,809][06909] Updated weights for policy 0, policy_version 167113 (0.0035) [2024-06-28 06:25:43,423][06887] Signal inference workers to stop experience collection... (37550 times) [2024-06-28 06:25:43,460][06909] InferenceWorker_p0-w0: stopping experience collection (37550 times) [2024-06-28 06:25:43,481][06887] Signal inference workers to resume experience collection... (37550 times) [2024-06-28 06:25:43,482][06909] InferenceWorker_p0-w0: resuming experience collection (37550 times) [2024-06-28 06:25:43,854][06674] Fps is (10 sec: 47494.1, 60 sec: 44233.7, 300 sec: 43986.3). Total num frames: 2738094080. Throughput: 0: 44119.1. Samples: 2640967260. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 06:25:43,854][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:25:44,768][06909] Updated weights for policy 0, policy_version 167123 (0.0038) [2024-06-28 06:25:48,850][06674] Fps is (10 sec: 39321.9, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2738290688. Throughput: 0: 44239.6. Samples: 2641229200. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 06:25:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:25:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000167132_2738290688.pth... [2024-06-28 06:25:48,947][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000166489_2727755776.pth [2024-06-28 06:25:49,110][06909] Updated weights for policy 0, policy_version 167133 (0.0020) [2024-06-28 06:25:52,190][06909] Updated weights for policy 0, policy_version 167143 (0.0026) [2024-06-28 06:25:53,850][06674] Fps is (10 sec: 45893.6, 60 sec: 43965.2, 300 sec: 44209.0). Total num frames: 2738552832. Throughput: 0: 44193.4. Samples: 2641484500. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 06:25:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:25:56,785][06909] Updated weights for policy 0, policy_version 167153 (0.0043) [2024-06-28 06:25:58,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2738765824. Throughput: 0: 44106.6. Samples: 2641629040. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 06:25:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:25:59,557][06909] Updated weights for policy 0, policy_version 167163 (0.0025) [2024-06-28 06:26:03,850][06674] Fps is (10 sec: 39322.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2738946048. Throughput: 0: 44282.8. Samples: 2641893880. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 06:26:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:26:03,985][06909] Updated weights for policy 0, policy_version 167173 (0.0038) [2024-06-28 06:26:07,044][06909] Updated weights for policy 0, policy_version 167183 (0.0030) [2024-06-28 06:26:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2739208192. Throughput: 0: 44189.8. Samples: 2642145620. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 06:26:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:26:11,407][06909] Updated weights for policy 0, policy_version 167193 (0.0035) [2024-06-28 06:26:13,850][06674] Fps is (10 sec: 47512.7, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2739421184. Throughput: 0: 44291.0. Samples: 2642294320. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 06:26:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:26:14,521][06909] Updated weights for policy 0, policy_version 167203 (0.0049) [2024-06-28 06:26:18,661][06909] Updated weights for policy 0, policy_version 167213 (0.0020) [2024-06-28 06:26:18,856][06674] Fps is (10 sec: 40935.1, 60 sec: 44232.3, 300 sec: 44097.0). Total num frames: 2739617792. Throughput: 0: 44354.0. Samples: 2642554200. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 06:26:18,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:26:22,072][06909] Updated weights for policy 0, policy_version 167223 (0.0039) [2024-06-28 06:26:23,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.8, 300 sec: 44153.8). Total num frames: 2739863552. Throughput: 0: 44202.8. Samples: 2642807740. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 06:26:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:26:25,994][06909] Updated weights for policy 0, policy_version 167233 (0.0031) [2024-06-28 06:26:28,852][06674] Fps is (10 sec: 44254.1, 60 sec: 43962.1, 300 sec: 43986.5). Total num frames: 2740060160. Throughput: 0: 44066.3. Samples: 2642950160. Policy #0 lag: (min: 0.0, avg: 8.3, max: 21.0) [2024-06-28 06:26:28,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:26:29,464][06909] Updated weights for policy 0, policy_version 167243 (0.0036) [2024-06-28 06:26:33,720][06909] Updated weights for policy 0, policy_version 167253 (0.0031) [2024-06-28 06:26:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2740273152. Throughput: 0: 43912.1. Samples: 2643205240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:26:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:26:36,938][06909] Updated weights for policy 0, policy_version 167263 (0.0029) [2024-06-28 06:26:38,852][06674] Fps is (10 sec: 47516.0, 60 sec: 43962.6, 300 sec: 44264.3). Total num frames: 2740535296. Throughput: 0: 43846.0. Samples: 2643457640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:26:38,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:26:41,018][06909] Updated weights for policy 0, policy_version 167273 (0.0033) [2024-06-28 06:26:43,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43693.6, 300 sec: 43986.9). Total num frames: 2740715520. Throughput: 0: 43880.0. Samples: 2643603640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:26:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:26:44,545][06909] Updated weights for policy 0, policy_version 167283 (0.0035) [2024-06-28 06:26:48,786][06909] Updated weights for policy 0, policy_version 167293 (0.0036) [2024-06-28 06:26:48,850][06674] Fps is (10 sec: 39328.0, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2740928512. Throughput: 0: 43831.4. Samples: 2643866300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:26:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:26:52,117][06909] Updated weights for policy 0, policy_version 167303 (0.0029) [2024-06-28 06:26:53,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.8, 300 sec: 44264.6). Total num frames: 2741190656. Throughput: 0: 43726.7. Samples: 2644113320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:26:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:26:55,995][06909] Updated weights for policy 0, policy_version 167313 (0.0041) [2024-06-28 06:26:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43417.7, 300 sec: 43986.9). Total num frames: 2741370880. Throughput: 0: 43695.3. Samples: 2644260600. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:26:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:26:59,497][06909] Updated weights for policy 0, policy_version 167323 (0.0050) [2024-06-28 06:26:59,548][06887] Signal inference workers to stop experience collection... (37600 times) [2024-06-28 06:26:59,573][06909] InferenceWorker_p0-w0: stopping experience collection (37600 times) [2024-06-28 06:26:59,658][06887] Signal inference workers to resume experience collection... (37600 times) [2024-06-28 06:26:59,659][06909] InferenceWorker_p0-w0: resuming experience collection (37600 times) [2024-06-28 06:27:03,278][06909] Updated weights for policy 0, policy_version 167333 (0.0031) [2024-06-28 06:27:03,850][06674] Fps is (10 sec: 39321.1, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2741583872. Throughput: 0: 43761.9. Samples: 2644523220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:27:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:27:06,979][06909] Updated weights for policy 0, policy_version 167343 (0.0038) [2024-06-28 06:27:08,852][06674] Fps is (10 sec: 47503.5, 60 sec: 43962.2, 300 sec: 44208.7). Total num frames: 2741846016. Throughput: 0: 43750.0. Samples: 2644776580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:27:08,861][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:27:10,942][06909] Updated weights for policy 0, policy_version 167353 (0.0033) [2024-06-28 06:27:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.7, 300 sec: 43931.3). Total num frames: 2742026240. Throughput: 0: 43661.7. Samples: 2644914840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:27:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:27:14,587][06909] Updated weights for policy 0, policy_version 167363 (0.0034) [2024-06-28 06:27:18,663][06909] Updated weights for policy 0, policy_version 167373 (0.0039) [2024-06-28 06:27:18,850][06674] Fps is (10 sec: 39329.6, 60 sec: 43695.1, 300 sec: 43986.9). Total num frames: 2742239232. Throughput: 0: 43793.7. Samples: 2645175960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:27:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:27:22,066][06909] Updated weights for policy 0, policy_version 167383 (0.0034) [2024-06-28 06:27:23,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 2742501376. Throughput: 0: 43946.8. Samples: 2645435180. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:27:23,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:27:25,903][06909] Updated weights for policy 0, policy_version 167393 (0.0029) [2024-06-28 06:27:28,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43692.3, 300 sec: 43875.8). Total num frames: 2742681600. Throughput: 0: 43840.1. Samples: 2645576440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:27:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:27:29,542][06909] Updated weights for policy 0, policy_version 167403 (0.0030) [2024-06-28 06:27:33,232][06909] Updated weights for policy 0, policy_version 167413 (0.0031) [2024-06-28 06:27:33,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2742910976. Throughput: 0: 43811.1. Samples: 2645837800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:27:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:27:36,864][06909] Updated weights for policy 0, policy_version 167423 (0.0042) [2024-06-28 06:27:38,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43691.9, 300 sec: 44209.0). Total num frames: 2743156736. Throughput: 0: 44096.8. Samples: 2646097680. Policy #0 lag: (min: 1.0, avg: 12.5, max: 24.0) [2024-06-28 06:27:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:27:40,438][06909] Updated weights for policy 0, policy_version 167433 (0.0025) [2024-06-28 06:27:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43931.6). Total num frames: 2743336960. Throughput: 0: 43947.9. Samples: 2646238260. Policy #0 lag: (min: 1.0, avg: 12.5, max: 24.0) [2024-06-28 06:27:43,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:27:44,397][06909] Updated weights for policy 0, policy_version 167443 (0.0039) [2024-06-28 06:27:48,009][06909] Updated weights for policy 0, policy_version 167453 (0.0029) [2024-06-28 06:27:48,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2743549952. Throughput: 0: 43854.4. Samples: 2646496660. Policy #0 lag: (min: 1.0, avg: 12.5, max: 24.0) [2024-06-28 06:27:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:27:48,905][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000167454_2743566336.pth... [2024-06-28 06:27:48,954][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000166810_2733015040.pth [2024-06-28 06:27:51,722][06909] Updated weights for policy 0, policy_version 167463 (0.0030) [2024-06-28 06:27:53,855][06674] Fps is (10 sec: 47491.2, 60 sec: 43687.2, 300 sec: 44097.3). Total num frames: 2743812096. Throughput: 0: 44008.9. Samples: 2646757100. Policy #0 lag: (min: 1.0, avg: 12.5, max: 24.0) [2024-06-28 06:27:53,855][06674] Avg episode reward: [(0, '0.463')] [2024-06-28 06:27:55,353][06909] Updated weights for policy 0, policy_version 167473 (0.0026) [2024-06-28 06:27:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2744008704. Throughput: 0: 44218.3. Samples: 2646904660. Policy #0 lag: (min: 1.0, avg: 12.5, max: 24.0) [2024-06-28 06:27:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:27:59,139][06909] Updated weights for policy 0, policy_version 167483 (0.0032) [2024-06-28 06:28:02,884][06909] Updated weights for policy 0, policy_version 167493 (0.0026) [2024-06-28 06:28:03,850][06674] Fps is (10 sec: 40979.4, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2744221696. Throughput: 0: 44136.5. Samples: 2647162100. Policy #0 lag: (min: 1.0, avg: 12.5, max: 24.0) [2024-06-28 06:28:03,850][06674] Avg episode reward: [(0, '0.443')] [2024-06-28 06:28:06,421][06909] Updated weights for policy 0, policy_version 167503 (0.0034) [2024-06-28 06:28:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43692.1, 300 sec: 44097.9). Total num frames: 2744467456. Throughput: 0: 44231.2. Samples: 2647425580. Policy #0 lag: (min: 1.0, avg: 12.5, max: 24.0) [2024-06-28 06:28:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:28:10,416][06909] Updated weights for policy 0, policy_version 167513 (0.0037) [2024-06-28 06:28:13,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2744680448. Throughput: 0: 44132.2. Samples: 2647562400. Policy #0 lag: (min: 1.0, avg: 12.5, max: 24.0) [2024-06-28 06:28:13,855][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:28:14,014][06909] Updated weights for policy 0, policy_version 167523 (0.0027) [2024-06-28 06:28:17,637][06909] Updated weights for policy 0, policy_version 167533 (0.0029) [2024-06-28 06:28:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2744893440. Throughput: 0: 44156.4. Samples: 2647824840. Policy #0 lag: (min: 1.0, avg: 12.5, max: 24.0) [2024-06-28 06:28:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:28:21,366][06909] Updated weights for policy 0, policy_version 167543 (0.0036) [2024-06-28 06:28:22,352][06887] Signal inference workers to stop experience collection... (37650 times) [2024-06-28 06:28:22,357][06887] Signal inference workers to resume experience collection... (37650 times) [2024-06-28 06:28:22,371][06909] InferenceWorker_p0-w0: stopping experience collection (37650 times) [2024-06-28 06:28:22,376][06909] InferenceWorker_p0-w0: resuming experience collection (37650 times) [2024-06-28 06:28:23,850][06674] Fps is (10 sec: 42599.4, 60 sec: 43417.8, 300 sec: 44042.4). Total num frames: 2745106432. Throughput: 0: 44228.6. Samples: 2648087960. Policy #0 lag: (min: 1.0, avg: 12.5, max: 24.0) [2024-06-28 06:28:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:28:25,005][06909] Updated weights for policy 0, policy_version 167553 (0.0021) [2024-06-28 06:28:28,770][06909] Updated weights for policy 0, policy_version 167563 (0.0031) [2024-06-28 06:28:28,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 2745352192. Throughput: 0: 44052.5. Samples: 2648220620. Policy #0 lag: (min: 1.0, avg: 12.5, max: 24.0) [2024-06-28 06:28:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:28:32,327][06909] Updated weights for policy 0, policy_version 167573 (0.0021) [2024-06-28 06:28:33,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2745548800. Throughput: 0: 44089.2. Samples: 2648480680. Policy #0 lag: (min: 1.0, avg: 12.5, max: 24.0) [2024-06-28 06:28:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:28:36,448][06909] Updated weights for policy 0, policy_version 167583 (0.0031) [2024-06-28 06:28:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2745778176. Throughput: 0: 44172.7. Samples: 2648744660. Policy #0 lag: (min: 1.0, avg: 12.5, max: 24.0) [2024-06-28 06:28:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:28:40,134][06909] Updated weights for policy 0, policy_version 167593 (0.0044) [2024-06-28 06:28:43,707][06909] Updated weights for policy 0, policy_version 167603 (0.0027) [2024-06-28 06:28:43,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44782.9, 300 sec: 44097.9). Total num frames: 2746023936. Throughput: 0: 44007.4. Samples: 2648885000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:28:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:28:47,701][06909] Updated weights for policy 0, policy_version 167613 (0.0032) [2024-06-28 06:28:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.9, 300 sec: 43931.3). Total num frames: 2746220544. Throughput: 0: 44037.4. Samples: 2649143780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:28:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:28:51,311][06909] Updated weights for policy 0, policy_version 167623 (0.0032) [2024-06-28 06:28:53,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43694.1, 300 sec: 43986.9). Total num frames: 2746433536. Throughput: 0: 44119.5. Samples: 2649410960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:28:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:28:54,920][06909] Updated weights for policy 0, policy_version 167633 (0.0029) [2024-06-28 06:28:58,492][06909] Updated weights for policy 0, policy_version 167643 (0.0030) [2024-06-28 06:28:58,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2746679296. Throughput: 0: 43930.8. Samples: 2649539280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:28:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:29:02,297][06909] Updated weights for policy 0, policy_version 167653 (0.0026) [2024-06-28 06:29:03,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2746875904. Throughput: 0: 43991.2. Samples: 2649804440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:29:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:29:05,678][06909] Updated weights for policy 0, policy_version 167663 (0.0035) [2024-06-28 06:29:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.2). Total num frames: 2747105280. Throughput: 0: 44059.0. Samples: 2650070620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:29:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:29:09,990][06909] Updated weights for policy 0, policy_version 167673 (0.0027) [2024-06-28 06:29:13,421][06909] Updated weights for policy 0, policy_version 167683 (0.0034) [2024-06-28 06:29:13,850][06674] Fps is (10 sec: 47512.8, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2747351040. Throughput: 0: 44022.9. Samples: 2650201660. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:29:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:29:17,153][06909] Updated weights for policy 0, policy_version 167693 (0.0030) [2024-06-28 06:29:18,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 2747547648. Throughput: 0: 44168.2. Samples: 2650468240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:29:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:29:20,691][06909] Updated weights for policy 0, policy_version 167703 (0.0030) [2024-06-28 06:29:23,850][06674] Fps is (10 sec: 39322.2, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2747744256. Throughput: 0: 44368.9. Samples: 2650741260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:29:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:29:24,534][06909] Updated weights for policy 0, policy_version 167713 (0.0023) [2024-06-28 06:29:28,132][06909] Updated weights for policy 0, policy_version 167723 (0.0031) [2024-06-28 06:29:28,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.7, 300 sec: 44153.8). Total num frames: 2748006400. Throughput: 0: 44044.1. Samples: 2650866980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:29:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:29:32,093][06909] Updated weights for policy 0, policy_version 167733 (0.0034) [2024-06-28 06:29:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 2748203008. Throughput: 0: 44237.7. Samples: 2651134480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:29:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:29:35,214][06909] Updated weights for policy 0, policy_version 167743 (0.0029) [2024-06-28 06:29:38,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.6, 300 sec: 43986.8). Total num frames: 2748416000. Throughput: 0: 44280.8. Samples: 2651403600. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:29:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:29:39,645][06909] Updated weights for policy 0, policy_version 167753 (0.0028) [2024-06-28 06:29:42,690][06909] Updated weights for policy 0, policy_version 167763 (0.0044) [2024-06-28 06:29:43,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2748661760. Throughput: 0: 44238.6. Samples: 2651530020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:29:43,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:29:46,712][06887] Signal inference workers to stop experience collection... (37700 times) [2024-06-28 06:29:46,712][06887] Signal inference workers to resume experience collection... (37700 times) [2024-06-28 06:29:46,744][06909] InferenceWorker_p0-w0: stopping experience collection (37700 times) [2024-06-28 06:29:46,745][06909] InferenceWorker_p0-w0: resuming experience collection (37700 times) [2024-06-28 06:29:46,847][06909] Updated weights for policy 0, policy_version 167773 (0.0032) [2024-06-28 06:29:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.7, 300 sec: 43931.6). Total num frames: 2748874752. Throughput: 0: 44240.4. Samples: 2651795260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 06:29:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:29:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000167778_2748874752.pth... [2024-06-28 06:29:48,919][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000167132_2738290688.pth [2024-06-28 06:29:50,249][06909] Updated weights for policy 0, policy_version 167783 (0.0031) [2024-06-28 06:29:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2749087744. Throughput: 0: 44342.7. Samples: 2652066040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 06:29:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:29:54,179][06909] Updated weights for policy 0, policy_version 167793 (0.0024) [2024-06-28 06:29:57,450][06909] Updated weights for policy 0, policy_version 167803 (0.0024) [2024-06-28 06:29:58,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44509.9, 300 sec: 44264.6). Total num frames: 2749349888. Throughput: 0: 44349.5. Samples: 2652197380. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 06:29:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:30:01,806][06909] Updated weights for policy 0, policy_version 167813 (0.0043) [2024-06-28 06:30:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2749530112. Throughput: 0: 44204.4. Samples: 2652457440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 06:30:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:30:05,005][06909] Updated weights for policy 0, policy_version 167823 (0.0035) [2024-06-28 06:30:08,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2749743104. Throughput: 0: 44123.6. Samples: 2652726820. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 06:30:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:30:09,215][06909] Updated weights for policy 0, policy_version 167833 (0.0035) [2024-06-28 06:30:12,412][06909] Updated weights for policy 0, policy_version 167843 (0.0032) [2024-06-28 06:30:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 2749972480. Throughput: 0: 44280.5. Samples: 2652859600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 06:30:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:30:16,806][06909] Updated weights for policy 0, policy_version 167853 (0.0032) [2024-06-28 06:30:18,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2750201856. Throughput: 0: 44084.3. Samples: 2653118280. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 06:30:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:30:19,939][06909] Updated weights for policy 0, policy_version 167863 (0.0045) [2024-06-28 06:30:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2750398464. Throughput: 0: 44034.2. Samples: 2653385140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 06:30:23,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:30:24,328][06909] Updated weights for policy 0, policy_version 167873 (0.0029) [2024-06-28 06:30:27,464][06909] Updated weights for policy 0, policy_version 167883 (0.0028) [2024-06-28 06:30:28,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2750644224. Throughput: 0: 44145.0. Samples: 2653516540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 06:30:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:30:31,735][06909] Updated weights for policy 0, policy_version 167893 (0.0038) [2024-06-28 06:30:33,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 2750857216. Throughput: 0: 44073.0. Samples: 2653778540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 06:30:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:30:34,734][06909] Updated weights for policy 0, policy_version 167903 (0.0019) [2024-06-28 06:30:38,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.8, 300 sec: 43987.5). Total num frames: 2751070208. Throughput: 0: 44060.8. Samples: 2654048780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 06:30:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:30:39,042][06909] Updated weights for policy 0, policy_version 167913 (0.0026) [2024-06-28 06:30:42,127][06909] Updated weights for policy 0, policy_version 167923 (0.0028) [2024-06-28 06:30:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2751299584. Throughput: 0: 44009.4. Samples: 2654177800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 06:30:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:30:46,272][06909] Updated weights for policy 0, policy_version 167933 (0.0044) [2024-06-28 06:30:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2751528960. Throughput: 0: 44165.7. Samples: 2654444900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 06:30:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:30:49,566][06909] Updated weights for policy 0, policy_version 167943 (0.0031) [2024-06-28 06:30:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2751725568. Throughput: 0: 44014.7. Samples: 2654707480. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 06:30:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:30:54,085][06909] Updated weights for policy 0, policy_version 167953 (0.0027) [2024-06-28 06:30:57,219][06909] Updated weights for policy 0, policy_version 167963 (0.0044) [2024-06-28 06:30:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.5, 300 sec: 44097.9). Total num frames: 2751954944. Throughput: 0: 43923.5. Samples: 2654836160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 06:30:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:31:01,402][06909] Updated weights for policy 0, policy_version 167973 (0.0041) [2024-06-28 06:31:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2752184320. Throughput: 0: 44178.8. Samples: 2655106320. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 06:31:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:31:04,487][06909] Updated weights for policy 0, policy_version 167983 (0.0025) [2024-06-28 06:31:07,871][06887] Signal inference workers to stop experience collection... (37750 times) [2024-06-28 06:31:07,876][06887] Signal inference workers to resume experience collection... (37750 times) [2024-06-28 06:31:07,920][06909] InferenceWorker_p0-w0: stopping experience collection (37750 times) [2024-06-28 06:31:07,920][06909] InferenceWorker_p0-w0: resuming experience collection (37750 times) [2024-06-28 06:31:08,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 2752380928. Throughput: 0: 44101.1. Samples: 2655369680. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 06:31:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:31:08,912][06909] Updated weights for policy 0, policy_version 167993 (0.0031) [2024-06-28 06:31:11,768][06909] Updated weights for policy 0, policy_version 168003 (0.0026) [2024-06-28 06:31:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44098.9). Total num frames: 2752626688. Throughput: 0: 44127.5. Samples: 2655502280. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 06:31:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:31:16,214][06909] Updated weights for policy 0, policy_version 168013 (0.0036) [2024-06-28 06:31:18,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2752856064. Throughput: 0: 44307.5. Samples: 2655772380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 06:31:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:31:19,359][06909] Updated weights for policy 0, policy_version 168023 (0.0022) [2024-06-28 06:31:23,394][06909] Updated weights for policy 0, policy_version 168033 (0.0030) [2024-06-28 06:31:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 2753052672. Throughput: 0: 44049.8. Samples: 2656031020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 06:31:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:31:26,690][06909] Updated weights for policy 0, policy_version 168043 (0.0043) [2024-06-28 06:31:28,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2753265664. Throughput: 0: 43972.9. Samples: 2656156580. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 06:31:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:31:30,822][06909] Updated weights for policy 0, policy_version 168053 (0.0037) [2024-06-28 06:31:33,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44509.9, 300 sec: 44042.7). Total num frames: 2753527808. Throughput: 0: 44042.8. Samples: 2656426820. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 06:31:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:31:34,230][06909] Updated weights for policy 0, policy_version 168063 (0.0030) [2024-06-28 06:31:38,508][06909] Updated weights for policy 0, policy_version 168073 (0.0028) [2024-06-28 06:31:38,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2753708032. Throughput: 0: 44228.8. Samples: 2656697780. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 06:31:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:31:41,601][06909] Updated weights for policy 0, policy_version 168083 (0.0032) [2024-06-28 06:31:43,852][06674] Fps is (10 sec: 40951.4, 60 sec: 43962.2, 300 sec: 44097.7). Total num frames: 2753937408. Throughput: 0: 44133.6. Samples: 2656822260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 06:31:43,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:31:45,707][06909] Updated weights for policy 0, policy_version 168093 (0.0028) [2024-06-28 06:31:48,825][06909] Updated weights for policy 0, policy_version 168103 (0.0039) [2024-06-28 06:31:48,850][06674] Fps is (10 sec: 49152.7, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 2754199552. Throughput: 0: 44209.4. Samples: 2657095740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 06:31:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:31:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000168103_2754199552.pth... [2024-06-28 06:31:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000167454_2743566336.pth [2024-06-28 06:31:52,938][06909] Updated weights for policy 0, policy_version 168113 (0.0038) [2024-06-28 06:31:53,850][06674] Fps is (10 sec: 44246.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2754379776. Throughput: 0: 44324.4. Samples: 2657364280. Policy #0 lag: (min: 0.0, avg: 9.2, max: 22.0) [2024-06-28 06:31:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:31:56,224][06909] Updated weights for policy 0, policy_version 168123 (0.0026) [2024-06-28 06:31:58,852][06674] Fps is (10 sec: 40951.3, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 2754609152. Throughput: 0: 44108.7. Samples: 2657487260. Policy #0 lag: (min: 0.0, avg: 12.7, max: 23.0) [2024-06-28 06:31:58,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:32:00,569][06909] Updated weights for policy 0, policy_version 168133 (0.0036) [2024-06-28 06:32:03,777][06909] Updated weights for policy 0, policy_version 168143 (0.0029) [2024-06-28 06:32:03,852][06674] Fps is (10 sec: 47503.9, 60 sec: 44508.4, 300 sec: 44098.0). Total num frames: 2754854912. Throughput: 0: 44091.0. Samples: 2657756560. Policy #0 lag: (min: 0.0, avg: 12.7, max: 23.0) [2024-06-28 06:32:03,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:32:07,848][06909] Updated weights for policy 0, policy_version 168153 (0.0045) [2024-06-28 06:32:08,850][06674] Fps is (10 sec: 42606.5, 60 sec: 44236.6, 300 sec: 44097.9). Total num frames: 2755035136. Throughput: 0: 44106.1. Samples: 2658015800. Policy #0 lag: (min: 0.0, avg: 12.7, max: 23.0) [2024-06-28 06:32:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:32:11,474][06909] Updated weights for policy 0, policy_version 168163 (0.0040) [2024-06-28 06:32:13,850][06674] Fps is (10 sec: 39329.5, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2755248128. Throughput: 0: 44164.0. Samples: 2658143960. Policy #0 lag: (min: 0.0, avg: 12.7, max: 23.0) [2024-06-28 06:32:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:32:15,538][06909] Updated weights for policy 0, policy_version 168173 (0.0036) [2024-06-28 06:32:18,641][06909] Updated weights for policy 0, policy_version 168183 (0.0031) [2024-06-28 06:32:18,850][06674] Fps is (10 sec: 47514.9, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2755510272. Throughput: 0: 44118.7. Samples: 2658412160. Policy #0 lag: (min: 0.0, avg: 12.7, max: 23.0) [2024-06-28 06:32:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:32:23,070][06909] Updated weights for policy 0, policy_version 168193 (0.0042) [2024-06-28 06:32:23,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2755706880. Throughput: 0: 44109.4. Samples: 2658682700. Policy #0 lag: (min: 0.0, avg: 12.7, max: 23.0) [2024-06-28 06:32:23,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:32:25,995][06909] Updated weights for policy 0, policy_version 168203 (0.0027) [2024-06-28 06:32:28,850][06674] Fps is (10 sec: 40959.3, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2755919872. Throughput: 0: 44222.8. Samples: 2658812200. Policy #0 lag: (min: 0.0, avg: 12.7, max: 23.0) [2024-06-28 06:32:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:32:30,444][06909] Updated weights for policy 0, policy_version 168213 (0.0031) [2024-06-28 06:32:33,222][06909] Updated weights for policy 0, policy_version 168223 (0.0027) [2024-06-28 06:32:33,852][06674] Fps is (10 sec: 45866.1, 60 sec: 43962.2, 300 sec: 44097.7). Total num frames: 2756165632. Throughput: 0: 44172.2. Samples: 2659083580. Policy #0 lag: (min: 0.0, avg: 12.7, max: 23.0) [2024-06-28 06:32:33,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:32:37,595][06909] Updated weights for policy 0, policy_version 168233 (0.0035) [2024-06-28 06:32:38,850][06674] Fps is (10 sec: 47514.2, 60 sec: 44783.0, 300 sec: 44264.6). Total num frames: 2756395008. Throughput: 0: 44169.8. Samples: 2659351920. Policy #0 lag: (min: 0.0, avg: 12.7, max: 23.0) [2024-06-28 06:32:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:32:40,726][06909] Updated weights for policy 0, policy_version 168243 (0.0025) [2024-06-28 06:32:43,850][06674] Fps is (10 sec: 42606.8, 60 sec: 44238.2, 300 sec: 44209.0). Total num frames: 2756591616. Throughput: 0: 44384.2. Samples: 2659484460. Policy #0 lag: (min: 0.0, avg: 12.7, max: 23.0) [2024-06-28 06:32:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:32:45,177][06909] Updated weights for policy 0, policy_version 168253 (0.0032) [2024-06-28 06:32:48,242][06909] Updated weights for policy 0, policy_version 168263 (0.0036) [2024-06-28 06:32:48,856][06674] Fps is (10 sec: 44209.9, 60 sec: 43959.3, 300 sec: 44153.3). Total num frames: 2756837376. Throughput: 0: 44168.0. Samples: 2659744300. Policy #0 lag: (min: 0.0, avg: 12.7, max: 23.0) [2024-06-28 06:32:48,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:32:52,603][06909] Updated weights for policy 0, policy_version 168273 (0.0032) [2024-06-28 06:32:53,465][06887] Signal inference workers to stop experience collection... (37800 times) [2024-06-28 06:32:53,466][06887] Signal inference workers to resume experience collection... (37800 times) [2024-06-28 06:32:53,492][06909] InferenceWorker_p0-w0: stopping experience collection (37800 times) [2024-06-28 06:32:53,493][06909] InferenceWorker_p0-w0: resuming experience collection (37800 times) [2024-06-28 06:32:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2757050368. Throughput: 0: 44391.7. Samples: 2660013420. Policy #0 lag: (min: 0.0, avg: 12.7, max: 23.0) [2024-06-28 06:32:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:32:55,508][06909] Updated weights for policy 0, policy_version 168283 (0.0023) [2024-06-28 06:32:58,850][06674] Fps is (10 sec: 42624.2, 60 sec: 44238.3, 300 sec: 44209.0). Total num frames: 2757263360. Throughput: 0: 44486.2. Samples: 2660145840. Policy #0 lag: (min: 0.0, avg: 12.7, max: 23.0) [2024-06-28 06:32:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:32:59,866][06909] Updated weights for policy 0, policy_version 168293 (0.0044) [2024-06-28 06:33:02,988][06909] Updated weights for policy 0, policy_version 168303 (0.0039) [2024-06-28 06:33:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44238.3, 300 sec: 44209.0). Total num frames: 2757509120. Throughput: 0: 44295.9. Samples: 2660405480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:33:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:33:07,394][06909] Updated weights for policy 0, policy_version 168313 (0.0034) [2024-06-28 06:33:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2757705728. Throughput: 0: 44313.3. Samples: 2660676800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:33:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:33:10,294][06909] Updated weights for policy 0, policy_version 168323 (0.0032) [2024-06-28 06:33:13,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2757918720. Throughput: 0: 44339.2. Samples: 2660807460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:33:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:33:14,641][06909] Updated weights for policy 0, policy_version 168333 (0.0042) [2024-06-28 06:33:17,775][06909] Updated weights for policy 0, policy_version 168343 (0.0022) [2024-06-28 06:33:18,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.7, 300 sec: 44264.6). Total num frames: 2758164480. Throughput: 0: 44166.9. Samples: 2661071000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:33:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:33:22,289][06909] Updated weights for policy 0, policy_version 168353 (0.0023) [2024-06-28 06:33:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2758361088. Throughput: 0: 44089.8. Samples: 2661335960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:33:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:33:25,240][06909] Updated weights for policy 0, policy_version 168363 (0.0037) [2024-06-28 06:33:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 2758590464. Throughput: 0: 44109.0. Samples: 2661469360. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:33:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:33:29,429][06909] Updated weights for policy 0, policy_version 168373 (0.0033) [2024-06-28 06:33:32,454][06909] Updated weights for policy 0, policy_version 168383 (0.0034) [2024-06-28 06:33:33,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44511.4, 300 sec: 44264.6). Total num frames: 2758836224. Throughput: 0: 44233.5. Samples: 2661734540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:33:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:33:36,905][06909] Updated weights for policy 0, policy_version 168393 (0.0027) [2024-06-28 06:33:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2759032832. Throughput: 0: 44252.5. Samples: 2662004780. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:33:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:33:39,870][06909] Updated weights for policy 0, policy_version 168403 (0.0041) [2024-06-28 06:33:43,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2759229440. Throughput: 0: 44135.5. Samples: 2662131940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:33:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:33:44,303][06909] Updated weights for policy 0, policy_version 168413 (0.0023) [2024-06-28 06:33:47,968][06909] Updated weights for policy 0, policy_version 168423 (0.0028) [2024-06-28 06:33:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43968.1, 300 sec: 44209.0). Total num frames: 2759475200. Throughput: 0: 44249.3. Samples: 2662396700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:33:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:33:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000168425_2759475200.pth... [2024-06-28 06:33:48,924][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000167778_2748874752.pth [2024-06-28 06:33:51,569][06909] Updated weights for policy 0, policy_version 168433 (0.0033) [2024-06-28 06:33:53,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2759704576. Throughput: 0: 44125.9. Samples: 2662662460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:33:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:33:55,255][06909] Updated weights for policy 0, policy_version 168443 (0.0028) [2024-06-28 06:33:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2759901184. Throughput: 0: 44148.5. Samples: 2662794140. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:33:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:33:59,402][06909] Updated weights for policy 0, policy_version 168453 (0.0028) [2024-06-28 06:34:02,658][06909] Updated weights for policy 0, policy_version 168463 (0.0036) [2024-06-28 06:34:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2760130560. Throughput: 0: 44133.3. Samples: 2663057000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 06:34:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:34:06,718][06909] Updated weights for policy 0, policy_version 168473 (0.0033) [2024-06-28 06:34:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2760359936. Throughput: 0: 44175.0. Samples: 2663323840. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 06:34:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:34:10,131][06909] Updated weights for policy 0, policy_version 168483 (0.0029) [2024-06-28 06:34:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2760572928. Throughput: 0: 44169.3. Samples: 2663456980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 06:34:13,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:34:14,149][06909] Updated weights for policy 0, policy_version 168493 (0.0032) [2024-06-28 06:34:17,883][06909] Updated weights for policy 0, policy_version 168503 (0.0026) [2024-06-28 06:34:18,409][06887] Signal inference workers to stop experience collection... (37850 times) [2024-06-28 06:34:18,409][06887] Signal inference workers to resume experience collection... (37850 times) [2024-06-28 06:34:18,425][06909] InferenceWorker_p0-w0: stopping experience collection (37850 times) [2024-06-28 06:34:18,425][06909] InferenceWorker_p0-w0: resuming experience collection (37850 times) [2024-06-28 06:34:18,853][06674] Fps is (10 sec: 44224.6, 60 sec: 43961.7, 300 sec: 44264.1). Total num frames: 2760802304. Throughput: 0: 43951.5. Samples: 2663712480. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 06:34:18,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:34:21,509][06909] Updated weights for policy 0, policy_version 168513 (0.0031) [2024-06-28 06:34:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2761031680. Throughput: 0: 43824.0. Samples: 2663976860. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 06:34:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:34:25,518][06909] Updated weights for policy 0, policy_version 168523 (0.0035) [2024-06-28 06:34:28,689][06909] Updated weights for policy 0, policy_version 168533 (0.0032) [2024-06-28 06:34:28,852][06674] Fps is (10 sec: 44240.1, 60 sec: 44235.3, 300 sec: 44208.7). Total num frames: 2761244672. Throughput: 0: 43999.3. Samples: 2664112000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 06:34:28,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:34:32,685][06909] Updated weights for policy 0, policy_version 168543 (0.0032) [2024-06-28 06:34:33,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 44209.0). Total num frames: 2761457664. Throughput: 0: 44082.7. Samples: 2664380420. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 06:34:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:34:36,357][06909] Updated weights for policy 0, policy_version 168553 (0.0030) [2024-06-28 06:34:38,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2761687040. Throughput: 0: 43979.1. Samples: 2664641520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 06:34:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:34:39,829][06909] Updated weights for policy 0, policy_version 168563 (0.0027) [2024-06-28 06:34:43,724][06909] Updated weights for policy 0, policy_version 168573 (0.0044) [2024-06-28 06:34:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2761900032. Throughput: 0: 44299.0. Samples: 2664787600. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 06:34:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:34:47,480][06909] Updated weights for policy 0, policy_version 168583 (0.0039) [2024-06-28 06:34:48,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2762129408. Throughput: 0: 44333.8. Samples: 2665052020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 06:34:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:34:50,941][06909] Updated weights for policy 0, policy_version 168593 (0.0036) [2024-06-28 06:34:53,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2762358784. Throughput: 0: 44097.9. Samples: 2665308240. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 06:34:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:34:54,717][06909] Updated weights for policy 0, policy_version 168603 (0.0037) [2024-06-28 06:34:58,551][06909] Updated weights for policy 0, policy_version 168613 (0.0029) [2024-06-28 06:34:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2762555392. Throughput: 0: 44146.3. Samples: 2665443560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 06:34:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:35:02,350][06909] Updated weights for policy 0, policy_version 168623 (0.0035) [2024-06-28 06:35:03,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2762768384. Throughput: 0: 44391.1. Samples: 2665709960. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 06:35:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:35:06,079][06909] Updated weights for policy 0, policy_version 168633 (0.0034) [2024-06-28 06:35:08,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44509.8, 300 sec: 44264.6). Total num frames: 2763030528. Throughput: 0: 44370.2. Samples: 2665973520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 06:35:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:35:09,479][06909] Updated weights for policy 0, policy_version 168643 (0.0028) [2024-06-28 06:35:13,248][06909] Updated weights for policy 0, policy_version 168653 (0.0035) [2024-06-28 06:35:13,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43962.2, 300 sec: 44097.7). Total num frames: 2763210752. Throughput: 0: 44412.9. Samples: 2666110580. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 06:35:13,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:35:16,700][06909] Updated weights for policy 0, policy_version 168663 (0.0025) [2024-06-28 06:35:18,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44238.9, 300 sec: 44264.6). Total num frames: 2763456512. Throughput: 0: 44399.2. Samples: 2666378380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 06:35:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:35:20,792][06909] Updated weights for policy 0, policy_version 168673 (0.0039) [2024-06-28 06:35:23,850][06674] Fps is (10 sec: 47522.8, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2763685888. Throughput: 0: 44195.0. Samples: 2666630300. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 06:35:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:35:24,451][06909] Updated weights for policy 0, policy_version 168683 (0.0041) [2024-06-28 06:35:28,175][06909] Updated weights for policy 0, policy_version 168693 (0.0035) [2024-06-28 06:35:28,850][06674] Fps is (10 sec: 42597.2, 60 sec: 43965.1, 300 sec: 44153.5). Total num frames: 2763882496. Throughput: 0: 44080.3. Samples: 2666771220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 06:35:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:35:31,696][06909] Updated weights for policy 0, policy_version 168703 (0.0047) [2024-06-28 06:35:33,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.9, 300 sec: 44209.1). Total num frames: 2764111872. Throughput: 0: 44005.7. Samples: 2667032280. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 06:35:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:35:35,090][06887] Signal inference workers to stop experience collection... (37900 times) [2024-06-28 06:35:35,090][06887] Signal inference workers to resume experience collection... (37900 times) [2024-06-28 06:35:35,108][06909] InferenceWorker_p0-w0: stopping experience collection (37900 times) [2024-06-28 06:35:35,108][06909] InferenceWorker_p0-w0: resuming experience collection (37900 times) [2024-06-28 06:35:35,926][06909] Updated weights for policy 0, policy_version 168713 (0.0032) [2024-06-28 06:35:38,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2764341248. Throughput: 0: 44137.2. Samples: 2667294420. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 06:35:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:35:39,422][06909] Updated weights for policy 0, policy_version 168723 (0.0041) [2024-06-28 06:35:43,153][06909] Updated weights for policy 0, policy_version 168733 (0.0039) [2024-06-28 06:35:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2764554240. Throughput: 0: 44170.7. Samples: 2667431240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 06:35:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:35:46,662][06909] Updated weights for policy 0, policy_version 168743 (0.0033) [2024-06-28 06:35:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2764767232. Throughput: 0: 44161.8. Samples: 2667697240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 06:35:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:35:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000168748_2764767232.pth... [2024-06-28 06:35:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000168103_2754199552.pth [2024-06-28 06:35:50,977][06909] Updated weights for policy 0, policy_version 168753 (0.0027) [2024-06-28 06:35:53,856][06674] Fps is (10 sec: 44209.9, 60 sec: 43959.2, 300 sec: 44208.1). Total num frames: 2764996608. Throughput: 0: 43870.2. Samples: 2667947940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 06:35:53,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:35:54,499][06909] Updated weights for policy 0, policy_version 168763 (0.0025) [2024-06-28 06:35:58,391][06909] Updated weights for policy 0, policy_version 168773 (0.0033) [2024-06-28 06:35:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2765209600. Throughput: 0: 43966.4. Samples: 2668088980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 06:35:58,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:36:01,658][06909] Updated weights for policy 0, policy_version 168783 (0.0035) [2024-06-28 06:36:03,850][06674] Fps is (10 sec: 42624.2, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2765422592. Throughput: 0: 43843.1. Samples: 2668351320. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 06:36:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:36:05,639][06909] Updated weights for policy 0, policy_version 168793 (0.0025) [2024-06-28 06:36:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2765651968. Throughput: 0: 43988.5. Samples: 2668609780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 06:36:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:36:09,369][06909] Updated weights for policy 0, policy_version 168803 (0.0041) [2024-06-28 06:36:13,095][06909] Updated weights for policy 0, policy_version 168813 (0.0022) [2024-06-28 06:36:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 2765864960. Throughput: 0: 44011.4. Samples: 2668751720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 06:36:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:36:16,592][06909] Updated weights for policy 0, policy_version 168823 (0.0034) [2024-06-28 06:36:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 2766077952. Throughput: 0: 43900.3. Samples: 2669007800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 06:36:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:36:20,771][06909] Updated weights for policy 0, policy_version 168833 (0.0041) [2024-06-28 06:36:23,808][06909] Updated weights for policy 0, policy_version 168843 (0.0034) [2024-06-28 06:36:23,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.8, 300 sec: 44264.6). Total num frames: 2766323712. Throughput: 0: 43904.4. Samples: 2669270120. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 06:36:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:36:28,109][06909] Updated weights for policy 0, policy_version 168853 (0.0038) [2024-06-28 06:36:28,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44237.0, 300 sec: 44098.0). Total num frames: 2766536704. Throughput: 0: 44010.3. Samples: 2669411700. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 06:36:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:36:31,479][06909] Updated weights for policy 0, policy_version 168863 (0.0027) [2024-06-28 06:36:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 2766733312. Throughput: 0: 44008.8. Samples: 2669677640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 06:36:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 06:36:35,227][06909] Updated weights for policy 0, policy_version 168873 (0.0022) [2024-06-28 06:36:38,500][06909] Updated weights for policy 0, policy_version 168883 (0.0029) [2024-06-28 06:36:38,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.8, 300 sec: 44264.9). Total num frames: 2766995456. Throughput: 0: 44379.7. Samples: 2669944760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 06:36:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:36:42,440][06909] Updated weights for policy 0, policy_version 168893 (0.0041) [2024-06-28 06:36:43,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2767208448. Throughput: 0: 44159.2. Samples: 2670076140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 06:36:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:36:46,212][06909] Updated weights for policy 0, policy_version 168903 (0.0036) [2024-06-28 06:36:48,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2767388672. Throughput: 0: 44265.2. Samples: 2670343260. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 06:36:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:36:49,959][06909] Updated weights for policy 0, policy_version 168913 (0.0039) [2024-06-28 06:36:53,430][06909] Updated weights for policy 0, policy_version 168923 (0.0027) [2024-06-28 06:36:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44241.3, 300 sec: 44209.3). Total num frames: 2767650816. Throughput: 0: 44313.0. Samples: 2670603860. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 06:36:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:36:57,446][06909] Updated weights for policy 0, policy_version 168933 (0.0044) [2024-06-28 06:36:58,850][06674] Fps is (10 sec: 47514.2, 60 sec: 44236.9, 300 sec: 44098.3). Total num frames: 2767863808. Throughput: 0: 44217.8. Samples: 2670741520. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 06:36:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:37:00,904][06909] Updated weights for policy 0, policy_version 168943 (0.0030) [2024-06-28 06:37:03,788][06887] Signal inference workers to stop experience collection... (37950 times) [2024-06-28 06:37:03,849][06887] Signal inference workers to resume experience collection... (37950 times) [2024-06-28 06:37:03,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2768060416. Throughput: 0: 44332.5. Samples: 2671002760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 06:37:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:37:03,851][06909] InferenceWorker_p0-w0: stopping experience collection (37950 times) [2024-06-28 06:37:03,867][06909] InferenceWorker_p0-w0: resuming experience collection (37950 times) [2024-06-28 06:37:04,710][06909] Updated weights for policy 0, policy_version 168953 (0.0036) [2024-06-28 06:37:08,550][06909] Updated weights for policy 0, policy_version 168963 (0.0021) [2024-06-28 06:37:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 2768306176. Throughput: 0: 44452.0. Samples: 2671270460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 06:37:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:37:12,233][06909] Updated weights for policy 0, policy_version 168973 (0.0033) [2024-06-28 06:37:13,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2768519168. Throughput: 0: 44281.3. Samples: 2671404360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 06:37:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:37:15,736][06909] Updated weights for policy 0, policy_version 168983 (0.0027) [2024-06-28 06:37:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2768732160. Throughput: 0: 44165.8. Samples: 2671665100. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 06:37:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:37:19,462][06909] Updated weights for policy 0, policy_version 168993 (0.0025) [2024-06-28 06:37:23,280][06909] Updated weights for policy 0, policy_version 169003 (0.0028) [2024-06-28 06:37:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.9, 300 sec: 44264.6). Total num frames: 2768977920. Throughput: 0: 44029.4. Samples: 2671926080. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 06:37:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:37:27,066][06909] Updated weights for policy 0, policy_version 169013 (0.0031) [2024-06-28 06:37:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 2769174528. Throughput: 0: 44120.5. Samples: 2672061560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 06:37:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:37:30,825][06909] Updated weights for policy 0, policy_version 169023 (0.0029) [2024-06-28 06:37:33,850][06674] Fps is (10 sec: 40959.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2769387520. Throughput: 0: 43829.3. Samples: 2672315580. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 06:37:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:37:34,514][06909] Updated weights for policy 0, policy_version 169033 (0.0032) [2024-06-28 06:37:38,044][06909] Updated weights for policy 0, policy_version 169043 (0.0033) [2024-06-28 06:37:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.9, 300 sec: 44209.1). Total num frames: 2769633280. Throughput: 0: 43977.4. Samples: 2672582840. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 06:37:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:37:42,127][06909] Updated weights for policy 0, policy_version 169053 (0.0042) [2024-06-28 06:37:43,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43963.7, 300 sec: 44098.9). Total num frames: 2769846272. Throughput: 0: 43941.8. Samples: 2672718900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 06:37:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:37:45,600][06909] Updated weights for policy 0, policy_version 169063 (0.0036) [2024-06-28 06:37:48,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2770042880. Throughput: 0: 43992.1. Samples: 2672982400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 06:37:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:37:48,907][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000169071_2770059264.pth... [2024-06-28 06:37:48,978][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000168425_2759475200.pth [2024-06-28 06:37:49,351][06909] Updated weights for policy 0, policy_version 169073 (0.0027) [2024-06-28 06:37:53,025][06909] Updated weights for policy 0, policy_version 169083 (0.0031) [2024-06-28 06:37:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2770288640. Throughput: 0: 43828.0. Samples: 2673242720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 06:37:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:37:56,720][06909] Updated weights for policy 0, policy_version 169093 (0.0042) [2024-06-28 06:37:58,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2770501632. Throughput: 0: 43720.9. Samples: 2673371800. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 06:37:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:38:00,558][06909] Updated weights for policy 0, policy_version 169103 (0.0045) [2024-06-28 06:38:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2770714624. Throughput: 0: 43943.2. Samples: 2673642540. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 06:38:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:38:04,319][06909] Updated weights for policy 0, policy_version 169113 (0.0033) [2024-06-28 06:38:08,009][06909] Updated weights for policy 0, policy_version 169123 (0.0035) [2024-06-28 06:38:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2770960384. Throughput: 0: 43854.6. Samples: 2673899540. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 06:38:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:38:11,557][06909] Updated weights for policy 0, policy_version 169133 (0.0039) [2024-06-28 06:38:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2771156992. Throughput: 0: 43968.0. Samples: 2674040120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 06:38:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:38:15,353][06909] Updated weights for policy 0, policy_version 169143 (0.0021) [2024-06-28 06:38:18,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2771369984. Throughput: 0: 44274.7. Samples: 2674307940. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 06:38:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 06:38:19,114][06909] Updated weights for policy 0, policy_version 169153 (0.0038) [2024-06-28 06:38:22,851][06909] Updated weights for policy 0, policy_version 169163 (0.0033) [2024-06-28 06:38:23,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2771632128. Throughput: 0: 44090.5. Samples: 2674566920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 06:38:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:38:26,297][06909] Updated weights for policy 0, policy_version 169173 (0.0027) [2024-06-28 06:38:28,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44509.7, 300 sec: 44097.9). Total num frames: 2771845120. Throughput: 0: 44166.9. Samples: 2674706420. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 06:38:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:38:30,347][06909] Updated weights for policy 0, policy_version 169183 (0.0038) [2024-06-28 06:38:33,778][06909] Updated weights for policy 0, policy_version 169193 (0.0032) [2024-06-28 06:38:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2772058112. Throughput: 0: 44073.3. Samples: 2674965700. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 06:38:33,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:38:36,749][06887] Signal inference workers to stop experience collection... (38000 times) [2024-06-28 06:38:36,793][06909] InferenceWorker_p0-w0: stopping experience collection (38000 times) [2024-06-28 06:38:36,801][06887] Signal inference workers to resume experience collection... (38000 times) [2024-06-28 06:38:36,808][06909] InferenceWorker_p0-w0: resuming experience collection (38000 times) [2024-06-28 06:38:37,570][06909] Updated weights for policy 0, policy_version 169203 (0.0048) [2024-06-28 06:38:38,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.6, 300 sec: 44209.0). Total num frames: 2772271104. Throughput: 0: 43943.1. Samples: 2675220160. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 06:38:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:38:41,405][06909] Updated weights for policy 0, policy_version 169213 (0.0047) [2024-06-28 06:38:43,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43690.5, 300 sec: 44042.4). Total num frames: 2772467712. Throughput: 0: 44065.1. Samples: 2675354740. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 06:38:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:38:45,305][06909] Updated weights for policy 0, policy_version 169223 (0.0038) [2024-06-28 06:38:48,796][06909] Updated weights for policy 0, policy_version 169233 (0.0032) [2024-06-28 06:38:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 2772713472. Throughput: 0: 44044.9. Samples: 2675624560. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 06:38:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:38:52,919][06909] Updated weights for policy 0, policy_version 169243 (0.0038) [2024-06-28 06:38:53,850][06674] Fps is (10 sec: 47514.2, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2772942848. Throughput: 0: 44099.1. Samples: 2675884000. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 06:38:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:38:56,320][06909] Updated weights for policy 0, policy_version 169253 (0.0027) [2024-06-28 06:38:58,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2773123072. Throughput: 0: 43954.6. Samples: 2676018080. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 06:38:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:39:00,118][06909] Updated weights for policy 0, policy_version 169263 (0.0035) [2024-06-28 06:39:03,503][06909] Updated weights for policy 0, policy_version 169273 (0.0034) [2024-06-28 06:39:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2773368832. Throughput: 0: 44054.3. Samples: 2676290380. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 06:39:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:39:07,596][06909] Updated weights for policy 0, policy_version 169283 (0.0040) [2024-06-28 06:39:08,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2773581824. Throughput: 0: 43901.4. Samples: 2676542480. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 06:39:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:39:11,127][06909] Updated weights for policy 0, policy_version 169293 (0.0028) [2024-06-28 06:39:13,853][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 43987.3). Total num frames: 2773778432. Throughput: 0: 43636.1. Samples: 2676670040. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 06:39:13,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:39:15,035][06909] Updated weights for policy 0, policy_version 169303 (0.0043) [2024-06-28 06:39:18,542][06909] Updated weights for policy 0, policy_version 169313 (0.0035) [2024-06-28 06:39:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2774024192. Throughput: 0: 43831.1. Samples: 2676938100. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 06:39:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:39:22,468][06909] Updated weights for policy 0, policy_version 169323 (0.0027) [2024-06-28 06:39:23,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43417.6, 300 sec: 44042.7). Total num frames: 2774237184. Throughput: 0: 44044.5. Samples: 2677202160. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 06:39:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:39:25,817][06909] Updated weights for policy 0, policy_version 169333 (0.0031) [2024-06-28 06:39:28,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43144.7, 300 sec: 43986.9). Total num frames: 2774433792. Throughput: 0: 44074.9. Samples: 2677338100. Policy #0 lag: (min: 1.0, avg: 10.1, max: 21.0) [2024-06-28 06:39:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:39:30,045][06909] Updated weights for policy 0, policy_version 169343 (0.0037) [2024-06-28 06:39:33,249][06909] Updated weights for policy 0, policy_version 169353 (0.0031) [2024-06-28 06:39:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2774695936. Throughput: 0: 44123.9. Samples: 2677610140. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-28 06:39:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:39:37,221][06909] Updated weights for policy 0, policy_version 169363 (0.0031) [2024-06-28 06:39:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2774892544. Throughput: 0: 44105.0. Samples: 2677868720. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-28 06:39:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:39:40,531][06909] Updated weights for policy 0, policy_version 169373 (0.0026) [2024-06-28 06:39:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2775105536. Throughput: 0: 44167.2. Samples: 2678005600. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-28 06:39:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:39:44,537][06909] Updated weights for policy 0, policy_version 169383 (0.0023) [2024-06-28 06:39:48,365][06909] Updated weights for policy 0, policy_version 169393 (0.0043) [2024-06-28 06:39:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2775334912. Throughput: 0: 43921.3. Samples: 2678266840. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-28 06:39:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:39:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000169393_2775334912.pth... [2024-06-28 06:39:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000168748_2764767232.pth [2024-06-28 06:39:52,400][06909] Updated weights for policy 0, policy_version 169403 (0.0030) [2024-06-28 06:39:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2775564288. Throughput: 0: 44151.0. Samples: 2678529280. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-28 06:39:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:39:55,540][06909] Updated weights for policy 0, policy_version 169413 (0.0033) [2024-06-28 06:39:56,374][06887] Signal inference workers to stop experience collection... (38050 times) [2024-06-28 06:39:56,422][06887] Signal inference workers to resume experience collection... (38050 times) [2024-06-28 06:39:56,423][06909] InferenceWorker_p0-w0: stopping experience collection (38050 times) [2024-06-28 06:39:56,438][06909] InferenceWorker_p0-w0: resuming experience collection (38050 times) [2024-06-28 06:39:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2775777280. Throughput: 0: 44343.2. Samples: 2678665480. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-28 06:39:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:39:59,574][06909] Updated weights for policy 0, policy_version 169423 (0.0022) [2024-06-28 06:40:02,780][06909] Updated weights for policy 0, policy_version 169433 (0.0024) [2024-06-28 06:40:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2776023040. Throughput: 0: 44168.8. Samples: 2678925700. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-28 06:40:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:40:07,207][06909] Updated weights for policy 0, policy_version 169443 (0.0040) [2024-06-28 06:40:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44098.2). Total num frames: 2776219648. Throughput: 0: 44322.6. Samples: 2679196680. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-28 06:40:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:40:10,429][06909] Updated weights for policy 0, policy_version 169453 (0.0036) [2024-06-28 06:40:13,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44783.0, 300 sec: 44097.9). Total num frames: 2776465408. Throughput: 0: 44291.1. Samples: 2679331200. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-28 06:40:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:40:14,298][06909] Updated weights for policy 0, policy_version 169463 (0.0027) [2024-06-28 06:40:17,700][06909] Updated weights for policy 0, policy_version 169473 (0.0035) [2024-06-28 06:40:18,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2776678400. Throughput: 0: 44066.7. Samples: 2679593140. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-28 06:40:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:40:22,124][06909] Updated weights for policy 0, policy_version 169483 (0.0035) [2024-06-28 06:40:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 44042.5). Total num frames: 2776875008. Throughput: 0: 44276.4. Samples: 2679861160. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-28 06:40:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:40:25,407][06909] Updated weights for policy 0, policy_version 169493 (0.0030) [2024-06-28 06:40:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44782.8, 300 sec: 44097.9). Total num frames: 2777120768. Throughput: 0: 43978.6. Samples: 2679984640. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-28 06:40:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:40:29,427][06909] Updated weights for policy 0, policy_version 169503 (0.0035) [2024-06-28 06:40:33,082][06909] Updated weights for policy 0, policy_version 169513 (0.0034) [2024-06-28 06:40:33,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2777333760. Throughput: 0: 43842.2. Samples: 2680239740. Policy #0 lag: (min: 1.0, avg: 11.0, max: 23.0) [2024-06-28 06:40:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:40:36,884][06909] Updated weights for policy 0, policy_version 169523 (0.0033) [2024-06-28 06:40:38,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2777530368. Throughput: 0: 44055.6. Samples: 2680511780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 06:40:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:40:40,465][06909] Updated weights for policy 0, policy_version 169533 (0.0037) [2024-06-28 06:40:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2777776128. Throughput: 0: 43836.4. Samples: 2680638120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 06:40:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:40:44,452][06909] Updated weights for policy 0, policy_version 169543 (0.0029) [2024-06-28 06:40:48,021][06909] Updated weights for policy 0, policy_version 169553 (0.0035) [2024-06-28 06:40:48,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44043.3). Total num frames: 2777989120. Throughput: 0: 43985.5. Samples: 2680905040. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 06:40:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:40:51,846][06909] Updated weights for policy 0, policy_version 169563 (0.0024) [2024-06-28 06:40:53,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2778185728. Throughput: 0: 43760.1. Samples: 2681165880. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 06:40:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:40:55,625][06909] Updated weights for policy 0, policy_version 169573 (0.0030) [2024-06-28 06:40:58,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2778415104. Throughput: 0: 43558.6. Samples: 2681291340. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 06:40:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 06:40:59,186][06909] Updated weights for policy 0, policy_version 169583 (0.0031) [2024-06-28 06:41:03,106][06909] Updated weights for policy 0, policy_version 169593 (0.0038) [2024-06-28 06:41:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2778644480. Throughput: 0: 43599.1. Samples: 2681555100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 06:41:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:41:06,947][06909] Updated weights for policy 0, policy_version 169603 (0.0033) [2024-06-28 06:41:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2778857472. Throughput: 0: 43537.6. Samples: 2681820360. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 06:41:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:41:10,850][06909] Updated weights for policy 0, policy_version 169613 (0.0033) [2024-06-28 06:41:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 2779070464. Throughput: 0: 43667.2. Samples: 2681949660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 06:41:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:41:14,097][06909] Updated weights for policy 0, policy_version 169623 (0.0033) [2024-06-28 06:41:18,102][06909] Updated weights for policy 0, policy_version 169633 (0.0036) [2024-06-28 06:41:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2779299840. Throughput: 0: 43839.1. Samples: 2682212500. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 06:41:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:41:20,485][06887] Signal inference workers to stop experience collection... (38100 times) [2024-06-28 06:41:20,529][06909] InferenceWorker_p0-w0: stopping experience collection (38100 times) [2024-06-28 06:41:20,537][06887] Signal inference workers to resume experience collection... (38100 times) [2024-06-28 06:41:20,544][06909] InferenceWorker_p0-w0: resuming experience collection (38100 times) [2024-06-28 06:41:21,725][06909] Updated weights for policy 0, policy_version 169643 (0.0029) [2024-06-28 06:41:23,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 2779512832. Throughput: 0: 43802.1. Samples: 2682482960. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 06:41:23,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:41:25,385][06909] Updated weights for policy 0, policy_version 169653 (0.0036) [2024-06-28 06:41:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2779742208. Throughput: 0: 44001.7. Samples: 2682618200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 06:41:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:41:29,061][06909] Updated weights for policy 0, policy_version 169663 (0.0038) [2024-06-28 06:41:32,843][06909] Updated weights for policy 0, policy_version 169673 (0.0026) [2024-06-28 06:41:33,850][06674] Fps is (10 sec: 44246.0, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 2779955200. Throughput: 0: 43892.9. Samples: 2682880220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 06:41:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:41:36,266][06909] Updated weights for policy 0, policy_version 169683 (0.0026) [2024-06-28 06:41:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2780184576. Throughput: 0: 44029.7. Samples: 2683147220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 06:41:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:41:39,986][06909] Updated weights for policy 0, policy_version 169693 (0.0023) [2024-06-28 06:41:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2780397568. Throughput: 0: 44167.2. Samples: 2683278860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 06:41:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:41:44,006][06909] Updated weights for policy 0, policy_version 169703 (0.0024) [2024-06-28 06:41:47,826][06909] Updated weights for policy 0, policy_version 169713 (0.0035) [2024-06-28 06:41:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2780610560. Throughput: 0: 44115.1. Samples: 2683540280. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 06:41:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:41:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000169715_2780610560.pth... [2024-06-28 06:41:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000169071_2770059264.pth [2024-06-28 06:41:51,377][06909] Updated weights for policy 0, policy_version 169723 (0.0029) [2024-06-28 06:41:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2780856320. Throughput: 0: 44023.6. Samples: 2683801420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 06:41:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:41:55,031][06909] Updated weights for policy 0, policy_version 169733 (0.0036) [2024-06-28 06:41:58,811][06909] Updated weights for policy 0, policy_version 169743 (0.0035) [2024-06-28 06:41:58,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2781069312. Throughput: 0: 44257.8. Samples: 2683941260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 06:41:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:42:02,386][06909] Updated weights for policy 0, policy_version 169753 (0.0036) [2024-06-28 06:42:03,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 2781282304. Throughput: 0: 44193.6. Samples: 2684201300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 06:42:03,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:42:06,238][06909] Updated weights for policy 0, policy_version 169763 (0.0024) [2024-06-28 06:42:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2781511680. Throughput: 0: 43994.9. Samples: 2684462640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 06:42:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:42:10,275][06909] Updated weights for policy 0, policy_version 169773 (0.0029) [2024-06-28 06:42:13,669][06909] Updated weights for policy 0, policy_version 169783 (0.0049) [2024-06-28 06:42:13,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2781724672. Throughput: 0: 44071.6. Samples: 2684601420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 06:42:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:42:17,964][06909] Updated weights for policy 0, policy_version 169793 (0.0022) [2024-06-28 06:42:18,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 2781921280. Throughput: 0: 43851.4. Samples: 2684853540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 06:42:18,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:42:21,344][06909] Updated weights for policy 0, policy_version 169803 (0.0039) [2024-06-28 06:42:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 2782167040. Throughput: 0: 43801.9. Samples: 2685118300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 06:42:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:42:25,181][06909] Updated weights for policy 0, policy_version 169813 (0.0035) [2024-06-28 06:42:28,478][06909] Updated weights for policy 0, policy_version 169823 (0.0033) [2024-06-28 06:42:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2782380032. Throughput: 0: 43963.9. Samples: 2685257240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 06:42:28,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:42:32,653][06909] Updated weights for policy 0, policy_version 169833 (0.0030) [2024-06-28 06:42:33,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43690.5, 300 sec: 43875.8). Total num frames: 2782576640. Throughput: 0: 43948.3. Samples: 2685517960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 06:42:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:42:36,109][06909] Updated weights for policy 0, policy_version 169843 (0.0037) [2024-06-28 06:42:38,852][06674] Fps is (10 sec: 47503.2, 60 sec: 44508.2, 300 sec: 44097.6). Total num frames: 2782855168. Throughput: 0: 44109.8. Samples: 2685786460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 06:42:38,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:42:39,785][06909] Updated weights for policy 0, policy_version 169853 (0.0031) [2024-06-28 06:42:43,586][06909] Updated weights for policy 0, policy_version 169863 (0.0039) [2024-06-28 06:42:43,850][06674] Fps is (10 sec: 47514.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2783051776. Throughput: 0: 44101.8. Samples: 2685925840. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 06:42:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:42:47,424][06909] Updated weights for policy 0, policy_version 169873 (0.0022) [2024-06-28 06:42:47,592][06887] Signal inference workers to stop experience collection... (38150 times) [2024-06-28 06:42:47,592][06887] Signal inference workers to resume experience collection... (38150 times) [2024-06-28 06:42:47,630][06909] InferenceWorker_p0-w0: stopping experience collection (38150 times) [2024-06-28 06:42:47,630][06909] InferenceWorker_p0-w0: resuming experience collection (38150 times) [2024-06-28 06:42:48,850][06674] Fps is (10 sec: 40968.9, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2783264768. Throughput: 0: 44170.3. Samples: 2686188880. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 06:42:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:42:50,918][06909] Updated weights for policy 0, policy_version 169883 (0.0039) [2024-06-28 06:42:53,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2783510528. Throughput: 0: 44043.6. Samples: 2686444600. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 06:42:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:42:54,727][06909] Updated weights for policy 0, policy_version 169893 (0.0032) [2024-06-28 06:42:58,425][06909] Updated weights for policy 0, policy_version 169903 (0.0026) [2024-06-28 06:42:58,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2783707136. Throughput: 0: 44009.0. Samples: 2686581820. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 06:42:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:43:01,879][06909] Updated weights for policy 0, policy_version 169913 (0.0038) [2024-06-28 06:43:03,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43965.2, 300 sec: 43931.3). Total num frames: 2783920128. Throughput: 0: 44360.1. Samples: 2686849740. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 06:43:03,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:43:05,648][06909] Updated weights for policy 0, policy_version 169923 (0.0031) [2024-06-28 06:43:08,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2784165888. Throughput: 0: 44199.0. Samples: 2687107260. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 06:43:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:43:09,629][06909] Updated weights for policy 0, policy_version 169933 (0.0038) [2024-06-28 06:43:13,166][06909] Updated weights for policy 0, policy_version 169943 (0.0026) [2024-06-28 06:43:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2784362496. Throughput: 0: 44205.0. Samples: 2687246460. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 06:43:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:43:16,908][06909] Updated weights for policy 0, policy_version 169953 (0.0040) [2024-06-28 06:43:18,850][06674] Fps is (10 sec: 40960.7, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 2784575488. Throughput: 0: 44197.5. Samples: 2687506840. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 06:43:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:43:20,595][06909] Updated weights for policy 0, policy_version 169963 (0.0027) [2024-06-28 06:43:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2784821248. Throughput: 0: 44024.9. Samples: 2687767480. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 06:43:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 06:43:24,649][06909] Updated weights for policy 0, policy_version 169973 (0.0035) [2024-06-28 06:43:28,155][06909] Updated weights for policy 0, policy_version 169983 (0.0020) [2024-06-28 06:43:28,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2785034240. Throughput: 0: 44024.0. Samples: 2687906920. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 06:43:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:43:32,071][06909] Updated weights for policy 0, policy_version 169993 (0.0026) [2024-06-28 06:43:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44510.0, 300 sec: 43986.9). Total num frames: 2785247232. Throughput: 0: 43966.8. Samples: 2688167380. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 06:43:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:43:35,620][06909] Updated weights for policy 0, policy_version 170003 (0.0046) [2024-06-28 06:43:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43692.3, 300 sec: 44098.0). Total num frames: 2785476608. Throughput: 0: 44158.2. Samples: 2688431720. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 06:43:38,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:43:39,433][06909] Updated weights for policy 0, policy_version 170013 (0.0026) [2024-06-28 06:43:42,879][06909] Updated weights for policy 0, policy_version 170023 (0.0023) [2024-06-28 06:43:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2785689600. Throughput: 0: 44174.6. Samples: 2688569680. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 06:43:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:43:46,714][06909] Updated weights for policy 0, policy_version 170033 (0.0022) [2024-06-28 06:43:48,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 2785886208. Throughput: 0: 43905.7. Samples: 2688825500. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 06:43:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:43:49,001][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000170038_2785902592.pth... [2024-06-28 06:43:49,051][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000169393_2775334912.pth [2024-06-28 06:43:50,673][06909] Updated weights for policy 0, policy_version 170043 (0.0050) [2024-06-28 06:43:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 2786131968. Throughput: 0: 43990.7. Samples: 2689086840. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 06:43:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:43:54,011][06909] Updated weights for policy 0, policy_version 170053 (0.0034) [2024-06-28 06:43:58,349][06909] Updated weights for policy 0, policy_version 170063 (0.0035) [2024-06-28 06:43:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2786344960. Throughput: 0: 44047.0. Samples: 2689228580. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 06:43:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:44:01,489][06909] Updated weights for policy 0, policy_version 170073 (0.0037) [2024-06-28 06:44:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2786557952. Throughput: 0: 44044.4. Samples: 2689488840. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 06:44:03,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:44:05,534][06909] Updated weights for policy 0, policy_version 170083 (0.0027) [2024-06-28 06:44:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2786787328. Throughput: 0: 44053.8. Samples: 2689749900. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 06:44:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:44:09,188][06909] Updated weights for policy 0, policy_version 170093 (0.0035) [2024-06-28 06:44:12,718][06909] Updated weights for policy 0, policy_version 170103 (0.0038) [2024-06-28 06:44:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2787000320. Throughput: 0: 43970.3. Samples: 2689885580. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 06:44:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:44:16,577][06909] Updated weights for policy 0, policy_version 170113 (0.0027) [2024-06-28 06:44:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2787213312. Throughput: 0: 43988.5. Samples: 2690146860. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 06:44:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:44:19,673][06887] Signal inference workers to stop experience collection... (38200 times) [2024-06-28 06:44:19,696][06909] InferenceWorker_p0-w0: stopping experience collection (38200 times) [2024-06-28 06:44:19,733][06887] Signal inference workers to resume experience collection... (38200 times) [2024-06-28 06:44:19,733][06909] InferenceWorker_p0-w0: resuming experience collection (38200 times) [2024-06-28 06:44:20,304][06909] Updated weights for policy 0, policy_version 170123 (0.0026) [2024-06-28 06:44:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2787442688. Throughput: 0: 43939.2. Samples: 2690408980. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 06:44:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:44:23,902][06909] Updated weights for policy 0, policy_version 170133 (0.0026) [2024-06-28 06:44:28,165][06909] Updated weights for policy 0, policy_version 170143 (0.0035) [2024-06-28 06:44:28,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2787655680. Throughput: 0: 43947.0. Samples: 2690547300. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 06:44:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:44:31,240][06909] Updated weights for policy 0, policy_version 170153 (0.0032) [2024-06-28 06:44:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2787885056. Throughput: 0: 44075.7. Samples: 2690808900. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 06:44:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:44:35,279][06909] Updated weights for policy 0, policy_version 170163 (0.0037) [2024-06-28 06:44:38,499][06909] Updated weights for policy 0, policy_version 170173 (0.0031) [2024-06-28 06:44:38,852][06674] Fps is (10 sec: 45866.5, 60 sec: 43962.3, 300 sec: 44097.7). Total num frames: 2788114432. Throughput: 0: 44240.7. Samples: 2691077760. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 06:44:38,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:44:42,484][06909] Updated weights for policy 0, policy_version 170183 (0.0032) [2024-06-28 06:44:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2788311040. Throughput: 0: 44029.0. Samples: 2691209880. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 06:44:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:44:46,121][06909] Updated weights for policy 0, policy_version 170193 (0.0038) [2024-06-28 06:44:48,850][06674] Fps is (10 sec: 44245.3, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2788556800. Throughput: 0: 44146.6. Samples: 2691475440. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 06:44:48,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:44:49,648][06909] Updated weights for policy 0, policy_version 170203 (0.0021) [2024-06-28 06:44:53,671][06909] Updated weights for policy 0, policy_version 170213 (0.0033) [2024-06-28 06:44:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2788769792. Throughput: 0: 44355.1. Samples: 2691745880. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 06:44:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:44:56,914][06909] Updated weights for policy 0, policy_version 170223 (0.0037) [2024-06-28 06:44:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2788999168. Throughput: 0: 44267.8. Samples: 2691877640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:44:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:45:00,812][06909] Updated weights for policy 0, policy_version 170233 (0.0031) [2024-06-28 06:45:03,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2789195776. Throughput: 0: 44194.1. Samples: 2692135600. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:45:03,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:45:05,002][06909] Updated weights for policy 0, policy_version 170243 (0.0035) [2024-06-28 06:45:08,253][06909] Updated weights for policy 0, policy_version 170253 (0.0036) [2024-06-28 06:45:08,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2789441536. Throughput: 0: 44330.6. Samples: 2692403860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:45:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:45:12,186][06909] Updated weights for policy 0, policy_version 170263 (0.0034) [2024-06-28 06:45:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2789654528. Throughput: 0: 44230.9. Samples: 2692537680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:45:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:45:15,438][06909] Updated weights for policy 0, policy_version 170273 (0.0027) [2024-06-28 06:45:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2789867520. Throughput: 0: 44229.8. Samples: 2692799240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:45:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:45:19,769][06909] Updated weights for policy 0, policy_version 170283 (0.0037) [2024-06-28 06:45:23,600][06909] Updated weights for policy 0, policy_version 170293 (0.0036) [2024-06-28 06:45:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2790096896. Throughput: 0: 44191.7. Samples: 2693066300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:45:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:45:26,876][06909] Updated weights for policy 0, policy_version 170303 (0.0031) [2024-06-28 06:45:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2790309888. Throughput: 0: 44224.9. Samples: 2693200000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:45:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:45:30,690][06909] Updated weights for policy 0, policy_version 170313 (0.0030) [2024-06-28 06:45:33,852][06674] Fps is (10 sec: 44228.2, 60 sec: 44235.2, 300 sec: 44097.7). Total num frames: 2790539264. Throughput: 0: 44221.7. Samples: 2693465500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:45:33,852][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 06:45:34,556][06909] Updated weights for policy 0, policy_version 170323 (0.0034) [2024-06-28 06:45:37,844][06909] Updated weights for policy 0, policy_version 170333 (0.0027) [2024-06-28 06:45:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 2790768640. Throughput: 0: 44032.9. Samples: 2693727360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:45:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:45:41,911][06909] Updated weights for policy 0, policy_version 170343 (0.0032) [2024-06-28 06:45:43,850][06674] Fps is (10 sec: 45884.7, 60 sec: 44782.9, 300 sec: 44098.0). Total num frames: 2790998016. Throughput: 0: 44217.0. Samples: 2693867400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:45:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:45:45,222][06909] Updated weights for policy 0, policy_version 170353 (0.0022) [2024-06-28 06:45:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2791211008. Throughput: 0: 44420.0. Samples: 2694134500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:45:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:45:48,876][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000170362_2791211008.pth... [2024-06-28 06:45:48,931][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000169715_2780610560.pth [2024-06-28 06:45:49,097][06909] Updated weights for policy 0, policy_version 170363 (0.0037) [2024-06-28 06:45:52,698][06909] Updated weights for policy 0, policy_version 170373 (0.0036) [2024-06-28 06:45:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2791424000. Throughput: 0: 44219.2. Samples: 2694393720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:45:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:45:56,544][06909] Updated weights for policy 0, policy_version 170383 (0.0041) [2024-06-28 06:45:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2791653376. Throughput: 0: 44079.9. Samples: 2694521280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:45:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:46:00,453][06909] Updated weights for policy 0, policy_version 170393 (0.0029) [2024-06-28 06:46:03,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2791866368. Throughput: 0: 44166.5. Samples: 2694786740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 06:46:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:46:03,942][06909] Updated weights for policy 0, policy_version 170403 (0.0044) [2024-06-28 06:46:05,299][06887] Signal inference workers to stop experience collection... (38250 times) [2024-06-28 06:46:05,300][06887] Signal inference workers to resume experience collection... (38250 times) [2024-06-28 06:46:05,312][06909] InferenceWorker_p0-w0: stopping experience collection (38250 times) [2024-06-28 06:46:05,312][06909] InferenceWorker_p0-w0: resuming experience collection (38250 times) [2024-06-28 06:46:07,683][06909] Updated weights for policy 0, policy_version 170413 (0.0040) [2024-06-28 06:46:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2792079360. Throughput: 0: 44097.4. Samples: 2695050680. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 06:46:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:46:11,651][06909] Updated weights for policy 0, policy_version 170423 (0.0022) [2024-06-28 06:46:13,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2792325120. Throughput: 0: 44074.2. Samples: 2695183340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 06:46:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:46:15,153][06909] Updated weights for policy 0, policy_version 170433 (0.0036) [2024-06-28 06:46:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 2792521728. Throughput: 0: 44093.6. Samples: 2695449620. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 06:46:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:46:18,942][06909] Updated weights for policy 0, policy_version 170443 (0.0037) [2024-06-28 06:46:22,509][06909] Updated weights for policy 0, policy_version 170453 (0.0047) [2024-06-28 06:46:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2792751104. Throughput: 0: 44176.5. Samples: 2695715300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 06:46:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:46:26,441][06909] Updated weights for policy 0, policy_version 170463 (0.0024) [2024-06-28 06:46:28,852][06674] Fps is (10 sec: 44227.6, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 2792964096. Throughput: 0: 43973.5. Samples: 2695846300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 06:46:28,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:46:29,935][06909] Updated weights for policy 0, policy_version 170473 (0.0044) [2024-06-28 06:46:33,805][06909] Updated weights for policy 0, policy_version 170483 (0.0032) [2024-06-28 06:46:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 2793193472. Throughput: 0: 43811.6. Samples: 2696106020. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 06:46:33,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:46:37,738][06909] Updated weights for policy 0, policy_version 170493 (0.0046) [2024-06-28 06:46:38,850][06674] Fps is (10 sec: 42607.0, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2793390080. Throughput: 0: 43926.6. Samples: 2696370420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 06:46:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:46:41,137][06909] Updated weights for policy 0, policy_version 170503 (0.0032) [2024-06-28 06:46:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2793635840. Throughput: 0: 43943.5. Samples: 2696498740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 06:46:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:46:44,938][06909] Updated weights for policy 0, policy_version 170513 (0.0037) [2024-06-28 06:46:48,517][06909] Updated weights for policy 0, policy_version 170523 (0.0034) [2024-06-28 06:46:48,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2793848832. Throughput: 0: 44005.9. Samples: 2696767000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 06:46:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:46:52,153][06909] Updated weights for policy 0, policy_version 170533 (0.0037) [2024-06-28 06:46:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2794061824. Throughput: 0: 44063.1. Samples: 2697033520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 06:46:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:46:56,159][06909] Updated weights for policy 0, policy_version 170543 (0.0028) [2024-06-28 06:46:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 2794291200. Throughput: 0: 44087.6. Samples: 2697167280. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 06:46:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:46:59,352][06909] Updated weights for policy 0, policy_version 170553 (0.0035) [2024-06-28 06:47:03,383][06909] Updated weights for policy 0, policy_version 170563 (0.0032) [2024-06-28 06:47:03,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2794504192. Throughput: 0: 44003.6. Samples: 2697429780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 06:47:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:47:06,867][06909] Updated weights for policy 0, policy_version 170573 (0.0026) [2024-06-28 06:47:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2794733568. Throughput: 0: 44077.3. Samples: 2697698780. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 06:47:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:47:10,774][06909] Updated weights for policy 0, policy_version 170583 (0.0035) [2024-06-28 06:47:13,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43689.2, 300 sec: 44153.2). Total num frames: 2794946560. Throughput: 0: 43987.1. Samples: 2697825720. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 06:47:13,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:47:14,846][06909] Updated weights for policy 0, policy_version 170593 (0.0042) [2024-06-28 06:47:18,003][06909] Updated weights for policy 0, policy_version 170603 (0.0034) [2024-06-28 06:47:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2795175936. Throughput: 0: 44125.3. Samples: 2698091660. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 06:47:18,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:47:22,152][06909] Updated weights for policy 0, policy_version 170613 (0.0032) [2024-06-28 06:47:23,850][06674] Fps is (10 sec: 44245.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2795388928. Throughput: 0: 44023.1. Samples: 2698351460. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 06:47:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:47:26,105][06909] Updated weights for policy 0, policy_version 170623 (0.0041) [2024-06-28 06:47:28,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 2795601920. Throughput: 0: 44057.5. Samples: 2698481320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 06:47:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:47:29,873][06909] Updated weights for policy 0, policy_version 170633 (0.0034) [2024-06-28 06:47:33,393][06909] Updated weights for policy 0, policy_version 170643 (0.0042) [2024-06-28 06:47:33,850][06674] Fps is (10 sec: 44234.8, 60 sec: 43963.4, 300 sec: 43987.1). Total num frames: 2795831296. Throughput: 0: 43994.2. Samples: 2698746760. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 06:47:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:47:34,754][06887] Signal inference workers to stop experience collection... (38300 times) [2024-06-28 06:47:34,788][06909] InferenceWorker_p0-w0: stopping experience collection (38300 times) [2024-06-28 06:47:34,813][06887] Signal inference workers to resume experience collection... (38300 times) [2024-06-28 06:47:34,813][06909] InferenceWorker_p0-w0: resuming experience collection (38300 times) [2024-06-28 06:47:37,081][06909] Updated weights for policy 0, policy_version 170653 (0.0037) [2024-06-28 06:47:38,850][06674] Fps is (10 sec: 44235.7, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2796044288. Throughput: 0: 43985.2. Samples: 2699012860. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 06:47:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:47:40,848][06909] Updated weights for policy 0, policy_version 170663 (0.0040) [2024-06-28 06:47:43,852][06674] Fps is (10 sec: 42591.7, 60 sec: 43689.2, 300 sec: 44042.1). Total num frames: 2796257280. Throughput: 0: 43806.9. Samples: 2699138680. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 06:47:43,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:47:44,770][06909] Updated weights for policy 0, policy_version 170673 (0.0040) [2024-06-28 06:47:48,131][06909] Updated weights for policy 0, policy_version 170683 (0.0031) [2024-06-28 06:47:48,851][06674] Fps is (10 sec: 44232.0, 60 sec: 43962.8, 300 sec: 43986.7). Total num frames: 2796486656. Throughput: 0: 43768.0. Samples: 2699399400. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 06:47:48,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 06:47:48,903][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000170685_2796503040.pth... [2024-06-28 06:47:48,942][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000170038_2785902592.pth [2024-06-28 06:47:52,046][06909] Updated weights for policy 0, policy_version 170693 (0.0041) [2024-06-28 06:47:53,850][06674] Fps is (10 sec: 44245.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2796699648. Throughput: 0: 43740.8. Samples: 2699667120. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 06:47:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:47:55,739][06909] Updated weights for policy 0, policy_version 170703 (0.0031) [2024-06-28 06:47:58,850][06674] Fps is (10 sec: 44242.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2796929024. Throughput: 0: 43910.8. Samples: 2699801620. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 06:47:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:47:59,750][06909] Updated weights for policy 0, policy_version 170713 (0.0040) [2024-06-28 06:48:03,204][06909] Updated weights for policy 0, policy_version 170723 (0.0034) [2024-06-28 06:48:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2797158400. Throughput: 0: 43899.2. Samples: 2700067120. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 06:48:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:48:07,219][06909] Updated weights for policy 0, policy_version 170733 (0.0027) [2024-06-28 06:48:08,852][06674] Fps is (10 sec: 44228.1, 60 sec: 43962.2, 300 sec: 44097.6). Total num frames: 2797371392. Throughput: 0: 44060.7. Samples: 2700334280. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 06:48:08,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:48:10,554][06909] Updated weights for policy 0, policy_version 170743 (0.0036) [2024-06-28 06:48:13,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43965.3, 300 sec: 44098.0). Total num frames: 2797584384. Throughput: 0: 44156.5. Samples: 2700468360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:48:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:48:14,600][06909] Updated weights for policy 0, policy_version 170753 (0.0026) [2024-06-28 06:48:17,993][06909] Updated weights for policy 0, policy_version 170763 (0.0018) [2024-06-28 06:48:18,850][06674] Fps is (10 sec: 44246.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2797813760. Throughput: 0: 44295.6. Samples: 2700740040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:48:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:48:22,003][06909] Updated weights for policy 0, policy_version 170773 (0.0027) [2024-06-28 06:48:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2798026752. Throughput: 0: 44067.8. Samples: 2700995900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:48:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:48:25,204][06909] Updated weights for policy 0, policy_version 170783 (0.0031) [2024-06-28 06:48:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2798239744. Throughput: 0: 44159.8. Samples: 2701125780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:48:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:48:29,554][06909] Updated weights for policy 0, policy_version 170793 (0.0034) [2024-06-28 06:48:32,895][06909] Updated weights for policy 0, policy_version 170803 (0.0034) [2024-06-28 06:48:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43964.1, 300 sec: 44042.4). Total num frames: 2798469120. Throughput: 0: 44235.9. Samples: 2701389960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:48:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:48:36,863][06909] Updated weights for policy 0, policy_version 170813 (0.0034) [2024-06-28 06:48:38,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2798682112. Throughput: 0: 44143.1. Samples: 2701653560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:48:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:48:40,326][06909] Updated weights for policy 0, policy_version 170823 (0.0030) [2024-06-28 06:48:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2798911488. Throughput: 0: 44159.6. Samples: 2701788800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:48:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:48:44,366][06909] Updated weights for policy 0, policy_version 170833 (0.0024) [2024-06-28 06:48:47,621][06909] Updated weights for policy 0, policy_version 170843 (0.0031) [2024-06-28 06:48:48,850][06674] Fps is (10 sec: 45876.2, 60 sec: 44237.8, 300 sec: 44098.0). Total num frames: 2799140864. Throughput: 0: 44101.0. Samples: 2702051660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:48:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:48:51,857][06909] Updated weights for policy 0, policy_version 170853 (0.0032) [2024-06-28 06:48:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2799337472. Throughput: 0: 44097.1. Samples: 2702318560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:48:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:48:55,303][06909] Updated weights for policy 0, policy_version 170863 (0.0036) [2024-06-28 06:48:56,311][06887] Signal inference workers to stop experience collection... (38350 times) [2024-06-28 06:48:56,314][06887] Signal inference workers to resume experience collection... (38350 times) [2024-06-28 06:48:56,352][06909] InferenceWorker_p0-w0: stopping experience collection (38350 times) [2024-06-28 06:48:56,352][06909] InferenceWorker_p0-w0: resuming experience collection (38350 times) [2024-06-28 06:48:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2799566848. Throughput: 0: 43912.4. Samples: 2702444420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:48:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:48:59,114][06909] Updated weights for policy 0, policy_version 170873 (0.0039) [2024-06-28 06:49:02,378][06909] Updated weights for policy 0, policy_version 170883 (0.0024) [2024-06-28 06:49:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2799796224. Throughput: 0: 43760.8. Samples: 2702709280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:49:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:49:06,562][06909] Updated weights for policy 0, policy_version 170893 (0.0040) [2024-06-28 06:49:08,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44238.2, 300 sec: 44153.5). Total num frames: 2800025600. Throughput: 0: 44168.7. Samples: 2702983500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:49:08,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:49:09,806][06909] Updated weights for policy 0, policy_version 170903 (0.0027) [2024-06-28 06:49:13,777][06909] Updated weights for policy 0, policy_version 170913 (0.0023) [2024-06-28 06:49:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2800238592. Throughput: 0: 44243.1. Samples: 2703116720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 06:49:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:49:17,073][06909] Updated weights for policy 0, policy_version 170923 (0.0038) [2024-06-28 06:49:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2800467968. Throughput: 0: 44144.0. Samples: 2703376440. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:49:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:49:21,378][06909] Updated weights for policy 0, policy_version 170933 (0.0040) [2024-06-28 06:49:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2800680960. Throughput: 0: 44399.3. Samples: 2703651520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:49:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:49:24,376][06909] Updated weights for policy 0, policy_version 170943 (0.0030) [2024-06-28 06:49:28,645][06909] Updated weights for policy 0, policy_version 170953 (0.0032) [2024-06-28 06:49:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2800893952. Throughput: 0: 44149.5. Samples: 2703775520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:49:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:49:31,885][06909] Updated weights for policy 0, policy_version 170963 (0.0028) [2024-06-28 06:49:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 2801123328. Throughput: 0: 44214.6. Samples: 2704041320. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:49:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:49:36,450][06909] Updated weights for policy 0, policy_version 170973 (0.0046) [2024-06-28 06:49:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44237.0, 300 sec: 44153.5). Total num frames: 2801336320. Throughput: 0: 44141.0. Samples: 2704304900. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:49:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:49:39,207][06909] Updated weights for policy 0, policy_version 170983 (0.0027) [2024-06-28 06:49:43,786][06909] Updated weights for policy 0, policy_version 170993 (0.0026) [2024-06-28 06:49:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2801549312. Throughput: 0: 44155.1. Samples: 2704431400. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:49:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:49:46,971][06909] Updated weights for policy 0, policy_version 171003 (0.0020) [2024-06-28 06:49:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2801762304. Throughput: 0: 44057.0. Samples: 2704691840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:49:48,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 06:49:48,880][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000171007_2801778688.pth... [2024-06-28 06:49:48,947][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000170362_2791211008.pth [2024-06-28 06:49:51,113][06909] Updated weights for policy 0, policy_version 171013 (0.0040) [2024-06-28 06:49:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2801975296. Throughput: 0: 43908.1. Samples: 2704959360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:49:53,856][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:49:54,407][06909] Updated weights for policy 0, policy_version 171023 (0.0027) [2024-06-28 06:49:58,579][06909] Updated weights for policy 0, policy_version 171033 (0.0025) [2024-06-28 06:49:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2802204672. Throughput: 0: 43784.9. Samples: 2705087040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:49:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:50:01,057][06887] Signal inference workers to stop experience collection... (38400 times) [2024-06-28 06:50:01,110][06909] InferenceWorker_p0-w0: stopping experience collection (38400 times) [2024-06-28 06:50:01,112][06887] Signal inference workers to resume experience collection... (38400 times) [2024-06-28 06:50:01,123][06909] InferenceWorker_p0-w0: resuming experience collection (38400 times) [2024-06-28 06:50:02,028][06909] Updated weights for policy 0, policy_version 171043 (0.0040) [2024-06-28 06:50:03,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2802450432. Throughput: 0: 43965.7. Samples: 2705354900. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:50:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:50:05,817][06909] Updated weights for policy 0, policy_version 171053 (0.0023) [2024-06-28 06:50:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2802663424. Throughput: 0: 43711.9. Samples: 2705618560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:50:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:50:09,251][06909] Updated weights for policy 0, policy_version 171063 (0.0039) [2024-06-28 06:50:13,593][06909] Updated weights for policy 0, policy_version 171073 (0.0036) [2024-06-28 06:50:13,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2802860032. Throughput: 0: 43929.2. Samples: 2705752340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:50:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:50:16,739][06909] Updated weights for policy 0, policy_version 171083 (0.0024) [2024-06-28 06:50:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2803089408. Throughput: 0: 43794.1. Samples: 2706012060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 06:50:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:50:20,707][06909] Updated weights for policy 0, policy_version 171093 (0.0022) [2024-06-28 06:50:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2803302400. Throughput: 0: 43878.6. Samples: 2706279440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:50:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:50:24,294][06909] Updated weights for policy 0, policy_version 171103 (0.0030) [2024-06-28 06:50:28,412][06909] Updated weights for policy 0, policy_version 171113 (0.0026) [2024-06-28 06:50:28,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.7, 300 sec: 44098.3). Total num frames: 2803548160. Throughput: 0: 43972.0. Samples: 2706410140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:50:28,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:50:31,435][06909] Updated weights for policy 0, policy_version 171123 (0.0025) [2024-06-28 06:50:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2803761152. Throughput: 0: 44190.2. Samples: 2706680400. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:50:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:50:35,572][06909] Updated weights for policy 0, policy_version 171133 (0.0025) [2024-06-28 06:50:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2803990528. Throughput: 0: 44151.2. Samples: 2706946160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:50:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:50:38,866][06909] Updated weights for policy 0, policy_version 171143 (0.0033) [2024-06-28 06:50:43,341][06909] Updated weights for policy 0, policy_version 171153 (0.0036) [2024-06-28 06:50:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2804203520. Throughput: 0: 44101.8. Samples: 2707071620. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:50:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:50:46,778][06909] Updated weights for policy 0, policy_version 171163 (0.0042) [2024-06-28 06:50:48,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2804400128. Throughput: 0: 43876.0. Samples: 2707329320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:50:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:50:50,646][06909] Updated weights for policy 0, policy_version 171173 (0.0033) [2024-06-28 06:50:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 2804613120. Throughput: 0: 43815.7. Samples: 2707590260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:50:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:50:54,174][06909] Updated weights for policy 0, policy_version 171183 (0.0032) [2024-06-28 06:50:58,201][06909] Updated weights for policy 0, policy_version 171193 (0.0041) [2024-06-28 06:50:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2804858880. Throughput: 0: 43767.1. Samples: 2707721860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:50:58,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:51:01,676][06909] Updated weights for policy 0, policy_version 171203 (0.0054) [2024-06-28 06:51:03,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2805071872. Throughput: 0: 43924.5. Samples: 2707988660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:51:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:51:05,613][06909] Updated weights for policy 0, policy_version 171213 (0.0039) [2024-06-28 06:51:08,148][06887] Signal inference workers to stop experience collection... (38450 times) [2024-06-28 06:51:08,152][06887] Signal inference workers to resume experience collection... (38450 times) [2024-06-28 06:51:08,191][06909] InferenceWorker_p0-w0: stopping experience collection (38450 times) [2024-06-28 06:51:08,191][06909] InferenceWorker_p0-w0: resuming experience collection (38450 times) [2024-06-28 06:51:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2805301248. Throughput: 0: 43926.3. Samples: 2708256120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:51:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:51:08,950][06909] Updated weights for policy 0, policy_version 171223 (0.0032) [2024-06-28 06:51:12,826][06909] Updated weights for policy 0, policy_version 171233 (0.0040) [2024-06-28 06:51:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2805514240. Throughput: 0: 43982.6. Samples: 2708389360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:51:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:51:16,614][06909] Updated weights for policy 0, policy_version 171243 (0.0041) [2024-06-28 06:51:18,856][06674] Fps is (10 sec: 42572.4, 60 sec: 43959.4, 300 sec: 43986.0). Total num frames: 2805727232. Throughput: 0: 43758.1. Samples: 2708649780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:51:18,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:51:20,295][06909] Updated weights for policy 0, policy_version 171253 (0.0020) [2024-06-28 06:51:23,830][06909] Updated weights for policy 0, policy_version 171263 (0.0025) [2024-06-28 06:51:23,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44509.9, 300 sec: 44098.3). Total num frames: 2805972992. Throughput: 0: 43810.2. Samples: 2708917620. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:51:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:51:28,022][06909] Updated weights for policy 0, policy_version 171273 (0.0037) [2024-06-28 06:51:28,850][06674] Fps is (10 sec: 44263.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2806169600. Throughput: 0: 44064.4. Samples: 2709054520. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 06:51:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:51:31,243][06909] Updated weights for policy 0, policy_version 171283 (0.0026) [2024-06-28 06:51:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2806398976. Throughput: 0: 44213.4. Samples: 2709318920. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 06:51:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:51:35,274][06909] Updated weights for policy 0, policy_version 171293 (0.0034) [2024-06-28 06:51:38,526][06909] Updated weights for policy 0, policy_version 171303 (0.0036) [2024-06-28 06:51:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2806628352. Throughput: 0: 44204.8. Samples: 2709579480. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 06:51:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:51:42,458][06909] Updated weights for policy 0, policy_version 171313 (0.0024) [2024-06-28 06:51:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2806841344. Throughput: 0: 44468.5. Samples: 2709722940. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 06:51:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:51:45,985][06909] Updated weights for policy 0, policy_version 171323 (0.0031) [2024-06-28 06:51:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2807070720. Throughput: 0: 44439.6. Samples: 2709988440. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 06:51:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:51:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000171330_2807070720.pth... [2024-06-28 06:51:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000170685_2796503040.pth [2024-06-28 06:51:49,611][06909] Updated weights for policy 0, policy_version 171333 (0.0035) [2024-06-28 06:51:53,620][06909] Updated weights for policy 0, policy_version 171343 (0.0044) [2024-06-28 06:51:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2807283712. Throughput: 0: 44221.8. Samples: 2710246100. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 06:51:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:51:57,716][06909] Updated weights for policy 0, policy_version 171353 (0.0033) [2024-06-28 06:51:58,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2807480320. Throughput: 0: 44150.7. Samples: 2710376140. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 06:51:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:52:01,270][06909] Updated weights for policy 0, policy_version 171363 (0.0039) [2024-06-28 06:52:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2807709696. Throughput: 0: 44130.0. Samples: 2710635360. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 06:52:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:52:05,079][06909] Updated weights for policy 0, policy_version 171373 (0.0039) [2024-06-28 06:52:08,711][06909] Updated weights for policy 0, policy_version 171383 (0.0030) [2024-06-28 06:52:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 2807939072. Throughput: 0: 44039.5. Samples: 2710899400. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 06:52:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:52:12,437][06909] Updated weights for policy 0, policy_version 171393 (0.0037) [2024-06-28 06:52:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2808152064. Throughput: 0: 43984.0. Samples: 2711033800. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 06:52:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:52:16,262][06909] Updated weights for policy 0, policy_version 171403 (0.0038) [2024-06-28 06:52:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43968.2, 300 sec: 43986.9). Total num frames: 2808365056. Throughput: 0: 43918.7. Samples: 2711295260. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 06:52:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:52:19,755][06909] Updated weights for policy 0, policy_version 171413 (0.0024) [2024-06-28 06:52:23,584][06909] Updated weights for policy 0, policy_version 171423 (0.0022) [2024-06-28 06:52:23,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43689.1, 300 sec: 44042.1). Total num frames: 2808594432. Throughput: 0: 44104.6. Samples: 2711564280. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 06:52:23,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:52:27,331][06909] Updated weights for policy 0, policy_version 171433 (0.0038) [2024-06-28 06:52:28,391][06887] Signal inference workers to stop experience collection... (38500 times) [2024-06-28 06:52:28,411][06909] InferenceWorker_p0-w0: stopping experience collection (38500 times) [2024-06-28 06:52:28,506][06887] Signal inference workers to resume experience collection... (38500 times) [2024-06-28 06:52:28,506][06909] InferenceWorker_p0-w0: resuming experience collection (38500 times) [2024-06-28 06:52:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44042.5). Total num frames: 2808823808. Throughput: 0: 43817.0. Samples: 2711694700. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 06:52:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:52:31,051][06909] Updated weights for policy 0, policy_version 171443 (0.0031) [2024-06-28 06:52:33,850][06674] Fps is (10 sec: 44245.3, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2809036800. Throughput: 0: 43851.4. Samples: 2711961760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:52:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:52:34,977][06909] Updated weights for policy 0, policy_version 171453 (0.0023) [2024-06-28 06:52:38,315][06909] Updated weights for policy 0, policy_version 171463 (0.0033) [2024-06-28 06:52:38,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 44042.7). Total num frames: 2809249792. Throughput: 0: 43811.5. Samples: 2712217620. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:52:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:52:42,191][06909] Updated weights for policy 0, policy_version 171473 (0.0027) [2024-06-28 06:52:43,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 44042.6). Total num frames: 2809479168. Throughput: 0: 43963.2. Samples: 2712354480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:52:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:52:45,742][06909] Updated weights for policy 0, policy_version 171483 (0.0028) [2024-06-28 06:52:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2809692160. Throughput: 0: 44157.6. Samples: 2712622460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:52:48,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:52:49,527][06909] Updated weights for policy 0, policy_version 171493 (0.0036) [2024-06-28 06:52:53,285][06909] Updated weights for policy 0, policy_version 171503 (0.0029) [2024-06-28 06:52:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2809921536. Throughput: 0: 44062.2. Samples: 2712882200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:52:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:52:56,988][06909] Updated weights for policy 0, policy_version 171513 (0.0028) [2024-06-28 06:52:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2810134528. Throughput: 0: 44080.4. Samples: 2713017420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:52:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:53:00,941][06909] Updated weights for policy 0, policy_version 171523 (0.0043) [2024-06-28 06:53:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 2810363904. Throughput: 0: 44277.3. Samples: 2713287740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:53:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:53:04,516][06909] Updated weights for policy 0, policy_version 171533 (0.0028) [2024-06-28 06:53:08,247][06909] Updated weights for policy 0, policy_version 171543 (0.0023) [2024-06-28 06:53:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2810576896. Throughput: 0: 43888.2. Samples: 2713539160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:53:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:53:11,913][06909] Updated weights for policy 0, policy_version 171553 (0.0027) [2024-06-28 06:53:13,852][06674] Fps is (10 sec: 44227.5, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 2810806272. Throughput: 0: 43912.1. Samples: 2713670840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:53:13,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:53:15,569][06909] Updated weights for policy 0, policy_version 171563 (0.0025) [2024-06-28 06:53:18,853][06674] Fps is (10 sec: 45860.7, 60 sec: 44507.4, 300 sec: 44097.5). Total num frames: 2811035648. Throughput: 0: 44053.0. Samples: 2713944280. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:53:18,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:53:19,468][06909] Updated weights for policy 0, policy_version 171573 (0.0026) [2024-06-28 06:53:22,850][06909] Updated weights for policy 0, policy_version 171583 (0.0029) [2024-06-28 06:53:23,850][06674] Fps is (10 sec: 44246.1, 60 sec: 44238.4, 300 sec: 44098.0). Total num frames: 2811248640. Throughput: 0: 44188.1. Samples: 2714206080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:53:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:53:26,856][06909] Updated weights for policy 0, policy_version 171593 (0.0034) [2024-06-28 06:53:28,850][06674] Fps is (10 sec: 44251.0, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2811478016. Throughput: 0: 44156.0. Samples: 2714341500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:53:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:53:30,326][06909] Updated weights for policy 0, policy_version 171603 (0.0026) [2024-06-28 06:53:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2811691008. Throughput: 0: 44139.7. Samples: 2714608740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 06:53:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:53:34,104][06909] Updated weights for policy 0, policy_version 171613 (0.0028) [2024-06-28 06:53:37,420][06909] Updated weights for policy 0, policy_version 171623 (0.0030) [2024-06-28 06:53:38,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2811904000. Throughput: 0: 44222.5. Samples: 2714872220. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 06:53:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:53:41,653][06909] Updated weights for policy 0, policy_version 171633 (0.0038) [2024-06-28 06:53:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2812133376. Throughput: 0: 44237.8. Samples: 2715008120. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 06:53:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:53:45,386][06909] Updated weights for policy 0, policy_version 171643 (0.0033) [2024-06-28 06:53:48,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2812346368. Throughput: 0: 44096.4. Samples: 2715272080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 06:53:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:53:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000171652_2812346368.pth... [2024-06-28 06:53:48,927][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000171007_2801778688.pth [2024-06-28 06:53:49,138][06909] Updated weights for policy 0, policy_version 171653 (0.0024) [2024-06-28 06:53:52,502][06909] Updated weights for policy 0, policy_version 171663 (0.0036) [2024-06-28 06:53:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2812559360. Throughput: 0: 44338.7. Samples: 2715534400. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 06:53:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:53:54,409][06887] Signal inference workers to stop experience collection... (38550 times) [2024-06-28 06:53:54,412][06887] Signal inference workers to resume experience collection... (38550 times) [2024-06-28 06:53:54,428][06909] InferenceWorker_p0-w0: stopping experience collection (38550 times) [2024-06-28 06:53:54,428][06909] InferenceWorker_p0-w0: resuming experience collection (38550 times) [2024-06-28 06:53:56,476][06909] Updated weights for policy 0, policy_version 171673 (0.0041) [2024-06-28 06:53:58,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2812805120. Throughput: 0: 44384.7. Samples: 2715668060. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 06:53:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:53:59,801][06909] Updated weights for policy 0, policy_version 171683 (0.0024) [2024-06-28 06:54:03,668][06909] Updated weights for policy 0, policy_version 171693 (0.0033) [2024-06-28 06:54:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2813018112. Throughput: 0: 44268.4. Samples: 2715936220. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 06:54:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:54:07,663][06909] Updated weights for policy 0, policy_version 171703 (0.0027) [2024-06-28 06:54:08,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2813214720. Throughput: 0: 44241.3. Samples: 2716196940. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 06:54:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:54:10,970][06909] Updated weights for policy 0, policy_version 171713 (0.0031) [2024-06-28 06:54:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 2813460480. Throughput: 0: 44177.3. Samples: 2716329480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 06:54:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:54:14,999][06909] Updated weights for policy 0, policy_version 171723 (0.0029) [2024-06-28 06:54:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43693.0, 300 sec: 43986.9). Total num frames: 2813657088. Throughput: 0: 44015.2. Samples: 2716589420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 06:54:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:54:18,927][06909] Updated weights for policy 0, policy_version 171733 (0.0030) [2024-06-28 06:54:22,652][06909] Updated weights for policy 0, policy_version 171743 (0.0034) [2024-06-28 06:54:23,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43986.8). Total num frames: 2813870080. Throughput: 0: 43883.1. Samples: 2716846960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 06:54:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:54:26,221][06909] Updated weights for policy 0, policy_version 171753 (0.0037) [2024-06-28 06:54:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2814115840. Throughput: 0: 43795.2. Samples: 2716978900. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 06:54:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:54:30,218][06909] Updated weights for policy 0, policy_version 171763 (0.0027) [2024-06-28 06:54:33,796][06909] Updated weights for policy 0, policy_version 171773 (0.0042) [2024-06-28 06:54:33,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2814328832. Throughput: 0: 43902.2. Samples: 2717247680. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 06:54:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:54:37,342][06909] Updated weights for policy 0, policy_version 171783 (0.0033) [2024-06-28 06:54:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 2814541824. Throughput: 0: 44205.0. Samples: 2717523620. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 06:54:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:54:41,019][06909] Updated weights for policy 0, policy_version 171793 (0.0037) [2024-06-28 06:54:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2814787584. Throughput: 0: 44161.8. Samples: 2717655340. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 06:54:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:54:44,654][06909] Updated weights for policy 0, policy_version 171803 (0.0036) [2024-06-28 06:54:48,169][06909] Updated weights for policy 0, policy_version 171813 (0.0028) [2024-06-28 06:54:48,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2815000576. Throughput: 0: 44088.5. Samples: 2717920200. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 06:54:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:54:52,143][06909] Updated weights for policy 0, policy_version 171823 (0.0029) [2024-06-28 06:54:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2815213568. Throughput: 0: 44327.0. Samples: 2718191660. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 06:54:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 06:54:55,815][06909] Updated weights for policy 0, policy_version 171833 (0.0045) [2024-06-28 06:54:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2815442944. Throughput: 0: 44202.7. Samples: 2718318600. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 06:54:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:54:59,486][06909] Updated weights for policy 0, policy_version 171843 (0.0037) [2024-06-28 06:55:03,124][06909] Updated weights for policy 0, policy_version 171853 (0.0036) [2024-06-28 06:55:03,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2815672320. Throughput: 0: 44221.8. Samples: 2718579400. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 06:55:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:55:07,062][06909] Updated weights for policy 0, policy_version 171863 (0.0041) [2024-06-28 06:55:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2815885312. Throughput: 0: 44495.7. Samples: 2718849260. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 06:55:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:55:10,586][06909] Updated weights for policy 0, policy_version 171873 (0.0023) [2024-06-28 06:55:13,852][06674] Fps is (10 sec: 44227.3, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 2816114688. Throughput: 0: 44457.4. Samples: 2718979580. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 06:55:13,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:55:14,330][06909] Updated weights for policy 0, policy_version 171883 (0.0031) [2024-06-28 06:55:17,936][06909] Updated weights for policy 0, policy_version 171893 (0.0038) [2024-06-28 06:55:18,591][06887] Signal inference workers to stop experience collection... (38600 times) [2024-06-28 06:55:18,592][06887] Signal inference workers to resume experience collection... (38600 times) [2024-06-28 06:55:18,608][06909] InferenceWorker_p0-w0: stopping experience collection (38600 times) [2024-06-28 06:55:18,640][06909] InferenceWorker_p0-w0: resuming experience collection (38600 times) [2024-06-28 06:55:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2816327680. Throughput: 0: 44340.4. Samples: 2719243000. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 06:55:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:55:21,842][06909] Updated weights for policy 0, policy_version 171903 (0.0043) [2024-06-28 06:55:23,850][06674] Fps is (10 sec: 42607.6, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 2816540672. Throughput: 0: 44100.9. Samples: 2719508160. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 06:55:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:55:25,471][06909] Updated weights for policy 0, policy_version 171913 (0.0035) [2024-06-28 06:55:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2816770048. Throughput: 0: 44143.0. Samples: 2719641780. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 06:55:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:55:29,467][06909] Updated weights for policy 0, policy_version 171923 (0.0023) [2024-06-28 06:55:33,023][06909] Updated weights for policy 0, policy_version 171933 (0.0036) [2024-06-28 06:55:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2816983040. Throughput: 0: 43886.3. Samples: 2719895080. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 06:55:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:55:36,667][06909] Updated weights for policy 0, policy_version 171943 (0.0019) [2024-06-28 06:55:38,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.6, 300 sec: 44042.4). Total num frames: 2817196032. Throughput: 0: 43961.6. Samples: 2720169940. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 06:55:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:55:40,170][06909] Updated weights for policy 0, policy_version 171953 (0.0045) [2024-06-28 06:55:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2817425408. Throughput: 0: 43899.5. Samples: 2720294080. Policy #0 lag: (min: 1.0, avg: 10.8, max: 23.0) [2024-06-28 06:55:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 06:55:44,239][06909] Updated weights for policy 0, policy_version 171963 (0.0026) [2024-06-28 06:55:47,673][06909] Updated weights for policy 0, policy_version 171973 (0.0025) [2024-06-28 06:55:48,853][06674] Fps is (10 sec: 45863.0, 60 sec: 44234.7, 300 sec: 44208.6). Total num frames: 2817654784. Throughput: 0: 44130.9. Samples: 2720565420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2024-06-28 06:55:48,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:55:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000171976_2817654784.pth... [2024-06-28 06:55:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000171330_2807070720.pth [2024-06-28 06:55:51,500][06909] Updated weights for policy 0, policy_version 171983 (0.0022) [2024-06-28 06:55:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2817884160. Throughput: 0: 44173.3. Samples: 2720837060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2024-06-28 06:55:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:55:55,289][06909] Updated weights for policy 0, policy_version 171993 (0.0038) [2024-06-28 06:55:58,850][06674] Fps is (10 sec: 42610.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2818080768. Throughput: 0: 44033.2. Samples: 2720960980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2024-06-28 06:55:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:55:58,866][06909] Updated weights for policy 0, policy_version 172003 (0.0040) [2024-06-28 06:56:02,521][06909] Updated weights for policy 0, policy_version 172013 (0.0036) [2024-06-28 06:56:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2818326528. Throughput: 0: 44090.7. Samples: 2721227080. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2024-06-28 06:56:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:56:06,275][06909] Updated weights for policy 0, policy_version 172023 (0.0039) [2024-06-28 06:56:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2818539520. Throughput: 0: 44143.9. Samples: 2721494640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2024-06-28 06:56:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:56:09,929][06909] Updated weights for policy 0, policy_version 172033 (0.0028) [2024-06-28 06:56:13,700][06909] Updated weights for policy 0, policy_version 172043 (0.0025) [2024-06-28 06:56:13,852][06674] Fps is (10 sec: 42589.5, 60 sec: 43963.8, 300 sec: 44154.1). Total num frames: 2818752512. Throughput: 0: 44098.1. Samples: 2721626280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2024-06-28 06:56:13,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:56:17,296][06909] Updated weights for policy 0, policy_version 172053 (0.0030) [2024-06-28 06:56:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2818965504. Throughput: 0: 44351.6. Samples: 2721890900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2024-06-28 06:56:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:56:21,076][06909] Updated weights for policy 0, policy_version 172063 (0.0025) [2024-06-28 06:56:23,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2819194880. Throughput: 0: 44041.1. Samples: 2722151780. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2024-06-28 06:56:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:56:24,724][06909] Updated weights for policy 0, policy_version 172073 (0.0042) [2024-06-28 06:56:28,695][06909] Updated weights for policy 0, policy_version 172083 (0.0044) [2024-06-28 06:56:28,853][06674] Fps is (10 sec: 44224.0, 60 sec: 43961.7, 300 sec: 44097.5). Total num frames: 2819407872. Throughput: 0: 44249.7. Samples: 2722285440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2024-06-28 06:56:28,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:56:32,201][06909] Updated weights for policy 0, policy_version 172093 (0.0033) [2024-06-28 06:56:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2819637248. Throughput: 0: 44275.7. Samples: 2722557700. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2024-06-28 06:56:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 06:56:36,176][06909] Updated weights for policy 0, policy_version 172103 (0.0030) [2024-06-28 06:56:38,850][06674] Fps is (10 sec: 45888.4, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 2819866624. Throughput: 0: 43999.1. Samples: 2722817020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2024-06-28 06:56:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:56:39,462][06909] Updated weights for policy 0, policy_version 172113 (0.0029) [2024-06-28 06:56:43,548][06909] Updated weights for policy 0, policy_version 172123 (0.0034) [2024-06-28 06:56:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2820063232. Throughput: 0: 44232.8. Samples: 2722951460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2024-06-28 06:56:43,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:56:44,616][06887] Signal inference workers to stop experience collection... (38650 times) [2024-06-28 06:56:44,616][06887] Signal inference workers to resume experience collection... (38650 times) [2024-06-28 06:56:44,627][06909] InferenceWorker_p0-w0: stopping experience collection (38650 times) [2024-06-28 06:56:44,644][06909] InferenceWorker_p0-w0: resuming experience collection (38650 times) [2024-06-28 06:56:47,110][06909] Updated weights for policy 0, policy_version 172133 (0.0022) [2024-06-28 06:56:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43965.8, 300 sec: 44097.9). Total num frames: 2820292608. Throughput: 0: 44216.3. Samples: 2723216820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 19.0) [2024-06-28 06:56:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:56:50,933][06909] Updated weights for policy 0, policy_version 172143 (0.0034) [2024-06-28 06:56:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2820521984. Throughput: 0: 43986.2. Samples: 2723474020. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 06:56:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:56:54,482][06909] Updated weights for policy 0, policy_version 172153 (0.0025) [2024-06-28 06:56:58,702][06909] Updated weights for policy 0, policy_version 172163 (0.0041) [2024-06-28 06:56:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2820718592. Throughput: 0: 44153.1. Samples: 2723613080. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 06:56:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:57:02,159][06909] Updated weights for policy 0, policy_version 172173 (0.0035) [2024-06-28 06:57:03,856][06674] Fps is (10 sec: 44210.1, 60 sec: 43959.3, 300 sec: 44152.6). Total num frames: 2820964352. Throughput: 0: 44073.6. Samples: 2723874480. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 06:57:03,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:57:06,240][06909] Updated weights for policy 0, policy_version 172183 (0.0029) [2024-06-28 06:57:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2821160960. Throughput: 0: 44212.9. Samples: 2724141360. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 06:57:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:57:09,406][06909] Updated weights for policy 0, policy_version 172193 (0.0034) [2024-06-28 06:57:13,482][06909] Updated weights for policy 0, policy_version 172203 (0.0028) [2024-06-28 06:57:13,850][06674] Fps is (10 sec: 40985.3, 60 sec: 43692.2, 300 sec: 44098.0). Total num frames: 2821373952. Throughput: 0: 44157.1. Samples: 2724272380. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 06:57:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:57:16,742][06909] Updated weights for policy 0, policy_version 172213 (0.0034) [2024-06-28 06:57:18,856][06674] Fps is (10 sec: 45847.4, 60 sec: 44232.3, 300 sec: 44152.9). Total num frames: 2821619712. Throughput: 0: 43943.4. Samples: 2724535420. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 06:57:18,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:57:20,679][06909] Updated weights for policy 0, policy_version 172223 (0.0034) [2024-06-28 06:57:23,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2821849088. Throughput: 0: 44122.7. Samples: 2724802540. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 06:57:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:57:24,369][06909] Updated weights for policy 0, policy_version 172233 (0.0032) [2024-06-28 06:57:27,819][06909] Updated weights for policy 0, policy_version 172243 (0.0046) [2024-06-28 06:57:28,850][06674] Fps is (10 sec: 44263.0, 60 sec: 44238.8, 300 sec: 44153.5). Total num frames: 2822062080. Throughput: 0: 44191.0. Samples: 2724940060. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 06:57:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:57:31,512][06909] Updated weights for policy 0, policy_version 172253 (0.0034) [2024-06-28 06:57:33,855][06674] Fps is (10 sec: 44214.6, 60 sec: 44233.1, 300 sec: 44208.3). Total num frames: 2822291456. Throughput: 0: 44105.0. Samples: 2725201760. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 06:57:33,855][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:57:35,614][06909] Updated weights for policy 0, policy_version 172263 (0.0027) [2024-06-28 06:57:38,852][06674] Fps is (10 sec: 44228.4, 60 sec: 43962.2, 300 sec: 44153.2). Total num frames: 2822504448. Throughput: 0: 44272.7. Samples: 2725466380. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 06:57:38,861][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:57:39,214][06909] Updated weights for policy 0, policy_version 172273 (0.0037) [2024-06-28 06:57:42,883][06909] Updated weights for policy 0, policy_version 172283 (0.0024) [2024-06-28 06:57:43,850][06674] Fps is (10 sec: 42619.0, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2822717440. Throughput: 0: 44077.2. Samples: 2725596560. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 06:57:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:57:46,690][06909] Updated weights for policy 0, policy_version 172293 (0.0031) [2024-06-28 06:57:48,850][06674] Fps is (10 sec: 44245.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2822946816. Throughput: 0: 44156.2. Samples: 2725861240. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 06:57:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:57:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000172299_2822946816.pth... [2024-06-28 06:57:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000171652_2812346368.pth [2024-06-28 06:57:50,502][06909] Updated weights for policy 0, policy_version 172303 (0.0033) [2024-06-28 06:57:53,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2823159808. Throughput: 0: 44143.5. Samples: 2726127820. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 06:57:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:57:54,023][06909] Updated weights for policy 0, policy_version 172313 (0.0033) [2024-06-28 06:57:57,904][06909] Updated weights for policy 0, policy_version 172323 (0.0035) [2024-06-28 06:57:58,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2823372800. Throughput: 0: 44065.5. Samples: 2726255340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 06:57:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:58:01,429][06909] Updated weights for policy 0, policy_version 172333 (0.0034) [2024-06-28 06:58:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44241.2, 300 sec: 44209.0). Total num frames: 2823618560. Throughput: 0: 44162.4. Samples: 2726522460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 06:58:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:58:05,229][06909] Updated weights for policy 0, policy_version 172343 (0.0037) [2024-06-28 06:58:08,730][06909] Updated weights for policy 0, policy_version 172353 (0.0041) [2024-06-28 06:58:08,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44509.9, 300 sec: 44153.8). Total num frames: 2823831552. Throughput: 0: 44165.7. Samples: 2726790000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 06:58:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:58:12,820][06909] Updated weights for policy 0, policy_version 172363 (0.0027) [2024-06-28 06:58:13,761][06887] Signal inference workers to stop experience collection... (38700 times) [2024-06-28 06:58:13,763][06887] Signal inference workers to resume experience collection... (38700 times) [2024-06-28 06:58:13,798][06909] InferenceWorker_p0-w0: stopping experience collection (38700 times) [2024-06-28 06:58:13,798][06909] InferenceWorker_p0-w0: resuming experience collection (38700 times) [2024-06-28 06:58:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44509.7, 300 sec: 44098.4). Total num frames: 2824044544. Throughput: 0: 44097.4. Samples: 2726924440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 06:58:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:58:16,046][06909] Updated weights for policy 0, policy_version 172373 (0.0040) [2024-06-28 06:58:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43968.2, 300 sec: 44098.0). Total num frames: 2824257536. Throughput: 0: 44153.4. Samples: 2727188440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 06:58:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:58:19,976][06909] Updated weights for policy 0, policy_version 172383 (0.0035) [2024-06-28 06:58:23,706][06909] Updated weights for policy 0, policy_version 172393 (0.0020) [2024-06-28 06:58:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2824486912. Throughput: 0: 44118.0. Samples: 2727451600. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 06:58:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:58:27,486][06909] Updated weights for policy 0, policy_version 172403 (0.0032) [2024-06-28 06:58:28,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2824716288. Throughput: 0: 44222.8. Samples: 2727586580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 06:58:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:58:30,909][06909] Updated weights for policy 0, policy_version 172413 (0.0039) [2024-06-28 06:58:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43967.4, 300 sec: 44153.5). Total num frames: 2824929280. Throughput: 0: 44266.7. Samples: 2727853240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 06:58:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 06:58:34,836][06909] Updated weights for policy 0, policy_version 172423 (0.0030) [2024-06-28 06:58:38,148][06909] Updated weights for policy 0, policy_version 172433 (0.0021) [2024-06-28 06:58:38,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43965.3, 300 sec: 44098.0). Total num frames: 2825142272. Throughput: 0: 44214.7. Samples: 2728117480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 06:58:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:58:42,057][06909] Updated weights for policy 0, policy_version 172443 (0.0040) [2024-06-28 06:58:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44510.0, 300 sec: 44209.0). Total num frames: 2825388032. Throughput: 0: 44418.4. Samples: 2728254160. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 06:58:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:58:46,061][06909] Updated weights for policy 0, policy_version 172453 (0.0051) [2024-06-28 06:58:48,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2825584640. Throughput: 0: 44285.3. Samples: 2728515300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 06:58:48,854][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:58:49,503][06909] Updated weights for policy 0, policy_version 172463 (0.0036) [2024-06-28 06:58:53,236][06909] Updated weights for policy 0, policy_version 172473 (0.0040) [2024-06-28 06:58:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2825814016. Throughput: 0: 44236.0. Samples: 2728780620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 06:58:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:58:56,891][06909] Updated weights for policy 0, policy_version 172483 (0.0028) [2024-06-28 06:58:58,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 2826043392. Throughput: 0: 44217.4. Samples: 2728914220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 06:58:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:59:00,814][06909] Updated weights for policy 0, policy_version 172493 (0.0028) [2024-06-28 06:59:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2826256384. Throughput: 0: 44310.6. Samples: 2729182420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:59:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:59:04,199][06909] Updated weights for policy 0, policy_version 172503 (0.0028) [2024-06-28 06:59:07,942][06909] Updated weights for policy 0, policy_version 172513 (0.0023) [2024-06-28 06:59:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2826485760. Throughput: 0: 44288.4. Samples: 2729444580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:59:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:59:11,811][06909] Updated weights for policy 0, policy_version 172523 (0.0022) [2024-06-28 06:59:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2826698752. Throughput: 0: 44315.6. Samples: 2729580780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:59:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:59:15,207][06909] Updated weights for policy 0, policy_version 172533 (0.0032) [2024-06-28 06:59:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.8, 300 sec: 44264.6). Total num frames: 2826928128. Throughput: 0: 44349.6. Samples: 2729848980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:59:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:59:19,108][06909] Updated weights for policy 0, policy_version 172543 (0.0035) [2024-06-28 06:59:22,757][06909] Updated weights for policy 0, policy_version 172553 (0.0038) [2024-06-28 06:59:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2827141120. Throughput: 0: 44299.6. Samples: 2730110960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:59:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:59:26,448][06909] Updated weights for policy 0, policy_version 172563 (0.0038) [2024-06-28 06:59:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2827370496. Throughput: 0: 44226.1. Samples: 2730244340. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:59:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:59:30,199][06909] Updated weights for policy 0, policy_version 172573 (0.0028) [2024-06-28 06:59:33,812][06909] Updated weights for policy 0, policy_version 172583 (0.0046) [2024-06-28 06:59:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.8, 300 sec: 44264.6). Total num frames: 2827599872. Throughput: 0: 44385.9. Samples: 2730512660. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:59:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 06:59:37,591][06909] Updated weights for policy 0, policy_version 172593 (0.0035) [2024-06-28 06:59:38,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44508.3, 300 sec: 44153.2). Total num frames: 2827812864. Throughput: 0: 44218.4. Samples: 2730770540. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:59:38,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 06:59:41,216][06909] Updated weights for policy 0, policy_version 172603 (0.0040) [2024-06-28 06:59:42,326][06887] Signal inference workers to stop experience collection... (38750 times) [2024-06-28 06:59:42,375][06887] Signal inference workers to resume experience collection... (38750 times) [2024-06-28 06:59:42,377][06909] InferenceWorker_p0-w0: stopping experience collection (38750 times) [2024-06-28 06:59:42,388][06909] InferenceWorker_p0-w0: resuming experience collection (38750 times) [2024-06-28 06:59:43,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 2828009472. Throughput: 0: 44151.1. Samples: 2730901020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:59:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 06:59:45,031][06909] Updated weights for policy 0, policy_version 172613 (0.0023) [2024-06-28 06:59:48,850][06674] Fps is (10 sec: 42607.4, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2828238848. Throughput: 0: 44151.6. Samples: 2731169240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:59:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 06:59:48,982][06909] Updated weights for policy 0, policy_version 172623 (0.0029) [2024-06-28 06:59:48,984][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000172623_2828255232.pth... [2024-06-28 06:59:49,029][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000171976_2817654784.pth [2024-06-28 06:59:52,446][06909] Updated weights for policy 0, policy_version 172633 (0.0031) [2024-06-28 06:59:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2828451840. Throughput: 0: 44122.7. Samples: 2731430100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:59:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 06:59:56,274][06909] Updated weights for policy 0, policy_version 172643 (0.0020) [2024-06-28 06:59:58,852][06674] Fps is (10 sec: 45865.9, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 2828697600. Throughput: 0: 44064.7. Samples: 2731563780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 06:59:58,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:00:00,242][06909] Updated weights for policy 0, policy_version 172653 (0.0032) [2024-06-28 07:00:03,728][06909] Updated weights for policy 0, policy_version 172663 (0.0027) [2024-06-28 07:00:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2828910592. Throughput: 0: 44175.2. Samples: 2731836860. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 07:00:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:00:07,439][06909] Updated weights for policy 0, policy_version 172673 (0.0023) [2024-06-28 07:00:08,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 2829123584. Throughput: 0: 44128.4. Samples: 2732096740. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 07:00:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:00:11,062][06909] Updated weights for policy 0, policy_version 172683 (0.0030) [2024-06-28 07:00:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2829336576. Throughput: 0: 44022.3. Samples: 2732225340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 07:00:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:00:14,796][06909] Updated weights for policy 0, policy_version 172693 (0.0041) [2024-06-28 07:00:18,345][06909] Updated weights for policy 0, policy_version 172703 (0.0023) [2024-06-28 07:00:18,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2829565952. Throughput: 0: 43982.2. Samples: 2732491860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 07:00:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:00:22,342][06909] Updated weights for policy 0, policy_version 172713 (0.0031) [2024-06-28 07:00:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2829795328. Throughput: 0: 44200.8. Samples: 2732759480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 07:00:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:00:26,071][06909] Updated weights for policy 0, policy_version 172723 (0.0025) [2024-06-28 07:00:28,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2830024704. Throughput: 0: 44140.5. Samples: 2732887340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 07:00:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:00:29,662][06909] Updated weights for policy 0, policy_version 172733 (0.0036) [2024-06-28 07:00:33,445][06909] Updated weights for policy 0, policy_version 172743 (0.0035) [2024-06-28 07:00:33,852][06674] Fps is (10 sec: 42589.5, 60 sec: 43689.2, 300 sec: 44153.2). Total num frames: 2830221312. Throughput: 0: 44223.3. Samples: 2733159380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 07:00:33,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:00:37,058][06909] Updated weights for policy 0, policy_version 172753 (0.0041) [2024-06-28 07:00:38,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43965.4, 300 sec: 44153.5). Total num frames: 2830450688. Throughput: 0: 44098.3. Samples: 2733414520. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 07:00:38,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 07:00:40,897][06909] Updated weights for policy 0, policy_version 172763 (0.0048) [2024-06-28 07:00:43,850][06674] Fps is (10 sec: 44246.2, 60 sec: 44236.9, 300 sec: 44098.4). Total num frames: 2830663680. Throughput: 0: 44204.7. Samples: 2733552900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 07:00:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:00:44,357][06909] Updated weights for policy 0, policy_version 172773 (0.0034) [2024-06-28 07:00:48,142][06909] Updated weights for policy 0, policy_version 172783 (0.0045) [2024-06-28 07:00:48,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2830893056. Throughput: 0: 44125.3. Samples: 2733822500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 07:00:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:00:52,330][06909] Updated weights for policy 0, policy_version 172793 (0.0030) [2024-06-28 07:00:53,856][06674] Fps is (10 sec: 44209.7, 60 sec: 44232.3, 300 sec: 44152.6). Total num frames: 2831106048. Throughput: 0: 43951.8. Samples: 2734074840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 07:00:53,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:00:55,734][06909] Updated weights for policy 0, policy_version 172803 (0.0037) [2024-06-28 07:00:58,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2831351808. Throughput: 0: 44101.3. Samples: 2734209900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 07:00:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:00:59,579][06909] Updated weights for policy 0, policy_version 172813 (0.0038) [2024-06-28 07:01:03,134][06909] Updated weights for policy 0, policy_version 172823 (0.0035) [2024-06-28 07:01:03,851][06674] Fps is (10 sec: 44260.4, 60 sec: 43963.2, 300 sec: 44097.8). Total num frames: 2831548416. Throughput: 0: 44146.0. Samples: 2734478460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 07:01:03,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:01:06,888][06909] Updated weights for policy 0, policy_version 172833 (0.0030) [2024-06-28 07:01:08,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43963.6, 300 sec: 44098.2). Total num frames: 2831761408. Throughput: 0: 43903.8. Samples: 2734735160. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 07:01:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:01:10,790][06909] Updated weights for policy 0, policy_version 172843 (0.0022) [2024-06-28 07:01:11,267][06887] Signal inference workers to stop experience collection... (38800 times) [2024-06-28 07:01:11,267][06887] Signal inference workers to resume experience collection... (38800 times) [2024-06-28 07:01:11,278][06909] InferenceWorker_p0-w0: stopping experience collection (38800 times) [2024-06-28 07:01:11,279][06909] InferenceWorker_p0-w0: resuming experience collection (38800 times) [2024-06-28 07:01:13,850][06674] Fps is (10 sec: 45878.5, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2832007168. Throughput: 0: 44034.2. Samples: 2734868880. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 07:01:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:01:14,494][06909] Updated weights for policy 0, policy_version 172853 (0.0031) [2024-06-28 07:01:18,225][06909] Updated weights for policy 0, policy_version 172863 (0.0036) [2024-06-28 07:01:18,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2832220160. Throughput: 0: 43973.1. Samples: 2735138080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 07:01:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:01:21,581][06909] Updated weights for policy 0, policy_version 172873 (0.0036) [2024-06-28 07:01:23,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 44098.4). Total num frames: 2832416768. Throughput: 0: 44222.0. Samples: 2735404520. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 07:01:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:01:25,514][06909] Updated weights for policy 0, policy_version 172883 (0.0021) [2024-06-28 07:01:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 2832662528. Throughput: 0: 43967.4. Samples: 2735531440. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 07:01:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:01:29,199][06909] Updated weights for policy 0, policy_version 172893 (0.0026) [2024-06-28 07:01:32,741][06909] Updated weights for policy 0, policy_version 172903 (0.0030) [2024-06-28 07:01:33,850][06674] Fps is (10 sec: 47514.5, 60 sec: 44511.5, 300 sec: 44153.5). Total num frames: 2832891904. Throughput: 0: 44041.9. Samples: 2735804380. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 07:01:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 07:01:36,381][06909] Updated weights for policy 0, policy_version 172913 (0.0039) [2024-06-28 07:01:38,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2833088512. Throughput: 0: 44412.7. Samples: 2736073140. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 07:01:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:01:40,044][06909] Updated weights for policy 0, policy_version 172923 (0.0034) [2024-06-28 07:01:43,716][06909] Updated weights for policy 0, policy_version 172933 (0.0042) [2024-06-28 07:01:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2833334272. Throughput: 0: 44196.5. Samples: 2736198740. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 07:01:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:01:47,774][06909] Updated weights for policy 0, policy_version 172943 (0.0034) [2024-06-28 07:01:48,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2833547264. Throughput: 0: 44097.9. Samples: 2736462840. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 07:01:48,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:01:48,990][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000172947_2833563648.pth... [2024-06-28 07:01:49,038][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000172299_2822946816.pth [2024-06-28 07:01:51,309][06909] Updated weights for policy 0, policy_version 172953 (0.0038) [2024-06-28 07:01:53,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43968.2, 300 sec: 44153.5). Total num frames: 2833743872. Throughput: 0: 44300.7. Samples: 2736728680. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 07:01:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:01:55,017][06909] Updated weights for policy 0, policy_version 172963 (0.0044) [2024-06-28 07:01:58,529][06909] Updated weights for policy 0, policy_version 172973 (0.0042) [2024-06-28 07:01:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44154.4). Total num frames: 2833989632. Throughput: 0: 44268.0. Samples: 2736860940. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 07:01:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:02:02,207][06909] Updated weights for policy 0, policy_version 172983 (0.0026) [2024-06-28 07:02:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43964.3, 300 sec: 44153.5). Total num frames: 2834186240. Throughput: 0: 44114.7. Samples: 2737123240. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 07:02:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:02:06,299][06909] Updated weights for policy 0, policy_version 172993 (0.0031) [2024-06-28 07:02:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2834415616. Throughput: 0: 44270.3. Samples: 2737396680. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 07:02:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:02:09,885][06909] Updated weights for policy 0, policy_version 173003 (0.0033) [2024-06-28 07:02:13,608][06909] Updated weights for policy 0, policy_version 173013 (0.0026) [2024-06-28 07:02:13,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.8, 300 sec: 44209.9). Total num frames: 2834661376. Throughput: 0: 44240.5. Samples: 2737522260. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 07:02:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:02:17,495][06909] Updated weights for policy 0, policy_version 173023 (0.0033) [2024-06-28 07:02:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2834874368. Throughput: 0: 44133.3. Samples: 2737790380. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 07:02:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:02:20,960][06909] Updated weights for policy 0, policy_version 173033 (0.0033) [2024-06-28 07:02:22,452][06887] Signal inference workers to stop experience collection... (38850 times) [2024-06-28 07:02:22,474][06909] InferenceWorker_p0-w0: stopping experience collection (38850 times) [2024-06-28 07:02:22,513][06887] Signal inference workers to resume experience collection... (38850 times) [2024-06-28 07:02:22,513][06909] InferenceWorker_p0-w0: resuming experience collection (38850 times) [2024-06-28 07:02:23,850][06674] Fps is (10 sec: 40960.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2835070976. Throughput: 0: 43967.6. Samples: 2738051680. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 07:02:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:02:24,760][06909] Updated weights for policy 0, policy_version 173043 (0.0035) [2024-06-28 07:02:28,419][06909] Updated weights for policy 0, policy_version 173053 (0.0038) [2024-06-28 07:02:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44154.2). Total num frames: 2835316736. Throughput: 0: 44052.0. Samples: 2738181080. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 07:02:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:02:31,988][06909] Updated weights for policy 0, policy_version 173063 (0.0041) [2024-06-28 07:02:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44153.8). Total num frames: 2835529728. Throughput: 0: 44123.6. Samples: 2738448400. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 07:02:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:02:35,746][06909] Updated weights for policy 0, policy_version 173073 (0.0041) [2024-06-28 07:02:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44209.1). Total num frames: 2835759104. Throughput: 0: 44203.9. Samples: 2738717860. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 07:02:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:02:39,256][06909] Updated weights for policy 0, policy_version 173083 (0.0026) [2024-06-28 07:02:43,203][06909] Updated weights for policy 0, policy_version 173093 (0.0038) [2024-06-28 07:02:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2835972096. Throughput: 0: 44092.9. Samples: 2738845120. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 07:02:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:02:46,972][06909] Updated weights for policy 0, policy_version 173103 (0.0033) [2024-06-28 07:02:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2836201472. Throughput: 0: 44191.6. Samples: 2739111860. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 07:02:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:02:50,683][06909] Updated weights for policy 0, policy_version 173113 (0.0026) [2024-06-28 07:02:53,852][06674] Fps is (10 sec: 42591.1, 60 sec: 44235.5, 300 sec: 44153.3). Total num frames: 2836398080. Throughput: 0: 44050.0. Samples: 2739379000. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 07:02:53,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:02:54,418][06909] Updated weights for policy 0, policy_version 173123 (0.0027) [2024-06-28 07:02:58,081][06909] Updated weights for policy 0, policy_version 173133 (0.0022) [2024-06-28 07:02:58,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2836611072. Throughput: 0: 44159.1. Samples: 2739509420. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 07:02:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:03:01,682][06909] Updated weights for policy 0, policy_version 173143 (0.0036) [2024-06-28 07:03:03,850][06674] Fps is (10 sec: 45882.7, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2836856832. Throughput: 0: 43951.5. Samples: 2739768200. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 07:03:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:03:05,427][06909] Updated weights for policy 0, policy_version 173153 (0.0021) [2024-06-28 07:03:08,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 2837086208. Throughput: 0: 44140.4. Samples: 2740038000. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 07:03:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:03:09,263][06909] Updated weights for policy 0, policy_version 173163 (0.0023) [2024-06-28 07:03:13,307][06909] Updated weights for policy 0, policy_version 173173 (0.0024) [2024-06-28 07:03:13,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 2837282816. Throughput: 0: 44215.6. Samples: 2740170780. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 07:03:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:03:16,748][06909] Updated weights for policy 0, policy_version 173183 (0.0027) [2024-06-28 07:03:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2837512192. Throughput: 0: 44087.6. Samples: 2740432340. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 07:03:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:03:20,561][06909] Updated weights for policy 0, policy_version 173193 (0.0037) [2024-06-28 07:03:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2837741568. Throughput: 0: 44064.9. Samples: 2740700780. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 07:03:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:03:23,937][06909] Updated weights for policy 0, policy_version 173203 (0.0034) [2024-06-28 07:03:27,935][06909] Updated weights for policy 0, policy_version 173213 (0.0045) [2024-06-28 07:03:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2837954560. Throughput: 0: 44187.0. Samples: 2740833540. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 07:03:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:03:31,894][06909] Updated weights for policy 0, policy_version 173223 (0.0035) [2024-06-28 07:03:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2838167552. Throughput: 0: 44007.6. Samples: 2741092200. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 07:03:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:03:34,787][06887] Signal inference workers to stop experience collection... (38900 times) [2024-06-28 07:03:34,787][06887] Signal inference workers to resume experience collection... (38900 times) [2024-06-28 07:03:34,803][06909] InferenceWorker_p0-w0: stopping experience collection (38900 times) [2024-06-28 07:03:34,803][06909] InferenceWorker_p0-w0: resuming experience collection (38900 times) [2024-06-28 07:03:35,506][06909] Updated weights for policy 0, policy_version 173233 (0.0040) [2024-06-28 07:03:38,852][06674] Fps is (10 sec: 44227.9, 60 sec: 43962.2, 300 sec: 44097.6). Total num frames: 2838396928. Throughput: 0: 43990.8. Samples: 2741358600. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 07:03:38,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:03:39,167][06909] Updated weights for policy 0, policy_version 173243 (0.0036) [2024-06-28 07:03:43,003][06909] Updated weights for policy 0, policy_version 173253 (0.0031) [2024-06-28 07:03:43,856][06674] Fps is (10 sec: 44209.9, 60 sec: 43959.3, 300 sec: 44152.6). Total num frames: 2838609920. Throughput: 0: 44160.3. Samples: 2741496900. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 07:03:43,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:03:46,210][06909] Updated weights for policy 0, policy_version 173263 (0.0030) [2024-06-28 07:03:48,850][06674] Fps is (10 sec: 44246.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2838839296. Throughput: 0: 44199.2. Samples: 2741757160. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 07:03:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:03:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000173269_2838839296.pth... [2024-06-28 07:03:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000172623_2828255232.pth [2024-06-28 07:03:50,653][06909] Updated weights for policy 0, policy_version 173273 (0.0036) [2024-06-28 07:03:53,810][06909] Updated weights for policy 0, policy_version 173283 (0.0031) [2024-06-28 07:03:53,850][06674] Fps is (10 sec: 45902.4, 60 sec: 44511.0, 300 sec: 44153.5). Total num frames: 2839068672. Throughput: 0: 44144.8. Samples: 2742024520. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 07:03:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:03:57,947][06909] Updated weights for policy 0, policy_version 173293 (0.0021) [2024-06-28 07:03:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2839265280. Throughput: 0: 44221.8. Samples: 2742160760. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 07:03:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:04:01,278][06909] Updated weights for policy 0, policy_version 173303 (0.0048) [2024-06-28 07:04:03,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2839494656. Throughput: 0: 44153.3. Samples: 2742419240. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 07:04:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:04:05,493][06909] Updated weights for policy 0, policy_version 173313 (0.0028) [2024-06-28 07:04:08,791][06909] Updated weights for policy 0, policy_version 173323 (0.0033) [2024-06-28 07:04:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2839724032. Throughput: 0: 44153.8. Samples: 2742687700. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 07:04:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:04:12,754][06909] Updated weights for policy 0, policy_version 173333 (0.0022) [2024-06-28 07:04:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2839953408. Throughput: 0: 44150.3. Samples: 2742820300. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 07:04:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:04:16,128][06909] Updated weights for policy 0, policy_version 173343 (0.0031) [2024-06-28 07:04:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2840166400. Throughput: 0: 44237.4. Samples: 2743082880. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 07:04:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:04:20,437][06909] Updated weights for policy 0, policy_version 173353 (0.0026) [2024-06-28 07:04:23,275][06909] Updated weights for policy 0, policy_version 173363 (0.0031) [2024-06-28 07:04:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.7, 300 sec: 44209.0). Total num frames: 2840412160. Throughput: 0: 44330.4. Samples: 2743353380. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 07:04:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:04:27,686][06909] Updated weights for policy 0, policy_version 173373 (0.0029) [2024-06-28 07:04:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2840592384. Throughput: 0: 44267.7. Samples: 2743488680. Policy #0 lag: (min: 0.0, avg: 11.7, max: 22.0) [2024-06-28 07:04:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:04:30,816][06909] Updated weights for policy 0, policy_version 173383 (0.0024) [2024-06-28 07:04:33,852][06674] Fps is (10 sec: 40952.3, 60 sec: 44235.3, 300 sec: 44098.0). Total num frames: 2840821760. Throughput: 0: 44238.0. Samples: 2743747960. Policy #0 lag: (min: 2.0, avg: 9.3, max: 24.0) [2024-06-28 07:04:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:04:34,904][06909] Updated weights for policy 0, policy_version 173393 (0.0042) [2024-06-28 07:04:38,361][06909] Updated weights for policy 0, policy_version 173403 (0.0038) [2024-06-28 07:04:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44238.3, 300 sec: 44209.0). Total num frames: 2841051136. Throughput: 0: 44174.8. Samples: 2744012380. Policy #0 lag: (min: 2.0, avg: 9.3, max: 24.0) [2024-06-28 07:04:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 07:04:39,187][06887] Signal inference workers to stop experience collection... (38950 times) [2024-06-28 07:04:39,216][06909] InferenceWorker_p0-w0: stopping experience collection (38950 times) [2024-06-28 07:04:39,234][06887] Signal inference workers to resume experience collection... (38950 times) [2024-06-28 07:04:39,235][06909] InferenceWorker_p0-w0: resuming experience collection (38950 times) [2024-06-28 07:04:42,431][06909] Updated weights for policy 0, policy_version 173413 (0.0041) [2024-06-28 07:04:43,850][06674] Fps is (10 sec: 44245.8, 60 sec: 44241.3, 300 sec: 44153.5). Total num frames: 2841264128. Throughput: 0: 44111.0. Samples: 2744145760. Policy #0 lag: (min: 2.0, avg: 9.3, max: 24.0) [2024-06-28 07:04:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:04:45,576][06909] Updated weights for policy 0, policy_version 173423 (0.0037) [2024-06-28 07:04:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2841493504. Throughput: 0: 44118.6. Samples: 2744404580. Policy #0 lag: (min: 2.0, avg: 9.3, max: 24.0) [2024-06-28 07:04:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:04:49,708][06909] Updated weights for policy 0, policy_version 173433 (0.0030) [2024-06-28 07:04:53,301][06909] Updated weights for policy 0, policy_version 173443 (0.0031) [2024-06-28 07:04:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.9, 300 sec: 44153.8). Total num frames: 2841722880. Throughput: 0: 44068.9. Samples: 2744670800. Policy #0 lag: (min: 2.0, avg: 9.3, max: 24.0) [2024-06-28 07:04:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:04:57,468][06909] Updated weights for policy 0, policy_version 173453 (0.0040) [2024-06-28 07:04:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2841919488. Throughput: 0: 44095.5. Samples: 2744804600. Policy #0 lag: (min: 2.0, avg: 9.3, max: 24.0) [2024-06-28 07:04:58,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:05:00,478][06909] Updated weights for policy 0, policy_version 173463 (0.0038) [2024-06-28 07:05:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2842132480. Throughput: 0: 44196.9. Samples: 2745071740. Policy #0 lag: (min: 2.0, avg: 9.3, max: 24.0) [2024-06-28 07:05:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:05:04,749][06909] Updated weights for policy 0, policy_version 173473 (0.0027) [2024-06-28 07:05:07,844][06909] Updated weights for policy 0, policy_version 173483 (0.0028) [2024-06-28 07:05:08,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2842378240. Throughput: 0: 44084.2. Samples: 2745337160. Policy #0 lag: (min: 2.0, avg: 9.3, max: 24.0) [2024-06-28 07:05:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:05:12,021][06909] Updated weights for policy 0, policy_version 173493 (0.0035) [2024-06-28 07:05:13,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2842591232. Throughput: 0: 44008.4. Samples: 2745469060. Policy #0 lag: (min: 2.0, avg: 9.3, max: 24.0) [2024-06-28 07:05:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:05:15,478][06909] Updated weights for policy 0, policy_version 173503 (0.0043) [2024-06-28 07:05:18,850][06674] Fps is (10 sec: 42597.1, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2842804224. Throughput: 0: 44132.4. Samples: 2745733840. Policy #0 lag: (min: 2.0, avg: 9.3, max: 24.0) [2024-06-28 07:05:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:05:19,647][06909] Updated weights for policy 0, policy_version 173513 (0.0041) [2024-06-28 07:05:22,719][06909] Updated weights for policy 0, policy_version 173523 (0.0027) [2024-06-28 07:05:23,852][06674] Fps is (10 sec: 47504.0, 60 sec: 44235.4, 300 sec: 44208.7). Total num frames: 2843066368. Throughput: 0: 44147.3. Samples: 2745999100. Policy #0 lag: (min: 2.0, avg: 9.3, max: 24.0) [2024-06-28 07:05:23,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:05:26,870][06909] Updated weights for policy 0, policy_version 173533 (0.0028) [2024-06-28 07:05:28,850][06674] Fps is (10 sec: 45876.4, 60 sec: 44509.9, 300 sec: 44209.3). Total num frames: 2843262976. Throughput: 0: 44279.1. Samples: 2746138320. Policy #0 lag: (min: 2.0, avg: 9.3, max: 24.0) [2024-06-28 07:05:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:05:30,326][06909] Updated weights for policy 0, policy_version 173543 (0.0038) [2024-06-28 07:05:33,850][06674] Fps is (10 sec: 40968.5, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2843475968. Throughput: 0: 44280.9. Samples: 2746397220. Policy #0 lag: (min: 2.0, avg: 9.3, max: 24.0) [2024-06-28 07:05:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 07:05:34,497][06909] Updated weights for policy 0, policy_version 173553 (0.0040) [2024-06-28 07:05:37,628][06909] Updated weights for policy 0, policy_version 173563 (0.0039) [2024-06-28 07:05:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2843705344. Throughput: 0: 44207.0. Samples: 2746660120. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 07:05:38,850][06674] Avg episode reward: [(0, '0.460')] [2024-06-28 07:05:41,733][06909] Updated weights for policy 0, policy_version 173573 (0.0040) [2024-06-28 07:05:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2843918336. Throughput: 0: 44247.6. Samples: 2746795740. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 07:05:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:05:44,877][06909] Updated weights for policy 0, policy_version 173583 (0.0033) [2024-06-28 07:05:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44154.4). Total num frames: 2844131328. Throughput: 0: 44244.5. Samples: 2747062740. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 07:05:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:05:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000173592_2844131328.pth... [2024-06-28 07:05:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000172947_2833563648.pth [2024-06-28 07:05:49,363][06909] Updated weights for policy 0, policy_version 173593 (0.0041) [2024-06-28 07:05:52,628][06909] Updated weights for policy 0, policy_version 173603 (0.0033) [2024-06-28 07:05:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2844360704. Throughput: 0: 44051.0. Samples: 2747319460. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 07:05:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:05:56,531][06909] Updated weights for policy 0, policy_version 173613 (0.0031) [2024-06-28 07:05:58,852][06674] Fps is (10 sec: 44227.4, 60 sec: 44235.3, 300 sec: 44153.3). Total num frames: 2844573696. Throughput: 0: 44157.6. Samples: 2747456240. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 07:05:58,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:06:00,145][06909] Updated weights for policy 0, policy_version 173623 (0.0026) [2024-06-28 07:06:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2844786688. Throughput: 0: 44105.5. Samples: 2747718580. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 07:06:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:06:03,903][06909] Updated weights for policy 0, policy_version 173633 (0.0035) [2024-06-28 07:06:07,620][06909] Updated weights for policy 0, policy_version 173643 (0.0039) [2024-06-28 07:06:08,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2844999680. Throughput: 0: 43911.4. Samples: 2747975020. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 07:06:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:06:11,650][06909] Updated weights for policy 0, policy_version 173653 (0.0032) [2024-06-28 07:06:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2845229056. Throughput: 0: 43781.7. Samples: 2748108500. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 07:06:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:06:14,955][06909] Updated weights for policy 0, policy_version 173663 (0.0036) [2024-06-28 07:06:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 2845442048. Throughput: 0: 43874.3. Samples: 2748371560. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 07:06:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:06:19,293][06909] Updated weights for policy 0, policy_version 173673 (0.0035) [2024-06-28 07:06:22,336][06909] Updated weights for policy 0, policy_version 173683 (0.0027) [2024-06-28 07:06:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43419.0, 300 sec: 44098.0). Total num frames: 2845671424. Throughput: 0: 43816.0. Samples: 2748631840. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 07:06:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:06:26,620][06909] Updated weights for policy 0, policy_version 173693 (0.0034) [2024-06-28 07:06:26,906][06887] Signal inference workers to stop experience collection... (39000 times) [2024-06-28 07:06:26,906][06887] Signal inference workers to resume experience collection... (39000 times) [2024-06-28 07:06:26,929][06909] InferenceWorker_p0-w0: stopping experience collection (39000 times) [2024-06-28 07:06:26,929][06909] InferenceWorker_p0-w0: resuming experience collection (39000 times) [2024-06-28 07:06:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2845884416. Throughput: 0: 43852.0. Samples: 2748769080. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 07:06:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:06:29,862][06909] Updated weights for policy 0, policy_version 173703 (0.0043) [2024-06-28 07:06:33,729][06909] Updated weights for policy 0, policy_version 173713 (0.0030) [2024-06-28 07:06:33,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2846113792. Throughput: 0: 43794.2. Samples: 2749033480. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 07:06:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:06:37,240][06909] Updated weights for policy 0, policy_version 173723 (0.0028) [2024-06-28 07:06:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2846326784. Throughput: 0: 44060.9. Samples: 2749302200. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 07:06:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:06:41,008][06909] Updated weights for policy 0, policy_version 173733 (0.0033) [2024-06-28 07:06:43,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 2846523392. Throughput: 0: 43906.1. Samples: 2749431920. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:06:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:06:44,486][06909] Updated weights for policy 0, policy_version 173743 (0.0048) [2024-06-28 07:06:48,414][06909] Updated weights for policy 0, policy_version 173753 (0.0036) [2024-06-28 07:06:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 2846785536. Throughput: 0: 44046.6. Samples: 2749700680. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:06:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:06:51,678][06909] Updated weights for policy 0, policy_version 173763 (0.0032) [2024-06-28 07:06:53,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2846998528. Throughput: 0: 44166.2. Samples: 2749962500. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:06:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:06:56,242][06909] Updated weights for policy 0, policy_version 173773 (0.0027) [2024-06-28 07:06:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44238.3, 300 sec: 44209.0). Total num frames: 2847227904. Throughput: 0: 44061.8. Samples: 2750091280. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:06:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:06:59,461][06909] Updated weights for policy 0, policy_version 173783 (0.0028) [2024-06-28 07:07:03,623][06909] Updated weights for policy 0, policy_version 173793 (0.0019) [2024-06-28 07:07:03,850][06674] Fps is (10 sec: 42596.9, 60 sec: 43963.5, 300 sec: 44097.9). Total num frames: 2847424512. Throughput: 0: 44147.2. Samples: 2750358200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:07:03,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 07:07:07,118][06909] Updated weights for policy 0, policy_version 173803 (0.0026) [2024-06-28 07:07:08,856][06674] Fps is (10 sec: 42572.8, 60 sec: 44232.3, 300 sec: 44041.5). Total num frames: 2847653888. Throughput: 0: 44136.8. Samples: 2750618260. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:07:08,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:07:11,174][06909] Updated weights for policy 0, policy_version 173813 (0.0037) [2024-06-28 07:07:13,850][06674] Fps is (10 sec: 44238.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2847866880. Throughput: 0: 43958.2. Samples: 2750747200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:07:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:07:14,592][06909] Updated weights for policy 0, policy_version 173823 (0.0032) [2024-06-28 07:07:18,379][06909] Updated weights for policy 0, policy_version 173833 (0.0031) [2024-06-28 07:07:18,850][06674] Fps is (10 sec: 45902.8, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 2848112640. Throughput: 0: 44244.8. Samples: 2751024500. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:07:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:07:22,096][06909] Updated weights for policy 0, policy_version 173843 (0.0024) [2024-06-28 07:07:23,856][06674] Fps is (10 sec: 44209.6, 60 sec: 43959.3, 300 sec: 44041.5). Total num frames: 2848309248. Throughput: 0: 43917.6. Samples: 2751278760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:07:23,857][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:07:25,892][06909] Updated weights for policy 0, policy_version 173853 (0.0041) [2024-06-28 07:07:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 2848555008. Throughput: 0: 43942.5. Samples: 2751409340. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:07:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:07:29,202][06909] Updated weights for policy 0, policy_version 173863 (0.0026) [2024-06-28 07:07:33,525][06909] Updated weights for policy 0, policy_version 173873 (0.0035) [2024-06-28 07:07:33,850][06674] Fps is (10 sec: 44263.7, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2848751616. Throughput: 0: 44025.8. Samples: 2751681840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:07:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:07:36,870][06909] Updated weights for policy 0, policy_version 173883 (0.0038) [2024-06-28 07:07:38,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2848964608. Throughput: 0: 43854.7. Samples: 2751935960. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:07:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:07:40,930][06909] Updated weights for policy 0, policy_version 173893 (0.0033) [2024-06-28 07:07:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2849193984. Throughput: 0: 43931.2. Samples: 2752068180. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:07:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:07:44,662][06909] Updated weights for policy 0, policy_version 173903 (0.0031) [2024-06-28 07:07:48,433][06909] Updated weights for policy 0, policy_version 173913 (0.0032) [2024-06-28 07:07:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.8, 300 sec: 44098.2). Total num frames: 2849406976. Throughput: 0: 43956.3. Samples: 2752336220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:07:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:07:48,892][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000173915_2849423360.pth... [2024-06-28 07:07:48,942][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000173269_2838839296.pth [2024-06-28 07:07:49,118][06887] Signal inference workers to stop experience collection... (39050 times) [2024-06-28 07:07:49,118][06887] Signal inference workers to resume experience collection... (39050 times) [2024-06-28 07:07:49,135][06909] InferenceWorker_p0-w0: stopping experience collection (39050 times) [2024-06-28 07:07:49,135][06909] InferenceWorker_p0-w0: resuming experience collection (39050 times) [2024-06-28 07:07:52,120][06909] Updated weights for policy 0, policy_version 173923 (0.0036) [2024-06-28 07:07:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2849619968. Throughput: 0: 43974.0. Samples: 2752596820. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:07:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:07:55,925][06909] Updated weights for policy 0, policy_version 173933 (0.0035) [2024-06-28 07:07:58,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2849865728. Throughput: 0: 43975.9. Samples: 2752726120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:07:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:07:59,323][06909] Updated weights for policy 0, policy_version 173943 (0.0026) [2024-06-28 07:08:03,544][06909] Updated weights for policy 0, policy_version 173953 (0.0048) [2024-06-28 07:08:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43964.0, 300 sec: 43986.9). Total num frames: 2850062336. Throughput: 0: 43749.3. Samples: 2752993220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:08:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:08:06,602][06909] Updated weights for policy 0, policy_version 173963 (0.0036) [2024-06-28 07:08:08,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43695.0, 300 sec: 44042.4). Total num frames: 2850275328. Throughput: 0: 43873.5. Samples: 2753252800. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:08:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 07:08:10,827][06909] Updated weights for policy 0, policy_version 173973 (0.0046) [2024-06-28 07:08:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2850521088. Throughput: 0: 43844.0. Samples: 2753382320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:08:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:08:14,305][06909] Updated weights for policy 0, policy_version 173983 (0.0037) [2024-06-28 07:08:18,190][06909] Updated weights for policy 0, policy_version 173993 (0.0037) [2024-06-28 07:08:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2850734080. Throughput: 0: 43845.4. Samples: 2753654880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:08:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:08:21,808][06909] Updated weights for policy 0, policy_version 174003 (0.0021) [2024-06-28 07:08:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43968.2, 300 sec: 44042.4). Total num frames: 2850947072. Throughput: 0: 43984.4. Samples: 2753915260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:08:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:08:25,550][06909] Updated weights for policy 0, policy_version 174013 (0.0021) [2024-06-28 07:08:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2851176448. Throughput: 0: 44000.8. Samples: 2754048220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:08:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:08:28,982][06909] Updated weights for policy 0, policy_version 174023 (0.0030) [2024-06-28 07:08:32,781][06909] Updated weights for policy 0, policy_version 174033 (0.0030) [2024-06-28 07:08:33,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.9, 300 sec: 44153.8). Total num frames: 2851422208. Throughput: 0: 44107.5. Samples: 2754321060. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:08:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:08:36,131][06909] Updated weights for policy 0, policy_version 174043 (0.0035) [2024-06-28 07:08:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44043.3). Total num frames: 2851602432. Throughput: 0: 44266.1. Samples: 2754588800. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:08:38,850][06674] Avg episode reward: [(0, '0.445')] [2024-06-28 07:08:40,248][06909] Updated weights for policy 0, policy_version 174053 (0.0047) [2024-06-28 07:08:43,448][06909] Updated weights for policy 0, policy_version 174063 (0.0035) [2024-06-28 07:08:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2851848192. Throughput: 0: 44155.6. Samples: 2754713120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:08:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:08:47,938][06909] Updated weights for policy 0, policy_version 174073 (0.0028) [2024-06-28 07:08:48,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2852044800. Throughput: 0: 44052.9. Samples: 2754975600. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:08:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:08:51,660][06909] Updated weights for policy 0, policy_version 174083 (0.0032) [2024-06-28 07:08:53,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2852257792. Throughput: 0: 44026.3. Samples: 2755233980. Policy #0 lag: (min: 2.0, avg: 11.3, max: 21.0) [2024-06-28 07:08:53,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:08:55,408][06909] Updated weights for policy 0, policy_version 174093 (0.0034) [2024-06-28 07:08:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2852487168. Throughput: 0: 44126.7. Samples: 2755368020. Policy #0 lag: (min: 2.0, avg: 11.3, max: 21.0) [2024-06-28 07:08:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:08:59,072][06909] Updated weights for policy 0, policy_version 174103 (0.0028) [2024-06-28 07:09:02,778][06909] Updated weights for policy 0, policy_version 174113 (0.0031) [2024-06-28 07:09:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2852716544. Throughput: 0: 43954.2. Samples: 2755632820. Policy #0 lag: (min: 2.0, avg: 11.3, max: 21.0) [2024-06-28 07:09:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:09:06,486][06909] Updated weights for policy 0, policy_version 174123 (0.0031) [2024-06-28 07:09:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2852913152. Throughput: 0: 43976.5. Samples: 2755894200. Policy #0 lag: (min: 2.0, avg: 11.3, max: 21.0) [2024-06-28 07:09:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:09:10,261][06909] Updated weights for policy 0, policy_version 174133 (0.0025) [2024-06-28 07:09:13,679][06909] Updated weights for policy 0, policy_version 174143 (0.0040) [2024-06-28 07:09:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2853158912. Throughput: 0: 43923.1. Samples: 2756024760. Policy #0 lag: (min: 2.0, avg: 11.3, max: 21.0) [2024-06-28 07:09:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:09:17,622][06909] Updated weights for policy 0, policy_version 174153 (0.0026) [2024-06-28 07:09:18,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2853371904. Throughput: 0: 43755.1. Samples: 2756290040. Policy #0 lag: (min: 2.0, avg: 11.3, max: 21.0) [2024-06-28 07:09:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:09:20,390][06887] Signal inference workers to stop experience collection... (39100 times) [2024-06-28 07:09:20,422][06909] InferenceWorker_p0-w0: stopping experience collection (39100 times) [2024-06-28 07:09:20,447][06887] Signal inference workers to resume experience collection... (39100 times) [2024-06-28 07:09:20,452][06909] InferenceWorker_p0-w0: resuming experience collection (39100 times) [2024-06-28 07:09:20,983][06909] Updated weights for policy 0, policy_version 174163 (0.0024) [2024-06-28 07:09:23,851][06674] Fps is (10 sec: 40954.1, 60 sec: 43689.6, 300 sec: 43986.7). Total num frames: 2853568512. Throughput: 0: 43628.4. Samples: 2756552140. Policy #0 lag: (min: 2.0, avg: 11.3, max: 21.0) [2024-06-28 07:09:23,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:09:25,252][06909] Updated weights for policy 0, policy_version 174173 (0.0041) [2024-06-28 07:09:28,852][06674] Fps is (10 sec: 42589.9, 60 sec: 43689.2, 300 sec: 43986.9). Total num frames: 2853797888. Throughput: 0: 43701.6. Samples: 2756679780. Policy #0 lag: (min: 2.0, avg: 11.3, max: 21.0) [2024-06-28 07:09:28,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:09:28,996][06909] Updated weights for policy 0, policy_version 174183 (0.0030) [2024-06-28 07:09:32,725][06909] Updated weights for policy 0, policy_version 174193 (0.0051) [2024-06-28 07:09:33,850][06674] Fps is (10 sec: 45881.1, 60 sec: 43417.5, 300 sec: 43986.8). Total num frames: 2854027264. Throughput: 0: 43808.2. Samples: 2756946980. Policy #0 lag: (min: 2.0, avg: 11.3, max: 21.0) [2024-06-28 07:09:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:09:36,444][06909] Updated weights for policy 0, policy_version 174203 (0.0044) [2024-06-28 07:09:38,850][06674] Fps is (10 sec: 42606.8, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2854223872. Throughput: 0: 43907.5. Samples: 2757209820. Policy #0 lag: (min: 2.0, avg: 11.3, max: 21.0) [2024-06-28 07:09:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:09:39,897][06909] Updated weights for policy 0, policy_version 174213 (0.0031) [2024-06-28 07:09:43,705][06909] Updated weights for policy 0, policy_version 174223 (0.0027) [2024-06-28 07:09:43,850][06674] Fps is (10 sec: 44237.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2854469632. Throughput: 0: 43844.5. Samples: 2757341020. Policy #0 lag: (min: 2.0, avg: 11.3, max: 21.0) [2024-06-28 07:09:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:09:47,386][06909] Updated weights for policy 0, policy_version 174233 (0.0023) [2024-06-28 07:09:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2854682624. Throughput: 0: 43854.3. Samples: 2757606260. Policy #0 lag: (min: 2.0, avg: 11.3, max: 21.0) [2024-06-28 07:09:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:09:48,958][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000174237_2854699008.pth... [2024-06-28 07:09:49,003][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000173592_2844131328.pth [2024-06-28 07:09:51,137][06909] Updated weights for policy 0, policy_version 174243 (0.0030) [2024-06-28 07:09:53,852][06674] Fps is (10 sec: 40951.5, 60 sec: 43689.2, 300 sec: 43931.0). Total num frames: 2854879232. Throughput: 0: 43923.3. Samples: 2757870840. Policy #0 lag: (min: 2.0, avg: 11.3, max: 21.0) [2024-06-28 07:09:53,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:09:55,306][06909] Updated weights for policy 0, policy_version 174253 (0.0031) [2024-06-28 07:09:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2855108608. Throughput: 0: 43668.9. Samples: 2757989860. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-28 07:09:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:09:59,051][06909] Updated weights for policy 0, policy_version 174263 (0.0033) [2024-06-28 07:10:02,482][06909] Updated weights for policy 0, policy_version 174273 (0.0031) [2024-06-28 07:10:03,850][06674] Fps is (10 sec: 47523.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2855354368. Throughput: 0: 43847.2. Samples: 2758263160. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-28 07:10:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:10:06,332][06909] Updated weights for policy 0, policy_version 174283 (0.0045) [2024-06-28 07:10:08,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.5, 300 sec: 43875.8). Total num frames: 2855534592. Throughput: 0: 43801.8. Samples: 2758523160. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-28 07:10:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:10:09,879][06909] Updated weights for policy 0, policy_version 174293 (0.0026) [2024-06-28 07:10:13,856][06674] Fps is (10 sec: 40934.8, 60 sec: 43413.2, 300 sec: 43930.5). Total num frames: 2855763968. Throughput: 0: 43830.7. Samples: 2758652340. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-28 07:10:13,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:10:13,928][06909] Updated weights for policy 0, policy_version 174303 (0.0032) [2024-06-28 07:10:17,321][06909] Updated weights for policy 0, policy_version 174313 (0.0032) [2024-06-28 07:10:18,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43690.7, 300 sec: 43820.6). Total num frames: 2855993344. Throughput: 0: 43807.3. Samples: 2758918300. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-28 07:10:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:10:21,106][06909] Updated weights for policy 0, policy_version 174323 (0.0037) [2024-06-28 07:10:23,850][06674] Fps is (10 sec: 44264.0, 60 sec: 43964.9, 300 sec: 43875.8). Total num frames: 2856206336. Throughput: 0: 43972.6. Samples: 2759188580. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-28 07:10:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:10:24,817][06909] Updated weights for policy 0, policy_version 174333 (0.0031) [2024-06-28 07:10:28,616][06909] Updated weights for policy 0, policy_version 174343 (0.0039) [2024-06-28 07:10:28,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43965.2, 300 sec: 43931.3). Total num frames: 2856435712. Throughput: 0: 43818.1. Samples: 2759312840. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-28 07:10:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:10:31,627][06887] Signal inference workers to stop experience collection... (39150 times) [2024-06-28 07:10:31,679][06887] Signal inference workers to resume experience collection... (39150 times) [2024-06-28 07:10:31,680][06909] InferenceWorker_p0-w0: stopping experience collection (39150 times) [2024-06-28 07:10:31,698][06909] InferenceWorker_p0-w0: resuming experience collection (39150 times) [2024-06-28 07:10:32,447][06909] Updated weights for policy 0, policy_version 174353 (0.0032) [2024-06-28 07:10:33,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2856665088. Throughput: 0: 43930.2. Samples: 2759583120. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-28 07:10:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:10:36,278][06909] Updated weights for policy 0, policy_version 174363 (0.0032) [2024-06-28 07:10:38,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 2856861696. Throughput: 0: 43898.5. Samples: 2759846180. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-28 07:10:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:10:39,768][06909] Updated weights for policy 0, policy_version 174373 (0.0028) [2024-06-28 07:10:43,693][06909] Updated weights for policy 0, policy_version 174383 (0.0026) [2024-06-28 07:10:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2857091072. Throughput: 0: 44068.4. Samples: 2759972940. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-28 07:10:43,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:10:47,033][06909] Updated weights for policy 0, policy_version 174393 (0.0030) [2024-06-28 07:10:48,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2857336832. Throughput: 0: 43900.0. Samples: 2760238660. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-28 07:10:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:10:51,093][06909] Updated weights for policy 0, policy_version 174403 (0.0025) [2024-06-28 07:10:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44238.3, 300 sec: 43931.6). Total num frames: 2857533440. Throughput: 0: 44123.3. Samples: 2760508700. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-28 07:10:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:10:54,653][06909] Updated weights for policy 0, policy_version 174413 (0.0036) [2024-06-28 07:10:58,518][06909] Updated weights for policy 0, policy_version 174423 (0.0028) [2024-06-28 07:10:58,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 2857746432. Throughput: 0: 43969.8. Samples: 2760630720. Policy #0 lag: (min: 0.0, avg: 11.3, max: 24.0) [2024-06-28 07:10:58,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:11:02,243][06909] Updated weights for policy 0, policy_version 174433 (0.0030) [2024-06-28 07:11:03,852][06674] Fps is (10 sec: 45863.3, 60 sec: 43961.8, 300 sec: 44042.0). Total num frames: 2857992192. Throughput: 0: 44039.7. Samples: 2760900200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 07:11:03,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:11:06,597][06909] Updated weights for policy 0, policy_version 174443 (0.0038) [2024-06-28 07:11:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 2858188800. Throughput: 0: 43933.2. Samples: 2761165580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 07:11:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:11:09,642][06909] Updated weights for policy 0, policy_version 174453 (0.0026) [2024-06-28 07:11:13,850][06674] Fps is (10 sec: 39332.1, 60 sec: 43695.2, 300 sec: 43875.8). Total num frames: 2858385408. Throughput: 0: 43894.0. Samples: 2761288060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 07:11:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:11:13,859][06909] Updated weights for policy 0, policy_version 174463 (0.0020) [2024-06-28 07:11:17,028][06909] Updated weights for policy 0, policy_version 174473 (0.0030) [2024-06-28 07:11:18,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2858647552. Throughput: 0: 43723.2. Samples: 2761550660. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 07:11:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:11:21,067][06909] Updated weights for policy 0, policy_version 174483 (0.0034) [2024-06-28 07:11:23,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 2858827776. Throughput: 0: 43965.3. Samples: 2761824620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 07:11:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:11:24,429][06909] Updated weights for policy 0, policy_version 174493 (0.0033) [2024-06-28 07:11:28,580][06909] Updated weights for policy 0, policy_version 174503 (0.0042) [2024-06-28 07:11:28,853][06674] Fps is (10 sec: 40946.9, 60 sec: 43688.4, 300 sec: 43875.3). Total num frames: 2859057152. Throughput: 0: 43866.3. Samples: 2761947060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 07:11:28,854][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:11:31,818][06909] Updated weights for policy 0, policy_version 174513 (0.0033) [2024-06-28 07:11:33,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2859302912. Throughput: 0: 43832.5. Samples: 2762211120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 07:11:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:11:35,723][06909] Updated weights for policy 0, policy_version 174523 (0.0033) [2024-06-28 07:11:38,850][06674] Fps is (10 sec: 45889.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2859515904. Throughput: 0: 44007.1. Samples: 2762489020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 07:11:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:11:39,489][06909] Updated weights for policy 0, policy_version 174533 (0.0031) [2024-06-28 07:11:39,701][06887] Signal inference workers to stop experience collection... (39200 times) [2024-06-28 07:11:39,752][06909] InferenceWorker_p0-w0: stopping experience collection (39200 times) [2024-06-28 07:11:39,752][06887] Signal inference workers to resume experience collection... (39200 times) [2024-06-28 07:11:39,773][06909] InferenceWorker_p0-w0: resuming experience collection (39200 times) [2024-06-28 07:11:43,586][06909] Updated weights for policy 0, policy_version 174543 (0.0031) [2024-06-28 07:11:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 2859712512. Throughput: 0: 43862.8. Samples: 2762604540. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 07:11:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:11:46,838][06909] Updated weights for policy 0, policy_version 174553 (0.0033) [2024-06-28 07:11:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2859958272. Throughput: 0: 43760.2. Samples: 2762869300. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 07:11:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:11:48,903][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000174559_2859974656.pth... [2024-06-28 07:11:48,949][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000173915_2849423360.pth [2024-06-28 07:11:50,887][06909] Updated weights for policy 0, policy_version 174563 (0.0028) [2024-06-28 07:11:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 2860171264. Throughput: 0: 43919.2. Samples: 2763141940. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 07:11:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:11:54,168][06909] Updated weights for policy 0, policy_version 174573 (0.0023) [2024-06-28 07:11:58,601][06909] Updated weights for policy 0, policy_version 174583 (0.0031) [2024-06-28 07:11:58,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 2860367872. Throughput: 0: 44007.5. Samples: 2763268400. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 07:11:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:12:01,669][06909] Updated weights for policy 0, policy_version 174593 (0.0024) [2024-06-28 07:12:03,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44238.6, 300 sec: 44043.3). Total num frames: 2860646400. Throughput: 0: 44036.8. Samples: 2763532320. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 07:12:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:12:05,766][06909] Updated weights for policy 0, policy_version 174603 (0.0042) [2024-06-28 07:12:08,850][06674] Fps is (10 sec: 49152.3, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 2860859392. Throughput: 0: 43940.6. Samples: 2763801940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:12:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:12:08,854][06909] Updated weights for policy 0, policy_version 174613 (0.0038) [2024-06-28 07:12:12,988][06909] Updated weights for policy 0, policy_version 174623 (0.0034) [2024-06-28 07:12:13,852][06674] Fps is (10 sec: 39313.8, 60 sec: 44235.2, 300 sec: 43820.0). Total num frames: 2861039616. Throughput: 0: 43930.9. Samples: 2763923900. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:12:13,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:12:16,722][06909] Updated weights for policy 0, policy_version 174633 (0.0027) [2024-06-28 07:12:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44043.3). Total num frames: 2861301760. Throughput: 0: 44195.5. Samples: 2764199920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:12:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:12:20,949][06909] Updated weights for policy 0, policy_version 174643 (0.0025) [2024-06-28 07:12:23,850][06674] Fps is (10 sec: 45884.8, 60 sec: 44509.9, 300 sec: 43875.8). Total num frames: 2861498368. Throughput: 0: 43877.4. Samples: 2764463500. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:12:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:12:23,972][06909] Updated weights for policy 0, policy_version 174653 (0.0025) [2024-06-28 07:12:28,161][06909] Updated weights for policy 0, policy_version 174663 (0.0027) [2024-06-28 07:12:28,850][06674] Fps is (10 sec: 40959.9, 60 sec: 44239.1, 300 sec: 43931.3). Total num frames: 2861711360. Throughput: 0: 44133.3. Samples: 2764590540. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:12:28,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:12:31,252][06909] Updated weights for policy 0, policy_version 174673 (0.0026) [2024-06-28 07:12:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2861957120. Throughput: 0: 44164.0. Samples: 2764856680. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:12:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:12:35,515][06909] Updated weights for policy 0, policy_version 174683 (0.0028) [2024-06-28 07:12:38,665][06909] Updated weights for policy 0, policy_version 174693 (0.0035) [2024-06-28 07:12:38,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2862186496. Throughput: 0: 44226.3. Samples: 2765132120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:12:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:12:42,922][06909] Updated weights for policy 0, policy_version 174703 (0.0035) [2024-06-28 07:12:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 2862383104. Throughput: 0: 44293.7. Samples: 2765261620. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:12:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:12:45,842][06909] Updated weights for policy 0, policy_version 174713 (0.0031) [2024-06-28 07:12:48,852][06674] Fps is (10 sec: 42589.5, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 2862612480. Throughput: 0: 44430.9. Samples: 2765531800. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:12:48,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:12:50,071][06909] Updated weights for policy 0, policy_version 174723 (0.0029) [2024-06-28 07:12:53,289][06909] Updated weights for policy 0, policy_version 174733 (0.0031) [2024-06-28 07:12:53,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44782.9, 300 sec: 44042.4). Total num frames: 2862858240. Throughput: 0: 44435.9. Samples: 2765801560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:12:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:12:57,396][06909] Updated weights for policy 0, policy_version 174743 (0.0031) [2024-06-28 07:12:58,850][06674] Fps is (10 sec: 42607.1, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 2863038464. Throughput: 0: 44754.9. Samples: 2765937780. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:12:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:13:00,586][06887] Signal inference workers to stop experience collection... (39250 times) [2024-06-28 07:13:00,586][06887] Signal inference workers to resume experience collection... (39250 times) [2024-06-28 07:13:00,634][06909] InferenceWorker_p0-w0: stopping experience collection (39250 times) [2024-06-28 07:13:00,634][06909] InferenceWorker_p0-w0: resuming experience collection (39250 times) [2024-06-28 07:13:00,751][06909] Updated weights for policy 0, policy_version 174753 (0.0045) [2024-06-28 07:13:03,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2863284224. Throughput: 0: 44280.3. Samples: 2766192540. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:13:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:13:04,976][06909] Updated weights for policy 0, policy_version 174763 (0.0026) [2024-06-28 07:13:08,060][06909] Updated weights for policy 0, policy_version 174773 (0.0037) [2024-06-28 07:13:08,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2863513600. Throughput: 0: 44423.6. Samples: 2766462560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:13:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:13:12,443][06909] Updated weights for policy 0, policy_version 174783 (0.0029) [2024-06-28 07:13:13,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44511.4, 300 sec: 43986.9). Total num frames: 2863710208. Throughput: 0: 44646.2. Samples: 2766599620. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:13:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:13:15,683][06909] Updated weights for policy 0, policy_version 174793 (0.0030) [2024-06-28 07:13:18,852][06674] Fps is (10 sec: 42589.2, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 2863939584. Throughput: 0: 44279.3. Samples: 2766849340. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:13:18,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:13:20,030][06909] Updated weights for policy 0, policy_version 174803 (0.0030) [2024-06-28 07:13:23,221][06909] Updated weights for policy 0, policy_version 174813 (0.0033) [2024-06-28 07:13:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2864152576. Throughput: 0: 44150.6. Samples: 2767118900. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:13:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:13:27,583][06909] Updated weights for policy 0, policy_version 174823 (0.0027) [2024-06-28 07:13:28,857][06674] Fps is (10 sec: 42575.1, 60 sec: 44231.2, 300 sec: 43874.7). Total num frames: 2864365568. Throughput: 0: 44211.8. Samples: 2767251480. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:13:28,858][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:13:30,593][06909] Updated weights for policy 0, policy_version 174833 (0.0035) [2024-06-28 07:13:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2864594944. Throughput: 0: 43911.4. Samples: 2767507720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:13:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:13:35,048][06909] Updated weights for policy 0, policy_version 174843 (0.0029) [2024-06-28 07:13:38,057][06909] Updated weights for policy 0, policy_version 174853 (0.0035) [2024-06-28 07:13:38,850][06674] Fps is (10 sec: 47549.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2864840704. Throughput: 0: 43863.1. Samples: 2767775400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:13:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:13:42,266][06909] Updated weights for policy 0, policy_version 174863 (0.0031) [2024-06-28 07:13:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2865020928. Throughput: 0: 43806.2. Samples: 2767909060. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:13:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:13:45,241][06909] Updated weights for policy 0, policy_version 174873 (0.0035) [2024-06-28 07:13:48,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 2865250304. Throughput: 0: 43941.9. Samples: 2768169920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:13:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:13:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000174881_2865250304.pth... [2024-06-28 07:13:48,914][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000174237_2854699008.pth [2024-06-28 07:13:49,833][06909] Updated weights for policy 0, policy_version 174883 (0.0041) [2024-06-28 07:13:52,842][06909] Updated weights for policy 0, policy_version 174893 (0.0032) [2024-06-28 07:13:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2865479680. Throughput: 0: 43816.3. Samples: 2768434300. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:13:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:13:57,145][06909] Updated weights for policy 0, policy_version 174903 (0.0037) [2024-06-28 07:13:58,851][06674] Fps is (10 sec: 42592.8, 60 sec: 43962.8, 300 sec: 43931.1). Total num frames: 2865676288. Throughput: 0: 43735.1. Samples: 2768567760. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:13:58,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:14:00,507][06909] Updated weights for policy 0, policy_version 174913 (0.0032) [2024-06-28 07:14:00,962][06887] Signal inference workers to stop experience collection... (39300 times) [2024-06-28 07:14:00,969][06887] Signal inference workers to resume experience collection... (39300 times) [2024-06-28 07:14:01,020][06909] InferenceWorker_p0-w0: stopping experience collection (39300 times) [2024-06-28 07:14:01,020][06909] InferenceWorker_p0-w0: resuming experience collection (39300 times) [2024-06-28 07:14:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2865905664. Throughput: 0: 43883.7. Samples: 2768824020. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:14:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:14:04,734][06909] Updated weights for policy 0, policy_version 174923 (0.0032) [2024-06-28 07:14:07,785][06909] Updated weights for policy 0, policy_version 174933 (0.0027) [2024-06-28 07:14:08,850][06674] Fps is (10 sec: 45881.3, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2866135040. Throughput: 0: 43797.8. Samples: 2769089800. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:14:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:14:12,018][06909] Updated weights for policy 0, policy_version 174943 (0.0029) [2024-06-28 07:14:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2866331648. Throughput: 0: 43920.7. Samples: 2769227580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:14:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:14:15,449][06909] Updated weights for policy 0, policy_version 174953 (0.0040) [2024-06-28 07:14:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43692.2, 300 sec: 44042.6). Total num frames: 2866561024. Throughput: 0: 44035.6. Samples: 2769489320. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 07:14:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:14:19,631][06909] Updated weights for policy 0, policy_version 174963 (0.0031) [2024-06-28 07:14:22,523][06909] Updated weights for policy 0, policy_version 174973 (0.0026) [2024-06-28 07:14:23,852][06674] Fps is (10 sec: 49142.0, 60 sec: 44508.4, 300 sec: 44153.5). Total num frames: 2866823168. Throughput: 0: 44008.2. Samples: 2769755860. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 07:14:23,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:14:26,921][06909] Updated weights for policy 0, policy_version 174983 (0.0036) [2024-06-28 07:14:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43969.3, 300 sec: 43986.9). Total num frames: 2867003392. Throughput: 0: 44175.6. Samples: 2769896960. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 07:14:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:14:30,285][06909] Updated weights for policy 0, policy_version 174993 (0.0039) [2024-06-28 07:14:33,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2867232768. Throughput: 0: 44168.9. Samples: 2770157520. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 07:14:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:14:34,263][06909] Updated weights for policy 0, policy_version 175003 (0.0037) [2024-06-28 07:14:37,546][06909] Updated weights for policy 0, policy_version 175013 (0.0024) [2024-06-28 07:14:38,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2867478528. Throughput: 0: 44048.5. Samples: 2770416480. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 07:14:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:14:41,810][06909] Updated weights for policy 0, policy_version 175023 (0.0025) [2024-06-28 07:14:43,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 2867675136. Throughput: 0: 44198.4. Samples: 2770556720. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 07:14:43,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:14:44,737][06909] Updated weights for policy 0, policy_version 175033 (0.0034) [2024-06-28 07:14:48,851][06674] Fps is (10 sec: 40955.2, 60 sec: 43962.9, 300 sec: 44098.1). Total num frames: 2867888128. Throughput: 0: 44364.7. Samples: 2770820480. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 07:14:48,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:14:48,947][06909] Updated weights for policy 0, policy_version 175043 (0.0034) [2024-06-28 07:14:52,280][06909] Updated weights for policy 0, policy_version 175053 (0.0031) [2024-06-28 07:14:53,850][06674] Fps is (10 sec: 47523.9, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 2868150272. Throughput: 0: 44178.7. Samples: 2771077840. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 07:14:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:14:56,622][06909] Updated weights for policy 0, policy_version 175063 (0.0020) [2024-06-28 07:14:58,850][06674] Fps is (10 sec: 45880.6, 60 sec: 44510.9, 300 sec: 44042.4). Total num frames: 2868346880. Throughput: 0: 44390.7. Samples: 2771225160. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 07:14:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:14:59,697][06909] Updated weights for policy 0, policy_version 175073 (0.0031) [2024-06-28 07:15:03,764][06909] Updated weights for policy 0, policy_version 175083 (0.0049) [2024-06-28 07:15:03,850][06674] Fps is (10 sec: 40959.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2868559872. Throughput: 0: 44430.9. Samples: 2771488720. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 07:15:03,854][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:15:07,319][06909] Updated weights for policy 0, policy_version 175093 (0.0026) [2024-06-28 07:15:08,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44509.8, 300 sec: 44209.9). Total num frames: 2868805632. Throughput: 0: 44230.3. Samples: 2771746140. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 07:15:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:15:11,371][06909] Updated weights for policy 0, policy_version 175103 (0.0030) [2024-06-28 07:15:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 2869018624. Throughput: 0: 44238.1. Samples: 2771887680. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 07:15:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:15:14,490][06909] Updated weights for policy 0, policy_version 175113 (0.0027) [2024-06-28 07:15:18,635][06909] Updated weights for policy 0, policy_version 175123 (0.0023) [2024-06-28 07:15:18,852][06674] Fps is (10 sec: 40952.0, 60 sec: 44235.2, 300 sec: 44097.6). Total num frames: 2869215232. Throughput: 0: 44394.9. Samples: 2772155380. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 07:15:18,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:15:20,483][06887] Signal inference workers to stop experience collection... (39350 times) [2024-06-28 07:15:20,530][06909] InferenceWorker_p0-w0: stopping experience collection (39350 times) [2024-06-28 07:15:20,540][06887] Signal inference workers to resume experience collection... (39350 times) [2024-06-28 07:15:20,550][06909] InferenceWorker_p0-w0: resuming experience collection (39350 times) [2024-06-28 07:15:21,645][06909] Updated weights for policy 0, policy_version 175133 (0.0038) [2024-06-28 07:15:23,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 2869460992. Throughput: 0: 44375.6. Samples: 2772413380. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 07:15:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:15:25,920][06909] Updated weights for policy 0, policy_version 175143 (0.0024) [2024-06-28 07:15:28,850][06674] Fps is (10 sec: 47523.7, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 2869690368. Throughput: 0: 44259.9. Samples: 2772548320. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 07:15:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:15:29,267][06909] Updated weights for policy 0, policy_version 175153 (0.0039) [2024-06-28 07:15:33,440][06909] Updated weights for policy 0, policy_version 175163 (0.0026) [2024-06-28 07:15:33,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2869870592. Throughput: 0: 44446.9. Samples: 2772820540. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 07:15:33,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:15:36,923][06909] Updated weights for policy 0, policy_version 175173 (0.0031) [2024-06-28 07:15:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2870116352. Throughput: 0: 44328.4. Samples: 2773072620. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 07:15:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:15:41,055][06909] Updated weights for policy 0, policy_version 175183 (0.0033) [2024-06-28 07:15:43,852][06674] Fps is (10 sec: 47504.2, 60 sec: 44509.9, 300 sec: 44097.6). Total num frames: 2870345728. Throughput: 0: 44194.0. Samples: 2773213980. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 07:15:43,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:15:44,177][06909] Updated weights for policy 0, policy_version 175193 (0.0038) [2024-06-28 07:15:48,479][06909] Updated weights for policy 0, policy_version 175203 (0.0034) [2024-06-28 07:15:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44237.6, 300 sec: 44097.9). Total num frames: 2870542336. Throughput: 0: 44216.5. Samples: 2773478460. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 07:15:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:15:48,951][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000175205_2870558720.pth... [2024-06-28 07:15:49,010][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000174559_2859974656.pth [2024-06-28 07:15:51,364][06909] Updated weights for policy 0, policy_version 175213 (0.0027) [2024-06-28 07:15:53,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 2870771712. Throughput: 0: 44132.2. Samples: 2773732080. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 07:15:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:15:56,063][06909] Updated weights for policy 0, policy_version 175223 (0.0044) [2024-06-28 07:15:58,721][06909] Updated weights for policy 0, policy_version 175233 (0.0026) [2024-06-28 07:15:58,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44509.9, 300 sec: 44153.9). Total num frames: 2871017472. Throughput: 0: 44191.7. Samples: 2773876300. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 07:15:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:16:03,345][06909] Updated weights for policy 0, policy_version 175243 (0.0032) [2024-06-28 07:16:03,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2871181312. Throughput: 0: 44145.6. Samples: 2774141840. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 07:16:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:16:06,171][06909] Updated weights for policy 0, policy_version 175253 (0.0040) [2024-06-28 07:16:08,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 2871427072. Throughput: 0: 43951.9. Samples: 2774391220. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 07:16:08,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:16:11,125][06909] Updated weights for policy 0, policy_version 175263 (0.0026) [2024-06-28 07:16:13,783][06909] Updated weights for policy 0, policy_version 175273 (0.0045) [2024-06-28 07:16:13,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2871672832. Throughput: 0: 44038.2. Samples: 2774530040. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 07:16:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:16:18,589][06909] Updated weights for policy 0, policy_version 175283 (0.0030) [2024-06-28 07:16:18,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43692.2, 300 sec: 44098.0). Total num frames: 2871836672. Throughput: 0: 43868.1. Samples: 2774794600. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 07:16:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:16:21,491][06909] Updated weights for policy 0, policy_version 175293 (0.0023) [2024-06-28 07:16:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 44154.0). Total num frames: 2872082432. Throughput: 0: 43871.5. Samples: 2775046840. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 07:16:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:16:26,020][06909] Updated weights for policy 0, policy_version 175303 (0.0031) [2024-06-28 07:16:28,764][06909] Updated weights for policy 0, policy_version 175313 (0.0033) [2024-06-28 07:16:28,850][06674] Fps is (10 sec: 49151.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2872328192. Throughput: 0: 43869.0. Samples: 2775188000. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 07:16:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:16:33,624][06909] Updated weights for policy 0, policy_version 175323 (0.0034) [2024-06-28 07:16:33,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2872492032. Throughput: 0: 43877.2. Samples: 2775452940. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 07:16:33,855][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:16:34,437][06887] Signal inference workers to stop experience collection... (39400 times) [2024-06-28 07:16:34,439][06887] Signal inference workers to resume experience collection... (39400 times) [2024-06-28 07:16:34,474][06909] InferenceWorker_p0-w0: stopping experience collection (39400 times) [2024-06-28 07:16:34,474][06909] InferenceWorker_p0-w0: resuming experience collection (39400 times) [2024-06-28 07:16:36,157][06909] Updated weights for policy 0, policy_version 175333 (0.0044) [2024-06-28 07:16:38,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2872754176. Throughput: 0: 43865.8. Samples: 2775706040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 07:16:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:16:41,008][06909] Updated weights for policy 0, policy_version 175343 (0.0032) [2024-06-28 07:16:43,495][06909] Updated weights for policy 0, policy_version 175353 (0.0022) [2024-06-28 07:16:43,850][06674] Fps is (10 sec: 49152.4, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 2872983552. Throughput: 0: 43910.6. Samples: 2775852280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 07:16:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:16:48,229][06909] Updated weights for policy 0, policy_version 175363 (0.0032) [2024-06-28 07:16:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2873163776. Throughput: 0: 43864.9. Samples: 2776115760. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 07:16:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:16:51,026][06909] Updated weights for policy 0, policy_version 175373 (0.0034) [2024-06-28 07:16:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 2873409536. Throughput: 0: 43968.6. Samples: 2776369800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 07:16:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:16:55,778][06909] Updated weights for policy 0, policy_version 175383 (0.0034) [2024-06-28 07:16:58,781][06909] Updated weights for policy 0, policy_version 175393 (0.0027) [2024-06-28 07:16:58,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2873638912. Throughput: 0: 43961.4. Samples: 2776508300. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 07:16:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:17:03,202][06909] Updated weights for policy 0, policy_version 175403 (0.0037) [2024-06-28 07:17:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2873819136. Throughput: 0: 43952.4. Samples: 2776772460. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 07:17:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:17:05,991][06909] Updated weights for policy 0, policy_version 175413 (0.0031) [2024-06-28 07:17:08,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.7, 300 sec: 44153.8). Total num frames: 2874064896. Throughput: 0: 44064.8. Samples: 2777029760. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 07:17:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:17:10,814][06909] Updated weights for policy 0, policy_version 175423 (0.0033) [2024-06-28 07:17:13,309][06909] Updated weights for policy 0, policy_version 175433 (0.0028) [2024-06-28 07:17:13,850][06674] Fps is (10 sec: 49152.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2874310656. Throughput: 0: 44116.1. Samples: 2777173220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 07:17:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:17:18,038][06909] Updated weights for policy 0, policy_version 175443 (0.0041) [2024-06-28 07:17:18,850][06674] Fps is (10 sec: 42599.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2874490880. Throughput: 0: 44092.2. Samples: 2777437080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 07:17:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:17:20,877][06909] Updated weights for policy 0, policy_version 175453 (0.0039) [2024-06-28 07:17:23,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2874720256. Throughput: 0: 44243.6. Samples: 2777697000. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 07:17:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:17:25,302][06909] Updated weights for policy 0, policy_version 175463 (0.0030) [2024-06-28 07:17:28,540][06909] Updated weights for policy 0, policy_version 175473 (0.0043) [2024-06-28 07:17:28,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2874966016. Throughput: 0: 43974.1. Samples: 2777831120. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 07:17:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:17:32,877][06909] Updated weights for policy 0, policy_version 175483 (0.0030) [2024-06-28 07:17:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 2875146240. Throughput: 0: 43935.6. Samples: 2778092860. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 07:17:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:17:35,774][06909] Updated weights for policy 0, policy_version 175493 (0.0022) [2024-06-28 07:17:38,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2875375616. Throughput: 0: 44072.0. Samples: 2778353040. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 07:17:38,853][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:17:40,183][06909] Updated weights for policy 0, policy_version 175503 (0.0034) [2024-06-28 07:17:43,127][06909] Updated weights for policy 0, policy_version 175513 (0.0029) [2024-06-28 07:17:43,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 2875637760. Throughput: 0: 44076.4. Samples: 2778491740. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 07:17:43,857][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:17:47,743][06909] Updated weights for policy 0, policy_version 175523 (0.0044) [2024-06-28 07:17:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 2875801600. Throughput: 0: 43980.0. Samples: 2778751560. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 07:17:48,858][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:17:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000175525_2875801600.pth... [2024-06-28 07:17:48,938][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000174881_2865250304.pth [2024-06-28 07:17:49,926][06887] Signal inference workers to stop experience collection... (39450 times) [2024-06-28 07:17:49,926][06887] Signal inference workers to resume experience collection... (39450 times) [2024-06-28 07:17:49,938][06909] InferenceWorker_p0-w0: stopping experience collection (39450 times) [2024-06-28 07:17:49,938][06909] InferenceWorker_p0-w0: resuming experience collection (39450 times) [2024-06-28 07:17:50,449][06909] Updated weights for policy 0, policy_version 175533 (0.0028) [2024-06-28 07:17:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2876047360. Throughput: 0: 44105.4. Samples: 2779014500. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 07:17:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:17:55,397][06909] Updated weights for policy 0, policy_version 175543 (0.0035) [2024-06-28 07:17:58,163][06909] Updated weights for policy 0, policy_version 175553 (0.0032) [2024-06-28 07:17:58,850][06674] Fps is (10 sec: 49151.7, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2876293120. Throughput: 0: 43942.6. Samples: 2779150640. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 07:17:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:18:02,672][06909] Updated weights for policy 0, policy_version 175563 (0.0031) [2024-06-28 07:18:03,851][06674] Fps is (10 sec: 44230.3, 60 sec: 44508.8, 300 sec: 43986.6). Total num frames: 2876489728. Throughput: 0: 43995.8. Samples: 2779416960. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 07:18:03,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:18:05,814][06909] Updated weights for policy 0, policy_version 175573 (0.0043) [2024-06-28 07:18:08,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2876686336. Throughput: 0: 43949.3. Samples: 2779674720. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 07:18:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:18:09,957][06909] Updated weights for policy 0, policy_version 175583 (0.0038) [2024-06-28 07:18:13,323][06909] Updated weights for policy 0, policy_version 175593 (0.0028) [2024-06-28 07:18:13,850][06674] Fps is (10 sec: 45881.6, 60 sec: 43963.6, 300 sec: 44098.3). Total num frames: 2876948480. Throughput: 0: 43903.1. Samples: 2779806760. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 07:18:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:18:17,278][06909] Updated weights for policy 0, policy_version 175603 (0.0031) [2024-06-28 07:18:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2877145088. Throughput: 0: 43974.2. Samples: 2780071700. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 07:18:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:18:20,581][06909] Updated weights for policy 0, policy_version 175613 (0.0029) [2024-06-28 07:18:23,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43690.6, 300 sec: 43988.0). Total num frames: 2877341696. Throughput: 0: 44174.6. Samples: 2780340900. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 07:18:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:18:24,689][06909] Updated weights for policy 0, policy_version 175623 (0.0036) [2024-06-28 07:18:27,789][06909] Updated weights for policy 0, policy_version 175633 (0.0040) [2024-06-28 07:18:28,850][06674] Fps is (10 sec: 47512.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2877620224. Throughput: 0: 44091.0. Samples: 2780475840. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 07:18:28,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 07:18:32,323][06909] Updated weights for policy 0, policy_version 175643 (0.0043) [2024-06-28 07:18:33,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2877800448. Throughput: 0: 44244.5. Samples: 2780742560. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 07:18:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:18:35,366][06909] Updated weights for policy 0, policy_version 175653 (0.0033) [2024-06-28 07:18:38,850][06674] Fps is (10 sec: 40960.9, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2878029824. Throughput: 0: 44254.3. Samples: 2781005940. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 07:18:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:18:39,486][06909] Updated weights for policy 0, policy_version 175663 (0.0028) [2024-06-28 07:18:42,948][06909] Updated weights for policy 0, policy_version 175673 (0.0032) [2024-06-28 07:18:43,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2878275584. Throughput: 0: 44097.8. Samples: 2781135040. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 07:18:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:18:46,987][06909] Updated weights for policy 0, policy_version 175683 (0.0034) [2024-06-28 07:18:48,857][06674] Fps is (10 sec: 44205.3, 60 sec: 44504.6, 300 sec: 44041.4). Total num frames: 2878472192. Throughput: 0: 44099.0. Samples: 2781401660. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 07:18:48,857][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:18:50,323][06909] Updated weights for policy 0, policy_version 175693 (0.0041) [2024-06-28 07:18:53,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43690.7, 300 sec: 44042.6). Total num frames: 2878668800. Throughput: 0: 44192.5. Samples: 2781663380. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 07:18:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:18:54,364][06909] Updated weights for policy 0, policy_version 175703 (0.0031) [2024-06-28 07:18:57,540][06909] Updated weights for policy 0, policy_version 175713 (0.0030) [2024-06-28 07:18:58,850][06674] Fps is (10 sec: 45907.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2878930944. Throughput: 0: 44238.7. Samples: 2781797500. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 07:18:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:19:01,905][06909] Updated weights for policy 0, policy_version 175723 (0.0040) [2024-06-28 07:19:03,852][06674] Fps is (10 sec: 45865.9, 60 sec: 43963.3, 300 sec: 44042.1). Total num frames: 2879127552. Throughput: 0: 44114.4. Samples: 2782056940. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 07:19:03,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:19:04,679][06909] Updated weights for policy 0, policy_version 175733 (0.0031) [2024-06-28 07:19:08,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2879340544. Throughput: 0: 44072.5. Samples: 2782324160. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 07:19:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:19:09,233][06909] Updated weights for policy 0, policy_version 175743 (0.0022) [2024-06-28 07:19:12,779][06909] Updated weights for policy 0, policy_version 175753 (0.0033) [2024-06-28 07:19:13,850][06674] Fps is (10 sec: 47523.3, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2879602688. Throughput: 0: 44004.6. Samples: 2782456040. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 07:19:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:19:16,611][06909] Updated weights for policy 0, policy_version 175763 (0.0023) [2024-06-28 07:19:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 2879799296. Throughput: 0: 43938.7. Samples: 2782719800. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 07:19:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:19:20,011][06909] Updated weights for policy 0, policy_version 175773 (0.0032) [2024-06-28 07:19:22,932][06887] Signal inference workers to stop experience collection... (39500 times) [2024-06-28 07:19:22,933][06887] Signal inference workers to resume experience collection... (39500 times) [2024-06-28 07:19:22,986][06909] InferenceWorker_p0-w0: stopping experience collection (39500 times) [2024-06-28 07:19:22,986][06909] InferenceWorker_p0-w0: resuming experience collection (39500 times) [2024-06-28 07:19:23,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2880012288. Throughput: 0: 44064.3. Samples: 2782988840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 07:19:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:19:24,117][06909] Updated weights for policy 0, policy_version 175783 (0.0029) [2024-06-28 07:19:27,339][06909] Updated weights for policy 0, policy_version 175793 (0.0043) [2024-06-28 07:19:28,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2880258048. Throughput: 0: 44036.4. Samples: 2783116680. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 07:19:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:19:31,290][06909] Updated weights for policy 0, policy_version 175803 (0.0030) [2024-06-28 07:19:33,856][06674] Fps is (10 sec: 44211.7, 60 sec: 44232.5, 300 sec: 43986.0). Total num frames: 2880454656. Throughput: 0: 44010.2. Samples: 2783382060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 07:19:33,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:19:34,662][06909] Updated weights for policy 0, policy_version 175813 (0.0030) [2024-06-28 07:19:38,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 2880667648. Throughput: 0: 44162.2. Samples: 2783650680. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 07:19:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:19:38,904][06909] Updated weights for policy 0, policy_version 175823 (0.0036) [2024-06-28 07:19:41,796][06909] Updated weights for policy 0, policy_version 175833 (0.0042) [2024-06-28 07:19:43,850][06674] Fps is (10 sec: 45901.2, 60 sec: 43963.7, 300 sec: 44153.7). Total num frames: 2880913408. Throughput: 0: 44100.5. Samples: 2783782020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 07:19:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:19:46,176][06909] Updated weights for policy 0, policy_version 175843 (0.0043) [2024-06-28 07:19:48,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44241.9, 300 sec: 43986.9). Total num frames: 2881126400. Throughput: 0: 44324.6. Samples: 2784051460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 07:19:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:19:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000175850_2881126400.pth... [2024-06-28 07:19:48,933][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000175205_2870558720.pth [2024-06-28 07:19:49,873][06909] Updated weights for policy 0, policy_version 175853 (0.0039) [2024-06-28 07:19:53,518][06909] Updated weights for policy 0, policy_version 175863 (0.0025) [2024-06-28 07:19:53,852][06674] Fps is (10 sec: 42589.9, 60 sec: 44508.3, 300 sec: 44042.1). Total num frames: 2881339392. Throughput: 0: 44051.8. Samples: 2784306580. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 07:19:53,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:19:57,409][06909] Updated weights for policy 0, policy_version 175873 (0.0036) [2024-06-28 07:19:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2881568768. Throughput: 0: 43986.1. Samples: 2784435420. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 07:19:58,854][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:20:01,096][06909] Updated weights for policy 0, policy_version 175883 (0.0024) [2024-06-28 07:20:03,850][06674] Fps is (10 sec: 44246.1, 60 sec: 44238.3, 300 sec: 43986.9). Total num frames: 2881781760. Throughput: 0: 44082.2. Samples: 2784703500. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 07:20:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:20:04,718][06909] Updated weights for policy 0, policy_version 175893 (0.0033) [2024-06-28 07:20:08,374][06909] Updated weights for policy 0, policy_version 175903 (0.0025) [2024-06-28 07:20:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2882011136. Throughput: 0: 44072.9. Samples: 2784972120. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 07:20:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:20:11,904][06909] Updated weights for policy 0, policy_version 175913 (0.0019) [2024-06-28 07:20:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44098.3). Total num frames: 2882224128. Throughput: 0: 44115.7. Samples: 2785101880. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 07:20:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:20:15,729][06909] Updated weights for policy 0, policy_version 175923 (0.0031) [2024-06-28 07:20:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2882453504. Throughput: 0: 44292.3. Samples: 2785374960. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 07:20:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:20:19,105][06909] Updated weights for policy 0, policy_version 175933 (0.0031) [2024-06-28 07:20:23,346][06909] Updated weights for policy 0, policy_version 175943 (0.0036) [2024-06-28 07:20:23,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2882682880. Throughput: 0: 44014.6. Samples: 2785631340. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 07:20:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:20:26,889][06909] Updated weights for policy 0, policy_version 175953 (0.0037) [2024-06-28 07:20:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 2882879488. Throughput: 0: 43945.0. Samples: 2785759540. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 07:20:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:20:29,903][06887] Signal inference workers to stop experience collection... (39550 times) [2024-06-28 07:20:29,942][06909] InferenceWorker_p0-w0: stopping experience collection (39550 times) [2024-06-28 07:20:29,949][06887] Signal inference workers to resume experience collection... (39550 times) [2024-06-28 07:20:29,955][06909] InferenceWorker_p0-w0: resuming experience collection (39550 times) [2024-06-28 07:20:30,933][06909] Updated weights for policy 0, policy_version 175963 (0.0029) [2024-06-28 07:20:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43967.9, 300 sec: 43986.9). Total num frames: 2883092480. Throughput: 0: 43834.3. Samples: 2786024000. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 07:20:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:20:34,336][06909] Updated weights for policy 0, policy_version 175973 (0.0024) [2024-06-28 07:20:38,342][06909] Updated weights for policy 0, policy_version 175983 (0.0038) [2024-06-28 07:20:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.8, 300 sec: 44042.7). Total num frames: 2883338240. Throughput: 0: 44097.5. Samples: 2786290880. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 07:20:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:20:41,663][06909] Updated weights for policy 0, policy_version 175993 (0.0028) [2024-06-28 07:20:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2883534848. Throughput: 0: 44129.0. Samples: 2786421220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 07:20:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:20:45,725][06909] Updated weights for policy 0, policy_version 176003 (0.0029) [2024-06-28 07:20:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2883780608. Throughput: 0: 44083.8. Samples: 2786687280. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 07:20:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:20:48,873][06909] Updated weights for policy 0, policy_version 176013 (0.0030) [2024-06-28 07:20:53,135][06909] Updated weights for policy 0, policy_version 176023 (0.0023) [2024-06-28 07:20:53,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44238.2, 300 sec: 43986.8). Total num frames: 2883993600. Throughput: 0: 44018.6. Samples: 2786952960. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:20:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:20:56,493][06909] Updated weights for policy 0, policy_version 176033 (0.0043) [2024-06-28 07:20:58,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2884190208. Throughput: 0: 43964.4. Samples: 2787080280. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:20:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:21:00,539][06909] Updated weights for policy 0, policy_version 176043 (0.0037) [2024-06-28 07:21:03,852][06674] Fps is (10 sec: 44228.3, 60 sec: 44235.2, 300 sec: 44097.7). Total num frames: 2884435968. Throughput: 0: 43762.9. Samples: 2787344380. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:21:03,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:21:04,293][06909] Updated weights for policy 0, policy_version 176053 (0.0030) [2024-06-28 07:21:08,196][06909] Updated weights for policy 0, policy_version 176063 (0.0040) [2024-06-28 07:21:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2884632576. Throughput: 0: 43995.2. Samples: 2787611120. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:21:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 07:21:11,477][06909] Updated weights for policy 0, policy_version 176073 (0.0023) [2024-06-28 07:21:13,850][06674] Fps is (10 sec: 40967.9, 60 sec: 43690.5, 300 sec: 44097.9). Total num frames: 2884845568. Throughput: 0: 44040.2. Samples: 2787741360. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:21:13,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:21:15,614][06909] Updated weights for policy 0, policy_version 176083 (0.0030) [2024-06-28 07:21:18,669][06909] Updated weights for policy 0, policy_version 176093 (0.0028) [2024-06-28 07:21:18,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2885107712. Throughput: 0: 44107.9. Samples: 2788008860. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:21:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:21:22,937][06909] Updated weights for policy 0, policy_version 176103 (0.0023) [2024-06-28 07:21:23,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2885304320. Throughput: 0: 44087.2. Samples: 2788274800. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:21:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:21:25,867][06909] Updated weights for policy 0, policy_version 176113 (0.0029) [2024-06-28 07:21:28,856][06674] Fps is (10 sec: 39297.8, 60 sec: 43686.1, 300 sec: 44097.1). Total num frames: 2885500928. Throughput: 0: 43970.0. Samples: 2788400140. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:21:28,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 07:21:30,303][06909] Updated weights for policy 0, policy_version 176123 (0.0030) [2024-06-28 07:21:33,650][06909] Updated weights for policy 0, policy_version 176133 (0.0030) [2024-06-28 07:21:33,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2885763072. Throughput: 0: 44036.5. Samples: 2788668920. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:21:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:21:36,828][06887] Signal inference workers to stop experience collection... (39600 times) [2024-06-28 07:21:36,879][06909] InferenceWorker_p0-w0: stopping experience collection (39600 times) [2024-06-28 07:21:36,885][06887] Signal inference workers to resume experience collection... (39600 times) [2024-06-28 07:21:36,899][06909] InferenceWorker_p0-w0: resuming experience collection (39600 times) [2024-06-28 07:21:38,029][06909] Updated weights for policy 0, policy_version 176143 (0.0035) [2024-06-28 07:21:38,850][06674] Fps is (10 sec: 45903.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2885959680. Throughput: 0: 44053.9. Samples: 2788935380. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:21:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:21:41,124][06909] Updated weights for policy 0, policy_version 176153 (0.0029) [2024-06-28 07:21:43,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2886156288. Throughput: 0: 44009.4. Samples: 2789060700. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:21:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:21:45,210][06909] Updated weights for policy 0, policy_version 176163 (0.0043) [2024-06-28 07:21:48,569][06909] Updated weights for policy 0, policy_version 176173 (0.0028) [2024-06-28 07:21:48,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2886434816. Throughput: 0: 44055.8. Samples: 2789326800. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:21:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:21:48,938][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000176175_2886451200.pth... [2024-06-28 07:21:48,995][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000175525_2875801600.pth [2024-06-28 07:21:52,599][06909] Updated weights for policy 0, policy_version 176183 (0.0036) [2024-06-28 07:21:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 2886615040. Throughput: 0: 44196.4. Samples: 2789599960. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:21:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:21:55,684][06909] Updated weights for policy 0, policy_version 176193 (0.0028) [2024-06-28 07:21:58,850][06674] Fps is (10 sec: 37682.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2886811648. Throughput: 0: 43977.0. Samples: 2789720320. Policy #0 lag: (min: 1.0, avg: 11.9, max: 23.0) [2024-06-28 07:21:58,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:21:59,974][06909] Updated weights for policy 0, policy_version 176203 (0.0043) [2024-06-28 07:22:02,936][06909] Updated weights for policy 0, policy_version 176213 (0.0033) [2024-06-28 07:22:03,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2887090176. Throughput: 0: 44006.3. Samples: 2789989140. Policy #0 lag: (min: 1.0, avg: 11.9, max: 23.0) [2024-06-28 07:22:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:22:07,306][06909] Updated weights for policy 0, policy_version 176223 (0.0035) [2024-06-28 07:22:08,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 2887270400. Throughput: 0: 44140.2. Samples: 2790261120. Policy #0 lag: (min: 1.0, avg: 11.9, max: 23.0) [2024-06-28 07:22:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:22:10,553][06909] Updated weights for policy 0, policy_version 176233 (0.0030) [2024-06-28 07:22:13,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2887483392. Throughput: 0: 44214.8. Samples: 2790389540. Policy #0 lag: (min: 1.0, avg: 11.9, max: 23.0) [2024-06-28 07:22:13,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:22:14,676][06909] Updated weights for policy 0, policy_version 176243 (0.0034) [2024-06-28 07:22:18,074][06909] Updated weights for policy 0, policy_version 176253 (0.0025) [2024-06-28 07:22:18,850][06674] Fps is (10 sec: 49152.9, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2887761920. Throughput: 0: 44110.2. Samples: 2790653880. Policy #0 lag: (min: 1.0, avg: 11.9, max: 23.0) [2024-06-28 07:22:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:22:22,091][06909] Updated weights for policy 0, policy_version 176263 (0.0040) [2024-06-28 07:22:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2887942144. Throughput: 0: 44246.2. Samples: 2790926460. Policy #0 lag: (min: 1.0, avg: 11.9, max: 23.0) [2024-06-28 07:22:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:22:25,405][06909] Updated weights for policy 0, policy_version 176273 (0.0038) [2024-06-28 07:22:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44514.4, 300 sec: 44153.5). Total num frames: 2888171520. Throughput: 0: 44279.4. Samples: 2791053280. Policy #0 lag: (min: 1.0, avg: 11.9, max: 23.0) [2024-06-28 07:22:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:22:29,635][06909] Updated weights for policy 0, policy_version 176283 (0.0025) [2024-06-28 07:22:31,889][06887] Signal inference workers to stop experience collection... (39650 times) [2024-06-28 07:22:31,912][06909] InferenceWorker_p0-w0: stopping experience collection (39650 times) [2024-06-28 07:22:31,948][06887] Signal inference workers to resume experience collection... (39650 times) [2024-06-28 07:22:31,948][06909] InferenceWorker_p0-w0: resuming experience collection (39650 times) [2024-06-28 07:22:32,681][06909] Updated weights for policy 0, policy_version 176293 (0.0025) [2024-06-28 07:22:33,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44509.9, 300 sec: 44264.6). Total num frames: 2888433664. Throughput: 0: 44272.8. Samples: 2791319080. Policy #0 lag: (min: 1.0, avg: 11.9, max: 23.0) [2024-06-28 07:22:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:22:36,896][06909] Updated weights for policy 0, policy_version 176303 (0.0037) [2024-06-28 07:22:38,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2888597504. Throughput: 0: 44235.2. Samples: 2791590540. Policy #0 lag: (min: 1.0, avg: 11.9, max: 23.0) [2024-06-28 07:22:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:22:40,230][06909] Updated weights for policy 0, policy_version 176313 (0.0034) [2024-06-28 07:22:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44782.8, 300 sec: 44209.0). Total num frames: 2888843264. Throughput: 0: 44175.5. Samples: 2791708220. Policy #0 lag: (min: 1.0, avg: 11.9, max: 23.0) [2024-06-28 07:22:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:22:44,170][06909] Updated weights for policy 0, policy_version 176323 (0.0041) [2024-06-28 07:22:47,574][06909] Updated weights for policy 0, policy_version 176333 (0.0030) [2024-06-28 07:22:48,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 2889089024. Throughput: 0: 44216.0. Samples: 2791978860. Policy #0 lag: (min: 1.0, avg: 11.9, max: 23.0) [2024-06-28 07:22:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:22:51,774][06909] Updated weights for policy 0, policy_version 176343 (0.0033) [2024-06-28 07:22:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2889269248. Throughput: 0: 44226.8. Samples: 2792251320. Policy #0 lag: (min: 1.0, avg: 11.9, max: 23.0) [2024-06-28 07:22:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:22:55,211][06909] Updated weights for policy 0, policy_version 176353 (0.0036) [2024-06-28 07:22:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44783.0, 300 sec: 44098.2). Total num frames: 2889498624. Throughput: 0: 44115.7. Samples: 2792374740. Policy #0 lag: (min: 1.0, avg: 11.9, max: 23.0) [2024-06-28 07:22:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:22:59,298][06909] Updated weights for policy 0, policy_version 176363 (0.0026) [2024-06-28 07:23:02,555][06909] Updated weights for policy 0, policy_version 176373 (0.0032) [2024-06-28 07:23:03,850][06674] Fps is (10 sec: 47514.3, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 2889744384. Throughput: 0: 44205.9. Samples: 2792643140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 07:23:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:23:06,670][06909] Updated weights for policy 0, policy_version 176383 (0.0027) [2024-06-28 07:23:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44237.0, 300 sec: 43986.9). Total num frames: 2889924608. Throughput: 0: 44142.7. Samples: 2792912880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 07:23:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:23:10,107][06909] Updated weights for policy 0, policy_version 176393 (0.0041) [2024-06-28 07:23:13,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2890153984. Throughput: 0: 43951.5. Samples: 2793031100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 07:23:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:23:14,067][06909] Updated weights for policy 0, policy_version 176403 (0.0029) [2024-06-28 07:23:17,861][06909] Updated weights for policy 0, policy_version 176413 (0.0027) [2024-06-28 07:23:18,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.8, 300 sec: 44264.6). Total num frames: 2890399744. Throughput: 0: 43844.5. Samples: 2793292080. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 07:23:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:23:21,761][06909] Updated weights for policy 0, policy_version 176423 (0.0044) [2024-06-28 07:23:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2890579968. Throughput: 0: 43793.2. Samples: 2793561240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 07:23:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 07:23:25,251][06909] Updated weights for policy 0, policy_version 176433 (0.0032) [2024-06-28 07:23:28,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2890825728. Throughput: 0: 43880.0. Samples: 2793682820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 07:23:28,853][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:23:29,048][06909] Updated weights for policy 0, policy_version 176443 (0.0036) [2024-06-28 07:23:32,654][06909] Updated weights for policy 0, policy_version 176453 (0.0031) [2024-06-28 07:23:33,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 2891055104. Throughput: 0: 43795.5. Samples: 2793949660. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 07:23:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:23:36,709][06909] Updated weights for policy 0, policy_version 176463 (0.0033) [2024-06-28 07:23:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2891235328. Throughput: 0: 43922.3. Samples: 2794227820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 07:23:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:23:39,417][06887] Signal inference workers to stop experience collection... (39700 times) [2024-06-28 07:23:39,423][06887] Signal inference workers to resume experience collection... (39700 times) [2024-06-28 07:23:39,445][06909] InferenceWorker_p0-w0: stopping experience collection (39700 times) [2024-06-28 07:23:39,445][06909] InferenceWorker_p0-w0: resuming experience collection (39700 times) [2024-06-28 07:23:40,155][06909] Updated weights for policy 0, policy_version 176473 (0.0024) [2024-06-28 07:23:43,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44099.0). Total num frames: 2891481088. Throughput: 0: 43783.1. Samples: 2794344980. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 07:23:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:23:43,890][06909] Updated weights for policy 0, policy_version 176483 (0.0037) [2024-06-28 07:23:47,318][06909] Updated weights for policy 0, policy_version 176493 (0.0029) [2024-06-28 07:23:48,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43690.6, 300 sec: 44209.0). Total num frames: 2891710464. Throughput: 0: 43756.7. Samples: 2794612200. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 07:23:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:23:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000176496_2891710464.pth... [2024-06-28 07:23:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000175850_2881126400.pth [2024-06-28 07:23:51,243][06909] Updated weights for policy 0, policy_version 176503 (0.0035) [2024-06-28 07:23:53,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2891890688. Throughput: 0: 43576.8. Samples: 2794873840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 07:23:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:23:55,472][06909] Updated weights for policy 0, policy_version 176513 (0.0032) [2024-06-28 07:23:58,851][06674] Fps is (10 sec: 42595.0, 60 sec: 43963.0, 300 sec: 44098.1). Total num frames: 2892136448. Throughput: 0: 43825.4. Samples: 2795003280. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 07:23:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:23:58,996][06909] Updated weights for policy 0, policy_version 176523 (0.0027) [2024-06-28 07:24:02,777][06909] Updated weights for policy 0, policy_version 176533 (0.0037) [2024-06-28 07:24:03,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 2892365824. Throughput: 0: 43889.2. Samples: 2795267100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 07:24:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:24:06,290][06909] Updated weights for policy 0, policy_version 176543 (0.0046) [2024-06-28 07:24:08,850][06674] Fps is (10 sec: 42602.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2892562432. Throughput: 0: 43926.3. Samples: 2795537920. Policy #0 lag: (min: 1.0, avg: 11.2, max: 23.0) [2024-06-28 07:24:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:24:10,027][06909] Updated weights for policy 0, policy_version 176553 (0.0032) [2024-06-28 07:24:13,834][06909] Updated weights for policy 0, policy_version 176563 (0.0024) [2024-06-28 07:24:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2892808192. Throughput: 0: 44100.0. Samples: 2795667320. Policy #0 lag: (min: 1.0, avg: 11.2, max: 23.0) [2024-06-28 07:24:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:24:17,406][06909] Updated weights for policy 0, policy_version 176573 (0.0026) [2024-06-28 07:24:18,851][06674] Fps is (10 sec: 45869.6, 60 sec: 43689.7, 300 sec: 44097.8). Total num frames: 2893021184. Throughput: 0: 43998.4. Samples: 2795929640. Policy #0 lag: (min: 1.0, avg: 11.2, max: 23.0) [2024-06-28 07:24:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:24:21,010][06909] Updated weights for policy 0, policy_version 176583 (0.0034) [2024-06-28 07:24:23,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2893217792. Throughput: 0: 43778.2. Samples: 2796197840. Policy #0 lag: (min: 1.0, avg: 11.2, max: 23.0) [2024-06-28 07:24:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:24:25,385][06909] Updated weights for policy 0, policy_version 176593 (0.0037) [2024-06-28 07:24:28,655][06909] Updated weights for policy 0, policy_version 176603 (0.0035) [2024-06-28 07:24:28,850][06674] Fps is (10 sec: 44242.3, 60 sec: 43963.8, 300 sec: 44098.8). Total num frames: 2893463552. Throughput: 0: 43844.4. Samples: 2796317980. Policy #0 lag: (min: 1.0, avg: 11.2, max: 23.0) [2024-06-28 07:24:28,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 07:24:32,890][06909] Updated weights for policy 0, policy_version 176613 (0.0032) [2024-06-28 07:24:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2893676544. Throughput: 0: 44021.0. Samples: 2796593140. Policy #0 lag: (min: 1.0, avg: 11.2, max: 23.0) [2024-06-28 07:24:33,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:24:36,320][06909] Updated weights for policy 0, policy_version 176623 (0.0029) [2024-06-28 07:24:38,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2893873152. Throughput: 0: 44000.5. Samples: 2796853860. Policy #0 lag: (min: 1.0, avg: 11.2, max: 23.0) [2024-06-28 07:24:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:24:40,226][06909] Updated weights for policy 0, policy_version 176633 (0.0024) [2024-06-28 07:24:43,489][06909] Updated weights for policy 0, policy_version 176643 (0.0031) [2024-06-28 07:24:43,852][06674] Fps is (10 sec: 44227.9, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 2894118912. Throughput: 0: 44103.8. Samples: 2796988000. Policy #0 lag: (min: 1.0, avg: 11.2, max: 23.0) [2024-06-28 07:24:43,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:24:47,372][06909] Updated weights for policy 0, policy_version 176653 (0.0031) [2024-06-28 07:24:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.8, 300 sec: 44042.7). Total num frames: 2894331904. Throughput: 0: 44175.2. Samples: 2797254980. Policy #0 lag: (min: 1.0, avg: 11.2, max: 23.0) [2024-06-28 07:24:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:24:50,676][06909] Updated weights for policy 0, policy_version 176663 (0.0035) [2024-06-28 07:24:53,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2894561280. Throughput: 0: 44039.9. Samples: 2797519720. Policy #0 lag: (min: 1.0, avg: 11.2, max: 23.0) [2024-06-28 07:24:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:24:54,721][06909] Updated weights for policy 0, policy_version 176673 (0.0036) [2024-06-28 07:24:58,389][06909] Updated weights for policy 0, policy_version 176683 (0.0031) [2024-06-28 07:24:58,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43962.9, 300 sec: 44042.1). Total num frames: 2894774272. Throughput: 0: 44156.7. Samples: 2797654460. Policy #0 lag: (min: 1.0, avg: 11.2, max: 23.0) [2024-06-28 07:24:58,853][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:25:02,250][06887] Signal inference workers to stop experience collection... (39750 times) [2024-06-28 07:25:02,250][06887] Signal inference workers to resume experience collection... (39750 times) [2024-06-28 07:25:02,304][06909] InferenceWorker_p0-w0: stopping experience collection (39750 times) [2024-06-28 07:25:02,304][06909] InferenceWorker_p0-w0: resuming experience collection (39750 times) [2024-06-28 07:25:02,385][06909] Updated weights for policy 0, policy_version 176693 (0.0029) [2024-06-28 07:25:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 2894987264. Throughput: 0: 43998.6. Samples: 2797909520. Policy #0 lag: (min: 1.0, avg: 11.2, max: 23.0) [2024-06-28 07:25:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:25:05,609][06909] Updated weights for policy 0, policy_version 176703 (0.0032) [2024-06-28 07:25:08,856][06674] Fps is (10 sec: 44220.4, 60 sec: 44232.5, 300 sec: 44041.5). Total num frames: 2895216640. Throughput: 0: 43951.8. Samples: 2798175920. Policy #0 lag: (min: 1.0, avg: 11.2, max: 23.0) [2024-06-28 07:25:08,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:25:10,108][06909] Updated weights for policy 0, policy_version 176713 (0.0040) [2024-06-28 07:25:13,301][06909] Updated weights for policy 0, policy_version 176723 (0.0046) [2024-06-28 07:25:13,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43690.6, 300 sec: 43986.8). Total num frames: 2895429632. Throughput: 0: 44203.8. Samples: 2798307160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 07:25:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:25:17,311][06909] Updated weights for policy 0, policy_version 176733 (0.0039) [2024-06-28 07:25:18,850][06674] Fps is (10 sec: 42623.2, 60 sec: 43691.6, 300 sec: 43931.3). Total num frames: 2895642624. Throughput: 0: 43941.4. Samples: 2798570500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 07:25:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:25:20,717][06909] Updated weights for policy 0, policy_version 176743 (0.0033) [2024-06-28 07:25:23,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 2895888384. Throughput: 0: 44082.2. Samples: 2798837560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 07:25:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:25:24,521][06909] Updated weights for policy 0, policy_version 176753 (0.0031) [2024-06-28 07:25:28,532][06909] Updated weights for policy 0, policy_version 176763 (0.0021) [2024-06-28 07:25:28,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43690.5, 300 sec: 44042.4). Total num frames: 2896084992. Throughput: 0: 44181.8. Samples: 2798976100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 07:25:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:25:32,207][06909] Updated weights for policy 0, policy_version 176773 (0.0032) [2024-06-28 07:25:33,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2896297984. Throughput: 0: 43938.2. Samples: 2799232200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 07:25:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:25:35,794][06909] Updated weights for policy 0, policy_version 176783 (0.0039) [2024-06-28 07:25:38,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2896543744. Throughput: 0: 43891.1. Samples: 2799494820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 07:25:38,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:25:39,922][06909] Updated weights for policy 0, policy_version 176793 (0.0030) [2024-06-28 07:25:43,122][06909] Updated weights for policy 0, policy_version 176803 (0.0031) [2024-06-28 07:25:43,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43692.0, 300 sec: 43931.3). Total num frames: 2896740352. Throughput: 0: 43836.5. Samples: 2799627020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 07:25:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:25:47,201][06909] Updated weights for policy 0, policy_version 176813 (0.0035) [2024-06-28 07:25:48,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2896953344. Throughput: 0: 43921.2. Samples: 2799885980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 07:25:48,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:25:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000176816_2896953344.pth... [2024-06-28 07:25:48,914][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000176175_2886451200.pth [2024-06-28 07:25:50,890][06909] Updated weights for policy 0, policy_version 176823 (0.0041) [2024-06-28 07:25:53,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2897182720. Throughput: 0: 43923.4. Samples: 2800152220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 07:25:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:25:54,465][06909] Updated weights for policy 0, policy_version 176833 (0.0025) [2024-06-28 07:25:57,962][06909] Updated weights for policy 0, policy_version 176843 (0.0040) [2024-06-28 07:25:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43692.2, 300 sec: 43931.6). Total num frames: 2897395712. Throughput: 0: 44059.3. Samples: 2800289820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 07:25:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:26:01,684][06909] Updated weights for policy 0, policy_version 176853 (0.0036) [2024-06-28 07:26:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2897625088. Throughput: 0: 44031.5. Samples: 2800551920. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 07:26:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:26:05,482][06909] Updated weights for policy 0, policy_version 176863 (0.0039) [2024-06-28 07:26:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43968.0, 300 sec: 44098.0). Total num frames: 2897854464. Throughput: 0: 43901.4. Samples: 2800813120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 07:26:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:26:09,374][06909] Updated weights for policy 0, policy_version 176873 (0.0038) [2024-06-28 07:26:13,145][06909] Updated weights for policy 0, policy_version 176883 (0.0036) [2024-06-28 07:26:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 2898051072. Throughput: 0: 43775.2. Samples: 2800945980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 07:26:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:26:17,178][06909] Updated weights for policy 0, policy_version 176893 (0.0037) [2024-06-28 07:26:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2898280448. Throughput: 0: 43881.8. Samples: 2801206880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 07:26:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:26:19,060][06887] Signal inference workers to stop experience collection... (39800 times) [2024-06-28 07:26:19,060][06887] Signal inference workers to resume experience collection... (39800 times) [2024-06-28 07:26:19,082][06909] InferenceWorker_p0-w0: stopping experience collection (39800 times) [2024-06-28 07:26:19,082][06909] InferenceWorker_p0-w0: resuming experience collection (39800 times) [2024-06-28 07:26:20,673][06909] Updated weights for policy 0, policy_version 176903 (0.0025) [2024-06-28 07:26:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.5, 300 sec: 44043.3). Total num frames: 2898493440. Throughput: 0: 43933.8. Samples: 2801471840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 07:26:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:26:24,423][06909] Updated weights for policy 0, policy_version 176913 (0.0033) [2024-06-28 07:26:28,199][06909] Updated weights for policy 0, policy_version 176923 (0.0041) [2024-06-28 07:26:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 2898706432. Throughput: 0: 44033.0. Samples: 2801608500. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 07:26:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:26:31,639][06909] Updated weights for policy 0, policy_version 176933 (0.0034) [2024-06-28 07:26:33,850][06674] Fps is (10 sec: 45873.9, 60 sec: 44236.6, 300 sec: 44042.4). Total num frames: 2898952192. Throughput: 0: 44122.0. Samples: 2801871480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 07:26:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:26:35,586][06909] Updated weights for policy 0, policy_version 176943 (0.0032) [2024-06-28 07:26:38,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2899181568. Throughput: 0: 44009.7. Samples: 2802132660. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 07:26:38,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:26:39,248][06909] Updated weights for policy 0, policy_version 176953 (0.0046) [2024-06-28 07:26:43,099][06909] Updated weights for policy 0, policy_version 176963 (0.0036) [2024-06-28 07:26:43,850][06674] Fps is (10 sec: 42599.9, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 2899378176. Throughput: 0: 43891.5. Samples: 2802264940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 07:26:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:26:46,591][06909] Updated weights for policy 0, policy_version 176973 (0.0028) [2024-06-28 07:26:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2899607552. Throughput: 0: 43895.6. Samples: 2802527220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 07:26:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:26:50,516][06909] Updated weights for policy 0, policy_version 176983 (0.0032) [2024-06-28 07:26:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2899820544. Throughput: 0: 43907.9. Samples: 2802788980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 07:26:53,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:26:54,217][06909] Updated weights for policy 0, policy_version 176993 (0.0045) [2024-06-28 07:26:58,787][06909] Updated weights for policy 0, policy_version 177003 (0.0039) [2024-06-28 07:26:58,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 2900017152. Throughput: 0: 43838.6. Samples: 2802918720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 07:26:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:27:01,595][06909] Updated weights for policy 0, policy_version 177013 (0.0031) [2024-06-28 07:27:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2900279296. Throughput: 0: 43967.1. Samples: 2803185400. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 07:27:03,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 07:27:06,066][06909] Updated weights for policy 0, policy_version 177023 (0.0042) [2024-06-28 07:27:08,746][06909] Updated weights for policy 0, policy_version 177033 (0.0038) [2024-06-28 07:27:08,850][06674] Fps is (10 sec: 49151.3, 60 sec: 44236.6, 300 sec: 44153.5). Total num frames: 2900508672. Throughput: 0: 44134.1. Samples: 2803457880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 07:27:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:27:13,237][06909] Updated weights for policy 0, policy_version 177043 (0.0032) [2024-06-28 07:27:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 2900705280. Throughput: 0: 44062.2. Samples: 2803591300. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 07:27:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:27:16,065][06909] Updated weights for policy 0, policy_version 177053 (0.0037) [2024-06-28 07:27:18,851][06674] Fps is (10 sec: 44232.8, 60 sec: 44509.1, 300 sec: 44097.8). Total num frames: 2900951040. Throughput: 0: 44220.1. Samples: 2803861420. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 07:27:18,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:27:20,687][06909] Updated weights for policy 0, policy_version 177063 (0.0021) [2024-06-28 07:27:23,595][06909] Updated weights for policy 0, policy_version 177073 (0.0030) [2024-06-28 07:27:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2901164032. Throughput: 0: 44214.2. Samples: 2804122300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 07:27:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:27:25,653][06887] Signal inference workers to stop experience collection... (39850 times) [2024-06-28 07:27:25,702][06909] InferenceWorker_p0-w0: stopping experience collection (39850 times) [2024-06-28 07:27:25,702][06887] Signal inference workers to resume experience collection... (39850 times) [2024-06-28 07:27:25,714][06909] InferenceWorker_p0-w0: resuming experience collection (39850 times) [2024-06-28 07:27:27,863][06909] Updated weights for policy 0, policy_version 177083 (0.0025) [2024-06-28 07:27:28,850][06674] Fps is (10 sec: 42603.0, 60 sec: 44509.8, 300 sec: 43875.8). Total num frames: 2901377024. Throughput: 0: 44155.5. Samples: 2804251940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 07:27:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:27:30,893][06909] Updated weights for policy 0, policy_version 177093 (0.0022) [2024-06-28 07:27:33,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44237.1, 300 sec: 44098.0). Total num frames: 2901606400. Throughput: 0: 44215.6. Samples: 2804516920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 07:27:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:27:35,286][06909] Updated weights for policy 0, policy_version 177103 (0.0034) [2024-06-28 07:27:38,534][06909] Updated weights for policy 0, policy_version 177113 (0.0038) [2024-06-28 07:27:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2901819392. Throughput: 0: 44330.3. Samples: 2804783840. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 07:27:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:27:42,745][06909] Updated weights for policy 0, policy_version 177123 (0.0030) [2024-06-28 07:27:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.9, 300 sec: 43931.3). Total num frames: 2902048768. Throughput: 0: 44251.6. Samples: 2804910040. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 07:27:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:27:45,819][06909] Updated weights for policy 0, policy_version 177133 (0.0028) [2024-06-28 07:27:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 2902278144. Throughput: 0: 44580.1. Samples: 2805191500. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 07:27:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:27:48,981][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000177142_2902294528.pth... [2024-06-28 07:27:49,031][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000176496_2891710464.pth [2024-06-28 07:27:49,939][06909] Updated weights for policy 0, policy_version 177143 (0.0037) [2024-06-28 07:27:53,079][06909] Updated weights for policy 0, policy_version 177153 (0.0028) [2024-06-28 07:27:53,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2902474752. Throughput: 0: 44312.1. Samples: 2805451920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 07:27:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:27:57,426][06909] Updated weights for policy 0, policy_version 177163 (0.0030) [2024-06-28 07:27:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 45056.1, 300 sec: 43986.9). Total num frames: 2902720512. Throughput: 0: 44151.5. Samples: 2805578120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 07:27:58,857][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:28:00,855][06909] Updated weights for policy 0, policy_version 177173 (0.0031) [2024-06-28 07:28:03,856][06674] Fps is (10 sec: 47485.3, 60 sec: 44505.4, 300 sec: 44152.6). Total num frames: 2902949888. Throughput: 0: 44130.7. Samples: 2805847520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 07:28:03,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:28:04,674][06909] Updated weights for policy 0, policy_version 177183 (0.0034) [2024-06-28 07:28:08,222][06909] Updated weights for policy 0, policy_version 177193 (0.0051) [2024-06-28 07:28:08,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 2903130112. Throughput: 0: 44019.6. Samples: 2806103180. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 07:28:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:28:12,330][06909] Updated weights for policy 0, policy_version 177203 (0.0033) [2024-06-28 07:28:13,850][06674] Fps is (10 sec: 40985.3, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 2903359488. Throughput: 0: 44029.5. Samples: 2806233260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 07:28:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:28:15,745][06909] Updated weights for policy 0, policy_version 177213 (0.0038) [2024-06-28 07:28:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43964.6, 300 sec: 44098.0). Total num frames: 2903588864. Throughput: 0: 44200.4. Samples: 2806505940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 07:28:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:28:19,821][06909] Updated weights for policy 0, policy_version 177223 (0.0033) [2024-06-28 07:28:22,976][06909] Updated weights for policy 0, policy_version 177233 (0.0030) [2024-06-28 07:28:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 2903785472. Throughput: 0: 44177.4. Samples: 2806771820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 07:28:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:28:27,023][06909] Updated weights for policy 0, policy_version 177243 (0.0026) [2024-06-28 07:28:28,850][06674] Fps is (10 sec: 44235.7, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2904031232. Throughput: 0: 44343.8. Samples: 2806905520. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-28 07:28:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:28:30,290][06909] Updated weights for policy 0, policy_version 177253 (0.0025) [2024-06-28 07:28:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2904244224. Throughput: 0: 43980.0. Samples: 2807170600. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-28 07:28:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:28:34,196][06909] Updated weights for policy 0, policy_version 177263 (0.0038) [2024-06-28 07:28:38,096][06909] Updated weights for policy 0, policy_version 177273 (0.0032) [2024-06-28 07:28:38,856][06674] Fps is (10 sec: 44210.8, 60 sec: 44232.3, 300 sec: 44041.5). Total num frames: 2904473600. Throughput: 0: 44204.3. Samples: 2807441380. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-28 07:28:38,857][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 07:28:41,595][06909] Updated weights for policy 0, policy_version 177283 (0.0035) [2024-06-28 07:28:43,850][06674] Fps is (10 sec: 42596.6, 60 sec: 43690.4, 300 sec: 43931.3). Total num frames: 2904670208. Throughput: 0: 44270.7. Samples: 2807570320. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-28 07:28:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:28:45,314][06909] Updated weights for policy 0, policy_version 177293 (0.0031) [2024-06-28 07:28:48,850][06674] Fps is (10 sec: 44263.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2904915968. Throughput: 0: 44145.5. Samples: 2807833800. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-28 07:28:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:28:49,074][06909] Updated weights for policy 0, policy_version 177303 (0.0042) [2024-06-28 07:28:51,923][06887] Signal inference workers to stop experience collection... (39900 times) [2024-06-28 07:28:51,923][06887] Signal inference workers to resume experience collection... (39900 times) [2024-06-28 07:28:51,959][06909] InferenceWorker_p0-w0: stopping experience collection (39900 times) [2024-06-28 07:28:51,960][06909] InferenceWorker_p0-w0: resuming experience collection (39900 times) [2024-06-28 07:28:53,289][06909] Updated weights for policy 0, policy_version 177313 (0.0025) [2024-06-28 07:28:53,850][06674] Fps is (10 sec: 44238.6, 60 sec: 43963.8, 300 sec: 43987.0). Total num frames: 2905112576. Throughput: 0: 44300.0. Samples: 2808096680. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-28 07:28:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:28:56,746][06909] Updated weights for policy 0, policy_version 177323 (0.0026) [2024-06-28 07:28:58,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2905341952. Throughput: 0: 44285.2. Samples: 2808226100. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-28 07:28:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:29:00,715][06909] Updated weights for policy 0, policy_version 177333 (0.0031) [2024-06-28 07:29:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43422.1, 300 sec: 44042.4). Total num frames: 2905554944. Throughput: 0: 43960.5. Samples: 2808484160. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-28 07:29:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:29:04,133][06909] Updated weights for policy 0, policy_version 177343 (0.0027) [2024-06-28 07:29:08,178][06909] Updated weights for policy 0, policy_version 177353 (0.0034) [2024-06-28 07:29:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2905784320. Throughput: 0: 44131.0. Samples: 2808757720. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-28 07:29:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:29:11,380][06909] Updated weights for policy 0, policy_version 177363 (0.0037) [2024-06-28 07:29:13,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43962.2, 300 sec: 43986.8). Total num frames: 2905997312. Throughput: 0: 43958.7. Samples: 2808883740. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-28 07:29:13,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:29:15,534][06909] Updated weights for policy 0, policy_version 177373 (0.0033) [2024-06-28 07:29:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2906226688. Throughput: 0: 44008.4. Samples: 2809150980. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-28 07:29:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:29:19,079][06909] Updated weights for policy 0, policy_version 177383 (0.0035) [2024-06-28 07:29:22,910][06909] Updated weights for policy 0, policy_version 177393 (0.0043) [2024-06-28 07:29:23,850][06674] Fps is (10 sec: 44245.5, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2906439680. Throughput: 0: 43832.6. Samples: 2809413580. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-28 07:29:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:29:26,627][06909] Updated weights for policy 0, policy_version 177403 (0.0032) [2024-06-28 07:29:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 2906652672. Throughput: 0: 43818.5. Samples: 2809542140. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-28 07:29:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:29:30,795][06909] Updated weights for policy 0, policy_version 177413 (0.0029) [2024-06-28 07:29:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2906882048. Throughput: 0: 43798.3. Samples: 2809804720. Policy #0 lag: (min: 1.0, avg: 11.3, max: 22.0) [2024-06-28 07:29:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 07:29:33,890][06909] Updated weights for policy 0, policy_version 177423 (0.0030) [2024-06-28 07:29:37,855][06909] Updated weights for policy 0, policy_version 177433 (0.0038) [2024-06-28 07:29:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43968.1, 300 sec: 44042.7). Total num frames: 2907111424. Throughput: 0: 44061.6. Samples: 2810079460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 07:29:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:29:41,476][06909] Updated weights for policy 0, policy_version 177443 (0.0046) [2024-06-28 07:29:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44510.2, 300 sec: 44098.0). Total num frames: 2907340800. Throughput: 0: 44149.4. Samples: 2810212820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 07:29:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:29:45,326][06909] Updated weights for policy 0, policy_version 177453 (0.0044) [2024-06-28 07:29:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2907537408. Throughput: 0: 44198.1. Samples: 2810473080. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 07:29:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:29:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000177463_2907553792.pth... [2024-06-28 07:29:48,863][06909] Updated weights for policy 0, policy_version 177463 (0.0025) [2024-06-28 07:29:48,902][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000176816_2896953344.pth [2024-06-28 07:29:52,586][06909] Updated weights for policy 0, policy_version 177473 (0.0041) [2024-06-28 07:29:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44098.3). Total num frames: 2907783168. Throughput: 0: 44020.1. Samples: 2810738620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 07:29:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:29:56,356][06909] Updated weights for policy 0, policy_version 177483 (0.0042) [2024-06-28 07:29:58,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2907996160. Throughput: 0: 44200.7. Samples: 2810872680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 07:29:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:29:59,827][06909] Updated weights for policy 0, policy_version 177493 (0.0037) [2024-06-28 07:30:03,647][06909] Updated weights for policy 0, policy_version 177503 (0.0035) [2024-06-28 07:30:03,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.7, 300 sec: 44043.3). Total num frames: 2908209152. Throughput: 0: 44047.0. Samples: 2811133100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 07:30:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:30:07,840][06909] Updated weights for policy 0, policy_version 177513 (0.0038) [2024-06-28 07:30:08,850][06674] Fps is (10 sec: 44235.9, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2908438528. Throughput: 0: 44132.4. Samples: 2811399540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 07:30:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:30:10,977][06909] Updated weights for policy 0, policy_version 177523 (0.0040) [2024-06-28 07:30:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 2908651520. Throughput: 0: 44248.1. Samples: 2811533300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 07:30:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:30:15,325][06909] Updated weights for policy 0, policy_version 177533 (0.0037) [2024-06-28 07:30:16,140][06887] Signal inference workers to stop experience collection... (39950 times) [2024-06-28 07:30:16,141][06887] Signal inference workers to resume experience collection... (39950 times) [2024-06-28 07:30:16,165][06909] InferenceWorker_p0-w0: stopping experience collection (39950 times) [2024-06-28 07:30:16,165][06909] InferenceWorker_p0-w0: resuming experience collection (39950 times) [2024-06-28 07:30:18,751][06909] Updated weights for policy 0, policy_version 177543 (0.0023) [2024-06-28 07:30:18,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.5, 300 sec: 43986.8). Total num frames: 2908864512. Throughput: 0: 44096.6. Samples: 2811789080. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 07:30:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:30:22,489][06909] Updated weights for policy 0, policy_version 177553 (0.0038) [2024-06-28 07:30:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2909110272. Throughput: 0: 43813.5. Samples: 2812051060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 07:30:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:30:26,412][06909] Updated weights for policy 0, policy_version 177563 (0.0041) [2024-06-28 07:30:28,850][06674] Fps is (10 sec: 44238.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2909306880. Throughput: 0: 44013.8. Samples: 2812193440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 07:30:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:30:29,747][06909] Updated weights for policy 0, policy_version 177573 (0.0047) [2024-06-28 07:30:33,762][06909] Updated weights for policy 0, policy_version 177583 (0.0026) [2024-06-28 07:30:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2909519872. Throughput: 0: 43873.4. Samples: 2812447380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 07:30:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:30:37,479][06909] Updated weights for policy 0, policy_version 177593 (0.0024) [2024-06-28 07:30:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 2909765632. Throughput: 0: 43810.2. Samples: 2812710080. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 07:30:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:30:41,187][06909] Updated weights for policy 0, policy_version 177603 (0.0030) [2024-06-28 07:30:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2909978624. Throughput: 0: 43853.7. Samples: 2812846100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 07:30:43,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:30:44,819][06909] Updated weights for policy 0, policy_version 177613 (0.0037) [2024-06-28 07:30:48,659][06909] Updated weights for policy 0, policy_version 177623 (0.0027) [2024-06-28 07:30:48,852][06674] Fps is (10 sec: 40951.2, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 2910175232. Throughput: 0: 43844.7. Samples: 2813106200. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 07:30:48,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:30:52,266][06909] Updated weights for policy 0, policy_version 177633 (0.0038) [2024-06-28 07:30:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 2910420992. Throughput: 0: 43810.2. Samples: 2813371000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 07:30:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:30:55,948][06909] Updated weights for policy 0, policy_version 177643 (0.0026) [2024-06-28 07:30:58,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2910617600. Throughput: 0: 43925.7. Samples: 2813509960. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 07:30:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:30:59,735][06909] Updated weights for policy 0, policy_version 177653 (0.0029) [2024-06-28 07:31:03,619][06909] Updated weights for policy 0, policy_version 177663 (0.0034) [2024-06-28 07:31:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2910830592. Throughput: 0: 44045.0. Samples: 2813771100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 07:31:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:31:06,959][06909] Updated weights for policy 0, policy_version 177673 (0.0035) [2024-06-28 07:31:08,852][06674] Fps is (10 sec: 44227.9, 60 sec: 43689.3, 300 sec: 44097.7). Total num frames: 2911059968. Throughput: 0: 43999.7. Samples: 2814031140. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 07:31:08,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:31:11,027][06909] Updated weights for policy 0, policy_version 177683 (0.0037) [2024-06-28 07:31:13,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2911305728. Throughput: 0: 43881.6. Samples: 2814168120. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 07:31:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:31:14,210][06909] Updated weights for policy 0, policy_version 177693 (0.0031) [2024-06-28 07:31:18,518][06909] Updated weights for policy 0, policy_version 177703 (0.0032) [2024-06-28 07:31:18,850][06674] Fps is (10 sec: 44246.1, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 2911502336. Throughput: 0: 44210.2. Samples: 2814436840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 07:31:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:31:22,079][06909] Updated weights for policy 0, policy_version 177713 (0.0038) [2024-06-28 07:31:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 2911731712. Throughput: 0: 43920.3. Samples: 2814686500. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 07:31:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:31:25,896][06909] Updated weights for policy 0, policy_version 177723 (0.0051) [2024-06-28 07:31:27,464][06887] Signal inference workers to stop experience collection... (40000 times) [2024-06-28 07:31:27,472][06887] Signal inference workers to resume experience collection... (40000 times) [2024-06-28 07:31:27,481][06909] InferenceWorker_p0-w0: stopping experience collection (40000 times) [2024-06-28 07:31:27,496][06909] InferenceWorker_p0-w0: resuming experience collection (40000 times) [2024-06-28 07:31:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2911961088. Throughput: 0: 44017.0. Samples: 2814826860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 07:31:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:31:29,302][06909] Updated weights for policy 0, policy_version 177733 (0.0038) [2024-06-28 07:31:33,281][06909] Updated weights for policy 0, policy_version 177743 (0.0031) [2024-06-28 07:31:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2912157696. Throughput: 0: 44165.7. Samples: 2815093560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 07:31:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:31:36,686][06909] Updated weights for policy 0, policy_version 177753 (0.0029) [2024-06-28 07:31:38,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2912387072. Throughput: 0: 44082.2. Samples: 2815354700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 07:31:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:31:40,777][06909] Updated weights for policy 0, policy_version 177763 (0.0020) [2024-06-28 07:31:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2912616448. Throughput: 0: 44080.1. Samples: 2815493560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 07:31:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:31:44,082][06909] Updated weights for policy 0, policy_version 177773 (0.0035) [2024-06-28 07:31:48,317][06909] Updated weights for policy 0, policy_version 177783 (0.0030) [2024-06-28 07:31:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 2912829440. Throughput: 0: 44101.3. Samples: 2815755660. Policy #0 lag: (min: 1.0, avg: 12.8, max: 23.0) [2024-06-28 07:31:48,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:31:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000177785_2912829440.pth... [2024-06-28 07:31:48,904][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000177142_2902294528.pth [2024-06-28 07:31:51,315][06909] Updated weights for policy 0, policy_version 177793 (0.0027) [2024-06-28 07:31:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 2913058816. Throughput: 0: 44098.0. Samples: 2816015460. Policy #0 lag: (min: 1.0, avg: 12.8, max: 23.0) [2024-06-28 07:31:53,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:31:55,678][06909] Updated weights for policy 0, policy_version 177803 (0.0027) [2024-06-28 07:31:58,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 2913288192. Throughput: 0: 44111.3. Samples: 2816153120. Policy #0 lag: (min: 1.0, avg: 12.8, max: 23.0) [2024-06-28 07:31:58,853][06909] Updated weights for policy 0, policy_version 177813 (0.0032) [2024-06-28 07:31:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:32:02,997][06909] Updated weights for policy 0, policy_version 177823 (0.0039) [2024-06-28 07:32:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2913484800. Throughput: 0: 44067.5. Samples: 2816419880. Policy #0 lag: (min: 1.0, avg: 12.8, max: 23.0) [2024-06-28 07:32:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:32:06,611][06909] Updated weights for policy 0, policy_version 177833 (0.0029) [2024-06-28 07:32:08,852][06674] Fps is (10 sec: 42589.4, 60 sec: 44236.8, 300 sec: 44097.6). Total num frames: 2913714176. Throughput: 0: 44135.0. Samples: 2816672660. Policy #0 lag: (min: 1.0, avg: 12.8, max: 23.0) [2024-06-28 07:32:08,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:32:10,690][06909] Updated weights for policy 0, policy_version 177843 (0.0026) [2024-06-28 07:32:13,762][06909] Updated weights for policy 0, policy_version 177853 (0.0032) [2024-06-28 07:32:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44042.6). Total num frames: 2913943552. Throughput: 0: 44079.1. Samples: 2816810420. Policy #0 lag: (min: 1.0, avg: 12.8, max: 23.0) [2024-06-28 07:32:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:32:17,952][06909] Updated weights for policy 0, policy_version 177863 (0.0038) [2024-06-28 07:32:18,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2914140160. Throughput: 0: 44061.7. Samples: 2817076340. Policy #0 lag: (min: 1.0, avg: 12.8, max: 23.0) [2024-06-28 07:32:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:32:20,918][06909] Updated weights for policy 0, policy_version 177873 (0.0036) [2024-06-28 07:32:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2914369536. Throughput: 0: 44048.1. Samples: 2817336860. Policy #0 lag: (min: 1.0, avg: 12.8, max: 23.0) [2024-06-28 07:32:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:32:25,340][06909] Updated weights for policy 0, policy_version 177883 (0.0032) [2024-06-28 07:32:28,659][06909] Updated weights for policy 0, policy_version 177893 (0.0039) [2024-06-28 07:32:28,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2914615296. Throughput: 0: 44015.5. Samples: 2817474260. Policy #0 lag: (min: 1.0, avg: 12.8, max: 23.0) [2024-06-28 07:32:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:32:32,649][06909] Updated weights for policy 0, policy_version 177903 (0.0034) [2024-06-28 07:32:33,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 2914795520. Throughput: 0: 44040.3. Samples: 2817737560. Policy #0 lag: (min: 1.0, avg: 12.8, max: 23.0) [2024-06-28 07:32:33,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:32:35,700][06887] Signal inference workers to stop experience collection... (40050 times) [2024-06-28 07:32:35,700][06887] Signal inference workers to resume experience collection... (40050 times) [2024-06-28 07:32:35,729][06909] InferenceWorker_p0-w0: stopping experience collection (40050 times) [2024-06-28 07:32:35,729][06909] InferenceWorker_p0-w0: resuming experience collection (40050 times) [2024-06-28 07:32:36,063][06909] Updated weights for policy 0, policy_version 177913 (0.0033) [2024-06-28 07:32:38,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2915024896. Throughput: 0: 44142.1. Samples: 2818001860. Policy #0 lag: (min: 1.0, avg: 12.8, max: 23.0) [2024-06-28 07:32:38,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:32:39,846][06909] Updated weights for policy 0, policy_version 177923 (0.0033) [2024-06-28 07:32:43,411][06909] Updated weights for policy 0, policy_version 177933 (0.0026) [2024-06-28 07:32:43,850][06674] Fps is (10 sec: 49161.9, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2915287040. Throughput: 0: 44177.7. Samples: 2818141120. Policy #0 lag: (min: 1.0, avg: 12.8, max: 23.0) [2024-06-28 07:32:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:32:47,419][06909] Updated weights for policy 0, policy_version 177943 (0.0032) [2024-06-28 07:32:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2915450880. Throughput: 0: 44039.9. Samples: 2818401680. Policy #0 lag: (min: 1.0, avg: 12.8, max: 23.0) [2024-06-28 07:32:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:32:50,915][06909] Updated weights for policy 0, policy_version 177953 (0.0032) [2024-06-28 07:32:53,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2915680256. Throughput: 0: 44107.3. Samples: 2818657400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 07:32:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:32:54,826][06909] Updated weights for policy 0, policy_version 177963 (0.0042) [2024-06-28 07:32:58,174][06909] Updated weights for policy 0, policy_version 177973 (0.0037) [2024-06-28 07:32:58,850][06674] Fps is (10 sec: 49152.6, 60 sec: 44236.8, 300 sec: 44043.3). Total num frames: 2915942400. Throughput: 0: 44206.7. Samples: 2818799720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 07:32:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:33:02,639][06909] Updated weights for policy 0, policy_version 177983 (0.0028) [2024-06-28 07:33:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2916106240. Throughput: 0: 44076.9. Samples: 2819059800. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 07:33:03,860][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:33:05,780][06909] Updated weights for policy 0, policy_version 177993 (0.0028) [2024-06-28 07:33:08,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 2916352000. Throughput: 0: 44097.9. Samples: 2819321260. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 07:33:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:33:09,959][06909] Updated weights for policy 0, policy_version 178003 (0.0042) [2024-06-28 07:33:13,290][06909] Updated weights for policy 0, policy_version 178013 (0.0023) [2024-06-28 07:33:13,850][06674] Fps is (10 sec: 49152.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2916597760. Throughput: 0: 44169.9. Samples: 2819461900. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 07:33:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:33:17,326][06909] Updated weights for policy 0, policy_version 178023 (0.0020) [2024-06-28 07:33:18,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2916777984. Throughput: 0: 43857.9. Samples: 2819711080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 07:33:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:33:20,805][06909] Updated weights for policy 0, policy_version 178033 (0.0040) [2024-06-28 07:33:23,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2917007360. Throughput: 0: 43784.6. Samples: 2819972160. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 07:33:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:33:24,766][06909] Updated weights for policy 0, policy_version 178043 (0.0038) [2024-06-28 07:33:28,110][06909] Updated weights for policy 0, policy_version 178053 (0.0026) [2024-06-28 07:33:28,856][06674] Fps is (10 sec: 49123.1, 60 sec: 44232.4, 300 sec: 44152.6). Total num frames: 2917269504. Throughput: 0: 43838.6. Samples: 2820114120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 07:33:28,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:33:32,335][06909] Updated weights for policy 0, policy_version 178063 (0.0033) [2024-06-28 07:33:33,852][06674] Fps is (10 sec: 40951.6, 60 sec: 43690.7, 300 sec: 43876.4). Total num frames: 2917416960. Throughput: 0: 43763.5. Samples: 2820371120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 07:33:33,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:33:35,422][06909] Updated weights for policy 0, policy_version 178073 (0.0033) [2024-06-28 07:33:38,850][06674] Fps is (10 sec: 39345.2, 60 sec: 43963.8, 300 sec: 44042.5). Total num frames: 2917662720. Throughput: 0: 43924.0. Samples: 2820633980. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 07:33:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:33:39,986][06909] Updated weights for policy 0, policy_version 178083 (0.0026) [2024-06-28 07:33:42,847][06909] Updated weights for policy 0, policy_version 178093 (0.0033) [2024-06-28 07:33:43,427][06887] Signal inference workers to stop experience collection... (40100 times) [2024-06-28 07:33:43,427][06887] Signal inference workers to resume experience collection... (40100 times) [2024-06-28 07:33:43,436][06909] InferenceWorker_p0-w0: stopping experience collection (40100 times) [2024-06-28 07:33:43,448][06909] InferenceWorker_p0-w0: resuming experience collection (40100 times) [2024-06-28 07:33:43,850][06674] Fps is (10 sec: 49162.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2917908480. Throughput: 0: 43914.7. Samples: 2820775880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 07:33:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:33:47,231][06909] Updated weights for policy 0, policy_version 178103 (0.0037) [2024-06-28 07:33:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2918105088. Throughput: 0: 43928.9. Samples: 2821036600. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 07:33:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:33:48,981][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000178108_2918121472.pth... [2024-06-28 07:33:49,023][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000177463_2907553792.pth [2024-06-28 07:33:50,700][06909] Updated weights for policy 0, policy_version 178113 (0.0033) [2024-06-28 07:33:53,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2918334464. Throughput: 0: 43739.0. Samples: 2821289520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 07:33:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:33:54,594][06909] Updated weights for policy 0, policy_version 178123 (0.0030) [2024-06-28 07:33:58,215][06909] Updated weights for policy 0, policy_version 178133 (0.0026) [2024-06-28 07:33:58,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2918580224. Throughput: 0: 43703.1. Samples: 2821428540. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 07:33:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:34:02,161][06909] Updated weights for policy 0, policy_version 178143 (0.0034) [2024-06-28 07:34:03,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 2918744064. Throughput: 0: 43970.9. Samples: 2821689760. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 07:34:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:34:05,721][06909] Updated weights for policy 0, policy_version 178153 (0.0029) [2024-06-28 07:34:08,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 2918973440. Throughput: 0: 43971.1. Samples: 2821950860. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 07:34:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:34:09,765][06909] Updated weights for policy 0, policy_version 178163 (0.0038) [2024-06-28 07:34:12,970][06909] Updated weights for policy 0, policy_version 178173 (0.0032) [2024-06-28 07:34:13,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2919219200. Throughput: 0: 43773.0. Samples: 2822083640. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 07:34:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:34:17,350][06909] Updated weights for policy 0, policy_version 178183 (0.0039) [2024-06-28 07:34:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 2919415808. Throughput: 0: 43912.2. Samples: 2822347080. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 07:34:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:34:20,589][06909] Updated weights for policy 0, policy_version 178193 (0.0026) [2024-06-28 07:34:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2919661568. Throughput: 0: 43972.9. Samples: 2822612760. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 07:34:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:34:24,565][06909] Updated weights for policy 0, policy_version 178203 (0.0032) [2024-06-28 07:34:27,730][06909] Updated weights for policy 0, policy_version 178213 (0.0036) [2024-06-28 07:34:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43422.0, 300 sec: 44042.4). Total num frames: 2919874560. Throughput: 0: 43829.8. Samples: 2822748220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 07:34:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:34:31,718][06909] Updated weights for policy 0, policy_version 178223 (0.0045) [2024-06-28 07:34:33,850][06674] Fps is (10 sec: 40960.7, 60 sec: 44238.4, 300 sec: 43931.4). Total num frames: 2920071168. Throughput: 0: 44029.0. Samples: 2823017900. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 07:34:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:34:35,319][06909] Updated weights for policy 0, policy_version 178233 (0.0043) [2024-06-28 07:34:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2920300544. Throughput: 0: 44206.0. Samples: 2823278780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 07:34:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:34:39,267][06909] Updated weights for policy 0, policy_version 178243 (0.0038) [2024-06-28 07:34:42,580][06909] Updated weights for policy 0, policy_version 178253 (0.0034) [2024-06-28 07:34:43,850][06674] Fps is (10 sec: 49151.2, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 2920562688. Throughput: 0: 44019.9. Samples: 2823409440. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 07:34:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:34:46,776][06909] Updated weights for policy 0, policy_version 178263 (0.0035) [2024-06-28 07:34:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2920759296. Throughput: 0: 44302.2. Samples: 2823683360. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 07:34:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:34:50,097][06909] Updated weights for policy 0, policy_version 178273 (0.0028) [2024-06-28 07:34:53,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2920972288. Throughput: 0: 44125.8. Samples: 2823936520. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 07:34:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:34:54,182][06909] Updated weights for policy 0, policy_version 178283 (0.0031) [2024-06-28 07:34:57,522][06909] Updated weights for policy 0, policy_version 178293 (0.0030) [2024-06-28 07:34:58,853][06674] Fps is (10 sec: 45858.7, 60 sec: 43961.1, 300 sec: 44097.4). Total num frames: 2921218048. Throughput: 0: 44228.9. Samples: 2824074100. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 07:34:58,854][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:35:01,355][06909] Updated weights for policy 0, policy_version 178303 (0.0026) [2024-06-28 07:35:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 2921414656. Throughput: 0: 44315.1. Samples: 2824341260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 07:35:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:35:05,166][06909] Updated weights for policy 0, policy_version 178313 (0.0033) [2024-06-28 07:35:08,790][06909] Updated weights for policy 0, policy_version 178323 (0.0027) [2024-06-28 07:35:08,850][06674] Fps is (10 sec: 42613.3, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2921644032. Throughput: 0: 44277.8. Samples: 2824605260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 07:35:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:35:12,380][06909] Updated weights for policy 0, policy_version 178333 (0.0032) [2024-06-28 07:35:13,852][06674] Fps is (10 sec: 45865.9, 60 sec: 44235.3, 300 sec: 44097.7). Total num frames: 2921873408. Throughput: 0: 44193.9. Samples: 2824737040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 07:35:13,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:35:16,200][06909] Updated weights for policy 0, policy_version 178343 (0.0031) [2024-06-28 07:35:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2922086400. Throughput: 0: 44243.9. Samples: 2825008880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 07:35:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:35:19,674][06909] Updated weights for policy 0, policy_version 178353 (0.0027) [2024-06-28 07:35:23,718][06909] Updated weights for policy 0, policy_version 178363 (0.0031) [2024-06-28 07:35:23,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2922299392. Throughput: 0: 44351.4. Samples: 2825274600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 07:35:23,854][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:35:26,958][06909] Updated weights for policy 0, policy_version 178373 (0.0042) [2024-06-28 07:35:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2922528768. Throughput: 0: 44291.1. Samples: 2825402540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 07:35:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:35:29,796][06887] Signal inference workers to stop experience collection... (40150 times) [2024-06-28 07:35:29,796][06887] Signal inference workers to resume experience collection... (40150 times) [2024-06-28 07:35:29,814][06909] InferenceWorker_p0-w0: stopping experience collection (40150 times) [2024-06-28 07:35:29,815][06909] InferenceWorker_p0-w0: resuming experience collection (40150 times) [2024-06-28 07:35:31,379][06909] Updated weights for policy 0, policy_version 178383 (0.0033) [2024-06-28 07:35:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 2922741760. Throughput: 0: 44189.3. Samples: 2825671880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 07:35:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:35:34,406][06909] Updated weights for policy 0, policy_version 178393 (0.0022) [2024-06-28 07:35:38,534][06909] Updated weights for policy 0, policy_version 178403 (0.0028) [2024-06-28 07:35:38,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2922954752. Throughput: 0: 44508.0. Samples: 2825939380. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 07:35:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:35:42,098][06909] Updated weights for policy 0, policy_version 178413 (0.0032) [2024-06-28 07:35:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44153.8). Total num frames: 2923200512. Throughput: 0: 44280.4. Samples: 2826066560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 07:35:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:35:45,992][06909] Updated weights for policy 0, policy_version 178423 (0.0038) [2024-06-28 07:35:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2923413504. Throughput: 0: 44180.4. Samples: 2826329380. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 07:35:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:35:48,874][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000178431_2923413504.pth... [2024-06-28 07:35:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000177785_2912829440.pth [2024-06-28 07:35:49,654][06909] Updated weights for policy 0, policy_version 178433 (0.0042) [2024-06-28 07:35:53,512][06909] Updated weights for policy 0, policy_version 178443 (0.0038) [2024-06-28 07:35:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2923626496. Throughput: 0: 44221.4. Samples: 2826595220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 07:35:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:35:57,043][06909] Updated weights for policy 0, policy_version 178453 (0.0021) [2024-06-28 07:35:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43966.4, 300 sec: 44153.5). Total num frames: 2923855872. Throughput: 0: 44110.1. Samples: 2826721900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 07:35:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:36:01,136][06909] Updated weights for policy 0, policy_version 178463 (0.0034) [2024-06-28 07:36:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 44153.8). Total num frames: 2924085248. Throughput: 0: 43960.8. Samples: 2826987120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 07:36:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:36:04,542][06909] Updated weights for policy 0, policy_version 178473 (0.0038) [2024-06-28 07:36:08,519][06909] Updated weights for policy 0, policy_version 178483 (0.0043) [2024-06-28 07:36:08,850][06674] Fps is (10 sec: 40958.9, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2924265472. Throughput: 0: 43959.0. Samples: 2827252760. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 07:36:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:36:12,223][06909] Updated weights for policy 0, policy_version 178493 (0.0021) [2024-06-28 07:36:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43965.2, 300 sec: 44097.9). Total num frames: 2924511232. Throughput: 0: 43960.9. Samples: 2827380780. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 07:36:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:36:16,224][06909] Updated weights for policy 0, policy_version 178503 (0.0027) [2024-06-28 07:36:18,850][06674] Fps is (10 sec: 47514.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2924740608. Throughput: 0: 43766.2. Samples: 2827641360. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 07:36:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:36:19,454][06909] Updated weights for policy 0, policy_version 178513 (0.0031) [2024-06-28 07:36:23,387][06909] Updated weights for policy 0, policy_version 178523 (0.0033) [2024-06-28 07:36:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2924937216. Throughput: 0: 43738.2. Samples: 2827907600. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 07:36:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:36:26,851][06909] Updated weights for policy 0, policy_version 178533 (0.0034) [2024-06-28 07:36:28,854][06674] Fps is (10 sec: 42582.4, 60 sec: 43961.0, 300 sec: 44097.4). Total num frames: 2925166592. Throughput: 0: 43802.2. Samples: 2828037820. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 07:36:28,854][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:36:30,729][06909] Updated weights for policy 0, policy_version 178543 (0.0043) [2024-06-28 07:36:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2925395968. Throughput: 0: 43934.2. Samples: 2828306420. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 07:36:33,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:36:34,291][06909] Updated weights for policy 0, policy_version 178553 (0.0038) [2024-06-28 07:36:38,384][06909] Updated weights for policy 0, policy_version 178563 (0.0033) [2024-06-28 07:36:38,850][06674] Fps is (10 sec: 40975.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2925576192. Throughput: 0: 43863.6. Samples: 2828569080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 07:36:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:36:41,974][06909] Updated weights for policy 0, policy_version 178573 (0.0020) [2024-06-28 07:36:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2925821952. Throughput: 0: 43938.1. Samples: 2828699120. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 07:36:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:36:45,632][06909] Updated weights for policy 0, policy_version 178583 (0.0033) [2024-06-28 07:36:48,852][06674] Fps is (10 sec: 47503.9, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 2926051328. Throughput: 0: 44027.4. Samples: 2828968440. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 07:36:48,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:36:49,258][06887] Signal inference workers to stop experience collection... (40200 times) [2024-06-28 07:36:49,291][06909] InferenceWorker_p0-w0: stopping experience collection (40200 times) [2024-06-28 07:36:49,317][06887] Signal inference workers to resume experience collection... (40200 times) [2024-06-28 07:36:49,318][06909] InferenceWorker_p0-w0: resuming experience collection (40200 times) [2024-06-28 07:36:49,322][06909] Updated weights for policy 0, policy_version 178593 (0.0032) [2024-06-28 07:36:53,297][06909] Updated weights for policy 0, policy_version 178603 (0.0031) [2024-06-28 07:36:53,852][06674] Fps is (10 sec: 44227.9, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 2926264320. Throughput: 0: 44059.1. Samples: 2829235500. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 07:36:53,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:36:56,733][06909] Updated weights for policy 0, policy_version 178613 (0.0030) [2024-06-28 07:36:58,850][06674] Fps is (10 sec: 42607.0, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2926477312. Throughput: 0: 44041.4. Samples: 2829362640. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 07:36:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:37:00,535][06909] Updated weights for policy 0, policy_version 178623 (0.0027) [2024-06-28 07:37:03,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43690.7, 300 sec: 44042.7). Total num frames: 2926706688. Throughput: 0: 44125.8. Samples: 2829627020. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 07:37:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:37:04,069][06909] Updated weights for policy 0, policy_version 178633 (0.0029) [2024-06-28 07:37:07,994][06909] Updated weights for policy 0, policy_version 178643 (0.0026) [2024-06-28 07:37:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2926919680. Throughput: 0: 44253.7. Samples: 2829899020. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 07:37:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:37:11,342][06909] Updated weights for policy 0, policy_version 178653 (0.0028) [2024-06-28 07:37:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2927132672. Throughput: 0: 44175.2. Samples: 2830025540. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 07:37:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:37:15,341][06909] Updated weights for policy 0, policy_version 178663 (0.0029) [2024-06-28 07:37:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2927362048. Throughput: 0: 44003.2. Samples: 2830286560. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 07:37:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:37:19,053][06909] Updated weights for policy 0, policy_version 178673 (0.0037) [2024-06-28 07:37:22,794][06909] Updated weights for policy 0, policy_version 178683 (0.0026) [2024-06-28 07:37:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2927575040. Throughput: 0: 44123.5. Samples: 2830554640. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 07:37:23,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:37:26,482][06909] Updated weights for policy 0, policy_version 178693 (0.0030) [2024-06-28 07:37:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43966.4, 300 sec: 44098.3). Total num frames: 2927804416. Throughput: 0: 44059.5. Samples: 2830681800. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 07:37:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:37:30,284][06909] Updated weights for policy 0, policy_version 178703 (0.0034) [2024-06-28 07:37:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2928017408. Throughput: 0: 43969.0. Samples: 2830946960. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 07:37:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:37:34,207][06909] Updated weights for policy 0, policy_version 178713 (0.0032) [2024-06-28 07:37:37,601][06909] Updated weights for policy 0, policy_version 178723 (0.0031) [2024-06-28 07:37:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 2928230400. Throughput: 0: 43859.3. Samples: 2831209080. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 07:37:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:37:41,547][06909] Updated weights for policy 0, policy_version 178733 (0.0043) [2024-06-28 07:37:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2928443392. Throughput: 0: 43867.5. Samples: 2831336680. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 07:37:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:37:45,057][06909] Updated weights for policy 0, policy_version 178743 (0.0044) [2024-06-28 07:37:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43692.2, 300 sec: 44042.4). Total num frames: 2928672768. Throughput: 0: 43906.7. Samples: 2831602820. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 07:37:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:37:48,877][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000178752_2928672768.pth... [2024-06-28 07:37:48,949][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000178108_2918121472.pth [2024-06-28 07:37:49,130][06909] Updated weights for policy 0, policy_version 178753 (0.0024) [2024-06-28 07:37:52,743][06909] Updated weights for policy 0, policy_version 178763 (0.0026) [2024-06-28 07:37:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43965.2, 300 sec: 43931.3). Total num frames: 2928902144. Throughput: 0: 43768.9. Samples: 2831868620. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 07:37:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:37:56,454][06909] Updated weights for policy 0, policy_version 178773 (0.0035) [2024-06-28 07:37:58,850][06674] Fps is (10 sec: 44234.4, 60 sec: 43963.3, 300 sec: 44097.9). Total num frames: 2929115136. Throughput: 0: 43875.5. Samples: 2831999960. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 07:37:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:38:00,073][06909] Updated weights for policy 0, policy_version 178783 (0.0041) [2024-06-28 07:38:03,793][06909] Updated weights for policy 0, policy_version 178793 (0.0030) [2024-06-28 07:38:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2929344512. Throughput: 0: 43974.6. Samples: 2832265420. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 07:38:03,851][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 07:38:07,306][06909] Updated weights for policy 0, policy_version 178803 (0.0032) [2024-06-28 07:38:08,850][06674] Fps is (10 sec: 45876.9, 60 sec: 44236.7, 300 sec: 43986.8). Total num frames: 2929573888. Throughput: 0: 43920.4. Samples: 2832531060. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 07:38:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:38:10,951][06909] Updated weights for policy 0, policy_version 178813 (0.0032) [2024-06-28 07:38:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2929770496. Throughput: 0: 44070.1. Samples: 2832664960. Policy #0 lag: (min: 0.0, avg: 11.5, max: 23.0) [2024-06-28 07:38:13,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:38:15,086][06909] Updated weights for policy 0, policy_version 178823 (0.0031) [2024-06-28 07:38:18,635][06909] Updated weights for policy 0, policy_version 178833 (0.0029) [2024-06-28 07:38:18,850][06674] Fps is (10 sec: 42599.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2929999872. Throughput: 0: 44004.2. Samples: 2832927140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 07:38:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:38:19,065][06887] Signal inference workers to stop experience collection... (40250 times) [2024-06-28 07:38:19,066][06887] Signal inference workers to resume experience collection... (40250 times) [2024-06-28 07:38:19,106][06909] InferenceWorker_p0-w0: stopping experience collection (40250 times) [2024-06-28 07:38:19,106][06909] InferenceWorker_p0-w0: resuming experience collection (40250 times) [2024-06-28 07:38:22,490][06909] Updated weights for policy 0, policy_version 178843 (0.0027) [2024-06-28 07:38:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 43932.2). Total num frames: 2930229248. Throughput: 0: 43935.1. Samples: 2833186160. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 07:38:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:38:26,048][06909] Updated weights for policy 0, policy_version 178853 (0.0034) [2024-06-28 07:38:28,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44153.8). Total num frames: 2930442240. Throughput: 0: 44051.1. Samples: 2833318980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 07:38:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:38:29,815][06909] Updated weights for policy 0, policy_version 178863 (0.0035) [2024-06-28 07:38:33,469][06909] Updated weights for policy 0, policy_version 178873 (0.0027) [2024-06-28 07:38:33,851][06674] Fps is (10 sec: 44230.1, 60 sec: 44235.8, 300 sec: 44097.7). Total num frames: 2930671616. Throughput: 0: 44067.4. Samples: 2833585920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 07:38:33,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:38:37,382][06909] Updated weights for policy 0, policy_version 178883 (0.0029) [2024-06-28 07:38:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2930884608. Throughput: 0: 44151.6. Samples: 2833855440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 07:38:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:38:41,054][06909] Updated weights for policy 0, policy_version 178893 (0.0038) [2024-06-28 07:38:43,850][06674] Fps is (10 sec: 42605.2, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2931097600. Throughput: 0: 44181.5. Samples: 2833988100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 07:38:43,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 07:38:44,698][06909] Updated weights for policy 0, policy_version 178903 (0.0038) [2024-06-28 07:38:48,369][06909] Updated weights for policy 0, policy_version 178913 (0.0030) [2024-06-28 07:38:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2931326976. Throughput: 0: 44161.5. Samples: 2834252680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 07:38:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:38:52,253][06909] Updated weights for policy 0, policy_version 178923 (0.0045) [2024-06-28 07:38:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2931556352. Throughput: 0: 44020.2. Samples: 2834511960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 07:38:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:38:55,977][06909] Updated weights for policy 0, policy_version 178933 (0.0034) [2024-06-28 07:38:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44237.2, 300 sec: 44153.5). Total num frames: 2931769344. Throughput: 0: 43877.0. Samples: 2834639420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 07:38:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:38:59,546][06909] Updated weights for policy 0, policy_version 178943 (0.0030) [2024-06-28 07:39:03,191][06909] Updated weights for policy 0, policy_version 178953 (0.0029) [2024-06-28 07:39:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 2932015104. Throughput: 0: 44051.5. Samples: 2834909460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 07:39:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:39:07,070][06909] Updated weights for policy 0, policy_version 178963 (0.0031) [2024-06-28 07:39:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2932211712. Throughput: 0: 44038.6. Samples: 2835167900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 07:39:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:39:10,675][06909] Updated weights for policy 0, policy_version 178973 (0.0034) [2024-06-28 07:39:13,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 2932408320. Throughput: 0: 44093.9. Samples: 2835303200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 07:39:13,858][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:39:14,401][06909] Updated weights for policy 0, policy_version 178983 (0.0026) [2024-06-28 07:39:18,227][06909] Updated weights for policy 0, policy_version 178993 (0.0033) [2024-06-28 07:39:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 2932670464. Throughput: 0: 44113.0. Samples: 2835570940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 07:39:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:39:21,605][06909] Updated weights for policy 0, policy_version 179003 (0.0037) [2024-06-28 07:39:23,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2932867072. Throughput: 0: 43998.2. Samples: 2835835360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 07:39:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:39:25,537][06909] Updated weights for policy 0, policy_version 179013 (0.0028) [2024-06-28 07:39:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2933096448. Throughput: 0: 44012.4. Samples: 2835968660. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 07:39:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:39:29,164][06909] Updated weights for policy 0, policy_version 179023 (0.0030) [2024-06-28 07:39:33,040][06909] Updated weights for policy 0, policy_version 179033 (0.0039) [2024-06-28 07:39:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43964.8, 300 sec: 44097.9). Total num frames: 2933309440. Throughput: 0: 44048.3. Samples: 2836234860. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 07:39:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:39:36,942][06909] Updated weights for policy 0, policy_version 179043 (0.0029) [2024-06-28 07:39:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2933522432. Throughput: 0: 43951.1. Samples: 2836489760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 07:39:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:39:40,539][06909] Updated weights for policy 0, policy_version 179053 (0.0024) [2024-06-28 07:39:40,656][06887] Signal inference workers to stop experience collection... (40300 times) [2024-06-28 07:39:40,657][06887] Signal inference workers to resume experience collection... (40300 times) [2024-06-28 07:39:40,683][06909] InferenceWorker_p0-w0: stopping experience collection (40300 times) [2024-06-28 07:39:40,688][06909] InferenceWorker_p0-w0: resuming experience collection (40300 times) [2024-06-28 07:39:43,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2933751808. Throughput: 0: 44022.6. Samples: 2836620440. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 07:39:43,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 07:39:44,258][06909] Updated weights for policy 0, policy_version 179063 (0.0031) [2024-06-28 07:39:47,902][06909] Updated weights for policy 0, policy_version 179073 (0.0030) [2024-06-28 07:39:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2933981184. Throughput: 0: 44030.6. Samples: 2836890840. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 07:39:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:39:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000179076_2933981184.pth... [2024-06-28 07:39:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000178431_2923413504.pth [2024-06-28 07:39:51,504][06909] Updated weights for policy 0, policy_version 179083 (0.0031) [2024-06-28 07:39:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.6, 300 sec: 43987.4). Total num frames: 2934194176. Throughput: 0: 44031.1. Samples: 2837149300. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 07:39:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:39:55,371][06909] Updated weights for policy 0, policy_version 179093 (0.0044) [2024-06-28 07:39:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2934407168. Throughput: 0: 43936.9. Samples: 2837280360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 07:39:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:39:58,951][06909] Updated weights for policy 0, policy_version 179103 (0.0041) [2024-06-28 07:40:03,060][06909] Updated weights for policy 0, policy_version 179113 (0.0036) [2024-06-28 07:40:03,853][06674] Fps is (10 sec: 44222.5, 60 sec: 43688.3, 300 sec: 44041.9). Total num frames: 2934636544. Throughput: 0: 44062.5. Samples: 2837553900. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 07:40:03,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:40:06,158][06909] Updated weights for policy 0, policy_version 179123 (0.0030) [2024-06-28 07:40:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43931.6). Total num frames: 2934833152. Throughput: 0: 43914.3. Samples: 2837811500. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 07:40:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:40:10,294][06909] Updated weights for policy 0, policy_version 179133 (0.0032) [2024-06-28 07:40:13,850][06674] Fps is (10 sec: 42612.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2935062528. Throughput: 0: 43844.9. Samples: 2837941680. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 07:40:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:40:14,061][06909] Updated weights for policy 0, policy_version 179143 (0.0032) [2024-06-28 07:40:17,499][06909] Updated weights for policy 0, policy_version 179153 (0.0031) [2024-06-28 07:40:18,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 2935308288. Throughput: 0: 43901.3. Samples: 2838210420. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 07:40:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:40:21,314][06909] Updated weights for policy 0, policy_version 179163 (0.0040) [2024-06-28 07:40:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2935504896. Throughput: 0: 44142.7. Samples: 2838476180. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 07:40:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:40:24,902][06909] Updated weights for policy 0, policy_version 179173 (0.0037) [2024-06-28 07:40:28,553][06909] Updated weights for policy 0, policy_version 179183 (0.0031) [2024-06-28 07:40:28,852][06674] Fps is (10 sec: 42590.1, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 2935734272. Throughput: 0: 44005.5. Samples: 2838600780. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 07:40:28,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:40:32,551][06909] Updated weights for policy 0, policy_version 179193 (0.0036) [2024-06-28 07:40:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2935963648. Throughput: 0: 43844.1. Samples: 2838863820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:40:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:40:36,167][06909] Updated weights for policy 0, policy_version 179203 (0.0045) [2024-06-28 07:40:38,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2936160256. Throughput: 0: 44110.2. Samples: 2839134260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:40:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:40:40,104][06909] Updated weights for policy 0, policy_version 179213 (0.0042) [2024-06-28 07:40:43,552][06909] Updated weights for policy 0, policy_version 179223 (0.0029) [2024-06-28 07:40:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2936389632. Throughput: 0: 43958.2. Samples: 2839258480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:40:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:40:47,298][06909] Updated weights for policy 0, policy_version 179233 (0.0026) [2024-06-28 07:40:48,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2936619008. Throughput: 0: 43831.3. Samples: 2839526160. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:40:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:40:51,132][06909] Updated weights for policy 0, policy_version 179243 (0.0027) [2024-06-28 07:40:52,158][06887] Signal inference workers to stop experience collection... (40350 times) [2024-06-28 07:40:52,201][06909] InferenceWorker_p0-w0: stopping experience collection (40350 times) [2024-06-28 07:40:52,213][06887] Signal inference workers to resume experience collection... (40350 times) [2024-06-28 07:40:52,223][06909] InferenceWorker_p0-w0: resuming experience collection (40350 times) [2024-06-28 07:40:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2936832000. Throughput: 0: 43981.3. Samples: 2839790660. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:40:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:40:54,790][06909] Updated weights for policy 0, policy_version 179253 (0.0023) [2024-06-28 07:40:58,638][06909] Updated weights for policy 0, policy_version 179263 (0.0029) [2024-06-28 07:40:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2937044992. Throughput: 0: 44036.4. Samples: 2839923320. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:40:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:41:01,921][06909] Updated weights for policy 0, policy_version 179273 (0.0026) [2024-06-28 07:41:03,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43964.7, 300 sec: 44097.7). Total num frames: 2937274368. Throughput: 0: 43888.7. Samples: 2840185500. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:41:03,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:41:06,389][06909] Updated weights for policy 0, policy_version 179283 (0.0032) [2024-06-28 07:41:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2937487360. Throughput: 0: 44001.4. Samples: 2840456240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:41:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:41:09,817][06909] Updated weights for policy 0, policy_version 179293 (0.0031) [2024-06-28 07:41:13,728][06909] Updated weights for policy 0, policy_version 179303 (0.0030) [2024-06-28 07:41:13,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2937700352. Throughput: 0: 44106.9. Samples: 2840585500. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:41:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:41:17,199][06909] Updated weights for policy 0, policy_version 179313 (0.0030) [2024-06-28 07:41:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 2937929728. Throughput: 0: 44167.2. Samples: 2840851340. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:41:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:41:20,922][06909] Updated weights for policy 0, policy_version 179323 (0.0032) [2024-06-28 07:41:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44043.0). Total num frames: 2938159104. Throughput: 0: 44085.4. Samples: 2841118100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:41:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:41:24,325][06909] Updated weights for policy 0, policy_version 179333 (0.0027) [2024-06-28 07:41:28,508][06909] Updated weights for policy 0, policy_version 179343 (0.0028) [2024-06-28 07:41:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 2938372096. Throughput: 0: 44202.7. Samples: 2841247600. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:41:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:41:31,874][06909] Updated weights for policy 0, policy_version 179353 (0.0031) [2024-06-28 07:41:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 2938585088. Throughput: 0: 44011.1. Samples: 2841506660. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:41:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:41:35,825][06909] Updated weights for policy 0, policy_version 179363 (0.0029) [2024-06-28 07:41:38,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2938798080. Throughput: 0: 44152.5. Samples: 2841777520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:41:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:41:39,296][06909] Updated weights for policy 0, policy_version 179373 (0.0040) [2024-06-28 07:41:43,420][06909] Updated weights for policy 0, policy_version 179383 (0.0030) [2024-06-28 07:41:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43931.6). Total num frames: 2939011072. Throughput: 0: 44034.7. Samples: 2841904880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:41:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:41:46,713][06909] Updated weights for policy 0, policy_version 179393 (0.0032) [2024-06-28 07:41:48,852][06674] Fps is (10 sec: 45865.1, 60 sec: 43962.1, 300 sec: 44042.4). Total num frames: 2939256832. Throughput: 0: 44104.4. Samples: 2842170200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:41:48,853][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 07:41:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000179398_2939256832.pth... [2024-06-28 07:41:48,914][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000178752_2928672768.pth [2024-06-28 07:41:50,553][06909] Updated weights for policy 0, policy_version 179403 (0.0026) [2024-06-28 07:41:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2939453440. Throughput: 0: 44143.1. Samples: 2842442680. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:41:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:41:54,358][06909] Updated weights for policy 0, policy_version 179413 (0.0044) [2024-06-28 07:41:58,123][06909] Updated weights for policy 0, policy_version 179423 (0.0038) [2024-06-28 07:41:58,850][06674] Fps is (10 sec: 44245.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2939699200. Throughput: 0: 44028.0. Samples: 2842566760. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:41:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 07:42:01,506][06909] Updated weights for policy 0, policy_version 179433 (0.0034) [2024-06-28 07:42:02,032][06887] Signal inference workers to stop experience collection... (40400 times) [2024-06-28 07:42:02,032][06887] Signal inference workers to resume experience collection... (40400 times) [2024-06-28 07:42:02,051][06909] InferenceWorker_p0-w0: stopping experience collection (40400 times) [2024-06-28 07:42:02,051][06909] InferenceWorker_p0-w0: resuming experience collection (40400 times) [2024-06-28 07:42:03,850][06674] Fps is (10 sec: 45873.0, 60 sec: 43964.9, 300 sec: 44042.4). Total num frames: 2939912192. Throughput: 0: 43971.9. Samples: 2842830100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:42:03,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:42:05,369][06909] Updated weights for policy 0, policy_version 179443 (0.0031) [2024-06-28 07:42:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2940141568. Throughput: 0: 44053.8. Samples: 2843100520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:42:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:42:09,201][06909] Updated weights for policy 0, policy_version 179453 (0.0030) [2024-06-28 07:42:13,105][06909] Updated weights for policy 0, policy_version 179463 (0.0038) [2024-06-28 07:42:13,850][06674] Fps is (10 sec: 44238.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2940354560. Throughput: 0: 44150.2. Samples: 2843234360. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:42:13,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:42:16,542][06909] Updated weights for policy 0, policy_version 179473 (0.0032) [2024-06-28 07:42:18,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2940567552. Throughput: 0: 44101.1. Samples: 2843491220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:42:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:42:20,170][06909] Updated weights for policy 0, policy_version 179483 (0.0024) [2024-06-28 07:42:23,856][06674] Fps is (10 sec: 44210.3, 60 sec: 43959.3, 300 sec: 44041.5). Total num frames: 2940796928. Throughput: 0: 44121.6. Samples: 2843763260. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:42:23,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 07:42:24,081][06909] Updated weights for policy 0, policy_version 179493 (0.0033) [2024-06-28 07:42:27,342][06909] Updated weights for policy 0, policy_version 179503 (0.0027) [2024-06-28 07:42:28,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2941009920. Throughput: 0: 44266.2. Samples: 2843896860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:42:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:42:31,513][06909] Updated weights for policy 0, policy_version 179513 (0.0038) [2024-06-28 07:42:33,850][06674] Fps is (10 sec: 44263.2, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2941239296. Throughput: 0: 44062.9. Samples: 2844152940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:42:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:42:35,100][06909] Updated weights for policy 0, policy_version 179523 (0.0026) [2024-06-28 07:42:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2941452288. Throughput: 0: 43975.6. Samples: 2844421580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 07:42:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:42:38,961][06909] Updated weights for policy 0, policy_version 179533 (0.0028) [2024-06-28 07:42:42,495][06909] Updated weights for policy 0, policy_version 179543 (0.0026) [2024-06-28 07:42:43,852][06674] Fps is (10 sec: 42589.9, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 2941665280. Throughput: 0: 44126.1. Samples: 2844552520. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 07:42:43,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:42:46,293][06909] Updated weights for policy 0, policy_version 179553 (0.0041) [2024-06-28 07:42:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43692.2, 300 sec: 43986.9). Total num frames: 2941878272. Throughput: 0: 44053.8. Samples: 2844812500. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 07:42:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:42:49,903][06909] Updated weights for policy 0, policy_version 179563 (0.0037) [2024-06-28 07:42:53,850][06674] Fps is (10 sec: 44245.7, 60 sec: 44236.8, 300 sec: 44042.5). Total num frames: 2942107648. Throughput: 0: 43971.5. Samples: 2845079240. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 07:42:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:42:53,967][06909] Updated weights for policy 0, policy_version 179573 (0.0030) [2024-06-28 07:42:57,293][06909] Updated weights for policy 0, policy_version 179583 (0.0044) [2024-06-28 07:42:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2942320640. Throughput: 0: 43916.9. Samples: 2845210620. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 07:42:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:43:01,365][06909] Updated weights for policy 0, policy_version 179593 (0.0033) [2024-06-28 07:43:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43964.1, 300 sec: 43986.9). Total num frames: 2942550016. Throughput: 0: 43940.2. Samples: 2845468520. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 07:43:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:43:05,056][06909] Updated weights for policy 0, policy_version 179603 (0.0037) [2024-06-28 07:43:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2942763008. Throughput: 0: 43961.3. Samples: 2845741260. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 07:43:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:43:09,133][06909] Updated weights for policy 0, policy_version 179613 (0.0040) [2024-06-28 07:43:12,147][06909] Updated weights for policy 0, policy_version 179623 (0.0035) [2024-06-28 07:43:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2942976000. Throughput: 0: 43887.1. Samples: 2845871780. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 07:43:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:43:14,103][06887] Signal inference workers to stop experience collection... (40450 times) [2024-06-28 07:43:14,103][06887] Signal inference workers to resume experience collection... (40450 times) [2024-06-28 07:43:14,136][06909] InferenceWorker_p0-w0: stopping experience collection (40450 times) [2024-06-28 07:43:14,136][06909] InferenceWorker_p0-w0: resuming experience collection (40450 times) [2024-06-28 07:43:16,386][06909] Updated weights for policy 0, policy_version 179633 (0.0039) [2024-06-28 07:43:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2943221760. Throughput: 0: 43976.0. Samples: 2846131860. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 07:43:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:43:19,345][06909] Updated weights for policy 0, policy_version 179643 (0.0031) [2024-06-28 07:43:23,777][06909] Updated weights for policy 0, policy_version 179653 (0.0030) [2024-06-28 07:43:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43968.2, 300 sec: 44042.4). Total num frames: 2943434752. Throughput: 0: 44148.0. Samples: 2846408240. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 07:43:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:43:27,251][06909] Updated weights for policy 0, policy_version 179663 (0.0036) [2024-06-28 07:43:28,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43987.1). Total num frames: 2943647744. Throughput: 0: 44070.9. Samples: 2846535620. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 07:43:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:43:31,149][06909] Updated weights for policy 0, policy_version 179673 (0.0039) [2024-06-28 07:43:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2943877120. Throughput: 0: 44014.7. Samples: 2846793160. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 07:43:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:43:34,354][06909] Updated weights for policy 0, policy_version 179683 (0.0027) [2024-06-28 07:43:38,557][06909] Updated weights for policy 0, policy_version 179693 (0.0030) [2024-06-28 07:43:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2944090112. Throughput: 0: 44289.4. Samples: 2847072260. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 07:43:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:43:41,587][06909] Updated weights for policy 0, policy_version 179703 (0.0036) [2024-06-28 07:43:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 2944319488. Throughput: 0: 44119.2. Samples: 2847195980. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 07:43:43,855][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:43:46,165][06909] Updated weights for policy 0, policy_version 179713 (0.0035) [2024-06-28 07:43:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 2944548864. Throughput: 0: 44274.6. Samples: 2847460880. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-28 07:43:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:43:48,976][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000179722_2944565248.pth... [2024-06-28 07:43:49,023][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000179076_2933981184.pth [2024-06-28 07:43:49,167][06909] Updated weights for policy 0, policy_version 179723 (0.0033) [2024-06-28 07:43:53,582][06909] Updated weights for policy 0, policy_version 179733 (0.0032) [2024-06-28 07:43:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2944761856. Throughput: 0: 44164.2. Samples: 2847728640. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-28 07:43:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:43:56,428][06909] Updated weights for policy 0, policy_version 179743 (0.0031) [2024-06-28 07:43:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2944974848. Throughput: 0: 44041.7. Samples: 2847853660. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-28 07:43:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:44:00,879][06909] Updated weights for policy 0, policy_version 179753 (0.0031) [2024-06-28 07:44:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 2945220608. Throughput: 0: 44198.2. Samples: 2848120780. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-28 07:44:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:44:04,187][06909] Updated weights for policy 0, policy_version 179763 (0.0034) [2024-06-28 07:44:08,150][06909] Updated weights for policy 0, policy_version 179773 (0.0042) [2024-06-28 07:44:08,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2945433600. Throughput: 0: 43943.1. Samples: 2848385680. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-28 07:44:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:44:11,693][06909] Updated weights for policy 0, policy_version 179783 (0.0030) [2024-06-28 07:44:13,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2945630208. Throughput: 0: 43987.6. Samples: 2848515060. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-28 07:44:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:44:16,158][06909] Updated weights for policy 0, policy_version 179793 (0.0035) [2024-06-28 07:44:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2945875968. Throughput: 0: 43945.7. Samples: 2848770720. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-28 07:44:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:44:19,050][06909] Updated weights for policy 0, policy_version 179803 (0.0031) [2024-06-28 07:44:23,524][06909] Updated weights for policy 0, policy_version 179813 (0.0043) [2024-06-28 07:44:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2946056192. Throughput: 0: 43703.5. Samples: 2849038920. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-28 07:44:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:44:26,576][06909] Updated weights for policy 0, policy_version 179823 (0.0029) [2024-06-28 07:44:28,718][06887] Signal inference workers to stop experience collection... (40500 times) [2024-06-28 07:44:28,771][06887] Signal inference workers to resume experience collection... (40500 times) [2024-06-28 07:44:28,775][06909] InferenceWorker_p0-w0: stopping experience collection (40500 times) [2024-06-28 07:44:28,796][06909] InferenceWorker_p0-w0: resuming experience collection (40500 times) [2024-06-28 07:44:28,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 2946269184. Throughput: 0: 43668.0. Samples: 2849161040. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-28 07:44:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:44:31,023][06909] Updated weights for policy 0, policy_version 179833 (0.0032) [2024-06-28 07:44:33,852][06674] Fps is (10 sec: 47503.4, 60 sec: 44235.2, 300 sec: 44097.6). Total num frames: 2946531328. Throughput: 0: 43661.0. Samples: 2849425720. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-28 07:44:33,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:44:34,015][06909] Updated weights for policy 0, policy_version 179843 (0.0031) [2024-06-28 07:44:38,215][06909] Updated weights for policy 0, policy_version 179853 (0.0034) [2024-06-28 07:44:38,852][06674] Fps is (10 sec: 45865.8, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 2946727936. Throughput: 0: 43725.0. Samples: 2849696360. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-28 07:44:38,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:44:41,771][06909] Updated weights for policy 0, policy_version 179863 (0.0026) [2024-06-28 07:44:43,850][06674] Fps is (10 sec: 40968.5, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2946940928. Throughput: 0: 43881.4. Samples: 2849828320. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-28 07:44:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 07:44:45,715][06909] Updated weights for policy 0, policy_version 179873 (0.0029) [2024-06-28 07:44:48,852][06674] Fps is (10 sec: 45874.7, 60 sec: 43962.1, 300 sec: 44042.1). Total num frames: 2947186688. Throughput: 0: 43827.2. Samples: 2850093100. Policy #0 lag: (min: 2.0, avg: 12.1, max: 23.0) [2024-06-28 07:44:48,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:44:49,015][06909] Updated weights for policy 0, policy_version 179883 (0.0045) [2024-06-28 07:44:53,400][06909] Updated weights for policy 0, policy_version 179893 (0.0049) [2024-06-28 07:44:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2947383296. Throughput: 0: 43999.0. Samples: 2850365640. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-28 07:44:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:44:56,321][06909] Updated weights for policy 0, policy_version 179903 (0.0034) [2024-06-28 07:44:58,856][06674] Fps is (10 sec: 40944.0, 60 sec: 43686.3, 300 sec: 43930.9). Total num frames: 2947596288. Throughput: 0: 43709.2. Samples: 2850482240. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-28 07:44:58,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:45:00,730][06909] Updated weights for policy 0, policy_version 179913 (0.0031) [2024-06-28 07:45:03,841][06909] Updated weights for policy 0, policy_version 179923 (0.0036) [2024-06-28 07:45:03,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 2947858432. Throughput: 0: 44034.2. Samples: 2850752260. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-28 07:45:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:45:08,006][06909] Updated weights for policy 0, policy_version 179933 (0.0030) [2024-06-28 07:45:08,853][06674] Fps is (10 sec: 45886.9, 60 sec: 43688.1, 300 sec: 44041.9). Total num frames: 2948055040. Throughput: 0: 44045.8. Samples: 2851021140. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-28 07:45:08,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:45:11,109][06909] Updated weights for policy 0, policy_version 179943 (0.0048) [2024-06-28 07:45:13,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 2948251648. Throughput: 0: 44141.4. Samples: 2851147400. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-28 07:45:13,850][06674] Avg episode reward: [(0, '0.411')] [2024-06-28 07:45:15,246][06909] Updated weights for policy 0, policy_version 179953 (0.0030) [2024-06-28 07:45:18,806][06909] Updated weights for policy 0, policy_version 179963 (0.0029) [2024-06-28 07:45:18,853][06674] Fps is (10 sec: 45876.7, 60 sec: 43961.4, 300 sec: 44097.5). Total num frames: 2948513792. Throughput: 0: 44306.9. Samples: 2851419580. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-28 07:45:18,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:45:22,680][06909] Updated weights for policy 0, policy_version 179973 (0.0024) [2024-06-28 07:45:23,850][06674] Fps is (10 sec: 45874.2, 60 sec: 44236.6, 300 sec: 43987.2). Total num frames: 2948710400. Throughput: 0: 44067.2. Samples: 2851679300. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-28 07:45:23,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:45:26,083][06909] Updated weights for policy 0, policy_version 179983 (0.0036) [2024-06-28 07:45:28,850][06674] Fps is (10 sec: 40972.9, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 2948923392. Throughput: 0: 43937.8. Samples: 2851805520. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-28 07:45:28,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:45:30,346][06909] Updated weights for policy 0, policy_version 179993 (0.0029) [2024-06-28 07:45:33,487][06909] Updated weights for policy 0, policy_version 180003 (0.0035) [2024-06-28 07:45:33,850][06674] Fps is (10 sec: 47514.3, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2949185536. Throughput: 0: 44135.0. Samples: 2852079080. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-28 07:45:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:45:37,895][06909] Updated weights for policy 0, policy_version 180013 (0.0029) [2024-06-28 07:45:38,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44238.4, 300 sec: 44042.4). Total num frames: 2949382144. Throughput: 0: 43906.8. Samples: 2852341440. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-28 07:45:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:45:40,808][06909] Updated weights for policy 0, policy_version 180023 (0.0026) [2024-06-28 07:45:43,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2949578752. Throughput: 0: 44141.9. Samples: 2852468360. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-28 07:45:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:45:45,187][06909] Updated weights for policy 0, policy_version 180033 (0.0031) [2024-06-28 07:45:47,520][06887] Signal inference workers to stop experience collection... (40550 times) [2024-06-28 07:45:47,575][06909] InferenceWorker_p0-w0: stopping experience collection (40550 times) [2024-06-28 07:45:47,581][06887] Signal inference workers to resume experience collection... (40550 times) [2024-06-28 07:45:47,596][06909] InferenceWorker_p0-w0: resuming experience collection (40550 times) [2024-06-28 07:45:48,680][06909] Updated weights for policy 0, policy_version 180043 (0.0033) [2024-06-28 07:45:48,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43963.8, 300 sec: 44042.1). Total num frames: 2949824512. Throughput: 0: 44193.1. Samples: 2852741040. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-28 07:45:48,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:45:48,940][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000180044_2949840896.pth... [2024-06-28 07:45:49,007][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000179398_2939256832.pth [2024-06-28 07:45:52,387][06909] Updated weights for policy 0, policy_version 180053 (0.0034) [2024-06-28 07:45:53,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2950037504. Throughput: 0: 43963.9. Samples: 2852999360. Policy #0 lag: (min: 1.0, avg: 8.2, max: 20.0) [2024-06-28 07:45:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:45:56,042][06909] Updated weights for policy 0, policy_version 180063 (0.0034) [2024-06-28 07:45:58,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43968.1, 300 sec: 43931.6). Total num frames: 2950234112. Throughput: 0: 44117.7. Samples: 2853132700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:45:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:45:59,778][06909] Updated weights for policy 0, policy_version 180073 (0.0044) [2024-06-28 07:46:03,445][06909] Updated weights for policy 0, policy_version 180083 (0.0024) [2024-06-28 07:46:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2950496256. Throughput: 0: 44003.7. Samples: 2853399600. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:46:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:46:07,444][06909] Updated weights for policy 0, policy_version 180093 (0.0033) [2024-06-28 07:46:08,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43966.3, 300 sec: 44042.4). Total num frames: 2950692864. Throughput: 0: 44059.8. Samples: 2853661980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:46:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:46:10,805][06909] Updated weights for policy 0, policy_version 180103 (0.0033) [2024-06-28 07:46:13,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 2950889472. Throughput: 0: 44189.5. Samples: 2853794040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:46:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:46:15,049][06909] Updated weights for policy 0, policy_version 180113 (0.0031) [2024-06-28 07:46:18,296][06909] Updated weights for policy 0, policy_version 180123 (0.0045) [2024-06-28 07:46:18,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43966.0, 300 sec: 44042.4). Total num frames: 2951151616. Throughput: 0: 43979.1. Samples: 2854058140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:46:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:46:22,373][06909] Updated weights for policy 0, policy_version 180133 (0.0042) [2024-06-28 07:46:23,850][06674] Fps is (10 sec: 47512.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2951364608. Throughput: 0: 43906.9. Samples: 2854317260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:46:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:46:26,017][06909] Updated weights for policy 0, policy_version 180143 (0.0038) [2024-06-28 07:46:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2951577600. Throughput: 0: 44053.4. Samples: 2854450760. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:46:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:46:29,598][06909] Updated weights for policy 0, policy_version 180153 (0.0033) [2024-06-28 07:46:33,356][06909] Updated weights for policy 0, policy_version 180163 (0.0039) [2024-06-28 07:46:33,852][06674] Fps is (10 sec: 44228.2, 60 sec: 43689.2, 300 sec: 44097.6). Total num frames: 2951806976. Throughput: 0: 44067.6. Samples: 2854724080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:46:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:46:37,017][06909] Updated weights for policy 0, policy_version 180173 (0.0035) [2024-06-28 07:46:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 2952019968. Throughput: 0: 44019.9. Samples: 2854980260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:46:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:46:40,708][06909] Updated weights for policy 0, policy_version 180183 (0.0030) [2024-06-28 07:46:43,850][06674] Fps is (10 sec: 42607.1, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 2952232960. Throughput: 0: 43986.7. Samples: 2855112100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:46:43,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:46:44,870][06909] Updated weights for policy 0, policy_version 180193 (0.0030) [2024-06-28 07:46:47,927][06909] Updated weights for policy 0, policy_version 180203 (0.0031) [2024-06-28 07:46:48,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44238.4, 300 sec: 44153.5). Total num frames: 2952478720. Throughput: 0: 44133.7. Samples: 2855385620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:46:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:46:52,226][06909] Updated weights for policy 0, policy_version 180213 (0.0035) [2024-06-28 07:46:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2952675328. Throughput: 0: 43970.1. Samples: 2855640640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:46:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:46:55,866][06909] Updated weights for policy 0, policy_version 180223 (0.0021) [2024-06-28 07:46:56,035][06887] Signal inference workers to stop experience collection... (40600 times) [2024-06-28 07:46:56,081][06909] InferenceWorker_p0-w0: stopping experience collection (40600 times) [2024-06-28 07:46:56,091][06887] Signal inference workers to resume experience collection... (40600 times) [2024-06-28 07:46:56,092][06909] InferenceWorker_p0-w0: resuming experience collection (40600 times) [2024-06-28 07:46:58,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2952888320. Throughput: 0: 43920.8. Samples: 2855770480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 07:46:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:46:59,407][06909] Updated weights for policy 0, policy_version 180233 (0.0024) [2024-06-28 07:47:03,342][06909] Updated weights for policy 0, policy_version 180243 (0.0040) [2024-06-28 07:47:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2953117696. Throughput: 0: 44068.1. Samples: 2856041200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:47:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 07:47:06,661][06909] Updated weights for policy 0, policy_version 180253 (0.0029) [2024-06-28 07:47:08,852][06674] Fps is (10 sec: 45866.1, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 2953347072. Throughput: 0: 44246.6. Samples: 2856308440. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:47:08,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:47:10,764][06909] Updated weights for policy 0, policy_version 180263 (0.0036) [2024-06-28 07:47:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 2953560064. Throughput: 0: 44161.3. Samples: 2856438020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:47:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:47:14,226][06909] Updated weights for policy 0, policy_version 180273 (0.0037) [2024-06-28 07:47:17,894][06909] Updated weights for policy 0, policy_version 180283 (0.0030) [2024-06-28 07:47:18,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43690.7, 300 sec: 43987.8). Total num frames: 2953773056. Throughput: 0: 44026.0. Samples: 2856705160. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:47:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:47:22,135][06909] Updated weights for policy 0, policy_version 180293 (0.0042) [2024-06-28 07:47:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2954002432. Throughput: 0: 44049.8. Samples: 2856962500. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:47:23,859][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:47:25,498][06909] Updated weights for policy 0, policy_version 180303 (0.0027) [2024-06-28 07:47:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 2954199040. Throughput: 0: 44057.5. Samples: 2857094680. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:47:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:47:29,307][06909] Updated weights for policy 0, policy_version 180313 (0.0033) [2024-06-28 07:47:33,278][06909] Updated weights for policy 0, policy_version 180323 (0.0029) [2024-06-28 07:47:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43692.2, 300 sec: 43986.9). Total num frames: 2954428416. Throughput: 0: 43679.5. Samples: 2857351200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:47:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:47:36,630][06909] Updated weights for policy 0, policy_version 180333 (0.0030) [2024-06-28 07:47:38,852][06674] Fps is (10 sec: 45865.5, 60 sec: 43962.3, 300 sec: 44042.4). Total num frames: 2954657792. Throughput: 0: 43823.8. Samples: 2857612800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:47:38,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:47:40,881][06909] Updated weights for policy 0, policy_version 180343 (0.0033) [2024-06-28 07:47:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2954887168. Throughput: 0: 44003.6. Samples: 2857750640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:47:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:47:43,908][06909] Updated weights for policy 0, policy_version 180353 (0.0028) [2024-06-28 07:47:48,275][06909] Updated weights for policy 0, policy_version 180363 (0.0027) [2024-06-28 07:47:48,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 2955083776. Throughput: 0: 43851.6. Samples: 2858014520. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:47:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:47:48,877][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000180365_2955100160.pth... [2024-06-28 07:47:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000179722_2944565248.pth [2024-06-28 07:47:51,709][06909] Updated weights for policy 0, policy_version 180373 (0.0027) [2024-06-28 07:47:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2955329536. Throughput: 0: 43769.1. Samples: 2858277960. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:47:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:47:55,456][06909] Updated weights for policy 0, policy_version 180383 (0.0021) [2024-06-28 07:47:58,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2955542528. Throughput: 0: 44021.2. Samples: 2858418980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:47:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:47:59,224][06909] Updated weights for policy 0, policy_version 180393 (0.0027) [2024-06-28 07:48:02,637][06909] Updated weights for policy 0, policy_version 180403 (0.0027) [2024-06-28 07:48:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2955739136. Throughput: 0: 43791.2. Samples: 2858675760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:48:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:48:06,790][06909] Updated weights for policy 0, policy_version 180413 (0.0045) [2024-06-28 07:48:08,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43965.3, 300 sec: 44098.0). Total num frames: 2955984896. Throughput: 0: 43894.4. Samples: 2858937740. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 07:48:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:48:10,759][06909] Updated weights for policy 0, policy_version 180423 (0.0033) [2024-06-28 07:48:13,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43689.2, 300 sec: 43931.0). Total num frames: 2956181504. Throughput: 0: 43874.8. Samples: 2859069140. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:48:13,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:48:14,127][06909] Updated weights for policy 0, policy_version 180433 (0.0023) [2024-06-28 07:48:14,633][06887] Signal inference workers to stop experience collection... (40650 times) [2024-06-28 07:48:14,635][06887] Signal inference workers to resume experience collection... (40650 times) [2024-06-28 07:48:14,648][06909] InferenceWorker_p0-w0: stopping experience collection (40650 times) [2024-06-28 07:48:14,674][06909] InferenceWorker_p0-w0: resuming experience collection (40650 times) [2024-06-28 07:48:18,115][06909] Updated weights for policy 0, policy_version 180443 (0.0031) [2024-06-28 07:48:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2956410880. Throughput: 0: 44095.6. Samples: 2859335500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:48:18,850][06674] Avg episode reward: [(0, '0.462')] [2024-06-28 07:48:21,362][06909] Updated weights for policy 0, policy_version 180453 (0.0027) [2024-06-28 07:48:23,850][06674] Fps is (10 sec: 47523.2, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2956656640. Throughput: 0: 44020.2. Samples: 2859593620. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:48:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:48:25,313][06909] Updated weights for policy 0, policy_version 180463 (0.0039) [2024-06-28 07:48:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2956853248. Throughput: 0: 44016.4. Samples: 2859731380. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:48:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:48:28,945][06909] Updated weights for policy 0, policy_version 180473 (0.0020) [2024-06-28 07:48:32,575][06909] Updated weights for policy 0, policy_version 180483 (0.0038) [2024-06-28 07:48:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2957066240. Throughput: 0: 44085.3. Samples: 2859998360. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:48:33,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:48:36,231][06909] Updated weights for policy 0, policy_version 180493 (0.0038) [2024-06-28 07:48:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 2957312000. Throughput: 0: 44118.1. Samples: 2860263280. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:48:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:48:40,077][06909] Updated weights for policy 0, policy_version 180503 (0.0039) [2024-06-28 07:48:43,582][06909] Updated weights for policy 0, policy_version 180513 (0.0029) [2024-06-28 07:48:43,855][06674] Fps is (10 sec: 45851.7, 60 sec: 43960.0, 300 sec: 43986.1). Total num frames: 2957524992. Throughput: 0: 43980.0. Samples: 2860398300. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:48:43,855][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:48:47,601][06909] Updated weights for policy 0, policy_version 180523 (0.0030) [2024-06-28 07:48:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2957737984. Throughput: 0: 44240.4. Samples: 2860666580. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:48:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:48:51,087][06909] Updated weights for policy 0, policy_version 180533 (0.0043) [2024-06-28 07:48:53,850][06674] Fps is (10 sec: 44259.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2957967360. Throughput: 0: 44247.9. Samples: 2860928900. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:48:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:48:55,085][06909] Updated weights for policy 0, policy_version 180543 (0.0035) [2024-06-28 07:48:58,253][06909] Updated weights for policy 0, policy_version 180553 (0.0029) [2024-06-28 07:48:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2958196736. Throughput: 0: 44419.4. Samples: 2861067920. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:48:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:49:02,222][06909] Updated weights for policy 0, policy_version 180563 (0.0029) [2024-06-28 07:49:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 2958376960. Throughput: 0: 44282.7. Samples: 2861328220. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:49:03,850][06674] Avg episode reward: [(0, '0.449')] [2024-06-28 07:49:05,981][06909] Updated weights for policy 0, policy_version 180573 (0.0031) [2024-06-28 07:49:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2958639104. Throughput: 0: 44397.9. Samples: 2861591520. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:49:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:49:09,485][06909] Updated weights for policy 0, policy_version 180583 (0.0030) [2024-06-28 07:49:13,182][06909] Updated weights for policy 0, policy_version 180593 (0.0033) [2024-06-28 07:49:13,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44511.4, 300 sec: 43986.9). Total num frames: 2958852096. Throughput: 0: 44382.3. Samples: 2861728580. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 07:49:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:49:17,173][06909] Updated weights for policy 0, policy_version 180603 (0.0031) [2024-06-28 07:49:18,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2959048704. Throughput: 0: 44194.8. Samples: 2861987120. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 07:49:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:49:20,779][06909] Updated weights for policy 0, policy_version 180613 (0.0046) [2024-06-28 07:49:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 2959278080. Throughput: 0: 44171.1. Samples: 2862250980. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 07:49:23,850][06674] Avg episode reward: [(0, '0.460')] [2024-06-28 07:49:24,643][06909] Updated weights for policy 0, policy_version 180623 (0.0031) [2024-06-28 07:49:28,105][06909] Updated weights for policy 0, policy_version 180633 (0.0029) [2024-06-28 07:49:28,850][06674] Fps is (10 sec: 49151.5, 60 sec: 44782.9, 300 sec: 44098.3). Total num frames: 2959540224. Throughput: 0: 44333.5. Samples: 2862393080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 07:49:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:49:32,109][06909] Updated weights for policy 0, policy_version 180643 (0.0047) [2024-06-28 07:49:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43987.2). Total num frames: 2959704064. Throughput: 0: 44072.9. Samples: 2862649860. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 07:49:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:49:35,524][06909] Updated weights for policy 0, policy_version 180653 (0.0029) [2024-06-28 07:49:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2959949824. Throughput: 0: 44159.6. Samples: 2862916080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 07:49:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:49:39,316][06909] Updated weights for policy 0, policy_version 180663 (0.0032) [2024-06-28 07:49:43,125][06909] Updated weights for policy 0, policy_version 180673 (0.0025) [2024-06-28 07:49:43,851][06674] Fps is (10 sec: 49146.3, 60 sec: 44512.8, 300 sec: 44098.1). Total num frames: 2960195584. Throughput: 0: 44102.9. Samples: 2863052600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 07:49:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:49:46,567][06909] Updated weights for policy 0, policy_version 180683 (0.0032) [2024-06-28 07:49:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2960375808. Throughput: 0: 43965.8. Samples: 2863306680. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 07:49:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:49:48,975][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000180688_2960392192.pth... [2024-06-28 07:49:49,022][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000180044_2949840896.pth [2024-06-28 07:49:49,673][06887] Signal inference workers to stop experience collection... (40700 times) [2024-06-28 07:49:49,700][06909] InferenceWorker_p0-w0: stopping experience collection (40700 times) [2024-06-28 07:49:49,736][06887] Signal inference workers to resume experience collection... (40700 times) [2024-06-28 07:49:49,736][06909] InferenceWorker_p0-w0: resuming experience collection (40700 times) [2024-06-28 07:49:50,635][06909] Updated weights for policy 0, policy_version 180693 (0.0032) [2024-06-28 07:49:53,850][06674] Fps is (10 sec: 39325.9, 60 sec: 43690.7, 300 sec: 44043.3). Total num frames: 2960588800. Throughput: 0: 43906.1. Samples: 2863567300. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 07:49:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:49:54,453][06909] Updated weights for policy 0, policy_version 180703 (0.0027) [2024-06-28 07:49:58,133][06909] Updated weights for policy 0, policy_version 180713 (0.0039) [2024-06-28 07:49:58,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2960834560. Throughput: 0: 43811.1. Samples: 2863700080. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 07:49:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:50:01,786][06909] Updated weights for policy 0, policy_version 180723 (0.0021) [2024-06-28 07:50:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43931.9). Total num frames: 2961014784. Throughput: 0: 43871.5. Samples: 2863961340. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 07:50:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:50:05,639][06909] Updated weights for policy 0, policy_version 180733 (0.0028) [2024-06-28 07:50:08,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 2961260544. Throughput: 0: 43973.8. Samples: 2864229800. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 07:50:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:50:09,173][06909] Updated weights for policy 0, policy_version 180743 (0.0030) [2024-06-28 07:50:12,839][06909] Updated weights for policy 0, policy_version 180753 (0.0026) [2024-06-28 07:50:13,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.7, 300 sec: 43987.4). Total num frames: 2961489920. Throughput: 0: 43886.3. Samples: 2864367960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 07:50:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:50:16,560][06909] Updated weights for policy 0, policy_version 180763 (0.0027) [2024-06-28 07:50:18,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.6, 300 sec: 43931.4). Total num frames: 2961670144. Throughput: 0: 43786.6. Samples: 2864620260. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 07:50:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:50:20,722][06909] Updated weights for policy 0, policy_version 180773 (0.0030) [2024-06-28 07:50:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 2961932288. Throughput: 0: 43819.1. Samples: 2864887940. Policy #0 lag: (min: 1.0, avg: 10.9, max: 23.0) [2024-06-28 07:50:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:50:23,877][06909] Updated weights for policy 0, policy_version 180783 (0.0040) [2024-06-28 07:50:28,264][06909] Updated weights for policy 0, policy_version 180793 (0.0035) [2024-06-28 07:50:28,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43417.5, 300 sec: 43931.3). Total num frames: 2962145280. Throughput: 0: 43804.9. Samples: 2865023780. Policy #0 lag: (min: 1.0, avg: 10.9, max: 23.0) [2024-06-28 07:50:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:50:31,615][06909] Updated weights for policy 0, policy_version 180803 (0.0030) [2024-06-28 07:50:33,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2962358272. Throughput: 0: 43806.6. Samples: 2865277980. Policy #0 lag: (min: 1.0, avg: 10.9, max: 23.0) [2024-06-28 07:50:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:50:35,509][06909] Updated weights for policy 0, policy_version 180813 (0.0033) [2024-06-28 07:50:38,856][06674] Fps is (10 sec: 44210.4, 60 sec: 43959.3, 300 sec: 44097.0). Total num frames: 2962587648. Throughput: 0: 44006.0. Samples: 2865547840. Policy #0 lag: (min: 1.0, avg: 10.9, max: 23.0) [2024-06-28 07:50:38,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:50:38,995][06909] Updated weights for policy 0, policy_version 180823 (0.0019) [2024-06-28 07:50:42,840][06909] Updated weights for policy 0, policy_version 180833 (0.0034) [2024-06-28 07:50:43,851][06674] Fps is (10 sec: 45869.5, 60 sec: 43690.5, 300 sec: 44042.5). Total num frames: 2962817024. Throughput: 0: 43958.7. Samples: 2865678280. Policy #0 lag: (min: 1.0, avg: 10.9, max: 23.0) [2024-06-28 07:50:43,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:50:46,518][06909] Updated weights for policy 0, policy_version 180843 (0.0032) [2024-06-28 07:50:48,850][06674] Fps is (10 sec: 42624.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2963013632. Throughput: 0: 44020.9. Samples: 2865942280. Policy #0 lag: (min: 1.0, avg: 10.9, max: 23.0) [2024-06-28 07:50:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:50:50,065][06909] Updated weights for policy 0, policy_version 180853 (0.0036) [2024-06-28 07:50:53,753][06909] Updated weights for policy 0, policy_version 180863 (0.0030) [2024-06-28 07:50:53,850][06674] Fps is (10 sec: 44242.6, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2963259392. Throughput: 0: 43896.9. Samples: 2866205160. Policy #0 lag: (min: 1.0, avg: 10.9, max: 23.0) [2024-06-28 07:50:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:50:57,725][06909] Updated weights for policy 0, policy_version 180873 (0.0025) [2024-06-28 07:50:58,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2963472384. Throughput: 0: 43930.6. Samples: 2866344840. Policy #0 lag: (min: 1.0, avg: 10.9, max: 23.0) [2024-06-28 07:50:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:50:59,717][06887] Signal inference workers to stop experience collection... (40750 times) [2024-06-28 07:50:59,718][06887] Signal inference workers to resume experience collection... (40750 times) [2024-06-28 07:50:59,730][06909] InferenceWorker_p0-w0: stopping experience collection (40750 times) [2024-06-28 07:50:59,730][06909] InferenceWorker_p0-w0: resuming experience collection (40750 times) [2024-06-28 07:51:01,321][06909] Updated weights for policy 0, policy_version 180883 (0.0029) [2024-06-28 07:51:03,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2963668992. Throughput: 0: 44136.9. Samples: 2866606420. Policy #0 lag: (min: 1.0, avg: 10.9, max: 23.0) [2024-06-28 07:51:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:51:05,162][06909] Updated weights for policy 0, policy_version 180893 (0.0031) [2024-06-28 07:51:08,651][06909] Updated weights for policy 0, policy_version 180903 (0.0026) [2024-06-28 07:51:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2963914752. Throughput: 0: 44138.6. Samples: 2866874180. Policy #0 lag: (min: 1.0, avg: 10.9, max: 23.0) [2024-06-28 07:51:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:51:12,486][06909] Updated weights for policy 0, policy_version 180913 (0.0035) [2024-06-28 07:51:13,853][06674] Fps is (10 sec: 47498.7, 60 sec: 44234.5, 300 sec: 44042.0). Total num frames: 2964144128. Throughput: 0: 44104.7. Samples: 2867008620. Policy #0 lag: (min: 1.0, avg: 10.9, max: 23.0) [2024-06-28 07:51:13,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:51:15,923][06909] Updated weights for policy 0, policy_version 180923 (0.0036) [2024-06-28 07:51:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2964340736. Throughput: 0: 44466.3. Samples: 2867278960. Policy #0 lag: (min: 1.0, avg: 10.9, max: 23.0) [2024-06-28 07:51:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:51:19,681][06909] Updated weights for policy 0, policy_version 180933 (0.0030) [2024-06-28 07:51:23,310][06909] Updated weights for policy 0, policy_version 180943 (0.0025) [2024-06-28 07:51:23,850][06674] Fps is (10 sec: 44250.3, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2964586496. Throughput: 0: 44298.4. Samples: 2867541000. Policy #0 lag: (min: 1.0, avg: 10.9, max: 23.0) [2024-06-28 07:51:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:51:27,283][06909] Updated weights for policy 0, policy_version 180953 (0.0047) [2024-06-28 07:51:28,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.9, 300 sec: 44042.7). Total num frames: 2964799488. Throughput: 0: 44301.2. Samples: 2867671780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 07:51:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:51:30,723][06909] Updated weights for policy 0, policy_version 180963 (0.0025) [2024-06-28 07:51:33,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2965012480. Throughput: 0: 44351.8. Samples: 2867938120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 07:51:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:51:34,686][06909] Updated weights for policy 0, policy_version 180973 (0.0032) [2024-06-28 07:51:38,113][06909] Updated weights for policy 0, policy_version 180983 (0.0032) [2024-06-28 07:51:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44241.3, 300 sec: 44097.9). Total num frames: 2965241856. Throughput: 0: 44219.5. Samples: 2868195040. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 07:51:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:51:42,171][06909] Updated weights for policy 0, policy_version 180993 (0.0025) [2024-06-28 07:51:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43964.6, 300 sec: 43986.9). Total num frames: 2965454848. Throughput: 0: 44120.4. Samples: 2868330260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 07:51:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:51:45,927][06909] Updated weights for policy 0, policy_version 181003 (0.0029) [2024-06-28 07:51:48,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2965667840. Throughput: 0: 44251.1. Samples: 2868597720. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 07:51:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:51:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000181010_2965667840.pth... [2024-06-28 07:51:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000180365_2955100160.pth [2024-06-28 07:51:49,637][06909] Updated weights for policy 0, policy_version 181013 (0.0032) [2024-06-28 07:51:53,193][06909] Updated weights for policy 0, policy_version 181023 (0.0040) [2024-06-28 07:51:53,854][06674] Fps is (10 sec: 44219.9, 60 sec: 43960.9, 300 sec: 44097.4). Total num frames: 2965897216. Throughput: 0: 44034.4. Samples: 2868855900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 07:51:53,854][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:51:56,820][06909] Updated weights for policy 0, policy_version 181033 (0.0026) [2024-06-28 07:51:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2966110208. Throughput: 0: 44114.1. Samples: 2868993620. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 07:51:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:52:00,439][06909] Updated weights for policy 0, policy_version 181043 (0.0024) [2024-06-28 07:52:03,850][06674] Fps is (10 sec: 44254.2, 60 sec: 44509.8, 300 sec: 44042.7). Total num frames: 2966339584. Throughput: 0: 44004.9. Samples: 2869259180. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 07:52:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:52:04,453][06909] Updated weights for policy 0, policy_version 181053 (0.0026) [2024-06-28 07:52:07,668][06909] Updated weights for policy 0, policy_version 181063 (0.0038) [2024-06-28 07:52:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2966552576. Throughput: 0: 43915.5. Samples: 2869517200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 07:52:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:52:11,837][06909] Updated weights for policy 0, policy_version 181073 (0.0037) [2024-06-28 07:52:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43692.9, 300 sec: 44042.4). Total num frames: 2966765568. Throughput: 0: 43899.2. Samples: 2869647240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 07:52:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:52:15,423][06909] Updated weights for policy 0, policy_version 181083 (0.0034) [2024-06-28 07:52:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 2967011328. Throughput: 0: 44021.4. Samples: 2869919080. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 07:52:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:52:19,203][06909] Updated weights for policy 0, policy_version 181093 (0.0026) [2024-06-28 07:52:22,829][06909] Updated weights for policy 0, policy_version 181103 (0.0022) [2024-06-28 07:52:23,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 2967224320. Throughput: 0: 44058.3. Samples: 2870177660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 07:52:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:52:26,977][06909] Updated weights for policy 0, policy_version 181113 (0.0035) [2024-06-28 07:52:28,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 2967420928. Throughput: 0: 43987.2. Samples: 2870309680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 07:52:28,859][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:52:30,306][06909] Updated weights for policy 0, policy_version 181123 (0.0043) [2024-06-28 07:52:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44098.3). Total num frames: 2967666688. Throughput: 0: 43884.8. Samples: 2870572540. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 07:52:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:52:34,406][06909] Updated weights for policy 0, policy_version 181133 (0.0027) [2024-06-28 07:52:37,918][06909] Updated weights for policy 0, policy_version 181143 (0.0025) [2024-06-28 07:52:38,850][06674] Fps is (10 sec: 44235.5, 60 sec: 43690.5, 300 sec: 43986.8). Total num frames: 2967863296. Throughput: 0: 43969.4. Samples: 2870834360. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 07:52:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:52:41,797][06909] Updated weights for policy 0, policy_version 181153 (0.0037) [2024-06-28 07:52:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2968092672. Throughput: 0: 43860.5. Samples: 2870967340. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 07:52:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:52:44,656][06887] Signal inference workers to stop experience collection... (40800 times) [2024-06-28 07:52:44,656][06887] Signal inference workers to resume experience collection... (40800 times) [2024-06-28 07:52:44,700][06909] InferenceWorker_p0-w0: stopping experience collection (40800 times) [2024-06-28 07:52:44,700][06909] InferenceWorker_p0-w0: resuming experience collection (40800 times) [2024-06-28 07:52:45,169][06909] Updated weights for policy 0, policy_version 181163 (0.0040) [2024-06-28 07:52:48,850][06674] Fps is (10 sec: 45876.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2968322048. Throughput: 0: 43944.9. Samples: 2871236700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 07:52:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:52:49,437][06909] Updated weights for policy 0, policy_version 181173 (0.0032) [2024-06-28 07:52:52,869][06909] Updated weights for policy 0, policy_version 181183 (0.0037) [2024-06-28 07:52:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43966.6, 300 sec: 44042.4). Total num frames: 2968535040. Throughput: 0: 43922.3. Samples: 2871493700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 07:52:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:52:56,602][06909] Updated weights for policy 0, policy_version 181193 (0.0038) [2024-06-28 07:52:58,855][06674] Fps is (10 sec: 42576.6, 60 sec: 43960.1, 300 sec: 44097.2). Total num frames: 2968748032. Throughput: 0: 44097.6. Samples: 2871631860. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 07:52:58,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:53:00,051][06909] Updated weights for policy 0, policy_version 181203 (0.0029) [2024-06-28 07:53:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2968977408. Throughput: 0: 43910.8. Samples: 2871895060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 07:53:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 07:53:04,014][06909] Updated weights for policy 0, policy_version 181213 (0.0026) [2024-06-28 07:53:07,298][06909] Updated weights for policy 0, policy_version 181223 (0.0029) [2024-06-28 07:53:08,850][06674] Fps is (10 sec: 44259.2, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 2969190400. Throughput: 0: 44149.3. Samples: 2872164380. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 07:53:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:53:11,479][06909] Updated weights for policy 0, policy_version 181233 (0.0022) [2024-06-28 07:53:13,852][06674] Fps is (10 sec: 44227.8, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 2969419776. Throughput: 0: 44138.8. Samples: 2872296020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 07:53:13,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:53:14,952][06909] Updated weights for policy 0, policy_version 181243 (0.0034) [2024-06-28 07:53:18,746][06909] Updated weights for policy 0, policy_version 181253 (0.0026) [2024-06-28 07:53:18,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2969649152. Throughput: 0: 44155.9. Samples: 2872559560. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 07:53:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:53:22,399][06909] Updated weights for policy 0, policy_version 181263 (0.0046) [2024-06-28 07:53:23,850][06674] Fps is (10 sec: 42606.6, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2969845760. Throughput: 0: 44187.2. Samples: 2872822780. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 07:53:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:53:26,429][06909] Updated weights for policy 0, policy_version 181273 (0.0040) [2024-06-28 07:53:28,852][06674] Fps is (10 sec: 42590.3, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 2970075136. Throughput: 0: 44060.7. Samples: 2872950160. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 07:53:28,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:53:29,862][06909] Updated weights for policy 0, policy_version 181283 (0.0027) [2024-06-28 07:53:33,830][06909] Updated weights for policy 0, policy_version 181293 (0.0028) [2024-06-28 07:53:33,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2970304512. Throughput: 0: 44032.8. Samples: 2873218180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 20.0) [2024-06-28 07:53:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:53:37,179][06909] Updated weights for policy 0, policy_version 181303 (0.0033) [2024-06-28 07:53:38,851][06674] Fps is (10 sec: 42602.4, 60 sec: 43963.1, 300 sec: 43987.5). Total num frames: 2970501120. Throughput: 0: 44215.9. Samples: 2873483460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 07:53:38,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:53:41,301][06909] Updated weights for policy 0, policy_version 181313 (0.0041) [2024-06-28 07:53:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2970730496. Throughput: 0: 44010.4. Samples: 2873612100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 07:53:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:53:44,343][06909] Updated weights for policy 0, policy_version 181323 (0.0024) [2024-06-28 07:53:48,613][06909] Updated weights for policy 0, policy_version 181333 (0.0033) [2024-06-28 07:53:48,850][06674] Fps is (10 sec: 45880.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2970959872. Throughput: 0: 44077.9. Samples: 2873878560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 07:53:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:53:48,934][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000181334_2970976256.pth... [2024-06-28 07:53:48,977][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000180688_2960392192.pth [2024-06-28 07:53:52,217][06909] Updated weights for policy 0, policy_version 181343 (0.0037) [2024-06-28 07:53:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2971172864. Throughput: 0: 43961.9. Samples: 2874142660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 07:53:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:53:56,039][06909] Updated weights for policy 0, policy_version 181353 (0.0033) [2024-06-28 07:53:58,850][06674] Fps is (10 sec: 45874.1, 60 sec: 44513.6, 300 sec: 44209.0). Total num frames: 2971418624. Throughput: 0: 44055.2. Samples: 2874278420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 07:53:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:53:59,830][06909] Updated weights for policy 0, policy_version 181363 (0.0037) [2024-06-28 07:54:03,373][06909] Updated weights for policy 0, policy_version 181373 (0.0035) [2024-06-28 07:54:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2971615232. Throughput: 0: 43958.9. Samples: 2874537700. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 07:54:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:54:07,213][06909] Updated weights for policy 0, policy_version 181383 (0.0028) [2024-06-28 07:54:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2971844608. Throughput: 0: 44019.7. Samples: 2874803660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 07:54:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:54:11,132][06909] Updated weights for policy 0, policy_version 181393 (0.0035) [2024-06-28 07:54:13,561][06887] Signal inference workers to stop experience collection... (40850 times) [2024-06-28 07:54:13,562][06887] Signal inference workers to resume experience collection... (40850 times) [2024-06-28 07:54:13,597][06909] InferenceWorker_p0-w0: stopping experience collection (40850 times) [2024-06-28 07:54:13,597][06909] InferenceWorker_p0-w0: resuming experience collection (40850 times) [2024-06-28 07:54:13,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43965.2, 300 sec: 44097.9). Total num frames: 2972057600. Throughput: 0: 43996.2. Samples: 2874929900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 07:54:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:54:14,734][06909] Updated weights for policy 0, policy_version 181403 (0.0024) [2024-06-28 07:54:18,408][06909] Updated weights for policy 0, policy_version 181413 (0.0040) [2024-06-28 07:54:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 2972270592. Throughput: 0: 44096.1. Samples: 2875202500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 07:54:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:54:22,182][06909] Updated weights for policy 0, policy_version 181423 (0.0027) [2024-06-28 07:54:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 2972499968. Throughput: 0: 44081.6. Samples: 2875467080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 07:54:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:54:25,651][06909] Updated weights for policy 0, policy_version 181433 (0.0030) [2024-06-28 07:54:28,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 2972729344. Throughput: 0: 44050.6. Samples: 2875594380. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 07:54:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:54:29,351][06909] Updated weights for policy 0, policy_version 181443 (0.0029) [2024-06-28 07:54:33,239][06909] Updated weights for policy 0, policy_version 181453 (0.0033) [2024-06-28 07:54:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2972942336. Throughput: 0: 44091.4. Samples: 2875862680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 07:54:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:54:36,993][06909] Updated weights for policy 0, policy_version 181463 (0.0037) [2024-06-28 07:54:38,856][06674] Fps is (10 sec: 42572.9, 60 sec: 44233.2, 300 sec: 43930.6). Total num frames: 2973155328. Throughput: 0: 44045.1. Samples: 2876124960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 07:54:38,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:54:40,540][06909] Updated weights for policy 0, policy_version 181473 (0.0030) [2024-06-28 07:54:43,852][06674] Fps is (10 sec: 44227.8, 60 sec: 44235.2, 300 sec: 44097.7). Total num frames: 2973384704. Throughput: 0: 44064.7. Samples: 2876261420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 07:54:43,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:54:44,223][06909] Updated weights for policy 0, policy_version 181483 (0.0035) [2024-06-28 07:54:48,175][06909] Updated weights for policy 0, policy_version 181493 (0.0042) [2024-06-28 07:54:48,852][06674] Fps is (10 sec: 44255.5, 60 sec: 43962.3, 300 sec: 44097.7). Total num frames: 2973597696. Throughput: 0: 44162.1. Samples: 2876525080. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 07:54:48,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:54:52,065][06909] Updated weights for policy 0, policy_version 181503 (0.0037) [2024-06-28 07:54:53,850][06674] Fps is (10 sec: 44245.8, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2973827072. Throughput: 0: 44006.2. Samples: 2876783940. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 07:54:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:54:55,590][06909] Updated weights for policy 0, policy_version 181513 (0.0037) [2024-06-28 07:54:58,852][06674] Fps is (10 sec: 45874.0, 60 sec: 43962.3, 300 sec: 44208.7). Total num frames: 2974056448. Throughput: 0: 44211.4. Samples: 2876919500. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 07:54:58,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:54:59,210][06909] Updated weights for policy 0, policy_version 181523 (0.0029) [2024-06-28 07:55:02,891][06909] Updated weights for policy 0, policy_version 181533 (0.0035) [2024-06-28 07:55:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 2974269440. Throughput: 0: 44193.7. Samples: 2877191220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 07:55:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:55:06,282][06909] Updated weights for policy 0, policy_version 181543 (0.0034) [2024-06-28 07:55:08,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2974482432. Throughput: 0: 44227.1. Samples: 2877457300. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 07:55:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:55:10,232][06909] Updated weights for policy 0, policy_version 181553 (0.0035) [2024-06-28 07:55:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 2974711808. Throughput: 0: 44281.0. Samples: 2877587020. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 07:55:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:55:14,206][06909] Updated weights for policy 0, policy_version 181563 (0.0036) [2024-06-28 07:55:17,743][06909] Updated weights for policy 0, policy_version 181573 (0.0031) [2024-06-28 07:55:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2974924800. Throughput: 0: 44228.5. Samples: 2877852960. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 07:55:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:55:21,382][06909] Updated weights for policy 0, policy_version 181583 (0.0019) [2024-06-28 07:55:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 2975170560. Throughput: 0: 44268.7. Samples: 2878116780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 07:55:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:55:25,458][06909] Updated weights for policy 0, policy_version 181593 (0.0039) [2024-06-28 07:55:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 2975367168. Throughput: 0: 44168.8. Samples: 2878248920. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 07:55:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:55:28,897][06909] Updated weights for policy 0, policy_version 181603 (0.0034) [2024-06-28 07:55:32,758][06909] Updated weights for policy 0, policy_version 181613 (0.0023) [2024-06-28 07:55:33,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43963.7, 300 sec: 44043.3). Total num frames: 2975580160. Throughput: 0: 44192.4. Samples: 2878513660. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 07:55:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:55:36,119][06909] Updated weights for policy 0, policy_version 181623 (0.0040) [2024-06-28 07:55:37,111][06887] Signal inference workers to stop experience collection... (40900 times) [2024-06-28 07:55:37,111][06887] Signal inference workers to resume experience collection... (40900 times) [2024-06-28 07:55:37,128][06909] InferenceWorker_p0-w0: stopping experience collection (40900 times) [2024-06-28 07:55:37,128][06909] InferenceWorker_p0-w0: resuming experience collection (40900 times) [2024-06-28 07:55:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44241.3, 300 sec: 44042.6). Total num frames: 2975809536. Throughput: 0: 44258.7. Samples: 2878775580. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 07:55:38,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:55:40,014][06909] Updated weights for policy 0, policy_version 181633 (0.0038) [2024-06-28 07:55:43,480][06909] Updated weights for policy 0, policy_version 181643 (0.0026) [2024-06-28 07:55:43,856][06674] Fps is (10 sec: 45849.1, 60 sec: 44234.1, 300 sec: 44152.6). Total num frames: 2976038912. Throughput: 0: 44228.4. Samples: 2878909940. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 07:55:43,856][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:55:47,242][06909] Updated weights for policy 0, policy_version 181653 (0.0034) [2024-06-28 07:55:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44238.1, 300 sec: 44042.4). Total num frames: 2976251904. Throughput: 0: 43948.0. Samples: 2879168880. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 07:55:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:55:48,988][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000181657_2976268288.pth... [2024-06-28 07:55:49,068][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000181010_2965667840.pth [2024-06-28 07:55:51,248][06909] Updated weights for policy 0, policy_version 181663 (0.0037) [2024-06-28 07:55:53,852][06674] Fps is (10 sec: 44253.4, 60 sec: 44235.3, 300 sec: 44097.7). Total num frames: 2976481280. Throughput: 0: 43841.1. Samples: 2879430240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 07:55:53,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:55:55,066][06909] Updated weights for policy 0, policy_version 181673 (0.0034) [2024-06-28 07:55:58,567][06909] Updated weights for policy 0, policy_version 181683 (0.0022) [2024-06-28 07:55:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 2976694272. Throughput: 0: 44008.8. Samples: 2879567420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 07:55:58,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:56:02,688][06909] Updated weights for policy 0, policy_version 181693 (0.0035) [2024-06-28 07:56:03,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2976907264. Throughput: 0: 43940.4. Samples: 2879830280. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 07:56:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:56:06,207][06909] Updated weights for policy 0, policy_version 181703 (0.0034) [2024-06-28 07:56:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44042.9). Total num frames: 2977136640. Throughput: 0: 43919.9. Samples: 2880093180. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 07:56:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:56:10,042][06909] Updated weights for policy 0, policy_version 181713 (0.0032) [2024-06-28 07:56:13,575][06909] Updated weights for policy 0, policy_version 181723 (0.0032) [2024-06-28 07:56:13,856][06674] Fps is (10 sec: 44209.6, 60 sec: 43959.2, 300 sec: 44097.0). Total num frames: 2977349632. Throughput: 0: 44025.5. Samples: 2880230340. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 07:56:13,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:56:17,482][06909] Updated weights for policy 0, policy_version 181733 (0.0022) [2024-06-28 07:56:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2977562624. Throughput: 0: 43928.1. Samples: 2880490420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 07:56:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:56:21,406][06909] Updated weights for policy 0, policy_version 181743 (0.0038) [2024-06-28 07:56:23,850][06674] Fps is (10 sec: 44263.6, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 2977792000. Throughput: 0: 43941.7. Samples: 2880752960. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 07:56:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:56:25,000][06909] Updated weights for policy 0, policy_version 181753 (0.0037) [2024-06-28 07:56:28,687][06909] Updated weights for policy 0, policy_version 181763 (0.0028) [2024-06-28 07:56:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2978004992. Throughput: 0: 43844.3. Samples: 2880882680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 07:56:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:56:32,180][06909] Updated weights for policy 0, policy_version 181773 (0.0030) [2024-06-28 07:56:33,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2978217984. Throughput: 0: 44071.2. Samples: 2881152080. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 07:56:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:56:35,848][06909] Updated weights for policy 0, policy_version 181783 (0.0032) [2024-06-28 07:56:38,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 2978447360. Throughput: 0: 44228.4. Samples: 2881420520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 07:56:38,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:56:39,335][06909] Updated weights for policy 0, policy_version 181793 (0.0025) [2024-06-28 07:56:43,398][06909] Updated weights for policy 0, policy_version 181803 (0.0035) [2024-06-28 07:56:43,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43694.8, 300 sec: 44042.4). Total num frames: 2978660352. Throughput: 0: 44124.9. Samples: 2881553040. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 07:56:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:56:46,912][06909] Updated weights for policy 0, policy_version 181813 (0.0026) [2024-06-28 07:56:48,850][06674] Fps is (10 sec: 44246.5, 60 sec: 43963.9, 300 sec: 44043.0). Total num frames: 2978889728. Throughput: 0: 44227.6. Samples: 2881820520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 07:56:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:56:51,145][06909] Updated weights for policy 0, policy_version 181823 (0.0027) [2024-06-28 07:56:53,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43965.2, 300 sec: 44098.0). Total num frames: 2979119104. Throughput: 0: 44067.6. Samples: 2882076220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 07:56:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:56:54,297][06909] Updated weights for policy 0, policy_version 181833 (0.0023) [2024-06-28 07:56:58,570][06909] Updated weights for policy 0, policy_version 181843 (0.0031) [2024-06-28 07:56:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2979332096. Throughput: 0: 44034.4. Samples: 2882211620. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 07:56:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:57:01,724][06909] Updated weights for policy 0, policy_version 181853 (0.0037) [2024-06-28 07:57:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2979561472. Throughput: 0: 44212.9. Samples: 2882480000. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 07:57:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:57:05,732][06909] Updated weights for policy 0, policy_version 181863 (0.0037) [2024-06-28 07:57:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 2979774464. Throughput: 0: 44205.4. Samples: 2882742200. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 07:57:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:57:08,939][06887] Signal inference workers to stop experience collection... (40950 times) [2024-06-28 07:57:08,940][06887] Signal inference workers to resume experience collection... (40950 times) [2024-06-28 07:57:08,956][06909] InferenceWorker_p0-w0: stopping experience collection (40950 times) [2024-06-28 07:57:08,956][06909] InferenceWorker_p0-w0: resuming experience collection (40950 times) [2024-06-28 07:57:09,093][06909] Updated weights for policy 0, policy_version 181873 (0.0036) [2024-06-28 07:57:13,096][06909] Updated weights for policy 0, policy_version 181883 (0.0026) [2024-06-28 07:57:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44241.3, 300 sec: 44042.4). Total num frames: 2980003840. Throughput: 0: 44191.5. Samples: 2882871300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 07:57:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:57:16,489][06909] Updated weights for policy 0, policy_version 181893 (0.0023) [2024-06-28 07:57:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2980216832. Throughput: 0: 44168.8. Samples: 2883139680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 07:57:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:57:20,657][06909] Updated weights for policy 0, policy_version 181903 (0.0031) [2024-06-28 07:57:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 2980446208. Throughput: 0: 44029.1. Samples: 2883401740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 07:57:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:57:24,072][06909] Updated weights for policy 0, policy_version 181913 (0.0034) [2024-06-28 07:57:28,427][06909] Updated weights for policy 0, policy_version 181923 (0.0028) [2024-06-28 07:57:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2980642816. Throughput: 0: 43918.8. Samples: 2883529380. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 07:57:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:57:31,721][06909] Updated weights for policy 0, policy_version 181933 (0.0043) [2024-06-28 07:57:33,856][06674] Fps is (10 sec: 42573.3, 60 sec: 44232.4, 300 sec: 44097.1). Total num frames: 2980872192. Throughput: 0: 43808.3. Samples: 2883792160. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 07:57:33,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:57:35,737][06909] Updated weights for policy 0, policy_version 181943 (0.0026) [2024-06-28 07:57:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 2981085184. Throughput: 0: 43974.6. Samples: 2884055080. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 07:57:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:57:39,238][06909] Updated weights for policy 0, policy_version 181953 (0.0024) [2024-06-28 07:57:42,963][06909] Updated weights for policy 0, policy_version 181963 (0.0023) [2024-06-28 07:57:43,850][06674] Fps is (10 sec: 44263.4, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2981314560. Throughput: 0: 43893.0. Samples: 2884186800. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 07:57:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:57:46,537][06909] Updated weights for policy 0, policy_version 181973 (0.0031) [2024-06-28 07:57:48,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2981527552. Throughput: 0: 43676.9. Samples: 2884445460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 07:57:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:57:48,936][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000181979_2981543936.pth... [2024-06-28 07:57:48,983][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000181334_2970976256.pth [2024-06-28 07:57:50,611][06909] Updated weights for policy 0, policy_version 181983 (0.0035) [2024-06-28 07:57:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44098.7). Total num frames: 2981756928. Throughput: 0: 43932.4. Samples: 2884719160. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 07:57:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:57:54,105][06909] Updated weights for policy 0, policy_version 181993 (0.0027) [2024-06-28 07:57:57,982][06909] Updated weights for policy 0, policy_version 182003 (0.0029) [2024-06-28 07:57:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2981969920. Throughput: 0: 43951.6. Samples: 2884849120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 07:57:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:58:01,463][06909] Updated weights for policy 0, policy_version 182013 (0.0031) [2024-06-28 07:58:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2982199296. Throughput: 0: 43828.0. Samples: 2885111940. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 07:58:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:58:05,644][06909] Updated weights for policy 0, policy_version 182023 (0.0038) [2024-06-28 07:58:08,856][06674] Fps is (10 sec: 44209.8, 60 sec: 43959.3, 300 sec: 44041.8). Total num frames: 2982412288. Throughput: 0: 43904.8. Samples: 2885377720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 07:58:08,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:58:09,167][06909] Updated weights for policy 0, policy_version 182033 (0.0031) [2024-06-28 07:58:13,083][06909] Updated weights for policy 0, policy_version 182043 (0.0024) [2024-06-28 07:58:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2982625280. Throughput: 0: 43813.2. Samples: 2885500980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 07:58:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:58:16,640][06909] Updated weights for policy 0, policy_version 182053 (0.0031) [2024-06-28 07:58:18,850][06674] Fps is (10 sec: 44263.3, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 2982854656. Throughput: 0: 43833.8. Samples: 2885764420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 07:58:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:58:20,567][06909] Updated weights for policy 0, policy_version 182063 (0.0035) [2024-06-28 07:58:23,831][06909] Updated weights for policy 0, policy_version 182073 (0.0030) [2024-06-28 07:58:23,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 2983084032. Throughput: 0: 43963.7. Samples: 2886033440. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 07:58:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:58:27,964][06909] Updated weights for policy 0, policy_version 182083 (0.0040) [2024-06-28 07:58:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 2983297024. Throughput: 0: 43947.9. Samples: 2886164460. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 07:58:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:58:31,474][06909] Updated weights for policy 0, policy_version 182093 (0.0027) [2024-06-28 07:58:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44241.2, 300 sec: 44153.7). Total num frames: 2983526400. Throughput: 0: 44169.8. Samples: 2886433100. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 07:58:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:58:35,297][06909] Updated weights for policy 0, policy_version 182103 (0.0032) [2024-06-28 07:58:38,838][06909] Updated weights for policy 0, policy_version 182113 (0.0039) [2024-06-28 07:58:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2983739392. Throughput: 0: 43915.6. Samples: 2886695360. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 07:58:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:58:42,674][06909] Updated weights for policy 0, policy_version 182123 (0.0026) [2024-06-28 07:58:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2983952384. Throughput: 0: 43912.4. Samples: 2886825180. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 07:58:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:58:46,511][06909] Updated weights for policy 0, policy_version 182133 (0.0033) [2024-06-28 07:58:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2984181760. Throughput: 0: 43966.2. Samples: 2887090420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 07:58:48,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:58:50,055][06909] Updated weights for policy 0, policy_version 182143 (0.0033) [2024-06-28 07:58:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 2984378368. Throughput: 0: 44043.3. Samples: 2887359400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 07:58:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:58:53,922][06909] Updated weights for policy 0, policy_version 182153 (0.0026) [2024-06-28 07:58:54,649][06887] Signal inference workers to stop experience collection... (41000 times) [2024-06-28 07:58:54,653][06887] Signal inference workers to resume experience collection... (41000 times) [2024-06-28 07:58:54,695][06909] InferenceWorker_p0-w0: stopping experience collection (41000 times) [2024-06-28 07:58:54,695][06909] InferenceWorker_p0-w0: resuming experience collection (41000 times) [2024-06-28 07:58:57,347][06909] Updated weights for policy 0, policy_version 182163 (0.0040) [2024-06-28 07:58:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 2984624128. Throughput: 0: 44262.3. Samples: 2887492780. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 07:58:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:59:01,098][06909] Updated weights for policy 0, policy_version 182173 (0.0025) [2024-06-28 07:59:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2984837120. Throughput: 0: 44261.9. Samples: 2887756200. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 07:59:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:59:04,922][06909] Updated weights for policy 0, policy_version 182183 (0.0031) [2024-06-28 07:59:08,778][06909] Updated weights for policy 0, policy_version 182193 (0.0033) [2024-06-28 07:59:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43968.2, 300 sec: 44042.4). Total num frames: 2985050112. Throughput: 0: 44260.8. Samples: 2888025180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:59:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:59:12,213][06909] Updated weights for policy 0, policy_version 182203 (0.0029) [2024-06-28 07:59:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 2985279488. Throughput: 0: 44217.4. Samples: 2888154240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:59:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:59:16,113][06909] Updated weights for policy 0, policy_version 182213 (0.0030) [2024-06-28 07:59:18,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 2985492480. Throughput: 0: 44010.8. Samples: 2888413680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:59:18,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 07:59:19,872][06909] Updated weights for policy 0, policy_version 182223 (0.0037) [2024-06-28 07:59:23,762][06909] Updated weights for policy 0, policy_version 182233 (0.0028) [2024-06-28 07:59:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 2985705472. Throughput: 0: 44071.1. Samples: 2888678560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:59:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:59:27,395][06909] Updated weights for policy 0, policy_version 182243 (0.0029) [2024-06-28 07:59:28,850][06674] Fps is (10 sec: 45884.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 2985951232. Throughput: 0: 44166.6. Samples: 2888812680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:59:28,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-28 07:59:30,989][06909] Updated weights for policy 0, policy_version 182253 (0.0032) [2024-06-28 07:59:33,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44236.8, 300 sec: 44154.4). Total num frames: 2986180608. Throughput: 0: 44090.7. Samples: 2889074500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:59:33,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 07:59:34,678][06909] Updated weights for policy 0, policy_version 182263 (0.0035) [2024-06-28 07:59:38,666][06909] Updated weights for policy 0, policy_version 182273 (0.0026) [2024-06-28 07:59:38,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 43987.2). Total num frames: 2986360832. Throughput: 0: 44018.5. Samples: 2889340240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:59:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 07:59:42,049][06909] Updated weights for policy 0, policy_version 182283 (0.0038) [2024-06-28 07:59:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 2986590208. Throughput: 0: 43921.8. Samples: 2889469260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:59:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 07:59:45,923][06909] Updated weights for policy 0, policy_version 182293 (0.0030) [2024-06-28 07:59:48,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2986819584. Throughput: 0: 43976.9. Samples: 2889735160. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:59:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 07:59:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000182301_2986819584.pth... [2024-06-28 07:59:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000181657_2976268288.pth [2024-06-28 07:59:49,854][06909] Updated weights for policy 0, policy_version 182303 (0.0030) [2024-06-28 07:59:53,245][06909] Updated weights for policy 0, policy_version 182313 (0.0031) [2024-06-28 07:59:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 43987.2). Total num frames: 2987032576. Throughput: 0: 43864.0. Samples: 2889999060. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:59:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 07:59:57,344][06909] Updated weights for policy 0, policy_version 182323 (0.0037) [2024-06-28 07:59:58,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2987261952. Throughput: 0: 43965.2. Samples: 2890132680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 07:59:58,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:00:01,048][06909] Updated weights for policy 0, policy_version 182333 (0.0027) [2024-06-28 08:00:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 2987491328. Throughput: 0: 43970.4. Samples: 2890392260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:00:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:00:04,604][06909] Updated weights for policy 0, policy_version 182343 (0.0039) [2024-06-28 08:00:08,263][06909] Updated weights for policy 0, policy_version 182353 (0.0026) [2024-06-28 08:00:08,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2987687936. Throughput: 0: 44090.8. Samples: 2890662640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:00:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:00:11,826][06909] Updated weights for policy 0, policy_version 182363 (0.0042) [2024-06-28 08:00:13,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2987900928. Throughput: 0: 43891.1. Samples: 2890787780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 08:00:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:00:16,015][06909] Updated weights for policy 0, policy_version 182373 (0.0037) [2024-06-28 08:00:18,776][06887] Signal inference workers to stop experience collection... (41050 times) [2024-06-28 08:00:18,776][06887] Signal inference workers to resume experience collection... (41050 times) [2024-06-28 08:00:18,820][06909] InferenceWorker_p0-w0: stopping experience collection (41050 times) [2024-06-28 08:00:18,820][06909] InferenceWorker_p0-w0: resuming experience collection (41050 times) [2024-06-28 08:00:18,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43965.2, 300 sec: 43931.3). Total num frames: 2988130304. Throughput: 0: 43925.2. Samples: 2891051140. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 08:00:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:00:19,342][06909] Updated weights for policy 0, policy_version 182383 (0.0043) [2024-06-28 08:00:23,397][06909] Updated weights for policy 0, policy_version 182393 (0.0049) [2024-06-28 08:00:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2988359680. Throughput: 0: 43907.2. Samples: 2891316060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 08:00:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:00:26,930][06909] Updated weights for policy 0, policy_version 182403 (0.0034) [2024-06-28 08:00:28,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 2988556288. Throughput: 0: 43882.6. Samples: 2891443980. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 08:00:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 08:00:30,629][06909] Updated weights for policy 0, policy_version 182413 (0.0027) [2024-06-28 08:00:33,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43417.5, 300 sec: 43986.9). Total num frames: 2988785664. Throughput: 0: 43891.3. Samples: 2891710280. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 08:00:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:00:34,824][06909] Updated weights for policy 0, policy_version 182423 (0.0035) [2024-06-28 08:00:38,362][06909] Updated weights for policy 0, policy_version 182433 (0.0023) [2024-06-28 08:00:38,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44510.0, 300 sec: 44043.3). Total num frames: 2989031424. Throughput: 0: 43885.4. Samples: 2891973900. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 08:00:38,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:00:42,007][06909] Updated weights for policy 0, policy_version 182443 (0.0034) [2024-06-28 08:00:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.5, 300 sec: 43931.3). Total num frames: 2989211648. Throughput: 0: 43846.2. Samples: 2892105760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 08:00:43,859][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:00:45,722][06909] Updated weights for policy 0, policy_version 182453 (0.0039) [2024-06-28 08:00:48,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43931.6). Total num frames: 2989441024. Throughput: 0: 43928.5. Samples: 2892369040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 08:00:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:00:49,218][06909] Updated weights for policy 0, policy_version 182463 (0.0031) [2024-06-28 08:00:53,083][06909] Updated weights for policy 0, policy_version 182473 (0.0038) [2024-06-28 08:00:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2989670400. Throughput: 0: 43783.9. Samples: 2892632920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 08:00:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:00:56,503][06909] Updated weights for policy 0, policy_version 182483 (0.0024) [2024-06-28 08:00:58,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43690.6, 300 sec: 43986.8). Total num frames: 2989883392. Throughput: 0: 43835.4. Samples: 2892760380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 08:00:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:01:00,450][06909] Updated weights for policy 0, policy_version 182493 (0.0032) [2024-06-28 08:01:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2990112768. Throughput: 0: 43944.6. Samples: 2893028640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 08:01:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:01:04,337][06909] Updated weights for policy 0, policy_version 182503 (0.0034) [2024-06-28 08:01:07,885][06909] Updated weights for policy 0, policy_version 182513 (0.0031) [2024-06-28 08:01:08,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43963.7, 300 sec: 43987.8). Total num frames: 2990325760. Throughput: 0: 43855.6. Samples: 2893289560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 08:01:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:01:11,780][06909] Updated weights for policy 0, policy_version 182523 (0.0019) [2024-06-28 08:01:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2990538752. Throughput: 0: 44055.1. Samples: 2893426460. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 08:01:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:01:15,337][06909] Updated weights for policy 0, policy_version 182533 (0.0029) [2024-06-28 08:01:18,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 2990768128. Throughput: 0: 44160.8. Samples: 2893697600. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 08:01:18,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:01:19,045][06909] Updated weights for policy 0, policy_version 182543 (0.0031) [2024-06-28 08:01:22,842][06909] Updated weights for policy 0, policy_version 182553 (0.0037) [2024-06-28 08:01:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 2990997504. Throughput: 0: 43981.8. Samples: 2893953080. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 08:01:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:01:26,237][06909] Updated weights for policy 0, policy_version 182563 (0.0022) [2024-06-28 08:01:28,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2991194112. Throughput: 0: 44047.7. Samples: 2894087900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 08:01:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:01:30,544][06909] Updated weights for policy 0, policy_version 182573 (0.0028) [2024-06-28 08:01:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.9, 300 sec: 43987.2). Total num frames: 2991423488. Throughput: 0: 44093.8. Samples: 2894353260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 08:01:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:01:33,921][06909] Updated weights for policy 0, policy_version 182583 (0.0031) [2024-06-28 08:01:36,100][06887] Signal inference workers to stop experience collection... (41100 times) [2024-06-28 08:01:36,100][06887] Signal inference workers to resume experience collection... (41100 times) [2024-06-28 08:01:36,127][06909] InferenceWorker_p0-w0: stopping experience collection (41100 times) [2024-06-28 08:01:36,127][06909] InferenceWorker_p0-w0: resuming experience collection (41100 times) [2024-06-28 08:01:37,982][06909] Updated weights for policy 0, policy_version 182593 (0.0049) [2024-06-28 08:01:38,850][06674] Fps is (10 sec: 44235.7, 60 sec: 43417.4, 300 sec: 43986.9). Total num frames: 2991636480. Throughput: 0: 43942.5. Samples: 2894610340. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 08:01:38,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:01:41,706][06909] Updated weights for policy 0, policy_version 182603 (0.0026) [2024-06-28 08:01:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2991865856. Throughput: 0: 44066.8. Samples: 2894743380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 08:01:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:01:45,136][06909] Updated weights for policy 0, policy_version 182613 (0.0035) [2024-06-28 08:01:48,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2992078848. Throughput: 0: 43943.5. Samples: 2895006100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 08:01:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:01:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000182622_2992078848.pth... [2024-06-28 08:01:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000181979_2981543936.pth [2024-06-28 08:01:49,101][06909] Updated weights for policy 0, policy_version 182623 (0.0023) [2024-06-28 08:01:52,563][06909] Updated weights for policy 0, policy_version 182633 (0.0033) [2024-06-28 08:01:53,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2992324608. Throughput: 0: 43968.1. Samples: 2895268120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 08:01:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:01:56,592][06909] Updated weights for policy 0, policy_version 182643 (0.0024) [2024-06-28 08:01:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 2992504832. Throughput: 0: 43963.5. Samples: 2895404820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 08:01:58,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 08:02:00,105][06909] Updated weights for policy 0, policy_version 182653 (0.0030) [2024-06-28 08:02:03,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2992734208. Throughput: 0: 43674.9. Samples: 2895662880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 08:02:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:02:03,926][06909] Updated weights for policy 0, policy_version 182663 (0.0031) [2024-06-28 08:02:07,467][06909] Updated weights for policy 0, policy_version 182673 (0.0035) [2024-06-28 08:02:08,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 2992963584. Throughput: 0: 43917.1. Samples: 2895929360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 08:02:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:02:11,358][06909] Updated weights for policy 0, policy_version 182683 (0.0031) [2024-06-28 08:02:13,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2993192960. Throughput: 0: 43847.6. Samples: 2896061040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 08:02:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:02:15,094][06909] Updated weights for policy 0, policy_version 182693 (0.0038) [2024-06-28 08:02:18,747][06909] Updated weights for policy 0, policy_version 182703 (0.0036) [2024-06-28 08:02:18,850][06674] Fps is (10 sec: 44237.8, 60 sec: 43965.3, 300 sec: 43931.4). Total num frames: 2993405952. Throughput: 0: 43847.6. Samples: 2896326400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 08:02:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:02:22,593][06909] Updated weights for policy 0, policy_version 182713 (0.0028) [2024-06-28 08:02:23,852][06674] Fps is (10 sec: 44224.9, 60 sec: 43961.8, 300 sec: 44042.0). Total num frames: 2993635328. Throughput: 0: 43873.7. Samples: 2896584760. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 08:02:23,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:02:26,242][06909] Updated weights for policy 0, policy_version 182723 (0.0028) [2024-06-28 08:02:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43932.2). Total num frames: 2993831936. Throughput: 0: 43942.3. Samples: 2896720780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 08:02:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:02:29,782][06909] Updated weights for policy 0, policy_version 182733 (0.0034) [2024-06-28 08:02:33,613][06909] Updated weights for policy 0, policy_version 182743 (0.0037) [2024-06-28 08:02:33,850][06674] Fps is (10 sec: 42609.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2994061312. Throughput: 0: 44013.9. Samples: 2896986720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 08:02:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:02:37,253][06909] Updated weights for policy 0, policy_version 182753 (0.0040) [2024-06-28 08:02:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2994290688. Throughput: 0: 44053.6. Samples: 2897250540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 08:02:38,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:02:40,951][06909] Updated weights for policy 0, policy_version 182763 (0.0036) [2024-06-28 08:02:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2994503680. Throughput: 0: 43897.3. Samples: 2897380200. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 08:02:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:02:44,866][06909] Updated weights for policy 0, policy_version 182773 (0.0026) [2024-06-28 08:02:48,560][06909] Updated weights for policy 0, policy_version 182783 (0.0029) [2024-06-28 08:02:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 2994733056. Throughput: 0: 44123.1. Samples: 2897648420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 08:02:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:02:52,078][06909] Updated weights for policy 0, policy_version 182793 (0.0028) [2024-06-28 08:02:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2994962432. Throughput: 0: 44050.9. Samples: 2897911640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 08:02:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:02:55,916][06909] Updated weights for policy 0, policy_version 182803 (0.0038) [2024-06-28 08:02:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 2995175424. Throughput: 0: 44235.5. Samples: 2898051640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 08:02:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:02:59,515][06909] Updated weights for policy 0, policy_version 182813 (0.0038) [2024-06-28 08:03:03,309][06909] Updated weights for policy 0, policy_version 182823 (0.0022) [2024-06-28 08:03:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 43987.8). Total num frames: 2995388416. Throughput: 0: 44140.0. Samples: 2898312700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 08:03:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:03:06,492][06887] Signal inference workers to stop experience collection... (41150 times) [2024-06-28 08:03:06,493][06887] Signal inference workers to resume experience collection... (41150 times) [2024-06-28 08:03:06,517][06909] InferenceWorker_p0-w0: stopping experience collection (41150 times) [2024-06-28 08:03:06,518][06909] InferenceWorker_p0-w0: resuming experience collection (41150 times) [2024-06-28 08:03:07,005][06909] Updated weights for policy 0, policy_version 182833 (0.0032) [2024-06-28 08:03:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 2995617792. Throughput: 0: 44104.8. Samples: 2898569360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 08:03:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:03:10,628][06909] Updated weights for policy 0, policy_version 182843 (0.0041) [2024-06-28 08:03:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 2995830784. Throughput: 0: 44088.0. Samples: 2898704740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 08:03:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:03:14,538][06909] Updated weights for policy 0, policy_version 182853 (0.0023) [2024-06-28 08:03:17,918][06909] Updated weights for policy 0, policy_version 182863 (0.0033) [2024-06-28 08:03:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 2996060160. Throughput: 0: 44200.4. Samples: 2898975740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 08:03:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:03:21,837][06909] Updated weights for policy 0, policy_version 182873 (0.0039) [2024-06-28 08:03:23,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43964.1, 300 sec: 43986.6). Total num frames: 2996273152. Throughput: 0: 44117.6. Samples: 2899235920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 08:03:23,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:03:25,554][06909] Updated weights for policy 0, policy_version 182883 (0.0026) [2024-06-28 08:03:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 2996486144. Throughput: 0: 44202.7. Samples: 2899369320. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 08:03:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:03:29,308][06909] Updated weights for policy 0, policy_version 182893 (0.0029) [2024-06-28 08:03:33,046][06909] Updated weights for policy 0, policy_version 182903 (0.0025) [2024-06-28 08:03:33,850][06674] Fps is (10 sec: 44246.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 2996715520. Throughput: 0: 44130.2. Samples: 2899634280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-28 08:03:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:03:36,557][06909] Updated weights for policy 0, policy_version 182913 (0.0032) [2024-06-28 08:03:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 2996928512. Throughput: 0: 44040.4. Samples: 2899893460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-28 08:03:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:03:40,354][06909] Updated weights for policy 0, policy_version 182923 (0.0027) [2024-06-28 08:03:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 2997141504. Throughput: 0: 43892.9. Samples: 2900026820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-28 08:03:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:03:44,277][06909] Updated weights for policy 0, policy_version 182933 (0.0030) [2024-06-28 08:03:47,539][06909] Updated weights for policy 0, policy_version 182943 (0.0023) [2024-06-28 08:03:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 2997370880. Throughput: 0: 44040.8. Samples: 2900294540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-28 08:03:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:03:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000182945_2997370880.pth... [2024-06-28 08:03:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000182301_2986819584.pth [2024-06-28 08:03:51,745][06909] Updated weights for policy 0, policy_version 182953 (0.0024) [2024-06-28 08:03:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2997583872. Throughput: 0: 44023.9. Samples: 2900550440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-28 08:03:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:03:55,230][06909] Updated weights for policy 0, policy_version 182963 (0.0043) [2024-06-28 08:03:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 2997796864. Throughput: 0: 44107.6. Samples: 2900689580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-28 08:03:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:03:59,418][06909] Updated weights for policy 0, policy_version 182973 (0.0039) [2024-06-28 08:04:02,797][06909] Updated weights for policy 0, policy_version 182983 (0.0026) [2024-06-28 08:04:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 2998026240. Throughput: 0: 43785.3. Samples: 2900946080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-28 08:04:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:04:06,655][06909] Updated weights for policy 0, policy_version 182993 (0.0036) [2024-06-28 08:04:08,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 2998239232. Throughput: 0: 43827.7. Samples: 2901208080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-28 08:04:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:04:10,513][06909] Updated weights for policy 0, policy_version 183003 (0.0023) [2024-06-28 08:04:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.6, 300 sec: 43987.2). Total num frames: 2998468608. Throughput: 0: 43885.6. Samples: 2901344180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-28 08:04:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:04:14,137][06909] Updated weights for policy 0, policy_version 183013 (0.0025) [2024-06-28 08:04:17,655][06909] Updated weights for policy 0, policy_version 183023 (0.0022) [2024-06-28 08:04:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 2998681600. Throughput: 0: 43955.5. Samples: 2901612280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-28 08:04:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:04:21,586][06909] Updated weights for policy 0, policy_version 183033 (0.0029) [2024-06-28 08:04:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43965.2, 300 sec: 43931.3). Total num frames: 2998910976. Throughput: 0: 44051.9. Samples: 2901875800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-28 08:04:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:04:24,849][06909] Updated weights for policy 0, policy_version 183043 (0.0031) [2024-06-28 08:04:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 2999123968. Throughput: 0: 44060.8. Samples: 2902009560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-28 08:04:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:04:28,922][06909] Updated weights for policy 0, policy_version 183053 (0.0028) [2024-06-28 08:04:30,216][06887] Signal inference workers to stop experience collection... (41200 times) [2024-06-28 08:04:30,217][06887] Signal inference workers to resume experience collection... (41200 times) [2024-06-28 08:04:30,265][06909] InferenceWorker_p0-w0: stopping experience collection (41200 times) [2024-06-28 08:04:30,265][06909] InferenceWorker_p0-w0: resuming experience collection (41200 times) [2024-06-28 08:04:32,203][06909] Updated weights for policy 0, policy_version 183063 (0.0025) [2024-06-28 08:04:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 2999353344. Throughput: 0: 43911.0. Samples: 2902270540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 24.0) [2024-06-28 08:04:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:04:36,427][06909] Updated weights for policy 0, policy_version 183073 (0.0023) [2024-06-28 08:04:38,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 2999582720. Throughput: 0: 44244.1. Samples: 2902541420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 08:04:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:04:39,854][06909] Updated weights for policy 0, policy_version 183083 (0.0022) [2024-06-28 08:04:43,703][06909] Updated weights for policy 0, policy_version 183093 (0.0032) [2024-06-28 08:04:43,853][06674] Fps is (10 sec: 44223.6, 60 sec: 44234.5, 300 sec: 43986.4). Total num frames: 2999795712. Throughput: 0: 44070.7. Samples: 2902672900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 08:04:43,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:04:47,398][06909] Updated weights for policy 0, policy_version 183103 (0.0031) [2024-06-28 08:04:48,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3000008704. Throughput: 0: 44252.5. Samples: 2902937440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 08:04:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:04:51,419][06909] Updated weights for policy 0, policy_version 183113 (0.0026) [2024-06-28 08:04:53,850][06674] Fps is (10 sec: 44250.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3000238080. Throughput: 0: 44322.7. Samples: 2903202600. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 08:04:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:04:54,692][06909] Updated weights for policy 0, policy_version 183123 (0.0027) [2024-06-28 08:04:58,690][06909] Updated weights for policy 0, policy_version 183133 (0.0034) [2024-06-28 08:04:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 3000467456. Throughput: 0: 44314.2. Samples: 2903338320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 08:04:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:05:02,037][06909] Updated weights for policy 0, policy_version 183143 (0.0028) [2024-06-28 08:05:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3000664064. Throughput: 0: 43955.6. Samples: 2903590280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 08:05:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:05:06,259][06909] Updated weights for policy 0, policy_version 183153 (0.0036) [2024-06-28 08:05:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 3000909824. Throughput: 0: 43973.4. Samples: 2903854600. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 08:05:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:05:09,312][06909] Updated weights for policy 0, policy_version 183163 (0.0042) [2024-06-28 08:05:13,699][06909] Updated weights for policy 0, policy_version 183173 (0.0032) [2024-06-28 08:05:13,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3001106432. Throughput: 0: 43938.6. Samples: 2903986800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 08:05:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:05:17,368][06909] Updated weights for policy 0, policy_version 183183 (0.0027) [2024-06-28 08:05:18,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3001319424. Throughput: 0: 43903.3. Samples: 2904246180. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 08:05:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:05:21,291][06909] Updated weights for policy 0, policy_version 183193 (0.0032) [2024-06-28 08:05:23,852][06674] Fps is (10 sec: 44228.3, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 3001548800. Throughput: 0: 43769.0. Samples: 2904511120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 08:05:23,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:05:24,664][06909] Updated weights for policy 0, policy_version 183203 (0.0045) [2024-06-28 08:05:28,738][06909] Updated weights for policy 0, policy_version 183213 (0.0031) [2024-06-28 08:05:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3001761792. Throughput: 0: 43899.9. Samples: 2904648260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 08:05:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:05:31,889][06909] Updated weights for policy 0, policy_version 183223 (0.0038) [2024-06-28 08:05:33,855][06674] Fps is (10 sec: 42583.3, 60 sec: 43686.7, 300 sec: 43875.0). Total num frames: 3001974784. Throughput: 0: 43782.6. Samples: 2904907900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 08:05:33,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:05:36,424][06909] Updated weights for policy 0, policy_version 183233 (0.0032) [2024-06-28 08:05:38,244][06887] Signal inference workers to stop experience collection... (41250 times) [2024-06-28 08:05:38,244][06887] Signal inference workers to resume experience collection... (41250 times) [2024-06-28 08:05:38,267][06909] InferenceWorker_p0-w0: stopping experience collection (41250 times) [2024-06-28 08:05:38,267][06909] InferenceWorker_p0-w0: resuming experience collection (41250 times) [2024-06-28 08:05:38,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3002220544. Throughput: 0: 43670.7. Samples: 2905167780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 08:05:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:05:39,343][06909] Updated weights for policy 0, policy_version 183243 (0.0027) [2024-06-28 08:05:43,850][06674] Fps is (10 sec: 42622.1, 60 sec: 43419.8, 300 sec: 43931.3). Total num frames: 3002400768. Throughput: 0: 43546.3. Samples: 2905297900. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:05:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:05:43,911][06909] Updated weights for policy 0, policy_version 183253 (0.0029) [2024-06-28 08:05:46,638][06909] Updated weights for policy 0, policy_version 183263 (0.0021) [2024-06-28 08:05:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3002646528. Throughput: 0: 43864.4. Samples: 2905564180. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:05:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:05:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000183267_3002646528.pth... [2024-06-28 08:05:48,900][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000182622_2992078848.pth [2024-06-28 08:05:51,037][06909] Updated weights for policy 0, policy_version 183273 (0.0045) [2024-06-28 08:05:53,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3002875904. Throughput: 0: 43932.0. Samples: 2905831540. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:05:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:05:54,462][06909] Updated weights for policy 0, policy_version 183283 (0.0037) [2024-06-28 08:05:58,346][06909] Updated weights for policy 0, policy_version 183293 (0.0041) [2024-06-28 08:05:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3003088896. Throughput: 0: 44006.8. Samples: 2905967100. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:05:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:06:01,828][06909] Updated weights for policy 0, policy_version 183303 (0.0039) [2024-06-28 08:06:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3003301888. Throughput: 0: 43977.8. Samples: 2906225180. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:06:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:06:06,005][06909] Updated weights for policy 0, policy_version 183313 (0.0025) [2024-06-28 08:06:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 3003547648. Throughput: 0: 43961.6. Samples: 2906489300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:06:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:06:09,180][06909] Updated weights for policy 0, policy_version 183323 (0.0044) [2024-06-28 08:06:13,289][06909] Updated weights for policy 0, policy_version 183333 (0.0035) [2024-06-28 08:06:13,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 3003760640. Throughput: 0: 43952.8. Samples: 2906626140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:06:13,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:06:16,426][06909] Updated weights for policy 0, policy_version 183343 (0.0032) [2024-06-28 08:06:18,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3003957248. Throughput: 0: 44022.4. Samples: 2906888660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:06:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:06:20,906][06909] Updated weights for policy 0, policy_version 183353 (0.0033) [2024-06-28 08:06:23,852][06674] Fps is (10 sec: 44228.1, 60 sec: 44236.8, 300 sec: 44097.6). Total num frames: 3004203008. Throughput: 0: 44057.5. Samples: 2907150460. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:06:23,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:06:24,091][06909] Updated weights for policy 0, policy_version 183363 (0.0029) [2024-06-28 08:06:28,098][06909] Updated weights for policy 0, policy_version 183373 (0.0026) [2024-06-28 08:06:28,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3004416000. Throughput: 0: 44318.2. Samples: 2907292220. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:06:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:06:31,464][06909] Updated weights for policy 0, policy_version 183383 (0.0021) [2024-06-28 08:06:33,850][06674] Fps is (10 sec: 40968.7, 60 sec: 43967.9, 300 sec: 43986.9). Total num frames: 3004612608. Throughput: 0: 44262.3. Samples: 2907555980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:06:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:06:35,354][06909] Updated weights for policy 0, policy_version 183393 (0.0037) [2024-06-28 08:06:38,853][06674] Fps is (10 sec: 44224.7, 60 sec: 43961.7, 300 sec: 44042.0). Total num frames: 3004858368. Throughput: 0: 44180.4. Samples: 2907819780. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:06:38,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:06:38,920][06909] Updated weights for policy 0, policy_version 183403 (0.0038) [2024-06-28 08:06:42,824][06909] Updated weights for policy 0, policy_version 183413 (0.0032) [2024-06-28 08:06:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3005071360. Throughput: 0: 44210.7. Samples: 2907956580. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:06:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:06:46,190][06909] Updated weights for policy 0, policy_version 183423 (0.0038) [2024-06-28 08:06:48,850][06674] Fps is (10 sec: 42610.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3005284352. Throughput: 0: 44195.0. Samples: 2908213960. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2024-06-28 08:06:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:06:50,721][06909] Updated weights for policy 0, policy_version 183433 (0.0035) [2024-06-28 08:06:53,562][06909] Updated weights for policy 0, policy_version 183443 (0.0033) [2024-06-28 08:06:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3005530112. Throughput: 0: 44090.2. Samples: 2908473360. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2024-06-28 08:06:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:06:58,060][06909] Updated weights for policy 0, policy_version 183453 (0.0037) [2024-06-28 08:06:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3005726720. Throughput: 0: 44205.0. Samples: 2908615360. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2024-06-28 08:06:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:07:01,191][06909] Updated weights for policy 0, policy_version 183463 (0.0026) [2024-06-28 08:07:03,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3005939712. Throughput: 0: 44131.9. Samples: 2908874600. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2024-06-28 08:07:03,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:07:05,363][06909] Updated weights for policy 0, policy_version 183473 (0.0028) [2024-06-28 08:07:08,532][06909] Updated weights for policy 0, policy_version 183483 (0.0034) [2024-06-28 08:07:08,850][06674] Fps is (10 sec: 45872.5, 60 sec: 43963.3, 300 sec: 44042.3). Total num frames: 3006185472. Throughput: 0: 44247.7. Samples: 2909141540. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2024-06-28 08:07:08,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:07:12,650][06909] Updated weights for policy 0, policy_version 183493 (0.0031) [2024-06-28 08:07:13,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3006398464. Throughput: 0: 44136.8. Samples: 2909278380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2024-06-28 08:07:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:07:16,017][06909] Updated weights for policy 0, policy_version 183503 (0.0044) [2024-06-28 08:07:18,850][06674] Fps is (10 sec: 42601.0, 60 sec: 44236.8, 300 sec: 43987.3). Total num frames: 3006611456. Throughput: 0: 43982.2. Samples: 2909535180. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2024-06-28 08:07:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:07:19,712][06887] Signal inference workers to stop experience collection... (41300 times) [2024-06-28 08:07:19,713][06887] Signal inference workers to resume experience collection... (41300 times) [2024-06-28 08:07:19,727][06909] InferenceWorker_p0-w0: stopping experience collection (41300 times) [2024-06-28 08:07:19,727][06909] InferenceWorker_p0-w0: resuming experience collection (41300 times) [2024-06-28 08:07:20,098][06909] Updated weights for policy 0, policy_version 183513 (0.0031) [2024-06-28 08:07:23,673][06909] Updated weights for policy 0, policy_version 183523 (0.0032) [2024-06-28 08:07:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43965.2, 300 sec: 44097.9). Total num frames: 3006840832. Throughput: 0: 43851.6. Samples: 2909792980. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2024-06-28 08:07:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:07:28,091][06909] Updated weights for policy 0, policy_version 183533 (0.0033) [2024-06-28 08:07:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3007037440. Throughput: 0: 43718.6. Samples: 2909923920. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2024-06-28 08:07:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:07:31,221][06909] Updated weights for policy 0, policy_version 183543 (0.0025) [2024-06-28 08:07:33,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3007266816. Throughput: 0: 43860.6. Samples: 2910187680. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2024-06-28 08:07:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:07:35,257][06909] Updated weights for policy 0, policy_version 183553 (0.0027) [2024-06-28 08:07:38,622][06909] Updated weights for policy 0, policy_version 183563 (0.0024) [2024-06-28 08:07:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43965.8, 300 sec: 44042.4). Total num frames: 3007496192. Throughput: 0: 44105.3. Samples: 2910458100. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2024-06-28 08:07:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:07:42,462][06909] Updated weights for policy 0, policy_version 183573 (0.0028) [2024-06-28 08:07:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3007709184. Throughput: 0: 43920.5. Samples: 2910591780. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2024-06-28 08:07:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:07:46,029][06909] Updated weights for policy 0, policy_version 183583 (0.0031) [2024-06-28 08:07:48,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 3007938560. Throughput: 0: 43951.8. Samples: 2910852520. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2024-06-28 08:07:48,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:07:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000183590_3007938560.pth... [2024-06-28 08:07:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000182945_2997370880.pth [2024-06-28 08:07:49,733][06909] Updated weights for policy 0, policy_version 183593 (0.0026) [2024-06-28 08:07:53,256][06909] Updated weights for policy 0, policy_version 183603 (0.0025) [2024-06-28 08:07:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3008151552. Throughput: 0: 43941.9. Samples: 2911118900. Policy #0 lag: (min: 0.0, avg: 8.7, max: 19.0) [2024-06-28 08:07:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:07:57,748][06909] Updated weights for policy 0, policy_version 183613 (0.0032) [2024-06-28 08:07:58,850][06674] Fps is (10 sec: 44246.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3008380928. Throughput: 0: 43918.9. Samples: 2911254720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:07:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:08:00,823][06909] Updated weights for policy 0, policy_version 183623 (0.0036) [2024-06-28 08:08:03,852][06674] Fps is (10 sec: 44227.9, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 3008593920. Throughput: 0: 43946.0. Samples: 2911512840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:08:03,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:08:05,209][06909] Updated weights for policy 0, policy_version 183633 (0.0033) [2024-06-28 08:08:08,188][06909] Updated weights for policy 0, policy_version 183643 (0.0030) [2024-06-28 08:08:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43691.1, 300 sec: 43986.9). Total num frames: 3008806912. Throughput: 0: 43966.3. Samples: 2911771460. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:08:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:08:12,431][06909] Updated weights for policy 0, policy_version 183653 (0.0024) [2024-06-28 08:08:13,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3009019904. Throughput: 0: 44060.0. Samples: 2911906620. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:08:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:08:16,061][06909] Updated weights for policy 0, policy_version 183663 (0.0032) [2024-06-28 08:08:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43987.2). Total num frames: 3009249280. Throughput: 0: 44040.5. Samples: 2912169500. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:08:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:08:19,645][06909] Updated weights for policy 0, policy_version 183673 (0.0024) [2024-06-28 08:08:23,169][06909] Updated weights for policy 0, policy_version 183683 (0.0028) [2024-06-28 08:08:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3009462272. Throughput: 0: 43996.4. Samples: 2912437940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:08:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:08:26,814][06909] Updated weights for policy 0, policy_version 183693 (0.0032) [2024-06-28 08:08:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3009691648. Throughput: 0: 43950.1. Samples: 2912569540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:08:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:08:30,648][06909] Updated weights for policy 0, policy_version 183703 (0.0028) [2024-06-28 08:08:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3009937408. Throughput: 0: 44171.8. Samples: 2912840160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:08:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:08:34,770][06909] Updated weights for policy 0, policy_version 183713 (0.0033) [2024-06-28 08:08:37,889][06909] Updated weights for policy 0, policy_version 183723 (0.0030) [2024-06-28 08:08:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3010134016. Throughput: 0: 44148.6. Samples: 2913105580. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:08:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:08:41,956][06909] Updated weights for policy 0, policy_version 183733 (0.0027) [2024-06-28 08:08:43,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3010347008. Throughput: 0: 44073.8. Samples: 2913238040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:08:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:08:45,275][06909] Updated weights for policy 0, policy_version 183743 (0.0033) [2024-06-28 08:08:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 3010576384. Throughput: 0: 44132.7. Samples: 2913498720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:08:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:08:49,258][06909] Updated weights for policy 0, policy_version 183753 (0.0025) [2024-06-28 08:08:52,934][06909] Updated weights for policy 0, policy_version 183763 (0.0034) [2024-06-28 08:08:53,855][06674] Fps is (10 sec: 44212.6, 60 sec: 43959.8, 300 sec: 44041.6). Total num frames: 3010789376. Throughput: 0: 44232.0. Samples: 2913762140. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:08:53,864][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:08:56,647][06909] Updated weights for policy 0, policy_version 183773 (0.0033) [2024-06-28 08:08:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3011002368. Throughput: 0: 44064.5. Samples: 2913889520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:08:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:09:00,803][06909] Updated weights for policy 0, policy_version 183783 (0.0028) [2024-06-28 08:09:03,850][06674] Fps is (10 sec: 45900.3, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 3011248128. Throughput: 0: 44213.7. Samples: 2914159120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:09:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:09:03,895][06909] Updated weights for policy 0, policy_version 183793 (0.0032) [2024-06-28 08:09:06,739][06887] Signal inference workers to stop experience collection... (41350 times) [2024-06-28 08:09:06,741][06887] Signal inference workers to resume experience collection... (41350 times) [2024-06-28 08:09:06,786][06909] InferenceWorker_p0-w0: stopping experience collection (41350 times) [2024-06-28 08:09:06,786][06909] InferenceWorker_p0-w0: resuming experience collection (41350 times) [2024-06-28 08:09:07,970][06909] Updated weights for policy 0, policy_version 183803 (0.0044) [2024-06-28 08:09:08,852][06674] Fps is (10 sec: 45865.7, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 3011461120. Throughput: 0: 44068.7. Samples: 2914421120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:09:08,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:09:12,163][06909] Updated weights for policy 0, policy_version 183813 (0.0028) [2024-06-28 08:09:13,852][06674] Fps is (10 sec: 42589.4, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 3011674112. Throughput: 0: 44184.7. Samples: 2914557940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:09:13,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:09:15,418][06909] Updated weights for policy 0, policy_version 183823 (0.0032) [2024-06-28 08:09:18,850][06674] Fps is (10 sec: 44245.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3011903488. Throughput: 0: 43899.5. Samples: 2914815640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:09:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:09:19,367][06909] Updated weights for policy 0, policy_version 183833 (0.0031) [2024-06-28 08:09:22,834][06909] Updated weights for policy 0, policy_version 183843 (0.0042) [2024-06-28 08:09:23,850][06674] Fps is (10 sec: 44246.2, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3012116480. Throughput: 0: 43963.5. Samples: 2915083940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:09:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:09:26,588][06909] Updated weights for policy 0, policy_version 183853 (0.0031) [2024-06-28 08:09:28,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3012345856. Throughput: 0: 43890.7. Samples: 2915213120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:09:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:09:30,484][06909] Updated weights for policy 0, policy_version 183863 (0.0032) [2024-06-28 08:09:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3012558848. Throughput: 0: 43936.9. Samples: 2915475880. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:09:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:09:34,139][06909] Updated weights for policy 0, policy_version 183873 (0.0029) [2024-06-28 08:09:37,690][06909] Updated weights for policy 0, policy_version 183883 (0.0027) [2024-06-28 08:09:38,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44236.7, 300 sec: 44042.9). Total num frames: 3012788224. Throughput: 0: 43977.7. Samples: 2915740900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:09:38,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:09:41,910][06909] Updated weights for policy 0, policy_version 183893 (0.0037) [2024-06-28 08:09:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3013001216. Throughput: 0: 44147.6. Samples: 2915876160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:09:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:09:45,143][06909] Updated weights for policy 0, policy_version 183903 (0.0028) [2024-06-28 08:09:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3013214208. Throughput: 0: 43955.6. Samples: 2916137120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:09:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:09:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000183912_3013214208.pth... [2024-06-28 08:09:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000183267_3002646528.pth [2024-06-28 08:09:49,307][06909] Updated weights for policy 0, policy_version 183913 (0.0019) [2024-06-28 08:09:52,927][06909] Updated weights for policy 0, policy_version 183923 (0.0033) [2024-06-28 08:09:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44240.8, 300 sec: 43986.9). Total num frames: 3013443584. Throughput: 0: 44028.3. Samples: 2916402300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:09:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:09:56,525][06909] Updated weights for policy 0, policy_version 183933 (0.0025) [2024-06-28 08:09:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3013656576. Throughput: 0: 43980.3. Samples: 2916536960. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:09:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:10:00,271][06909] Updated weights for policy 0, policy_version 183943 (0.0040) [2024-06-28 08:10:03,661][06909] Updated weights for policy 0, policy_version 183953 (0.0031) [2024-06-28 08:10:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3013885952. Throughput: 0: 44097.0. Samples: 2916800000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:10:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:10:07,576][06909] Updated weights for policy 0, policy_version 183963 (0.0036) [2024-06-28 08:10:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 3014098944. Throughput: 0: 44093.7. Samples: 2917068160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 08:10:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:10:11,271][06909] Updated weights for policy 0, policy_version 183973 (0.0028) [2024-06-28 08:10:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44238.4, 300 sec: 44098.0). Total num frames: 3014328320. Throughput: 0: 44240.4. Samples: 2917203940. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 08:10:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:10:14,986][06909] Updated weights for policy 0, policy_version 183983 (0.0024) [2024-06-28 08:10:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.8, 300 sec: 43987.2). Total num frames: 3014524928. Throughput: 0: 44146.7. Samples: 2917462480. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 08:10:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:10:18,878][06909] Updated weights for policy 0, policy_version 183993 (0.0027) [2024-06-28 08:10:22,190][06909] Updated weights for policy 0, policy_version 184003 (0.0020) [2024-06-28 08:10:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3014770688. Throughput: 0: 44098.7. Samples: 2917725340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 08:10:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:10:26,400][06909] Updated weights for policy 0, policy_version 184013 (0.0033) [2024-06-28 08:10:28,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 44154.3). Total num frames: 3015000064. Throughput: 0: 44084.4. Samples: 2917859960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 08:10:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:10:29,986][06909] Updated weights for policy 0, policy_version 184023 (0.0032) [2024-06-28 08:10:33,651][06909] Updated weights for policy 0, policy_version 184033 (0.0025) [2024-06-28 08:10:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3015213056. Throughput: 0: 44097.3. Samples: 2918121500. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 08:10:33,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 08:10:37,419][06909] Updated weights for policy 0, policy_version 184043 (0.0021) [2024-06-28 08:10:38,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 3015409664. Throughput: 0: 44205.3. Samples: 2918391540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 08:10:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:10:40,849][06909] Updated weights for policy 0, policy_version 184053 (0.0039) [2024-06-28 08:10:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3015639040. Throughput: 0: 43919.1. Samples: 2918513320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 08:10:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:10:44,709][06909] Updated weights for policy 0, policy_version 184063 (0.0033) [2024-06-28 08:10:48,238][06909] Updated weights for policy 0, policy_version 184073 (0.0032) [2024-06-28 08:10:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3015852032. Throughput: 0: 44098.7. Samples: 2918784440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 08:10:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:10:51,875][06909] Updated weights for policy 0, policy_version 184083 (0.0035) [2024-06-28 08:10:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3016081408. Throughput: 0: 44044.5. Samples: 2919050160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 08:10:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:10:56,017][06909] Updated weights for policy 0, policy_version 184093 (0.0034) [2024-06-28 08:10:56,705][06887] Signal inference workers to stop experience collection... (41400 times) [2024-06-28 08:10:56,705][06887] Signal inference workers to resume experience collection... (41400 times) [2024-06-28 08:10:56,731][06909] InferenceWorker_p0-w0: stopping experience collection (41400 times) [2024-06-28 08:10:56,731][06909] InferenceWorker_p0-w0: resuming experience collection (41400 times) [2024-06-28 08:10:58,852][06674] Fps is (10 sec: 45865.6, 60 sec: 44235.2, 300 sec: 44097.6). Total num frames: 3016310784. Throughput: 0: 43858.8. Samples: 2919177680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 08:10:58,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:10:59,501][06909] Updated weights for policy 0, policy_version 184103 (0.0026) [2024-06-28 08:11:03,207][06909] Updated weights for policy 0, policy_version 184113 (0.0032) [2024-06-28 08:11:03,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3016507392. Throughput: 0: 43987.4. Samples: 2919441920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 08:11:03,851][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 08:11:07,090][06909] Updated weights for policy 0, policy_version 184123 (0.0028) [2024-06-28 08:11:08,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3016720384. Throughput: 0: 44075.1. Samples: 2919708720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 08:11:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:11:10,515][06909] Updated weights for policy 0, policy_version 184133 (0.0025) [2024-06-28 08:11:13,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3016966144. Throughput: 0: 43921.3. Samples: 2919836420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:11:13,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:11:14,562][06909] Updated weights for policy 0, policy_version 184143 (0.0040) [2024-06-28 08:11:17,879][06909] Updated weights for policy 0, policy_version 184153 (0.0032) [2024-06-28 08:11:18,850][06674] Fps is (10 sec: 47514.2, 60 sec: 44509.9, 300 sec: 44042.7). Total num frames: 3017195520. Throughput: 0: 44035.6. Samples: 2920103100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:11:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:11:21,754][06909] Updated weights for policy 0, policy_version 184163 (0.0034) [2024-06-28 08:11:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3017392128. Throughput: 0: 44099.6. Samples: 2920376020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:11:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:11:25,550][06909] Updated weights for policy 0, policy_version 184173 (0.0034) [2024-06-28 08:11:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 3017621504. Throughput: 0: 44049.2. Samples: 2920495540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:11:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:11:29,175][06909] Updated weights for policy 0, policy_version 184183 (0.0044) [2024-06-28 08:11:33,211][06909] Updated weights for policy 0, policy_version 184193 (0.0039) [2024-06-28 08:11:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43987.3). Total num frames: 3017834496. Throughput: 0: 43941.3. Samples: 2920761800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:11:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:11:36,911][06909] Updated weights for policy 0, policy_version 184203 (0.0035) [2024-06-28 08:11:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3018047488. Throughput: 0: 43944.9. Samples: 2921027680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:11:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:11:40,404][06909] Updated weights for policy 0, policy_version 184213 (0.0027) [2024-06-28 08:11:43,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3018276864. Throughput: 0: 43970.8. Samples: 2921156280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:11:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:11:44,432][06909] Updated weights for policy 0, policy_version 184223 (0.0036) [2024-06-28 08:11:47,604][06909] Updated weights for policy 0, policy_version 184233 (0.0038) [2024-06-28 08:11:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3018506240. Throughput: 0: 43952.1. Samples: 2921419760. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:11:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:11:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000184235_3018506240.pth... [2024-06-28 08:11:48,913][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000183590_3007938560.pth [2024-06-28 08:11:51,759][06909] Updated weights for policy 0, policy_version 184243 (0.0025) [2024-06-28 08:11:53,850][06674] Fps is (10 sec: 42599.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3018702848. Throughput: 0: 43981.5. Samples: 2921687880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:11:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:11:55,273][06909] Updated weights for policy 0, policy_version 184253 (0.0031) [2024-06-28 08:11:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43965.2, 300 sec: 44097.9). Total num frames: 3018948608. Throughput: 0: 44112.4. Samples: 2921821480. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:11:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:11:59,094][06909] Updated weights for policy 0, policy_version 184263 (0.0028) [2024-06-28 08:12:02,969][06909] Updated weights for policy 0, policy_version 184273 (0.0031) [2024-06-28 08:12:03,852][06674] Fps is (10 sec: 45865.4, 60 sec: 44235.4, 300 sec: 43986.7). Total num frames: 3019161600. Throughput: 0: 44052.6. Samples: 2922085560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:12:03,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 08:12:06,737][06909] Updated weights for policy 0, policy_version 184283 (0.0047) [2024-06-28 08:12:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3019374592. Throughput: 0: 43670.2. Samples: 2922341180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:12:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:12:10,613][06909] Updated weights for policy 0, policy_version 184293 (0.0033) [2024-06-28 08:12:13,850][06674] Fps is (10 sec: 44245.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3019603968. Throughput: 0: 43884.3. Samples: 2922470340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:12:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:12:14,095][06909] Updated weights for policy 0, policy_version 184303 (0.0036) [2024-06-28 08:12:18,108][06909] Updated weights for policy 0, policy_version 184313 (0.0037) [2024-06-28 08:12:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3019833344. Throughput: 0: 43849.8. Samples: 2922735040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:12:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:12:21,810][06909] Updated weights for policy 0, policy_version 184323 (0.0036) [2024-06-28 08:12:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3020029952. Throughput: 0: 43773.3. Samples: 2922997480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:12:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:12:25,590][06909] Updated weights for policy 0, policy_version 184333 (0.0024) [2024-06-28 08:12:28,852][06674] Fps is (10 sec: 42589.2, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 3020259328. Throughput: 0: 43811.4. Samples: 2923127880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:12:28,853][06674] Avg episode reward: [(0, '0.490')] [2024-06-28 08:12:28,973][06909] Updated weights for policy 0, policy_version 184343 (0.0030) [2024-06-28 08:12:30,216][06887] Signal inference workers to stop experience collection... (41450 times) [2024-06-28 08:12:30,216][06887] Signal inference workers to resume experience collection... (41450 times) [2024-06-28 08:12:30,253][06909] InferenceWorker_p0-w0: stopping experience collection (41450 times) [2024-06-28 08:12:30,253][06909] InferenceWorker_p0-w0: resuming experience collection (41450 times) [2024-06-28 08:12:32,887][06909] Updated weights for policy 0, policy_version 184353 (0.0022) [2024-06-28 08:12:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3020488704. Throughput: 0: 43988.9. Samples: 2923399260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:12:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:12:36,412][06909] Updated weights for policy 0, policy_version 184363 (0.0030) [2024-06-28 08:12:38,850][06674] Fps is (10 sec: 44246.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3020701696. Throughput: 0: 43890.1. Samples: 2923662940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:12:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:12:40,556][06909] Updated weights for policy 0, policy_version 184373 (0.0043) [2024-06-28 08:12:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 43987.2). Total num frames: 3020914688. Throughput: 0: 43902.2. Samples: 2923797080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:12:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:12:44,044][06909] Updated weights for policy 0, policy_version 184383 (0.0037) [2024-06-28 08:12:48,013][06909] Updated weights for policy 0, policy_version 184393 (0.0031) [2024-06-28 08:12:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3021144064. Throughput: 0: 44016.3. Samples: 2924066200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:12:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:12:51,489][06909] Updated weights for policy 0, policy_version 184403 (0.0034) [2024-06-28 08:12:53,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3021357056. Throughput: 0: 44081.3. Samples: 2924324840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:12:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:12:55,309][06909] Updated weights for policy 0, policy_version 184413 (0.0040) [2024-06-28 08:12:58,814][06909] Updated weights for policy 0, policy_version 184423 (0.0029) [2024-06-28 08:12:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 3021586432. Throughput: 0: 44094.3. Samples: 2924454580. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:12:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:13:02,834][06909] Updated weights for policy 0, policy_version 184433 (0.0028) [2024-06-28 08:13:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 3021799424. Throughput: 0: 44030.6. Samples: 2924716420. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:13:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:13:06,504][06909] Updated weights for policy 0, policy_version 184443 (0.0032) [2024-06-28 08:13:08,851][06674] Fps is (10 sec: 44234.2, 60 sec: 44236.3, 300 sec: 44097.9). Total num frames: 3022028800. Throughput: 0: 44017.1. Samples: 2924978280. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:13:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:13:10,369][06909] Updated weights for policy 0, policy_version 184453 (0.0032) [2024-06-28 08:13:13,752][06909] Updated weights for policy 0, policy_version 184463 (0.0034) [2024-06-28 08:13:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3022241792. Throughput: 0: 44020.7. Samples: 2925108720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:13:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 08:13:17,916][06909] Updated weights for policy 0, policy_version 184473 (0.0024) [2024-06-28 08:13:18,850][06674] Fps is (10 sec: 40962.7, 60 sec: 43417.5, 300 sec: 43986.9). Total num frames: 3022438400. Throughput: 0: 43980.4. Samples: 2925378380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:13:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:13:21,583][06909] Updated weights for policy 0, policy_version 184483 (0.0033) [2024-06-28 08:13:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3022667776. Throughput: 0: 43860.5. Samples: 2925636660. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:13:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:13:25,277][06909] Updated weights for policy 0, policy_version 184493 (0.0031) [2024-06-28 08:13:28,848][06909] Updated weights for policy 0, policy_version 184503 (0.0035) [2024-06-28 08:13:28,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43965.2, 300 sec: 43931.3). Total num frames: 3022897152. Throughput: 0: 43708.0. Samples: 2925763940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 08:13:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:13:32,873][06909] Updated weights for policy 0, policy_version 184513 (0.0034) [2024-06-28 08:13:33,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43986.8). Total num frames: 3023110144. Throughput: 0: 43657.2. Samples: 2926030780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 08:13:33,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:13:36,022][06909] Updated weights for policy 0, policy_version 184523 (0.0025) [2024-06-28 08:13:38,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3023323136. Throughput: 0: 43864.0. Samples: 2926298720. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 08:13:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:13:40,185][06909] Updated weights for policy 0, policy_version 184533 (0.0034) [2024-06-28 08:13:43,080][06909] Updated weights for policy 0, policy_version 184543 (0.0035) [2024-06-28 08:13:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3023552512. Throughput: 0: 43913.0. Samples: 2926430660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 08:13:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:13:47,784][06909] Updated weights for policy 0, policy_version 184553 (0.0040) [2024-06-28 08:13:48,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43690.6, 300 sec: 43987.7). Total num frames: 3023765504. Throughput: 0: 44014.6. Samples: 2926697080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 08:13:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:13:48,900][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000184557_3023781888.pth... [2024-06-28 08:13:48,957][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000183912_3013214208.pth [2024-06-28 08:13:51,055][06909] Updated weights for policy 0, policy_version 184563 (0.0037) [2024-06-28 08:13:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3023978496. Throughput: 0: 43910.5. Samples: 2926954220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 08:13:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:13:55,135][06909] Updated weights for policy 0, policy_version 184573 (0.0022) [2024-06-28 08:13:57,123][06887] Signal inference workers to stop experience collection... (41500 times) [2024-06-28 08:13:57,174][06909] InferenceWorker_p0-w0: stopping experience collection (41500 times) [2024-06-28 08:13:57,180][06887] Signal inference workers to resume experience collection... (41500 times) [2024-06-28 08:13:57,185][06909] InferenceWorker_p0-w0: resuming experience collection (41500 times) [2024-06-28 08:13:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43875.8). Total num frames: 3024191488. Throughput: 0: 43942.2. Samples: 2927086120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 08:13:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:13:58,873][06909] Updated weights for policy 0, policy_version 184583 (0.0036) [2024-06-28 08:14:02,417][06909] Updated weights for policy 0, policy_version 184593 (0.0037) [2024-06-28 08:14:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.7, 300 sec: 43876.1). Total num frames: 3024404480. Throughput: 0: 43677.0. Samples: 2927343840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 08:14:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:14:06,193][06909] Updated weights for policy 0, policy_version 184603 (0.0037) [2024-06-28 08:14:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43418.1, 300 sec: 43931.6). Total num frames: 3024633856. Throughput: 0: 43792.8. Samples: 2927607340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 08:14:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:14:09,920][06909] Updated weights for policy 0, policy_version 184613 (0.0043) [2024-06-28 08:14:13,392][06909] Updated weights for policy 0, policy_version 184623 (0.0035) [2024-06-28 08:14:13,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3024863232. Throughput: 0: 43814.6. Samples: 2927735600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 08:14:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:14:17,543][06909] Updated weights for policy 0, policy_version 184633 (0.0039) [2024-06-28 08:14:18,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3025092608. Throughput: 0: 43953.5. Samples: 2928008680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 08:14:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:14:21,086][06909] Updated weights for policy 0, policy_version 184643 (0.0033) [2024-06-28 08:14:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3025305600. Throughput: 0: 43888.3. Samples: 2928273700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 08:14:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:14:25,088][06909] Updated weights for policy 0, policy_version 184653 (0.0037) [2024-06-28 08:14:28,432][06909] Updated weights for policy 0, policy_version 184663 (0.0034) [2024-06-28 08:14:28,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3025534976. Throughput: 0: 43864.0. Samples: 2928404540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 08:14:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:14:32,309][06909] Updated weights for policy 0, policy_version 184673 (0.0031) [2024-06-28 08:14:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3025731584. Throughput: 0: 43818.2. Samples: 2928668900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 08:14:33,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:14:35,762][06909] Updated weights for policy 0, policy_version 184683 (0.0022) [2024-06-28 08:14:38,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3025944576. Throughput: 0: 43909.8. Samples: 2928930160. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 08:14:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:14:39,665][06909] Updated weights for policy 0, policy_version 184693 (0.0037) [2024-06-28 08:14:43,491][06909] Updated weights for policy 0, policy_version 184703 (0.0032) [2024-06-28 08:14:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3026190336. Throughput: 0: 43841.2. Samples: 2929058980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 08:14:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:14:47,327][06909] Updated weights for policy 0, policy_version 184713 (0.0038) [2024-06-28 08:14:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3026403328. Throughput: 0: 44002.7. Samples: 2929323960. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 08:14:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:14:50,841][06909] Updated weights for policy 0, policy_version 184723 (0.0040) [2024-06-28 08:14:53,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3026632704. Throughput: 0: 44102.8. Samples: 2929591960. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 08:14:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:14:54,926][06909] Updated weights for policy 0, policy_version 184733 (0.0025) [2024-06-28 08:14:58,354][06909] Updated weights for policy 0, policy_version 184743 (0.0037) [2024-06-28 08:14:58,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44235.3, 300 sec: 43931.0). Total num frames: 3026845696. Throughput: 0: 44098.2. Samples: 2929720100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 08:14:58,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:15:02,341][06909] Updated weights for policy 0, policy_version 184753 (0.0041) [2024-06-28 08:15:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3027058688. Throughput: 0: 43872.4. Samples: 2929982940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 08:15:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:15:05,751][06909] Updated weights for policy 0, policy_version 184763 (0.0032) [2024-06-28 08:15:08,850][06674] Fps is (10 sec: 42606.7, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3027271680. Throughput: 0: 43890.6. Samples: 2930248780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 08:15:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:15:09,566][06909] Updated weights for policy 0, policy_version 184773 (0.0034) [2024-06-28 08:15:12,993][06909] Updated weights for policy 0, policy_version 184783 (0.0024) [2024-06-28 08:15:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 3027501056. Throughput: 0: 43810.0. Samples: 2930375980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 08:15:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:15:17,116][06909] Updated weights for policy 0, policy_version 184793 (0.0041) [2024-06-28 08:15:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3027714048. Throughput: 0: 43727.3. Samples: 2930636620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 08:15:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:15:19,563][06887] Signal inference workers to stop experience collection... (41550 times) [2024-06-28 08:15:19,587][06909] InferenceWorker_p0-w0: stopping experience collection (41550 times) [2024-06-28 08:15:19,626][06887] Signal inference workers to resume experience collection... (41550 times) [2024-06-28 08:15:19,626][06909] InferenceWorker_p0-w0: resuming experience collection (41550 times) [2024-06-28 08:15:20,658][06909] Updated weights for policy 0, policy_version 184803 (0.0028) [2024-06-28 08:15:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 3027927040. Throughput: 0: 43876.0. Samples: 2930904580. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 08:15:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:15:24,447][06909] Updated weights for policy 0, policy_version 184813 (0.0031) [2024-06-28 08:15:28,015][06909] Updated weights for policy 0, policy_version 184823 (0.0034) [2024-06-28 08:15:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.7, 300 sec: 43820.3). Total num frames: 3028140032. Throughput: 0: 43790.4. Samples: 2931029540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 08:15:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:15:32,321][06909] Updated weights for policy 0, policy_version 184833 (0.0031) [2024-06-28 08:15:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3028369408. Throughput: 0: 43860.4. Samples: 2931297680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 08:15:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:15:35,446][06909] Updated weights for policy 0, policy_version 184843 (0.0026) [2024-06-28 08:15:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3028582400. Throughput: 0: 43872.9. Samples: 2931566240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:15:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:15:39,606][06909] Updated weights for policy 0, policy_version 184853 (0.0028) [2024-06-28 08:15:43,045][06909] Updated weights for policy 0, policy_version 184863 (0.0031) [2024-06-28 08:15:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 3028811776. Throughput: 0: 43772.7. Samples: 2931689780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:15:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:15:46,957][06909] Updated weights for policy 0, policy_version 184873 (0.0032) [2024-06-28 08:15:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3029024768. Throughput: 0: 43800.5. Samples: 2931953960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:15:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:15:48,978][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000184878_3029041152.pth... [2024-06-28 08:15:49,020][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000184235_3018506240.pth [2024-06-28 08:15:50,450][06909] Updated weights for policy 0, policy_version 184883 (0.0034) [2024-06-28 08:15:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 43820.6). Total num frames: 3029237760. Throughput: 0: 43629.9. Samples: 2932212120. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:15:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:15:54,738][06909] Updated weights for policy 0, policy_version 184893 (0.0042) [2024-06-28 08:15:57,991][06909] Updated weights for policy 0, policy_version 184903 (0.0027) [2024-06-28 08:15:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43692.1, 300 sec: 43931.4). Total num frames: 3029467136. Throughput: 0: 43615.0. Samples: 2932338660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:15:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:16:01,857][06909] Updated weights for policy 0, policy_version 184913 (0.0037) [2024-06-28 08:16:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3029696512. Throughput: 0: 43977.7. Samples: 2932615620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:16:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:16:05,450][06909] Updated weights for policy 0, policy_version 184923 (0.0027) [2024-06-28 08:16:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 3029925888. Throughput: 0: 43886.6. Samples: 2932879480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:16:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:16:09,204][06909] Updated weights for policy 0, policy_version 184933 (0.0029) [2024-06-28 08:16:12,865][06909] Updated weights for policy 0, policy_version 184943 (0.0032) [2024-06-28 08:16:13,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.5, 300 sec: 43875.8). Total num frames: 3030138880. Throughput: 0: 44175.8. Samples: 2933017460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:16:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:16:16,777][06909] Updated weights for policy 0, policy_version 184953 (0.0033) [2024-06-28 08:16:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3030351872. Throughput: 0: 44005.7. Samples: 2933277940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:16:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:16:20,338][06909] Updated weights for policy 0, policy_version 184963 (0.0042) [2024-06-28 08:16:23,850][06674] Fps is (10 sec: 44237.7, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3030581248. Throughput: 0: 43911.1. Samples: 2933542240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:16:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:16:23,952][06909] Updated weights for policy 0, policy_version 184973 (0.0030) [2024-06-28 08:16:27,723][06909] Updated weights for policy 0, policy_version 184983 (0.0035) [2024-06-28 08:16:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 3030810624. Throughput: 0: 44088.3. Samples: 2933673760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:16:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:16:31,615][06909] Updated weights for policy 0, policy_version 184993 (0.0020) [2024-06-28 08:16:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3031007232. Throughput: 0: 44079.4. Samples: 2933937540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:16:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:16:35,076][06909] Updated weights for policy 0, policy_version 185003 (0.0043) [2024-06-28 08:16:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.7, 300 sec: 43931.4). Total num frames: 3031236608. Throughput: 0: 44162.1. Samples: 2934199420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:16:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:16:38,869][06909] Updated weights for policy 0, policy_version 185013 (0.0026) [2024-06-28 08:16:42,505][06909] Updated weights for policy 0, policy_version 185023 (0.0027) [2024-06-28 08:16:43,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 3031465984. Throughput: 0: 44350.7. Samples: 2934334440. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:16:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:16:46,210][06909] Updated weights for policy 0, policy_version 185033 (0.0027) [2024-06-28 08:16:48,856][06674] Fps is (10 sec: 45847.8, 60 sec: 44505.3, 300 sec: 44041.5). Total num frames: 3031695360. Throughput: 0: 44137.2. Samples: 2934602060. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:16:48,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:16:50,094][06909] Updated weights for policy 0, policy_version 185043 (0.0048) [2024-06-28 08:16:50,980][06887] Signal inference workers to stop experience collection... (41600 times) [2024-06-28 08:16:50,982][06887] Signal inference workers to resume experience collection... (41600 times) [2024-06-28 08:16:51,012][06909] InferenceWorker_p0-w0: stopping experience collection (41600 times) [2024-06-28 08:16:51,012][06909] InferenceWorker_p0-w0: resuming experience collection (41600 times) [2024-06-28 08:16:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 3031891968. Throughput: 0: 43950.2. Samples: 2934857240. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:16:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:16:54,083][06909] Updated weights for policy 0, policy_version 185053 (0.0041) [2024-06-28 08:16:57,443][06909] Updated weights for policy 0, policy_version 185063 (0.0026) [2024-06-28 08:16:58,850][06674] Fps is (10 sec: 42624.3, 60 sec: 44236.8, 300 sec: 43931.6). Total num frames: 3032121344. Throughput: 0: 43998.9. Samples: 2934997400. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:16:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:17:01,499][06909] Updated weights for policy 0, policy_version 185073 (0.0034) [2024-06-28 08:17:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3032334336. Throughput: 0: 43989.4. Samples: 2935257460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:17:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:17:05,185][06909] Updated weights for policy 0, policy_version 185083 (0.0041) [2024-06-28 08:17:08,780][06909] Updated weights for policy 0, policy_version 185093 (0.0036) [2024-06-28 08:17:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3032563712. Throughput: 0: 43974.1. Samples: 2935521080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:17:08,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:17:12,805][06909] Updated weights for policy 0, policy_version 185103 (0.0029) [2024-06-28 08:17:13,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 3032793088. Throughput: 0: 44076.9. Samples: 2935657220. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:17:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:17:16,032][06909] Updated weights for policy 0, policy_version 185113 (0.0027) [2024-06-28 08:17:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3032989696. Throughput: 0: 44214.2. Samples: 2935927180. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:17:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:17:20,103][06909] Updated weights for policy 0, policy_version 185123 (0.0026) [2024-06-28 08:17:23,758][06909] Updated weights for policy 0, policy_version 185133 (0.0022) [2024-06-28 08:17:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43931.6). Total num frames: 3033219072. Throughput: 0: 44209.8. Samples: 2936188860. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:17:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:17:27,306][06909] Updated weights for policy 0, policy_version 185143 (0.0025) [2024-06-28 08:17:28,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3033448448. Throughput: 0: 44298.8. Samples: 2936327880. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:17:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:17:31,270][06909] Updated weights for policy 0, policy_version 185153 (0.0026) [2024-06-28 08:17:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3033645056. Throughput: 0: 43944.6. Samples: 2936579300. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:17:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:17:34,584][06909] Updated weights for policy 0, policy_version 185163 (0.0024) [2024-06-28 08:17:38,653][06909] Updated weights for policy 0, policy_version 185173 (0.0031) [2024-06-28 08:17:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3033874432. Throughput: 0: 44123.6. Samples: 2936842800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:17:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:17:42,311][06909] Updated weights for policy 0, policy_version 185183 (0.0028) [2024-06-28 08:17:43,852][06674] Fps is (10 sec: 45865.5, 60 sec: 43962.2, 300 sec: 43931.0). Total num frames: 3034103808. Throughput: 0: 43967.2. Samples: 2936976020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:17:43,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:17:45,957][06909] Updated weights for policy 0, policy_version 185193 (0.0042) [2024-06-28 08:17:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43422.0, 300 sec: 43875.8). Total num frames: 3034300416. Throughput: 0: 44039.1. Samples: 2937239220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 08:17:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:17:48,859][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000185200_3034316800.pth... [2024-06-28 08:17:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000184557_3023781888.pth [2024-06-28 08:17:49,890][06909] Updated weights for policy 0, policy_version 185203 (0.0029) [2024-06-28 08:17:53,114][06909] Updated weights for policy 0, policy_version 185213 (0.0029) [2024-06-28 08:17:53,856][06674] Fps is (10 sec: 42581.3, 60 sec: 43959.2, 300 sec: 43874.9). Total num frames: 3034529792. Throughput: 0: 44075.9. Samples: 2937504760. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 08:17:53,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:17:57,133][06909] Updated weights for policy 0, policy_version 185223 (0.0032) [2024-06-28 08:17:58,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 3034759168. Throughput: 0: 44100.8. Samples: 2937641760. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 08:17:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:18:00,692][06909] Updated weights for policy 0, policy_version 185233 (0.0030) [2024-06-28 08:18:03,850][06674] Fps is (10 sec: 44263.9, 60 sec: 43963.7, 300 sec: 43875.9). Total num frames: 3034972160. Throughput: 0: 43893.0. Samples: 2937902360. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 08:18:03,850][06674] Avg episode reward: [(0, '0.499')] [2024-06-28 08:18:04,748][06909] Updated weights for policy 0, policy_version 185243 (0.0041) [2024-06-28 08:18:08,246][06909] Updated weights for policy 0, policy_version 185253 (0.0027) [2024-06-28 08:18:08,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3035201536. Throughput: 0: 43961.4. Samples: 2938167120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 08:18:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:18:12,153][06909] Updated weights for policy 0, policy_version 185263 (0.0040) [2024-06-28 08:18:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3035414528. Throughput: 0: 43772.8. Samples: 2938297660. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 08:18:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:18:15,527][06909] Updated weights for policy 0, policy_version 185273 (0.0041) [2024-06-28 08:18:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3035643904. Throughput: 0: 44039.1. Samples: 2938561060. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 08:18:18,864][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:18:19,702][06909] Updated weights for policy 0, policy_version 185283 (0.0043) [2024-06-28 08:18:22,858][06909] Updated weights for policy 0, policy_version 185293 (0.0036) [2024-06-28 08:18:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3035856896. Throughput: 0: 44069.3. Samples: 2938825920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 08:18:23,859][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:18:26,944][06909] Updated weights for policy 0, policy_version 185303 (0.0042) [2024-06-28 08:18:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3036102656. Throughput: 0: 44148.7. Samples: 2938962620. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 08:18:28,859][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:18:30,274][06909] Updated weights for policy 0, policy_version 185313 (0.0029) [2024-06-28 08:18:33,850][06674] Fps is (10 sec: 45873.7, 60 sec: 44509.6, 300 sec: 44042.3). Total num frames: 3036315648. Throughput: 0: 44166.3. Samples: 2939226720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 08:18:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:18:34,316][06909] Updated weights for policy 0, policy_version 185323 (0.0048) [2024-06-28 08:18:36,471][06887] Signal inference workers to stop experience collection... (41650 times) [2024-06-28 08:18:36,471][06887] Signal inference workers to resume experience collection... (41650 times) [2024-06-28 08:18:36,485][06909] InferenceWorker_p0-w0: stopping experience collection (41650 times) [2024-06-28 08:18:36,485][06909] InferenceWorker_p0-w0: resuming experience collection (41650 times) [2024-06-28 08:18:37,610][06909] Updated weights for policy 0, policy_version 185333 (0.0032) [2024-06-28 08:18:38,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3036512256. Throughput: 0: 44051.8. Samples: 2939486820. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 08:18:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:18:41,618][06909] Updated weights for policy 0, policy_version 185343 (0.0034) [2024-06-28 08:18:43,850][06674] Fps is (10 sec: 44238.4, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 3036758016. Throughput: 0: 44006.8. Samples: 2939622060. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 08:18:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:18:45,285][06909] Updated weights for policy 0, policy_version 185353 (0.0030) [2024-06-28 08:18:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3036971008. Throughput: 0: 44091.1. Samples: 2939886460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 08:18:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:18:48,928][06909] Updated weights for policy 0, policy_version 185363 (0.0020) [2024-06-28 08:18:52,519][06909] Updated weights for policy 0, policy_version 185373 (0.0025) [2024-06-28 08:18:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44241.3, 300 sec: 44042.4). Total num frames: 3037184000. Throughput: 0: 43988.5. Samples: 2940146600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 08:18:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:18:56,946][06909] Updated weights for policy 0, policy_version 185383 (0.0027) [2024-06-28 08:18:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 3037396992. Throughput: 0: 44072.9. Samples: 2940280940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:18:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:19:00,121][06909] Updated weights for policy 0, policy_version 185393 (0.0032) [2024-06-28 08:19:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3037626368. Throughput: 0: 44160.9. Samples: 2940548300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:19:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:19:04,190][06909] Updated weights for policy 0, policy_version 185403 (0.0032) [2024-06-28 08:19:07,434][06909] Updated weights for policy 0, policy_version 185413 (0.0024) [2024-06-28 08:19:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3037855744. Throughput: 0: 44079.6. Samples: 2940809500. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:19:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:19:11,546][06909] Updated weights for policy 0, policy_version 185423 (0.0040) [2024-06-28 08:19:13,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 3038052352. Throughput: 0: 44033.3. Samples: 2940944120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:19:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:19:14,701][06909] Updated weights for policy 0, policy_version 185433 (0.0027) [2024-06-28 08:19:18,841][06909] Updated weights for policy 0, policy_version 185443 (0.0038) [2024-06-28 08:19:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3038298112. Throughput: 0: 43995.0. Samples: 2941206480. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:19:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:19:22,626][06909] Updated weights for policy 0, policy_version 185453 (0.0035) [2024-06-28 08:19:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3038494720. Throughput: 0: 43903.5. Samples: 2941462480. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:19:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:19:26,322][06909] Updated weights for policy 0, policy_version 185463 (0.0034) [2024-06-28 08:19:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3038724096. Throughput: 0: 43829.4. Samples: 2941594380. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:19:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 08:19:29,893][06909] Updated weights for policy 0, policy_version 185473 (0.0035) [2024-06-28 08:19:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43691.0, 300 sec: 44042.4). Total num frames: 3038937088. Throughput: 0: 43937.4. Samples: 2941863640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:19:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:19:33,973][06909] Updated weights for policy 0, policy_version 185483 (0.0032) [2024-06-28 08:19:37,234][06909] Updated weights for policy 0, policy_version 185493 (0.0027) [2024-06-28 08:19:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3039166464. Throughput: 0: 44043.0. Samples: 2942128540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:19:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:19:41,290][06909] Updated weights for policy 0, policy_version 185503 (0.0024) [2024-06-28 08:19:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3039379456. Throughput: 0: 43972.4. Samples: 2942259700. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:19:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:19:44,508][06909] Updated weights for policy 0, policy_version 185513 (0.0038) [2024-06-28 08:19:48,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43689.2, 300 sec: 43931.0). Total num frames: 3039592448. Throughput: 0: 43985.5. Samples: 2942527740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:19:48,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:19:48,981][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000185523_3039608832.pth... [2024-06-28 08:19:48,987][06909] Updated weights for policy 0, policy_version 185523 (0.0024) [2024-06-28 08:19:49,026][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000184878_3029041152.pth [2024-06-28 08:19:52,134][06909] Updated weights for policy 0, policy_version 185533 (0.0028) [2024-06-28 08:19:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 44042.7). Total num frames: 3039838208. Throughput: 0: 44067.1. Samples: 2942792520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:19:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:19:56,243][06909] Updated weights for policy 0, policy_version 185543 (0.0040) [2024-06-28 08:19:57,497][06887] Signal inference workers to stop experience collection... (41700 times) [2024-06-28 08:19:57,497][06887] Signal inference workers to resume experience collection... (41700 times) [2024-06-28 08:19:57,529][06909] InferenceWorker_p0-w0: stopping experience collection (41700 times) [2024-06-28 08:19:57,529][06909] InferenceWorker_p0-w0: resuming experience collection (41700 times) [2024-06-28 08:19:58,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3040034816. Throughput: 0: 44013.8. Samples: 2942924740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:19:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:19:59,717][06909] Updated weights for policy 0, policy_version 185553 (0.0029) [2024-06-28 08:20:03,726][06909] Updated weights for policy 0, policy_version 185563 (0.0040) [2024-06-28 08:20:03,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3040264192. Throughput: 0: 43961.2. Samples: 2943184740. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 08:20:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:20:07,241][06909] Updated weights for policy 0, policy_version 185573 (0.0037) [2024-06-28 08:20:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3040493568. Throughput: 0: 44090.6. Samples: 2943446560. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 08:20:08,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:20:11,194][06909] Updated weights for policy 0, policy_version 185583 (0.0033) [2024-06-28 08:20:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3040706560. Throughput: 0: 44219.0. Samples: 2943584240. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 08:20:13,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 08:20:14,500][06909] Updated weights for policy 0, policy_version 185593 (0.0029) [2024-06-28 08:20:18,619][06909] Updated weights for policy 0, policy_version 185603 (0.0025) [2024-06-28 08:20:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3040919552. Throughput: 0: 43958.1. Samples: 2943841760. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 08:20:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:20:22,104][06909] Updated weights for policy 0, policy_version 185613 (0.0029) [2024-06-28 08:20:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3041148928. Throughput: 0: 44027.6. Samples: 2944109780. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 08:20:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:20:25,837][06909] Updated weights for policy 0, policy_version 185623 (0.0038) [2024-06-28 08:20:28,856][06674] Fps is (10 sec: 45847.5, 60 sec: 44232.3, 300 sec: 44097.0). Total num frames: 3041378304. Throughput: 0: 44115.8. Samples: 2944245180. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 08:20:28,857][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:20:29,466][06909] Updated weights for policy 0, policy_version 185633 (0.0037) [2024-06-28 08:20:33,379][06909] Updated weights for policy 0, policy_version 185643 (0.0037) [2024-06-28 08:20:33,851][06674] Fps is (10 sec: 42591.8, 60 sec: 43962.6, 300 sec: 44042.2). Total num frames: 3041574912. Throughput: 0: 43837.0. Samples: 2944500380. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 08:20:33,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:20:37,374][06909] Updated weights for policy 0, policy_version 185653 (0.0036) [2024-06-28 08:20:38,850][06674] Fps is (10 sec: 44263.6, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3041820672. Throughput: 0: 43704.4. Samples: 2944759220. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 08:20:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:20:40,933][06909] Updated weights for policy 0, policy_version 185663 (0.0048) [2024-06-28 08:20:43,850][06674] Fps is (10 sec: 42604.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3042000896. Throughput: 0: 43797.8. Samples: 2944895640. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 08:20:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:20:44,828][06909] Updated weights for policy 0, policy_version 185673 (0.0038) [2024-06-28 08:20:48,474][06909] Updated weights for policy 0, policy_version 185683 (0.0044) [2024-06-28 08:20:48,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 3042230272. Throughput: 0: 43785.9. Samples: 2945155100. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 08:20:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:20:52,186][06909] Updated weights for policy 0, policy_version 185693 (0.0035) [2024-06-28 08:20:53,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3042476032. Throughput: 0: 43898.3. Samples: 2945421980. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 08:20:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:20:55,691][06909] Updated weights for policy 0, policy_version 185703 (0.0027) [2024-06-28 08:20:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3042672640. Throughput: 0: 43885.4. Samples: 2945559080. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 08:20:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:20:59,744][06909] Updated weights for policy 0, policy_version 185713 (0.0037) [2024-06-28 08:21:03,060][06909] Updated weights for policy 0, policy_version 185723 (0.0031) [2024-06-28 08:21:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3042902016. Throughput: 0: 44029.0. Samples: 2945823060. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 08:21:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:21:07,073][06909] Updated weights for policy 0, policy_version 185733 (0.0040) [2024-06-28 08:21:08,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3043147776. Throughput: 0: 43800.3. Samples: 2946080800. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-28 08:21:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:21:10,863][06909] Updated weights for policy 0, policy_version 185743 (0.0034) [2024-06-28 08:21:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3043328000. Throughput: 0: 43876.2. Samples: 2946219340. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-28 08:21:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:21:14,634][06909] Updated weights for policy 0, policy_version 185753 (0.0031) [2024-06-28 08:21:18,512][06909] Updated weights for policy 0, policy_version 185763 (0.0028) [2024-06-28 08:21:18,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3043557376. Throughput: 0: 44053.0. Samples: 2946482700. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-28 08:21:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:21:21,888][06909] Updated weights for policy 0, policy_version 185773 (0.0046) [2024-06-28 08:21:23,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3043803136. Throughput: 0: 44076.4. Samples: 2946742660. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-28 08:21:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:21:25,770][06909] Updated weights for policy 0, policy_version 185783 (0.0029) [2024-06-28 08:21:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43422.0, 300 sec: 43986.9). Total num frames: 3043983360. Throughput: 0: 44199.1. Samples: 2946884600. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-28 08:21:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:21:29,298][06909] Updated weights for policy 0, policy_version 185793 (0.0042) [2024-06-28 08:21:33,007][06909] Updated weights for policy 0, policy_version 185803 (0.0031) [2024-06-28 08:21:33,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43964.9, 300 sec: 43986.9). Total num frames: 3044212736. Throughput: 0: 44234.2. Samples: 2947145640. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-28 08:21:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:21:36,943][06909] Updated weights for policy 0, policy_version 185813 (0.0026) [2024-06-28 08:21:38,850][06674] Fps is (10 sec: 49152.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3044474880. Throughput: 0: 44212.9. Samples: 2947411560. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-28 08:21:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:21:40,211][06909] Updated weights for policy 0, policy_version 185823 (0.0034) [2024-06-28 08:21:41,822][06887] Signal inference workers to stop experience collection... (41750 times) [2024-06-28 08:21:41,823][06887] Signal inference workers to resume experience collection... (41750 times) [2024-06-28 08:21:41,864][06909] InferenceWorker_p0-w0: stopping experience collection (41750 times) [2024-06-28 08:21:41,864][06909] InferenceWorker_p0-w0: resuming experience collection (41750 times) [2024-06-28 08:21:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 43932.2). Total num frames: 3044655104. Throughput: 0: 44070.8. Samples: 2947542260. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-28 08:21:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:21:44,296][06909] Updated weights for policy 0, policy_version 185833 (0.0032) [2024-06-28 08:21:47,959][06909] Updated weights for policy 0, policy_version 185843 (0.0031) [2024-06-28 08:21:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3044884480. Throughput: 0: 43984.0. Samples: 2947802340. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-28 08:21:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:21:48,987][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000185846_3044900864.pth... [2024-06-28 08:21:49,034][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000185200_3034316800.pth [2024-06-28 08:21:51,737][06909] Updated weights for policy 0, policy_version 185853 (0.0023) [2024-06-28 08:21:53,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3045113856. Throughput: 0: 44198.7. Samples: 2948069740. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-28 08:21:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:21:55,379][06909] Updated weights for policy 0, policy_version 185863 (0.0025) [2024-06-28 08:21:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3045326848. Throughput: 0: 44259.1. Samples: 2948211000. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-28 08:21:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:21:58,963][06909] Updated weights for policy 0, policy_version 185873 (0.0034) [2024-06-28 08:22:02,677][06909] Updated weights for policy 0, policy_version 185883 (0.0031) [2024-06-28 08:22:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3045539840. Throughput: 0: 44125.8. Samples: 2948468360. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-28 08:22:03,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:22:06,343][06909] Updated weights for policy 0, policy_version 185893 (0.0029) [2024-06-28 08:22:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3045785600. Throughput: 0: 44281.4. Samples: 2948735320. Policy #0 lag: (min: 1.0, avg: 10.3, max: 22.0) [2024-06-28 08:22:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:22:10,161][06909] Updated weights for policy 0, policy_version 185903 (0.0034) [2024-06-28 08:22:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3045982208. Throughput: 0: 44224.5. Samples: 2948874700. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 08:22:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:22:14,119][06909] Updated weights for policy 0, policy_version 185913 (0.0032) [2024-06-28 08:22:17,343][06909] Updated weights for policy 0, policy_version 185923 (0.0033) [2024-06-28 08:22:18,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3046211584. Throughput: 0: 44143.4. Samples: 2949132100. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 08:22:18,854][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:22:21,363][06909] Updated weights for policy 0, policy_version 185933 (0.0040) [2024-06-28 08:22:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3046440960. Throughput: 0: 44023.1. Samples: 2949392600. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 08:22:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:22:24,813][06909] Updated weights for policy 0, policy_version 185943 (0.0030) [2024-06-28 08:22:28,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3046637568. Throughput: 0: 44177.8. Samples: 2949530260. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 08:22:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:22:28,891][06909] Updated weights for policy 0, policy_version 185953 (0.0038) [2024-06-28 08:22:32,381][06909] Updated weights for policy 0, policy_version 185963 (0.0032) [2024-06-28 08:22:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3046866944. Throughput: 0: 44233.3. Samples: 2949792840. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 08:22:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:22:36,298][06909] Updated weights for policy 0, policy_version 185973 (0.0028) [2024-06-28 08:22:38,850][06674] Fps is (10 sec: 47511.7, 60 sec: 43963.5, 300 sec: 44098.2). Total num frames: 3047112704. Throughput: 0: 44123.3. Samples: 2950055300. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 08:22:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:22:39,589][06909] Updated weights for policy 0, policy_version 185983 (0.0040) [2024-06-28 08:22:43,454][06909] Updated weights for policy 0, policy_version 185993 (0.0040) [2024-06-28 08:22:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3047309312. Throughput: 0: 44146.2. Samples: 2950197580. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 08:22:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:22:47,217][06909] Updated weights for policy 0, policy_version 186003 (0.0031) [2024-06-28 08:22:48,850][06674] Fps is (10 sec: 42599.6, 60 sec: 44236.7, 300 sec: 44098.9). Total num frames: 3047538688. Throughput: 0: 44199.5. Samples: 2950457340. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 08:22:48,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:22:50,987][06909] Updated weights for policy 0, policy_version 186013 (0.0025) [2024-06-28 08:22:53,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 3047784448. Throughput: 0: 44132.9. Samples: 2950721300. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 08:22:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:22:54,757][06909] Updated weights for policy 0, policy_version 186023 (0.0025) [2024-06-28 08:22:58,576][06909] Updated weights for policy 0, policy_version 186033 (0.0026) [2024-06-28 08:22:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3047964672. Throughput: 0: 44010.6. Samples: 2950855180. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 08:22:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:23:02,006][06909] Updated weights for policy 0, policy_version 186043 (0.0024) [2024-06-28 08:23:03,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3048177664. Throughput: 0: 44121.0. Samples: 2951117540. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 08:23:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:23:06,012][06909] Updated weights for policy 0, policy_version 186053 (0.0045) [2024-06-28 08:23:06,544][06887] Signal inference workers to stop experience collection... (41800 times) [2024-06-28 08:23:06,545][06887] Signal inference workers to resume experience collection... (41800 times) [2024-06-28 08:23:06,584][06909] InferenceWorker_p0-w0: stopping experience collection (41800 times) [2024-06-28 08:23:06,584][06909] InferenceWorker_p0-w0: resuming experience collection (41800 times) [2024-06-28 08:23:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3048423424. Throughput: 0: 44053.2. Samples: 2951375000. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 08:23:08,859][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:23:10,120][06909] Updated weights for policy 0, policy_version 186063 (0.0036) [2024-06-28 08:23:13,434][06909] Updated weights for policy 0, policy_version 186073 (0.0030) [2024-06-28 08:23:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3048620032. Throughput: 0: 44077.8. Samples: 2951513760. Policy #0 lag: (min: 1.0, avg: 10.1, max: 22.0) [2024-06-28 08:23:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:23:17,391][06909] Updated weights for policy 0, policy_version 186083 (0.0038) [2024-06-28 08:23:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3048865792. Throughput: 0: 44211.6. Samples: 2951782360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 08:23:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:23:20,918][06909] Updated weights for policy 0, policy_version 186093 (0.0027) [2024-06-28 08:23:23,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3049095168. Throughput: 0: 44120.3. Samples: 2952040700. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 08:23:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:23:24,625][06909] Updated weights for policy 0, policy_version 186103 (0.0030) [2024-06-28 08:23:28,096][06909] Updated weights for policy 0, policy_version 186113 (0.0026) [2024-06-28 08:23:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3049291776. Throughput: 0: 43996.8. Samples: 2952177440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 08:23:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:23:31,859][06909] Updated weights for policy 0, policy_version 186123 (0.0028) [2024-06-28 08:23:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3049504768. Throughput: 0: 44174.3. Samples: 2952445180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 08:23:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:23:35,637][06909] Updated weights for policy 0, policy_version 186133 (0.0031) [2024-06-28 08:23:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43964.1, 300 sec: 44042.4). Total num frames: 3049750528. Throughput: 0: 43980.9. Samples: 2952700440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 08:23:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:23:39,305][06909] Updated weights for policy 0, policy_version 186143 (0.0030) [2024-06-28 08:23:43,093][06909] Updated weights for policy 0, policy_version 186153 (0.0043) [2024-06-28 08:23:43,852][06674] Fps is (10 sec: 45866.1, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 3049963520. Throughput: 0: 44006.0. Samples: 2952835540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 08:23:43,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:23:47,037][06909] Updated weights for policy 0, policy_version 186163 (0.0035) [2024-06-28 08:23:48,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3050160128. Throughput: 0: 44178.2. Samples: 2953105560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 08:23:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:23:48,873][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000186168_3050176512.pth... [2024-06-28 08:23:48,930][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000185523_3039608832.pth [2024-06-28 08:23:50,746][06909] Updated weights for policy 0, policy_version 186173 (0.0025) [2024-06-28 08:23:53,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3050405888. Throughput: 0: 44095.6. Samples: 2953359300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 08:23:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:23:54,298][06909] Updated weights for policy 0, policy_version 186183 (0.0037) [2024-06-28 08:23:58,252][06909] Updated weights for policy 0, policy_version 186193 (0.0034) [2024-06-28 08:23:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3050602496. Throughput: 0: 43992.8. Samples: 2953493440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 08:23:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:24:01,560][06909] Updated weights for policy 0, policy_version 186203 (0.0024) [2024-06-28 08:24:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3050848256. Throughput: 0: 43979.5. Samples: 2953761440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 08:24:03,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:24:05,584][06909] Updated weights for policy 0, policy_version 186213 (0.0032) [2024-06-28 08:24:08,852][06674] Fps is (10 sec: 45865.9, 60 sec: 43962.3, 300 sec: 44097.7). Total num frames: 3051061248. Throughput: 0: 44012.7. Samples: 2954021360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 08:24:08,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:24:09,103][06909] Updated weights for policy 0, policy_version 186223 (0.0021) [2024-06-28 08:24:13,115][06909] Updated weights for policy 0, policy_version 186233 (0.0030) [2024-06-28 08:24:13,852][06674] Fps is (10 sec: 42591.1, 60 sec: 44235.5, 300 sec: 43986.6). Total num frames: 3051274240. Throughput: 0: 43949.8. Samples: 2954155260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 08:24:13,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:24:16,522][06909] Updated weights for policy 0, policy_version 186243 (0.0030) [2024-06-28 08:24:18,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3051487232. Throughput: 0: 43899.6. Samples: 2954420660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 08:24:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 08:24:19,493][06887] Signal inference workers to stop experience collection... (41850 times) [2024-06-28 08:24:19,493][06887] Signal inference workers to resume experience collection... (41850 times) [2024-06-28 08:24:19,538][06909] InferenceWorker_p0-w0: stopping experience collection (41850 times) [2024-06-28 08:24:19,539][06909] InferenceWorker_p0-w0: resuming experience collection (41850 times) [2024-06-28 08:24:20,278][06909] Updated weights for policy 0, policy_version 186253 (0.0029) [2024-06-28 08:24:23,850][06674] Fps is (10 sec: 42605.6, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 3051700224. Throughput: 0: 43879.9. Samples: 2954675040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 08:24:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:24:24,081][06909] Updated weights for policy 0, policy_version 186263 (0.0047) [2024-06-28 08:24:27,614][06909] Updated weights for policy 0, policy_version 186273 (0.0028) [2024-06-28 08:24:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3051929600. Throughput: 0: 43949.1. Samples: 2954813160. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 08:24:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:24:31,432][06909] Updated weights for policy 0, policy_version 186283 (0.0035) [2024-06-28 08:24:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3052158976. Throughput: 0: 43873.3. Samples: 2955079860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 08:24:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:24:35,118][06909] Updated weights for policy 0, policy_version 186293 (0.0031) [2024-06-28 08:24:38,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3052371968. Throughput: 0: 44085.2. Samples: 2955343140. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 08:24:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:24:39,021][06909] Updated weights for policy 0, policy_version 186303 (0.0027) [2024-06-28 08:24:42,402][06909] Updated weights for policy 0, policy_version 186313 (0.0045) [2024-06-28 08:24:43,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44238.3, 300 sec: 44153.8). Total num frames: 3052617728. Throughput: 0: 44039.6. Samples: 2955475220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 08:24:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:24:46,777][06909] Updated weights for policy 0, policy_version 186323 (0.0028) [2024-06-28 08:24:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3052797952. Throughput: 0: 43916.5. Samples: 2955737680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 08:24:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:24:50,186][06909] Updated weights for policy 0, policy_version 186333 (0.0026) [2024-06-28 08:24:53,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3053027328. Throughput: 0: 43956.6. Samples: 2955999320. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 08:24:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:24:54,201][06909] Updated weights for policy 0, policy_version 186343 (0.0033) [2024-06-28 08:24:57,474][06909] Updated weights for policy 0, policy_version 186353 (0.0023) [2024-06-28 08:24:58,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 3053273088. Throughput: 0: 44156.4. Samples: 2956142220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 08:24:58,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:25:01,318][06909] Updated weights for policy 0, policy_version 186363 (0.0034) [2024-06-28 08:25:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3053486080. Throughput: 0: 44063.5. Samples: 2956403520. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 08:25:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:25:04,733][06909] Updated weights for policy 0, policy_version 186373 (0.0032) [2024-06-28 08:25:08,754][06909] Updated weights for policy 0, policy_version 186383 (0.0033) [2024-06-28 08:25:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 3053699072. Throughput: 0: 44384.9. Samples: 2956672360. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 08:25:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:25:11,956][06909] Updated weights for policy 0, policy_version 186393 (0.0030) [2024-06-28 08:25:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44511.2, 300 sec: 44153.5). Total num frames: 3053944832. Throughput: 0: 44318.2. Samples: 2956807480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 08:25:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:25:16,044][06909] Updated weights for policy 0, policy_version 186403 (0.0031) [2024-06-28 08:25:18,851][06674] Fps is (10 sec: 44231.2, 60 sec: 44235.8, 300 sec: 44042.2). Total num frames: 3054141440. Throughput: 0: 44285.0. Samples: 2957072740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 08:25:18,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:25:19,424][06909] Updated weights for policy 0, policy_version 186413 (0.0037) [2024-06-28 08:25:23,805][06909] Updated weights for policy 0, policy_version 186423 (0.0037) [2024-06-28 08:25:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.9, 300 sec: 43987.8). Total num frames: 3054354432. Throughput: 0: 44208.1. Samples: 2957332500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 08:25:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:25:27,022][06909] Updated weights for policy 0, policy_version 186433 (0.0027) [2024-06-28 08:25:28,850][06674] Fps is (10 sec: 47520.0, 60 sec: 44782.9, 300 sec: 44209.3). Total num frames: 3054616576. Throughput: 0: 44285.3. Samples: 2957468060. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 08:25:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:25:31,070][06909] Updated weights for policy 0, policy_version 186443 (0.0023) [2024-06-28 08:25:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 3054796800. Throughput: 0: 44264.9. Samples: 2957729600. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:25:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:25:34,774][06909] Updated weights for policy 0, policy_version 186453 (0.0023) [2024-06-28 08:25:38,291][06909] Updated weights for policy 0, policy_version 186463 (0.0044) [2024-06-28 08:25:38,852][06674] Fps is (10 sec: 39313.3, 60 sec: 43962.3, 300 sec: 44097.6). Total num frames: 3055009792. Throughput: 0: 44231.3. Samples: 2957989820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:25:38,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:25:41,918][06909] Updated weights for policy 0, policy_version 186473 (0.0037) [2024-06-28 08:25:43,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3055255552. Throughput: 0: 44152.4. Samples: 2958129080. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:25:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:25:45,921][06909] Updated weights for policy 0, policy_version 186483 (0.0024) [2024-06-28 08:25:48,850][06674] Fps is (10 sec: 45885.0, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3055468544. Throughput: 0: 44193.4. Samples: 2958392220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:25:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:25:48,878][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000186491_3055468544.pth... [2024-06-28 08:25:48,952][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000185846_3044900864.pth [2024-06-28 08:25:49,431][06909] Updated weights for policy 0, policy_version 186493 (0.0031) [2024-06-28 08:25:53,566][06909] Updated weights for policy 0, policy_version 186503 (0.0024) [2024-06-28 08:25:53,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3055665152. Throughput: 0: 44232.4. Samples: 2958662820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:25:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:25:55,408][06887] Signal inference workers to stop experience collection... (41900 times) [2024-06-28 08:25:55,420][06909] InferenceWorker_p0-w0: stopping experience collection (41900 times) [2024-06-28 08:25:55,472][06887] Signal inference workers to resume experience collection... (41900 times) [2024-06-28 08:25:55,472][06909] InferenceWorker_p0-w0: resuming experience collection (41900 times) [2024-06-28 08:25:56,815][06909] Updated weights for policy 0, policy_version 186513 (0.0035) [2024-06-28 08:25:58,853][06674] Fps is (10 sec: 44221.8, 60 sec: 43961.3, 300 sec: 44097.4). Total num frames: 3055910912. Throughput: 0: 43939.8. Samples: 2958784920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:25:58,854][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:26:00,948][06909] Updated weights for policy 0, policy_version 186523 (0.0042) [2024-06-28 08:26:03,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3056123904. Throughput: 0: 43960.0. Samples: 2959050880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:26:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:26:04,235][06909] Updated weights for policy 0, policy_version 186533 (0.0028) [2024-06-28 08:26:08,458][06909] Updated weights for policy 0, policy_version 186543 (0.0026) [2024-06-28 08:26:08,850][06674] Fps is (10 sec: 40973.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3056320512. Throughput: 0: 44098.2. Samples: 2959316920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:26:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:26:11,540][06909] Updated weights for policy 0, policy_version 186553 (0.0032) [2024-06-28 08:26:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3056582656. Throughput: 0: 43959.5. Samples: 2959446240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:26:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:26:15,902][06909] Updated weights for policy 0, policy_version 186563 (0.0037) [2024-06-28 08:26:18,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44237.8, 300 sec: 44042.4). Total num frames: 3056795648. Throughput: 0: 44091.5. Samples: 2959713720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:26:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:26:18,884][06909] Updated weights for policy 0, policy_version 186573 (0.0025) [2024-06-28 08:26:23,445][06909] Updated weights for policy 0, policy_version 186583 (0.0034) [2024-06-28 08:26:23,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3057008640. Throughput: 0: 44363.7. Samples: 2959986100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:26:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:26:26,525][06909] Updated weights for policy 0, policy_version 186593 (0.0028) [2024-06-28 08:26:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 3057238016. Throughput: 0: 44044.1. Samples: 2960111060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:26:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:26:30,950][06909] Updated weights for policy 0, policy_version 186603 (0.0024) [2024-06-28 08:26:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3057451008. Throughput: 0: 44002.5. Samples: 2960372340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 08:26:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:26:34,151][06909] Updated weights for policy 0, policy_version 186613 (0.0025) [2024-06-28 08:26:38,412][06909] Updated weights for policy 0, policy_version 186623 (0.0024) [2024-06-28 08:26:38,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 3057647616. Throughput: 0: 43897.3. Samples: 2960638200. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 08:26:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:26:41,543][06909] Updated weights for policy 0, policy_version 186633 (0.0029) [2024-06-28 08:26:43,852][06674] Fps is (10 sec: 45866.1, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 3057909760. Throughput: 0: 43980.4. Samples: 2960763980. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 08:26:43,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:26:45,893][06909] Updated weights for policy 0, policy_version 186643 (0.0032) [2024-06-28 08:26:48,704][06909] Updated weights for policy 0, policy_version 186653 (0.0036) [2024-06-28 08:26:48,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3058122752. Throughput: 0: 43935.4. Samples: 2961027980. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 08:26:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:26:53,240][06909] Updated weights for policy 0, policy_version 186663 (0.0026) [2024-06-28 08:26:53,850][06674] Fps is (10 sec: 42607.3, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 3058335744. Throughput: 0: 44097.3. Samples: 2961301300. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 08:26:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:26:56,218][06909] Updated weights for policy 0, policy_version 186673 (0.0040) [2024-06-28 08:26:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44239.2, 300 sec: 44153.5). Total num frames: 3058565120. Throughput: 0: 43959.5. Samples: 2961424420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 08:26:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:27:00,849][06909] Updated weights for policy 0, policy_version 186683 (0.0029) [2024-06-28 08:27:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3058761728. Throughput: 0: 43897.3. Samples: 2961689100. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 08:27:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:27:03,858][06909] Updated weights for policy 0, policy_version 186693 (0.0031) [2024-06-28 08:27:04,545][06887] Signal inference workers to stop experience collection... (41950 times) [2024-06-28 08:27:04,546][06887] Signal inference workers to resume experience collection... (41950 times) [2024-06-28 08:27:04,587][06909] InferenceWorker_p0-w0: stopping experience collection (41950 times) [2024-06-28 08:27:04,587][06909] InferenceWorker_p0-w0: resuming experience collection (41950 times) [2024-06-28 08:27:08,479][06909] Updated weights for policy 0, policy_version 186703 (0.0029) [2024-06-28 08:27:08,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3058974720. Throughput: 0: 43767.5. Samples: 2961955640. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 08:27:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:27:11,391][06909] Updated weights for policy 0, policy_version 186713 (0.0026) [2024-06-28 08:27:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3059220480. Throughput: 0: 43713.3. Samples: 2962078160. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 08:27:13,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 08:27:15,601][06909] Updated weights for policy 0, policy_version 186723 (0.0035) [2024-06-28 08:27:18,709][06909] Updated weights for policy 0, policy_version 186733 (0.0027) [2024-06-28 08:27:18,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3059433472. Throughput: 0: 43891.7. Samples: 2962347460. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 08:27:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:27:22,869][06909] Updated weights for policy 0, policy_version 186743 (0.0028) [2024-06-28 08:27:23,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3059630080. Throughput: 0: 43980.5. Samples: 2962617320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 08:27:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:27:26,034][06909] Updated weights for policy 0, policy_version 186753 (0.0025) [2024-06-28 08:27:28,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 3059875840. Throughput: 0: 43932.6. Samples: 2962740860. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 08:27:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:27:30,639][06909] Updated weights for policy 0, policy_version 186763 (0.0034) [2024-06-28 08:27:33,515][06909] Updated weights for policy 0, policy_version 186773 (0.0027) [2024-06-28 08:27:33,850][06674] Fps is (10 sec: 49152.4, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 3060121600. Throughput: 0: 44109.1. Samples: 2963012880. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 08:27:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:27:37,938][06909] Updated weights for policy 0, policy_version 186783 (0.0032) [2024-06-28 08:27:38,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3060301824. Throughput: 0: 44006.2. Samples: 2963281580. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 08:27:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:27:40,808][06909] Updated weights for policy 0, policy_version 186793 (0.0029) [2024-06-28 08:27:43,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43692.2, 300 sec: 44042.4). Total num frames: 3060531200. Throughput: 0: 43881.0. Samples: 2963399060. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 08:27:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:27:45,533][06909] Updated weights for policy 0, policy_version 186803 (0.0028) [2024-06-28 08:27:48,142][06909] Updated weights for policy 0, policy_version 186813 (0.0036) [2024-06-28 08:27:48,850][06674] Fps is (10 sec: 49151.6, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 3060793344. Throughput: 0: 44168.7. Samples: 2963676700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:27:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:27:48,900][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000186817_3060809728.pth... [2024-06-28 08:27:48,955][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000186168_3050176512.pth [2024-06-28 08:27:52,723][06909] Updated weights for policy 0, policy_version 186823 (0.0029) [2024-06-28 08:27:53,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43417.5, 300 sec: 43986.9). Total num frames: 3060940800. Throughput: 0: 44155.1. Samples: 2963942620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:27:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:27:55,487][06909] Updated weights for policy 0, policy_version 186833 (0.0034) [2024-06-28 08:27:58,856][06674] Fps is (10 sec: 39298.3, 60 sec: 43686.3, 300 sec: 44097.0). Total num frames: 3061186560. Throughput: 0: 44023.8. Samples: 2964059500. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:27:58,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:27:59,994][06909] Updated weights for policy 0, policy_version 186843 (0.0039) [2024-06-28 08:28:03,058][06909] Updated weights for policy 0, policy_version 186853 (0.0028) [2024-06-28 08:28:03,629][06887] Signal inference workers to stop experience collection... (42000 times) [2024-06-28 08:28:03,629][06887] Signal inference workers to resume experience collection... (42000 times) [2024-06-28 08:28:03,640][06909] InferenceWorker_p0-w0: stopping experience collection (42000 times) [2024-06-28 08:28:03,641][06909] InferenceWorker_p0-w0: resuming experience collection (42000 times) [2024-06-28 08:28:03,850][06674] Fps is (10 sec: 50791.3, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 3061448704. Throughput: 0: 44114.7. Samples: 2964332620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:28:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:28:07,691][06909] Updated weights for policy 0, policy_version 186863 (0.0031) [2024-06-28 08:28:08,850][06674] Fps is (10 sec: 42624.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3061612544. Throughput: 0: 44158.2. Samples: 2964604440. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:28:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:28:10,423][06909] Updated weights for policy 0, policy_version 186873 (0.0031) [2024-06-28 08:28:13,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3061841920. Throughput: 0: 44041.1. Samples: 2964722700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:28:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:28:15,008][06909] Updated weights for policy 0, policy_version 186883 (0.0029) [2024-06-28 08:28:17,828][06909] Updated weights for policy 0, policy_version 186893 (0.0025) [2024-06-28 08:28:18,850][06674] Fps is (10 sec: 50790.1, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 3062120448. Throughput: 0: 44166.1. Samples: 2965000360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:28:18,859][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 08:28:22,515][06909] Updated weights for policy 0, policy_version 186903 (0.0036) [2024-06-28 08:28:23,852][06674] Fps is (10 sec: 42589.5, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 3062267904. Throughput: 0: 44186.5. Samples: 2965270060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:28:23,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:28:25,316][06909] Updated weights for policy 0, policy_version 186913 (0.0028) [2024-06-28 08:28:28,850][06674] Fps is (10 sec: 37683.5, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 3062497280. Throughput: 0: 44173.8. Samples: 2965386880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:28:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:28:29,879][06909] Updated weights for policy 0, policy_version 186923 (0.0046) [2024-06-28 08:28:32,682][06909] Updated weights for policy 0, policy_version 186933 (0.0034) [2024-06-28 08:28:33,850][06674] Fps is (10 sec: 50800.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3062775808. Throughput: 0: 43953.9. Samples: 2965654620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:28:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:28:37,086][06909] Updated weights for policy 0, policy_version 186943 (0.0026) [2024-06-28 08:28:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43987.2). Total num frames: 3062939648. Throughput: 0: 44315.2. Samples: 2965936800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:28:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:28:39,955][06909] Updated weights for policy 0, policy_version 186953 (0.0029) [2024-06-28 08:28:43,850][06674] Fps is (10 sec: 37683.0, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3063152640. Throughput: 0: 44297.4. Samples: 2966052620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:28:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:28:44,786][06909] Updated weights for policy 0, policy_version 186963 (0.0033) [2024-06-28 08:28:47,411][06909] Updated weights for policy 0, policy_version 186973 (0.0031) [2024-06-28 08:28:48,852][06674] Fps is (10 sec: 50780.2, 60 sec: 44235.4, 300 sec: 44208.7). Total num frames: 3063447552. Throughput: 0: 44193.0. Samples: 2966321400. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 08:28:48,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:28:52,167][06909] Updated weights for policy 0, policy_version 186983 (0.0028) [2024-06-28 08:28:53,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3063595008. Throughput: 0: 44269.8. Samples: 2966596580. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:28:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:28:54,931][06909] Updated weights for policy 0, policy_version 186993 (0.0025) [2024-06-28 08:28:58,850][06674] Fps is (10 sec: 36051.6, 60 sec: 43694.9, 300 sec: 43931.3). Total num frames: 3063808000. Throughput: 0: 44342.0. Samples: 2966718100. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:28:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:28:59,506][06909] Updated weights for policy 0, policy_version 187003 (0.0036) [2024-06-28 08:29:01,818][06887] Signal inference workers to stop experience collection... (42050 times) [2024-06-28 08:29:01,867][06909] InferenceWorker_p0-w0: stopping experience collection (42050 times) [2024-06-28 08:29:01,928][06887] Signal inference workers to resume experience collection... (42050 times) [2024-06-28 08:29:01,929][06909] InferenceWorker_p0-w0: resuming experience collection (42050 times) [2024-06-28 08:29:02,252][06909] Updated weights for policy 0, policy_version 187013 (0.0041) [2024-06-28 08:29:03,850][06674] Fps is (10 sec: 50789.6, 60 sec: 44236.7, 300 sec: 44209.3). Total num frames: 3064102912. Throughput: 0: 43979.5. Samples: 2966979440. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:29:03,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:29:07,089][06909] Updated weights for policy 0, policy_version 187023 (0.0035) [2024-06-28 08:29:08,850][06674] Fps is (10 sec: 47514.5, 60 sec: 44509.9, 300 sec: 44098.2). Total num frames: 3064283136. Throughput: 0: 44069.1. Samples: 2967253080. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:29:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:29:09,971][06909] Updated weights for policy 0, policy_version 187033 (0.0039) [2024-06-28 08:29:13,850][06674] Fps is (10 sec: 37683.3, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3064479744. Throughput: 0: 44247.0. Samples: 2967378000. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:29:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:29:14,404][06909] Updated weights for policy 0, policy_version 187043 (0.0036) [2024-06-28 08:29:17,104][06909] Updated weights for policy 0, policy_version 187053 (0.0025) [2024-06-28 08:29:18,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44236.8, 300 sec: 44320.1). Total num frames: 3064774656. Throughput: 0: 44317.3. Samples: 2967648900. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:29:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:29:21,733][06909] Updated weights for policy 0, policy_version 187063 (0.0019) [2024-06-28 08:29:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44511.3, 300 sec: 44097.9). Total num frames: 3064938496. Throughput: 0: 44085.3. Samples: 2967920640. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:29:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:29:24,533][06909] Updated weights for policy 0, policy_version 187073 (0.0036) [2024-06-28 08:29:28,850][06674] Fps is (10 sec: 34406.7, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 3065118720. Throughput: 0: 44095.7. Samples: 2968036920. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:29:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:29:29,239][06909] Updated weights for policy 0, policy_version 187083 (0.0033) [2024-06-28 08:29:31,832][06909] Updated weights for policy 0, policy_version 187093 (0.0032) [2024-06-28 08:29:33,850][06674] Fps is (10 sec: 47514.3, 60 sec: 43963.8, 300 sec: 44209.1). Total num frames: 3065413632. Throughput: 0: 44227.9. Samples: 2968311560. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:29:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:29:36,431][06909] Updated weights for policy 0, policy_version 187103 (0.0026) [2024-06-28 08:29:38,850][06674] Fps is (10 sec: 49151.5, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3065610240. Throughput: 0: 44083.0. Samples: 2968580320. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:29:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:29:39,345][06909] Updated weights for policy 0, policy_version 187113 (0.0032) [2024-06-28 08:29:43,852][06674] Fps is (10 sec: 39312.1, 60 sec: 44235.1, 300 sec: 44097.6). Total num frames: 3065806848. Throughput: 0: 44213.5. Samples: 2968707800. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:29:43,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:29:43,932][06909] Updated weights for policy 0, policy_version 187123 (0.0043) [2024-06-28 08:29:46,934][06909] Updated weights for policy 0, policy_version 187133 (0.0027) [2024-06-28 08:29:48,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43692.2, 300 sec: 44209.0). Total num frames: 3066068992. Throughput: 0: 44287.2. Samples: 2968972360. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:29:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 08:29:48,947][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000187139_3066085376.pth... [2024-06-28 08:29:48,991][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000186491_3055468544.pth [2024-06-28 08:29:51,321][06909] Updated weights for policy 0, policy_version 187143 (0.0031) [2024-06-28 08:29:53,850][06674] Fps is (10 sec: 49163.6, 60 sec: 45056.0, 300 sec: 44153.5). Total num frames: 3066298368. Throughput: 0: 44319.6. Samples: 2969247460. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:29:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:29:54,171][06909] Updated weights for policy 0, policy_version 187153 (0.0035) [2024-06-28 08:29:54,667][06887] Signal inference workers to stop experience collection... (42100 times) [2024-06-28 08:29:54,722][06887] Signal inference workers to resume experience collection... (42100 times) [2024-06-28 08:29:54,728][06909] InferenceWorker_p0-w0: stopping experience collection (42100 times) [2024-06-28 08:29:54,758][06909] InferenceWorker_p0-w0: resuming experience collection (42100 times) [2024-06-28 08:29:58,764][06909] Updated weights for policy 0, policy_version 187163 (0.0030) [2024-06-28 08:29:58,852][06674] Fps is (10 sec: 40951.5, 60 sec: 44508.5, 300 sec: 44042.1). Total num frames: 3066478592. Throughput: 0: 44359.4. Samples: 2969374260. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 08:29:58,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-28 08:30:01,574][06909] Updated weights for policy 0, policy_version 187173 (0.0043) [2024-06-28 08:30:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.8, 300 sec: 44153.5). Total num frames: 3066724352. Throughput: 0: 44213.4. Samples: 2969638500. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 08:30:03,850][06674] Avg episode reward: [(0, '0.452')] [2024-06-28 08:30:06,350][06909] Updated weights for policy 0, policy_version 187183 (0.0038) [2024-06-28 08:30:08,850][06674] Fps is (10 sec: 47523.0, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3066953728. Throughput: 0: 44172.0. Samples: 2969908380. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 08:30:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:30:08,898][06909] Updated weights for policy 0, policy_version 187193 (0.0021) [2024-06-28 08:30:13,641][06909] Updated weights for policy 0, policy_version 187203 (0.0031) [2024-06-28 08:30:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44510.0, 300 sec: 44098.2). Total num frames: 3067150336. Throughput: 0: 44400.5. Samples: 2970034940. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 08:30:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:30:16,735][06909] Updated weights for policy 0, policy_version 187213 (0.0038) [2024-06-28 08:30:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 3067396096. Throughput: 0: 43931.4. Samples: 2970288480. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 08:30:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:30:20,911][06909] Updated weights for policy 0, policy_version 187223 (0.0027) [2024-06-28 08:30:23,850][06674] Fps is (10 sec: 45874.0, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3067609088. Throughput: 0: 44032.8. Samples: 2970561800. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 08:30:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:30:24,050][06909] Updated weights for policy 0, policy_version 187233 (0.0028) [2024-06-28 08:30:28,347][06909] Updated weights for policy 0, policy_version 187243 (0.0039) [2024-06-28 08:30:28,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44782.9, 300 sec: 44097.9). Total num frames: 3067805696. Throughput: 0: 44137.0. Samples: 2970693860. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 08:30:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:30:31,430][06909] Updated weights for policy 0, policy_version 187253 (0.0031) [2024-06-28 08:30:33,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43690.6, 300 sec: 44153.8). Total num frames: 3068035072. Throughput: 0: 44002.7. Samples: 2970952480. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 08:30:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:30:35,842][06909] Updated weights for policy 0, policy_version 187263 (0.0030) [2024-06-28 08:30:38,771][06909] Updated weights for policy 0, policy_version 187273 (0.0027) [2024-06-28 08:30:38,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3068280832. Throughput: 0: 44058.2. Samples: 2971230080. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 08:30:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:30:43,305][06909] Updated weights for policy 0, policy_version 187283 (0.0029) [2024-06-28 08:30:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44511.6, 300 sec: 44097.9). Total num frames: 3068477440. Throughput: 0: 44167.4. Samples: 2971361700. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 08:30:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:30:46,225][06909] Updated weights for policy 0, policy_version 187293 (0.0038) [2024-06-28 08:30:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 3068706816. Throughput: 0: 44030.2. Samples: 2971619860. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 08:30:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:30:50,523][06909] Updated weights for policy 0, policy_version 187303 (0.0047) [2024-06-28 08:30:53,687][06909] Updated weights for policy 0, policy_version 187313 (0.0036) [2024-06-28 08:30:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44154.0). Total num frames: 3068936192. Throughput: 0: 44044.5. Samples: 2971890380. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 08:30:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:30:57,995][06909] Updated weights for policy 0, policy_version 187323 (0.0033) [2024-06-28 08:30:58,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44238.2, 300 sec: 44097.9). Total num frames: 3069132800. Throughput: 0: 44065.1. Samples: 2972017880. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 08:30:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:31:00,983][06909] Updated weights for policy 0, policy_version 187333 (0.0026) [2024-06-28 08:31:03,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.6, 300 sec: 44209.0). Total num frames: 3069362176. Throughput: 0: 44227.0. Samples: 2972278700. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 08:31:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:31:05,587][06909] Updated weights for policy 0, policy_version 187343 (0.0032) [2024-06-28 08:31:08,625][06909] Updated weights for policy 0, policy_version 187353 (0.0030) [2024-06-28 08:31:08,850][06674] Fps is (10 sec: 47514.4, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3069607936. Throughput: 0: 44261.5. Samples: 2972553560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:31:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:31:12,803][06909] Updated weights for policy 0, policy_version 187363 (0.0031) [2024-06-28 08:31:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3069804544. Throughput: 0: 44282.2. Samples: 2972686560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:31:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:31:15,854][06909] Updated weights for policy 0, policy_version 187373 (0.0026) [2024-06-28 08:31:17,479][06887] Signal inference workers to stop experience collection... (42150 times) [2024-06-28 08:31:17,480][06887] Signal inference workers to resume experience collection... (42150 times) [2024-06-28 08:31:17,500][06909] InferenceWorker_p0-w0: stopping experience collection (42150 times) [2024-06-28 08:31:17,500][06909] InferenceWorker_p0-w0: resuming experience collection (42150 times) [2024-06-28 08:31:18,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 3070017536. Throughput: 0: 44324.8. Samples: 2972947100. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:31:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:31:20,415][06909] Updated weights for policy 0, policy_version 187383 (0.0029) [2024-06-28 08:31:23,489][06909] Updated weights for policy 0, policy_version 187393 (0.0037) [2024-06-28 08:31:23,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 3070279680. Throughput: 0: 43964.8. Samples: 2973208500. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:31:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:31:27,771][06909] Updated weights for policy 0, policy_version 187403 (0.0026) [2024-06-28 08:31:28,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3070459904. Throughput: 0: 44034.2. Samples: 2973343240. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:31:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:31:31,118][06909] Updated weights for policy 0, policy_version 187413 (0.0030) [2024-06-28 08:31:33,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3070672896. Throughput: 0: 44122.6. Samples: 2973605380. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:31:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:31:35,156][06909] Updated weights for policy 0, policy_version 187423 (0.0030) [2024-06-28 08:31:38,390][06909] Updated weights for policy 0, policy_version 187433 (0.0032) [2024-06-28 08:31:38,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43963.6, 300 sec: 44098.2). Total num frames: 3070918656. Throughput: 0: 44109.2. Samples: 2973875300. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:31:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:31:42,774][06909] Updated weights for policy 0, policy_version 187443 (0.0041) [2024-06-28 08:31:43,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3071115264. Throughput: 0: 44251.8. Samples: 2974009200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:31:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:31:45,769][06909] Updated weights for policy 0, policy_version 187453 (0.0033) [2024-06-28 08:31:48,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3071328256. Throughput: 0: 44193.9. Samples: 2974267420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:31:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:31:48,958][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000187460_3071344640.pth... [2024-06-28 08:31:49,018][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000186817_3060809728.pth [2024-06-28 08:31:50,123][06909] Updated weights for policy 0, policy_version 187463 (0.0030) [2024-06-28 08:31:53,281][06909] Updated weights for policy 0, policy_version 187473 (0.0038) [2024-06-28 08:31:53,850][06674] Fps is (10 sec: 47512.3, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3071590400. Throughput: 0: 43871.4. Samples: 2974527780. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:31:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:31:57,819][06909] Updated weights for policy 0, policy_version 187483 (0.0032) [2024-06-28 08:31:58,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3071787008. Throughput: 0: 43911.1. Samples: 2974662560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:31:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:32:00,713][06909] Updated weights for policy 0, policy_version 187493 (0.0028) [2024-06-28 08:32:03,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3071983616. Throughput: 0: 43872.0. Samples: 2974921340. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:32:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:32:05,004][06909] Updated weights for policy 0, policy_version 187503 (0.0032) [2024-06-28 08:32:08,062][06909] Updated weights for policy 0, policy_version 187513 (0.0025) [2024-06-28 08:32:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3072245760. Throughput: 0: 43943.1. Samples: 2975185940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 08:32:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:32:12,496][06909] Updated weights for policy 0, policy_version 187523 (0.0040) [2024-06-28 08:32:13,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 3072442368. Throughput: 0: 44097.7. Samples: 2975327640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 08:32:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:32:15,491][06909] Updated weights for policy 0, policy_version 187533 (0.0042) [2024-06-28 08:32:18,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3072638976. Throughput: 0: 43972.5. Samples: 2975584140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 08:32:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:32:19,887][06909] Updated weights for policy 0, policy_version 187543 (0.0043) [2024-06-28 08:32:22,915][06909] Updated weights for policy 0, policy_version 187553 (0.0041) [2024-06-28 08:32:23,852][06674] Fps is (10 sec: 45865.8, 60 sec: 43689.2, 300 sec: 44153.2). Total num frames: 3072901120. Throughput: 0: 43690.1. Samples: 2975841440. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 08:32:23,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:32:27,669][06909] Updated weights for policy 0, policy_version 187563 (0.0038) [2024-06-28 08:32:28,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3073114112. Throughput: 0: 43974.9. Samples: 2975988080. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 08:32:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:32:30,303][06909] Updated weights for policy 0, policy_version 187573 (0.0033) [2024-06-28 08:32:30,584][06887] Signal inference workers to stop experience collection... (42200 times) [2024-06-28 08:32:30,616][06909] InferenceWorker_p0-w0: stopping experience collection (42200 times) [2024-06-28 08:32:30,636][06887] Signal inference workers to resume experience collection... (42200 times) [2024-06-28 08:32:30,637][06909] InferenceWorker_p0-w0: resuming experience collection (42200 times) [2024-06-28 08:32:33,850][06674] Fps is (10 sec: 40968.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3073310720. Throughput: 0: 43918.7. Samples: 2976243760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 08:32:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:32:35,034][06909] Updated weights for policy 0, policy_version 187583 (0.0035) [2024-06-28 08:32:37,870][06909] Updated weights for policy 0, policy_version 187593 (0.0040) [2024-06-28 08:32:38,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3073556480. Throughput: 0: 43777.5. Samples: 2976497760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 08:32:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:32:42,381][06909] Updated weights for policy 0, policy_version 187603 (0.0050) [2024-06-28 08:32:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 3073753088. Throughput: 0: 43971.2. Samples: 2976641260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 08:32:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:32:45,226][06909] Updated weights for policy 0, policy_version 187613 (0.0035) [2024-06-28 08:32:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3073966080. Throughput: 0: 43909.4. Samples: 2976897260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 08:32:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:32:49,843][06909] Updated weights for policy 0, policy_version 187623 (0.0031) [2024-06-28 08:32:52,656][06909] Updated weights for policy 0, policy_version 187633 (0.0037) [2024-06-28 08:32:53,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.8, 300 sec: 44154.4). Total num frames: 3074211840. Throughput: 0: 43693.9. Samples: 2977152160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 08:32:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:32:57,496][06909] Updated weights for policy 0, policy_version 187643 (0.0027) [2024-06-28 08:32:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3074424832. Throughput: 0: 43887.6. Samples: 2977302580. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 08:32:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:33:00,070][06909] Updated weights for policy 0, policy_version 187653 (0.0030) [2024-06-28 08:33:03,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3074621440. Throughput: 0: 43964.4. Samples: 2977562540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 08:33:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:33:04,855][06909] Updated weights for policy 0, policy_version 187663 (0.0027) [2024-06-28 08:33:07,442][06909] Updated weights for policy 0, policy_version 187673 (0.0028) [2024-06-28 08:33:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 3074883584. Throughput: 0: 43935.8. Samples: 2977818460. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 08:33:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 08:33:12,089][06909] Updated weights for policy 0, policy_version 187683 (0.0026) [2024-06-28 08:33:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3075080192. Throughput: 0: 43997.8. Samples: 2977967980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 08:33:13,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:33:14,917][06909] Updated weights for policy 0, policy_version 187693 (0.0032) [2024-06-28 08:33:18,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 3075293184. Throughput: 0: 44145.8. Samples: 2978230320. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 08:33:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:33:19,262][06909] Updated weights for policy 0, policy_version 187703 (0.0032) [2024-06-28 08:33:22,226][06909] Updated weights for policy 0, policy_version 187713 (0.0031) [2024-06-28 08:33:23,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44238.3, 300 sec: 44264.6). Total num frames: 3075555328. Throughput: 0: 44149.3. Samples: 2978484480. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 08:33:23,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:33:26,783][06909] Updated weights for policy 0, policy_version 187723 (0.0044) [2024-06-28 08:33:28,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3075768320. Throughput: 0: 44257.7. Samples: 2978632860. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 08:33:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:33:29,457][06909] Updated weights for policy 0, policy_version 187733 (0.0032) [2024-06-28 08:33:33,850][06674] Fps is (10 sec: 37683.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3075932160. Throughput: 0: 44284.5. Samples: 2978890060. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 08:33:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:33:34,626][06909] Updated weights for policy 0, policy_version 187743 (0.0031) [2024-06-28 08:33:36,563][06887] Signal inference workers to stop experience collection... (42250 times) [2024-06-28 08:33:36,564][06887] Signal inference workers to resume experience collection... (42250 times) [2024-06-28 08:33:36,597][06909] InferenceWorker_p0-w0: stopping experience collection (42250 times) [2024-06-28 08:33:36,598][06909] InferenceWorker_p0-w0: resuming experience collection (42250 times) [2024-06-28 08:33:36,946][06909] Updated weights for policy 0, policy_version 187753 (0.0030) [2024-06-28 08:33:38,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 44209.1). Total num frames: 3076194304. Throughput: 0: 44356.9. Samples: 2979148220. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 08:33:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:33:41,811][06909] Updated weights for policy 0, policy_version 187763 (0.0032) [2024-06-28 08:33:43,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44509.9, 300 sec: 43987.2). Total num frames: 3076423680. Throughput: 0: 44162.2. Samples: 2979289880. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 08:33:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:33:44,439][06909] Updated weights for policy 0, policy_version 187773 (0.0026) [2024-06-28 08:33:48,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3076603904. Throughput: 0: 44327.2. Samples: 2979557260. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 08:33:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:33:48,891][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000187782_3076620288.pth... [2024-06-28 08:33:48,943][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000187139_3066085376.pth [2024-06-28 08:33:49,146][06909] Updated weights for policy 0, policy_version 187783 (0.0031) [2024-06-28 08:33:51,831][06909] Updated weights for policy 0, policy_version 187793 (0.0028) [2024-06-28 08:33:53,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.7, 300 sec: 44264.6). Total num frames: 3076866048. Throughput: 0: 44238.6. Samples: 2979809200. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 08:33:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:33:56,462][06909] Updated weights for policy 0, policy_version 187803 (0.0031) [2024-06-28 08:33:58,850][06674] Fps is (10 sec: 49152.1, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3077095424. Throughput: 0: 44169.9. Samples: 2979955620. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 08:33:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:33:59,149][06909] Updated weights for policy 0, policy_version 187813 (0.0036) [2024-06-28 08:34:03,850][06674] Fps is (10 sec: 40960.6, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3077275648. Throughput: 0: 44349.3. Samples: 2980226040. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 08:34:03,850][06674] Avg episode reward: [(0, '0.414')] [2024-06-28 08:34:03,883][06909] Updated weights for policy 0, policy_version 187823 (0.0030) [2024-06-28 08:34:06,567][06909] Updated weights for policy 0, policy_version 187833 (0.0031) [2024-06-28 08:34:08,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 3077521408. Throughput: 0: 44136.4. Samples: 2980470620. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 08:34:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:34:11,525][06909] Updated weights for policy 0, policy_version 187843 (0.0039) [2024-06-28 08:34:13,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 3077750784. Throughput: 0: 43893.8. Samples: 2980608080. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 08:34:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:34:14,207][06909] Updated weights for policy 0, policy_version 187853 (0.0026) [2024-06-28 08:34:18,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3077931008. Throughput: 0: 44129.7. Samples: 2980875900. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 08:34:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:34:18,995][06909] Updated weights for policy 0, policy_version 187863 (0.0033) [2024-06-28 08:34:21,872][06909] Updated weights for policy 0, policy_version 187873 (0.0038) [2024-06-28 08:34:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 44264.6). Total num frames: 3078176768. Throughput: 0: 43910.2. Samples: 2981124180. Policy #0 lag: (min: 0.0, avg: 8.7, max: 20.0) [2024-06-28 08:34:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:34:26,439][06909] Updated weights for policy 0, policy_version 187883 (0.0028) [2024-06-28 08:34:28,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3078406144. Throughput: 0: 43907.9. Samples: 2981265740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:34:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:34:29,201][06909] Updated weights for policy 0, policy_version 187893 (0.0043) [2024-06-28 08:34:33,823][06909] Updated weights for policy 0, policy_version 187903 (0.0043) [2024-06-28 08:34:33,853][06674] Fps is (10 sec: 42586.7, 60 sec: 44507.8, 300 sec: 44042.0). Total num frames: 3078602752. Throughput: 0: 44016.0. Samples: 2981538100. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:34:33,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:34:36,590][06909] Updated weights for policy 0, policy_version 187913 (0.0041) [2024-06-28 08:34:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.6, 300 sec: 44153.8). Total num frames: 3078832128. Throughput: 0: 43857.8. Samples: 2981782800. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:34:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:34:41,467][06909] Updated weights for policy 0, policy_version 187923 (0.0031) [2024-06-28 08:34:43,856][06674] Fps is (10 sec: 47497.6, 60 sec: 44232.3, 300 sec: 44097.0). Total num frames: 3079077888. Throughput: 0: 43868.7. Samples: 2981929980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:34:43,857][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:34:44,154][06909] Updated weights for policy 0, policy_version 187933 (0.0039) [2024-06-28 08:34:48,112][06887] Signal inference workers to stop experience collection... (42300 times) [2024-06-28 08:34:48,113][06887] Signal inference workers to resume experience collection... (42300 times) [2024-06-28 08:34:48,142][06909] InferenceWorker_p0-w0: stopping experience collection (42300 times) [2024-06-28 08:34:48,142][06909] InferenceWorker_p0-w0: resuming experience collection (42300 times) [2024-06-28 08:34:48,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3079241728. Throughput: 0: 43760.5. Samples: 2982195260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:34:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:34:48,920][06909] Updated weights for policy 0, policy_version 187943 (0.0034) [2024-06-28 08:34:51,468][06909] Updated weights for policy 0, policy_version 187953 (0.0039) [2024-06-28 08:34:53,850][06674] Fps is (10 sec: 40984.8, 60 sec: 43690.7, 300 sec: 44098.3). Total num frames: 3079487488. Throughput: 0: 43864.0. Samples: 2982444500. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:34:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:34:56,443][06909] Updated weights for policy 0, policy_version 187963 (0.0028) [2024-06-28 08:34:58,850][06674] Fps is (10 sec: 49151.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3079733248. Throughput: 0: 43882.3. Samples: 2982582780. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:34:58,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 08:34:58,870][06909] Updated weights for policy 0, policy_version 187973 (0.0027) [2024-06-28 08:35:03,703][06909] Updated weights for policy 0, policy_version 187983 (0.0021) [2024-06-28 08:35:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 3079913472. Throughput: 0: 44017.8. Samples: 2982856700. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:35:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:35:06,424][06909] Updated weights for policy 0, policy_version 187993 (0.0030) [2024-06-28 08:35:08,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3080142848. Throughput: 0: 43940.0. Samples: 2983101480. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:35:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 08:35:11,315][06909] Updated weights for policy 0, policy_version 188003 (0.0027) [2024-06-28 08:35:13,755][06909] Updated weights for policy 0, policy_version 188013 (0.0026) [2024-06-28 08:35:13,850][06674] Fps is (10 sec: 49151.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3080404992. Throughput: 0: 43952.0. Samples: 2983243580. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:35:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:35:18,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3080552448. Throughput: 0: 43915.9. Samples: 2983514200. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:35:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:35:18,896][06909] Updated weights for policy 0, policy_version 188023 (0.0045) [2024-06-28 08:35:21,187][06909] Updated weights for policy 0, policy_version 188033 (0.0034) [2024-06-28 08:35:23,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3080798208. Throughput: 0: 44122.7. Samples: 2983768320. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:35:23,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:35:26,225][06909] Updated weights for policy 0, policy_version 188043 (0.0034) [2024-06-28 08:35:28,606][06909] Updated weights for policy 0, policy_version 188053 (0.0039) [2024-06-28 08:35:28,852][06674] Fps is (10 sec: 50780.5, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 3081060352. Throughput: 0: 43850.1. Samples: 2983903060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 08:35:28,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:35:33,413][06909] Updated weights for policy 0, policy_version 188063 (0.0036) [2024-06-28 08:35:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43692.6, 300 sec: 43875.8). Total num frames: 3081224192. Throughput: 0: 43944.4. Samples: 2984172760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 08:35:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:35:36,338][06909] Updated weights for policy 0, policy_version 188073 (0.0037) [2024-06-28 08:35:38,851][06674] Fps is (10 sec: 39323.4, 60 sec: 43689.5, 300 sec: 43986.6). Total num frames: 3081453568. Throughput: 0: 44089.6. Samples: 2984428600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 08:35:38,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:35:40,976][06909] Updated weights for policy 0, policy_version 188083 (0.0035) [2024-06-28 08:35:43,503][06887] Signal inference workers to stop experience collection... (42350 times) [2024-06-28 08:35:43,503][06887] Signal inference workers to resume experience collection... (42350 times) [2024-06-28 08:35:43,545][06909] InferenceWorker_p0-w0: stopping experience collection (42350 times) [2024-06-28 08:35:43,545][06909] InferenceWorker_p0-w0: resuming experience collection (42350 times) [2024-06-28 08:35:43,651][06909] Updated weights for policy 0, policy_version 188093 (0.0032) [2024-06-28 08:35:43,850][06674] Fps is (10 sec: 49151.7, 60 sec: 43968.1, 300 sec: 44097.9). Total num frames: 3081715712. Throughput: 0: 44128.8. Samples: 2984568580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 08:35:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:35:48,276][06909] Updated weights for policy 0, policy_version 188103 (0.0035) [2024-06-28 08:35:48,850][06674] Fps is (10 sec: 44244.5, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 3081895936. Throughput: 0: 43959.2. Samples: 2984834860. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 08:35:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:35:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000188104_3081895936.pth... [2024-06-28 08:35:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000187460_3071344640.pth [2024-06-28 08:35:50,994][06909] Updated weights for policy 0, policy_version 188113 (0.0029) [2024-06-28 08:35:53,852][06674] Fps is (10 sec: 39313.8, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 3082108928. Throughput: 0: 44277.5. Samples: 2985094060. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 08:35:53,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:35:55,757][06909] Updated weights for policy 0, policy_version 188123 (0.0033) [2024-06-28 08:35:58,170][06909] Updated weights for policy 0, policy_version 188133 (0.0034) [2024-06-28 08:35:58,850][06674] Fps is (10 sec: 49151.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3082387456. Throughput: 0: 44242.7. Samples: 2985234500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 08:35:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:36:03,101][06909] Updated weights for policy 0, policy_version 188143 (0.0032) [2024-06-28 08:36:03,850][06674] Fps is (10 sec: 44246.2, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3082551296. Throughput: 0: 44039.3. Samples: 2985495960. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 08:36:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:36:05,727][06909] Updated weights for policy 0, policy_version 188153 (0.0025) [2024-06-28 08:36:08,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3082780672. Throughput: 0: 44325.9. Samples: 2985762980. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 08:36:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:36:10,154][06909] Updated weights for policy 0, policy_version 188163 (0.0022) [2024-06-28 08:36:13,168][06909] Updated weights for policy 0, policy_version 188173 (0.0042) [2024-06-28 08:36:13,850][06674] Fps is (10 sec: 49151.8, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3083042816. Throughput: 0: 44347.0. Samples: 2985898580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 08:36:13,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 08:36:17,955][06909] Updated weights for policy 0, policy_version 188183 (0.0040) [2024-06-28 08:36:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44510.0, 300 sec: 43875.8). Total num frames: 3083223040. Throughput: 0: 44147.1. Samples: 2986159380. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 08:36:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 08:36:20,601][06909] Updated weights for policy 0, policy_version 188193 (0.0027) [2024-06-28 08:36:23,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3083436032. Throughput: 0: 44406.9. Samples: 2986426840. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 08:36:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:36:25,371][06909] Updated weights for policy 0, policy_version 188203 (0.0036) [2024-06-28 08:36:28,258][06909] Updated weights for policy 0, policy_version 188213 (0.0025) [2024-06-28 08:36:28,856][06674] Fps is (10 sec: 47484.9, 60 sec: 43960.8, 300 sec: 44152.6). Total num frames: 3083698176. Throughput: 0: 44221.7. Samples: 2986558820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 08:36:28,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:36:32,860][06909] Updated weights for policy 0, policy_version 188223 (0.0022) [2024-06-28 08:36:33,851][06674] Fps is (10 sec: 45869.8, 60 sec: 44509.0, 300 sec: 43986.7). Total num frames: 3083894784. Throughput: 0: 44104.9. Samples: 2986819640. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 08:36:33,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:36:35,498][06909] Updated weights for policy 0, policy_version 188233 (0.0035) [2024-06-28 08:36:38,850][06674] Fps is (10 sec: 39345.6, 60 sec: 43965.0, 300 sec: 43986.9). Total num frames: 3084091392. Throughput: 0: 44227.0. Samples: 2987084180. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 08:36:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:36:40,172][06909] Updated weights for policy 0, policy_version 188243 (0.0028) [2024-06-28 08:36:42,982][06909] Updated weights for policy 0, policy_version 188253 (0.0035) [2024-06-28 08:36:43,850][06674] Fps is (10 sec: 47519.2, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 3084369920. Throughput: 0: 44026.5. Samples: 2987215700. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 08:36:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:36:47,603][06909] Updated weights for policy 0, policy_version 188263 (0.0021) [2024-06-28 08:36:48,728][06887] Signal inference workers to stop experience collection... (42400 times) [2024-06-28 08:36:48,728][06887] Signal inference workers to resume experience collection... (42400 times) [2024-06-28 08:36:48,759][06909] InferenceWorker_p0-w0: stopping experience collection (42400 times) [2024-06-28 08:36:48,759][06909] InferenceWorker_p0-w0: resuming experience collection (42400 times) [2024-06-28 08:36:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 3084550144. Throughput: 0: 44054.7. Samples: 2987478420. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 08:36:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:36:50,603][06909] Updated weights for policy 0, policy_version 188273 (0.0035) [2024-06-28 08:36:53,852][06674] Fps is (10 sec: 37675.5, 60 sec: 43963.7, 300 sec: 43931.0). Total num frames: 3084746752. Throughput: 0: 43988.5. Samples: 2987742560. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 08:36:53,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:36:55,048][06909] Updated weights for policy 0, policy_version 188283 (0.0037) [2024-06-28 08:36:58,211][06909] Updated weights for policy 0, policy_version 188293 (0.0037) [2024-06-28 08:36:58,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 3085025280. Throughput: 0: 43872.4. Samples: 2987872840. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 08:36:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:37:02,381][06909] Updated weights for policy 0, policy_version 188303 (0.0031) [2024-06-28 08:37:03,850][06674] Fps is (10 sec: 47523.9, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 3085221888. Throughput: 0: 44001.4. Samples: 2988139440. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 08:37:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:37:05,580][06909] Updated weights for policy 0, policy_version 188313 (0.0043) [2024-06-28 08:37:08,850][06674] Fps is (10 sec: 37683.6, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 3085402112. Throughput: 0: 44038.4. Samples: 2988408560. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 08:37:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:37:10,130][06909] Updated weights for policy 0, policy_version 188323 (0.0030) [2024-06-28 08:37:12,772][06909] Updated weights for policy 0, policy_version 188333 (0.0028) [2024-06-28 08:37:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 3085680640. Throughput: 0: 43912.7. Samples: 2988534620. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 08:37:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:37:17,308][06909] Updated weights for policy 0, policy_version 188343 (0.0036) [2024-06-28 08:37:18,850][06674] Fps is (10 sec: 47512.6, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 3085877248. Throughput: 0: 44077.6. Samples: 2988803080. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 08:37:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:37:20,225][06909] Updated weights for policy 0, policy_version 188353 (0.0033) [2024-06-28 08:37:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44237.0, 300 sec: 43986.9). Total num frames: 3086090240. Throughput: 0: 44162.3. Samples: 2989071480. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 08:37:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:37:24,601][06909] Updated weights for policy 0, policy_version 188363 (0.0035) [2024-06-28 08:37:27,789][06909] Updated weights for policy 0, policy_version 188373 (0.0034) [2024-06-28 08:37:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43968.2, 300 sec: 44153.5). Total num frames: 3086336000. Throughput: 0: 43946.7. Samples: 2989193300. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 08:37:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:37:32,677][06909] Updated weights for policy 0, policy_version 188383 (0.0027) [2024-06-28 08:37:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43964.7, 300 sec: 43986.9). Total num frames: 3086532608. Throughput: 0: 44066.3. Samples: 2989461400. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 08:37:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:37:35,335][06909] Updated weights for policy 0, policy_version 188393 (0.0026) [2024-06-28 08:37:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3086745600. Throughput: 0: 44165.2. Samples: 2989729900. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 08:37:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:37:39,874][06909] Updated weights for policy 0, policy_version 188403 (0.0030) [2024-06-28 08:37:42,981][06909] Updated weights for policy 0, policy_version 188413 (0.0037) [2024-06-28 08:37:43,854][06674] Fps is (10 sec: 45856.5, 60 sec: 43687.8, 300 sec: 44152.9). Total num frames: 3086991360. Throughput: 0: 43954.3. Samples: 2989850960. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 08:37:43,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:37:47,218][06909] Updated weights for policy 0, policy_version 188423 (0.0026) [2024-06-28 08:37:48,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3087204352. Throughput: 0: 44084.3. Samples: 2990123240. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 08:37:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:37:49,012][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000188429_3087220736.pth... [2024-06-28 08:37:49,065][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000187782_3076620288.pth [2024-06-28 08:37:50,122][06909] Updated weights for policy 0, policy_version 188433 (0.0037) [2024-06-28 08:37:53,850][06674] Fps is (10 sec: 40976.6, 60 sec: 44238.4, 300 sec: 43986.9). Total num frames: 3087400960. Throughput: 0: 44020.0. Samples: 2990389460. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 08:37:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:37:54,468][06909] Updated weights for policy 0, policy_version 188443 (0.0035) [2024-06-28 08:37:57,955][06909] Updated weights for policy 0, policy_version 188453 (0.0026) [2024-06-28 08:37:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 3087646720. Throughput: 0: 43963.5. Samples: 2990512980. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 08:37:58,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:38:02,189][06909] Updated weights for policy 0, policy_version 188463 (0.0026) [2024-06-28 08:38:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3087859712. Throughput: 0: 43963.2. Samples: 2990781420. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 08:38:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:38:05,261][06909] Updated weights for policy 0, policy_version 188473 (0.0038) [2024-06-28 08:38:08,854][06674] Fps is (10 sec: 40943.1, 60 sec: 44233.7, 300 sec: 43986.3). Total num frames: 3088056320. Throughput: 0: 43722.5. Samples: 2991039180. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 08:38:08,854][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:38:09,719][06909] Updated weights for policy 0, policy_version 188483 (0.0028) [2024-06-28 08:38:12,803][06909] Updated weights for policy 0, policy_version 188493 (0.0028) [2024-06-28 08:38:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3088302080. Throughput: 0: 43957.4. Samples: 2991171380. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 08:38:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:38:16,938][06909] Updated weights for policy 0, policy_version 188503 (0.0028) [2024-06-28 08:38:18,850][06674] Fps is (10 sec: 47533.5, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3088531456. Throughput: 0: 43923.9. Samples: 2991437980. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 08:38:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 08:38:19,990][06909] Updated weights for policy 0, policy_version 188513 (0.0039) [2024-06-28 08:38:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 3088728064. Throughput: 0: 43915.6. Samples: 2991706100. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 08:38:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:38:24,158][06909] Updated weights for policy 0, policy_version 188523 (0.0031) [2024-06-28 08:38:27,515][06909] Updated weights for policy 0, policy_version 188533 (0.0048) [2024-06-28 08:38:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 3088957440. Throughput: 0: 44055.0. Samples: 2991833260. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 08:38:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:38:31,437][06909] Updated weights for policy 0, policy_version 188543 (0.0040) [2024-06-28 08:38:33,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 3089203200. Throughput: 0: 44206.0. Samples: 2992112500. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 08:38:33,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:38:34,155][06887] Signal inference workers to stop experience collection... (42450 times) [2024-06-28 08:38:34,155][06887] Signal inference workers to resume experience collection... (42450 times) [2024-06-28 08:38:34,191][06909] InferenceWorker_p0-w0: stopping experience collection (42450 times) [2024-06-28 08:38:34,191][06909] InferenceWorker_p0-w0: resuming experience collection (42450 times) [2024-06-28 08:38:34,647][06909] Updated weights for policy 0, policy_version 188553 (0.0028) [2024-06-28 08:38:38,853][06674] Fps is (10 sec: 44223.5, 60 sec: 44234.6, 300 sec: 43986.4). Total num frames: 3089399808. Throughput: 0: 44067.7. Samples: 2992372640. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 08:38:38,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:38:39,123][06909] Updated weights for policy 0, policy_version 188563 (0.0028) [2024-06-28 08:38:42,322][06909] Updated weights for policy 0, policy_version 188573 (0.0030) [2024-06-28 08:38:43,852][06674] Fps is (10 sec: 40951.0, 60 sec: 43692.1, 300 sec: 44097.6). Total num frames: 3089612800. Throughput: 0: 44088.6. Samples: 2992497060. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 08:38:43,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:38:46,806][06909] Updated weights for policy 0, policy_version 188583 (0.0031) [2024-06-28 08:38:48,850][06674] Fps is (10 sec: 45889.5, 60 sec: 44237.0, 300 sec: 44042.4). Total num frames: 3089858560. Throughput: 0: 44032.1. Samples: 2992762860. Policy #0 lag: (min: 0.0, avg: 11.6, max: 21.0) [2024-06-28 08:38:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:38:50,015][06909] Updated weights for policy 0, policy_version 188593 (0.0029) [2024-06-28 08:38:53,850][06674] Fps is (10 sec: 44246.2, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3090055168. Throughput: 0: 44182.3. Samples: 2993027200. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:38:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:38:54,060][06909] Updated weights for policy 0, policy_version 188603 (0.0024) [2024-06-28 08:38:57,277][06909] Updated weights for policy 0, policy_version 188613 (0.0022) [2024-06-28 08:38:58,855][06674] Fps is (10 sec: 42574.3, 60 sec: 43959.7, 300 sec: 44097.1). Total num frames: 3090284544. Throughput: 0: 44097.2. Samples: 2993156000. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:38:58,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:39:01,190][06909] Updated weights for policy 0, policy_version 188623 (0.0035) [2024-06-28 08:39:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3090513920. Throughput: 0: 44196.9. Samples: 2993426840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:39:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:39:04,689][06909] Updated weights for policy 0, policy_version 188633 (0.0036) [2024-06-28 08:39:08,560][06909] Updated weights for policy 0, policy_version 188643 (0.0026) [2024-06-28 08:39:08,850][06674] Fps is (10 sec: 44261.4, 60 sec: 44512.9, 300 sec: 43986.9). Total num frames: 3090726912. Throughput: 0: 44163.5. Samples: 2993693460. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:39:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:39:11,729][06909] Updated weights for policy 0, policy_version 188653 (0.0023) [2024-06-28 08:39:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3090956288. Throughput: 0: 44197.8. Samples: 2993822160. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:39:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:39:16,615][06909] Updated weights for policy 0, policy_version 188663 (0.0032) [2024-06-28 08:39:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3091185664. Throughput: 0: 43866.1. Samples: 2994086480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:39:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:39:19,375][06909] Updated weights for policy 0, policy_version 188673 (0.0025) [2024-06-28 08:39:23,755][06909] Updated weights for policy 0, policy_version 188683 (0.0024) [2024-06-28 08:39:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3091382272. Throughput: 0: 43918.5. Samples: 2994348840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:39:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:39:27,210][06909] Updated weights for policy 0, policy_version 188693 (0.0027) [2024-06-28 08:39:28,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.8, 300 sec: 44042.8). Total num frames: 3091595264. Throughput: 0: 44125.3. Samples: 2994482600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:39:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:39:31,001][06909] Updated weights for policy 0, policy_version 188703 (0.0036) [2024-06-28 08:39:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3091824640. Throughput: 0: 44113.3. Samples: 2994747960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:39:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:39:34,479][06909] Updated weights for policy 0, policy_version 188713 (0.0028) [2024-06-28 08:39:38,344][06909] Updated weights for policy 0, policy_version 188723 (0.0033) [2024-06-28 08:39:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44239.1, 300 sec: 43987.8). Total num frames: 3092054016. Throughput: 0: 44105.8. Samples: 2995011960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:39:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:39:42,012][06909] Updated weights for policy 0, policy_version 188733 (0.0034) [2024-06-28 08:39:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44238.4, 300 sec: 44153.5). Total num frames: 3092267008. Throughput: 0: 44129.1. Samples: 2995141560. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:39:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:39:46,141][06909] Updated weights for policy 0, policy_version 188743 (0.0034) [2024-06-28 08:39:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3092512768. Throughput: 0: 44016.9. Samples: 2995407600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:39:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:39:48,871][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000188752_3092512768.pth... [2024-06-28 08:39:48,934][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000188104_3081895936.pth [2024-06-28 08:39:49,222][06909] Updated weights for policy 0, policy_version 188753 (0.0034) [2024-06-28 08:39:53,689][06909] Updated weights for policy 0, policy_version 188763 (0.0028) [2024-06-28 08:39:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3092692992. Throughput: 0: 43981.4. Samples: 2995672620. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:39:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:39:57,145][06909] Updated weights for policy 0, policy_version 188773 (0.0040) [2024-06-28 08:39:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44240.9, 300 sec: 44153.5). Total num frames: 3092938752. Throughput: 0: 43996.0. Samples: 2995801980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:39:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:40:01,057][06909] Updated weights for policy 0, policy_version 188783 (0.0027) [2024-06-28 08:40:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3093151744. Throughput: 0: 43906.3. Samples: 2996062260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:40:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:40:04,901][06909] Updated weights for policy 0, policy_version 188793 (0.0026) [2024-06-28 08:40:06,615][06887] Signal inference workers to stop experience collection... (42500 times) [2024-06-28 08:40:06,618][06887] Signal inference workers to resume experience collection... (42500 times) [2024-06-28 08:40:06,647][06909] InferenceWorker_p0-w0: stopping experience collection (42500 times) [2024-06-28 08:40:06,647][06909] InferenceWorker_p0-w0: resuming experience collection (42500 times) [2024-06-28 08:40:08,474][06909] Updated weights for policy 0, policy_version 188803 (0.0025) [2024-06-28 08:40:08,852][06674] Fps is (10 sec: 44227.6, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 3093381120. Throughput: 0: 44124.2. Samples: 2996334520. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:40:08,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:40:12,266][06909] Updated weights for policy 0, policy_version 188813 (0.0024) [2024-06-28 08:40:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 3093577728. Throughput: 0: 44024.9. Samples: 2996463720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:40:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:40:15,598][06909] Updated weights for policy 0, policy_version 188823 (0.0024) [2024-06-28 08:40:18,850][06674] Fps is (10 sec: 44246.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3093823488. Throughput: 0: 44017.7. Samples: 2996728760. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:40:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:40:19,349][06909] Updated weights for policy 0, policy_version 188833 (0.0025) [2024-06-28 08:40:23,031][06909] Updated weights for policy 0, policy_version 188843 (0.0033) [2024-06-28 08:40:23,852][06674] Fps is (10 sec: 44227.1, 60 sec: 43962.2, 300 sec: 43931.3). Total num frames: 3094020096. Throughput: 0: 44062.7. Samples: 2996994880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:40:23,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:40:26,569][06909] Updated weights for policy 0, policy_version 188853 (0.0040) [2024-06-28 08:40:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3094249472. Throughput: 0: 44021.8. Samples: 2997122540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:40:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:40:30,627][06909] Updated weights for policy 0, policy_version 188863 (0.0028) [2024-06-28 08:40:33,850][06674] Fps is (10 sec: 45884.9, 60 sec: 44236.7, 300 sec: 44153.7). Total num frames: 3094478848. Throughput: 0: 44029.7. Samples: 2997388940. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:40:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:40:34,289][06909] Updated weights for policy 0, policy_version 188873 (0.0039) [2024-06-28 08:40:38,140][06909] Updated weights for policy 0, policy_version 188883 (0.0035) [2024-06-28 08:40:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3094691840. Throughput: 0: 44127.2. Samples: 2997658340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:40:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:40:41,923][06909] Updated weights for policy 0, policy_version 188893 (0.0034) [2024-06-28 08:40:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3094921216. Throughput: 0: 44162.2. Samples: 2997789280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:40:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:40:45,314][06909] Updated weights for policy 0, policy_version 188903 (0.0032) [2024-06-28 08:40:48,852][06674] Fps is (10 sec: 44227.1, 60 sec: 43689.1, 300 sec: 44153.5). Total num frames: 3095134208. Throughput: 0: 44401.0. Samples: 2998060400. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:40:48,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:40:49,112][06909] Updated weights for policy 0, policy_version 188913 (0.0028) [2024-06-28 08:40:52,578][06909] Updated weights for policy 0, policy_version 188923 (0.0023) [2024-06-28 08:40:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3095347200. Throughput: 0: 44182.1. Samples: 2998322620. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:40:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:40:56,263][06909] Updated weights for policy 0, policy_version 188933 (0.0034) [2024-06-28 08:40:58,850][06674] Fps is (10 sec: 44246.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3095576576. Throughput: 0: 44125.3. Samples: 2998449360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 20.0) [2024-06-28 08:40:58,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 08:41:00,124][06909] Updated weights for policy 0, policy_version 188943 (0.0031) [2024-06-28 08:41:03,607][06909] Updated weights for policy 0, policy_version 188953 (0.0040) [2024-06-28 08:41:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3095805952. Throughput: 0: 44083.2. Samples: 2998712500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:41:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:41:08,004][06909] Updated weights for policy 0, policy_version 188963 (0.0029) [2024-06-28 08:41:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43692.2, 300 sec: 43931.3). Total num frames: 3096002560. Throughput: 0: 44118.5. Samples: 2998980120. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:41:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:41:11,423][06909] Updated weights for policy 0, policy_version 188973 (0.0033) [2024-06-28 08:41:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3096248320. Throughput: 0: 44173.7. Samples: 2999110360. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:41:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:41:15,172][06909] Updated weights for policy 0, policy_version 188983 (0.0028) [2024-06-28 08:41:18,808][06909] Updated weights for policy 0, policy_version 188993 (0.0029) [2024-06-28 08:41:18,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3096461312. Throughput: 0: 44115.4. Samples: 2999374140. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:41:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:41:22,442][06909] Updated weights for policy 0, policy_version 189003 (0.0036) [2024-06-28 08:41:23,852][06674] Fps is (10 sec: 40951.5, 60 sec: 43963.8, 300 sec: 43931.9). Total num frames: 3096657920. Throughput: 0: 44064.1. Samples: 2999641320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:41:23,861][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:41:26,372][06909] Updated weights for policy 0, policy_version 189013 (0.0031) [2024-06-28 08:41:28,854][06674] Fps is (10 sec: 42579.2, 60 sec: 43960.3, 300 sec: 44041.9). Total num frames: 3096887296. Throughput: 0: 44026.6. Samples: 2999770680. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:41:28,855][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:41:30,073][06909] Updated weights for policy 0, policy_version 189023 (0.0030) [2024-06-28 08:41:32,834][06887] Signal inference workers to stop experience collection... (42550 times) [2024-06-28 08:41:32,858][06909] InferenceWorker_p0-w0: stopping experience collection (42550 times) [2024-06-28 08:41:32,886][06887] Signal inference workers to resume experience collection... (42550 times) [2024-06-28 08:41:32,892][06909] InferenceWorker_p0-w0: resuming experience collection (42550 times) [2024-06-28 08:41:33,546][06909] Updated weights for policy 0, policy_version 189033 (0.0027) [2024-06-28 08:41:33,852][06674] Fps is (10 sec: 45875.2, 60 sec: 43962.2, 300 sec: 44153.2). Total num frames: 3097116672. Throughput: 0: 43935.6. Samples: 3000037500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:41:33,863][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:41:37,441][06909] Updated weights for policy 0, policy_version 189043 (0.0044) [2024-06-28 08:41:38,850][06674] Fps is (10 sec: 42617.9, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3097313280. Throughput: 0: 43916.4. Samples: 3000298860. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:41:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:41:41,287][06909] Updated weights for policy 0, policy_version 189053 (0.0034) [2024-06-28 08:41:43,850][06674] Fps is (10 sec: 44246.0, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3097559040. Throughput: 0: 43932.4. Samples: 3000426320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:41:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:41:45,208][06909] Updated weights for policy 0, policy_version 189063 (0.0027) [2024-06-28 08:41:48,576][06909] Updated weights for policy 0, policy_version 189073 (0.0032) [2024-06-28 08:41:48,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44238.4, 300 sec: 44209.4). Total num frames: 3097788416. Throughput: 0: 44040.8. Samples: 3000694340. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:41:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:41:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000189074_3097788416.pth... [2024-06-28 08:41:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000188429_3087220736.pth [2024-06-28 08:41:52,347][06909] Updated weights for policy 0, policy_version 189083 (0.0025) [2024-06-28 08:41:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3098001408. Throughput: 0: 43933.3. Samples: 3000957120. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:41:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:41:56,128][06909] Updated weights for policy 0, policy_version 189093 (0.0021) [2024-06-28 08:41:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3098214400. Throughput: 0: 43951.5. Samples: 3001088180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:41:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:41:59,615][06909] Updated weights for policy 0, policy_version 189103 (0.0033) [2024-06-28 08:42:03,332][06909] Updated weights for policy 0, policy_version 189113 (0.0034) [2024-06-28 08:42:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 3098443776. Throughput: 0: 44014.4. Samples: 3001354780. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:42:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:42:07,474][06909] Updated weights for policy 0, policy_version 189123 (0.0044) [2024-06-28 08:42:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3098640384. Throughput: 0: 43812.2. Samples: 3001612780. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 08:42:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:42:10,910][06909] Updated weights for policy 0, policy_version 189133 (0.0023) [2024-06-28 08:42:13,852][06674] Fps is (10 sec: 42589.1, 60 sec: 43689.1, 300 sec: 44042.1). Total num frames: 3098869760. Throughput: 0: 43919.3. Samples: 3001746940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 08:42:13,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:42:14,846][06909] Updated weights for policy 0, policy_version 189143 (0.0032) [2024-06-28 08:42:18,520][06909] Updated weights for policy 0, policy_version 189153 (0.0031) [2024-06-28 08:42:18,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3099115520. Throughput: 0: 43856.3. Samples: 3002010940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 08:42:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:42:22,314][06909] Updated weights for policy 0, policy_version 189163 (0.0027) [2024-06-28 08:42:23,850][06674] Fps is (10 sec: 44246.3, 60 sec: 44238.3, 300 sec: 43986.9). Total num frames: 3099312128. Throughput: 0: 44061.8. Samples: 3002281640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 08:42:23,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 08:42:26,199][06909] Updated weights for policy 0, policy_version 189173 (0.0029) [2024-06-28 08:42:28,856][06674] Fps is (10 sec: 42572.4, 60 sec: 44235.7, 300 sec: 44097.0). Total num frames: 3099541504. Throughput: 0: 44062.1. Samples: 3002409380. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 08:42:28,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:42:29,593][06909] Updated weights for policy 0, policy_version 189183 (0.0033) [2024-06-28 08:42:33,357][06909] Updated weights for policy 0, policy_version 189193 (0.0024) [2024-06-28 08:42:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44238.4, 300 sec: 44153.5). Total num frames: 3099770880. Throughput: 0: 44003.6. Samples: 3002674500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 08:42:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:42:37,272][06909] Updated weights for policy 0, policy_version 189203 (0.0031) [2024-06-28 08:42:38,850][06674] Fps is (10 sec: 42624.5, 60 sec: 44236.9, 300 sec: 43987.5). Total num frames: 3099967488. Throughput: 0: 44168.5. Samples: 3002944700. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 08:42:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:42:40,556][06909] Updated weights for policy 0, policy_version 189213 (0.0032) [2024-06-28 08:42:43,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3100196864. Throughput: 0: 44088.0. Samples: 3003072140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 08:42:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:42:44,422][06909] Updated weights for policy 0, policy_version 189223 (0.0029) [2024-06-28 08:42:48,395][06909] Updated weights for policy 0, policy_version 189233 (0.0040) [2024-06-28 08:42:48,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3100426240. Throughput: 0: 43958.2. Samples: 3003332900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 08:42:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:42:52,342][06909] Updated weights for policy 0, policy_version 189243 (0.0040) [2024-06-28 08:42:53,853][06674] Fps is (10 sec: 44224.8, 60 sec: 43961.7, 300 sec: 44042.0). Total num frames: 3100639232. Throughput: 0: 44192.4. Samples: 3003601560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 08:42:53,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:42:55,540][06909] Updated weights for policy 0, policy_version 189253 (0.0032) [2024-06-28 08:42:56,880][06887] Signal inference workers to stop experience collection... (42600 times) [2024-06-28 08:42:56,881][06887] Signal inference workers to resume experience collection... (42600 times) [2024-06-28 08:42:56,928][06909] InferenceWorker_p0-w0: stopping experience collection (42600 times) [2024-06-28 08:42:56,928][06909] InferenceWorker_p0-w0: resuming experience collection (42600 times) [2024-06-28 08:42:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3100868608. Throughput: 0: 44187.0. Samples: 3003735260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 08:42:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:42:59,586][06909] Updated weights for policy 0, policy_version 189263 (0.0039) [2024-06-28 08:43:03,141][06909] Updated weights for policy 0, policy_version 189273 (0.0035) [2024-06-28 08:43:03,850][06674] Fps is (10 sec: 44249.3, 60 sec: 43963.7, 300 sec: 44154.1). Total num frames: 3101081600. Throughput: 0: 44138.6. Samples: 3003997180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 08:43:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:43:06,777][06909] Updated weights for policy 0, policy_version 189283 (0.0035) [2024-06-28 08:43:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 3101310976. Throughput: 0: 44059.1. Samples: 3004264300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 08:43:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:43:10,360][06909] Updated weights for policy 0, policy_version 189293 (0.0037) [2024-06-28 08:43:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44238.4, 300 sec: 44042.4). Total num frames: 3101523968. Throughput: 0: 44170.4. Samples: 3004396780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 08:43:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:43:14,355][06909] Updated weights for policy 0, policy_version 189303 (0.0031) [2024-06-28 08:43:17,512][06909] Updated weights for policy 0, policy_version 189313 (0.0028) [2024-06-28 08:43:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3101736960. Throughput: 0: 44040.9. Samples: 3004656340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 08:43:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:43:21,909][06909] Updated weights for policy 0, policy_version 189323 (0.0032) [2024-06-28 08:43:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3101966336. Throughput: 0: 43980.4. Samples: 3004923820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 08:43:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:43:25,413][06909] Updated weights for policy 0, policy_version 189333 (0.0034) [2024-06-28 08:43:28,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43695.0, 300 sec: 43931.3). Total num frames: 3102162944. Throughput: 0: 44116.4. Samples: 3005057380. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 08:43:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:43:29,404][06909] Updated weights for policy 0, policy_version 189343 (0.0032) [2024-06-28 08:43:32,644][06909] Updated weights for policy 0, policy_version 189353 (0.0035) [2024-06-28 08:43:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 44042.9). Total num frames: 3102392320. Throughput: 0: 44111.2. Samples: 3005317900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 08:43:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:43:36,522][06909] Updated weights for policy 0, policy_version 189363 (0.0031) [2024-06-28 08:43:38,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44509.8, 300 sec: 44153.8). Total num frames: 3102638080. Throughput: 0: 44101.8. Samples: 3005586020. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 08:43:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:43:40,359][06909] Updated weights for policy 0, policy_version 189373 (0.0039) [2024-06-28 08:43:43,802][06909] Updated weights for policy 0, policy_version 189383 (0.0032) [2024-06-28 08:43:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3102851072. Throughput: 0: 44127.6. Samples: 3005721000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 08:43:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:43:47,451][06909] Updated weights for policy 0, policy_version 189393 (0.0035) [2024-06-28 08:43:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3103064064. Throughput: 0: 44300.9. Samples: 3005990720. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 08:43:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:43:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000189396_3103064064.pth... [2024-06-28 08:43:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000188752_3092512768.pth [2024-06-28 08:43:51,190][06909] Updated weights for policy 0, policy_version 189403 (0.0038) [2024-06-28 08:43:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44238.9, 300 sec: 44098.8). Total num frames: 3103293440. Throughput: 0: 44097.0. Samples: 3006248660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 08:43:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:43:54,657][06909] Updated weights for policy 0, policy_version 189413 (0.0030) [2024-06-28 08:43:58,851][06674] Fps is (10 sec: 42594.5, 60 sec: 43690.0, 300 sec: 43986.7). Total num frames: 3103490048. Throughput: 0: 44038.2. Samples: 3006378540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 08:43:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:43:59,040][06909] Updated weights for policy 0, policy_version 189423 (0.0035) [2024-06-28 08:44:02,389][06909] Updated weights for policy 0, policy_version 189433 (0.0037) [2024-06-28 08:44:03,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3103719424. Throughput: 0: 44116.8. Samples: 3006641600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 08:44:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:44:06,707][06909] Updated weights for policy 0, policy_version 189443 (0.0025) [2024-06-28 08:44:08,850][06674] Fps is (10 sec: 45879.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3103948800. Throughput: 0: 43912.3. Samples: 3006899880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 08:44:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:44:10,105][06909] Updated weights for policy 0, policy_version 189453 (0.0036) [2024-06-28 08:44:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3104145408. Throughput: 0: 43987.6. Samples: 3007036820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 08:44:13,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:44:14,120][06909] Updated weights for policy 0, policy_version 189463 (0.0029) [2024-06-28 08:44:17,407][06909] Updated weights for policy 0, policy_version 189473 (0.0032) [2024-06-28 08:44:18,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3104391168. Throughput: 0: 44089.2. Samples: 3007301920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 08:44:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:44:21,470][06909] Updated weights for policy 0, policy_version 189483 (0.0020) [2024-06-28 08:44:23,853][06674] Fps is (10 sec: 47498.9, 60 sec: 44234.5, 300 sec: 44153.0). Total num frames: 3104620544. Throughput: 0: 43839.7. Samples: 3007558940. Policy #0 lag: (min: 0.0, avg: 11.5, max: 26.0) [2024-06-28 08:44:23,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:44:25,126][06909] Updated weights for policy 0, policy_version 189493 (0.0024) [2024-06-28 08:44:28,698][06909] Updated weights for policy 0, policy_version 189503 (0.0033) [2024-06-28 08:44:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3104817152. Throughput: 0: 43764.8. Samples: 3007690420. Policy #0 lag: (min: 0.0, avg: 11.5, max: 26.0) [2024-06-28 08:44:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:44:32,339][06909] Updated weights for policy 0, policy_version 189513 (0.0032) [2024-06-28 08:44:33,428][06887] Signal inference workers to stop experience collection... (42650 times) [2024-06-28 08:44:33,432][06887] Signal inference workers to resume experience collection... (42650 times) [2024-06-28 08:44:33,442][06909] InferenceWorker_p0-w0: stopping experience collection (42650 times) [2024-06-28 08:44:33,468][06909] InferenceWorker_p0-w0: resuming experience collection (42650 times) [2024-06-28 08:44:33,852][06674] Fps is (10 sec: 42602.7, 60 sec: 44235.2, 300 sec: 44042.1). Total num frames: 3105046528. Throughput: 0: 43692.6. Samples: 3007956980. Policy #0 lag: (min: 0.0, avg: 11.5, max: 26.0) [2024-06-28 08:44:33,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:44:36,094][06909] Updated weights for policy 0, policy_version 189523 (0.0032) [2024-06-28 08:44:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3105275904. Throughput: 0: 43809.7. Samples: 3008220100. Policy #0 lag: (min: 0.0, avg: 11.5, max: 26.0) [2024-06-28 08:44:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:44:39,927][06909] Updated weights for policy 0, policy_version 189533 (0.0038) [2024-06-28 08:44:43,850][06674] Fps is (10 sec: 40968.8, 60 sec: 43417.6, 300 sec: 43875.8). Total num frames: 3105456128. Throughput: 0: 43950.7. Samples: 3008356280. Policy #0 lag: (min: 0.0, avg: 11.5, max: 26.0) [2024-06-28 08:44:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:44:43,943][06909] Updated weights for policy 0, policy_version 189543 (0.0023) [2024-06-28 08:44:47,435][06909] Updated weights for policy 0, policy_version 189553 (0.0025) [2024-06-28 08:44:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3105701888. Throughput: 0: 44067.6. Samples: 3008624640. Policy #0 lag: (min: 0.0, avg: 11.5, max: 26.0) [2024-06-28 08:44:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:44:51,084][06909] Updated weights for policy 0, policy_version 189563 (0.0032) [2024-06-28 08:44:53,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3105931264. Throughput: 0: 44102.6. Samples: 3008884500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 26.0) [2024-06-28 08:44:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:44:54,659][06909] Updated weights for policy 0, policy_version 189573 (0.0027) [2024-06-28 08:44:58,757][06909] Updated weights for policy 0, policy_version 189583 (0.0040) [2024-06-28 08:44:58,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43964.4, 300 sec: 43986.9). Total num frames: 3106127872. Throughput: 0: 44026.3. Samples: 3009018000. Policy #0 lag: (min: 0.0, avg: 11.5, max: 26.0) [2024-06-28 08:44:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:45:02,030][06909] Updated weights for policy 0, policy_version 189593 (0.0035) [2024-06-28 08:45:03,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 43987.2). Total num frames: 3106357248. Throughput: 0: 43973.0. Samples: 3009280700. Policy #0 lag: (min: 0.0, avg: 11.5, max: 26.0) [2024-06-28 08:45:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:45:06,200][06909] Updated weights for policy 0, policy_version 189603 (0.0030) [2024-06-28 08:45:08,852][06674] Fps is (10 sec: 45865.0, 60 sec: 43962.2, 300 sec: 44097.6). Total num frames: 3106586624. Throughput: 0: 44169.4. Samples: 3009546520. Policy #0 lag: (min: 0.0, avg: 11.5, max: 26.0) [2024-06-28 08:45:08,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:45:09,312][06909] Updated weights for policy 0, policy_version 189613 (0.0027) [2024-06-28 08:45:13,777][06909] Updated weights for policy 0, policy_version 189623 (0.0030) [2024-06-28 08:45:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3106783232. Throughput: 0: 44193.9. Samples: 3009679140. Policy #0 lag: (min: 0.0, avg: 11.5, max: 26.0) [2024-06-28 08:45:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:45:16,554][06909] Updated weights for policy 0, policy_version 189633 (0.0026) [2024-06-28 08:45:18,850][06674] Fps is (10 sec: 44246.6, 60 sec: 43963.9, 300 sec: 44098.3). Total num frames: 3107028992. Throughput: 0: 44258.1. Samples: 3009948500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 26.0) [2024-06-28 08:45:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:45:21,130][06909] Updated weights for policy 0, policy_version 189643 (0.0034) [2024-06-28 08:45:23,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43966.0, 300 sec: 44097.9). Total num frames: 3107258368. Throughput: 0: 44120.0. Samples: 3010205500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 26.0) [2024-06-28 08:45:23,850][06674] Avg episode reward: [(0, '0.443')] [2024-06-28 08:45:24,284][06909] Updated weights for policy 0, policy_version 189653 (0.0022) [2024-06-28 08:45:28,202][06909] Updated weights for policy 0, policy_version 189663 (0.0025) [2024-06-28 08:45:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3107454976. Throughput: 0: 44307.6. Samples: 3010350120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 08:45:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:45:31,423][06909] Updated weights for policy 0, policy_version 189673 (0.0022) [2024-06-28 08:45:33,850][06674] Fps is (10 sec: 42595.5, 60 sec: 43964.8, 300 sec: 44042.3). Total num frames: 3107684352. Throughput: 0: 44213.1. Samples: 3010614260. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 08:45:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:45:35,907][06909] Updated weights for policy 0, policy_version 189683 (0.0033) [2024-06-28 08:45:38,853][06674] Fps is (10 sec: 45860.7, 60 sec: 43961.5, 300 sec: 44042.0). Total num frames: 3107913728. Throughput: 0: 44134.8. Samples: 3010870700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 08:45:38,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:45:39,054][06909] Updated weights for policy 0, policy_version 189693 (0.0026) [2024-06-28 08:45:43,105][06909] Updated weights for policy 0, policy_version 189703 (0.0032) [2024-06-28 08:45:43,850][06674] Fps is (10 sec: 42601.6, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 3108110336. Throughput: 0: 44206.7. Samples: 3011007300. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 08:45:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:45:46,220][06909] Updated weights for policy 0, policy_version 189713 (0.0027) [2024-06-28 08:45:48,850][06674] Fps is (10 sec: 42611.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3108339712. Throughput: 0: 44244.9. Samples: 3011271720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 08:45:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:45:48,918][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000189719_3108356096.pth... [2024-06-28 08:45:48,985][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000189074_3097788416.pth [2024-06-28 08:45:50,820][06909] Updated weights for policy 0, policy_version 189723 (0.0025) [2024-06-28 08:45:53,617][06909] Updated weights for policy 0, policy_version 189733 (0.0045) [2024-06-28 08:45:53,850][06674] Fps is (10 sec: 47512.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3108585472. Throughput: 0: 44070.0. Samples: 3011529580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 08:45:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:45:54,553][06887] Signal inference workers to stop experience collection... (42700 times) [2024-06-28 08:45:54,554][06887] Signal inference workers to resume experience collection... (42700 times) [2024-06-28 08:45:54,575][06909] InferenceWorker_p0-w0: stopping experience collection (42700 times) [2024-06-28 08:45:54,607][06909] InferenceWorker_p0-w0: resuming experience collection (42700 times) [2024-06-28 08:45:58,439][06909] Updated weights for policy 0, policy_version 189743 (0.0027) [2024-06-28 08:45:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3108782080. Throughput: 0: 44219.9. Samples: 3011669040. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 08:45:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:46:01,471][06909] Updated weights for policy 0, policy_version 189753 (0.0024) [2024-06-28 08:46:03,851][06674] Fps is (10 sec: 40955.5, 60 sec: 43962.8, 300 sec: 44042.2). Total num frames: 3108995072. Throughput: 0: 43905.0. Samples: 3011924280. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 08:46:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:46:05,717][06909] Updated weights for policy 0, policy_version 189763 (0.0035) [2024-06-28 08:46:08,665][06909] Updated weights for policy 0, policy_version 189773 (0.0035) [2024-06-28 08:46:08,851][06674] Fps is (10 sec: 45868.4, 60 sec: 44237.3, 300 sec: 44042.2). Total num frames: 3109240832. Throughput: 0: 43901.3. Samples: 3012181120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 08:46:08,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:46:13,186][06909] Updated weights for policy 0, policy_version 189783 (0.0030) [2024-06-28 08:46:13,850][06674] Fps is (10 sec: 44242.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3109437440. Throughput: 0: 43748.4. Samples: 3012318800. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 08:46:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:46:16,169][06909] Updated weights for policy 0, policy_version 189793 (0.0030) [2024-06-28 08:46:18,853][06674] Fps is (10 sec: 42592.8, 60 sec: 43961.7, 300 sec: 44097.8). Total num frames: 3109666816. Throughput: 0: 43778.4. Samples: 3012584380. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 08:46:18,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:46:20,371][06909] Updated weights for policy 0, policy_version 189803 (0.0031) [2024-06-28 08:46:23,400][06909] Updated weights for policy 0, policy_version 189813 (0.0037) [2024-06-28 08:46:23,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44098.6). Total num frames: 3109896192. Throughput: 0: 43873.6. Samples: 3012844880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 08:46:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:46:28,117][06909] Updated weights for policy 0, policy_version 189823 (0.0032) [2024-06-28 08:46:28,850][06674] Fps is (10 sec: 42609.9, 60 sec: 43963.6, 300 sec: 43987.2). Total num frames: 3110092800. Throughput: 0: 43769.2. Samples: 3012976920. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 08:46:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:46:30,707][06909] Updated weights for policy 0, policy_version 189833 (0.0024) [2024-06-28 08:46:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43691.2, 300 sec: 44042.4). Total num frames: 3110305792. Throughput: 0: 43815.1. Samples: 3013243400. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 08:46:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:46:35,506][06909] Updated weights for policy 0, policy_version 189843 (0.0039) [2024-06-28 08:46:38,403][06909] Updated weights for policy 0, policy_version 189853 (0.0037) [2024-06-28 08:46:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43966.0, 300 sec: 44042.4). Total num frames: 3110551552. Throughput: 0: 43825.4. Samples: 3013501720. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 08:46:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:46:42,722][06909] Updated weights for policy 0, policy_version 189863 (0.0031) [2024-06-28 08:46:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3110748160. Throughput: 0: 43805.8. Samples: 3013640300. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 08:46:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:46:45,626][06909] Updated weights for policy 0, policy_version 189873 (0.0031) [2024-06-28 08:46:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3110977536. Throughput: 0: 43986.5. Samples: 3013903620. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 08:46:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:46:50,379][06909] Updated weights for policy 0, policy_version 189883 (0.0024) [2024-06-28 08:46:53,278][06909] Updated weights for policy 0, policy_version 189893 (0.0030) [2024-06-28 08:46:53,853][06674] Fps is (10 sec: 45861.6, 60 sec: 43688.6, 300 sec: 44042.0). Total num frames: 3111206912. Throughput: 0: 43995.5. Samples: 3014160980. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 08:46:53,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:46:57,491][06909] Updated weights for policy 0, policy_version 189903 (0.0020) [2024-06-28 08:46:58,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.6, 300 sec: 43986.8). Total num frames: 3111419904. Throughput: 0: 44083.3. Samples: 3014302560. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 08:46:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:47:00,680][06909] Updated weights for policy 0, policy_version 189913 (0.0047) [2024-06-28 08:47:03,850][06674] Fps is (10 sec: 42611.0, 60 sec: 43964.7, 300 sec: 44042.4). Total num frames: 3111632896. Throughput: 0: 44081.9. Samples: 3014567940. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 08:47:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:47:05,145][06909] Updated weights for policy 0, policy_version 189923 (0.0040) [2024-06-28 08:47:07,924][06909] Updated weights for policy 0, policy_version 189933 (0.0041) [2024-06-28 08:47:08,851][06674] Fps is (10 sec: 44230.1, 60 sec: 43690.5, 300 sec: 44042.5). Total num frames: 3111862272. Throughput: 0: 44065.5. Samples: 3014827900. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 08:47:08,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:47:12,554][06909] Updated weights for policy 0, policy_version 189943 (0.0024) [2024-06-28 08:47:13,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3112091648. Throughput: 0: 44170.7. Samples: 3014964600. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 08:47:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:47:15,773][06909] Updated weights for policy 0, policy_version 189953 (0.0034) [2024-06-28 08:47:18,850][06674] Fps is (10 sec: 44244.1, 60 sec: 43965.7, 300 sec: 44042.4). Total num frames: 3112304640. Throughput: 0: 44094.2. Samples: 3015227640. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 08:47:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:47:19,756][06909] Updated weights for policy 0, policy_version 189963 (0.0033) [2024-06-28 08:47:22,243][06887] Signal inference workers to stop experience collection... (42750 times) [2024-06-28 08:47:22,244][06887] Signal inference workers to resume experience collection... (42750 times) [2024-06-28 08:47:22,264][06909] InferenceWorker_p0-w0: stopping experience collection (42750 times) [2024-06-28 08:47:22,264][06909] InferenceWorker_p0-w0: resuming experience collection (42750 times) [2024-06-28 08:47:23,291][06909] Updated weights for policy 0, policy_version 189973 (0.0026) [2024-06-28 08:47:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44043.3). Total num frames: 3112534016. Throughput: 0: 44162.7. Samples: 3015489040. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 08:47:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:47:27,148][06909] Updated weights for policy 0, policy_version 189983 (0.0027) [2024-06-28 08:47:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3112747008. Throughput: 0: 44172.7. Samples: 3015628080. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 08:47:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:47:30,600][06909] Updated weights for policy 0, policy_version 189993 (0.0034) [2024-06-28 08:47:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3112960000. Throughput: 0: 44244.0. Samples: 3015894600. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 08:47:33,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 08:47:34,594][06909] Updated weights for policy 0, policy_version 190003 (0.0031) [2024-06-28 08:47:38,278][06909] Updated weights for policy 0, policy_version 190013 (0.0031) [2024-06-28 08:47:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3113189376. Throughput: 0: 44371.7. Samples: 3016157580. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 08:47:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:47:42,098][06909] Updated weights for policy 0, policy_version 190023 (0.0034) [2024-06-28 08:47:43,852][06674] Fps is (10 sec: 45862.9, 60 sec: 44507.8, 300 sec: 44042.0). Total num frames: 3113418752. Throughput: 0: 44214.9. Samples: 3016292340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 08:47:43,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:47:45,611][06909] Updated weights for policy 0, policy_version 190033 (0.0035) [2024-06-28 08:47:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.6, 300 sec: 43987.3). Total num frames: 3113615360. Throughput: 0: 44228.3. Samples: 3016558220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 08:47:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:47:48,990][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000190041_3113631744.pth... [2024-06-28 08:47:49,051][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000189396_3103064064.pth [2024-06-28 08:47:49,640][06909] Updated weights for policy 0, policy_version 190043 (0.0032) [2024-06-28 08:47:53,014][06909] Updated weights for policy 0, policy_version 190053 (0.0037) [2024-06-28 08:47:53,853][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.9, 300 sec: 44042.0). Total num frames: 3113861120. Throughput: 0: 44170.9. Samples: 3016815640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 08:47:53,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:47:56,777][06909] Updated weights for policy 0, policy_version 190063 (0.0039) [2024-06-28 08:47:58,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3114074112. Throughput: 0: 44246.6. Samples: 3016955700. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 08:47:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:48:00,253][06909] Updated weights for policy 0, policy_version 190073 (0.0021) [2024-06-28 08:48:03,851][06674] Fps is (10 sec: 44245.2, 60 sec: 44509.1, 300 sec: 44042.3). Total num frames: 3114303488. Throughput: 0: 44152.0. Samples: 3017214520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 08:48:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:48:04,403][06909] Updated weights for policy 0, policy_version 190083 (0.0024) [2024-06-28 08:48:07,821][06909] Updated weights for policy 0, policy_version 190093 (0.0042) [2024-06-28 08:48:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44238.1, 300 sec: 44042.4). Total num frames: 3114516480. Throughput: 0: 44453.8. Samples: 3017489460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 08:48:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:48:11,709][06909] Updated weights for policy 0, policy_version 190103 (0.0042) [2024-06-28 08:48:13,850][06674] Fps is (10 sec: 44240.4, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3114745856. Throughput: 0: 44249.3. Samples: 3017619300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 08:48:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:48:15,486][06909] Updated weights for policy 0, policy_version 190113 (0.0026) [2024-06-28 08:48:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3114958848. Throughput: 0: 44040.9. Samples: 3017876440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 08:48:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:48:18,943][06909] Updated weights for policy 0, policy_version 190123 (0.0032) [2024-06-28 08:48:22,696][06909] Updated weights for policy 0, policy_version 190133 (0.0034) [2024-06-28 08:48:23,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3115171840. Throughput: 0: 44245.9. Samples: 3018148640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 08:48:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:48:26,669][06909] Updated weights for policy 0, policy_version 190143 (0.0034) [2024-06-28 08:48:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3115417600. Throughput: 0: 44162.6. Samples: 3018279540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 08:48:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:48:29,986][06909] Updated weights for policy 0, policy_version 190153 (0.0036) [2024-06-28 08:48:33,853][06674] Fps is (10 sec: 44222.3, 60 sec: 44234.4, 300 sec: 43986.4). Total num frames: 3115614208. Throughput: 0: 44065.8. Samples: 3018541320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 08:48:33,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:48:34,039][06909] Updated weights for policy 0, policy_version 190163 (0.0024) [2024-06-28 08:48:37,504][06909] Updated weights for policy 0, policy_version 190173 (0.0028) [2024-06-28 08:48:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3115843584. Throughput: 0: 44454.8. Samples: 3018815980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 08:48:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:48:41,473][06909] Updated weights for policy 0, policy_version 190183 (0.0026) [2024-06-28 08:48:43,850][06674] Fps is (10 sec: 45890.0, 60 sec: 44238.8, 300 sec: 44097.9). Total num frames: 3116072960. Throughput: 0: 44220.5. Samples: 3018945620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 08:48:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:48:45,259][06909] Updated weights for policy 0, policy_version 190193 (0.0033) [2024-06-28 08:48:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.8, 300 sec: 43986.8). Total num frames: 3116269568. Throughput: 0: 44224.8. Samples: 3019204600. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:48:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:48:48,949][06909] Updated weights for policy 0, policy_version 190203 (0.0032) [2024-06-28 08:48:52,589][06909] Updated weights for policy 0, policy_version 190213 (0.0026) [2024-06-28 08:48:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43965.8, 300 sec: 44098.1). Total num frames: 3116498944. Throughput: 0: 43806.2. Samples: 3019460740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:48:53,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 08:48:56,225][06909] Updated weights for policy 0, policy_version 190223 (0.0039) [2024-06-28 08:48:58,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3116728320. Throughput: 0: 43936.1. Samples: 3019596420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:48:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:48:59,803][06909] Updated weights for policy 0, policy_version 190233 (0.0022) [2024-06-28 08:49:01,767][06887] Signal inference workers to stop experience collection... (42800 times) [2024-06-28 08:49:01,768][06887] Signal inference workers to resume experience collection... (42800 times) [2024-06-28 08:49:01,801][06909] InferenceWorker_p0-w0: stopping experience collection (42800 times) [2024-06-28 08:49:01,801][06909] InferenceWorker_p0-w0: resuming experience collection (42800 times) [2024-06-28 08:49:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43964.4, 300 sec: 44042.4). Total num frames: 3116941312. Throughput: 0: 44120.5. Samples: 3019861860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:49:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:49:03,853][06909] Updated weights for policy 0, policy_version 190243 (0.0029) [2024-06-28 08:49:07,436][06909] Updated weights for policy 0, policy_version 190253 (0.0043) [2024-06-28 08:49:08,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3117137920. Throughput: 0: 43864.7. Samples: 3020122560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:49:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:49:11,365][06909] Updated weights for policy 0, policy_version 190263 (0.0031) [2024-06-28 08:49:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3117383680. Throughput: 0: 44026.2. Samples: 3020260720. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:49:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:49:14,884][06909] Updated weights for policy 0, policy_version 190273 (0.0028) [2024-06-28 08:49:18,787][06909] Updated weights for policy 0, policy_version 190283 (0.0024) [2024-06-28 08:49:18,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.7, 300 sec: 43987.3). Total num frames: 3117596672. Throughput: 0: 44086.3. Samples: 3020525060. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:49:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:49:22,451][06909] Updated weights for policy 0, policy_version 190293 (0.0029) [2024-06-28 08:49:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3117809664. Throughput: 0: 43681.8. Samples: 3020781660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:49:23,859][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:49:26,138][06909] Updated weights for policy 0, policy_version 190303 (0.0035) [2024-06-28 08:49:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43987.2). Total num frames: 3118022656. Throughput: 0: 43646.2. Samples: 3020909700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:49:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:49:29,832][06909] Updated weights for policy 0, policy_version 190313 (0.0041) [2024-06-28 08:49:33,753][06909] Updated weights for policy 0, policy_version 190323 (0.0032) [2024-06-28 08:49:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43966.1, 300 sec: 43986.9). Total num frames: 3118252032. Throughput: 0: 43838.3. Samples: 3021177320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:49:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:49:37,229][06909] Updated weights for policy 0, policy_version 190333 (0.0029) [2024-06-28 08:49:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 3118465024. Throughput: 0: 43857.7. Samples: 3021434340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:49:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:49:41,307][06909] Updated weights for policy 0, policy_version 190343 (0.0039) [2024-06-28 08:49:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3118694400. Throughput: 0: 43800.0. Samples: 3021567420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:49:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:49:44,647][06909] Updated weights for policy 0, policy_version 190353 (0.0032) [2024-06-28 08:49:48,663][06909] Updated weights for policy 0, policy_version 190363 (0.0033) [2024-06-28 08:49:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3118907392. Throughput: 0: 43818.7. Samples: 3021833700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:49:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:49:48,945][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000190364_3118923776.pth... [2024-06-28 08:49:48,998][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000189719_3108356096.pth [2024-06-28 08:49:53,005][06909] Updated weights for policy 0, policy_version 190373 (0.0035) [2024-06-28 08:49:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3119120384. Throughput: 0: 43813.0. Samples: 3022094140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 08:49:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:49:56,275][06909] Updated weights for policy 0, policy_version 190383 (0.0034) [2024-06-28 08:49:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3119349760. Throughput: 0: 43648.0. Samples: 3022224880. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 08:49:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:50:00,252][06909] Updated weights for policy 0, policy_version 190393 (0.0032) [2024-06-28 08:50:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43931.7). Total num frames: 3119546368. Throughput: 0: 43550.2. Samples: 3022484820. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 08:50:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:50:03,892][06909] Updated weights for policy 0, policy_version 190403 (0.0028) [2024-06-28 08:50:07,458][06909] Updated weights for policy 0, policy_version 190413 (0.0037) [2024-06-28 08:50:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3119775744. Throughput: 0: 43822.2. Samples: 3022753660. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 08:50:08,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 08:50:11,275][06909] Updated weights for policy 0, policy_version 190423 (0.0039) [2024-06-28 08:50:13,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3120021504. Throughput: 0: 43931.5. Samples: 3022886620. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 08:50:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:50:14,695][06909] Updated weights for policy 0, policy_version 190433 (0.0030) [2024-06-28 08:50:18,405][06909] Updated weights for policy 0, policy_version 190443 (0.0037) [2024-06-28 08:50:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3120234496. Throughput: 0: 43918.7. Samples: 3023153660. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 08:50:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:50:22,084][06909] Updated weights for policy 0, policy_version 190453 (0.0031) [2024-06-28 08:50:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3120447488. Throughput: 0: 44340.1. Samples: 3023429640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 08:50:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:50:25,828][06909] Updated weights for policy 0, policy_version 190463 (0.0029) [2024-06-28 08:50:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44042.5). Total num frames: 3120676864. Throughput: 0: 44212.0. Samples: 3023556960. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 08:50:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:50:29,394][06909] Updated weights for policy 0, policy_version 190473 (0.0029) [2024-06-28 08:50:33,321][06887] Signal inference workers to stop experience collection... (42850 times) [2024-06-28 08:50:33,322][06887] Signal inference workers to resume experience collection... (42850 times) [2024-06-28 08:50:33,336][06909] InferenceWorker_p0-w0: stopping experience collection (42850 times) [2024-06-28 08:50:33,336][06909] InferenceWorker_p0-w0: resuming experience collection (42850 times) [2024-06-28 08:50:33,474][06909] Updated weights for policy 0, policy_version 190483 (0.0029) [2024-06-28 08:50:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43931.8). Total num frames: 3120873472. Throughput: 0: 44113.4. Samples: 3023818800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 08:50:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:50:36,995][06909] Updated weights for policy 0, policy_version 190493 (0.0028) [2024-06-28 08:50:38,853][06674] Fps is (10 sec: 44221.0, 60 sec: 44234.2, 300 sec: 44097.4). Total num frames: 3121119232. Throughput: 0: 44044.1. Samples: 3024076280. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 08:50:38,854][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:50:41,023][06909] Updated weights for policy 0, policy_version 190503 (0.0031) [2024-06-28 08:50:43,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3121332224. Throughput: 0: 44064.8. Samples: 3024207800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 08:50:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:50:44,489][06909] Updated weights for policy 0, policy_version 190513 (0.0037) [2024-06-28 08:50:48,482][06909] Updated weights for policy 0, policy_version 190523 (0.0028) [2024-06-28 08:50:48,850][06674] Fps is (10 sec: 42613.6, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 3121545216. Throughput: 0: 44130.7. Samples: 3024470700. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 08:50:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:50:52,068][06909] Updated weights for policy 0, policy_version 190533 (0.0032) [2024-06-28 08:50:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3121774592. Throughput: 0: 44061.4. Samples: 3024736420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 08:50:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:50:55,762][06909] Updated weights for policy 0, policy_version 190543 (0.0025) [2024-06-28 08:50:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44098.1). Total num frames: 3122003968. Throughput: 0: 44060.5. Samples: 3024869340. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 08:50:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:50:59,598][06909] Updated weights for policy 0, policy_version 190553 (0.0032) [2024-06-28 08:51:03,288][06909] Updated weights for policy 0, policy_version 190563 (0.0029) [2024-06-28 08:51:03,852][06674] Fps is (10 sec: 44227.5, 60 sec: 44508.3, 300 sec: 43986.8). Total num frames: 3122216960. Throughput: 0: 43990.8. Samples: 3025133340. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 08:51:03,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:51:06,813][06909] Updated weights for policy 0, policy_version 190573 (0.0041) [2024-06-28 08:51:08,852][06674] Fps is (10 sec: 42586.7, 60 sec: 44234.8, 300 sec: 44042.0). Total num frames: 3122429952. Throughput: 0: 43805.7. Samples: 3025401020. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 08:51:08,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:51:10,771][06909] Updated weights for policy 0, policy_version 190583 (0.0039) [2024-06-28 08:51:13,850][06674] Fps is (10 sec: 44246.1, 60 sec: 43963.7, 300 sec: 44042.8). Total num frames: 3122659328. Throughput: 0: 43871.1. Samples: 3025531160. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 08:51:13,860][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:51:14,315][06909] Updated weights for policy 0, policy_version 190593 (0.0038) [2024-06-28 08:51:18,146][06909] Updated weights for policy 0, policy_version 190603 (0.0037) [2024-06-28 08:51:18,852][06674] Fps is (10 sec: 44239.5, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 3122872320. Throughput: 0: 43885.0. Samples: 3025793720. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 08:51:18,861][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:51:21,877][06909] Updated weights for policy 0, policy_version 190613 (0.0035) [2024-06-28 08:51:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3123085312. Throughput: 0: 43898.6. Samples: 3026051560. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 08:51:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:51:25,742][06909] Updated weights for policy 0, policy_version 190623 (0.0034) [2024-06-28 08:51:28,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 3123314688. Throughput: 0: 43991.1. Samples: 3026187400. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 08:51:28,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:51:29,442][06909] Updated weights for policy 0, policy_version 190633 (0.0035) [2024-06-28 08:51:32,923][06909] Updated weights for policy 0, policy_version 190643 (0.0027) [2024-06-28 08:51:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3123544064. Throughput: 0: 44024.8. Samples: 3026451820. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 08:51:33,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:51:36,738][06909] Updated weights for policy 0, policy_version 190653 (0.0034) [2024-06-28 08:51:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43966.4, 300 sec: 44097.9). Total num frames: 3123757056. Throughput: 0: 44085.4. Samples: 3026720260. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 08:51:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:51:40,558][06909] Updated weights for policy 0, policy_version 190663 (0.0025) [2024-06-28 08:51:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3123970048. Throughput: 0: 43944.9. Samples: 3026846860. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 08:51:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:51:44,162][06909] Updated weights for policy 0, policy_version 190673 (0.0033) [2024-06-28 08:51:47,812][06887] Signal inference workers to stop experience collection... (42900 times) [2024-06-28 08:51:47,850][06909] InferenceWorker_p0-w0: stopping experience collection (42900 times) [2024-06-28 08:51:47,869][06887] Signal inference workers to resume experience collection... (42900 times) [2024-06-28 08:51:47,870][06909] InferenceWorker_p0-w0: resuming experience collection (42900 times) [2024-06-28 08:51:47,876][06909] Updated weights for policy 0, policy_version 190683 (0.0032) [2024-06-28 08:51:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44042.8). Total num frames: 3124199424. Throughput: 0: 43889.2. Samples: 3027108260. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 08:51:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:51:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000190686_3124199424.pth... [2024-06-28 08:51:48,912][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000190041_3113631744.pth [2024-06-28 08:51:51,628][06909] Updated weights for policy 0, policy_version 190693 (0.0030) [2024-06-28 08:51:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3124396032. Throughput: 0: 43828.0. Samples: 3027373160. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 08:51:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:51:55,141][06909] Updated weights for policy 0, policy_version 190703 (0.0038) [2024-06-28 08:51:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3124625408. Throughput: 0: 43839.1. Samples: 3027503920. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 08:51:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:51:59,167][06909] Updated weights for policy 0, policy_version 190713 (0.0033) [2024-06-28 08:52:02,839][06909] Updated weights for policy 0, policy_version 190723 (0.0040) [2024-06-28 08:52:03,852][06674] Fps is (10 sec: 45865.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3124854784. Throughput: 0: 43825.8. Samples: 3027765880. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 08:52:03,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:52:06,501][06909] Updated weights for policy 0, policy_version 190733 (0.0029) [2024-06-28 08:52:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43965.7, 300 sec: 43986.9). Total num frames: 3125067776. Throughput: 0: 44039.9. Samples: 3028033360. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 08:52:08,850][06674] Avg episode reward: [(0, '0.429')] [2024-06-28 08:52:10,192][06909] Updated weights for policy 0, policy_version 190743 (0.0033) [2024-06-28 08:52:13,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3125280768. Throughput: 0: 43919.7. Samples: 3028163780. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 08:52:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:52:14,076][06909] Updated weights for policy 0, policy_version 190753 (0.0027) [2024-06-28 08:52:17,762][06909] Updated weights for policy 0, policy_version 190763 (0.0026) [2024-06-28 08:52:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43692.1, 300 sec: 43931.3). Total num frames: 3125493760. Throughput: 0: 43909.3. Samples: 3028427740. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 08:52:18,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:52:21,399][06909] Updated weights for policy 0, policy_version 190773 (0.0039) [2024-06-28 08:52:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3125723136. Throughput: 0: 43844.3. Samples: 3028693260. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 08:52:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:52:24,972][06909] Updated weights for policy 0, policy_version 190783 (0.0030) [2024-06-28 08:52:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3125936128. Throughput: 0: 43915.5. Samples: 3028823060. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 08:52:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:52:28,877][06909] Updated weights for policy 0, policy_version 190793 (0.0040) [2024-06-28 08:52:32,456][06909] Updated weights for policy 0, policy_version 190803 (0.0034) [2024-06-28 08:52:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3126165504. Throughput: 0: 44064.0. Samples: 3029091140. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 08:52:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:52:36,197][06909] Updated weights for policy 0, policy_version 190813 (0.0035) [2024-06-28 08:52:38,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.7, 300 sec: 43987.3). Total num frames: 3126394880. Throughput: 0: 44096.0. Samples: 3029357480. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 08:52:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:52:39,671][06909] Updated weights for policy 0, policy_version 190823 (0.0040) [2024-06-28 08:52:43,792][06909] Updated weights for policy 0, policy_version 190833 (0.0032) [2024-06-28 08:52:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3126607872. Throughput: 0: 44049.8. Samples: 3029486160. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 08:52:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:52:47,243][06909] Updated weights for policy 0, policy_version 190843 (0.0031) [2024-06-28 08:52:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44042.8). Total num frames: 3126853632. Throughput: 0: 44248.7. Samples: 3029756980. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 08:52:48,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 08:52:50,948][06909] Updated weights for policy 0, policy_version 190853 (0.0040) [2024-06-28 08:52:53,852][06674] Fps is (10 sec: 44228.1, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 3127050240. Throughput: 0: 44308.8. Samples: 3030027340. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 08:52:53,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:52:54,845][06909] Updated weights for policy 0, policy_version 190863 (0.0032) [2024-06-28 08:52:58,140][06909] Updated weights for policy 0, policy_version 190873 (0.0024) [2024-06-28 08:52:58,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.7, 300 sec: 43931.5). Total num frames: 3127263232. Throughput: 0: 44261.7. Samples: 3030155560. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 08:52:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:53:02,346][06909] Updated weights for policy 0, policy_version 190883 (0.0037) [2024-06-28 08:53:03,850][06674] Fps is (10 sec: 45884.1, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 3127508992. Throughput: 0: 44351.6. Samples: 3030423560. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 08:53:03,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:53:05,739][06909] Updated weights for policy 0, policy_version 190893 (0.0031) [2024-06-28 08:53:08,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 3127705600. Throughput: 0: 44132.1. Samples: 3030679200. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 08:53:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:53:09,834][06909] Updated weights for policy 0, policy_version 190903 (0.0032) [2024-06-28 08:53:13,413][06909] Updated weights for policy 0, policy_version 190913 (0.0038) [2024-06-28 08:53:13,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3127918592. Throughput: 0: 44101.8. Samples: 3030807640. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 08:53:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 08:53:17,167][06909] Updated weights for policy 0, policy_version 190923 (0.0025) [2024-06-28 08:53:18,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44783.0, 300 sec: 44097.9). Total num frames: 3128180736. Throughput: 0: 44187.0. Samples: 3031079560. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 08:53:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:53:20,686][06909] Updated weights for policy 0, policy_version 190933 (0.0050) [2024-06-28 08:53:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3128360960. Throughput: 0: 44116.8. Samples: 3031342740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 08:53:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:53:24,837][06909] Updated weights for policy 0, policy_version 190943 (0.0032) [2024-06-28 08:53:27,950][06909] Updated weights for policy 0, policy_version 190953 (0.0023) [2024-06-28 08:53:28,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.9, 300 sec: 43987.4). Total num frames: 3128590336. Throughput: 0: 44124.0. Samples: 3031471740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 08:53:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:53:32,157][06909] Updated weights for policy 0, policy_version 190963 (0.0029) [2024-06-28 08:53:33,390][06887] Signal inference workers to stop experience collection... (42950 times) [2024-06-28 08:53:33,390][06887] Signal inference workers to resume experience collection... (42950 times) [2024-06-28 08:53:33,435][06909] InferenceWorker_p0-w0: stopping experience collection (42950 times) [2024-06-28 08:53:33,435][06909] InferenceWorker_p0-w0: resuming experience collection (42950 times) [2024-06-28 08:53:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3128819712. Throughput: 0: 44018.2. Samples: 3031737800. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 08:53:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:53:35,536][06909] Updated weights for policy 0, policy_version 190973 (0.0022) [2024-06-28 08:53:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3129016320. Throughput: 0: 43879.3. Samples: 3032001820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 08:53:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:53:39,639][06909] Updated weights for policy 0, policy_version 190983 (0.0032) [2024-06-28 08:53:43,214][06909] Updated weights for policy 0, policy_version 190993 (0.0025) [2024-06-28 08:53:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3129245696. Throughput: 0: 43784.6. Samples: 3032125860. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 08:53:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:53:47,212][06909] Updated weights for policy 0, policy_version 191003 (0.0033) [2024-06-28 08:53:48,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3129507840. Throughput: 0: 43857.8. Samples: 3032397160. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 08:53:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:53:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000191010_3129507840.pth... [2024-06-28 08:53:48,912][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000190364_3118923776.pth [2024-06-28 08:53:50,669][06909] Updated weights for policy 0, policy_version 191013 (0.0034) [2024-06-28 08:53:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43965.1, 300 sec: 43931.3). Total num frames: 3129688064. Throughput: 0: 43998.1. Samples: 3032659120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 08:53:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:53:54,686][06909] Updated weights for policy 0, policy_version 191023 (0.0022) [2024-06-28 08:53:57,982][06909] Updated weights for policy 0, policy_version 191033 (0.0042) [2024-06-28 08:53:58,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3129901056. Throughput: 0: 43904.9. Samples: 3032783360. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 08:53:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:54:02,110][06909] Updated weights for policy 0, policy_version 191043 (0.0028) [2024-06-28 08:54:03,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3130146816. Throughput: 0: 44004.5. Samples: 3033059760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 08:54:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:54:05,373][06909] Updated weights for policy 0, policy_version 191053 (0.0020) [2024-06-28 08:54:08,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3130327040. Throughput: 0: 43994.7. Samples: 3033322500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 08:54:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:54:09,578][06909] Updated weights for policy 0, policy_version 191063 (0.0028) [2024-06-28 08:54:12,928][06909] Updated weights for policy 0, policy_version 191073 (0.0024) [2024-06-28 08:54:13,856][06674] Fps is (10 sec: 42572.3, 60 sec: 44232.4, 300 sec: 43986.0). Total num frames: 3130572800. Throughput: 0: 43796.3. Samples: 3033442840. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 08:54:13,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:54:17,027][06909] Updated weights for policy 0, policy_version 191083 (0.0037) [2024-06-28 08:54:18,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3130802176. Throughput: 0: 43920.8. Samples: 3033714240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 08:54:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:54:20,522][06909] Updated weights for policy 0, policy_version 191093 (0.0031) [2024-06-28 08:54:23,850][06674] Fps is (10 sec: 42624.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3130998784. Throughput: 0: 43868.9. Samples: 3033975920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:54:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:54:24,607][06909] Updated weights for policy 0, policy_version 191103 (0.0041) [2024-06-28 08:54:27,886][06909] Updated weights for policy 0, policy_version 191113 (0.0033) [2024-06-28 08:54:28,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3131228160. Throughput: 0: 43880.9. Samples: 3034100500. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:54:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:54:31,798][06909] Updated weights for policy 0, policy_version 191123 (0.0028) [2024-06-28 08:54:33,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3131473920. Throughput: 0: 43995.1. Samples: 3034376940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:54:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:54:35,140][06909] Updated weights for policy 0, policy_version 191133 (0.0044) [2024-06-28 08:54:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3131670528. Throughput: 0: 44142.8. Samples: 3034645540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:54:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:54:39,253][06909] Updated weights for policy 0, policy_version 191143 (0.0023) [2024-06-28 08:54:42,284][06909] Updated weights for policy 0, policy_version 191153 (0.0034) [2024-06-28 08:54:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 3131916288. Throughput: 0: 44299.6. Samples: 3034776840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:54:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:54:46,444][06909] Updated weights for policy 0, policy_version 191163 (0.0036) [2024-06-28 08:54:48,852][06674] Fps is (10 sec: 45865.3, 60 sec: 43689.2, 300 sec: 44097.6). Total num frames: 3132129280. Throughput: 0: 44114.8. Samples: 3035045020. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:54:48,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:54:49,934][06909] Updated weights for policy 0, policy_version 191173 (0.0031) [2024-06-28 08:54:51,288][06887] Signal inference workers to stop experience collection... (43000 times) [2024-06-28 08:54:51,292][06887] Signal inference workers to resume experience collection... (43000 times) [2024-06-28 08:54:51,344][06909] InferenceWorker_p0-w0: stopping experience collection (43000 times) [2024-06-28 08:54:51,344][06909] InferenceWorker_p0-w0: resuming experience collection (43000 times) [2024-06-28 08:54:53,852][06674] Fps is (10 sec: 40950.7, 60 sec: 43962.1, 300 sec: 43986.5). Total num frames: 3132325888. Throughput: 0: 44197.4. Samples: 3035311480. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:54:53,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:54:54,307][06909] Updated weights for policy 0, policy_version 191183 (0.0027) [2024-06-28 08:54:57,511][06909] Updated weights for policy 0, policy_version 191193 (0.0032) [2024-06-28 08:54:58,850][06674] Fps is (10 sec: 44246.0, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3132571648. Throughput: 0: 44318.9. Samples: 3035436920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:54:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:55:01,393][06909] Updated weights for policy 0, policy_version 191203 (0.0036) [2024-06-28 08:55:03,850][06674] Fps is (10 sec: 47524.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3132801024. Throughput: 0: 44229.5. Samples: 3035704560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:55:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:55:04,840][06909] Updated weights for policy 0, policy_version 191213 (0.0026) [2024-06-28 08:55:08,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3132981248. Throughput: 0: 44416.4. Samples: 3035974660. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:55:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:55:08,966][06909] Updated weights for policy 0, policy_version 191223 (0.0034) [2024-06-28 08:55:12,146][06909] Updated weights for policy 0, policy_version 191233 (0.0036) [2024-06-28 08:55:13,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44241.2, 300 sec: 44042.4). Total num frames: 3133227008. Throughput: 0: 44456.3. Samples: 3036101040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:55:13,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:55:16,267][06909] Updated weights for policy 0, policy_version 191243 (0.0033) [2024-06-28 08:55:18,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 3133456384. Throughput: 0: 44203.6. Samples: 3036366100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:55:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:55:19,434][06909] Updated weights for policy 0, policy_version 191253 (0.0036) [2024-06-28 08:55:23,711][06909] Updated weights for policy 0, policy_version 191263 (0.0032) [2024-06-28 08:55:23,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3133652992. Throughput: 0: 44202.2. Samples: 3036634640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 08:55:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:55:26,931][06909] Updated weights for policy 0, policy_version 191273 (0.0025) [2024-06-28 08:55:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3133898752. Throughput: 0: 44127.0. Samples: 3036762560. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 08:55:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:55:31,478][06909] Updated weights for policy 0, policy_version 191283 (0.0042) [2024-06-28 08:55:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44042.9). Total num frames: 3134111744. Throughput: 0: 43976.3. Samples: 3037023860. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 08:55:33,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 08:55:34,688][06909] Updated weights for policy 0, policy_version 191293 (0.0035) [2024-06-28 08:55:38,771][06909] Updated weights for policy 0, policy_version 191303 (0.0032) [2024-06-28 08:55:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3134308352. Throughput: 0: 44127.0. Samples: 3037297100. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 08:55:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:55:41,974][06909] Updated weights for policy 0, policy_version 191313 (0.0031) [2024-06-28 08:55:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3134554112. Throughput: 0: 44043.6. Samples: 3037418880. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 08:55:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:55:46,041][06909] Updated weights for policy 0, policy_version 191323 (0.0021) [2024-06-28 08:55:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 3134767104. Throughput: 0: 44061.7. Samples: 3037687340. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 08:55:48,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 08:55:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000191331_3134767104.pth... [2024-06-28 08:55:48,933][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000190686_3124199424.pth [2024-06-28 08:55:49,238][06909] Updated weights for policy 0, policy_version 191333 (0.0038) [2024-06-28 08:55:53,682][06909] Updated weights for policy 0, policy_version 191343 (0.0031) [2024-06-28 08:55:53,850][06674] Fps is (10 sec: 42597.6, 60 sec: 44238.3, 300 sec: 43986.8). Total num frames: 3134980096. Throughput: 0: 44052.7. Samples: 3037957040. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 08:55:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:55:56,757][06909] Updated weights for policy 0, policy_version 191353 (0.0035) [2024-06-28 08:55:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.6, 300 sec: 44042.7). Total num frames: 3135209472. Throughput: 0: 44070.6. Samples: 3038084220. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 08:55:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:56:01,149][06909] Updated weights for policy 0, policy_version 191363 (0.0047) [2024-06-28 08:56:03,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43963.7, 300 sec: 44098.4). Total num frames: 3135438848. Throughput: 0: 44067.6. Samples: 3038349140. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 08:56:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:56:04,000][06909] Updated weights for policy 0, policy_version 191373 (0.0042) [2024-06-28 08:56:08,498][06909] Updated weights for policy 0, policy_version 191383 (0.0026) [2024-06-28 08:56:08,794][06887] Signal inference workers to stop experience collection... (43050 times) [2024-06-28 08:56:08,795][06887] Signal inference workers to resume experience collection... (43050 times) [2024-06-28 08:56:08,839][06909] InferenceWorker_p0-w0: stopping experience collection (43050 times) [2024-06-28 08:56:08,839][06909] InferenceWorker_p0-w0: resuming experience collection (43050 times) [2024-06-28 08:56:08,850][06674] Fps is (10 sec: 42599.7, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3135635456. Throughput: 0: 44094.3. Samples: 3038618880. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 08:56:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:56:11,762][06909] Updated weights for policy 0, policy_version 191393 (0.0027) [2024-06-28 08:56:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44098.3). Total num frames: 3135881216. Throughput: 0: 43968.5. Samples: 3038741140. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 08:56:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:56:15,916][06909] Updated weights for policy 0, policy_version 191403 (0.0034) [2024-06-28 08:56:18,852][06674] Fps is (10 sec: 45865.8, 60 sec: 43962.3, 300 sec: 44097.7). Total num frames: 3136094208. Throughput: 0: 44068.3. Samples: 3039007020. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 08:56:18,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:56:19,011][06909] Updated weights for policy 0, policy_version 191413 (0.0024) [2024-06-28 08:56:23,394][06909] Updated weights for policy 0, policy_version 191423 (0.0030) [2024-06-28 08:56:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3136307200. Throughput: 0: 44060.5. Samples: 3039279820. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 08:56:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:56:26,222][06909] Updated weights for policy 0, policy_version 191433 (0.0021) [2024-06-28 08:56:28,850][06674] Fps is (10 sec: 42606.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3136520192. Throughput: 0: 44119.0. Samples: 3039404240. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 08:56:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:56:30,849][06909] Updated weights for policy 0, policy_version 191443 (0.0039) [2024-06-28 08:56:33,633][06909] Updated weights for policy 0, policy_version 191453 (0.0032) [2024-06-28 08:56:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3136765952. Throughput: 0: 43962.8. Samples: 3039665660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 08:56:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:56:38,227][06909] Updated weights for policy 0, policy_version 191463 (0.0031) [2024-06-28 08:56:38,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 3136962560. Throughput: 0: 43962.6. Samples: 3039935440. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 08:56:38,852][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 08:56:40,963][06909] Updated weights for policy 0, policy_version 191473 (0.0026) [2024-06-28 08:56:43,850][06674] Fps is (10 sec: 42596.4, 60 sec: 43963.4, 300 sec: 44042.3). Total num frames: 3137191936. Throughput: 0: 43924.2. Samples: 3040060820. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 08:56:43,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:56:45,632][06909] Updated weights for policy 0, policy_version 191483 (0.0023) [2024-06-28 08:56:48,333][06909] Updated weights for policy 0, policy_version 191493 (0.0033) [2024-06-28 08:56:48,852][06674] Fps is (10 sec: 45875.0, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 3137421312. Throughput: 0: 43957.0. Samples: 3040327300. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 08:56:48,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:56:53,249][06909] Updated weights for policy 0, policy_version 191503 (0.0041) [2024-06-28 08:56:53,850][06674] Fps is (10 sec: 42599.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3137617920. Throughput: 0: 44108.3. Samples: 3040603760. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 08:56:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:56:56,136][06909] Updated weights for policy 0, policy_version 191513 (0.0034) [2024-06-28 08:56:58,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43963.9, 300 sec: 44042.7). Total num frames: 3137847296. Throughput: 0: 44234.2. Samples: 3040731680. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 08:56:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:57:00,462][06909] Updated weights for policy 0, policy_version 191523 (0.0043) [2024-06-28 08:57:03,433][06909] Updated weights for policy 0, policy_version 191533 (0.0036) [2024-06-28 08:57:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3138076672. Throughput: 0: 44222.3. Samples: 3040996940. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 08:57:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:57:07,697][06909] Updated weights for policy 0, policy_version 191543 (0.0029) [2024-06-28 08:57:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3138289664. Throughput: 0: 44011.1. Samples: 3041260320. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 08:57:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:57:10,684][06909] Updated weights for policy 0, policy_version 191553 (0.0052) [2024-06-28 08:57:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 3138502656. Throughput: 0: 44089.8. Samples: 3041388280. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 08:57:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:57:15,351][06909] Updated weights for policy 0, policy_version 191563 (0.0036) [2024-06-28 08:57:17,937][06887] Signal inference workers to stop experience collection... (43100 times) [2024-06-28 08:57:17,938][06887] Signal inference workers to resume experience collection... (43100 times) [2024-06-28 08:57:17,956][06909] InferenceWorker_p0-w0: stopping experience collection (43100 times) [2024-06-28 08:57:17,956][06909] InferenceWorker_p0-w0: resuming experience collection (43100 times) [2024-06-28 08:57:18,281][06909] Updated weights for policy 0, policy_version 191573 (0.0035) [2024-06-28 08:57:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44238.2, 300 sec: 44153.5). Total num frames: 3138748416. Throughput: 0: 44088.0. Samples: 3041649620. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 08:57:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:57:22,810][06909] Updated weights for policy 0, policy_version 191583 (0.0028) [2024-06-28 08:57:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3138961408. Throughput: 0: 44130.9. Samples: 3041921240. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 08:57:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 08:57:26,098][06909] Updated weights for policy 0, policy_version 191593 (0.0038) [2024-06-28 08:57:28,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3139141632. Throughput: 0: 44184.0. Samples: 3042049080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 08:57:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:57:30,399][06909] Updated weights for policy 0, policy_version 191603 (0.0030) [2024-06-28 08:57:33,278][06909] Updated weights for policy 0, policy_version 191613 (0.0024) [2024-06-28 08:57:33,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3139420160. Throughput: 0: 44196.4. Samples: 3042316040. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 08:57:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:57:37,670][06909] Updated weights for policy 0, policy_version 191623 (0.0036) [2024-06-28 08:57:38,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 3139616768. Throughput: 0: 43896.5. Samples: 3042579100. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 08:57:38,850][06674] Avg episode reward: [(0, '0.415')] [2024-06-28 08:57:40,706][06909] Updated weights for policy 0, policy_version 191633 (0.0030) [2024-06-28 08:57:43,850][06674] Fps is (10 sec: 39320.8, 60 sec: 43690.9, 300 sec: 43931.3). Total num frames: 3139813376. Throughput: 0: 43974.5. Samples: 3042710540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:57:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:57:44,957][06909] Updated weights for policy 0, policy_version 191643 (0.0021) [2024-06-28 08:57:48,251][06909] Updated weights for policy 0, policy_version 191653 (0.0037) [2024-06-28 08:57:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43965.3, 300 sec: 44098.3). Total num frames: 3140059136. Throughput: 0: 43917.4. Samples: 3042973220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:57:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:57:48,864][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000191655_3140075520.pth... [2024-06-28 08:57:48,909][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000191010_3129507840.pth [2024-06-28 08:57:52,573][06909] Updated weights for policy 0, policy_version 191663 (0.0036) [2024-06-28 08:57:53,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3140272128. Throughput: 0: 43901.4. Samples: 3043235880. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:57:53,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 08:57:55,606][06909] Updated weights for policy 0, policy_version 191673 (0.0037) [2024-06-28 08:57:58,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3140468736. Throughput: 0: 43826.2. Samples: 3043360460. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:57:58,850][06674] Avg episode reward: [(0, '0.401')] [2024-06-28 08:57:59,887][06909] Updated weights for policy 0, policy_version 191683 (0.0035) [2024-06-28 08:58:03,133][06909] Updated weights for policy 0, policy_version 191693 (0.0038) [2024-06-28 08:58:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3140730880. Throughput: 0: 44169.4. Samples: 3043637240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:58:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:58:07,335][06909] Updated weights for policy 0, policy_version 191703 (0.0045) [2024-06-28 08:58:08,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3140943872. Throughput: 0: 43971.1. Samples: 3043899940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:58:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:58:10,407][06909] Updated weights for policy 0, policy_version 191713 (0.0040) [2024-06-28 08:58:13,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3141140480. Throughput: 0: 44028.0. Samples: 3044030340. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:58:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:58:14,724][06909] Updated weights for policy 0, policy_version 191723 (0.0030) [2024-06-28 08:58:18,164][06909] Updated weights for policy 0, policy_version 191733 (0.0024) [2024-06-28 08:58:18,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43962.2, 300 sec: 44153.2). Total num frames: 3141386240. Throughput: 0: 44003.7. Samples: 3044296300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:58:18,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:58:18,903][06887] Signal inference workers to stop experience collection... (43150 times) [2024-06-28 08:58:18,955][06909] InferenceWorker_p0-w0: stopping experience collection (43150 times) [2024-06-28 08:58:19,014][06887] Signal inference workers to resume experience collection... (43150 times) [2024-06-28 08:58:19,014][06909] InferenceWorker_p0-w0: resuming experience collection (43150 times) [2024-06-28 08:58:22,139][06909] Updated weights for policy 0, policy_version 191743 (0.0032) [2024-06-28 08:58:23,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3141599232. Throughput: 0: 43832.8. Samples: 3044551580. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:58:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:58:25,782][06909] Updated weights for policy 0, policy_version 191753 (0.0040) [2024-06-28 08:58:28,852][06674] Fps is (10 sec: 39321.4, 60 sec: 43962.2, 300 sec: 43931.0). Total num frames: 3141779456. Throughput: 0: 43800.7. Samples: 3044681660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:58:28,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:58:29,851][06909] Updated weights for policy 0, policy_version 191763 (0.0043) [2024-06-28 08:58:33,269][06909] Updated weights for policy 0, policy_version 191773 (0.0029) [2024-06-28 08:58:33,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 3142057984. Throughput: 0: 43903.1. Samples: 3044948860. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:58:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:58:37,521][06909] Updated weights for policy 0, policy_version 191783 (0.0028) [2024-06-28 08:58:38,850][06674] Fps is (10 sec: 47523.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3142254592. Throughput: 0: 43904.8. Samples: 3045211600. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:58:38,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:58:40,649][06909] Updated weights for policy 0, policy_version 191793 (0.0032) [2024-06-28 08:58:43,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3142451200. Throughput: 0: 43968.9. Samples: 3045339060. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 08:58:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:58:44,961][06909] Updated weights for policy 0, policy_version 191803 (0.0019) [2024-06-28 08:58:48,126][06909] Updated weights for policy 0, policy_version 191813 (0.0038) [2024-06-28 08:58:48,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3142713344. Throughput: 0: 43816.9. Samples: 3045609000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 08:58:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:58:52,302][06909] Updated weights for policy 0, policy_version 191823 (0.0037) [2024-06-28 08:58:53,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3142926336. Throughput: 0: 43668.0. Samples: 3045865000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 08:58:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:58:55,760][06909] Updated weights for policy 0, policy_version 191833 (0.0021) [2024-06-28 08:58:58,852][06674] Fps is (10 sec: 37675.2, 60 sec: 43689.2, 300 sec: 43875.5). Total num frames: 3143090176. Throughput: 0: 43623.3. Samples: 3045993480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 08:58:58,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:58:59,817][06909] Updated weights for policy 0, policy_version 191843 (0.0028) [2024-06-28 08:59:03,172][06909] Updated weights for policy 0, policy_version 191853 (0.0028) [2024-06-28 08:59:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 3143352320. Throughput: 0: 43670.9. Samples: 3046261400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 08:59:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:59:07,256][06909] Updated weights for policy 0, policy_version 191863 (0.0031) [2024-06-28 08:59:08,850][06674] Fps is (10 sec: 47523.7, 60 sec: 43690.7, 300 sec: 44043.3). Total num frames: 3143565312. Throughput: 0: 43922.8. Samples: 3046528100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 08:59:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:59:10,818][06909] Updated weights for policy 0, policy_version 191873 (0.0038) [2024-06-28 08:59:13,214][06887] Signal inference workers to stop experience collection... (43200 times) [2024-06-28 08:59:13,214][06887] Signal inference workers to resume experience collection... (43200 times) [2024-06-28 08:59:13,231][06909] InferenceWorker_p0-w0: stopping experience collection (43200 times) [2024-06-28 08:59:13,231][06909] InferenceWorker_p0-w0: resuming experience collection (43200 times) [2024-06-28 08:59:13,851][06674] Fps is (10 sec: 40956.9, 60 sec: 43690.1, 300 sec: 43931.2). Total num frames: 3143761920. Throughput: 0: 43803.1. Samples: 3046652740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 08:59:13,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:59:14,689][06909] Updated weights for policy 0, policy_version 191883 (0.0027) [2024-06-28 08:59:18,326][06909] Updated weights for policy 0, policy_version 191893 (0.0032) [2024-06-28 08:59:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 3144024064. Throughput: 0: 43857.8. Samples: 3046922460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 08:59:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:59:22,095][06909] Updated weights for policy 0, policy_version 191903 (0.0030) [2024-06-28 08:59:23,856][06674] Fps is (10 sec: 47490.0, 60 sec: 43959.6, 300 sec: 44097.1). Total num frames: 3144237056. Throughput: 0: 43759.7. Samples: 3047181040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 08:59:23,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:59:25,704][06909] Updated weights for policy 0, policy_version 191913 (0.0035) [2024-06-28 08:59:28,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43965.3, 300 sec: 43875.8). Total num frames: 3144417280. Throughput: 0: 43805.9. Samples: 3047310320. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 08:59:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:59:29,459][06909] Updated weights for policy 0, policy_version 191923 (0.0038) [2024-06-28 08:59:32,936][06909] Updated weights for policy 0, policy_version 191933 (0.0038) [2024-06-28 08:59:33,851][06674] Fps is (10 sec: 44255.4, 60 sec: 43689.5, 300 sec: 44097.7). Total num frames: 3144679424. Throughput: 0: 43827.3. Samples: 3047581300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 08:59:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:59:37,074][06909] Updated weights for policy 0, policy_version 191943 (0.0021) [2024-06-28 08:59:38,850][06674] Fps is (10 sec: 45874.5, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3144876032. Throughput: 0: 43967.0. Samples: 3047843520. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 08:59:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 08:59:40,454][06909] Updated weights for policy 0, policy_version 191953 (0.0034) [2024-06-28 08:59:43,850][06674] Fps is (10 sec: 40965.8, 60 sec: 43963.7, 300 sec: 43931.6). Total num frames: 3145089024. Throughput: 0: 43933.4. Samples: 3047970400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 08:59:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 08:59:44,469][06909] Updated weights for policy 0, policy_version 191963 (0.0031) [2024-06-28 08:59:48,018][06909] Updated weights for policy 0, policy_version 191973 (0.0032) [2024-06-28 08:59:48,852][06674] Fps is (10 sec: 45865.8, 60 sec: 43689.1, 300 sec: 44098.0). Total num frames: 3145334784. Throughput: 0: 43860.6. Samples: 3048235220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 08:59:48,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 08:59:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000191976_3145334784.pth... [2024-06-28 08:59:48,911][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000191331_3134767104.pth [2024-06-28 08:59:52,131][06909] Updated weights for policy 0, policy_version 191983 (0.0036) [2024-06-28 08:59:53,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43417.7, 300 sec: 43931.3). Total num frames: 3145531392. Throughput: 0: 43782.6. Samples: 3048498320. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:59:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 08:59:55,356][06909] Updated weights for policy 0, policy_version 191993 (0.0040) [2024-06-28 08:59:58,850][06674] Fps is (10 sec: 40969.1, 60 sec: 44238.4, 300 sec: 43875.8). Total num frames: 3145744384. Throughput: 0: 44010.6. Samples: 3048633180. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 08:59:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 08:59:59,138][06909] Updated weights for policy 0, policy_version 192003 (0.0036) [2024-06-28 09:00:02,814][06909] Updated weights for policy 0, policy_version 192013 (0.0038) [2024-06-28 09:00:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3145990144. Throughput: 0: 44186.7. Samples: 3048910860. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 09:00:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:00:06,520][06909] Updated weights for policy 0, policy_version 192023 (0.0026) [2024-06-28 09:00:08,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3146219520. Throughput: 0: 44155.8. Samples: 3049167800. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 09:00:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:00:09,975][06909] Updated weights for policy 0, policy_version 192033 (0.0029) [2024-06-28 09:00:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44237.4, 300 sec: 43931.3). Total num frames: 3146416128. Throughput: 0: 44164.4. Samples: 3049297720. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 09:00:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:00:14,118][06909] Updated weights for policy 0, policy_version 192043 (0.0040) [2024-06-28 09:00:17,540][06909] Updated weights for policy 0, policy_version 192053 (0.0034) [2024-06-28 09:00:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3146645504. Throughput: 0: 44058.8. Samples: 3049563880. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 09:00:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:00:21,602][06909] Updated weights for policy 0, policy_version 192063 (0.0030) [2024-06-28 09:00:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43968.0, 300 sec: 43986.9). Total num frames: 3146874880. Throughput: 0: 44230.4. Samples: 3049833880. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 09:00:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:00:24,730][06909] Updated weights for policy 0, policy_version 192073 (0.0028) [2024-06-28 09:00:28,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3147071488. Throughput: 0: 44256.2. Samples: 3049961920. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 09:00:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:00:28,950][06909] Updated weights for policy 0, policy_version 192083 (0.0026) [2024-06-28 09:00:29,556][06887] Signal inference workers to stop experience collection... (43250 times) [2024-06-28 09:00:29,556][06887] Signal inference workers to resume experience collection... (43250 times) [2024-06-28 09:00:29,595][06909] InferenceWorker_p0-w0: stopping experience collection (43250 times) [2024-06-28 09:00:29,595][06909] InferenceWorker_p0-w0: resuming experience collection (43250 times) [2024-06-28 09:00:32,255][06909] Updated weights for policy 0, policy_version 192093 (0.0029) [2024-06-28 09:00:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43691.8, 300 sec: 44042.4). Total num frames: 3147300864. Throughput: 0: 44281.3. Samples: 3050227780. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 09:00:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:00:36,174][06909] Updated weights for policy 0, policy_version 192103 (0.0034) [2024-06-28 09:00:38,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3147530240. Throughput: 0: 44387.9. Samples: 3050495780. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 09:00:38,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 09:00:39,734][06909] Updated weights for policy 0, policy_version 192113 (0.0032) [2024-06-28 09:00:43,842][06909] Updated weights for policy 0, policy_version 192123 (0.0021) [2024-06-28 09:00:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3147743232. Throughput: 0: 44465.7. Samples: 3050634140. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 09:00:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:00:47,061][06909] Updated weights for policy 0, policy_version 192133 (0.0027) [2024-06-28 09:00:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 3147972608. Throughput: 0: 44035.9. Samples: 3050892480. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 09:00:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:00:51,084][06909] Updated weights for policy 0, policy_version 192143 (0.0043) [2024-06-28 09:00:53,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44782.9, 300 sec: 44098.0). Total num frames: 3148218368. Throughput: 0: 44209.8. Samples: 3051157240. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 09:00:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:00:54,330][06909] Updated weights for policy 0, policy_version 192153 (0.0043) [2024-06-28 09:00:58,791][06909] Updated weights for policy 0, policy_version 192163 (0.0033) [2024-06-28 09:00:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3148398592. Throughput: 0: 44396.0. Samples: 3051295540. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 09:00:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:01:01,690][06909] Updated weights for policy 0, policy_version 192173 (0.0038) [2024-06-28 09:01:03,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3148627968. Throughput: 0: 44105.8. Samples: 3051548640. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 09:01:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:01:06,002][06909] Updated weights for policy 0, policy_version 192183 (0.0038) [2024-06-28 09:01:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3148840960. Throughput: 0: 44186.7. Samples: 3051822280. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 09:01:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:01:09,312][06909] Updated weights for policy 0, policy_version 192193 (0.0027) [2024-06-28 09:01:13,238][06909] Updated weights for policy 0, policy_version 192203 (0.0033) [2024-06-28 09:01:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.8, 300 sec: 44042.7). Total num frames: 3149086720. Throughput: 0: 44247.5. Samples: 3051953060. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 09:01:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:01:16,894][06909] Updated weights for policy 0, policy_version 192213 (0.0039) [2024-06-28 09:01:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3149283328. Throughput: 0: 44041.7. Samples: 3052209660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 09:01:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:01:21,083][06909] Updated weights for policy 0, policy_version 192223 (0.0029) [2024-06-28 09:01:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3149512704. Throughput: 0: 44013.9. Samples: 3052476400. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 09:01:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:01:24,192][06909] Updated weights for policy 0, policy_version 192233 (0.0032) [2024-06-28 09:01:28,330][06909] Updated weights for policy 0, policy_version 192243 (0.0036) [2024-06-28 09:01:28,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3149725696. Throughput: 0: 43921.9. Samples: 3052610620. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 09:01:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:01:31,422][06909] Updated weights for policy 0, policy_version 192253 (0.0031) [2024-06-28 09:01:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43987.2). Total num frames: 3149938688. Throughput: 0: 43880.5. Samples: 3052867100. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 09:01:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:01:35,908][06909] Updated weights for policy 0, policy_version 192263 (0.0032) [2024-06-28 09:01:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.9, 300 sec: 44042.5). Total num frames: 3150184448. Throughput: 0: 44064.5. Samples: 3053140140. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 09:01:38,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:01:39,154][06909] Updated weights for policy 0, policy_version 192273 (0.0038) [2024-06-28 09:01:43,017][06909] Updated weights for policy 0, policy_version 192283 (0.0031) [2024-06-28 09:01:43,850][06674] Fps is (10 sec: 47512.0, 60 sec: 44509.7, 300 sec: 44042.7). Total num frames: 3150413824. Throughput: 0: 44171.2. Samples: 3053283260. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 09:01:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:01:46,383][06909] Updated weights for policy 0, policy_version 192293 (0.0039) [2024-06-28 09:01:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3150594048. Throughput: 0: 44185.4. Samples: 3053536980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 09:01:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:01:48,984][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000192298_3150610432.pth... [2024-06-28 09:01:48,989][06887] Signal inference workers to stop experience collection... (43300 times) [2024-06-28 09:01:48,989][06887] Signal inference workers to resume experience collection... (43300 times) [2024-06-28 09:01:49,000][06909] InferenceWorker_p0-w0: stopping experience collection (43300 times) [2024-06-28 09:01:49,001][06909] InferenceWorker_p0-w0: resuming experience collection (43300 times) [2024-06-28 09:01:49,040][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000191655_3140075520.pth [2024-06-28 09:01:50,754][06909] Updated weights for policy 0, policy_version 192303 (0.0028) [2024-06-28 09:01:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.5, 300 sec: 44042.4). Total num frames: 3150839808. Throughput: 0: 43934.9. Samples: 3053799360. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 09:01:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:01:53,988][06909] Updated weights for policy 0, policy_version 192313 (0.0042) [2024-06-28 09:01:58,186][06909] Updated weights for policy 0, policy_version 192323 (0.0033) [2024-06-28 09:01:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3151052800. Throughput: 0: 44047.6. Samples: 3053935200. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 09:01:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:02:01,283][06909] Updated weights for policy 0, policy_version 192333 (0.0036) [2024-06-28 09:02:03,850][06674] Fps is (10 sec: 40961.0, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3151249408. Throughput: 0: 43965.9. Samples: 3054188120. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 09:02:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:02:05,584][06909] Updated weights for policy 0, policy_version 192343 (0.0036) [2024-06-28 09:02:08,786][06909] Updated weights for policy 0, policy_version 192353 (0.0033) [2024-06-28 09:02:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3151511552. Throughput: 0: 43963.8. Samples: 3054454780. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 09:02:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:02:13,253][06909] Updated weights for policy 0, policy_version 192363 (0.0029) [2024-06-28 09:02:13,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3151724544. Throughput: 0: 44023.0. Samples: 3054591660. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 09:02:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:02:16,418][06909] Updated weights for policy 0, policy_version 192373 (0.0037) [2024-06-28 09:02:18,852][06674] Fps is (10 sec: 39313.6, 60 sec: 43689.2, 300 sec: 43875.5). Total num frames: 3151904768. Throughput: 0: 44105.9. Samples: 3054851960. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 09:02:18,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:02:20,529][06909] Updated weights for policy 0, policy_version 192383 (0.0029) [2024-06-28 09:02:23,834][06909] Updated weights for policy 0, policy_version 192393 (0.0037) [2024-06-28 09:02:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3152166912. Throughput: 0: 43841.6. Samples: 3055113020. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 09:02:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:02:28,128][06909] Updated weights for policy 0, policy_version 192403 (0.0037) [2024-06-28 09:02:28,850][06674] Fps is (10 sec: 45885.0, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3152363520. Throughput: 0: 43744.3. Samples: 3055251740. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 09:02:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:02:31,551][06909] Updated weights for policy 0, policy_version 192413 (0.0041) [2024-06-28 09:02:33,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3152576512. Throughput: 0: 43823.5. Samples: 3055509040. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 09:02:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:02:35,492][06909] Updated weights for policy 0, policy_version 192423 (0.0036) [2024-06-28 09:02:38,771][06909] Updated weights for policy 0, policy_version 192433 (0.0035) [2024-06-28 09:02:38,856][06674] Fps is (10 sec: 45847.2, 60 sec: 43959.3, 300 sec: 44097.1). Total num frames: 3152822272. Throughput: 0: 43797.0. Samples: 3055770480. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 09:02:38,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:02:43,235][06909] Updated weights for policy 0, policy_version 192443 (0.0033) [2024-06-28 09:02:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43417.7, 300 sec: 43931.3). Total num frames: 3153018880. Throughput: 0: 43781.7. Samples: 3055905380. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 09:02:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:02:46,118][06909] Updated weights for policy 0, policy_version 192453 (0.0031) [2024-06-28 09:02:48,850][06674] Fps is (10 sec: 40985.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3153231872. Throughput: 0: 43914.7. Samples: 3056164280. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 09:02:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:02:50,455][06909] Updated weights for policy 0, policy_version 192463 (0.0033) [2024-06-28 09:02:51,187][06887] Signal inference workers to stop experience collection... (43350 times) [2024-06-28 09:02:51,188][06887] Signal inference workers to resume experience collection... (43350 times) [2024-06-28 09:02:51,236][06909] InferenceWorker_p0-w0: stopping experience collection (43350 times) [2024-06-28 09:02:51,236][06909] InferenceWorker_p0-w0: resuming experience collection (43350 times) [2024-06-28 09:02:53,776][06909] Updated weights for policy 0, policy_version 192473 (0.0035) [2024-06-28 09:02:53,852][06674] Fps is (10 sec: 45866.1, 60 sec: 43962.4, 300 sec: 44097.7). Total num frames: 3153477632. Throughput: 0: 44036.3. Samples: 3056436500. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 09:02:53,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:02:57,896][06909] Updated weights for policy 0, policy_version 192483 (0.0021) [2024-06-28 09:02:58,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3153723392. Throughput: 0: 44102.8. Samples: 3056576280. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 09:02:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:03:01,158][06909] Updated weights for policy 0, policy_version 192493 (0.0033) [2024-06-28 09:03:03,850][06674] Fps is (10 sec: 40968.0, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 3153887232. Throughput: 0: 43929.5. Samples: 3056828700. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 09:03:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:03:05,170][06909] Updated weights for policy 0, policy_version 192503 (0.0027) [2024-06-28 09:03:08,586][06909] Updated weights for policy 0, policy_version 192513 (0.0035) [2024-06-28 09:03:08,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3154132992. Throughput: 0: 44031.6. Samples: 3057094440. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 09:03:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:03:12,715][06909] Updated weights for policy 0, policy_version 192523 (0.0025) [2024-06-28 09:03:13,850][06674] Fps is (10 sec: 47514.7, 60 sec: 43963.9, 300 sec: 43987.2). Total num frames: 3154362368. Throughput: 0: 44014.3. Samples: 3057232380. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-28 09:03:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:03:15,704][06909] Updated weights for policy 0, policy_version 192533 (0.0026) [2024-06-28 09:03:18,852][06674] Fps is (10 sec: 40951.8, 60 sec: 43963.7, 300 sec: 43875.5). Total num frames: 3154542592. Throughput: 0: 44105.1. Samples: 3057493860. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-28 09:03:18,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:03:20,167][06909] Updated weights for policy 0, policy_version 192543 (0.0027) [2024-06-28 09:03:23,063][06909] Updated weights for policy 0, policy_version 192553 (0.0041) [2024-06-28 09:03:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.9, 300 sec: 44153.8). Total num frames: 3154804736. Throughput: 0: 44218.5. Samples: 3057760040. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-28 09:03:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:03:27,439][06909] Updated weights for policy 0, policy_version 192563 (0.0034) [2024-06-28 09:03:28,850][06674] Fps is (10 sec: 49162.1, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 3155034112. Throughput: 0: 44389.8. Samples: 3057902920. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-28 09:03:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:03:30,813][06909] Updated weights for policy 0, policy_version 192573 (0.0026) [2024-06-28 09:03:33,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3155214336. Throughput: 0: 44309.6. Samples: 3058158220. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-28 09:03:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:03:35,032][06909] Updated weights for policy 0, policy_version 192583 (0.0026) [2024-06-28 09:03:38,101][06909] Updated weights for policy 0, policy_version 192593 (0.0033) [2024-06-28 09:03:38,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43968.2, 300 sec: 44098.0). Total num frames: 3155460096. Throughput: 0: 44100.3. Samples: 3058420920. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-28 09:03:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:03:42,201][06909] Updated weights for policy 0, policy_version 192603 (0.0032) [2024-06-28 09:03:43,852][06674] Fps is (10 sec: 47504.4, 60 sec: 44508.4, 300 sec: 43986.6). Total num frames: 3155689472. Throughput: 0: 44152.6. Samples: 3058563240. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-28 09:03:43,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:03:45,206][06909] Updated weights for policy 0, policy_version 192613 (0.0032) [2024-06-28 09:03:48,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 3155853312. Throughput: 0: 44258.3. Samples: 3058820320. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-28 09:03:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:03:48,999][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000192619_3155869696.pth... [2024-06-28 09:03:49,048][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000191976_3145334784.pth [2024-06-28 09:03:49,898][06909] Updated weights for policy 0, policy_version 192623 (0.0035) [2024-06-28 09:03:52,116][06887] Signal inference workers to stop experience collection... (43400 times) [2024-06-28 09:03:52,117][06887] Signal inference workers to resume experience collection... (43400 times) [2024-06-28 09:03:52,160][06909] InferenceWorker_p0-w0: stopping experience collection (43400 times) [2024-06-28 09:03:52,160][06909] InferenceWorker_p0-w0: resuming experience collection (43400 times) [2024-06-28 09:03:52,977][06909] Updated weights for policy 0, policy_version 192633 (0.0038) [2024-06-28 09:03:53,850][06674] Fps is (10 sec: 45884.5, 60 sec: 44511.4, 300 sec: 44264.9). Total num frames: 3156148224. Throughput: 0: 44107.2. Samples: 3059079260. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-28 09:03:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:03:57,363][06909] Updated weights for policy 0, policy_version 192643 (0.0031) [2024-06-28 09:03:58,856][06674] Fps is (10 sec: 49122.1, 60 sec: 43686.2, 300 sec: 44041.5). Total num frames: 3156344832. Throughput: 0: 44284.5. Samples: 3059225460. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-28 09:03:58,857][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:04:00,442][06909] Updated weights for policy 0, policy_version 192653 (0.0041) [2024-06-28 09:04:03,850][06674] Fps is (10 sec: 39321.9, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3156541440. Throughput: 0: 44251.0. Samples: 3059485060. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-28 09:04:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:04:04,642][06909] Updated weights for policy 0, policy_version 192663 (0.0039) [2024-06-28 09:04:07,898][06909] Updated weights for policy 0, policy_version 192673 (0.0037) [2024-06-28 09:04:08,850][06674] Fps is (10 sec: 45903.1, 60 sec: 44509.9, 300 sec: 44209.1). Total num frames: 3156803584. Throughput: 0: 44012.4. Samples: 3059740600. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-28 09:04:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:04:12,178][06909] Updated weights for policy 0, policy_version 192683 (0.0022) [2024-06-28 09:04:13,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3157016576. Throughput: 0: 44081.0. Samples: 3059886560. Policy #0 lag: (min: 0.0, avg: 8.4, max: 21.0) [2024-06-28 09:04:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:04:15,202][06909] Updated weights for policy 0, policy_version 192693 (0.0029) [2024-06-28 09:04:18,850][06674] Fps is (10 sec: 39320.9, 60 sec: 44238.2, 300 sec: 43932.2). Total num frames: 3157196800. Throughput: 0: 43995.4. Samples: 3060138020. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 09:04:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:04:19,786][06909] Updated weights for policy 0, policy_version 192703 (0.0032) [2024-06-28 09:04:22,794][06909] Updated weights for policy 0, policy_version 192713 (0.0034) [2024-06-28 09:04:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3157442560. Throughput: 0: 43759.6. Samples: 3060390100. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 09:04:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:04:27,259][06909] Updated weights for policy 0, policy_version 192723 (0.0027) [2024-06-28 09:04:28,850][06674] Fps is (10 sec: 47514.9, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 3157671936. Throughput: 0: 43892.3. Samples: 3060538300. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 09:04:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:04:30,470][06909] Updated weights for policy 0, policy_version 192733 (0.0029) [2024-06-28 09:04:33,852][06674] Fps is (10 sec: 40951.8, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 3157852160. Throughput: 0: 43919.4. Samples: 3060796780. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 09:04:33,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:04:34,567][06909] Updated weights for policy 0, policy_version 192743 (0.0030) [2024-06-28 09:04:37,641][06909] Updated weights for policy 0, policy_version 192753 (0.0028) [2024-06-28 09:04:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3158114304. Throughput: 0: 44009.4. Samples: 3061059680. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 09:04:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:04:42,015][06909] Updated weights for policy 0, policy_version 192763 (0.0035) [2024-06-28 09:04:43,850][06674] Fps is (10 sec: 47523.2, 60 sec: 43965.3, 300 sec: 44042.7). Total num frames: 3158327296. Throughput: 0: 44006.4. Samples: 3061205480. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 09:04:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:04:45,113][06909] Updated weights for policy 0, policy_version 192773 (0.0031) [2024-06-28 09:04:48,850][06674] Fps is (10 sec: 39321.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3158507520. Throughput: 0: 43945.3. Samples: 3061462600. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 09:04:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:04:49,475][06909] Updated weights for policy 0, policy_version 192783 (0.0032) [2024-06-28 09:04:52,674][06909] Updated weights for policy 0, policy_version 192793 (0.0036) [2024-06-28 09:04:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 3158769664. Throughput: 0: 43896.4. Samples: 3061715940. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 09:04:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:04:56,936][06909] Updated weights for policy 0, policy_version 192803 (0.0028) [2024-06-28 09:04:58,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43968.2, 300 sec: 44042.4). Total num frames: 3158982656. Throughput: 0: 43956.9. Samples: 3061864620. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 09:04:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:04:59,779][06909] Updated weights for policy 0, policy_version 192813 (0.0033) [2024-06-28 09:05:03,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3159162880. Throughput: 0: 44142.9. Samples: 3062124440. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 09:05:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:05:04,403][06909] Updated weights for policy 0, policy_version 192823 (0.0043) [2024-06-28 09:05:05,241][06887] Signal inference workers to stop experience collection... (43450 times) [2024-06-28 09:05:05,242][06887] Signal inference workers to resume experience collection... (43450 times) [2024-06-28 09:05:05,260][06909] InferenceWorker_p0-w0: stopping experience collection (43450 times) [2024-06-28 09:05:05,261][06909] InferenceWorker_p0-w0: resuming experience collection (43450 times) [2024-06-28 09:05:07,088][06909] Updated weights for policy 0, policy_version 192833 (0.0034) [2024-06-28 09:05:08,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 3159425024. Throughput: 0: 44289.2. Samples: 3062383120. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 09:05:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:05:11,893][06909] Updated weights for policy 0, policy_version 192843 (0.0029) [2024-06-28 09:05:13,850][06674] Fps is (10 sec: 49152.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3159654400. Throughput: 0: 44297.8. Samples: 3062531700. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 09:05:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:05:14,525][06909] Updated weights for policy 0, policy_version 192853 (0.0034) [2024-06-28 09:05:18,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44237.0, 300 sec: 43986.9). Total num frames: 3159851008. Throughput: 0: 44454.4. Samples: 3062797140. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 09:05:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:05:19,069][06909] Updated weights for policy 0, policy_version 192863 (0.0034) [2024-06-28 09:05:21,996][06909] Updated weights for policy 0, policy_version 192873 (0.0042) [2024-06-28 09:05:23,852][06674] Fps is (10 sec: 44227.2, 60 sec: 44235.2, 300 sec: 44153.2). Total num frames: 3160096768. Throughput: 0: 44305.4. Samples: 3063053520. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 09:05:23,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:05:26,646][06909] Updated weights for policy 0, policy_version 192883 (0.0029) [2024-06-28 09:05:28,852][06674] Fps is (10 sec: 47503.8, 60 sec: 44235.2, 300 sec: 44153.2). Total num frames: 3160326144. Throughput: 0: 44173.9. Samples: 3063193400. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:05:28,863][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:05:29,403][06909] Updated weights for policy 0, policy_version 192893 (0.0032) [2024-06-28 09:05:33,850][06674] Fps is (10 sec: 40968.5, 60 sec: 44238.2, 300 sec: 43986.9). Total num frames: 3160506368. Throughput: 0: 44350.6. Samples: 3063458380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:05:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:05:33,948][06909] Updated weights for policy 0, policy_version 192903 (0.0036) [2024-06-28 09:05:36,775][06909] Updated weights for policy 0, policy_version 192913 (0.0033) [2024-06-28 09:05:38,850][06674] Fps is (10 sec: 40968.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3160735744. Throughput: 0: 44381.3. Samples: 3063713100. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:05:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:05:41,449][06909] Updated weights for policy 0, policy_version 192923 (0.0028) [2024-06-28 09:05:43,856][06674] Fps is (10 sec: 47485.1, 60 sec: 44232.3, 300 sec: 44097.1). Total num frames: 3160981504. Throughput: 0: 44066.9. Samples: 3063847900. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:05:43,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:05:44,270][06909] Updated weights for policy 0, policy_version 192933 (0.0041) [2024-06-28 09:05:48,753][06909] Updated weights for policy 0, policy_version 192943 (0.0041) [2024-06-28 09:05:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.8, 300 sec: 43931.3). Total num frames: 3161178112. Throughput: 0: 44162.5. Samples: 3064111760. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:05:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:05:48,879][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000192943_3161178112.pth... [2024-06-28 09:05:48,929][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000192298_3150610432.pth [2024-06-28 09:05:51,681][06909] Updated weights for policy 0, policy_version 192953 (0.0028) [2024-06-28 09:05:53,850][06674] Fps is (10 sec: 42624.1, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 3161407488. Throughput: 0: 44277.4. Samples: 3064375600. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:05:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:05:56,435][06909] Updated weights for policy 0, policy_version 192963 (0.0028) [2024-06-28 09:05:58,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3161636864. Throughput: 0: 43983.6. Samples: 3064510960. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:05:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:05:59,437][06909] Updated weights for policy 0, policy_version 192973 (0.0032) [2024-06-28 09:06:03,758][06909] Updated weights for policy 0, policy_version 192983 (0.0032) [2024-06-28 09:06:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3161833472. Throughput: 0: 43926.2. Samples: 3064773820. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:06:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:06:06,844][06909] Updated weights for policy 0, policy_version 192993 (0.0040) [2024-06-28 09:06:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3162062848. Throughput: 0: 43903.4. Samples: 3065029080. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:06:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:06:11,154][06909] Updated weights for policy 0, policy_version 193003 (0.0036) [2024-06-28 09:06:13,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3162308608. Throughput: 0: 43922.5. Samples: 3065169820. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:06:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:06:14,170][06909] Updated weights for policy 0, policy_version 193013 (0.0034) [2024-06-28 09:06:18,075][06887] Signal inference workers to stop experience collection... (43500 times) [2024-06-28 09:06:18,132][06909] InferenceWorker_p0-w0: stopping experience collection (43500 times) [2024-06-28 09:06:18,134][06887] Signal inference workers to resume experience collection... (43500 times) [2024-06-28 09:06:18,143][06909] InferenceWorker_p0-w0: resuming experience collection (43500 times) [2024-06-28 09:06:18,438][06909] Updated weights for policy 0, policy_version 193023 (0.0021) [2024-06-28 09:06:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3162505216. Throughput: 0: 43897.4. Samples: 3065433760. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:06:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:06:21,398][06909] Updated weights for policy 0, policy_version 193033 (0.0022) [2024-06-28 09:06:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43692.2, 300 sec: 44042.4). Total num frames: 3162718208. Throughput: 0: 44190.4. Samples: 3065701660. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:06:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:06:25,685][06909] Updated weights for policy 0, policy_version 193043 (0.0038) [2024-06-28 09:06:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 3162963968. Throughput: 0: 44102.4. Samples: 3065832240. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:06:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:06:28,907][06909] Updated weights for policy 0, policy_version 193053 (0.0036) [2024-06-28 09:06:33,122][06909] Updated weights for policy 0, policy_version 193063 (0.0033) [2024-06-28 09:06:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3163176960. Throughput: 0: 44169.1. Samples: 3066099360. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 09:06:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:06:36,317][06909] Updated weights for policy 0, policy_version 193073 (0.0040) [2024-06-28 09:06:38,851][06674] Fps is (10 sec: 40956.9, 60 sec: 43963.3, 300 sec: 43931.3). Total num frames: 3163373568. Throughput: 0: 44026.9. Samples: 3066356840. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 09:06:38,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:06:40,879][06909] Updated weights for policy 0, policy_version 193083 (0.0040) [2024-06-28 09:06:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43968.2, 300 sec: 44153.5). Total num frames: 3163619328. Throughput: 0: 43867.5. Samples: 3066485000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 09:06:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:06:44,167][06909] Updated weights for policy 0, policy_version 193093 (0.0021) [2024-06-28 09:06:48,497][06909] Updated weights for policy 0, policy_version 193103 (0.0031) [2024-06-28 09:06:48,850][06674] Fps is (10 sec: 45878.6, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3163832320. Throughput: 0: 43974.7. Samples: 3066752680. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 09:06:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:06:51,477][06909] Updated weights for policy 0, policy_version 193113 (0.0034) [2024-06-28 09:06:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3164028928. Throughput: 0: 44056.0. Samples: 3067011600. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 09:06:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:06:55,872][06909] Updated weights for policy 0, policy_version 193123 (0.0033) [2024-06-28 09:06:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3164274688. Throughput: 0: 43687.1. Samples: 3067135740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 09:06:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:06:59,150][06909] Updated weights for policy 0, policy_version 193133 (0.0024) [2024-06-28 09:07:03,085][06909] Updated weights for policy 0, policy_version 193143 (0.0042) [2024-06-28 09:07:03,852][06674] Fps is (10 sec: 45865.7, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 3164487680. Throughput: 0: 44097.9. Samples: 3067418260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 09:07:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:07:06,381][06909] Updated weights for policy 0, policy_version 193153 (0.0034) [2024-06-28 09:07:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3164700672. Throughput: 0: 43925.3. Samples: 3067678300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 09:07:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:07:10,539][06909] Updated weights for policy 0, policy_version 193163 (0.0037) [2024-06-28 09:07:13,654][06909] Updated weights for policy 0, policy_version 193173 (0.0038) [2024-06-28 09:07:13,850][06674] Fps is (10 sec: 45884.5, 60 sec: 43963.7, 300 sec: 44209.3). Total num frames: 3164946432. Throughput: 0: 43933.7. Samples: 3067809260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 09:07:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:07:17,688][06909] Updated weights for policy 0, policy_version 193183 (0.0038) [2024-06-28 09:07:18,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 3165175808. Throughput: 0: 43947.0. Samples: 3068076980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 09:07:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:07:21,053][06909] Updated weights for policy 0, policy_version 193193 (0.0031) [2024-06-28 09:07:23,852][06674] Fps is (10 sec: 40951.7, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 3165356032. Throughput: 0: 44130.2. Samples: 3068342760. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 09:07:23,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:07:25,537][06909] Updated weights for policy 0, policy_version 193203 (0.0036) [2024-06-28 09:07:28,559][06909] Updated weights for policy 0, policy_version 193213 (0.0028) [2024-06-28 09:07:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 3165618176. Throughput: 0: 44054.7. Samples: 3068467460. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 09:07:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:07:32,735][06887] Signal inference workers to stop experience collection... (43550 times) [2024-06-28 09:07:32,737][06887] Signal inference workers to resume experience collection... (43550 times) [2024-06-28 09:07:32,745][06909] Updated weights for policy 0, policy_version 193223 (0.0027) [2024-06-28 09:07:32,756][06909] InferenceWorker_p0-w0: stopping experience collection (43550 times) [2024-06-28 09:07:32,756][06909] InferenceWorker_p0-w0: resuming experience collection (43550 times) [2024-06-28 09:07:33,850][06674] Fps is (10 sec: 47523.2, 60 sec: 44236.7, 300 sec: 44098.8). Total num frames: 3165831168. Throughput: 0: 44325.7. Samples: 3068747340. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 09:07:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:07:35,659][06909] Updated weights for policy 0, policy_version 193233 (0.0025) [2024-06-28 09:07:38,850][06674] Fps is (10 sec: 40959.9, 60 sec: 44237.3, 300 sec: 44098.0). Total num frames: 3166027776. Throughput: 0: 44472.0. Samples: 3069012840. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:07:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:07:39,942][06909] Updated weights for policy 0, policy_version 193243 (0.0037) [2024-06-28 09:07:42,942][06909] Updated weights for policy 0, policy_version 193253 (0.0027) [2024-06-28 09:07:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 3166273536. Throughput: 0: 44494.6. Samples: 3069138000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:07:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:07:47,698][06909] Updated weights for policy 0, policy_version 193263 (0.0036) [2024-06-28 09:07:48,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 3166486528. Throughput: 0: 44142.1. Samples: 3069404560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:07:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:07:48,975][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000193268_3166502912.pth... [2024-06-28 09:07:49,027][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000192619_3155869696.pth [2024-06-28 09:07:50,659][06909] Updated weights for policy 0, policy_version 193273 (0.0026) [2024-06-28 09:07:53,850][06674] Fps is (10 sec: 40960.6, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 3166683136. Throughput: 0: 44192.5. Samples: 3069666960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:07:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:07:55,009][06909] Updated weights for policy 0, policy_version 193283 (0.0026) [2024-06-28 09:07:57,874][06909] Updated weights for policy 0, policy_version 193293 (0.0038) [2024-06-28 09:07:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 3166928896. Throughput: 0: 44116.9. Samples: 3069794520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:07:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:08:02,595][06909] Updated weights for policy 0, policy_version 193303 (0.0033) [2024-06-28 09:08:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44238.4, 300 sec: 44098.0). Total num frames: 3167141888. Throughput: 0: 44112.6. Samples: 3070062040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:08:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:08:05,391][06909] Updated weights for policy 0, policy_version 193313 (0.0037) [2024-06-28 09:08:08,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3167338496. Throughput: 0: 44079.8. Samples: 3070326260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:08:08,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:08:09,903][06909] Updated weights for policy 0, policy_version 193323 (0.0028) [2024-06-28 09:08:12,682][06909] Updated weights for policy 0, policy_version 193333 (0.0041) [2024-06-28 09:08:13,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.7, 300 sec: 44209.3). Total num frames: 3167584256. Throughput: 0: 44092.4. Samples: 3070451620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:08:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:08:17,138][06909] Updated weights for policy 0, policy_version 193343 (0.0024) [2024-06-28 09:08:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3167813632. Throughput: 0: 43980.9. Samples: 3070726480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:08:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:08:19,905][06909] Updated weights for policy 0, policy_version 193353 (0.0039) [2024-06-28 09:08:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44238.4, 300 sec: 43986.9). Total num frames: 3168010240. Throughput: 0: 43903.6. Samples: 3070988500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:08:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:08:24,702][06909] Updated weights for policy 0, policy_version 193363 (0.0021) [2024-06-28 09:08:27,771][06909] Updated weights for policy 0, policy_version 193373 (0.0029) [2024-06-28 09:08:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 3168239616. Throughput: 0: 43998.7. Samples: 3071117940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:08:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:08:31,834][06887] Signal inference workers to stop experience collection... (43600 times) [2024-06-28 09:08:31,834][06887] Signal inference workers to resume experience collection... (43600 times) [2024-06-28 09:08:31,850][06909] InferenceWorker_p0-w0: stopping experience collection (43600 times) [2024-06-28 09:08:31,850][06909] InferenceWorker_p0-w0: resuming experience collection (43600 times) [2024-06-28 09:08:31,986][06909] Updated weights for policy 0, policy_version 193383 (0.0040) [2024-06-28 09:08:33,852][06674] Fps is (10 sec: 47503.9, 60 sec: 44235.4, 300 sec: 44153.2). Total num frames: 3168485376. Throughput: 0: 44047.3. Samples: 3071386780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:08:33,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:08:35,081][06909] Updated weights for policy 0, policy_version 193393 (0.0031) [2024-06-28 09:08:38,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43987.2). Total num frames: 3168665600. Throughput: 0: 44232.5. Samples: 3071657420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:08:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:08:39,301][06909] Updated weights for policy 0, policy_version 193403 (0.0039) [2024-06-28 09:08:42,739][06909] Updated weights for policy 0, policy_version 193413 (0.0033) [2024-06-28 09:08:43,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43690.7, 300 sec: 44209.0). Total num frames: 3168894976. Throughput: 0: 44070.6. Samples: 3071777700. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:08:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:08:46,769][06909] Updated weights for policy 0, policy_version 193423 (0.0030) [2024-06-28 09:08:48,852][06674] Fps is (10 sec: 47503.3, 60 sec: 44235.2, 300 sec: 44042.1). Total num frames: 3169140736. Throughput: 0: 44097.8. Samples: 3072046540. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:08:48,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 09:08:49,983][06909] Updated weights for policy 0, policy_version 193433 (0.0027) [2024-06-28 09:08:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44043.3). Total num frames: 3169337344. Throughput: 0: 44287.9. Samples: 3072319220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:08:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:08:54,356][06909] Updated weights for policy 0, policy_version 193443 (0.0033) [2024-06-28 09:08:57,058][06909] Updated weights for policy 0, policy_version 193453 (0.0031) [2024-06-28 09:08:58,850][06674] Fps is (10 sec: 40968.8, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3169550336. Throughput: 0: 44296.1. Samples: 3072444940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:08:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:09:01,502][06909] Updated weights for policy 0, policy_version 193463 (0.0035) [2024-06-28 09:09:03,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.7, 300 sec: 44097.9). Total num frames: 3169812480. Throughput: 0: 44095.5. Samples: 3072710780. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:09:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:09:04,745][06909] Updated weights for policy 0, policy_version 193473 (0.0053) [2024-06-28 09:09:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3169992704. Throughput: 0: 44241.0. Samples: 3072979340. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:09:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:09:09,152][06909] Updated weights for policy 0, policy_version 193483 (0.0031) [2024-06-28 09:09:12,373][06909] Updated weights for policy 0, policy_version 193493 (0.0028) [2024-06-28 09:09:13,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 3170205696. Throughput: 0: 44095.9. Samples: 3073102260. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:09:13,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:09:16,380][06909] Updated weights for policy 0, policy_version 193503 (0.0033) [2024-06-28 09:09:18,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3170467840. Throughput: 0: 44067.4. Samples: 3073369720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:09:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:09:19,791][06909] Updated weights for policy 0, policy_version 193513 (0.0033) [2024-06-28 09:09:23,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3170664448. Throughput: 0: 44169.4. Samples: 3073645040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:09:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:09:23,914][06909] Updated weights for policy 0, policy_version 193523 (0.0045) [2024-06-28 09:09:27,093][06909] Updated weights for policy 0, policy_version 193533 (0.0028) [2024-06-28 09:09:28,851][06674] Fps is (10 sec: 40955.1, 60 sec: 43962.9, 300 sec: 44153.6). Total num frames: 3170877440. Throughput: 0: 44182.9. Samples: 3073765980. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:09:28,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:09:31,246][06909] Updated weights for policy 0, policy_version 193543 (0.0030) [2024-06-28 09:09:33,850][06674] Fps is (10 sec: 47512.5, 60 sec: 44238.2, 300 sec: 44153.5). Total num frames: 3171139584. Throughput: 0: 44306.8. Samples: 3074040260. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:09:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:09:34,359][06909] Updated weights for policy 0, policy_version 193553 (0.0032) [2024-06-28 09:09:38,666][06909] Updated weights for policy 0, policy_version 193563 (0.0053) [2024-06-28 09:09:38,850][06674] Fps is (10 sec: 47519.2, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 3171352576. Throughput: 0: 44120.6. Samples: 3074304640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:09:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:09:42,074][06909] Updated weights for policy 0, policy_version 193573 (0.0032) [2024-06-28 09:09:43,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3171532800. Throughput: 0: 44165.7. Samples: 3074432400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:09:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:09:46,421][06909] Updated weights for policy 0, policy_version 193583 (0.0038) [2024-06-28 09:09:48,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 3171794944. Throughput: 0: 43970.7. Samples: 3074689460. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:09:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:09:48,995][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000193592_3171811328.pth... [2024-06-28 09:09:49,049][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000192943_3161178112.pth [2024-06-28 09:09:49,443][06909] Updated weights for policy 0, policy_version 193593 (0.0029) [2024-06-28 09:09:53,724][06909] Updated weights for policy 0, policy_version 193603 (0.0031) [2024-06-28 09:09:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 3171991552. Throughput: 0: 44002.5. Samples: 3074959460. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 09:09:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:09:53,923][06887] Signal inference workers to stop experience collection... (43650 times) [2024-06-28 09:09:53,970][06909] InferenceWorker_p0-w0: stopping experience collection (43650 times) [2024-06-28 09:09:54,035][06887] Signal inference workers to resume experience collection... (43650 times) [2024-06-28 09:09:54,036][06909] InferenceWorker_p0-w0: resuming experience collection (43650 times) [2024-06-28 09:09:57,183][06909] Updated weights for policy 0, policy_version 193613 (0.0031) [2024-06-28 09:09:58,850][06674] Fps is (10 sec: 40960.7, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 3172204544. Throughput: 0: 44139.7. Samples: 3075088540. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 09:09:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:10:01,215][06909] Updated weights for policy 0, policy_version 193623 (0.0025) [2024-06-28 09:10:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3172450304. Throughput: 0: 44135.9. Samples: 3075355840. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 09:10:03,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:10:04,509][06909] Updated weights for policy 0, policy_version 193633 (0.0035) [2024-06-28 09:10:08,413][06909] Updated weights for policy 0, policy_version 193643 (0.0027) [2024-06-28 09:10:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3172663296. Throughput: 0: 43969.2. Samples: 3075623660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 09:10:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:10:11,827][06909] Updated weights for policy 0, policy_version 193653 (0.0034) [2024-06-28 09:10:13,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3172859904. Throughput: 0: 44115.8. Samples: 3075751140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 09:10:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:10:16,182][06909] Updated weights for policy 0, policy_version 193663 (0.0022) [2024-06-28 09:10:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 3173105664. Throughput: 0: 43780.6. Samples: 3076010380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 09:10:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:10:19,294][06909] Updated weights for policy 0, policy_version 193673 (0.0041) [2024-06-28 09:10:23,620][06909] Updated weights for policy 0, policy_version 193683 (0.0030) [2024-06-28 09:10:23,856][06674] Fps is (10 sec: 44210.4, 60 sec: 43959.3, 300 sec: 43986.3). Total num frames: 3173302272. Throughput: 0: 43802.1. Samples: 3076276000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 09:10:23,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:10:27,113][06909] Updated weights for policy 0, policy_version 193693 (0.0022) [2024-06-28 09:10:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43964.6, 300 sec: 44098.0). Total num frames: 3173515264. Throughput: 0: 43729.4. Samples: 3076400220. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 09:10:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:10:30,999][06909] Updated weights for policy 0, policy_version 193703 (0.0038) [2024-06-28 09:10:33,850][06674] Fps is (10 sec: 47542.1, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 3173777408. Throughput: 0: 44024.1. Samples: 3076670540. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 09:10:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:10:34,432][06909] Updated weights for policy 0, policy_version 193713 (0.0035) [2024-06-28 09:10:38,152][06909] Updated weights for policy 0, policy_version 193723 (0.0026) [2024-06-28 09:10:38,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.7, 300 sec: 44098.8). Total num frames: 3173990400. Throughput: 0: 44093.3. Samples: 3076943660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 09:10:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:10:42,108][06909] Updated weights for policy 0, policy_version 193733 (0.0034) [2024-06-28 09:10:43,850][06674] Fps is (10 sec: 40959.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3174187008. Throughput: 0: 44126.2. Samples: 3077074220. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 09:10:43,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 09:10:45,648][06909] Updated weights for policy 0, policy_version 193743 (0.0032) [2024-06-28 09:10:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3174432768. Throughput: 0: 44089.4. Samples: 3077339860. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 09:10:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:10:49,208][06909] Updated weights for policy 0, policy_version 193753 (0.0037) [2024-06-28 09:10:53,189][06909] Updated weights for policy 0, policy_version 193763 (0.0034) [2024-06-28 09:10:53,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3174645760. Throughput: 0: 44010.6. Samples: 3077604140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 09:10:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:10:56,978][06909] Updated weights for policy 0, policy_version 193773 (0.0033) [2024-06-28 09:10:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3174858752. Throughput: 0: 43997.8. Samples: 3077731040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 09:10:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:11:00,630][06909] Updated weights for policy 0, policy_version 193783 (0.0033) [2024-06-28 09:11:03,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3175088128. Throughput: 0: 44194.7. Samples: 3077999140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 09:11:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:11:04,294][06909] Updated weights for policy 0, policy_version 193793 (0.0035) [2024-06-28 09:11:08,060][06909] Updated weights for policy 0, policy_version 193803 (0.0037) [2024-06-28 09:11:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3175317504. Throughput: 0: 44288.5. Samples: 3078268720. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 09:11:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:11:11,530][06909] Updated weights for policy 0, policy_version 193813 (0.0040) [2024-06-28 09:11:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3175514112. Throughput: 0: 44481.8. Samples: 3078401900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 09:11:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:11:15,300][06909] Updated weights for policy 0, policy_version 193823 (0.0026) [2024-06-28 09:11:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 3175743488. Throughput: 0: 44273.6. Samples: 3078662860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 09:11:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:11:19,129][06909] Updated weights for policy 0, policy_version 193833 (0.0039) [2024-06-28 09:11:23,024][06909] Updated weights for policy 0, policy_version 193843 (0.0035) [2024-06-28 09:11:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44241.3, 300 sec: 44042.4). Total num frames: 3175956480. Throughput: 0: 44027.3. Samples: 3078924880. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 09:11:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:11:26,831][06909] Updated weights for policy 0, policy_version 193853 (0.0037) [2024-06-28 09:11:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3176169472. Throughput: 0: 44021.7. Samples: 3079055200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 09:11:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:11:30,510][06909] Updated weights for policy 0, policy_version 193863 (0.0039) [2024-06-28 09:11:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 44153.6). Total num frames: 3176398848. Throughput: 0: 43873.4. Samples: 3079314160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 09:11:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:11:34,088][06909] Updated weights for policy 0, policy_version 193873 (0.0030) [2024-06-28 09:11:36,922][06887] Signal inference workers to stop experience collection... (43700 times) [2024-06-28 09:11:36,922][06887] Signal inference workers to resume experience collection... (43700 times) [2024-06-28 09:11:36,932][06909] InferenceWorker_p0-w0: stopping experience collection (43700 times) [2024-06-28 09:11:36,932][06909] InferenceWorker_p0-w0: resuming experience collection (43700 times) [2024-06-28 09:11:37,779][06909] Updated weights for policy 0, policy_version 193883 (0.0027) [2024-06-28 09:11:38,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3176628224. Throughput: 0: 43911.3. Samples: 3079580140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 09:11:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:11:41,343][06909] Updated weights for policy 0, policy_version 193893 (0.0034) [2024-06-28 09:11:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3176841216. Throughput: 0: 44129.8. Samples: 3079716880. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 09:11:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:11:45,023][06909] Updated weights for policy 0, policy_version 193903 (0.0026) [2024-06-28 09:11:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 3177054208. Throughput: 0: 44120.4. Samples: 3079984560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 09:11:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:11:49,010][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000193913_3177070592.pth... [2024-06-28 09:11:49,022][06909] Updated weights for policy 0, policy_version 193913 (0.0027) [2024-06-28 09:11:49,054][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000193268_3166502912.pth [2024-06-28 09:11:52,633][06909] Updated weights for policy 0, policy_version 193923 (0.0044) [2024-06-28 09:11:53,851][06674] Fps is (10 sec: 42594.4, 60 sec: 43690.0, 300 sec: 44042.3). Total num frames: 3177267200. Throughput: 0: 43905.3. Samples: 3080244500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 09:11:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:11:56,270][06909] Updated weights for policy 0, policy_version 193933 (0.0042) [2024-06-28 09:11:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 3177496576. Throughput: 0: 43841.3. Samples: 3080374760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 09:11:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:12:00,063][06909] Updated weights for policy 0, policy_version 193943 (0.0030) [2024-06-28 09:12:03,850][06674] Fps is (10 sec: 44240.8, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 3177709568. Throughput: 0: 43938.3. Samples: 3080640080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 09:12:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:12:04,306][06909] Updated weights for policy 0, policy_version 193953 (0.0031) [2024-06-28 09:12:07,384][06909] Updated weights for policy 0, policy_version 193963 (0.0027) [2024-06-28 09:12:08,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3177955328. Throughput: 0: 43996.7. Samples: 3080904740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:12:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:12:11,525][06909] Updated weights for policy 0, policy_version 193973 (0.0022) [2024-06-28 09:12:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3178168320. Throughput: 0: 44147.5. Samples: 3081041840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:12:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:12:14,814][06909] Updated weights for policy 0, policy_version 193983 (0.0029) [2024-06-28 09:12:18,651][06909] Updated weights for policy 0, policy_version 193993 (0.0023) [2024-06-28 09:12:18,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44153.8). Total num frames: 3178381312. Throughput: 0: 44276.7. Samples: 3081306620. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:12:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:12:22,131][06909] Updated weights for policy 0, policy_version 194003 (0.0027) [2024-06-28 09:12:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3178610688. Throughput: 0: 44320.4. Samples: 3081574560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:12:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:12:25,941][06909] Updated weights for policy 0, policy_version 194013 (0.0045) [2024-06-28 09:12:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3178823680. Throughput: 0: 44327.5. Samples: 3081711620. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:12:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:12:29,799][06909] Updated weights for policy 0, policy_version 194023 (0.0028) [2024-06-28 09:12:33,298][06909] Updated weights for policy 0, policy_version 194033 (0.0028) [2024-06-28 09:12:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3179053056. Throughput: 0: 44059.1. Samples: 3081967220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:12:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:12:37,032][06909] Updated weights for policy 0, policy_version 194043 (0.0029) [2024-06-28 09:12:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3179266048. Throughput: 0: 44232.9. Samples: 3082234940. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:12:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:12:41,420][06909] Updated weights for policy 0, policy_version 194053 (0.0045) [2024-06-28 09:12:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3179495424. Throughput: 0: 44380.3. Samples: 3082371880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:12:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:12:44,391][06909] Updated weights for policy 0, policy_version 194063 (0.0031) [2024-06-28 09:12:48,609][06909] Updated weights for policy 0, policy_version 194073 (0.0036) [2024-06-28 09:12:48,852][06674] Fps is (10 sec: 42589.6, 60 sec: 43962.2, 300 sec: 44097.6). Total num frames: 3179692032. Throughput: 0: 44144.7. Samples: 3082626680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:12:48,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:12:51,761][06909] Updated weights for policy 0, policy_version 194083 (0.0024) [2024-06-28 09:12:53,852][06674] Fps is (10 sec: 44227.8, 60 sec: 44509.0, 300 sec: 44097.6). Total num frames: 3179937792. Throughput: 0: 44210.0. Samples: 3082894280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:12:53,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:12:55,800][06909] Updated weights for policy 0, policy_version 194093 (0.0020) [2024-06-28 09:12:58,852][06674] Fps is (10 sec: 47513.8, 60 sec: 44508.3, 300 sec: 44153.2). Total num frames: 3180167168. Throughput: 0: 44343.0. Samples: 3083037360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:12:58,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:12:59,054][06909] Updated weights for policy 0, policy_version 194103 (0.0031) [2024-06-28 09:13:01,248][06887] Signal inference workers to stop experience collection... (43750 times) [2024-06-28 09:13:01,249][06887] Signal inference workers to resume experience collection... (43750 times) [2024-06-28 09:13:01,287][06909] InferenceWorker_p0-w0: stopping experience collection (43750 times) [2024-06-28 09:13:01,287][06909] InferenceWorker_p0-w0: resuming experience collection (43750 times) [2024-06-28 09:13:03,607][06909] Updated weights for policy 0, policy_version 194113 (0.0050) [2024-06-28 09:13:03,850][06674] Fps is (10 sec: 40968.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3180347392. Throughput: 0: 44182.8. Samples: 3083294840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:13:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:13:06,930][06909] Updated weights for policy 0, policy_version 194123 (0.0029) [2024-06-28 09:13:08,850][06674] Fps is (10 sec: 40968.7, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3180576768. Throughput: 0: 43991.2. Samples: 3083554160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:13:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:13:10,958][06909] Updated weights for policy 0, policy_version 194133 (0.0032) [2024-06-28 09:13:13,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 3180789760. Throughput: 0: 43890.5. Samples: 3083686780. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:13:13,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:13:14,310][06909] Updated weights for policy 0, policy_version 194143 (0.0035) [2024-06-28 09:13:18,750][06909] Updated weights for policy 0, policy_version 194153 (0.0037) [2024-06-28 09:13:18,852][06674] Fps is (10 sec: 42589.3, 60 sec: 43689.3, 300 sec: 44042.1). Total num frames: 3181002752. Throughput: 0: 44018.0. Samples: 3083948120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:13:18,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:13:21,622][06909] Updated weights for policy 0, policy_version 194163 (0.0035) [2024-06-28 09:13:23,850][06674] Fps is (10 sec: 45885.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3181248512. Throughput: 0: 43778.4. Samples: 3084204960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:13:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:13:26,255][06909] Updated weights for policy 0, policy_version 194173 (0.0041) [2024-06-28 09:13:28,854][06674] Fps is (10 sec: 47505.9, 60 sec: 44234.1, 300 sec: 44042.2). Total num frames: 3181477888. Throughput: 0: 43936.9. Samples: 3084349200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:13:28,854][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:13:29,071][06909] Updated weights for policy 0, policy_version 194183 (0.0022) [2024-06-28 09:13:33,791][06909] Updated weights for policy 0, policy_version 194193 (0.0025) [2024-06-28 09:13:33,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 3181658112. Throughput: 0: 44123.4. Samples: 3084612140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:13:33,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 09:13:36,534][06909] Updated weights for policy 0, policy_version 194203 (0.0038) [2024-06-28 09:13:38,850][06674] Fps is (10 sec: 44253.6, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3181920256. Throughput: 0: 43815.9. Samples: 3084865900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:13:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:13:40,999][06909] Updated weights for policy 0, policy_version 194213 (0.0029) [2024-06-28 09:13:43,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 3182133248. Throughput: 0: 43640.6. Samples: 3085001100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:13:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:13:44,106][06909] Updated weights for policy 0, policy_version 194223 (0.0037) [2024-06-28 09:13:48,398][06909] Updated weights for policy 0, policy_version 194233 (0.0029) [2024-06-28 09:13:48,850][06674] Fps is (10 sec: 40959.2, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 3182329856. Throughput: 0: 43856.4. Samples: 3085268380. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:13:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:13:48,875][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000194235_3182346240.pth... [2024-06-28 09:13:48,951][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000193592_3171811328.pth [2024-06-28 09:13:51,532][06909] Updated weights for policy 0, policy_version 194243 (0.0036) [2024-06-28 09:13:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43965.2, 300 sec: 44153.5). Total num frames: 3182575616. Throughput: 0: 43773.2. Samples: 3085523960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:13:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:13:56,025][06909] Updated weights for policy 0, policy_version 194253 (0.0032) [2024-06-28 09:13:58,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43692.2, 300 sec: 43986.9). Total num frames: 3182788608. Throughput: 0: 43845.1. Samples: 3085659720. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:13:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:13:59,106][06909] Updated weights for policy 0, policy_version 194263 (0.0034) [2024-06-28 09:14:03,404][06909] Updated weights for policy 0, policy_version 194273 (0.0050) [2024-06-28 09:14:03,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3182985216. Throughput: 0: 44106.5. Samples: 3085932820. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:14:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:14:06,348][06909] Updated weights for policy 0, policy_version 194283 (0.0044) [2024-06-28 09:14:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 3183247360. Throughput: 0: 44014.1. Samples: 3086185600. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:14:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:14:10,907][06909] Updated weights for policy 0, policy_version 194293 (0.0033) [2024-06-28 09:14:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44238.4, 300 sec: 43986.9). Total num frames: 3183443968. Throughput: 0: 44030.3. Samples: 3086330400. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:14:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:14:13,886][06909] Updated weights for policy 0, policy_version 194303 (0.0026) [2024-06-28 09:14:18,134][06909] Updated weights for policy 0, policy_version 194313 (0.0042) [2024-06-28 09:14:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44511.4, 300 sec: 44097.9). Total num frames: 3183673344. Throughput: 0: 43895.6. Samples: 3086587440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:14:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:14:21,269][06909] Updated weights for policy 0, policy_version 194323 (0.0039) [2024-06-28 09:14:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.7, 300 sec: 44153.7). Total num frames: 3183902720. Throughput: 0: 43933.7. Samples: 3086842920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:14:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:14:25,807][06909] Updated weights for policy 0, policy_version 194333 (0.0028) [2024-06-28 09:14:26,543][06887] Signal inference workers to stop experience collection... (43800 times) [2024-06-28 09:14:26,596][06909] InferenceWorker_p0-w0: stopping experience collection (43800 times) [2024-06-28 09:14:26,603][06887] Signal inference workers to resume experience collection... (43800 times) [2024-06-28 09:14:26,605][06909] InferenceWorker_p0-w0: resuming experience collection (43800 times) [2024-06-28 09:14:28,663][06909] Updated weights for policy 0, policy_version 194343 (0.0027) [2024-06-28 09:14:28,856][06674] Fps is (10 sec: 44210.2, 60 sec: 43962.0, 300 sec: 43986.0). Total num frames: 3184115712. Throughput: 0: 43955.9. Samples: 3086979380. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:14:28,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:14:33,257][06909] Updated weights for policy 0, policy_version 194353 (0.0029) [2024-06-28 09:14:33,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3184312320. Throughput: 0: 43929.8. Samples: 3087245220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:14:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:14:36,332][06909] Updated weights for policy 0, policy_version 194363 (0.0032) [2024-06-28 09:14:38,850][06674] Fps is (10 sec: 44263.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3184558080. Throughput: 0: 43934.3. Samples: 3087501000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:14:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:14:40,755][06909] Updated weights for policy 0, policy_version 194373 (0.0029) [2024-06-28 09:14:43,530][06909] Updated weights for policy 0, policy_version 194383 (0.0023) [2024-06-28 09:14:43,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3184771072. Throughput: 0: 44041.4. Samples: 3087641580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:14:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:14:48,027][06909] Updated weights for policy 0, policy_version 194393 (0.0034) [2024-06-28 09:14:48,851][06674] Fps is (10 sec: 42593.5, 60 sec: 44236.0, 300 sec: 44042.3). Total num frames: 3184984064. Throughput: 0: 43826.9. Samples: 3087905080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:14:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:14:51,277][06909] Updated weights for policy 0, policy_version 194403 (0.0023) [2024-06-28 09:14:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3185213440. Throughput: 0: 43929.0. Samples: 3088162400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:14:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:14:55,255][06909] Updated weights for policy 0, policy_version 194413 (0.0034) [2024-06-28 09:14:58,666][06909] Updated weights for policy 0, policy_version 194423 (0.0026) [2024-06-28 09:14:58,850][06674] Fps is (10 sec: 44241.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3185426432. Throughput: 0: 43753.3. Samples: 3088299300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:14:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:15:02,809][06909] Updated weights for policy 0, policy_version 194433 (0.0029) [2024-06-28 09:15:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3185639424. Throughput: 0: 43981.4. Samples: 3088566600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:15:03,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 09:15:06,126][06909] Updated weights for policy 0, policy_version 194443 (0.0040) [2024-06-28 09:15:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3185868800. Throughput: 0: 44048.9. Samples: 3088825120. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:15:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:15:10,342][06909] Updated weights for policy 0, policy_version 194453 (0.0040) [2024-06-28 09:15:13,553][06909] Updated weights for policy 0, policy_version 194463 (0.0039) [2024-06-28 09:15:13,851][06674] Fps is (10 sec: 45869.0, 60 sec: 44235.8, 300 sec: 44042.2). Total num frames: 3186098176. Throughput: 0: 44205.1. Samples: 3088968400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:15:13,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:15:17,738][06909] Updated weights for policy 0, policy_version 194473 (0.0030) [2024-06-28 09:15:18,851][06674] Fps is (10 sec: 44231.7, 60 sec: 43962.9, 300 sec: 44098.7). Total num frames: 3186311168. Throughput: 0: 44200.3. Samples: 3089234280. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:15:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:15:20,665][06909] Updated weights for policy 0, policy_version 194483 (0.0031) [2024-06-28 09:15:23,852][06674] Fps is (10 sec: 42595.4, 60 sec: 43689.2, 300 sec: 44097.6). Total num frames: 3186524160. Throughput: 0: 44217.5. Samples: 3089490880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 09:15:23,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:15:24,937][06909] Updated weights for policy 0, policy_version 194493 (0.0024) [2024-06-28 09:15:28,481][06909] Updated weights for policy 0, policy_version 194503 (0.0044) [2024-06-28 09:15:28,850][06674] Fps is (10 sec: 42602.9, 60 sec: 43695.0, 300 sec: 43931.3). Total num frames: 3186737152. Throughput: 0: 44081.7. Samples: 3089625260. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 09:15:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:15:32,217][06909] Updated weights for policy 0, policy_version 194513 (0.0038) [2024-06-28 09:15:33,852][06674] Fps is (10 sec: 44236.8, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 3186966528. Throughput: 0: 44073.3. Samples: 3089888420. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 09:15:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:15:35,683][06909] Updated weights for policy 0, policy_version 194523 (0.0039) [2024-06-28 09:15:38,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3187195904. Throughput: 0: 44181.7. Samples: 3090150580. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 09:15:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:15:39,914][06909] Updated weights for policy 0, policy_version 194533 (0.0033) [2024-06-28 09:15:43,253][06909] Updated weights for policy 0, policy_version 194543 (0.0041) [2024-06-28 09:15:43,850][06674] Fps is (10 sec: 45884.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3187425280. Throughput: 0: 44239.5. Samples: 3090290080. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 09:15:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:15:47,217][06909] Updated weights for policy 0, policy_version 194553 (0.0041) [2024-06-28 09:15:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43964.5, 300 sec: 43986.9). Total num frames: 3187621888. Throughput: 0: 44268.3. Samples: 3090558680. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 09:15:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:15:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000194557_3187621888.pth... [2024-06-28 09:15:48,923][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000193913_3177070592.pth [2024-06-28 09:15:50,497][06909] Updated weights for policy 0, policy_version 194563 (0.0044) [2024-06-28 09:15:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3187834880. Throughput: 0: 44293.0. Samples: 3090818300. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 09:15:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:15:54,727][06909] Updated weights for policy 0, policy_version 194573 (0.0021) [2024-06-28 09:15:58,122][06909] Updated weights for policy 0, policy_version 194583 (0.0033) [2024-06-28 09:15:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3188064256. Throughput: 0: 44061.3. Samples: 3090951100. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 09:15:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:16:02,122][06909] Updated weights for policy 0, policy_version 194593 (0.0037) [2024-06-28 09:16:03,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3188293632. Throughput: 0: 43982.4. Samples: 3091213440. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 09:16:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:16:05,471][06909] Updated weights for policy 0, policy_version 194603 (0.0031) [2024-06-28 09:16:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3188490240. Throughput: 0: 44098.9. Samples: 3091475240. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 09:16:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:16:09,514][06909] Updated weights for policy 0, policy_version 194613 (0.0033) [2024-06-28 09:16:11,925][06887] Signal inference workers to stop experience collection... (43850 times) [2024-06-28 09:16:11,931][06887] Signal inference workers to resume experience collection... (43850 times) [2024-06-28 09:16:11,971][06909] InferenceWorker_p0-w0: stopping experience collection (43850 times) [2024-06-28 09:16:11,973][06909] InferenceWorker_p0-w0: resuming experience collection (43850 times) [2024-06-28 09:16:12,957][06909] Updated weights for policy 0, policy_version 194623 (0.0037) [2024-06-28 09:16:13,851][06674] Fps is (10 sec: 44232.1, 60 sec: 43963.9, 300 sec: 44042.3). Total num frames: 3188736000. Throughput: 0: 43968.8. Samples: 3091603900. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 09:16:13,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:16:17,130][06909] Updated weights for policy 0, policy_version 194633 (0.0030) [2024-06-28 09:16:18,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43964.5, 300 sec: 44042.4). Total num frames: 3188948992. Throughput: 0: 44008.1. Samples: 3091868700. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 09:16:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:16:20,587][06909] Updated weights for policy 0, policy_version 194643 (0.0031) [2024-06-28 09:16:23,850][06674] Fps is (10 sec: 40964.6, 60 sec: 43692.1, 300 sec: 43986.9). Total num frames: 3189145600. Throughput: 0: 43898.7. Samples: 3092126020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 09:16:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:16:24,939][06909] Updated weights for policy 0, policy_version 194653 (0.0026) [2024-06-28 09:16:28,041][06909] Updated weights for policy 0, policy_version 194663 (0.0036) [2024-06-28 09:16:28,856][06674] Fps is (10 sec: 44210.3, 60 sec: 44232.4, 300 sec: 44041.5). Total num frames: 3189391360. Throughput: 0: 43733.6. Samples: 3092258360. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 09:16:28,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:16:32,250][06909] Updated weights for policy 0, policy_version 194673 (0.0035) [2024-06-28 09:16:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 3189604352. Throughput: 0: 43732.1. Samples: 3092526620. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 09:16:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:16:35,603][06909] Updated weights for policy 0, policy_version 194683 (0.0027) [2024-06-28 09:16:38,850][06674] Fps is (10 sec: 40985.0, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 3189800960. Throughput: 0: 43779.0. Samples: 3092788360. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 09:16:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:16:39,929][06909] Updated weights for policy 0, policy_version 194693 (0.0032) [2024-06-28 09:16:42,869][06909] Updated weights for policy 0, policy_version 194703 (0.0030) [2024-06-28 09:16:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3190046720. Throughput: 0: 43635.6. Samples: 3092914700. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 09:16:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:16:47,133][06909] Updated weights for policy 0, policy_version 194713 (0.0036) [2024-06-28 09:16:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44042.6). Total num frames: 3190259712. Throughput: 0: 43793.9. Samples: 3093184160. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 09:16:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:16:50,377][06909] Updated weights for policy 0, policy_version 194723 (0.0042) [2024-06-28 09:16:53,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3190456320. Throughput: 0: 43802.7. Samples: 3093446360. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 09:16:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:16:54,393][06909] Updated weights for policy 0, policy_version 194733 (0.0040) [2024-06-28 09:16:57,851][06909] Updated weights for policy 0, policy_version 194743 (0.0033) [2024-06-28 09:16:58,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3190718464. Throughput: 0: 43893.9. Samples: 3093579080. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 09:16:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:17:01,967][06909] Updated weights for policy 0, policy_version 194753 (0.0029) [2024-06-28 09:17:03,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3190931456. Throughput: 0: 43977.8. Samples: 3093847700. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 09:17:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:17:05,332][06909] Updated weights for policy 0, policy_version 194763 (0.0039) [2024-06-28 09:17:08,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 3191128064. Throughput: 0: 44111.6. Samples: 3094111040. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 09:17:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:17:09,221][06909] Updated weights for policy 0, policy_version 194773 (0.0031) [2024-06-28 09:17:12,659][06909] Updated weights for policy 0, policy_version 194783 (0.0039) [2024-06-28 09:17:13,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43964.6, 300 sec: 44042.5). Total num frames: 3191373824. Throughput: 0: 44104.3. Samples: 3094242780. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 09:17:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:17:16,528][06909] Updated weights for policy 0, policy_version 194793 (0.0034) [2024-06-28 09:17:18,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3191603200. Throughput: 0: 44092.5. Samples: 3094510780. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 09:17:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:17:20,092][06909] Updated weights for policy 0, policy_version 194803 (0.0038) [2024-06-28 09:17:23,755][06909] Updated weights for policy 0, policy_version 194813 (0.0034) [2024-06-28 09:17:23,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3191816192. Throughput: 0: 44163.9. Samples: 3094775740. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 09:17:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:17:27,462][06909] Updated weights for policy 0, policy_version 194823 (0.0040) [2024-06-28 09:17:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44241.2, 300 sec: 44042.4). Total num frames: 3192045568. Throughput: 0: 44278.6. Samples: 3094907240. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 09:17:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:17:31,447][06909] Updated weights for policy 0, policy_version 194833 (0.0025) [2024-06-28 09:17:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3192274944. Throughput: 0: 44299.9. Samples: 3095177660. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 09:17:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:17:34,687][06909] Updated weights for policy 0, policy_version 194843 (0.0038) [2024-06-28 09:17:35,050][06887] Signal inference workers to stop experience collection... (43900 times) [2024-06-28 09:17:35,078][06909] InferenceWorker_p0-w0: stopping experience collection (43900 times) [2024-06-28 09:17:35,100][06887] Signal inference workers to resume experience collection... (43900 times) [2024-06-28 09:17:35,101][06909] InferenceWorker_p0-w0: resuming experience collection (43900 times) [2024-06-28 09:17:38,825][06909] Updated weights for policy 0, policy_version 194853 (0.0037) [2024-06-28 09:17:38,856][06674] Fps is (10 sec: 42573.0, 60 sec: 44505.4, 300 sec: 43986.0). Total num frames: 3192471552. Throughput: 0: 44541.9. Samples: 3095451020. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 09:17:38,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:17:42,311][06909] Updated weights for policy 0, policy_version 194863 (0.0034) [2024-06-28 09:17:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 3192700928. Throughput: 0: 44395.2. Samples: 3095576860. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 09:17:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:17:46,270][06909] Updated weights for policy 0, policy_version 194873 (0.0022) [2024-06-28 09:17:48,850][06674] Fps is (10 sec: 44263.6, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 3192913920. Throughput: 0: 44188.5. Samples: 3095836180. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 09:17:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:17:48,871][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000194880_3192913920.pth... [2024-06-28 09:17:48,916][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000194235_3182346240.pth [2024-06-28 09:17:49,710][06909] Updated weights for policy 0, policy_version 194883 (0.0019) [2024-06-28 09:17:53,587][06909] Updated weights for policy 0, policy_version 194893 (0.0028) [2024-06-28 09:17:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44509.7, 300 sec: 43931.6). Total num frames: 3193126912. Throughput: 0: 44155.5. Samples: 3096098040. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 09:17:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:17:57,100][06909] Updated weights for policy 0, policy_version 194903 (0.0027) [2024-06-28 09:17:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3193356288. Throughput: 0: 44216.7. Samples: 3096232540. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 09:17:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:18:01,113][06909] Updated weights for policy 0, policy_version 194913 (0.0036) [2024-06-28 09:18:03,852][06674] Fps is (10 sec: 45865.9, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 3193585664. Throughput: 0: 44048.1. Samples: 3096493040. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 09:18:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:18:04,544][06909] Updated weights for policy 0, policy_version 194923 (0.0032) [2024-06-28 09:18:08,531][06909] Updated weights for policy 0, policy_version 194933 (0.0026) [2024-06-28 09:18:08,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 3193782272. Throughput: 0: 44186.3. Samples: 3096764120. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 09:18:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:18:11,753][06909] Updated weights for policy 0, policy_version 194943 (0.0035) [2024-06-28 09:18:13,850][06674] Fps is (10 sec: 44245.9, 60 sec: 44236.7, 300 sec: 44153.8). Total num frames: 3194028032. Throughput: 0: 44119.2. Samples: 3096892600. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 09:18:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:18:16,037][06909] Updated weights for policy 0, policy_version 194953 (0.0030) [2024-06-28 09:18:18,850][06674] Fps is (10 sec: 45873.1, 60 sec: 43963.4, 300 sec: 44042.3). Total num frames: 3194241024. Throughput: 0: 43964.5. Samples: 3097156080. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 09:18:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:18:19,373][06909] Updated weights for policy 0, policy_version 194963 (0.0032) [2024-06-28 09:18:23,421][06909] Updated weights for policy 0, policy_version 194973 (0.0028) [2024-06-28 09:18:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43987.4). Total num frames: 3194454016. Throughput: 0: 43897.4. Samples: 3097426140. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 09:18:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:18:26,638][06909] Updated weights for policy 0, policy_version 194983 (0.0046) [2024-06-28 09:18:28,850][06674] Fps is (10 sec: 44237.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3194683392. Throughput: 0: 43985.2. Samples: 3097556200. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 09:18:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:18:30,864][06909] Updated weights for policy 0, policy_version 194993 (0.0035) [2024-06-28 09:18:33,852][06674] Fps is (10 sec: 45866.2, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 3194912768. Throughput: 0: 44162.0. Samples: 3097823560. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 09:18:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:18:34,150][06909] Updated weights for policy 0, policy_version 195003 (0.0040) [2024-06-28 09:18:38,121][06909] Updated weights for policy 0, policy_version 195013 (0.0035) [2024-06-28 09:18:38,850][06674] Fps is (10 sec: 44237.8, 60 sec: 44241.3, 300 sec: 44042.4). Total num frames: 3195125760. Throughput: 0: 44253.9. Samples: 3098089460. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 09:18:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:18:41,329][06909] Updated weights for policy 0, policy_version 195023 (0.0039) [2024-06-28 09:18:43,852][06674] Fps is (10 sec: 42598.3, 60 sec: 43962.2, 300 sec: 44097.7). Total num frames: 3195338752. Throughput: 0: 44114.5. Samples: 3098217780. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 09:18:43,852][06674] Avg episode reward: [(0, '0.415')] [2024-06-28 09:18:45,743][06909] Updated weights for policy 0, policy_version 195033 (0.0034) [2024-06-28 09:18:48,798][06909] Updated weights for policy 0, policy_version 195043 (0.0033) [2024-06-28 09:18:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 3195584512. Throughput: 0: 44297.2. Samples: 3098486320. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 09:18:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:18:53,219][06909] Updated weights for policy 0, policy_version 195053 (0.0033) [2024-06-28 09:18:53,850][06674] Fps is (10 sec: 45884.3, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3195797504. Throughput: 0: 43999.4. Samples: 3098744100. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 09:18:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:18:55,875][06887] Signal inference workers to stop experience collection... (43950 times) [2024-06-28 09:18:55,875][06887] Signal inference workers to resume experience collection... (43950 times) [2024-06-28 09:18:55,886][06909] InferenceWorker_p0-w0: stopping experience collection (43950 times) [2024-06-28 09:18:55,886][06909] InferenceWorker_p0-w0: resuming experience collection (43950 times) [2024-06-28 09:18:56,497][06909] Updated weights for policy 0, policy_version 195063 (0.0034) [2024-06-28 09:18:58,850][06674] Fps is (10 sec: 39321.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3195977728. Throughput: 0: 44110.6. Samples: 3098877580. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 09:18:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:19:00,429][06909] Updated weights for policy 0, policy_version 195073 (0.0024) [2024-06-28 09:19:03,788][06909] Updated weights for policy 0, policy_version 195083 (0.0029) [2024-06-28 09:19:03,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44238.4, 300 sec: 44042.4). Total num frames: 3196239872. Throughput: 0: 44280.9. Samples: 3099148700. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 09:19:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:19:08,060][06909] Updated weights for policy 0, policy_version 195093 (0.0031) [2024-06-28 09:19:08,850][06674] Fps is (10 sec: 49152.4, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 3196469248. Throughput: 0: 44083.1. Samples: 3099409880. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 09:19:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:19:11,376][06909] Updated weights for policy 0, policy_version 195103 (0.0036) [2024-06-28 09:19:13,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3196649472. Throughput: 0: 44085.5. Samples: 3099540040. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 09:19:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:19:15,369][06909] Updated weights for policy 0, policy_version 195113 (0.0031) [2024-06-28 09:19:18,552][06909] Updated weights for policy 0, policy_version 195123 (0.0027) [2024-06-28 09:19:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44237.1, 300 sec: 44042.4). Total num frames: 3196895232. Throughput: 0: 44197.2. Samples: 3099812340. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 09:19:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:19:22,852][06909] Updated weights for policy 0, policy_version 195133 (0.0027) [2024-06-28 09:19:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43987.8). Total num frames: 3197091840. Throughput: 0: 44024.0. Samples: 3100070540. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 09:19:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:19:26,219][06909] Updated weights for policy 0, policy_version 195143 (0.0026) [2024-06-28 09:19:28,850][06674] Fps is (10 sec: 44235.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3197337600. Throughput: 0: 44047.6. Samples: 3100199840. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 09:19:28,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 09:19:30,395][06909] Updated weights for policy 0, policy_version 195153 (0.0029) [2024-06-28 09:19:33,480][06909] Updated weights for policy 0, policy_version 195163 (0.0033) [2024-06-28 09:19:33,852][06674] Fps is (10 sec: 45865.4, 60 sec: 43963.7, 300 sec: 44042.1). Total num frames: 3197550592. Throughput: 0: 44068.6. Samples: 3100469500. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 09:19:33,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:19:37,571][06909] Updated weights for policy 0, policy_version 195173 (0.0035) [2024-06-28 09:19:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3197779968. Throughput: 0: 44208.9. Samples: 3100733500. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 09:19:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:19:40,708][06909] Updated weights for policy 0, policy_version 195183 (0.0031) [2024-06-28 09:19:43,852][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44097.8). Total num frames: 3197992960. Throughput: 0: 44225.7. Samples: 3100867820. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 09:19:43,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:19:44,942][06909] Updated weights for policy 0, policy_version 195193 (0.0032) [2024-06-28 09:19:48,205][06909] Updated weights for policy 0, policy_version 195203 (0.0028) [2024-06-28 09:19:48,852][06674] Fps is (10 sec: 44228.1, 60 sec: 43962.2, 300 sec: 44097.6). Total num frames: 3198222336. Throughput: 0: 44160.6. Samples: 3101136020. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 09:19:48,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:19:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000195204_3198222336.pth... [2024-06-28 09:19:48,931][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000194557_3187621888.pth [2024-06-28 09:19:52,290][06909] Updated weights for policy 0, policy_version 195213 (0.0033) [2024-06-28 09:19:53,850][06674] Fps is (10 sec: 44243.0, 60 sec: 43963.3, 300 sec: 44097.9). Total num frames: 3198435328. Throughput: 0: 44367.9. Samples: 3101406460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:19:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:19:55,441][06909] Updated weights for policy 0, policy_version 195223 (0.0031) [2024-06-28 09:19:58,850][06674] Fps is (10 sec: 42607.4, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 3198648320. Throughput: 0: 44228.5. Samples: 3101530320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:19:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:19:59,661][06909] Updated weights for policy 0, policy_version 195233 (0.0031) [2024-06-28 09:20:03,062][06909] Updated weights for policy 0, policy_version 195243 (0.0040) [2024-06-28 09:20:03,850][06674] Fps is (10 sec: 44239.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3198877696. Throughput: 0: 44052.4. Samples: 3101794700. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:20:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:20:07,308][06909] Updated weights for policy 0, policy_version 195253 (0.0036) [2024-06-28 09:20:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44098.1). Total num frames: 3199107072. Throughput: 0: 44299.5. Samples: 3102064020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:20:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:20:10,239][06909] Updated weights for policy 0, policy_version 195263 (0.0026) [2024-06-28 09:20:13,852][06674] Fps is (10 sec: 42589.2, 60 sec: 44235.2, 300 sec: 44042.3). Total num frames: 3199303680. Throughput: 0: 44343.4. Samples: 3102195380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:20:13,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:20:14,542][06909] Updated weights for policy 0, policy_version 195273 (0.0021) [2024-06-28 09:20:16,985][06887] Signal inference workers to stop experience collection... (44000 times) [2024-06-28 09:20:17,007][06909] InferenceWorker_p0-w0: stopping experience collection (44000 times) [2024-06-28 09:20:17,045][06887] Signal inference workers to resume experience collection... (44000 times) [2024-06-28 09:20:17,046][06909] InferenceWorker_p0-w0: resuming experience collection (44000 times) [2024-06-28 09:20:17,866][06909] Updated weights for policy 0, policy_version 195283 (0.0034) [2024-06-28 09:20:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44153.8). Total num frames: 3199549440. Throughput: 0: 44139.3. Samples: 3102455680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:20:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:20:21,882][06909] Updated weights for policy 0, policy_version 195293 (0.0032) [2024-06-28 09:20:23,850][06674] Fps is (10 sec: 44246.5, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3199746048. Throughput: 0: 44338.4. Samples: 3102728720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:20:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:20:25,157][06909] Updated weights for policy 0, policy_version 195303 (0.0026) [2024-06-28 09:20:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 44098.2). Total num frames: 3199975424. Throughput: 0: 44077.0. Samples: 3102851200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:20:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:20:29,289][06909] Updated weights for policy 0, policy_version 195313 (0.0031) [2024-06-28 09:20:32,443][06909] Updated weights for policy 0, policy_version 195323 (0.0028) [2024-06-28 09:20:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 3200204800. Throughput: 0: 44044.7. Samples: 3103117940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:20:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:20:36,706][06909] Updated weights for policy 0, policy_version 195333 (0.0022) [2024-06-28 09:20:38,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3200401408. Throughput: 0: 44002.9. Samples: 3103386560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:20:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:20:39,958][06909] Updated weights for policy 0, policy_version 195343 (0.0035) [2024-06-28 09:20:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43965.2, 300 sec: 44098.0). Total num frames: 3200630784. Throughput: 0: 44234.2. Samples: 3103520860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:20:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:20:44,190][06909] Updated weights for policy 0, policy_version 195353 (0.0034) [2024-06-28 09:20:47,220][06909] Updated weights for policy 0, policy_version 195363 (0.0033) [2024-06-28 09:20:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 3200860160. Throughput: 0: 44005.8. Samples: 3103774960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:20:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:20:51,706][06909] Updated weights for policy 0, policy_version 195373 (0.0038) [2024-06-28 09:20:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43964.2, 300 sec: 44098.0). Total num frames: 3201073152. Throughput: 0: 44053.4. Samples: 3104046420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:20:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:20:54,905][06909] Updated weights for policy 0, policy_version 195383 (0.0024) [2024-06-28 09:20:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3201286144. Throughput: 0: 43894.1. Samples: 3104170520. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 09:20:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:20:59,141][06909] Updated weights for policy 0, policy_version 195393 (0.0031) [2024-06-28 09:21:02,273][06909] Updated weights for policy 0, policy_version 195403 (0.0026) [2024-06-28 09:21:03,852][06674] Fps is (10 sec: 44227.3, 60 sec: 43962.2, 300 sec: 44153.2). Total num frames: 3201515520. Throughput: 0: 43966.8. Samples: 3104434280. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 09:21:03,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:21:06,677][06909] Updated weights for policy 0, policy_version 195413 (0.0027) [2024-06-28 09:21:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.7, 300 sec: 44042.6). Total num frames: 3201728512. Throughput: 0: 43984.8. Samples: 3104708040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 09:21:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 09:21:09,992][06909] Updated weights for policy 0, policy_version 195423 (0.0041) [2024-06-28 09:21:13,850][06674] Fps is (10 sec: 44246.2, 60 sec: 44238.4, 300 sec: 44098.0). Total num frames: 3201957888. Throughput: 0: 44069.4. Samples: 3104834320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 09:21:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:21:14,139][06909] Updated weights for policy 0, policy_version 195433 (0.0028) [2024-06-28 09:21:17,474][06909] Updated weights for policy 0, policy_version 195443 (0.0024) [2024-06-28 09:21:18,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43690.8, 300 sec: 44153.5). Total num frames: 3202170880. Throughput: 0: 43893.9. Samples: 3105093160. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 09:21:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:21:21,531][06909] Updated weights for policy 0, policy_version 195453 (0.0028) [2024-06-28 09:21:23,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43962.2, 300 sec: 44043.0). Total num frames: 3202383872. Throughput: 0: 43920.2. Samples: 3105363060. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 09:21:23,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:21:24,983][06909] Updated weights for policy 0, policy_version 195463 (0.0039) [2024-06-28 09:21:28,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3202613248. Throughput: 0: 43829.7. Samples: 3105493200. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 09:21:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:21:29,038][06887] Signal inference workers to stop experience collection... (44050 times) [2024-06-28 09:21:29,076][06909] InferenceWorker_p0-w0: stopping experience collection (44050 times) [2024-06-28 09:21:29,096][06887] Signal inference workers to resume experience collection... (44050 times) [2024-06-28 09:21:29,100][06909] InferenceWorker_p0-w0: resuming experience collection (44050 times) [2024-06-28 09:21:29,102][06909] Updated weights for policy 0, policy_version 195473 (0.0034) [2024-06-28 09:21:32,344][06909] Updated weights for policy 0, policy_version 195483 (0.0038) [2024-06-28 09:21:33,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 3202826240. Throughput: 0: 43942.2. Samples: 3105752360. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 09:21:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:21:36,353][06909] Updated weights for policy 0, policy_version 195493 (0.0019) [2024-06-28 09:21:38,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3203055616. Throughput: 0: 43867.2. Samples: 3106020440. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 09:21:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:21:39,804][06909] Updated weights for policy 0, policy_version 195503 (0.0027) [2024-06-28 09:21:43,691][06909] Updated weights for policy 0, policy_version 195513 (0.0038) [2024-06-28 09:21:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3203284992. Throughput: 0: 44184.4. Samples: 3106158820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 09:21:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:21:47,169][06909] Updated weights for policy 0, policy_version 195523 (0.0031) [2024-06-28 09:21:48,856][06674] Fps is (10 sec: 42572.0, 60 sec: 43686.2, 300 sec: 44152.6). Total num frames: 3203481600. Throughput: 0: 44073.0. Samples: 3106417740. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 09:21:48,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:21:48,911][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000195526_3203497984.pth... [2024-06-28 09:21:48,955][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000194880_3192913920.pth [2024-06-28 09:21:51,083][06909] Updated weights for policy 0, policy_version 195533 (0.0036) [2024-06-28 09:21:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3203727360. Throughput: 0: 43954.3. Samples: 3106685980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 09:21:53,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 09:21:54,812][06909] Updated weights for policy 0, policy_version 195543 (0.0030) [2024-06-28 09:21:58,448][06909] Updated weights for policy 0, policy_version 195553 (0.0026) [2024-06-28 09:21:58,850][06674] Fps is (10 sec: 45903.0, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3203940352. Throughput: 0: 44058.6. Samples: 3106816960. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 09:21:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:22:02,001][06909] Updated weights for policy 0, policy_version 195563 (0.0034) [2024-06-28 09:22:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 3204153344. Throughput: 0: 44243.0. Samples: 3107084100. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 09:22:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:22:06,004][06909] Updated weights for policy 0, policy_version 195573 (0.0028) [2024-06-28 09:22:08,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 3204382720. Throughput: 0: 44097.6. Samples: 3107347360. Policy #0 lag: (min: 1.0, avg: 8.7, max: 20.0) [2024-06-28 09:22:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:22:09,771][06909] Updated weights for policy 0, policy_version 195583 (0.0032) [2024-06-28 09:22:13,246][06909] Updated weights for policy 0, policy_version 195593 (0.0028) [2024-06-28 09:22:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3204612096. Throughput: 0: 44160.1. Samples: 3107480400. Policy #0 lag: (min: 1.0, avg: 8.7, max: 20.0) [2024-06-28 09:22:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:22:17,269][06909] Updated weights for policy 0, policy_version 195603 (0.0029) [2024-06-28 09:22:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3204808704. Throughput: 0: 44251.5. Samples: 3107743680. Policy #0 lag: (min: 1.0, avg: 8.7, max: 20.0) [2024-06-28 09:22:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:22:20,725][06909] Updated weights for policy 0, policy_version 195613 (0.0028) [2024-06-28 09:22:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 3205038080. Throughput: 0: 43988.0. Samples: 3107999900. Policy #0 lag: (min: 1.0, avg: 8.7, max: 20.0) [2024-06-28 09:22:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:22:24,518][06909] Updated weights for policy 0, policy_version 195623 (0.0030) [2024-06-28 09:22:28,096][06909] Updated weights for policy 0, policy_version 195633 (0.0027) [2024-06-28 09:22:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3205251072. Throughput: 0: 44033.3. Samples: 3108140320. Policy #0 lag: (min: 1.0, avg: 8.7, max: 20.0) [2024-06-28 09:22:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:22:32,022][06909] Updated weights for policy 0, policy_version 195643 (0.0025) [2024-06-28 09:22:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44098.9). Total num frames: 3205480448. Throughput: 0: 44128.3. Samples: 3108403240. Policy #0 lag: (min: 1.0, avg: 8.7, max: 20.0) [2024-06-28 09:22:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:22:35,553][06909] Updated weights for policy 0, policy_version 195653 (0.0035) [2024-06-28 09:22:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3205709824. Throughput: 0: 43972.8. Samples: 3108664760. Policy #0 lag: (min: 1.0, avg: 8.7, max: 20.0) [2024-06-28 09:22:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:22:39,349][06909] Updated weights for policy 0, policy_version 195663 (0.0029) [2024-06-28 09:22:43,060][06909] Updated weights for policy 0, policy_version 195673 (0.0038) [2024-06-28 09:22:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3205922816. Throughput: 0: 44107.6. Samples: 3108801800. Policy #0 lag: (min: 1.0, avg: 8.7, max: 20.0) [2024-06-28 09:22:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:22:47,147][06909] Updated weights for policy 0, policy_version 195683 (0.0029) [2024-06-28 09:22:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44514.4, 300 sec: 44153.5). Total num frames: 3206152192. Throughput: 0: 43915.1. Samples: 3109060280. Policy #0 lag: (min: 1.0, avg: 8.7, max: 20.0) [2024-06-28 09:22:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:22:50,625][06909] Updated weights for policy 0, policy_version 195693 (0.0029) [2024-06-28 09:22:52,150][06887] Signal inference workers to stop experience collection... (44100 times) [2024-06-28 09:22:52,171][06909] InferenceWorker_p0-w0: stopping experience collection (44100 times) [2024-06-28 09:22:52,258][06887] Signal inference workers to resume experience collection... (44100 times) [2024-06-28 09:22:52,258][06909] InferenceWorker_p0-w0: resuming experience collection (44100 times) [2024-06-28 09:22:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3206365184. Throughput: 0: 44024.4. Samples: 3109328460. Policy #0 lag: (min: 1.0, avg: 8.7, max: 20.0) [2024-06-28 09:22:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:22:54,295][06909] Updated weights for policy 0, policy_version 195703 (0.0025) [2024-06-28 09:22:57,992][06909] Updated weights for policy 0, policy_version 195713 (0.0035) [2024-06-28 09:22:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 3206578176. Throughput: 0: 44028.0. Samples: 3109461660. Policy #0 lag: (min: 1.0, avg: 8.7, max: 20.0) [2024-06-28 09:22:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:23:01,646][06909] Updated weights for policy 0, policy_version 195723 (0.0037) [2024-06-28 09:23:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3206807552. Throughput: 0: 44194.2. Samples: 3109732420. Policy #0 lag: (min: 1.0, avg: 8.7, max: 20.0) [2024-06-28 09:23:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:23:05,226][06909] Updated weights for policy 0, policy_version 195733 (0.0029) [2024-06-28 09:23:08,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3207036928. Throughput: 0: 44223.4. Samples: 3109989960. Policy #0 lag: (min: 1.0, avg: 8.7, max: 20.0) [2024-06-28 09:23:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:23:09,252][06909] Updated weights for policy 0, policy_version 195743 (0.0034) [2024-06-28 09:23:12,557][06909] Updated weights for policy 0, policy_version 195753 (0.0023) [2024-06-28 09:23:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 44042.5). Total num frames: 3207233536. Throughput: 0: 44223.0. Samples: 3110130360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:23:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:23:16,482][06909] Updated weights for policy 0, policy_version 195763 (0.0035) [2024-06-28 09:23:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3207462912. Throughput: 0: 44175.5. Samples: 3110391140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:23:18,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:23:20,309][06909] Updated weights for policy 0, policy_version 195773 (0.0041) [2024-06-28 09:23:23,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3207675904. Throughput: 0: 44122.3. Samples: 3110650260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:23:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:23:24,223][06909] Updated weights for policy 0, policy_version 195783 (0.0034) [2024-06-28 09:23:27,758][06909] Updated weights for policy 0, policy_version 195793 (0.0038) [2024-06-28 09:23:28,851][06674] Fps is (10 sec: 44232.8, 60 sec: 44236.1, 300 sec: 44042.6). Total num frames: 3207905280. Throughput: 0: 43966.6. Samples: 3110780340. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:23:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:23:31,515][06909] Updated weights for policy 0, policy_version 195803 (0.0032) [2024-06-28 09:23:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3208118272. Throughput: 0: 44084.5. Samples: 3111044080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:23:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:23:35,235][06909] Updated weights for policy 0, policy_version 195813 (0.0027) [2024-06-28 09:23:38,850][06674] Fps is (10 sec: 44241.4, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 3208347648. Throughput: 0: 43926.7. Samples: 3111305160. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:23:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:23:38,926][06909] Updated weights for policy 0, policy_version 195823 (0.0027) [2024-06-28 09:23:42,698][06909] Updated weights for policy 0, policy_version 195833 (0.0034) [2024-06-28 09:23:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3208560640. Throughput: 0: 44058.7. Samples: 3111444300. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:23:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:23:46,912][06909] Updated weights for policy 0, policy_version 195843 (0.0046) [2024-06-28 09:23:48,852][06674] Fps is (10 sec: 42589.0, 60 sec: 43689.1, 300 sec: 43986.6). Total num frames: 3208773632. Throughput: 0: 43811.8. Samples: 3111704040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:23:48,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:23:48,873][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000195848_3208773632.pth... [2024-06-28 09:23:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000195204_3198222336.pth [2024-06-28 09:23:50,292][06909] Updated weights for policy 0, policy_version 195853 (0.0037) [2024-06-28 09:23:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3209003008. Throughput: 0: 43861.0. Samples: 3111963700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:23:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:23:54,120][06909] Updated weights for policy 0, policy_version 195863 (0.0021) [2024-06-28 09:23:57,827][06909] Updated weights for policy 0, policy_version 195873 (0.0029) [2024-06-28 09:23:58,850][06674] Fps is (10 sec: 44246.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3209216000. Throughput: 0: 43714.8. Samples: 3112097520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:23:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:24:01,592][06909] Updated weights for policy 0, policy_version 195883 (0.0035) [2024-06-28 09:24:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3209428992. Throughput: 0: 43804.4. Samples: 3112362340. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:24:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:24:05,087][06909] Updated weights for policy 0, policy_version 195893 (0.0035) [2024-06-28 09:24:08,774][06909] Updated weights for policy 0, policy_version 195903 (0.0051) [2024-06-28 09:24:08,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3209674752. Throughput: 0: 43811.9. Samples: 3112621800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:24:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:24:12,614][06909] Updated weights for policy 0, policy_version 195913 (0.0027) [2024-06-28 09:24:13,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3209887744. Throughput: 0: 43920.9. Samples: 3112756740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:24:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:24:16,293][06909] Updated weights for policy 0, policy_version 195923 (0.0030) [2024-06-28 09:24:18,853][06674] Fps is (10 sec: 40948.3, 60 sec: 43688.5, 300 sec: 44042.0). Total num frames: 3210084352. Throughput: 0: 43885.1. Samples: 3113019040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 09:24:18,853][06674] Avg episode reward: [(0, '0.493')] [2024-06-28 09:24:19,458][06887] Signal inference workers to stop experience collection... (44150 times) [2024-06-28 09:24:19,458][06887] Signal inference workers to resume experience collection... (44150 times) [2024-06-28 09:24:19,492][06909] InferenceWorker_p0-w0: stopping experience collection (44150 times) [2024-06-28 09:24:19,492][06909] InferenceWorker_p0-w0: resuming experience collection (44150 times) [2024-06-28 09:24:20,282][06909] Updated weights for policy 0, policy_version 195933 (0.0021) [2024-06-28 09:24:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3210313728. Throughput: 0: 43809.2. Samples: 3113276580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 09:24:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:24:24,179][06909] Updated weights for policy 0, policy_version 195943 (0.0027) [2024-06-28 09:24:27,498][06909] Updated weights for policy 0, policy_version 195953 (0.0035) [2024-06-28 09:24:28,850][06674] Fps is (10 sec: 44249.9, 60 sec: 43691.4, 300 sec: 43987.2). Total num frames: 3210526720. Throughput: 0: 43802.2. Samples: 3113415400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 09:24:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:24:31,394][06909] Updated weights for policy 0, policy_version 195963 (0.0033) [2024-06-28 09:24:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3210756096. Throughput: 0: 43956.2. Samples: 3113681980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 09:24:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:24:35,148][06909] Updated weights for policy 0, policy_version 195973 (0.0028) [2024-06-28 09:24:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 43987.2). Total num frames: 3210969088. Throughput: 0: 43981.8. Samples: 3113942880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 09:24:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:24:38,893][06909] Updated weights for policy 0, policy_version 195983 (0.0036) [2024-06-28 09:24:42,171][06909] Updated weights for policy 0, policy_version 195993 (0.0023) [2024-06-28 09:24:43,852][06674] Fps is (10 sec: 45865.9, 60 sec: 44235.2, 300 sec: 44042.4). Total num frames: 3211214848. Throughput: 0: 44052.6. Samples: 3114079980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 09:24:43,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:24:46,027][06909] Updated weights for policy 0, policy_version 196003 (0.0036) [2024-06-28 09:24:48,850][06674] Fps is (10 sec: 45874.2, 60 sec: 44238.2, 300 sec: 44042.5). Total num frames: 3211427840. Throughput: 0: 44138.1. Samples: 3114348560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 09:24:48,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:24:49,717][06909] Updated weights for policy 0, policy_version 196013 (0.0034) [2024-06-28 09:24:53,502][06909] Updated weights for policy 0, policy_version 196023 (0.0026) [2024-06-28 09:24:53,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3211640832. Throughput: 0: 44077.8. Samples: 3114605300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 09:24:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:24:57,546][06909] Updated weights for policy 0, policy_version 196033 (0.0026) [2024-06-28 09:24:58,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3211853824. Throughput: 0: 44035.1. Samples: 3114738320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 09:24:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:25:01,223][06909] Updated weights for policy 0, policy_version 196043 (0.0037) [2024-06-28 09:25:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3212066816. Throughput: 0: 44106.0. Samples: 3115003680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 09:25:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:25:04,999][06909] Updated weights for policy 0, policy_version 196053 (0.0026) [2024-06-28 09:25:08,420][06909] Updated weights for policy 0, policy_version 196063 (0.0040) [2024-06-28 09:25:08,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43689.2, 300 sec: 44042.4). Total num frames: 3212296192. Throughput: 0: 44129.1. Samples: 3115262480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 09:25:08,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:25:12,181][06909] Updated weights for policy 0, policy_version 196073 (0.0032) [2024-06-28 09:25:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3212509184. Throughput: 0: 43974.7. Samples: 3115394260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 09:25:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:25:16,046][06909] Updated weights for policy 0, policy_version 196083 (0.0033) [2024-06-28 09:25:18,852][06674] Fps is (10 sec: 44236.7, 60 sec: 44237.4, 300 sec: 44042.1). Total num frames: 3212738560. Throughput: 0: 43822.0. Samples: 3115654060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 09:25:18,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:25:19,586][06909] Updated weights for policy 0, policy_version 196093 (0.0025) [2024-06-28 09:25:23,407][06909] Updated weights for policy 0, policy_version 196103 (0.0031) [2024-06-28 09:25:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3212951552. Throughput: 0: 43778.7. Samples: 3115912920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 09:25:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:25:27,167][06909] Updated weights for policy 0, policy_version 196113 (0.0035) [2024-06-28 09:25:28,850][06674] Fps is (10 sec: 44245.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3213180928. Throughput: 0: 43759.3. Samples: 3116049060. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 09:25:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:25:31,171][06909] Updated weights for policy 0, policy_version 196123 (0.0032) [2024-06-28 09:25:33,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3213393920. Throughput: 0: 43721.8. Samples: 3116316040. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 09:25:33,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:25:34,840][06909] Updated weights for policy 0, policy_version 196133 (0.0037) [2024-06-28 09:25:38,327][06909] Updated weights for policy 0, policy_version 196143 (0.0032) [2024-06-28 09:25:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3213606912. Throughput: 0: 43815.0. Samples: 3116576980. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 09:25:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:25:42,215][06909] Updated weights for policy 0, policy_version 196153 (0.0038) [2024-06-28 09:25:43,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 3213852672. Throughput: 0: 43923.6. Samples: 3116714880. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 09:25:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:25:45,790][06909] Updated weights for policy 0, policy_version 196163 (0.0042) [2024-06-28 09:25:48,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3214049280. Throughput: 0: 43904.8. Samples: 3116979400. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 09:25:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:25:48,904][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000196171_3214065664.pth... [2024-06-28 09:25:48,944][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000195526_3203497984.pth [2024-06-28 09:25:49,334][06887] Signal inference workers to stop experience collection... (44200 times) [2024-06-28 09:25:49,343][06887] Signal inference workers to resume experience collection... (44200 times) [2024-06-28 09:25:49,360][06909] InferenceWorker_p0-w0: stopping experience collection (44200 times) [2024-06-28 09:25:49,360][06909] InferenceWorker_p0-w0: resuming experience collection (44200 times) [2024-06-28 09:25:49,509][06909] Updated weights for policy 0, policy_version 196173 (0.0028) [2024-06-28 09:25:53,230][06909] Updated weights for policy 0, policy_version 196183 (0.0026) [2024-06-28 09:25:53,852][06674] Fps is (10 sec: 40951.6, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 3214262272. Throughput: 0: 43823.1. Samples: 3117234520. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 09:25:53,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:25:57,050][06909] Updated weights for policy 0, policy_version 196193 (0.0029) [2024-06-28 09:25:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43987.2). Total num frames: 3214491648. Throughput: 0: 43941.2. Samples: 3117371620. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 09:25:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:26:00,772][06909] Updated weights for policy 0, policy_version 196203 (0.0030) [2024-06-28 09:26:03,852][06674] Fps is (10 sec: 44236.8, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 3214704640. Throughput: 0: 44090.2. Samples: 3117638120. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 09:26:03,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:26:04,254][06909] Updated weights for policy 0, policy_version 196213 (0.0047) [2024-06-28 09:26:08,555][06909] Updated weights for policy 0, policy_version 196223 (0.0028) [2024-06-28 09:26:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 3214934016. Throughput: 0: 44226.5. Samples: 3117903120. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 09:26:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:26:11,958][06909] Updated weights for policy 0, policy_version 196233 (0.0036) [2024-06-28 09:26:13,850][06674] Fps is (10 sec: 45884.7, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3215163392. Throughput: 0: 44098.7. Samples: 3118033500. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 09:26:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:26:15,826][06909] Updated weights for policy 0, policy_version 196243 (0.0029) [2024-06-28 09:26:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43965.1, 300 sec: 44042.7). Total num frames: 3215376384. Throughput: 0: 44074.2. Samples: 3118299380. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 09:26:18,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:26:19,397][06909] Updated weights for policy 0, policy_version 196253 (0.0032) [2024-06-28 09:26:23,303][06909] Updated weights for policy 0, policy_version 196263 (0.0028) [2024-06-28 09:26:23,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43690.5, 300 sec: 43931.3). Total num frames: 3215572992. Throughput: 0: 43980.8. Samples: 3118556120. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 09:26:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:26:26,953][06909] Updated weights for policy 0, policy_version 196273 (0.0024) [2024-06-28 09:26:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3215818752. Throughput: 0: 43803.5. Samples: 3118686040. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 09:26:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:26:30,434][06909] Updated weights for policy 0, policy_version 196283 (0.0042) [2024-06-28 09:26:33,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 3216015360. Throughput: 0: 43956.5. Samples: 3118957440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 09:26:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:26:34,355][06909] Updated weights for policy 0, policy_version 196293 (0.0024) [2024-06-28 09:26:38,373][06909] Updated weights for policy 0, policy_version 196303 (0.0022) [2024-06-28 09:26:38,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3216261120. Throughput: 0: 44234.9. Samples: 3119225000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 09:26:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:26:41,712][06909] Updated weights for policy 0, policy_version 196313 (0.0032) [2024-06-28 09:26:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 44043.3). Total num frames: 3216474112. Throughput: 0: 44130.7. Samples: 3119357500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 09:26:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:26:45,469][06909] Updated weights for policy 0, policy_version 196323 (0.0031) [2024-06-28 09:26:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3216703488. Throughput: 0: 44192.3. Samples: 3119626680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 09:26:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:26:49,030][06909] Updated weights for policy 0, policy_version 196333 (0.0027) [2024-06-28 09:26:52,637][06909] Updated weights for policy 0, policy_version 196343 (0.0033) [2024-06-28 09:26:53,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43965.3, 300 sec: 43931.4). Total num frames: 3216900096. Throughput: 0: 44088.2. Samples: 3119887080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 09:26:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:26:56,659][06909] Updated weights for policy 0, policy_version 196353 (0.0023) [2024-06-28 09:26:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3217145856. Throughput: 0: 44164.0. Samples: 3120020880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 09:26:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:27:00,164][06909] Updated weights for policy 0, policy_version 196363 (0.0036) [2024-06-28 09:27:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44238.4, 300 sec: 43986.9). Total num frames: 3217358848. Throughput: 0: 44042.4. Samples: 3120281280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 09:27:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:27:04,062][06909] Updated weights for policy 0, policy_version 196373 (0.0031) [2024-06-28 09:27:07,601][06909] Updated weights for policy 0, policy_version 196383 (0.0028) [2024-06-28 09:27:08,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3217555456. Throughput: 0: 44136.5. Samples: 3120542260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 09:27:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:27:11,360][06909] Updated weights for policy 0, policy_version 196393 (0.0030) [2024-06-28 09:27:13,852][06674] Fps is (10 sec: 45865.4, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 3217817600. Throughput: 0: 44160.7. Samples: 3120673360. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 09:27:13,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:27:15,332][06909] Updated weights for policy 0, policy_version 196403 (0.0024) [2024-06-28 09:27:18,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 43986.8). Total num frames: 3218014208. Throughput: 0: 44016.3. Samples: 3120938180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 09:27:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:27:19,018][06909] Updated weights for policy 0, policy_version 196413 (0.0031) [2024-06-28 09:27:22,583][06909] Updated weights for policy 0, policy_version 196423 (0.0032) [2024-06-28 09:27:23,850][06674] Fps is (10 sec: 39330.1, 60 sec: 43963.9, 300 sec: 43931.3). Total num frames: 3218210816. Throughput: 0: 43948.5. Samples: 3121202680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 09:27:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:27:26,600][06909] Updated weights for policy 0, policy_version 196433 (0.0028) [2024-06-28 09:27:28,851][06674] Fps is (10 sec: 45869.7, 60 sec: 44235.9, 300 sec: 44042.2). Total num frames: 3218472960. Throughput: 0: 43974.2. Samples: 3121336400. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 09:27:28,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:27:29,721][06909] Updated weights for policy 0, policy_version 196443 (0.0036) [2024-06-28 09:27:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3218669568. Throughput: 0: 43853.8. Samples: 3121600100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 09:27:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:27:33,949][06909] Updated weights for policy 0, policy_version 196453 (0.0040) [2024-06-28 09:27:37,740][06909] Updated weights for policy 0, policy_version 196463 (0.0034) [2024-06-28 09:27:37,742][06887] Signal inference workers to stop experience collection... (44250 times) [2024-06-28 09:27:37,742][06887] Signal inference workers to resume experience collection... (44250 times) [2024-06-28 09:27:37,768][06909] InferenceWorker_p0-w0: stopping experience collection (44250 times) [2024-06-28 09:27:37,768][06909] InferenceWorker_p0-w0: resuming experience collection (44250 times) [2024-06-28 09:27:38,850][06674] Fps is (10 sec: 40965.7, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3218882560. Throughput: 0: 43987.1. Samples: 3121866500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 09:27:38,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 09:27:41,417][06909] Updated weights for policy 0, policy_version 196473 (0.0026) [2024-06-28 09:27:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3219128320. Throughput: 0: 43907.2. Samples: 3121996700. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 09:27:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:27:44,957][06909] Updated weights for policy 0, policy_version 196483 (0.0036) [2024-06-28 09:27:48,781][06909] Updated weights for policy 0, policy_version 196493 (0.0022) [2024-06-28 09:27:48,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3219341312. Throughput: 0: 43887.9. Samples: 3122256240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 09:27:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:27:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000196493_3219341312.pth... [2024-06-28 09:27:48,930][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000195848_3208773632.pth [2024-06-28 09:27:52,159][06909] Updated weights for policy 0, policy_version 196503 (0.0036) [2024-06-28 09:27:53,850][06674] Fps is (10 sec: 42597.6, 60 sec: 44236.6, 300 sec: 43986.8). Total num frames: 3219554304. Throughput: 0: 44020.4. Samples: 3122523180. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 09:27:53,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:27:56,099][06909] Updated weights for policy 0, policy_version 196513 (0.0037) [2024-06-28 09:27:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3219800064. Throughput: 0: 44163.3. Samples: 3122660620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 09:27:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:27:59,576][06909] Updated weights for policy 0, policy_version 196523 (0.0032) [2024-06-28 09:28:03,613][06909] Updated weights for policy 0, policy_version 196533 (0.0037) [2024-06-28 09:28:03,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 3219996672. Throughput: 0: 44060.2. Samples: 3122920880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 09:28:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:28:06,801][06909] Updated weights for policy 0, policy_version 196543 (0.0030) [2024-06-28 09:28:08,852][06674] Fps is (10 sec: 40952.0, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 3220209664. Throughput: 0: 44218.3. Samples: 3123192600. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 09:28:08,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:28:10,803][06909] Updated weights for policy 0, policy_version 196553 (0.0035) [2024-06-28 09:28:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43692.2, 300 sec: 43986.9). Total num frames: 3220439040. Throughput: 0: 44026.2. Samples: 3123317520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 09:28:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:28:14,592][06909] Updated weights for policy 0, policy_version 196563 (0.0034) [2024-06-28 09:28:18,478][06909] Updated weights for policy 0, policy_version 196573 (0.0027) [2024-06-28 09:28:18,850][06674] Fps is (10 sec: 45884.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3220668416. Throughput: 0: 43937.2. Samples: 3123577280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 09:28:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:28:22,071][06909] Updated weights for policy 0, policy_version 196583 (0.0027) [2024-06-28 09:28:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 43931.5). Total num frames: 3220865024. Throughput: 0: 43991.5. Samples: 3123846120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 09:28:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:28:25,707][06909] Updated weights for policy 0, policy_version 196593 (0.0041) [2024-06-28 09:28:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43691.6, 300 sec: 43986.9). Total num frames: 3221094400. Throughput: 0: 43903.9. Samples: 3123972380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 09:28:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:28:29,754][06909] Updated weights for policy 0, policy_version 196603 (0.0030) [2024-06-28 09:28:33,294][06909] Updated weights for policy 0, policy_version 196613 (0.0026) [2024-06-28 09:28:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3221340160. Throughput: 0: 44010.7. Samples: 3124236720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 09:28:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:28:37,099][06909] Updated weights for policy 0, policy_version 196623 (0.0030) [2024-06-28 09:28:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 43986.8). Total num frames: 3221536768. Throughput: 0: 44103.6. Samples: 3124507840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 09:28:38,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:28:40,793][06909] Updated weights for policy 0, policy_version 196633 (0.0045) [2024-06-28 09:28:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 3221782528. Throughput: 0: 43928.1. Samples: 3124637380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 09:28:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:28:44,279][06909] Updated weights for policy 0, policy_version 196643 (0.0026) [2024-06-28 09:28:48,381][06909] Updated weights for policy 0, policy_version 196653 (0.0023) [2024-06-28 09:28:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3221995520. Throughput: 0: 43953.2. Samples: 3124898780. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 09:28:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:28:51,553][06909] Updated weights for policy 0, policy_version 196663 (0.0044) [2024-06-28 09:28:53,850][06674] Fps is (10 sec: 40958.8, 60 sec: 43963.7, 300 sec: 43986.8). Total num frames: 3222192128. Throughput: 0: 43930.2. Samples: 3125169380. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 09:28:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:28:55,894][06909] Updated weights for policy 0, policy_version 196673 (0.0029) [2024-06-28 09:28:58,632][06887] Signal inference workers to stop experience collection... (44300 times) [2024-06-28 09:28:58,632][06887] Signal inference workers to resume experience collection... (44300 times) [2024-06-28 09:28:58,645][06909] InferenceWorker_p0-w0: stopping experience collection (44300 times) [2024-06-28 09:28:58,646][06909] InferenceWorker_p0-w0: resuming experience collection (44300 times) [2024-06-28 09:28:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3222437888. Throughput: 0: 43917.4. Samples: 3125293800. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 09:28:58,850][06674] Avg episode reward: [(0, '0.417')] [2024-06-28 09:28:59,922][06909] Updated weights for policy 0, policy_version 196683 (0.0026) [2024-06-28 09:29:03,082][06909] Updated weights for policy 0, policy_version 196693 (0.0043) [2024-06-28 09:29:03,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3222650880. Throughput: 0: 44098.7. Samples: 3125561720. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 09:29:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:29:07,070][06909] Updated weights for policy 0, policy_version 196703 (0.0033) [2024-06-28 09:29:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44238.3, 300 sec: 43986.9). Total num frames: 3222863872. Throughput: 0: 43976.9. Samples: 3125825080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 09:29:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:29:10,312][06909] Updated weights for policy 0, policy_version 196713 (0.0037) [2024-06-28 09:29:13,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.9, 300 sec: 44098.4). Total num frames: 3223093248. Throughput: 0: 44148.6. Samples: 3125959060. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 09:29:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:29:14,271][06909] Updated weights for policy 0, policy_version 196723 (0.0039) [2024-06-28 09:29:17,809][06909] Updated weights for policy 0, policy_version 196733 (0.0030) [2024-06-28 09:29:18,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 3223306240. Throughput: 0: 44043.8. Samples: 3126218780. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 09:29:18,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:29:21,447][06909] Updated weights for policy 0, policy_version 196743 (0.0043) [2024-06-28 09:29:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3223535616. Throughput: 0: 43905.0. Samples: 3126483560. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 09:29:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:29:25,464][06909] Updated weights for policy 0, policy_version 196753 (0.0039) [2024-06-28 09:29:28,701][06909] Updated weights for policy 0, policy_version 196763 (0.0039) [2024-06-28 09:29:28,850][06674] Fps is (10 sec: 45884.5, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 3223764992. Throughput: 0: 43949.7. Samples: 3126615120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 09:29:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:29:32,847][06909] Updated weights for policy 0, policy_version 196773 (0.0035) [2024-06-28 09:29:33,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3223961600. Throughput: 0: 44032.6. Samples: 3126880240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 09:29:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:29:36,506][06909] Updated weights for policy 0, policy_version 196783 (0.0053) [2024-06-28 09:29:38,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.9, 300 sec: 43931.7). Total num frames: 3224174592. Throughput: 0: 43928.3. Samples: 3127146140. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 09:29:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:29:40,066][06909] Updated weights for policy 0, policy_version 196793 (0.0036) [2024-06-28 09:29:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3224403968. Throughput: 0: 44186.6. Samples: 3127282200. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 09:29:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:29:44,211][06909] Updated weights for policy 0, policy_version 196803 (0.0036) [2024-06-28 09:29:47,790][06909] Updated weights for policy 0, policy_version 196813 (0.0023) [2024-06-28 09:29:48,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3224616960. Throughput: 0: 44084.0. Samples: 3127545500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 09:29:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:29:48,964][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000196816_3224633344.pth... [2024-06-28 09:29:49,035][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000196171_3214065664.pth [2024-06-28 09:29:51,640][06909] Updated weights for policy 0, policy_version 196823 (0.0041) [2024-06-28 09:29:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 3224829952. Throughput: 0: 44008.5. Samples: 3127805460. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:29:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:29:55,297][06909] Updated weights for policy 0, policy_version 196833 (0.0035) [2024-06-28 09:29:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3225059328. Throughput: 0: 44048.9. Samples: 3127941260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:29:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:29:58,951][06909] Updated weights for policy 0, policy_version 196843 (0.0035) [2024-06-28 09:30:02,486][06909] Updated weights for policy 0, policy_version 196853 (0.0037) [2024-06-28 09:30:03,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43689.2, 300 sec: 43986.9). Total num frames: 3225272320. Throughput: 0: 44147.6. Samples: 3128205420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:30:03,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:30:06,590][06909] Updated weights for policy 0, policy_version 196863 (0.0025) [2024-06-28 09:30:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3225501696. Throughput: 0: 44236.0. Samples: 3128474180. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:30:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:30:10,097][06909] Updated weights for policy 0, policy_version 196873 (0.0036) [2024-06-28 09:30:13,850][06674] Fps is (10 sec: 44245.9, 60 sec: 43690.6, 300 sec: 43987.2). Total num frames: 3225714688. Throughput: 0: 44218.7. Samples: 3128604960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:30:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:30:14,005][06909] Updated weights for policy 0, policy_version 196883 (0.0035) [2024-06-28 09:30:17,180][06909] Updated weights for policy 0, policy_version 196893 (0.0042) [2024-06-28 09:30:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44238.3, 300 sec: 44097.9). Total num frames: 3225960448. Throughput: 0: 44291.5. Samples: 3128873360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:30:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:30:21,233][06909] Updated weights for policy 0, policy_version 196903 (0.0039) [2024-06-28 09:30:23,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3226157056. Throughput: 0: 44319.3. Samples: 3129140520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:30:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:30:24,886][06909] Updated weights for policy 0, policy_version 196913 (0.0031) [2024-06-28 09:30:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3226386432. Throughput: 0: 44077.3. Samples: 3129265680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:30:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:30:28,853][06909] Updated weights for policy 0, policy_version 196923 (0.0031) [2024-06-28 09:30:31,074][06887] Signal inference workers to stop experience collection... (44350 times) [2024-06-28 09:30:31,108][06909] InferenceWorker_p0-w0: stopping experience collection (44350 times) [2024-06-28 09:30:31,139][06887] Signal inference workers to resume experience collection... (44350 times) [2024-06-28 09:30:31,142][06909] InferenceWorker_p0-w0: resuming experience collection (44350 times) [2024-06-28 09:30:32,200][06909] Updated weights for policy 0, policy_version 196933 (0.0020) [2024-06-28 09:30:33,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3226615808. Throughput: 0: 44172.5. Samples: 3129533260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:30:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 09:30:36,059][06909] Updated weights for policy 0, policy_version 196943 (0.0029) [2024-06-28 09:30:38,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44782.8, 300 sec: 44097.9). Total num frames: 3226861568. Throughput: 0: 44323.5. Samples: 3129800020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:30:38,859][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:30:39,650][06909] Updated weights for policy 0, policy_version 196953 (0.0029) [2024-06-28 09:30:43,291][06909] Updated weights for policy 0, policy_version 196963 (0.0035) [2024-06-28 09:30:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3227041792. Throughput: 0: 44331.1. Samples: 3129936160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:30:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:30:47,151][06909] Updated weights for policy 0, policy_version 196973 (0.0040) [2024-06-28 09:30:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44783.0, 300 sec: 44209.3). Total num frames: 3227303936. Throughput: 0: 44344.2. Samples: 3130200820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:30:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:30:50,755][06909] Updated weights for policy 0, policy_version 196983 (0.0031) [2024-06-28 09:30:53,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 3227500544. Throughput: 0: 44359.1. Samples: 3130470340. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:30:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:30:54,474][06909] Updated weights for policy 0, policy_version 196993 (0.0036) [2024-06-28 09:30:58,044][06909] Updated weights for policy 0, policy_version 197003 (0.0037) [2024-06-28 09:30:58,858][06674] Fps is (10 sec: 42562.6, 60 sec: 44503.6, 300 sec: 44152.5). Total num frames: 3227729920. Throughput: 0: 44250.4. Samples: 3130596600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:30:58,859][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:31:02,058][06909] Updated weights for policy 0, policy_version 197013 (0.0029) [2024-06-28 09:31:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44511.4, 300 sec: 44098.0). Total num frames: 3227942912. Throughput: 0: 44305.3. Samples: 3130867100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:31:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:31:05,264][06909] Updated weights for policy 0, policy_version 197023 (0.0039) [2024-06-28 09:31:08,852][06674] Fps is (10 sec: 44264.8, 60 sec: 44508.4, 300 sec: 44097.7). Total num frames: 3228172288. Throughput: 0: 44325.7. Samples: 3131135260. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:31:08,852][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 09:31:09,231][06909] Updated weights for policy 0, policy_version 197033 (0.0024) [2024-06-28 09:31:12,990][06909] Updated weights for policy 0, policy_version 197043 (0.0034) [2024-06-28 09:31:13,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44509.7, 300 sec: 44098.0). Total num frames: 3228385280. Throughput: 0: 44312.3. Samples: 3131259740. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:31:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:31:16,528][06909] Updated weights for policy 0, policy_version 197053 (0.0036) [2024-06-28 09:31:18,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3228598272. Throughput: 0: 44490.7. Samples: 3131535340. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:31:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:31:20,181][06909] Updated weights for policy 0, policy_version 197063 (0.0043) [2024-06-28 09:31:23,850][06674] Fps is (10 sec: 42599.2, 60 sec: 44237.0, 300 sec: 44042.4). Total num frames: 3228811264. Throughput: 0: 44381.0. Samples: 3131797160. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:31:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:31:24,266][06909] Updated weights for policy 0, policy_version 197073 (0.0032) [2024-06-28 09:31:27,705][06909] Updated weights for policy 0, policy_version 197083 (0.0032) [2024-06-28 09:31:28,850][06674] Fps is (10 sec: 45874.2, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 3229057024. Throughput: 0: 44149.1. Samples: 3131922880. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:31:28,859][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:31:31,661][06909] Updated weights for policy 0, policy_version 197093 (0.0021) [2024-06-28 09:31:33,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3229286400. Throughput: 0: 44411.5. Samples: 3132199340. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:31:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:31:34,970][06909] Updated weights for policy 0, policy_version 197103 (0.0031) [2024-06-28 09:31:38,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 3229483008. Throughput: 0: 44237.3. Samples: 3132461020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:31:38,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 09:31:38,942][06909] Updated weights for policy 0, policy_version 197113 (0.0037) [2024-06-28 09:31:42,230][06909] Updated weights for policy 0, policy_version 197123 (0.0024) [2024-06-28 09:31:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3229712384. Throughput: 0: 44213.9. Samples: 3132585860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:31:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:31:46,070][06909] Updated weights for policy 0, policy_version 197133 (0.0028) [2024-06-28 09:31:48,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.8, 300 sec: 44264.6). Total num frames: 3229958144. Throughput: 0: 44267.6. Samples: 3132859140. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:31:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:31:48,916][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000197142_3229974528.pth... [2024-06-28 09:31:48,972][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000196493_3219341312.pth [2024-06-28 09:31:49,839][06909] Updated weights for policy 0, policy_version 197143 (0.0025) [2024-06-28 09:31:53,852][06674] Fps is (10 sec: 42590.1, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 3230138368. Throughput: 0: 43918.2. Samples: 3133111580. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:31:53,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:31:54,241][06909] Updated weights for policy 0, policy_version 197153 (0.0039) [2024-06-28 09:31:55,143][06887] Signal inference workers to stop experience collection... (44400 times) [2024-06-28 09:31:55,195][06887] Signal inference workers to resume experience collection... (44400 times) [2024-06-28 09:31:55,195][06909] InferenceWorker_p0-w0: stopping experience collection (44400 times) [2024-06-28 09:31:55,212][06909] InferenceWorker_p0-w0: resuming experience collection (44400 times) [2024-06-28 09:31:57,062][06909] Updated weights for policy 0, policy_version 197163 (0.0044) [2024-06-28 09:31:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44243.0, 300 sec: 44153.5). Total num frames: 3230384128. Throughput: 0: 44109.0. Samples: 3133244640. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:31:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:32:01,447][06909] Updated weights for policy 0, policy_version 197173 (0.0022) [2024-06-28 09:32:03,850][06674] Fps is (10 sec: 47523.6, 60 sec: 44509.9, 300 sec: 44264.6). Total num frames: 3230613504. Throughput: 0: 44039.1. Samples: 3133517100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:32:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:32:04,694][06909] Updated weights for policy 0, policy_version 197183 (0.0036) [2024-06-28 09:32:08,830][06909] Updated weights for policy 0, policy_version 197193 (0.0027) [2024-06-28 09:32:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43965.2, 300 sec: 44042.7). Total num frames: 3230810112. Throughput: 0: 44128.4. Samples: 3133782940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:32:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:32:11,931][06909] Updated weights for policy 0, policy_version 197203 (0.0027) [2024-06-28 09:32:13,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3231039488. Throughput: 0: 44261.4. Samples: 3133914640. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:32:13,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:32:16,116][06909] Updated weights for policy 0, policy_version 197213 (0.0030) [2024-06-28 09:32:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 3231252480. Throughput: 0: 43955.2. Samples: 3134177320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:32:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 09:32:19,425][06909] Updated weights for policy 0, policy_version 197223 (0.0042) [2024-06-28 09:32:23,688][06909] Updated weights for policy 0, policy_version 197233 (0.0036) [2024-06-28 09:32:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.8, 300 sec: 44042.6). Total num frames: 3231465472. Throughput: 0: 43983.6. Samples: 3134440280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:32:23,850][06674] Avg episode reward: [(0, '0.455')] [2024-06-28 09:32:27,055][06909] Updated weights for policy 0, policy_version 197243 (0.0027) [2024-06-28 09:32:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3231694848. Throughput: 0: 44070.7. Samples: 3134569040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:32:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:32:31,209][06909] Updated weights for policy 0, policy_version 197253 (0.0038) [2024-06-28 09:32:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 3231907840. Throughput: 0: 43812.9. Samples: 3134830720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:32:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:32:34,688][06909] Updated weights for policy 0, policy_version 197263 (0.0036) [2024-06-28 09:32:38,681][06909] Updated weights for policy 0, policy_version 197273 (0.0038) [2024-06-28 09:32:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3232120832. Throughput: 0: 44126.0. Samples: 3135097160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:32:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:32:42,121][06909] Updated weights for policy 0, policy_version 197283 (0.0030) [2024-06-28 09:32:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3232350208. Throughput: 0: 44009.7. Samples: 3135225080. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:32:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:32:46,054][06909] Updated weights for policy 0, policy_version 197293 (0.0037) [2024-06-28 09:32:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 44098.0). Total num frames: 3232563200. Throughput: 0: 43889.3. Samples: 3135492120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:32:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:32:49,577][06909] Updated weights for policy 0, policy_version 197303 (0.0039) [2024-06-28 09:32:53,358][06909] Updated weights for policy 0, policy_version 197313 (0.0039) [2024-06-28 09:32:53,856][06674] Fps is (10 sec: 45847.3, 60 sec: 44506.8, 300 sec: 44097.1). Total num frames: 3232808960. Throughput: 0: 43909.6. Samples: 3135759140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:32:53,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:32:56,845][06909] Updated weights for policy 0, policy_version 197323 (0.0034) [2024-06-28 09:32:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 3232989184. Throughput: 0: 43777.9. Samples: 3135884640. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:32:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:33:00,565][06909] Updated weights for policy 0, policy_version 197333 (0.0035) [2024-06-28 09:33:03,850][06674] Fps is (10 sec: 40985.1, 60 sec: 43417.5, 300 sec: 44098.3). Total num frames: 3233218560. Throughput: 0: 43831.1. Samples: 3136149720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:33:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:33:04,561][06909] Updated weights for policy 0, policy_version 197343 (0.0026) [2024-06-28 09:33:08,418][06909] Updated weights for policy 0, policy_version 197353 (0.0034) [2024-06-28 09:33:08,856][06674] Fps is (10 sec: 45846.8, 60 sec: 43959.2, 300 sec: 44097.0). Total num frames: 3233447936. Throughput: 0: 43915.7. Samples: 3136416760. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 09:33:08,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:33:11,909][06909] Updated weights for policy 0, policy_version 197363 (0.0024) [2024-06-28 09:33:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3233660928. Throughput: 0: 43957.4. Samples: 3136547120. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:33:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:33:15,874][06909] Updated weights for policy 0, policy_version 197373 (0.0034) [2024-06-28 09:33:18,850][06674] Fps is (10 sec: 42624.5, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 3233873920. Throughput: 0: 43971.9. Samples: 3136809460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:33:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:33:19,086][06887] Signal inference workers to stop experience collection... (44450 times) [2024-06-28 09:33:19,086][06887] Signal inference workers to resume experience collection... (44450 times) [2024-06-28 09:33:19,145][06909] InferenceWorker_p0-w0: stopping experience collection (44450 times) [2024-06-28 09:33:19,145][06909] InferenceWorker_p0-w0: resuming experience collection (44450 times) [2024-06-28 09:33:19,218][06909] Updated weights for policy 0, policy_version 197383 (0.0027) [2024-06-28 09:33:23,383][06909] Updated weights for policy 0, policy_version 197393 (0.0033) [2024-06-28 09:33:23,850][06674] Fps is (10 sec: 45874.3, 60 sec: 44236.6, 300 sec: 44153.5). Total num frames: 3234119680. Throughput: 0: 43932.8. Samples: 3137074140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:33:23,851][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 09:33:26,904][06909] Updated weights for policy 0, policy_version 197403 (0.0026) [2024-06-28 09:33:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3234316288. Throughput: 0: 43977.3. Samples: 3137204060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:33:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:33:30,678][06909] Updated weights for policy 0, policy_version 197413 (0.0027) [2024-06-28 09:33:33,850][06674] Fps is (10 sec: 42599.5, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3234545664. Throughput: 0: 43989.8. Samples: 3137471660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:33:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:33:34,146][06909] Updated weights for policy 0, policy_version 197423 (0.0033) [2024-06-28 09:33:38,125][06909] Updated weights for policy 0, policy_version 197433 (0.0036) [2024-06-28 09:33:38,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3234775040. Throughput: 0: 43987.8. Samples: 3137738320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:33:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:33:41,873][06909] Updated weights for policy 0, policy_version 197443 (0.0031) [2024-06-28 09:33:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3234971648. Throughput: 0: 44104.9. Samples: 3137869360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:33:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:33:45,484][06909] Updated weights for policy 0, policy_version 197453 (0.0029) [2024-06-28 09:33:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 44042.5). Total num frames: 3235184640. Throughput: 0: 43988.5. Samples: 3138129200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:33:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:33:48,935][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000197461_3235201024.pth... [2024-06-28 09:33:49,005][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000196816_3224633344.pth [2024-06-28 09:33:49,258][06909] Updated weights for policy 0, policy_version 197463 (0.0029) [2024-06-28 09:33:52,764][06909] Updated weights for policy 0, policy_version 197473 (0.0023) [2024-06-28 09:33:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43695.2, 300 sec: 44042.4). Total num frames: 3235430400. Throughput: 0: 43912.7. Samples: 3138392560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:33:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:33:56,710][06909] Updated weights for policy 0, policy_version 197483 (0.0027) [2024-06-28 09:33:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3235627008. Throughput: 0: 44031.5. Samples: 3138528540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:33:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:34:00,523][06909] Updated weights for policy 0, policy_version 197493 (0.0031) [2024-06-28 09:34:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3235872768. Throughput: 0: 43903.1. Samples: 3138785100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:34:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:34:04,132][06909] Updated weights for policy 0, policy_version 197503 (0.0035) [2024-06-28 09:34:07,867][06909] Updated weights for policy 0, policy_version 197513 (0.0030) [2024-06-28 09:34:08,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43968.2, 300 sec: 44042.4). Total num frames: 3236085760. Throughput: 0: 43829.0. Samples: 3139046440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:34:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:34:11,800][06909] Updated weights for policy 0, policy_version 197523 (0.0040) [2024-06-28 09:34:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 3236298752. Throughput: 0: 43950.3. Samples: 3139181820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:34:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:34:15,443][06909] Updated weights for policy 0, policy_version 197533 (0.0029) [2024-06-28 09:34:18,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3236495360. Throughput: 0: 43813.6. Samples: 3139443280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 09:34:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:34:19,168][06909] Updated weights for policy 0, policy_version 197543 (0.0027) [2024-06-28 09:34:22,930][06909] Updated weights for policy 0, policy_version 197553 (0.0026) [2024-06-28 09:34:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3236741120. Throughput: 0: 43667.9. Samples: 3139703380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 09:34:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:34:26,849][06909] Updated weights for policy 0, policy_version 197563 (0.0035) [2024-06-28 09:34:28,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3236937728. Throughput: 0: 43767.1. Samples: 3139838880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 09:34:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:34:30,446][06909] Updated weights for policy 0, policy_version 197573 (0.0029) [2024-06-28 09:34:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3237183488. Throughput: 0: 43836.0. Samples: 3140101820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 09:34:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:34:34,123][06909] Updated weights for policy 0, policy_version 197583 (0.0028) [2024-06-28 09:34:37,841][06909] Updated weights for policy 0, policy_version 197593 (0.0031) [2024-06-28 09:34:37,926][06887] Signal inference workers to stop experience collection... (44500 times) [2024-06-28 09:34:37,962][06909] InferenceWorker_p0-w0: stopping experience collection (44500 times) [2024-06-28 09:34:37,985][06887] Signal inference workers to resume experience collection... (44500 times) [2024-06-28 09:34:37,992][06909] InferenceWorker_p0-w0: resuming experience collection (44500 times) [2024-06-28 09:34:38,850][06674] Fps is (10 sec: 49151.4, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3237429248. Throughput: 0: 43807.4. Samples: 3140363900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 09:34:38,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:34:41,409][06909] Updated weights for policy 0, policy_version 197603 (0.0021) [2024-06-28 09:34:43,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3237609472. Throughput: 0: 43821.8. Samples: 3140500520. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 09:34:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:34:44,972][06909] Updated weights for policy 0, policy_version 197613 (0.0031) [2024-06-28 09:34:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3237838848. Throughput: 0: 44026.2. Samples: 3140766280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 09:34:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:34:48,926][06909] Updated weights for policy 0, policy_version 197623 (0.0025) [2024-06-28 09:34:52,400][06909] Updated weights for policy 0, policy_version 197633 (0.0030) [2024-06-28 09:34:53,852][06674] Fps is (10 sec: 45865.6, 60 sec: 43962.2, 300 sec: 44097.6). Total num frames: 3238068224. Throughput: 0: 44077.6. Samples: 3141030020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 09:34:53,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:34:56,483][06909] Updated weights for policy 0, policy_version 197643 (0.0026) [2024-06-28 09:34:58,854][06674] Fps is (10 sec: 45858.1, 60 sec: 44507.1, 300 sec: 44153.2). Total num frames: 3238297600. Throughput: 0: 44099.4. Samples: 3141166460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 09:34:58,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:35:00,085][06909] Updated weights for policy 0, policy_version 197653 (0.0031) [2024-06-28 09:35:03,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3238494208. Throughput: 0: 43974.4. Samples: 3141422120. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 09:35:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:35:03,908][06909] Updated weights for policy 0, policy_version 197663 (0.0028) [2024-06-28 09:35:07,683][06909] Updated weights for policy 0, policy_version 197673 (0.0036) [2024-06-28 09:35:08,854][06674] Fps is (10 sec: 44236.2, 60 sec: 44234.0, 300 sec: 44152.9). Total num frames: 3238739968. Throughput: 0: 44081.5. Samples: 3141687220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 09:35:08,854][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:35:11,243][06909] Updated weights for policy 0, policy_version 197683 (0.0029) [2024-06-28 09:35:13,850][06674] Fps is (10 sec: 44235.5, 60 sec: 43963.5, 300 sec: 43986.8). Total num frames: 3238936576. Throughput: 0: 43974.8. Samples: 3141817760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 09:35:13,855][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:35:14,990][06909] Updated weights for policy 0, policy_version 197693 (0.0031) [2024-06-28 09:35:18,850][06674] Fps is (10 sec: 40976.3, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3239149568. Throughput: 0: 43908.0. Samples: 3142077680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 09:35:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:35:18,953][06909] Updated weights for policy 0, policy_version 197703 (0.0028) [2024-06-28 09:35:22,368][06909] Updated weights for policy 0, policy_version 197713 (0.0029) [2024-06-28 09:35:23,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3239395328. Throughput: 0: 44028.9. Samples: 3142345200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 09:35:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:35:26,202][06909] Updated weights for policy 0, policy_version 197723 (0.0035) [2024-06-28 09:35:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3239608320. Throughput: 0: 43919.1. Samples: 3142476880. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-28 09:35:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:35:29,870][06909] Updated weights for policy 0, policy_version 197733 (0.0044) [2024-06-28 09:35:33,584][06909] Updated weights for policy 0, policy_version 197743 (0.0036) [2024-06-28 09:35:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3239821312. Throughput: 0: 43806.2. Samples: 3142737560. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-28 09:35:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:35:37,884][06909] Updated weights for policy 0, policy_version 197753 (0.0034) [2024-06-28 09:35:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 3240050688. Throughput: 0: 43768.2. Samples: 3142999500. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-28 09:35:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:35:41,050][06909] Updated weights for policy 0, policy_version 197763 (0.0042) [2024-06-28 09:35:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 3240263680. Throughput: 0: 43710.2. Samples: 3143133260. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-28 09:35:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:35:45,147][06909] Updated weights for policy 0, policy_version 197773 (0.0039) [2024-06-28 09:35:48,298][06909] Updated weights for policy 0, policy_version 197783 (0.0034) [2024-06-28 09:35:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3240476672. Throughput: 0: 43823.0. Samples: 3143394160. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-28 09:35:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:35:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000197783_3240476672.pth... [2024-06-28 09:35:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000197142_3229974528.pth [2024-06-28 09:35:52,420][06909] Updated weights for policy 0, policy_version 197793 (0.0036) [2024-06-28 09:35:52,765][06887] Signal inference workers to stop experience collection... (44550 times) [2024-06-28 09:35:52,765][06887] Signal inference workers to resume experience collection... (44550 times) [2024-06-28 09:35:52,791][06909] InferenceWorker_p0-w0: stopping experience collection (44550 times) [2024-06-28 09:35:52,796][06909] InferenceWorker_p0-w0: resuming experience collection (44550 times) [2024-06-28 09:35:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43692.1, 300 sec: 43932.6). Total num frames: 3240689664. Throughput: 0: 43889.1. Samples: 3143662060. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-28 09:35:53,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:35:55,990][06909] Updated weights for policy 0, policy_version 197803 (0.0027) [2024-06-28 09:35:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43420.3, 300 sec: 43931.3). Total num frames: 3240902656. Throughput: 0: 43960.2. Samples: 3143795960. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-28 09:35:58,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-28 09:35:59,587][06909] Updated weights for policy 0, policy_version 197813 (0.0046) [2024-06-28 09:36:03,152][06909] Updated weights for policy 0, policy_version 197823 (0.0034) [2024-06-28 09:36:03,852][06674] Fps is (10 sec: 44227.7, 60 sec: 43962.2, 300 sec: 43931.3). Total num frames: 3241132032. Throughput: 0: 43946.8. Samples: 3144055380. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-28 09:36:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:36:07,155][06909] Updated weights for policy 0, policy_version 197833 (0.0034) [2024-06-28 09:36:08,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43966.5, 300 sec: 44042.4). Total num frames: 3241377792. Throughput: 0: 44044.0. Samples: 3144327180. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-28 09:36:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:36:10,741][06909] Updated weights for policy 0, policy_version 197843 (0.0038) [2024-06-28 09:36:13,850][06674] Fps is (10 sec: 44246.2, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 3241574400. Throughput: 0: 44071.5. Samples: 3144460100. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-28 09:36:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:36:14,627][06909] Updated weights for policy 0, policy_version 197853 (0.0033) [2024-06-28 09:36:18,062][06909] Updated weights for policy 0, policy_version 197863 (0.0035) [2024-06-28 09:36:18,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3241787392. Throughput: 0: 43965.4. Samples: 3144716000. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-28 09:36:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:36:22,120][06909] Updated weights for policy 0, policy_version 197873 (0.0025) [2024-06-28 09:36:23,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3242033152. Throughput: 0: 44260.8. Samples: 3144991240. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-28 09:36:23,859][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:36:25,405][06909] Updated weights for policy 0, policy_version 197883 (0.0043) [2024-06-28 09:36:28,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43690.5, 300 sec: 43875.8). Total num frames: 3242229760. Throughput: 0: 44355.1. Samples: 3145129240. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-28 09:36:28,859][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:36:29,393][06909] Updated weights for policy 0, policy_version 197893 (0.0034) [2024-06-28 09:36:32,967][06909] Updated weights for policy 0, policy_version 197903 (0.0026) [2024-06-28 09:36:33,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 3242442752. Throughput: 0: 44099.6. Samples: 3145378640. Policy #0 lag: (min: 1.0, avg: 8.7, max: 21.0) [2024-06-28 09:36:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:36:36,741][06909] Updated weights for policy 0, policy_version 197913 (0.0034) [2024-06-28 09:36:38,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3242704896. Throughput: 0: 44137.2. Samples: 3145648240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:36:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:36:40,143][06909] Updated weights for policy 0, policy_version 197923 (0.0034) [2024-06-28 09:36:43,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3242901504. Throughput: 0: 44153.7. Samples: 3145782880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:36:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:36:44,490][06909] Updated weights for policy 0, policy_version 197933 (0.0032) [2024-06-28 09:36:47,944][06909] Updated weights for policy 0, policy_version 197943 (0.0041) [2024-06-28 09:36:48,850][06674] Fps is (10 sec: 40960.9, 60 sec: 43963.8, 300 sec: 43987.2). Total num frames: 3243114496. Throughput: 0: 44100.4. Samples: 3146039800. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:36:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:36:51,694][06909] Updated weights for policy 0, policy_version 197953 (0.0033) [2024-06-28 09:36:53,852][06674] Fps is (10 sec: 45866.8, 60 sec: 44508.5, 300 sec: 43986.6). Total num frames: 3243360256. Throughput: 0: 43992.0. Samples: 3146306900. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:36:53,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:36:55,183][06909] Updated weights for policy 0, policy_version 197963 (0.0036) [2024-06-28 09:36:58,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44509.8, 300 sec: 43931.3). Total num frames: 3243573248. Throughput: 0: 44165.7. Samples: 3146447560. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:36:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:36:59,276][06909] Updated weights for policy 0, policy_version 197973 (0.0029) [2024-06-28 09:37:02,963][06909] Updated weights for policy 0, policy_version 197983 (0.0029) [2024-06-28 09:37:03,850][06674] Fps is (10 sec: 42606.8, 60 sec: 44238.4, 300 sec: 43986.9). Total num frames: 3243786240. Throughput: 0: 44169.0. Samples: 3146703600. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:37:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:37:06,702][06909] Updated weights for policy 0, policy_version 197993 (0.0039) [2024-06-28 09:37:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3244015616. Throughput: 0: 43809.4. Samples: 3146962660. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:37:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:37:10,205][06909] Updated weights for policy 0, policy_version 198003 (0.0025) [2024-06-28 09:37:13,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3244228608. Throughput: 0: 43937.8. Samples: 3147106440. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:37:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:37:14,170][06909] Updated weights for policy 0, policy_version 198013 (0.0033) [2024-06-28 09:37:17,502][06909] Updated weights for policy 0, policy_version 198023 (0.0027) [2024-06-28 09:37:18,852][06674] Fps is (10 sec: 42589.7, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 3244441600. Throughput: 0: 44131.3. Samples: 3147364640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:37:18,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:37:21,459][06909] Updated weights for policy 0, policy_version 198033 (0.0028) [2024-06-28 09:37:22,583][06887] Signal inference workers to stop experience collection... (44600 times) [2024-06-28 09:37:22,583][06887] Signal inference workers to resume experience collection... (44600 times) [2024-06-28 09:37:22,631][06909] InferenceWorker_p0-w0: stopping experience collection (44600 times) [2024-06-28 09:37:22,631][06909] InferenceWorker_p0-w0: resuming experience collection (44600 times) [2024-06-28 09:37:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3244670976. Throughput: 0: 43981.0. Samples: 3147627380. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:37:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:37:25,117][06909] Updated weights for policy 0, policy_version 198043 (0.0032) [2024-06-28 09:37:28,850][06674] Fps is (10 sec: 44244.7, 60 sec: 44236.7, 300 sec: 43986.8). Total num frames: 3244883968. Throughput: 0: 44007.3. Samples: 3147763220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:37:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:37:28,998][06909] Updated weights for policy 0, policy_version 198053 (0.0029) [2024-06-28 09:37:32,950][06909] Updated weights for policy 0, policy_version 198063 (0.0031) [2024-06-28 09:37:33,850][06674] Fps is (10 sec: 42597.0, 60 sec: 44236.5, 300 sec: 43986.8). Total num frames: 3245096960. Throughput: 0: 44082.2. Samples: 3148023520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:37:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:37:36,526][06909] Updated weights for policy 0, policy_version 198073 (0.0026) [2024-06-28 09:37:38,850][06674] Fps is (10 sec: 45876.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3245342720. Throughput: 0: 44041.8. Samples: 3148288700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:37:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:37:40,277][06909] Updated weights for policy 0, policy_version 198083 (0.0033) [2024-06-28 09:37:43,850][06674] Fps is (10 sec: 44238.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3245539328. Throughput: 0: 43897.0. Samples: 3148422920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:37:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 09:37:43,877][06909] Updated weights for policy 0, policy_version 198093 (0.0030) [2024-06-28 09:37:47,585][06909] Updated weights for policy 0, policy_version 198103 (0.0039) [2024-06-28 09:37:48,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.6, 300 sec: 43876.7). Total num frames: 3245752320. Throughput: 0: 44056.8. Samples: 3148686160. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:37:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:37:48,871][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000198105_3245752320.pth... [2024-06-28 09:37:48,939][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000197461_3235201024.pth [2024-06-28 09:37:51,478][06909] Updated weights for policy 0, policy_version 198113 (0.0034) [2024-06-28 09:37:53,852][06674] Fps is (10 sec: 45865.6, 60 sec: 43963.6, 300 sec: 44097.6). Total num frames: 3245998080. Throughput: 0: 44142.0. Samples: 3148949140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:37:53,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:37:54,982][06909] Updated weights for policy 0, policy_version 198123 (0.0036) [2024-06-28 09:37:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3246194688. Throughput: 0: 44057.9. Samples: 3149089040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:37:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:37:58,949][06909] Updated weights for policy 0, policy_version 198133 (0.0035) [2024-06-28 09:38:02,563][06909] Updated weights for policy 0, policy_version 198143 (0.0031) [2024-06-28 09:38:03,850][06674] Fps is (10 sec: 40967.5, 60 sec: 43690.5, 300 sec: 43932.2). Total num frames: 3246407680. Throughput: 0: 44101.8. Samples: 3149349140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:38:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:38:06,305][06909] Updated weights for policy 0, policy_version 198153 (0.0038) [2024-06-28 09:38:08,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3246653440. Throughput: 0: 43979.7. Samples: 3149606460. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:38:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:38:10,028][06909] Updated weights for policy 0, policy_version 198163 (0.0024) [2024-06-28 09:38:13,665][06909] Updated weights for policy 0, policy_version 198173 (0.0032) [2024-06-28 09:38:13,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3246866432. Throughput: 0: 44083.8. Samples: 3149746980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:38:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:38:17,160][06909] Updated weights for policy 0, policy_version 198183 (0.0048) [2024-06-28 09:38:18,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43692.2, 300 sec: 43875.8). Total num frames: 3247063040. Throughput: 0: 44091.6. Samples: 3150007620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:38:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:38:20,981][06909] Updated weights for policy 0, policy_version 198193 (0.0037) [2024-06-28 09:38:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3247308800. Throughput: 0: 43996.1. Samples: 3150268520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:38:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:38:24,953][06909] Updated weights for policy 0, policy_version 198203 (0.0039) [2024-06-28 09:38:28,810][06909] Updated weights for policy 0, policy_version 198213 (0.0027) [2024-06-28 09:38:28,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 3247521792. Throughput: 0: 44127.4. Samples: 3150408660. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:38:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:38:32,180][06909] Updated weights for policy 0, policy_version 198223 (0.0040) [2024-06-28 09:38:33,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43964.0, 300 sec: 43931.3). Total num frames: 3247734784. Throughput: 0: 44085.8. Samples: 3150670020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:38:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:38:35,974][06909] Updated weights for policy 0, policy_version 198233 (0.0055) [2024-06-28 09:38:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3247980544. Throughput: 0: 44038.4. Samples: 3150930780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:38:38,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:38:39,567][06909] Updated weights for policy 0, policy_version 198243 (0.0028) [2024-06-28 09:38:40,974][06887] Signal inference workers to stop experience collection... (44650 times) [2024-06-28 09:38:40,975][06887] Signal inference workers to resume experience collection... (44650 times) [2024-06-28 09:38:41,008][06909] InferenceWorker_p0-w0: stopping experience collection (44650 times) [2024-06-28 09:38:41,008][06909] InferenceWorker_p0-w0: resuming experience collection (44650 times) [2024-06-28 09:38:43,182][06909] Updated weights for policy 0, policy_version 198253 (0.0028) [2024-06-28 09:38:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3248177152. Throughput: 0: 44052.0. Samples: 3151071380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:38:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:38:47,033][06909] Updated weights for policy 0, policy_version 198263 (0.0035) [2024-06-28 09:38:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3248406528. Throughput: 0: 44122.4. Samples: 3151334640. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:38:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:38:50,925][06909] Updated weights for policy 0, policy_version 198273 (0.0035) [2024-06-28 09:38:53,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43965.2, 300 sec: 44098.0). Total num frames: 3248635904. Throughput: 0: 44113.3. Samples: 3151591560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:38:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:38:54,881][06909] Updated weights for policy 0, policy_version 198283 (0.0029) [2024-06-28 09:38:58,282][06909] Updated weights for policy 0, policy_version 198293 (0.0036) [2024-06-28 09:38:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3248848896. Throughput: 0: 44017.8. Samples: 3151727780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:38:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:39:02,109][06909] Updated weights for policy 0, policy_version 198303 (0.0032) [2024-06-28 09:39:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44237.0, 300 sec: 43986.9). Total num frames: 3249061888. Throughput: 0: 44184.4. Samples: 3151995920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:39:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:39:05,835][06909] Updated weights for policy 0, policy_version 198313 (0.0027) [2024-06-28 09:39:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3249307648. Throughput: 0: 44011.5. Samples: 3152249040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:39:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:39:09,241][06909] Updated weights for policy 0, policy_version 198323 (0.0036) [2024-06-28 09:39:13,065][06909] Updated weights for policy 0, policy_version 198333 (0.0033) [2024-06-28 09:39:13,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3249520640. Throughput: 0: 44128.8. Samples: 3152394460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:39:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:39:16,970][06909] Updated weights for policy 0, policy_version 198343 (0.0039) [2024-06-28 09:39:18,851][06674] Fps is (10 sec: 40954.7, 60 sec: 44235.8, 300 sec: 43986.7). Total num frames: 3249717248. Throughput: 0: 44277.4. Samples: 3152662560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:39:18,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:39:20,441][06909] Updated weights for policy 0, policy_version 198353 (0.0031) [2024-06-28 09:39:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3249963008. Throughput: 0: 44113.8. Samples: 3152915900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:39:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:39:24,322][06909] Updated weights for policy 0, policy_version 198363 (0.0032) [2024-06-28 09:39:28,038][06909] Updated weights for policy 0, policy_version 198373 (0.0030) [2024-06-28 09:39:28,850][06674] Fps is (10 sec: 45881.4, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3250176000. Throughput: 0: 44049.4. Samples: 3153053600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:39:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:39:31,981][06909] Updated weights for policy 0, policy_version 198383 (0.0033) [2024-06-28 09:39:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3250372608. Throughput: 0: 44008.4. Samples: 3153315020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:39:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:39:35,449][06909] Updated weights for policy 0, policy_version 198393 (0.0038) [2024-06-28 09:39:38,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43962.3, 300 sec: 44097.6). Total num frames: 3250618368. Throughput: 0: 44055.7. Samples: 3153574160. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:39:38,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:39:39,463][06909] Updated weights for policy 0, policy_version 198403 (0.0021) [2024-06-28 09:39:42,911][06909] Updated weights for policy 0, policy_version 198413 (0.0046) [2024-06-28 09:39:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3250831360. Throughput: 0: 44177.2. Samples: 3153715760. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:39:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:39:46,906][06909] Updated weights for policy 0, policy_version 198423 (0.0032) [2024-06-28 09:39:48,850][06674] Fps is (10 sec: 42605.9, 60 sec: 43963.6, 300 sec: 43987.1). Total num frames: 3251044352. Throughput: 0: 44009.9. Samples: 3153976380. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:39:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:39:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000198428_3251044352.pth... [2024-06-28 09:39:48,933][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000197783_3240476672.pth [2024-06-28 09:39:50,231][06909] Updated weights for policy 0, policy_version 198433 (0.0032) [2024-06-28 09:39:53,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 43987.4). Total num frames: 3251273728. Throughput: 0: 44221.8. Samples: 3154239020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:39:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:39:54,021][06909] Updated weights for policy 0, policy_version 198443 (0.0036) [2024-06-28 09:39:57,571][06909] Updated weights for policy 0, policy_version 198453 (0.0028) [2024-06-28 09:39:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3251486720. Throughput: 0: 44091.0. Samples: 3154378560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:39:58,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:40:01,804][06909] Updated weights for policy 0, policy_version 198463 (0.0031) [2024-06-28 09:40:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43931.9). Total num frames: 3251699712. Throughput: 0: 44026.2. Samples: 3154643680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:40:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:40:04,937][06909] Updated weights for policy 0, policy_version 198473 (0.0028) [2024-06-28 09:40:08,852][06674] Fps is (10 sec: 44228.4, 60 sec: 43689.1, 300 sec: 44042.1). Total num frames: 3251929088. Throughput: 0: 44030.4. Samples: 3154897360. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:40:08,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:40:09,006][06909] Updated weights for policy 0, policy_version 198483 (0.0031) [2024-06-28 09:40:12,222][06909] Updated weights for policy 0, policy_version 198493 (0.0031) [2024-06-28 09:40:13,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 3252158464. Throughput: 0: 44125.4. Samples: 3155039240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:40:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:40:16,678][06909] Updated weights for policy 0, policy_version 198503 (0.0035) [2024-06-28 09:40:18,850][06674] Fps is (10 sec: 42607.6, 60 sec: 43964.7, 300 sec: 43931.3). Total num frames: 3252355072. Throughput: 0: 44133.0. Samples: 3155301000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:40:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:40:18,919][06887] Signal inference workers to stop experience collection... (44700 times) [2024-06-28 09:40:18,963][06909] InferenceWorker_p0-w0: stopping experience collection (44700 times) [2024-06-28 09:40:19,034][06887] Signal inference workers to resume experience collection... (44700 times) [2024-06-28 09:40:19,034][06909] InferenceWorker_p0-w0: resuming experience collection (44700 times) [2024-06-28 09:40:20,101][06909] Updated weights for policy 0, policy_version 198513 (0.0021) [2024-06-28 09:40:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3252584448. Throughput: 0: 43928.2. Samples: 3155550840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:40:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:40:24,029][06909] Updated weights for policy 0, policy_version 198523 (0.0035) [2024-06-28 09:40:27,463][06909] Updated weights for policy 0, policy_version 198533 (0.0033) [2024-06-28 09:40:28,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3252813824. Throughput: 0: 43853.0. Samples: 3155689140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:40:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:40:31,488][06909] Updated weights for policy 0, policy_version 198543 (0.0031) [2024-06-28 09:40:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3253010432. Throughput: 0: 43971.9. Samples: 3155955100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:40:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:40:35,132][06909] Updated weights for policy 0, policy_version 198553 (0.0022) [2024-06-28 09:40:38,850][06674] Fps is (10 sec: 42597.4, 60 sec: 43692.0, 300 sec: 43986.9). Total num frames: 3253239808. Throughput: 0: 43897.5. Samples: 3156214420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:40:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:40:39,034][06909] Updated weights for policy 0, policy_version 198563 (0.0043) [2024-06-28 09:40:42,549][06909] Updated weights for policy 0, policy_version 198573 (0.0034) [2024-06-28 09:40:43,850][06674] Fps is (10 sec: 47512.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3253485568. Throughput: 0: 43850.3. Samples: 3156351820. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:40:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:40:46,154][06909] Updated weights for policy 0, policy_version 198583 (0.0027) [2024-06-28 09:40:48,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 3253698560. Throughput: 0: 43914.5. Samples: 3156619840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:40:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:40:49,828][06909] Updated weights for policy 0, policy_version 198593 (0.0040) [2024-06-28 09:40:53,805][06909] Updated weights for policy 0, policy_version 198603 (0.0043) [2024-06-28 09:40:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3253911552. Throughput: 0: 44035.4. Samples: 3156878860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:40:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:40:57,232][06909] Updated weights for policy 0, policy_version 198613 (0.0033) [2024-06-28 09:40:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.9, 300 sec: 44098.3). Total num frames: 3254140928. Throughput: 0: 43896.8. Samples: 3157014600. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:40:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:41:00,984][06909] Updated weights for policy 0, policy_version 198623 (0.0027) [2024-06-28 09:41:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3254353920. Throughput: 0: 44025.3. Samples: 3157282140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:41:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:41:04,569][06909] Updated weights for policy 0, policy_version 198633 (0.0038) [2024-06-28 09:41:08,440][06909] Updated weights for policy 0, policy_version 198643 (0.0025) [2024-06-28 09:41:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 3254566912. Throughput: 0: 44394.7. Samples: 3157548600. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:41:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:41:11,941][06909] Updated weights for policy 0, policy_version 198653 (0.0034) [2024-06-28 09:41:13,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3254812672. Throughput: 0: 44216.8. Samples: 3157678900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:41:13,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:41:15,783][06909] Updated weights for policy 0, policy_version 198663 (0.0036) [2024-06-28 09:41:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3255009280. Throughput: 0: 44207.0. Samples: 3157944420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:41:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:41:19,363][06909] Updated weights for policy 0, policy_version 198673 (0.0031) [2024-06-28 09:41:23,457][06909] Updated weights for policy 0, policy_version 198683 (0.0030) [2024-06-28 09:41:23,852][06674] Fps is (10 sec: 40952.0, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 3255222272. Throughput: 0: 44268.5. Samples: 3158206580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:41:23,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:41:27,181][06909] Updated weights for policy 0, policy_version 198693 (0.0027) [2024-06-28 09:41:28,853][06674] Fps is (10 sec: 45859.5, 60 sec: 44234.3, 300 sec: 44153.0). Total num frames: 3255468032. Throughput: 0: 44151.9. Samples: 3158338800. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:41:28,854][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:41:30,952][06909] Updated weights for policy 0, policy_version 198703 (0.0028) [2024-06-28 09:41:33,850][06674] Fps is (10 sec: 44245.3, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 3255664640. Throughput: 0: 43990.2. Samples: 3158599400. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:41:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:41:34,474][06909] Updated weights for policy 0, policy_version 198713 (0.0037) [2024-06-28 09:41:38,179][06909] Updated weights for policy 0, policy_version 198723 (0.0030) [2024-06-28 09:41:38,850][06674] Fps is (10 sec: 42611.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3255894016. Throughput: 0: 44240.2. Samples: 3158869680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:41:38,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:41:41,722][06909] Updated weights for policy 0, policy_version 198733 (0.0037) [2024-06-28 09:41:43,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3256139776. Throughput: 0: 44172.0. Samples: 3159002340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:41:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:41:45,572][06909] Updated weights for policy 0, policy_version 198743 (0.0043) [2024-06-28 09:41:48,850][06674] Fps is (10 sec: 45876.2, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 3256352768. Throughput: 0: 44188.4. Samples: 3159270620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:41:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:41:48,885][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000198752_3256352768.pth... [2024-06-28 09:41:48,930][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000198105_3245752320.pth [2024-06-28 09:41:49,268][06909] Updated weights for policy 0, policy_version 198753 (0.0022) [2024-06-28 09:41:53,037][06909] Updated weights for policy 0, policy_version 198763 (0.0033) [2024-06-28 09:41:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3256549376. Throughput: 0: 44109.8. Samples: 3159533540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:41:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:41:56,810][06909] Updated weights for policy 0, policy_version 198773 (0.0027) [2024-06-28 09:41:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3256795136. Throughput: 0: 44091.2. Samples: 3159663000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:41:58,850][06674] Avg episode reward: [(0, '0.473')] [2024-06-28 09:42:00,838][06909] Updated weights for policy 0, policy_version 198783 (0.0035) [2024-06-28 09:42:02,990][06887] Signal inference workers to stop experience collection... (44750 times) [2024-06-28 09:42:02,991][06887] Signal inference workers to resume experience collection... (44750 times) [2024-06-28 09:42:03,006][06909] InferenceWorker_p0-w0: stopping experience collection (44750 times) [2024-06-28 09:42:03,006][06909] InferenceWorker_p0-w0: resuming experience collection (44750 times) [2024-06-28 09:42:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3256991744. Throughput: 0: 43961.8. Samples: 3159922700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 09:42:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:42:04,304][06909] Updated weights for policy 0, policy_version 198793 (0.0036) [2024-06-28 09:42:08,191][06909] Updated weights for policy 0, policy_version 198803 (0.0032) [2024-06-28 09:42:08,852][06674] Fps is (10 sec: 40951.4, 60 sec: 43962.2, 300 sec: 43986.6). Total num frames: 3257204736. Throughput: 0: 43992.8. Samples: 3160186260. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:42:08,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:42:11,645][06909] Updated weights for policy 0, policy_version 198813 (0.0048) [2024-06-28 09:42:13,850][06674] Fps is (10 sec: 47512.6, 60 sec: 44236.7, 300 sec: 44153.8). Total num frames: 3257466880. Throughput: 0: 43997.4. Samples: 3160318540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:42:13,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:42:15,428][06909] Updated weights for policy 0, policy_version 198823 (0.0047) [2024-06-28 09:42:18,850][06674] Fps is (10 sec: 45884.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3257663488. Throughput: 0: 44011.2. Samples: 3160579900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:42:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:42:19,422][06909] Updated weights for policy 0, policy_version 198833 (0.0041) [2024-06-28 09:42:23,032][06909] Updated weights for policy 0, policy_version 198843 (0.0027) [2024-06-28 09:42:23,850][06674] Fps is (10 sec: 40960.6, 60 sec: 44238.3, 300 sec: 44042.5). Total num frames: 3257876480. Throughput: 0: 43894.1. Samples: 3160844900. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:42:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:42:26,771][06909] Updated weights for policy 0, policy_version 198853 (0.0026) [2024-06-28 09:42:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43966.3, 300 sec: 44098.0). Total num frames: 3258105856. Throughput: 0: 43761.4. Samples: 3160971600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:42:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:42:30,354][06909] Updated weights for policy 0, policy_version 198863 (0.0033) [2024-06-28 09:42:33,852][06674] Fps is (10 sec: 44227.7, 60 sec: 44235.4, 300 sec: 43986.6). Total num frames: 3258318848. Throughput: 0: 43678.1. Samples: 3161236220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:42:33,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:42:34,345][06909] Updated weights for policy 0, policy_version 198873 (0.0041) [2024-06-28 09:42:38,040][06909] Updated weights for policy 0, policy_version 198883 (0.0042) [2024-06-28 09:42:38,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 3258531840. Throughput: 0: 43797.2. Samples: 3161504420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:42:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:42:41,762][06909] Updated weights for policy 0, policy_version 198893 (0.0030) [2024-06-28 09:42:43,850][06674] Fps is (10 sec: 45884.8, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3258777600. Throughput: 0: 43868.0. Samples: 3161637060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:42:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:42:45,128][06909] Updated weights for policy 0, policy_version 198903 (0.0034) [2024-06-28 09:42:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 3258974208. Throughput: 0: 43932.3. Samples: 3161899660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:42:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:42:48,881][06909] Updated weights for policy 0, policy_version 198913 (0.0039) [2024-06-28 09:42:52,947][06909] Updated weights for policy 0, policy_version 198923 (0.0030) [2024-06-28 09:42:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3259203584. Throughput: 0: 44047.8. Samples: 3162168320. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:42:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:42:56,656][06909] Updated weights for policy 0, policy_version 198933 (0.0030) [2024-06-28 09:42:58,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3259432960. Throughput: 0: 44067.8. Samples: 3162301580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:42:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:43:00,224][06909] Updated weights for policy 0, policy_version 198943 (0.0046) [2024-06-28 09:43:03,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3259629568. Throughput: 0: 44039.6. Samples: 3162561680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:43:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:43:03,881][06909] Updated weights for policy 0, policy_version 198953 (0.0031) [2024-06-28 09:43:07,778][06909] Updated weights for policy 0, policy_version 198963 (0.0037) [2024-06-28 09:43:08,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43965.3, 300 sec: 43986.9). Total num frames: 3259842560. Throughput: 0: 44125.4. Samples: 3162830540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:43:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:43:11,155][06909] Updated weights for policy 0, policy_version 198973 (0.0025) [2024-06-28 09:43:13,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.8, 300 sec: 44153.5). Total num frames: 3260088320. Throughput: 0: 44163.1. Samples: 3162958940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:43:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:43:15,029][06909] Updated weights for policy 0, policy_version 198983 (0.0026) [2024-06-28 09:43:18,799][06909] Updated weights for policy 0, policy_version 198993 (0.0032) [2024-06-28 09:43:18,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3260301312. Throughput: 0: 44142.4. Samples: 3163222540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:43:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:43:22,633][06909] Updated weights for policy 0, policy_version 199003 (0.0026) [2024-06-28 09:43:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3260514304. Throughput: 0: 44211.2. Samples: 3163493920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:43:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:43:25,984][06909] Updated weights for policy 0, policy_version 199013 (0.0034) [2024-06-28 09:43:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3260743680. Throughput: 0: 44191.6. Samples: 3163625680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:43:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:43:29,838][06909] Updated weights for policy 0, policy_version 199023 (0.0031) [2024-06-28 09:43:32,121][06887] Signal inference workers to stop experience collection... (44800 times) [2024-06-28 09:43:32,121][06887] Signal inference workers to resume experience collection... (44800 times) [2024-06-28 09:43:32,137][06909] InferenceWorker_p0-w0: stopping experience collection (44800 times) [2024-06-28 09:43:32,137][06909] InferenceWorker_p0-w0: resuming experience collection (44800 times) [2024-06-28 09:43:33,612][06909] Updated weights for policy 0, policy_version 199033 (0.0030) [2024-06-28 09:43:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 3260973056. Throughput: 0: 43990.2. Samples: 3163879220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:43:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:43:37,190][06909] Updated weights for policy 0, policy_version 199043 (0.0023) [2024-06-28 09:43:38,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3261186048. Throughput: 0: 44132.4. Samples: 3164154280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:43:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:43:40,957][06909] Updated weights for policy 0, policy_version 199053 (0.0034) [2024-06-28 09:43:43,856][06674] Fps is (10 sec: 42573.1, 60 sec: 43686.3, 300 sec: 44041.5). Total num frames: 3261399040. Throughput: 0: 43968.7. Samples: 3164280440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:43:43,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:43:44,893][06909] Updated weights for policy 0, policy_version 199063 (0.0030) [2024-06-28 09:43:48,478][06909] Updated weights for policy 0, policy_version 199073 (0.0030) [2024-06-28 09:43:48,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 3261644800. Throughput: 0: 43968.8. Samples: 3164540280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:43:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:43:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000199075_3261644800.pth... [2024-06-28 09:43:48,909][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000198428_3251044352.pth [2024-06-28 09:43:52,434][06909] Updated weights for policy 0, policy_version 199083 (0.0026) [2024-06-28 09:43:53,850][06674] Fps is (10 sec: 45900.2, 60 sec: 44236.5, 300 sec: 44097.9). Total num frames: 3261857792. Throughput: 0: 43877.2. Samples: 3164805040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:43:53,851][06674] Avg episode reward: [(0, '0.431')] [2024-06-28 09:43:55,820][06909] Updated weights for policy 0, policy_version 199093 (0.0038) [2024-06-28 09:43:58,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.5, 300 sec: 44042.4). Total num frames: 3262054400. Throughput: 0: 44023.5. Samples: 3164940000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:43:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:43:59,691][06909] Updated weights for policy 0, policy_version 199103 (0.0031) [2024-06-28 09:44:03,382][06909] Updated weights for policy 0, policy_version 199113 (0.0036) [2024-06-28 09:44:03,850][06674] Fps is (10 sec: 42600.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3262283776. Throughput: 0: 43997.4. Samples: 3165202420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:44:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:44:07,169][06909] Updated weights for policy 0, policy_version 199123 (0.0031) [2024-06-28 09:44:08,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3262513152. Throughput: 0: 43923.1. Samples: 3165470460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:44:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:44:10,901][06909] Updated weights for policy 0, policy_version 199133 (0.0037) [2024-06-28 09:44:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44098.1). Total num frames: 3262726144. Throughput: 0: 43880.8. Samples: 3165600320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:44:13,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:44:14,810][06909] Updated weights for policy 0, policy_version 199143 (0.0030) [2024-06-28 09:44:18,210][06909] Updated weights for policy 0, policy_version 199153 (0.0029) [2024-06-28 09:44:18,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3262955520. Throughput: 0: 44192.4. Samples: 3165867880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:44:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:44:22,116][06909] Updated weights for policy 0, policy_version 199163 (0.0036) [2024-06-28 09:44:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 3263184896. Throughput: 0: 43980.6. Samples: 3166133400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 09:44:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:44:25,432][06909] Updated weights for policy 0, policy_version 199173 (0.0024) [2024-06-28 09:44:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3263381504. Throughput: 0: 44176.5. Samples: 3166268120. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 09:44:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:44:29,978][06909] Updated weights for policy 0, policy_version 199183 (0.0025) [2024-06-28 09:44:32,582][06909] Updated weights for policy 0, policy_version 199193 (0.0033) [2024-06-28 09:44:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.9, 300 sec: 44153.8). Total num frames: 3263643648. Throughput: 0: 44312.4. Samples: 3166534340. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 09:44:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:44:37,055][06909] Updated weights for policy 0, policy_version 199203 (0.0036) [2024-06-28 09:44:38,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 3263856640. Throughput: 0: 44369.0. Samples: 3166801620. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 09:44:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:44:40,353][06909] Updated weights for policy 0, policy_version 199213 (0.0041) [2024-06-28 09:44:43,852][06674] Fps is (10 sec: 40951.6, 60 sec: 44239.7, 300 sec: 44097.7). Total num frames: 3264053248. Throughput: 0: 44331.8. Samples: 3166935020. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 09:44:43,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:44:44,480][06909] Updated weights for policy 0, policy_version 199223 (0.0033) [2024-06-28 09:44:47,513][06909] Updated weights for policy 0, policy_version 199233 (0.0034) [2024-06-28 09:44:48,479][06887] Signal inference workers to stop experience collection... (44850 times) [2024-06-28 09:44:48,479][06887] Signal inference workers to resume experience collection... (44850 times) [2024-06-28 09:44:48,539][06909] InferenceWorker_p0-w0: stopping experience collection (44850 times) [2024-06-28 09:44:48,539][06909] InferenceWorker_p0-w0: resuming experience collection (44850 times) [2024-06-28 09:44:48,850][06674] Fps is (10 sec: 44235.8, 60 sec: 44236.6, 300 sec: 44153.5). Total num frames: 3264299008. Throughput: 0: 44499.8. Samples: 3167204920. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 09:44:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:44:51,857][06909] Updated weights for policy 0, policy_version 199243 (0.0026) [2024-06-28 09:44:53,850][06674] Fps is (10 sec: 47523.2, 60 sec: 44510.3, 300 sec: 44209.1). Total num frames: 3264528384. Throughput: 0: 44387.5. Samples: 3167467900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 09:44:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:44:54,805][06909] Updated weights for policy 0, policy_version 199253 (0.0031) [2024-06-28 09:44:58,850][06674] Fps is (10 sec: 40960.6, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3264708608. Throughput: 0: 44528.5. Samples: 3167604100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 09:44:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:44:59,123][06909] Updated weights for policy 0, policy_version 199263 (0.0027) [2024-06-28 09:45:02,356][06909] Updated weights for policy 0, policy_version 199273 (0.0024) [2024-06-28 09:45:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44509.9, 300 sec: 44153.8). Total num frames: 3264954368. Throughput: 0: 44373.0. Samples: 3167864660. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 09:45:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:45:06,933][06909] Updated weights for policy 0, policy_version 199283 (0.0030) [2024-06-28 09:45:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3265167360. Throughput: 0: 44115.0. Samples: 3168118580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 09:45:08,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:45:09,809][06909] Updated weights for policy 0, policy_version 199293 (0.0034) [2024-06-28 09:45:13,853][06674] Fps is (10 sec: 40948.1, 60 sec: 43961.6, 300 sec: 44097.5). Total num frames: 3265363968. Throughput: 0: 44098.5. Samples: 3168252680. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 09:45:13,854][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 09:45:14,274][06909] Updated weights for policy 0, policy_version 199303 (0.0030) [2024-06-28 09:45:17,304][06909] Updated weights for policy 0, policy_version 199313 (0.0040) [2024-06-28 09:45:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3265609728. Throughput: 0: 44080.0. Samples: 3168517940. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 09:45:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:45:21,498][06909] Updated weights for policy 0, policy_version 199323 (0.0032) [2024-06-28 09:45:23,850][06674] Fps is (10 sec: 45888.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3265822720. Throughput: 0: 44145.8. Samples: 3168788180. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 09:45:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:45:24,569][06909] Updated weights for policy 0, policy_version 199333 (0.0028) [2024-06-28 09:45:28,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 3266019328. Throughput: 0: 43957.2. Samples: 3168913000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 09:45:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:45:28,885][06909] Updated weights for policy 0, policy_version 199343 (0.0034) [2024-06-28 09:45:32,157][06909] Updated weights for policy 0, policy_version 199353 (0.0031) [2024-06-28 09:45:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44209.1). Total num frames: 3266281472. Throughput: 0: 43991.2. Samples: 3169184520. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:45:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:45:36,292][06909] Updated weights for policy 0, policy_version 199363 (0.0030) [2024-06-28 09:45:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3266478080. Throughput: 0: 44092.5. Samples: 3169452060. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:45:38,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 09:45:39,648][06909] Updated weights for policy 0, policy_version 199373 (0.0041) [2024-06-28 09:45:43,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43692.1, 300 sec: 43986.9). Total num frames: 3266674688. Throughput: 0: 43824.9. Samples: 3169576220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:45:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:45:43,873][06909] Updated weights for policy 0, policy_version 199383 (0.0034) [2024-06-28 09:45:46,895][06909] Updated weights for policy 0, policy_version 199393 (0.0029) [2024-06-28 09:45:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 3266936832. Throughput: 0: 43948.0. Samples: 3169842320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:45:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:45:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000199398_3266936832.pth... [2024-06-28 09:45:48,930][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000198752_3256352768.pth [2024-06-28 09:45:51,161][06909] Updated weights for policy 0, policy_version 199403 (0.0038) [2024-06-28 09:45:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 44042.4). Total num frames: 3267133440. Throughput: 0: 44276.9. Samples: 3170111040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:45:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:45:54,607][06909] Updated weights for policy 0, policy_version 199413 (0.0038) [2024-06-28 09:45:58,467][06909] Updated weights for policy 0, policy_version 199423 (0.0027) [2024-06-28 09:45:58,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3267346432. Throughput: 0: 44109.9. Samples: 3170237500. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:45:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:46:01,859][06909] Updated weights for policy 0, policy_version 199433 (0.0031) [2024-06-28 09:46:03,852][06674] Fps is (10 sec: 45865.9, 60 sec: 43962.2, 300 sec: 44153.2). Total num frames: 3267592192. Throughput: 0: 44122.9. Samples: 3170503560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:46:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:46:05,975][06909] Updated weights for policy 0, policy_version 199443 (0.0035) [2024-06-28 09:46:08,532][06887] Signal inference workers to stop experience collection... (44900 times) [2024-06-28 09:46:08,536][06887] Signal inference workers to resume experience collection... (44900 times) [2024-06-28 09:46:08,574][06909] InferenceWorker_p0-w0: stopping experience collection (44900 times) [2024-06-28 09:46:08,574][06909] InferenceWorker_p0-w0: resuming experience collection (44900 times) [2024-06-28 09:46:08,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 3267788800. Throughput: 0: 44222.4. Samples: 3170778280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:46:08,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:46:09,316][06909] Updated weights for policy 0, policy_version 199453 (0.0026) [2024-06-28 09:46:13,327][06909] Updated weights for policy 0, policy_version 199463 (0.0030) [2024-06-28 09:46:13,850][06674] Fps is (10 sec: 42607.2, 60 sec: 44238.9, 300 sec: 44098.0). Total num frames: 3268018176. Throughput: 0: 44123.1. Samples: 3170898540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:46:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:46:16,772][06909] Updated weights for policy 0, policy_version 199473 (0.0037) [2024-06-28 09:46:18,856][06674] Fps is (10 sec: 45856.8, 60 sec: 43959.3, 300 sec: 44152.9). Total num frames: 3268247552. Throughput: 0: 43828.8. Samples: 3171157080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:46:18,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:46:20,807][06909] Updated weights for policy 0, policy_version 199483 (0.0028) [2024-06-28 09:46:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44042.9). Total num frames: 3268460544. Throughput: 0: 44064.9. Samples: 3171434980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:46:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:46:24,341][06909] Updated weights for policy 0, policy_version 199493 (0.0040) [2024-06-28 09:46:28,077][06909] Updated weights for policy 0, policy_version 199503 (0.0035) [2024-06-28 09:46:28,850][06674] Fps is (10 sec: 42624.3, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 3268673536. Throughput: 0: 44173.4. Samples: 3171564020. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:46:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:46:31,835][06909] Updated weights for policy 0, policy_version 199513 (0.0040) [2024-06-28 09:46:33,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.8, 300 sec: 44209.1). Total num frames: 3268935680. Throughput: 0: 44130.2. Samples: 3171828180. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:46:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:46:35,289][06909] Updated weights for policy 0, policy_version 199523 (0.0027) [2024-06-28 09:46:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3269099520. Throughput: 0: 44280.1. Samples: 3172103640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:46:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:46:39,318][06909] Updated weights for policy 0, policy_version 199533 (0.0029) [2024-06-28 09:46:42,938][06909] Updated weights for policy 0, policy_version 199543 (0.0021) [2024-06-28 09:46:43,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44782.9, 300 sec: 44097.9). Total num frames: 3269361664. Throughput: 0: 44128.8. Samples: 3172223300. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:46:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:46:46,901][06909] Updated weights for policy 0, policy_version 199553 (0.0033) [2024-06-28 09:46:48,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3269574656. Throughput: 0: 43957.1. Samples: 3172481540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:46:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:46:50,411][06909] Updated weights for policy 0, policy_version 199563 (0.0027) [2024-06-28 09:46:53,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3269787648. Throughput: 0: 43998.9. Samples: 3172758140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:46:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:46:54,182][06909] Updated weights for policy 0, policy_version 199573 (0.0028) [2024-06-28 09:46:57,771][06909] Updated weights for policy 0, policy_version 199583 (0.0031) [2024-06-28 09:46:58,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3270000640. Throughput: 0: 44142.8. Samples: 3172884960. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:46:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:47:01,643][06909] Updated weights for policy 0, policy_version 199593 (0.0034) [2024-06-28 09:47:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44238.3, 300 sec: 44209.3). Total num frames: 3270246400. Throughput: 0: 44272.2. Samples: 3173149060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:47:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:47:05,296][06909] Updated weights for policy 0, policy_version 199603 (0.0026) [2024-06-28 09:47:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43965.2, 300 sec: 43931.4). Total num frames: 3270426624. Throughput: 0: 44112.8. Samples: 3173420060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:47:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:47:09,134][06909] Updated weights for policy 0, policy_version 199613 (0.0034) [2024-06-28 09:47:12,553][06909] Updated weights for policy 0, policy_version 199623 (0.0031) [2024-06-28 09:47:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3270672384. Throughput: 0: 44084.5. Samples: 3173547820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:47:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:47:16,695][06909] Updated weights for policy 0, policy_version 199633 (0.0026) [2024-06-28 09:47:18,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44241.3, 300 sec: 44153.5). Total num frames: 3270901760. Throughput: 0: 44019.6. Samples: 3173809060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:47:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:47:20,027][06909] Updated weights for policy 0, policy_version 199643 (0.0033) [2024-06-28 09:47:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3271081984. Throughput: 0: 43804.5. Samples: 3174074840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:47:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:47:24,201][06909] Updated weights for policy 0, policy_version 199653 (0.0034) [2024-06-28 09:47:27,571][06887] Signal inference workers to stop experience collection... (44950 times) [2024-06-28 09:47:27,572][06887] Signal inference workers to resume experience collection... (44950 times) [2024-06-28 09:47:27,582][06909] InferenceWorker_p0-w0: stopping experience collection (44950 times) [2024-06-28 09:47:27,589][06909] Updated weights for policy 0, policy_version 199663 (0.0030) [2024-06-28 09:47:27,616][06909] InferenceWorker_p0-w0: resuming experience collection (44950 times) [2024-06-28 09:47:28,850][06674] Fps is (10 sec: 42597.5, 60 sec: 44236.7, 300 sec: 44098.2). Total num frames: 3271327744. Throughput: 0: 43821.3. Samples: 3174195260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:47:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:47:31,356][06909] Updated weights for policy 0, policy_version 199673 (0.0025) [2024-06-28 09:47:33,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 3271557120. Throughput: 0: 44100.4. Samples: 3174466060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:47:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:47:34,898][06909] Updated weights for policy 0, policy_version 199683 (0.0024) [2024-06-28 09:47:38,721][06909] Updated weights for policy 0, policy_version 199693 (0.0034) [2024-06-28 09:47:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.7, 300 sec: 44042.4). Total num frames: 3271770112. Throughput: 0: 44087.8. Samples: 3174742100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:47:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:47:42,220][06909] Updated weights for policy 0, policy_version 199703 (0.0025) [2024-06-28 09:47:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3271999488. Throughput: 0: 44094.5. Samples: 3174869220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 09:47:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:47:46,302][06909] Updated weights for policy 0, policy_version 199713 (0.0037) [2024-06-28 09:47:48,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3272228864. Throughput: 0: 44149.3. Samples: 3175135780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 09:47:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:47:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000199721_3272228864.pth... [2024-06-28 09:47:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000199075_3261644800.pth [2024-06-28 09:47:49,667][06909] Updated weights for policy 0, policy_version 199723 (0.0043) [2024-06-28 09:47:53,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3272409088. Throughput: 0: 44143.6. Samples: 3175406520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 09:47:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:47:53,878][06909] Updated weights for policy 0, policy_version 199733 (0.0033) [2024-06-28 09:47:57,252][06909] Updated weights for policy 0, policy_version 199743 (0.0035) [2024-06-28 09:47:58,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3272638464. Throughput: 0: 43966.7. Samples: 3175526320. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 09:47:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:48:01,134][06909] Updated weights for policy 0, policy_version 199753 (0.0029) [2024-06-28 09:48:03,850][06674] Fps is (10 sec: 45874.2, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 3272867840. Throughput: 0: 44085.1. Samples: 3175792900. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 09:48:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:48:04,437][06909] Updated weights for policy 0, policy_version 199763 (0.0024) [2024-06-28 09:48:08,490][06909] Updated weights for policy 0, policy_version 199773 (0.0034) [2024-06-28 09:48:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 3273097216. Throughput: 0: 44148.4. Samples: 3176061520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 09:48:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:48:12,052][06909] Updated weights for policy 0, policy_version 199783 (0.0042) [2024-06-28 09:48:13,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3273326592. Throughput: 0: 44343.3. Samples: 3176190700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 09:48:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:48:15,916][06909] Updated weights for policy 0, policy_version 199793 (0.0047) [2024-06-28 09:48:18,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 3273539584. Throughput: 0: 44250.6. Samples: 3176457340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 09:48:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:48:19,289][06909] Updated weights for policy 0, policy_version 199803 (0.0036) [2024-06-28 09:48:23,467][06909] Updated weights for policy 0, policy_version 199813 (0.0040) [2024-06-28 09:48:23,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3273752576. Throughput: 0: 44062.3. Samples: 3176724900. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 09:48:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:48:26,604][06909] Updated weights for policy 0, policy_version 199823 (0.0030) [2024-06-28 09:48:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3273981952. Throughput: 0: 44068.8. Samples: 3176852320. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 09:48:28,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:48:30,851][06909] Updated weights for policy 0, policy_version 199833 (0.0028) [2024-06-28 09:48:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3274194944. Throughput: 0: 44129.3. Samples: 3177121600. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 09:48:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:48:34,156][06909] Updated weights for policy 0, policy_version 199843 (0.0035) [2024-06-28 09:48:38,301][06909] Updated weights for policy 0, policy_version 199853 (0.0027) [2024-06-28 09:48:38,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.9, 300 sec: 44154.4). Total num frames: 3274424320. Throughput: 0: 43828.0. Samples: 3177378780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 09:48:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:48:41,586][06909] Updated weights for policy 0, policy_version 199863 (0.0025) [2024-06-28 09:48:43,852][06674] Fps is (10 sec: 44228.0, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 3274637312. Throughput: 0: 44113.5. Samples: 3177511520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 09:48:43,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:48:45,610][06909] Updated weights for policy 0, policy_version 199873 (0.0036) [2024-06-28 09:48:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3274866688. Throughput: 0: 44205.1. Samples: 3177782120. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 09:48:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:48:48,991][06909] Updated weights for policy 0, policy_version 199883 (0.0033) [2024-06-28 09:48:52,177][06887] Signal inference workers to stop experience collection... (45000 times) [2024-06-28 09:48:52,179][06887] Signal inference workers to resume experience collection... (45000 times) [2024-06-28 09:48:52,196][06909] InferenceWorker_p0-w0: stopping experience collection (45000 times) [2024-06-28 09:48:52,196][06909] InferenceWorker_p0-w0: resuming experience collection (45000 times) [2024-06-28 09:48:52,924][06909] Updated weights for policy 0, policy_version 199893 (0.0035) [2024-06-28 09:48:53,850][06674] Fps is (10 sec: 42607.6, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3275063296. Throughput: 0: 44112.1. Samples: 3178046560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 09:48:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:48:56,382][06909] Updated weights for policy 0, policy_version 199903 (0.0027) [2024-06-28 09:48:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3275292672. Throughput: 0: 44117.7. Samples: 3178176000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 09:48:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:49:00,439][06909] Updated weights for policy 0, policy_version 199913 (0.0027) [2024-06-28 09:49:03,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 3275522048. Throughput: 0: 44124.2. Samples: 3178442920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 09:49:03,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 09:49:03,985][06909] Updated weights for policy 0, policy_version 199923 (0.0030) [2024-06-28 09:49:07,906][06909] Updated weights for policy 0, policy_version 199933 (0.0034) [2024-06-28 09:49:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3275735040. Throughput: 0: 44009.9. Samples: 3178705340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 09:49:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:49:11,312][06909] Updated weights for policy 0, policy_version 199943 (0.0029) [2024-06-28 09:49:13,856][06674] Fps is (10 sec: 44210.0, 60 sec: 43959.3, 300 sec: 44097.1). Total num frames: 3275964416. Throughput: 0: 44088.8. Samples: 3178836580. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 09:49:13,857][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:49:15,144][06909] Updated weights for policy 0, policy_version 199953 (0.0032) [2024-06-28 09:49:18,727][06909] Updated weights for policy 0, policy_version 199963 (0.0039) [2024-06-28 09:49:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44237.0, 300 sec: 44098.0). Total num frames: 3276193792. Throughput: 0: 44191.7. Samples: 3179110220. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 09:49:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:49:22,694][06909] Updated weights for policy 0, policy_version 199973 (0.0028) [2024-06-28 09:49:23,850][06674] Fps is (10 sec: 44263.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3276406784. Throughput: 0: 44290.6. Samples: 3179371860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 09:49:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:49:26,034][06909] Updated weights for policy 0, policy_version 199983 (0.0032) [2024-06-28 09:49:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3276636160. Throughput: 0: 44227.8. Samples: 3179501680. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 09:49:28,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:49:29,988][06909] Updated weights for policy 0, policy_version 199993 (0.0032) [2024-06-28 09:49:33,296][06909] Updated weights for policy 0, policy_version 200003 (0.0048) [2024-06-28 09:49:33,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44783.0, 300 sec: 44153.5). Total num frames: 3276881920. Throughput: 0: 44342.6. Samples: 3179777540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 09:49:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:49:37,657][06909] Updated weights for policy 0, policy_version 200013 (0.0045) [2024-06-28 09:49:38,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 44042.7). Total num frames: 3277045760. Throughput: 0: 44228.4. Samples: 3180036840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 09:49:38,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:49:40,862][06909] Updated weights for policy 0, policy_version 200023 (0.0033) [2024-06-28 09:49:43,850][06674] Fps is (10 sec: 39321.1, 60 sec: 43965.1, 300 sec: 43986.9). Total num frames: 3277275136. Throughput: 0: 44179.0. Samples: 3180164060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 09:49:43,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:49:44,798][06909] Updated weights for policy 0, policy_version 200033 (0.0036) [2024-06-28 09:49:48,271][06909] Updated weights for policy 0, policy_version 200043 (0.0030) [2024-06-28 09:49:48,850][06674] Fps is (10 sec: 49151.9, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 3277537280. Throughput: 0: 44156.0. Samples: 3180429940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 09:49:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:49:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000200045_3277537280.pth... [2024-06-28 09:49:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000199398_3266936832.pth [2024-06-28 09:49:52,386][06909] Updated weights for policy 0, policy_version 200053 (0.0022) [2024-06-28 09:49:53,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 3277717504. Throughput: 0: 44326.2. Samples: 3180700020. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 09:49:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:49:55,730][06909] Updated weights for policy 0, policy_version 200063 (0.0036) [2024-06-28 09:49:58,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3277946880. Throughput: 0: 44167.8. Samples: 3180823860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:49:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:49:59,711][06909] Updated weights for policy 0, policy_version 200073 (0.0041) [2024-06-28 09:50:03,097][06909] Updated weights for policy 0, policy_version 200083 (0.0027) [2024-06-28 09:50:03,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3278192640. Throughput: 0: 44144.4. Samples: 3181096720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:50:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:50:07,289][06909] Updated weights for policy 0, policy_version 200093 (0.0023) [2024-06-28 09:50:08,850][06674] Fps is (10 sec: 44233.8, 60 sec: 44236.3, 300 sec: 44153.8). Total num frames: 3278389248. Throughput: 0: 44415.9. Samples: 3181370600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:50:08,851][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 09:50:09,993][06887] Signal inference workers to stop experience collection... (45050 times) [2024-06-28 09:50:09,993][06887] Signal inference workers to resume experience collection... (45050 times) [2024-06-28 09:50:10,033][06909] InferenceWorker_p0-w0: stopping experience collection (45050 times) [2024-06-28 09:50:10,033][06909] InferenceWorker_p0-w0: resuming experience collection (45050 times) [2024-06-28 09:50:10,522][06909] Updated weights for policy 0, policy_version 200103 (0.0030) [2024-06-28 09:50:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44241.3, 300 sec: 44098.0). Total num frames: 3278618624. Throughput: 0: 44207.7. Samples: 3181491020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:50:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:50:14,762][06909] Updated weights for policy 0, policy_version 200113 (0.0031) [2024-06-28 09:50:17,887][06909] Updated weights for policy 0, policy_version 200123 (0.0031) [2024-06-28 09:50:18,850][06674] Fps is (10 sec: 44239.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3278831616. Throughput: 0: 44041.8. Samples: 3181759420. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:50:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:50:22,292][06909] Updated weights for policy 0, policy_version 200133 (0.0045) [2024-06-28 09:50:23,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3279044608. Throughput: 0: 44125.2. Samples: 3182022480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:50:23,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:50:25,167][06909] Updated weights for policy 0, policy_version 200143 (0.0033) [2024-06-28 09:50:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3279257600. Throughput: 0: 44038.4. Samples: 3182145780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:50:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:50:29,568][06909] Updated weights for policy 0, policy_version 200153 (0.0033) [2024-06-28 09:50:32,770][06909] Updated weights for policy 0, policy_version 200163 (0.0031) [2024-06-28 09:50:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 3279503360. Throughput: 0: 44008.4. Samples: 3182410320. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:50:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:50:36,760][06909] Updated weights for policy 0, policy_version 200173 (0.0040) [2024-06-28 09:50:38,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 3279716352. Throughput: 0: 44270.1. Samples: 3182692180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:50:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:50:40,183][06909] Updated weights for policy 0, policy_version 200183 (0.0035) [2024-06-28 09:50:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 3279945728. Throughput: 0: 44299.4. Samples: 3182817340. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:50:43,864][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:50:44,453][06909] Updated weights for policy 0, policy_version 200193 (0.0034) [2024-06-28 09:50:47,564][06909] Updated weights for policy 0, policy_version 200203 (0.0027) [2024-06-28 09:50:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.6, 300 sec: 44209.0). Total num frames: 3280175104. Throughput: 0: 44143.4. Samples: 3183083180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:50:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:50:51,762][06909] Updated weights for policy 0, policy_version 200213 (0.0041) [2024-06-28 09:50:53,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3280371712. Throughput: 0: 43872.2. Samples: 3183344820. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:50:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:50:55,007][06909] Updated weights for policy 0, policy_version 200223 (0.0027) [2024-06-28 09:50:58,856][06674] Fps is (10 sec: 42573.0, 60 sec: 44232.3, 300 sec: 44097.4). Total num frames: 3280601088. Throughput: 0: 44138.0. Samples: 3183477500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:50:58,856][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:50:58,948][06909] Updated weights for policy 0, policy_version 200233 (0.0022) [2024-06-28 09:51:02,436][06909] Updated weights for policy 0, policy_version 200243 (0.0035) [2024-06-28 09:51:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 44153.8). Total num frames: 3280814080. Throughput: 0: 44009.3. Samples: 3183739840. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 09:51:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:51:06,469][06909] Updated weights for policy 0, policy_version 200253 (0.0030) [2024-06-28 09:51:08,850][06674] Fps is (10 sec: 42624.4, 60 sec: 43964.2, 300 sec: 44098.0). Total num frames: 3281027072. Throughput: 0: 44164.1. Samples: 3184009860. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:51:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:51:09,869][06909] Updated weights for policy 0, policy_version 200263 (0.0032) [2024-06-28 09:51:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.6, 300 sec: 44098.9). Total num frames: 3281256448. Throughput: 0: 44371.0. Samples: 3184142480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:51:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:51:14,130][06909] Updated weights for policy 0, policy_version 200273 (0.0035) [2024-06-28 09:51:17,103][06909] Updated weights for policy 0, policy_version 200283 (0.0038) [2024-06-28 09:51:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3281485824. Throughput: 0: 44293.9. Samples: 3184403540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:51:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:51:21,579][06909] Updated weights for policy 0, policy_version 200293 (0.0038) [2024-06-28 09:51:23,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3281698816. Throughput: 0: 43914.8. Samples: 3184668340. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:51:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:51:24,638][06909] Updated weights for policy 0, policy_version 200303 (0.0038) [2024-06-28 09:51:28,689][06909] Updated weights for policy 0, policy_version 200313 (0.0022) [2024-06-28 09:51:28,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3281928192. Throughput: 0: 44129.3. Samples: 3184803160. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:51:28,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:51:32,278][06909] Updated weights for policy 0, policy_version 200323 (0.0037) [2024-06-28 09:51:33,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43962.3, 300 sec: 44208.7). Total num frames: 3282141184. Throughput: 0: 44003.4. Samples: 3185063420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:51:33,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:51:35,853][06909] Updated weights for policy 0, policy_version 200333 (0.0039) [2024-06-28 09:51:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3282370560. Throughput: 0: 44232.4. Samples: 3185335280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:51:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:51:39,608][06909] Updated weights for policy 0, policy_version 200343 (0.0025) [2024-06-28 09:51:42,087][06887] Signal inference workers to stop experience collection... (45100 times) [2024-06-28 09:51:42,134][06909] InferenceWorker_p0-w0: stopping experience collection (45100 times) [2024-06-28 09:51:42,193][06887] Signal inference workers to resume experience collection... (45100 times) [2024-06-28 09:51:42,193][06909] InferenceWorker_p0-w0: resuming experience collection (45100 times) [2024-06-28 09:51:43,632][06909] Updated weights for policy 0, policy_version 200353 (0.0026) [2024-06-28 09:51:43,850][06674] Fps is (10 sec: 45884.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3282599936. Throughput: 0: 44207.6. Samples: 3185466580. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:51:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:51:46,770][06909] Updated weights for policy 0, policy_version 200363 (0.0021) [2024-06-28 09:51:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3282812928. Throughput: 0: 44364.1. Samples: 3185736220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:51:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:51:48,901][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000200368_3282829312.pth... [2024-06-28 09:51:48,952][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000199721_3272228864.pth [2024-06-28 09:51:51,236][06909] Updated weights for policy 0, policy_version 200373 (0.0033) [2024-06-28 09:51:53,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 3283042304. Throughput: 0: 44279.6. Samples: 3186002440. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:51:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:51:54,232][06909] Updated weights for policy 0, policy_version 200383 (0.0033) [2024-06-28 09:51:58,515][06909] Updated weights for policy 0, policy_version 200393 (0.0027) [2024-06-28 09:51:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43968.2, 300 sec: 44042.4). Total num frames: 3283238912. Throughput: 0: 44273.4. Samples: 3186134780. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:51:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:52:01,615][06909] Updated weights for policy 0, policy_version 200403 (0.0040) [2024-06-28 09:52:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 3283468288. Throughput: 0: 44202.7. Samples: 3186392660. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:52:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:52:05,840][06909] Updated weights for policy 0, policy_version 200413 (0.0035) [2024-06-28 09:52:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3283697664. Throughput: 0: 44152.7. Samples: 3186655220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 09:52:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 09:52:09,418][06909] Updated weights for policy 0, policy_version 200423 (0.0032) [2024-06-28 09:52:13,376][06909] Updated weights for policy 0, policy_version 200433 (0.0039) [2024-06-28 09:52:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3283894272. Throughput: 0: 44066.8. Samples: 3186786160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:52:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:52:16,677][06909] Updated weights for policy 0, policy_version 200443 (0.0031) [2024-06-28 09:52:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.6, 300 sec: 44209.0). Total num frames: 3284123648. Throughput: 0: 44189.0. Samples: 3187051840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:52:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:52:21,220][06909] Updated weights for policy 0, policy_version 200453 (0.0033) [2024-06-28 09:52:23,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3284353024. Throughput: 0: 43993.2. Samples: 3187314980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:52:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:52:24,265][06909] Updated weights for policy 0, policy_version 200463 (0.0050) [2024-06-28 09:52:28,494][06909] Updated weights for policy 0, policy_version 200473 (0.0029) [2024-06-28 09:52:28,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3284549632. Throughput: 0: 44014.8. Samples: 3187447240. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:52:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:52:31,945][06909] Updated weights for policy 0, policy_version 200483 (0.0040) [2024-06-28 09:52:33,856][06674] Fps is (10 sec: 42573.2, 60 sec: 43960.8, 300 sec: 44097.1). Total num frames: 3284779008. Throughput: 0: 43848.3. Samples: 3187709660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:52:33,856][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:52:35,990][06909] Updated weights for policy 0, policy_version 200493 (0.0026) [2024-06-28 09:52:38,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3285024768. Throughput: 0: 43979.0. Samples: 3187981500. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:52:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:52:39,191][06909] Updated weights for policy 0, policy_version 200503 (0.0038) [2024-06-28 09:52:43,584][06909] Updated weights for policy 0, policy_version 200513 (0.0037) [2024-06-28 09:52:43,850][06674] Fps is (10 sec: 44263.6, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 3285221376. Throughput: 0: 43778.7. Samples: 3188104820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:52:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:52:46,644][06909] Updated weights for policy 0, policy_version 200523 (0.0039) [2024-06-28 09:52:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 3285450752. Throughput: 0: 43899.5. Samples: 3188368140. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:52:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:52:50,941][06909] Updated weights for policy 0, policy_version 200533 (0.0034) [2024-06-28 09:52:53,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.6, 300 sec: 44209.0). Total num frames: 3285680128. Throughput: 0: 43942.6. Samples: 3188632640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:52:53,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:52:54,116][06909] Updated weights for policy 0, policy_version 200543 (0.0037) [2024-06-28 09:52:58,249][06909] Updated weights for policy 0, policy_version 200553 (0.0035) [2024-06-28 09:52:58,850][06674] Fps is (10 sec: 40959.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3285860352. Throughput: 0: 44127.8. Samples: 3188771920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:52:58,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:53:01,350][06909] Updated weights for policy 0, policy_version 200563 (0.0032) [2024-06-28 09:53:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 3286106112. Throughput: 0: 43994.7. Samples: 3189031600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:53:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:53:04,091][06887] Signal inference workers to stop experience collection... (45150 times) [2024-06-28 09:53:04,135][06909] InferenceWorker_p0-w0: stopping experience collection (45150 times) [2024-06-28 09:53:04,144][06887] Signal inference workers to resume experience collection... (45150 times) [2024-06-28 09:53:04,149][06909] InferenceWorker_p0-w0: resuming experience collection (45150 times) [2024-06-28 09:53:05,809][06909] Updated weights for policy 0, policy_version 200573 (0.0039) [2024-06-28 09:53:08,840][06909] Updated weights for policy 0, policy_version 200583 (0.0036) [2024-06-28 09:53:08,850][06674] Fps is (10 sec: 49152.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3286351872. Throughput: 0: 43916.0. Samples: 3189291200. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:53:08,854][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:53:13,159][06909] Updated weights for policy 0, policy_version 200593 (0.0031) [2024-06-28 09:53:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3286532096. Throughput: 0: 44057.2. Samples: 3189429820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:53:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:53:16,525][06909] Updated weights for policy 0, policy_version 200603 (0.0034) [2024-06-28 09:53:18,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 3286761472. Throughput: 0: 43973.5. Samples: 3189688200. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 09:53:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:53:20,691][06909] Updated weights for policy 0, policy_version 200613 (0.0033) [2024-06-28 09:53:23,856][06674] Fps is (10 sec: 45847.4, 60 sec: 43959.3, 300 sec: 44097.1). Total num frames: 3286990848. Throughput: 0: 43675.9. Samples: 3189947180. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:53:23,857][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:53:24,085][06909] Updated weights for policy 0, policy_version 200623 (0.0027) [2024-06-28 09:53:28,147][06909] Updated weights for policy 0, policy_version 200633 (0.0034) [2024-06-28 09:53:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3287187456. Throughput: 0: 43796.9. Samples: 3190075680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:53:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:53:31,366][06909] Updated weights for policy 0, policy_version 200643 (0.0021) [2024-06-28 09:53:33,850][06674] Fps is (10 sec: 44263.5, 60 sec: 44241.2, 300 sec: 44097.9). Total num frames: 3287433216. Throughput: 0: 43940.3. Samples: 3190345460. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:53:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 09:53:35,816][06909] Updated weights for policy 0, policy_version 200653 (0.0034) [2024-06-28 09:53:38,740][06909] Updated weights for policy 0, policy_version 200663 (0.0026) [2024-06-28 09:53:38,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.7, 300 sec: 44153.8). Total num frames: 3287662592. Throughput: 0: 43853.3. Samples: 3190606040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:53:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:53:43,039][06909] Updated weights for policy 0, policy_version 200673 (0.0038) [2024-06-28 09:53:43,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3287859200. Throughput: 0: 43900.7. Samples: 3190747440. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:53:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:53:46,190][06909] Updated weights for policy 0, policy_version 200683 (0.0028) [2024-06-28 09:53:48,852][06674] Fps is (10 sec: 42589.3, 60 sec: 43962.1, 300 sec: 44153.1). Total num frames: 3288088576. Throughput: 0: 43908.6. Samples: 3191007580. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:53:48,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:53:48,877][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000200689_3288088576.pth... [2024-06-28 09:53:48,937][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000200045_3277537280.pth [2024-06-28 09:53:50,423][06909] Updated weights for policy 0, policy_version 200693 (0.0027) [2024-06-28 09:53:53,817][06909] Updated weights for policy 0, policy_version 200703 (0.0028) [2024-06-28 09:53:53,850][06674] Fps is (10 sec: 45874.3, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3288317952. Throughput: 0: 43834.1. Samples: 3191263740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:53:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:53:58,181][06909] Updated weights for policy 0, policy_version 200713 (0.0030) [2024-06-28 09:53:58,852][06674] Fps is (10 sec: 42599.1, 60 sec: 44235.4, 300 sec: 44042.1). Total num frames: 3288514560. Throughput: 0: 43807.0. Samples: 3191401220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:53:58,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:54:01,006][06909] Updated weights for policy 0, policy_version 200723 (0.0032) [2024-06-28 09:54:03,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3288760320. Throughput: 0: 43976.4. Samples: 3191667140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:54:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:54:05,376][06909] Updated weights for policy 0, policy_version 200733 (0.0027) [2024-06-28 09:54:08,522][06909] Updated weights for policy 0, policy_version 200743 (0.0027) [2024-06-28 09:54:08,850][06674] Fps is (10 sec: 45884.1, 60 sec: 43690.6, 300 sec: 44098.9). Total num frames: 3288973312. Throughput: 0: 44137.0. Samples: 3191933080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:54:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:54:12,909][06909] Updated weights for policy 0, policy_version 200753 (0.0030) [2024-06-28 09:54:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3289186304. Throughput: 0: 44269.3. Samples: 3192067800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:54:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:54:15,779][06909] Updated weights for policy 0, policy_version 200763 (0.0035) [2024-06-28 09:54:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3289415680. Throughput: 0: 44105.8. Samples: 3192330220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:54:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:54:20,119][06909] Updated weights for policy 0, policy_version 200773 (0.0022) [2024-06-28 09:54:23,199][06909] Updated weights for policy 0, policy_version 200783 (0.0028) [2024-06-28 09:54:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43968.2, 300 sec: 44042.4). Total num frames: 3289628672. Throughput: 0: 44038.8. Samples: 3192587780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 09:54:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:54:27,605][06909] Updated weights for policy 0, policy_version 200793 (0.0030) [2024-06-28 09:54:28,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3289825280. Throughput: 0: 43900.3. Samples: 3192722960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 09:54:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:54:30,928][06909] Updated weights for policy 0, policy_version 200803 (0.0030) [2024-06-28 09:54:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3290071040. Throughput: 0: 44112.4. Samples: 3192992540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 09:54:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:54:34,937][06909] Updated weights for policy 0, policy_version 200813 (0.0034) [2024-06-28 09:54:38,165][06887] Signal inference workers to stop experience collection... (45200 times) [2024-06-28 09:54:38,165][06887] Signal inference workers to resume experience collection... (45200 times) [2024-06-28 09:54:38,185][06909] InferenceWorker_p0-w0: stopping experience collection (45200 times) [2024-06-28 09:54:38,185][06909] InferenceWorker_p0-w0: resuming experience collection (45200 times) [2024-06-28 09:54:38,307][06909] Updated weights for policy 0, policy_version 200823 (0.0029) [2024-06-28 09:54:38,852][06674] Fps is (10 sec: 47504.1, 60 sec: 43962.3, 300 sec: 44153.2). Total num frames: 3290300416. Throughput: 0: 44181.7. Samples: 3193252000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 09:54:38,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:54:42,604][06909] Updated weights for policy 0, policy_version 200833 (0.0040) [2024-06-28 09:54:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3290513408. Throughput: 0: 44127.3. Samples: 3193386860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 09:54:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:54:45,746][06909] Updated weights for policy 0, policy_version 200843 (0.0041) [2024-06-28 09:54:48,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43965.4, 300 sec: 44098.0). Total num frames: 3290726400. Throughput: 0: 43946.7. Samples: 3193644740. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 09:54:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:54:50,221][06909] Updated weights for policy 0, policy_version 200853 (0.0027) [2024-06-28 09:54:53,276][06909] Updated weights for policy 0, policy_version 200863 (0.0031) [2024-06-28 09:54:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3290972160. Throughput: 0: 43851.2. Samples: 3193906380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 09:54:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:54:57,509][06909] Updated weights for policy 0, policy_version 200873 (0.0027) [2024-06-28 09:54:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44238.3, 300 sec: 43986.9). Total num frames: 3291168768. Throughput: 0: 43940.0. Samples: 3194045100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 09:54:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:55:00,812][06909] Updated weights for policy 0, policy_version 200883 (0.0024) [2024-06-28 09:55:03,854][06674] Fps is (10 sec: 42580.6, 60 sec: 43960.7, 300 sec: 44097.4). Total num frames: 3291398144. Throughput: 0: 44010.1. Samples: 3194310860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 09:55:03,854][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:55:04,794][06909] Updated weights for policy 0, policy_version 200893 (0.0033) [2024-06-28 09:55:08,256][06909] Updated weights for policy 0, policy_version 200903 (0.0039) [2024-06-28 09:55:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3291627520. Throughput: 0: 44079.5. Samples: 3194571360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 09:55:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:55:12,239][06909] Updated weights for policy 0, policy_version 200913 (0.0037) [2024-06-28 09:55:13,850][06674] Fps is (10 sec: 42616.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3291824128. Throughput: 0: 43916.6. Samples: 3194699200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 09:55:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:55:15,537][06909] Updated weights for policy 0, policy_version 200923 (0.0032) [2024-06-28 09:55:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3292053504. Throughput: 0: 44025.2. Samples: 3194973680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 09:55:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:55:19,813][06909] Updated weights for policy 0, policy_version 200933 (0.0030) [2024-06-28 09:55:22,684][06909] Updated weights for policy 0, policy_version 200943 (0.0028) [2024-06-28 09:55:23,851][06674] Fps is (10 sec: 45870.0, 60 sec: 44236.0, 300 sec: 44153.3). Total num frames: 3292282880. Throughput: 0: 44023.7. Samples: 3195233020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 09:55:23,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:55:27,001][06909] Updated weights for policy 0, policy_version 200953 (0.0035) [2024-06-28 09:55:28,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3292495872. Throughput: 0: 44130.5. Samples: 3195372740. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 09:55:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:55:29,981][06909] Updated weights for policy 0, policy_version 200963 (0.0040) [2024-06-28 09:55:33,850][06674] Fps is (10 sec: 42602.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3292708864. Throughput: 0: 44356.4. Samples: 3195640780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 09:55:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:55:34,632][06909] Updated weights for policy 0, policy_version 200973 (0.0033) [2024-06-28 09:55:37,721][06909] Updated weights for policy 0, policy_version 200983 (0.0032) [2024-06-28 09:55:38,850][06674] Fps is (10 sec: 44237.9, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 3292938240. Throughput: 0: 44291.6. Samples: 3195899500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:55:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:55:41,877][06909] Updated weights for policy 0, policy_version 200993 (0.0040) [2024-06-28 09:55:43,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3293151232. Throughput: 0: 44266.7. Samples: 3196037100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:55:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:55:44,991][06909] Updated weights for policy 0, policy_version 201003 (0.0027) [2024-06-28 09:55:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3293364224. Throughput: 0: 44209.0. Samples: 3196300080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:55:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:55:48,890][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000201012_3293380608.pth... [2024-06-28 09:55:48,934][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000200368_3282829312.pth [2024-06-28 09:55:49,458][06909] Updated weights for policy 0, policy_version 201013 (0.0029) [2024-06-28 09:55:52,380][06909] Updated weights for policy 0, policy_version 201023 (0.0037) [2024-06-28 09:55:53,850][06674] Fps is (10 sec: 44235.4, 60 sec: 43690.5, 300 sec: 44043.3). Total num frames: 3293593600. Throughput: 0: 44320.7. Samples: 3196565800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:55:53,851][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 09:55:57,052][06909] Updated weights for policy 0, policy_version 201033 (0.0037) [2024-06-28 09:55:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3293806592. Throughput: 0: 44405.6. Samples: 3196697460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:55:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:55:59,626][06909] Updated weights for policy 0, policy_version 201043 (0.0031) [2024-06-28 09:56:03,850][06674] Fps is (10 sec: 40961.3, 60 sec: 43420.7, 300 sec: 43986.9). Total num frames: 3294003200. Throughput: 0: 44091.2. Samples: 3196957780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:56:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:56:04,355][06909] Updated weights for policy 0, policy_version 201053 (0.0037) [2024-06-28 09:56:07,190][06909] Updated weights for policy 0, policy_version 201063 (0.0024) [2024-06-28 09:56:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3294248960. Throughput: 0: 44214.2. Samples: 3197222620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:56:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:56:11,631][06909] Updated weights for policy 0, policy_version 201073 (0.0032) [2024-06-28 09:56:11,947][06887] Signal inference workers to stop experience collection... (45250 times) [2024-06-28 09:56:11,998][06909] InferenceWorker_p0-w0: stopping experience collection (45250 times) [2024-06-28 09:56:12,007][06887] Signal inference workers to resume experience collection... (45250 times) [2024-06-28 09:56:12,009][06909] InferenceWorker_p0-w0: resuming experience collection (45250 times) [2024-06-28 09:56:13,850][06674] Fps is (10 sec: 49152.1, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 3294494720. Throughput: 0: 44202.9. Samples: 3197361860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:56:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:56:14,658][06909] Updated weights for policy 0, policy_version 201083 (0.0033) [2024-06-28 09:56:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3294691328. Throughput: 0: 43982.6. Samples: 3197620000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:56:18,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 09:56:19,124][06909] Updated weights for policy 0, policy_version 201093 (0.0022) [2024-06-28 09:56:22,282][06909] Updated weights for policy 0, policy_version 201103 (0.0048) [2024-06-28 09:56:23,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43691.4, 300 sec: 43986.9). Total num frames: 3294904320. Throughput: 0: 44199.9. Samples: 3197888500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:56:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:56:26,620][06909] Updated weights for policy 0, policy_version 201113 (0.0021) [2024-06-28 09:56:28,852][06674] Fps is (10 sec: 45865.9, 60 sec: 44235.4, 300 sec: 44097.9). Total num frames: 3295150080. Throughput: 0: 44092.5. Samples: 3198021360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:56:28,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:56:29,495][06909] Updated weights for policy 0, policy_version 201123 (0.0028) [2024-06-28 09:56:33,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.6, 300 sec: 43986.8). Total num frames: 3295346688. Throughput: 0: 44110.8. Samples: 3198285080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:56:33,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:56:34,007][06909] Updated weights for policy 0, policy_version 201133 (0.0035) [2024-06-28 09:56:36,759][06909] Updated weights for policy 0, policy_version 201143 (0.0046) [2024-06-28 09:56:38,850][06674] Fps is (10 sec: 42606.8, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3295576064. Throughput: 0: 44045.5. Samples: 3198547840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 09:56:38,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:56:41,419][06909] Updated weights for policy 0, policy_version 201153 (0.0030) [2024-06-28 09:56:43,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44509.7, 300 sec: 44097.9). Total num frames: 3295821824. Throughput: 0: 44137.2. Samples: 3198683640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:56:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:56:44,187][06909] Updated weights for policy 0, policy_version 201163 (0.0035) [2024-06-28 09:56:48,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3296002048. Throughput: 0: 44149.3. Samples: 3198944500. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:56:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:56:48,981][06909] Updated weights for policy 0, policy_version 201173 (0.0036) [2024-06-28 09:56:52,095][06909] Updated weights for policy 0, policy_version 201183 (0.0040) [2024-06-28 09:56:53,850][06674] Fps is (10 sec: 39322.4, 60 sec: 43690.9, 300 sec: 43986.9). Total num frames: 3296215040. Throughput: 0: 44056.6. Samples: 3199205160. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:56:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:56:56,266][06909] Updated weights for policy 0, policy_version 201193 (0.0032) [2024-06-28 09:56:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3296460800. Throughput: 0: 43775.0. Samples: 3199331740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:56:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:56:59,552][06909] Updated weights for policy 0, policy_version 201203 (0.0030) [2024-06-28 09:57:03,827][06909] Updated weights for policy 0, policy_version 201213 (0.0034) [2024-06-28 09:57:03,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.7, 300 sec: 43986.9). Total num frames: 3296673792. Throughput: 0: 44090.6. Samples: 3199604080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:57:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:57:06,819][06909] Updated weights for policy 0, policy_version 201223 (0.0031) [2024-06-28 09:57:08,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3296870400. Throughput: 0: 43946.2. Samples: 3199866080. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:57:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:57:11,189][06909] Updated weights for policy 0, policy_version 201233 (0.0036) [2024-06-28 09:57:13,850][06674] Fps is (10 sec: 45872.5, 60 sec: 43963.2, 300 sec: 44097.9). Total num frames: 3297132544. Throughput: 0: 43879.1. Samples: 3199995860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:57:13,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:57:14,143][06909] Updated weights for policy 0, policy_version 201243 (0.0034) [2024-06-28 09:57:18,842][06909] Updated weights for policy 0, policy_version 201253 (0.0021) [2024-06-28 09:57:18,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3297329152. Throughput: 0: 43834.5. Samples: 3200257620. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:57:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:57:21,659][06909] Updated weights for policy 0, policy_version 201263 (0.0031) [2024-06-28 09:57:23,850][06674] Fps is (10 sec: 39324.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3297525760. Throughput: 0: 43939.6. Samples: 3200525120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:57:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 09:57:26,263][06909] Updated weights for policy 0, policy_version 201273 (0.0038) [2024-06-28 09:57:28,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44238.3, 300 sec: 44154.4). Total num frames: 3297804288. Throughput: 0: 43822.8. Samples: 3200655660. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:57:28,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:57:29,209][06909] Updated weights for policy 0, policy_version 201283 (0.0035) [2024-06-28 09:57:33,451][06909] Updated weights for policy 0, policy_version 201293 (0.0030) [2024-06-28 09:57:33,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.9, 300 sec: 43931.3). Total num frames: 3297984512. Throughput: 0: 43816.0. Samples: 3200916220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:57:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:57:35,897][06887] Signal inference workers to stop experience collection... (45300 times) [2024-06-28 09:57:35,927][06909] InferenceWorker_p0-w0: stopping experience collection (45300 times) [2024-06-28 09:57:36,009][06887] Signal inference workers to resume experience collection... (45300 times) [2024-06-28 09:57:36,009][06909] InferenceWorker_p0-w0: resuming experience collection (45300 times) [2024-06-28 09:57:36,513][06909] Updated weights for policy 0, policy_version 201303 (0.0040) [2024-06-28 09:57:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3298213888. Throughput: 0: 44068.4. Samples: 3201188240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:57:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:57:41,110][06909] Updated weights for policy 0, policy_version 201313 (0.0047) [2024-06-28 09:57:43,721][06909] Updated weights for policy 0, policy_version 201323 (0.0042) [2024-06-28 09:57:43,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44237.0, 300 sec: 44153.5). Total num frames: 3298476032. Throughput: 0: 44181.8. Samples: 3201319920. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 09:57:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:57:48,562][06909] Updated weights for policy 0, policy_version 201333 (0.0032) [2024-06-28 09:57:48,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 3298639872. Throughput: 0: 44036.4. Samples: 3201585720. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 09:57:48,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:57:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000201333_3298639872.pth... [2024-06-28 09:57:48,906][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000200689_3288088576.pth [2024-06-28 09:57:51,214][06909] Updated weights for policy 0, policy_version 201343 (0.0027) [2024-06-28 09:57:53,850][06674] Fps is (10 sec: 37683.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3298852864. Throughput: 0: 44139.3. Samples: 3201852340. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 09:57:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 09:57:55,833][06909] Updated weights for policy 0, policy_version 201353 (0.0039) [2024-06-28 09:57:58,638][06909] Updated weights for policy 0, policy_version 201363 (0.0031) [2024-06-28 09:57:58,850][06674] Fps is (10 sec: 49153.0, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3299131392. Throughput: 0: 44267.8. Samples: 3201987880. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 09:57:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:58:03,199][06909] Updated weights for policy 0, policy_version 201373 (0.0029) [2024-06-28 09:58:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3299311616. Throughput: 0: 44239.1. Samples: 3202248380. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 09:58:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:58:06,117][06909] Updated weights for policy 0, policy_version 201383 (0.0033) [2024-06-28 09:58:08,850][06674] Fps is (10 sec: 39321.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3299524608. Throughput: 0: 44204.9. Samples: 3202514340. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 09:58:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:58:10,871][06909] Updated weights for policy 0, policy_version 201393 (0.0032) [2024-06-28 09:58:13,686][06909] Updated weights for policy 0, policy_version 201403 (0.0033) [2024-06-28 09:58:13,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44237.3, 300 sec: 44153.5). Total num frames: 3299786752. Throughput: 0: 44133.3. Samples: 3202641660. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 09:58:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:58:18,362][06909] Updated weights for policy 0, policy_version 201413 (0.0023) [2024-06-28 09:58:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44043.3). Total num frames: 3299983360. Throughput: 0: 44400.4. Samples: 3202914240. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 09:58:18,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 09:58:21,261][06909] Updated weights for policy 0, policy_version 201423 (0.0037) [2024-06-28 09:58:23,850][06674] Fps is (10 sec: 39321.9, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3300179968. Throughput: 0: 44039.2. Samples: 3203170000. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 09:58:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:58:25,774][06909] Updated weights for policy 0, policy_version 201433 (0.0028) [2024-06-28 09:58:28,510][06909] Updated weights for policy 0, policy_version 201443 (0.0026) [2024-06-28 09:58:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3300442112. Throughput: 0: 44026.2. Samples: 3203301100. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 09:58:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:58:32,955][06909] Updated weights for policy 0, policy_version 201453 (0.0036) [2024-06-28 09:58:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3300638720. Throughput: 0: 44151.7. Samples: 3203572540. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 09:58:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:58:35,683][06909] Updated weights for policy 0, policy_version 201463 (0.0031) [2024-06-28 09:58:38,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3300851712. Throughput: 0: 44070.5. Samples: 3203835520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 09:58:38,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:58:40,526][06909] Updated weights for policy 0, policy_version 201473 (0.0039) [2024-06-28 09:58:43,438][06909] Updated weights for policy 0, policy_version 201483 (0.0033) [2024-06-28 09:58:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 44098.3). Total num frames: 3301097472. Throughput: 0: 43935.5. Samples: 3203964980. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 09:58:43,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:58:48,191][06909] Updated weights for policy 0, policy_version 201493 (0.0040) [2024-06-28 09:58:48,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 3301310464. Throughput: 0: 44016.9. Samples: 3204229140. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 09:58:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:58:51,035][06909] Updated weights for policy 0, policy_version 201503 (0.0042) [2024-06-28 09:58:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44509.8, 300 sec: 44098.2). Total num frames: 3301523456. Throughput: 0: 43951.1. Samples: 3204492140. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 09:58:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:58:55,733][06909] Updated weights for policy 0, policy_version 201513 (0.0033) [2024-06-28 09:58:58,368][06909] Updated weights for policy 0, policy_version 201523 (0.0036) [2024-06-28 09:58:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3301752832. Throughput: 0: 43986.8. Samples: 3204621060. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:58:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:59:02,861][06909] Updated weights for policy 0, policy_version 201533 (0.0042) [2024-06-28 09:59:03,612][06887] Signal inference workers to stop experience collection... (45350 times) [2024-06-28 09:59:03,619][06887] Signal inference workers to resume experience collection... (45350 times) [2024-06-28 09:59:03,655][06909] InferenceWorker_p0-w0: stopping experience collection (45350 times) [2024-06-28 09:59:03,655][06909] InferenceWorker_p0-w0: resuming experience collection (45350 times) [2024-06-28 09:59:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3301965824. Throughput: 0: 43946.3. Samples: 3204891820. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:59:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:59:05,675][06909] Updated weights for policy 0, policy_version 201543 (0.0028) [2024-06-28 09:59:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 3302195200. Throughput: 0: 44130.6. Samples: 3205155880. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:59:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:59:10,566][06909] Updated weights for policy 0, policy_version 201553 (0.0031) [2024-06-28 09:59:13,217][06909] Updated weights for policy 0, policy_version 201563 (0.0030) [2024-06-28 09:59:13,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3302424576. Throughput: 0: 44142.6. Samples: 3205287520. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:59:13,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 09:59:18,037][06909] Updated weights for policy 0, policy_version 201573 (0.0031) [2024-06-28 09:59:18,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3302621184. Throughput: 0: 44063.6. Samples: 3205555400. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:59:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:59:20,709][06909] Updated weights for policy 0, policy_version 201583 (0.0039) [2024-06-28 09:59:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3302850560. Throughput: 0: 44013.0. Samples: 3205816100. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:59:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:59:25,322][06909] Updated weights for policy 0, policy_version 201593 (0.0038) [2024-06-28 09:59:27,966][06909] Updated weights for policy 0, policy_version 201603 (0.0037) [2024-06-28 09:59:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3303079936. Throughput: 0: 44110.3. Samples: 3205949940. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:59:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 09:59:32,632][06909] Updated weights for policy 0, policy_version 201613 (0.0045) [2024-06-28 09:59:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43987.2). Total num frames: 3303276544. Throughput: 0: 44042.7. Samples: 3206211060. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:59:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:59:35,417][06909] Updated weights for policy 0, policy_version 201623 (0.0037) [2024-06-28 09:59:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3303505920. Throughput: 0: 44129.0. Samples: 3206477940. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:59:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:59:39,913][06909] Updated weights for policy 0, policy_version 201633 (0.0034) [2024-06-28 09:59:42,703][06909] Updated weights for policy 0, policy_version 201643 (0.0040) [2024-06-28 09:59:43,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 3303735296. Throughput: 0: 44193.8. Samples: 3206609780. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:59:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 09:59:47,666][06909] Updated weights for policy 0, policy_version 201653 (0.0037) [2024-06-28 09:59:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3303931904. Throughput: 0: 43981.3. Samples: 3206870980. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:59:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 09:59:48,981][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000201657_3303948288.pth... [2024-06-28 09:59:49,030][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000201012_3293380608.pth [2024-06-28 09:59:50,602][06909] Updated weights for policy 0, policy_version 201663 (0.0033) [2024-06-28 09:59:53,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3304161280. Throughput: 0: 43772.4. Samples: 3207125640. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:59:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 09:59:54,981][06909] Updated weights for policy 0, policy_version 201673 (0.0035) [2024-06-28 09:59:58,290][06909] Updated weights for policy 0, policy_version 201683 (0.0037) [2024-06-28 09:59:58,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 44098.6). Total num frames: 3304407040. Throughput: 0: 43854.3. Samples: 3207260960. Policy #0 lag: (min: 0.0, avg: 8.7, max: 21.0) [2024-06-28 09:59:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:00:02,419][06909] Updated weights for policy 0, policy_version 201693 (0.0027) [2024-06-28 10:00:03,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 43986.8). Total num frames: 3304603648. Throughput: 0: 43857.6. Samples: 3207529000. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 10:00:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:00:05,677][06909] Updated weights for policy 0, policy_version 201703 (0.0038) [2024-06-28 10:00:08,856][06674] Fps is (10 sec: 40935.2, 60 sec: 43686.3, 300 sec: 44041.5). Total num frames: 3304816640. Throughput: 0: 43819.5. Samples: 3207788240. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 10:00:08,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:00:09,566][06909] Updated weights for policy 0, policy_version 201713 (0.0033) [2024-06-28 10:00:12,827][06909] Updated weights for policy 0, policy_version 201723 (0.0034) [2024-06-28 10:00:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3305062400. Throughput: 0: 43837.2. Samples: 3207922620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 10:00:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:00:17,192][06909] Updated weights for policy 0, policy_version 201733 (0.0037) [2024-06-28 10:00:18,823][06887] Signal inference workers to stop experience collection... (45400 times) [2024-06-28 10:00:18,850][06674] Fps is (10 sec: 44263.2, 60 sec: 43963.7, 300 sec: 43987.0). Total num frames: 3305259008. Throughput: 0: 43995.9. Samples: 3208190880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 10:00:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:00:18,874][06909] InferenceWorker_p0-w0: stopping experience collection (45400 times) [2024-06-28 10:00:18,880][06887] Signal inference workers to resume experience collection... (45400 times) [2024-06-28 10:00:18,891][06909] InferenceWorker_p0-w0: resuming experience collection (45400 times) [2024-06-28 10:00:20,458][06909] Updated weights for policy 0, policy_version 201743 (0.0046) [2024-06-28 10:00:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3305488384. Throughput: 0: 43841.2. Samples: 3208450800. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 10:00:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:00:24,820][06909] Updated weights for policy 0, policy_version 201753 (0.0035) [2024-06-28 10:00:27,886][06909] Updated weights for policy 0, policy_version 201763 (0.0030) [2024-06-28 10:00:28,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43689.1, 300 sec: 44042.1). Total num frames: 3305701376. Throughput: 0: 43872.1. Samples: 3208584120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 10:00:28,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:00:32,408][06909] Updated weights for policy 0, policy_version 201773 (0.0031) [2024-06-28 10:00:33,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3305914368. Throughput: 0: 43955.1. Samples: 3208848960. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 10:00:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:00:35,755][06909] Updated weights for policy 0, policy_version 201783 (0.0027) [2024-06-28 10:00:38,850][06674] Fps is (10 sec: 45883.6, 60 sec: 44236.6, 300 sec: 44097.9). Total num frames: 3306160128. Throughput: 0: 43988.7. Samples: 3209105140. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 10:00:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:00:39,660][06909] Updated weights for policy 0, policy_version 201793 (0.0053) [2024-06-28 10:00:42,961][06909] Updated weights for policy 0, policy_version 201803 (0.0032) [2024-06-28 10:00:43,852][06674] Fps is (10 sec: 45865.6, 60 sec: 43962.2, 300 sec: 44097.6). Total num frames: 3306373120. Throughput: 0: 43965.9. Samples: 3209239520. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 10:00:43,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:00:46,697][06909] Updated weights for policy 0, policy_version 201813 (0.0033) [2024-06-28 10:00:48,850][06674] Fps is (10 sec: 42599.8, 60 sec: 44236.8, 300 sec: 44042.5). Total num frames: 3306586112. Throughput: 0: 43980.2. Samples: 3209508100. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 10:00:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:00:50,163][06909] Updated weights for policy 0, policy_version 201823 (0.0032) [2024-06-28 10:00:53,850][06674] Fps is (10 sec: 44245.2, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3306815488. Throughput: 0: 44110.2. Samples: 3209772940. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 10:00:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:00:54,180][06909] Updated weights for policy 0, policy_version 201833 (0.0033) [2024-06-28 10:00:57,905][06909] Updated weights for policy 0, policy_version 201843 (0.0029) [2024-06-28 10:00:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 3307028480. Throughput: 0: 44198.3. Samples: 3209911540. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 10:00:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:01:01,554][06909] Updated weights for policy 0, policy_version 201853 (0.0025) [2024-06-28 10:01:03,850][06674] Fps is (10 sec: 44237.6, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3307257856. Throughput: 0: 44162.3. Samples: 3210178180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 10:01:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:01:05,073][06909] Updated weights for policy 0, policy_version 201863 (0.0020) [2024-06-28 10:01:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44241.2, 300 sec: 43986.9). Total num frames: 3307470848. Throughput: 0: 43944.5. Samples: 3210428300. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 10:01:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:01:09,417][06909] Updated weights for policy 0, policy_version 201873 (0.0035) [2024-06-28 10:01:12,720][06909] Updated weights for policy 0, policy_version 201883 (0.0030) [2024-06-28 10:01:13,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43417.7, 300 sec: 43986.9). Total num frames: 3307667456. Throughput: 0: 44072.3. Samples: 3210567280. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 10:01:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:01:16,610][06909] Updated weights for policy 0, policy_version 201893 (0.0035) [2024-06-28 10:01:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3307913216. Throughput: 0: 44037.7. Samples: 3210830660. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 10:01:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:01:19,977][06909] Updated weights for policy 0, policy_version 201903 (0.0036) [2024-06-28 10:01:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 43987.2). Total num frames: 3308126208. Throughput: 0: 44287.4. Samples: 3211098060. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 10:01:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:01:24,049][06909] Updated weights for policy 0, policy_version 201913 (0.0032) [2024-06-28 10:01:27,403][06909] Updated weights for policy 0, policy_version 201923 (0.0023) [2024-06-28 10:01:28,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 3308339200. Throughput: 0: 44202.0. Samples: 3211228520. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 10:01:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:01:31,250][06909] Updated weights for policy 0, policy_version 201933 (0.0034) [2024-06-28 10:01:33,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 3308584960. Throughput: 0: 44323.4. Samples: 3211502660. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 10:01:33,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:01:34,792][06909] Updated weights for policy 0, policy_version 201943 (0.0030) [2024-06-28 10:01:38,721][06909] Updated weights for policy 0, policy_version 201953 (0.0044) [2024-06-28 10:01:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 3308797952. Throughput: 0: 44265.9. Samples: 3211764900. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 10:01:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:01:42,337][06909] Updated weights for policy 0, policy_version 201963 (0.0035) [2024-06-28 10:01:43,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43692.2, 300 sec: 44042.4). Total num frames: 3308994560. Throughput: 0: 44014.7. Samples: 3211892200. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 10:01:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 10:01:46,279][06909] Updated weights for policy 0, policy_version 201973 (0.0033) [2024-06-28 10:01:46,667][06887] Signal inference workers to stop experience collection... (45450 times) [2024-06-28 10:01:46,700][06909] InferenceWorker_p0-w0: stopping experience collection (45450 times) [2024-06-28 10:01:46,724][06887] Signal inference workers to resume experience collection... (45450 times) [2024-06-28 10:01:46,728][06909] InferenceWorker_p0-w0: resuming experience collection (45450 times) [2024-06-28 10:01:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3309240320. Throughput: 0: 43970.5. Samples: 3212156860. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 10:01:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:01:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000201980_3309240320.pth... [2024-06-28 10:01:48,929][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000201333_3298639872.pth [2024-06-28 10:01:49,827][06909] Updated weights for policy 0, policy_version 201983 (0.0046) [2024-06-28 10:01:53,401][06909] Updated weights for policy 0, policy_version 201993 (0.0038) [2024-06-28 10:01:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 3309453312. Throughput: 0: 44340.5. Samples: 3212423620. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 10:01:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:01:57,082][06909] Updated weights for policy 0, policy_version 202003 (0.0032) [2024-06-28 10:01:58,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3309649920. Throughput: 0: 44124.0. Samples: 3212552860. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 10:01:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:02:00,980][06909] Updated weights for policy 0, policy_version 202013 (0.0028) [2024-06-28 10:02:03,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.2, 300 sec: 44153.2). Total num frames: 3309895680. Throughput: 0: 44215.8. Samples: 3212820460. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 10:02:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:02:04,723][06909] Updated weights for policy 0, policy_version 202023 (0.0028) [2024-06-28 10:02:08,178][06909] Updated weights for policy 0, policy_version 202033 (0.0038) [2024-06-28 10:02:08,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.9, 300 sec: 44042.5). Total num frames: 3310125056. Throughput: 0: 44212.9. Samples: 3213087640. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 10:02:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:02:12,164][06909] Updated weights for policy 0, policy_version 202043 (0.0033) [2024-06-28 10:02:13,850][06674] Fps is (10 sec: 42607.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3310321664. Throughput: 0: 44201.4. Samples: 3213217580. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 10:02:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:02:15,963][06909] Updated weights for policy 0, policy_version 202053 (0.0023) [2024-06-28 10:02:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 3310567424. Throughput: 0: 43895.2. Samples: 3213477940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 10:02:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:02:20,034][06909] Updated weights for policy 0, policy_version 202063 (0.0030) [2024-06-28 10:02:23,354][06909] Updated weights for policy 0, policy_version 202073 (0.0030) [2024-06-28 10:02:23,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3310780416. Throughput: 0: 44038.7. Samples: 3213746640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 10:02:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:02:27,204][06909] Updated weights for policy 0, policy_version 202083 (0.0035) [2024-06-28 10:02:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3310993408. Throughput: 0: 44087.1. Samples: 3213876120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 10:02:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:02:30,578][06909] Updated weights for policy 0, policy_version 202093 (0.0033) [2024-06-28 10:02:33,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 3311222784. Throughput: 0: 44134.9. Samples: 3214142920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 10:02:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:02:34,399][06909] Updated weights for policy 0, policy_version 202103 (0.0034) [2024-06-28 10:02:38,097][06909] Updated weights for policy 0, policy_version 202113 (0.0025) [2024-06-28 10:02:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3311452160. Throughput: 0: 44118.6. Samples: 3214408960. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 10:02:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:02:41,854][06909] Updated weights for policy 0, policy_version 202123 (0.0036) [2024-06-28 10:02:43,852][06674] Fps is (10 sec: 44227.3, 60 sec: 44508.3, 300 sec: 44153.2). Total num frames: 3311665152. Throughput: 0: 44195.7. Samples: 3214541760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 10:02:43,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:02:45,236][06909] Updated weights for policy 0, policy_version 202133 (0.0031) [2024-06-28 10:02:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44209.0). Total num frames: 3311894528. Throughput: 0: 44127.8. Samples: 3214806120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 10:02:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:02:49,273][06909] Updated weights for policy 0, policy_version 202143 (0.0026) [2024-06-28 10:02:52,942][06909] Updated weights for policy 0, policy_version 202153 (0.0027) [2024-06-28 10:02:53,850][06674] Fps is (10 sec: 45884.5, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3312123904. Throughput: 0: 44132.8. Samples: 3215073620. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 10:02:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:02:57,051][06909] Updated weights for policy 0, policy_version 202163 (0.0036) [2024-06-28 10:02:58,804][06887] Signal inference workers to stop experience collection... (45500 times) [2024-06-28 10:02:58,804][06887] Signal inference workers to resume experience collection... (45500 times) [2024-06-28 10:02:58,826][06909] InferenceWorker_p0-w0: stopping experience collection (45500 times) [2024-06-28 10:02:58,826][06909] InferenceWorker_p0-w0: resuming experience collection (45500 times) [2024-06-28 10:02:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 3312336896. Throughput: 0: 44272.5. Samples: 3215209840. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 10:02:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:03:00,194][06909] Updated weights for policy 0, policy_version 202173 (0.0029) [2024-06-28 10:03:03,852][06674] Fps is (10 sec: 42589.7, 60 sec: 44236.8, 300 sec: 44153.2). Total num frames: 3312549888. Throughput: 0: 44378.8. Samples: 3215475080. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 10:03:03,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:03:04,202][06909] Updated weights for policy 0, policy_version 202183 (0.0028) [2024-06-28 10:03:07,633][06909] Updated weights for policy 0, policy_version 202193 (0.0032) [2024-06-28 10:03:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3312779264. Throughput: 0: 44143.6. Samples: 3215733100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 10:03:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:03:11,441][06909] Updated weights for policy 0, policy_version 202203 (0.0037) [2024-06-28 10:03:13,852][06674] Fps is (10 sec: 44236.7, 60 sec: 44508.3, 300 sec: 44097.6). Total num frames: 3312992256. Throughput: 0: 44323.7. Samples: 3215870780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 10:03:13,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:03:15,047][06909] Updated weights for policy 0, policy_version 202213 (0.0032) [2024-06-28 10:03:18,569][06909] Updated weights for policy 0, policy_version 202223 (0.0023) [2024-06-28 10:03:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 3313221632. Throughput: 0: 44224.4. Samples: 3216133020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 10:03:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:03:22,302][06909] Updated weights for policy 0, policy_version 202233 (0.0038) [2024-06-28 10:03:23,850][06674] Fps is (10 sec: 45885.1, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 3313451008. Throughput: 0: 44336.5. Samples: 3216404100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 10:03:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:03:26,303][06909] Updated weights for policy 0, policy_version 202243 (0.0029) [2024-06-28 10:03:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3313664000. Throughput: 0: 44300.2. Samples: 3216535180. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-28 10:03:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:03:29,600][06909] Updated weights for policy 0, policy_version 202253 (0.0024) [2024-06-28 10:03:33,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3313860608. Throughput: 0: 44303.6. Samples: 3216799780. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-28 10:03:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:03:33,947][06909] Updated weights for policy 0, policy_version 202263 (0.0031) [2024-06-28 10:03:37,265][06909] Updated weights for policy 0, policy_version 202273 (0.0028) [2024-06-28 10:03:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3314106368. Throughput: 0: 44143.6. Samples: 3217060080. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-28 10:03:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:03:41,097][06909] Updated weights for policy 0, policy_version 202283 (0.0033) [2024-06-28 10:03:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44238.4, 300 sec: 44098.0). Total num frames: 3314319360. Throughput: 0: 44229.3. Samples: 3217200160. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-28 10:03:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:03:44,875][06909] Updated weights for policy 0, policy_version 202293 (0.0029) [2024-06-28 10:03:48,299][06909] Updated weights for policy 0, policy_version 202303 (0.0036) [2024-06-28 10:03:48,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3314532352. Throughput: 0: 44161.1. Samples: 3217462240. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-28 10:03:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:03:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000202303_3314532352.pth... [2024-06-28 10:03:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000201657_3303948288.pth [2024-06-28 10:03:52,079][06909] Updated weights for policy 0, policy_version 202313 (0.0041) [2024-06-28 10:03:53,850][06674] Fps is (10 sec: 45874.3, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3314778112. Throughput: 0: 44302.5. Samples: 3217726720. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-28 10:03:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:03:56,029][06909] Updated weights for policy 0, policy_version 202323 (0.0030) [2024-06-28 10:03:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3314991104. Throughput: 0: 44418.9. Samples: 3217869540. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-28 10:03:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:03:59,672][06909] Updated weights for policy 0, policy_version 202333 (0.0031) [2024-06-28 10:03:59,762][06887] Signal inference workers to stop experience collection... (45550 times) [2024-06-28 10:03:59,805][06909] InferenceWorker_p0-w0: stopping experience collection (45550 times) [2024-06-28 10:03:59,816][06887] Signal inference workers to resume experience collection... (45550 times) [2024-06-28 10:03:59,826][06909] InferenceWorker_p0-w0: resuming experience collection (45550 times) [2024-06-28 10:04:03,164][06909] Updated weights for policy 0, policy_version 202343 (0.0042) [2024-06-28 10:04:03,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 3315187712. Throughput: 0: 44394.1. Samples: 3218130760. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-28 10:04:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:04:07,029][06909] Updated weights for policy 0, policy_version 202353 (0.0035) [2024-06-28 10:04:08,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3315433472. Throughput: 0: 44030.2. Samples: 3218385460. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-28 10:04:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:04:10,863][06909] Updated weights for policy 0, policy_version 202363 (0.0032) [2024-06-28 10:04:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43965.2, 300 sec: 44097.9). Total num frames: 3315630080. Throughput: 0: 44419.9. Samples: 3218534080. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-28 10:04:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 10:04:14,430][06909] Updated weights for policy 0, policy_version 202373 (0.0036) [2024-06-28 10:04:18,033][06909] Updated weights for policy 0, policy_version 202383 (0.0032) [2024-06-28 10:04:18,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3315843072. Throughput: 0: 44211.1. Samples: 3218789280. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-28 10:04:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:04:21,753][06909] Updated weights for policy 0, policy_version 202393 (0.0044) [2024-06-28 10:04:23,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44236.6, 300 sec: 44153.5). Total num frames: 3316105216. Throughput: 0: 44167.8. Samples: 3219047640. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-28 10:04:23,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:04:25,251][06909] Updated weights for policy 0, policy_version 202403 (0.0033) [2024-06-28 10:04:28,852][06674] Fps is (10 sec: 47503.6, 60 sec: 44235.3, 300 sec: 44208.7). Total num frames: 3316318208. Throughput: 0: 44382.8. Samples: 3219197480. Policy #0 lag: (min: 1.0, avg: 12.2, max: 21.0) [2024-06-28 10:04:28,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:04:29,069][06909] Updated weights for policy 0, policy_version 202413 (0.0026) [2024-06-28 10:04:32,589][06909] Updated weights for policy 0, policy_version 202423 (0.0047) [2024-06-28 10:04:33,850][06674] Fps is (10 sec: 39322.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3316498432. Throughput: 0: 44245.5. Samples: 3219453280. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-28 10:04:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:04:36,339][06909] Updated weights for policy 0, policy_version 202433 (0.0027) [2024-06-28 10:04:38,850][06674] Fps is (10 sec: 45884.5, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 3316776960. Throughput: 0: 44169.4. Samples: 3219714340. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-28 10:04:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:04:40,314][06909] Updated weights for policy 0, policy_version 202443 (0.0028) [2024-06-28 10:04:43,653][06909] Updated weights for policy 0, policy_version 202453 (0.0037) [2024-06-28 10:04:43,852][06674] Fps is (10 sec: 49141.9, 60 sec: 44508.3, 300 sec: 44264.3). Total num frames: 3316989952. Throughput: 0: 44207.0. Samples: 3219858940. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-28 10:04:43,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:04:47,534][06909] Updated weights for policy 0, policy_version 202463 (0.0037) [2024-06-28 10:04:48,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3317186560. Throughput: 0: 44076.1. Samples: 3220114180. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-28 10:04:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:04:51,426][06909] Updated weights for policy 0, policy_version 202473 (0.0028) [2024-06-28 10:04:53,850][06674] Fps is (10 sec: 42606.6, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 3317415936. Throughput: 0: 44246.1. Samples: 3220376540. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-28 10:04:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:04:55,169][06909] Updated weights for policy 0, policy_version 202483 (0.0031) [2024-06-28 10:04:58,803][06909] Updated weights for policy 0, policy_version 202493 (0.0029) [2024-06-28 10:04:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.9, 300 sec: 44209.1). Total num frames: 3317645312. Throughput: 0: 43932.5. Samples: 3220511040. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-28 10:04:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:05:02,375][06909] Updated weights for policy 0, policy_version 202503 (0.0046) [2024-06-28 10:05:03,264][06887] Signal inference workers to stop experience collection... (45600 times) [2024-06-28 10:05:03,265][06887] Signal inference workers to resume experience collection... (45600 times) [2024-06-28 10:05:03,307][06909] InferenceWorker_p0-w0: stopping experience collection (45600 times) [2024-06-28 10:05:03,308][06909] InferenceWorker_p0-w0: resuming experience collection (45600 times) [2024-06-28 10:05:03,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.9, 300 sec: 44154.4). Total num frames: 3317841920. Throughput: 0: 44127.6. Samples: 3220775020. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-28 10:05:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:05:06,119][06909] Updated weights for policy 0, policy_version 202513 (0.0041) [2024-06-28 10:05:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3318087680. Throughput: 0: 44228.1. Samples: 3221037900. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-28 10:05:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:05:10,098][06909] Updated weights for policy 0, policy_version 202523 (0.0030) [2024-06-28 10:05:13,501][06909] Updated weights for policy 0, policy_version 202533 (0.0031) [2024-06-28 10:05:13,856][06674] Fps is (10 sec: 45847.9, 60 sec: 44505.5, 300 sec: 44208.1). Total num frames: 3318300672. Throughput: 0: 44096.2. Samples: 3221181980. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-28 10:05:13,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:05:17,347][06909] Updated weights for policy 0, policy_version 202543 (0.0026) [2024-06-28 10:05:18,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 3318497280. Throughput: 0: 44187.9. Samples: 3221441740. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-28 10:05:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:05:20,810][06909] Updated weights for policy 0, policy_version 202553 (0.0025) [2024-06-28 10:05:23,850][06674] Fps is (10 sec: 44262.9, 60 sec: 43963.8, 300 sec: 44209.3). Total num frames: 3318743040. Throughput: 0: 44174.2. Samples: 3221702180. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-28 10:05:23,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 10:05:24,955][06909] Updated weights for policy 0, policy_version 202563 (0.0034) [2024-06-28 10:05:28,499][06909] Updated weights for policy 0, policy_version 202573 (0.0030) [2024-06-28 10:05:28,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43965.3, 300 sec: 44209.0). Total num frames: 3318956032. Throughput: 0: 43933.6. Samples: 3221835860. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-28 10:05:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:05:32,369][06909] Updated weights for policy 0, policy_version 202583 (0.0041) [2024-06-28 10:05:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 3319169024. Throughput: 0: 44079.5. Samples: 3222097760. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-28 10:05:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:05:35,834][06909] Updated weights for policy 0, policy_version 202593 (0.0032) [2024-06-28 10:05:38,850][06674] Fps is (10 sec: 44233.8, 60 sec: 43690.2, 300 sec: 44153.7). Total num frames: 3319398400. Throughput: 0: 44165.7. Samples: 3222364020. Policy #0 lag: (min: 0.0, avg: 12.5, max: 22.0) [2024-06-28 10:05:38,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:05:39,781][06909] Updated weights for policy 0, policy_version 202603 (0.0024) [2024-06-28 10:05:43,038][06909] Updated weights for policy 0, policy_version 202613 (0.0037) [2024-06-28 10:05:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43692.1, 300 sec: 44153.5). Total num frames: 3319611392. Throughput: 0: 44128.8. Samples: 3222496840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 10:05:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:05:47,142][06909] Updated weights for policy 0, policy_version 202623 (0.0038) [2024-06-28 10:05:48,850][06674] Fps is (10 sec: 44239.3, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3319840768. Throughput: 0: 44116.3. Samples: 3222760260. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 10:05:48,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:05:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000202627_3319840768.pth... [2024-06-28 10:05:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000201980_3309240320.pth [2024-06-28 10:05:50,696][06909] Updated weights for policy 0, policy_version 202633 (0.0027) [2024-06-28 10:05:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3320053760. Throughput: 0: 44181.4. Samples: 3223026060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 10:05:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:05:54,433][06909] Updated weights for policy 0, policy_version 202643 (0.0039) [2024-06-28 10:05:58,538][06909] Updated weights for policy 0, policy_version 202653 (0.0032) [2024-06-28 10:05:58,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3320266752. Throughput: 0: 43961.0. Samples: 3223159960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 10:05:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:06:01,718][06909] Updated weights for policy 0, policy_version 202663 (0.0035) [2024-06-28 10:06:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 3320512512. Throughput: 0: 44097.8. Samples: 3223426140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 10:06:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:06:05,711][06909] Updated weights for policy 0, policy_version 202673 (0.0037) [2024-06-28 10:06:08,850][06674] Fps is (10 sec: 45874.4, 60 sec: 43963.7, 300 sec: 44264.6). Total num frames: 3320725504. Throughput: 0: 44080.4. Samples: 3223685800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 10:06:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:06:09,316][06909] Updated weights for policy 0, policy_version 202683 (0.0022) [2024-06-28 10:06:12,841][06909] Updated weights for policy 0, policy_version 202693 (0.0031) [2024-06-28 10:06:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44241.2, 300 sec: 44209.0). Total num frames: 3320954880. Throughput: 0: 44096.9. Samples: 3223820220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 10:06:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:06:16,855][06909] Updated weights for policy 0, policy_version 202703 (0.0021) [2024-06-28 10:06:18,201][06887] Signal inference workers to stop experience collection... (45650 times) [2024-06-28 10:06:18,201][06887] Signal inference workers to resume experience collection... (45650 times) [2024-06-28 10:06:18,210][06909] InferenceWorker_p0-w0: stopping experience collection (45650 times) [2024-06-28 10:06:18,210][06909] InferenceWorker_p0-w0: resuming experience collection (45650 times) [2024-06-28 10:06:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 3321167872. Throughput: 0: 44338.2. Samples: 3224092980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 10:06:18,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:06:20,513][06909] Updated weights for policy 0, policy_version 202713 (0.0031) [2024-06-28 10:06:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 3321380864. Throughput: 0: 44194.8. Samples: 3224352760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 10:06:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:06:24,171][06909] Updated weights for policy 0, policy_version 202723 (0.0034) [2024-06-28 10:06:27,679][06909] Updated weights for policy 0, policy_version 202733 (0.0046) [2024-06-28 10:06:28,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 3321626624. Throughput: 0: 44322.3. Samples: 3224491340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 10:06:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:06:31,368][06909] Updated weights for policy 0, policy_version 202743 (0.0034) [2024-06-28 10:06:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.9, 300 sec: 44209.0). Total num frames: 3321839616. Throughput: 0: 44452.9. Samples: 3224760640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 10:06:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:06:35,498][06909] Updated weights for policy 0, policy_version 202753 (0.0024) [2024-06-28 10:06:38,581][06909] Updated weights for policy 0, policy_version 202763 (0.0038) [2024-06-28 10:06:38,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44510.2, 300 sec: 44320.1). Total num frames: 3322068992. Throughput: 0: 44272.3. Samples: 3225018320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 10:06:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:06:42,718][06909] Updated weights for policy 0, policy_version 202773 (0.0029) [2024-06-28 10:06:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44509.9, 300 sec: 44209.1). Total num frames: 3322281984. Throughput: 0: 44180.8. Samples: 3225148100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 10:06:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:06:46,300][06909] Updated weights for policy 0, policy_version 202783 (0.0033) [2024-06-28 10:06:48,856][06674] Fps is (10 sec: 42573.3, 60 sec: 44232.4, 300 sec: 44208.1). Total num frames: 3322494976. Throughput: 0: 44179.5. Samples: 3225414480. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 10:06:48,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:06:49,932][06909] Updated weights for policy 0, policy_version 202793 (0.0031) [2024-06-28 10:06:53,814][06909] Updated weights for policy 0, policy_version 202803 (0.0036) [2024-06-28 10:06:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44320.1). Total num frames: 3322724352. Throughput: 0: 44237.0. Samples: 3225676460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 10:06:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:06:57,601][06909] Updated weights for policy 0, policy_version 202813 (0.0037) [2024-06-28 10:06:58,856][06674] Fps is (10 sec: 44236.4, 60 sec: 44505.3, 300 sec: 44208.4). Total num frames: 3322937344. Throughput: 0: 44265.5. Samples: 3225812440. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 10:06:58,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:07:01,199][06909] Updated weights for policy 0, policy_version 202823 (0.0018) [2024-06-28 10:07:03,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3323150336. Throughput: 0: 44089.4. Samples: 3226077000. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 10:07:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:07:04,789][06909] Updated weights for policy 0, policy_version 202833 (0.0037) [2024-06-28 10:07:08,526][06909] Updated weights for policy 0, policy_version 202843 (0.0027) [2024-06-28 10:07:08,850][06674] Fps is (10 sec: 44264.1, 60 sec: 44236.9, 300 sec: 44264.6). Total num frames: 3323379712. Throughput: 0: 44256.5. Samples: 3226344300. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 10:07:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:07:12,526][06909] Updated weights for policy 0, policy_version 202853 (0.0037) [2024-06-28 10:07:13,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.7, 300 sec: 44209.0). Total num frames: 3323609088. Throughput: 0: 44161.7. Samples: 3226478620. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 10:07:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:07:16,508][06909] Updated weights for policy 0, policy_version 202863 (0.0033) [2024-06-28 10:07:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3323805696. Throughput: 0: 43668.0. Samples: 3226725700. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 10:07:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:07:19,941][06909] Updated weights for policy 0, policy_version 202873 (0.0033) [2024-06-28 10:07:23,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3324018688. Throughput: 0: 43895.8. Samples: 3226993620. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 10:07:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:07:23,871][06909] Updated weights for policy 0, policy_version 202883 (0.0029) [2024-06-28 10:07:27,384][06909] Updated weights for policy 0, policy_version 202893 (0.0035) [2024-06-28 10:07:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 3324264448. Throughput: 0: 43876.0. Samples: 3227122520. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 10:07:28,856][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:07:31,359][06909] Updated weights for policy 0, policy_version 202903 (0.0020) [2024-06-28 10:07:32,369][06887] Signal inference workers to stop experience collection... (45700 times) [2024-06-28 10:07:32,370][06887] Signal inference workers to resume experience collection... (45700 times) [2024-06-28 10:07:32,407][06909] InferenceWorker_p0-w0: stopping experience collection (45700 times) [2024-06-28 10:07:32,407][06909] InferenceWorker_p0-w0: resuming experience collection (45700 times) [2024-06-28 10:07:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 3324461056. Throughput: 0: 43848.5. Samples: 3227387400. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 10:07:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:07:34,721][06909] Updated weights for policy 0, policy_version 202913 (0.0019) [2024-06-28 10:07:38,526][06909] Updated weights for policy 0, policy_version 202923 (0.0036) [2024-06-28 10:07:38,852][06674] Fps is (10 sec: 42589.5, 60 sec: 43689.2, 300 sec: 44153.5). Total num frames: 3324690432. Throughput: 0: 44011.7. Samples: 3227657080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 10:07:38,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:07:41,858][06909] Updated weights for policy 0, policy_version 202933 (0.0032) [2024-06-28 10:07:43,852][06674] Fps is (10 sec: 45865.9, 60 sec: 43962.2, 300 sec: 44153.2). Total num frames: 3324919808. Throughput: 0: 43958.2. Samples: 3227790380. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 10:07:43,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:07:45,811][06909] Updated weights for policy 0, policy_version 202943 (0.0030) [2024-06-28 10:07:48,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43968.1, 300 sec: 44097.9). Total num frames: 3325132800. Throughput: 0: 44004.4. Samples: 3228057200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 10:07:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:07:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000202950_3325132800.pth... [2024-06-28 10:07:48,914][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000202303_3314532352.pth [2024-06-28 10:07:49,691][06909] Updated weights for policy 0, policy_version 202953 (0.0029) [2024-06-28 10:07:53,769][06909] Updated weights for policy 0, policy_version 202963 (0.0029) [2024-06-28 10:07:53,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 3325345792. Throughput: 0: 43928.4. Samples: 3228321080. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-28 10:07:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:07:57,054][06909] Updated weights for policy 0, policy_version 202973 (0.0030) [2024-06-28 10:07:58,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43968.3, 300 sec: 44153.8). Total num frames: 3325575168. Throughput: 0: 43740.6. Samples: 3228446940. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-28 10:07:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:08:01,271][06909] Updated weights for policy 0, policy_version 202983 (0.0031) [2024-06-28 10:08:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3325771776. Throughput: 0: 44093.9. Samples: 3228709920. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-28 10:08:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:08:04,443][06909] Updated weights for policy 0, policy_version 202993 (0.0023) [2024-06-28 10:08:08,703][06909] Updated weights for policy 0, policy_version 203003 (0.0027) [2024-06-28 10:08:08,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 44098.3). Total num frames: 3326001152. Throughput: 0: 44159.9. Samples: 3228980820. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-28 10:08:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:08:11,985][06909] Updated weights for policy 0, policy_version 203013 (0.0031) [2024-06-28 10:08:13,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3326246912. Throughput: 0: 44117.3. Samples: 3229107800. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-28 10:08:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:08:15,983][06909] Updated weights for policy 0, policy_version 203023 (0.0034) [2024-06-28 10:08:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3326459904. Throughput: 0: 44177.8. Samples: 3229375400. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-28 10:08:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:08:19,208][06909] Updated weights for policy 0, policy_version 203033 (0.0027) [2024-06-28 10:08:23,561][06909] Updated weights for policy 0, policy_version 203043 (0.0025) [2024-06-28 10:08:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3326656512. Throughput: 0: 44219.0. Samples: 3229646840. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-28 10:08:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:08:26,594][06909] Updated weights for policy 0, policy_version 203053 (0.0025) [2024-06-28 10:08:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44209.0). Total num frames: 3326902272. Throughput: 0: 44172.7. Samples: 3229778060. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-28 10:08:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:08:30,924][06909] Updated weights for policy 0, policy_version 203063 (0.0046) [2024-06-28 10:08:33,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3327131648. Throughput: 0: 43997.9. Samples: 3230037100. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-28 10:08:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:08:34,123][06909] Updated weights for policy 0, policy_version 203073 (0.0022) [2024-06-28 10:08:38,099][06909] Updated weights for policy 0, policy_version 203083 (0.0040) [2024-06-28 10:08:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43692.2, 300 sec: 44042.4). Total num frames: 3327311872. Throughput: 0: 44020.9. Samples: 3230302020. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-28 10:08:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:08:41,836][06909] Updated weights for policy 0, policy_version 203093 (0.0039) [2024-06-28 10:08:43,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43965.1, 300 sec: 44153.5). Total num frames: 3327557632. Throughput: 0: 44101.6. Samples: 3230431520. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-28 10:08:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:08:45,645][06909] Updated weights for policy 0, policy_version 203103 (0.0033) [2024-06-28 10:08:48,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3327787008. Throughput: 0: 44092.5. Samples: 3230694080. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-28 10:08:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:08:49,097][06909] Updated weights for policy 0, policy_version 203113 (0.0037) [2024-06-28 10:08:53,137][06909] Updated weights for policy 0, policy_version 203123 (0.0039) [2024-06-28 10:08:53,850][06674] Fps is (10 sec: 44237.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3328000000. Throughput: 0: 44111.6. Samples: 3230965840. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-28 10:08:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:08:55,134][06887] Signal inference workers to stop experience collection... (45750 times) [2024-06-28 10:08:55,140][06887] Signal inference workers to resume experience collection... (45750 times) [2024-06-28 10:08:55,172][06909] InferenceWorker_p0-w0: stopping experience collection (45750 times) [2024-06-28 10:08:55,172][06909] InferenceWorker_p0-w0: resuming experience collection (45750 times) [2024-06-28 10:08:56,392][06909] Updated weights for policy 0, policy_version 203133 (0.0028) [2024-06-28 10:08:58,850][06674] Fps is (10 sec: 44235.7, 60 sec: 44236.6, 300 sec: 44209.0). Total num frames: 3328229376. Throughput: 0: 44097.6. Samples: 3231092200. Policy #0 lag: (min: 1.0, avg: 10.4, max: 22.0) [2024-06-28 10:08:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:09:00,535][06909] Updated weights for policy 0, policy_version 203143 (0.0031) [2024-06-28 10:09:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 3328442368. Throughput: 0: 44156.0. Samples: 3231362420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 10:09:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:09:04,132][06909] Updated weights for policy 0, policy_version 203153 (0.0031) [2024-06-28 10:09:07,959][06909] Updated weights for policy 0, policy_version 203163 (0.0035) [2024-06-28 10:09:08,852][06674] Fps is (10 sec: 42590.6, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 3328655360. Throughput: 0: 43897.1. Samples: 3231622300. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 10:09:08,852][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 10:09:11,512][06909] Updated weights for policy 0, policy_version 203173 (0.0032) [2024-06-28 10:09:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 3328868352. Throughput: 0: 43825.8. Samples: 3231750220. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 10:09:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:09:15,403][06909] Updated weights for policy 0, policy_version 203183 (0.0022) [2024-06-28 10:09:18,850][06674] Fps is (10 sec: 44245.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3329097728. Throughput: 0: 43933.8. Samples: 3232014120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 10:09:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:09:19,198][06909] Updated weights for policy 0, policy_version 203193 (0.0031) [2024-06-28 10:09:22,944][06909] Updated weights for policy 0, policy_version 203203 (0.0044) [2024-06-28 10:09:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 3329310720. Throughput: 0: 43884.9. Samples: 3232276840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 10:09:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:09:26,483][06909] Updated weights for policy 0, policy_version 203213 (0.0038) [2024-06-28 10:09:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 3329523712. Throughput: 0: 43907.2. Samples: 3232407340. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 10:09:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:09:30,730][06909] Updated weights for policy 0, policy_version 203223 (0.0037) [2024-06-28 10:09:33,731][06909] Updated weights for policy 0, policy_version 203233 (0.0041) [2024-06-28 10:09:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3329769472. Throughput: 0: 44042.1. Samples: 3232675980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 10:09:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:09:37,939][06909] Updated weights for policy 0, policy_version 203243 (0.0026) [2024-06-28 10:09:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 3329966080. Throughput: 0: 43711.5. Samples: 3232932860. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 10:09:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:09:41,299][06909] Updated weights for policy 0, policy_version 203253 (0.0027) [2024-06-28 10:09:43,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3330179072. Throughput: 0: 43904.1. Samples: 3233067880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 10:09:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:09:45,091][06909] Updated weights for policy 0, policy_version 203263 (0.0041) [2024-06-28 10:09:48,725][06909] Updated weights for policy 0, policy_version 203273 (0.0034) [2024-06-28 10:09:48,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3330424832. Throughput: 0: 43744.9. Samples: 3233330940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 10:09:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:09:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000203273_3330424832.pth... [2024-06-28 10:09:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000202627_3319840768.pth [2024-06-28 10:09:52,925][06909] Updated weights for policy 0, policy_version 203283 (0.0029) [2024-06-28 10:09:53,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3330637824. Throughput: 0: 43959.3. Samples: 3233600380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 10:09:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:09:56,414][06909] Updated weights for policy 0, policy_version 203293 (0.0043) [2024-06-28 10:09:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.8, 300 sec: 44097.9). Total num frames: 3330850816. Throughput: 0: 43975.6. Samples: 3233729120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 10:09:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:10:00,148][06909] Updated weights for policy 0, policy_version 203303 (0.0042) [2024-06-28 10:10:03,640][06909] Updated weights for policy 0, policy_version 203313 (0.0031) [2024-06-28 10:10:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3331080192. Throughput: 0: 43960.9. Samples: 3233992360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 10:10:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:10:07,342][06909] Updated weights for policy 0, policy_version 203323 (0.0035) [2024-06-28 10:10:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44238.2, 300 sec: 44098.8). Total num frames: 3331309568. Throughput: 0: 44091.1. Samples: 3234260940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:10:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:10:10,293][06887] Signal inference workers to stop experience collection... (45800 times) [2024-06-28 10:10:10,334][06909] InferenceWorker_p0-w0: stopping experience collection (45800 times) [2024-06-28 10:10:10,349][06887] Signal inference workers to resume experience collection... (45800 times) [2024-06-28 10:10:10,357][06909] InferenceWorker_p0-w0: resuming experience collection (45800 times) [2024-06-28 10:10:11,073][06909] Updated weights for policy 0, policy_version 203333 (0.0027) [2024-06-28 10:10:13,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3331522560. Throughput: 0: 44244.4. Samples: 3234398340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:10:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:10:14,847][06909] Updated weights for policy 0, policy_version 203343 (0.0026) [2024-06-28 10:10:18,191][06909] Updated weights for policy 0, policy_version 203353 (0.0033) [2024-06-28 10:10:18,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3331735552. Throughput: 0: 44148.0. Samples: 3234662640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:10:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:10:22,602][06909] Updated weights for policy 0, policy_version 203363 (0.0038) [2024-06-28 10:10:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3331981312. Throughput: 0: 44127.9. Samples: 3234918620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:10:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:10:25,496][06909] Updated weights for policy 0, policy_version 203373 (0.0031) [2024-06-28 10:10:28,852][06674] Fps is (10 sec: 44228.1, 60 sec: 44235.3, 300 sec: 44097.7). Total num frames: 3332177920. Throughput: 0: 44223.0. Samples: 3235058000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:10:28,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:10:29,841][06909] Updated weights for policy 0, policy_version 203383 (0.0038) [2024-06-28 10:10:33,149][06909] Updated weights for policy 0, policy_version 203393 (0.0038) [2024-06-28 10:10:33,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 44042.5). Total num frames: 3332390912. Throughput: 0: 44172.9. Samples: 3235318720. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:10:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:10:37,113][06909] Updated weights for policy 0, policy_version 203403 (0.0042) [2024-06-28 10:10:38,850][06674] Fps is (10 sec: 44245.6, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3332620288. Throughput: 0: 43946.6. Samples: 3235577980. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:10:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:10:40,926][06909] Updated weights for policy 0, policy_version 203413 (0.0027) [2024-06-28 10:10:43,854][06674] Fps is (10 sec: 44219.1, 60 sec: 44233.9, 300 sec: 44041.8). Total num frames: 3332833280. Throughput: 0: 44189.4. Samples: 3235717820. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:10:43,854][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:10:44,548][06909] Updated weights for policy 0, policy_version 203423 (0.0029) [2024-06-28 10:10:48,162][06909] Updated weights for policy 0, policy_version 203433 (0.0027) [2024-06-28 10:10:48,856][06674] Fps is (10 sec: 44210.4, 60 sec: 43959.3, 300 sec: 44097.0). Total num frames: 3333062656. Throughput: 0: 44138.6. Samples: 3235978860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:10:48,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:10:52,204][06909] Updated weights for policy 0, policy_version 203443 (0.0033) [2024-06-28 10:10:53,850][06674] Fps is (10 sec: 44254.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3333275648. Throughput: 0: 44026.4. Samples: 3236242120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:10:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:10:55,329][06909] Updated weights for policy 0, policy_version 203453 (0.0037) [2024-06-28 10:10:58,850][06674] Fps is (10 sec: 42622.4, 60 sec: 43963.4, 300 sec: 43986.8). Total num frames: 3333488640. Throughput: 0: 44019.2. Samples: 3236379220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:10:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:10:59,695][06909] Updated weights for policy 0, policy_version 203463 (0.0030) [2024-06-28 10:11:03,012][06909] Updated weights for policy 0, policy_version 203473 (0.0044) [2024-06-28 10:11:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3333701632. Throughput: 0: 43921.5. Samples: 3236639100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:11:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:11:07,047][06909] Updated weights for policy 0, policy_version 203483 (0.0027) [2024-06-28 10:11:08,850][06674] Fps is (10 sec: 45877.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3333947392. Throughput: 0: 44145.9. Samples: 3236905180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:11:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:11:10,131][06909] Updated weights for policy 0, policy_version 203493 (0.0036) [2024-06-28 10:11:13,850][06674] Fps is (10 sec: 45874.2, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3334160384. Throughput: 0: 44166.3. Samples: 3237045400. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:11:13,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:11:14,329][06909] Updated weights for policy 0, policy_version 203503 (0.0027) [2024-06-28 10:11:17,650][06909] Updated weights for policy 0, policy_version 203513 (0.0029) [2024-06-28 10:11:18,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3334356992. Throughput: 0: 44105.4. Samples: 3237303460. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 10:11:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:11:21,770][06909] Updated weights for policy 0, policy_version 203523 (0.0032) [2024-06-28 10:11:23,850][06674] Fps is (10 sec: 44238.0, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3334602752. Throughput: 0: 44236.1. Samples: 3237568600. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 10:11:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:11:25,042][06909] Updated weights for policy 0, policy_version 203533 (0.0031) [2024-06-28 10:11:28,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 3334815744. Throughput: 0: 44219.0. Samples: 3237707500. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 10:11:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:11:29,342][06909] Updated weights for policy 0, policy_version 203543 (0.0032) [2024-06-28 10:11:31,465][06887] Signal inference workers to stop experience collection... (45850 times) [2024-06-28 10:11:31,493][06909] InferenceWorker_p0-w0: stopping experience collection (45850 times) [2024-06-28 10:11:31,516][06887] Signal inference workers to resume experience collection... (45850 times) [2024-06-28 10:11:31,516][06909] InferenceWorker_p0-w0: resuming experience collection (45850 times) [2024-06-28 10:11:32,281][06909] Updated weights for policy 0, policy_version 203553 (0.0034) [2024-06-28 10:11:33,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3335045120. Throughput: 0: 44037.4. Samples: 3237960280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 10:11:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:11:36,533][06909] Updated weights for policy 0, policy_version 203563 (0.0033) [2024-06-28 10:11:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3335258112. Throughput: 0: 44188.3. Samples: 3238230600. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 10:11:38,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:11:39,994][06909] Updated weights for policy 0, policy_version 203573 (0.0031) [2024-06-28 10:11:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44239.7, 300 sec: 44043.3). Total num frames: 3335487488. Throughput: 0: 44199.0. Samples: 3238368160. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 10:11:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:11:44,033][06909] Updated weights for policy 0, policy_version 203583 (0.0031) [2024-06-28 10:11:47,248][06909] Updated weights for policy 0, policy_version 203593 (0.0043) [2024-06-28 10:11:48,852][06674] Fps is (10 sec: 44227.9, 60 sec: 43966.6, 300 sec: 43986.6). Total num frames: 3335700480. Throughput: 0: 44099.3. Samples: 3238623660. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 10:11:48,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:11:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000203595_3335700480.pth... [2024-06-28 10:11:48,925][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000202950_3325132800.pth [2024-06-28 10:11:51,222][06909] Updated weights for policy 0, policy_version 203603 (0.0023) [2024-06-28 10:11:53,856][06674] Fps is (10 sec: 45847.6, 60 sec: 44505.3, 300 sec: 44098.0). Total num frames: 3335946240. Throughput: 0: 44265.6. Samples: 3238897400. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 10:11:53,857][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:11:54,576][06909] Updated weights for policy 0, policy_version 203613 (0.0030) [2024-06-28 10:11:58,850][06674] Fps is (10 sec: 45884.9, 60 sec: 44510.2, 300 sec: 44098.0). Total num frames: 3336159232. Throughput: 0: 44315.8. Samples: 3239039600. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 10:11:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:11:58,860][06909] Updated weights for policy 0, policy_version 203623 (0.0024) [2024-06-28 10:12:01,943][06909] Updated weights for policy 0, policy_version 203633 (0.0022) [2024-06-28 10:12:03,850][06674] Fps is (10 sec: 40984.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3336355840. Throughput: 0: 44247.0. Samples: 3239294580. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 10:12:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:12:06,239][06909] Updated weights for policy 0, policy_version 203643 (0.0043) [2024-06-28 10:12:08,850][06674] Fps is (10 sec: 47513.1, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 3336634368. Throughput: 0: 44227.0. Samples: 3239558820. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 10:12:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:12:09,660][06909] Updated weights for policy 0, policy_version 203653 (0.0044) [2024-06-28 10:12:13,703][06909] Updated weights for policy 0, policy_version 203663 (0.0039) [2024-06-28 10:12:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3336814592. Throughput: 0: 44220.9. Samples: 3239697440. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 10:12:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:12:17,563][06909] Updated weights for policy 0, policy_version 203673 (0.0036) [2024-06-28 10:12:18,850][06674] Fps is (10 sec: 39321.4, 60 sec: 44509.7, 300 sec: 44097.9). Total num frames: 3337027584. Throughput: 0: 44355.1. Samples: 3239956260. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 10:12:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:12:21,097][06909] Updated weights for policy 0, policy_version 203683 (0.0033) [2024-06-28 10:12:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 3337273344. Throughput: 0: 44188.6. Samples: 3240219080. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 10:12:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:12:24,926][06909] Updated weights for policy 0, policy_version 203693 (0.0039) [2024-06-28 10:12:28,541][06909] Updated weights for policy 0, policy_version 203703 (0.0045) [2024-06-28 10:12:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3337486336. Throughput: 0: 44109.7. Samples: 3240353100. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 10:12:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:12:32,228][06909] Updated weights for policy 0, policy_version 203713 (0.0024) [2024-06-28 10:12:33,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 3337682944. Throughput: 0: 44193.6. Samples: 3240612280. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 10:12:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:12:36,208][06909] Updated weights for policy 0, policy_version 203723 (0.0026) [2024-06-28 10:12:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 44098.2). Total num frames: 3337928704. Throughput: 0: 43873.4. Samples: 3240871440. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 10:12:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:12:39,465][06909] Updated weights for policy 0, policy_version 203733 (0.0031) [2024-06-28 10:12:43,443][06909] Updated weights for policy 0, policy_version 203743 (0.0024) [2024-06-28 10:12:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3338125312. Throughput: 0: 43757.7. Samples: 3241008700. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 10:12:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:12:46,687][06909] Updated weights for policy 0, policy_version 203753 (0.0026) [2024-06-28 10:12:48,850][06674] Fps is (10 sec: 42599.2, 60 sec: 44238.4, 300 sec: 44098.0). Total num frames: 3338354688. Throughput: 0: 44026.8. Samples: 3241275780. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 10:12:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:12:51,023][06909] Updated weights for policy 0, policy_version 203763 (0.0032) [2024-06-28 10:12:53,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44241.3, 300 sec: 44153.5). Total num frames: 3338600448. Throughput: 0: 43933.8. Samples: 3241535840. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 10:12:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:12:54,695][06909] Updated weights for policy 0, policy_version 203773 (0.0037) [2024-06-28 10:12:58,308][06909] Updated weights for policy 0, policy_version 203783 (0.0041) [2024-06-28 10:12:58,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 3338797056. Throughput: 0: 43848.7. Samples: 3241670640. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 10:12:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:13:01,905][06909] Updated weights for policy 0, policy_version 203793 (0.0029) [2024-06-28 10:13:02,668][06887] Signal inference workers to stop experience collection... (45900 times) [2024-06-28 10:13:02,714][06909] InferenceWorker_p0-w0: stopping experience collection (45900 times) [2024-06-28 10:13:02,724][06887] Signal inference workers to resume experience collection... (45900 times) [2024-06-28 10:13:02,733][06909] InferenceWorker_p0-w0: resuming experience collection (45900 times) [2024-06-28 10:13:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3339010048. Throughput: 0: 44031.2. Samples: 3241937660. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 10:13:03,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 10:13:05,667][06909] Updated weights for policy 0, policy_version 203803 (0.0041) [2024-06-28 10:13:08,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3339255808. Throughput: 0: 43975.0. Samples: 3242197960. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 10:13:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:13:09,064][06909] Updated weights for policy 0, policy_version 203813 (0.0024) [2024-06-28 10:13:13,214][06909] Updated weights for policy 0, policy_version 203823 (0.0025) [2024-06-28 10:13:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3339468800. Throughput: 0: 44083.6. Samples: 3242336860. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 10:13:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:13:16,300][06909] Updated weights for policy 0, policy_version 203833 (0.0024) [2024-06-28 10:13:18,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3339665408. Throughput: 0: 44016.0. Samples: 3242593000. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 10:13:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:13:20,578][06909] Updated weights for policy 0, policy_version 203843 (0.0041) [2024-06-28 10:13:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3339894784. Throughput: 0: 44201.9. Samples: 3242860520. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 10:13:23,850][06674] Avg episode reward: [(0, '0.429')] [2024-06-28 10:13:24,148][06909] Updated weights for policy 0, policy_version 203853 (0.0031) [2024-06-28 10:13:27,954][06909] Updated weights for policy 0, policy_version 203863 (0.0029) [2024-06-28 10:13:28,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3340124160. Throughput: 0: 44101.8. Samples: 3242993280. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 10:13:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:13:31,710][06909] Updated weights for policy 0, policy_version 203873 (0.0037) [2024-06-28 10:13:33,856][06674] Fps is (10 sec: 44211.3, 60 sec: 44232.6, 300 sec: 44152.6). Total num frames: 3340337152. Throughput: 0: 43951.6. Samples: 3243253860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:13:33,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:13:35,347][06909] Updated weights for policy 0, policy_version 203883 (0.0044) [2024-06-28 10:13:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3340566528. Throughput: 0: 44107.6. Samples: 3243520680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:13:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:13:39,331][06909] Updated weights for policy 0, policy_version 203893 (0.0028) [2024-06-28 10:13:42,683][06909] Updated weights for policy 0, policy_version 203903 (0.0026) [2024-06-28 10:13:43,850][06674] Fps is (10 sec: 44262.5, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3340779520. Throughput: 0: 44108.6. Samples: 3243655520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:13:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:13:46,825][06909] Updated weights for policy 0, policy_version 203913 (0.0027) [2024-06-28 10:13:48,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.5, 300 sec: 43986.8). Total num frames: 3340976128. Throughput: 0: 44040.8. Samples: 3243919500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:13:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:13:48,873][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000203917_3340976128.pth... [2024-06-28 10:13:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000203273_3330424832.pth [2024-06-28 10:13:50,201][06909] Updated weights for policy 0, policy_version 203923 (0.0044) [2024-06-28 10:13:53,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3341221888. Throughput: 0: 44031.5. Samples: 3244179380. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:13:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:13:54,300][06909] Updated weights for policy 0, policy_version 203933 (0.0029) [2024-06-28 10:13:57,623][06909] Updated weights for policy 0, policy_version 203943 (0.0030) [2024-06-28 10:13:58,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3341451264. Throughput: 0: 44047.9. Samples: 3244319020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:13:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:14:01,611][06909] Updated weights for policy 0, policy_version 203953 (0.0046) [2024-06-28 10:14:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 3341647872. Throughput: 0: 44121.7. Samples: 3244578480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:14:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:14:05,168][06909] Updated weights for policy 0, policy_version 203963 (0.0043) [2024-06-28 10:14:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 3341877248. Throughput: 0: 43947.5. Samples: 3244838160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:14:08,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 10:14:09,214][06909] Updated weights for policy 0, policy_version 203973 (0.0027) [2024-06-28 10:14:12,503][06909] Updated weights for policy 0, policy_version 203983 (0.0029) [2024-06-28 10:14:13,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 3342106624. Throughput: 0: 44131.5. Samples: 3244979200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:14:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:14:16,433][06909] Updated weights for policy 0, policy_version 203993 (0.0030) [2024-06-28 10:14:18,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 3342303232. Throughput: 0: 44162.7. Samples: 3245241020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:14:18,852][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 10:14:19,899][06909] Updated weights for policy 0, policy_version 204003 (0.0039) [2024-06-28 10:14:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3342532608. Throughput: 0: 44082.6. Samples: 3245504400. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:14:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:14:24,154][06909] Updated weights for policy 0, policy_version 204013 (0.0030) [2024-06-28 10:14:27,143][06909] Updated weights for policy 0, policy_version 204023 (0.0027) [2024-06-28 10:14:28,850][06674] Fps is (10 sec: 45884.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3342761984. Throughput: 0: 44176.8. Samples: 3245643480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:14:28,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:14:31,291][06909] Updated weights for policy 0, policy_version 204033 (0.0030) [2024-06-28 10:14:32,571][06887] Signal inference workers to stop experience collection... (45950 times) [2024-06-28 10:14:32,571][06887] Signal inference workers to resume experience collection... (45950 times) [2024-06-28 10:14:32,605][06909] InferenceWorker_p0-w0: stopping experience collection (45950 times) [2024-06-28 10:14:32,605][06909] InferenceWorker_p0-w0: resuming experience collection (45950 times) [2024-06-28 10:14:33,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43968.0, 300 sec: 44098.0). Total num frames: 3342974976. Throughput: 0: 44132.2. Samples: 3245905440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:14:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:14:34,783][06909] Updated weights for policy 0, policy_version 204043 (0.0044) [2024-06-28 10:14:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3343187968. Throughput: 0: 44132.1. Samples: 3246165320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:14:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:14:38,885][06909] Updated weights for policy 0, policy_version 204053 (0.0038) [2024-06-28 10:14:42,341][06909] Updated weights for policy 0, policy_version 204063 (0.0040) [2024-06-28 10:14:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3343417344. Throughput: 0: 43918.8. Samples: 3246295360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:14:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:14:46,593][06909] Updated weights for policy 0, policy_version 204073 (0.0033) [2024-06-28 10:14:48,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 3343646720. Throughput: 0: 44044.4. Samples: 3246560480. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:14:48,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 10:14:49,732][06909] Updated weights for policy 0, policy_version 204083 (0.0030) [2024-06-28 10:14:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3343843328. Throughput: 0: 44233.3. Samples: 3246828660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:14:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:14:53,968][06909] Updated weights for policy 0, policy_version 204093 (0.0026) [2024-06-28 10:14:57,061][06909] Updated weights for policy 0, policy_version 204103 (0.0033) [2024-06-28 10:14:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3344089088. Throughput: 0: 43993.8. Samples: 3246958920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:14:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:15:01,429][06909] Updated weights for policy 0, policy_version 204113 (0.0038) [2024-06-28 10:15:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3344302080. Throughput: 0: 43957.0. Samples: 3247219000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:15:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:15:04,582][06909] Updated weights for policy 0, policy_version 204123 (0.0029) [2024-06-28 10:15:08,710][06909] Updated weights for policy 0, policy_version 204133 (0.0036) [2024-06-28 10:15:08,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3344515072. Throughput: 0: 43944.0. Samples: 3247481880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:15:08,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:15:11,978][06909] Updated weights for policy 0, policy_version 204143 (0.0019) [2024-06-28 10:15:13,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 3344744448. Throughput: 0: 43799.2. Samples: 3247614440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:15:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:15:16,537][06909] Updated weights for policy 0, policy_version 204153 (0.0025) [2024-06-28 10:15:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44511.4, 300 sec: 44042.4). Total num frames: 3344973824. Throughput: 0: 43880.7. Samples: 3247880080. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:15:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:15:19,445][06909] Updated weights for policy 0, policy_version 204163 (0.0023) [2024-06-28 10:15:23,719][06909] Updated weights for policy 0, policy_version 204173 (0.0036) [2024-06-28 10:15:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.8, 300 sec: 44042.7). Total num frames: 3345170432. Throughput: 0: 44055.1. Samples: 3248147800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:15:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:15:26,828][06909] Updated weights for policy 0, policy_version 204183 (0.0025) [2024-06-28 10:15:28,856][06674] Fps is (10 sec: 42573.0, 60 sec: 43959.3, 300 sec: 44097.0). Total num frames: 3345399808. Throughput: 0: 44032.3. Samples: 3248277080. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:15:28,856][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:15:31,345][06909] Updated weights for policy 0, policy_version 204193 (0.0045) [2024-06-28 10:15:33,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3345645568. Throughput: 0: 44068.9. Samples: 3248543580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:15:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:15:34,058][06909] Updated weights for policy 0, policy_version 204203 (0.0027) [2024-06-28 10:15:38,603][06909] Updated weights for policy 0, policy_version 204213 (0.0032) [2024-06-28 10:15:38,850][06674] Fps is (10 sec: 42623.7, 60 sec: 43963.6, 300 sec: 44043.0). Total num frames: 3345825792. Throughput: 0: 43849.7. Samples: 3248801900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:15:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:15:41,793][06909] Updated weights for policy 0, policy_version 204223 (0.0031) [2024-06-28 10:15:43,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.7, 300 sec: 44043.3). Total num frames: 3346055168. Throughput: 0: 43886.3. Samples: 3248933800. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2024-06-28 10:15:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:15:46,026][06909] Updated weights for policy 0, policy_version 204233 (0.0030) [2024-06-28 10:15:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 3346284544. Throughput: 0: 43955.6. Samples: 3249197000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2024-06-28 10:15:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:15:48,855][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000204241_3346284544.pth... [2024-06-28 10:15:48,907][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000203595_3335700480.pth [2024-06-28 10:15:49,373][06909] Updated weights for policy 0, policy_version 204243 (0.0040) [2024-06-28 10:15:53,471][06909] Updated weights for policy 0, policy_version 204253 (0.0030) [2024-06-28 10:15:53,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.7, 300 sec: 44042.5). Total num frames: 3346481152. Throughput: 0: 43959.1. Samples: 3249460040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2024-06-28 10:15:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:15:56,815][06909] Updated weights for policy 0, policy_version 204263 (0.0032) [2024-06-28 10:15:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3346726912. Throughput: 0: 43814.2. Samples: 3249586080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2024-06-28 10:15:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:16:01,182][06909] Updated weights for policy 0, policy_version 204273 (0.0035) [2024-06-28 10:16:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3346939904. Throughput: 0: 43938.6. Samples: 3249857320. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2024-06-28 10:16:03,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 10:16:04,239][06909] Updated weights for policy 0, policy_version 204283 (0.0027) [2024-06-28 10:16:08,471][06909] Updated weights for policy 0, policy_version 204293 (0.0039) [2024-06-28 10:16:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3347136512. Throughput: 0: 43927.6. Samples: 3250124540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2024-06-28 10:16:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:16:11,685][06909] Updated weights for policy 0, policy_version 204303 (0.0044) [2024-06-28 10:16:11,964][06887] Signal inference workers to stop experience collection... (46000 times) [2024-06-28 10:16:11,965][06887] Signal inference workers to resume experience collection... (46000 times) [2024-06-28 10:16:11,977][06909] InferenceWorker_p0-w0: stopping experience collection (46000 times) [2024-06-28 10:16:11,977][06909] InferenceWorker_p0-w0: resuming experience collection (46000 times) [2024-06-28 10:16:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 3347365888. Throughput: 0: 43847.6. Samples: 3250249960. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2024-06-28 10:16:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:16:15,645][06909] Updated weights for policy 0, policy_version 204313 (0.0037) [2024-06-28 10:16:18,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 3347611648. Throughput: 0: 43920.0. Samples: 3250519980. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2024-06-28 10:16:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:16:19,045][06909] Updated weights for policy 0, policy_version 204323 (0.0027) [2024-06-28 10:16:22,864][06909] Updated weights for policy 0, policy_version 204333 (0.0038) [2024-06-28 10:16:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3347808256. Throughput: 0: 44048.1. Samples: 3250784060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2024-06-28 10:16:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:16:26,623][06909] Updated weights for policy 0, policy_version 204343 (0.0036) [2024-06-28 10:16:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43968.1, 300 sec: 44042.4). Total num frames: 3348037632. Throughput: 0: 43935.0. Samples: 3250910880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2024-06-28 10:16:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:16:30,765][06909] Updated weights for policy 0, policy_version 204353 (0.0028) [2024-06-28 10:16:33,749][06909] Updated weights for policy 0, policy_version 204363 (0.0027) [2024-06-28 10:16:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3348283392. Throughput: 0: 44143.1. Samples: 3251183440. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2024-06-28 10:16:33,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:16:37,967][06909] Updated weights for policy 0, policy_version 204373 (0.0029) [2024-06-28 10:16:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3348480000. Throughput: 0: 44276.8. Samples: 3251452500. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2024-06-28 10:16:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:16:41,050][06909] Updated weights for policy 0, policy_version 204383 (0.0035) [2024-06-28 10:16:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 3348709376. Throughput: 0: 44326.7. Samples: 3251580780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2024-06-28 10:16:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:16:45,085][06909] Updated weights for policy 0, policy_version 204393 (0.0039) [2024-06-28 10:16:48,589][06909] Updated weights for policy 0, policy_version 204403 (0.0032) [2024-06-28 10:16:48,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.8, 300 sec: 44043.3). Total num frames: 3348938752. Throughput: 0: 44396.1. Samples: 3251855140. Policy #0 lag: (min: 0.0, avg: 10.4, max: 25.0) [2024-06-28 10:16:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:16:52,543][06909] Updated weights for policy 0, policy_version 204413 (0.0040) [2024-06-28 10:16:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3349135360. Throughput: 0: 44099.5. Samples: 3252109020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:16:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:16:56,174][06909] Updated weights for policy 0, policy_version 204423 (0.0022) [2024-06-28 10:16:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3349364736. Throughput: 0: 44237.8. Samples: 3252240660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:16:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 10:16:59,714][06909] Updated weights for policy 0, policy_version 204433 (0.0036) [2024-06-28 10:17:03,727][06909] Updated weights for policy 0, policy_version 204443 (0.0038) [2024-06-28 10:17:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 3349594112. Throughput: 0: 44133.8. Samples: 3252506000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:17:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 10:17:07,968][06909] Updated weights for policy 0, policy_version 204453 (0.0040) [2024-06-28 10:17:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3349774336. Throughput: 0: 44107.6. Samples: 3252768900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:17:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:17:11,149][06909] Updated weights for policy 0, policy_version 204463 (0.0027) [2024-06-28 10:17:13,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3350020096. Throughput: 0: 44032.0. Samples: 3252892320. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:17:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:17:15,127][06909] Updated weights for policy 0, policy_version 204473 (0.0031) [2024-06-28 10:17:18,657][06909] Updated weights for policy 0, policy_version 204483 (0.0026) [2024-06-28 10:17:18,850][06674] Fps is (10 sec: 49151.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3350265856. Throughput: 0: 44140.4. Samples: 3253169760. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:17:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:17:22,479][06909] Updated weights for policy 0, policy_version 204493 (0.0027) [2024-06-28 10:17:23,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 3350446080. Throughput: 0: 43912.4. Samples: 3253428560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:17:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:17:26,358][06909] Updated weights for policy 0, policy_version 204503 (0.0038) [2024-06-28 10:17:28,852][06674] Fps is (10 sec: 42590.0, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 3350691840. Throughput: 0: 43905.1. Samples: 3253556600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:17:28,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:17:29,674][06909] Updated weights for policy 0, policy_version 204513 (0.0028) [2024-06-28 10:17:33,755][06909] Updated weights for policy 0, policy_version 204523 (0.0044) [2024-06-28 10:17:33,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3350904832. Throughput: 0: 43872.0. Samples: 3253829380. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:17:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:17:33,855][06887] Signal inference workers to stop experience collection... (46050 times) [2024-06-28 10:17:33,909][06909] InferenceWorker_p0-w0: stopping experience collection (46050 times) [2024-06-28 10:17:33,917][06887] Signal inference workers to resume experience collection... (46050 times) [2024-06-28 10:17:33,927][06909] InferenceWorker_p0-w0: resuming experience collection (46050 times) [2024-06-28 10:17:37,724][06909] Updated weights for policy 0, policy_version 204533 (0.0031) [2024-06-28 10:17:38,850][06674] Fps is (10 sec: 40968.8, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3351101440. Throughput: 0: 43960.5. Samples: 3254087240. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:17:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:17:41,126][06909] Updated weights for policy 0, policy_version 204543 (0.0021) [2024-06-28 10:17:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3351347200. Throughput: 0: 43673.7. Samples: 3254205980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:17:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:17:45,289][06909] Updated weights for policy 0, policy_version 204553 (0.0037) [2024-06-28 10:17:48,344][06909] Updated weights for policy 0, policy_version 204563 (0.0045) [2024-06-28 10:17:48,850][06674] Fps is (10 sec: 47512.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3351576576. Throughput: 0: 44030.6. Samples: 3254487380. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:17:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 10:17:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000204565_3351592960.pth... [2024-06-28 10:17:48,919][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000203917_3340976128.pth [2024-06-28 10:17:52,483][06909] Updated weights for policy 0, policy_version 204573 (0.0031) [2024-06-28 10:17:53,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3351773184. Throughput: 0: 43902.3. Samples: 3254744500. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:17:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:17:56,026][06909] Updated weights for policy 0, policy_version 204583 (0.0025) [2024-06-28 10:17:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3352002560. Throughput: 0: 43924.9. Samples: 3254868940. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-28 10:17:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:17:59,673][06909] Updated weights for policy 0, policy_version 204593 (0.0030) [2024-06-28 10:18:03,796][06909] Updated weights for policy 0, policy_version 204603 (0.0026) [2024-06-28 10:18:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 3352215552. Throughput: 0: 43760.6. Samples: 3255138980. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-28 10:18:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:18:07,129][06909] Updated weights for policy 0, policy_version 204613 (0.0034) [2024-06-28 10:18:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.7, 300 sec: 43986.9). Total num frames: 3352444928. Throughput: 0: 43853.4. Samples: 3255401960. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-28 10:18:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:18:11,199][06909] Updated weights for policy 0, policy_version 204623 (0.0040) [2024-06-28 10:18:13,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3352657920. Throughput: 0: 43753.5. Samples: 3255525420. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-28 10:18:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:18:15,053][06909] Updated weights for policy 0, policy_version 204633 (0.0042) [2024-06-28 10:18:18,372][06909] Updated weights for policy 0, policy_version 204643 (0.0036) [2024-06-28 10:18:18,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3352887296. Throughput: 0: 43808.5. Samples: 3255800760. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-28 10:18:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:18:22,361][06909] Updated weights for policy 0, policy_version 204653 (0.0031) [2024-06-28 10:18:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3353083904. Throughput: 0: 43788.7. Samples: 3256057740. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-28 10:18:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:18:26,026][06909] Updated weights for policy 0, policy_version 204663 (0.0031) [2024-06-28 10:18:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43692.1, 300 sec: 43987.7). Total num frames: 3353313280. Throughput: 0: 43987.5. Samples: 3256185420. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-28 10:18:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:18:29,472][06909] Updated weights for policy 0, policy_version 204673 (0.0037) [2024-06-28 10:18:33,230][06909] Updated weights for policy 0, policy_version 204683 (0.0031) [2024-06-28 10:18:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3353542656. Throughput: 0: 43840.9. Samples: 3256460220. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-28 10:18:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:18:36,728][06909] Updated weights for policy 0, policy_version 204693 (0.0038) [2024-06-28 10:18:38,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.6, 300 sec: 43986.8). Total num frames: 3353755648. Throughput: 0: 43832.6. Samples: 3256716980. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-28 10:18:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:18:41,014][06909] Updated weights for policy 0, policy_version 204703 (0.0027) [2024-06-28 10:18:43,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3353985024. Throughput: 0: 43987.1. Samples: 3256848360. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-28 10:18:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:18:44,228][06909] Updated weights for policy 0, policy_version 204713 (0.0030) [2024-06-28 10:18:48,148][06887] Signal inference workers to stop experience collection... (46100 times) [2024-06-28 10:18:48,181][06909] InferenceWorker_p0-w0: stopping experience collection (46100 times) [2024-06-28 10:18:48,205][06887] Signal inference workers to resume experience collection... (46100 times) [2024-06-28 10:18:48,205][06909] InferenceWorker_p0-w0: resuming experience collection (46100 times) [2024-06-28 10:18:48,212][06909] Updated weights for policy 0, policy_version 204723 (0.0038) [2024-06-28 10:18:48,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3354214400. Throughput: 0: 44104.3. Samples: 3257123680. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-28 10:18:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:18:52,181][06909] Updated weights for policy 0, policy_version 204733 (0.0036) [2024-06-28 10:18:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 3354411008. Throughput: 0: 44118.8. Samples: 3257387300. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-28 10:18:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:18:55,498][06909] Updated weights for policy 0, policy_version 204743 (0.0026) [2024-06-28 10:18:58,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3354624000. Throughput: 0: 44152.0. Samples: 3257512260. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-28 10:18:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:18:59,246][06909] Updated weights for policy 0, policy_version 204753 (0.0023) [2024-06-28 10:19:02,961][06909] Updated weights for policy 0, policy_version 204763 (0.0040) [2024-06-28 10:19:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3354869760. Throughput: 0: 44049.8. Samples: 3257783000. Policy #0 lag: (min: 1.0, avg: 11.1, max: 22.0) [2024-06-28 10:19:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:19:06,546][06909] Updated weights for policy 0, policy_version 204773 (0.0037) [2024-06-28 10:19:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3355066368. Throughput: 0: 44215.1. Samples: 3258047420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 10:19:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:19:10,236][06909] Updated weights for policy 0, policy_version 204783 (0.0030) [2024-06-28 10:19:13,856][06674] Fps is (10 sec: 44210.3, 60 sec: 44232.4, 300 sec: 44097.4). Total num frames: 3355312128. Throughput: 0: 44315.0. Samples: 3258179860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 10:19:13,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:19:14,095][06909] Updated weights for policy 0, policy_version 204793 (0.0027) [2024-06-28 10:19:17,823][06909] Updated weights for policy 0, policy_version 204803 (0.0034) [2024-06-28 10:19:18,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3355525120. Throughput: 0: 44210.3. Samples: 3258449680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 10:19:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:19:21,722][06909] Updated weights for policy 0, policy_version 204813 (0.0030) [2024-06-28 10:19:23,850][06674] Fps is (10 sec: 42624.3, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3355738112. Throughput: 0: 44267.4. Samples: 3258709000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 10:19:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:19:25,171][06909] Updated weights for policy 0, policy_version 204823 (0.0036) [2024-06-28 10:19:28,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.7, 300 sec: 43986.8). Total num frames: 3355951104. Throughput: 0: 44207.0. Samples: 3258837680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 10:19:28,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:19:29,212][06909] Updated weights for policy 0, policy_version 204833 (0.0035) [2024-06-28 10:19:32,959][06909] Updated weights for policy 0, policy_version 204843 (0.0027) [2024-06-28 10:19:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3356196864. Throughput: 0: 43905.0. Samples: 3259099400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 10:19:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:19:36,500][06909] Updated weights for policy 0, policy_version 204853 (0.0027) [2024-06-28 10:19:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3356409856. Throughput: 0: 44104.3. Samples: 3259372000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 10:19:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:19:40,107][06909] Updated weights for policy 0, policy_version 204863 (0.0025) [2024-06-28 10:19:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3356622848. Throughput: 0: 44224.5. Samples: 3259502360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 10:19:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:19:44,009][06909] Updated weights for policy 0, policy_version 204873 (0.0033) [2024-06-28 10:19:47,713][06909] Updated weights for policy 0, policy_version 204883 (0.0033) [2024-06-28 10:19:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3356835840. Throughput: 0: 44040.4. Samples: 3259764820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 10:19:48,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:19:48,985][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000204886_3356852224.pth... [2024-06-28 10:19:49,037][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000204241_3346284544.pth [2024-06-28 10:19:51,450][06909] Updated weights for policy 0, policy_version 204893 (0.0033) [2024-06-28 10:19:53,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 3357048832. Throughput: 0: 43992.6. Samples: 3260027080. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 10:19:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:19:55,102][06909] Updated weights for policy 0, policy_version 204903 (0.0028) [2024-06-28 10:19:58,856][06674] Fps is (10 sec: 44210.5, 60 sec: 44232.4, 300 sec: 43986.0). Total num frames: 3357278208. Throughput: 0: 44004.0. Samples: 3260160040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 10:19:58,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:19:58,996][06909] Updated weights for policy 0, policy_version 204913 (0.0034) [2024-06-28 10:20:02,117][06887] Signal inference workers to stop experience collection... (46150 times) [2024-06-28 10:20:02,118][06887] Signal inference workers to resume experience collection... (46150 times) [2024-06-28 10:20:02,145][06909] InferenceWorker_p0-w0: stopping experience collection (46150 times) [2024-06-28 10:20:02,145][06909] InferenceWorker_p0-w0: resuming experience collection (46150 times) [2024-06-28 10:20:02,251][06909] Updated weights for policy 0, policy_version 204923 (0.0037) [2024-06-28 10:20:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3357507584. Throughput: 0: 43901.3. Samples: 3260425240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 10:20:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:20:06,253][06909] Updated weights for policy 0, policy_version 204933 (0.0038) [2024-06-28 10:20:08,850][06674] Fps is (10 sec: 45903.0, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3357736960. Throughput: 0: 44140.8. Samples: 3260695340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 10:20:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:20:09,914][06909] Updated weights for policy 0, policy_version 204943 (0.0037) [2024-06-28 10:20:13,738][06909] Updated weights for policy 0, policy_version 204953 (0.0030) [2024-06-28 10:20:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43968.1, 300 sec: 43986.9). Total num frames: 3357949952. Throughput: 0: 44246.4. Samples: 3260828760. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 10:20:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:20:16,962][06909] Updated weights for policy 0, policy_version 204963 (0.0020) [2024-06-28 10:20:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3358162944. Throughput: 0: 44405.0. Samples: 3261097620. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 10:20:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:20:20,982][06909] Updated weights for policy 0, policy_version 204973 (0.0030) [2024-06-28 10:20:23,856][06674] Fps is (10 sec: 44210.3, 60 sec: 44232.3, 300 sec: 44042.4). Total num frames: 3358392320. Throughput: 0: 44054.7. Samples: 3261354720. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 10:20:23,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:20:24,688][06909] Updated weights for policy 0, policy_version 204983 (0.0028) [2024-06-28 10:20:28,723][06909] Updated weights for policy 0, policy_version 204993 (0.0024) [2024-06-28 10:20:28,850][06674] Fps is (10 sec: 44235.8, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3358605312. Throughput: 0: 44219.9. Samples: 3261492260. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 10:20:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:20:31,964][06909] Updated weights for policy 0, policy_version 205003 (0.0045) [2024-06-28 10:20:33,850][06674] Fps is (10 sec: 44263.6, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3358834688. Throughput: 0: 44206.4. Samples: 3261754100. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 10:20:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:20:36,205][06909] Updated weights for policy 0, policy_version 205013 (0.0037) [2024-06-28 10:20:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3359064064. Throughput: 0: 44239.4. Samples: 3262017860. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 10:20:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:20:39,274][06909] Updated weights for policy 0, policy_version 205023 (0.0038) [2024-06-28 10:20:43,363][06909] Updated weights for policy 0, policy_version 205033 (0.0031) [2024-06-28 10:20:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3359277056. Throughput: 0: 44276.1. Samples: 3262152200. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 10:20:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:20:46,723][06909] Updated weights for policy 0, policy_version 205043 (0.0028) [2024-06-28 10:20:48,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44510.0, 300 sec: 44153.5). Total num frames: 3359506432. Throughput: 0: 44446.2. Samples: 3262425320. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 10:20:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:20:50,905][06909] Updated weights for policy 0, policy_version 205053 (0.0030) [2024-06-28 10:20:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44782.8, 300 sec: 44097.9). Total num frames: 3359735808. Throughput: 0: 44093.7. Samples: 3262679560. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 10:20:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:20:54,225][06909] Updated weights for policy 0, policy_version 205063 (0.0036) [2024-06-28 10:20:58,226][06909] Updated weights for policy 0, policy_version 205073 (0.0032) [2024-06-28 10:20:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44241.3, 300 sec: 44042.4). Total num frames: 3359932416. Throughput: 0: 44168.5. Samples: 3262816340. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 10:20:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:21:01,709][06909] Updated weights for policy 0, policy_version 205083 (0.0038) [2024-06-28 10:21:03,850][06674] Fps is (10 sec: 42599.1, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3360161792. Throughput: 0: 44125.7. Samples: 3263083280. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 10:21:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:21:05,614][06909] Updated weights for policy 0, policy_version 205093 (0.0033) [2024-06-28 10:21:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3360391168. Throughput: 0: 44152.6. Samples: 3263341320. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 10:21:08,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 10:21:09,091][06909] Updated weights for policy 0, policy_version 205103 (0.0036) [2024-06-28 10:21:13,053][06909] Updated weights for policy 0, policy_version 205113 (0.0046) [2024-06-28 10:21:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3360587776. Throughput: 0: 44035.2. Samples: 3263473840. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 10:21:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:21:16,520][06909] Updated weights for policy 0, policy_version 205123 (0.0036) [2024-06-28 10:21:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 3360817152. Throughput: 0: 44063.9. Samples: 3263736980. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 10:21:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:21:20,570][06909] Updated weights for policy 0, policy_version 205133 (0.0033) [2024-06-28 10:21:23,804][06909] Updated weights for policy 0, policy_version 205143 (0.0030) [2024-06-28 10:21:23,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44514.2, 300 sec: 44153.5). Total num frames: 3361062912. Throughput: 0: 44082.2. Samples: 3264001560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 10:21:23,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:21:27,728][06909] Updated weights for policy 0, policy_version 205153 (0.0031) [2024-06-28 10:21:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3361259520. Throughput: 0: 44180.8. Samples: 3264140340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 10:21:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:21:31,384][06909] Updated weights for policy 0, policy_version 205163 (0.0035) [2024-06-28 10:21:33,850][06674] Fps is (10 sec: 40957.9, 60 sec: 43963.2, 300 sec: 44042.3). Total num frames: 3361472512. Throughput: 0: 44040.3. Samples: 3264407160. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 10:21:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:21:34,853][06909] Updated weights for policy 0, policy_version 205173 (0.0023) [2024-06-28 10:21:38,602][06909] Updated weights for policy 0, policy_version 205183 (0.0030) [2024-06-28 10:21:38,852][06674] Fps is (10 sec: 45866.0, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 3361718272. Throughput: 0: 44282.1. Samples: 3264672340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 10:21:38,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:21:42,649][06909] Updated weights for policy 0, policy_version 205193 (0.0034) [2024-06-28 10:21:43,850][06674] Fps is (10 sec: 45878.3, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3361931264. Throughput: 0: 44293.3. Samples: 3264809540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 10:21:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:21:46,042][06909] Updated weights for policy 0, policy_version 205203 (0.0027) [2024-06-28 10:21:48,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3362127872. Throughput: 0: 44130.5. Samples: 3265069160. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 10:21:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:21:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000205208_3362127872.pth... [2024-06-28 10:21:48,909][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000204565_3351592960.pth [2024-06-28 10:21:49,865][06909] Updated weights for policy 0, policy_version 205213 (0.0023) [2024-06-28 10:21:52,494][06887] Signal inference workers to stop experience collection... (46200 times) [2024-06-28 10:21:52,496][06887] Signal inference workers to resume experience collection... (46200 times) [2024-06-28 10:21:52,507][06909] InferenceWorker_p0-w0: stopping experience collection (46200 times) [2024-06-28 10:21:52,522][06909] InferenceWorker_p0-w0: resuming experience collection (46200 times) [2024-06-28 10:21:53,318][06909] Updated weights for policy 0, policy_version 205223 (0.0049) [2024-06-28 10:21:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 3362373632. Throughput: 0: 44177.8. Samples: 3265329320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 10:21:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:21:57,232][06909] Updated weights for policy 0, policy_version 205233 (0.0024) [2024-06-28 10:21:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3362586624. Throughput: 0: 44443.9. Samples: 3265473820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 10:21:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:22:00,771][06909] Updated weights for policy 0, policy_version 205243 (0.0030) [2024-06-28 10:22:03,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3362799616. Throughput: 0: 44385.7. Samples: 3265734340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 10:22:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:22:05,030][06909] Updated weights for policy 0, policy_version 205253 (0.0030) [2024-06-28 10:22:08,504][06909] Updated weights for policy 0, policy_version 205263 (0.0034) [2024-06-28 10:22:08,852][06674] Fps is (10 sec: 45866.0, 60 sec: 44235.2, 300 sec: 44153.2). Total num frames: 3363045376. Throughput: 0: 44361.2. Samples: 3265997900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 10:22:08,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 10:22:12,315][06909] Updated weights for policy 0, policy_version 205273 (0.0029) [2024-06-28 10:22:13,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3363258368. Throughput: 0: 44305.5. Samples: 3266134080. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 10:22:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 10:22:15,769][06909] Updated weights for policy 0, policy_version 205283 (0.0037) [2024-06-28 10:22:18,850][06674] Fps is (10 sec: 42606.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3363471360. Throughput: 0: 44106.2. Samples: 3266391920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 10:22:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:22:19,575][06909] Updated weights for policy 0, policy_version 205293 (0.0043) [2024-06-28 10:22:23,657][06909] Updated weights for policy 0, policy_version 205303 (0.0032) [2024-06-28 10:22:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 3363700736. Throughput: 0: 44068.3. Samples: 3266655320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 10:22:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:22:27,043][06909] Updated weights for policy 0, policy_version 205313 (0.0034) [2024-06-28 10:22:28,856][06674] Fps is (10 sec: 44210.5, 60 sec: 44232.4, 300 sec: 44097.1). Total num frames: 3363913728. Throughput: 0: 44028.2. Samples: 3266791080. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-28 10:22:28,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:22:30,946][06909] Updated weights for policy 0, policy_version 205323 (0.0034) [2024-06-28 10:22:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44237.3, 300 sec: 44153.5). Total num frames: 3364126720. Throughput: 0: 44062.8. Samples: 3267051980. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-28 10:22:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:22:34,546][06909] Updated weights for policy 0, policy_version 205333 (0.0030) [2024-06-28 10:22:38,443][06909] Updated weights for policy 0, policy_version 205343 (0.0041) [2024-06-28 10:22:38,850][06674] Fps is (10 sec: 45902.9, 60 sec: 44238.3, 300 sec: 44153.5). Total num frames: 3364372480. Throughput: 0: 44313.7. Samples: 3267323440. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-28 10:22:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:22:42,127][06909] Updated weights for policy 0, policy_version 205353 (0.0040) [2024-06-28 10:22:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3364569088. Throughput: 0: 44075.7. Samples: 3267457220. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-28 10:22:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:22:46,257][06909] Updated weights for policy 0, policy_version 205363 (0.0048) [2024-06-28 10:22:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3364782080. Throughput: 0: 43919.1. Samples: 3267710700. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-28 10:22:48,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 10:22:49,493][06909] Updated weights for policy 0, policy_version 205373 (0.0034) [2024-06-28 10:22:53,497][06909] Updated weights for policy 0, policy_version 205383 (0.0035) [2024-06-28 10:22:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3365011456. Throughput: 0: 44152.8. Samples: 3267984680. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-28 10:22:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:22:56,784][06909] Updated weights for policy 0, policy_version 205393 (0.0045) [2024-06-28 10:22:58,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3365240832. Throughput: 0: 44013.1. Samples: 3268114680. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-28 10:22:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:22:59,692][06887] Signal inference workers to stop experience collection... (46250 times) [2024-06-28 10:22:59,693][06887] Signal inference workers to resume experience collection... (46250 times) [2024-06-28 10:22:59,710][06909] InferenceWorker_p0-w0: stopping experience collection (46250 times) [2024-06-28 10:22:59,710][06909] InferenceWorker_p0-w0: resuming experience collection (46250 times) [2024-06-28 10:23:00,747][06909] Updated weights for policy 0, policy_version 205403 (0.0040) [2024-06-28 10:23:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3365453824. Throughput: 0: 44029.9. Samples: 3268373260. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-28 10:23:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:23:04,428][06909] Updated weights for policy 0, policy_version 205413 (0.0029) [2024-06-28 10:23:08,355][06909] Updated weights for policy 0, policy_version 205423 (0.0030) [2024-06-28 10:23:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43965.1, 300 sec: 44153.5). Total num frames: 3365683200. Throughput: 0: 44213.2. Samples: 3268644920. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-28 10:23:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:23:11,833][06909] Updated weights for policy 0, policy_version 205433 (0.0042) [2024-06-28 10:23:13,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3365912576. Throughput: 0: 43995.7. Samples: 3268770620. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-28 10:23:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:23:15,834][06909] Updated weights for policy 0, policy_version 205443 (0.0036) [2024-06-28 10:23:18,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43963.9, 300 sec: 44153.5). Total num frames: 3366109184. Throughput: 0: 44134.7. Samples: 3269038040. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-28 10:23:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:23:19,301][06909] Updated weights for policy 0, policy_version 205453 (0.0041) [2024-06-28 10:23:23,296][06909] Updated weights for policy 0, policy_version 205463 (0.0041) [2024-06-28 10:23:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3366322176. Throughput: 0: 43895.6. Samples: 3269298740. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-28 10:23:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:23:26,789][06909] Updated weights for policy 0, policy_version 205473 (0.0032) [2024-06-28 10:23:28,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43966.7, 300 sec: 44097.7). Total num frames: 3366551552. Throughput: 0: 43824.2. Samples: 3269429400. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-28 10:23:28,861][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:23:30,596][06909] Updated weights for policy 0, policy_version 205483 (0.0025) [2024-06-28 10:23:33,852][06674] Fps is (10 sec: 45865.9, 60 sec: 44235.2, 300 sec: 44153.2). Total num frames: 3366780928. Throughput: 0: 44136.3. Samples: 3269696920. Policy #0 lag: (min: 0.0, avg: 11.8, max: 24.0) [2024-06-28 10:23:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:23:34,202][06909] Updated weights for policy 0, policy_version 205493 (0.0036) [2024-06-28 10:23:37,831][06909] Updated weights for policy 0, policy_version 205503 (0.0031) [2024-06-28 10:23:38,850][06674] Fps is (10 sec: 45884.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3367010304. Throughput: 0: 43859.0. Samples: 3269958340. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:23:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:23:41,882][06909] Updated weights for policy 0, policy_version 205513 (0.0035) [2024-06-28 10:23:43,850][06674] Fps is (10 sec: 44245.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3367223296. Throughput: 0: 43881.5. Samples: 3270089340. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:23:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:23:45,335][06909] Updated weights for policy 0, policy_version 205523 (0.0025) [2024-06-28 10:23:48,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 3367419904. Throughput: 0: 44059.4. Samples: 3270355940. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:23:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:23:48,875][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000205532_3367436288.pth... [2024-06-28 10:23:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000204886_3356852224.pth [2024-06-28 10:23:49,155][06909] Updated weights for policy 0, policy_version 205533 (0.0028) [2024-06-28 10:23:52,960][06909] Updated weights for policy 0, policy_version 205543 (0.0032) [2024-06-28 10:23:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3367649280. Throughput: 0: 43778.9. Samples: 3270614960. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:23:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:23:56,542][06909] Updated weights for policy 0, policy_version 205553 (0.0027) [2024-06-28 10:23:58,850][06674] Fps is (10 sec: 45876.2, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 3367878656. Throughput: 0: 44001.0. Samples: 3270750660. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:23:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:24:00,139][06909] Updated weights for policy 0, policy_version 205563 (0.0029) [2024-06-28 10:24:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3368091648. Throughput: 0: 43981.3. Samples: 3271017200. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:24:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 10:24:04,135][06909] Updated weights for policy 0, policy_version 205573 (0.0027) [2024-06-28 10:24:07,520][06909] Updated weights for policy 0, policy_version 205583 (0.0026) [2024-06-28 10:24:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44098.9). Total num frames: 3368321024. Throughput: 0: 43998.2. Samples: 3271278660. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:24:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 10:24:11,344][06909] Updated weights for policy 0, policy_version 205593 (0.0033) [2024-06-28 10:24:13,686][06887] Signal inference workers to stop experience collection... (46300 times) [2024-06-28 10:24:13,687][06887] Signal inference workers to resume experience collection... (46300 times) [2024-06-28 10:24:13,706][06909] InferenceWorker_p0-w0: stopping experience collection (46300 times) [2024-06-28 10:24:13,706][06909] InferenceWorker_p0-w0: resuming experience collection (46300 times) [2024-06-28 10:24:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3368550400. Throughput: 0: 44001.1. Samples: 3271409360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:24:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 10:24:14,769][06909] Updated weights for policy 0, policy_version 205603 (0.0032) [2024-06-28 10:24:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3368747008. Throughput: 0: 44093.9. Samples: 3271681060. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:24:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:24:19,080][06909] Updated weights for policy 0, policy_version 205613 (0.0045) [2024-06-28 10:24:22,043][06909] Updated weights for policy 0, policy_version 205623 (0.0030) [2024-06-28 10:24:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3368976384. Throughput: 0: 44017.4. Samples: 3271939120. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:24:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:24:26,437][06909] Updated weights for policy 0, policy_version 205633 (0.0049) [2024-06-28 10:24:28,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44238.2, 300 sec: 44097.9). Total num frames: 3369205760. Throughput: 0: 44142.9. Samples: 3272075780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:24:28,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:24:29,890][06909] Updated weights for policy 0, policy_version 205643 (0.0031) [2024-06-28 10:24:33,812][06909] Updated weights for policy 0, policy_version 205653 (0.0035) [2024-06-28 10:24:33,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43963.7, 300 sec: 44097.7). Total num frames: 3369418752. Throughput: 0: 44049.7. Samples: 3272338260. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:24:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:24:37,326][06909] Updated weights for policy 0, policy_version 205663 (0.0031) [2024-06-28 10:24:38,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 3369631744. Throughput: 0: 43971.4. Samples: 3272593680. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:24:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:24:41,310][06909] Updated weights for policy 0, policy_version 205673 (0.0042) [2024-06-28 10:24:43,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 3369844736. Throughput: 0: 43941.7. Samples: 3272728040. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 10:24:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:24:44,524][06909] Updated weights for policy 0, policy_version 205683 (0.0027) [2024-06-28 10:24:48,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 3370057728. Throughput: 0: 43961.8. Samples: 3272995480. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 10:24:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:24:48,887][06909] Updated weights for policy 0, policy_version 205693 (0.0040) [2024-06-28 10:24:52,129][06909] Updated weights for policy 0, policy_version 205703 (0.0031) [2024-06-28 10:24:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44098.9). Total num frames: 3370287104. Throughput: 0: 43816.4. Samples: 3273250400. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 10:24:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:24:56,413][06909] Updated weights for policy 0, policy_version 205713 (0.0031) [2024-06-28 10:24:58,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3370532864. Throughput: 0: 44005.8. Samples: 3273389620. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 10:24:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:24:59,713][06909] Updated weights for policy 0, policy_version 205723 (0.0029) [2024-06-28 10:25:03,717][06909] Updated weights for policy 0, policy_version 205733 (0.0029) [2024-06-28 10:25:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3370729472. Throughput: 0: 44008.5. Samples: 3273661440. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 10:25:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:25:07,042][06909] Updated weights for policy 0, policy_version 205743 (0.0023) [2024-06-28 10:25:08,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3370942464. Throughput: 0: 43943.0. Samples: 3273916560. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 10:25:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:25:11,283][06909] Updated weights for policy 0, policy_version 205753 (0.0038) [2024-06-28 10:25:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 3371171840. Throughput: 0: 43881.5. Samples: 3274050440. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 10:25:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:25:14,325][06909] Updated weights for policy 0, policy_version 205763 (0.0029) [2024-06-28 10:25:18,677][06909] Updated weights for policy 0, policy_version 205773 (0.0027) [2024-06-28 10:25:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44043.3). Total num frames: 3371384832. Throughput: 0: 44033.9. Samples: 3274319700. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 10:25:18,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:25:21,542][06909] Updated weights for policy 0, policy_version 205783 (0.0036) [2024-06-28 10:25:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.6, 300 sec: 44098.0). Total num frames: 3371614208. Throughput: 0: 43980.4. Samples: 3274572800. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 10:25:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:25:26,255][06909] Updated weights for policy 0, policy_version 205793 (0.0041) [2024-06-28 10:25:28,856][06674] Fps is (10 sec: 47485.1, 60 sec: 44232.5, 300 sec: 44152.6). Total num frames: 3371859968. Throughput: 0: 44000.3. Samples: 3274708320. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 10:25:28,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:25:29,310][06909] Updated weights for policy 0, policy_version 205803 (0.0027) [2024-06-28 10:25:33,756][06909] Updated weights for policy 0, policy_version 205813 (0.0043) [2024-06-28 10:25:33,758][06887] Signal inference workers to stop experience collection... (46350 times) [2024-06-28 10:25:33,758][06887] Signal inference workers to resume experience collection... (46350 times) [2024-06-28 10:25:33,779][06909] InferenceWorker_p0-w0: stopping experience collection (46350 times) [2024-06-28 10:25:33,779][06909] InferenceWorker_p0-w0: resuming experience collection (46350 times) [2024-06-28 10:25:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43692.2, 300 sec: 43986.9). Total num frames: 3372040192. Throughput: 0: 44072.4. Samples: 3274978740. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 10:25:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:25:36,692][06909] Updated weights for policy 0, policy_version 205823 (0.0026) [2024-06-28 10:25:38,850][06674] Fps is (10 sec: 40984.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3372269568. Throughput: 0: 44050.6. Samples: 3275232680. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 10:25:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:25:41,301][06909] Updated weights for policy 0, policy_version 205833 (0.0023) [2024-06-28 10:25:43,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3372498944. Throughput: 0: 43948.0. Samples: 3275367280. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 10:25:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:25:44,648][06909] Updated weights for policy 0, policy_version 205843 (0.0036) [2024-06-28 10:25:48,468][06909] Updated weights for policy 0, policy_version 205853 (0.0035) [2024-06-28 10:25:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 3372695552. Throughput: 0: 43839.6. Samples: 3275634220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 10:25:48,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 10:25:48,948][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000205854_3372711936.pth... [2024-06-28 10:25:49,019][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000205208_3362127872.pth [2024-06-28 10:25:51,888][06909] Updated weights for policy 0, policy_version 205863 (0.0027) [2024-06-28 10:25:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3372924928. Throughput: 0: 43957.4. Samples: 3275894640. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 10:25:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:25:56,195][06909] Updated weights for policy 0, policy_version 205873 (0.0035) [2024-06-28 10:25:58,850][06674] Fps is (10 sec: 45874.2, 60 sec: 43690.5, 300 sec: 44042.4). Total num frames: 3373154304. Throughput: 0: 43787.9. Samples: 3276020900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 10:25:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 10:25:59,413][06909] Updated weights for policy 0, policy_version 205883 (0.0032) [2024-06-28 10:26:03,850][06909] Updated weights for policy 0, policy_version 205893 (0.0036) [2024-06-28 10:26:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3373350912. Throughput: 0: 43767.2. Samples: 3276289220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 10:26:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:26:06,712][06909] Updated weights for policy 0, policy_version 205903 (0.0024) [2024-06-28 10:26:08,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3373580288. Throughput: 0: 44126.3. Samples: 3276558480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 10:26:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:26:11,046][06909] Updated weights for policy 0, policy_version 205913 (0.0028) [2024-06-28 10:26:13,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3373826048. Throughput: 0: 44101.1. Samples: 3276692600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 10:26:13,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:26:14,239][06909] Updated weights for policy 0, policy_version 205923 (0.0046) [2024-06-28 10:26:18,323][06909] Updated weights for policy 0, policy_version 205933 (0.0030) [2024-06-28 10:26:18,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3374039040. Throughput: 0: 44049.7. Samples: 3276960980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 10:26:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:26:21,761][06909] Updated weights for policy 0, policy_version 205943 (0.0020) [2024-06-28 10:26:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 3374252032. Throughput: 0: 44238.4. Samples: 3277223400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 10:26:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:26:25,721][06909] Updated weights for policy 0, policy_version 205953 (0.0036) [2024-06-28 10:26:28,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43695.1, 300 sec: 44098.0). Total num frames: 3374481408. Throughput: 0: 44118.2. Samples: 3277352600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 10:26:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:26:29,023][06909] Updated weights for policy 0, policy_version 205963 (0.0028) [2024-06-28 10:26:32,999][06909] Updated weights for policy 0, policy_version 205973 (0.0037) [2024-06-28 10:26:33,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 3374694400. Throughput: 0: 44129.7. Samples: 3277620060. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 10:26:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:26:36,498][06909] Updated weights for policy 0, policy_version 205983 (0.0045) [2024-06-28 10:26:38,856][06674] Fps is (10 sec: 42572.6, 60 sec: 43959.3, 300 sec: 43986.0). Total num frames: 3374907392. Throughput: 0: 44096.7. Samples: 3277879260. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 10:26:38,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:26:40,635][06909] Updated weights for policy 0, policy_version 205993 (0.0028) [2024-06-28 10:26:43,727][06909] Updated weights for policy 0, policy_version 206003 (0.0032) [2024-06-28 10:26:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3375153152. Throughput: 0: 44170.3. Samples: 3278008560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 10:26:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:26:48,085][06909] Updated weights for policy 0, policy_version 206013 (0.0044) [2024-06-28 10:26:48,850][06674] Fps is (10 sec: 45902.3, 60 sec: 44509.7, 300 sec: 44042.4). Total num frames: 3375366144. Throughput: 0: 44348.7. Samples: 3278284920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 10:26:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:26:51,177][06909] Updated weights for policy 0, policy_version 206023 (0.0028) [2024-06-28 10:26:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3375579136. Throughput: 0: 44133.7. Samples: 3278544500. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 10:26:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:26:55,295][06909] Updated weights for policy 0, policy_version 206033 (0.0032) [2024-06-28 10:26:58,698][06909] Updated weights for policy 0, policy_version 206043 (0.0031) [2024-06-28 10:26:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3375808512. Throughput: 0: 44120.3. Samples: 3278678020. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:26:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:27:02,537][06909] Updated weights for policy 0, policy_version 206053 (0.0029) [2024-06-28 10:27:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.7, 300 sec: 43987.2). Total num frames: 3376021504. Throughput: 0: 44008.4. Samples: 3278941360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:27:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:27:06,044][06909] Updated weights for policy 0, policy_version 206063 (0.0021) [2024-06-28 10:27:08,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3376218112. Throughput: 0: 43924.4. Samples: 3279200000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:27:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:27:10,224][06909] Updated weights for policy 0, policy_version 206073 (0.0034) [2024-06-28 10:27:13,447][06909] Updated weights for policy 0, policy_version 206083 (0.0032) [2024-06-28 10:27:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3376463872. Throughput: 0: 43931.9. Samples: 3279329540. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:27:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:27:17,802][06909] Updated weights for policy 0, policy_version 206093 (0.0029) [2024-06-28 10:27:18,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3376676864. Throughput: 0: 44039.2. Samples: 3279601820. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:27:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:27:20,331][06887] Signal inference workers to stop experience collection... (46400 times) [2024-06-28 10:27:20,337][06887] Signal inference workers to resume experience collection... (46400 times) [2024-06-28 10:27:20,358][06909] InferenceWorker_p0-w0: stopping experience collection (46400 times) [2024-06-28 10:27:20,358][06909] InferenceWorker_p0-w0: resuming experience collection (46400 times) [2024-06-28 10:27:20,629][06909] Updated weights for policy 0, policy_version 206103 (0.0024) [2024-06-28 10:27:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44043.3). Total num frames: 3376906240. Throughput: 0: 44112.5. Samples: 3279864060. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:27:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:27:25,026][06909] Updated weights for policy 0, policy_version 206113 (0.0042) [2024-06-28 10:27:28,262][06909] Updated weights for policy 0, policy_version 206123 (0.0035) [2024-06-28 10:27:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3377135616. Throughput: 0: 44214.3. Samples: 3279998200. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:27:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:27:32,536][06909] Updated weights for policy 0, policy_version 206133 (0.0035) [2024-06-28 10:27:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3377348608. Throughput: 0: 43980.1. Samples: 3280264020. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:27:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:27:36,177][06909] Updated weights for policy 0, policy_version 206143 (0.0024) [2024-06-28 10:27:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44241.3, 300 sec: 44042.4). Total num frames: 3377561600. Throughput: 0: 43908.1. Samples: 3280520360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:27:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:27:39,781][06909] Updated weights for policy 0, policy_version 206153 (0.0035) [2024-06-28 10:27:43,476][06909] Updated weights for policy 0, policy_version 206163 (0.0038) [2024-06-28 10:27:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3377774592. Throughput: 0: 43902.3. Samples: 3280653620. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:27:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:27:47,101][06909] Updated weights for policy 0, policy_version 206173 (0.0035) [2024-06-28 10:27:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3378003968. Throughput: 0: 43861.4. Samples: 3280915120. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:27:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:27:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000206177_3378003968.pth... [2024-06-28 10:27:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000205532_3367436288.pth [2024-06-28 10:27:50,782][06909] Updated weights for policy 0, policy_version 206183 (0.0036) [2024-06-28 10:27:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 3378200576. Throughput: 0: 43974.6. Samples: 3281178860. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:27:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:27:54,783][06909] Updated weights for policy 0, policy_version 206193 (0.0023) [2024-06-28 10:27:58,269][06909] Updated weights for policy 0, policy_version 206203 (0.0025) [2024-06-28 10:27:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3378429952. Throughput: 0: 44064.9. Samples: 3281312460. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:27:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:28:02,322][06909] Updated weights for policy 0, policy_version 206213 (0.0026) [2024-06-28 10:28:03,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3378675712. Throughput: 0: 44030.7. Samples: 3281583200. Policy #0 lag: (min: 0.0, avg: 11.0, max: 22.0) [2024-06-28 10:28:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:28:05,941][06909] Updated weights for policy 0, policy_version 206223 (0.0037) [2024-06-28 10:28:08,852][06674] Fps is (10 sec: 44227.9, 60 sec: 44235.2, 300 sec: 43931.0). Total num frames: 3378872320. Throughput: 0: 43998.5. Samples: 3281844080. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:28:08,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:28:09,639][06909] Updated weights for policy 0, policy_version 206233 (0.0027) [2024-06-28 10:28:13,497][06909] Updated weights for policy 0, policy_version 206243 (0.0033) [2024-06-28 10:28:13,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3379085312. Throughput: 0: 43950.2. Samples: 3281975960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:28:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:28:16,969][06909] Updated weights for policy 0, policy_version 206253 (0.0029) [2024-06-28 10:28:18,850][06674] Fps is (10 sec: 45884.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3379331072. Throughput: 0: 43861.4. Samples: 3282237780. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:28:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:28:20,918][06909] Updated weights for policy 0, policy_version 206263 (0.0028) [2024-06-28 10:28:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 43987.2). Total num frames: 3379527680. Throughput: 0: 44004.5. Samples: 3282500560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:28:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:28:24,247][06909] Updated weights for policy 0, policy_version 206273 (0.0031) [2024-06-28 10:28:28,304][06909] Updated weights for policy 0, policy_version 206283 (0.0034) [2024-06-28 10:28:28,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 43987.2). Total num frames: 3379757056. Throughput: 0: 44040.8. Samples: 3282635460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:28:28,859][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:28:32,258][06909] Updated weights for policy 0, policy_version 206293 (0.0038) [2024-06-28 10:28:33,850][06674] Fps is (10 sec: 45874.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3379986432. Throughput: 0: 44191.9. Samples: 3282903760. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:28:33,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 10:28:36,335][06909] Updated weights for policy 0, policy_version 206303 (0.0028) [2024-06-28 10:28:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3380199424. Throughput: 0: 44155.5. Samples: 3283165860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:28:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:28:39,565][06909] Updated weights for policy 0, policy_version 206313 (0.0028) [2024-06-28 10:28:39,595][06887] Signal inference workers to stop experience collection... (46450 times) [2024-06-28 10:28:39,600][06887] Signal inference workers to resume experience collection... (46450 times) [2024-06-28 10:28:39,644][06909] InferenceWorker_p0-w0: stopping experience collection (46450 times) [2024-06-28 10:28:39,644][06909] InferenceWorker_p0-w0: resuming experience collection (46450 times) [2024-06-28 10:28:43,623][06909] Updated weights for policy 0, policy_version 206323 (0.0023) [2024-06-28 10:28:43,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3380396032. Throughput: 0: 44056.4. Samples: 3283295000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:28:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:28:46,795][06909] Updated weights for policy 0, policy_version 206333 (0.0035) [2024-06-28 10:28:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3380641792. Throughput: 0: 43708.0. Samples: 3283550060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:28:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:28:51,068][06909] Updated weights for policy 0, policy_version 206343 (0.0031) [2024-06-28 10:28:53,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3380854784. Throughput: 0: 43924.6. Samples: 3283820600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:28:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:28:54,108][06909] Updated weights for policy 0, policy_version 206353 (0.0031) [2024-06-28 10:28:58,850][06674] Fps is (10 sec: 39320.8, 60 sec: 43417.5, 300 sec: 43875.8). Total num frames: 3381035008. Throughput: 0: 43930.5. Samples: 3283952840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:28:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:28:58,881][06909] Updated weights for policy 0, policy_version 206363 (0.0029) [2024-06-28 10:29:01,625][06909] Updated weights for policy 0, policy_version 206373 (0.0036) [2024-06-28 10:29:03,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3381297152. Throughput: 0: 43891.4. Samples: 3284212900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:29:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:29:06,067][06909] Updated weights for policy 0, policy_version 206383 (0.0028) [2024-06-28 10:29:08,850][06674] Fps is (10 sec: 49152.8, 60 sec: 44238.3, 300 sec: 43986.9). Total num frames: 3381526528. Throughput: 0: 44051.9. Samples: 3284482900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:29:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:29:09,327][06909] Updated weights for policy 0, policy_version 206393 (0.0029) [2024-06-28 10:29:13,679][06909] Updated weights for policy 0, policy_version 206403 (0.0035) [2024-06-28 10:29:13,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3381706752. Throughput: 0: 43865.9. Samples: 3284609420. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 10:29:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:29:16,706][06909] Updated weights for policy 0, policy_version 206413 (0.0029) [2024-06-28 10:29:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3381952512. Throughput: 0: 43725.0. Samples: 3284871380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 10:29:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:29:20,931][06909] Updated weights for policy 0, policy_version 206423 (0.0031) [2024-06-28 10:29:23,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3382181888. Throughput: 0: 43825.3. Samples: 3285138000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 10:29:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:29:24,154][06909] Updated weights for policy 0, policy_version 206433 (0.0034) [2024-06-28 10:29:28,152][06909] Updated weights for policy 0, policy_version 206443 (0.0033) [2024-06-28 10:29:28,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43987.2). Total num frames: 3382394880. Throughput: 0: 43999.6. Samples: 3285274980. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 10:29:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:29:31,390][06909] Updated weights for policy 0, policy_version 206453 (0.0027) [2024-06-28 10:29:33,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3382624256. Throughput: 0: 44135.0. Samples: 3285536140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 10:29:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:29:35,477][06909] Updated weights for policy 0, policy_version 206463 (0.0031) [2024-06-28 10:29:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3382837248. Throughput: 0: 44093.7. Samples: 3285804820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 10:29:38,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:29:38,930][06909] Updated weights for policy 0, policy_version 206473 (0.0035) [2024-06-28 10:29:42,806][06909] Updated weights for policy 0, policy_version 206483 (0.0042) [2024-06-28 10:29:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.9, 300 sec: 44097.9). Total num frames: 3383066624. Throughput: 0: 44069.9. Samples: 3285935980. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 10:29:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:29:46,443][06909] Updated weights for policy 0, policy_version 206493 (0.0021) [2024-06-28 10:29:48,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3383312384. Throughput: 0: 44181.9. Samples: 3286201080. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 10:29:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:29:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000206501_3383312384.pth... [2024-06-28 10:29:48,920][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000205854_3372711936.pth [2024-06-28 10:29:49,943][06909] Updated weights for policy 0, policy_version 206503 (0.0031) [2024-06-28 10:29:53,802][06909] Updated weights for policy 0, policy_version 206513 (0.0033) [2024-06-28 10:29:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3383508992. Throughput: 0: 44269.3. Samples: 3286475020. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 10:29:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:29:57,425][06909] Updated weights for policy 0, policy_version 206523 (0.0026) [2024-06-28 10:29:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44783.1, 300 sec: 44042.4). Total num frames: 3383721984. Throughput: 0: 44341.8. Samples: 3286604800. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 10:29:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:30:01,127][06909] Updated weights for policy 0, policy_version 206533 (0.0030) [2024-06-28 10:30:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3383951360. Throughput: 0: 44278.6. Samples: 3286863920. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 10:30:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:30:04,975][06909] Updated weights for policy 0, policy_version 206543 (0.0035) [2024-06-28 10:30:08,460][06909] Updated weights for policy 0, policy_version 206553 (0.0036) [2024-06-28 10:30:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3384164352. Throughput: 0: 44283.1. Samples: 3287130740. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 10:30:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:30:12,182][06909] Updated weights for policy 0, policy_version 206563 (0.0029) [2024-06-28 10:30:13,682][06887] Signal inference workers to stop experience collection... (46500 times) [2024-06-28 10:30:13,689][06887] Signal inference workers to resume experience collection... (46500 times) [2024-06-28 10:30:13,702][06909] InferenceWorker_p0-w0: stopping experience collection (46500 times) [2024-06-28 10:30:13,739][06909] InferenceWorker_p0-w0: resuming experience collection (46500 times) [2024-06-28 10:30:13,852][06674] Fps is (10 sec: 44228.4, 60 sec: 44781.4, 300 sec: 44097.7). Total num frames: 3384393728. Throughput: 0: 44142.9. Samples: 3287261500. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 10:30:13,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:30:15,968][06909] Updated weights for policy 0, policy_version 206573 (0.0021) [2024-06-28 10:30:18,856][06674] Fps is (10 sec: 44210.1, 60 sec: 44232.4, 300 sec: 44041.5). Total num frames: 3384606720. Throughput: 0: 44086.2. Samples: 3287520280. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 10:30:18,857][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:30:19,578][06909] Updated weights for policy 0, policy_version 206583 (0.0032) [2024-06-28 10:30:23,676][06909] Updated weights for policy 0, policy_version 206593 (0.0028) [2024-06-28 10:30:23,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43963.8, 300 sec: 43932.3). Total num frames: 3384819712. Throughput: 0: 44100.1. Samples: 3287789320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:30:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:30:26,816][06909] Updated weights for policy 0, policy_version 206603 (0.0024) [2024-06-28 10:30:28,850][06674] Fps is (10 sec: 40985.1, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3385016320. Throughput: 0: 43977.4. Samples: 3287914960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:30:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:30:31,122][06909] Updated weights for policy 0, policy_version 206613 (0.0028) [2024-06-28 10:30:33,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3385278464. Throughput: 0: 43968.5. Samples: 3288179660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:30:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:30:34,242][06909] Updated weights for policy 0, policy_version 206623 (0.0032) [2024-06-28 10:30:38,493][06909] Updated weights for policy 0, policy_version 206633 (0.0038) [2024-06-28 10:30:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3385475072. Throughput: 0: 43801.0. Samples: 3288446060. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:30:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:30:41,911][06909] Updated weights for policy 0, policy_version 206643 (0.0031) [2024-06-28 10:30:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3385688064. Throughput: 0: 43809.8. Samples: 3288576240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:30:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:30:46,194][06909] Updated weights for policy 0, policy_version 206653 (0.0036) [2024-06-28 10:30:48,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3385950208. Throughput: 0: 43876.2. Samples: 3288838340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:30:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:30:49,151][06909] Updated weights for policy 0, policy_version 206663 (0.0024) [2024-06-28 10:30:53,425][06909] Updated weights for policy 0, policy_version 206673 (0.0027) [2024-06-28 10:30:53,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3386146816. Throughput: 0: 43972.0. Samples: 3289109480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:30:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:30:56,834][06909] Updated weights for policy 0, policy_version 206683 (0.0031) [2024-06-28 10:30:58,850][06674] Fps is (10 sec: 37682.9, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 3386327040. Throughput: 0: 43838.4. Samples: 3289234140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:30:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:31:00,873][06909] Updated weights for policy 0, policy_version 206693 (0.0027) [2024-06-28 10:31:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3386605568. Throughput: 0: 44082.8. Samples: 3289503740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:31:03,850][06674] Avg episode reward: [(0, '0.440')] [2024-06-28 10:31:04,065][06909] Updated weights for policy 0, policy_version 206703 (0.0031) [2024-06-28 10:31:08,318][06909] Updated weights for policy 0, policy_version 206713 (0.0029) [2024-06-28 10:31:08,850][06674] Fps is (10 sec: 49151.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3386818560. Throughput: 0: 43941.3. Samples: 3289766680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:31:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:31:11,490][06909] Updated weights for policy 0, policy_version 206723 (0.0027) [2024-06-28 10:31:13,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43692.1, 300 sec: 43986.9). Total num frames: 3387015168. Throughput: 0: 43913.7. Samples: 3289891080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:31:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:31:15,924][06909] Updated weights for policy 0, policy_version 206733 (0.0041) [2024-06-28 10:31:18,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44241.1, 300 sec: 44097.9). Total num frames: 3387260928. Throughput: 0: 44030.0. Samples: 3290161020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:31:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:31:19,243][06909] Updated weights for policy 0, policy_version 206743 (0.0027) [2024-06-28 10:31:23,306][06909] Updated weights for policy 0, policy_version 206753 (0.0033) [2024-06-28 10:31:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3387457536. Throughput: 0: 43975.1. Samples: 3290424940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:31:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:31:26,792][06909] Updated weights for policy 0, policy_version 206763 (0.0033) [2024-06-28 10:31:28,266][06887] Signal inference workers to stop experience collection... (46550 times) [2024-06-28 10:31:28,266][06887] Signal inference workers to resume experience collection... (46550 times) [2024-06-28 10:31:28,294][06909] InferenceWorker_p0-w0: stopping experience collection (46550 times) [2024-06-28 10:31:28,295][06909] InferenceWorker_p0-w0: resuming experience collection (46550 times) [2024-06-28 10:31:28,850][06674] Fps is (10 sec: 40960.4, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3387670528. Throughput: 0: 43983.0. Samples: 3290555480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 10:31:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:31:30,675][06909] Updated weights for policy 0, policy_version 206773 (0.0032) [2024-06-28 10:31:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44098.9). Total num frames: 3387916288. Throughput: 0: 44164.0. Samples: 3290825720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 10:31:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:31:33,954][06909] Updated weights for policy 0, policy_version 206783 (0.0033) [2024-06-28 10:31:38,193][06909] Updated weights for policy 0, policy_version 206793 (0.0032) [2024-06-28 10:31:38,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3388145664. Throughput: 0: 43962.6. Samples: 3291087800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 10:31:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:31:41,210][06909] Updated weights for policy 0, policy_version 206803 (0.0038) [2024-06-28 10:31:43,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 3388325888. Throughput: 0: 44014.7. Samples: 3291214800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 10:31:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:31:45,654][06909] Updated weights for policy 0, policy_version 206813 (0.0037) [2024-06-28 10:31:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3388571648. Throughput: 0: 43808.5. Samples: 3291475120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 10:31:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:31:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000206822_3388571648.pth... [2024-06-28 10:31:48,909][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000206177_3378003968.pth [2024-06-28 10:31:49,152][06909] Updated weights for policy 0, policy_version 206823 (0.0027) [2024-06-28 10:31:53,197][06909] Updated weights for policy 0, policy_version 206833 (0.0038) [2024-06-28 10:31:53,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3388784640. Throughput: 0: 43973.3. Samples: 3291745480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 10:31:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:31:56,404][06909] Updated weights for policy 0, policy_version 206843 (0.0022) [2024-06-28 10:31:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.9, 300 sec: 43931.4). Total num frames: 3388981248. Throughput: 0: 44009.9. Samples: 3291871520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 10:31:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:32:00,448][06909] Updated weights for policy 0, policy_version 206853 (0.0039) [2024-06-28 10:32:03,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 3389227008. Throughput: 0: 44008.7. Samples: 3292141400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 10:32:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:32:03,853][06909] Updated weights for policy 0, policy_version 206863 (0.0024) [2024-06-28 10:32:07,843][06909] Updated weights for policy 0, policy_version 206873 (0.0033) [2024-06-28 10:32:08,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3389456384. Throughput: 0: 44017.7. Samples: 3292405740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 10:32:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:32:11,080][06909] Updated weights for policy 0, policy_version 206883 (0.0028) [2024-06-28 10:32:13,852][06674] Fps is (10 sec: 40951.3, 60 sec: 43689.2, 300 sec: 43931.0). Total num frames: 3389636608. Throughput: 0: 44042.1. Samples: 3292537460. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 10:32:13,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:32:15,421][06909] Updated weights for policy 0, policy_version 206893 (0.0031) [2024-06-28 10:32:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.9, 300 sec: 43986.9). Total num frames: 3389882368. Throughput: 0: 43914.2. Samples: 3292801860. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 10:32:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:32:18,874][06909] Updated weights for policy 0, policy_version 206903 (0.0029) [2024-06-28 10:32:22,897][06909] Updated weights for policy 0, policy_version 206913 (0.0038) [2024-06-28 10:32:23,850][06674] Fps is (10 sec: 47523.0, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3390111744. Throughput: 0: 43880.9. Samples: 3293062440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 10:32:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:32:26,106][06909] Updated weights for policy 0, policy_version 206923 (0.0023) [2024-06-28 10:32:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3390308352. Throughput: 0: 44012.4. Samples: 3293195360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 10:32:28,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 10:32:30,091][06909] Updated weights for policy 0, policy_version 206933 (0.0033) [2024-06-28 10:32:33,634][06909] Updated weights for policy 0, policy_version 206943 (0.0041) [2024-06-28 10:32:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3390554112. Throughput: 0: 44212.8. Samples: 3293464700. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 10:32:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:32:37,323][06909] Updated weights for policy 0, policy_version 206953 (0.0032) [2024-06-28 10:32:38,852][06674] Fps is (10 sec: 45865.9, 60 sec: 43689.2, 300 sec: 44042.1). Total num frames: 3390767104. Throughput: 0: 44075.8. Samples: 3293728980. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 10:32:38,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:32:41,045][06909] Updated weights for policy 0, policy_version 206963 (0.0028) [2024-06-28 10:32:43,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 3390963712. Throughput: 0: 44243.4. Samples: 3293862480. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 10:32:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:32:44,956][06909] Updated weights for policy 0, policy_version 206973 (0.0028) [2024-06-28 10:32:48,322][06909] Updated weights for policy 0, policy_version 206983 (0.0036) [2024-06-28 10:32:48,850][06674] Fps is (10 sec: 45884.5, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3391225856. Throughput: 0: 43961.7. Samples: 3294119680. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 10:32:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:32:52,292][06887] Signal inference workers to stop experience collection... (46600 times) [2024-06-28 10:32:52,342][06887] Signal inference workers to resume experience collection... (46600 times) [2024-06-28 10:32:52,343][06909] InferenceWorker_p0-w0: stopping experience collection (46600 times) [2024-06-28 10:32:52,377][06909] InferenceWorker_p0-w0: resuming experience collection (46600 times) [2024-06-28 10:32:52,481][06909] Updated weights for policy 0, policy_version 206993 (0.0032) [2024-06-28 10:32:53,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3391438848. Throughput: 0: 43871.0. Samples: 3294379940. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 10:32:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:32:56,224][06909] Updated weights for policy 0, policy_version 207003 (0.0032) [2024-06-28 10:32:58,850][06674] Fps is (10 sec: 39321.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3391619072. Throughput: 0: 43919.4. Samples: 3294513740. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 10:32:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:32:59,908][06909] Updated weights for policy 0, policy_version 207013 (0.0026) [2024-06-28 10:33:03,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.5, 300 sec: 43987.1). Total num frames: 3391848448. Throughput: 0: 43926.4. Samples: 3294778560. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 10:33:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:33:03,861][06909] Updated weights for policy 0, policy_version 207023 (0.0041) [2024-06-28 10:33:07,446][06909] Updated weights for policy 0, policy_version 207033 (0.0037) [2024-06-28 10:33:08,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3392077824. Throughput: 0: 44008.0. Samples: 3295042800. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 10:33:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:33:11,281][06909] Updated weights for policy 0, policy_version 207043 (0.0032) [2024-06-28 10:33:13,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43965.3, 300 sec: 43875.8). Total num frames: 3392274432. Throughput: 0: 44050.3. Samples: 3295177620. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 10:33:13,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 10:33:14,842][06909] Updated weights for policy 0, policy_version 207053 (0.0022) [2024-06-28 10:33:18,567][06909] Updated weights for policy 0, policy_version 207063 (0.0028) [2024-06-28 10:33:18,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3392536576. Throughput: 0: 43872.5. Samples: 3295438960. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 10:33:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:33:22,340][06909] Updated weights for policy 0, policy_version 207073 (0.0029) [2024-06-28 10:33:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3392733184. Throughput: 0: 43828.3. Samples: 3295701160. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 10:33:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:33:25,983][06909] Updated weights for policy 0, policy_version 207083 (0.0033) [2024-06-28 10:33:28,850][06674] Fps is (10 sec: 37682.9, 60 sec: 43417.6, 300 sec: 43820.3). Total num frames: 3392913408. Throughput: 0: 43765.8. Samples: 3295831940. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 10:33:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:33:29,908][06909] Updated weights for policy 0, policy_version 207093 (0.0038) [2024-06-28 10:33:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 3393159168. Throughput: 0: 43819.1. Samples: 3296091540. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 10:33:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:33:33,890][06909] Updated weights for policy 0, policy_version 207103 (0.0031) [2024-06-28 10:33:37,439][06909] Updated weights for policy 0, policy_version 207113 (0.0033) [2024-06-28 10:33:38,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43692.1, 300 sec: 44042.4). Total num frames: 3393388544. Throughput: 0: 43751.1. Samples: 3296348740. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 10:33:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:33:41,268][06909] Updated weights for policy 0, policy_version 207123 (0.0026) [2024-06-28 10:33:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3393585152. Throughput: 0: 43795.5. Samples: 3296484540. Policy #0 lag: (min: 0.0, avg: 8.1, max: 21.0) [2024-06-28 10:33:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:33:44,902][06909] Updated weights for policy 0, policy_version 207133 (0.0032) [2024-06-28 10:33:48,681][06909] Updated weights for policy 0, policy_version 207143 (0.0039) [2024-06-28 10:33:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 3393830912. Throughput: 0: 43756.2. Samples: 3296747580. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 10:33:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:33:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000207143_3393830912.pth... [2024-06-28 10:33:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000206501_3383312384.pth [2024-06-28 10:33:52,400][06909] Updated weights for policy 0, policy_version 207153 (0.0032) [2024-06-28 10:33:53,856][06674] Fps is (10 sec: 47484.8, 60 sec: 43686.3, 300 sec: 44152.6). Total num frames: 3394060288. Throughput: 0: 43632.4. Samples: 3297006520. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 10:33:53,857][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:33:55,944][06909] Updated weights for policy 0, policy_version 207163 (0.0029) [2024-06-28 10:33:58,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3394240512. Throughput: 0: 43675.1. Samples: 3297143000. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 10:33:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:33:59,767][06909] Updated weights for policy 0, policy_version 207173 (0.0030) [2024-06-28 10:34:03,850][06674] Fps is (10 sec: 40984.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3394469888. Throughput: 0: 43599.4. Samples: 3297400940. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 10:34:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:34:03,917][06909] Updated weights for policy 0, policy_version 207183 (0.0038) [2024-06-28 10:34:07,229][06909] Updated weights for policy 0, policy_version 207193 (0.0040) [2024-06-28 10:34:08,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3394715648. Throughput: 0: 43506.6. Samples: 3297658960. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 10:34:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:34:11,320][06909] Updated weights for policy 0, policy_version 207203 (0.0037) [2024-06-28 10:34:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3394912256. Throughput: 0: 43743.6. Samples: 3297800400. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 10:34:13,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 10:34:14,719][06909] Updated weights for policy 0, policy_version 207213 (0.0026) [2024-06-28 10:34:18,709][06909] Updated weights for policy 0, policy_version 207223 (0.0040) [2024-06-28 10:34:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 3395141632. Throughput: 0: 43811.6. Samples: 3298063060. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 10:34:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:34:22,226][06909] Updated weights for policy 0, policy_version 207233 (0.0023) [2024-06-28 10:34:22,434][06887] Signal inference workers to stop experience collection... (46650 times) [2024-06-28 10:34:22,481][06909] InferenceWorker_p0-w0: stopping experience collection (46650 times) [2024-06-28 10:34:22,544][06887] Signal inference workers to resume experience collection... (46650 times) [2024-06-28 10:34:22,544][06909] InferenceWorker_p0-w0: resuming experience collection (46650 times) [2024-06-28 10:34:23,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3395371008. Throughput: 0: 43783.6. Samples: 3298319000. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 10:34:23,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:34:26,273][06909] Updated weights for policy 0, policy_version 207243 (0.0032) [2024-06-28 10:34:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 3395567616. Throughput: 0: 43789.9. Samples: 3298455080. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 10:34:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:34:29,580][06909] Updated weights for policy 0, policy_version 207253 (0.0038) [2024-06-28 10:34:33,726][06909] Updated weights for policy 0, policy_version 207263 (0.0023) [2024-06-28 10:34:33,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3395796992. Throughput: 0: 43745.8. Samples: 3298716140. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 10:34:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:34:37,152][06909] Updated weights for policy 0, policy_version 207273 (0.0028) [2024-06-28 10:34:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 3396009984. Throughput: 0: 43808.2. Samples: 3298977620. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 10:34:38,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 10:34:41,517][06909] Updated weights for policy 0, policy_version 207283 (0.0026) [2024-06-28 10:34:43,852][06674] Fps is (10 sec: 44227.6, 60 sec: 44235.3, 300 sec: 43820.0). Total num frames: 3396239360. Throughput: 0: 43847.4. Samples: 3299116220. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 10:34:43,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:34:44,802][06909] Updated weights for policy 0, policy_version 207293 (0.0035) [2024-06-28 10:34:48,742][06909] Updated weights for policy 0, policy_version 207303 (0.0029) [2024-06-28 10:34:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3396452352. Throughput: 0: 43936.9. Samples: 3299378100. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 10:34:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:34:52,145][06909] Updated weights for policy 0, policy_version 207313 (0.0032) [2024-06-28 10:34:53,850][06674] Fps is (10 sec: 44245.3, 60 sec: 43695.0, 300 sec: 43931.3). Total num frames: 3396681728. Throughput: 0: 43939.0. Samples: 3299636220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 10:34:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:34:55,959][06909] Updated weights for policy 0, policy_version 207323 (0.0036) [2024-06-28 10:34:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 3396894720. Throughput: 0: 43873.8. Samples: 3299774720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 10:34:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 10:34:59,398][06909] Updated weights for policy 0, policy_version 207333 (0.0029) [2024-06-28 10:35:03,322][06909] Updated weights for policy 0, policy_version 207343 (0.0025) [2024-06-28 10:35:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3397107712. Throughput: 0: 43841.2. Samples: 3300035920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 10:35:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:35:06,734][06909] Updated weights for policy 0, policy_version 207353 (0.0035) [2024-06-28 10:35:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43876.1). Total num frames: 3397337088. Throughput: 0: 44098.3. Samples: 3300303420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 10:35:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:35:11,530][06909] Updated weights for policy 0, policy_version 207363 (0.0031) [2024-06-28 10:35:13,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44236.9, 300 sec: 43932.3). Total num frames: 3397566464. Throughput: 0: 44145.9. Samples: 3300441640. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 10:35:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:35:14,281][06909] Updated weights for policy 0, policy_version 207373 (0.0034) [2024-06-28 10:35:18,753][06909] Updated weights for policy 0, policy_version 207383 (0.0032) [2024-06-28 10:35:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3397763072. Throughput: 0: 44037.6. Samples: 3300697840. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 10:35:18,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:35:21,722][06909] Updated weights for policy 0, policy_version 207393 (0.0023) [2024-06-28 10:35:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3398008832. Throughput: 0: 44050.7. Samples: 3300959900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 10:35:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:35:26,020][06909] Updated weights for policy 0, policy_version 207403 (0.0033) [2024-06-28 10:35:28,852][06674] Fps is (10 sec: 44228.3, 60 sec: 43962.2, 300 sec: 43819.9). Total num frames: 3398205440. Throughput: 0: 44071.1. Samples: 3301099420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 10:35:28,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:35:29,352][06909] Updated weights for policy 0, policy_version 207413 (0.0039) [2024-06-28 10:35:33,315][06909] Updated weights for policy 0, policy_version 207423 (0.0039) [2024-06-28 10:35:33,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3398418432. Throughput: 0: 43884.5. Samples: 3301352900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 10:35:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:35:36,567][06909] Updated weights for policy 0, policy_version 207433 (0.0032) [2024-06-28 10:35:38,850][06674] Fps is (10 sec: 47523.3, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3398680576. Throughput: 0: 44132.1. Samples: 3301622160. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 10:35:38,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:35:40,558][06909] Updated weights for policy 0, policy_version 207443 (0.0026) [2024-06-28 10:35:43,430][06887] Signal inference workers to stop experience collection... (46700 times) [2024-06-28 10:35:43,436][06887] Signal inference workers to resume experience collection... (46700 times) [2024-06-28 10:35:43,454][06909] InferenceWorker_p0-w0: stopping experience collection (46700 times) [2024-06-28 10:35:43,455][06909] InferenceWorker_p0-w0: resuming experience collection (46700 times) [2024-06-28 10:35:43,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44238.3, 300 sec: 43875.8). Total num frames: 3398893568. Throughput: 0: 44087.1. Samples: 3301758640. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 10:35:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:35:44,115][06909] Updated weights for policy 0, policy_version 207453 (0.0036) [2024-06-28 10:35:48,647][06909] Updated weights for policy 0, policy_version 207463 (0.0058) [2024-06-28 10:35:48,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 3399073792. Throughput: 0: 43956.5. Samples: 3302013960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 10:35:48,854][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 10:35:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000207463_3399073792.pth... [2024-06-28 10:35:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000206822_3388571648.pth [2024-06-28 10:35:51,451][06909] Updated weights for policy 0, policy_version 207473 (0.0027) [2024-06-28 10:35:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3399319552. Throughput: 0: 43955.6. Samples: 3302281420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 10:35:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:35:55,892][06909] Updated weights for policy 0, policy_version 207483 (0.0024) [2024-06-28 10:35:58,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 3399548928. Throughput: 0: 44037.6. Samples: 3302423340. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 10:35:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:35:58,934][06909] Updated weights for policy 0, policy_version 207493 (0.0023) [2024-06-28 10:36:03,073][06909] Updated weights for policy 0, policy_version 207503 (0.0027) [2024-06-28 10:36:03,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 3399729152. Throughput: 0: 43996.5. Samples: 3302677680. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 10:36:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:36:06,275][06909] Updated weights for policy 0, policy_version 207513 (0.0033) [2024-06-28 10:36:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3400007680. Throughput: 0: 44095.1. Samples: 3302944180. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 10:36:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:36:10,571][06909] Updated weights for policy 0, policy_version 207523 (0.0027) [2024-06-28 10:36:13,501][06909] Updated weights for policy 0, policy_version 207533 (0.0031) [2024-06-28 10:36:13,850][06674] Fps is (10 sec: 49152.0, 60 sec: 44236.7, 300 sec: 43931.4). Total num frames: 3400220672. Throughput: 0: 44190.4. Samples: 3303087900. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 10:36:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:36:17,708][06909] Updated weights for policy 0, policy_version 207543 (0.0041) [2024-06-28 10:36:18,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3400400896. Throughput: 0: 44296.0. Samples: 3303346220. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 10:36:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:36:20,984][06909] Updated weights for policy 0, policy_version 207553 (0.0025) [2024-06-28 10:36:23,856][06674] Fps is (10 sec: 44210.2, 60 sec: 44232.3, 300 sec: 44041.5). Total num frames: 3400663040. Throughput: 0: 44124.3. Samples: 3303608020. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 10:36:23,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:36:25,642][06909] Updated weights for policy 0, policy_version 207563 (0.0034) [2024-06-28 10:36:28,682][06909] Updated weights for policy 0, policy_version 207573 (0.0032) [2024-06-28 10:36:28,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44511.4, 300 sec: 43931.3). Total num frames: 3400876032. Throughput: 0: 44190.6. Samples: 3303747220. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 10:36:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:36:32,944][06909] Updated weights for policy 0, policy_version 207583 (0.0027) [2024-06-28 10:36:33,850][06674] Fps is (10 sec: 39345.7, 60 sec: 43963.8, 300 sec: 43764.7). Total num frames: 3401056256. Throughput: 0: 44265.9. Samples: 3304005920. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 10:36:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:36:36,102][06909] Updated weights for policy 0, policy_version 207593 (0.0034) [2024-06-28 10:36:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3401334784. Throughput: 0: 44104.4. Samples: 3304266120. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 10:36:38,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 10:36:40,427][06909] Updated weights for policy 0, policy_version 207603 (0.0036) [2024-06-28 10:36:41,980][06887] Signal inference workers to stop experience collection... (46750 times) [2024-06-28 10:36:41,980][06887] Signal inference workers to resume experience collection... (46750 times) [2024-06-28 10:36:42,030][06909] InferenceWorker_p0-w0: stopping experience collection (46750 times) [2024-06-28 10:36:42,030][06909] InferenceWorker_p0-w0: resuming experience collection (46750 times) [2024-06-28 10:36:43,608][06909] Updated weights for policy 0, policy_version 207613 (0.0034) [2024-06-28 10:36:43,852][06674] Fps is (10 sec: 49141.2, 60 sec: 44235.2, 300 sec: 43986.6). Total num frames: 3401547776. Throughput: 0: 44053.1. Samples: 3304405820. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 10:36:43,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:36:47,945][06909] Updated weights for policy 0, policy_version 207623 (0.0025) [2024-06-28 10:36:48,850][06674] Fps is (10 sec: 36044.9, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 3401695232. Throughput: 0: 44011.6. Samples: 3304658200. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 10:36:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:36:51,013][06909] Updated weights for policy 0, policy_version 207633 (0.0036) [2024-06-28 10:36:53,850][06674] Fps is (10 sec: 44246.1, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3401990144. Throughput: 0: 43917.3. Samples: 3304920460. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 10:36:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:36:55,351][06909] Updated weights for policy 0, policy_version 207643 (0.0037) [2024-06-28 10:36:58,397][06909] Updated weights for policy 0, policy_version 207653 (0.0038) [2024-06-28 10:36:58,850][06674] Fps is (10 sec: 50790.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3402203136. Throughput: 0: 43863.1. Samples: 3305061740. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 10:36:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:37:02,994][06909] Updated weights for policy 0, policy_version 207663 (0.0029) [2024-06-28 10:37:03,852][06674] Fps is (10 sec: 37675.6, 60 sec: 43962.3, 300 sec: 43764.4). Total num frames: 3402366976. Throughput: 0: 43793.6. Samples: 3305317020. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 10:37:03,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:37:06,171][06909] Updated weights for policy 0, policy_version 207673 (0.0025) [2024-06-28 10:37:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 3402645504. Throughput: 0: 43771.6. Samples: 3305577480. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 10:37:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:37:10,203][06909] Updated weights for policy 0, policy_version 207683 (0.0038) [2024-06-28 10:37:13,712][06909] Updated weights for policy 0, policy_version 207693 (0.0036) [2024-06-28 10:37:13,850][06674] Fps is (10 sec: 47523.5, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3402842112. Throughput: 0: 43737.9. Samples: 3305715420. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 10:37:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:37:17,610][06909] Updated weights for policy 0, policy_version 207703 (0.0034) [2024-06-28 10:37:18,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43963.7, 300 sec: 43820.3). Total num frames: 3403038720. Throughput: 0: 43749.6. Samples: 3305974660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 10:37:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:37:21,238][06909] Updated weights for policy 0, policy_version 207713 (0.0033) [2024-06-28 10:37:23,851][06674] Fps is (10 sec: 44230.3, 60 sec: 43694.0, 300 sec: 43986.7). Total num frames: 3403284480. Throughput: 0: 43590.6. Samples: 3306227760. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 10:37:23,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:37:25,330][06909] Updated weights for policy 0, policy_version 207723 (0.0028) [2024-06-28 10:37:28,541][06909] Updated weights for policy 0, policy_version 207733 (0.0032) [2024-06-28 10:37:28,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3403513856. Throughput: 0: 43733.2. Samples: 3306373720. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 10:37:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:37:32,517][06909] Updated weights for policy 0, policy_version 207743 (0.0041) [2024-06-28 10:37:33,850][06674] Fps is (10 sec: 40965.2, 60 sec: 43963.6, 300 sec: 43820.5). Total num frames: 3403694080. Throughput: 0: 43994.9. Samples: 3306637980. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 10:37:33,851][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 10:37:36,278][06909] Updated weights for policy 0, policy_version 207753 (0.0024) [2024-06-28 10:37:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 3403939840. Throughput: 0: 43846.2. Samples: 3306893540. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 10:37:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:37:40,056][06909] Updated weights for policy 0, policy_version 207763 (0.0034) [2024-06-28 10:37:43,517][06909] Updated weights for policy 0, policy_version 207773 (0.0031) [2024-06-28 10:37:43,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43419.1, 300 sec: 43820.3). Total num frames: 3404152832. Throughput: 0: 43747.5. Samples: 3307030380. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 10:37:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:37:47,487][06909] Updated weights for policy 0, policy_version 207783 (0.0032) [2024-06-28 10:37:48,850][06674] Fps is (10 sec: 40959.9, 60 sec: 44236.7, 300 sec: 43764.7). Total num frames: 3404349440. Throughput: 0: 43807.7. Samples: 3307288280. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 10:37:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:37:48,899][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000207786_3404365824.pth... [2024-06-28 10:37:48,960][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000207143_3393830912.pth [2024-06-28 10:37:51,334][06909] Updated weights for policy 0, policy_version 207793 (0.0053) [2024-06-28 10:37:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 3404595200. Throughput: 0: 43645.8. Samples: 3307541540. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 10:37:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:37:55,182][06909] Updated weights for policy 0, policy_version 207803 (0.0041) [2024-06-28 10:37:58,683][06909] Updated weights for policy 0, policy_version 207813 (0.0034) [2024-06-28 10:37:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43417.6, 300 sec: 43931.4). Total num frames: 3404808192. Throughput: 0: 43683.9. Samples: 3307681200. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 10:37:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:38:02,390][06909] Updated weights for policy 0, policy_version 207823 (0.0027) [2024-06-28 10:38:03,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43965.2, 300 sec: 43820.3). Total num frames: 3405004800. Throughput: 0: 43703.1. Samples: 3307941300. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 10:38:03,854][06674] Avg episode reward: [(0, '0.415')] [2024-06-28 10:38:05,833][06887] Signal inference workers to stop experience collection... (46800 times) [2024-06-28 10:38:05,838][06887] Signal inference workers to resume experience collection... (46800 times) [2024-06-28 10:38:05,854][06909] InferenceWorker_p0-w0: stopping experience collection (46800 times) [2024-06-28 10:38:05,854][06909] InferenceWorker_p0-w0: resuming experience collection (46800 times) [2024-06-28 10:38:05,986][06909] Updated weights for policy 0, policy_version 207833 (0.0028) [2024-06-28 10:38:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43417.7, 300 sec: 43986.9). Total num frames: 3405250560. Throughput: 0: 43869.4. Samples: 3308201820. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 10:38:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:38:10,045][06909] Updated weights for policy 0, policy_version 207843 (0.0039) [2024-06-28 10:38:13,698][06909] Updated weights for policy 0, policy_version 207853 (0.0033) [2024-06-28 10:38:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 3405463552. Throughput: 0: 43711.5. Samples: 3308340740. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 10:38:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:38:17,471][06909] Updated weights for policy 0, policy_version 207863 (0.0027) [2024-06-28 10:38:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3405676544. Throughput: 0: 43731.2. Samples: 3308605880. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 10:38:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:38:20,883][06909] Updated weights for policy 0, policy_version 207873 (0.0045) [2024-06-28 10:38:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43691.6, 300 sec: 44042.4). Total num frames: 3405905920. Throughput: 0: 43851.9. Samples: 3308866880. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 10:38:23,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 10:38:24,802][06909] Updated weights for policy 0, policy_version 207883 (0.0027) [2024-06-28 10:38:28,645][06909] Updated weights for policy 0, policy_version 207893 (0.0029) [2024-06-28 10:38:28,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43416.1, 300 sec: 43931.0). Total num frames: 3406118912. Throughput: 0: 43792.2. Samples: 3309001120. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 10:38:28,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:38:32,341][06909] Updated weights for policy 0, policy_version 207903 (0.0037) [2024-06-28 10:38:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.9, 300 sec: 43875.8). Total num frames: 3406331904. Throughput: 0: 43972.1. Samples: 3309267020. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 10:38:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:38:35,781][06909] Updated weights for policy 0, policy_version 207913 (0.0038) [2024-06-28 10:38:38,850][06674] Fps is (10 sec: 45884.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3406577664. Throughput: 0: 44292.1. Samples: 3309534680. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 10:38:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:38:39,799][06909] Updated weights for policy 0, policy_version 207923 (0.0022) [2024-06-28 10:38:42,982][06909] Updated weights for policy 0, policy_version 207933 (0.0029) [2024-06-28 10:38:43,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3406807040. Throughput: 0: 44289.3. Samples: 3309674220. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 10:38:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:38:47,159][06909] Updated weights for policy 0, policy_version 207943 (0.0029) [2024-06-28 10:38:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.8, 300 sec: 43932.2). Total num frames: 3407020032. Throughput: 0: 44389.8. Samples: 3309938840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 10:38:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:38:50,723][06909] Updated weights for policy 0, policy_version 207953 (0.0036) [2024-06-28 10:38:53,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3407216640. Throughput: 0: 44275.5. Samples: 3310194220. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 10:38:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:38:54,575][06909] Updated weights for policy 0, policy_version 207963 (0.0029) [2024-06-28 10:38:57,938][06909] Updated weights for policy 0, policy_version 207973 (0.0028) [2024-06-28 10:38:58,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 3407429632. Throughput: 0: 44132.5. Samples: 3310326700. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 10:38:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:39:02,032][06909] Updated weights for policy 0, policy_version 207983 (0.0022) [2024-06-28 10:39:03,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44509.9, 300 sec: 43931.3). Total num frames: 3407675392. Throughput: 0: 44157.9. Samples: 3310592980. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 10:39:03,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 10:39:05,618][06909] Updated weights for policy 0, policy_version 207993 (0.0026) [2024-06-28 10:39:08,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3407888384. Throughput: 0: 44208.0. Samples: 3310856240. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 10:39:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:39:09,599][06909] Updated weights for policy 0, policy_version 208003 (0.0033) [2024-06-28 10:39:12,875][06909] Updated weights for policy 0, policy_version 208013 (0.0027) [2024-06-28 10:39:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3408117760. Throughput: 0: 44155.4. Samples: 3310988020. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 10:39:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:39:17,036][06909] Updated weights for policy 0, policy_version 208023 (0.0028) [2024-06-28 10:39:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 3408347136. Throughput: 0: 44183.4. Samples: 3311255280. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 10:39:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:39:20,127][06909] Updated weights for policy 0, policy_version 208033 (0.0028) [2024-06-28 10:39:23,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3408527360. Throughput: 0: 44081.7. Samples: 3311518360. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 10:39:23,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:39:24,687][06909] Updated weights for policy 0, policy_version 208043 (0.0022) [2024-06-28 10:39:27,871][06909] Updated weights for policy 0, policy_version 208053 (0.0034) [2024-06-28 10:39:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44238.3, 300 sec: 43986.9). Total num frames: 3408773120. Throughput: 0: 43883.1. Samples: 3311648960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 10:39:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:39:32,047][06909] Updated weights for policy 0, policy_version 208063 (0.0040) [2024-06-28 10:39:33,852][06674] Fps is (10 sec: 45866.2, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 3408986112. Throughput: 0: 43901.6. Samples: 3311914500. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 10:39:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:39:35,387][06909] Updated weights for policy 0, policy_version 208073 (0.0040) [2024-06-28 10:39:38,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.5, 300 sec: 43931.6). Total num frames: 3409199104. Throughput: 0: 44039.0. Samples: 3312175980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 10:39:38,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:39:39,784][06909] Updated weights for policy 0, policy_version 208083 (0.0029) [2024-06-28 10:39:42,175][06887] Signal inference workers to stop experience collection... (46850 times) [2024-06-28 10:39:42,176][06887] Signal inference workers to resume experience collection... (46850 times) [2024-06-28 10:39:42,196][06909] InferenceWorker_p0-w0: stopping experience collection (46850 times) [2024-06-28 10:39:42,228][06909] InferenceWorker_p0-w0: resuming experience collection (46850 times) [2024-06-28 10:39:42,850][06909] Updated weights for policy 0, policy_version 208093 (0.0038) [2024-06-28 10:39:43,850][06674] Fps is (10 sec: 44245.3, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3409428480. Throughput: 0: 43911.0. Samples: 3312302700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 10:39:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:39:47,238][06909] Updated weights for policy 0, policy_version 208103 (0.0033) [2024-06-28 10:39:48,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3409657856. Throughput: 0: 44024.7. Samples: 3312574100. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 10:39:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:39:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000208109_3409657856.pth... [2024-06-28 10:39:48,908][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000207463_3399073792.pth [2024-06-28 10:39:50,087][06909] Updated weights for policy 0, policy_version 208113 (0.0027) [2024-06-28 10:39:53,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3409838080. Throughput: 0: 43991.0. Samples: 3312835840. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 10:39:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:39:54,617][06909] Updated weights for policy 0, policy_version 208123 (0.0043) [2024-06-28 10:39:57,780][06909] Updated weights for policy 0, policy_version 208133 (0.0028) [2024-06-28 10:39:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3410100224. Throughput: 0: 43791.0. Samples: 3312958620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 10:39:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:40:01,764][06909] Updated weights for policy 0, policy_version 208143 (0.0031) [2024-06-28 10:40:03,850][06674] Fps is (10 sec: 47514.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3410313216. Throughput: 0: 43983.6. Samples: 3313234540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 10:40:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:40:05,108][06909] Updated weights for policy 0, policy_version 208153 (0.0021) [2024-06-28 10:40:08,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 3410509824. Throughput: 0: 43897.5. Samples: 3313493740. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 10:40:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:40:09,581][06909] Updated weights for policy 0, policy_version 208163 (0.0038) [2024-06-28 10:40:12,490][06909] Updated weights for policy 0, policy_version 208173 (0.0041) [2024-06-28 10:40:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3410771968. Throughput: 0: 43847.2. Samples: 3313622080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 10:40:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:40:17,147][06909] Updated weights for policy 0, policy_version 208183 (0.0029) [2024-06-28 10:40:18,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3410968576. Throughput: 0: 43922.8. Samples: 3313890940. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 10:40:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:40:20,098][06909] Updated weights for policy 0, policy_version 208193 (0.0035) [2024-06-28 10:40:23,850][06674] Fps is (10 sec: 40959.9, 60 sec: 44236.9, 300 sec: 43987.2). Total num frames: 3411181568. Throughput: 0: 43957.5. Samples: 3314154060. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 10:40:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:40:24,694][06909] Updated weights for policy 0, policy_version 208203 (0.0041) [2024-06-28 10:40:27,269][06909] Updated weights for policy 0, policy_version 208213 (0.0037) [2024-06-28 10:40:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3411427328. Throughput: 0: 44030.7. Samples: 3314284080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 22.0) [2024-06-28 10:40:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:40:31,797][06909] Updated weights for policy 0, policy_version 208223 (0.0033) [2024-06-28 10:40:33,852][06674] Fps is (10 sec: 45865.9, 60 sec: 44236.8, 300 sec: 43931.0). Total num frames: 3411640320. Throughput: 0: 44021.7. Samples: 3314555160. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-28 10:40:33,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:40:35,049][06909] Updated weights for policy 0, policy_version 208233 (0.0034) [2024-06-28 10:40:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3411836928. Throughput: 0: 43980.6. Samples: 3314814960. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-28 10:40:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:40:39,040][06909] Updated weights for policy 0, policy_version 208243 (0.0041) [2024-06-28 10:40:42,306][06909] Updated weights for policy 0, policy_version 208253 (0.0046) [2024-06-28 10:40:43,850][06674] Fps is (10 sec: 44245.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3412082688. Throughput: 0: 44103.6. Samples: 3314943280. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-28 10:40:43,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 10:40:46,464][06909] Updated weights for policy 0, policy_version 208263 (0.0043) [2024-06-28 10:40:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43875.8). Total num frames: 3412262912. Throughput: 0: 43917.7. Samples: 3315210840. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-28 10:40:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:40:49,832][06909] Updated weights for policy 0, policy_version 208273 (0.0031) [2024-06-28 10:40:53,850][06674] Fps is (10 sec: 40960.5, 60 sec: 44236.9, 300 sec: 43875.8). Total num frames: 3412492288. Throughput: 0: 44035.1. Samples: 3315475320. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-28 10:40:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:40:54,418][06909] Updated weights for policy 0, policy_version 208283 (0.0035) [2024-06-28 10:40:57,075][06909] Updated weights for policy 0, policy_version 208293 (0.0028) [2024-06-28 10:40:58,850][06674] Fps is (10 sec: 49151.4, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3412754432. Throughput: 0: 44040.7. Samples: 3315603920. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-28 10:40:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:41:01,596][06909] Updated weights for policy 0, policy_version 208303 (0.0059) [2024-06-28 10:41:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 3412934656. Throughput: 0: 43952.9. Samples: 3315868820. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-28 10:41:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:41:04,741][06909] Updated weights for policy 0, policy_version 208313 (0.0030) [2024-06-28 10:41:08,813][06909] Updated weights for policy 0, policy_version 208323 (0.0031) [2024-06-28 10:41:08,850][06674] Fps is (10 sec: 40960.7, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 3413164032. Throughput: 0: 43989.4. Samples: 3316133580. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-28 10:41:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:41:12,391][06909] Updated weights for policy 0, policy_version 208333 (0.0030) [2024-06-28 10:41:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3413393408. Throughput: 0: 43913.0. Samples: 3316260160. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-28 10:41:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:41:16,083][06909] Updated weights for policy 0, policy_version 208343 (0.0027) [2024-06-28 10:41:17,314][06887] Signal inference workers to stop experience collection... (46900 times) [2024-06-28 10:41:17,346][06909] InferenceWorker_p0-w0: stopping experience collection (46900 times) [2024-06-28 10:41:17,366][06887] Signal inference workers to resume experience collection... (46900 times) [2024-06-28 10:41:17,369][06909] InferenceWorker_p0-w0: resuming experience collection (46900 times) [2024-06-28 10:41:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 43821.1). Total num frames: 3413590016. Throughput: 0: 43944.6. Samples: 3316532580. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-28 10:41:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:41:19,585][06909] Updated weights for policy 0, policy_version 208353 (0.0040) [2024-06-28 10:41:23,810][06909] Updated weights for policy 0, policy_version 208363 (0.0029) [2024-06-28 10:41:23,852][06674] Fps is (10 sec: 42589.3, 60 sec: 43962.2, 300 sec: 43875.5). Total num frames: 3413819392. Throughput: 0: 43955.3. Samples: 3316793040. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-28 10:41:23,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:41:27,108][06909] Updated weights for policy 0, policy_version 208373 (0.0021) [2024-06-28 10:41:28,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 3414065152. Throughput: 0: 44061.4. Samples: 3316926040. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-28 10:41:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:41:31,505][06909] Updated weights for policy 0, policy_version 208383 (0.0038) [2024-06-28 10:41:33,850][06674] Fps is (10 sec: 44246.2, 60 sec: 43692.2, 300 sec: 43820.3). Total num frames: 3414261760. Throughput: 0: 43883.2. Samples: 3317185580. Policy #0 lag: (min: 0.0, avg: 11.7, max: 20.0) [2024-06-28 10:41:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:41:34,365][06909] Updated weights for policy 0, policy_version 208393 (0.0029) [2024-06-28 10:41:38,850][06674] Fps is (10 sec: 39322.2, 60 sec: 43690.8, 300 sec: 43765.0). Total num frames: 3414458368. Throughput: 0: 44010.3. Samples: 3317455780. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 10:41:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:41:38,906][06909] Updated weights for policy 0, policy_version 208403 (0.0032) [2024-06-28 10:41:42,077][06909] Updated weights for policy 0, policy_version 208413 (0.0042) [2024-06-28 10:41:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3414720512. Throughput: 0: 43974.8. Samples: 3317582780. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 10:41:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:41:46,017][06909] Updated weights for policy 0, policy_version 208423 (0.0028) [2024-06-28 10:41:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.9, 300 sec: 43820.3). Total num frames: 3414917120. Throughput: 0: 43962.3. Samples: 3317847120. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 10:41:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:41:48,918][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000208431_3414933504.pth... [2024-06-28 10:41:48,966][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000207786_3404365824.pth [2024-06-28 10:41:49,569][06909] Updated weights for policy 0, policy_version 208433 (0.0037) [2024-06-28 10:41:53,334][06909] Updated weights for policy 0, policy_version 208443 (0.0029) [2024-06-28 10:41:53,850][06674] Fps is (10 sec: 42597.7, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 3415146496. Throughput: 0: 43982.5. Samples: 3318112800. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 10:41:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:41:56,801][06909] Updated weights for policy 0, policy_version 208453 (0.0042) [2024-06-28 10:41:58,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.8, 300 sec: 44153.8). Total num frames: 3415392256. Throughput: 0: 44123.0. Samples: 3318245700. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 10:41:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:42:00,879][06909] Updated weights for policy 0, policy_version 208463 (0.0027) [2024-06-28 10:42:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.8, 300 sec: 43931.3). Total num frames: 3415605248. Throughput: 0: 44046.6. Samples: 3318514680. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 10:42:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:42:04,144][06909] Updated weights for policy 0, policy_version 208473 (0.0032) [2024-06-28 10:42:08,606][06909] Updated weights for policy 0, policy_version 208483 (0.0045) [2024-06-28 10:42:08,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3415801856. Throughput: 0: 44211.4. Samples: 3318782460. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 10:42:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:42:11,672][06909] Updated weights for policy 0, policy_version 208493 (0.0041) [2024-06-28 10:42:13,852][06674] Fps is (10 sec: 44228.3, 60 sec: 44235.3, 300 sec: 44097.7). Total num frames: 3416047616. Throughput: 0: 44036.3. Samples: 3318907760. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 10:42:13,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:42:15,834][06909] Updated weights for policy 0, policy_version 208503 (0.0034) [2024-06-28 10:42:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 43931.5). Total num frames: 3416244224. Throughput: 0: 44249.3. Samples: 3319176800. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 10:42:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:42:19,134][06909] Updated weights for policy 0, policy_version 208513 (0.0029) [2024-06-28 10:42:23,075][06909] Updated weights for policy 0, policy_version 208523 (0.0043) [2024-06-28 10:42:23,850][06674] Fps is (10 sec: 42607.5, 60 sec: 44238.4, 300 sec: 43931.4). Total num frames: 3416473600. Throughput: 0: 44083.6. Samples: 3319439540. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 10:42:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:42:26,518][06909] Updated weights for policy 0, policy_version 208533 (0.0041) [2024-06-28 10:42:28,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3416702976. Throughput: 0: 44223.6. Samples: 3319572840. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 10:42:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:42:30,479][06909] Updated weights for policy 0, policy_version 208543 (0.0036) [2024-06-28 10:42:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3416915968. Throughput: 0: 44304.5. Samples: 3319840820. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 10:42:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:42:33,875][06909] Updated weights for policy 0, policy_version 208553 (0.0028) [2024-06-28 10:42:37,759][06887] Signal inference workers to stop experience collection... (46950 times) [2024-06-28 10:42:37,782][06909] InferenceWorker_p0-w0: stopping experience collection (46950 times) [2024-06-28 10:42:37,821][06887] Signal inference workers to resume experience collection... (46950 times) [2024-06-28 10:42:37,821][06909] InferenceWorker_p0-w0: resuming experience collection (46950 times) [2024-06-28 10:42:38,000][06909] Updated weights for policy 0, policy_version 208563 (0.0034) [2024-06-28 10:42:38,852][06674] Fps is (10 sec: 45865.2, 60 sec: 45054.3, 300 sec: 44097.6). Total num frames: 3417161728. Throughput: 0: 44118.0. Samples: 3320098200. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 10:42:38,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:42:41,335][06909] Updated weights for policy 0, policy_version 208573 (0.0030) [2024-06-28 10:42:43,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3417358336. Throughput: 0: 44100.9. Samples: 3320230240. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 10:42:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:42:45,545][06909] Updated weights for policy 0, policy_version 208583 (0.0041) [2024-06-28 10:42:48,850][06674] Fps is (10 sec: 40968.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3417571328. Throughput: 0: 44072.1. Samples: 3320497920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:42:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:42:48,937][06909] Updated weights for policy 0, policy_version 208593 (0.0034) [2024-06-28 10:42:53,039][06909] Updated weights for policy 0, policy_version 208603 (0.0030) [2024-06-28 10:42:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 3417784320. Throughput: 0: 43801.8. Samples: 3320753540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:42:53,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 10:42:56,349][06909] Updated weights for policy 0, policy_version 208613 (0.0031) [2024-06-28 10:42:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3418013696. Throughput: 0: 43969.1. Samples: 3320886280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:42:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:43:00,332][06909] Updated weights for policy 0, policy_version 208623 (0.0031) [2024-06-28 10:43:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3418226688. Throughput: 0: 43894.3. Samples: 3321152040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:43:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:43:03,907][06909] Updated weights for policy 0, policy_version 208633 (0.0034) [2024-06-28 10:43:07,638][06909] Updated weights for policy 0, policy_version 208643 (0.0029) [2024-06-28 10:43:08,854][06674] Fps is (10 sec: 45857.8, 60 sec: 44507.1, 300 sec: 44097.4). Total num frames: 3418472448. Throughput: 0: 43936.2. Samples: 3321416840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:43:08,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:43:11,329][06909] Updated weights for policy 0, policy_version 208653 (0.0031) [2024-06-28 10:43:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43419.1, 300 sec: 43986.9). Total num frames: 3418652672. Throughput: 0: 43907.1. Samples: 3321548660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:43:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:43:15,058][06909] Updated weights for policy 0, policy_version 208663 (0.0028) [2024-06-28 10:43:18,623][06909] Updated weights for policy 0, policy_version 208673 (0.0030) [2024-06-28 10:43:18,850][06674] Fps is (10 sec: 42614.6, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3418898432. Throughput: 0: 43926.2. Samples: 3321817500. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:43:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:43:22,775][06909] Updated weights for policy 0, policy_version 208683 (0.0029) [2024-06-28 10:43:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 3419111424. Throughput: 0: 43900.8. Samples: 3322073640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:43:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 10:43:26,269][06909] Updated weights for policy 0, policy_version 208693 (0.0027) [2024-06-28 10:43:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3419324416. Throughput: 0: 43908.1. Samples: 3322206100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:43:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:43:30,212][06909] Updated weights for policy 0, policy_version 208703 (0.0037) [2024-06-28 10:43:33,690][06909] Updated weights for policy 0, policy_version 208713 (0.0028) [2024-06-28 10:43:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3419553792. Throughput: 0: 43853.8. Samples: 3322471340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:43:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:43:37,521][06909] Updated weights for policy 0, policy_version 208723 (0.0032) [2024-06-28 10:43:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43419.2, 300 sec: 43931.4). Total num frames: 3419766784. Throughput: 0: 43982.2. Samples: 3322732740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:43:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:43:41,158][06909] Updated weights for policy 0, policy_version 208733 (0.0045) [2024-06-28 10:43:43,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3419979776. Throughput: 0: 43981.2. Samples: 3322865440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:43:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:43:44,980][06909] Updated weights for policy 0, policy_version 208743 (0.0038) [2024-06-28 10:43:48,311][06909] Updated weights for policy 0, policy_version 208753 (0.0031) [2024-06-28 10:43:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3420209152. Throughput: 0: 44223.6. Samples: 3323142100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 10:43:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:43:48,944][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000208754_3420225536.pth... [2024-06-28 10:43:48,991][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000208109_3409657856.pth [2024-06-28 10:43:52,258][06909] Updated weights for policy 0, policy_version 208763 (0.0031) [2024-06-28 10:43:53,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3420438528. Throughput: 0: 44041.4. Samples: 3323398540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:43:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:43:55,679][06909] Updated weights for policy 0, policy_version 208773 (0.0041) [2024-06-28 10:43:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3420651520. Throughput: 0: 43878.2. Samples: 3323523180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:43:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:43:59,933][06909] Updated weights for policy 0, policy_version 208783 (0.0021) [2024-06-28 10:44:00,978][06887] Signal inference workers to stop experience collection... (47000 times) [2024-06-28 10:44:01,011][06909] InferenceWorker_p0-w0: stopping experience collection (47000 times) [2024-06-28 10:44:01,091][06887] Signal inference workers to resume experience collection... (47000 times) [2024-06-28 10:44:01,092][06909] InferenceWorker_p0-w0: resuming experience collection (47000 times) [2024-06-28 10:44:03,270][06909] Updated weights for policy 0, policy_version 208793 (0.0046) [2024-06-28 10:44:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3420880896. Throughput: 0: 43884.8. Samples: 3323792320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:44:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:44:07,444][06909] Updated weights for policy 0, policy_version 208803 (0.0032) [2024-06-28 10:44:08,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43147.3, 300 sec: 43875.8). Total num frames: 3421061120. Throughput: 0: 44006.6. Samples: 3324053940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:44:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:44:10,626][06909] Updated weights for policy 0, policy_version 208813 (0.0052) [2024-06-28 10:44:13,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3421290496. Throughput: 0: 43892.1. Samples: 3324181240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:44:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:44:14,738][06909] Updated weights for policy 0, policy_version 208823 (0.0045) [2024-06-28 10:44:18,182][06909] Updated weights for policy 0, policy_version 208833 (0.0030) [2024-06-28 10:44:18,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 3421536256. Throughput: 0: 43955.4. Samples: 3324449340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:44:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:44:22,186][06909] Updated weights for policy 0, policy_version 208843 (0.0033) [2024-06-28 10:44:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3421749248. Throughput: 0: 44145.4. Samples: 3324719280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:44:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:44:25,437][06909] Updated weights for policy 0, policy_version 208853 (0.0041) [2024-06-28 10:44:28,850][06674] Fps is (10 sec: 42596.1, 60 sec: 43963.2, 300 sec: 43987.1). Total num frames: 3421962240. Throughput: 0: 44061.2. Samples: 3324848220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:44:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:44:29,605][06909] Updated weights for policy 0, policy_version 208863 (0.0032) [2024-06-28 10:44:32,844][06909] Updated weights for policy 0, policy_version 208873 (0.0043) [2024-06-28 10:44:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 3422208000. Throughput: 0: 43786.6. Samples: 3325112500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:44:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:44:37,197][06909] Updated weights for policy 0, policy_version 208883 (0.0051) [2024-06-28 10:44:38,850][06674] Fps is (10 sec: 44239.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3422404608. Throughput: 0: 44002.3. Samples: 3325378640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:44:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:44:40,411][06909] Updated weights for policy 0, policy_version 208893 (0.0022) [2024-06-28 10:44:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3422617600. Throughput: 0: 44022.1. Samples: 3325504180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:44:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:44:44,676][06909] Updated weights for policy 0, policy_version 208903 (0.0033) [2024-06-28 10:44:48,110][06909] Updated weights for policy 0, policy_version 208913 (0.0043) [2024-06-28 10:44:48,852][06674] Fps is (10 sec: 45865.4, 60 sec: 44235.2, 300 sec: 44153.2). Total num frames: 3422863360. Throughput: 0: 43892.7. Samples: 3325767580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:44:48,853][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 10:44:52,266][06909] Updated weights for policy 0, policy_version 208923 (0.0034) [2024-06-28 10:44:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3423059968. Throughput: 0: 43951.0. Samples: 3326031740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:44:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:44:55,424][06909] Updated weights for policy 0, policy_version 208933 (0.0038) [2024-06-28 10:44:58,850][06674] Fps is (10 sec: 40968.3, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3423272960. Throughput: 0: 43919.9. Samples: 3326157640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 10:44:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:44:59,846][06909] Updated weights for policy 0, policy_version 208943 (0.0037) [2024-06-28 10:45:02,643][06909] Updated weights for policy 0, policy_version 208953 (0.0034) [2024-06-28 10:45:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 3423518720. Throughput: 0: 43892.6. Samples: 3326424500. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 10:45:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:45:07,254][06909] Updated weights for policy 0, policy_version 208963 (0.0028) [2024-06-28 10:45:08,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 3423698944. Throughput: 0: 43854.5. Samples: 3326692740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 10:45:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:45:09,327][06887] Signal inference workers to stop experience collection... (47050 times) [2024-06-28 10:45:09,327][06887] Signal inference workers to resume experience collection... (47050 times) [2024-06-28 10:45:09,342][06909] InferenceWorker_p0-w0: stopping experience collection (47050 times) [2024-06-28 10:45:09,342][06909] InferenceWorker_p0-w0: resuming experience collection (47050 times) [2024-06-28 10:45:10,293][06909] Updated weights for policy 0, policy_version 208973 (0.0034) [2024-06-28 10:45:13,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3423928320. Throughput: 0: 43729.5. Samples: 3326816020. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 10:45:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:45:14,543][06909] Updated weights for policy 0, policy_version 208983 (0.0041) [2024-06-28 10:45:17,912][06909] Updated weights for policy 0, policy_version 208993 (0.0044) [2024-06-28 10:45:18,850][06674] Fps is (10 sec: 49152.8, 60 sec: 44237.0, 300 sec: 44098.0). Total num frames: 3424190464. Throughput: 0: 43790.3. Samples: 3327083060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 10:45:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:45:22,161][06909] Updated weights for policy 0, policy_version 209003 (0.0035) [2024-06-28 10:45:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3424370688. Throughput: 0: 43869.4. Samples: 3327352760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 10:45:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:45:25,472][06909] Updated weights for policy 0, policy_version 209013 (0.0028) [2024-06-28 10:45:28,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43691.2, 300 sec: 43876.1). Total num frames: 3424583680. Throughput: 0: 43710.7. Samples: 3327471160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 10:45:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:45:29,472][06909] Updated weights for policy 0, policy_version 209023 (0.0022) [2024-06-28 10:45:32,683][06909] Updated weights for policy 0, policy_version 209033 (0.0030) [2024-06-28 10:45:33,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3424845824. Throughput: 0: 43855.4. Samples: 3327740980. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 10:45:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:45:36,886][06909] Updated weights for policy 0, policy_version 209043 (0.0036) [2024-06-28 10:45:38,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3425026048. Throughput: 0: 44139.5. Samples: 3328018020. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 10:45:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:45:39,969][06909] Updated weights for policy 0, policy_version 209053 (0.0034) [2024-06-28 10:45:43,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3425255424. Throughput: 0: 44013.0. Samples: 3328138220. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 10:45:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:45:44,338][06909] Updated weights for policy 0, policy_version 209063 (0.0042) [2024-06-28 10:45:47,988][06909] Updated weights for policy 0, policy_version 209073 (0.0032) [2024-06-28 10:45:48,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43965.2, 300 sec: 44097.9). Total num frames: 3425501184. Throughput: 0: 43855.1. Samples: 3328397980. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 10:45:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:45:48,921][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000209077_3425517568.pth... [2024-06-28 10:45:48,978][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000208431_3414933504.pth [2024-06-28 10:45:52,073][06909] Updated weights for policy 0, policy_version 209083 (0.0031) [2024-06-28 10:45:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 3425681408. Throughput: 0: 43842.3. Samples: 3328665640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 10:45:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:45:55,323][06909] Updated weights for policy 0, policy_version 209093 (0.0022) [2024-06-28 10:45:58,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3425910784. Throughput: 0: 43733.2. Samples: 3328784020. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 10:45:58,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:45:59,371][06909] Updated weights for policy 0, policy_version 209103 (0.0035) [2024-06-28 10:46:02,690][06909] Updated weights for policy 0, policy_version 209113 (0.0034) [2024-06-28 10:46:03,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3426156544. Throughput: 0: 43742.2. Samples: 3329051460. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 10:46:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:46:06,623][06909] Updated weights for policy 0, policy_version 209123 (0.0026) [2024-06-28 10:46:08,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3426353152. Throughput: 0: 43810.6. Samples: 3329324240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 10:46:08,864][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:46:10,112][06909] Updated weights for policy 0, policy_version 209133 (0.0038) [2024-06-28 10:46:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3426582528. Throughput: 0: 43982.7. Samples: 3329450380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 10:46:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:46:14,214][06909] Updated weights for policy 0, policy_version 209143 (0.0027) [2024-06-28 10:46:16,421][06887] Signal inference workers to stop experience collection... (47100 times) [2024-06-28 10:46:16,474][06909] InferenceWorker_p0-w0: stopping experience collection (47100 times) [2024-06-28 10:46:16,482][06887] Signal inference workers to resume experience collection... (47100 times) [2024-06-28 10:46:16,492][06909] InferenceWorker_p0-w0: resuming experience collection (47100 times) [2024-06-28 10:46:17,406][06909] Updated weights for policy 0, policy_version 209153 (0.0024) [2024-06-28 10:46:18,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.6, 300 sec: 44042.7). Total num frames: 3426811904. Throughput: 0: 44028.5. Samples: 3329722260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 10:46:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:46:21,350][06909] Updated weights for policy 0, policy_version 209163 (0.0024) [2024-06-28 10:46:23,852][06674] Fps is (10 sec: 44227.8, 60 sec: 44235.3, 300 sec: 43931.0). Total num frames: 3427024896. Throughput: 0: 43915.9. Samples: 3329994320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 10:46:23,861][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:46:24,996][06909] Updated weights for policy 0, policy_version 209173 (0.0035) [2024-06-28 10:46:28,850][06674] Fps is (10 sec: 42597.6, 60 sec: 44236.7, 300 sec: 43986.8). Total num frames: 3427237888. Throughput: 0: 43872.8. Samples: 3330112500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 10:46:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:46:29,108][06909] Updated weights for policy 0, policy_version 209183 (0.0021) [2024-06-28 10:46:32,450][06909] Updated weights for policy 0, policy_version 209193 (0.0040) [2024-06-28 10:46:33,850][06674] Fps is (10 sec: 45884.1, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3427483648. Throughput: 0: 44071.5. Samples: 3330381200. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 10:46:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:46:36,467][06909] Updated weights for policy 0, policy_version 209203 (0.0037) [2024-06-28 10:46:38,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3427663872. Throughput: 0: 44055.6. Samples: 3330648140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 10:46:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:46:39,924][06909] Updated weights for policy 0, policy_version 209213 (0.0028) [2024-06-28 10:46:43,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3427893248. Throughput: 0: 44109.6. Samples: 3330768940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 10:46:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:46:43,872][06909] Updated weights for policy 0, policy_version 209223 (0.0029) [2024-06-28 10:46:47,365][06909] Updated weights for policy 0, policy_version 209233 (0.0032) [2024-06-28 10:46:48,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3428139008. Throughput: 0: 44055.0. Samples: 3331033940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 10:46:48,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:46:51,414][06909] Updated weights for policy 0, policy_version 209243 (0.0035) [2024-06-28 10:46:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 3428319232. Throughput: 0: 44147.6. Samples: 3331310880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 10:46:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:46:54,871][06909] Updated weights for policy 0, policy_version 209253 (0.0035) [2024-06-28 10:46:58,703][06909] Updated weights for policy 0, policy_version 209263 (0.0032) [2024-06-28 10:46:58,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3428564992. Throughput: 0: 44030.0. Samples: 3331431740. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 10:46:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:47:02,283][06909] Updated weights for policy 0, policy_version 209273 (0.0039) [2024-06-28 10:47:03,850][06674] Fps is (10 sec: 47512.5, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3428794368. Throughput: 0: 43829.1. Samples: 3331694580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 10:47:03,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:47:06,191][06909] Updated weights for policy 0, policy_version 209283 (0.0023) [2024-06-28 10:47:08,850][06674] Fps is (10 sec: 40960.9, 60 sec: 43690.7, 300 sec: 43820.6). Total num frames: 3428974592. Throughput: 0: 43732.2. Samples: 3331962180. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 10:47:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:47:10,046][06909] Updated weights for policy 0, policy_version 209293 (0.0026) [2024-06-28 10:47:13,751][06909] Updated weights for policy 0, policy_version 209303 (0.0031) [2024-06-28 10:47:13,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3429220352. Throughput: 0: 43857.4. Samples: 3332086080. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 10:47:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 10:47:17,380][06909] Updated weights for policy 0, policy_version 209313 (0.0035) [2024-06-28 10:47:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3429449728. Throughput: 0: 43777.4. Samples: 3332351180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 10:47:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:47:20,766][06887] Signal inference workers to stop experience collection... (47150 times) [2024-06-28 10:47:20,778][06909] InferenceWorker_p0-w0: stopping experience collection (47150 times) [2024-06-28 10:47:20,825][06887] Signal inference workers to resume experience collection... (47150 times) [2024-06-28 10:47:20,825][06909] InferenceWorker_p0-w0: resuming experience collection (47150 times) [2024-06-28 10:47:20,956][06909] Updated weights for policy 0, policy_version 209323 (0.0031) [2024-06-28 10:47:23,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43692.2, 300 sec: 43875.8). Total num frames: 3429646336. Throughput: 0: 43851.2. Samples: 3332621440. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 10:47:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:47:24,617][06909] Updated weights for policy 0, policy_version 209333 (0.0031) [2024-06-28 10:47:28,706][06909] Updated weights for policy 0, policy_version 209343 (0.0027) [2024-06-28 10:47:28,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3429875712. Throughput: 0: 44127.9. Samples: 3332754700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 10:47:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:47:32,120][06909] Updated weights for policy 0, policy_version 209353 (0.0030) [2024-06-28 10:47:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 43876.1). Total num frames: 3430105088. Throughput: 0: 43981.5. Samples: 3333013100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 10:47:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:47:36,034][06909] Updated weights for policy 0, policy_version 209363 (0.0040) [2024-06-28 10:47:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3430301696. Throughput: 0: 43882.2. Samples: 3333285580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 10:47:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:47:39,558][06909] Updated weights for policy 0, policy_version 209373 (0.0037) [2024-06-28 10:47:43,589][06909] Updated weights for policy 0, policy_version 209383 (0.0041) [2024-06-28 10:47:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3430547456. Throughput: 0: 43905.9. Samples: 3333407500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 10:47:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:47:47,000][06909] Updated weights for policy 0, policy_version 209393 (0.0034) [2024-06-28 10:47:48,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3430776832. Throughput: 0: 44028.1. Samples: 3333675840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 10:47:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:47:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000209398_3430776832.pth... [2024-06-28 10:47:48,902][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000208754_3420225536.pth [2024-06-28 10:47:50,715][06909] Updated weights for policy 0, policy_version 209403 (0.0039) [2024-06-28 10:47:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3430973440. Throughput: 0: 44106.7. Samples: 3333946980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 10:47:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:47:54,305][06909] Updated weights for policy 0, policy_version 209413 (0.0025) [2024-06-28 10:47:58,192][06909] Updated weights for policy 0, policy_version 209423 (0.0036) [2024-06-28 10:47:58,852][06674] Fps is (10 sec: 44228.3, 60 sec: 44235.4, 300 sec: 44042.1). Total num frames: 3431219200. Throughput: 0: 44282.1. Samples: 3334078860. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 10:47:58,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:48:01,535][06909] Updated weights for policy 0, policy_version 209433 (0.0037) [2024-06-28 10:48:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.8, 300 sec: 43876.4). Total num frames: 3431415808. Throughput: 0: 44257.7. Samples: 3334342780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 10:48:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:48:05,527][06909] Updated weights for policy 0, policy_version 209443 (0.0042) [2024-06-28 10:48:08,850][06674] Fps is (10 sec: 42607.2, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3431645184. Throughput: 0: 44127.5. Samples: 3334607180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 10:48:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:48:09,176][06909] Updated weights for policy 0, policy_version 209453 (0.0029) [2024-06-28 10:48:13,192][06909] Updated weights for policy 0, policy_version 209463 (0.0033) [2024-06-28 10:48:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3431874560. Throughput: 0: 44132.5. Samples: 3334740660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 10:48:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:48:16,683][06909] Updated weights for policy 0, policy_version 209473 (0.0026) [2024-06-28 10:48:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3432087552. Throughput: 0: 44129.7. Samples: 3334998940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 10:48:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:48:20,734][06909] Updated weights for policy 0, policy_version 209483 (0.0034) [2024-06-28 10:48:23,127][06887] Signal inference workers to stop experience collection... (47200 times) [2024-06-28 10:48:23,127][06887] Signal inference workers to resume experience collection... (47200 times) [2024-06-28 10:48:23,177][06909] InferenceWorker_p0-w0: stopping experience collection (47200 times) [2024-06-28 10:48:23,177][06909] InferenceWorker_p0-w0: resuming experience collection (47200 times) [2024-06-28 10:48:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3432300544. Throughput: 0: 43900.1. Samples: 3335261080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:48:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:48:24,027][06909] Updated weights for policy 0, policy_version 209493 (0.0040) [2024-06-28 10:48:28,080][06909] Updated weights for policy 0, policy_version 209503 (0.0030) [2024-06-28 10:48:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3432513536. Throughput: 0: 44249.2. Samples: 3335398720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:48:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:48:31,462][06909] Updated weights for policy 0, policy_version 209513 (0.0026) [2024-06-28 10:48:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3432742912. Throughput: 0: 44117.0. Samples: 3335661100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:48:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:48:35,665][06909] Updated weights for policy 0, policy_version 209523 (0.0039) [2024-06-28 10:48:38,600][06909] Updated weights for policy 0, policy_version 209533 (0.0029) [2024-06-28 10:48:38,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44782.8, 300 sec: 44097.9). Total num frames: 3432988672. Throughput: 0: 43979.8. Samples: 3335926080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:48:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:48:42,992][06909] Updated weights for policy 0, policy_version 209543 (0.0032) [2024-06-28 10:48:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3433185280. Throughput: 0: 44220.2. Samples: 3336068680. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:48:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:48:46,081][06909] Updated weights for policy 0, policy_version 209553 (0.0036) [2024-06-28 10:48:48,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 3433398272. Throughput: 0: 44104.5. Samples: 3336327480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:48:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:48:50,548][06909] Updated weights for policy 0, policy_version 209563 (0.0036) [2024-06-28 10:48:53,702][06909] Updated weights for policy 0, policy_version 209573 (0.0036) [2024-06-28 10:48:53,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3433644032. Throughput: 0: 44015.0. Samples: 3336587860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:48:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:48:57,905][06909] Updated weights for policy 0, policy_version 209583 (0.0037) [2024-06-28 10:48:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43692.1, 300 sec: 43931.3). Total num frames: 3433840640. Throughput: 0: 44046.1. Samples: 3336722740. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:48:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:49:00,914][06909] Updated weights for policy 0, policy_version 209593 (0.0029) [2024-06-28 10:49:03,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3434053632. Throughput: 0: 44195.1. Samples: 3336987720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:49:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:49:05,376][06909] Updated weights for policy 0, policy_version 209603 (0.0042) [2024-06-28 10:49:08,351][06909] Updated weights for policy 0, policy_version 209613 (0.0040) [2024-06-28 10:49:08,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3434315776. Throughput: 0: 44198.2. Samples: 3337250000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:49:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:49:12,797][06909] Updated weights for policy 0, policy_version 209623 (0.0033) [2024-06-28 10:49:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3434496000. Throughput: 0: 44302.3. Samples: 3337392320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:49:13,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:49:15,583][06909] Updated weights for policy 0, policy_version 209633 (0.0020) [2024-06-28 10:49:18,850][06674] Fps is (10 sec: 39321.0, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3434708992. Throughput: 0: 44382.0. Samples: 3337658300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:49:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:49:20,018][06909] Updated weights for policy 0, policy_version 209643 (0.0027) [2024-06-28 10:49:22,969][06909] Updated weights for policy 0, policy_version 209653 (0.0031) [2024-06-28 10:49:23,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 3434971136. Throughput: 0: 44032.1. Samples: 3337907520. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:49:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:49:27,743][06909] Updated weights for policy 0, policy_version 209663 (0.0040) [2024-06-28 10:49:28,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 3435167744. Throughput: 0: 44076.0. Samples: 3338052100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:49:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:49:30,938][06909] Updated weights for policy 0, policy_version 209673 (0.0033) [2024-06-28 10:49:33,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3435364352. Throughput: 0: 44026.2. Samples: 3338308660. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 10:49:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:49:35,120][06909] Updated weights for policy 0, policy_version 209683 (0.0041) [2024-06-28 10:49:37,564][06887] Signal inference workers to stop experience collection... (47250 times) [2024-06-28 10:49:37,564][06887] Signal inference workers to resume experience collection... (47250 times) [2024-06-28 10:49:37,603][06909] InferenceWorker_p0-w0: stopping experience collection (47250 times) [2024-06-28 10:49:37,604][06909] InferenceWorker_p0-w0: resuming experience collection (47250 times) [2024-06-28 10:49:38,206][06909] Updated weights for policy 0, policy_version 209693 (0.0037) [2024-06-28 10:49:38,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3435626496. Throughput: 0: 43905.8. Samples: 3338563620. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 10:49:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:49:42,748][06909] Updated weights for policy 0, policy_version 209703 (0.0024) [2024-06-28 10:49:43,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43690.6, 300 sec: 43876.1). Total num frames: 3435806720. Throughput: 0: 44034.2. Samples: 3338704280. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 10:49:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:49:45,428][06909] Updated weights for policy 0, policy_version 209713 (0.0025) [2024-06-28 10:49:48,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3436036096. Throughput: 0: 44013.9. Samples: 3338968340. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 10:49:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:49:48,916][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000209720_3436052480.pth... [2024-06-28 10:49:48,968][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000209077_3425517568.pth [2024-06-28 10:49:50,094][06909] Updated weights for policy 0, policy_version 209723 (0.0029) [2024-06-28 10:49:53,153][06909] Updated weights for policy 0, policy_version 209733 (0.0029) [2024-06-28 10:49:53,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3436281856. Throughput: 0: 43884.7. Samples: 3339224820. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 10:49:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:49:57,692][06909] Updated weights for policy 0, policy_version 209743 (0.0036) [2024-06-28 10:49:58,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3436478464. Throughput: 0: 43939.6. Samples: 3339369600. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 10:49:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:50:00,505][06909] Updated weights for policy 0, policy_version 209753 (0.0039) [2024-06-28 10:50:03,850][06674] Fps is (10 sec: 39322.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3436675072. Throughput: 0: 43757.0. Samples: 3339627360. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 10:50:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:50:05,102][06909] Updated weights for policy 0, policy_version 209763 (0.0029) [2024-06-28 10:50:08,222][06909] Updated weights for policy 0, policy_version 209773 (0.0024) [2024-06-28 10:50:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 3436937216. Throughput: 0: 43812.0. Samples: 3339879060. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 10:50:08,859][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:50:12,846][06909] Updated weights for policy 0, policy_version 209783 (0.0028) [2024-06-28 10:50:13,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3437150208. Throughput: 0: 43797.7. Samples: 3340023000. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 10:50:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:50:15,453][06909] Updated weights for policy 0, policy_version 209793 (0.0036) [2024-06-28 10:50:18,852][06674] Fps is (10 sec: 40951.6, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 3437346816. Throughput: 0: 43837.5. Samples: 3340281440. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 10:50:18,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:50:20,080][06909] Updated weights for policy 0, policy_version 209803 (0.0038) [2024-06-28 10:50:22,835][06909] Updated weights for policy 0, policy_version 209813 (0.0032) [2024-06-28 10:50:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.8, 300 sec: 44098.0). Total num frames: 3437592576. Throughput: 0: 43953.9. Samples: 3340541540. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 10:50:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:50:27,372][06909] Updated weights for policy 0, policy_version 209823 (0.0041) [2024-06-28 10:50:28,850][06674] Fps is (10 sec: 45885.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3437805568. Throughput: 0: 44052.1. Samples: 3340686620. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 10:50:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:50:30,268][06909] Updated weights for policy 0, policy_version 209833 (0.0035) [2024-06-28 10:50:33,850][06674] Fps is (10 sec: 39321.4, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 3437985792. Throughput: 0: 44089.3. Samples: 3340952360. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 10:50:33,850][06674] Avg episode reward: [(0, '0.491')] [2024-06-28 10:50:34,816][06909] Updated weights for policy 0, policy_version 209843 (0.0038) [2024-06-28 10:50:37,858][06909] Updated weights for policy 0, policy_version 209853 (0.0024) [2024-06-28 10:50:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3438264320. Throughput: 0: 43907.7. Samples: 3341200660. Policy #0 lag: (min: 0.0, avg: 12.8, max: 21.0) [2024-06-28 10:50:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:50:42,176][06909] Updated weights for policy 0, policy_version 209863 (0.0029) [2024-06-28 10:50:43,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 3438460928. Throughput: 0: 44036.5. Samples: 3341351240. Policy #0 lag: (min: 0.0, avg: 12.8, max: 21.0) [2024-06-28 10:50:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:50:45,337][06909] Updated weights for policy 0, policy_version 209873 (0.0020) [2024-06-28 10:50:48,850][06674] Fps is (10 sec: 39321.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3438657536. Throughput: 0: 43985.8. Samples: 3341606720. Policy #0 lag: (min: 0.0, avg: 12.8, max: 21.0) [2024-06-28 10:50:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:50:49,857][06909] Updated weights for policy 0, policy_version 209883 (0.0032) [2024-06-28 10:50:52,613][06909] Updated weights for policy 0, policy_version 209893 (0.0028) [2024-06-28 10:50:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3438919680. Throughput: 0: 44100.1. Samples: 3341863560. Policy #0 lag: (min: 0.0, avg: 12.8, max: 21.0) [2024-06-28 10:50:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:50:56,823][06887] Signal inference workers to stop experience collection... (47300 times) [2024-06-28 10:50:56,882][06909] InferenceWorker_p0-w0: stopping experience collection (47300 times) [2024-06-28 10:50:56,888][06887] Signal inference workers to resume experience collection... (47300 times) [2024-06-28 10:50:56,897][06909] InferenceWorker_p0-w0: resuming experience collection (47300 times) [2024-06-28 10:50:57,217][06909] Updated weights for policy 0, policy_version 209903 (0.0032) [2024-06-28 10:50:58,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3439116288. Throughput: 0: 44042.2. Samples: 3342004900. Policy #0 lag: (min: 0.0, avg: 12.8, max: 21.0) [2024-06-28 10:50:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:51:00,093][06909] Updated weights for policy 0, policy_version 209913 (0.0034) [2024-06-28 10:51:03,850][06674] Fps is (10 sec: 40959.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3439329280. Throughput: 0: 44217.1. Samples: 3342271120. Policy #0 lag: (min: 0.0, avg: 12.8, max: 21.0) [2024-06-28 10:51:03,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:51:04,581][06909] Updated weights for policy 0, policy_version 209923 (0.0030) [2024-06-28 10:51:07,515][06909] Updated weights for policy 0, policy_version 209933 (0.0033) [2024-06-28 10:51:08,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3439575040. Throughput: 0: 44162.9. Samples: 3342528880. Policy #0 lag: (min: 0.0, avg: 12.8, max: 21.0) [2024-06-28 10:51:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:51:12,010][06909] Updated weights for policy 0, policy_version 209943 (0.0030) [2024-06-28 10:51:13,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3439804416. Throughput: 0: 44107.5. Samples: 3342671460. Policy #0 lag: (min: 0.0, avg: 12.8, max: 21.0) [2024-06-28 10:51:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:51:15,118][06909] Updated weights for policy 0, policy_version 209953 (0.0027) [2024-06-28 10:51:18,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43965.2, 300 sec: 43931.6). Total num frames: 3439984640. Throughput: 0: 44004.8. Samples: 3342932580. Policy #0 lag: (min: 0.0, avg: 12.8, max: 21.0) [2024-06-28 10:51:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:51:19,445][06909] Updated weights for policy 0, policy_version 209963 (0.0037) [2024-06-28 10:51:22,463][06909] Updated weights for policy 0, policy_version 209973 (0.0033) [2024-06-28 10:51:23,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3440230400. Throughput: 0: 44153.3. Samples: 3343187560. Policy #0 lag: (min: 0.0, avg: 12.8, max: 21.0) [2024-06-28 10:51:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:51:27,199][06909] Updated weights for policy 0, policy_version 209983 (0.0033) [2024-06-28 10:51:28,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3440459776. Throughput: 0: 43964.4. Samples: 3343329640. Policy #0 lag: (min: 0.0, avg: 12.8, max: 21.0) [2024-06-28 10:51:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:51:29,758][06909] Updated weights for policy 0, policy_version 209993 (0.0025) [2024-06-28 10:51:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3440640000. Throughput: 0: 44035.9. Samples: 3343588340. Policy #0 lag: (min: 0.0, avg: 12.8, max: 21.0) [2024-06-28 10:51:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:51:34,582][06909] Updated weights for policy 0, policy_version 210003 (0.0039) [2024-06-28 10:51:37,397][06909] Updated weights for policy 0, policy_version 210013 (0.0034) [2024-06-28 10:51:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3440885760. Throughput: 0: 44166.7. Samples: 3343851060. Policy #0 lag: (min: 0.0, avg: 12.8, max: 21.0) [2024-06-28 10:51:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:51:41,888][06909] Updated weights for policy 0, policy_version 210023 (0.0029) [2024-06-28 10:51:43,850][06674] Fps is (10 sec: 49152.5, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3441131520. Throughput: 0: 44199.6. Samples: 3343993880. Policy #0 lag: (min: 0.0, avg: 12.8, max: 21.0) [2024-06-28 10:51:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:51:44,678][06909] Updated weights for policy 0, policy_version 210033 (0.0030) [2024-06-28 10:51:48,856][06674] Fps is (10 sec: 42572.5, 60 sec: 44232.3, 300 sec: 44041.5). Total num frames: 3441311744. Throughput: 0: 44107.4. Samples: 3344256220. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 10:51:48,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:51:48,912][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000210042_3441328128.pth... [2024-06-28 10:51:48,966][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000209398_3430776832.pth [2024-06-28 10:51:49,330][06909] Updated weights for policy 0, policy_version 210043 (0.0030) [2024-06-28 10:51:52,331][06909] Updated weights for policy 0, policy_version 210053 (0.0033) [2024-06-28 10:51:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3441557504. Throughput: 0: 44016.1. Samples: 3344509600. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 10:51:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:51:56,757][06909] Updated weights for policy 0, policy_version 210063 (0.0034) [2024-06-28 10:51:58,850][06674] Fps is (10 sec: 47542.1, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3441786880. Throughput: 0: 43987.0. Samples: 3344650880. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 10:51:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:52:00,169][06909] Updated weights for policy 0, policy_version 210073 (0.0028) [2024-06-28 10:52:03,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3441967104. Throughput: 0: 43894.2. Samples: 3344907820. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 10:52:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:52:04,191][06909] Updated weights for policy 0, policy_version 210083 (0.0037) [2024-06-28 10:52:07,527][06909] Updated weights for policy 0, policy_version 210093 (0.0023) [2024-06-28 10:52:08,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3442196480. Throughput: 0: 44100.9. Samples: 3345172100. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 10:52:08,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:52:11,626][06909] Updated weights for policy 0, policy_version 210103 (0.0032) [2024-06-28 10:52:13,850][06674] Fps is (10 sec: 49152.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3442458624. Throughput: 0: 44066.7. Samples: 3345312640. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 10:52:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:52:14,598][06909] Updated weights for policy 0, policy_version 210113 (0.0030) [2024-06-28 10:52:18,849][06909] Updated weights for policy 0, policy_version 210123 (0.0033) [2024-06-28 10:52:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3442655232. Throughput: 0: 44273.8. Samples: 3345580660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 10:52:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:52:21,858][06909] Updated weights for policy 0, policy_version 210133 (0.0039) [2024-06-28 10:52:23,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3442868224. Throughput: 0: 44227.1. Samples: 3345841280. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 10:52:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:52:26,222][06909] Updated weights for policy 0, policy_version 210143 (0.0029) [2024-06-28 10:52:26,876][06887] Signal inference workers to stop experience collection... (47350 times) [2024-06-28 10:52:26,876][06887] Signal inference workers to resume experience collection... (47350 times) [2024-06-28 10:52:26,918][06909] InferenceWorker_p0-w0: stopping experience collection (47350 times) [2024-06-28 10:52:26,918][06909] InferenceWorker_p0-w0: resuming experience collection (47350 times) [2024-06-28 10:52:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3443097600. Throughput: 0: 43914.6. Samples: 3345970040. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 10:52:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:52:29,587][06909] Updated weights for policy 0, policy_version 210153 (0.0036) [2024-06-28 10:52:33,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3443294208. Throughput: 0: 44013.9. Samples: 3346236580. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 10:52:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:52:33,999][06909] Updated weights for policy 0, policy_version 210163 (0.0028) [2024-06-28 10:52:36,766][06909] Updated weights for policy 0, policy_version 210173 (0.0042) [2024-06-28 10:52:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3443523584. Throughput: 0: 44166.6. Samples: 3346497100. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 10:52:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:52:41,209][06909] Updated weights for policy 0, policy_version 210183 (0.0030) [2024-06-28 10:52:43,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3443769344. Throughput: 0: 44026.8. Samples: 3346632080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 10:52:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:52:44,638][06909] Updated weights for policy 0, policy_version 210193 (0.0029) [2024-06-28 10:52:48,616][06909] Updated weights for policy 0, policy_version 210203 (0.0026) [2024-06-28 10:52:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44241.3, 300 sec: 44042.4). Total num frames: 3443965952. Throughput: 0: 44134.3. Samples: 3346893860. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 10:52:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:52:51,873][06909] Updated weights for policy 0, policy_version 210213 (0.0028) [2024-06-28 10:52:53,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43931.6). Total num frames: 3444178944. Throughput: 0: 44187.3. Samples: 3347160520. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 10:52:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:52:56,042][06909] Updated weights for policy 0, policy_version 210223 (0.0027) [2024-06-28 10:52:58,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3444424704. Throughput: 0: 44048.0. Samples: 3347294800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 10:52:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:52:59,624][06909] Updated weights for policy 0, policy_version 210233 (0.0032) [2024-06-28 10:53:03,367][06909] Updated weights for policy 0, policy_version 210243 (0.0034) [2024-06-28 10:53:03,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3444621312. Throughput: 0: 43858.6. Samples: 3347554300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 10:53:03,859][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:53:06,830][06909] Updated weights for policy 0, policy_version 210253 (0.0027) [2024-06-28 10:53:08,850][06674] Fps is (10 sec: 42597.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3444850688. Throughput: 0: 43940.8. Samples: 3347818620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 10:53:08,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:53:11,118][06909] Updated weights for policy 0, policy_version 210263 (0.0040) [2024-06-28 10:53:13,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3445080064. Throughput: 0: 43992.1. Samples: 3347949680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 10:53:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:53:14,161][06909] Updated weights for policy 0, policy_version 210273 (0.0036) [2024-06-28 10:53:18,519][06909] Updated weights for policy 0, policy_version 210283 (0.0036) [2024-06-28 10:53:18,850][06674] Fps is (10 sec: 42599.2, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3445276672. Throughput: 0: 43853.8. Samples: 3348210000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 10:53:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:53:21,827][06909] Updated weights for policy 0, policy_version 210293 (0.0026) [2024-06-28 10:53:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3445506048. Throughput: 0: 44024.8. Samples: 3348478220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 10:53:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:53:25,707][06909] Updated weights for policy 0, policy_version 210303 (0.0033) [2024-06-28 10:53:28,850][06674] Fps is (10 sec: 47513.2, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3445751808. Throughput: 0: 43905.7. Samples: 3348607840. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 10:53:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:53:29,301][06909] Updated weights for policy 0, policy_version 210313 (0.0033) [2024-06-28 10:53:33,086][06909] Updated weights for policy 0, policy_version 210323 (0.0025) [2024-06-28 10:53:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3445948416. Throughput: 0: 43901.7. Samples: 3348869440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 10:53:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:53:36,877][06909] Updated weights for policy 0, policy_version 210333 (0.0029) [2024-06-28 10:53:38,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3446161408. Throughput: 0: 43920.3. Samples: 3349136940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 10:53:38,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:53:40,708][06909] Updated weights for policy 0, policy_version 210343 (0.0030) [2024-06-28 10:53:43,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3446407168. Throughput: 0: 43764.0. Samples: 3349264180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 10:53:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:53:44,077][06909] Updated weights for policy 0, policy_version 210353 (0.0038) [2024-06-28 10:53:48,355][06909] Updated weights for policy 0, policy_version 210363 (0.0039) [2024-06-28 10:53:48,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3446603776. Throughput: 0: 43896.2. Samples: 3349529620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 10:53:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:53:48,894][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000210365_3446620160.pth... [2024-06-28 10:53:48,949][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000209720_3436052480.pth [2024-06-28 10:53:51,395][06909] Updated weights for policy 0, policy_version 210373 (0.0029) [2024-06-28 10:53:53,851][06674] Fps is (10 sec: 42594.5, 60 sec: 44236.1, 300 sec: 44042.3). Total num frames: 3446833152. Throughput: 0: 43965.1. Samples: 3349797080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 10:53:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:53:55,578][06909] Updated weights for policy 0, policy_version 210383 (0.0034) [2024-06-28 10:53:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3447062528. Throughput: 0: 43909.4. Samples: 3349925600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 10:53:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:53:59,005][06909] Updated weights for policy 0, policy_version 210393 (0.0033) [2024-06-28 10:54:02,942][06909] Updated weights for policy 0, policy_version 210403 (0.0047) [2024-06-28 10:54:03,850][06674] Fps is (10 sec: 42601.5, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3447259136. Throughput: 0: 44110.9. Samples: 3350195000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:54:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:54:06,537][06909] Updated weights for policy 0, policy_version 210413 (0.0031) [2024-06-28 10:54:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3447488512. Throughput: 0: 44057.8. Samples: 3350460820. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:54:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:54:10,602][06909] Updated weights for policy 0, policy_version 210423 (0.0038) [2024-06-28 10:54:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3447717888. Throughput: 0: 44029.8. Samples: 3350589180. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:54:13,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:54:14,018][06909] Updated weights for policy 0, policy_version 210433 (0.0024) [2024-06-28 10:54:18,050][06909] Updated weights for policy 0, policy_version 210443 (0.0032) [2024-06-28 10:54:18,851][06674] Fps is (10 sec: 44230.0, 60 sec: 44235.6, 300 sec: 43931.1). Total num frames: 3447930880. Throughput: 0: 44040.4. Samples: 3350851320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:54:18,852][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 10:54:20,782][06887] Signal inference workers to stop experience collection... (47400 times) [2024-06-28 10:54:20,788][06887] Signal inference workers to resume experience collection... (47400 times) [2024-06-28 10:54:20,831][06909] InferenceWorker_p0-w0: stopping experience collection (47400 times) [2024-06-28 10:54:20,831][06909] InferenceWorker_p0-w0: resuming experience collection (47400 times) [2024-06-28 10:54:21,322][06909] Updated weights for policy 0, policy_version 210453 (0.0039) [2024-06-28 10:54:23,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3448143872. Throughput: 0: 43947.3. Samples: 3351114560. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:54:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:54:25,374][06909] Updated weights for policy 0, policy_version 210463 (0.0032) [2024-06-28 10:54:28,703][06909] Updated weights for policy 0, policy_version 210473 (0.0038) [2024-06-28 10:54:28,850][06674] Fps is (10 sec: 45881.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3448389632. Throughput: 0: 43980.3. Samples: 3351243300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:54:28,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:54:32,950][06909] Updated weights for policy 0, policy_version 210483 (0.0037) [2024-06-28 10:54:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3448586240. Throughput: 0: 43993.8. Samples: 3351509340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:54:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:54:36,157][06909] Updated weights for policy 0, policy_version 210493 (0.0036) [2024-06-28 10:54:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3448815616. Throughput: 0: 43969.6. Samples: 3351775680. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:54:38,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:54:40,158][06909] Updated weights for policy 0, policy_version 210503 (0.0024) [2024-06-28 10:54:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3449028608. Throughput: 0: 43940.8. Samples: 3351902940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:54:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:54:43,873][06909] Updated weights for policy 0, policy_version 210513 (0.0020) [2024-06-28 10:54:47,689][06909] Updated weights for policy 0, policy_version 210523 (0.0028) [2024-06-28 10:54:48,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3449257984. Throughput: 0: 43881.9. Samples: 3352169680. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:54:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:54:51,618][06909] Updated weights for policy 0, policy_version 210533 (0.0030) [2024-06-28 10:54:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43964.4, 300 sec: 44042.4). Total num frames: 3449470976. Throughput: 0: 43877.8. Samples: 3352435320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:54:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:54:55,306][06909] Updated weights for policy 0, policy_version 210543 (0.0026) [2024-06-28 10:54:58,838][06909] Updated weights for policy 0, policy_version 210553 (0.0023) [2024-06-28 10:54:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 3449700352. Throughput: 0: 43958.2. Samples: 3352567300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:54:58,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:55:02,514][06909] Updated weights for policy 0, policy_version 210563 (0.0034) [2024-06-28 10:55:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44237.0, 300 sec: 43986.9). Total num frames: 3449913344. Throughput: 0: 44019.8. Samples: 3352832140. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:55:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:55:06,097][06909] Updated weights for policy 0, policy_version 210573 (0.0033) [2024-06-28 10:55:08,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3450126336. Throughput: 0: 44011.6. Samples: 3353095080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 10:55:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:55:10,010][06909] Updated weights for policy 0, policy_version 210583 (0.0032) [2024-06-28 10:55:13,728][06909] Updated weights for policy 0, policy_version 210593 (0.0032) [2024-06-28 10:55:13,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.7, 300 sec: 44098.3). Total num frames: 3450355712. Throughput: 0: 44112.0. Samples: 3353228340. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-28 10:55:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:55:17,382][06909] Updated weights for policy 0, policy_version 210603 (0.0029) [2024-06-28 10:55:18,850][06674] Fps is (10 sec: 45874.3, 60 sec: 44237.9, 300 sec: 44042.4). Total num frames: 3450585088. Throughput: 0: 44091.9. Samples: 3353493480. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-28 10:55:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:55:20,905][06909] Updated weights for policy 0, policy_version 210613 (0.0027) [2024-06-28 10:55:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3450798080. Throughput: 0: 44128.1. Samples: 3353761440. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-28 10:55:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:55:24,711][06909] Updated weights for policy 0, policy_version 210623 (0.0032) [2024-06-28 10:55:28,529][06909] Updated weights for policy 0, policy_version 210633 (0.0037) [2024-06-28 10:55:28,856][06674] Fps is (10 sec: 42573.1, 60 sec: 43686.3, 300 sec: 44152.6). Total num frames: 3451011072. Throughput: 0: 44218.5. Samples: 3353893040. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-28 10:55:28,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:55:32,206][06909] Updated weights for policy 0, policy_version 210643 (0.0026) [2024-06-28 10:55:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3451240448. Throughput: 0: 44006.2. Samples: 3354149960. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-28 10:55:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:55:35,869][06909] Updated weights for policy 0, policy_version 210653 (0.0033) [2024-06-28 10:55:38,850][06674] Fps is (10 sec: 44263.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3451453440. Throughput: 0: 44153.7. Samples: 3354422240. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-28 10:55:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:55:39,780][06909] Updated weights for policy 0, policy_version 210663 (0.0031) [2024-06-28 10:55:43,163][06909] Updated weights for policy 0, policy_version 210673 (0.0022) [2024-06-28 10:55:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3451666432. Throughput: 0: 44067.7. Samples: 3354550340. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-28 10:55:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:55:47,125][06909] Updated weights for policy 0, policy_version 210683 (0.0036) [2024-06-28 10:55:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3451895808. Throughput: 0: 43956.3. Samples: 3354810180. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-28 10:55:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:55:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000210687_3451895808.pth... [2024-06-28 10:55:48,932][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000210042_3441328128.pth [2024-06-28 10:55:50,881][06909] Updated weights for policy 0, policy_version 210693 (0.0029) [2024-06-28 10:55:52,054][06887] Signal inference workers to stop experience collection... (47450 times) [2024-06-28 10:55:52,109][06887] Signal inference workers to resume experience collection... (47450 times) [2024-06-28 10:55:52,110][06909] InferenceWorker_p0-w0: stopping experience collection (47450 times) [2024-06-28 10:55:52,124][06909] InferenceWorker_p0-w0: resuming experience collection (47450 times) [2024-06-28 10:55:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3452108800. Throughput: 0: 44028.3. Samples: 3355076360. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-28 10:55:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:55:54,717][06909] Updated weights for policy 0, policy_version 210703 (0.0027) [2024-06-28 10:55:58,426][06909] Updated weights for policy 0, policy_version 210713 (0.0038) [2024-06-28 10:55:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3452321792. Throughput: 0: 43921.3. Samples: 3355204800. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-28 10:55:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 10:56:02,202][06909] Updated weights for policy 0, policy_version 210723 (0.0027) [2024-06-28 10:56:03,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3452551168. Throughput: 0: 43806.8. Samples: 3355464780. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-28 10:56:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:56:05,759][06909] Updated weights for policy 0, policy_version 210733 (0.0034) [2024-06-28 10:56:08,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3452764160. Throughput: 0: 43829.4. Samples: 3355733760. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-28 10:56:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:56:09,662][06909] Updated weights for policy 0, policy_version 210743 (0.0038) [2024-06-28 10:56:13,029][06909] Updated weights for policy 0, policy_version 210753 (0.0040) [2024-06-28 10:56:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3452977152. Throughput: 0: 43821.9. Samples: 3355864760. Policy #0 lag: (min: 1.0, avg: 10.2, max: 20.0) [2024-06-28 10:56:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:56:16,854][06909] Updated weights for policy 0, policy_version 210763 (0.0029) [2024-06-28 10:56:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3453206528. Throughput: 0: 44147.6. Samples: 3356136600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 10:56:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:56:20,444][06909] Updated weights for policy 0, policy_version 210773 (0.0040) [2024-06-28 10:56:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 3453419520. Throughput: 0: 43966.3. Samples: 3356400720. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 10:56:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:56:24,631][06909] Updated weights for policy 0, policy_version 210783 (0.0036) [2024-06-28 10:56:27,684][06909] Updated weights for policy 0, policy_version 210793 (0.0035) [2024-06-28 10:56:28,856][06674] Fps is (10 sec: 42572.6, 60 sec: 43690.7, 300 sec: 44041.5). Total num frames: 3453632512. Throughput: 0: 44070.5. Samples: 3356533780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 10:56:28,857][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:56:31,836][06909] Updated weights for policy 0, policy_version 210803 (0.0038) [2024-06-28 10:56:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3453861888. Throughput: 0: 44063.6. Samples: 3356793040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 10:56:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:56:35,115][06909] Updated weights for policy 0, policy_version 210813 (0.0029) [2024-06-28 10:56:38,850][06674] Fps is (10 sec: 45902.7, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3454091264. Throughput: 0: 44093.3. Samples: 3357060560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 10:56:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:56:39,294][06909] Updated weights for policy 0, policy_version 210823 (0.0042) [2024-06-28 10:56:42,663][06909] Updated weights for policy 0, policy_version 210833 (0.0031) [2024-06-28 10:56:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44043.3). Total num frames: 3454304256. Throughput: 0: 44152.1. Samples: 3357191640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 10:56:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:56:46,915][06909] Updated weights for policy 0, policy_version 210843 (0.0049) [2024-06-28 10:56:48,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3454517248. Throughput: 0: 44225.7. Samples: 3357454940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 10:56:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:56:50,063][06909] Updated weights for policy 0, policy_version 210853 (0.0036) [2024-06-28 10:56:53,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 3454746624. Throughput: 0: 44180.7. Samples: 3357721900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 10:56:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:56:54,426][06909] Updated weights for policy 0, policy_version 210863 (0.0030) [2024-06-28 10:56:57,501][06909] Updated weights for policy 0, policy_version 210873 (0.0035) [2024-06-28 10:56:58,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 3454959616. Throughput: 0: 44078.3. Samples: 3357848280. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 10:56:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:57:01,933][06909] Updated weights for policy 0, policy_version 210883 (0.0028) [2024-06-28 10:57:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3455188992. Throughput: 0: 43914.5. Samples: 3358112760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 10:57:03,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:57:05,189][06909] Updated weights for policy 0, policy_version 210893 (0.0033) [2024-06-28 10:57:08,173][06887] Signal inference workers to stop experience collection... (47500 times) [2024-06-28 10:57:08,224][06909] InferenceWorker_p0-w0: stopping experience collection (47500 times) [2024-06-28 10:57:08,227][06887] Signal inference workers to resume experience collection... (47500 times) [2024-06-28 10:57:08,239][06909] InferenceWorker_p0-w0: resuming experience collection (47500 times) [2024-06-28 10:57:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3455418368. Throughput: 0: 43981.3. Samples: 3358379880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 10:57:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:57:09,199][06909] Updated weights for policy 0, policy_version 210903 (0.0036) [2024-06-28 10:57:12,681][06909] Updated weights for policy 0, policy_version 210913 (0.0028) [2024-06-28 10:57:13,852][06674] Fps is (10 sec: 44228.1, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 3455631360. Throughput: 0: 44031.4. Samples: 3358515020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 10:57:13,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:57:16,903][06909] Updated weights for policy 0, policy_version 210923 (0.0042) [2024-06-28 10:57:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3455844352. Throughput: 0: 44031.9. Samples: 3358774480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 10:57:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:57:20,386][06909] Updated weights for policy 0, policy_version 210933 (0.0028) [2024-06-28 10:57:23,850][06674] Fps is (10 sec: 44245.6, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3456073728. Throughput: 0: 43886.2. Samples: 3359035440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 10:57:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:57:24,374][06909] Updated weights for policy 0, policy_version 210943 (0.0042) [2024-06-28 10:57:27,835][06909] Updated weights for policy 0, policy_version 210953 (0.0037) [2024-06-28 10:57:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44241.2, 300 sec: 44042.4). Total num frames: 3456286720. Throughput: 0: 43901.7. Samples: 3359167220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 10:57:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:57:32,050][06909] Updated weights for policy 0, policy_version 210963 (0.0028) [2024-06-28 10:57:33,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3456516096. Throughput: 0: 43906.8. Samples: 3359430740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 10:57:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:57:35,190][06909] Updated weights for policy 0, policy_version 210973 (0.0036) [2024-06-28 10:57:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3456729088. Throughput: 0: 43702.8. Samples: 3359688520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 10:57:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:57:39,368][06909] Updated weights for policy 0, policy_version 210983 (0.0036) [2024-06-28 10:57:42,862][06909] Updated weights for policy 0, policy_version 210993 (0.0037) [2024-06-28 10:57:43,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3456958464. Throughput: 0: 43840.3. Samples: 3359821100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 10:57:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:57:46,684][06909] Updated weights for policy 0, policy_version 211003 (0.0036) [2024-06-28 10:57:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3457171456. Throughput: 0: 43871.5. Samples: 3360086980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 10:57:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:57:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000211009_3457171456.pth... [2024-06-28 10:57:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000210365_3446620160.pth [2024-06-28 10:57:50,016][06909] Updated weights for policy 0, policy_version 211013 (0.0033) [2024-06-28 10:57:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3457384448. Throughput: 0: 43698.1. Samples: 3360346300. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 10:57:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:57:54,485][06909] Updated weights for policy 0, policy_version 211023 (0.0033) [2024-06-28 10:57:57,790][06909] Updated weights for policy 0, policy_version 211033 (0.0037) [2024-06-28 10:57:58,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3457597440. Throughput: 0: 43706.8. Samples: 3360481740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 10:57:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:58:01,990][06909] Updated weights for policy 0, policy_version 211043 (0.0030) [2024-06-28 10:58:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3457843200. Throughput: 0: 43901.4. Samples: 3360750040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 10:58:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:58:05,018][06909] Updated weights for policy 0, policy_version 211053 (0.0032) [2024-06-28 10:58:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3458039808. Throughput: 0: 43852.0. Samples: 3361008780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 10:58:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 10:58:09,283][06909] Updated weights for policy 0, policy_version 211063 (0.0033) [2024-06-28 10:58:12,357][06909] Updated weights for policy 0, policy_version 211073 (0.0037) [2024-06-28 10:58:13,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43965.1, 300 sec: 44042.4). Total num frames: 3458269184. Throughput: 0: 43870.1. Samples: 3361141380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 10:58:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:58:16,594][06909] Updated weights for policy 0, policy_version 211083 (0.0037) [2024-06-28 10:58:18,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 3458482176. Throughput: 0: 43811.2. Samples: 3361402240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 10:58:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:58:20,258][06909] Updated weights for policy 0, policy_version 211093 (0.0043) [2024-06-28 10:58:23,850][06674] Fps is (10 sec: 42599.5, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3458695168. Throughput: 0: 44055.2. Samples: 3361671000. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 10:58:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:58:24,104][06909] Updated weights for policy 0, policy_version 211103 (0.0039) [2024-06-28 10:58:27,429][06909] Updated weights for policy 0, policy_version 211113 (0.0030) [2024-06-28 10:58:28,478][06887] Signal inference workers to stop experience collection... (47550 times) [2024-06-28 10:58:28,482][06887] Signal inference workers to resume experience collection... (47550 times) [2024-06-28 10:58:28,489][06909] InferenceWorker_p0-w0: stopping experience collection (47550 times) [2024-06-28 10:58:28,519][06909] InferenceWorker_p0-w0: resuming experience collection (47550 times) [2024-06-28 10:58:28,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3458924544. Throughput: 0: 43944.9. Samples: 3361798620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 23.0) [2024-06-28 10:58:28,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:58:31,307][06909] Updated weights for policy 0, policy_version 211123 (0.0029) [2024-06-28 10:58:33,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3459153920. Throughput: 0: 43981.6. Samples: 3362066140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 10:58:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 10:58:35,157][06909] Updated weights for policy 0, policy_version 211133 (0.0028) [2024-06-28 10:58:38,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3459350528. Throughput: 0: 44083.2. Samples: 3362330040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 10:58:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:58:38,946][06909] Updated weights for policy 0, policy_version 211143 (0.0029) [2024-06-28 10:58:42,354][06909] Updated weights for policy 0, policy_version 211153 (0.0025) [2024-06-28 10:58:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3459579904. Throughput: 0: 43912.6. Samples: 3362457800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 10:58:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:58:46,167][06909] Updated weights for policy 0, policy_version 211163 (0.0028) [2024-06-28 10:58:48,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.9, 300 sec: 44042.5). Total num frames: 3459825664. Throughput: 0: 44045.8. Samples: 3362732100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 10:58:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:58:49,582][06909] Updated weights for policy 0, policy_version 211173 (0.0031) [2024-06-28 10:58:53,648][06909] Updated weights for policy 0, policy_version 211183 (0.0040) [2024-06-28 10:58:53,850][06674] Fps is (10 sec: 44234.2, 60 sec: 43963.5, 300 sec: 43931.3). Total num frames: 3460022272. Throughput: 0: 44033.8. Samples: 3362990320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 10:58:53,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:58:57,296][06909] Updated weights for policy 0, policy_version 211193 (0.0039) [2024-06-28 10:58:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3460235264. Throughput: 0: 43926.4. Samples: 3363118060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 10:58:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:59:01,282][06909] Updated weights for policy 0, policy_version 211203 (0.0026) [2024-06-28 10:59:03,850][06674] Fps is (10 sec: 47515.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3460497408. Throughput: 0: 44084.3. Samples: 3363386040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 10:59:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:59:04,555][06909] Updated weights for policy 0, policy_version 211213 (0.0031) [2024-06-28 10:59:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3460677632. Throughput: 0: 43876.8. Samples: 3363645460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 10:59:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 10:59:08,857][06909] Updated weights for policy 0, policy_version 211223 (0.0026) [2024-06-28 10:59:11,897][06909] Updated weights for policy 0, policy_version 211233 (0.0028) [2024-06-28 10:59:13,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.9, 300 sec: 43987.1). Total num frames: 3460907008. Throughput: 0: 43863.2. Samples: 3363772460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 10:59:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 10:59:16,214][06909] Updated weights for policy 0, policy_version 211243 (0.0027) [2024-06-28 10:59:18,852][06674] Fps is (10 sec: 47504.4, 60 sec: 44508.3, 300 sec: 44097.6). Total num frames: 3461152768. Throughput: 0: 43853.9. Samples: 3364039660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 10:59:18,852][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 10:59:19,687][06909] Updated weights for policy 0, policy_version 211253 (0.0034) [2024-06-28 10:59:23,572][06909] Updated weights for policy 0, policy_version 211263 (0.0024) [2024-06-28 10:59:23,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 3461332992. Throughput: 0: 44056.3. Samples: 3364312580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 10:59:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:59:27,134][06909] Updated weights for policy 0, policy_version 211273 (0.0025) [2024-06-28 10:59:28,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3461562368. Throughput: 0: 43945.2. Samples: 3364435340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 10:59:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:59:31,338][06909] Updated weights for policy 0, policy_version 211283 (0.0029) [2024-06-28 10:59:33,850][06674] Fps is (10 sec: 47514.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3461808128. Throughput: 0: 43913.0. Samples: 3364708180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 10:59:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:59:34,526][06909] Updated weights for policy 0, policy_version 211293 (0.0029) [2024-06-28 10:59:38,852][06674] Fps is (10 sec: 40951.7, 60 sec: 43689.2, 300 sec: 43875.5). Total num frames: 3461971968. Throughput: 0: 43929.6. Samples: 3364967220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 10:59:38,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 10:59:39,051][06909] Updated weights for policy 0, policy_version 211303 (0.0040) [2024-06-28 10:59:41,996][06909] Updated weights for policy 0, policy_version 211313 (0.0030) [2024-06-28 10:59:43,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3462234112. Throughput: 0: 43882.2. Samples: 3365092760. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 10:59:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:59:46,334][06909] Updated weights for policy 0, policy_version 211323 (0.0031) [2024-06-28 10:59:47,926][06887] Signal inference workers to stop experience collection... (47600 times) [2024-06-28 10:59:47,928][06887] Signal inference workers to resume experience collection... (47600 times) [2024-06-28 10:59:47,951][06909] InferenceWorker_p0-w0: stopping experience collection (47600 times) [2024-06-28 10:59:47,981][06909] InferenceWorker_p0-w0: resuming experience collection (47600 times) [2024-06-28 10:59:48,850][06674] Fps is (10 sec: 49162.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3462463488. Throughput: 0: 44005.8. Samples: 3365366300. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 10:59:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 10:59:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000211332_3462463488.pth... [2024-06-28 10:59:48,908][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000210687_3451895808.pth [2024-06-28 10:59:49,554][06909] Updated weights for policy 0, policy_version 211333 (0.0032) [2024-06-28 10:59:53,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43417.9, 300 sec: 43820.3). Total num frames: 3462627328. Throughput: 0: 44147.2. Samples: 3365632080. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 10:59:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 10:59:53,903][06909] Updated weights for policy 0, policy_version 211343 (0.0039) [2024-06-28 10:59:56,774][06909] Updated weights for policy 0, policy_version 211353 (0.0036) [2024-06-28 10:59:58,856][06674] Fps is (10 sec: 44210.0, 60 sec: 44505.4, 300 sec: 44041.5). Total num frames: 3462905856. Throughput: 0: 44162.5. Samples: 3365760040. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 10:59:58,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:00:01,139][06909] Updated weights for policy 0, policy_version 211363 (0.0036) [2024-06-28 11:00:03,850][06674] Fps is (10 sec: 49152.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3463118848. Throughput: 0: 44080.7. Samples: 3366023200. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 11:00:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:00:04,597][06909] Updated weights for policy 0, policy_version 211373 (0.0035) [2024-06-28 11:00:08,447][06909] Updated weights for policy 0, policy_version 211383 (0.0030) [2024-06-28 11:00:08,850][06674] Fps is (10 sec: 39345.0, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3463299072. Throughput: 0: 43924.1. Samples: 3366289160. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 11:00:08,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 11:00:11,796][06909] Updated weights for policy 0, policy_version 211393 (0.0034) [2024-06-28 11:00:13,854][06674] Fps is (10 sec: 42581.4, 60 sec: 43960.8, 300 sec: 43930.8). Total num frames: 3463544832. Throughput: 0: 44076.6. Samples: 3366418960. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 11:00:13,854][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:00:16,196][06909] Updated weights for policy 0, policy_version 211403 (0.0034) [2024-06-28 11:00:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43692.1, 300 sec: 43986.9). Total num frames: 3463774208. Throughput: 0: 43850.5. Samples: 3366681460. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 11:00:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:00:19,026][06909] Updated weights for policy 0, policy_version 211413 (0.0035) [2024-06-28 11:00:23,384][06909] Updated weights for policy 0, policy_version 211423 (0.0036) [2024-06-28 11:00:23,850][06674] Fps is (10 sec: 40976.0, 60 sec: 43690.7, 300 sec: 43876.7). Total num frames: 3463954432. Throughput: 0: 44081.5. Samples: 3366950800. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 11:00:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:00:26,824][06909] Updated weights for policy 0, policy_version 211433 (0.0027) [2024-06-28 11:00:28,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 3464200192. Throughput: 0: 44101.2. Samples: 3367077320. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 11:00:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:00:30,957][06909] Updated weights for policy 0, policy_version 211443 (0.0034) [2024-06-28 11:00:33,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3464429568. Throughput: 0: 44092.8. Samples: 3367350480. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 11:00:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:00:33,985][06909] Updated weights for policy 0, policy_version 211453 (0.0020) [2024-06-28 11:00:38,166][06909] Updated weights for policy 0, policy_version 211463 (0.0027) [2024-06-28 11:00:38,850][06674] Fps is (10 sec: 44237.7, 60 sec: 44511.4, 300 sec: 43986.9). Total num frames: 3464642560. Throughput: 0: 44172.5. Samples: 3367619840. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 11:00:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:00:41,221][06909] Updated weights for policy 0, policy_version 211473 (0.0033) [2024-06-28 11:00:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3464871936. Throughput: 0: 44080.6. Samples: 3367743400. Policy #0 lag: (min: 1.0, avg: 10.9, max: 22.0) [2024-06-28 11:00:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:00:45,605][06909] Updated weights for policy 0, policy_version 211483 (0.0026) [2024-06-28 11:00:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3465084928. Throughput: 0: 44137.3. Samples: 3368009380. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 11:00:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:00:48,977][06909] Updated weights for policy 0, policy_version 211493 (0.0031) [2024-06-28 11:00:53,080][06909] Updated weights for policy 0, policy_version 211503 (0.0040) [2024-06-28 11:00:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44510.0, 300 sec: 43986.9). Total num frames: 3465297920. Throughput: 0: 44359.3. Samples: 3368285320. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 11:00:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:00:56,194][06909] Updated weights for policy 0, policy_version 211513 (0.0030) [2024-06-28 11:00:57,339][06887] Signal inference workers to stop experience collection... (47650 times) [2024-06-28 11:00:57,346][06887] Signal inference workers to resume experience collection... (47650 times) [2024-06-28 11:00:57,387][06909] InferenceWorker_p0-w0: stopping experience collection (47650 times) [2024-06-28 11:00:57,388][06909] InferenceWorker_p0-w0: resuming experience collection (47650 times) [2024-06-28 11:00:58,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43968.2, 300 sec: 44042.4). Total num frames: 3465543680. Throughput: 0: 44228.3. Samples: 3368409060. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 11:00:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:01:00,501][06909] Updated weights for policy 0, policy_version 211523 (0.0028) [2024-06-28 11:01:03,816][06909] Updated weights for policy 0, policy_version 211533 (0.0028) [2024-06-28 11:01:03,852][06674] Fps is (10 sec: 45865.2, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 3465756672. Throughput: 0: 44280.3. Samples: 3368674160. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 11:01:03,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 11:01:08,035][06909] Updated weights for policy 0, policy_version 211543 (0.0037) [2024-06-28 11:01:08,850][06674] Fps is (10 sec: 42597.6, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3465969664. Throughput: 0: 44168.8. Samples: 3368938400. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 11:01:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:01:11,039][06909] Updated weights for policy 0, policy_version 211553 (0.0032) [2024-06-28 11:01:13,850][06674] Fps is (10 sec: 42607.1, 60 sec: 43966.6, 300 sec: 43986.9). Total num frames: 3466182656. Throughput: 0: 44236.2. Samples: 3369067940. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 11:01:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:01:15,321][06909] Updated weights for policy 0, policy_version 211563 (0.0044) [2024-06-28 11:01:18,349][06909] Updated weights for policy 0, policy_version 211573 (0.0027) [2024-06-28 11:01:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3466412032. Throughput: 0: 44026.6. Samples: 3369331680. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 11:01:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:01:22,793][06909] Updated weights for policy 0, policy_version 211583 (0.0035) [2024-06-28 11:01:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.8, 300 sec: 44043.3). Total num frames: 3466625024. Throughput: 0: 43927.5. Samples: 3369596580. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 11:01:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:01:26,075][06909] Updated weights for policy 0, policy_version 211593 (0.0027) [2024-06-28 11:01:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3466854400. Throughput: 0: 44124.4. Samples: 3369729000. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 11:01:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:01:29,992][06909] Updated weights for policy 0, policy_version 211603 (0.0037) [2024-06-28 11:01:33,316][06909] Updated weights for policy 0, policy_version 211613 (0.0027) [2024-06-28 11:01:33,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3467067392. Throughput: 0: 44152.1. Samples: 3369996220. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 11:01:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:01:37,902][06909] Updated weights for policy 0, policy_version 211623 (0.0036) [2024-06-28 11:01:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3467296768. Throughput: 0: 43850.2. Samples: 3370258580. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 11:01:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:01:41,176][06909] Updated weights for policy 0, policy_version 211633 (0.0022) [2024-06-28 11:01:43,850][06674] Fps is (10 sec: 44235.8, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3467509760. Throughput: 0: 44114.9. Samples: 3370394240. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 11:01:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:01:45,076][06909] Updated weights for policy 0, policy_version 211643 (0.0028) [2024-06-28 11:01:48,424][06909] Updated weights for policy 0, policy_version 211653 (0.0027) [2024-06-28 11:01:48,850][06674] Fps is (10 sec: 45873.9, 60 sec: 44509.7, 300 sec: 44097.9). Total num frames: 3467755520. Throughput: 0: 43984.9. Samples: 3370653400. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 11:01:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:01:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000211655_3467755520.pth... [2024-06-28 11:01:48,931][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000211009_3457171456.pth [2024-06-28 11:01:52,442][06909] Updated weights for policy 0, policy_version 211663 (0.0029) [2024-06-28 11:01:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.6, 300 sec: 44042.4). Total num frames: 3467952128. Throughput: 0: 44024.1. Samples: 3370919480. Policy #0 lag: (min: 0.0, avg: 11.2, max: 23.0) [2024-06-28 11:01:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:01:55,706][06909] Updated weights for policy 0, policy_version 211673 (0.0025) [2024-06-28 11:01:58,850][06674] Fps is (10 sec: 40961.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3468165120. Throughput: 0: 44109.4. Samples: 3371052860. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:01:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:01:59,935][06909] Updated weights for policy 0, policy_version 211683 (0.0040) [2024-06-28 11:02:03,403][06909] Updated weights for policy 0, policy_version 211693 (0.0034) [2024-06-28 11:02:03,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 3468410880. Throughput: 0: 44067.7. Samples: 3371314720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:02:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:02:07,188][06909] Updated weights for policy 0, policy_version 211703 (0.0044) [2024-06-28 11:02:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 44042.7). Total num frames: 3468623872. Throughput: 0: 44093.9. Samples: 3371580800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:02:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:02:10,770][06909] Updated weights for policy 0, policy_version 211713 (0.0035) [2024-06-28 11:02:13,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3468820480. Throughput: 0: 43994.7. Samples: 3371708760. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:02:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:02:14,901][06909] Updated weights for policy 0, policy_version 211723 (0.0027) [2024-06-28 11:02:18,073][06909] Updated weights for policy 0, policy_version 211733 (0.0038) [2024-06-28 11:02:18,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 3469082624. Throughput: 0: 44037.3. Samples: 3371977900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:02:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:02:22,209][06909] Updated weights for policy 0, policy_version 211743 (0.0030) [2024-06-28 11:02:23,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3469279232. Throughput: 0: 44058.6. Samples: 3372241220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:02:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:02:25,039][06887] Signal inference workers to stop experience collection... (47700 times) [2024-06-28 11:02:25,044][06887] Signal inference workers to resume experience collection... (47700 times) [2024-06-28 11:02:25,083][06909] InferenceWorker_p0-w0: stopping experience collection (47700 times) [2024-06-28 11:02:25,083][06909] InferenceWorker_p0-w0: resuming experience collection (47700 times) [2024-06-28 11:02:25,510][06909] Updated weights for policy 0, policy_version 211753 (0.0035) [2024-06-28 11:02:28,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 3469475840. Throughput: 0: 43850.0. Samples: 3372367480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:02:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:02:29,538][06909] Updated weights for policy 0, policy_version 211763 (0.0039) [2024-06-28 11:02:33,099][06909] Updated weights for policy 0, policy_version 211773 (0.0034) [2024-06-28 11:02:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3469721600. Throughput: 0: 43941.5. Samples: 3372630760. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:02:33,859][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:02:37,256][06909] Updated weights for policy 0, policy_version 211783 (0.0026) [2024-06-28 11:02:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3469934592. Throughput: 0: 43942.8. Samples: 3372896900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:02:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:02:40,578][06909] Updated weights for policy 0, policy_version 211793 (0.0025) [2024-06-28 11:02:43,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3470131200. Throughput: 0: 43973.6. Samples: 3373031680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:02:43,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:02:44,656][06909] Updated weights for policy 0, policy_version 211803 (0.0038) [2024-06-28 11:02:48,002][06909] Updated weights for policy 0, policy_version 211813 (0.0033) [2024-06-28 11:02:48,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 3470393344. Throughput: 0: 44019.6. Samples: 3373295600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:02:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:02:52,209][06909] Updated weights for policy 0, policy_version 211823 (0.0038) [2024-06-28 11:02:53,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3470589952. Throughput: 0: 43892.5. Samples: 3373555960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:02:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:02:55,274][06909] Updated weights for policy 0, policy_version 211833 (0.0026) [2024-06-28 11:02:58,850][06674] Fps is (10 sec: 40959.0, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 3470802944. Throughput: 0: 43846.0. Samples: 3373681840. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:02:58,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:02:59,286][06909] Updated weights for policy 0, policy_version 211843 (0.0026) [2024-06-28 11:03:03,085][06909] Updated weights for policy 0, policy_version 211853 (0.0029) [2024-06-28 11:03:03,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3471048704. Throughput: 0: 43752.4. Samples: 3373946760. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:03:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:03:06,809][06909] Updated weights for policy 0, policy_version 211863 (0.0030) [2024-06-28 11:03:08,850][06674] Fps is (10 sec: 44237.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3471245312. Throughput: 0: 43915.2. Samples: 3374217400. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 11:03:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:03:10,394][06909] Updated weights for policy 0, policy_version 211873 (0.0022) [2024-06-28 11:03:13,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3471458304. Throughput: 0: 43899.6. Samples: 3374342960. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 11:03:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:03:14,256][06909] Updated weights for policy 0, policy_version 211883 (0.0033) [2024-06-28 11:03:17,729][06909] Updated weights for policy 0, policy_version 211893 (0.0038) [2024-06-28 11:03:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 3471704064. Throughput: 0: 44058.7. Samples: 3374613400. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 11:03:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:03:21,847][06909] Updated weights for policy 0, policy_version 211903 (0.0039) [2024-06-28 11:03:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3471900672. Throughput: 0: 43883.5. Samples: 3374871660. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 11:03:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:03:25,338][06909] Updated weights for policy 0, policy_version 211913 (0.0033) [2024-06-28 11:03:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3472113664. Throughput: 0: 43719.2. Samples: 3374999040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 11:03:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:03:29,283][06909] Updated weights for policy 0, policy_version 211923 (0.0039) [2024-06-28 11:03:32,864][06909] Updated weights for policy 0, policy_version 211933 (0.0044) [2024-06-28 11:03:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3472359424. Throughput: 0: 43710.2. Samples: 3375262560. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 11:03:33,854][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:03:36,513][06909] Updated weights for policy 0, policy_version 211943 (0.0035) [2024-06-28 11:03:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43986.8). Total num frames: 3472556032. Throughput: 0: 43836.8. Samples: 3375528620. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 11:03:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:03:40,201][06909] Updated weights for policy 0, policy_version 211953 (0.0026) [2024-06-28 11:03:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3472785408. Throughput: 0: 43913.0. Samples: 3375657920. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 11:03:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:03:44,192][06909] Updated weights for policy 0, policy_version 211963 (0.0043) [2024-06-28 11:03:47,595][06909] Updated weights for policy 0, policy_version 211973 (0.0028) [2024-06-28 11:03:48,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43417.6, 300 sec: 43987.0). Total num frames: 3472998400. Throughput: 0: 44017.0. Samples: 3375927520. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 11:03:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:03:48,992][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000211976_3473014784.pth... [2024-06-28 11:03:49,041][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000211332_3462463488.pth [2024-06-28 11:03:51,504][06909] Updated weights for policy 0, policy_version 211983 (0.0041) [2024-06-28 11:03:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3473227776. Throughput: 0: 43831.5. Samples: 3376189820. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 11:03:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:03:55,051][06909] Updated weights for policy 0, policy_version 211993 (0.0026) [2024-06-28 11:03:58,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3473440768. Throughput: 0: 44072.7. Samples: 3376326240. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 11:03:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:03:59,079][06909] Updated weights for policy 0, policy_version 212003 (0.0029) [2024-06-28 11:04:01,016][06887] Signal inference workers to stop experience collection... (47750 times) [2024-06-28 11:04:01,016][06887] Signal inference workers to resume experience collection... (47750 times) [2024-06-28 11:04:01,036][06909] InferenceWorker_p0-w0: stopping experience collection (47750 times) [2024-06-28 11:04:01,040][06909] InferenceWorker_p0-w0: resuming experience collection (47750 times) [2024-06-28 11:04:02,683][06909] Updated weights for policy 0, policy_version 212013 (0.0036) [2024-06-28 11:04:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3473670144. Throughput: 0: 43973.4. Samples: 3376592200. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 11:04:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:04:06,281][06909] Updated weights for policy 0, policy_version 212023 (0.0030) [2024-06-28 11:04:08,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3473899520. Throughput: 0: 44015.4. Samples: 3376852360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 11:04:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:04:09,979][06909] Updated weights for policy 0, policy_version 212033 (0.0030) [2024-06-28 11:04:13,491][06909] Updated weights for policy 0, policy_version 212043 (0.0034) [2024-06-28 11:04:13,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 43931.6). Total num frames: 3474112512. Throughput: 0: 44193.8. Samples: 3376987760. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 11:04:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:04:17,342][06909] Updated weights for policy 0, policy_version 212053 (0.0029) [2024-06-28 11:04:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3474341888. Throughput: 0: 44132.9. Samples: 3377248540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 11:04:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:04:21,206][06909] Updated weights for policy 0, policy_version 212063 (0.0039) [2024-06-28 11:04:23,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 3474571264. Throughput: 0: 44098.2. Samples: 3377513040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 11:04:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:04:24,624][06909] Updated weights for policy 0, policy_version 212073 (0.0039) [2024-06-28 11:04:28,494][06909] Updated weights for policy 0, policy_version 212083 (0.0033) [2024-06-28 11:04:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3474767872. Throughput: 0: 44265.4. Samples: 3377649860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 11:04:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:04:32,644][06909] Updated weights for policy 0, policy_version 212093 (0.0023) [2024-06-28 11:04:33,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.6, 300 sec: 44153.8). Total num frames: 3474997248. Throughput: 0: 44137.1. Samples: 3377913700. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 11:04:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:04:36,048][06909] Updated weights for policy 0, policy_version 212103 (0.0037) [2024-06-28 11:04:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3475210240. Throughput: 0: 44074.6. Samples: 3378173180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 11:04:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:04:39,850][06909] Updated weights for policy 0, policy_version 212113 (0.0026) [2024-06-28 11:04:43,507][06909] Updated weights for policy 0, policy_version 212123 (0.0026) [2024-06-28 11:04:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3475423232. Throughput: 0: 44015.2. Samples: 3378306920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 11:04:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:04:47,050][06909] Updated weights for policy 0, policy_version 212133 (0.0033) [2024-06-28 11:04:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3475652608. Throughput: 0: 44036.8. Samples: 3378573860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 11:04:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:04:50,932][06909] Updated weights for policy 0, policy_version 212143 (0.0026) [2024-06-28 11:04:53,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44509.9, 300 sec: 44043.3). Total num frames: 3475898368. Throughput: 0: 43948.1. Samples: 3378830020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 11:04:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:04:54,380][06909] Updated weights for policy 0, policy_version 212153 (0.0040) [2024-06-28 11:04:58,466][06909] Updated weights for policy 0, policy_version 212163 (0.0024) [2024-06-28 11:04:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3476094976. Throughput: 0: 44009.2. Samples: 3378968180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 11:04:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:05:02,165][06909] Updated weights for policy 0, policy_version 212173 (0.0034) [2024-06-28 11:05:03,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 3476307968. Throughput: 0: 44256.4. Samples: 3379240080. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 11:05:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:05:05,746][06909] Updated weights for policy 0, policy_version 212183 (0.0029) [2024-06-28 11:05:08,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.9, 300 sec: 44098.5). Total num frames: 3476553728. Throughput: 0: 44141.8. Samples: 3379499420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 11:05:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:05:09,421][06909] Updated weights for policy 0, policy_version 212193 (0.0029) [2024-06-28 11:05:13,385][06909] Updated weights for policy 0, policy_version 212203 (0.0024) [2024-06-28 11:05:13,850][06674] Fps is (10 sec: 45876.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3476766720. Throughput: 0: 44209.9. Samples: 3379639300. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 11:05:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:05:16,706][06909] Updated weights for policy 0, policy_version 212213 (0.0036) [2024-06-28 11:05:18,856][06674] Fps is (10 sec: 40935.2, 60 sec: 43686.3, 300 sec: 44097.1). Total num frames: 3476963328. Throughput: 0: 44016.0. Samples: 3379894680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 11:05:18,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:05:20,833][06909] Updated weights for policy 0, policy_version 212223 (0.0031) [2024-06-28 11:05:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3477209088. Throughput: 0: 43942.8. Samples: 3380150600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 11:05:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:05:24,346][06909] Updated weights for policy 0, policy_version 212233 (0.0043) [2024-06-28 11:05:28,249][06909] Updated weights for policy 0, policy_version 212243 (0.0026) [2024-06-28 11:05:28,850][06674] Fps is (10 sec: 42624.6, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 3477389312. Throughput: 0: 44120.6. Samples: 3380292340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 11:05:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:05:30,361][06887] Signal inference workers to stop experience collection... (47800 times) [2024-06-28 11:05:30,361][06887] Signal inference workers to resume experience collection... (47800 times) [2024-06-28 11:05:30,408][06909] InferenceWorker_p0-w0: stopping experience collection (47800 times) [2024-06-28 11:05:30,408][06909] InferenceWorker_p0-w0: resuming experience collection (47800 times) [2024-06-28 11:05:31,727][06909] Updated weights for policy 0, policy_version 212253 (0.0038) [2024-06-28 11:05:33,850][06674] Fps is (10 sec: 39321.0, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 3477602304. Throughput: 0: 43868.8. Samples: 3380547960. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 11:05:33,851][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 11:05:35,756][06909] Updated weights for policy 0, policy_version 212263 (0.0031) [2024-06-28 11:05:38,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3477864448. Throughput: 0: 43903.5. Samples: 3380805680. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 11:05:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:05:38,975][06909] Updated weights for policy 0, policy_version 212273 (0.0042) [2024-06-28 11:05:43,255][06909] Updated weights for policy 0, policy_version 212283 (0.0027) [2024-06-28 11:05:43,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3478061056. Throughput: 0: 44047.2. Samples: 3380950300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 11:05:43,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:05:46,677][06909] Updated weights for policy 0, policy_version 212293 (0.0031) [2024-06-28 11:05:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3478274048. Throughput: 0: 43781.5. Samples: 3381210240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 11:05:48,850][06674] Avg episode reward: [(0, '0.429')] [2024-06-28 11:05:48,872][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000212297_3478274048.pth... [2024-06-28 11:05:48,928][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000211655_3467755520.pth [2024-06-28 11:05:50,533][06909] Updated weights for policy 0, policy_version 212303 (0.0032) [2024-06-28 11:05:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3478519808. Throughput: 0: 43827.6. Samples: 3381471660. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 11:05:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:05:54,211][06909] Updated weights for policy 0, policy_version 212313 (0.0033) [2024-06-28 11:05:58,079][06909] Updated weights for policy 0, policy_version 212323 (0.0032) [2024-06-28 11:05:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.8, 300 sec: 43931.6). Total num frames: 3478716416. Throughput: 0: 43778.7. Samples: 3381609340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 11:05:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:06:01,607][06909] Updated weights for policy 0, policy_version 212333 (0.0028) [2024-06-28 11:06:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 3478929408. Throughput: 0: 43859.2. Samples: 3381868080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 11:06:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:06:05,341][06909] Updated weights for policy 0, policy_version 212343 (0.0035) [2024-06-28 11:06:08,849][06909] Updated weights for policy 0, policy_version 212353 (0.0024) [2024-06-28 11:06:08,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3479191552. Throughput: 0: 44043.1. Samples: 3382132540. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 11:06:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:06:12,640][06909] Updated weights for policy 0, policy_version 212363 (0.0038) [2024-06-28 11:06:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3479388160. Throughput: 0: 44198.1. Samples: 3382281260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 11:06:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:06:16,211][06909] Updated weights for policy 0, policy_version 212373 (0.0032) [2024-06-28 11:06:18,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43968.1, 300 sec: 43986.9). Total num frames: 3479601152. Throughput: 0: 44190.3. Samples: 3382536520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 11:06:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 11:06:20,124][06909] Updated weights for policy 0, policy_version 212383 (0.0036) [2024-06-28 11:06:23,554][06909] Updated weights for policy 0, policy_version 212393 (0.0029) [2024-06-28 11:06:23,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3479846912. Throughput: 0: 44323.7. Samples: 3382800240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 11:06:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:06:27,810][06909] Updated weights for policy 0, policy_version 212403 (0.0033) [2024-06-28 11:06:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44509.7, 300 sec: 44042.4). Total num frames: 3480059904. Throughput: 0: 44194.7. Samples: 3382939060. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:06:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:06:31,198][06909] Updated weights for policy 0, policy_version 212413 (0.0026) [2024-06-28 11:06:33,850][06674] Fps is (10 sec: 40959.6, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 3480256512. Throughput: 0: 44088.0. Samples: 3383194200. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:06:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:06:35,131][06909] Updated weights for policy 0, policy_version 212423 (0.0026) [2024-06-28 11:06:38,658][06909] Updated weights for policy 0, policy_version 212433 (0.0044) [2024-06-28 11:06:38,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3480502272. Throughput: 0: 43973.4. Samples: 3383450460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:06:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:06:43,002][06909] Updated weights for policy 0, policy_version 212443 (0.0026) [2024-06-28 11:06:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3480698880. Throughput: 0: 44005.3. Samples: 3383589580. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:06:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:06:45,917][06909] Updated weights for policy 0, policy_version 212453 (0.0032) [2024-06-28 11:06:48,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3480911872. Throughput: 0: 44197.3. Samples: 3383856960. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:06:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:06:49,451][06887] Signal inference workers to stop experience collection... (47850 times) [2024-06-28 11:06:49,484][06909] InferenceWorker_p0-w0: stopping experience collection (47850 times) [2024-06-28 11:06:49,562][06887] Signal inference workers to resume experience collection... (47850 times) [2024-06-28 11:06:49,562][06909] InferenceWorker_p0-w0: resuming experience collection (47850 times) [2024-06-28 11:06:50,370][06909] Updated weights for policy 0, policy_version 212463 (0.0022) [2024-06-28 11:06:53,081][06909] Updated weights for policy 0, policy_version 212473 (0.0031) [2024-06-28 11:06:53,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3481174016. Throughput: 0: 44111.2. Samples: 3384117540. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:06:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:06:57,535][06909] Updated weights for policy 0, policy_version 212483 (0.0027) [2024-06-28 11:06:58,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 3481387008. Throughput: 0: 43940.1. Samples: 3384258560. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:06:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:07:00,699][06909] Updated weights for policy 0, policy_version 212493 (0.0037) [2024-06-28 11:07:03,856][06674] Fps is (10 sec: 40935.0, 60 sec: 44232.3, 300 sec: 43930.4). Total num frames: 3481583616. Throughput: 0: 44124.3. Samples: 3384522380. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:07:03,856][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 11:07:05,621][06909] Updated weights for policy 0, policy_version 212503 (0.0040) [2024-06-28 11:07:08,343][06909] Updated weights for policy 0, policy_version 212513 (0.0029) [2024-06-28 11:07:08,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3481829376. Throughput: 0: 43896.3. Samples: 3384775580. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:07:08,850][06674] Avg episode reward: [(0, '0.429')] [2024-06-28 11:07:12,959][06909] Updated weights for policy 0, policy_version 212523 (0.0035) [2024-06-28 11:07:13,850][06674] Fps is (10 sec: 44264.1, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3482025984. Throughput: 0: 43995.7. Samples: 3384918860. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:07:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:07:15,831][06909] Updated weights for policy 0, policy_version 212533 (0.0022) [2024-06-28 11:07:18,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3482238976. Throughput: 0: 44086.2. Samples: 3385178080. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:07:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:07:20,347][06909] Updated weights for policy 0, policy_version 212543 (0.0030) [2024-06-28 11:07:23,189][06909] Updated weights for policy 0, policy_version 212553 (0.0023) [2024-06-28 11:07:23,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3482484736. Throughput: 0: 43995.0. Samples: 3385430240. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:07:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:07:27,542][06909] Updated weights for policy 0, policy_version 212563 (0.0036) [2024-06-28 11:07:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3482697728. Throughput: 0: 44172.9. Samples: 3385577360. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:07:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:07:30,377][06909] Updated weights for policy 0, policy_version 212573 (0.0035) [2024-06-28 11:07:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3482894336. Throughput: 0: 44106.2. Samples: 3385841740. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:07:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:07:35,091][06909] Updated weights for policy 0, policy_version 212583 (0.0035) [2024-06-28 11:07:38,390][06909] Updated weights for policy 0, policy_version 212593 (0.0026) [2024-06-28 11:07:38,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3483156480. Throughput: 0: 43975.9. Samples: 3386096460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 11:07:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:07:42,446][06909] Updated weights for policy 0, policy_version 212603 (0.0039) [2024-06-28 11:07:43,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3483353088. Throughput: 0: 43795.5. Samples: 3386229360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 11:07:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:07:45,690][06909] Updated weights for policy 0, policy_version 212613 (0.0036) [2024-06-28 11:07:48,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3483566080. Throughput: 0: 43809.4. Samples: 3386493540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 11:07:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:07:48,856][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000212620_3483566080.pth... [2024-06-28 11:07:48,902][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000211976_3473014784.pth [2024-06-28 11:07:50,346][06909] Updated weights for policy 0, policy_version 212623 (0.0040) [2024-06-28 11:07:53,047][06909] Updated weights for policy 0, policy_version 212633 (0.0022) [2024-06-28 11:07:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44042.5). Total num frames: 3483795456. Throughput: 0: 43854.3. Samples: 3386749020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 11:07:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:07:57,925][06909] Updated weights for policy 0, policy_version 212643 (0.0040) [2024-06-28 11:07:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3484008448. Throughput: 0: 43807.9. Samples: 3386890220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 11:07:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:07:59,064][06887] Signal inference workers to stop experience collection... (47900 times) [2024-06-28 11:07:59,071][06887] Signal inference workers to resume experience collection... (47900 times) [2024-06-28 11:07:59,116][06909] InferenceWorker_p0-w0: stopping experience collection (47900 times) [2024-06-28 11:07:59,116][06909] InferenceWorker_p0-w0: resuming experience collection (47900 times) [2024-06-28 11:08:00,323][06909] Updated weights for policy 0, policy_version 212653 (0.0025) [2024-06-28 11:08:03,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43966.7, 300 sec: 43986.6). Total num frames: 3484221440. Throughput: 0: 43858.9. Samples: 3387151820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 11:08:03,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:08:05,178][06909] Updated weights for policy 0, policy_version 212663 (0.0029) [2024-06-28 11:08:07,566][06909] Updated weights for policy 0, policy_version 212673 (0.0023) [2024-06-28 11:08:08,853][06674] Fps is (10 sec: 44221.5, 60 sec: 43688.2, 300 sec: 44041.9). Total num frames: 3484450816. Throughput: 0: 44101.5. Samples: 3387414960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 11:08:08,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:08:12,736][06909] Updated weights for policy 0, policy_version 212683 (0.0032) [2024-06-28 11:08:13,850][06674] Fps is (10 sec: 45885.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3484680192. Throughput: 0: 43920.9. Samples: 3387553800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 11:08:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:08:15,682][06909] Updated weights for policy 0, policy_version 212693 (0.0030) [2024-06-28 11:08:18,852][06674] Fps is (10 sec: 42604.5, 60 sec: 43962.3, 300 sec: 43986.6). Total num frames: 3484876800. Throughput: 0: 43760.7. Samples: 3387811060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 11:08:18,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:08:20,054][06909] Updated weights for policy 0, policy_version 212703 (0.0034) [2024-06-28 11:08:22,863][06909] Updated weights for policy 0, policy_version 212713 (0.0029) [2024-06-28 11:08:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3485122560. Throughput: 0: 44042.8. Samples: 3388078380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 11:08:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:08:27,360][06909] Updated weights for policy 0, policy_version 212723 (0.0036) [2024-06-28 11:08:28,850][06674] Fps is (10 sec: 49161.1, 60 sec: 44509.7, 300 sec: 44097.9). Total num frames: 3485368320. Throughput: 0: 44086.0. Samples: 3388213240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 11:08:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:08:30,373][06909] Updated weights for policy 0, policy_version 212733 (0.0029) [2024-06-28 11:08:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3485532160. Throughput: 0: 44029.9. Samples: 3388474880. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 11:08:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:08:34,986][06909] Updated weights for policy 0, policy_version 212743 (0.0033) [2024-06-28 11:08:37,561][06909] Updated weights for policy 0, policy_version 212753 (0.0031) [2024-06-28 11:08:38,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3485777920. Throughput: 0: 44258.6. Samples: 3388740660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 11:08:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:08:42,182][06909] Updated weights for policy 0, policy_version 212763 (0.0023) [2024-06-28 11:08:43,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3486007296. Throughput: 0: 44096.0. Samples: 3388874540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 11:08:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:08:44,807][06909] Updated weights for policy 0, policy_version 212773 (0.0023) [2024-06-28 11:08:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3486203904. Throughput: 0: 44080.7. Samples: 3389135360. Policy #0 lag: (min: 0.0, avg: 13.3, max: 27.0) [2024-06-28 11:08:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:08:49,724][06909] Updated weights for policy 0, policy_version 212783 (0.0038) [2024-06-28 11:08:52,716][06909] Updated weights for policy 0, policy_version 212793 (0.0051) [2024-06-28 11:08:53,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3486433280. Throughput: 0: 44053.6. Samples: 3389397220. Policy #0 lag: (min: 0.0, avg: 13.3, max: 27.0) [2024-06-28 11:08:53,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 11:08:56,904][06909] Updated weights for policy 0, policy_version 212803 (0.0029) [2024-06-28 11:08:58,172][06887] Signal inference workers to stop experience collection... (47950 times) [2024-06-28 11:08:58,173][06887] Signal inference workers to resume experience collection... (47950 times) [2024-06-28 11:08:58,207][06909] InferenceWorker_p0-w0: stopping experience collection (47950 times) [2024-06-28 11:08:58,207][06909] InferenceWorker_p0-w0: resuming experience collection (47950 times) [2024-06-28 11:08:58,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3486679040. Throughput: 0: 43930.5. Samples: 3389530680. Policy #0 lag: (min: 0.0, avg: 13.3, max: 27.0) [2024-06-28 11:08:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:09:00,287][06909] Updated weights for policy 0, policy_version 212813 (0.0029) [2024-06-28 11:09:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43965.2, 300 sec: 43931.3). Total num frames: 3486859264. Throughput: 0: 44084.6. Samples: 3389794780. Policy #0 lag: (min: 0.0, avg: 13.3, max: 27.0) [2024-06-28 11:09:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:09:04,438][06909] Updated weights for policy 0, policy_version 212823 (0.0042) [2024-06-28 11:09:07,418][06909] Updated weights for policy 0, policy_version 212833 (0.0025) [2024-06-28 11:09:08,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43966.2, 300 sec: 43986.9). Total num frames: 3487088640. Throughput: 0: 43929.3. Samples: 3390055200. Policy #0 lag: (min: 0.0, avg: 13.3, max: 27.0) [2024-06-28 11:09:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:09:11,964][06909] Updated weights for policy 0, policy_version 212843 (0.0039) [2024-06-28 11:09:13,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3487334400. Throughput: 0: 44065.0. Samples: 3390196160. Policy #0 lag: (min: 0.0, avg: 13.3, max: 27.0) [2024-06-28 11:09:13,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 11:09:15,315][06909] Updated weights for policy 0, policy_version 212853 (0.0040) [2024-06-28 11:09:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44238.2, 300 sec: 43931.3). Total num frames: 3487531008. Throughput: 0: 44323.0. Samples: 3390469420. Policy #0 lag: (min: 0.0, avg: 13.3, max: 27.0) [2024-06-28 11:09:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:09:19,073][06909] Updated weights for policy 0, policy_version 212863 (0.0032) [2024-06-28 11:09:22,458][06909] Updated weights for policy 0, policy_version 212873 (0.0030) [2024-06-28 11:09:23,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3487744000. Throughput: 0: 44208.1. Samples: 3390730020. Policy #0 lag: (min: 0.0, avg: 13.3, max: 27.0) [2024-06-28 11:09:23,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 11:09:26,559][06909] Updated weights for policy 0, policy_version 212883 (0.0027) [2024-06-28 11:09:28,852][06674] Fps is (10 sec: 47504.2, 60 sec: 43962.4, 300 sec: 44097.7). Total num frames: 3488006144. Throughput: 0: 44096.2. Samples: 3390858960. Policy #0 lag: (min: 0.0, avg: 13.3, max: 27.0) [2024-06-28 11:09:28,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:09:29,903][06909] Updated weights for policy 0, policy_version 212893 (0.0026) [2024-06-28 11:09:33,756][06909] Updated weights for policy 0, policy_version 212903 (0.0034) [2024-06-28 11:09:33,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3488202752. Throughput: 0: 44262.1. Samples: 3391127160. Policy #0 lag: (min: 0.0, avg: 13.3, max: 27.0) [2024-06-28 11:09:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:09:37,042][06909] Updated weights for policy 0, policy_version 212913 (0.0027) [2024-06-28 11:09:38,852][06674] Fps is (10 sec: 40960.2, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 3488415744. Throughput: 0: 44286.9. Samples: 3391390220. Policy #0 lag: (min: 0.0, avg: 13.3, max: 27.0) [2024-06-28 11:09:38,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:09:41,267][06909] Updated weights for policy 0, policy_version 212923 (0.0033) [2024-06-28 11:09:43,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3488661504. Throughput: 0: 44259.7. Samples: 3391522360. Policy #0 lag: (min: 0.0, avg: 13.3, max: 27.0) [2024-06-28 11:09:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:09:44,589][06909] Updated weights for policy 0, policy_version 212933 (0.0025) [2024-06-28 11:09:48,643][06909] Updated weights for policy 0, policy_version 212943 (0.0036) [2024-06-28 11:09:48,850][06674] Fps is (10 sec: 44246.0, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3488858112. Throughput: 0: 44345.4. Samples: 3391790320. Policy #0 lag: (min: 0.0, avg: 13.3, max: 27.0) [2024-06-28 11:09:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:09:48,944][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000212944_3488874496.pth... [2024-06-28 11:09:49,014][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000212297_3478274048.pth [2024-06-28 11:09:52,336][06909] Updated weights for policy 0, policy_version 212953 (0.0041) [2024-06-28 11:09:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3489071104. Throughput: 0: 44521.0. Samples: 3392058640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 11:09:53,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 11:09:56,181][06909] Updated weights for policy 0, policy_version 212963 (0.0041) [2024-06-28 11:09:58,850][06674] Fps is (10 sec: 47512.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3489333248. Throughput: 0: 44195.1. Samples: 3392184940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 11:09:58,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:09:59,595][06909] Updated weights for policy 0, policy_version 212973 (0.0035) [2024-06-28 11:10:03,559][06909] Updated weights for policy 0, policy_version 212983 (0.0041) [2024-06-28 11:10:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3489513472. Throughput: 0: 43919.6. Samples: 3392445800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 11:10:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:10:07,036][06909] Updated weights for policy 0, policy_version 212993 (0.0034) [2024-06-28 11:10:08,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3489726464. Throughput: 0: 44139.1. Samples: 3392716280. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 11:10:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:10:10,791][06909] Updated weights for policy 0, policy_version 213003 (0.0045) [2024-06-28 11:10:13,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 44098.9). Total num frames: 3489972224. Throughput: 0: 44070.0. Samples: 3392842020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 11:10:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:10:14,261][06909] Updated weights for policy 0, policy_version 213013 (0.0035) [2024-06-28 11:10:18,448][06909] Updated weights for policy 0, policy_version 213023 (0.0043) [2024-06-28 11:10:18,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.3, 300 sec: 43931.0). Total num frames: 3490168832. Throughput: 0: 44121.2. Samples: 3393112700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 11:10:18,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:10:19,316][06887] Signal inference workers to stop experience collection... (48000 times) [2024-06-28 11:10:19,316][06887] Signal inference workers to resume experience collection... (48000 times) [2024-06-28 11:10:19,362][06909] InferenceWorker_p0-w0: stopping experience collection (48000 times) [2024-06-28 11:10:19,363][06909] InferenceWorker_p0-w0: resuming experience collection (48000 times) [2024-06-28 11:10:22,175][06909] Updated weights for policy 0, policy_version 213033 (0.0029) [2024-06-28 11:10:23,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3490381824. Throughput: 0: 44041.0. Samples: 3393371980. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 11:10:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:10:25,802][06909] Updated weights for policy 0, policy_version 213043 (0.0035) [2024-06-28 11:10:28,850][06674] Fps is (10 sec: 47523.3, 60 sec: 43965.2, 300 sec: 44209.1). Total num frames: 3490643968. Throughput: 0: 43935.9. Samples: 3393499480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 11:10:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:10:29,774][06909] Updated weights for policy 0, policy_version 213053 (0.0032) [2024-06-28 11:10:33,253][06909] Updated weights for policy 0, policy_version 213063 (0.0037) [2024-06-28 11:10:33,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3490856960. Throughput: 0: 44083.4. Samples: 3393774080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 11:10:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:10:37,052][06909] Updated weights for policy 0, policy_version 213073 (0.0027) [2024-06-28 11:10:38,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 3491053568. Throughput: 0: 43921.7. Samples: 3394035120. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 11:10:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:10:40,514][06909] Updated weights for policy 0, policy_version 213083 (0.0036) [2024-06-28 11:10:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3491282944. Throughput: 0: 43973.0. Samples: 3394163720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 11:10:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:10:44,262][06909] Updated weights for policy 0, policy_version 213093 (0.0033) [2024-06-28 11:10:47,993][06909] Updated weights for policy 0, policy_version 213103 (0.0027) [2024-06-28 11:10:48,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3491512320. Throughput: 0: 44188.1. Samples: 3394434260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 11:10:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:10:51,841][06909] Updated weights for policy 0, policy_version 213113 (0.0034) [2024-06-28 11:10:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3491708928. Throughput: 0: 44083.1. Samples: 3394700020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 11:10:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:10:55,431][06909] Updated weights for policy 0, policy_version 213123 (0.0024) [2024-06-28 11:10:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 3491954688. Throughput: 0: 43987.0. Samples: 3394821440. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 11:10:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:10:59,171][06909] Updated weights for policy 0, policy_version 213133 (0.0034) [2024-06-28 11:11:02,775][06909] Updated weights for policy 0, policy_version 213143 (0.0033) [2024-06-28 11:11:03,850][06674] Fps is (10 sec: 49152.1, 60 sec: 44783.0, 300 sec: 44098.0). Total num frames: 3492200448. Throughput: 0: 44156.7. Samples: 3395099660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 11:11:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:11:06,715][06909] Updated weights for policy 0, policy_version 213153 (0.0045) [2024-06-28 11:11:08,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3492364288. Throughput: 0: 44156.5. Samples: 3395359020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 11:11:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:11:10,212][06909] Updated weights for policy 0, policy_version 213163 (0.0033) [2024-06-28 11:11:13,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3492610048. Throughput: 0: 44150.7. Samples: 3395486260. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 11:11:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:11:14,262][06909] Updated weights for policy 0, policy_version 213173 (0.0027) [2024-06-28 11:11:17,680][06909] Updated weights for policy 0, policy_version 213183 (0.0039) [2024-06-28 11:11:18,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44511.4, 300 sec: 44042.4). Total num frames: 3492839424. Throughput: 0: 44007.6. Samples: 3395754420. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 11:11:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:11:22,160][06909] Updated weights for policy 0, policy_version 213193 (0.0043) [2024-06-28 11:11:23,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3493019648. Throughput: 0: 43971.1. Samples: 3396013820. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 11:11:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:11:25,271][06909] Updated weights for policy 0, policy_version 213203 (0.0030) [2024-06-28 11:11:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 3493265408. Throughput: 0: 43874.1. Samples: 3396138060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 11:11:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:11:29,439][06909] Updated weights for policy 0, policy_version 213213 (0.0027) [2024-06-28 11:11:32,836][06909] Updated weights for policy 0, policy_version 213223 (0.0023) [2024-06-28 11:11:33,850][06674] Fps is (10 sec: 50790.4, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3493527552. Throughput: 0: 43899.4. Samples: 3396409740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 11:11:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:11:36,981][06909] Updated weights for policy 0, policy_version 213233 (0.0020) [2024-06-28 11:11:38,853][06674] Fps is (10 sec: 42585.5, 60 sec: 43961.5, 300 sec: 44041.9). Total num frames: 3493691392. Throughput: 0: 44063.6. Samples: 3396683020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 11:11:38,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:11:40,244][06887] Signal inference workers to stop experience collection... (48050 times) [2024-06-28 11:11:40,245][06887] Signal inference workers to resume experience collection... (48050 times) [2024-06-28 11:11:40,269][06909] Updated weights for policy 0, policy_version 213243 (0.0038) [2024-06-28 11:11:40,296][06909] InferenceWorker_p0-w0: stopping experience collection (48050 times) [2024-06-28 11:11:40,296][06909] InferenceWorker_p0-w0: resuming experience collection (48050 times) [2024-06-28 11:11:43,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3493920768. Throughput: 0: 44090.3. Samples: 3396805500. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 11:11:43,850][06674] Avg episode reward: [(0, '0.475')] [2024-06-28 11:11:44,353][06909] Updated weights for policy 0, policy_version 213253 (0.0031) [2024-06-28 11:11:47,845][06909] Updated weights for policy 0, policy_version 213263 (0.0033) [2024-06-28 11:11:48,851][06674] Fps is (10 sec: 47523.2, 60 sec: 44236.0, 300 sec: 44042.3). Total num frames: 3494166528. Throughput: 0: 43829.6. Samples: 3397072040. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 11:11:48,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:11:48,990][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000213268_3494182912.pth... [2024-06-28 11:11:49,056][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000212620_3483566080.pth [2024-06-28 11:11:51,574][06909] Updated weights for policy 0, policy_version 213273 (0.0038) [2024-06-28 11:11:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3494346752. Throughput: 0: 43883.3. Samples: 3397333760. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 11:11:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:11:55,189][06909] Updated weights for policy 0, policy_version 213283 (0.0037) [2024-06-28 11:11:58,850][06674] Fps is (10 sec: 40964.7, 60 sec: 43690.8, 300 sec: 44043.3). Total num frames: 3494576128. Throughput: 0: 43826.7. Samples: 3397458460. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 11:11:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:11:58,953][06909] Updated weights for policy 0, policy_version 213293 (0.0035) [2024-06-28 11:12:02,443][06909] Updated weights for policy 0, policy_version 213303 (0.0043) [2024-06-28 11:12:03,852][06674] Fps is (10 sec: 49141.6, 60 sec: 43962.2, 300 sec: 44097.7). Total num frames: 3494838272. Throughput: 0: 43882.1. Samples: 3397729200. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 11:12:03,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:12:06,689][06909] Updated weights for policy 0, policy_version 213313 (0.0031) [2024-06-28 11:12:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3495018496. Throughput: 0: 44167.7. Samples: 3398001360. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 11:12:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:12:10,055][06909] Updated weights for policy 0, policy_version 213323 (0.0034) [2024-06-28 11:12:13,850][06674] Fps is (10 sec: 39329.4, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3495231488. Throughput: 0: 44056.0. Samples: 3398120580. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 11:12:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:12:14,186][06909] Updated weights for policy 0, policy_version 213333 (0.0049) [2024-06-28 11:12:17,477][06909] Updated weights for policy 0, policy_version 213343 (0.0038) [2024-06-28 11:12:18,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3495493632. Throughput: 0: 43994.7. Samples: 3398389500. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 11:12:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:12:21,354][06909] Updated weights for policy 0, policy_version 213353 (0.0029) [2024-06-28 11:12:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3495673856. Throughput: 0: 43836.4. Samples: 3398655520. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 11:12:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:12:25,136][06909] Updated weights for policy 0, policy_version 213363 (0.0044) [2024-06-28 11:12:28,687][06909] Updated weights for policy 0, policy_version 213373 (0.0027) [2024-06-28 11:12:28,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3495903232. Throughput: 0: 43768.0. Samples: 3398775060. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 11:12:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:12:32,514][06909] Updated weights for policy 0, policy_version 213383 (0.0037) [2024-06-28 11:12:33,850][06674] Fps is (10 sec: 49151.7, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3496165376. Throughput: 0: 43894.4. Samples: 3399047240. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 11:12:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:12:36,018][06909] Updated weights for policy 0, policy_version 213393 (0.0035) [2024-06-28 11:12:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44239.0, 300 sec: 44042.4). Total num frames: 3496345600. Throughput: 0: 44111.4. Samples: 3399318780. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 11:12:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:12:39,682][06909] Updated weights for policy 0, policy_version 213403 (0.0027) [2024-06-28 11:12:43,754][06909] Updated weights for policy 0, policy_version 213413 (0.0036) [2024-06-28 11:12:43,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3496558592. Throughput: 0: 43972.4. Samples: 3399437220. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 11:12:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:12:46,246][06887] Signal inference workers to stop experience collection... (48100 times) [2024-06-28 11:12:46,246][06887] Signal inference workers to resume experience collection... (48100 times) [2024-06-28 11:12:46,295][06909] InferenceWorker_p0-w0: stopping experience collection (48100 times) [2024-06-28 11:12:46,295][06909] InferenceWorker_p0-w0: resuming experience collection (48100 times) [2024-06-28 11:12:47,333][06909] Updated weights for policy 0, policy_version 213423 (0.0042) [2024-06-28 11:12:48,856][06674] Fps is (10 sec: 45849.1, 60 sec: 43960.3, 300 sec: 44097.1). Total num frames: 3496804352. Throughput: 0: 44051.9. Samples: 3399711700. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 11:12:48,856][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 11:12:51,116][06909] Updated weights for policy 0, policy_version 213433 (0.0033) [2024-06-28 11:12:53,852][06674] Fps is (10 sec: 44227.8, 60 sec: 44235.2, 300 sec: 44042.1). Total num frames: 3497000960. Throughput: 0: 43952.6. Samples: 3399979320. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 11:12:53,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:12:54,509][06909] Updated weights for policy 0, policy_version 213443 (0.0038) [2024-06-28 11:12:58,317][06909] Updated weights for policy 0, policy_version 213453 (0.0031) [2024-06-28 11:12:58,850][06674] Fps is (10 sec: 40983.7, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 3497213952. Throughput: 0: 44160.1. Samples: 3400107780. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 11:12:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:13:02,107][06909] Updated weights for policy 0, policy_version 213463 (0.0030) [2024-06-28 11:13:03,850][06674] Fps is (10 sec: 47523.1, 60 sec: 43965.2, 300 sec: 44154.0). Total num frames: 3497476096. Throughput: 0: 44079.6. Samples: 3400373080. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 11:13:03,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 11:13:05,895][06909] Updated weights for policy 0, policy_version 213473 (0.0038) [2024-06-28 11:13:08,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3497672704. Throughput: 0: 44239.5. Samples: 3400646300. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 11:13:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:13:09,332][06909] Updated weights for policy 0, policy_version 213483 (0.0024) [2024-06-28 11:13:13,519][06909] Updated weights for policy 0, policy_version 213493 (0.0029) [2024-06-28 11:13:13,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.8, 300 sec: 44098.2). Total num frames: 3497885696. Throughput: 0: 44196.8. Samples: 3400763920. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 11:13:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:13:16,941][06909] Updated weights for policy 0, policy_version 213503 (0.0046) [2024-06-28 11:13:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3498131456. Throughput: 0: 44129.3. Samples: 3401033060. Policy #0 lag: (min: 1.0, avg: 9.0, max: 21.0) [2024-06-28 11:13:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:13:20,998][06909] Updated weights for policy 0, policy_version 213513 (0.0045) [2024-06-28 11:13:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 3498328064. Throughput: 0: 44004.4. Samples: 3401298980. Policy #0 lag: (min: 1.0, avg: 9.0, max: 21.0) [2024-06-28 11:13:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:13:24,319][06909] Updated weights for policy 0, policy_version 213523 (0.0033) [2024-06-28 11:13:28,354][06909] Updated weights for policy 0, policy_version 213533 (0.0033) [2024-06-28 11:13:28,852][06674] Fps is (10 sec: 42590.1, 60 sec: 44235.3, 300 sec: 44153.2). Total num frames: 3498557440. Throughput: 0: 44357.6. Samples: 3401433400. Policy #0 lag: (min: 1.0, avg: 9.0, max: 21.0) [2024-06-28 11:13:28,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:13:31,565][06909] Updated weights for policy 0, policy_version 213543 (0.0023) [2024-06-28 11:13:33,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43417.5, 300 sec: 44042.4). Total num frames: 3498770432. Throughput: 0: 44054.4. Samples: 3401693900. Policy #0 lag: (min: 1.0, avg: 9.0, max: 21.0) [2024-06-28 11:13:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:13:35,699][06909] Updated weights for policy 0, policy_version 213553 (0.0037) [2024-06-28 11:13:38,850][06674] Fps is (10 sec: 40967.8, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3498967040. Throughput: 0: 43953.0. Samples: 3401957120. Policy #0 lag: (min: 1.0, avg: 9.0, max: 21.0) [2024-06-28 11:13:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:13:39,318][06909] Updated weights for policy 0, policy_version 213563 (0.0038) [2024-06-28 11:13:42,966][06909] Updated weights for policy 0, policy_version 213573 (0.0022) [2024-06-28 11:13:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3499212800. Throughput: 0: 43911.0. Samples: 3402083780. Policy #0 lag: (min: 1.0, avg: 9.0, max: 21.0) [2024-06-28 11:13:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:13:46,626][06909] Updated weights for policy 0, policy_version 213583 (0.0026) [2024-06-28 11:13:48,850][06674] Fps is (10 sec: 47513.9, 60 sec: 43967.9, 300 sec: 44097.9). Total num frames: 3499442176. Throughput: 0: 43834.2. Samples: 3402345620. Policy #0 lag: (min: 1.0, avg: 9.0, max: 21.0) [2024-06-28 11:13:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:13:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000213589_3499442176.pth... [2024-06-28 11:13:48,921][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000212944_3488874496.pth [2024-06-28 11:13:50,597][06909] Updated weights for policy 0, policy_version 213593 (0.0041) [2024-06-28 11:13:53,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44238.3, 300 sec: 43986.9). Total num frames: 3499655168. Throughput: 0: 43895.7. Samples: 3402621600. Policy #0 lag: (min: 1.0, avg: 9.0, max: 21.0) [2024-06-28 11:13:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:13:54,154][06909] Updated weights for policy 0, policy_version 213603 (0.0020) [2024-06-28 11:13:58,049][06909] Updated weights for policy 0, policy_version 213613 (0.0028) [2024-06-28 11:13:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3499884544. Throughput: 0: 44233.8. Samples: 3402754440. Policy #0 lag: (min: 1.0, avg: 9.0, max: 21.0) [2024-06-28 11:13:58,859][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:14:01,373][06909] Updated weights for policy 0, policy_version 213623 (0.0028) [2024-06-28 11:14:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3500097536. Throughput: 0: 43949.5. Samples: 3403010780. Policy #0 lag: (min: 1.0, avg: 9.0, max: 21.0) [2024-06-28 11:14:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:14:05,398][06909] Updated weights for policy 0, policy_version 213633 (0.0030) [2024-06-28 11:14:05,669][06887] Signal inference workers to stop experience collection... (48150 times) [2024-06-28 11:14:05,674][06887] Signal inference workers to resume experience collection... (48150 times) [2024-06-28 11:14:05,690][06909] InferenceWorker_p0-w0: stopping experience collection (48150 times) [2024-06-28 11:14:05,690][06909] InferenceWorker_p0-w0: resuming experience collection (48150 times) [2024-06-28 11:14:08,667][06909] Updated weights for policy 0, policy_version 213643 (0.0023) [2024-06-28 11:14:08,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 3500343296. Throughput: 0: 44165.0. Samples: 3403286400. Policy #0 lag: (min: 1.0, avg: 9.0, max: 21.0) [2024-06-28 11:14:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:14:12,566][06909] Updated weights for policy 0, policy_version 213653 (0.0026) [2024-06-28 11:14:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3500523520. Throughput: 0: 44034.0. Samples: 3403414840. Policy #0 lag: (min: 1.0, avg: 9.0, max: 21.0) [2024-06-28 11:14:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:14:16,426][06909] Updated weights for policy 0, policy_version 213663 (0.0026) [2024-06-28 11:14:18,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 3500752896. Throughput: 0: 43993.8. Samples: 3403673620. Policy #0 lag: (min: 1.0, avg: 9.0, max: 21.0) [2024-06-28 11:14:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:14:20,046][06909] Updated weights for policy 0, policy_version 213673 (0.0028) [2024-06-28 11:14:23,786][06909] Updated weights for policy 0, policy_version 213683 (0.0041) [2024-06-28 11:14:23,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 3500982272. Throughput: 0: 44189.0. Samples: 3403945620. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 11:14:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:14:27,502][06909] Updated weights for policy 0, policy_version 213693 (0.0026) [2024-06-28 11:14:28,856][06674] Fps is (10 sec: 44210.3, 60 sec: 43960.8, 300 sec: 44041.5). Total num frames: 3501195264. Throughput: 0: 44253.7. Samples: 3404075460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 11:14:28,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:14:31,041][06909] Updated weights for policy 0, policy_version 213703 (0.0032) [2024-06-28 11:14:33,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 44098.2). Total num frames: 3501424640. Throughput: 0: 44123.5. Samples: 3404331180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 11:14:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:14:35,362][06909] Updated weights for policy 0, policy_version 213713 (0.0028) [2024-06-28 11:14:38,662][06909] Updated weights for policy 0, policy_version 213723 (0.0043) [2024-06-28 11:14:38,850][06674] Fps is (10 sec: 45903.2, 60 sec: 44783.1, 300 sec: 44042.4). Total num frames: 3501654016. Throughput: 0: 44172.4. Samples: 3404609360. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 11:14:38,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 11:14:42,805][06909] Updated weights for policy 0, policy_version 213733 (0.0032) [2024-06-28 11:14:43,850][06674] Fps is (10 sec: 42599.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3501850624. Throughput: 0: 44047.2. Samples: 3404736560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 11:14:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:14:46,101][06909] Updated weights for policy 0, policy_version 213743 (0.0030) [2024-06-28 11:14:48,850][06674] Fps is (10 sec: 42597.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3502080000. Throughput: 0: 44101.6. Samples: 3404995360. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 11:14:48,851][06674] Avg episode reward: [(0, '0.458')] [2024-06-28 11:14:49,978][06909] Updated weights for policy 0, policy_version 213753 (0.0031) [2024-06-28 11:14:53,393][06909] Updated weights for policy 0, policy_version 213763 (0.0031) [2024-06-28 11:14:53,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3502309376. Throughput: 0: 43980.0. Samples: 3405265500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 11:14:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:14:57,354][06909] Updated weights for policy 0, policy_version 213773 (0.0045) [2024-06-28 11:14:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3502522368. Throughput: 0: 44109.7. Samples: 3405399780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 11:14:58,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:15:00,974][06909] Updated weights for policy 0, policy_version 213783 (0.0029) [2024-06-28 11:15:03,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 3502735360. Throughput: 0: 44100.4. Samples: 3405658140. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 11:15:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:15:05,014][06909] Updated weights for policy 0, policy_version 213793 (0.0043) [2024-06-28 11:15:08,429][06909] Updated weights for policy 0, policy_version 213803 (0.0041) [2024-06-28 11:15:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3502964736. Throughput: 0: 43994.7. Samples: 3405925380. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 11:15:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:15:12,433][06909] Updated weights for policy 0, policy_version 213813 (0.0034) [2024-06-28 11:15:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 3503177728. Throughput: 0: 44126.8. Samples: 3406060900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 11:15:13,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 11:15:15,952][06909] Updated weights for policy 0, policy_version 213823 (0.0026) [2024-06-28 11:15:18,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3503407104. Throughput: 0: 44069.4. Samples: 3406314300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 11:15:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:15:20,127][06909] Updated weights for policy 0, policy_version 213833 (0.0023) [2024-06-28 11:15:23,324][06909] Updated weights for policy 0, policy_version 213843 (0.0030) [2024-06-28 11:15:23,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3503636480. Throughput: 0: 43851.5. Samples: 3406582680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 11:15:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:15:27,248][06909] Updated weights for policy 0, policy_version 213853 (0.0033) [2024-06-28 11:15:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43968.2, 300 sec: 43986.9). Total num frames: 3503833088. Throughput: 0: 44066.2. Samples: 3406719540. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 11:15:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:15:30,659][06909] Updated weights for policy 0, policy_version 213863 (0.0034) [2024-06-28 11:15:33,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 3504046080. Throughput: 0: 44103.3. Samples: 3406980000. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 11:15:33,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:15:34,522][06909] Updated weights for policy 0, policy_version 213873 (0.0027) [2024-06-28 11:15:37,971][06887] Signal inference workers to stop experience collection... (48200 times) [2024-06-28 11:15:37,972][06887] Signal inference workers to resume experience collection... (48200 times) [2024-06-28 11:15:37,974][06909] Updated weights for policy 0, policy_version 213883 (0.0026) [2024-06-28 11:15:38,024][06909] InferenceWorker_p0-w0: stopping experience collection (48200 times) [2024-06-28 11:15:38,024][06909] InferenceWorker_p0-w0: resuming experience collection (48200 times) [2024-06-28 11:15:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3504291840. Throughput: 0: 43970.2. Samples: 3407244160. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 11:15:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:15:42,158][06909] Updated weights for policy 0, policy_version 213893 (0.0035) [2024-06-28 11:15:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3504488448. Throughput: 0: 44041.0. Samples: 3407381620. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 11:15:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:15:45,398][06909] Updated weights for policy 0, policy_version 213903 (0.0029) [2024-06-28 11:15:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 3504717824. Throughput: 0: 44053.0. Samples: 3407640520. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 11:15:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:15:48,981][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000213912_3504734208.pth... [2024-06-28 11:15:49,022][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000213268_3494182912.pth [2024-06-28 11:15:49,746][06909] Updated weights for policy 0, policy_version 213913 (0.0036) [2024-06-28 11:15:52,963][06909] Updated weights for policy 0, policy_version 213923 (0.0031) [2024-06-28 11:15:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3504947200. Throughput: 0: 43812.9. Samples: 3407896960. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 11:15:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:15:57,136][06909] Updated weights for policy 0, policy_version 213933 (0.0020) [2024-06-28 11:15:58,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3505143808. Throughput: 0: 43810.6. Samples: 3408032380. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 11:15:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:16:00,424][06909] Updated weights for policy 0, policy_version 213943 (0.0032) [2024-06-28 11:16:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3505373184. Throughput: 0: 44144.6. Samples: 3408300800. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 11:16:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:16:04,322][06909] Updated weights for policy 0, policy_version 213953 (0.0036) [2024-06-28 11:16:07,784][06909] Updated weights for policy 0, policy_version 213963 (0.0033) [2024-06-28 11:16:08,850][06674] Fps is (10 sec: 47514.5, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3505618944. Throughput: 0: 44072.5. Samples: 3408565940. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 11:16:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:16:11,685][06909] Updated weights for policy 0, policy_version 213973 (0.0043) [2024-06-28 11:16:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3505799168. Throughput: 0: 44075.6. Samples: 3408702940. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 11:16:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:16:14,988][06909] Updated weights for policy 0, policy_version 213983 (0.0030) [2024-06-28 11:16:18,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3506044928. Throughput: 0: 44094.2. Samples: 3408964240. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 11:16:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:16:19,178][06909] Updated weights for policy 0, policy_version 213993 (0.0029) [2024-06-28 11:16:22,460][06909] Updated weights for policy 0, policy_version 214003 (0.0040) [2024-06-28 11:16:23,850][06674] Fps is (10 sec: 49151.7, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3506290688. Throughput: 0: 44029.3. Samples: 3409225480. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 11:16:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:16:26,845][06909] Updated weights for policy 0, policy_version 214013 (0.0036) [2024-06-28 11:16:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 3506454528. Throughput: 0: 44097.8. Samples: 3409366020. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 11:16:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:16:29,895][06909] Updated weights for policy 0, policy_version 214023 (0.0032) [2024-06-28 11:16:33,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.7, 300 sec: 44098.4). Total num frames: 3506700288. Throughput: 0: 44146.1. Samples: 3409627100. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 11:16:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:16:34,215][06909] Updated weights for policy 0, policy_version 214033 (0.0032) [2024-06-28 11:16:37,345][06909] Updated weights for policy 0, policy_version 214043 (0.0029) [2024-06-28 11:16:38,850][06674] Fps is (10 sec: 49152.1, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3506946048. Throughput: 0: 44302.7. Samples: 3409890580. Policy #0 lag: (min: 0.0, avg: 12.1, max: 25.0) [2024-06-28 11:16:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:16:41,396][06909] Updated weights for policy 0, policy_version 214053 (0.0034) [2024-06-28 11:16:43,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43963.8, 300 sec: 43931.5). Total num frames: 3507126272. Throughput: 0: 44387.3. Samples: 3410029800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 11:16:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:16:44,901][06909] Updated weights for policy 0, policy_version 214063 (0.0028) [2024-06-28 11:16:48,690][06909] Updated weights for policy 0, policy_version 214073 (0.0026) [2024-06-28 11:16:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3507372032. Throughput: 0: 44150.2. Samples: 3410287560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 11:16:48,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 11:16:52,254][06909] Updated weights for policy 0, policy_version 214083 (0.0036) [2024-06-28 11:16:53,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3507601408. Throughput: 0: 44075.0. Samples: 3410549320. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 11:16:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:16:56,354][06909] Updated weights for policy 0, policy_version 214093 (0.0035) [2024-06-28 11:16:58,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.8, 300 sec: 43876.1). Total num frames: 3507781632. Throughput: 0: 44172.4. Samples: 3410690700. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 11:16:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:16:59,073][06887] Signal inference workers to stop experience collection... (48250 times) [2024-06-28 11:16:59,074][06887] Signal inference workers to resume experience collection... (48250 times) [2024-06-28 11:16:59,118][06909] InferenceWorker_p0-w0: stopping experience collection (48250 times) [2024-06-28 11:16:59,118][06909] InferenceWorker_p0-w0: resuming experience collection (48250 times) [2024-06-28 11:16:59,522][06909] Updated weights for policy 0, policy_version 214103 (0.0042) [2024-06-28 11:17:03,618][06909] Updated weights for policy 0, policy_version 214113 (0.0050) [2024-06-28 11:17:03,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3508027392. Throughput: 0: 44038.1. Samples: 3410945960. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 11:17:03,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:17:07,020][06909] Updated weights for policy 0, policy_version 214123 (0.0032) [2024-06-28 11:17:08,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3508256768. Throughput: 0: 44115.6. Samples: 3411210680. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 11:17:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:17:11,195][06909] Updated weights for policy 0, policy_version 214133 (0.0036) [2024-06-28 11:17:13,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 3508453376. Throughput: 0: 44019.1. Samples: 3411346880. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 11:17:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:17:14,404][06909] Updated weights for policy 0, policy_version 214143 (0.0029) [2024-06-28 11:17:18,390][06909] Updated weights for policy 0, policy_version 214153 (0.0030) [2024-06-28 11:17:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3508699136. Throughput: 0: 44154.0. Samples: 3411614020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 11:17:18,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:17:21,845][06909] Updated weights for policy 0, policy_version 214163 (0.0040) [2024-06-28 11:17:23,852][06674] Fps is (10 sec: 45865.6, 60 sec: 43689.2, 300 sec: 44097.6). Total num frames: 3508912128. Throughput: 0: 44202.8. Samples: 3411879800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 11:17:23,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:17:25,837][06909] Updated weights for policy 0, policy_version 214173 (0.0029) [2024-06-28 11:17:28,856][06674] Fps is (10 sec: 44209.4, 60 sec: 44778.4, 300 sec: 43986.0). Total num frames: 3509141504. Throughput: 0: 44075.3. Samples: 3412013460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 11:17:28,865][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:17:29,177][06909] Updated weights for policy 0, policy_version 214183 (0.0025) [2024-06-28 11:17:33,105][06909] Updated weights for policy 0, policy_version 214193 (0.0037) [2024-06-28 11:17:33,850][06674] Fps is (10 sec: 45884.4, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3509370880. Throughput: 0: 44319.5. Samples: 3412281940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 11:17:33,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:17:36,771][06909] Updated weights for policy 0, policy_version 214203 (0.0046) [2024-06-28 11:17:38,850][06674] Fps is (10 sec: 44263.7, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3509583872. Throughput: 0: 44257.8. Samples: 3412540920. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 11:17:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:17:40,753][06909] Updated weights for policy 0, policy_version 214213 (0.0027) [2024-06-28 11:17:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44509.8, 300 sec: 44043.3). Total num frames: 3509796864. Throughput: 0: 44125.8. Samples: 3412676360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 11:17:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:17:44,381][06909] Updated weights for policy 0, policy_version 214223 (0.0023) [2024-06-28 11:17:48,092][06909] Updated weights for policy 0, policy_version 214233 (0.0032) [2024-06-28 11:17:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.8, 300 sec: 44153.8). Total num frames: 3510026240. Throughput: 0: 44248.9. Samples: 3412937160. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 11:17:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:17:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000214235_3510026240.pth... [2024-06-28 11:17:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000213589_3499442176.pth [2024-06-28 11:17:51,830][06909] Updated weights for policy 0, policy_version 214243 (0.0032) [2024-06-28 11:17:53,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3510239232. Throughput: 0: 44335.1. Samples: 3413205760. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 11:17:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:17:55,671][06909] Updated weights for policy 0, policy_version 214253 (0.0036) [2024-06-28 11:17:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44782.9, 300 sec: 44042.4). Total num frames: 3510468608. Throughput: 0: 44165.2. Samples: 3413334320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 11:17:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:17:59,125][06909] Updated weights for policy 0, policy_version 214263 (0.0034) [2024-06-28 11:18:02,912][06909] Updated weights for policy 0, policy_version 214273 (0.0033) [2024-06-28 11:18:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3510681600. Throughput: 0: 44164.4. Samples: 3413601420. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 11:18:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:18:05,879][06887] Signal inference workers to stop experience collection... (48300 times) [2024-06-28 11:18:05,916][06909] InferenceWorker_p0-w0: stopping experience collection (48300 times) [2024-06-28 11:18:05,948][06887] Signal inference workers to resume experience collection... (48300 times) [2024-06-28 11:18:05,949][06909] InferenceWorker_p0-w0: resuming experience collection (48300 times) [2024-06-28 11:18:06,750][06909] Updated weights for policy 0, policy_version 214283 (0.0034) [2024-06-28 11:18:08,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3510894592. Throughput: 0: 44134.1. Samples: 3413865740. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 11:18:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:18:10,634][06909] Updated weights for policy 0, policy_version 214293 (0.0032) [2024-06-28 11:18:13,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3511123968. Throughput: 0: 44102.0. Samples: 3413997780. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 11:18:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:18:13,935][06909] Updated weights for policy 0, policy_version 214303 (0.0025) [2024-06-28 11:18:17,955][06909] Updated weights for policy 0, policy_version 214313 (0.0025) [2024-06-28 11:18:18,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.6, 300 sec: 44153.5). Total num frames: 3511353344. Throughput: 0: 44020.8. Samples: 3414262880. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 11:18:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:18:21,562][06909] Updated weights for policy 0, policy_version 214323 (0.0040) [2024-06-28 11:18:23,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44238.4, 300 sec: 44098.3). Total num frames: 3511566336. Throughput: 0: 44035.2. Samples: 3414522500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 11:18:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:18:25,335][06909] Updated weights for policy 0, policy_version 214333 (0.0023) [2024-06-28 11:18:28,792][06909] Updated weights for policy 0, policy_version 214343 (0.0024) [2024-06-28 11:18:28,852][06674] Fps is (10 sec: 44228.1, 60 sec: 44239.8, 300 sec: 44153.2). Total num frames: 3511795712. Throughput: 0: 44119.7. Samples: 3414661840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 11:18:28,853][06674] Avg episode reward: [(0, '0.429')] [2024-06-28 11:18:32,828][06909] Updated weights for policy 0, policy_version 214353 (0.0028) [2024-06-28 11:18:33,853][06674] Fps is (10 sec: 42585.8, 60 sec: 43688.6, 300 sec: 44153.1). Total num frames: 3511992320. Throughput: 0: 44235.4. Samples: 3414927880. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 11:18:33,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:18:36,285][06909] Updated weights for policy 0, policy_version 214363 (0.0028) [2024-06-28 11:18:38,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3512221696. Throughput: 0: 44098.3. Samples: 3415190180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 11:18:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:18:40,158][06909] Updated weights for policy 0, policy_version 214373 (0.0042) [2024-06-28 11:18:43,750][06909] Updated weights for policy 0, policy_version 214383 (0.0026) [2024-06-28 11:18:43,850][06674] Fps is (10 sec: 45887.9, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3512451072. Throughput: 0: 44200.8. Samples: 3415323360. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 11:18:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:18:47,596][06909] Updated weights for policy 0, policy_version 214393 (0.0029) [2024-06-28 11:18:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3512664064. Throughput: 0: 44105.7. Samples: 3415586180. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 11:18:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:18:51,254][06909] Updated weights for policy 0, policy_version 214403 (0.0035) [2024-06-28 11:18:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3512893440. Throughput: 0: 43921.3. Samples: 3415842200. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 11:18:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:18:55,112][06909] Updated weights for policy 0, policy_version 214413 (0.0024) [2024-06-28 11:18:58,715][06909] Updated weights for policy 0, policy_version 214423 (0.0035) [2024-06-28 11:18:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3513106432. Throughput: 0: 44090.1. Samples: 3415981840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 11:18:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:19:02,632][06909] Updated weights for policy 0, policy_version 214433 (0.0027) [2024-06-28 11:19:03,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3513319424. Throughput: 0: 44016.6. Samples: 3416243620. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 11:19:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:19:06,031][06909] Updated weights for policy 0, policy_version 214443 (0.0024) [2024-06-28 11:19:08,852][06674] Fps is (10 sec: 42590.1, 60 sec: 43962.2, 300 sec: 44097.6). Total num frames: 3513532416. Throughput: 0: 44161.0. Samples: 3416509840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 11:19:08,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:19:10,094][06909] Updated weights for policy 0, policy_version 214453 (0.0043) [2024-06-28 11:19:13,533][06909] Updated weights for policy 0, policy_version 214463 (0.0029) [2024-06-28 11:19:13,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3513778176. Throughput: 0: 44027.4. Samples: 3416642980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 11:19:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:19:17,394][06909] Updated weights for policy 0, policy_version 214473 (0.0033) [2024-06-28 11:19:18,850][06674] Fps is (10 sec: 44245.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3513974784. Throughput: 0: 43977.0. Samples: 3416906720. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 11:19:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:19:20,694][06909] Updated weights for policy 0, policy_version 214483 (0.0031) [2024-06-28 11:19:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.9). Total num frames: 3514204160. Throughput: 0: 44049.8. Samples: 3417172420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 11:19:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:19:24,837][06909] Updated weights for policy 0, policy_version 214493 (0.0039) [2024-06-28 11:19:27,077][06887] Signal inference workers to stop experience collection... (48350 times) [2024-06-28 11:19:27,095][06909] InferenceWorker_p0-w0: stopping experience collection (48350 times) [2024-06-28 11:19:27,136][06887] Signal inference workers to resume experience collection... (48350 times) [2024-06-28 11:19:27,137][06909] InferenceWorker_p0-w0: resuming experience collection (48350 times) [2024-06-28 11:19:27,947][06909] Updated weights for policy 0, policy_version 214503 (0.0026) [2024-06-28 11:19:28,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43965.2, 300 sec: 44098.0). Total num frames: 3514433536. Throughput: 0: 44147.2. Samples: 3417309980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 11:19:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:19:32,247][06909] Updated weights for policy 0, policy_version 214513 (0.0034) [2024-06-28 11:19:33,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43965.8, 300 sec: 43986.9). Total num frames: 3514630144. Throughput: 0: 44037.3. Samples: 3417567860. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 11:19:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:19:35,463][06909] Updated weights for policy 0, policy_version 214523 (0.0041) [2024-06-28 11:19:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3514859520. Throughput: 0: 44272.0. Samples: 3417834440. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 11:19:38,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:19:39,922][06909] Updated weights for policy 0, policy_version 214533 (0.0025) [2024-06-28 11:19:43,060][06909] Updated weights for policy 0, policy_version 214543 (0.0039) [2024-06-28 11:19:43,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3515088896. Throughput: 0: 44028.0. Samples: 3417963100. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 11:19:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:19:47,243][06909] Updated weights for policy 0, policy_version 214553 (0.0029) [2024-06-28 11:19:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3515301888. Throughput: 0: 44077.7. Samples: 3418227120. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 11:19:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:19:48,873][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000214557_3515301888.pth... [2024-06-28 11:19:48,933][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000213912_3504734208.pth [2024-06-28 11:19:50,798][06909] Updated weights for policy 0, policy_version 214563 (0.0043) [2024-06-28 11:19:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3515514880. Throughput: 0: 43872.7. Samples: 3418484020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 11:19:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:19:55,059][06909] Updated weights for policy 0, policy_version 214573 (0.0037) [2024-06-28 11:19:58,005][06909] Updated weights for policy 0, policy_version 214583 (0.0037) [2024-06-28 11:19:58,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3515744256. Throughput: 0: 43959.5. Samples: 3418621160. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 11:19:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:20:02,308][06909] Updated weights for policy 0, policy_version 214593 (0.0028) [2024-06-28 11:20:03,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3515957248. Throughput: 0: 43939.1. Samples: 3418883980. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-28 11:20:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:20:05,483][06909] Updated weights for policy 0, policy_version 214603 (0.0040) [2024-06-28 11:20:08,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43965.1, 300 sec: 44042.4). Total num frames: 3516170240. Throughput: 0: 43973.1. Samples: 3419151220. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-28 11:20:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:20:09,534][06909] Updated weights for policy 0, policy_version 214613 (0.0030) [2024-06-28 11:20:12,706][06909] Updated weights for policy 0, policy_version 214623 (0.0027) [2024-06-28 11:20:13,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3516416000. Throughput: 0: 43738.7. Samples: 3419278220. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-28 11:20:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:20:17,181][06909] Updated weights for policy 0, policy_version 214633 (0.0042) [2024-06-28 11:20:18,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3516612608. Throughput: 0: 43854.7. Samples: 3419541320. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-28 11:20:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:20:20,174][06909] Updated weights for policy 0, policy_version 214643 (0.0026) [2024-06-28 11:20:23,852][06674] Fps is (10 sec: 40951.7, 60 sec: 43689.2, 300 sec: 44042.1). Total num frames: 3516825600. Throughput: 0: 43846.9. Samples: 3419807640. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-28 11:20:23,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:20:24,496][06909] Updated weights for policy 0, policy_version 214653 (0.0026) [2024-06-28 11:20:27,823][06909] Updated weights for policy 0, policy_version 214663 (0.0025) [2024-06-28 11:20:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 3517054976. Throughput: 0: 43822.7. Samples: 3419935120. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-28 11:20:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:20:32,163][06909] Updated weights for policy 0, policy_version 214673 (0.0038) [2024-06-28 11:20:33,850][06674] Fps is (10 sec: 45884.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3517284352. Throughput: 0: 43888.9. Samples: 3420202120. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-28 11:20:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:20:35,444][06909] Updated weights for policy 0, policy_version 214683 (0.0037) [2024-06-28 11:20:38,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3517480960. Throughput: 0: 44039.0. Samples: 3420465780. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-28 11:20:38,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:20:39,421][06909] Updated weights for policy 0, policy_version 214693 (0.0026) [2024-06-28 11:20:42,570][06909] Updated weights for policy 0, policy_version 214703 (0.0036) [2024-06-28 11:20:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3517726720. Throughput: 0: 43934.2. Samples: 3420598200. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-28 11:20:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:20:46,997][06909] Updated weights for policy 0, policy_version 214713 (0.0024) [2024-06-28 11:20:48,710][06887] Signal inference workers to stop experience collection... (48400 times) [2024-06-28 11:20:48,710][06887] Signal inference workers to resume experience collection... (48400 times) [2024-06-28 11:20:48,727][06909] InferenceWorker_p0-w0: stopping experience collection (48400 times) [2024-06-28 11:20:48,727][06909] InferenceWorker_p0-w0: resuming experience collection (48400 times) [2024-06-28 11:20:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3517923328. Throughput: 0: 43914.3. Samples: 3420860120. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-28 11:20:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:20:50,168][06909] Updated weights for policy 0, policy_version 214723 (0.0034) [2024-06-28 11:20:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3518136320. Throughput: 0: 43827.7. Samples: 3421123460. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-28 11:20:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:20:54,415][06909] Updated weights for policy 0, policy_version 214733 (0.0028) [2024-06-28 11:20:57,438][06909] Updated weights for policy 0, policy_version 214743 (0.0037) [2024-06-28 11:20:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3518382080. Throughput: 0: 43989.8. Samples: 3421257760. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-28 11:20:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:21:01,982][06909] Updated weights for policy 0, policy_version 214753 (0.0040) [2024-06-28 11:21:03,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3518595072. Throughput: 0: 43936.1. Samples: 3421518440. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-28 11:21:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:21:04,695][06909] Updated weights for policy 0, policy_version 214763 (0.0039) [2024-06-28 11:21:08,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 3518808064. Throughput: 0: 43991.2. Samples: 3421787160. Policy #0 lag: (min: 1.0, avg: 9.7, max: 21.0) [2024-06-28 11:21:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:21:09,509][06909] Updated weights for policy 0, policy_version 214773 (0.0033) [2024-06-28 11:21:12,337][06909] Updated weights for policy 0, policy_version 214783 (0.0027) [2024-06-28 11:21:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3519037440. Throughput: 0: 44068.9. Samples: 3421918220. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-28 11:21:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:21:16,818][06909] Updated weights for policy 0, policy_version 214793 (0.0028) [2024-06-28 11:21:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3519266816. Throughput: 0: 44043.1. Samples: 3422184060. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-28 11:21:18,851][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 11:21:19,767][06909] Updated weights for policy 0, policy_version 214803 (0.0042) [2024-06-28 11:21:23,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43692.1, 300 sec: 44042.4). Total num frames: 3519447040. Throughput: 0: 44001.8. Samples: 3422445860. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-28 11:21:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:21:24,537][06909] Updated weights for policy 0, policy_version 214813 (0.0038) [2024-06-28 11:21:27,471][06909] Updated weights for policy 0, policy_version 214823 (0.0034) [2024-06-28 11:21:28,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3519725568. Throughput: 0: 43906.8. Samples: 3422574000. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-28 11:21:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:21:31,902][06909] Updated weights for policy 0, policy_version 214833 (0.0028) [2024-06-28 11:21:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 3519905792. Throughput: 0: 43933.0. Samples: 3422837100. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-28 11:21:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:21:34,805][06909] Updated weights for policy 0, policy_version 214843 (0.0037) [2024-06-28 11:21:38,850][06674] Fps is (10 sec: 37683.3, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3520102400. Throughput: 0: 44051.7. Samples: 3423105780. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-28 11:21:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:21:39,189][06909] Updated weights for policy 0, policy_version 214853 (0.0037) [2024-06-28 11:21:42,274][06909] Updated weights for policy 0, policy_version 214863 (0.0035) [2024-06-28 11:21:43,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3520380928. Throughput: 0: 43931.5. Samples: 3423234680. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-28 11:21:43,864][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:21:46,657][06909] Updated weights for policy 0, policy_version 214873 (0.0027) [2024-06-28 11:21:48,850][06674] Fps is (10 sec: 49151.2, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3520593920. Throughput: 0: 44213.7. Samples: 3423508060. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-28 11:21:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:21:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000214880_3520593920.pth... [2024-06-28 11:21:48,916][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000214235_3510026240.pth [2024-06-28 11:21:49,517][06909] Updated weights for policy 0, policy_version 214883 (0.0031) [2024-06-28 11:21:53,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3520790528. Throughput: 0: 44072.1. Samples: 3423770400. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-28 11:21:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:21:54,080][06909] Updated weights for policy 0, policy_version 214893 (0.0026) [2024-06-28 11:21:57,227][06909] Updated weights for policy 0, policy_version 214903 (0.0021) [2024-06-28 11:21:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3521036288. Throughput: 0: 43960.3. Samples: 3423896440. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-28 11:21:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:22:01,712][06909] Updated weights for policy 0, policy_version 214913 (0.0027) [2024-06-28 11:22:02,847][06887] Signal inference workers to stop experience collection... (48450 times) [2024-06-28 11:22:02,896][06909] InferenceWorker_p0-w0: stopping experience collection (48450 times) [2024-06-28 11:22:02,958][06887] Signal inference workers to resume experience collection... (48450 times) [2024-06-28 11:22:02,959][06909] InferenceWorker_p0-w0: resuming experience collection (48450 times) [2024-06-28 11:22:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3521249280. Throughput: 0: 44005.9. Samples: 3424164320. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-28 11:22:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:22:04,559][06909] Updated weights for policy 0, policy_version 214923 (0.0025) [2024-06-28 11:22:08,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3521445888. Throughput: 0: 43933.8. Samples: 3424422880. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-28 11:22:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:22:09,147][06909] Updated weights for policy 0, policy_version 214933 (0.0026) [2024-06-28 11:22:12,390][06909] Updated weights for policy 0, policy_version 214943 (0.0026) [2024-06-28 11:22:13,852][06674] Fps is (10 sec: 44227.9, 60 sec: 44235.2, 300 sec: 44042.1). Total num frames: 3521691648. Throughput: 0: 44021.5. Samples: 3424555060. Policy #0 lag: (min: 1.0, avg: 9.0, max: 22.0) [2024-06-28 11:22:13,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:22:16,338][06909] Updated weights for policy 0, policy_version 214953 (0.0033) [2024-06-28 11:22:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.8, 300 sec: 43987.2). Total num frames: 3521888256. Throughput: 0: 44166.7. Samples: 3424824600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:22:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:22:19,707][06909] Updated weights for policy 0, policy_version 214963 (0.0043) [2024-06-28 11:22:23,850][06674] Fps is (10 sec: 42607.3, 60 sec: 44509.9, 300 sec: 43987.8). Total num frames: 3522117632. Throughput: 0: 44005.3. Samples: 3425086020. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:22:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:22:23,852][06909] Updated weights for policy 0, policy_version 214973 (0.0036) [2024-06-28 11:22:27,079][06909] Updated weights for policy 0, policy_version 214983 (0.0035) [2024-06-28 11:22:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3522347008. Throughput: 0: 43963.6. Samples: 3425213040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:22:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:22:31,593][06909] Updated weights for policy 0, policy_version 214993 (0.0038) [2024-06-28 11:22:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3522560000. Throughput: 0: 43796.6. Samples: 3425478900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:22:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:22:34,418][06909] Updated weights for policy 0, policy_version 215003 (0.0025) [2024-06-28 11:22:38,850][06674] Fps is (10 sec: 40959.8, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 3522756608. Throughput: 0: 44025.8. Samples: 3425751560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:22:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:22:38,971][06909] Updated weights for policy 0, policy_version 215013 (0.0024) [2024-06-28 11:22:42,049][06909] Updated weights for policy 0, policy_version 215023 (0.0031) [2024-06-28 11:22:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3523018752. Throughput: 0: 43883.7. Samples: 3425871200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:22:43,856][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:22:46,182][06909] Updated weights for policy 0, policy_version 215033 (0.0032) [2024-06-28 11:22:48,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3523231744. Throughput: 0: 44076.4. Samples: 3426147760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:22:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:22:49,328][06909] Updated weights for policy 0, policy_version 215043 (0.0033) [2024-06-28 11:22:53,503][06909] Updated weights for policy 0, policy_version 215053 (0.0025) [2024-06-28 11:22:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3523444736. Throughput: 0: 44105.3. Samples: 3426407620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:22:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:22:56,611][06909] Updated weights for policy 0, policy_version 215063 (0.0031) [2024-06-28 11:22:58,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3523674112. Throughput: 0: 44051.2. Samples: 3426537280. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:22:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:23:01,014][06909] Updated weights for policy 0, policy_version 215073 (0.0028) [2024-06-28 11:23:03,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3523903488. Throughput: 0: 44065.2. Samples: 3426807540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:23:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:23:04,061][06909] Updated weights for policy 0, policy_version 215083 (0.0036) [2024-06-28 11:23:08,563][06909] Updated weights for policy 0, policy_version 215093 (0.0036) [2024-06-28 11:23:08,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 3524083712. Throughput: 0: 44055.8. Samples: 3427068540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:23:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:23:11,717][06909] Updated weights for policy 0, policy_version 215103 (0.0035) [2024-06-28 11:23:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43965.2, 300 sec: 43986.9). Total num frames: 3524329472. Throughput: 0: 43965.2. Samples: 3427191480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:23:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:23:15,798][06909] Updated weights for policy 0, policy_version 215113 (0.0034) [2024-06-28 11:23:18,850][06674] Fps is (10 sec: 47514.2, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3524558848. Throughput: 0: 44117.3. Samples: 3427464180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:23:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:23:19,228][06909] Updated weights for policy 0, policy_version 215123 (0.0028) [2024-06-28 11:23:23,218][06909] Updated weights for policy 0, policy_version 215133 (0.0027) [2024-06-28 11:23:23,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43876.1). Total num frames: 3524739072. Throughput: 0: 43877.4. Samples: 3427726040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:23:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:23:26,608][06909] Updated weights for policy 0, policy_version 215143 (0.0033) [2024-06-28 11:23:27,615][06887] Signal inference workers to stop experience collection... (48500 times) [2024-06-28 11:23:27,615][06887] Signal inference workers to resume experience collection... (48500 times) [2024-06-28 11:23:27,656][06909] InferenceWorker_p0-w0: stopping experience collection (48500 times) [2024-06-28 11:23:27,656][06909] InferenceWorker_p0-w0: resuming experience collection (48500 times) [2024-06-28 11:23:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44042.8). Total num frames: 3524984832. Throughput: 0: 43995.1. Samples: 3427850980. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 11:23:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:23:30,904][06909] Updated weights for policy 0, policy_version 215153 (0.0030) [2024-06-28 11:23:33,850][06674] Fps is (10 sec: 47512.7, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3525214208. Throughput: 0: 43899.1. Samples: 3428123220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 11:23:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:23:34,173][06909] Updated weights for policy 0, policy_version 215163 (0.0022) [2024-06-28 11:23:38,280][06909] Updated weights for policy 0, policy_version 215173 (0.0032) [2024-06-28 11:23:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 3525410816. Throughput: 0: 43797.7. Samples: 3428378520. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 11:23:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:23:41,553][06909] Updated weights for policy 0, policy_version 215183 (0.0030) [2024-06-28 11:23:43,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3525640192. Throughput: 0: 43752.1. Samples: 3428506120. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 11:23:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:23:45,810][06909] Updated weights for policy 0, policy_version 215193 (0.0027) [2024-06-28 11:23:48,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3525869568. Throughput: 0: 43763.6. Samples: 3428776900. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 11:23:48,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:23:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000215202_3525869568.pth... [2024-06-28 11:23:48,913][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000214557_3515301888.pth [2024-06-28 11:23:49,287][06909] Updated weights for policy 0, policy_version 215203 (0.0038) [2024-06-28 11:23:53,150][06909] Updated weights for policy 0, policy_version 215213 (0.0022) [2024-06-28 11:23:53,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3526066176. Throughput: 0: 43825.4. Samples: 3429040680. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 11:23:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:23:56,706][06909] Updated weights for policy 0, policy_version 215223 (0.0038) [2024-06-28 11:23:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3526295552. Throughput: 0: 43901.8. Samples: 3429167060. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 11:23:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:24:00,861][06909] Updated weights for policy 0, policy_version 215233 (0.0028) [2024-06-28 11:24:03,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 44042.7). Total num frames: 3526524928. Throughput: 0: 43821.7. Samples: 3429436160. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 11:24:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:24:04,043][06909] Updated weights for policy 0, policy_version 215243 (0.0029) [2024-06-28 11:24:07,964][06909] Updated weights for policy 0, policy_version 215253 (0.0046) [2024-06-28 11:24:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3526737920. Throughput: 0: 43930.1. Samples: 3429702900. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 11:24:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:24:11,579][06909] Updated weights for policy 0, policy_version 215263 (0.0025) [2024-06-28 11:24:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3526967296. Throughput: 0: 44115.1. Samples: 3429836160. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 11:24:13,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:24:15,639][06909] Updated weights for policy 0, policy_version 215273 (0.0028) [2024-06-28 11:24:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3527180288. Throughput: 0: 43984.1. Samples: 3430102500. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 11:24:18,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:24:19,244][06909] Updated weights for policy 0, policy_version 215283 (0.0032) [2024-06-28 11:24:22,915][06909] Updated weights for policy 0, policy_version 215293 (0.0040) [2024-06-28 11:24:23,850][06674] Fps is (10 sec: 42599.0, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3527393280. Throughput: 0: 44167.3. Samples: 3430366040. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 11:24:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:24:26,465][06909] Updated weights for policy 0, policy_version 215303 (0.0042) [2024-06-28 11:24:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3527622656. Throughput: 0: 44311.0. Samples: 3430500120. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 11:24:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:24:30,279][06909] Updated weights for policy 0, policy_version 215313 (0.0032) [2024-06-28 11:24:33,758][06909] Updated weights for policy 0, policy_version 215323 (0.0024) [2024-06-28 11:24:33,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3527852032. Throughput: 0: 44038.2. Samples: 3430758620. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 11:24:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:24:37,794][06909] Updated weights for policy 0, policy_version 215333 (0.0030) [2024-06-28 11:24:38,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3528065024. Throughput: 0: 44166.7. Samples: 3431028180. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 11:24:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:24:41,393][06909] Updated weights for policy 0, policy_version 215343 (0.0031) [2024-06-28 11:24:43,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3528278016. Throughput: 0: 44381.0. Samples: 3431164200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 11:24:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:24:44,971][06909] Updated weights for policy 0, policy_version 215353 (0.0037) [2024-06-28 11:24:48,601][06909] Updated weights for policy 0, policy_version 215363 (0.0031) [2024-06-28 11:24:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3528507392. Throughput: 0: 44125.3. Samples: 3431421800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 11:24:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:24:52,830][06909] Updated weights for policy 0, policy_version 215373 (0.0036) [2024-06-28 11:24:53,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3528720384. Throughput: 0: 43980.9. Samples: 3431682040. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 11:24:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:24:56,218][06909] Updated weights for policy 0, policy_version 215383 (0.0034) [2024-06-28 11:24:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3528933376. Throughput: 0: 44016.8. Samples: 3431816920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 11:24:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:25:00,143][06909] Updated weights for policy 0, policy_version 215393 (0.0033) [2024-06-28 11:25:03,536][06909] Updated weights for policy 0, policy_version 215403 (0.0039) [2024-06-28 11:25:03,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44042.5). Total num frames: 3529162752. Throughput: 0: 43933.8. Samples: 3432079520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 11:25:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:25:07,326][06909] Updated weights for policy 0, policy_version 215413 (0.0026) [2024-06-28 11:25:08,852][06674] Fps is (10 sec: 45866.4, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 3529392128. Throughput: 0: 44151.3. Samples: 3432352940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 11:25:08,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:25:10,747][06909] Updated weights for policy 0, policy_version 215423 (0.0040) [2024-06-28 11:25:13,745][06887] Signal inference workers to stop experience collection... (48550 times) [2024-06-28 11:25:13,748][06887] Signal inference workers to resume experience collection... (48550 times) [2024-06-28 11:25:13,761][06909] InferenceWorker_p0-w0: stopping experience collection (48550 times) [2024-06-28 11:25:13,794][06909] InferenceWorker_p0-w0: resuming experience collection (48550 times) [2024-06-28 11:25:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3529588736. Throughput: 0: 44037.0. Samples: 3432481780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 11:25:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:25:14,684][06909] Updated weights for policy 0, policy_version 215433 (0.0030) [2024-06-28 11:25:18,244][06909] Updated weights for policy 0, policy_version 215443 (0.0038) [2024-06-28 11:25:18,850][06674] Fps is (10 sec: 44246.0, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 3529834496. Throughput: 0: 44124.9. Samples: 3432744240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 11:25:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:25:22,308][06909] Updated weights for policy 0, policy_version 215453 (0.0053) [2024-06-28 11:25:23,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3530031104. Throughput: 0: 43902.5. Samples: 3433003800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 11:25:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:25:26,001][06909] Updated weights for policy 0, policy_version 215463 (0.0029) [2024-06-28 11:25:28,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3530244096. Throughput: 0: 43754.0. Samples: 3433133140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 11:25:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:25:30,069][06909] Updated weights for policy 0, policy_version 215473 (0.0039) [2024-06-28 11:25:33,283][06909] Updated weights for policy 0, policy_version 215483 (0.0026) [2024-06-28 11:25:33,852][06674] Fps is (10 sec: 45866.1, 60 sec: 43962.3, 300 sec: 44097.7). Total num frames: 3530489856. Throughput: 0: 43871.8. Samples: 3433396120. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 11:25:33,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:25:37,321][06909] Updated weights for policy 0, policy_version 215493 (0.0033) [2024-06-28 11:25:38,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3530702848. Throughput: 0: 43995.6. Samples: 3433661840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 11:25:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:25:40,558][06909] Updated weights for policy 0, policy_version 215503 (0.0030) [2024-06-28 11:25:43,850][06674] Fps is (10 sec: 42606.4, 60 sec: 43963.5, 300 sec: 44042.4). Total num frames: 3530915840. Throughput: 0: 43894.6. Samples: 3433792180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 11:25:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:25:45,051][06909] Updated weights for policy 0, policy_version 215513 (0.0032) [2024-06-28 11:25:48,067][06909] Updated weights for policy 0, policy_version 215523 (0.0029) [2024-06-28 11:25:48,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3531145216. Throughput: 0: 43901.3. Samples: 3434055080. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 11:25:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:25:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000215524_3531145216.pth... [2024-06-28 11:25:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000214880_3520593920.pth [2024-06-28 11:25:52,248][06909] Updated weights for policy 0, policy_version 215533 (0.0027) [2024-06-28 11:25:53,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3531358208. Throughput: 0: 43821.1. Samples: 3434324800. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 11:25:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:25:55,783][06909] Updated weights for policy 0, policy_version 215543 (0.0031) [2024-06-28 11:25:58,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3531554816. Throughput: 0: 43855.5. Samples: 3434455280. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 11:25:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:25:59,467][06909] Updated weights for policy 0, policy_version 215553 (0.0043) [2024-06-28 11:26:03,106][06909] Updated weights for policy 0, policy_version 215563 (0.0036) [2024-06-28 11:26:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3531800576. Throughput: 0: 43702.2. Samples: 3434710840. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 11:26:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 11:26:07,075][06909] Updated weights for policy 0, policy_version 215573 (0.0030) [2024-06-28 11:26:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43692.1, 300 sec: 43986.9). Total num frames: 3532013568. Throughput: 0: 43968.9. Samples: 3434982400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 11:26:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:26:10,337][06909] Updated weights for policy 0, policy_version 215583 (0.0039) [2024-06-28 11:26:13,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3532210176. Throughput: 0: 44081.9. Samples: 3435116820. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 11:26:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:26:14,521][06909] Updated weights for policy 0, policy_version 215593 (0.0030) [2024-06-28 11:26:17,546][06909] Updated weights for policy 0, policy_version 215603 (0.0032) [2024-06-28 11:26:18,856][06674] Fps is (10 sec: 45847.5, 60 sec: 43959.3, 300 sec: 44152.6). Total num frames: 3532472320. Throughput: 0: 44046.3. Samples: 3435378380. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 11:26:18,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:26:21,829][06909] Updated weights for policy 0, policy_version 215613 (0.0035) [2024-06-28 11:26:23,852][06674] Fps is (10 sec: 49142.0, 60 sec: 44508.4, 300 sec: 43986.6). Total num frames: 3532701696. Throughput: 0: 44241.1. Samples: 3435652780. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 11:26:23,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:26:25,102][06909] Updated weights for policy 0, policy_version 215623 (0.0036) [2024-06-28 11:26:28,850][06674] Fps is (10 sec: 42623.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3532898304. Throughput: 0: 44339.2. Samples: 3435787440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 11:26:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:26:29,061][06909] Updated weights for policy 0, policy_version 215633 (0.0034) [2024-06-28 11:26:32,687][06909] Updated weights for policy 0, policy_version 215643 (0.0029) [2024-06-28 11:26:33,850][06674] Fps is (10 sec: 42607.4, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 3533127680. Throughput: 0: 44231.6. Samples: 3436045500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 11:26:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:26:37,026][06909] Updated weights for policy 0, policy_version 215653 (0.0026) [2024-06-28 11:26:38,238][06887] Signal inference workers to stop experience collection... (48600 times) [2024-06-28 11:26:38,239][06887] Signal inference workers to resume experience collection... (48600 times) [2024-06-28 11:26:38,251][06909] InferenceWorker_p0-w0: stopping experience collection (48600 times) [2024-06-28 11:26:38,251][06909] InferenceWorker_p0-w0: resuming experience collection (48600 times) [2024-06-28 11:26:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3533340672. Throughput: 0: 43952.9. Samples: 3436302680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 11:26:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:26:39,960][06909] Updated weights for policy 0, policy_version 215663 (0.0041) [2024-06-28 11:26:43,850][06674] Fps is (10 sec: 40959.5, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 3533537280. Throughput: 0: 44052.5. Samples: 3436437640. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 11:26:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:26:44,494][06909] Updated weights for policy 0, policy_version 215673 (0.0039) [2024-06-28 11:26:47,307][06909] Updated weights for policy 0, policy_version 215683 (0.0032) [2024-06-28 11:26:48,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3533783040. Throughput: 0: 44080.9. Samples: 3436694480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 11:26:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:26:51,830][06909] Updated weights for policy 0, policy_version 215693 (0.0030) [2024-06-28 11:26:53,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3534012416. Throughput: 0: 44012.9. Samples: 3436962980. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 11:26:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:26:54,669][06909] Updated weights for policy 0, policy_version 215703 (0.0030) [2024-06-28 11:26:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3534192640. Throughput: 0: 43959.1. Samples: 3437094980. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 11:26:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:26:59,272][06909] Updated weights for policy 0, policy_version 215713 (0.0038) [2024-06-28 11:27:02,368][06909] Updated weights for policy 0, policy_version 215723 (0.0034) [2024-06-28 11:27:03,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3534438400. Throughput: 0: 43874.8. Samples: 3437352480. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 11:27:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:27:06,875][06909] Updated weights for policy 0, policy_version 215733 (0.0030) [2024-06-28 11:27:08,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 43931.6). Total num frames: 3534651392. Throughput: 0: 43693.9. Samples: 3437618920. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 11:27:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:27:09,655][06909] Updated weights for policy 0, policy_version 215743 (0.0024) [2024-06-28 11:27:13,856][06674] Fps is (10 sec: 42572.8, 60 sec: 44232.4, 300 sec: 43986.0). Total num frames: 3534864384. Throughput: 0: 43567.2. Samples: 3437748220. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 11:27:13,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:27:14,822][06909] Updated weights for policy 0, policy_version 215753 (0.0032) [2024-06-28 11:27:17,358][06909] Updated weights for policy 0, policy_version 215763 (0.0036) [2024-06-28 11:27:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43695.1, 300 sec: 43986.9). Total num frames: 3535093760. Throughput: 0: 43581.3. Samples: 3438006660. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 11:27:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:27:22,196][06909] Updated weights for policy 0, policy_version 215773 (0.0041) [2024-06-28 11:27:23,850][06674] Fps is (10 sec: 47542.2, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 3535339520. Throughput: 0: 43937.3. Samples: 3438279860. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 11:27:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:27:24,615][06909] Updated weights for policy 0, policy_version 215783 (0.0038) [2024-06-28 11:27:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3535536128. Throughput: 0: 43878.2. Samples: 3438412160. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 11:27:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:27:29,577][06909] Updated weights for policy 0, policy_version 215793 (0.0033) [2024-06-28 11:27:31,932][06909] Updated weights for policy 0, policy_version 215803 (0.0024) [2024-06-28 11:27:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3535749120. Throughput: 0: 43841.8. Samples: 3438667360. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 11:27:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:27:36,893][06909] Updated weights for policy 0, policy_version 215813 (0.0028) [2024-06-28 11:27:37,691][06887] Signal inference workers to stop experience collection... (48650 times) [2024-06-28 11:27:37,697][06887] Signal inference workers to resume experience collection... (48650 times) [2024-06-28 11:27:37,706][06909] InferenceWorker_p0-w0: stopping experience collection (48650 times) [2024-06-28 11:27:37,732][06909] InferenceWorker_p0-w0: resuming experience collection (48650 times) [2024-06-28 11:27:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3535978496. Throughput: 0: 43871.2. Samples: 3438937180. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 11:27:38,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 11:27:39,733][06909] Updated weights for policy 0, policy_version 215823 (0.0030) [2024-06-28 11:27:43,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 43931.4). Total num frames: 3536191488. Throughput: 0: 43820.0. Samples: 3439066880. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 11:27:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:27:44,230][06909] Updated weights for policy 0, policy_version 215833 (0.0025) [2024-06-28 11:27:47,351][06909] Updated weights for policy 0, policy_version 215843 (0.0029) [2024-06-28 11:27:48,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3536404480. Throughput: 0: 43888.0. Samples: 3439327440. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 11:27:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:27:48,865][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000215846_3536420864.pth... [2024-06-28 11:27:48,931][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000215202_3525869568.pth [2024-06-28 11:27:52,380][06909] Updated weights for policy 0, policy_version 215853 (0.0037) [2024-06-28 11:27:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3536650240. Throughput: 0: 43873.5. Samples: 3439593220. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 11:27:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:27:54,757][06909] Updated weights for policy 0, policy_version 215863 (0.0035) [2024-06-28 11:27:58,850][06674] Fps is (10 sec: 44235.9, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 3536846848. Throughput: 0: 43979.5. Samples: 3439727040. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 11:27:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:27:59,791][06909] Updated weights for policy 0, policy_version 215873 (0.0031) [2024-06-28 11:28:02,116][06909] Updated weights for policy 0, policy_version 215883 (0.0031) [2024-06-28 11:28:03,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3537059840. Throughput: 0: 43885.4. Samples: 3439981500. Policy #0 lag: (min: 0.0, avg: 7.0, max: 20.0) [2024-06-28 11:28:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:28:07,049][06909] Updated weights for policy 0, policy_version 215893 (0.0028) [2024-06-28 11:28:08,850][06674] Fps is (10 sec: 45876.4, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3537305600. Throughput: 0: 43733.4. Samples: 3440247860. Policy #0 lag: (min: 0.0, avg: 7.0, max: 20.0) [2024-06-28 11:28:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:28:09,542][06909] Updated weights for policy 0, policy_version 215903 (0.0043) [2024-06-28 11:28:13,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43968.1, 300 sec: 43875.8). Total num frames: 3537502208. Throughput: 0: 43740.4. Samples: 3440380480. Policy #0 lag: (min: 0.0, avg: 7.0, max: 20.0) [2024-06-28 11:28:13,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:28:14,727][06909] Updated weights for policy 0, policy_version 215913 (0.0038) [2024-06-28 11:28:17,294][06909] Updated weights for policy 0, policy_version 215923 (0.0040) [2024-06-28 11:28:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3537731584. Throughput: 0: 43745.8. Samples: 3440635920. Policy #0 lag: (min: 0.0, avg: 7.0, max: 20.0) [2024-06-28 11:28:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:28:22,203][06909] Updated weights for policy 0, policy_version 215933 (0.0043) [2024-06-28 11:28:23,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3537977344. Throughput: 0: 43738.2. Samples: 3440905400. Policy #0 lag: (min: 0.0, avg: 7.0, max: 20.0) [2024-06-28 11:28:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:28:24,839][06909] Updated weights for policy 0, policy_version 215943 (0.0032) [2024-06-28 11:28:28,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3538157568. Throughput: 0: 43901.8. Samples: 3441042460. Policy #0 lag: (min: 0.0, avg: 7.0, max: 20.0) [2024-06-28 11:28:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:28:29,465][06909] Updated weights for policy 0, policy_version 215953 (0.0030) [2024-06-28 11:28:32,042][06909] Updated weights for policy 0, policy_version 215963 (0.0039) [2024-06-28 11:28:33,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3538386944. Throughput: 0: 43934.5. Samples: 3441304500. Policy #0 lag: (min: 0.0, avg: 7.0, max: 20.0) [2024-06-28 11:28:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:28:36,816][06909] Updated weights for policy 0, policy_version 215973 (0.0034) [2024-06-28 11:28:38,401][06887] Signal inference workers to stop experience collection... (48700 times) [2024-06-28 11:28:38,449][06909] InferenceWorker_p0-w0: stopping experience collection (48700 times) [2024-06-28 11:28:38,458][06887] Signal inference workers to resume experience collection... (48700 times) [2024-06-28 11:28:38,472][06909] InferenceWorker_p0-w0: resuming experience collection (48700 times) [2024-06-28 11:28:38,850][06674] Fps is (10 sec: 49150.9, 60 sec: 44509.7, 300 sec: 44097.9). Total num frames: 3538649088. Throughput: 0: 43949.1. Samples: 3441570940. Policy #0 lag: (min: 0.0, avg: 7.0, max: 20.0) [2024-06-28 11:28:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:28:39,275][06909] Updated weights for policy 0, policy_version 215983 (0.0028) [2024-06-28 11:28:43,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3538812928. Throughput: 0: 43972.6. Samples: 3441705800. Policy #0 lag: (min: 0.0, avg: 7.0, max: 20.0) [2024-06-28 11:28:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:28:44,545][06909] Updated weights for policy 0, policy_version 215993 (0.0048) [2024-06-28 11:28:46,838][06909] Updated weights for policy 0, policy_version 216003 (0.0032) [2024-06-28 11:28:48,850][06674] Fps is (10 sec: 39322.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3539042304. Throughput: 0: 44085.7. Samples: 3441965360. Policy #0 lag: (min: 0.0, avg: 7.0, max: 20.0) [2024-06-28 11:28:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:28:51,642][06909] Updated weights for policy 0, policy_version 216013 (0.0036) [2024-06-28 11:28:53,850][06674] Fps is (10 sec: 47513.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3539288064. Throughput: 0: 44003.9. Samples: 3442228040. Policy #0 lag: (min: 0.0, avg: 7.0, max: 20.0) [2024-06-28 11:28:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:28:54,631][06909] Updated weights for policy 0, policy_version 216023 (0.0032) [2024-06-28 11:28:58,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 3539468288. Throughput: 0: 44123.6. Samples: 3442366040. Policy #0 lag: (min: 0.0, avg: 7.0, max: 20.0) [2024-06-28 11:28:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:28:59,302][06909] Updated weights for policy 0, policy_version 216033 (0.0033) [2024-06-28 11:29:01,841][06909] Updated weights for policy 0, policy_version 216043 (0.0032) [2024-06-28 11:29:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3539714048. Throughput: 0: 44176.0. Samples: 3442623840. Policy #0 lag: (min: 0.0, avg: 7.0, max: 20.0) [2024-06-28 11:29:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:29:06,640][06909] Updated weights for policy 0, policy_version 216053 (0.0035) [2024-06-28 11:29:08,856][06674] Fps is (10 sec: 49122.4, 60 sec: 44232.3, 300 sec: 44041.5). Total num frames: 3539959808. Throughput: 0: 44028.3. Samples: 3442886940. Policy #0 lag: (min: 1.0, avg: 9.8, max: 24.0) [2024-06-28 11:29:08,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:29:09,373][06909] Updated weights for policy 0, policy_version 216063 (0.0034) [2024-06-28 11:29:13,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3540123648. Throughput: 0: 44147.9. Samples: 3443029120. Policy #0 lag: (min: 1.0, avg: 9.8, max: 24.0) [2024-06-28 11:29:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:29:14,069][06909] Updated weights for policy 0, policy_version 216073 (0.0041) [2024-06-28 11:29:16,660][06909] Updated weights for policy 0, policy_version 216083 (0.0031) [2024-06-28 11:29:18,850][06674] Fps is (10 sec: 40984.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3540369408. Throughput: 0: 44120.5. Samples: 3443289920. Policy #0 lag: (min: 1.0, avg: 9.8, max: 24.0) [2024-06-28 11:29:18,850][06674] Avg episode reward: [(0, '0.403')] [2024-06-28 11:29:21,376][06909] Updated weights for policy 0, policy_version 216093 (0.0032) [2024-06-28 11:29:23,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43417.6, 300 sec: 43931.3). Total num frames: 3540582400. Throughput: 0: 43831.3. Samples: 3443543340. Policy #0 lag: (min: 1.0, avg: 9.8, max: 24.0) [2024-06-28 11:29:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:29:24,526][06909] Updated weights for policy 0, policy_version 216103 (0.0032) [2024-06-28 11:29:28,607][06909] Updated weights for policy 0, policy_version 216113 (0.0027) [2024-06-28 11:29:28,852][06674] Fps is (10 sec: 42589.8, 60 sec: 43962.2, 300 sec: 43875.5). Total num frames: 3540795392. Throughput: 0: 43859.8. Samples: 3443679580. Policy #0 lag: (min: 1.0, avg: 9.8, max: 24.0) [2024-06-28 11:29:28,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:29:31,981][06909] Updated weights for policy 0, policy_version 216123 (0.0035) [2024-06-28 11:29:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3541008384. Throughput: 0: 43902.3. Samples: 3443940960. Policy #0 lag: (min: 1.0, avg: 9.8, max: 24.0) [2024-06-28 11:29:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:29:36,517][06909] Updated weights for policy 0, policy_version 216133 (0.0036) [2024-06-28 11:29:38,850][06674] Fps is (10 sec: 45884.5, 60 sec: 43417.7, 300 sec: 43986.9). Total num frames: 3541254144. Throughput: 0: 43987.5. Samples: 3444207480. Policy #0 lag: (min: 1.0, avg: 9.8, max: 24.0) [2024-06-28 11:29:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:29:39,248][06909] Updated weights for policy 0, policy_version 216143 (0.0035) [2024-06-28 11:29:43,770][06909] Updated weights for policy 0, policy_version 216153 (0.0036) [2024-06-28 11:29:43,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3541450752. Throughput: 0: 43990.7. Samples: 3444345620. Policy #0 lag: (min: 1.0, avg: 9.8, max: 24.0) [2024-06-28 11:29:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:29:46,735][06909] Updated weights for policy 0, policy_version 216163 (0.0033) [2024-06-28 11:29:48,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3541680128. Throughput: 0: 44088.5. Samples: 3444607820. Policy #0 lag: (min: 1.0, avg: 9.8, max: 24.0) [2024-06-28 11:29:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:29:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000216167_3541680128.pth... [2024-06-28 11:29:48,947][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000215524_3531145216.pth [2024-06-28 11:29:51,255][06909] Updated weights for policy 0, policy_version 216173 (0.0033) [2024-06-28 11:29:53,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3541925888. Throughput: 0: 43983.3. Samples: 3444865920. Policy #0 lag: (min: 1.0, avg: 9.8, max: 24.0) [2024-06-28 11:29:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:29:54,005][06909] Updated weights for policy 0, policy_version 216183 (0.0043) [2024-06-28 11:29:58,391][06909] Updated weights for policy 0, policy_version 216193 (0.0023) [2024-06-28 11:29:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 3542122496. Throughput: 0: 43914.7. Samples: 3445005280. Policy #0 lag: (min: 1.0, avg: 9.8, max: 24.0) [2024-06-28 11:29:58,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:29:59,830][06887] Signal inference workers to stop experience collection... (48750 times) [2024-06-28 11:29:59,830][06887] Signal inference workers to resume experience collection... (48750 times) [2024-06-28 11:29:59,860][06909] InferenceWorker_p0-w0: stopping experience collection (48750 times) [2024-06-28 11:29:59,860][06909] InferenceWorker_p0-w0: resuming experience collection (48750 times) [2024-06-28 11:30:01,911][06909] Updated weights for policy 0, policy_version 216203 (0.0039) [2024-06-28 11:30:03,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.7, 300 sec: 43931.6). Total num frames: 3542351872. Throughput: 0: 43903.1. Samples: 3445265560. Policy #0 lag: (min: 1.0, avg: 9.8, max: 24.0) [2024-06-28 11:30:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:30:06,014][06909] Updated weights for policy 0, policy_version 216213 (0.0033) [2024-06-28 11:30:08,856][06674] Fps is (10 sec: 45847.4, 60 sec: 43690.7, 300 sec: 44041.5). Total num frames: 3542581248. Throughput: 0: 44216.7. Samples: 3445533360. Policy #0 lag: (min: 1.0, avg: 9.8, max: 24.0) [2024-06-28 11:30:08,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:30:09,293][06909] Updated weights for policy 0, policy_version 216223 (0.0034) [2024-06-28 11:30:13,528][06909] Updated weights for policy 0, policy_version 216233 (0.0033) [2024-06-28 11:30:13,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 3542761472. Throughput: 0: 44199.5. Samples: 3445668460. Policy #0 lag: (min: 1.0, avg: 9.8, max: 24.0) [2024-06-28 11:30:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:30:16,610][06909] Updated weights for policy 0, policy_version 216243 (0.0023) [2024-06-28 11:30:18,850][06674] Fps is (10 sec: 42624.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3543007232. Throughput: 0: 44164.0. Samples: 3445928340. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-28 11:30:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:30:21,053][06909] Updated weights for policy 0, policy_version 216253 (0.0034) [2024-06-28 11:30:23,805][06909] Updated weights for policy 0, policy_version 216263 (0.0039) [2024-06-28 11:30:23,850][06674] Fps is (10 sec: 49151.4, 60 sec: 44509.8, 300 sec: 44098.0). Total num frames: 3543252992. Throughput: 0: 44056.0. Samples: 3446190000. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-28 11:30:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:30:28,377][06909] Updated weights for policy 0, policy_version 216273 (0.0030) [2024-06-28 11:30:28,850][06674] Fps is (10 sec: 42597.4, 60 sec: 43965.1, 300 sec: 43876.1). Total num frames: 3543433216. Throughput: 0: 43926.9. Samples: 3446322340. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-28 11:30:28,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:30:31,231][06909] Updated weights for policy 0, policy_version 216283 (0.0036) [2024-06-28 11:30:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3543662592. Throughput: 0: 43960.0. Samples: 3446586020. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-28 11:30:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:30:35,641][06909] Updated weights for policy 0, policy_version 216293 (0.0033) [2024-06-28 11:30:38,850][06674] Fps is (10 sec: 45876.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3543891968. Throughput: 0: 44201.7. Samples: 3446855000. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-28 11:30:38,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:30:39,117][06909] Updated weights for policy 0, policy_version 216303 (0.0025) [2024-06-28 11:30:43,085][06909] Updated weights for policy 0, policy_version 216313 (0.0035) [2024-06-28 11:30:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 3544121344. Throughput: 0: 44091.0. Samples: 3446989380. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-28 11:30:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:30:46,343][06909] Updated weights for policy 0, policy_version 216323 (0.0030) [2024-06-28 11:30:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3544334336. Throughput: 0: 44336.6. Samples: 3447260700. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-28 11:30:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:30:50,709][06909] Updated weights for policy 0, policy_version 216333 (0.0036) [2024-06-28 11:30:53,622][06909] Updated weights for policy 0, policy_version 216343 (0.0045) [2024-06-28 11:30:53,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3544563712. Throughput: 0: 44144.7. Samples: 3447519600. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-28 11:30:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:30:58,090][06909] Updated weights for policy 0, policy_version 216353 (0.0036) [2024-06-28 11:30:58,654][06887] Signal inference workers to stop experience collection... (48800 times) [2024-06-28 11:30:58,706][06887] Signal inference workers to resume experience collection... (48800 times) [2024-06-28 11:30:58,708][06909] InferenceWorker_p0-w0: stopping experience collection (48800 times) [2024-06-28 11:30:58,732][06909] InferenceWorker_p0-w0: resuming experience collection (48800 times) [2024-06-28 11:30:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3544776704. Throughput: 0: 44046.6. Samples: 3447650560. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-28 11:30:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:31:00,978][06909] Updated weights for policy 0, policy_version 216363 (0.0035) [2024-06-28 11:31:03,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3545006080. Throughput: 0: 44154.2. Samples: 3447915280. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-28 11:31:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:31:05,378][06909] Updated weights for policy 0, policy_version 216373 (0.0022) [2024-06-28 11:31:08,746][06909] Updated weights for policy 0, policy_version 216383 (0.0036) [2024-06-28 11:31:08,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43968.1, 300 sec: 44098.0). Total num frames: 3545219072. Throughput: 0: 44123.1. Samples: 3448175540. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-28 11:31:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:31:12,785][06909] Updated weights for policy 0, policy_version 216393 (0.0022) [2024-06-28 11:31:13,850][06674] Fps is (10 sec: 44236.1, 60 sec: 44782.7, 300 sec: 43987.8). Total num frames: 3545448448. Throughput: 0: 44089.9. Samples: 3448306380. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-28 11:31:13,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:31:16,183][06909] Updated weights for policy 0, policy_version 216403 (0.0034) [2024-06-28 11:31:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43931.6). Total num frames: 3545661440. Throughput: 0: 44266.6. Samples: 3448578020. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-28 11:31:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 11:31:20,243][06909] Updated weights for policy 0, policy_version 216413 (0.0033) [2024-06-28 11:31:23,722][06909] Updated weights for policy 0, policy_version 216423 (0.0026) [2024-06-28 11:31:23,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3545874432. Throughput: 0: 44193.2. Samples: 3448843700. Policy #0 lag: (min: 0.0, avg: 11.3, max: 25.0) [2024-06-28 11:31:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:31:27,745][06909] Updated weights for policy 0, policy_version 216433 (0.0042) [2024-06-28 11:31:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44237.0, 300 sec: 43931.3). Total num frames: 3546087424. Throughput: 0: 43934.8. Samples: 3448966440. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:31:28,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:31:31,224][06909] Updated weights for policy 0, policy_version 216443 (0.0040) [2024-06-28 11:31:33,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3546333184. Throughput: 0: 43837.7. Samples: 3449233400. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:31:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:31:35,096][06909] Updated weights for policy 0, policy_version 216453 (0.0021) [2024-06-28 11:31:38,593][06909] Updated weights for policy 0, policy_version 216463 (0.0033) [2024-06-28 11:31:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3546529792. Throughput: 0: 43924.0. Samples: 3449496180. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:31:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:31:42,747][06909] Updated weights for policy 0, policy_version 216473 (0.0031) [2024-06-28 11:31:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3546759168. Throughput: 0: 43808.9. Samples: 3449621960. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:31:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:31:46,214][06909] Updated weights for policy 0, policy_version 216483 (0.0022) [2024-06-28 11:31:48,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3546988544. Throughput: 0: 44020.4. Samples: 3449896200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:31:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:31:48,870][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000216491_3546988544.pth... [2024-06-28 11:31:48,931][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000215846_3536420864.pth [2024-06-28 11:31:50,236][06909] Updated weights for policy 0, policy_version 216493 (0.0038) [2024-06-28 11:31:53,441][06909] Updated weights for policy 0, policy_version 216503 (0.0032) [2024-06-28 11:31:53,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 3547201536. Throughput: 0: 44143.9. Samples: 3450162020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:31:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:31:57,400][06909] Updated weights for policy 0, policy_version 216513 (0.0021) [2024-06-28 11:31:58,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3547447296. Throughput: 0: 44112.9. Samples: 3450291460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:31:58,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:32:00,840][06909] Updated weights for policy 0, policy_version 216523 (0.0029) [2024-06-28 11:32:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3547627520. Throughput: 0: 44020.4. Samples: 3450558940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:32:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:32:04,819][06909] Updated weights for policy 0, policy_version 216533 (0.0036) [2024-06-28 11:32:08,384][06909] Updated weights for policy 0, policy_version 216543 (0.0032) [2024-06-28 11:32:08,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.7, 300 sec: 44043.3). Total num frames: 3547856896. Throughput: 0: 43865.8. Samples: 3450817660. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:32:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:32:10,523][06887] Signal inference workers to stop experience collection... (48850 times) [2024-06-28 11:32:10,559][06909] InferenceWorker_p0-w0: stopping experience collection (48850 times) [2024-06-28 11:32:10,576][06887] Signal inference workers to resume experience collection... (48850 times) [2024-06-28 11:32:10,578][06909] InferenceWorker_p0-w0: resuming experience collection (48850 times) [2024-06-28 11:32:12,163][06909] Updated weights for policy 0, policy_version 216553 (0.0038) [2024-06-28 11:32:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 3548086272. Throughput: 0: 44031.0. Samples: 3450947840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:32:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:32:16,029][06909] Updated weights for policy 0, policy_version 216563 (0.0026) [2024-06-28 11:32:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3548282880. Throughput: 0: 43828.0. Samples: 3451205660. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:32:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:32:20,331][06909] Updated weights for policy 0, policy_version 216573 (0.0032) [2024-06-28 11:32:23,356][06909] Updated weights for policy 0, policy_version 216583 (0.0036) [2024-06-28 11:32:23,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3548512256. Throughput: 0: 43964.7. Samples: 3451474600. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:32:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:32:27,479][06909] Updated weights for policy 0, policy_version 216593 (0.0038) [2024-06-28 11:32:28,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3548741632. Throughput: 0: 44134.7. Samples: 3451608020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:32:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:32:30,470][06909] Updated weights for policy 0, policy_version 216603 (0.0029) [2024-06-28 11:32:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3548954624. Throughput: 0: 44049.3. Samples: 3451878420. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 11:32:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:32:34,746][06909] Updated weights for policy 0, policy_version 216613 (0.0041) [2024-06-28 11:32:38,140][06909] Updated weights for policy 0, policy_version 216623 (0.0038) [2024-06-28 11:32:38,850][06674] Fps is (10 sec: 42597.6, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3549167616. Throughput: 0: 43995.6. Samples: 3452141820. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 11:32:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:32:41,951][06909] Updated weights for policy 0, policy_version 216633 (0.0035) [2024-06-28 11:32:43,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3549413376. Throughput: 0: 44132.5. Samples: 3452277420. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 11:32:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:32:45,743][06909] Updated weights for policy 0, policy_version 216643 (0.0029) [2024-06-28 11:32:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3549609984. Throughput: 0: 43886.1. Samples: 3452533820. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 11:32:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:32:49,575][06909] Updated weights for policy 0, policy_version 216653 (0.0031) [2024-06-28 11:32:53,139][06909] Updated weights for policy 0, policy_version 216663 (0.0037) [2024-06-28 11:32:53,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3549822976. Throughput: 0: 43930.2. Samples: 3452794520. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 11:32:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:32:57,229][06909] Updated weights for policy 0, policy_version 216673 (0.0034) [2024-06-28 11:32:58,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43417.7, 300 sec: 44042.4). Total num frames: 3550052352. Throughput: 0: 43991.6. Samples: 3452927460. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 11:32:58,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:33:00,321][06909] Updated weights for policy 0, policy_version 216683 (0.0042) [2024-06-28 11:33:03,856][06674] Fps is (10 sec: 45847.9, 60 sec: 44232.4, 300 sec: 43986.0). Total num frames: 3550281728. Throughput: 0: 44211.4. Samples: 3453195440. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 11:33:03,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:33:04,465][06909] Updated weights for policy 0, policy_version 216693 (0.0030) [2024-06-28 11:33:07,995][06909] Updated weights for policy 0, policy_version 216703 (0.0040) [2024-06-28 11:33:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3550494720. Throughput: 0: 43929.9. Samples: 3453451440. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 11:33:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:33:11,992][06909] Updated weights for policy 0, policy_version 216713 (0.0031) [2024-06-28 11:33:13,850][06674] Fps is (10 sec: 42624.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3550707712. Throughput: 0: 44037.7. Samples: 3453589720. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 11:33:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:33:15,356][06909] Updated weights for policy 0, policy_version 216723 (0.0033) [2024-06-28 11:33:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3550920704. Throughput: 0: 43741.3. Samples: 3453846780. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 11:33:18,854][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:33:19,393][06909] Updated weights for policy 0, policy_version 216733 (0.0031) [2024-06-28 11:33:23,126][06909] Updated weights for policy 0, policy_version 216743 (0.0042) [2024-06-28 11:33:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3551133696. Throughput: 0: 43800.0. Samples: 3454112820. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 11:33:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:33:27,220][06909] Updated weights for policy 0, policy_version 216753 (0.0033) [2024-06-28 11:33:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3551363072. Throughput: 0: 43617.3. Samples: 3454240200. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 11:33:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:33:30,284][06909] Updated weights for policy 0, policy_version 216763 (0.0025) [2024-06-28 11:33:32,041][06887] Signal inference workers to stop experience collection... (48900 times) [2024-06-28 11:33:32,041][06887] Signal inference workers to resume experience collection... (48900 times) [2024-06-28 11:33:32,054][06909] InferenceWorker_p0-w0: stopping experience collection (48900 times) [2024-06-28 11:33:32,054][06909] InferenceWorker_p0-w0: resuming experience collection (48900 times) [2024-06-28 11:33:33,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3551592448. Throughput: 0: 43857.1. Samples: 3454507380. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 11:33:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:33:34,346][06909] Updated weights for policy 0, policy_version 216773 (0.0027) [2024-06-28 11:33:38,194][06909] Updated weights for policy 0, policy_version 216783 (0.0030) [2024-06-28 11:33:38,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3551805440. Throughput: 0: 44036.0. Samples: 3454776140. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 11:33:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:33:41,646][06909] Updated weights for policy 0, policy_version 216793 (0.0024) [2024-06-28 11:33:43,852][06674] Fps is (10 sec: 44227.4, 60 sec: 43689.2, 300 sec: 44042.1). Total num frames: 3552034816. Throughput: 0: 43910.9. Samples: 3454903540. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-28 11:33:43,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:33:45,398][06909] Updated weights for policy 0, policy_version 216803 (0.0032) [2024-06-28 11:33:48,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43962.3, 300 sec: 43931.0). Total num frames: 3552247808. Throughput: 0: 43773.2. Samples: 3455165060. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-28 11:33:48,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:33:48,868][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000216812_3552247808.pth... [2024-06-28 11:33:48,922][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000216167_3541680128.pth [2024-06-28 11:33:49,185][06909] Updated weights for policy 0, policy_version 216813 (0.0038) [2024-06-28 11:33:53,092][06909] Updated weights for policy 0, policy_version 216823 (0.0037) [2024-06-28 11:33:53,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3552460800. Throughput: 0: 44110.7. Samples: 3455436420. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-28 11:33:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:33:56,647][06909] Updated weights for policy 0, policy_version 216833 (0.0043) [2024-06-28 11:33:58,850][06674] Fps is (10 sec: 42607.2, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3552673792. Throughput: 0: 43996.0. Samples: 3455569540. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-28 11:33:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:34:00,197][06909] Updated weights for policy 0, policy_version 216843 (0.0026) [2024-06-28 11:34:03,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43695.0, 300 sec: 43876.7). Total num frames: 3552903168. Throughput: 0: 44031.1. Samples: 3455828180. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-28 11:34:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 11:34:04,042][06909] Updated weights for policy 0, policy_version 216853 (0.0041) [2024-06-28 11:34:07,435][06909] Updated weights for policy 0, policy_version 216863 (0.0028) [2024-06-28 11:34:08,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43689.2, 300 sec: 44042.1). Total num frames: 3553116160. Throughput: 0: 44096.3. Samples: 3456097240. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-28 11:34:08,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:34:11,177][06909] Updated weights for policy 0, policy_version 216873 (0.0032) [2024-06-28 11:34:13,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3553361920. Throughput: 0: 44172.5. Samples: 3456227960. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-28 11:34:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:34:15,464][06909] Updated weights for policy 0, policy_version 216883 (0.0026) [2024-06-28 11:34:18,752][06909] Updated weights for policy 0, policy_version 216893 (0.0034) [2024-06-28 11:34:18,850][06674] Fps is (10 sec: 45884.7, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3553574912. Throughput: 0: 44128.9. Samples: 3456493180. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-28 11:34:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:34:22,969][06909] Updated weights for policy 0, policy_version 216903 (0.0034) [2024-06-28 11:34:23,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.9, 300 sec: 44042.7). Total num frames: 3553787904. Throughput: 0: 44221.9. Samples: 3456766120. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-28 11:34:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:34:25,923][06909] Updated weights for policy 0, policy_version 216913 (0.0033) [2024-06-28 11:34:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3554017280. Throughput: 0: 44261.6. Samples: 3456895220. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-28 11:34:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:34:30,124][06909] Updated weights for policy 0, policy_version 216923 (0.0034) [2024-06-28 11:34:33,669][06909] Updated weights for policy 0, policy_version 216933 (0.0029) [2024-06-28 11:34:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3554230272. Throughput: 0: 44275.9. Samples: 3457157380. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-28 11:34:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:34:37,469][06909] Updated weights for policy 0, policy_version 216943 (0.0046) [2024-06-28 11:34:38,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3554443264. Throughput: 0: 44202.5. Samples: 3457425540. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-28 11:34:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:34:41,076][06909] Updated weights for policy 0, policy_version 216953 (0.0030) [2024-06-28 11:34:42,778][06887] Signal inference workers to stop experience collection... (48950 times) [2024-06-28 11:34:42,818][06909] InferenceWorker_p0-w0: stopping experience collection (48950 times) [2024-06-28 11:34:42,841][06887] Signal inference workers to resume experience collection... (48950 times) [2024-06-28 11:34:42,841][06909] InferenceWorker_p0-w0: resuming experience collection (48950 times) [2024-06-28 11:34:43,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43965.1, 300 sec: 44042.4). Total num frames: 3554672640. Throughput: 0: 44073.2. Samples: 3457552840. Policy #0 lag: (min: 1.0, avg: 10.7, max: 24.0) [2024-06-28 11:34:43,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:34:45,266][06909] Updated weights for policy 0, policy_version 216963 (0.0040) [2024-06-28 11:34:48,261][06909] Updated weights for policy 0, policy_version 216973 (0.0025) [2024-06-28 11:34:48,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44238.3, 300 sec: 43986.9). Total num frames: 3554902016. Throughput: 0: 44097.0. Samples: 3457812540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 11:34:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:34:52,434][06909] Updated weights for policy 0, policy_version 216983 (0.0026) [2024-06-28 11:34:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3555115008. Throughput: 0: 44315.7. Samples: 3458091360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 11:34:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:34:55,902][06909] Updated weights for policy 0, policy_version 216993 (0.0033) [2024-06-28 11:34:58,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 3555344384. Throughput: 0: 44207.2. Samples: 3458217280. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 11:34:58,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 11:34:59,884][06909] Updated weights for policy 0, policy_version 217003 (0.0038) [2024-06-28 11:35:03,004][06909] Updated weights for policy 0, policy_version 217013 (0.0028) [2024-06-28 11:35:03,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44782.9, 300 sec: 44098.8). Total num frames: 3555590144. Throughput: 0: 44363.9. Samples: 3458489560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 11:35:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:35:07,022][06909] Updated weights for policy 0, policy_version 217023 (0.0038) [2024-06-28 11:35:08,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44511.3, 300 sec: 44153.5). Total num frames: 3555786752. Throughput: 0: 44241.7. Samples: 3458757000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 11:35:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:35:10,342][06909] Updated weights for policy 0, policy_version 217033 (0.0027) [2024-06-28 11:35:13,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3555999744. Throughput: 0: 44325.3. Samples: 3458889860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 11:35:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:35:14,195][06909] Updated weights for policy 0, policy_version 217043 (0.0035) [2024-06-28 11:35:17,879][06909] Updated weights for policy 0, policy_version 217053 (0.0044) [2024-06-28 11:35:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3556229120. Throughput: 0: 44173.3. Samples: 3459145180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 11:35:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:35:22,267][06909] Updated weights for policy 0, policy_version 217063 (0.0028) [2024-06-28 11:35:23,852][06674] Fps is (10 sec: 45866.9, 60 sec: 44508.5, 300 sec: 44153.3). Total num frames: 3556458496. Throughput: 0: 44161.9. Samples: 3459412900. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 11:35:23,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:35:25,437][06909] Updated weights for policy 0, policy_version 217073 (0.0037) [2024-06-28 11:35:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3556671488. Throughput: 0: 44257.9. Samples: 3459544440. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 11:35:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:35:29,474][06909] Updated weights for policy 0, policy_version 217083 (0.0029) [2024-06-28 11:35:32,719][06909] Updated weights for policy 0, policy_version 217093 (0.0027) [2024-06-28 11:35:33,850][06674] Fps is (10 sec: 44244.5, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3556900864. Throughput: 0: 44433.7. Samples: 3459812060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 11:35:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:35:37,139][06909] Updated weights for policy 0, policy_version 217103 (0.0038) [2024-06-28 11:35:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3557097472. Throughput: 0: 44055.6. Samples: 3460073860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 11:35:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:35:40,256][06909] Updated weights for policy 0, policy_version 217113 (0.0027) [2024-06-28 11:35:43,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 3557310464. Throughput: 0: 44151.1. Samples: 3460204080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 11:35:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:35:44,317][06909] Updated weights for policy 0, policy_version 217123 (0.0025) [2024-06-28 11:35:47,535][06909] Updated weights for policy 0, policy_version 217133 (0.0041) [2024-06-28 11:35:48,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3557556224. Throughput: 0: 43966.3. Samples: 3460468040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 11:35:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:35:48,871][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000217136_3557556224.pth... [2024-06-28 11:35:48,927][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000216491_3546988544.pth [2024-06-28 11:35:51,825][06909] Updated weights for policy 0, policy_version 217143 (0.0036) [2024-06-28 11:35:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3557752832. Throughput: 0: 43778.7. Samples: 3460727040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 11:35:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:35:55,331][06909] Updated weights for policy 0, policy_version 217153 (0.0026) [2024-06-28 11:35:58,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.6, 300 sec: 43931.4). Total num frames: 3557965824. Throughput: 0: 43677.0. Samples: 3460855320. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:35:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:35:59,611][06909] Updated weights for policy 0, policy_version 217163 (0.0041) [2024-06-28 11:36:02,738][06909] Updated weights for policy 0, policy_version 217173 (0.0032) [2024-06-28 11:36:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 3558211584. Throughput: 0: 43926.7. Samples: 3461121880. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:36:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:36:06,829][06909] Updated weights for policy 0, policy_version 217183 (0.0029) [2024-06-28 11:36:08,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3558408192. Throughput: 0: 43933.3. Samples: 3461389820. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:36:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:36:09,771][06887] Signal inference workers to stop experience collection... (49000 times) [2024-06-28 11:36:09,774][06887] Signal inference workers to resume experience collection... (49000 times) [2024-06-28 11:36:09,799][06909] InferenceWorker_p0-w0: stopping experience collection (49000 times) [2024-06-28 11:36:09,799][06909] InferenceWorker_p0-w0: resuming experience collection (49000 times) [2024-06-28 11:36:10,257][06909] Updated weights for policy 0, policy_version 217193 (0.0036) [2024-06-28 11:36:13,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3558637568. Throughput: 0: 43849.4. Samples: 3461517660. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:36:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:36:14,078][06909] Updated weights for policy 0, policy_version 217203 (0.0042) [2024-06-28 11:36:17,676][06909] Updated weights for policy 0, policy_version 217213 (0.0039) [2024-06-28 11:36:18,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3558883328. Throughput: 0: 43917.4. Samples: 3461788340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:36:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:36:21,318][06909] Updated weights for policy 0, policy_version 217223 (0.0025) [2024-06-28 11:36:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43418.9, 300 sec: 43986.9). Total num frames: 3559063552. Throughput: 0: 44060.5. Samples: 3462056580. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:36:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:36:25,033][06909] Updated weights for policy 0, policy_version 217233 (0.0032) [2024-06-28 11:36:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3559292928. Throughput: 0: 43793.2. Samples: 3462174780. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:36:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:36:28,912][06909] Updated weights for policy 0, policy_version 217243 (0.0030) [2024-06-28 11:36:32,516][06909] Updated weights for policy 0, policy_version 217253 (0.0020) [2024-06-28 11:36:33,850][06674] Fps is (10 sec: 47513.5, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 3559538688. Throughput: 0: 44014.7. Samples: 3462448700. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:36:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:36:36,578][06909] Updated weights for policy 0, policy_version 217263 (0.0036) [2024-06-28 11:36:38,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3559735296. Throughput: 0: 44213.8. Samples: 3462716660. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:36:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:36:39,831][06909] Updated weights for policy 0, policy_version 217273 (0.0029) [2024-06-28 11:36:43,835][06909] Updated weights for policy 0, policy_version 217283 (0.0033) [2024-06-28 11:36:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3559964672. Throughput: 0: 44191.5. Samples: 3462843940. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:36:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:36:47,180][06909] Updated weights for policy 0, policy_version 217293 (0.0039) [2024-06-28 11:36:48,850][06674] Fps is (10 sec: 49151.7, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3560226816. Throughput: 0: 44231.5. Samples: 3463112300. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:36:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:36:51,218][06909] Updated weights for policy 0, policy_version 217303 (0.0033) [2024-06-28 11:36:53,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 3560374272. Throughput: 0: 44153.4. Samples: 3463376720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:36:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:36:54,958][06909] Updated weights for policy 0, policy_version 217313 (0.0041) [2024-06-28 11:36:58,850][06674] Fps is (10 sec: 37682.9, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3560603648. Throughput: 0: 44021.2. Samples: 3463498620. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:36:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:36:58,886][06909] Updated weights for policy 0, policy_version 217323 (0.0031) [2024-06-28 11:37:02,482][06909] Updated weights for policy 0, policy_version 217333 (0.0046) [2024-06-28 11:37:03,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 3560865792. Throughput: 0: 43903.6. Samples: 3463764000. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:37:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:37:06,606][06909] Updated weights for policy 0, policy_version 217343 (0.0032) [2024-06-28 11:37:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3561046016. Throughput: 0: 43707.5. Samples: 3464023420. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 11:37:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:37:09,979][06909] Updated weights for policy 0, policy_version 217353 (0.0025) [2024-06-28 11:37:13,760][06909] Updated weights for policy 0, policy_version 217363 (0.0029) [2024-06-28 11:37:13,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3561275392. Throughput: 0: 43994.3. Samples: 3464154520. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 11:37:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:37:17,289][06909] Updated weights for policy 0, policy_version 217373 (0.0032) [2024-06-28 11:37:18,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3561521152. Throughput: 0: 43943.2. Samples: 3464426140. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 11:37:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:37:21,335][06909] Updated weights for policy 0, policy_version 217383 (0.0031) [2024-06-28 11:37:23,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3561684992. Throughput: 0: 43785.3. Samples: 3464687000. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 11:37:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:37:24,891][06909] Updated weights for policy 0, policy_version 217393 (0.0035) [2024-06-28 11:37:28,654][06909] Updated weights for policy 0, policy_version 217403 (0.0033) [2024-06-28 11:37:28,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3561930752. Throughput: 0: 43644.8. Samples: 3464807960. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 11:37:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:37:32,735][06909] Updated weights for policy 0, policy_version 217413 (0.0030) [2024-06-28 11:37:33,850][06674] Fps is (10 sec: 49151.6, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3562176512. Throughput: 0: 43710.6. Samples: 3465079280. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 11:37:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:37:36,082][06909] Updated weights for policy 0, policy_version 217423 (0.0026) [2024-06-28 11:37:38,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43963.6, 300 sec: 43931.3). Total num frames: 3562373120. Throughput: 0: 43809.2. Samples: 3465348140. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 11:37:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:37:40,126][06909] Updated weights for policy 0, policy_version 217433 (0.0038) [2024-06-28 11:37:43,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43417.6, 300 sec: 43931.4). Total num frames: 3562569728. Throughput: 0: 43679.7. Samples: 3465464200. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 11:37:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:37:43,946][06887] Signal inference workers to stop experience collection... (49050 times) [2024-06-28 11:37:43,947][06887] Signal inference workers to resume experience collection... (49050 times) [2024-06-28 11:37:43,961][06909] Updated weights for policy 0, policy_version 217443 (0.0033) [2024-06-28 11:37:43,980][06909] InferenceWorker_p0-w0: stopping experience collection (49050 times) [2024-06-28 11:37:43,980][06909] InferenceWorker_p0-w0: resuming experience collection (49050 times) [2024-06-28 11:37:47,612][06909] Updated weights for policy 0, policy_version 217453 (0.0028) [2024-06-28 11:37:48,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43417.5, 300 sec: 44098.0). Total num frames: 3562831872. Throughput: 0: 43837.7. Samples: 3465736700. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 11:37:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:37:48,965][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000217459_3562848256.pth... [2024-06-28 11:37:49,030][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000216812_3552247808.pth [2024-06-28 11:37:51,214][06909] Updated weights for policy 0, policy_version 217463 (0.0038) [2024-06-28 11:37:53,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3563028480. Throughput: 0: 43833.9. Samples: 3465995940. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 11:37:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:37:55,209][06909] Updated weights for policy 0, policy_version 217473 (0.0037) [2024-06-28 11:37:58,694][06909] Updated weights for policy 0, policy_version 217483 (0.0029) [2024-06-28 11:37:58,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.8, 300 sec: 43932.2). Total num frames: 3563241472. Throughput: 0: 43651.1. Samples: 3466118820. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 11:37:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:38:02,456][06909] Updated weights for policy 0, policy_version 217493 (0.0030) [2024-06-28 11:38:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3563487232. Throughput: 0: 43731.1. Samples: 3466394040. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 11:38:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:38:06,182][06909] Updated weights for policy 0, policy_version 217503 (0.0025) [2024-06-28 11:38:08,856][06674] Fps is (10 sec: 44209.9, 60 sec: 43959.3, 300 sec: 43986.0). Total num frames: 3563683840. Throughput: 0: 43778.1. Samples: 3466657280. Policy #0 lag: (min: 0.0, avg: 13.0, max: 23.0) [2024-06-28 11:38:08,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:38:10,025][06909] Updated weights for policy 0, policy_version 217513 (0.0031) [2024-06-28 11:38:13,319][06909] Updated weights for policy 0, policy_version 217523 (0.0024) [2024-06-28 11:38:13,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3563896832. Throughput: 0: 43854.3. Samples: 3466781400. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:38:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:38:17,269][06909] Updated weights for policy 0, policy_version 217533 (0.0031) [2024-06-28 11:38:18,850][06674] Fps is (10 sec: 45902.9, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 3564142592. Throughput: 0: 43900.0. Samples: 3467054780. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:38:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:38:20,995][06909] Updated weights for policy 0, policy_version 217543 (0.0026) [2024-06-28 11:38:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3564339200. Throughput: 0: 43692.5. Samples: 3467314300. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:38:23,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:38:24,973][06909] Updated weights for policy 0, policy_version 217553 (0.0054) [2024-06-28 11:38:28,293][06909] Updated weights for policy 0, policy_version 217563 (0.0050) [2024-06-28 11:38:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3564584960. Throughput: 0: 44057.8. Samples: 3467446800. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:38:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:38:32,240][06909] Updated weights for policy 0, policy_version 217573 (0.0033) [2024-06-28 11:38:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3564797952. Throughput: 0: 43853.8. Samples: 3467710120. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:38:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:38:35,599][06909] Updated weights for policy 0, policy_version 217583 (0.0033) [2024-06-28 11:38:38,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.8, 300 sec: 43931.7). Total num frames: 3564994560. Throughput: 0: 44152.1. Samples: 3467982780. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:38:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:38:39,530][06909] Updated weights for policy 0, policy_version 217593 (0.0031) [2024-06-28 11:38:43,319][06909] Updated weights for policy 0, policy_version 217603 (0.0035) [2024-06-28 11:38:43,856][06674] Fps is (10 sec: 44211.4, 60 sec: 44505.6, 300 sec: 44041.9). Total num frames: 3565240320. Throughput: 0: 44113.9. Samples: 3468104200. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:38:43,856][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:38:47,286][06909] Updated weights for policy 0, policy_version 217613 (0.0036) [2024-06-28 11:38:47,596][06887] Signal inference workers to stop experience collection... (49100 times) [2024-06-28 11:38:47,596][06887] Signal inference workers to resume experience collection... (49100 times) [2024-06-28 11:38:47,609][06909] InferenceWorker_p0-w0: stopping experience collection (49100 times) [2024-06-28 11:38:47,610][06909] InferenceWorker_p0-w0: resuming experience collection (49100 times) [2024-06-28 11:38:48,852][06674] Fps is (10 sec: 45865.2, 60 sec: 43689.2, 300 sec: 44042.1). Total num frames: 3565453312. Throughput: 0: 43816.1. Samples: 3468365860. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:38:48,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:38:51,064][06909] Updated weights for policy 0, policy_version 217623 (0.0032) [2024-06-28 11:38:53,850][06674] Fps is (10 sec: 40983.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3565649920. Throughput: 0: 43736.1. Samples: 3468625140. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:38:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:38:54,871][06909] Updated weights for policy 0, policy_version 217633 (0.0027) [2024-06-28 11:38:58,357][06909] Updated weights for policy 0, policy_version 217643 (0.0029) [2024-06-28 11:38:58,850][06674] Fps is (10 sec: 44246.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3565895680. Throughput: 0: 43907.9. Samples: 3468757260. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:38:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:39:02,200][06909] Updated weights for policy 0, policy_version 217653 (0.0036) [2024-06-28 11:39:03,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 44042.7). Total num frames: 3566108672. Throughput: 0: 43811.2. Samples: 3469026280. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:39:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:39:05,678][06909] Updated weights for policy 0, policy_version 217663 (0.0035) [2024-06-28 11:39:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43968.2, 300 sec: 43931.3). Total num frames: 3566321664. Throughput: 0: 43998.8. Samples: 3469294240. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:39:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:39:09,366][06909] Updated weights for policy 0, policy_version 217673 (0.0028) [2024-06-28 11:39:12,821][06909] Updated weights for policy 0, policy_version 217683 (0.0022) [2024-06-28 11:39:13,852][06674] Fps is (10 sec: 45865.3, 60 sec: 44508.3, 300 sec: 44042.1). Total num frames: 3566567424. Throughput: 0: 43992.2. Samples: 3469426540. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:39:13,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:39:16,627][06909] Updated weights for policy 0, policy_version 217693 (0.0024) [2024-06-28 11:39:18,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3566764032. Throughput: 0: 43990.7. Samples: 3469689700. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 11:39:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:39:20,515][06909] Updated weights for policy 0, policy_version 217703 (0.0022) [2024-06-28 11:39:23,850][06674] Fps is (10 sec: 42607.4, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3566993408. Throughput: 0: 43940.4. Samples: 3469960100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2024-06-28 11:39:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:39:24,037][06909] Updated weights for policy 0, policy_version 217713 (0.0029) [2024-06-28 11:39:28,030][06909] Updated weights for policy 0, policy_version 217723 (0.0024) [2024-06-28 11:39:28,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3567222784. Throughput: 0: 44136.7. Samples: 3470090100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2024-06-28 11:39:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:39:31,718][06909] Updated weights for policy 0, policy_version 217733 (0.0040) [2024-06-28 11:39:33,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3567419392. Throughput: 0: 44208.2. Samples: 3470355140. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2024-06-28 11:39:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:39:35,227][06909] Updated weights for policy 0, policy_version 217743 (0.0034) [2024-06-28 11:39:38,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 3567632384. Throughput: 0: 44458.7. Samples: 3470625780. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2024-06-28 11:39:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:39:39,024][06909] Updated weights for policy 0, policy_version 217753 (0.0036) [2024-06-28 11:39:42,554][06909] Updated weights for policy 0, policy_version 217763 (0.0034) [2024-06-28 11:39:43,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44241.0, 300 sec: 44042.4). Total num frames: 3567894528. Throughput: 0: 44423.9. Samples: 3470756340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2024-06-28 11:39:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:39:46,362][06909] Updated weights for policy 0, policy_version 217773 (0.0028) [2024-06-28 11:39:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43692.1, 300 sec: 43931.3). Total num frames: 3568074752. Throughput: 0: 44138.1. Samples: 3471012500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2024-06-28 11:39:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:39:48,932][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000217779_3568091136.pth... [2024-06-28 11:39:48,990][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000217136_3557556224.pth [2024-06-28 11:39:50,210][06909] Updated weights for policy 0, policy_version 217783 (0.0032) [2024-06-28 11:39:53,719][06909] Updated weights for policy 0, policy_version 217793 (0.0028) [2024-06-28 11:39:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44509.8, 300 sec: 43986.8). Total num frames: 3568320512. Throughput: 0: 44076.8. Samples: 3471277700. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2024-06-28 11:39:53,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:39:57,547][06909] Updated weights for policy 0, policy_version 217803 (0.0036) [2024-06-28 11:39:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 3568517120. Throughput: 0: 44113.6. Samples: 3471411560. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2024-06-28 11:39:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:40:01,391][06909] Updated weights for policy 0, policy_version 217813 (0.0044) [2024-06-28 11:40:03,856][06674] Fps is (10 sec: 40935.3, 60 sec: 43686.2, 300 sec: 43874.9). Total num frames: 3568730112. Throughput: 0: 43935.4. Samples: 3471667060. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2024-06-28 11:40:03,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:40:04,889][06909] Updated weights for policy 0, policy_version 217823 (0.0025) [2024-06-28 11:40:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3568959488. Throughput: 0: 43901.3. Samples: 3471935660. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2024-06-28 11:40:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:40:09,160][06909] Updated weights for policy 0, policy_version 217833 (0.0038) [2024-06-28 11:40:11,456][06887] Signal inference workers to stop experience collection... (49150 times) [2024-06-28 11:40:11,506][06909] InferenceWorker_p0-w0: stopping experience collection (49150 times) [2024-06-28 11:40:11,572][06887] Signal inference workers to resume experience collection... (49150 times) [2024-06-28 11:40:11,573][06909] InferenceWorker_p0-w0: resuming experience collection (49150 times) [2024-06-28 11:40:12,319][06909] Updated weights for policy 0, policy_version 217843 (0.0026) [2024-06-28 11:40:13,852][06674] Fps is (10 sec: 45893.7, 60 sec: 43690.7, 300 sec: 43931.0). Total num frames: 3569188864. Throughput: 0: 44128.3. Samples: 3472075960. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2024-06-28 11:40:13,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:40:16,301][06909] Updated weights for policy 0, policy_version 217853 (0.0038) [2024-06-28 11:40:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43820.5). Total num frames: 3569385472. Throughput: 0: 43891.2. Samples: 3472330240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2024-06-28 11:40:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:40:19,834][06909] Updated weights for policy 0, policy_version 217863 (0.0040) [2024-06-28 11:40:23,495][06909] Updated weights for policy 0, policy_version 217873 (0.0033) [2024-06-28 11:40:23,850][06674] Fps is (10 sec: 45884.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3569647616. Throughput: 0: 43800.5. Samples: 3472596800. Policy #0 lag: (min: 0.0, avg: 9.3, max: 19.0) [2024-06-28 11:40:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:40:27,527][06909] Updated weights for policy 0, policy_version 217883 (0.0029) [2024-06-28 11:40:28,850][06674] Fps is (10 sec: 47513.3, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3569860608. Throughput: 0: 43980.1. Samples: 3472735440. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 11:40:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:40:30,798][06909] Updated weights for policy 0, policy_version 217893 (0.0028) [2024-06-28 11:40:33,850][06674] Fps is (10 sec: 42597.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3570073600. Throughput: 0: 44015.5. Samples: 3472993200. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 11:40:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:40:34,891][06909] Updated weights for policy 0, policy_version 217903 (0.0027) [2024-06-28 11:40:38,431][06909] Updated weights for policy 0, policy_version 217913 (0.0031) [2024-06-28 11:40:38,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3570286592. Throughput: 0: 44070.7. Samples: 3473260880. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 11:40:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:40:42,236][06909] Updated weights for policy 0, policy_version 217923 (0.0032) [2024-06-28 11:40:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3570515968. Throughput: 0: 44087.0. Samples: 3473395480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 11:40:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:40:46,100][06909] Updated weights for policy 0, policy_version 217933 (0.0030) [2024-06-28 11:40:48,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3570728960. Throughput: 0: 44173.4. Samples: 3473654600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 11:40:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:40:49,575][06909] Updated weights for policy 0, policy_version 217943 (0.0021) [2024-06-28 11:40:53,432][06909] Updated weights for policy 0, policy_version 217953 (0.0026) [2024-06-28 11:40:53,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 3570974720. Throughput: 0: 44238.8. Samples: 3473926400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 11:40:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:40:57,303][06909] Updated weights for policy 0, policy_version 217963 (0.0029) [2024-06-28 11:40:58,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3571171328. Throughput: 0: 44119.3. Samples: 3474061240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 11:40:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:41:00,625][06909] Updated weights for policy 0, policy_version 217973 (0.0027) [2024-06-28 11:41:03,850][06674] Fps is (10 sec: 40959.0, 60 sec: 44241.2, 300 sec: 43986.9). Total num frames: 3571384320. Throughput: 0: 44125.6. Samples: 3474315900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 11:41:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:41:04,659][06909] Updated weights for policy 0, policy_version 217983 (0.0032) [2024-06-28 11:41:07,901][06909] Updated weights for policy 0, policy_version 217993 (0.0038) [2024-06-28 11:41:08,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3571630080. Throughput: 0: 44061.2. Samples: 3474579560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 11:41:08,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:41:12,214][06909] Updated weights for policy 0, policy_version 218003 (0.0030) [2024-06-28 11:41:13,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44238.3, 300 sec: 43931.3). Total num frames: 3571843072. Throughput: 0: 44004.9. Samples: 3474715660. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 11:41:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:41:15,602][06909] Updated weights for policy 0, policy_version 218013 (0.0039) [2024-06-28 11:41:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3572056064. Throughput: 0: 44147.3. Samples: 3474979820. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 11:41:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:41:19,532][06909] Updated weights for policy 0, policy_version 218023 (0.0030) [2024-06-28 11:41:23,066][06909] Updated weights for policy 0, policy_version 218033 (0.0027) [2024-06-28 11:41:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3572285440. Throughput: 0: 44032.1. Samples: 3475242320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 11:41:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 11:41:27,095][06909] Updated weights for policy 0, policy_version 218043 (0.0030) [2024-06-28 11:41:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3572498432. Throughput: 0: 44031.6. Samples: 3475376900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 11:41:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:41:30,454][06909] Updated weights for policy 0, policy_version 218053 (0.0037) [2024-06-28 11:41:33,853][06674] Fps is (10 sec: 40946.5, 60 sec: 43688.4, 300 sec: 43930.8). Total num frames: 3572695040. Throughput: 0: 43895.1. Samples: 3475630020. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 11:41:33,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:41:34,658][06909] Updated weights for policy 0, policy_version 218063 (0.0025) [2024-06-28 11:41:38,168][06909] Updated weights for policy 0, policy_version 218073 (0.0031) [2024-06-28 11:41:38,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3572957184. Throughput: 0: 43690.6. Samples: 3475892480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 11:41:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:41:40,691][06887] Signal inference workers to stop experience collection... (49200 times) [2024-06-28 11:41:40,692][06887] Signal inference workers to resume experience collection... (49200 times) [2024-06-28 11:41:40,739][06909] InferenceWorker_p0-w0: stopping experience collection (49200 times) [2024-06-28 11:41:40,739][06909] InferenceWorker_p0-w0: resuming experience collection (49200 times) [2024-06-28 11:41:42,507][06909] Updated weights for policy 0, policy_version 218083 (0.0036) [2024-06-28 11:41:43,850][06674] Fps is (10 sec: 45890.3, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 3573153792. Throughput: 0: 43669.3. Samples: 3476026360. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 11:41:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:41:45,613][06909] Updated weights for policy 0, policy_version 218093 (0.0047) [2024-06-28 11:41:48,850][06674] Fps is (10 sec: 39321.1, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3573350400. Throughput: 0: 43816.5. Samples: 3476287640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 11:41:48,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:41:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000218100_3573350400.pth... [2024-06-28 11:41:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000217459_3562848256.pth [2024-06-28 11:41:49,722][06909] Updated weights for policy 0, policy_version 218103 (0.0029) [2024-06-28 11:41:53,137][06909] Updated weights for policy 0, policy_version 218113 (0.0035) [2024-06-28 11:41:53,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3573596160. Throughput: 0: 43700.6. Samples: 3476546080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 11:41:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:41:57,209][06909] Updated weights for policy 0, policy_version 218123 (0.0035) [2024-06-28 11:41:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3573809152. Throughput: 0: 43723.1. Samples: 3476683200. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 11:41:58,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:42:00,480][06909] Updated weights for policy 0, policy_version 218133 (0.0036) [2024-06-28 11:42:03,850][06674] Fps is (10 sec: 40959.2, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3574005760. Throughput: 0: 43663.4. Samples: 3476944680. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 11:42:03,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:42:04,634][06909] Updated weights for policy 0, policy_version 218143 (0.0029) [2024-06-28 11:42:08,031][06909] Updated weights for policy 0, policy_version 218153 (0.0031) [2024-06-28 11:42:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3574267904. Throughput: 0: 43610.5. Samples: 3477204800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 11:42:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:42:11,954][06909] Updated weights for policy 0, policy_version 218163 (0.0027) [2024-06-28 11:42:13,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3574464512. Throughput: 0: 43784.0. Samples: 3477347180. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 11:42:13,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:42:15,549][06909] Updated weights for policy 0, policy_version 218173 (0.0027) [2024-06-28 11:42:18,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43417.5, 300 sec: 43986.9). Total num frames: 3574661120. Throughput: 0: 43868.9. Samples: 3477603980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 11:42:18,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 11:42:19,731][06909] Updated weights for policy 0, policy_version 218183 (0.0025) [2024-06-28 11:42:22,809][06909] Updated weights for policy 0, policy_version 218193 (0.0029) [2024-06-28 11:42:23,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3574923264. Throughput: 0: 43808.6. Samples: 3477863860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 11:42:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:42:27,116][06909] Updated weights for policy 0, policy_version 218203 (0.0023) [2024-06-28 11:42:28,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3575119872. Throughput: 0: 43907.1. Samples: 3478002180. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 11:42:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:42:30,478][06909] Updated weights for policy 0, policy_version 218213 (0.0033) [2024-06-28 11:42:33,850][06674] Fps is (10 sec: 39320.9, 60 sec: 43693.0, 300 sec: 43875.8). Total num frames: 3575316480. Throughput: 0: 43828.0. Samples: 3478259900. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 11:42:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:42:34,437][06909] Updated weights for policy 0, policy_version 218223 (0.0037) [2024-06-28 11:42:37,735][06909] Updated weights for policy 0, policy_version 218233 (0.0028) [2024-06-28 11:42:38,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 3575578624. Throughput: 0: 43998.5. Samples: 3478526020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 11:42:38,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:42:42,031][06909] Updated weights for policy 0, policy_version 218243 (0.0029) [2024-06-28 11:42:43,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3575775232. Throughput: 0: 44059.1. Samples: 3478665860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 11:42:43,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:42:45,246][06909] Updated weights for policy 0, policy_version 218253 (0.0034) [2024-06-28 11:42:48,856][06674] Fps is (10 sec: 42573.0, 60 sec: 44232.5, 300 sec: 43986.0). Total num frames: 3576004608. Throughput: 0: 44037.3. Samples: 3478926620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:42:48,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:42:49,196][06909] Updated weights for policy 0, policy_version 218263 (0.0036) [2024-06-28 11:42:52,516][06909] Updated weights for policy 0, policy_version 218273 (0.0030) [2024-06-28 11:42:53,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3576233984. Throughput: 0: 44040.6. Samples: 3479186620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:42:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:42:57,030][06909] Updated weights for policy 0, policy_version 218283 (0.0030) [2024-06-28 11:42:57,284][06887] Signal inference workers to stop experience collection... (49250 times) [2024-06-28 11:42:57,331][06909] InferenceWorker_p0-w0: stopping experience collection (49250 times) [2024-06-28 11:42:57,336][06887] Signal inference workers to resume experience collection... (49250 times) [2024-06-28 11:42:57,348][06909] InferenceWorker_p0-w0: resuming experience collection (49250 times) [2024-06-28 11:42:58,850][06674] Fps is (10 sec: 44263.3, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3576446976. Throughput: 0: 43900.0. Samples: 3479322680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:42:58,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:43:00,064][06909] Updated weights for policy 0, policy_version 218293 (0.0032) [2024-06-28 11:43:03,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.8, 300 sec: 43932.2). Total num frames: 3576643584. Throughput: 0: 43838.7. Samples: 3479576720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:43:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:43:04,454][06909] Updated weights for policy 0, policy_version 218303 (0.0043) [2024-06-28 11:43:07,741][06909] Updated weights for policy 0, policy_version 218313 (0.0022) [2024-06-28 11:43:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3576889344. Throughput: 0: 43903.0. Samples: 3479839500. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:43:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:43:11,609][06909] Updated weights for policy 0, policy_version 218323 (0.0039) [2024-06-28 11:43:13,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3577102336. Throughput: 0: 43856.3. Samples: 3479975720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:43:13,861][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:43:15,174][06909] Updated weights for policy 0, policy_version 218333 (0.0022) [2024-06-28 11:43:18,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3577315328. Throughput: 0: 44046.2. Samples: 3480241980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:43:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:43:19,168][06909] Updated weights for policy 0, policy_version 218343 (0.0038) [2024-06-28 11:43:22,834][06909] Updated weights for policy 0, policy_version 218353 (0.0036) [2024-06-28 11:43:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.5, 300 sec: 43931.3). Total num frames: 3577544704. Throughput: 0: 43843.5. Samples: 3480498980. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:43:23,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:43:26,409][06909] Updated weights for policy 0, policy_version 218363 (0.0044) [2024-06-28 11:43:28,852][06674] Fps is (10 sec: 44226.9, 60 sec: 43962.0, 300 sec: 43931.0). Total num frames: 3577757696. Throughput: 0: 43761.7. Samples: 3480635240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:43:28,853][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:43:30,006][06909] Updated weights for policy 0, policy_version 218373 (0.0029) [2024-06-28 11:43:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3577970688. Throughput: 0: 43879.2. Samples: 3480900920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:43:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:43:34,235][06909] Updated weights for policy 0, policy_version 218383 (0.0035) [2024-06-28 11:43:37,586][06909] Updated weights for policy 0, policy_version 218393 (0.0025) [2024-06-28 11:43:38,850][06674] Fps is (10 sec: 45885.8, 60 sec: 43963.8, 300 sec: 43987.7). Total num frames: 3578216448. Throughput: 0: 43912.4. Samples: 3481162680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:43:38,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:43:41,534][06909] Updated weights for policy 0, policy_version 218403 (0.0028) [2024-06-28 11:43:43,852][06674] Fps is (10 sec: 45866.0, 60 sec: 44235.3, 300 sec: 43986.9). Total num frames: 3578429440. Throughput: 0: 43941.6. Samples: 3481300140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:43:43,853][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:43:44,881][06909] Updated weights for policy 0, policy_version 218413 (0.0036) [2024-06-28 11:43:48,647][06909] Updated weights for policy 0, policy_version 218423 (0.0024) [2024-06-28 11:43:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43968.1, 300 sec: 44042.4). Total num frames: 3578642432. Throughput: 0: 44239.6. Samples: 3481567500. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:43:48,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 11:43:48,959][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000218424_3578658816.pth... [2024-06-28 11:43:49,019][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000217779_3568091136.pth [2024-06-28 11:43:52,150][06909] Updated weights for policy 0, policy_version 218433 (0.0033) [2024-06-28 11:43:53,850][06674] Fps is (10 sec: 44245.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3578871808. Throughput: 0: 44208.9. Samples: 3481828900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:43:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:43:56,105][06909] Updated weights for policy 0, policy_version 218443 (0.0030) [2024-06-28 11:43:58,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3579084800. Throughput: 0: 44280.1. Samples: 3481968320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:43:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:43:59,595][06909] Updated weights for policy 0, policy_version 218453 (0.0037) [2024-06-28 11:44:03,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3579281408. Throughput: 0: 44082.7. Samples: 3482225700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:44:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:44:03,898][06909] Updated weights for policy 0, policy_version 218463 (0.0027) [2024-06-28 11:44:07,061][06909] Updated weights for policy 0, policy_version 218473 (0.0031) [2024-06-28 11:44:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43931.6). Total num frames: 3579527168. Throughput: 0: 44199.6. Samples: 3482487960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:44:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:44:11,359][06909] Updated weights for policy 0, policy_version 218483 (0.0029) [2024-06-28 11:44:13,850][06674] Fps is (10 sec: 47514.1, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3579756544. Throughput: 0: 44216.1. Samples: 3482624860. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:44:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:44:14,327][06909] Updated weights for policy 0, policy_version 218493 (0.0028) [2024-06-28 11:44:18,521][06909] Updated weights for policy 0, policy_version 218503 (0.0035) [2024-06-28 11:44:18,850][06674] Fps is (10 sec: 44237.5, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3579969536. Throughput: 0: 44327.2. Samples: 3482895640. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:44:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:44:21,880][06909] Updated weights for policy 0, policy_version 218513 (0.0034) [2024-06-28 11:44:23,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 3580166144. Throughput: 0: 44189.4. Samples: 3483151200. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:44:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:44:25,752][06909] Updated weights for policy 0, policy_version 218523 (0.0034) [2024-06-28 11:44:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44511.6, 300 sec: 44098.0). Total num frames: 3580428288. Throughput: 0: 44144.3. Samples: 3483286540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:44:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:44:29,130][06909] Updated weights for policy 0, policy_version 218533 (0.0044) [2024-06-28 11:44:33,516][06909] Updated weights for policy 0, policy_version 218543 (0.0041) [2024-06-28 11:44:33,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3580624896. Throughput: 0: 44114.7. Samples: 3483552660. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:44:33,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:44:36,326][06909] Updated weights for policy 0, policy_version 218553 (0.0034) [2024-06-28 11:44:38,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3580837888. Throughput: 0: 44053.8. Samples: 3483811320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:44:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:44:40,910][06909] Updated weights for policy 0, policy_version 218563 (0.0028) [2024-06-28 11:44:43,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44238.3, 300 sec: 44098.0). Total num frames: 3581083648. Throughput: 0: 43929.4. Samples: 3483945140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:44:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:44:44,129][06909] Updated weights for policy 0, policy_version 218573 (0.0043) [2024-06-28 11:44:48,402][06909] Updated weights for policy 0, policy_version 218583 (0.0028) [2024-06-28 11:44:48,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3581280256. Throughput: 0: 44161.0. Samples: 3484212940. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:44:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:44:50,060][06887] Signal inference workers to stop experience collection... (49300 times) [2024-06-28 11:44:50,060][06887] Signal inference workers to resume experience collection... (49300 times) [2024-06-28 11:44:50,096][06909] InferenceWorker_p0-w0: stopping experience collection (49300 times) [2024-06-28 11:44:50,096][06909] InferenceWorker_p0-w0: resuming experience collection (49300 times) [2024-06-28 11:44:51,417][06909] Updated weights for policy 0, policy_version 218593 (0.0034) [2024-06-28 11:44:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3581493248. Throughput: 0: 44061.0. Samples: 3484470700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:44:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:44:55,969][06909] Updated weights for policy 0, policy_version 218603 (0.0041) [2024-06-28 11:44:58,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44236.7, 300 sec: 44098.8). Total num frames: 3581739008. Throughput: 0: 43893.6. Samples: 3484600080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 11:44:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:44:58,936][06909] Updated weights for policy 0, policy_version 218613 (0.0029) [2024-06-28 11:45:03,199][06909] Updated weights for policy 0, policy_version 218623 (0.0046) [2024-06-28 11:45:03,850][06674] Fps is (10 sec: 45874.3, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3581952000. Throughput: 0: 43944.7. Samples: 3484873160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:45:03,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:45:06,514][06909] Updated weights for policy 0, policy_version 218633 (0.0026) [2024-06-28 11:45:08,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.7, 300 sec: 43931.6). Total num frames: 3582148608. Throughput: 0: 43904.4. Samples: 3485126900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:45:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:45:10,926][06909] Updated weights for policy 0, policy_version 218643 (0.0025) [2024-06-28 11:45:13,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3582394368. Throughput: 0: 43857.3. Samples: 3485260120. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:45:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 11:45:13,927][06909] Updated weights for policy 0, policy_version 218653 (0.0052) [2024-06-28 11:45:18,281][06909] Updated weights for policy 0, policy_version 218663 (0.0031) [2024-06-28 11:45:18,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43820.3). Total num frames: 3582574592. Throughput: 0: 43702.8. Samples: 3485519280. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:45:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:45:21,498][06909] Updated weights for policy 0, policy_version 218673 (0.0036) [2024-06-28 11:45:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3582803968. Throughput: 0: 43699.7. Samples: 3485777800. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:45:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:45:25,842][06909] Updated weights for policy 0, policy_version 218683 (0.0038) [2024-06-28 11:45:28,817][06909] Updated weights for policy 0, policy_version 218693 (0.0034) [2024-06-28 11:45:28,850][06674] Fps is (10 sec: 49150.9, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3583066112. Throughput: 0: 43849.1. Samples: 3485918360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:45:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:45:33,100][06909] Updated weights for policy 0, policy_version 218703 (0.0025) [2024-06-28 11:45:33,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3583262720. Throughput: 0: 43894.7. Samples: 3486188200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:45:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 11:45:36,273][06909] Updated weights for policy 0, policy_version 218713 (0.0030) [2024-06-28 11:45:38,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3583475712. Throughput: 0: 43904.8. Samples: 3486446420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:45:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:45:40,499][06909] Updated weights for policy 0, policy_version 218723 (0.0023) [2024-06-28 11:45:43,706][06909] Updated weights for policy 0, policy_version 218733 (0.0027) [2024-06-28 11:45:43,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3583721472. Throughput: 0: 43894.8. Samples: 3486575340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:45:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:45:48,280][06909] Updated weights for policy 0, policy_version 218743 (0.0034) [2024-06-28 11:45:48,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 3583901696. Throughput: 0: 43756.2. Samples: 3486842180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:45:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:45:48,967][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000218745_3583918080.pth... [2024-06-28 11:45:49,011][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000218100_3573350400.pth [2024-06-28 11:45:51,546][06909] Updated weights for policy 0, policy_version 218753 (0.0031) [2024-06-28 11:45:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3584131072. Throughput: 0: 43760.3. Samples: 3487096120. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:45:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:45:55,680][06909] Updated weights for policy 0, policy_version 218763 (0.0026) [2024-06-28 11:45:58,760][06909] Updated weights for policy 0, policy_version 218773 (0.0030) [2024-06-28 11:45:58,850][06674] Fps is (10 sec: 47512.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3584376832. Throughput: 0: 43867.0. Samples: 3487234140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:45:58,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:46:03,162][06909] Updated weights for policy 0, policy_version 218783 (0.0044) [2024-06-28 11:46:03,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43417.8, 300 sec: 43820.3). Total num frames: 3584557056. Throughput: 0: 44049.3. Samples: 3487501500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:46:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:46:06,272][06909] Updated weights for policy 0, policy_version 218793 (0.0025) [2024-06-28 11:46:08,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3584786432. Throughput: 0: 44173.3. Samples: 3487765600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 11:46:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:46:10,378][06909] Updated weights for policy 0, policy_version 218803 (0.0051) [2024-06-28 11:46:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3585015808. Throughput: 0: 43911.8. Samples: 3487894380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 11:46:13,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 11:46:13,949][06909] Updated weights for policy 0, policy_version 218813 (0.0037) [2024-06-28 11:46:18,030][06909] Updated weights for policy 0, policy_version 218823 (0.0040) [2024-06-28 11:46:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 3585228800. Throughput: 0: 43802.5. Samples: 3488159320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 11:46:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:46:19,525][06887] Signal inference workers to stop experience collection... (49350 times) [2024-06-28 11:46:19,525][06887] Signal inference workers to resume experience collection... (49350 times) [2024-06-28 11:46:19,542][06909] InferenceWorker_p0-w0: stopping experience collection (49350 times) [2024-06-28 11:46:19,542][06909] InferenceWorker_p0-w0: resuming experience collection (49350 times) [2024-06-28 11:46:21,343][06909] Updated weights for policy 0, policy_version 218833 (0.0041) [2024-06-28 11:46:23,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3585441792. Throughput: 0: 43868.5. Samples: 3488420500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 11:46:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:46:25,434][06909] Updated weights for policy 0, policy_version 218843 (0.0027) [2024-06-28 11:46:28,669][06909] Updated weights for policy 0, policy_version 218853 (0.0037) [2024-06-28 11:46:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 44042.9). Total num frames: 3585687552. Throughput: 0: 43946.2. Samples: 3488552920. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 11:46:28,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:46:33,095][06909] Updated weights for policy 0, policy_version 218863 (0.0041) [2024-06-28 11:46:33,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43690.5, 300 sec: 43820.2). Total num frames: 3585884160. Throughput: 0: 43874.5. Samples: 3488816540. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 11:46:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:46:35,952][06909] Updated weights for policy 0, policy_version 218873 (0.0045) [2024-06-28 11:46:38,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3586097152. Throughput: 0: 43921.8. Samples: 3489072600. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 11:46:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:46:40,386][06909] Updated weights for policy 0, policy_version 218883 (0.0024) [2024-06-28 11:46:43,704][06909] Updated weights for policy 0, policy_version 218893 (0.0035) [2024-06-28 11:46:43,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3586342912. Throughput: 0: 43816.1. Samples: 3489205860. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 11:46:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:46:47,547][06909] Updated weights for policy 0, policy_version 218903 (0.0027) [2024-06-28 11:46:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3586539520. Throughput: 0: 43839.9. Samples: 3489474300. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 11:46:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:46:50,909][06909] Updated weights for policy 0, policy_version 218913 (0.0035) [2024-06-28 11:46:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3586752512. Throughput: 0: 43860.9. Samples: 3489739340. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 11:46:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:46:55,171][06909] Updated weights for policy 0, policy_version 218923 (0.0038) [2024-06-28 11:46:58,405][06909] Updated weights for policy 0, policy_version 218933 (0.0027) [2024-06-28 11:46:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 3586998272. Throughput: 0: 43991.5. Samples: 3489874000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 11:46:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:47:02,738][06909] Updated weights for policy 0, policy_version 218943 (0.0031) [2024-06-28 11:47:03,850][06674] Fps is (10 sec: 47513.0, 60 sec: 44509.7, 300 sec: 43931.3). Total num frames: 3587227648. Throughput: 0: 43960.9. Samples: 3490137560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 11:47:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 11:47:05,941][06909] Updated weights for policy 0, policy_version 218953 (0.0036) [2024-06-28 11:47:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 3587424256. Throughput: 0: 43948.1. Samples: 3490398160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 11:47:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:47:10,198][06909] Updated weights for policy 0, policy_version 218963 (0.0033) [2024-06-28 11:47:13,278][06909] Updated weights for policy 0, policy_version 218973 (0.0037) [2024-06-28 11:47:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 3587670016. Throughput: 0: 44029.8. Samples: 3490534260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 11:47:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:47:17,446][06909] Updated weights for policy 0, policy_version 218983 (0.0031) [2024-06-28 11:47:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3587866624. Throughput: 0: 44109.0. Samples: 3490801440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 11:47:18,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:47:20,873][06909] Updated weights for policy 0, policy_version 218993 (0.0029) [2024-06-28 11:47:23,852][06674] Fps is (10 sec: 40949.3, 60 sec: 43961.8, 300 sec: 43930.9). Total num frames: 3588079616. Throughput: 0: 44441.9. Samples: 3491072600. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 11:47:23,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:47:24,619][06909] Updated weights for policy 0, policy_version 219003 (0.0032) [2024-06-28 11:47:28,021][06909] Updated weights for policy 0, policy_version 219013 (0.0025) [2024-06-28 11:47:28,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3588341760. Throughput: 0: 44340.5. Samples: 3491201180. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 11:47:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:47:29,466][06887] Signal inference workers to stop experience collection... (49400 times) [2024-06-28 11:47:29,466][06887] Signal inference workers to resume experience collection... (49400 times) [2024-06-28 11:47:29,506][06909] InferenceWorker_p0-w0: stopping experience collection (49400 times) [2024-06-28 11:47:29,506][06909] InferenceWorker_p0-w0: resuming experience collection (49400 times) [2024-06-28 11:47:32,002][06909] Updated weights for policy 0, policy_version 219023 (0.0036) [2024-06-28 11:47:33,850][06674] Fps is (10 sec: 47526.4, 60 sec: 44510.0, 300 sec: 43986.9). Total num frames: 3588554752. Throughput: 0: 44281.9. Samples: 3491466980. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 11:47:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:47:35,579][06909] Updated weights for policy 0, policy_version 219033 (0.0031) [2024-06-28 11:47:38,850][06674] Fps is (10 sec: 40959.9, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3588751360. Throughput: 0: 44298.6. Samples: 3491732780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 11:47:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:47:39,566][06909] Updated weights for policy 0, policy_version 219043 (0.0042) [2024-06-28 11:47:43,003][06909] Updated weights for policy 0, policy_version 219053 (0.0032) [2024-06-28 11:47:43,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43932.2). Total num frames: 3588964352. Throughput: 0: 43977.8. Samples: 3491853000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 11:47:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:47:47,356][06909] Updated weights for policy 0, policy_version 219063 (0.0035) [2024-06-28 11:47:48,852][06674] Fps is (10 sec: 45865.7, 60 sec: 44508.3, 300 sec: 43986.6). Total num frames: 3589210112. Throughput: 0: 44105.2. Samples: 3492122380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 11:47:48,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:47:48,860][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000219068_3589210112.pth... [2024-06-28 11:47:48,916][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000218424_3578658816.pth [2024-06-28 11:47:50,522][06909] Updated weights for policy 0, policy_version 219073 (0.0031) [2024-06-28 11:47:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3589406720. Throughput: 0: 44139.5. Samples: 3492384440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 11:47:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:47:54,610][06909] Updated weights for policy 0, policy_version 219083 (0.0027) [2024-06-28 11:47:58,079][06909] Updated weights for policy 0, policy_version 219093 (0.0026) [2024-06-28 11:47:58,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3589636096. Throughput: 0: 43948.4. Samples: 3492511940. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 11:47:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:48:01,887][06909] Updated weights for policy 0, policy_version 219103 (0.0029) [2024-06-28 11:48:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3589865472. Throughput: 0: 43908.1. Samples: 3492777300. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 11:48:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:48:05,587][06909] Updated weights for policy 0, policy_version 219113 (0.0040) [2024-06-28 11:48:08,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3590078464. Throughput: 0: 43842.0. Samples: 3493045380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 11:48:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:48:09,554][06909] Updated weights for policy 0, policy_version 219123 (0.0031) [2024-06-28 11:48:13,378][06909] Updated weights for policy 0, policy_version 219133 (0.0031) [2024-06-28 11:48:13,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3590291456. Throughput: 0: 43840.5. Samples: 3493174000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 11:48:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:48:16,986][06909] Updated weights for policy 0, policy_version 219143 (0.0038) [2024-06-28 11:48:18,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3590520832. Throughput: 0: 43733.3. Samples: 3493434980. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 11:48:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:48:20,943][06909] Updated weights for policy 0, policy_version 219153 (0.0033) [2024-06-28 11:48:23,852][06674] Fps is (10 sec: 42589.1, 60 sec: 43964.1, 300 sec: 43931.4). Total num frames: 3590717440. Throughput: 0: 43626.9. Samples: 3493696080. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 11:48:23,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:48:24,846][06909] Updated weights for policy 0, policy_version 219163 (0.0032) [2024-06-28 11:48:28,435][06909] Updated weights for policy 0, policy_version 219173 (0.0034) [2024-06-28 11:48:28,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 3590946816. Throughput: 0: 43851.9. Samples: 3493826340. Policy #0 lag: (min: 1.0, avg: 11.8, max: 22.0) [2024-06-28 11:48:28,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 11:48:31,931][06909] Updated weights for policy 0, policy_version 219183 (0.0021) [2024-06-28 11:48:33,850][06674] Fps is (10 sec: 45883.9, 60 sec: 43690.5, 300 sec: 43931.3). Total num frames: 3591176192. Throughput: 0: 43656.5. Samples: 3494086840. Policy #0 lag: (min: 1.0, avg: 11.8, max: 22.0) [2024-06-28 11:48:33,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:48:35,545][06909] Updated weights for policy 0, policy_version 219193 (0.0033) [2024-06-28 11:48:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 43931.6). Total num frames: 3591389184. Throughput: 0: 43884.4. Samples: 3494359240. Policy #0 lag: (min: 1.0, avg: 11.8, max: 22.0) [2024-06-28 11:48:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:48:39,145][06909] Updated weights for policy 0, policy_version 219203 (0.0041) [2024-06-28 11:48:42,885][06909] Updated weights for policy 0, policy_version 219213 (0.0026) [2024-06-28 11:48:43,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3591602176. Throughput: 0: 43975.6. Samples: 3494490840. Policy #0 lag: (min: 1.0, avg: 11.8, max: 22.0) [2024-06-28 11:48:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:48:46,625][06909] Updated weights for policy 0, policy_version 219223 (0.0028) [2024-06-28 11:48:48,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43965.3, 300 sec: 43986.9). Total num frames: 3591847936. Throughput: 0: 44035.6. Samples: 3494758900. Policy #0 lag: (min: 1.0, avg: 11.8, max: 22.0) [2024-06-28 11:48:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:48:51,191][06909] Updated weights for policy 0, policy_version 219233 (0.0036) [2024-06-28 11:48:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3592028160. Throughput: 0: 43740.9. Samples: 3495013720. Policy #0 lag: (min: 1.0, avg: 11.8, max: 22.0) [2024-06-28 11:48:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:48:54,019][06887] Signal inference workers to stop experience collection... (49450 times) [2024-06-28 11:48:54,026][06887] Signal inference workers to resume experience collection... (49450 times) [2024-06-28 11:48:54,042][06909] InferenceWorker_p0-w0: stopping experience collection (49450 times) [2024-06-28 11:48:54,042][06909] InferenceWorker_p0-w0: resuming experience collection (49450 times) [2024-06-28 11:48:54,450][06909] Updated weights for policy 0, policy_version 219243 (0.0037) [2024-06-28 11:48:58,395][06909] Updated weights for policy 0, policy_version 219253 (0.0035) [2024-06-28 11:48:58,850][06674] Fps is (10 sec: 40959.1, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3592257536. Throughput: 0: 43744.2. Samples: 3495142500. Policy #0 lag: (min: 1.0, avg: 11.8, max: 22.0) [2024-06-28 11:48:58,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:49:01,710][06909] Updated weights for policy 0, policy_version 219263 (0.0032) [2024-06-28 11:49:03,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3592503296. Throughput: 0: 43981.8. Samples: 3495414160. Policy #0 lag: (min: 1.0, avg: 11.8, max: 22.0) [2024-06-28 11:49:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:49:05,576][06909] Updated weights for policy 0, policy_version 219273 (0.0031) [2024-06-28 11:49:08,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 3592699904. Throughput: 0: 44047.4. Samples: 3495678120. Policy #0 lag: (min: 1.0, avg: 11.8, max: 22.0) [2024-06-28 11:49:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:49:09,174][06909] Updated weights for policy 0, policy_version 219283 (0.0044) [2024-06-28 11:49:12,744][06909] Updated weights for policy 0, policy_version 219293 (0.0033) [2024-06-28 11:49:13,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43690.5, 300 sec: 43875.8). Total num frames: 3592912896. Throughput: 0: 43880.3. Samples: 3495800960. Policy #0 lag: (min: 1.0, avg: 11.8, max: 22.0) [2024-06-28 11:49:13,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:49:16,300][06909] Updated weights for policy 0, policy_version 219303 (0.0042) [2024-06-28 11:49:18,850][06674] Fps is (10 sec: 45874.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3593158656. Throughput: 0: 44151.2. Samples: 3496073640. Policy #0 lag: (min: 1.0, avg: 11.8, max: 22.0) [2024-06-28 11:49:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:49:19,989][06909] Updated weights for policy 0, policy_version 219313 (0.0038) [2024-06-28 11:49:23,850][06674] Fps is (10 sec: 47514.3, 60 sec: 44511.4, 300 sec: 43931.3). Total num frames: 3593388032. Throughput: 0: 43960.5. Samples: 3496337460. Policy #0 lag: (min: 1.0, avg: 11.8, max: 22.0) [2024-06-28 11:49:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:49:23,852][06909] Updated weights for policy 0, policy_version 219323 (0.0039) [2024-06-28 11:49:28,442][06909] Updated weights for policy 0, policy_version 219333 (0.0028) [2024-06-28 11:49:28,850][06674] Fps is (10 sec: 40960.7, 60 sec: 43690.8, 300 sec: 43875.8). Total num frames: 3593568256. Throughput: 0: 43863.7. Samples: 3496464700. Policy #0 lag: (min: 1.0, avg: 11.8, max: 22.0) [2024-06-28 11:49:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:49:31,542][06909] Updated weights for policy 0, policy_version 219343 (0.0034) [2024-06-28 11:49:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 3593814016. Throughput: 0: 43688.9. Samples: 3496724900. Policy #0 lag: (min: 1.0, avg: 11.8, max: 22.0) [2024-06-28 11:49:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:49:35,740][06909] Updated weights for policy 0, policy_version 219353 (0.0053) [2024-06-28 11:49:38,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3594027008. Throughput: 0: 44023.2. Samples: 3496994760. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 11:49:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:49:38,934][06909] Updated weights for policy 0, policy_version 219363 (0.0031) [2024-06-28 11:49:42,906][06909] Updated weights for policy 0, policy_version 219373 (0.0028) [2024-06-28 11:49:43,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3594223616. Throughput: 0: 43921.5. Samples: 3497118960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 11:49:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:49:46,358][06909] Updated weights for policy 0, policy_version 219383 (0.0032) [2024-06-28 11:49:48,850][06674] Fps is (10 sec: 42599.0, 60 sec: 43417.7, 300 sec: 43931.4). Total num frames: 3594452992. Throughput: 0: 43789.5. Samples: 3497384680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 11:49:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:49:48,890][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000219389_3594469376.pth... [2024-06-28 11:49:48,967][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000218745_3583918080.pth [2024-06-28 11:49:50,109][06909] Updated weights for policy 0, policy_version 219393 (0.0021) [2024-06-28 11:49:53,799][06909] Updated weights for policy 0, policy_version 219403 (0.0035) [2024-06-28 11:49:53,850][06674] Fps is (10 sec: 47513.9, 60 sec: 44510.0, 300 sec: 43931.4). Total num frames: 3594698752. Throughput: 0: 43881.8. Samples: 3497652800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 11:49:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:49:57,499][06909] Updated weights for policy 0, policy_version 219413 (0.0039) [2024-06-28 11:49:58,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.9, 300 sec: 43875.8). Total num frames: 3594895360. Throughput: 0: 43982.8. Samples: 3497780180. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 11:49:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:50:01,178][06909] Updated weights for policy 0, policy_version 219423 (0.0029) [2024-06-28 11:50:03,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3595141120. Throughput: 0: 43892.1. Samples: 3498048780. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 11:50:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:50:05,697][06909] Updated weights for policy 0, policy_version 219433 (0.0037) [2024-06-28 11:50:08,726][06909] Updated weights for policy 0, policy_version 219443 (0.0033) [2024-06-28 11:50:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3595354112. Throughput: 0: 43920.5. Samples: 3498313880. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 11:50:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:50:11,221][06887] Signal inference workers to stop experience collection... (49500 times) [2024-06-28 11:50:11,221][06887] Signal inference workers to resume experience collection... (49500 times) [2024-06-28 11:50:11,251][06909] InferenceWorker_p0-w0: stopping experience collection (49500 times) [2024-06-28 11:50:11,251][06909] InferenceWorker_p0-w0: resuming experience collection (49500 times) [2024-06-28 11:50:12,861][06909] Updated weights for policy 0, policy_version 219453 (0.0020) [2024-06-28 11:50:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3595567104. Throughput: 0: 43895.9. Samples: 3498440020. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 11:50:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:50:16,222][06909] Updated weights for policy 0, policy_version 219463 (0.0022) [2024-06-28 11:50:18,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3595812864. Throughput: 0: 44011.4. Samples: 3498705420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 11:50:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:50:20,010][06909] Updated weights for policy 0, policy_version 219473 (0.0034) [2024-06-28 11:50:23,612][06909] Updated weights for policy 0, policy_version 219483 (0.0027) [2024-06-28 11:50:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3596009472. Throughput: 0: 43891.6. Samples: 3498969880. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 11:50:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:50:27,209][06909] Updated weights for policy 0, policy_version 219493 (0.0046) [2024-06-28 11:50:28,850][06674] Fps is (10 sec: 39322.1, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3596206080. Throughput: 0: 44033.0. Samples: 3499100440. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 11:50:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:50:31,168][06909] Updated weights for policy 0, policy_version 219503 (0.0030) [2024-06-28 11:50:33,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3596468224. Throughput: 0: 44026.1. Samples: 3499365860. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 11:50:33,850][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 11:50:34,875][06909] Updated weights for policy 0, policy_version 219513 (0.0033) [2024-06-28 11:50:38,710][06909] Updated weights for policy 0, policy_version 219523 (0.0029) [2024-06-28 11:50:38,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3596664832. Throughput: 0: 43959.0. Samples: 3499630960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 11:50:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:50:42,624][06909] Updated weights for policy 0, policy_version 219533 (0.0039) [2024-06-28 11:50:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3596877824. Throughput: 0: 43831.1. Samples: 3499752580. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 11:50:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:50:46,160][06909] Updated weights for policy 0, policy_version 219543 (0.0023) [2024-06-28 11:50:48,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44782.8, 300 sec: 44098.0). Total num frames: 3597139968. Throughput: 0: 43824.4. Samples: 3500020880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 11:50:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:50:50,150][06909] Updated weights for policy 0, policy_version 219553 (0.0031) [2024-06-28 11:50:53,585][06909] Updated weights for policy 0, policy_version 219563 (0.0027) [2024-06-28 11:50:53,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3597320192. Throughput: 0: 43979.9. Samples: 3500292980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 11:50:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:50:57,344][06909] Updated weights for policy 0, policy_version 219573 (0.0035) [2024-06-28 11:50:58,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3597533184. Throughput: 0: 43848.1. Samples: 3500413180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 11:50:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:51:01,011][06909] Updated weights for policy 0, policy_version 219583 (0.0032) [2024-06-28 11:51:03,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3597795328. Throughput: 0: 44033.9. Samples: 3500686940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 11:51:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:51:04,439][06909] Updated weights for policy 0, policy_version 219593 (0.0027) [2024-06-28 11:51:08,579][06909] Updated weights for policy 0, policy_version 219603 (0.0036) [2024-06-28 11:51:08,852][06674] Fps is (10 sec: 44227.3, 60 sec: 43689.1, 300 sec: 43931.0). Total num frames: 3597975552. Throughput: 0: 44105.0. Samples: 3500954700. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 11:51:08,858][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 11:51:11,779][06909] Updated weights for policy 0, policy_version 219613 (0.0042) [2024-06-28 11:51:13,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3598204928. Throughput: 0: 43908.2. Samples: 3501076320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 11:51:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:51:15,888][06909] Updated weights for policy 0, policy_version 219623 (0.0040) [2024-06-28 11:51:18,850][06674] Fps is (10 sec: 47523.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3598450688. Throughput: 0: 43852.7. Samples: 3501339240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 11:51:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:51:19,882][06909] Updated weights for policy 0, policy_version 219633 (0.0037) [2024-06-28 11:51:23,605][06909] Updated weights for policy 0, policy_version 219643 (0.0032) [2024-06-28 11:51:23,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.5, 300 sec: 43875.8). Total num frames: 3598630912. Throughput: 0: 43957.2. Samples: 3501609040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 11:51:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:51:25,044][06887] Signal inference workers to stop experience collection... (49550 times) [2024-06-28 11:51:25,077][06909] InferenceWorker_p0-w0: stopping experience collection (49550 times) [2024-06-28 11:51:25,101][06887] Signal inference workers to resume experience collection... (49550 times) [2024-06-28 11:51:25,102][06909] InferenceWorker_p0-w0: resuming experience collection (49550 times) [2024-06-28 11:51:27,225][06909] Updated weights for policy 0, policy_version 219653 (0.0038) [2024-06-28 11:51:28,850][06674] Fps is (10 sec: 40960.7, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3598860288. Throughput: 0: 44003.1. Samples: 3501732720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 11:51:28,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 11:51:30,875][06909] Updated weights for policy 0, policy_version 219663 (0.0023) [2024-06-28 11:51:33,850][06674] Fps is (10 sec: 47514.5, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3599106048. Throughput: 0: 44012.5. Samples: 3502001440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 11:51:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:51:34,451][06909] Updated weights for policy 0, policy_version 219673 (0.0036) [2024-06-28 11:51:38,581][06909] Updated weights for policy 0, policy_version 219683 (0.0039) [2024-06-28 11:51:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3599286272. Throughput: 0: 44050.2. Samples: 3502275240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 11:51:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:51:41,544][06909] Updated weights for policy 0, policy_version 219693 (0.0030) [2024-06-28 11:51:43,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3599515648. Throughput: 0: 44148.7. Samples: 3502399880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 11:51:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:51:45,689][06909] Updated weights for policy 0, policy_version 219703 (0.0035) [2024-06-28 11:51:48,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 3599761408. Throughput: 0: 43963.4. Samples: 3502665300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 11:51:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:51:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000219712_3599761408.pth... [2024-06-28 11:51:48,930][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000219068_3589210112.pth [2024-06-28 11:51:49,081][06909] Updated weights for policy 0, policy_version 219713 (0.0035) [2024-06-28 11:51:52,989][06909] Updated weights for policy 0, policy_version 219723 (0.0035) [2024-06-28 11:51:53,850][06674] Fps is (10 sec: 44237.7, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3599958016. Throughput: 0: 43974.6. Samples: 3502933460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:51:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:51:56,562][06909] Updated weights for policy 0, policy_version 219733 (0.0030) [2024-06-28 11:51:58,852][06674] Fps is (10 sec: 40951.6, 60 sec: 43962.1, 300 sec: 43875.5). Total num frames: 3600171008. Throughput: 0: 44122.5. Samples: 3503061920. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:51:58,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:52:00,548][06909] Updated weights for policy 0, policy_version 219743 (0.0035) [2024-06-28 11:52:03,852][06674] Fps is (10 sec: 45865.5, 60 sec: 43689.2, 300 sec: 44042.1). Total num frames: 3600416768. Throughput: 0: 44140.4. Samples: 3503325640. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:52:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:52:04,210][06909] Updated weights for policy 0, policy_version 219753 (0.0035) [2024-06-28 11:52:07,745][06909] Updated weights for policy 0, policy_version 219763 (0.0025) [2024-06-28 11:52:08,850][06674] Fps is (10 sec: 44246.5, 60 sec: 43965.3, 300 sec: 43875.8). Total num frames: 3600613376. Throughput: 0: 44238.9. Samples: 3503599780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:52:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:52:11,310][06909] Updated weights for policy 0, policy_version 219773 (0.0022) [2024-06-28 11:52:13,850][06674] Fps is (10 sec: 40968.4, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 3600826368. Throughput: 0: 44271.1. Samples: 3503724920. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:52:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:52:15,479][06909] Updated weights for policy 0, policy_version 219783 (0.0020) [2024-06-28 11:52:18,757][06909] Updated weights for policy 0, policy_version 219793 (0.0029) [2024-06-28 11:52:18,850][06674] Fps is (10 sec: 47513.2, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 3601088512. Throughput: 0: 44216.4. Samples: 3503991180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:52:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:52:22,653][06909] Updated weights for policy 0, policy_version 219803 (0.0024) [2024-06-28 11:52:23,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44510.0, 300 sec: 43931.3). Total num frames: 3601301504. Throughput: 0: 44230.2. Samples: 3504265600. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:52:23,850][06674] Avg episode reward: [(0, '0.418')] [2024-06-28 11:52:25,980][06909] Updated weights for policy 0, policy_version 219813 (0.0040) [2024-06-28 11:52:28,850][06674] Fps is (10 sec: 40959.8, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 3601498112. Throughput: 0: 44164.9. Samples: 3504387300. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:52:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:52:29,451][06887] Signal inference workers to stop experience collection... (49600 times) [2024-06-28 11:52:29,451][06887] Signal inference workers to resume experience collection... (49600 times) [2024-06-28 11:52:29,468][06909] InferenceWorker_p0-w0: stopping experience collection (49600 times) [2024-06-28 11:52:29,468][06909] InferenceWorker_p0-w0: resuming experience collection (49600 times) [2024-06-28 11:52:30,187][06909] Updated weights for policy 0, policy_version 219823 (0.0023) [2024-06-28 11:52:33,147][06909] Updated weights for policy 0, policy_version 219833 (0.0036) [2024-06-28 11:52:33,852][06674] Fps is (10 sec: 44227.6, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 3601743872. Throughput: 0: 44188.3. Samples: 3504653860. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:52:33,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:52:37,538][06909] Updated weights for policy 0, policy_version 219843 (0.0041) [2024-06-28 11:52:38,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3601956864. Throughput: 0: 44239.8. Samples: 3504924260. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:52:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:52:41,105][06909] Updated weights for policy 0, policy_version 219853 (0.0032) [2024-06-28 11:52:43,850][06674] Fps is (10 sec: 40968.8, 60 sec: 43963.9, 300 sec: 43876.1). Total num frames: 3602153472. Throughput: 0: 44280.0. Samples: 3505054420. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:52:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:52:45,302][06909] Updated weights for policy 0, policy_version 219863 (0.0024) [2024-06-28 11:52:48,339][06909] Updated weights for policy 0, policy_version 219873 (0.0024) [2024-06-28 11:52:48,850][06674] Fps is (10 sec: 44237.6, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 3602399232. Throughput: 0: 44159.8. Samples: 3505312740. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:52:48,850][06674] Avg episode reward: [(0, '0.483')] [2024-06-28 11:52:52,573][06909] Updated weights for policy 0, policy_version 219883 (0.0035) [2024-06-28 11:52:53,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3602612224. Throughput: 0: 44037.7. Samples: 3505581480. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:52:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:52:55,799][06909] Updated weights for policy 0, policy_version 219893 (0.0041) [2024-06-28 11:52:58,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43965.3, 300 sec: 43875.8). Total num frames: 3602808832. Throughput: 0: 44178.2. Samples: 3505712940. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 11:52:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:52:59,750][06909] Updated weights for policy 0, policy_version 219903 (0.0032) [2024-06-28 11:53:03,260][06909] Updated weights for policy 0, policy_version 219913 (0.0035) [2024-06-28 11:53:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 3603070976. Throughput: 0: 44040.9. Samples: 3505973020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 11:53:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:53:07,240][06909] Updated weights for policy 0, policy_version 219923 (0.0039) [2024-06-28 11:53:08,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3603267584. Throughput: 0: 44015.1. Samples: 3506246280. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 11:53:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:53:11,131][06909] Updated weights for policy 0, policy_version 219933 (0.0035) [2024-06-28 11:53:13,850][06674] Fps is (10 sec: 40960.2, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3603480576. Throughput: 0: 44047.7. Samples: 3506369440. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 11:53:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:53:14,575][06909] Updated weights for policy 0, policy_version 219943 (0.0040) [2024-06-28 11:53:18,435][06909] Updated weights for policy 0, policy_version 219953 (0.0026) [2024-06-28 11:53:18,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.8, 300 sec: 44098.3). Total num frames: 3603726336. Throughput: 0: 44014.4. Samples: 3506634420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 11:53:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:53:22,294][06909] Updated weights for policy 0, policy_version 219963 (0.0029) [2024-06-28 11:53:23,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3603939328. Throughput: 0: 44130.8. Samples: 3506910140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 11:53:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:53:25,672][06909] Updated weights for policy 0, policy_version 219973 (0.0050) [2024-06-28 11:53:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44510.0, 300 sec: 44042.4). Total num frames: 3604168704. Throughput: 0: 44168.4. Samples: 3507042000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 11:53:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:53:29,626][06909] Updated weights for policy 0, policy_version 219983 (0.0027) [2024-06-28 11:53:33,139][06909] Updated weights for policy 0, policy_version 219993 (0.0035) [2024-06-28 11:53:33,850][06674] Fps is (10 sec: 45874.4, 60 sec: 44238.2, 300 sec: 44097.9). Total num frames: 3604398080. Throughput: 0: 44329.6. Samples: 3507307580. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 11:53:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:53:36,776][06909] Updated weights for policy 0, policy_version 220003 (0.0039) [2024-06-28 11:53:38,852][06674] Fps is (10 sec: 44227.4, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 3604611072. Throughput: 0: 44202.5. Samples: 3507570680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 11:53:38,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:53:40,350][06909] Updated weights for policy 0, policy_version 220013 (0.0029) [2024-06-28 11:53:43,853][06674] Fps is (10 sec: 42586.3, 60 sec: 44507.6, 300 sec: 43986.4). Total num frames: 3604824064. Throughput: 0: 44191.2. Samples: 3507701680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 11:53:43,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:53:44,428][06909] Updated weights for policy 0, policy_version 220023 (0.0031) [2024-06-28 11:53:48,273][06909] Updated weights for policy 0, policy_version 220033 (0.0035) [2024-06-28 11:53:48,850][06674] Fps is (10 sec: 42607.5, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3605037056. Throughput: 0: 44234.3. Samples: 3507963560. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 11:53:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:53:48,878][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000220035_3605053440.pth... [2024-06-28 11:53:48,937][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000219389_3594469376.pth [2024-06-28 11:53:51,764][06909] Updated weights for policy 0, policy_version 220043 (0.0033) [2024-06-28 11:53:53,850][06674] Fps is (10 sec: 44249.7, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3605266432. Throughput: 0: 44000.8. Samples: 3508226320. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 11:53:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:53:55,598][06909] Updated weights for policy 0, policy_version 220053 (0.0028) [2024-06-28 11:53:58,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44782.9, 300 sec: 44042.4). Total num frames: 3605495808. Throughput: 0: 44266.6. Samples: 3508361440. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 11:53:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:53:59,403][06909] Updated weights for policy 0, policy_version 220063 (0.0028) [2024-06-28 11:54:02,904][06909] Updated weights for policy 0, policy_version 220073 (0.0032) [2024-06-28 11:54:03,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3605692416. Throughput: 0: 44142.6. Samples: 3508620840. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 11:54:03,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:54:06,765][06909] Updated weights for policy 0, policy_version 220083 (0.0035) [2024-06-28 11:54:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3605921792. Throughput: 0: 43856.9. Samples: 3508883700. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 11:54:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:54:10,284][06909] Updated weights for policy 0, policy_version 220093 (0.0025) [2024-06-28 11:54:11,137][06887] Signal inference workers to stop experience collection... (49650 times) [2024-06-28 11:54:11,172][06909] InferenceWorker_p0-w0: stopping experience collection (49650 times) [2024-06-28 11:54:11,203][06887] Signal inference workers to resume experience collection... (49650 times) [2024-06-28 11:54:11,204][06909] InferenceWorker_p0-w0: resuming experience collection (49650 times) [2024-06-28 11:54:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3606151168. Throughput: 0: 43921.7. Samples: 3509018480. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 11:54:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:54:14,064][06909] Updated weights for policy 0, policy_version 220103 (0.0036) [2024-06-28 11:54:18,217][06909] Updated weights for policy 0, policy_version 220113 (0.0046) [2024-06-28 11:54:18,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3606364160. Throughput: 0: 43824.1. Samples: 3509279660. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 11:54:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:54:21,542][06909] Updated weights for policy 0, policy_version 220123 (0.0030) [2024-06-28 11:54:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3606593536. Throughput: 0: 43805.6. Samples: 3509541840. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 11:54:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:54:25,430][06909] Updated weights for policy 0, policy_version 220133 (0.0030) [2024-06-28 11:54:28,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3606806528. Throughput: 0: 43839.6. Samples: 3509674340. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 11:54:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:54:29,085][06909] Updated weights for policy 0, policy_version 220143 (0.0035) [2024-06-28 11:54:32,770][06909] Updated weights for policy 0, policy_version 220153 (0.0039) [2024-06-28 11:54:33,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3607019520. Throughput: 0: 43945.7. Samples: 3509941120. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 11:54:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:54:36,679][06909] Updated weights for policy 0, policy_version 220163 (0.0027) [2024-06-28 11:54:38,850][06674] Fps is (10 sec: 42599.4, 60 sec: 43692.2, 300 sec: 44098.0). Total num frames: 3607232512. Throughput: 0: 43861.0. Samples: 3510200060. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 11:54:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:54:40,094][06909] Updated weights for policy 0, policy_version 220173 (0.0030) [2024-06-28 11:54:43,843][06909] Updated weights for policy 0, policy_version 220183 (0.0026) [2024-06-28 11:54:43,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44239.0, 300 sec: 44153.5). Total num frames: 3607478272. Throughput: 0: 43994.7. Samples: 3510341200. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 11:54:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:54:47,830][06909] Updated weights for policy 0, policy_version 220193 (0.0043) [2024-06-28 11:54:48,850][06674] Fps is (10 sec: 44236.3, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3607674880. Throughput: 0: 44029.8. Samples: 3510602180. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 11:54:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:54:51,169][06909] Updated weights for policy 0, policy_version 220203 (0.0025) [2024-06-28 11:54:53,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3607904256. Throughput: 0: 43956.9. Samples: 3510861760. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 11:54:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:54:55,202][06909] Updated weights for policy 0, policy_version 220213 (0.0034) [2024-06-28 11:54:58,658][06909] Updated weights for policy 0, policy_version 220223 (0.0038) [2024-06-28 11:54:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3608133632. Throughput: 0: 44040.0. Samples: 3511000280. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 11:54:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:55:02,573][06909] Updated weights for policy 0, policy_version 220233 (0.0028) [2024-06-28 11:55:03,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3608330240. Throughput: 0: 44020.4. Samples: 3511260580. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 11:55:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:55:06,293][06909] Updated weights for policy 0, policy_version 220243 (0.0035) [2024-06-28 11:55:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3608559616. Throughput: 0: 44014.6. Samples: 3511522500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 11:55:08,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:55:10,095][06909] Updated weights for policy 0, policy_version 220253 (0.0030) [2024-06-28 11:55:13,404][06909] Updated weights for policy 0, policy_version 220263 (0.0036) [2024-06-28 11:55:13,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3608788992. Throughput: 0: 44239.2. Samples: 3511665100. Policy #0 lag: (min: 0.0, avg: 11.5, max: 22.0) [2024-06-28 11:55:13,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:55:17,378][06909] Updated weights for policy 0, policy_version 220273 (0.0025) [2024-06-28 11:55:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3608985600. Throughput: 0: 44086.2. Samples: 3511925000. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:55:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:55:20,814][06909] Updated weights for policy 0, policy_version 220283 (0.0039) [2024-06-28 11:55:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3609231360. Throughput: 0: 44103.0. Samples: 3512184700. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:55:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:55:24,936][06909] Updated weights for policy 0, policy_version 220293 (0.0037) [2024-06-28 11:55:28,378][06909] Updated weights for policy 0, policy_version 220303 (0.0021) [2024-06-28 11:55:28,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3609460736. Throughput: 0: 44031.5. Samples: 3512322620. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:55:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:55:31,663][06887] Signal inference workers to stop experience collection... (49700 times) [2024-06-28 11:55:31,664][06887] Signal inference workers to resume experience collection... (49700 times) [2024-06-28 11:55:31,704][06909] InferenceWorker_p0-w0: stopping experience collection (49700 times) [2024-06-28 11:55:31,704][06909] InferenceWorker_p0-w0: resuming experience collection (49700 times) [2024-06-28 11:55:32,271][06909] Updated weights for policy 0, policy_version 220313 (0.0024) [2024-06-28 11:55:33,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3609640960. Throughput: 0: 43989.4. Samples: 3512581700. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:55:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:55:35,959][06909] Updated weights for policy 0, policy_version 220323 (0.0041) [2024-06-28 11:55:38,852][06674] Fps is (10 sec: 42589.9, 60 sec: 44235.2, 300 sec: 44097.6). Total num frames: 3609886720. Throughput: 0: 44122.8. Samples: 3512847380. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:55:38,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:55:39,757][06909] Updated weights for policy 0, policy_version 220333 (0.0022) [2024-06-28 11:55:43,319][06909] Updated weights for policy 0, policy_version 220343 (0.0028) [2024-06-28 11:55:43,850][06674] Fps is (10 sec: 49151.4, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3610132480. Throughput: 0: 44128.4. Samples: 3512986060. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:55:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:55:47,605][06909] Updated weights for policy 0, policy_version 220353 (0.0029) [2024-06-28 11:55:48,850][06674] Fps is (10 sec: 40968.1, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3610296320. Throughput: 0: 44017.8. Samples: 3513241380. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:55:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 11:55:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000220355_3610296320.pth... [2024-06-28 11:55:48,906][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000219712_3599761408.pth [2024-06-28 11:55:50,839][06909] Updated weights for policy 0, policy_version 220363 (0.0029) [2024-06-28 11:55:53,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 3610542080. Throughput: 0: 44009.8. Samples: 3513502940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:55:53,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:55:54,790][06909] Updated weights for policy 0, policy_version 220373 (0.0038) [2024-06-28 11:55:58,309][06909] Updated weights for policy 0, policy_version 220383 (0.0035) [2024-06-28 11:55:58,850][06674] Fps is (10 sec: 49152.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3610787840. Throughput: 0: 43912.5. Samples: 3513641160. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:55:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:56:02,497][06909] Updated weights for policy 0, policy_version 220393 (0.0044) [2024-06-28 11:56:03,850][06674] Fps is (10 sec: 40959.4, 60 sec: 43690.6, 300 sec: 43987.2). Total num frames: 3610951680. Throughput: 0: 43879.4. Samples: 3513899580. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:56:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:56:05,871][06909] Updated weights for policy 0, policy_version 220403 (0.0042) [2024-06-28 11:56:08,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3611197440. Throughput: 0: 43923.2. Samples: 3514161240. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:56:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:56:09,770][06909] Updated weights for policy 0, policy_version 220413 (0.0031) [2024-06-28 11:56:13,396][06909] Updated weights for policy 0, policy_version 220423 (0.0045) [2024-06-28 11:56:13,850][06674] Fps is (10 sec: 47514.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3611426816. Throughput: 0: 43787.6. Samples: 3514293060. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:56:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:56:17,627][06909] Updated weights for policy 0, policy_version 220433 (0.0031) [2024-06-28 11:56:18,852][06674] Fps is (10 sec: 42589.3, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 3611623424. Throughput: 0: 43989.0. Samples: 3514561300. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:56:18,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:56:20,731][06909] Updated weights for policy 0, policy_version 220443 (0.0040) [2024-06-28 11:56:23,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 3611869184. Throughput: 0: 43928.7. Samples: 3514824080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 23.0) [2024-06-28 11:56:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:56:24,757][06909] Updated weights for policy 0, policy_version 220453 (0.0038) [2024-06-28 11:56:28,121][06909] Updated weights for policy 0, policy_version 220463 (0.0034) [2024-06-28 11:56:28,850][06674] Fps is (10 sec: 47523.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3612098560. Throughput: 0: 43972.9. Samples: 3514964840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 11:56:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:56:32,013][06909] Updated weights for policy 0, policy_version 220473 (0.0025) [2024-06-28 11:56:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3612278784. Throughput: 0: 44010.3. Samples: 3515221840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 11:56:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:56:35,518][06909] Updated weights for policy 0, policy_version 220483 (0.0035) [2024-06-28 11:56:38,852][06674] Fps is (10 sec: 44227.8, 60 sec: 44236.8, 300 sec: 44153.2). Total num frames: 3612540928. Throughput: 0: 43987.4. Samples: 3515482460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 11:56:38,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:56:39,321][06909] Updated weights for policy 0, policy_version 220493 (0.0028) [2024-06-28 11:56:43,026][06909] Updated weights for policy 0, policy_version 220503 (0.0029) [2024-06-28 11:56:43,850][06674] Fps is (10 sec: 47513.1, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3612753920. Throughput: 0: 44001.3. Samples: 3515621220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 11:56:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:56:46,759][06909] Updated weights for policy 0, policy_version 220513 (0.0028) [2024-06-28 11:56:48,850][06674] Fps is (10 sec: 40968.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3612950528. Throughput: 0: 43993.1. Samples: 3515879260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 11:56:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:56:50,342][06887] Signal inference workers to stop experience collection... (49750 times) [2024-06-28 11:56:50,342][06887] Signal inference workers to resume experience collection... (49750 times) [2024-06-28 11:56:50,361][06909] InferenceWorker_p0-w0: stopping experience collection (49750 times) [2024-06-28 11:56:50,361][06909] InferenceWorker_p0-w0: resuming experience collection (49750 times) [2024-06-28 11:56:50,499][06909] Updated weights for policy 0, policy_version 220523 (0.0024) [2024-06-28 11:56:53,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43963.9, 300 sec: 44098.3). Total num frames: 3613179904. Throughput: 0: 44048.1. Samples: 3516143400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 11:56:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:56:54,740][06909] Updated weights for policy 0, policy_version 220533 (0.0031) [2024-06-28 11:56:58,250][06909] Updated weights for policy 0, policy_version 220543 (0.0025) [2024-06-28 11:56:58,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 44042.7). Total num frames: 3613409280. Throughput: 0: 43976.8. Samples: 3516272020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 11:56:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:57:01,966][06909] Updated weights for policy 0, policy_version 220553 (0.0042) [2024-06-28 11:57:03,850][06674] Fps is (10 sec: 39320.6, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3613573120. Throughput: 0: 43912.5. Samples: 3516537280. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 11:57:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:57:05,546][06909] Updated weights for policy 0, policy_version 220563 (0.0034) [2024-06-28 11:57:08,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3613851648. Throughput: 0: 43795.5. Samples: 3516794880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 11:57:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:57:09,239][06909] Updated weights for policy 0, policy_version 220573 (0.0030) [2024-06-28 11:57:12,872][06909] Updated weights for policy 0, policy_version 220583 (0.0037) [2024-06-28 11:57:13,850][06674] Fps is (10 sec: 49152.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3614064640. Throughput: 0: 43751.5. Samples: 3516933660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 11:57:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:57:16,946][06909] Updated weights for policy 0, policy_version 220593 (0.0028) [2024-06-28 11:57:18,852][06674] Fps is (10 sec: 39313.3, 60 sec: 43690.6, 300 sec: 43875.5). Total num frames: 3614244864. Throughput: 0: 43905.1. Samples: 3517197660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 11:57:18,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:57:20,395][06909] Updated weights for policy 0, policy_version 220603 (0.0030) [2024-06-28 11:57:23,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3614507008. Throughput: 0: 43697.6. Samples: 3517448760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 11:57:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:57:24,308][06909] Updated weights for policy 0, policy_version 220613 (0.0036) [2024-06-28 11:57:28,008][06909] Updated weights for policy 0, policy_version 220623 (0.0027) [2024-06-28 11:57:28,850][06674] Fps is (10 sec: 47523.5, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 3614720000. Throughput: 0: 43798.7. Samples: 3517592160. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 11:57:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:57:31,894][06909] Updated weights for policy 0, policy_version 220633 (0.0039) [2024-06-28 11:57:33,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3614932992. Throughput: 0: 43890.2. Samples: 3517854320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 11:57:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:57:35,510][06909] Updated weights for policy 0, policy_version 220643 (0.0029) [2024-06-28 11:57:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43692.1, 300 sec: 44097.9). Total num frames: 3615162368. Throughput: 0: 43802.5. Samples: 3518114520. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 11:57:38,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:57:38,967][06909] Updated weights for policy 0, policy_version 220653 (0.0027) [2024-06-28 11:57:42,614][06909] Updated weights for policy 0, policy_version 220663 (0.0027) [2024-06-28 11:57:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3615391744. Throughput: 0: 44021.8. Samples: 3518253000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 11:57:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:57:46,167][06909] Updated weights for policy 0, policy_version 220673 (0.0030) [2024-06-28 11:57:48,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3615571968. Throughput: 0: 44146.3. Samples: 3518523860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 11:57:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:57:49,014][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000220678_3615588352.pth... [2024-06-28 11:57:49,087][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000220035_3605053440.pth [2024-06-28 11:57:49,930][06909] Updated weights for policy 0, policy_version 220683 (0.0036) [2024-06-28 11:57:53,439][06909] Updated weights for policy 0, policy_version 220693 (0.0040) [2024-06-28 11:57:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3615834112. Throughput: 0: 44031.1. Samples: 3518776280. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 11:57:53,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:57:57,272][06909] Updated weights for policy 0, policy_version 220703 (0.0022) [2024-06-28 11:57:58,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3616047104. Throughput: 0: 44136.0. Samples: 3518919780. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 11:57:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:58:00,209][06887] Signal inference workers to stop experience collection... (49800 times) [2024-06-28 11:58:00,209][06887] Signal inference workers to resume experience collection... (49800 times) [2024-06-28 11:58:00,253][06909] InferenceWorker_p0-w0: stopping experience collection (49800 times) [2024-06-28 11:58:00,253][06909] InferenceWorker_p0-w0: resuming experience collection (49800 times) [2024-06-28 11:58:02,009][06909] Updated weights for policy 0, policy_version 220713 (0.0026) [2024-06-28 11:58:03,850][06674] Fps is (10 sec: 40959.9, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 3616243712. Throughput: 0: 44093.6. Samples: 3519181780. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 11:58:03,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:58:04,809][06909] Updated weights for policy 0, policy_version 220723 (0.0034) [2024-06-28 11:58:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3616473088. Throughput: 0: 44119.1. Samples: 3519434120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 11:58:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:58:09,150][06909] Updated weights for policy 0, policy_version 220733 (0.0026) [2024-06-28 11:58:12,275][06909] Updated weights for policy 0, policy_version 220743 (0.0027) [2024-06-28 11:58:13,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3616702464. Throughput: 0: 44131.2. Samples: 3519578060. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 11:58:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:58:16,545][06909] Updated weights for policy 0, policy_version 220753 (0.0035) [2024-06-28 11:58:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 44238.4, 300 sec: 43931.3). Total num frames: 3616899072. Throughput: 0: 44069.8. Samples: 3519837460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 11:58:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:58:19,990][06909] Updated weights for policy 0, policy_version 220763 (0.0039) [2024-06-28 11:58:23,756][06909] Updated weights for policy 0, policy_version 220773 (0.0046) [2024-06-28 11:58:23,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3617144832. Throughput: 0: 43887.2. Samples: 3520089440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 11:58:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:58:27,214][06909] Updated weights for policy 0, policy_version 220783 (0.0030) [2024-06-28 11:58:28,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 43931.4). Total num frames: 3617357824. Throughput: 0: 44047.1. Samples: 3520235120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 11:58:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:58:31,302][06909] Updated weights for policy 0, policy_version 220793 (0.0038) [2024-06-28 11:58:33,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43931.6). Total num frames: 3617570816. Throughput: 0: 44025.8. Samples: 3520505020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 11:58:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:58:34,634][06909] Updated weights for policy 0, policy_version 220803 (0.0030) [2024-06-28 11:58:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43931.8). Total num frames: 3617783808. Throughput: 0: 44213.3. Samples: 3520765880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 23.0) [2024-06-28 11:58:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 11:58:38,928][06909] Updated weights for policy 0, policy_version 220813 (0.0026) [2024-06-28 11:58:41,939][06909] Updated weights for policy 0, policy_version 220823 (0.0032) [2024-06-28 11:58:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3618029568. Throughput: 0: 43900.0. Samples: 3520895280. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 11:58:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:58:46,021][06909] Updated weights for policy 0, policy_version 220833 (0.0021) [2024-06-28 11:58:48,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 3618226176. Throughput: 0: 44104.4. Samples: 3521166480. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 11:58:48,851][06674] Avg episode reward: [(0, '0.419')] [2024-06-28 11:58:49,744][06909] Updated weights for policy 0, policy_version 220843 (0.0036) [2024-06-28 11:58:53,538][06909] Updated weights for policy 0, policy_version 220853 (0.0029) [2024-06-28 11:58:53,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3618455552. Throughput: 0: 44197.3. Samples: 3521423000. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 11:58:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:58:57,156][06909] Updated weights for policy 0, policy_version 220863 (0.0026) [2024-06-28 11:58:58,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3618684928. Throughput: 0: 43935.1. Samples: 3521555140. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 11:58:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:59:00,815][06909] Updated weights for policy 0, policy_version 220873 (0.0025) [2024-06-28 11:59:03,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3618914304. Throughput: 0: 44250.1. Samples: 3521828720. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 11:59:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:59:04,569][06909] Updated weights for policy 0, policy_version 220883 (0.0026) [2024-06-28 11:59:08,262][06909] Updated weights for policy 0, policy_version 220893 (0.0029) [2024-06-28 11:59:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3619110912. Throughput: 0: 44401.0. Samples: 3522087480. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 11:59:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:59:12,003][06909] Updated weights for policy 0, policy_version 220903 (0.0032) [2024-06-28 11:59:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3619356672. Throughput: 0: 44183.5. Samples: 3522223380. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 11:59:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 11:59:16,069][06909] Updated weights for policy 0, policy_version 220913 (0.0030) [2024-06-28 11:59:18,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 3619569664. Throughput: 0: 44112.0. Samples: 3522490060. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 11:59:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:59:19,231][06909] Updated weights for policy 0, policy_version 220923 (0.0026) [2024-06-28 11:59:20,280][06887] Signal inference workers to stop experience collection... (49850 times) [2024-06-28 11:59:20,281][06887] Signal inference workers to resume experience collection... (49850 times) [2024-06-28 11:59:20,326][06909] InferenceWorker_p0-w0: stopping experience collection (49850 times) [2024-06-28 11:59:20,326][06909] InferenceWorker_p0-w0: resuming experience collection (49850 times) [2024-06-28 11:59:23,302][06909] Updated weights for policy 0, policy_version 220933 (0.0032) [2024-06-28 11:59:23,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 3619766272. Throughput: 0: 43983.6. Samples: 3522745140. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 11:59:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 11:59:26,822][06909] Updated weights for policy 0, policy_version 220943 (0.0040) [2024-06-28 11:59:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44509.9, 300 sec: 44098.0). Total num frames: 3620028416. Throughput: 0: 44016.1. Samples: 3522876000. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 11:59:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:59:31,014][06909] Updated weights for policy 0, policy_version 220953 (0.0041) [2024-06-28 11:59:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3620225024. Throughput: 0: 44032.1. Samples: 3523147920. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 11:59:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:59:34,282][06909] Updated weights for policy 0, policy_version 220963 (0.0031) [2024-06-28 11:59:38,499][06909] Updated weights for policy 0, policy_version 220973 (0.0029) [2024-06-28 11:59:38,850][06674] Fps is (10 sec: 40959.3, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 3620438016. Throughput: 0: 44195.0. Samples: 3523411780. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 11:59:38,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:59:41,594][06909] Updated weights for policy 0, policy_version 220983 (0.0037) [2024-06-28 11:59:43,850][06674] Fps is (10 sec: 44235.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3620667392. Throughput: 0: 44102.1. Samples: 3523539740. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 11:59:43,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 11:59:45,870][06909] Updated weights for policy 0, policy_version 220993 (0.0035) [2024-06-28 11:59:48,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3620880384. Throughput: 0: 44017.8. Samples: 3523809520. Policy #0 lag: (min: 0.0, avg: 11.8, max: 22.0) [2024-06-28 11:59:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 11:59:48,880][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000221002_3620896768.pth... [2024-06-28 11:59:48,927][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000220355_3610296320.pth [2024-06-28 11:59:49,271][06909] Updated weights for policy 0, policy_version 221003 (0.0029) [2024-06-28 11:59:53,296][06909] Updated weights for policy 0, policy_version 221013 (0.0031) [2024-06-28 11:59:53,850][06674] Fps is (10 sec: 42599.3, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3621093376. Throughput: 0: 44190.2. Samples: 3524076040. Policy #0 lag: (min: 0.0, avg: 13.1, max: 21.0) [2024-06-28 11:59:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 11:59:56,510][06909] Updated weights for policy 0, policy_version 221023 (0.0035) [2024-06-28 11:59:58,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3621355520. Throughput: 0: 43983.2. Samples: 3524202620. Policy #0 lag: (min: 0.0, avg: 13.1, max: 21.0) [2024-06-28 11:59:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:00:01,115][06909] Updated weights for policy 0, policy_version 221033 (0.0030) [2024-06-28 12:00:03,850][06674] Fps is (10 sec: 45874.6, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3621552128. Throughput: 0: 43904.8. Samples: 3524465780. Policy #0 lag: (min: 0.0, avg: 13.1, max: 21.0) [2024-06-28 12:00:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:00:03,982][06909] Updated weights for policy 0, policy_version 221043 (0.0037) [2024-06-28 12:00:08,485][06909] Updated weights for policy 0, policy_version 221053 (0.0032) [2024-06-28 12:00:08,850][06674] Fps is (10 sec: 37683.2, 60 sec: 43690.7, 300 sec: 43875.8). Total num frames: 3621732352. Throughput: 0: 44202.3. Samples: 3524734240. Policy #0 lag: (min: 0.0, avg: 13.1, max: 21.0) [2024-06-28 12:00:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:00:11,300][06909] Updated weights for policy 0, policy_version 221063 (0.0034) [2024-06-28 12:00:13,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3621994496. Throughput: 0: 44017.8. Samples: 3524856800. Policy #0 lag: (min: 0.0, avg: 13.1, max: 21.0) [2024-06-28 12:00:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:00:15,740][06909] Updated weights for policy 0, policy_version 221073 (0.0029) [2024-06-28 12:00:18,715][06909] Updated weights for policy 0, policy_version 221083 (0.0040) [2024-06-28 12:00:18,852][06674] Fps is (10 sec: 49141.5, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 3622223872. Throughput: 0: 44081.5. Samples: 3525131680. Policy #0 lag: (min: 0.0, avg: 13.1, max: 21.0) [2024-06-28 12:00:18,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:00:23,260][06909] Updated weights for policy 0, policy_version 221093 (0.0028) [2024-06-28 12:00:23,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3622404096. Throughput: 0: 44101.9. Samples: 3525396360. Policy #0 lag: (min: 0.0, avg: 13.1, max: 21.0) [2024-06-28 12:00:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:00:26,277][06909] Updated weights for policy 0, policy_version 221103 (0.0022) [2024-06-28 12:00:28,850][06674] Fps is (10 sec: 42607.6, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3622649856. Throughput: 0: 44063.3. Samples: 3525522580. Policy #0 lag: (min: 0.0, avg: 13.1, max: 21.0) [2024-06-28 12:00:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:00:30,546][06909] Updated weights for policy 0, policy_version 221113 (0.0030) [2024-06-28 12:00:33,683][06909] Updated weights for policy 0, policy_version 221123 (0.0034) [2024-06-28 12:00:33,850][06674] Fps is (10 sec: 47513.8, 60 sec: 44236.8, 300 sec: 44042.7). Total num frames: 3622879232. Throughput: 0: 43926.8. Samples: 3525786220. Policy #0 lag: (min: 0.0, avg: 13.1, max: 21.0) [2024-06-28 12:00:33,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:00:38,222][06909] Updated weights for policy 0, policy_version 221133 (0.0034) [2024-06-28 12:00:38,850][06674] Fps is (10 sec: 39321.6, 60 sec: 43417.7, 300 sec: 43764.7). Total num frames: 3623043072. Throughput: 0: 43952.0. Samples: 3526053880. Policy #0 lag: (min: 0.0, avg: 13.1, max: 21.0) [2024-06-28 12:00:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:00:41,028][06909] Updated weights for policy 0, policy_version 221143 (0.0036) [2024-06-28 12:00:43,852][06674] Fps is (10 sec: 42589.4, 60 sec: 43962.3, 300 sec: 44097.7). Total num frames: 3623305216. Throughput: 0: 43878.0. Samples: 3526177220. Policy #0 lag: (min: 0.0, avg: 13.1, max: 21.0) [2024-06-28 12:00:43,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:00:45,545][06909] Updated weights for policy 0, policy_version 221153 (0.0026) [2024-06-28 12:00:48,701][06909] Updated weights for policy 0, policy_version 221163 (0.0036) [2024-06-28 12:00:48,850][06674] Fps is (10 sec: 49151.9, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3623534592. Throughput: 0: 43988.1. Samples: 3526445240. Policy #0 lag: (min: 0.0, avg: 13.1, max: 21.0) [2024-06-28 12:00:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:00:49,139][06887] Signal inference workers to stop experience collection... (49900 times) [2024-06-28 12:00:49,139][06887] Signal inference workers to resume experience collection... (49900 times) [2024-06-28 12:00:49,180][06909] InferenceWorker_p0-w0: stopping experience collection (49900 times) [2024-06-28 12:00:49,180][06909] InferenceWorker_p0-w0: resuming experience collection (49900 times) [2024-06-28 12:00:52,768][06909] Updated weights for policy 0, policy_version 221173 (0.0035) [2024-06-28 12:00:53,850][06674] Fps is (10 sec: 40968.5, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 3623714816. Throughput: 0: 44135.6. Samples: 3526720340. Policy #0 lag: (min: 0.0, avg: 13.1, max: 21.0) [2024-06-28 12:00:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:00:56,012][06909] Updated weights for policy 0, policy_version 221183 (0.0036) [2024-06-28 12:00:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43417.6, 300 sec: 44098.0). Total num frames: 3623960576. Throughput: 0: 44160.4. Samples: 3526844020. Policy #0 lag: (min: 0.0, avg: 13.1, max: 21.0) [2024-06-28 12:00:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:01:00,470][06909] Updated weights for policy 0, policy_version 221193 (0.0038) [2024-06-28 12:01:03,684][06909] Updated weights for policy 0, policy_version 221203 (0.0037) [2024-06-28 12:01:03,850][06674] Fps is (10 sec: 49151.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3624206336. Throughput: 0: 43902.4. Samples: 3527107200. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:01:03,850][06674] Avg episode reward: [(0, '0.474')] [2024-06-28 12:01:08,011][06909] Updated weights for policy 0, policy_version 221213 (0.0025) [2024-06-28 12:01:08,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 3624386560. Throughput: 0: 43978.5. Samples: 3527375400. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:01:08,853][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:01:11,031][06909] Updated weights for policy 0, policy_version 221223 (0.0040) [2024-06-28 12:01:13,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.2). Total num frames: 3624632320. Throughput: 0: 43757.6. Samples: 3527491680. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:01:13,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:01:15,322][06909] Updated weights for policy 0, policy_version 221233 (0.0028) [2024-06-28 12:01:18,476][06909] Updated weights for policy 0, policy_version 221243 (0.0046) [2024-06-28 12:01:18,850][06674] Fps is (10 sec: 47513.6, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 3624861696. Throughput: 0: 43820.7. Samples: 3527758160. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:01:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:01:22,718][06909] Updated weights for policy 0, policy_version 221253 (0.0028) [2024-06-28 12:01:23,852][06674] Fps is (10 sec: 37675.6, 60 sec: 43416.1, 300 sec: 43764.4). Total num frames: 3625009152. Throughput: 0: 43769.4. Samples: 3528023600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:01:23,853][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:01:26,030][06909] Updated weights for policy 0, policy_version 221263 (0.0029) [2024-06-28 12:01:28,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3625271296. Throughput: 0: 43686.0. Samples: 3528143000. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:01:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:01:30,426][06909] Updated weights for policy 0, policy_version 221273 (0.0028) [2024-06-28 12:01:33,605][06909] Updated weights for policy 0, policy_version 221283 (0.0045) [2024-06-28 12:01:33,850][06674] Fps is (10 sec: 49162.0, 60 sec: 43690.6, 300 sec: 43931.6). Total num frames: 3625500672. Throughput: 0: 43719.0. Samples: 3528412600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:01:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:01:37,894][06909] Updated weights for policy 0, policy_version 221293 (0.0040) [2024-06-28 12:01:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.7, 300 sec: 43875.8). Total num frames: 3625697280. Throughput: 0: 43549.7. Samples: 3528680080. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:01:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:01:41,172][06909] Updated weights for policy 0, policy_version 221303 (0.0033) [2024-06-28 12:01:43,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43692.2, 300 sec: 43986.9). Total num frames: 3625926656. Throughput: 0: 43396.1. Samples: 3528796840. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:01:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:01:45,302][06909] Updated weights for policy 0, policy_version 221313 (0.0033) [2024-06-28 12:01:48,431][06909] Updated weights for policy 0, policy_version 221323 (0.0025) [2024-06-28 12:01:48,850][06674] Fps is (10 sec: 49151.9, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3626188800. Throughput: 0: 43737.4. Samples: 3529075380. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:01:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:01:48,863][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000221325_3626188800.pth... [2024-06-28 12:01:48,914][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000220678_3615588352.pth [2024-06-28 12:01:52,629][06909] Updated weights for policy 0, policy_version 221333 (0.0025) [2024-06-28 12:01:53,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 3626336256. Throughput: 0: 43685.5. Samples: 3529341240. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:01:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:01:55,743][06909] Updated weights for policy 0, policy_version 221343 (0.0032) [2024-06-28 12:01:58,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3626582016. Throughput: 0: 43858.7. Samples: 3529465320. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:01:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:02:00,011][06909] Updated weights for policy 0, policy_version 221353 (0.0022) [2024-06-28 12:02:03,460][06909] Updated weights for policy 0, policy_version 221363 (0.0031) [2024-06-28 12:02:03,850][06674] Fps is (10 sec: 50790.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3626844160. Throughput: 0: 43941.9. Samples: 3529735540. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:02:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:02:06,343][06887] Signal inference workers to stop experience collection... (49950 times) [2024-06-28 12:02:06,345][06887] Signal inference workers to resume experience collection... (49950 times) [2024-06-28 12:02:06,388][06909] InferenceWorker_p0-w0: stopping experience collection (49950 times) [2024-06-28 12:02:06,388][06909] InferenceWorker_p0-w0: resuming experience collection (49950 times) [2024-06-28 12:02:07,601][06909] Updated weights for policy 0, policy_version 221373 (0.0036) [2024-06-28 12:02:08,852][06674] Fps is (10 sec: 42589.7, 60 sec: 43689.2, 300 sec: 43875.5). Total num frames: 3627008000. Throughput: 0: 43944.5. Samples: 3530001100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 12:02:08,852][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:02:10,729][06909] Updated weights for policy 0, policy_version 221383 (0.0028) [2024-06-28 12:02:13,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.8, 300 sec: 44098.3). Total num frames: 3627253760. Throughput: 0: 43933.8. Samples: 3530120020. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 12:02:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:02:14,946][06909] Updated weights for policy 0, policy_version 221393 (0.0025) [2024-06-28 12:02:18,267][06909] Updated weights for policy 0, policy_version 221403 (0.0033) [2024-06-28 12:02:18,850][06674] Fps is (10 sec: 47522.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3627483136. Throughput: 0: 44048.9. Samples: 3530394800. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 12:02:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:02:22,483][06909] Updated weights for policy 0, policy_version 221413 (0.0032) [2024-06-28 12:02:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44511.4, 300 sec: 43931.3). Total num frames: 3627679744. Throughput: 0: 43902.7. Samples: 3530655700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 12:02:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:02:25,536][06909] Updated weights for policy 0, policy_version 221423 (0.0033) [2024-06-28 12:02:28,856][06674] Fps is (10 sec: 42572.9, 60 sec: 43959.3, 300 sec: 43986.0). Total num frames: 3627909120. Throughput: 0: 44198.4. Samples: 3530786040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 12:02:28,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:02:29,781][06909] Updated weights for policy 0, policy_version 221433 (0.0026) [2024-06-28 12:02:32,994][06909] Updated weights for policy 0, policy_version 221443 (0.0041) [2024-06-28 12:02:33,850][06674] Fps is (10 sec: 47512.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3628154880. Throughput: 0: 43898.9. Samples: 3531050840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 12:02:33,851][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 12:02:37,081][06909] Updated weights for policy 0, policy_version 221453 (0.0026) [2024-06-28 12:02:38,850][06674] Fps is (10 sec: 42624.4, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3628335104. Throughput: 0: 44074.2. Samples: 3531324580. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 12:02:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:02:40,423][06909] Updated weights for policy 0, policy_version 221463 (0.0028) [2024-06-28 12:02:43,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3628564480. Throughput: 0: 43983.0. Samples: 3531444560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 12:02:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:02:44,742][06909] Updated weights for policy 0, policy_version 221473 (0.0023) [2024-06-28 12:02:48,022][06909] Updated weights for policy 0, policy_version 221483 (0.0027) [2024-06-28 12:02:48,852][06674] Fps is (10 sec: 47503.5, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 3628810240. Throughput: 0: 43989.5. Samples: 3531715160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 12:02:48,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:02:52,299][06909] Updated weights for policy 0, policy_version 221493 (0.0027) [2024-06-28 12:02:53,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 3628990464. Throughput: 0: 43890.9. Samples: 3531976100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 12:02:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:02:55,278][06909] Updated weights for policy 0, policy_version 221503 (0.0050) [2024-06-28 12:02:58,850][06674] Fps is (10 sec: 39329.5, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3629203456. Throughput: 0: 44086.6. Samples: 3532103920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 12:02:58,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:02:59,739][06909] Updated weights for policy 0, policy_version 221513 (0.0028) [2024-06-28 12:03:02,799][06909] Updated weights for policy 0, policy_version 221523 (0.0024) [2024-06-28 12:03:03,854][06674] Fps is (10 sec: 47494.8, 60 sec: 43687.8, 300 sec: 44041.8). Total num frames: 3629465600. Throughput: 0: 43679.4. Samples: 3532360540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 12:03:03,854][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:03:06,945][06909] Updated weights for policy 0, policy_version 221533 (0.0033) [2024-06-28 12:03:08,852][06674] Fps is (10 sec: 45865.4, 60 sec: 44236.7, 300 sec: 43931.0). Total num frames: 3629662208. Throughput: 0: 44134.7. Samples: 3532641860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 12:03:08,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:03:10,069][06909] Updated weights for policy 0, policy_version 221543 (0.0038) [2024-06-28 12:03:13,852][06674] Fps is (10 sec: 42606.4, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 3629891584. Throughput: 0: 44025.2. Samples: 3532767000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 12:03:13,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:03:14,438][06909] Updated weights for policy 0, policy_version 221553 (0.0027) [2024-06-28 12:03:17,654][06909] Updated weights for policy 0, policy_version 221563 (0.0042) [2024-06-28 12:03:18,850][06674] Fps is (10 sec: 47523.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3630137344. Throughput: 0: 44025.5. Samples: 3533031980. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 12:03:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:03:22,060][06909] Updated weights for policy 0, policy_version 221573 (0.0034) [2024-06-28 12:03:23,852][06674] Fps is (10 sec: 40960.0, 60 sec: 43689.1, 300 sec: 43875.5). Total num frames: 3630301184. Throughput: 0: 43784.2. Samples: 3533294960. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 12:03:23,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:03:24,758][06887] Signal inference workers to stop experience collection... (50000 times) [2024-06-28 12:03:24,813][06887] Signal inference workers to resume experience collection... (50000 times) [2024-06-28 12:03:24,813][06909] InferenceWorker_p0-w0: stopping experience collection (50000 times) [2024-06-28 12:03:24,841][06909] InferenceWorker_p0-w0: resuming experience collection (50000 times) [2024-06-28 12:03:25,316][06909] Updated weights for policy 0, policy_version 221583 (0.0032) [2024-06-28 12:03:28,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43695.1, 300 sec: 43931.3). Total num frames: 3630530560. Throughput: 0: 43957.9. Samples: 3533422660. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 12:03:28,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:03:29,397][06909] Updated weights for policy 0, policy_version 221593 (0.0030) [2024-06-28 12:03:32,581][06909] Updated weights for policy 0, policy_version 221603 (0.0028) [2024-06-28 12:03:33,850][06674] Fps is (10 sec: 47523.2, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 3630776320. Throughput: 0: 43752.2. Samples: 3533683920. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 12:03:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:03:37,185][06909] Updated weights for policy 0, policy_version 221613 (0.0038) [2024-06-28 12:03:38,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 43875.8). Total num frames: 3630972928. Throughput: 0: 43946.7. Samples: 3533953700. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 12:03:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:03:39,981][06909] Updated weights for policy 0, policy_version 221623 (0.0031) [2024-06-28 12:03:43,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3631185920. Throughput: 0: 43866.7. Samples: 3534077920. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 12:03:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 12:03:44,658][06909] Updated weights for policy 0, policy_version 221633 (0.0032) [2024-06-28 12:03:47,422][06909] Updated weights for policy 0, policy_version 221643 (0.0033) [2024-06-28 12:03:48,850][06674] Fps is (10 sec: 47513.0, 60 sec: 43965.2, 300 sec: 44042.4). Total num frames: 3631448064. Throughput: 0: 43975.3. Samples: 3534339260. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 12:03:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:03:48,996][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000221647_3631464448.pth... [2024-06-28 12:03:49,061][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000221002_3620896768.pth [2024-06-28 12:03:52,352][06909] Updated weights for policy 0, policy_version 221653 (0.0026) [2024-06-28 12:03:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43820.3). Total num frames: 3631611904. Throughput: 0: 43599.9. Samples: 3534603760. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 12:03:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:03:55,341][06909] Updated weights for policy 0, policy_version 221663 (0.0043) [2024-06-28 12:03:58,850][06674] Fps is (10 sec: 37684.0, 60 sec: 43690.8, 300 sec: 43764.7). Total num frames: 3631824896. Throughput: 0: 43455.4. Samples: 3534722400. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 12:03:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:03:59,729][06909] Updated weights for policy 0, policy_version 221673 (0.0036) [2024-06-28 12:04:02,800][06909] Updated weights for policy 0, policy_version 221683 (0.0025) [2024-06-28 12:04:03,850][06674] Fps is (10 sec: 49152.3, 60 sec: 43966.7, 300 sec: 44042.4). Total num frames: 3632103424. Throughput: 0: 43681.0. Samples: 3534997620. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 12:04:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 12:04:07,295][06909] Updated weights for policy 0, policy_version 221693 (0.0035) [2024-06-28 12:04:08,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43965.3, 300 sec: 43875.8). Total num frames: 3632300032. Throughput: 0: 43747.7. Samples: 3535263520. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 12:04:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:04:10,011][06909] Updated weights for policy 0, policy_version 221703 (0.0031) [2024-06-28 12:04:13,850][06674] Fps is (10 sec: 39321.2, 60 sec: 43419.1, 300 sec: 43820.3). Total num frames: 3632496640. Throughput: 0: 43746.1. Samples: 3535391240. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 12:04:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:04:14,632][06909] Updated weights for policy 0, policy_version 221713 (0.0043) [2024-06-28 12:04:17,232][06909] Updated weights for policy 0, policy_version 221723 (0.0030) [2024-06-28 12:04:18,852][06674] Fps is (10 sec: 44227.9, 60 sec: 43416.1, 300 sec: 43986.6). Total num frames: 3632742400. Throughput: 0: 43774.5. Samples: 3535653860. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 12:04:18,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:04:21,881][06909] Updated weights for policy 0, policy_version 221733 (0.0038) [2024-06-28 12:04:23,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44238.3, 300 sec: 43820.3). Total num frames: 3632955392. Throughput: 0: 43823.1. Samples: 3535925740. Policy #0 lag: (min: 0.0, avg: 8.3, max: 22.0) [2024-06-28 12:04:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:04:24,798][06909] Updated weights for policy 0, policy_version 221743 (0.0025) [2024-06-28 12:04:28,850][06674] Fps is (10 sec: 42606.9, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 3633168384. Throughput: 0: 43790.6. Samples: 3536048500. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-28 12:04:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:04:29,367][06909] Updated weights for policy 0, policy_version 221753 (0.0036) [2024-06-28 12:04:32,459][06909] Updated weights for policy 0, policy_version 221763 (0.0025) [2024-06-28 12:04:33,850][06674] Fps is (10 sec: 47512.8, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3633430528. Throughput: 0: 43897.7. Samples: 3536314660. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-28 12:04:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:04:37,001][06909] Updated weights for policy 0, policy_version 221773 (0.0025) [2024-06-28 12:04:38,778][06887] Signal inference workers to stop experience collection... (50050 times) [2024-06-28 12:04:38,827][06909] InferenceWorker_p0-w0: stopping experience collection (50050 times) [2024-06-28 12:04:38,828][06887] Signal inference workers to resume experience collection... (50050 times) [2024-06-28 12:04:38,844][06909] InferenceWorker_p0-w0: resuming experience collection (50050 times) [2024-06-28 12:04:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 43931.4). Total num frames: 3633627136. Throughput: 0: 44039.6. Samples: 3536585540. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-28 12:04:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:04:39,579][06909] Updated weights for policy 0, policy_version 221783 (0.0046) [2024-06-28 12:04:43,850][06674] Fps is (10 sec: 37683.7, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 3633807360. Throughput: 0: 44325.2. Samples: 3536717040. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-28 12:04:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:04:44,288][06909] Updated weights for policy 0, policy_version 221793 (0.0032) [2024-06-28 12:04:46,845][06909] Updated weights for policy 0, policy_version 221803 (0.0024) [2024-06-28 12:04:48,852][06674] Fps is (10 sec: 44226.5, 60 sec: 43689.0, 300 sec: 43986.5). Total num frames: 3634069504. Throughput: 0: 43954.5. Samples: 3536975680. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-28 12:04:48,860][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:04:51,811][06909] Updated weights for policy 0, policy_version 221813 (0.0028) [2024-06-28 12:04:53,850][06674] Fps is (10 sec: 50790.1, 60 sec: 45056.0, 300 sec: 43931.3). Total num frames: 3634315264. Throughput: 0: 44111.5. Samples: 3537248540. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-28 12:04:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:04:54,161][06909] Updated weights for policy 0, policy_version 221823 (0.0029) [2024-06-28 12:04:58,850][06674] Fps is (10 sec: 42608.1, 60 sec: 44509.7, 300 sec: 43875.8). Total num frames: 3634495488. Throughput: 0: 44090.7. Samples: 3537375320. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-28 12:04:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:04:59,088][06909] Updated weights for policy 0, policy_version 221833 (0.0026) [2024-06-28 12:05:02,032][06909] Updated weights for policy 0, policy_version 221843 (0.0034) [2024-06-28 12:05:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3634741248. Throughput: 0: 44031.4. Samples: 3537635180. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-28 12:05:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:05:06,792][06909] Updated weights for policy 0, policy_version 221853 (0.0035) [2024-06-28 12:05:08,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 43875.8). Total num frames: 3634937856. Throughput: 0: 44063.6. Samples: 3537908600. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-28 12:05:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:05:09,440][06909] Updated weights for policy 0, policy_version 221863 (0.0041) [2024-06-28 12:05:13,850][06674] Fps is (10 sec: 39322.0, 60 sec: 43963.9, 300 sec: 43765.0). Total num frames: 3635134464. Throughput: 0: 44102.4. Samples: 3538033100. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-28 12:05:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:05:14,162][06909] Updated weights for policy 0, policy_version 221873 (0.0040) [2024-06-28 12:05:16,699][06909] Updated weights for policy 0, policy_version 221883 (0.0033) [2024-06-28 12:05:18,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44238.3, 300 sec: 44042.4). Total num frames: 3635396608. Throughput: 0: 43939.2. Samples: 3538291920. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-28 12:05:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:05:21,538][06909] Updated weights for policy 0, policy_version 221893 (0.0030) [2024-06-28 12:05:23,850][06674] Fps is (10 sec: 49151.6, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 3635625984. Throughput: 0: 44021.8. Samples: 3538566520. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-28 12:05:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:05:24,031][06909] Updated weights for policy 0, policy_version 221903 (0.0030) [2024-06-28 12:05:28,815][06909] Updated weights for policy 0, policy_version 221913 (0.0030) [2024-06-28 12:05:28,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 43875.8). Total num frames: 3635822592. Throughput: 0: 44033.7. Samples: 3538698560. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-28 12:05:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:05:31,729][06909] Updated weights for policy 0, policy_version 221923 (0.0035) [2024-06-28 12:05:33,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43690.8, 300 sec: 44097.9). Total num frames: 3636051968. Throughput: 0: 44065.9. Samples: 3538958540. Policy #0 lag: (min: 0.0, avg: 12.6, max: 22.0) [2024-06-28 12:05:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:05:35,911][06909] Updated weights for policy 0, policy_version 221933 (0.0037) [2024-06-28 12:05:36,812][06887] Signal inference workers to stop experience collection... (50100 times) [2024-06-28 12:05:36,812][06887] Signal inference workers to resume experience collection... (50100 times) [2024-06-28 12:05:36,856][06909] InferenceWorker_p0-w0: stopping experience collection (50100 times) [2024-06-28 12:05:36,856][06909] InferenceWorker_p0-w0: resuming experience collection (50100 times) [2024-06-28 12:05:38,850][06674] Fps is (10 sec: 45875.6, 60 sec: 44236.8, 300 sec: 43987.2). Total num frames: 3636281344. Throughput: 0: 44206.7. Samples: 3539237840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 12:05:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:05:38,995][06909] Updated weights for policy 0, policy_version 221943 (0.0027) [2024-06-28 12:05:43,481][06909] Updated weights for policy 0, policy_version 221953 (0.0029) [2024-06-28 12:05:43,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44509.9, 300 sec: 43875.8). Total num frames: 3636477952. Throughput: 0: 44308.5. Samples: 3539369200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 12:05:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:05:46,574][06909] Updated weights for policy 0, policy_version 221963 (0.0037) [2024-06-28 12:05:48,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43965.5, 300 sec: 44042.4). Total num frames: 3636707328. Throughput: 0: 44190.2. Samples: 3539623740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 12:05:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:05:48,858][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000221967_3636707328.pth... [2024-06-28 12:05:48,918][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000221325_3626188800.pth [2024-06-28 12:05:51,132][06909] Updated weights for policy 0, policy_version 221973 (0.0041) [2024-06-28 12:05:53,763][06909] Updated weights for policy 0, policy_version 221983 (0.0021) [2024-06-28 12:05:53,850][06674] Fps is (10 sec: 49151.8, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3636969472. Throughput: 0: 44024.8. Samples: 3539889720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 12:05:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:05:58,379][06909] Updated weights for policy 0, policy_version 221993 (0.0021) [2024-06-28 12:05:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.8, 300 sec: 43820.3). Total num frames: 3637133312. Throughput: 0: 44371.8. Samples: 3540029840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 12:05:58,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:06:01,213][06909] Updated weights for policy 0, policy_version 222003 (0.0043) [2024-06-28 12:06:03,850][06674] Fps is (10 sec: 39321.9, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3637362688. Throughput: 0: 44380.0. Samples: 3540289020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 12:06:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:06:05,828][06909] Updated weights for policy 0, policy_version 222013 (0.0032) [2024-06-28 12:06:08,850][06674] Fps is (10 sec: 47514.0, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 3637608448. Throughput: 0: 44132.9. Samples: 3540552500. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 12:06:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:06:08,955][06909] Updated weights for policy 0, policy_version 222023 (0.0033) [2024-06-28 12:06:13,383][06909] Updated weights for policy 0, policy_version 222033 (0.0026) [2024-06-28 12:06:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 43820.3). Total num frames: 3637788672. Throughput: 0: 44236.5. Samples: 3540689200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 12:06:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:06:16,370][06909] Updated weights for policy 0, policy_version 222043 (0.0044) [2024-06-28 12:06:18,850][06674] Fps is (10 sec: 40958.6, 60 sec: 43690.4, 300 sec: 44098.2). Total num frames: 3638018048. Throughput: 0: 44100.1. Samples: 3540943060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 12:06:18,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:06:21,146][06909] Updated weights for policy 0, policy_version 222053 (0.0034) [2024-06-28 12:06:23,749][06909] Updated weights for policy 0, policy_version 222063 (0.0023) [2024-06-28 12:06:23,850][06674] Fps is (10 sec: 49151.9, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3638280192. Throughput: 0: 43814.2. Samples: 3541209480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 12:06:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:06:28,386][06909] Updated weights for policy 0, policy_version 222073 (0.0035) [2024-06-28 12:06:28,851][06674] Fps is (10 sec: 42594.0, 60 sec: 43689.8, 300 sec: 43875.6). Total num frames: 3638444032. Throughput: 0: 43986.2. Samples: 3541348640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 12:06:28,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:06:30,218][06887] Signal inference workers to stop experience collection... (50150 times) [2024-06-28 12:06:30,219][06887] Signal inference workers to resume experience collection... (50150 times) [2024-06-28 12:06:30,260][06909] InferenceWorker_p0-w0: stopping experience collection (50150 times) [2024-06-28 12:06:30,260][06909] InferenceWorker_p0-w0: resuming experience collection (50150 times) [2024-06-28 12:06:31,186][06909] Updated weights for policy 0, policy_version 222083 (0.0034) [2024-06-28 12:06:33,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3638689792. Throughput: 0: 44176.4. Samples: 3541611680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 12:06:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:06:35,945][06909] Updated weights for policy 0, policy_version 222093 (0.0026) [2024-06-28 12:06:38,643][06909] Updated weights for policy 0, policy_version 222103 (0.0029) [2024-06-28 12:06:38,850][06674] Fps is (10 sec: 49158.7, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3638935552. Throughput: 0: 43972.5. Samples: 3541868480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 12:06:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:06:43,225][06909] Updated weights for policy 0, policy_version 222113 (0.0029) [2024-06-28 12:06:43,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43764.7). Total num frames: 3639099392. Throughput: 0: 43844.5. Samples: 3542002840. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 12:06:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:06:46,190][06909] Updated weights for policy 0, policy_version 222123 (0.0039) [2024-06-28 12:06:48,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3639328768. Throughput: 0: 43832.8. Samples: 3542261500. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 12:06:48,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:06:50,785][06909] Updated weights for policy 0, policy_version 222133 (0.0032) [2024-06-28 12:06:53,798][06909] Updated weights for policy 0, policy_version 222143 (0.0032) [2024-06-28 12:06:53,850][06674] Fps is (10 sec: 49151.9, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3639590912. Throughput: 0: 43848.4. Samples: 3542525680. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 12:06:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:06:58,126][06909] Updated weights for policy 0, policy_version 222153 (0.0032) [2024-06-28 12:06:58,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.7, 300 sec: 43820.2). Total num frames: 3639771136. Throughput: 0: 43825.3. Samples: 3542661340. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 12:06:58,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:07:01,305][06909] Updated weights for policy 0, policy_version 222163 (0.0034) [2024-06-28 12:07:03,850][06674] Fps is (10 sec: 40959.6, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 3640000512. Throughput: 0: 43999.4. Samples: 3542923020. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 12:07:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:07:06,066][06909] Updated weights for policy 0, policy_version 222173 (0.0022) [2024-06-28 12:07:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3640229888. Throughput: 0: 43900.5. Samples: 3543185000. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 12:07:08,854][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:07:08,873][06909] Updated weights for policy 0, policy_version 222183 (0.0035) [2024-06-28 12:07:13,350][06909] Updated weights for policy 0, policy_version 222193 (0.0033) [2024-06-28 12:07:13,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 3640410112. Throughput: 0: 43938.1. Samples: 3543325800. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 12:07:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:07:16,331][06909] Updated weights for policy 0, policy_version 222203 (0.0035) [2024-06-28 12:07:18,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44237.0, 300 sec: 44042.4). Total num frames: 3640672256. Throughput: 0: 43704.5. Samples: 3543578380. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 12:07:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:07:20,527][06909] Updated weights for policy 0, policy_version 222213 (0.0041) [2024-06-28 12:07:23,826][06909] Updated weights for policy 0, policy_version 222223 (0.0036) [2024-06-28 12:07:23,853][06674] Fps is (10 sec: 49135.9, 60 sec: 43688.3, 300 sec: 44042.8). Total num frames: 3640901632. Throughput: 0: 43848.7. Samples: 3543841820. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 12:07:23,854][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:07:28,092][06909] Updated weights for policy 0, policy_version 222233 (0.0020) [2024-06-28 12:07:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44237.8, 300 sec: 43875.8). Total num frames: 3641098240. Throughput: 0: 43870.2. Samples: 3543977000. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 12:07:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:07:31,133][06909] Updated weights for policy 0, policy_version 222243 (0.0034) [2024-06-28 12:07:33,850][06674] Fps is (10 sec: 42612.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3641327616. Throughput: 0: 43946.4. Samples: 3544239080. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 12:07:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:07:35,487][06909] Updated weights for policy 0, policy_version 222253 (0.0027) [2024-06-28 12:07:38,650][06909] Updated weights for policy 0, policy_version 222263 (0.0026) [2024-06-28 12:07:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3641556992. Throughput: 0: 43974.7. Samples: 3544504540. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 12:07:38,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 12:07:42,756][06909] Updated weights for policy 0, policy_version 222273 (0.0041) [2024-06-28 12:07:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 43876.1). Total num frames: 3641753600. Throughput: 0: 44003.2. Samples: 3544641480. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 12:07:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:07:45,818][06909] Updated weights for policy 0, policy_version 222283 (0.0038) [2024-06-28 12:07:48,854][06674] Fps is (10 sec: 42581.9, 60 sec: 44234.0, 300 sec: 44041.8). Total num frames: 3641982976. Throughput: 0: 44093.2. Samples: 3544907380. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 12:07:48,854][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:07:48,883][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000222289_3641982976.pth... [2024-06-28 12:07:48,929][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000221647_3631464448.pth [2024-06-28 12:07:50,473][06909] Updated weights for policy 0, policy_version 222293 (0.0031) [2024-06-28 12:07:53,853][06674] Fps is (10 sec: 44221.5, 60 sec: 43415.1, 300 sec: 44041.9). Total num frames: 3642195968. Throughput: 0: 44103.8. Samples: 3545169820. Policy #0 lag: (min: 1.0, avg: 8.8, max: 20.0) [2024-06-28 12:07:53,854][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:07:54,042][06909] Updated weights for policy 0, policy_version 222303 (0.0032) [2024-06-28 12:07:57,635][06909] Updated weights for policy 0, policy_version 222313 (0.0035) [2024-06-28 12:07:58,850][06674] Fps is (10 sec: 42614.3, 60 sec: 43963.7, 300 sec: 43876.4). Total num frames: 3642408960. Throughput: 0: 43864.4. Samples: 3545299700. Policy #0 lag: (min: 1.0, avg: 8.8, max: 20.0) [2024-06-28 12:07:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:08:01,417][06909] Updated weights for policy 0, policy_version 222323 (0.0036) [2024-06-28 12:08:01,625][06887] Signal inference workers to stop experience collection... (50200 times) [2024-06-28 12:08:01,685][06909] InferenceWorker_p0-w0: stopping experience collection (50200 times) [2024-06-28 12:08:01,687][06887] Signal inference workers to resume experience collection... (50200 times) [2024-06-28 12:08:01,701][06909] InferenceWorker_p0-w0: resuming experience collection (50200 times) [2024-06-28 12:08:03,850][06674] Fps is (10 sec: 45891.1, 60 sec: 44236.9, 300 sec: 44042.7). Total num frames: 3642654720. Throughput: 0: 44031.1. Samples: 3545559780. Policy #0 lag: (min: 1.0, avg: 8.8, max: 20.0) [2024-06-28 12:08:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:08:05,374][06909] Updated weights for policy 0, policy_version 222333 (0.0023) [2024-06-28 12:08:08,792][06909] Updated weights for policy 0, policy_version 222343 (0.0053) [2024-06-28 12:08:08,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 43987.2). Total num frames: 3642867712. Throughput: 0: 44012.9. Samples: 3545822260. Policy #0 lag: (min: 1.0, avg: 8.8, max: 20.0) [2024-06-28 12:08:08,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:08:12,791][06909] Updated weights for policy 0, policy_version 222353 (0.0030) [2024-06-28 12:08:13,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44509.8, 300 sec: 43875.8). Total num frames: 3643080704. Throughput: 0: 44001.6. Samples: 3545957080. Policy #0 lag: (min: 1.0, avg: 8.8, max: 20.0) [2024-06-28 12:08:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:08:16,268][06909] Updated weights for policy 0, policy_version 222363 (0.0029) [2024-06-28 12:08:18,852][06674] Fps is (10 sec: 44228.2, 60 sec: 43962.2, 300 sec: 44098.0). Total num frames: 3643310080. Throughput: 0: 44119.7. Samples: 3546224560. Policy #0 lag: (min: 1.0, avg: 8.8, max: 20.0) [2024-06-28 12:08:18,852][06674] Avg episode reward: [(0, '0.413')] [2024-06-28 12:08:20,054][06909] Updated weights for policy 0, policy_version 222373 (0.0031) [2024-06-28 12:08:23,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43420.1, 300 sec: 43986.9). Total num frames: 3643506688. Throughput: 0: 44000.9. Samples: 3546484580. Policy #0 lag: (min: 1.0, avg: 8.8, max: 20.0) [2024-06-28 12:08:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:08:23,859][06909] Updated weights for policy 0, policy_version 222383 (0.0032) [2024-06-28 12:08:27,558][06909] Updated weights for policy 0, policy_version 222393 (0.0046) [2024-06-28 12:08:28,850][06674] Fps is (10 sec: 40968.4, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3643719680. Throughput: 0: 43974.2. Samples: 3546620320. Policy #0 lag: (min: 1.0, avg: 8.8, max: 20.0) [2024-06-28 12:08:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:08:31,242][06909] Updated weights for policy 0, policy_version 222403 (0.0026) [2024-06-28 12:08:33,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3643965440. Throughput: 0: 43825.1. Samples: 3546879340. Policy #0 lag: (min: 1.0, avg: 8.8, max: 20.0) [2024-06-28 12:08:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:08:34,810][06909] Updated weights for policy 0, policy_version 222413 (0.0037) [2024-06-28 12:08:38,783][06909] Updated weights for policy 0, policy_version 222423 (0.0031) [2024-06-28 12:08:38,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3644178432. Throughput: 0: 43871.8. Samples: 3547143900. Policy #0 lag: (min: 1.0, avg: 8.8, max: 20.0) [2024-06-28 12:08:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:08:42,563][06909] Updated weights for policy 0, policy_version 222433 (0.0041) [2024-06-28 12:08:43,850][06674] Fps is (10 sec: 42597.7, 60 sec: 43963.6, 300 sec: 43875.8). Total num frames: 3644391424. Throughput: 0: 43868.8. Samples: 3547273800. Policy #0 lag: (min: 1.0, avg: 8.8, max: 20.0) [2024-06-28 12:08:43,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:08:46,114][06909] Updated weights for policy 0, policy_version 222443 (0.0035) [2024-06-28 12:08:48,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43966.6, 300 sec: 44098.0). Total num frames: 3644620800. Throughput: 0: 44141.9. Samples: 3547546160. Policy #0 lag: (min: 1.0, avg: 8.8, max: 20.0) [2024-06-28 12:08:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:08:49,800][06909] Updated weights for policy 0, policy_version 222453 (0.0038) [2024-06-28 12:08:53,621][06909] Updated weights for policy 0, policy_version 222463 (0.0032) [2024-06-28 12:08:53,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43966.3, 300 sec: 44097.9). Total num frames: 3644833792. Throughput: 0: 44257.5. Samples: 3547813840. Policy #0 lag: (min: 1.0, avg: 8.8, max: 20.0) [2024-06-28 12:08:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:08:57,509][06909] Updated weights for policy 0, policy_version 222473 (0.0026) [2024-06-28 12:08:58,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.9, 300 sec: 43931.3). Total num frames: 3645063168. Throughput: 0: 44143.6. Samples: 3547943540. Policy #0 lag: (min: 1.0, avg: 8.8, max: 20.0) [2024-06-28 12:08:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:09:01,345][06909] Updated weights for policy 0, policy_version 222483 (0.0029) [2024-06-28 12:09:03,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3645276160. Throughput: 0: 43976.8. Samples: 3548203420. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 12:09:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:09:04,999][06909] Updated weights for policy 0, policy_version 222493 (0.0026) [2024-06-28 12:09:08,810][06909] Updated weights for policy 0, policy_version 222503 (0.0029) [2024-06-28 12:09:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 44042.4). Total num frames: 3645489152. Throughput: 0: 44053.3. Samples: 3548466980. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 12:09:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:09:12,262][06909] Updated weights for policy 0, policy_version 222513 (0.0036) [2024-06-28 12:09:13,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.8, 300 sec: 43931.6). Total num frames: 3645702144. Throughput: 0: 43977.4. Samples: 3548599300. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 12:09:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:09:16,125][06909] Updated weights for policy 0, policy_version 222523 (0.0037) [2024-06-28 12:09:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43692.2, 300 sec: 43986.9). Total num frames: 3645931520. Throughput: 0: 44005.8. Samples: 3548859600. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 12:09:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:09:19,756][06909] Updated weights for policy 0, policy_version 222533 (0.0025) [2024-06-28 12:09:23,829][06909] Updated weights for policy 0, policy_version 222543 (0.0040) [2024-06-28 12:09:23,850][06674] Fps is (10 sec: 44236.2, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3646144512. Throughput: 0: 44006.1. Samples: 3549124180. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 12:09:23,851][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:09:24,678][06887] Signal inference workers to stop experience collection... (50250 times) [2024-06-28 12:09:24,704][06909] InferenceWorker_p0-w0: stopping experience collection (50250 times) [2024-06-28 12:09:24,738][06887] Signal inference workers to resume experience collection... (50250 times) [2024-06-28 12:09:24,753][06909] InferenceWorker_p0-w0: resuming experience collection (50250 times) [2024-06-28 12:09:27,458][06909] Updated weights for policy 0, policy_version 222553 (0.0032) [2024-06-28 12:09:28,850][06674] Fps is (10 sec: 45874.5, 60 sec: 44509.8, 300 sec: 43931.3). Total num frames: 3646390272. Throughput: 0: 44072.0. Samples: 3549257040. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 12:09:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:09:31,086][06909] Updated weights for policy 0, policy_version 222563 (0.0031) [2024-06-28 12:09:33,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3646586880. Throughput: 0: 43754.9. Samples: 3549515140. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 12:09:33,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:09:34,786][06909] Updated weights for policy 0, policy_version 222573 (0.0023) [2024-06-28 12:09:38,634][06909] Updated weights for policy 0, policy_version 222583 (0.0032) [2024-06-28 12:09:38,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3646799872. Throughput: 0: 43796.3. Samples: 3549784680. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 12:09:38,851][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:09:42,476][06909] Updated weights for policy 0, policy_version 222593 (0.0036) [2024-06-28 12:09:43,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43931.7). Total num frames: 3647029248. Throughput: 0: 43786.7. Samples: 3549913940. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 12:09:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:09:46,220][06909] Updated weights for policy 0, policy_version 222603 (0.0034) [2024-06-28 12:09:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43690.5, 300 sec: 43820.2). Total num frames: 3647242240. Throughput: 0: 43663.3. Samples: 3550168280. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 12:09:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:09:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000222610_3647242240.pth... [2024-06-28 12:09:48,954][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000221967_3636707328.pth [2024-06-28 12:09:49,758][06909] Updated weights for policy 0, policy_version 222613 (0.0026) [2024-06-28 12:09:53,482][06909] Updated weights for policy 0, policy_version 222623 (0.0028) [2024-06-28 12:09:53,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 3647455232. Throughput: 0: 43776.0. Samples: 3550436900. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 12:09:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:09:57,447][06909] Updated weights for policy 0, policy_version 222633 (0.0042) [2024-06-28 12:09:58,850][06674] Fps is (10 sec: 45876.0, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3647700992. Throughput: 0: 43805.3. Samples: 3550570540. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 12:09:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:10:00,651][06909] Updated weights for policy 0, policy_version 222643 (0.0033) [2024-06-28 12:10:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3647897600. Throughput: 0: 43767.5. Samples: 3550829140. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 12:10:03,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 12:10:04,787][06909] Updated weights for policy 0, policy_version 222653 (0.0023) [2024-06-28 12:10:08,408][06909] Updated weights for policy 0, policy_version 222663 (0.0029) [2024-06-28 12:10:08,852][06674] Fps is (10 sec: 44227.6, 60 sec: 44235.3, 300 sec: 44097.6). Total num frames: 3648143360. Throughput: 0: 43987.4. Samples: 3551103700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:10:08,852][06674] Avg episode reward: [(0, '0.490')] [2024-06-28 12:10:12,329][06909] Updated weights for policy 0, policy_version 222673 (0.0042) [2024-06-28 12:10:13,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44509.8, 300 sec: 43986.9). Total num frames: 3648372736. Throughput: 0: 43992.1. Samples: 3551236680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:10:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:10:15,833][06909] Updated weights for policy 0, policy_version 222683 (0.0031) [2024-06-28 12:10:18,850][06674] Fps is (10 sec: 40968.2, 60 sec: 43690.6, 300 sec: 43820.2). Total num frames: 3648552960. Throughput: 0: 43780.9. Samples: 3551485280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:10:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:10:19,665][06909] Updated weights for policy 0, policy_version 222693 (0.0041) [2024-06-28 12:10:23,487][06909] Updated weights for policy 0, policy_version 222703 (0.0041) [2024-06-28 12:10:23,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3648782336. Throughput: 0: 43789.0. Samples: 3551755180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:10:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:10:27,255][06909] Updated weights for policy 0, policy_version 222713 (0.0031) [2024-06-28 12:10:28,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3649011712. Throughput: 0: 43831.1. Samples: 3551886340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:10:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:10:30,725][06909] Updated weights for policy 0, policy_version 222723 (0.0043) [2024-06-28 12:10:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43820.3). Total num frames: 3649208320. Throughput: 0: 43914.8. Samples: 3552144440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:10:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 12:10:34,691][06909] Updated weights for policy 0, policy_version 222733 (0.0031) [2024-06-28 12:10:35,256][06887] Signal inference workers to stop experience collection... (50300 times) [2024-06-28 12:10:35,257][06887] Signal inference workers to resume experience collection... (50300 times) [2024-06-28 12:10:35,297][06909] InferenceWorker_p0-w0: stopping experience collection (50300 times) [2024-06-28 12:10:35,297][06909] InferenceWorker_p0-w0: resuming experience collection (50300 times) [2024-06-28 12:10:38,132][06909] Updated weights for policy 0, policy_version 222743 (0.0028) [2024-06-28 12:10:38,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3649470464. Throughput: 0: 44023.4. Samples: 3552417960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:10:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:10:42,172][06909] Updated weights for policy 0, policy_version 222753 (0.0039) [2024-06-28 12:10:43,850][06674] Fps is (10 sec: 47513.5, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3649683456. Throughput: 0: 44032.9. Samples: 3552552020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:10:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:10:45,626][06909] Updated weights for policy 0, policy_version 222763 (0.0031) [2024-06-28 12:10:48,850][06674] Fps is (10 sec: 39321.3, 60 sec: 43690.6, 300 sec: 43709.2). Total num frames: 3649863680. Throughput: 0: 44046.1. Samples: 3552811220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:10:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:10:49,311][06909] Updated weights for policy 0, policy_version 222773 (0.0030) [2024-06-28 12:10:53,042][06909] Updated weights for policy 0, policy_version 222783 (0.0024) [2024-06-28 12:10:53,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3650109440. Throughput: 0: 43662.4. Samples: 3553068420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:10:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:10:56,779][06909] Updated weights for policy 0, policy_version 222793 (0.0029) [2024-06-28 12:10:58,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3650322432. Throughput: 0: 43745.3. Samples: 3553205220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:10:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:11:00,509][06909] Updated weights for policy 0, policy_version 222803 (0.0028) [2024-06-28 12:11:03,853][06674] Fps is (10 sec: 44225.0, 60 sec: 44234.8, 300 sec: 43875.4). Total num frames: 3650551808. Throughput: 0: 44050.7. Samples: 3553467680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:11:03,853][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 12:11:04,165][06909] Updated weights for policy 0, policy_version 222813 (0.0021) [2024-06-28 12:11:07,954][06909] Updated weights for policy 0, policy_version 222823 (0.0029) [2024-06-28 12:11:08,850][06674] Fps is (10 sec: 45875.8, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 3650781184. Throughput: 0: 43884.6. Samples: 3553729980. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:11:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:11:11,997][06909] Updated weights for policy 0, policy_version 222833 (0.0036) [2024-06-28 12:11:13,852][06674] Fps is (10 sec: 44239.7, 60 sec: 43689.2, 300 sec: 43986.6). Total num frames: 3650994176. Throughput: 0: 44111.4. Samples: 3553871440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:11:13,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:11:15,328][06909] Updated weights for policy 0, policy_version 222843 (0.0036) [2024-06-28 12:11:18,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.8, 300 sec: 43820.3). Total num frames: 3651207168. Throughput: 0: 44118.1. Samples: 3554129760. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 12:11:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:11:19,313][06909] Updated weights for policy 0, policy_version 222853 (0.0036) [2024-06-28 12:11:22,838][06909] Updated weights for policy 0, policy_version 222863 (0.0032) [2024-06-28 12:11:23,850][06674] Fps is (10 sec: 44246.1, 60 sec: 44236.9, 300 sec: 44042.6). Total num frames: 3651436544. Throughput: 0: 43749.0. Samples: 3554386660. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 12:11:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:11:26,786][06909] Updated weights for policy 0, policy_version 222873 (0.0049) [2024-06-28 12:11:28,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3651665920. Throughput: 0: 43835.0. Samples: 3554524600. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 12:11:28,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:11:30,402][06909] Updated weights for policy 0, policy_version 222883 (0.0027) [2024-06-28 12:11:33,850][06674] Fps is (10 sec: 40959.9, 60 sec: 43963.7, 300 sec: 43764.7). Total num frames: 3651846144. Throughput: 0: 43760.2. Samples: 3554780420. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 12:11:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:11:34,306][06909] Updated weights for policy 0, policy_version 222893 (0.0026) [2024-06-28 12:11:37,862][06909] Updated weights for policy 0, policy_version 222903 (0.0029) [2024-06-28 12:11:38,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44097.9). Total num frames: 3652108288. Throughput: 0: 43892.9. Samples: 3555043600. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 12:11:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:11:41,596][06909] Updated weights for policy 0, policy_version 222913 (0.0036) [2024-06-28 12:11:43,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3652304896. Throughput: 0: 43880.5. Samples: 3555179840. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 12:11:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:11:45,162][06909] Updated weights for policy 0, policy_version 222923 (0.0022) [2024-06-28 12:11:48,850][06674] Fps is (10 sec: 40960.0, 60 sec: 44236.9, 300 sec: 43820.2). Total num frames: 3652517888. Throughput: 0: 43960.0. Samples: 3555445760. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 12:11:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:11:48,869][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000222932_3652517888.pth... [2024-06-28 12:11:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000222289_3641982976.pth [2024-06-28 12:11:49,098][06909] Updated weights for policy 0, policy_version 222933 (0.0031) [2024-06-28 12:11:52,816][06909] Updated weights for policy 0, policy_version 222943 (0.0036) [2024-06-28 12:11:53,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 3652747264. Throughput: 0: 43948.9. Samples: 3555707680. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 12:11:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:11:56,381][06909] Updated weights for policy 0, policy_version 222953 (0.0031) [2024-06-28 12:11:58,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3652976640. Throughput: 0: 43750.4. Samples: 3555840120. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 12:11:58,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 12:11:59,959][06909] Updated weights for policy 0, policy_version 222963 (0.0037) [2024-06-28 12:12:02,792][06887] Signal inference workers to stop experience collection... (50350 times) [2024-06-28 12:12:02,839][06909] InferenceWorker_p0-w0: stopping experience collection (50350 times) [2024-06-28 12:12:02,850][06887] Signal inference workers to resume experience collection... (50350 times) [2024-06-28 12:12:02,851][06909] InferenceWorker_p0-w0: resuming experience collection (50350 times) [2024-06-28 12:12:03,665][06909] Updated weights for policy 0, policy_version 222973 (0.0044) [2024-06-28 12:12:03,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43965.7, 300 sec: 43931.3). Total num frames: 3653189632. Throughput: 0: 44078.7. Samples: 3556113300. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 12:12:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:12:07,568][06909] Updated weights for policy 0, policy_version 222983 (0.0039) [2024-06-28 12:12:08,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3653419008. Throughput: 0: 44070.2. Samples: 3556369820. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 12:12:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:12:11,206][06909] Updated weights for policy 0, policy_version 222993 (0.0023) [2024-06-28 12:12:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43692.2, 300 sec: 43875.8). Total num frames: 3653615616. Throughput: 0: 44011.6. Samples: 3556505120. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 12:12:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:12:15,052][06909] Updated weights for policy 0, policy_version 223003 (0.0028) [2024-06-28 12:12:18,704][06909] Updated weights for policy 0, policy_version 223013 (0.0051) [2024-06-28 12:12:18,854][06674] Fps is (10 sec: 42578.8, 60 sec: 43960.4, 300 sec: 43875.6). Total num frames: 3653844992. Throughput: 0: 44329.2. Samples: 3556775440. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 12:12:18,855][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:12:22,233][06909] Updated weights for policy 0, policy_version 223023 (0.0046) [2024-06-28 12:12:23,850][06674] Fps is (10 sec: 47513.7, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3654090752. Throughput: 0: 44252.1. Samples: 3557034940. Policy #0 lag: (min: 0.0, avg: 11.7, max: 21.0) [2024-06-28 12:12:23,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:12:25,979][06909] Updated weights for policy 0, policy_version 223033 (0.0032) [2024-06-28 12:12:28,850][06674] Fps is (10 sec: 44256.6, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3654287360. Throughput: 0: 44130.6. Samples: 3557165720. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 12:12:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:12:29,713][06909] Updated weights for policy 0, policy_version 223043 (0.0030) [2024-06-28 12:12:33,210][06909] Updated weights for policy 0, policy_version 223053 (0.0042) [2024-06-28 12:12:33,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44509.8, 300 sec: 43931.3). Total num frames: 3654516736. Throughput: 0: 44343.9. Samples: 3557441240. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 12:12:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:12:37,148][06909] Updated weights for policy 0, policy_version 223063 (0.0037) [2024-06-28 12:12:38,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3654746112. Throughput: 0: 44226.9. Samples: 3557697900. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 12:12:38,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:12:40,993][06909] Updated weights for policy 0, policy_version 223073 (0.0031) [2024-06-28 12:12:43,850][06674] Fps is (10 sec: 42599.1, 60 sec: 43963.8, 300 sec: 43931.9). Total num frames: 3654942720. Throughput: 0: 44115.7. Samples: 3557825320. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 12:12:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:12:44,542][06909] Updated weights for policy 0, policy_version 223083 (0.0031) [2024-06-28 12:12:48,208][06909] Updated weights for policy 0, policy_version 223093 (0.0036) [2024-06-28 12:12:48,850][06674] Fps is (10 sec: 42598.8, 60 sec: 44236.8, 300 sec: 43987.4). Total num frames: 3655172096. Throughput: 0: 44164.1. Samples: 3558100680. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 12:12:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:12:52,167][06909] Updated weights for policy 0, policy_version 223103 (0.0039) [2024-06-28 12:12:53,850][06674] Fps is (10 sec: 45874.2, 60 sec: 44236.6, 300 sec: 44042.4). Total num frames: 3655401472. Throughput: 0: 44170.5. Samples: 3558357500. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 12:12:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:12:55,698][06909] Updated weights for policy 0, policy_version 223113 (0.0040) [2024-06-28 12:12:58,850][06674] Fps is (10 sec: 45874.8, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3655630848. Throughput: 0: 44256.8. Samples: 3558496680. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 12:12:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:12:59,549][06909] Updated weights for policy 0, policy_version 223123 (0.0041) [2024-06-28 12:13:03,251][06909] Updated weights for policy 0, policy_version 223133 (0.0036) [2024-06-28 12:13:03,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3655843840. Throughput: 0: 44258.6. Samples: 3558766880. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 12:13:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:13:06,736][06909] Updated weights for policy 0, policy_version 223143 (0.0033) [2024-06-28 12:13:08,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3656056832. Throughput: 0: 44234.5. Samples: 3559025500. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 12:13:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:13:10,685][06909] Updated weights for policy 0, policy_version 223153 (0.0031) [2024-06-28 12:13:13,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.8, 300 sec: 43987.2). Total num frames: 3656286208. Throughput: 0: 44247.1. Samples: 3559156840. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 12:13:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:13:14,301][06909] Updated weights for policy 0, policy_version 223163 (0.0023) [2024-06-28 12:13:18,174][06909] Updated weights for policy 0, policy_version 223173 (0.0042) [2024-06-28 12:13:18,850][06674] Fps is (10 sec: 44237.8, 60 sec: 44240.2, 300 sec: 44042.4). Total num frames: 3656499200. Throughput: 0: 44064.2. Samples: 3559424120. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 12:13:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:13:21,706][06909] Updated weights for policy 0, policy_version 223183 (0.0032) [2024-06-28 12:13:23,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3656712192. Throughput: 0: 44059.1. Samples: 3559680560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 12:13:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:13:25,471][06909] Updated weights for policy 0, policy_version 223193 (0.0030) [2024-06-28 12:13:27,568][06887] Signal inference workers to stop experience collection... (50400 times) [2024-06-28 12:13:27,568][06887] Signal inference workers to resume experience collection... (50400 times) [2024-06-28 12:13:27,605][06909] InferenceWorker_p0-w0: stopping experience collection (50400 times) [2024-06-28 12:13:27,612][06909] InferenceWorker_p0-w0: resuming experience collection (50400 times) [2024-06-28 12:13:28,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3656941568. Throughput: 0: 44207.5. Samples: 3559814660. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 12:13:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:13:29,170][06909] Updated weights for policy 0, policy_version 223203 (0.0025) [2024-06-28 12:13:33,029][06909] Updated weights for policy 0, policy_version 223213 (0.0031) [2024-06-28 12:13:33,852][06674] Fps is (10 sec: 45866.1, 60 sec: 44235.4, 300 sec: 44042.1). Total num frames: 3657170944. Throughput: 0: 44089.5. Samples: 3560084800. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:13:33,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:13:36,546][06909] Updated weights for policy 0, policy_version 223223 (0.0034) [2024-06-28 12:13:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3657383936. Throughput: 0: 44201.5. Samples: 3560346560. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:13:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:13:40,260][06909] Updated weights for policy 0, policy_version 223233 (0.0036) [2024-06-28 12:13:43,815][06909] Updated weights for policy 0, policy_version 223243 (0.0029) [2024-06-28 12:13:43,850][06674] Fps is (10 sec: 44245.6, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3657613312. Throughput: 0: 43971.6. Samples: 3560475400. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:13:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:13:47,918][06909] Updated weights for policy 0, policy_version 223253 (0.0025) [2024-06-28 12:13:48,850][06674] Fps is (10 sec: 44237.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3657826304. Throughput: 0: 44051.6. Samples: 3560749200. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:13:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:13:48,914][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000223257_3657842688.pth... [2024-06-28 12:13:48,958][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000222610_3647242240.pth [2024-06-28 12:13:51,734][06909] Updated weights for policy 0, policy_version 223263 (0.0035) [2024-06-28 12:13:53,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.8, 300 sec: 43931.4). Total num frames: 3658022912. Throughput: 0: 44062.9. Samples: 3561008320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:13:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:13:55,177][06909] Updated weights for policy 0, policy_version 223273 (0.0031) [2024-06-28 12:13:58,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3658252288. Throughput: 0: 43955.7. Samples: 3561134840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:13:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:13:58,861][06909] Updated weights for policy 0, policy_version 223283 (0.0040) [2024-06-28 12:14:02,605][06909] Updated weights for policy 0, policy_version 223293 (0.0028) [2024-06-28 12:14:03,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3658498048. Throughput: 0: 44104.0. Samples: 3561408800. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:14:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:14:06,164][06909] Updated weights for policy 0, policy_version 223303 (0.0035) [2024-06-28 12:14:08,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.9, 300 sec: 44097.9). Total num frames: 3658711040. Throughput: 0: 44316.1. Samples: 3561674780. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:14:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:14:09,811][06909] Updated weights for policy 0, policy_version 223313 (0.0031) [2024-06-28 12:14:13,471][06909] Updated weights for policy 0, policy_version 223323 (0.0027) [2024-06-28 12:14:13,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.9, 300 sec: 44042.4). Total num frames: 3658924032. Throughput: 0: 44224.1. Samples: 3561804740. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:14:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:14:17,642][06909] Updated weights for policy 0, policy_version 223333 (0.0026) [2024-06-28 12:14:18,855][06674] Fps is (10 sec: 42576.1, 60 sec: 43959.9, 300 sec: 44041.6). Total num frames: 3659137024. Throughput: 0: 44152.4. Samples: 3562071800. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:14:18,855][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:14:20,866][06909] Updated weights for policy 0, policy_version 223343 (0.0029) [2024-06-28 12:14:23,850][06674] Fps is (10 sec: 44236.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3659366400. Throughput: 0: 44146.2. Samples: 3562333140. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:14:23,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 12:14:25,094][06909] Updated weights for policy 0, policy_version 223353 (0.0034) [2024-06-28 12:14:28,790][06909] Updated weights for policy 0, policy_version 223363 (0.0043) [2024-06-28 12:14:28,850][06674] Fps is (10 sec: 44259.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3659579392. Throughput: 0: 44157.7. Samples: 3562462500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:14:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:14:32,250][06909] Updated weights for policy 0, policy_version 223373 (0.0030) [2024-06-28 12:14:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43965.2, 300 sec: 44098.0). Total num frames: 3659808768. Throughput: 0: 43956.0. Samples: 3562727220. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:14:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:14:36,022][06909] Updated weights for policy 0, policy_version 223383 (0.0027) [2024-06-28 12:14:38,850][06674] Fps is (10 sec: 44237.4, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3660021760. Throughput: 0: 44140.0. Samples: 3562994620. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:14:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:14:39,953][06909] Updated weights for policy 0, policy_version 223393 (0.0032) [2024-06-28 12:14:43,168][06909] Updated weights for policy 0, policy_version 223403 (0.0028) [2024-06-28 12:14:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3660251136. Throughput: 0: 44179.5. Samples: 3563122920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 12:14:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:14:47,118][06909] Updated weights for policy 0, policy_version 223413 (0.0032) [2024-06-28 12:14:48,854][06674] Fps is (10 sec: 44216.6, 60 sec: 43960.4, 300 sec: 44097.3). Total num frames: 3660464128. Throughput: 0: 44048.4. Samples: 3563391180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 12:14:48,855][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:14:50,426][06909] Updated weights for policy 0, policy_version 223423 (0.0030) [2024-06-28 12:14:53,851][06674] Fps is (10 sec: 42594.7, 60 sec: 44236.1, 300 sec: 43986.7). Total num frames: 3660677120. Throughput: 0: 44104.9. Samples: 3563659540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 12:14:53,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:14:54,768][06909] Updated weights for policy 0, policy_version 223433 (0.0025) [2024-06-28 12:14:57,622][06909] Updated weights for policy 0, policy_version 223443 (0.0034) [2024-06-28 12:14:58,850][06674] Fps is (10 sec: 45895.5, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3660922880. Throughput: 0: 44083.8. Samples: 3563788520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 12:14:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:15:02,306][06909] Updated weights for policy 0, policy_version 223453 (0.0036) [2024-06-28 12:15:03,275][06887] Signal inference workers to stop experience collection... (50450 times) [2024-06-28 12:15:03,275][06887] Signal inference workers to resume experience collection... (50450 times) [2024-06-28 12:15:03,320][06909] InferenceWorker_p0-w0: stopping experience collection (50450 times) [2024-06-28 12:15:03,320][06909] InferenceWorker_p0-w0: resuming experience collection (50450 times) [2024-06-28 12:15:03,850][06674] Fps is (10 sec: 44241.1, 60 sec: 43690.7, 300 sec: 43987.2). Total num frames: 3661119488. Throughput: 0: 44025.6. Samples: 3564052720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 12:15:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:15:05,678][06909] Updated weights for policy 0, policy_version 223463 (0.0027) [2024-06-28 12:15:08,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3661332480. Throughput: 0: 44143.7. Samples: 3564319600. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 12:15:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:15:09,379][06909] Updated weights for policy 0, policy_version 223473 (0.0035) [2024-06-28 12:15:12,886][06909] Updated weights for policy 0, policy_version 223483 (0.0026) [2024-06-28 12:15:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3661578240. Throughput: 0: 44163.2. Samples: 3564449840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 12:15:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:15:16,857][06909] Updated weights for policy 0, policy_version 223493 (0.0031) [2024-06-28 12:15:18,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44240.6, 300 sec: 44098.0). Total num frames: 3661791232. Throughput: 0: 44204.5. Samples: 3564716420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 12:15:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:15:20,215][06909] Updated weights for policy 0, policy_version 223503 (0.0027) [2024-06-28 12:15:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3662004224. Throughput: 0: 44212.8. Samples: 3564984200. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 12:15:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:15:24,169][06909] Updated weights for policy 0, policy_version 223513 (0.0034) [2024-06-28 12:15:27,478][06909] Updated weights for policy 0, policy_version 223523 (0.0036) [2024-06-28 12:15:28,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.9, 300 sec: 44153.5). Total num frames: 3662233600. Throughput: 0: 44144.9. Samples: 3565109440. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 12:15:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:15:31,711][06909] Updated weights for policy 0, policy_version 223533 (0.0028) [2024-06-28 12:15:33,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3662446592. Throughput: 0: 44117.4. Samples: 3565376260. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 12:15:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 12:15:35,262][06909] Updated weights for policy 0, policy_version 223543 (0.0030) [2024-06-28 12:15:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3662659584. Throughput: 0: 43955.0. Samples: 3565637480. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 12:15:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:15:39,145][06909] Updated weights for policy 0, policy_version 223553 (0.0040) [2024-06-28 12:15:43,006][06909] Updated weights for policy 0, policy_version 223563 (0.0026) [2024-06-28 12:15:43,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44509.9, 300 sec: 44264.6). Total num frames: 3662921728. Throughput: 0: 43993.0. Samples: 3565768200. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 12:15:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:15:46,838][06909] Updated weights for policy 0, policy_version 223573 (0.0032) [2024-06-28 12:15:48,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44240.1, 300 sec: 44098.0). Total num frames: 3663118336. Throughput: 0: 43962.1. Samples: 3566031020. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 12:15:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:15:48,993][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000223580_3663134720.pth... [2024-06-28 12:15:49,044][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000222932_3652517888.pth [2024-06-28 12:15:50,197][06909] Updated weights for policy 0, policy_version 223583 (0.0040) [2024-06-28 12:15:53,850][06674] Fps is (10 sec: 39321.7, 60 sec: 43964.4, 300 sec: 44042.4). Total num frames: 3663314944. Throughput: 0: 43945.3. Samples: 3566297140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:15:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:15:54,304][06909] Updated weights for policy 0, policy_version 223593 (0.0036) [2024-06-28 12:15:57,582][06909] Updated weights for policy 0, policy_version 223603 (0.0032) [2024-06-28 12:15:58,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.7, 300 sec: 44042.8). Total num frames: 3663544320. Throughput: 0: 43819.9. Samples: 3566421740. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:15:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:16:01,647][06909] Updated weights for policy 0, policy_version 223613 (0.0035) [2024-06-28 12:16:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3663773696. Throughput: 0: 43780.0. Samples: 3566686520. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:16:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:16:05,082][06909] Updated weights for policy 0, policy_version 223623 (0.0037) [2024-06-28 12:16:08,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43690.7, 300 sec: 43931.7). Total num frames: 3663953920. Throughput: 0: 43805.0. Samples: 3566955420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:16:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:16:09,293][06909] Updated weights for policy 0, policy_version 223633 (0.0036) [2024-06-28 12:16:13,004][06909] Updated weights for policy 0, policy_version 223643 (0.0027) [2024-06-28 12:16:13,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3664232448. Throughput: 0: 43884.4. Samples: 3567084240. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:16:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:16:16,851][06909] Updated weights for policy 0, policy_version 223653 (0.0029) [2024-06-28 12:16:18,850][06674] Fps is (10 sec: 47512.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3664429056. Throughput: 0: 43835.9. Samples: 3567348880. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:16:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:16:20,195][06909] Updated weights for policy 0, policy_version 223663 (0.0031) [2024-06-28 12:16:23,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3664642048. Throughput: 0: 43884.1. Samples: 3567612260. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:16:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:16:24,227][06909] Updated weights for policy 0, policy_version 223673 (0.0024) [2024-06-28 12:16:27,547][06909] Updated weights for policy 0, policy_version 223683 (0.0039) [2024-06-28 12:16:28,850][06674] Fps is (10 sec: 45875.5, 60 sec: 44236.8, 300 sec: 44209.0). Total num frames: 3664887808. Throughput: 0: 44004.9. Samples: 3567748420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:16:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:16:31,422][06909] Updated weights for policy 0, policy_version 223693 (0.0040) [2024-06-28 12:16:33,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3665084416. Throughput: 0: 43901.0. Samples: 3568006560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:16:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:16:34,844][06909] Updated weights for policy 0, policy_version 223703 (0.0029) [2024-06-28 12:16:38,823][06909] Updated weights for policy 0, policy_version 223713 (0.0046) [2024-06-28 12:16:38,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3665313792. Throughput: 0: 43984.0. Samples: 3568276420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:16:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:16:42,670][06909] Updated weights for policy 0, policy_version 223723 (0.0035) [2024-06-28 12:16:42,872][06887] Signal inference workers to stop experience collection... (50500 times) [2024-06-28 12:16:42,927][06909] InferenceWorker_p0-w0: stopping experience collection (50500 times) [2024-06-28 12:16:42,986][06887] Signal inference workers to resume experience collection... (50500 times) [2024-06-28 12:16:42,986][06909] InferenceWorker_p0-w0: resuming experience collection (50500 times) [2024-06-28 12:16:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43690.7, 300 sec: 44153.5). Total num frames: 3665543168. Throughput: 0: 44077.0. Samples: 3568405200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:16:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:16:46,322][06909] Updated weights for policy 0, policy_version 223733 (0.0030) [2024-06-28 12:16:48,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3665756160. Throughput: 0: 44054.5. Samples: 3568668980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:16:48,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:16:49,977][06909] Updated weights for policy 0, policy_version 223743 (0.0028) [2024-06-28 12:16:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3665952768. Throughput: 0: 44035.1. Samples: 3568937000. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:16:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:16:53,876][06909] Updated weights for policy 0, policy_version 223753 (0.0035) [2024-06-28 12:16:57,344][06909] Updated weights for policy 0, policy_version 223763 (0.0031) [2024-06-28 12:16:58,850][06674] Fps is (10 sec: 44237.4, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3666198528. Throughput: 0: 44071.2. Samples: 3569067440. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:16:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:17:01,274][06909] Updated weights for policy 0, policy_version 223773 (0.0028) [2024-06-28 12:17:03,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3666395136. Throughput: 0: 44080.4. Samples: 3569332500. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:17:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:17:04,721][06909] Updated weights for policy 0, policy_version 223783 (0.0027) [2024-06-28 12:17:08,474][06909] Updated weights for policy 0, policy_version 223793 (0.0028) [2024-06-28 12:17:08,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 3666640896. Throughput: 0: 43985.7. Samples: 3569591620. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:17:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:17:12,038][06909] Updated weights for policy 0, policy_version 223803 (0.0028) [2024-06-28 12:17:13,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43690.7, 300 sec: 44098.6). Total num frames: 3666853888. Throughput: 0: 43906.2. Samples: 3569724200. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:17:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:17:16,011][06909] Updated weights for policy 0, policy_version 223813 (0.0032) [2024-06-28 12:17:18,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3667066880. Throughput: 0: 43916.2. Samples: 3569982800. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:17:18,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:17:19,538][06909] Updated weights for policy 0, policy_version 223823 (0.0036) [2024-06-28 12:17:23,333][06909] Updated weights for policy 0, policy_version 223833 (0.0029) [2024-06-28 12:17:23,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3667296256. Throughput: 0: 43793.0. Samples: 3570247100. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:17:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:17:27,323][06909] Updated weights for policy 0, policy_version 223843 (0.0037) [2024-06-28 12:17:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3667509248. Throughput: 0: 43991.9. Samples: 3570384840. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:17:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:17:31,069][06909] Updated weights for policy 0, policy_version 223853 (0.0049) [2024-06-28 12:17:33,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3667738624. Throughput: 0: 43928.6. Samples: 3570645760. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:17:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:17:34,704][06909] Updated weights for policy 0, policy_version 223863 (0.0031) [2024-06-28 12:17:38,459][06909] Updated weights for policy 0, policy_version 223873 (0.0036) [2024-06-28 12:17:38,852][06674] Fps is (10 sec: 44228.1, 60 sec: 43962.3, 300 sec: 44097.6). Total num frames: 3667951616. Throughput: 0: 43853.9. Samples: 3570910520. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:17:38,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:17:42,178][06909] Updated weights for policy 0, policy_version 223883 (0.0036) [2024-06-28 12:17:43,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3668164608. Throughput: 0: 43859.0. Samples: 3571041100. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:17:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:17:45,635][06909] Updated weights for policy 0, policy_version 223893 (0.0026) [2024-06-28 12:17:48,850][06674] Fps is (10 sec: 44245.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3668393984. Throughput: 0: 43928.9. Samples: 3571309300. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:17:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:17:48,867][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000223901_3668393984.pth... [2024-06-28 12:17:48,917][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000223257_3657842688.pth [2024-06-28 12:17:49,381][06909] Updated weights for policy 0, policy_version 223903 (0.0045) [2024-06-28 12:17:53,326][06909] Updated weights for policy 0, policy_version 223913 (0.0022) [2024-06-28 12:17:53,850][06674] Fps is (10 sec: 45875.7, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3668623360. Throughput: 0: 44009.8. Samples: 3571572060. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:17:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:17:57,206][06909] Updated weights for policy 0, policy_version 223923 (0.0030) [2024-06-28 12:17:58,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3668819968. Throughput: 0: 43988.4. Samples: 3571703680. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:17:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:18:00,579][06909] Updated weights for policy 0, policy_version 223933 (0.0027) [2024-06-28 12:18:03,852][06674] Fps is (10 sec: 42589.7, 60 sec: 44235.4, 300 sec: 44042.1). Total num frames: 3669049344. Throughput: 0: 44144.4. Samples: 3571969380. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:18:03,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:18:04,540][06909] Updated weights for policy 0, policy_version 223943 (0.0034) [2024-06-28 12:18:08,365][06909] Updated weights for policy 0, policy_version 223953 (0.0041) [2024-06-28 12:18:08,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3669278720. Throughput: 0: 44142.2. Samples: 3572233500. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 12:18:08,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:18:11,961][06909] Updated weights for policy 0, policy_version 223963 (0.0032) [2024-06-28 12:18:13,850][06674] Fps is (10 sec: 44245.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3669491712. Throughput: 0: 43972.9. Samples: 3572363620. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 12:18:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:18:15,619][06909] Updated weights for policy 0, policy_version 223973 (0.0028) [2024-06-28 12:18:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3669704704. Throughput: 0: 43953.7. Samples: 3572623680. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 12:18:18,859][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:18:19,267][06909] Updated weights for policy 0, policy_version 223983 (0.0032) [2024-06-28 12:18:22,940][06909] Updated weights for policy 0, policy_version 223993 (0.0048) [2024-06-28 12:18:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.6, 300 sec: 44042.4). Total num frames: 3669934080. Throughput: 0: 43932.6. Samples: 3572887400. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 12:18:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:18:26,965][06909] Updated weights for policy 0, policy_version 224003 (0.0029) [2024-06-28 12:18:28,716][06887] Signal inference workers to stop experience collection... (50550 times) [2024-06-28 12:18:28,773][06909] InferenceWorker_p0-w0: stopping experience collection (50550 times) [2024-06-28 12:18:28,780][06887] Signal inference workers to resume experience collection... (50550 times) [2024-06-28 12:18:28,795][06909] InferenceWorker_p0-w0: resuming experience collection (50550 times) [2024-06-28 12:18:28,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.8, 300 sec: 43931.6). Total num frames: 3670130688. Throughput: 0: 43989.4. Samples: 3573020620. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 12:18:28,859][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:18:30,258][06909] Updated weights for policy 0, policy_version 224013 (0.0027) [2024-06-28 12:18:33,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3670360064. Throughput: 0: 43889.9. Samples: 3573284340. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 12:18:33,853][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:18:34,327][06909] Updated weights for policy 0, policy_version 224023 (0.0040) [2024-06-28 12:18:37,736][06909] Updated weights for policy 0, policy_version 224033 (0.0026) [2024-06-28 12:18:38,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44238.4, 300 sec: 44042.4). Total num frames: 3670605824. Throughput: 0: 43912.9. Samples: 3573548140. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 12:18:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:18:41,669][06909] Updated weights for policy 0, policy_version 224043 (0.0026) [2024-06-28 12:18:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3670802432. Throughput: 0: 44139.7. Samples: 3573689960. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 12:18:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:18:44,806][06909] Updated weights for policy 0, policy_version 224053 (0.0044) [2024-06-28 12:18:48,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.9, 300 sec: 44097.9). Total num frames: 3671031808. Throughput: 0: 43986.9. Samples: 3573948700. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 12:18:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:18:49,087][06909] Updated weights for policy 0, policy_version 224063 (0.0029) [2024-06-28 12:18:52,663][06909] Updated weights for policy 0, policy_version 224073 (0.0031) [2024-06-28 12:18:53,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3671261184. Throughput: 0: 43740.9. Samples: 3574201840. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 12:18:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:18:56,878][06909] Updated weights for policy 0, policy_version 224083 (0.0030) [2024-06-28 12:18:58,856][06674] Fps is (10 sec: 42572.5, 60 sec: 43959.4, 300 sec: 43930.4). Total num frames: 3671457792. Throughput: 0: 43850.6. Samples: 3574337160. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 12:18:58,856][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:19:00,066][06909] Updated weights for policy 0, policy_version 224093 (0.0035) [2024-06-28 12:19:03,856][06674] Fps is (10 sec: 42572.2, 60 sec: 43960.7, 300 sec: 43986.0). Total num frames: 3671687168. Throughput: 0: 43954.9. Samples: 3574601920. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 12:19:03,857][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:19:04,208][06909] Updated weights for policy 0, policy_version 224103 (0.0033) [2024-06-28 12:19:07,873][06909] Updated weights for policy 0, policy_version 224113 (0.0035) [2024-06-28 12:19:08,850][06674] Fps is (10 sec: 45903.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3671916544. Throughput: 0: 43911.2. Samples: 3574863400. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 12:19:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:19:11,566][06909] Updated weights for policy 0, policy_version 224123 (0.0032) [2024-06-28 12:19:13,850][06674] Fps is (10 sec: 44264.0, 60 sec: 43963.8, 300 sec: 44043.2). Total num frames: 3672129536. Throughput: 0: 44064.0. Samples: 3575003500. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 12:19:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:19:15,094][06909] Updated weights for policy 0, policy_version 224133 (0.0028) [2024-06-28 12:19:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3672342528. Throughput: 0: 43998.6. Samples: 3575264280. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 12:19:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:19:19,089][06909] Updated weights for policy 0, policy_version 224143 (0.0035) [2024-06-28 12:19:22,739][06909] Updated weights for policy 0, policy_version 224153 (0.0032) [2024-06-28 12:19:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3672571904. Throughput: 0: 43862.2. Samples: 3575521940. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 12:19:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:19:26,760][06909] Updated weights for policy 0, policy_version 224163 (0.0037) [2024-06-28 12:19:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3672768512. Throughput: 0: 43682.6. Samples: 3575655680. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 12:19:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:19:30,150][06909] Updated weights for policy 0, policy_version 224173 (0.0039) [2024-06-28 12:19:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3672997888. Throughput: 0: 43819.1. Samples: 3575920560. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 12:19:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:19:34,041][06909] Updated weights for policy 0, policy_version 224183 (0.0042) [2024-06-28 12:19:37,675][06909] Updated weights for policy 0, policy_version 224193 (0.0028) [2024-06-28 12:19:38,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3673243648. Throughput: 0: 43991.6. Samples: 3576181460. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 12:19:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:19:41,400][06909] Updated weights for policy 0, policy_version 224203 (0.0031) [2024-06-28 12:19:43,850][06674] Fps is (10 sec: 44236.0, 60 sec: 43963.6, 300 sec: 43987.5). Total num frames: 3673440256. Throughput: 0: 44060.0. Samples: 3576319600. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 12:19:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:19:44,827][06909] Updated weights for policy 0, policy_version 224213 (0.0033) [2024-06-28 12:19:48,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43690.7, 300 sec: 43987.0). Total num frames: 3673653248. Throughput: 0: 44109.2. Samples: 3576586560. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 12:19:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:19:48,942][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000224223_3673669632.pth... [2024-06-28 12:19:48,944][06909] Updated weights for policy 0, policy_version 224223 (0.0032) [2024-06-28 12:19:48,999][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000223580_3663134720.pth [2024-06-28 12:19:52,162][06909] Updated weights for policy 0, policy_version 224233 (0.0031) [2024-06-28 12:19:53,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3673899008. Throughput: 0: 44073.3. Samples: 3576846700. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 12:19:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:19:56,223][06909] Updated weights for policy 0, policy_version 224243 (0.0030) [2024-06-28 12:19:58,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44241.3, 300 sec: 44042.4). Total num frames: 3674112000. Throughput: 0: 43986.6. Samples: 3576982900. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 12:19:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:19:59,607][06887] Signal inference workers to stop experience collection... (50600 times) [2024-06-28 12:19:59,608][06887] Signal inference workers to resume experience collection... (50600 times) [2024-06-28 12:19:59,627][06909] InferenceWorker_p0-w0: stopping experience collection (50600 times) [2024-06-28 12:19:59,628][06909] InferenceWorker_p0-w0: resuming experience collection (50600 times) [2024-06-28 12:19:59,751][06909] Updated weights for policy 0, policy_version 224253 (0.0030) [2024-06-28 12:20:03,698][06909] Updated weights for policy 0, policy_version 224263 (0.0034) [2024-06-28 12:20:03,852][06674] Fps is (10 sec: 42588.7, 60 sec: 43966.5, 300 sec: 44042.1). Total num frames: 3674324992. Throughput: 0: 43986.7. Samples: 3577243780. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 12:20:03,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:20:07,240][06909] Updated weights for policy 0, policy_version 224273 (0.0038) [2024-06-28 12:20:08,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3674554368. Throughput: 0: 44089.2. Samples: 3577505960. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 12:20:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:20:11,038][06909] Updated weights for policy 0, policy_version 224283 (0.0033) [2024-06-28 12:20:13,850][06674] Fps is (10 sec: 44246.3, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3674767360. Throughput: 0: 44144.8. Samples: 3577642200. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 12:20:13,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:20:14,821][06909] Updated weights for policy 0, policy_version 224293 (0.0026) [2024-06-28 12:20:18,536][06909] Updated weights for policy 0, policy_version 224303 (0.0024) [2024-06-28 12:20:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3674980352. Throughput: 0: 44122.6. Samples: 3577906080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 12:20:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:20:22,375][06909] Updated weights for policy 0, policy_version 224313 (0.0043) [2024-06-28 12:20:23,850][06674] Fps is (10 sec: 45875.3, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3675226112. Throughput: 0: 44027.8. Samples: 3578162720. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 12:20:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:20:26,007][06909] Updated weights for policy 0, policy_version 224323 (0.0024) [2024-06-28 12:20:28,856][06674] Fps is (10 sec: 44209.9, 60 sec: 44232.3, 300 sec: 43986.0). Total num frames: 3675422720. Throughput: 0: 44087.5. Samples: 3578303800. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-28 12:20:28,856][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:20:29,567][06909] Updated weights for policy 0, policy_version 224333 (0.0025) [2024-06-28 12:20:33,294][06909] Updated weights for policy 0, policy_version 224343 (0.0027) [2024-06-28 12:20:33,850][06674] Fps is (10 sec: 40960.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3675635712. Throughput: 0: 44119.0. Samples: 3578571920. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-28 12:20:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:20:37,382][06909] Updated weights for policy 0, policy_version 224353 (0.0028) [2024-06-28 12:20:38,850][06674] Fps is (10 sec: 44263.7, 60 sec: 43690.6, 300 sec: 43875.8). Total num frames: 3675865088. Throughput: 0: 44028.9. Samples: 3578828000. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-28 12:20:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:20:40,969][06909] Updated weights for policy 0, policy_version 224363 (0.0025) [2024-06-28 12:20:43,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3676094464. Throughput: 0: 43996.9. Samples: 3578962760. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-28 12:20:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 12:20:44,579][06909] Updated weights for policy 0, policy_version 224373 (0.0033) [2024-06-28 12:20:48,273][06909] Updated weights for policy 0, policy_version 224383 (0.0032) [2024-06-28 12:20:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3676291072. Throughput: 0: 43943.1. Samples: 3579221120. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-28 12:20:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:20:52,237][06909] Updated weights for policy 0, policy_version 224393 (0.0041) [2024-06-28 12:20:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3676536832. Throughput: 0: 43988.1. Samples: 3579485420. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-28 12:20:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:20:55,756][06909] Updated weights for policy 0, policy_version 224403 (0.0035) [2024-06-28 12:20:58,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3676749824. Throughput: 0: 44125.0. Samples: 3579627820. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-28 12:20:58,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:20:59,465][06909] Updated weights for policy 0, policy_version 224413 (0.0028) [2024-06-28 12:21:03,356][06909] Updated weights for policy 0, policy_version 224423 (0.0029) [2024-06-28 12:21:03,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43692.4, 300 sec: 44042.4). Total num frames: 3676946432. Throughput: 0: 44092.5. Samples: 3579890240. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-28 12:21:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:21:06,916][06909] Updated weights for policy 0, policy_version 224433 (0.0031) [2024-06-28 12:21:08,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.8, 300 sec: 43931.4). Total num frames: 3677192192. Throughput: 0: 44238.4. Samples: 3580153440. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-28 12:21:08,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:21:10,529][06909] Updated weights for policy 0, policy_version 224443 (0.0031) [2024-06-28 12:21:13,850][06674] Fps is (10 sec: 44236.5, 60 sec: 43690.8, 300 sec: 43931.3). Total num frames: 3677388800. Throughput: 0: 44048.6. Samples: 3580285720. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-28 12:21:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:21:14,382][06909] Updated weights for policy 0, policy_version 224453 (0.0033) [2024-06-28 12:21:18,106][06909] Updated weights for policy 0, policy_version 224463 (0.0040) [2024-06-28 12:21:18,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3677618176. Throughput: 0: 43865.7. Samples: 3580545880. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-28 12:21:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:21:22,110][06909] Updated weights for policy 0, policy_version 224473 (0.0035) [2024-06-28 12:21:23,850][06674] Fps is (10 sec: 47514.1, 60 sec: 43963.9, 300 sec: 43986.9). Total num frames: 3677863936. Throughput: 0: 44119.6. Samples: 3580813380. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-28 12:21:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:21:25,348][06909] Updated weights for policy 0, policy_version 224483 (0.0041) [2024-06-28 12:21:28,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44241.3, 300 sec: 44042.4). Total num frames: 3678076928. Throughput: 0: 44117.8. Samples: 3580948060. Policy #0 lag: (min: 1.0, avg: 10.7, max: 21.0) [2024-06-28 12:21:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:21:29,195][06909] Updated weights for policy 0, policy_version 224493 (0.0039) [2024-06-28 12:21:32,974][06909] Updated weights for policy 0, policy_version 224503 (0.0031) [2024-06-28 12:21:33,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3678289920. Throughput: 0: 44133.3. Samples: 3581207120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 12:21:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:21:36,526][06909] Updated weights for policy 0, policy_version 224513 (0.0028) [2024-06-28 12:21:38,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 43931.3). Total num frames: 3678502912. Throughput: 0: 44244.4. Samples: 3581476420. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 12:21:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:21:40,357][06909] Updated weights for policy 0, policy_version 224523 (0.0046) [2024-06-28 12:21:43,850][06674] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3678732288. Throughput: 0: 43973.4. Samples: 3581606620. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 12:21:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:21:44,035][06909] Updated weights for policy 0, policy_version 224533 (0.0029) [2024-06-28 12:21:47,658][06909] Updated weights for policy 0, policy_version 224543 (0.0031) [2024-06-28 12:21:48,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3678945280. Throughput: 0: 43918.6. Samples: 3581866580. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 12:21:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:21:48,891][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000224546_3678961664.pth... [2024-06-28 12:21:48,941][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000223901_3668393984.pth [2024-06-28 12:21:51,130][06887] Signal inference workers to stop experience collection... (50650 times) [2024-06-28 12:21:51,162][06909] InferenceWorker_p0-w0: stopping experience collection (50650 times) [2024-06-28 12:21:51,184][06887] Signal inference workers to resume experience collection... (50650 times) [2024-06-28 12:21:51,187][06909] InferenceWorker_p0-w0: resuming experience collection (50650 times) [2024-06-28 12:21:51,322][06909] Updated weights for policy 0, policy_version 224553 (0.0030) [2024-06-28 12:21:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3679158272. Throughput: 0: 43975.1. Samples: 3582132320. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 12:21:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:21:55,193][06909] Updated weights for policy 0, policy_version 224563 (0.0035) [2024-06-28 12:21:58,852][06674] Fps is (10 sec: 44227.8, 60 sec: 43962.2, 300 sec: 44042.1). Total num frames: 3679387648. Throughput: 0: 43978.9. Samples: 3582264860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 12:21:58,852][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:21:59,029][06909] Updated weights for policy 0, policy_version 224573 (0.0049) [2024-06-28 12:22:02,443][06909] Updated weights for policy 0, policy_version 224583 (0.0025) [2024-06-28 12:22:03,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 43931.3). Total num frames: 3679600640. Throughput: 0: 43944.5. Samples: 3582523380. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 12:22:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:22:06,524][06909] Updated weights for policy 0, policy_version 224593 (0.0034) [2024-06-28 12:22:08,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3679813632. Throughput: 0: 43956.8. Samples: 3582791440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 12:22:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:22:09,984][06909] Updated weights for policy 0, policy_version 224603 (0.0027) [2024-06-28 12:22:13,682][06909] Updated weights for policy 0, policy_version 224613 (0.0032) [2024-06-28 12:22:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3680059392. Throughput: 0: 43929.3. Samples: 3582924880. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 12:22:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:22:17,540][06909] Updated weights for policy 0, policy_version 224623 (0.0038) [2024-06-28 12:22:18,852][06674] Fps is (10 sec: 45865.6, 60 sec: 44235.3, 300 sec: 43986.6). Total num frames: 3680272384. Throughput: 0: 43937.6. Samples: 3583184400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 12:22:18,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:22:21,374][06909] Updated weights for policy 0, policy_version 224633 (0.0020) [2024-06-28 12:22:23,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43417.5, 300 sec: 43931.3). Total num frames: 3680468992. Throughput: 0: 43942.2. Samples: 3583453820. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 12:22:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:22:25,202][06909] Updated weights for policy 0, policy_version 224643 (0.0027) [2024-06-28 12:22:28,652][06909] Updated weights for policy 0, policy_version 224653 (0.0031) [2024-06-28 12:22:28,850][06674] Fps is (10 sec: 44245.3, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3680714752. Throughput: 0: 43932.7. Samples: 3583583600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 12:22:28,851][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 12:22:32,394][06909] Updated weights for policy 0, policy_version 224663 (0.0027) [2024-06-28 12:22:33,856][06674] Fps is (10 sec: 44210.1, 60 sec: 43686.3, 300 sec: 43930.7). Total num frames: 3680911360. Throughput: 0: 43907.4. Samples: 3583842680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 12:22:33,856][06674] Avg episode reward: [(0, '0.444')] [2024-06-28 12:22:36,198][06909] Updated weights for policy 0, policy_version 224673 (0.0029) [2024-06-28 12:22:38,850][06674] Fps is (10 sec: 40960.5, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3681124352. Throughput: 0: 44015.9. Samples: 3584113040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 12:22:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:22:39,687][06909] Updated weights for policy 0, policy_version 224683 (0.0032) [2024-06-28 12:22:43,386][06909] Updated weights for policy 0, policy_version 224693 (0.0027) [2024-06-28 12:22:43,850][06674] Fps is (10 sec: 45900.4, 60 sec: 43963.3, 300 sec: 43986.8). Total num frames: 3681370112. Throughput: 0: 44123.2. Samples: 3584250340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 12:22:43,859][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:22:47,351][06909] Updated weights for policy 0, policy_version 224703 (0.0033) [2024-06-28 12:22:48,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3681599488. Throughput: 0: 44205.7. Samples: 3584512640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 12:22:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:22:50,911][06909] Updated weights for policy 0, policy_version 224713 (0.0035) [2024-06-28 12:22:53,850][06674] Fps is (10 sec: 42601.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3681796096. Throughput: 0: 44184.5. Samples: 3584779740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 12:22:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:22:54,628][06909] Updated weights for policy 0, policy_version 224723 (0.0026) [2024-06-28 12:22:58,111][06909] Updated weights for policy 0, policy_version 224733 (0.0028) [2024-06-28 12:22:58,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44238.3, 300 sec: 44042.7). Total num frames: 3682041856. Throughput: 0: 44209.8. Samples: 3584914320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 12:22:58,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:23:02,179][06909] Updated weights for policy 0, policy_version 224743 (0.0039) [2024-06-28 12:23:03,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3682254848. Throughput: 0: 44276.3. Samples: 3585176740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 12:23:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:23:05,362][06909] Updated weights for policy 0, policy_version 224753 (0.0027) [2024-06-28 12:23:08,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3682467840. Throughput: 0: 44185.7. Samples: 3585442180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 12:23:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:23:09,838][06909] Updated weights for policy 0, policy_version 224763 (0.0036) [2024-06-28 12:23:12,022][06887] Signal inference workers to stop experience collection... (50700 times) [2024-06-28 12:23:12,044][06909] InferenceWorker_p0-w0: stopping experience collection (50700 times) [2024-06-28 12:23:12,080][06887] Signal inference workers to resume experience collection... (50700 times) [2024-06-28 12:23:12,081][06909] InferenceWorker_p0-w0: resuming experience collection (50700 times) [2024-06-28 12:23:13,370][06909] Updated weights for policy 0, policy_version 224773 (0.0027) [2024-06-28 12:23:13,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3682697216. Throughput: 0: 44210.3. Samples: 3585573060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 12:23:13,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:23:17,209][06909] Updated weights for policy 0, policy_version 224783 (0.0026) [2024-06-28 12:23:18,850][06674] Fps is (10 sec: 44237.5, 60 sec: 43965.3, 300 sec: 43986.9). Total num frames: 3682910208. Throughput: 0: 44282.0. Samples: 3585835100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 12:23:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:23:20,524][06909] Updated weights for policy 0, policy_version 224793 (0.0040) [2024-06-28 12:23:23,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3683123200. Throughput: 0: 44079.6. Samples: 3586096620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 12:23:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:23:25,013][06909] Updated weights for policy 0, policy_version 224803 (0.0029) [2024-06-28 12:23:27,930][06909] Updated weights for policy 0, policy_version 224813 (0.0031) [2024-06-28 12:23:28,852][06674] Fps is (10 sec: 44227.5, 60 sec: 43962.3, 300 sec: 44042.1). Total num frames: 3683352576. Throughput: 0: 43953.7. Samples: 3586228320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 12:23:28,852][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:23:32,254][06909] Updated weights for policy 0, policy_version 224823 (0.0022) [2024-06-28 12:23:33,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44514.3, 300 sec: 43986.9). Total num frames: 3683581952. Throughput: 0: 44219.1. Samples: 3586502500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 12:23:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:23:35,070][06909] Updated weights for policy 0, policy_version 224833 (0.0025) [2024-06-28 12:23:38,850][06674] Fps is (10 sec: 42606.6, 60 sec: 44236.7, 300 sec: 43986.8). Total num frames: 3683778560. Throughput: 0: 44018.5. Samples: 3586760580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 12:23:38,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:23:39,611][06909] Updated weights for policy 0, policy_version 224843 (0.0033) [2024-06-28 12:23:42,472][06909] Updated weights for policy 0, policy_version 224853 (0.0024) [2024-06-28 12:23:43,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43964.2, 300 sec: 43986.9). Total num frames: 3684007936. Throughput: 0: 44013.3. Samples: 3586894920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 12:23:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:23:47,077][06909] Updated weights for policy 0, policy_version 224863 (0.0027) [2024-06-28 12:23:48,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43690.7, 300 sec: 43931.3). Total num frames: 3684220928. Throughput: 0: 44151.0. Samples: 3587163540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 22.0) [2024-06-28 12:23:48,854][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:23:48,861][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000224868_3684237312.pth... [2024-06-28 12:23:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000224223_3673669632.pth [2024-06-28 12:23:50,235][06909] Updated weights for policy 0, policy_version 224873 (0.0031) [2024-06-28 12:23:53,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.7, 300 sec: 44043.3). Total num frames: 3684450304. Throughput: 0: 44120.5. Samples: 3587427600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:23:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:23:54,763][06909] Updated weights for policy 0, policy_version 224883 (0.0032) [2024-06-28 12:23:57,368][06909] Updated weights for policy 0, policy_version 224893 (0.0036) [2024-06-28 12:23:58,850][06674] Fps is (10 sec: 45875.6, 60 sec: 43963.7, 300 sec: 44043.3). Total num frames: 3684679680. Throughput: 0: 44109.4. Samples: 3587557980. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:23:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:24:02,144][06909] Updated weights for policy 0, policy_version 224903 (0.0040) [2024-06-28 12:24:03,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3684909056. Throughput: 0: 44135.1. Samples: 3587821180. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:24:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:24:04,963][06909] Updated weights for policy 0, policy_version 224913 (0.0036) [2024-06-28 12:24:08,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3685105664. Throughput: 0: 44302.2. Samples: 3588090220. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:24:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:24:09,394][06909] Updated weights for policy 0, policy_version 224923 (0.0032) [2024-06-28 12:24:12,430][06909] Updated weights for policy 0, policy_version 224933 (0.0026) [2024-06-28 12:24:13,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3685318656. Throughput: 0: 44162.0. Samples: 3588215520. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:24:13,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:24:16,945][06909] Updated weights for policy 0, policy_version 224943 (0.0020) [2024-06-28 12:24:18,850][06674] Fps is (10 sec: 47513.3, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3685580800. Throughput: 0: 44036.9. Samples: 3588484160. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:24:18,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 12:24:19,617][06909] Updated weights for policy 0, policy_version 224953 (0.0030) [2024-06-28 12:24:23,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3685777408. Throughput: 0: 44260.5. Samples: 3588752300. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:24:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:24:24,191][06909] Updated weights for policy 0, policy_version 224963 (0.0037) [2024-06-28 12:24:24,597][06887] Signal inference workers to stop experience collection... (50750 times) [2024-06-28 12:24:24,635][06909] InferenceWorker_p0-w0: stopping experience collection (50750 times) [2024-06-28 12:24:24,643][06887] Signal inference workers to resume experience collection... (50750 times) [2024-06-28 12:24:24,653][06909] InferenceWorker_p0-w0: resuming experience collection (50750 times) [2024-06-28 12:24:27,168][06909] Updated weights for policy 0, policy_version 224973 (0.0029) [2024-06-28 12:24:28,850][06674] Fps is (10 sec: 42598.9, 60 sec: 44238.4, 300 sec: 44098.0). Total num frames: 3686006784. Throughput: 0: 44158.3. Samples: 3588882040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:24:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:24:31,691][06909] Updated weights for policy 0, policy_version 224983 (0.0038) [2024-06-28 12:24:33,850][06674] Fps is (10 sec: 45876.0, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3686236160. Throughput: 0: 44193.9. Samples: 3589152260. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:24:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:24:34,304][06909] Updated weights for policy 0, policy_version 224993 (0.0046) [2024-06-28 12:24:38,850][06674] Fps is (10 sec: 42598.3, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3686432768. Throughput: 0: 44255.7. Samples: 3589419100. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:24:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:24:38,904][06909] Updated weights for policy 0, policy_version 225003 (0.0030) [2024-06-28 12:24:42,043][06909] Updated weights for policy 0, policy_version 225013 (0.0038) [2024-06-28 12:24:43,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3686678528. Throughput: 0: 44193.8. Samples: 3589546700. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:24:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:24:46,462][06909] Updated weights for policy 0, policy_version 225023 (0.0032) [2024-06-28 12:24:48,850][06674] Fps is (10 sec: 47513.6, 60 sec: 44783.0, 300 sec: 44098.0). Total num frames: 3686907904. Throughput: 0: 44323.1. Samples: 3589815720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:24:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:24:49,197][06909] Updated weights for policy 0, policy_version 225033 (0.0029) [2024-06-28 12:24:53,765][06909] Updated weights for policy 0, policy_version 225043 (0.0039) [2024-06-28 12:24:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3687104512. Throughput: 0: 44141.3. Samples: 3590076580. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:24:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:24:57,089][06909] Updated weights for policy 0, policy_version 225053 (0.0035) [2024-06-28 12:24:58,850][06674] Fps is (10 sec: 42597.9, 60 sec: 44236.8, 300 sec: 44098.3). Total num frames: 3687333888. Throughput: 0: 44168.4. Samples: 3590203100. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 12:24:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:25:01,154][06909] Updated weights for policy 0, policy_version 225063 (0.0031) [2024-06-28 12:25:03,850][06674] Fps is (10 sec: 45875.4, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3687563264. Throughput: 0: 44311.2. Samples: 3590478160. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:25:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:25:04,273][06909] Updated weights for policy 0, policy_version 225073 (0.0030) [2024-06-28 12:25:08,780][06909] Updated weights for policy 0, policy_version 225083 (0.0036) [2024-06-28 12:25:08,853][06674] Fps is (10 sec: 42586.0, 60 sec: 44234.6, 300 sec: 44042.0). Total num frames: 3687759872. Throughput: 0: 44041.2. Samples: 3590734280. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:25:08,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:25:12,051][06909] Updated weights for policy 0, policy_version 225093 (0.0027) [2024-06-28 12:25:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44782.9, 300 sec: 44153.5). Total num frames: 3688005632. Throughput: 0: 43965.2. Samples: 3590860480. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:25:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:25:16,066][06909] Updated weights for policy 0, policy_version 225103 (0.0037) [2024-06-28 12:25:18,850][06674] Fps is (10 sec: 47527.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3688235008. Throughput: 0: 43968.8. Samples: 3591130860. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:25:18,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:25:19,316][06909] Updated weights for policy 0, policy_version 225113 (0.0021) [2024-06-28 12:25:23,649][06909] Updated weights for policy 0, policy_version 225123 (0.0033) [2024-06-28 12:25:23,850][06674] Fps is (10 sec: 42598.2, 60 sec: 44236.8, 300 sec: 44098.8). Total num frames: 3688431616. Throughput: 0: 43848.8. Samples: 3591392300. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:25:23,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:25:26,745][06909] Updated weights for policy 0, policy_version 225133 (0.0025) [2024-06-28 12:25:28,850][06674] Fps is (10 sec: 40960.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3688644608. Throughput: 0: 43773.4. Samples: 3591516500. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:25:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:25:30,975][06909] Updated weights for policy 0, policy_version 225143 (0.0029) [2024-06-28 12:25:33,850][06674] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3688873984. Throughput: 0: 43803.1. Samples: 3591786860. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:25:33,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 12:25:34,418][06909] Updated weights for policy 0, policy_version 225153 (0.0028) [2024-06-28 12:25:38,602][06909] Updated weights for policy 0, policy_version 225163 (0.0027) [2024-06-28 12:25:38,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3689070592. Throughput: 0: 43894.7. Samples: 3592051840. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:25:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:25:41,783][06909] Updated weights for policy 0, policy_version 225173 (0.0021) [2024-06-28 12:25:42,619][06887] Signal inference workers to stop experience collection... (50800 times) [2024-06-28 12:25:42,626][06887] Signal inference workers to resume experience collection... (50800 times) [2024-06-28 12:25:42,663][06909] InferenceWorker_p0-w0: stopping experience collection (50800 times) [2024-06-28 12:25:42,663][06909] InferenceWorker_p0-w0: resuming experience collection (50800 times) [2024-06-28 12:25:43,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3689316352. Throughput: 0: 43829.3. Samples: 3592175420. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:25:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:25:45,803][06909] Updated weights for policy 0, policy_version 225183 (0.0031) [2024-06-28 12:25:48,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3689545728. Throughput: 0: 43799.1. Samples: 3592449120. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:25:48,850][06674] Avg episode reward: [(0, '0.489')] [2024-06-28 12:25:48,866][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000225192_3689545728.pth... [2024-06-28 12:25:48,910][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000224546_3678961664.pth [2024-06-28 12:25:49,372][06909] Updated weights for policy 0, policy_version 225193 (0.0029) [2024-06-28 12:25:53,546][06909] Updated weights for policy 0, policy_version 225203 (0.0040) [2024-06-28 12:25:53,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43986.9). Total num frames: 3689725952. Throughput: 0: 43829.6. Samples: 3592706480. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:25:53,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:25:56,705][06909] Updated weights for policy 0, policy_version 225213 (0.0030) [2024-06-28 12:25:58,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 44097.9). Total num frames: 3689955328. Throughput: 0: 43848.0. Samples: 3592833640. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:25:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:26:00,832][06909] Updated weights for policy 0, policy_version 225223 (0.0037) [2024-06-28 12:26:03,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3690184704. Throughput: 0: 43871.2. Samples: 3593105060. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:26:03,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:26:04,303][06909] Updated weights for policy 0, policy_version 225233 (0.0043) [2024-06-28 12:26:08,273][06909] Updated weights for policy 0, policy_version 225243 (0.0042) [2024-06-28 12:26:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43692.7, 300 sec: 44042.4). Total num frames: 3690381312. Throughput: 0: 43664.4. Samples: 3593357200. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:26:08,859][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:26:11,787][06909] Updated weights for policy 0, policy_version 225253 (0.0032) [2024-06-28 12:26:13,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3690627072. Throughput: 0: 43864.4. Samples: 3593490400. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 12:26:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:26:15,978][06909] Updated weights for policy 0, policy_version 225263 (0.0034) [2024-06-28 12:26:18,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43417.6, 300 sec: 43986.9). Total num frames: 3690840064. Throughput: 0: 43825.3. Samples: 3593759000. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 12:26:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:26:19,343][06909] Updated weights for policy 0, policy_version 225273 (0.0030) [2024-06-28 12:26:23,454][06909] Updated weights for policy 0, policy_version 225283 (0.0031) [2024-06-28 12:26:23,856][06674] Fps is (10 sec: 40935.2, 60 sec: 43413.3, 300 sec: 43930.4). Total num frames: 3691036672. Throughput: 0: 43792.3. Samples: 3594022760. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 12:26:23,857][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:26:26,834][06909] Updated weights for policy 0, policy_version 225293 (0.0040) [2024-06-28 12:26:28,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3691282432. Throughput: 0: 43737.9. Samples: 3594143620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 12:26:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:26:30,825][06909] Updated weights for policy 0, policy_version 225303 (0.0037) [2024-06-28 12:26:33,850][06674] Fps is (10 sec: 47542.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3691511808. Throughput: 0: 43795.2. Samples: 3594419900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 12:26:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:26:34,085][06909] Updated weights for policy 0, policy_version 225313 (0.0030) [2024-06-28 12:26:38,319][06909] Updated weights for policy 0, policy_version 225323 (0.0040) [2024-06-28 12:26:38,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3691708416. Throughput: 0: 43889.7. Samples: 3594681520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 12:26:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:26:41,587][06909] Updated weights for policy 0, policy_version 225333 (0.0027) [2024-06-28 12:26:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3691937792. Throughput: 0: 43905.3. Samples: 3594809380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 12:26:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:26:45,922][06909] Updated weights for policy 0, policy_version 225343 (0.0033) [2024-06-28 12:26:48,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43690.6, 300 sec: 44097.9). Total num frames: 3692167168. Throughput: 0: 43759.5. Samples: 3595074240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 12:26:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:26:48,983][06909] Updated weights for policy 0, policy_version 225353 (0.0025) [2024-06-28 12:26:53,558][06909] Updated weights for policy 0, policy_version 225363 (0.0033) [2024-06-28 12:26:53,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43931.6). Total num frames: 3692347392. Throughput: 0: 44230.4. Samples: 3595347560. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 12:26:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:26:56,299][06909] Updated weights for policy 0, policy_version 225373 (0.0025) [2024-06-28 12:26:58,850][06674] Fps is (10 sec: 42598.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3692593152. Throughput: 0: 43972.4. Samples: 3595469160. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 12:26:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:27:00,789][06909] Updated weights for policy 0, policy_version 225383 (0.0023) [2024-06-28 12:27:03,653][06909] Updated weights for policy 0, policy_version 225393 (0.0042) [2024-06-28 12:27:03,850][06674] Fps is (10 sec: 49151.6, 60 sec: 44236.7, 300 sec: 44153.5). Total num frames: 3692838912. Throughput: 0: 43906.2. Samples: 3595734780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 12:27:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:27:05,360][06887] Signal inference workers to stop experience collection... (50850 times) [2024-06-28 12:27:05,408][06909] InferenceWorker_p0-w0: stopping experience collection (50850 times) [2024-06-28 12:27:05,416][06887] Signal inference workers to resume experience collection... (50850 times) [2024-06-28 12:27:05,431][06909] InferenceWorker_p0-w0: resuming experience collection (50850 times) [2024-06-28 12:27:08,008][06909] Updated weights for policy 0, policy_version 225403 (0.0028) [2024-06-28 12:27:08,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3693019136. Throughput: 0: 44060.1. Samples: 3596005200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 12:27:08,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:27:11,027][06909] Updated weights for policy 0, policy_version 225413 (0.0035) [2024-06-28 12:27:13,856][06674] Fps is (10 sec: 40935.1, 60 sec: 43686.2, 300 sec: 43986.3). Total num frames: 3693248512. Throughput: 0: 44183.3. Samples: 3596132140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 12:27:13,857][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:27:15,587][06909] Updated weights for policy 0, policy_version 225423 (0.0034) [2024-06-28 12:27:18,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3693477888. Throughput: 0: 43978.1. Samples: 3596398920. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 12:27:18,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:27:19,086][06909] Updated weights for policy 0, policy_version 225433 (0.0036) [2024-06-28 12:27:23,417][06909] Updated weights for policy 0, policy_version 225443 (0.0037) [2024-06-28 12:27:23,850][06674] Fps is (10 sec: 42624.0, 60 sec: 43968.1, 300 sec: 43931.3). Total num frames: 3693674496. Throughput: 0: 44147.1. Samples: 3596668140. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 12:27:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:27:26,337][06909] Updated weights for policy 0, policy_version 225453 (0.0028) [2024-06-28 12:27:28,850][06674] Fps is (10 sec: 44236.9, 60 sec: 43963.7, 300 sec: 44098.9). Total num frames: 3693920256. Throughput: 0: 44022.6. Samples: 3596790400. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 12:27:28,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:27:30,662][06909] Updated weights for policy 0, policy_version 225463 (0.0034) [2024-06-28 12:27:33,561][06909] Updated weights for policy 0, policy_version 225473 (0.0036) [2024-06-28 12:27:33,850][06674] Fps is (10 sec: 47513.8, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 3694149632. Throughput: 0: 44001.7. Samples: 3597054320. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 12:27:33,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:27:37,873][06909] Updated weights for policy 0, policy_version 225483 (0.0032) [2024-06-28 12:27:38,850][06674] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43931.4). Total num frames: 3694329856. Throughput: 0: 43945.3. Samples: 3597325100. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 12:27:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:27:41,058][06909] Updated weights for policy 0, policy_version 225493 (0.0036) [2024-06-28 12:27:43,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3694575616. Throughput: 0: 44009.4. Samples: 3597449580. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 12:27:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:27:45,483][06909] Updated weights for policy 0, policy_version 225503 (0.0027) [2024-06-28 12:27:48,169][06909] Updated weights for policy 0, policy_version 225513 (0.0029) [2024-06-28 12:27:48,850][06674] Fps is (10 sec: 47514.0, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3694804992. Throughput: 0: 44040.1. Samples: 3597716580. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 12:27:48,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:27:48,875][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000225514_3694821376.pth... [2024-06-28 12:27:48,926][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000224868_3684237312.pth [2024-06-28 12:27:52,635][06909] Updated weights for policy 0, policy_version 225523 (0.0029) [2024-06-28 12:27:53,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44509.9, 300 sec: 43986.9). Total num frames: 3695017984. Throughput: 0: 44224.5. Samples: 3597995300. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 12:27:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:27:55,939][06909] Updated weights for policy 0, policy_version 225533 (0.0026) [2024-06-28 12:27:58,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3695247360. Throughput: 0: 44267.0. Samples: 3598123880. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 12:27:58,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:28:00,336][06909] Updated weights for policy 0, policy_version 225543 (0.0035) [2024-06-28 12:28:03,275][06909] Updated weights for policy 0, policy_version 225553 (0.0026) [2024-06-28 12:28:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3695460352. Throughput: 0: 44058.0. Samples: 3598381520. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 12:28:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:28:07,879][06909] Updated weights for policy 0, policy_version 225563 (0.0040) [2024-06-28 12:28:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3695689728. Throughput: 0: 44181.0. Samples: 3598656280. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 12:28:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:28:10,854][06909] Updated weights for policy 0, policy_version 225573 (0.0022) [2024-06-28 12:28:13,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44241.3, 300 sec: 44042.4). Total num frames: 3695902720. Throughput: 0: 44203.1. Samples: 3598779540. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 12:28:13,862][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:28:15,089][06909] Updated weights for policy 0, policy_version 225583 (0.0021) [2024-06-28 12:28:15,777][06887] Signal inference workers to stop experience collection... (50900 times) [2024-06-28 12:28:15,832][06887] Signal inference workers to resume experience collection... (50900 times) [2024-06-28 12:28:15,833][06909] InferenceWorker_p0-w0: stopping experience collection (50900 times) [2024-06-28 12:28:15,858][06909] InferenceWorker_p0-w0: resuming experience collection (50900 times) [2024-06-28 12:28:18,135][06909] Updated weights for policy 0, policy_version 225593 (0.0038) [2024-06-28 12:28:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3696132096. Throughput: 0: 44229.9. Samples: 3599044660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 12:28:18,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:28:22,767][06909] Updated weights for policy 0, policy_version 225603 (0.0033) [2024-06-28 12:28:23,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44510.0, 300 sec: 44042.7). Total num frames: 3696345088. Throughput: 0: 44230.3. Samples: 3599315460. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 12:28:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:28:25,967][06909] Updated weights for policy 0, policy_version 225613 (0.0032) [2024-06-28 12:28:28,850][06674] Fps is (10 sec: 42598.3, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3696558080. Throughput: 0: 44288.9. Samples: 3599442580. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 12:28:28,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:28:30,031][06909] Updated weights for policy 0, policy_version 225623 (0.0031) [2024-06-28 12:28:33,071][06909] Updated weights for policy 0, policy_version 225633 (0.0027) [2024-06-28 12:28:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3696787456. Throughput: 0: 44199.9. Samples: 3599705580. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 12:28:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:28:37,686][06909] Updated weights for policy 0, policy_version 225643 (0.0034) [2024-06-28 12:28:38,850][06674] Fps is (10 sec: 45874.9, 60 sec: 44782.9, 300 sec: 44097.9). Total num frames: 3697016832. Throughput: 0: 44026.1. Samples: 3599976480. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 12:28:38,851][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:28:40,694][06909] Updated weights for policy 0, policy_version 225653 (0.0030) [2024-06-28 12:28:43,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3697213440. Throughput: 0: 44076.4. Samples: 3600107320. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 12:28:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:28:44,857][06909] Updated weights for policy 0, policy_version 225663 (0.0037) [2024-06-28 12:28:47,908][06909] Updated weights for policy 0, policy_version 225673 (0.0038) [2024-06-28 12:28:48,850][06674] Fps is (10 sec: 44237.2, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3697459200. Throughput: 0: 44199.5. Samples: 3600370500. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 12:28:48,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:28:52,103][06909] Updated weights for policy 0, policy_version 225683 (0.0034) [2024-06-28 12:28:53,850][06674] Fps is (10 sec: 45875.2, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3697672192. Throughput: 0: 44053.4. Samples: 3600638680. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 12:28:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:28:55,056][06909] Updated weights for policy 0, policy_version 225693 (0.0033) [2024-06-28 12:28:58,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.7, 300 sec: 44042.4). Total num frames: 3697901568. Throughput: 0: 44298.7. Samples: 3600772980. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 12:28:58,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:28:59,431][06909] Updated weights for policy 0, policy_version 225703 (0.0038) [2024-06-28 12:29:02,437][06909] Updated weights for policy 0, policy_version 225713 (0.0020) [2024-06-28 12:29:03,850][06674] Fps is (10 sec: 45874.7, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3698130944. Throughput: 0: 44456.3. Samples: 3601045200. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 12:29:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:29:06,609][06909] Updated weights for policy 0, policy_version 225723 (0.0025) [2024-06-28 12:29:08,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44509.8, 300 sec: 44209.0). Total num frames: 3698360320. Throughput: 0: 44330.6. Samples: 3601310340. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 12:29:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:29:09,947][06909] Updated weights for policy 0, policy_version 225733 (0.0035) [2024-06-28 12:29:13,850][06674] Fps is (10 sec: 42598.1, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3698556928. Throughput: 0: 44418.5. Samples: 3601441420. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 12:29:13,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:29:14,120][06909] Updated weights for policy 0, policy_version 225743 (0.0037) [2024-06-28 12:29:17,142][06909] Updated weights for policy 0, policy_version 225753 (0.0027) [2024-06-28 12:29:18,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3698802688. Throughput: 0: 44541.3. Samples: 3601709940. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 12:29:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:29:21,671][06909] Updated weights for policy 0, policy_version 225763 (0.0026) [2024-06-28 12:29:23,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44509.8, 300 sec: 44097.9). Total num frames: 3699015680. Throughput: 0: 44440.1. Samples: 3601976280. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 12:29:23,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:29:24,746][06909] Updated weights for policy 0, policy_version 225773 (0.0031) [2024-06-28 12:29:28,850][06674] Fps is (10 sec: 40960.1, 60 sec: 44236.8, 300 sec: 43986.9). Total num frames: 3699212288. Throughput: 0: 44505.7. Samples: 3602110080. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 12:29:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:29:28,933][06909] Updated weights for policy 0, policy_version 225783 (0.0031) [2024-06-28 12:29:32,180][06909] Updated weights for policy 0, policy_version 225793 (0.0031) [2024-06-28 12:29:33,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3699458048. Throughput: 0: 44457.8. Samples: 3602371100. Policy #0 lag: (min: 0.0, avg: 12.0, max: 21.0) [2024-06-28 12:29:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:29:36,151][06909] Updated weights for policy 0, policy_version 225803 (0.0036) [2024-06-28 12:29:38,852][06674] Fps is (10 sec: 45865.6, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 3699671040. Throughput: 0: 44560.1. Samples: 3602643980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:29:38,853][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:29:39,649][06909] Updated weights for policy 0, policy_version 225813 (0.0036) [2024-06-28 12:29:40,147][06887] Signal inference workers to stop experience collection... (50950 times) [2024-06-28 12:29:40,171][06909] InferenceWorker_p0-w0: stopping experience collection (50950 times) [2024-06-28 12:29:40,203][06887] Signal inference workers to resume experience collection... (50950 times) [2024-06-28 12:29:40,204][06909] InferenceWorker_p0-w0: resuming experience collection (50950 times) [2024-06-28 12:29:43,850][06674] Fps is (10 sec: 40959.2, 60 sec: 44236.7, 300 sec: 43931.3). Total num frames: 3699867648. Throughput: 0: 44218.1. Samples: 3602762800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:29:43,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:29:43,982][06909] Updated weights for policy 0, policy_version 225823 (0.0032) [2024-06-28 12:29:47,294][06909] Updated weights for policy 0, policy_version 225833 (0.0037) [2024-06-28 12:29:48,850][06674] Fps is (10 sec: 44245.8, 60 sec: 44236.7, 300 sec: 44097.9). Total num frames: 3700113408. Throughput: 0: 44062.6. Samples: 3603028020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:29:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:29:48,903][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000225838_3700129792.pth... [2024-06-28 12:29:48,944][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000225192_3689545728.pth [2024-06-28 12:29:51,417][06909] Updated weights for policy 0, policy_version 225843 (0.0023) [2024-06-28 12:29:53,850][06674] Fps is (10 sec: 45875.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3700326400. Throughput: 0: 44033.0. Samples: 3603291820. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:29:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:29:54,659][06909] Updated weights for policy 0, policy_version 225853 (0.0032) [2024-06-28 12:29:58,850][06674] Fps is (10 sec: 40959.7, 60 sec: 43690.6, 300 sec: 43931.3). Total num frames: 3700523008. Throughput: 0: 43998.6. Samples: 3603421360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:29:58,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:29:59,081][06909] Updated weights for policy 0, policy_version 225863 (0.0045) [2024-06-28 12:30:02,066][06909] Updated weights for policy 0, policy_version 225873 (0.0032) [2024-06-28 12:30:03,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44098.4). Total num frames: 3700768768. Throughput: 0: 43833.8. Samples: 3603682460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:30:03,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:30:06,381][06909] Updated weights for policy 0, policy_version 225883 (0.0047) [2024-06-28 12:30:08,850][06674] Fps is (10 sec: 45875.4, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3700981760. Throughput: 0: 43881.7. Samples: 3603950960. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:30:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:30:09,574][06909] Updated weights for policy 0, policy_version 225893 (0.0035) [2024-06-28 12:30:13,586][06909] Updated weights for policy 0, policy_version 225903 (0.0038) [2024-06-28 12:30:13,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3701194752. Throughput: 0: 43660.0. Samples: 3604074780. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:30:13,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:30:17,149][06909] Updated weights for policy 0, policy_version 225913 (0.0028) [2024-06-28 12:30:18,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3701440512. Throughput: 0: 43809.6. Samples: 3604342540. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:30:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:30:21,133][06909] Updated weights for policy 0, policy_version 225923 (0.0031) [2024-06-28 12:30:23,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 44042.4). Total num frames: 3701637120. Throughput: 0: 43599.8. Samples: 3604605880. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:30:23,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:30:24,654][06909] Updated weights for policy 0, policy_version 225933 (0.0029) [2024-06-28 12:30:28,837][06909] Updated weights for policy 0, policy_version 225943 (0.0034) [2024-06-28 12:30:28,850][06674] Fps is (10 sec: 40960.2, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3701850112. Throughput: 0: 43879.2. Samples: 3604737360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:30:28,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:30:32,133][06909] Updated weights for policy 0, policy_version 225953 (0.0038) [2024-06-28 12:30:33,850][06674] Fps is (10 sec: 45875.5, 60 sec: 43963.7, 300 sec: 44153.5). Total num frames: 3702095872. Throughput: 0: 44033.0. Samples: 3605009500. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:30:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:30:36,075][06909] Updated weights for policy 0, policy_version 225963 (0.0041) [2024-06-28 12:30:38,850][06674] Fps is (10 sec: 44236.1, 60 sec: 43692.0, 300 sec: 43986.8). Total num frames: 3702292480. Throughput: 0: 43953.6. Samples: 3605269740. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 12:30:38,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:30:39,568][06909] Updated weights for policy 0, policy_version 225973 (0.0031) [2024-06-28 12:30:43,317][06909] Updated weights for policy 0, policy_version 225983 (0.0031) [2024-06-28 12:30:43,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44509.9, 300 sec: 44042.4). Total num frames: 3702538240. Throughput: 0: 43969.4. Samples: 3605399980. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:30:43,853][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:30:46,893][06909] Updated weights for policy 0, policy_version 225993 (0.0040) [2024-06-28 12:30:48,850][06674] Fps is (10 sec: 45876.2, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3702751232. Throughput: 0: 44100.5. Samples: 3605666980. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:30:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:30:50,829][06909] Updated weights for policy 0, policy_version 226003 (0.0021) [2024-06-28 12:30:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3702964224. Throughput: 0: 43991.2. Samples: 3605930560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:30:53,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:30:54,379][06909] Updated weights for policy 0, policy_version 226013 (0.0042) [2024-06-28 12:30:58,281][06909] Updated weights for policy 0, policy_version 226023 (0.0029) [2024-06-28 12:30:58,852][06674] Fps is (10 sec: 44227.6, 60 sec: 44508.4, 300 sec: 44097.6). Total num frames: 3703193600. Throughput: 0: 44108.2. Samples: 3606059740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:30:58,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:31:01,980][06909] Updated weights for policy 0, policy_version 226033 (0.0037) [2024-06-28 12:31:02,995][06887] Signal inference workers to stop experience collection... (51000 times) [2024-06-28 12:31:02,995][06887] Signal inference workers to resume experience collection... (51000 times) [2024-06-28 12:31:03,040][06909] InferenceWorker_p0-w0: stopping experience collection (51000 times) [2024-06-28 12:31:03,040][06909] InferenceWorker_p0-w0: resuming experience collection (51000 times) [2024-06-28 12:31:03,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44153.5). Total num frames: 3703406592. Throughput: 0: 44220.1. Samples: 3606332440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:31:03,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:31:05,826][06909] Updated weights for policy 0, policy_version 226043 (0.0042) [2024-06-28 12:31:08,850][06674] Fps is (10 sec: 42607.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3703619584. Throughput: 0: 44101.8. Samples: 3606590460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:31:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:31:09,288][06909] Updated weights for policy 0, policy_version 226053 (0.0035) [2024-06-28 12:31:13,091][06909] Updated weights for policy 0, policy_version 226063 (0.0037) [2024-06-28 12:31:13,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3703848960. Throughput: 0: 44048.6. Samples: 3606719540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:31:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:31:16,511][06909] Updated weights for policy 0, policy_version 226073 (0.0035) [2024-06-28 12:31:18,850][06674] Fps is (10 sec: 45875.2, 60 sec: 43963.8, 300 sec: 44209.9). Total num frames: 3704078336. Throughput: 0: 43992.0. Samples: 3606989140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:31:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:31:20,542][06909] Updated weights for policy 0, policy_version 226083 (0.0040) [2024-06-28 12:31:23,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3704291328. Throughput: 0: 44007.7. Samples: 3607250080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:31:23,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:31:23,966][06909] Updated weights for policy 0, policy_version 226093 (0.0030) [2024-06-28 12:31:28,019][06909] Updated weights for policy 0, policy_version 226103 (0.0026) [2024-06-28 12:31:28,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.9, 300 sec: 44042.4). Total num frames: 3704504320. Throughput: 0: 43960.5. Samples: 3607378200. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:31:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:31:31,760][06909] Updated weights for policy 0, policy_version 226113 (0.0043) [2024-06-28 12:31:33,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3704717312. Throughput: 0: 44056.1. Samples: 3607649500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:31:33,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:31:35,316][06909] Updated weights for policy 0, policy_version 226123 (0.0038) [2024-06-28 12:31:38,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44237.0, 300 sec: 44098.0). Total num frames: 3704946688. Throughput: 0: 43941.0. Samples: 3607907900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:31:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:31:38,965][06909] Updated weights for policy 0, policy_version 226133 (0.0032) [2024-06-28 12:31:43,025][06909] Updated weights for policy 0, policy_version 226143 (0.0036) [2024-06-28 12:31:43,850][06674] Fps is (10 sec: 45874.8, 60 sec: 43963.7, 300 sec: 44097.9). Total num frames: 3705176064. Throughput: 0: 44017.1. Samples: 3608040420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:31:43,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:31:46,830][06909] Updated weights for policy 0, policy_version 226153 (0.0040) [2024-06-28 12:31:48,850][06674] Fps is (10 sec: 42597.9, 60 sec: 43690.6, 300 sec: 44153.5). Total num frames: 3705372672. Throughput: 0: 43759.5. Samples: 3608301620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 21.0) [2024-06-28 12:31:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:31:48,862][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000226158_3705372672.pth... [2024-06-28 12:31:48,919][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000225514_3694821376.pth [2024-06-28 12:31:50,275][06909] Updated weights for policy 0, policy_version 226163 (0.0024) [2024-06-28 12:31:53,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3705602048. Throughput: 0: 43822.6. Samples: 3608562480. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 12:31:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:31:54,232][06909] Updated weights for policy 0, policy_version 226173 (0.0027) [2024-06-28 12:31:58,043][06909] Updated weights for policy 0, policy_version 226183 (0.0038) [2024-06-28 12:31:58,850][06674] Fps is (10 sec: 45875.7, 60 sec: 43965.3, 300 sec: 44042.4). Total num frames: 3705831424. Throughput: 0: 43942.2. Samples: 3608696940. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 12:31:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:32:01,442][06909] Updated weights for policy 0, policy_version 226193 (0.0038) [2024-06-28 12:32:03,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 44098.0). Total num frames: 3706028032. Throughput: 0: 43711.2. Samples: 3608956140. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 12:32:03,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:32:05,386][06909] Updated weights for policy 0, policy_version 226203 (0.0039) [2024-06-28 12:32:08,850][06674] Fps is (10 sec: 42598.1, 60 sec: 43963.7, 300 sec: 44098.9). Total num frames: 3706257408. Throughput: 0: 43732.0. Samples: 3609218020. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 12:32:08,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:32:09,176][06909] Updated weights for policy 0, policy_version 226213 (0.0033) [2024-06-28 12:32:12,631][06909] Updated weights for policy 0, policy_version 226223 (0.0046) [2024-06-28 12:32:13,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44098.0). Total num frames: 3706486784. Throughput: 0: 43917.8. Samples: 3609354500. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 12:32:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:32:16,763][06909] Updated weights for policy 0, policy_version 226233 (0.0032) [2024-06-28 12:32:18,850][06674] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 44098.0). Total num frames: 3706683392. Throughput: 0: 43743.5. Samples: 3609617960. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 12:32:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:32:20,192][06909] Updated weights for policy 0, policy_version 226243 (0.0033) [2024-06-28 12:32:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3706912768. Throughput: 0: 43789.7. Samples: 3609878440. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 12:32:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:32:24,349][06909] Updated weights for policy 0, policy_version 226253 (0.0037) [2024-06-28 12:32:27,828][06909] Updated weights for policy 0, policy_version 226263 (0.0033) [2024-06-28 12:32:28,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3707142144. Throughput: 0: 43971.6. Samples: 3610019140. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 12:32:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:32:31,470][06909] Updated weights for policy 0, policy_version 226273 (0.0030) [2024-06-28 12:32:33,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 44098.0). Total num frames: 3707338752. Throughput: 0: 43941.0. Samples: 3610278960. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 12:32:33,850][06674] Avg episode reward: [(0, '0.420')] [2024-06-28 12:32:35,199][06909] Updated weights for policy 0, policy_version 226283 (0.0030) [2024-06-28 12:32:38,603][06909] Updated weights for policy 0, policy_version 226293 (0.0031) [2024-06-28 12:32:38,850][06674] Fps is (10 sec: 44236.4, 60 sec: 43963.6, 300 sec: 44097.9). Total num frames: 3707584512. Throughput: 0: 43960.4. Samples: 3610540700. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 12:32:38,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:32:42,584][06909] Updated weights for policy 0, policy_version 226303 (0.0029) [2024-06-28 12:32:43,850][06674] Fps is (10 sec: 45875.0, 60 sec: 43690.7, 300 sec: 44042.4). Total num frames: 3707797504. Throughput: 0: 44039.9. Samples: 3610678740. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 12:32:43,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:32:46,231][06909] Updated weights for policy 0, policy_version 226313 (0.0027) [2024-06-28 12:32:48,850][06674] Fps is (10 sec: 40960.8, 60 sec: 43690.8, 300 sec: 43986.9). Total num frames: 3707994112. Throughput: 0: 44099.1. Samples: 3610940600. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 12:32:48,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:32:49,531][06887] Signal inference workers to stop experience collection... (51050 times) [2024-06-28 12:32:49,534][06887] Signal inference workers to resume experience collection... (51050 times) [2024-06-28 12:32:49,546][06909] InferenceWorker_p0-w0: stopping experience collection (51050 times) [2024-06-28 12:32:49,547][06909] InferenceWorker_p0-w0: resuming experience collection (51050 times) [2024-06-28 12:32:49,872][06909] Updated weights for policy 0, policy_version 226323 (0.0027) [2024-06-28 12:32:53,437][06909] Updated weights for policy 0, policy_version 226333 (0.0029) [2024-06-28 12:32:53,850][06674] Fps is (10 sec: 44237.1, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3708239872. Throughput: 0: 44090.7. Samples: 3611202100. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 12:32:53,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:32:57,283][06909] Updated weights for policy 0, policy_version 226343 (0.0039) [2024-06-28 12:32:58,851][06674] Fps is (10 sec: 47507.8, 60 sec: 43962.9, 300 sec: 44097.8). Total num frames: 3708469248. Throughput: 0: 44247.7. Samples: 3611345700. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 12:32:58,852][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:33:01,168][06909] Updated weights for policy 0, policy_version 226353 (0.0041) [2024-06-28 12:33:03,850][06674] Fps is (10 sec: 42598.5, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3708665856. Throughput: 0: 44292.9. Samples: 3611611140. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:33:03,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:33:04,855][06909] Updated weights for policy 0, policy_version 226363 (0.0035) [2024-06-28 12:33:08,291][06909] Updated weights for policy 0, policy_version 226373 (0.0035) [2024-06-28 12:33:08,850][06674] Fps is (10 sec: 42603.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3708895232. Throughput: 0: 44153.3. Samples: 3611865340. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:33:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:33:12,336][06909] Updated weights for policy 0, policy_version 226383 (0.0028) [2024-06-28 12:33:13,850][06674] Fps is (10 sec: 45874.9, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3709124608. Throughput: 0: 44205.8. Samples: 3612008400. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:33:13,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:33:15,970][06909] Updated weights for policy 0, policy_version 226393 (0.0028) [2024-06-28 12:33:18,850][06674] Fps is (10 sec: 44237.1, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3709337600. Throughput: 0: 44166.7. Samples: 3612266460. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:33:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:33:19,599][06909] Updated weights for policy 0, policy_version 226403 (0.0034) [2024-06-28 12:33:23,424][06909] Updated weights for policy 0, policy_version 226413 (0.0039) [2024-06-28 12:33:23,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3709550592. Throughput: 0: 44315.2. Samples: 3612534880. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:33:23,859][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:33:26,919][06909] Updated weights for policy 0, policy_version 226423 (0.0030) [2024-06-28 12:33:28,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3709779968. Throughput: 0: 44234.8. Samples: 3612669300. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:33:28,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:33:30,677][06909] Updated weights for policy 0, policy_version 226433 (0.0032) [2024-06-28 12:33:33,850][06674] Fps is (10 sec: 45875.0, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3710009344. Throughput: 0: 44283.0. Samples: 3612933340. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:33:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:33:34,422][06909] Updated weights for policy 0, policy_version 226443 (0.0039) [2024-06-28 12:33:38,026][06909] Updated weights for policy 0, policy_version 226453 (0.0032) [2024-06-28 12:33:38,850][06674] Fps is (10 sec: 44236.7, 60 sec: 43963.9, 300 sec: 44098.0). Total num frames: 3710222336. Throughput: 0: 44374.7. Samples: 3613198960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:33:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:33:41,948][06909] Updated weights for policy 0, policy_version 226463 (0.0033) [2024-06-28 12:33:43,850][06674] Fps is (10 sec: 45875.9, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 3710468096. Throughput: 0: 44138.5. Samples: 3613331880. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:33:43,850][06674] Avg episode reward: [(0, '0.421')] [2024-06-28 12:33:45,230][06909] Updated weights for policy 0, policy_version 226473 (0.0043) [2024-06-28 12:33:48,850][06674] Fps is (10 sec: 42597.8, 60 sec: 44236.7, 300 sec: 43986.9). Total num frames: 3710648320. Throughput: 0: 44112.3. Samples: 3613596200. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:33:48,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:33:48,946][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000226481_3710664704.pth... [2024-06-28 12:33:49,001][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000225838_3700129792.pth [2024-06-28 12:33:49,339][06909] Updated weights for policy 0, policy_version 226483 (0.0025) [2024-06-28 12:33:53,001][06909] Updated weights for policy 0, policy_version 226493 (0.0033) [2024-06-28 12:33:53,850][06674] Fps is (10 sec: 40960.1, 60 sec: 43963.8, 300 sec: 43986.9). Total num frames: 3710877696. Throughput: 0: 44310.8. Samples: 3613859320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:33:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:33:56,803][06909] Updated weights for policy 0, policy_version 226503 (0.0035) [2024-06-28 12:33:58,020][06887] Signal inference workers to stop experience collection... (51100 times) [2024-06-28 12:33:58,020][06887] Signal inference workers to resume experience collection... (51100 times) [2024-06-28 12:33:58,061][06909] InferenceWorker_p0-w0: stopping experience collection (51100 times) [2024-06-28 12:33:58,062][06909] InferenceWorker_p0-w0: resuming experience collection (51100 times) [2024-06-28 12:33:58,850][06674] Fps is (10 sec: 47514.4, 60 sec: 44237.7, 300 sec: 44042.4). Total num frames: 3711123456. Throughput: 0: 44118.3. Samples: 3613993720. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:33:58,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:34:00,514][06909] Updated weights for policy 0, policy_version 226513 (0.0032) [2024-06-28 12:34:03,851][06674] Fps is (10 sec: 44229.1, 60 sec: 44235.6, 300 sec: 43931.1). Total num frames: 3711320064. Throughput: 0: 44113.5. Samples: 3614251640. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:34:03,852][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:34:04,479][06909] Updated weights for policy 0, policy_version 226523 (0.0040) [2024-06-28 12:34:07,838][06909] Updated weights for policy 0, policy_version 226533 (0.0039) [2024-06-28 12:34:08,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3711549440. Throughput: 0: 43989.3. Samples: 3614514400. Policy #0 lag: (min: 0.0, avg: 10.9, max: 22.0) [2024-06-28 12:34:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:34:11,682][06909] Updated weights for policy 0, policy_version 226543 (0.0026) [2024-06-28 12:34:13,850][06674] Fps is (10 sec: 45883.1, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3711778816. Throughput: 0: 43927.1. Samples: 3614646020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 12:34:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:34:15,172][06909] Updated weights for policy 0, policy_version 226553 (0.0030) [2024-06-28 12:34:18,850][06674] Fps is (10 sec: 42598.8, 60 sec: 43963.8, 300 sec: 43931.3). Total num frames: 3711975424. Throughput: 0: 44018.4. Samples: 3614914160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 12:34:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:34:19,472][06909] Updated weights for policy 0, policy_version 226563 (0.0037) [2024-06-28 12:34:22,595][06909] Updated weights for policy 0, policy_version 226573 (0.0033) [2024-06-28 12:34:23,850][06674] Fps is (10 sec: 44236.9, 60 sec: 44510.0, 300 sec: 44098.0). Total num frames: 3712221184. Throughput: 0: 43987.6. Samples: 3615178400. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 12:34:23,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:34:26,749][06909] Updated weights for policy 0, policy_version 226583 (0.0028) [2024-06-28 12:34:28,850][06674] Fps is (10 sec: 47513.4, 60 sec: 44509.8, 300 sec: 44042.4). Total num frames: 3712450560. Throughput: 0: 43952.4. Samples: 3615309740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 12:34:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:34:30,121][06909] Updated weights for policy 0, policy_version 226593 (0.0022) [2024-06-28 12:34:33,850][06674] Fps is (10 sec: 40960.0, 60 sec: 43690.8, 300 sec: 43931.7). Total num frames: 3712630784. Throughput: 0: 43913.5. Samples: 3615572300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 12:34:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:34:34,234][06909] Updated weights for policy 0, policy_version 226603 (0.0030) [2024-06-28 12:34:37,639][06909] Updated weights for policy 0, policy_version 226613 (0.0029) [2024-06-28 12:34:38,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3712892928. Throughput: 0: 43860.3. Samples: 3615833040. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 12:34:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:34:41,719][06909] Updated weights for policy 0, policy_version 226623 (0.0033) [2024-06-28 12:34:43,850][06674] Fps is (10 sec: 47513.4, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3713105920. Throughput: 0: 43895.1. Samples: 3615969000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 12:34:43,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:34:45,189][06909] Updated weights for policy 0, policy_version 226633 (0.0037) [2024-06-28 12:34:48,850][06674] Fps is (10 sec: 40960.3, 60 sec: 44236.9, 300 sec: 43986.9). Total num frames: 3713302528. Throughput: 0: 44079.4. Samples: 3616235140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 12:34:48,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:34:48,952][06909] Updated weights for policy 0, policy_version 226643 (0.0034) [2024-06-28 12:34:52,333][06909] Updated weights for policy 0, policy_version 226653 (0.0039) [2024-06-28 12:34:53,850][06674] Fps is (10 sec: 44236.7, 60 sec: 44509.8, 300 sec: 44153.5). Total num frames: 3713548288. Throughput: 0: 44086.7. Samples: 3616498300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 12:34:53,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:34:56,650][06909] Updated weights for policy 0, policy_version 226663 (0.0031) [2024-06-28 12:34:58,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3713761280. Throughput: 0: 44211.5. Samples: 3616635540. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 12:34:58,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 12:34:59,670][06909] Updated weights for policy 0, policy_version 226673 (0.0036) [2024-06-28 12:35:03,850][06674] Fps is (10 sec: 40959.3, 60 sec: 43964.8, 300 sec: 43986.9). Total num frames: 3713957888. Throughput: 0: 44209.1. Samples: 3616903580. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 12:35:03,850][06674] Avg episode reward: [(0, '0.428')] [2024-06-28 12:35:04,317][06909] Updated weights for policy 0, policy_version 226683 (0.0034) [2024-06-28 12:35:07,045][06909] Updated weights for policy 0, policy_version 226693 (0.0039) [2024-06-28 12:35:08,850][06674] Fps is (10 sec: 44236.5, 60 sec: 44236.8, 300 sec: 44098.0). Total num frames: 3714203648. Throughput: 0: 44036.8. Samples: 3617160060. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 12:35:08,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:35:11,485][06909] Updated weights for policy 0, policy_version 226703 (0.0037) [2024-06-28 12:35:13,051][06887] Signal inference workers to stop experience collection... (51150 times) [2024-06-28 12:35:13,052][06887] Signal inference workers to resume experience collection... (51150 times) [2024-06-28 12:35:13,076][06909] InferenceWorker_p0-w0: stopping experience collection (51150 times) [2024-06-28 12:35:13,076][06909] InferenceWorker_p0-w0: resuming experience collection (51150 times) [2024-06-28 12:35:13,850][06674] Fps is (10 sec: 45875.9, 60 sec: 43963.7, 300 sec: 43986.9). Total num frames: 3714416640. Throughput: 0: 44303.6. Samples: 3617303400. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 12:35:13,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:35:14,362][06909] Updated weights for policy 0, policy_version 226713 (0.0023) [2024-06-28 12:35:18,823][06909] Updated weights for policy 0, policy_version 226723 (0.0030) [2024-06-28 12:35:18,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3714629632. Throughput: 0: 44268.4. Samples: 3617564380. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 12:35:18,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:35:21,907][06909] Updated weights for policy 0, policy_version 226733 (0.0035) [2024-06-28 12:35:23,856][06674] Fps is (10 sec: 44210.0, 60 sec: 43959.2, 300 sec: 44097.1). Total num frames: 3714859008. Throughput: 0: 44116.8. Samples: 3617818560. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 12:35:23,857][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:35:26,017][06909] Updated weights for policy 0, policy_version 226743 (0.0030) [2024-06-28 12:35:28,850][06674] Fps is (10 sec: 45875.3, 60 sec: 43963.8, 300 sec: 44042.4). Total num frames: 3715088384. Throughput: 0: 44205.3. Samples: 3617958240. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 12:35:28,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:35:29,295][06909] Updated weights for policy 0, policy_version 226753 (0.0032) [2024-06-28 12:35:33,850][06674] Fps is (10 sec: 40984.1, 60 sec: 43963.5, 300 sec: 43986.9). Total num frames: 3715268608. Throughput: 0: 44137.6. Samples: 3618221340. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 12:35:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:35:34,085][06909] Updated weights for policy 0, policy_version 226763 (0.0032) [2024-06-28 12:35:36,899][06909] Updated weights for policy 0, policy_version 226773 (0.0028) [2024-06-28 12:35:38,850][06674] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43986.9). Total num frames: 3715514368. Throughput: 0: 43973.7. Samples: 3618477120. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 12:35:38,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:35:41,457][06909] Updated weights for policy 0, policy_version 226783 (0.0031) [2024-06-28 12:35:43,850][06674] Fps is (10 sec: 47514.2, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3715743744. Throughput: 0: 44221.2. Samples: 3618625500. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 12:35:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:35:44,469][06909] Updated weights for policy 0, policy_version 226793 (0.0031) [2024-06-28 12:35:48,649][06909] Updated weights for policy 0, policy_version 226803 (0.0024) [2024-06-28 12:35:48,850][06674] Fps is (10 sec: 42598.4, 60 sec: 43963.6, 300 sec: 43986.9). Total num frames: 3715940352. Throughput: 0: 44057.8. Samples: 3618886180. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 12:35:48,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:35:48,857][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000226803_3715940352.pth... [2024-06-28 12:35:48,915][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000226158_3705372672.pth [2024-06-28 12:35:51,648][06909] Updated weights for policy 0, policy_version 226813 (0.0042) [2024-06-28 12:35:53,850][06674] Fps is (10 sec: 44237.0, 60 sec: 43963.7, 300 sec: 44042.7). Total num frames: 3716186112. Throughput: 0: 44114.7. Samples: 3619145220. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 12:35:53,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:35:56,105][06909] Updated weights for policy 0, policy_version 226823 (0.0021) [2024-06-28 12:35:58,855][06674] Fps is (10 sec: 47487.8, 60 sec: 44232.7, 300 sec: 44097.1). Total num frames: 3716415488. Throughput: 0: 43929.7. Samples: 3619280480. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 12:35:58,856][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:35:59,066][06909] Updated weights for policy 0, policy_version 226833 (0.0032) [2024-06-28 12:36:03,285][06909] Updated weights for policy 0, policy_version 226843 (0.0043) [2024-06-28 12:36:03,850][06674] Fps is (10 sec: 42598.0, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3716612096. Throughput: 0: 44088.8. Samples: 3619548380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 12:36:03,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:36:06,565][06909] Updated weights for policy 0, policy_version 226853 (0.0032) [2024-06-28 12:36:08,850][06674] Fps is (10 sec: 42621.8, 60 sec: 43963.7, 300 sec: 44042.4). Total num frames: 3716841472. Throughput: 0: 44274.4. Samples: 3619810640. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 12:36:08,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:36:10,932][06909] Updated weights for policy 0, policy_version 226863 (0.0034) [2024-06-28 12:36:13,852][06674] Fps is (10 sec: 45866.3, 60 sec: 44235.3, 300 sec: 44042.1). Total num frames: 3717070848. Throughput: 0: 44185.0. Samples: 3619946660. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 12:36:13,852][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:36:14,140][06909] Updated weights for policy 0, policy_version 226873 (0.0045) [2024-06-28 12:36:18,235][06909] Updated weights for policy 0, policy_version 226883 (0.0033) [2024-06-28 12:36:18,850][06674] Fps is (10 sec: 44236.8, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3717283840. Throughput: 0: 44264.6. Samples: 3620213240. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 12:36:18,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:36:21,408][06909] Updated weights for policy 0, policy_version 226893 (0.0044) [2024-06-28 12:36:23,856][06674] Fps is (10 sec: 42581.2, 60 sec: 43963.7, 300 sec: 44041.5). Total num frames: 3717496832. Throughput: 0: 44359.4. Samples: 3620473560. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 12:36:23,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:36:25,507][06909] Updated weights for policy 0, policy_version 226903 (0.0040) [2024-06-28 12:36:28,716][06909] Updated weights for policy 0, policy_version 226913 (0.0040) [2024-06-28 12:36:28,850][06674] Fps is (10 sec: 45874.6, 60 sec: 44236.6, 300 sec: 44153.5). Total num frames: 3717742592. Throughput: 0: 44018.1. Samples: 3620606320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 12:36:28,851][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:36:32,875][06909] Updated weights for policy 0, policy_version 226923 (0.0033) [2024-06-28 12:36:33,850][06674] Fps is (10 sec: 44264.2, 60 sec: 44510.1, 300 sec: 44042.4). Total num frames: 3717939200. Throughput: 0: 44099.8. Samples: 3620870660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 12:36:33,850][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:36:36,452][06909] Updated weights for policy 0, policy_version 226933 (0.0036) [2024-06-28 12:36:38,850][06674] Fps is (10 sec: 42598.5, 60 sec: 44236.8, 300 sec: 44042.4). Total num frames: 3718168576. Throughput: 0: 44137.2. Samples: 3621131400. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 12:36:38,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:36:40,478][06909] Updated weights for policy 0, policy_version 226943 (0.0035) [2024-06-28 12:36:43,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3718381568. Throughput: 0: 44314.4. Samples: 3621274380. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 12:36:43,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:36:43,858][06909] Updated weights for policy 0, policy_version 226953 (0.0041) [2024-06-28 12:36:47,735][06909] Updated weights for policy 0, policy_version 226963 (0.0040) [2024-06-28 12:36:48,856][06674] Fps is (10 sec: 44210.5, 60 sec: 44505.4, 300 sec: 44097.0). Total num frames: 3718610944. Throughput: 0: 44131.9. Samples: 3621534580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 12:36:48,857][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:36:51,131][06909] Updated weights for policy 0, policy_version 226973 (0.0025) [2024-06-28 12:36:52,085][06887] Signal inference workers to stop experience collection... (51200 times) [2024-06-28 12:36:52,085][06887] Signal inference workers to resume experience collection... (51200 times) [2024-06-28 12:36:52,128][06909] InferenceWorker_p0-w0: stopping experience collection (51200 times) [2024-06-28 12:36:52,128][06909] InferenceWorker_p0-w0: resuming experience collection (51200 times) [2024-06-28 12:36:53,850][06674] Fps is (10 sec: 45875.1, 60 sec: 44236.9, 300 sec: 44098.0). Total num frames: 3718840320. Throughput: 0: 44221.9. Samples: 3621800620. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 12:36:53,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:36:55,025][06909] Updated weights for policy 0, policy_version 226983 (0.0025) [2024-06-28 12:36:58,431][06909] Updated weights for policy 0, policy_version 226993 (0.0034) [2024-06-28 12:36:58,850][06674] Fps is (10 sec: 44263.9, 60 sec: 43967.8, 300 sec: 44153.5). Total num frames: 3719053312. Throughput: 0: 44086.0. Samples: 3621930440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 12:36:58,850][06674] Avg episode reward: [(0, '0.433')] [2024-06-28 12:37:02,715][06909] Updated weights for policy 0, policy_version 227003 (0.0037) [2024-06-28 12:37:03,850][06674] Fps is (10 sec: 44236.4, 60 sec: 44509.9, 300 sec: 44153.5). Total num frames: 3719282688. Throughput: 0: 44113.8. Samples: 3622198360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 12:37:03,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:37:05,686][06909] Updated weights for policy 0, policy_version 227013 (0.0040) [2024-06-28 12:37:08,850][06674] Fps is (10 sec: 44236.2, 60 sec: 44236.8, 300 sec: 44097.9). Total num frames: 3719495680. Throughput: 0: 44051.2. Samples: 3622455600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 12:37:08,851][06674] Avg episode reward: [(0, '0.426')] [2024-06-28 12:37:10,015][06909] Updated weights for policy 0, policy_version 227023 (0.0045) [2024-06-28 12:37:13,441][06909] Updated weights for policy 0, policy_version 227033 (0.0030) [2024-06-28 12:37:13,850][06674] Fps is (10 sec: 42598.9, 60 sec: 43965.3, 300 sec: 44153.5). Total num frames: 3719708672. Throughput: 0: 44071.4. Samples: 3622589520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 12:37:13,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:37:17,598][06909] Updated weights for policy 0, policy_version 227043 (0.0036) [2024-06-28 12:37:18,850][06674] Fps is (10 sec: 44237.3, 60 sec: 44236.8, 300 sec: 44153.5). Total num frames: 3719938048. Throughput: 0: 44228.4. Samples: 3622860940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 12:37:18,850][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:37:20,813][06909] Updated weights for policy 0, policy_version 227053 (0.0034) [2024-06-28 12:37:23,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44241.3, 300 sec: 44098.0). Total num frames: 3720151040. Throughput: 0: 44227.7. Samples: 3623121640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 12:37:23,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:37:24,945][06909] Updated weights for policy 0, policy_version 227063 (0.0033) [2024-06-28 12:37:28,146][06909] Updated weights for policy 0, policy_version 227073 (0.0041) [2024-06-28 12:37:28,850][06674] Fps is (10 sec: 44236.6, 60 sec: 43963.8, 300 sec: 44209.0). Total num frames: 3720380416. Throughput: 0: 43883.4. Samples: 3623249140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 12:37:28,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:37:32,365][06909] Updated weights for policy 0, policy_version 227083 (0.0031) [2024-06-28 12:37:33,850][06674] Fps is (10 sec: 44236.6, 60 sec: 44236.7, 300 sec: 44098.0). Total num frames: 3720593408. Throughput: 0: 43975.3. Samples: 3623513200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 12:37:33,850][06674] Avg episode reward: [(0, '0.425')] [2024-06-28 12:37:35,835][06909] Updated weights for policy 0, policy_version 227093 (0.0033) [2024-06-28 12:37:38,850][06674] Fps is (10 sec: 42598.6, 60 sec: 43963.8, 300 sec: 44098.0). Total num frames: 3720806400. Throughput: 0: 43933.7. Samples: 3623777640. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:37:38,850][06674] Avg episode reward: [(0, '0.423')] [2024-06-28 12:37:39,682][06909] Updated weights for policy 0, policy_version 227103 (0.0035) [2024-06-28 12:37:43,243][06909] Updated weights for policy 0, policy_version 227113 (0.0035) [2024-06-28 12:37:43,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43963.6, 300 sec: 44153.5). Total num frames: 3721019392. Throughput: 0: 43823.5. Samples: 3623902500. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:37:43,850][06674] Avg episode reward: [(0, '0.427')] [2024-06-28 12:37:47,593][06909] Updated weights for policy 0, policy_version 227123 (0.0045) [2024-06-28 12:37:48,850][06674] Fps is (10 sec: 42598.2, 60 sec: 43695.1, 300 sec: 44042.4). Total num frames: 3721232384. Throughput: 0: 43724.0. Samples: 3624165940. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:37:48,850][06674] Avg episode reward: [(0, '0.422')] [2024-06-28 12:37:48,972][06887] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000227127_3721248768.pth... [2024-06-28 12:37:49,033][06887] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000226481_3710664704.pth [2024-06-28 12:37:50,633][06909] Updated weights for policy 0, policy_version 227133 (0.0038) [2024-06-28 12:37:53,850][06674] Fps is (10 sec: 44236.8, 60 sec: 43690.6, 300 sec: 44042.6). Total num frames: 3721461760. Throughput: 0: 43858.7. Samples: 3624429240. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 12:37:53,851][06674] Avg episode reward: [(0, '0.424')] [2024-06-28 12:37:54,950][06909] Updated weights for policy 0, policy_version 227143 (0.0041) [2024-06-28 12:38:20,242][09190] Saving configuration to ./train_dir/sample_factory/p2.sf/config.json... [2024-06-28 12:38:20,307][09190] Rollout worker 0 uses device cpu [2024-06-28 12:38:20,308][09190] Rollout worker 1 uses device cpu [2024-06-28 12:38:20,309][09190] Rollout worker 2 uses device cpu [2024-06-28 12:38:20,309][09190] Rollout worker 3 uses device cpu [2024-06-28 12:38:20,309][09190] Rollout worker 4 uses device cpu [2024-06-28 12:38:20,310][09190] Rollout worker 5 uses device cpu [2024-06-28 12:38:20,310][09190] Rollout worker 6 uses device cpu [2024-06-28 12:38:20,311][09190] Rollout worker 7 uses device cpu [2024-06-28 12:38:20,311][09190] Rollout worker 8 uses device cpu [2024-06-28 12:38:20,312][09190] Rollout worker 9 uses device cpu [2024-06-28 12:38:20,313][09190] Rollout worker 10 uses device cpu [2024-06-28 12:38:20,313][09190] Rollout worker 11 uses device cpu [2024-06-28 12:38:20,313][09190] Rollout worker 12 uses device cpu [2024-06-28 12:38:20,313][09190] Rollout worker 13 uses device cpu [2024-06-28 12:38:20,313][09190] Rollout worker 14 uses device cpu [2024-06-28 12:38:20,314][09190] Rollout worker 15 uses device cpu [2024-06-28 12:38:20,314][09190] Rollout worker 16 uses device cpu [2024-06-28 12:38:20,314][09190] Rollout worker 17 uses device cpu [2024-06-28 12:38:20,314][09190] Rollout worker 18 uses device cpu [2024-06-28 12:38:20,314][09190] Rollout worker 19 uses device cpu [2024-06-28 12:38:20,315][09190] Rollout worker 20 uses device cpu [2024-06-28 12:38:20,315][09190] Rollout worker 21 uses device cpu [2024-06-28 12:38:20,315][09190] Rollout worker 22 uses device cpu [2024-06-28 12:38:20,315][09190] Rollout worker 23 uses device cpu [2024-06-28 12:38:20,315][09190] Rollout worker 24 uses device cpu [2024-06-28 12:38:20,316][09190] Rollout worker 25 uses device cpu [2024-06-28 12:38:20,316][09190] Rollout worker 26 uses device cpu [2024-06-28 12:38:20,316][09190] Rollout worker 27 uses device cpu [2024-06-28 12:38:20,316][09190] Rollout worker 28 uses device cpu [2024-06-28 12:38:20,316][09190] Rollout worker 29 uses device cpu [2024-06-28 12:38:20,317][09190] Rollout worker 30 uses device cpu [2024-06-28 12:38:20,317][09190] Rollout worker 31 uses device cpu [2024-06-28 12:38:20,903][09190] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-28 12:38:20,903][09190] InferenceWorker_p0-w0: min num requests: 10 [2024-06-28 12:38:20,962][09190] Starting all processes... [2024-06-28 12:38:20,963][09190] Starting process learner_proc0 [2024-06-28 12:38:21,237][09190] Starting all processes... [2024-06-28 12:38:21,240][09190] Starting process inference_proc0-0 [2024-06-28 12:38:21,240][09190] Starting process rollout_proc0 [2024-06-28 12:38:21,240][09190] Starting process rollout_proc1 [2024-06-28 12:38:21,240][09190] Starting process rollout_proc2 [2024-06-28 12:38:21,240][09190] Starting process rollout_proc3 [2024-06-28 12:38:21,240][09190] Starting process rollout_proc4 [2024-06-28 12:38:21,240][09190] Starting process rollout_proc5 [2024-06-28 12:38:21,240][09190] Starting process rollout_proc6 [2024-06-28 12:38:21,241][09190] Starting process rollout_proc7 [2024-06-28 12:38:21,241][09190] Starting process rollout_proc8 [2024-06-28 12:38:21,241][09190] Starting process rollout_proc9 [2024-06-28 12:38:21,241][09190] Starting process rollout_proc10 [2024-06-28 12:38:21,242][09190] Starting process rollout_proc11 [2024-06-28 12:38:21,243][09190] Starting process rollout_proc12 [2024-06-28 12:38:21,243][09190] Starting process rollout_proc13 [2024-06-28 12:38:21,244][09190] Starting process rollout_proc14 [2024-06-28 12:38:21,245][09190] Starting process rollout_proc15 [2024-06-28 12:38:21,246][09190] Starting process rollout_proc16 [2024-06-28 12:38:21,248][09190] Starting process rollout_proc17 [2024-06-28 12:38:21,248][09190] Starting process rollout_proc18 [2024-06-28 12:38:21,250][09190] Starting process rollout_proc19 [2024-06-28 12:38:21,251][09190] Starting process rollout_proc20 [2024-06-28 12:38:21,253][09190] Starting process rollout_proc21 [2024-06-28 12:38:21,256][09190] Starting process rollout_proc22 [2024-06-28 12:38:21,256][09190] Starting process rollout_proc23 [2024-06-28 12:38:21,257][09190] Starting process rollout_proc24 [2024-06-28 12:38:21,258][09190] Starting process rollout_proc25 [2024-06-28 12:38:21,258][09190] Starting process rollout_proc26 [2024-06-28 12:38:21,263][09190] Starting process rollout_proc27 [2024-06-28 12:38:21,263][09190] Starting process rollout_proc28 [2024-06-28 12:38:21,263][09190] Starting process rollout_proc29 [2024-06-28 12:38:21,266][09190] Starting process rollout_proc30 [2024-06-28 12:38:21,267][09190] Starting process rollout_proc31 [2024-06-28 12:38:23,056][09450] Worker 25 uses CPU cores [25] [2024-06-28 12:38:23,334][09424] Worker 0 uses CPU cores [0] [2024-06-28 12:38:23,351][09431] Worker 9 uses CPU cores [9] [2024-06-28 12:38:23,351][09447] Worker 24 uses CPU cores [24] [2024-06-28 12:38:23,368][09454] Worker 31 uses CPU cores [31] [2024-06-28 12:38:23,404][09433] Worker 8 uses CPU cores [8] [2024-06-28 12:38:23,428][09452] Worker 27 uses CPU cores [27] [2024-06-28 12:38:23,452][09430] Worker 7 uses CPU cores [7] [2024-06-28 12:38:23,488][09427] Worker 3 uses CPU cores [3] [2024-06-28 12:38:23,520][09440] Worker 16 uses CPU cores [16] [2024-06-28 12:38:23,555][09442] Worker 18 uses CPU cores [18] [2024-06-28 12:38:23,563][09451] Worker 29 uses CPU cores [29] [2024-06-28 12:38:23,567][09432] Worker 6 uses CPU cores [6] [2024-06-28 12:38:23,568][09444] Worker 20 uses CPU cores [20] [2024-06-28 12:38:23,600][09445] Worker 21 uses CPU cores [21] [2024-06-28 12:38:23,601][09425] Worker 1 uses CPU cores [1] [2024-06-28 12:38:23,636][09436] Worker 10 uses CPU cores [10] [2024-06-28 12:38:23,650][09441] Worker 17 uses CPU cores [17] [2024-06-28 12:38:23,652][09443] Worker 19 uses CPU cores [19] [2024-06-28 12:38:23,661][09438] Worker 15 uses CPU cores [15] [2024-06-28 12:38:23,676][09437] Worker 13 uses CPU cores [13] [2024-06-28 12:38:23,688][09446] Worker 22 uses CPU cores [22] [2024-06-28 12:38:23,698][09439] Worker 14 uses CPU cores [14] [2024-06-28 12:38:23,711][09403] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-28 12:38:23,711][09403] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2024-06-28 12:38:23,719][09403] Num visible devices: 1 [2024-06-28 12:38:23,728][09426] Worker 2 uses CPU cores [2] [2024-06-28 12:38:23,736][09403] Setting fixed seed 0 [2024-06-28 12:38:23,737][09403] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-28 12:38:23,737][09403] Initializing actor-critic model on device cuda:0 [2024-06-28 12:38:23,763][09435] Worker 12 uses CPU cores [12] [2024-06-28 12:38:23,772][09428] Worker 4 uses CPU cores [4] [2024-06-28 12:38:23,774][09434] Worker 11 uses CPU cores [11] [2024-06-28 12:38:23,775][09448] Worker 23 uses CPU cores [23] [2024-06-28 12:38:23,819][09453] Worker 28 uses CPU cores [28] [2024-06-28 12:38:23,845][09423] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-28 12:38:23,845][09423] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2024-06-28 12:38:23,846][09429] Worker 5 uses CPU cores [5] [2024-06-28 12:38:23,852][09423] Num visible devices: 1 [2024-06-28 12:38:23,883][09455] Worker 30 uses CPU cores [30] [2024-06-28 12:38:23,922][09449] Worker 26 uses CPU cores [26] [2024-06-28 12:38:24,468][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,468][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,468][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,468][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,468][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,468][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,469][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,472][09403] RunningMeanStd input shape: (1,) [2024-06-28 12:38:24,473][09403] RunningMeanStd input shape: (1,) [2024-06-28 12:38:24,473][09403] RunningMeanStd input shape: (1,) [2024-06-28 12:38:24,473][09403] RunningMeanStd input shape: (1,) [2024-06-28 12:38:24,473][09403] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:24,509][09403] RunningMeanStd input shape: (1,) [2024-06-28 12:38:24,517][09403] Created Actor Critic model with architecture: [2024-06-28 12:38:24,517][09403] SampleFactoryAgentWrapper( (obs_normalizer): ObservationNormalizer() (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (agent): MettaAgent( (_encoder): MultiFeatureSetEncoder( (feature_set_encoders): ModuleDict( (grid_obs): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (agent): RunningMeanStdInPlace() (altar): RunningMeanStdInPlace() (clock): RunningMeanStdInPlace() (converter): RunningMeanStdInPlace() (generator): RunningMeanStdInPlace() (wall): RunningMeanStdInPlace() (agent:dir): RunningMeanStdInPlace() (agent:energy): RunningMeanStdInPlace() (agent:frozen): RunningMeanStdInPlace() (agent:hp): RunningMeanStdInPlace() (agent:id): RunningMeanStdInPlace() (agent:inv_r1): RunningMeanStdInPlace() (agent:inv_r2): RunningMeanStdInPlace() (agent:inv_r3): RunningMeanStdInPlace() (agent:shield): RunningMeanStdInPlace() (altar:hp): RunningMeanStdInPlace() (altar:state): RunningMeanStdInPlace() (converter:hp): RunningMeanStdInPlace() (converter:state): RunningMeanStdInPlace() (generator:amount): RunningMeanStdInPlace() (generator:hp): RunningMeanStdInPlace() (generator:state): RunningMeanStdInPlace() (wall:hp): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=125, out_features=512, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=512, out_features=512, bias=True) (3): ELU(alpha=1.0) (4): Linear(in_features=512, out_features=512, bias=True) (5): ELU(alpha=1.0) (6): Linear(in_features=512, out_features=512, bias=True) (7): ELU(alpha=1.0) ) ) (global_vars): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (_steps): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (last_action): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (last_action_id): RunningMeanStdInPlace() (last_action_val): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (last_reward): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (last_reward): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (kinship): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (kinship): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=125, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) ) (merged_encoder): Sequential( (0): Linear(in_features=544, out_features=512, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=512, out_features=512, bias=True) (3): ELU(alpha=1.0) (4): Linear(in_features=512, out_features=512, bias=True) (5): ELU(alpha=1.0) ) ) (_decoder): Decoder( (mlp): Identity() ) (_critic_linear): Linear(in_features=512, out_features=1, bias=True) ) (_core): ModelCoreRNN( (core): GRU(512, 512) ) (_action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=16, bias=True) ) ) [2024-06-28 12:38:24,589][09403] Using optimizer [2024-06-28 12:38:24,778][09403] Loading state from checkpoint ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000227127_3721248768.pth... [2024-06-28 12:38:24,794][09403] Loading model from checkpoint [2024-06-28 12:38:24,796][09403] Loaded experiment state at self.train_step=227127, self.env_steps=3721248768 [2024-06-28 12:38:24,796][09403] Initialized policy 0 weights for model version 227127 [2024-06-28 12:38:24,797][09403] LearnerWorker_p0 finished initialization! [2024-06-28 12:38:24,797][09403] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-28 12:38:25,539][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,539][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,539][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,539][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,540][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,540][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,548][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,549][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,552][09423] RunningMeanStd input shape: (1,) [2024-06-28 12:38:25,552][09423] RunningMeanStd input shape: (1,) [2024-06-28 12:38:25,552][09423] RunningMeanStd input shape: (1,) [2024-06-28 12:38:25,552][09423] RunningMeanStd input shape: (1,) [2024-06-28 12:38:25,552][09423] RunningMeanStd input shape: (11, 11) [2024-06-28 12:38:25,588][09423] RunningMeanStd input shape: (1,) [2024-06-28 12:38:25,613][09190] Inference worker 0-0 is ready! [2024-06-28 12:38:25,613][09190] All inference workers are ready! Signal rollout workers to start! [2024-06-28 12:38:27,922][09190] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 3721248768. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2024-06-28 12:38:28,468][09446] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,475][09441] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,475][09448] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,489][09444] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,498][09442] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,522][09451] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,525][09449] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,535][09455] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,535][09450] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,537][09440] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,541][09425] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,544][09454] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,545][09431] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,546][09434] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,546][09438] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,547][09435] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,547][09447] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,551][09437] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,551][09424] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,552][09432] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,552][09433] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,553][09453] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,553][09430] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,554][09429] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,556][09426] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,556][09439] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,556][09436] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,557][09427] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,560][09428] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,569][09443] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,569][09445] Decorrelating experience for 0 frames... [2024-06-28 12:38:28,571][09452] Decorrelating experience for 0 frames... [2024-06-28 12:38:29,714][09446] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,733][09448] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,735][09441] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,758][09442] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,758][09444] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,799][09451] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,813][09440] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,813][09449] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,816][09455] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,818][09450] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,826][09454] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,846][09447] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,850][09431] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,850][09425] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,859][09434] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,863][09438] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,869][09435] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,870][09429] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,871][09437] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,871][09433] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,875][09432] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,876][09424] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,878][09453] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,882][09430] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,886][09436] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,888][09439] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,890][09426] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,893][09427] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,895][09428] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,897][09452] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,919][09443] Decorrelating experience for 256 frames... [2024-06-28 12:38:29,919][09445] Decorrelating experience for 256 frames... [2024-06-28 12:38:32,921][09190] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 3721248768. Throughput: 0: 3732.1. Samples: 18660. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2024-06-28 12:38:36,846][09446] Worker 22, sleep for 103.125 sec to decorrelate experience collection [2024-06-28 12:38:36,855][09440] Worker 16, sleep for 75.000 sec to decorrelate experience collection [2024-06-28 12:38:36,867][09450] Worker 25, sleep for 117.188 sec to decorrelate experience collection [2024-06-28 12:38:36,868][09449] Worker 26, sleep for 121.875 sec to decorrelate experience collection [2024-06-28 12:38:36,868][09441] Worker 17, sleep for 79.688 sec to decorrelate experience collection [2024-06-28 12:38:36,878][09444] Worker 20, sleep for 93.750 sec to decorrelate experience collection [2024-06-28 12:38:36,878][09447] Worker 24, sleep for 112.500 sec to decorrelate experience collection [2024-06-28 12:38:36,879][09454] Worker 31, sleep for 145.312 sec to decorrelate experience collection [2024-06-28 12:38:36,887][09433] Worker 8, sleep for 37.500 sec to decorrelate experience collection [2024-06-28 12:38:36,888][09448] Worker 23, sleep for 107.812 sec to decorrelate experience collection [2024-06-28 12:38:36,901][09438] Worker 15, sleep for 70.312 sec to decorrelate experience collection [2024-06-28 12:38:36,901][09425] Worker 1, sleep for 4.688 sec to decorrelate experience collection [2024-06-28 12:38:36,901][09442] Worker 18, sleep for 84.375 sec to decorrelate experience collection [2024-06-28 12:38:36,903][09451] Worker 29, sleep for 135.938 sec to decorrelate experience collection [2024-06-28 12:38:36,907][09439] Worker 14, sleep for 65.625 sec to decorrelate experience collection [2024-06-28 12:38:36,909][09434] Worker 11, sleep for 51.562 sec to decorrelate experience collection [2024-06-28 12:38:36,910][09455] Worker 30, sleep for 140.625 sec to decorrelate experience collection [2024-06-28 12:38:36,915][09435] Worker 12, sleep for 56.250 sec to decorrelate experience collection [2024-06-28 12:38:36,915][09431] Worker 9, sleep for 42.188 sec to decorrelate experience collection [2024-06-28 12:38:36,916][09445] Worker 21, sleep for 98.438 sec to decorrelate experience collection [2024-06-28 12:38:36,929][09437] Worker 13, sleep for 60.938 sec to decorrelate experience collection [2024-06-28 12:38:36,937][09453] Worker 28, sleep for 131.250 sec to decorrelate experience collection [2024-06-28 12:38:36,946][09436] Worker 10, sleep for 46.875 sec to decorrelate experience collection [2024-06-28 12:38:36,947][09427] Worker 3, sleep for 14.062 sec to decorrelate experience collection [2024-06-28 12:38:36,955][09443] Worker 19, sleep for 89.062 sec to decorrelate experience collection [2024-06-28 12:38:36,966][09452] Worker 27, sleep for 126.562 sec to decorrelate experience collection [2024-06-28 12:38:36,967][09426] Worker 2, sleep for 9.375 sec to decorrelate experience collection [2024-06-28 12:38:36,983][09430] Worker 7, sleep for 32.812 sec to decorrelate experience collection [2024-06-28 12:38:37,002][09403] Signal inference workers to stop experience collection... [2024-06-28 12:38:37,038][09423] InferenceWorker_p0-w0: stopping experience collection [2024-06-28 12:38:37,566][09403] Signal inference workers to resume experience collection... [2024-06-28 12:38:37,566][09423] InferenceWorker_p0-w0: resuming experience collection [2024-06-28 12:38:37,631][09429] Worker 5, sleep for 23.438 sec to decorrelate experience collection [2024-06-28 12:38:37,635][09428] Worker 4, sleep for 18.750 sec to decorrelate experience collection [2024-06-28 12:38:37,921][09190] Fps is (10 sec: 4915.2, 60 sec: 4915.2, 300 sec: 4915.2). Total num frames: 3721297920. Throughput: 0: 32798.3. Samples: 327980. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 12:38:37,944][09432] Worker 6, sleep for 28.125 sec to decorrelate experience collection [2024-06-28 12:38:38,743][09423] Updated weights for policy 0, policy_version 227137 (0.0015) [2024-06-28 12:38:40,899][09190] Heartbeat connected on Batcher_0 [2024-06-28 12:38:40,901][09190] Heartbeat connected on LearnerWorker_p0 [2024-06-28 12:38:40,906][09190] Heartbeat connected on RolloutWorker_w0 [2024-06-28 12:38:40,955][09190] Heartbeat connected on InferenceWorker_p0-w0 [2024-06-28 12:38:41,612][09425] Worker 1 awakens! [2024-06-28 12:38:41,619][09190] Heartbeat connected on RolloutWorker_w1 [2024-06-28 12:38:42,921][09190] Fps is (10 sec: 16383.7, 60 sec: 10922.7, 300 sec: 10922.7). Total num frames: 3721412608. Throughput: 0: 22053.3. Samples: 330800. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 12:38:46,389][09426] Worker 2 awakens! [2024-06-28 12:38:46,394][09190] Heartbeat connected on RolloutWorker_w2 [2024-06-28 12:38:47,921][09190] Fps is (10 sec: 13107.2, 60 sec: 9011.2, 300 sec: 9011.2). Total num frames: 3721428992. Throughput: 0: 17223.1. Samples: 344460. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 12:38:51,080][09427] Worker 3 awakens! [2024-06-28 12:38:51,085][09190] Heartbeat connected on RolloutWorker_w3 [2024-06-28 12:38:52,921][09190] Fps is (10 sec: 3276.8, 60 sec: 7864.3, 300 sec: 7864.3). Total num frames: 3721445376. Throughput: 0: 14732.0. Samples: 368300. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 12:38:56,386][09428] Worker 4 awakens! [2024-06-28 12:38:56,392][09190] Heartbeat connected on RolloutWorker_w4 [2024-06-28 12:38:57,921][09190] Fps is (10 sec: 6553.7, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 3721494528. Throughput: 0: 12772.7. Samples: 383180. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 12:39:01,168][09429] Worker 5 awakens! [2024-06-28 12:39:01,173][09190] Heartbeat connected on RolloutWorker_w5 [2024-06-28 12:39:02,921][09190] Fps is (10 sec: 11469.0, 60 sec: 8894.2, 300 sec: 8894.2). Total num frames: 3721560064. Throughput: 0: 13234.9. Samples: 463220. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 12:39:03,421][09423] Updated weights for policy 0, policy_version 227147 (0.0015) [2024-06-28 12:39:06,168][09432] Worker 6 awakens! [2024-06-28 12:39:06,173][09190] Heartbeat connected on RolloutWorker_w6 [2024-06-28 12:39:07,921][09190] Fps is (10 sec: 14745.6, 60 sec: 9830.4, 300 sec: 9830.4). Total num frames: 3721641984. Throughput: 0: 13941.6. Samples: 557660. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 12:39:07,922][09190] Avg episode reward: [(0, '0.113')] [2024-06-28 12:39:09,836][09430] Worker 7 awakens! [2024-06-28 12:39:09,843][09190] Heartbeat connected on RolloutWorker_w7 [2024-06-28 12:39:12,470][09423] Updated weights for policy 0, policy_version 227157 (0.0013) [2024-06-28 12:39:12,921][09190] Fps is (10 sec: 18022.2, 60 sec: 10922.7, 300 sec: 10922.7). Total num frames: 3721740288. Throughput: 0: 13699.1. Samples: 616460. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 12:39:12,922][09190] Avg episode reward: [(0, '0.035')] [2024-06-28 12:39:14,488][09433] Worker 8 awakens! [2024-06-28 12:39:14,493][09190] Heartbeat connected on RolloutWorker_w8 [2024-06-28 12:39:17,921][09190] Fps is (10 sec: 19660.8, 60 sec: 11796.5, 300 sec: 11796.5). Total num frames: 3721838592. Throughput: 0: 16183.1. Samples: 746900. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 12:39:17,928][09190] Avg episode reward: [(0, '0.074')] [2024-06-28 12:39:19,200][09431] Worker 9 awakens! [2024-06-28 12:39:19,206][09190] Heartbeat connected on RolloutWorker_w9 [2024-06-28 12:39:19,971][09423] Updated weights for policy 0, policy_version 227167 (0.0012) [2024-06-28 12:39:22,921][09190] Fps is (10 sec: 24575.9, 60 sec: 13405.1, 300 sec: 13405.1). Total num frames: 3721986048. Throughput: 0: 12472.9. Samples: 889260. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 12:39:22,922][09190] Avg episode reward: [(0, '0.074')] [2024-06-28 12:39:23,920][09436] Worker 10 awakens! [2024-06-28 12:39:23,927][09190] Heartbeat connected on RolloutWorker_w10 [2024-06-28 12:39:26,018][09423] Updated weights for policy 0, policy_version 227177 (0.0019) [2024-06-28 12:39:27,921][09190] Fps is (10 sec: 27852.5, 60 sec: 14472.6, 300 sec: 14472.6). Total num frames: 3722117120. Throughput: 0: 14468.0. Samples: 981860. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 12:39:27,922][09190] Avg episode reward: [(0, '0.074')] [2024-06-28 12:39:28,572][09434] Worker 11 awakens! [2024-06-28 12:39:28,580][09190] Heartbeat connected on RolloutWorker_w11 [2024-06-28 12:39:30,515][09423] Updated weights for policy 0, policy_version 227187 (0.0017) [2024-06-28 12:39:32,921][09190] Fps is (10 sec: 29491.2, 60 sec: 17203.2, 300 sec: 15879.9). Total num frames: 3722280960. Throughput: 0: 18396.0. Samples: 1172280. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 12:39:32,929][09190] Avg episode reward: [(0, '0.074')] [2024-06-28 12:39:33,264][09435] Worker 12 awakens! [2024-06-28 12:39:33,270][09190] Heartbeat connected on RolloutWorker_w12 [2024-06-28 12:39:36,154][09423] Updated weights for policy 0, policy_version 227197 (0.0018) [2024-06-28 12:39:37,921][09190] Fps is (10 sec: 31129.7, 60 sec: 18841.6, 300 sec: 16852.1). Total num frames: 3722428416. Throughput: 0: 22151.2. Samples: 1365100. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 12:39:37,922][09190] Avg episode reward: [(0, '0.093')] [2024-06-28 12:39:37,968][09437] Worker 13 awakens! [2024-06-28 12:39:37,974][09190] Heartbeat connected on RolloutWorker_w13 [2024-06-28 12:39:41,132][09423] Updated weights for policy 0, policy_version 227207 (0.0020) [2024-06-28 12:39:42,633][09439] Worker 14 awakens! [2024-06-28 12:39:42,639][09190] Heartbeat connected on RolloutWorker_w14 [2024-06-28 12:39:42,921][09190] Fps is (10 sec: 34406.3, 60 sec: 20206.9, 300 sec: 18350.1). Total num frames: 3722625024. Throughput: 0: 24039.0. Samples: 1464940. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 12:39:42,922][09190] Avg episode reward: [(0, '0.094')] [2024-06-28 12:39:45,982][09423] Updated weights for policy 0, policy_version 227217 (0.0023) [2024-06-28 12:39:47,316][09438] Worker 15 awakens! [2024-06-28 12:39:47,322][09190] Heartbeat connected on RolloutWorker_w15 [2024-06-28 12:39:47,922][09190] Fps is (10 sec: 37682.8, 60 sec: 22937.6, 300 sec: 19456.0). Total num frames: 3722805248. Throughput: 0: 26888.3. Samples: 1673200. Policy #0 lag: (min: 0.0, avg: 3.6, max: 10.0) [2024-06-28 12:39:47,922][09190] Avg episode reward: [(0, '0.113')] [2024-06-28 12:39:50,636][09423] Updated weights for policy 0, policy_version 227227 (0.0034) [2024-06-28 12:39:51,952][09440] Worker 16 awakens! [2024-06-28 12:39:51,964][09190] Heartbeat connected on RolloutWorker_w16 [2024-06-28 12:39:52,921][09190] Fps is (10 sec: 34406.3, 60 sec: 25395.2, 300 sec: 20239.1). Total num frames: 3722969088. Throughput: 0: 29071.0. Samples: 1865860. Policy #0 lag: (min: 0.0, avg: 3.6, max: 10.0) [2024-06-28 12:39:52,922][09190] Avg episode reward: [(0, '0.095')] [2024-06-28 12:39:55,795][09423] Updated weights for policy 0, policy_version 227237 (0.0030) [2024-06-28 12:39:56,652][09441] Worker 17 awakens! [2024-06-28 12:39:56,661][09190] Heartbeat connected on RolloutWorker_w17 [2024-06-28 12:39:57,921][09190] Fps is (10 sec: 32768.2, 60 sec: 27306.6, 300 sec: 20935.1). Total num frames: 3723132928. Throughput: 0: 30098.6. Samples: 1970900. Policy #0 lag: (min: 0.0, avg: 3.6, max: 10.0) [2024-06-28 12:39:57,922][09190] Avg episode reward: [(0, '0.096')] [2024-06-28 12:40:00,768][09423] Updated weights for policy 0, policy_version 227247 (0.0027) [2024-06-28 12:40:01,376][09442] Worker 18 awakens! [2024-06-28 12:40:01,387][09190] Heartbeat connected on RolloutWorker_w18 [2024-06-28 12:40:02,921][09190] Fps is (10 sec: 34406.5, 60 sec: 29218.1, 300 sec: 21730.4). Total num frames: 3723313152. Throughput: 0: 31817.7. Samples: 2178700. Policy #0 lag: (min: 0.0, avg: 3.6, max: 10.0) [2024-06-28 12:40:02,922][09190] Avg episode reward: [(0, '0.076')] [2024-06-28 12:40:05,083][09423] Updated weights for policy 0, policy_version 227257 (0.0029) [2024-06-28 12:40:06,118][09443] Worker 19 awakens! [2024-06-28 12:40:06,127][09190] Heartbeat connected on RolloutWorker_w19 [2024-06-28 12:40:07,921][09190] Fps is (10 sec: 36044.9, 60 sec: 30856.5, 300 sec: 22446.1). Total num frames: 3723493376. Throughput: 0: 33369.8. Samples: 2390900. Policy #0 lag: (min: 0.0, avg: 3.6, max: 10.0) [2024-06-28 12:40:07,922][09190] Avg episode reward: [(0, '0.096')] [2024-06-28 12:40:08,926][09423] Updated weights for policy 0, policy_version 227267 (0.0026) [2024-06-28 12:40:10,636][09444] Worker 20 awakens! [2024-06-28 12:40:10,647][09190] Heartbeat connected on RolloutWorker_w20 [2024-06-28 12:40:12,921][09190] Fps is (10 sec: 34406.7, 60 sec: 31948.8, 300 sec: 22937.6). Total num frames: 3723657216. Throughput: 0: 33920.5. Samples: 2508280. Policy #0 lag: (min: 0.0, avg: 3.6, max: 10.0) [2024-06-28 12:40:12,922][09190] Avg episode reward: [(0, '0.115')] [2024-06-28 12:40:13,778][09423] Updated weights for policy 0, policy_version 227277 (0.0029) [2024-06-28 12:40:15,452][09445] Worker 21 awakens! [2024-06-28 12:40:15,462][09190] Heartbeat connected on RolloutWorker_w21 [2024-06-28 12:40:17,922][09190] Fps is (10 sec: 34405.9, 60 sec: 33314.0, 300 sec: 23533.4). Total num frames: 3723837440. Throughput: 0: 34505.7. Samples: 2725040. Policy #0 lag: (min: 0.0, avg: 3.6, max: 10.0) [2024-06-28 12:40:17,922][09190] Avg episode reward: [(0, '0.114')] [2024-06-28 12:40:17,934][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000227285_3723837440.pth... [2024-06-28 12:40:17,987][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000226803_3715940352.pth [2024-06-28 12:40:18,479][09423] Updated weights for policy 0, policy_version 227287 (0.0024) [2024-06-28 12:40:20,072][09446] Worker 22 awakens! [2024-06-28 12:40:20,084][09190] Heartbeat connected on RolloutWorker_w22 [2024-06-28 12:40:22,729][09423] Updated weights for policy 0, policy_version 227297 (0.0034) [2024-06-28 12:40:22,922][09190] Fps is (10 sec: 37682.6, 60 sec: 34133.3, 300 sec: 24219.8). Total num frames: 3724034048. Throughput: 0: 35180.3. Samples: 2948220. Policy #0 lag: (min: 0.0, avg: 3.6, max: 10.0) [2024-06-28 12:40:22,922][09190] Avg episode reward: [(0, '0.097')] [2024-06-28 12:40:24,801][09448] Worker 23 awakens! [2024-06-28 12:40:24,814][09190] Heartbeat connected on RolloutWorker_w23 [2024-06-28 12:40:26,613][09423] Updated weights for policy 0, policy_version 227307 (0.0030) [2024-06-28 12:40:27,922][09190] Fps is (10 sec: 39321.8, 60 sec: 35225.6, 300 sec: 24849.1). Total num frames: 3724230656. Throughput: 0: 35430.6. Samples: 3059320. Policy #0 lag: (min: 0.0, avg: 3.6, max: 10.0) [2024-06-28 12:40:27,922][09190] Avg episode reward: [(0, '0.096')] [2024-06-28 12:40:29,476][09447] Worker 24 awakens! [2024-06-28 12:40:29,488][09190] Heartbeat connected on RolloutWorker_w24 [2024-06-28 12:40:31,460][09423] Updated weights for policy 0, policy_version 227317 (0.0031) [2024-06-28 12:40:32,921][09190] Fps is (10 sec: 39322.1, 60 sec: 35771.8, 300 sec: 25428.0). Total num frames: 3724427264. Throughput: 0: 35999.2. Samples: 3293160. Policy #0 lag: (min: 0.0, avg: 3.6, max: 10.0) [2024-06-28 12:40:32,922][09190] Avg episode reward: [(0, '0.074')] [2024-06-28 12:40:34,152][09450] Worker 25 awakens! [2024-06-28 12:40:34,164][09190] Heartbeat connected on RolloutWorker_w25 [2024-06-28 12:40:35,368][09423] Updated weights for policy 0, policy_version 227327 (0.0026) [2024-06-28 12:40:37,921][09190] Fps is (10 sec: 37683.3, 60 sec: 36317.8, 300 sec: 25836.3). Total num frames: 3724607488. Throughput: 0: 36890.2. Samples: 3525920. Policy #0 lag: (min: 0.0, avg: 3.6, max: 10.0) [2024-06-28 12:40:37,922][09190] Avg episode reward: [(0, '0.071')] [2024-06-28 12:40:38,812][09449] Worker 26 awakens! [2024-06-28 12:40:38,824][09190] Heartbeat connected on RolloutWorker_w26 [2024-06-28 12:40:39,531][09423] Updated weights for policy 0, policy_version 227337 (0.0031) [2024-06-28 12:40:42,921][09190] Fps is (10 sec: 36044.8, 60 sec: 36044.8, 300 sec: 26214.4). Total num frames: 3724787712. Throughput: 0: 37069.8. Samples: 3639040. Policy #0 lag: (min: 0.0, avg: 3.6, max: 10.0) [2024-06-28 12:40:42,922][09190] Avg episode reward: [(0, '0.114')] [2024-06-28 12:40:43,629][09452] Worker 27 awakens! [2024-06-28 12:40:43,643][09190] Heartbeat connected on RolloutWorker_w27 [2024-06-28 12:40:44,257][09423] Updated weights for policy 0, policy_version 227347 (0.0035) [2024-06-28 12:40:47,524][09423] Updated weights for policy 0, policy_version 227357 (0.0038) [2024-06-28 12:40:47,921][09190] Fps is (10 sec: 42598.9, 60 sec: 37137.2, 300 sec: 27033.6). Total num frames: 3725033472. Throughput: 0: 37728.5. Samples: 3876480. Policy #0 lag: (min: 0.0, avg: 3.6, max: 10.0) [2024-06-28 12:40:47,922][09190] Avg episode reward: [(0, '0.092')] [2024-06-28 12:40:48,288][09453] Worker 28 awakens! [2024-06-28 12:40:48,302][09190] Heartbeat connected on RolloutWorker_w28 [2024-06-28 12:40:52,195][09423] Updated weights for policy 0, policy_version 227367 (0.0021) [2024-06-28 12:40:52,846][09451] Worker 29 awakens! [2024-06-28 12:40:52,859][09190] Heartbeat connected on RolloutWorker_w29 [2024-06-28 12:40:52,922][09190] Fps is (10 sec: 40959.5, 60 sec: 37137.0, 300 sec: 27231.3). Total num frames: 3725197312. Throughput: 0: 38574.6. Samples: 4126760. Policy #0 lag: (min: 0.0, avg: 3.6, max: 10.0) [2024-06-28 12:40:52,922][09190] Avg episode reward: [(0, '0.116')] [2024-06-28 12:40:55,047][09423] Updated weights for policy 0, policy_version 227377 (0.0036) [2024-06-28 12:40:57,610][09455] Worker 30 awakens! [2024-06-28 12:40:57,624][09190] Heartbeat connected on RolloutWorker_w30 [2024-06-28 12:40:57,921][09190] Fps is (10 sec: 36044.8, 60 sec: 37683.2, 300 sec: 27634.4). Total num frames: 3725393920. Throughput: 0: 38590.7. Samples: 4244860. Policy #0 lag: (min: 0.0, avg: 11.1, max: 19.0) [2024-06-28 12:40:57,922][09190] Avg episode reward: [(0, '0.100')] [2024-06-28 12:41:00,071][09423] Updated weights for policy 0, policy_version 227387 (0.0032) [2024-06-28 12:41:02,292][09454] Worker 31 awakens! [2024-06-28 12:41:02,316][09190] Heartbeat connected on RolloutWorker_w31 [2024-06-28 12:41:02,925][09190] Fps is (10 sec: 45861.0, 60 sec: 39046.5, 300 sec: 28433.6). Total num frames: 3725656064. Throughput: 0: 39387.1. Samples: 4497580. Policy #0 lag: (min: 0.0, avg: 11.1, max: 19.0) [2024-06-28 12:41:02,925][09190] Avg episode reward: [(0, '0.100')] [2024-06-28 12:41:03,727][09423] Updated weights for policy 0, policy_version 227397 (0.0045) [2024-06-28 12:41:07,438][09423] Updated weights for policy 0, policy_version 227407 (0.0036) [2024-06-28 12:41:07,922][09190] Fps is (10 sec: 45872.0, 60 sec: 39321.2, 300 sec: 28774.3). Total num frames: 3725852672. Throughput: 0: 40337.7. Samples: 4763440. Policy #0 lag: (min: 0.0, avg: 11.1, max: 19.0) [2024-06-28 12:41:07,923][09190] Avg episode reward: [(0, '0.098')] [2024-06-28 12:41:11,189][09423] Updated weights for policy 0, policy_version 227417 (0.0043) [2024-06-28 12:41:12,921][09190] Fps is (10 sec: 39334.4, 60 sec: 39867.7, 300 sec: 29094.0). Total num frames: 3726049280. Throughput: 0: 40741.0. Samples: 4892660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 19.0) [2024-06-28 12:41:12,928][09190] Avg episode reward: [(0, '0.095')] [2024-06-28 12:41:15,109][09423] Updated weights for policy 0, policy_version 227427 (0.0042) [2024-06-28 12:41:17,921][09190] Fps is (10 sec: 42601.1, 60 sec: 40687.0, 300 sec: 29587.6). Total num frames: 3726278656. Throughput: 0: 41163.1. Samples: 5145500. Policy #0 lag: (min: 0.0, avg: 11.1, max: 19.0) [2024-06-28 12:41:17,922][09190] Avg episode reward: [(0, '0.115')] [2024-06-28 12:41:18,777][09423] Updated weights for policy 0, policy_version 227437 (0.0052) [2024-06-28 12:41:22,905][09403] Signal inference workers to stop experience collection... (50 times) [2024-06-28 12:41:22,910][09403] Signal inference workers to resume experience collection... (50 times) [2024-06-28 12:41:22,921][09190] Fps is (10 sec: 42598.1, 60 sec: 40687.0, 300 sec: 29865.7). Total num frames: 3726475264. Throughput: 0: 41641.8. Samples: 5399800. Policy #0 lag: (min: 0.0, avg: 11.1, max: 19.0) [2024-06-28 12:41:22,922][09190] Avg episode reward: [(0, '0.082')] [2024-06-28 12:41:22,947][09423] InferenceWorker_p0-w0: stopping experience collection (50 times) [2024-06-28 12:41:22,947][09423] InferenceWorker_p0-w0: resuming experience collection (50 times) [2024-06-28 12:41:23,049][09423] Updated weights for policy 0, policy_version 227447 (0.0032) [2024-06-28 12:41:26,628][09423] Updated weights for policy 0, policy_version 227457 (0.0028) [2024-06-28 12:41:27,921][09190] Fps is (10 sec: 40960.3, 60 sec: 40960.1, 300 sec: 30219.4). Total num frames: 3726688256. Throughput: 0: 41865.8. Samples: 5523000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 19.0) [2024-06-28 12:41:27,922][09190] Avg episode reward: [(0, '0.122')] [2024-06-28 12:41:30,723][09423] Updated weights for policy 0, policy_version 227467 (0.0036) [2024-06-28 12:41:32,921][09190] Fps is (10 sec: 44237.1, 60 sec: 41506.1, 300 sec: 30642.5). Total num frames: 3726917632. Throughput: 0: 42173.3. Samples: 5774280. Policy #0 lag: (min: 0.0, avg: 11.1, max: 19.0) [2024-06-28 12:41:32,922][09190] Avg episode reward: [(0, '0.120')] [2024-06-28 12:41:34,225][09423] Updated weights for policy 0, policy_version 227477 (0.0032) [2024-06-28 12:41:37,921][09190] Fps is (10 sec: 39321.2, 60 sec: 41233.1, 300 sec: 30698.5). Total num frames: 3727081472. Throughput: 0: 42514.3. Samples: 6039900. Policy #0 lag: (min: 0.0, avg: 11.1, max: 19.0) [2024-06-28 12:41:37,922][09190] Avg episode reward: [(0, '0.121')] [2024-06-28 12:41:38,573][09423] Updated weights for policy 0, policy_version 227487 (0.0039) [2024-06-28 12:41:41,739][09423] Updated weights for policy 0, policy_version 227497 (0.0026) [2024-06-28 12:41:42,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.4, 300 sec: 31171.6). Total num frames: 3727327232. Throughput: 0: 42531.1. Samples: 6158760. Policy #0 lag: (min: 0.0, avg: 11.1, max: 19.0) [2024-06-28 12:41:42,922][09190] Avg episode reward: [(0, '0.105')] [2024-06-28 12:41:46,451][09423] Updated weights for policy 0, policy_version 227507 (0.0031) [2024-06-28 12:41:47,921][09190] Fps is (10 sec: 47514.0, 60 sec: 42052.3, 300 sec: 31539.2). Total num frames: 3727556608. Throughput: 0: 42489.3. Samples: 6409460. Policy #0 lag: (min: 0.0, avg: 11.1, max: 19.0) [2024-06-28 12:41:47,922][09190] Avg episode reward: [(0, '0.126')] [2024-06-28 12:41:49,831][09423] Updated weights for policy 0, policy_version 227517 (0.0031) [2024-06-28 12:41:52,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.4, 300 sec: 31649.1). Total num frames: 3727736832. Throughput: 0: 42340.2. Samples: 6668720. Policy #0 lag: (min: 0.0, avg: 11.1, max: 19.0) [2024-06-28 12:41:52,922][09190] Avg episode reward: [(0, '0.126')] [2024-06-28 12:41:54,119][09423] Updated weights for policy 0, policy_version 227527 (0.0035) [2024-06-28 12:41:57,778][09423] Updated weights for policy 0, policy_version 227537 (0.0036) [2024-06-28 12:41:57,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42871.5, 300 sec: 31987.8). Total num frames: 3727966208. Throughput: 0: 42190.7. Samples: 6791240. Policy #0 lag: (min: 0.0, avg: 11.1, max: 19.0) [2024-06-28 12:41:57,922][09190] Avg episode reward: [(0, '0.131')] [2024-06-28 12:42:01,729][09423] Updated weights for policy 0, policy_version 227547 (0.0036) [2024-06-28 12:42:02,921][09190] Fps is (10 sec: 45874.7, 60 sec: 42327.5, 300 sec: 32310.8). Total num frames: 3728195584. Throughput: 0: 42383.5. Samples: 7052760. Policy #0 lag: (min: 0.0, avg: 11.1, max: 19.0) [2024-06-28 12:42:02,922][09190] Avg episode reward: [(0, '0.136')] [2024-06-28 12:42:05,343][09423] Updated weights for policy 0, policy_version 227557 (0.0037) [2024-06-28 12:42:07,921][09190] Fps is (10 sec: 39321.9, 60 sec: 41779.7, 300 sec: 32321.2). Total num frames: 3728359424. Throughput: 0: 42420.6. Samples: 7308720. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 12:42:07,922][09190] Avg episode reward: [(0, '0.128')] [2024-06-28 12:42:09,566][09423] Updated weights for policy 0, policy_version 227567 (0.0029) [2024-06-28 12:42:12,891][09423] Updated weights for policy 0, policy_version 227577 (0.0041) [2024-06-28 12:42:12,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42871.5, 300 sec: 32768.0). Total num frames: 3728621568. Throughput: 0: 42167.1. Samples: 7420520. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 12:42:12,922][09190] Avg episode reward: [(0, '0.130')] [2024-06-28 12:42:17,398][09423] Updated weights for policy 0, policy_version 227587 (0.0036) [2024-06-28 12:42:17,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42325.4, 300 sec: 32910.5). Total num frames: 3728818176. Throughput: 0: 42527.2. Samples: 7688000. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 12:42:17,922][09190] Avg episode reward: [(0, '0.141')] [2024-06-28 12:42:17,965][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000227590_3728834560.pth... [2024-06-28 12:42:18,038][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000227127_3721248768.pth [2024-06-28 12:42:20,306][09423] Updated weights for policy 0, policy_version 227597 (0.0030) [2024-06-28 12:42:22,921][09190] Fps is (10 sec: 37683.0, 60 sec: 42052.3, 300 sec: 32977.2). Total num frames: 3728998400. Throughput: 0: 42204.5. Samples: 7939100. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 12:42:22,922][09190] Avg episode reward: [(0, '0.140')] [2024-06-28 12:42:25,051][09423] Updated weights for policy 0, policy_version 227607 (0.0034) [2024-06-28 12:42:27,922][09190] Fps is (10 sec: 42597.8, 60 sec: 42598.3, 300 sec: 33314.1). Total num frames: 3729244160. Throughput: 0: 42170.5. Samples: 8056440. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 12:42:27,922][09190] Avg episode reward: [(0, '0.143')] [2024-06-28 12:42:28,188][09403] Signal inference workers to stop experience collection... (100 times) [2024-06-28 12:42:28,189][09403] Signal inference workers to resume experience collection... (100 times) [2024-06-28 12:42:28,232][09423] InferenceWorker_p0-w0: stopping experience collection (100 times) [2024-06-28 12:42:28,232][09423] InferenceWorker_p0-w0: resuming experience collection (100 times) [2024-06-28 12:42:28,326][09423] Updated weights for policy 0, policy_version 227617 (0.0051) [2024-06-28 12:42:32,872][09423] Updated weights for policy 0, policy_version 227627 (0.0040) [2024-06-28 12:42:32,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42052.2, 300 sec: 33436.7). Total num frames: 3729440768. Throughput: 0: 42440.8. Samples: 8319300. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 12:42:32,922][09190] Avg episode reward: [(0, '0.146')] [2024-06-28 12:42:36,279][09423] Updated weights for policy 0, policy_version 227637 (0.0032) [2024-06-28 12:42:37,921][09190] Fps is (10 sec: 37683.8, 60 sec: 42325.4, 300 sec: 33488.9). Total num frames: 3729620992. Throughput: 0: 42416.9. Samples: 8577480. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 12:42:37,922][09190] Avg episode reward: [(0, '0.144')] [2024-06-28 12:42:40,445][09423] Updated weights for policy 0, policy_version 227647 (0.0024) [2024-06-28 12:42:42,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42871.4, 300 sec: 33924.5). Total num frames: 3729899520. Throughput: 0: 42396.4. Samples: 8699080. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 12:42:42,922][09190] Avg episode reward: [(0, '0.150')] [2024-06-28 12:42:43,730][09423] Updated weights for policy 0, policy_version 227657 (0.0043) [2024-06-28 12:42:47,921][09190] Fps is (10 sec: 44236.4, 60 sec: 41779.1, 300 sec: 33902.3). Total num frames: 3730063360. Throughput: 0: 42265.8. Samples: 8954720. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 12:42:47,925][09190] Avg episode reward: [(0, '0.154')] [2024-06-28 12:42:48,272][09423] Updated weights for policy 0, policy_version 227667 (0.0030) [2024-06-28 12:42:51,481][09423] Updated weights for policy 0, policy_version 227677 (0.0032) [2024-06-28 12:42:52,921][09190] Fps is (10 sec: 37683.8, 60 sec: 42325.4, 300 sec: 34066.4). Total num frames: 3730276352. Throughput: 0: 42013.8. Samples: 9199340. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 12:42:52,922][09190] Avg episode reward: [(0, '0.155')] [2024-06-28 12:42:56,072][09423] Updated weights for policy 0, policy_version 227687 (0.0039) [2024-06-28 12:42:57,921][09190] Fps is (10 sec: 45875.4, 60 sec: 42598.4, 300 sec: 34345.7). Total num frames: 3730522112. Throughput: 0: 42429.7. Samples: 9329860. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 12:42:57,922][09190] Avg episode reward: [(0, '0.157')] [2024-06-28 12:42:59,345][09423] Updated weights for policy 0, policy_version 227697 (0.0034) [2024-06-28 12:43:02,921][09190] Fps is (10 sec: 39320.9, 60 sec: 41233.1, 300 sec: 34257.5). Total num frames: 3730669568. Throughput: 0: 42357.7. Samples: 9594100. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 12:43:02,922][09190] Avg episode reward: [(0, '0.159')] [2024-06-28 12:43:03,790][09423] Updated weights for policy 0, policy_version 227707 (0.0037) [2024-06-28 12:43:07,385][09423] Updated weights for policy 0, policy_version 227717 (0.0033) [2024-06-28 12:43:07,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42598.3, 300 sec: 34523.4). Total num frames: 3730915328. Throughput: 0: 42110.6. Samples: 9834080. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 12:43:07,922][09190] Avg episode reward: [(0, '0.154')] [2024-06-28 12:43:11,373][09423] Updated weights for policy 0, policy_version 227727 (0.0032) [2024-06-28 12:43:12,924][09190] Fps is (10 sec: 49140.3, 60 sec: 42323.6, 300 sec: 34779.8). Total num frames: 3731161088. Throughput: 0: 42498.6. Samples: 9968980. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 12:43:12,924][09190] Avg episode reward: [(0, '0.163')] [2024-06-28 12:43:14,811][09423] Updated weights for policy 0, policy_version 227737 (0.0038) [2024-06-28 12:43:17,921][09190] Fps is (10 sec: 39321.9, 60 sec: 41506.1, 300 sec: 34688.9). Total num frames: 3731308544. Throughput: 0: 42180.5. Samples: 10217420. Policy #0 lag: (min: 0.0, avg: 14.0, max: 25.0) [2024-06-28 12:43:17,922][09190] Avg episode reward: [(0, '0.160')] [2024-06-28 12:43:19,734][09423] Updated weights for policy 0, policy_version 227747 (0.0030) [2024-06-28 12:43:22,754][09423] Updated weights for policy 0, policy_version 227757 (0.0035) [2024-06-28 12:43:22,921][09190] Fps is (10 sec: 40970.3, 60 sec: 42871.5, 300 sec: 34989.6). Total num frames: 3731570688. Throughput: 0: 41868.4. Samples: 10461560. Policy #0 lag: (min: 0.0, avg: 14.0, max: 25.0) [2024-06-28 12:43:22,922][09190] Avg episode reward: [(0, '0.162')] [2024-06-28 12:43:27,349][09423] Updated weights for policy 0, policy_version 227767 (0.0037) [2024-06-28 12:43:27,921][09190] Fps is (10 sec: 45874.7, 60 sec: 42052.3, 300 sec: 35656.0). Total num frames: 3731767296. Throughput: 0: 42220.4. Samples: 10599000. Policy #0 lag: (min: 0.0, avg: 14.0, max: 25.0) [2024-06-28 12:43:27,922][09190] Avg episode reward: [(0, '0.163')] [2024-06-28 12:43:30,270][09423] Updated weights for policy 0, policy_version 227777 (0.0034) [2024-06-28 12:43:32,921][09190] Fps is (10 sec: 36044.8, 60 sec: 41506.2, 300 sec: 36044.8). Total num frames: 3731931136. Throughput: 0: 42075.6. Samples: 10848120. Policy #0 lag: (min: 0.0, avg: 14.0, max: 25.0) [2024-06-28 12:43:32,922][09190] Avg episode reward: [(0, '0.163')] [2024-06-28 12:43:34,840][09403] Signal inference workers to stop experience collection... (150 times) [2024-06-28 12:43:34,873][09423] InferenceWorker_p0-w0: stopping experience collection (150 times) [2024-06-28 12:43:34,886][09403] Signal inference workers to resume experience collection... (150 times) [2024-06-28 12:43:34,896][09423] InferenceWorker_p0-w0: resuming experience collection (150 times) [2024-06-28 12:43:35,033][09423] Updated weights for policy 0, policy_version 227787 (0.0027) [2024-06-28 12:43:37,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.4, 300 sec: 36544.7). Total num frames: 3732193280. Throughput: 0: 42079.0. Samples: 11092900. Policy #0 lag: (min: 0.0, avg: 14.0, max: 25.0) [2024-06-28 12:43:37,922][09190] Avg episode reward: [(0, '0.159')] [2024-06-28 12:43:38,405][09423] Updated weights for policy 0, policy_version 227797 (0.0037) [2024-06-28 12:43:42,848][09423] Updated weights for policy 0, policy_version 227807 (0.0033) [2024-06-28 12:43:42,922][09190] Fps is (10 sec: 45874.3, 60 sec: 41506.0, 300 sec: 37155.6). Total num frames: 3732389888. Throughput: 0: 42170.1. Samples: 11227520. Policy #0 lag: (min: 0.0, avg: 14.0, max: 25.0) [2024-06-28 12:43:42,922][09190] Avg episode reward: [(0, '0.168')] [2024-06-28 12:43:45,828][09423] Updated weights for policy 0, policy_version 227817 (0.0033) [2024-06-28 12:43:47,922][09190] Fps is (10 sec: 40959.5, 60 sec: 42325.3, 300 sec: 37822.0). Total num frames: 3732602880. Throughput: 0: 41800.4. Samples: 11475120. Policy #0 lag: (min: 0.0, avg: 14.0, max: 25.0) [2024-06-28 12:43:47,922][09190] Avg episode reward: [(0, '0.161')] [2024-06-28 12:43:50,669][09423] Updated weights for policy 0, policy_version 227827 (0.0041) [2024-06-28 12:43:52,921][09190] Fps is (10 sec: 44237.7, 60 sec: 42598.4, 300 sec: 38433.0). Total num frames: 3732832256. Throughput: 0: 41861.0. Samples: 11717820. Policy #0 lag: (min: 0.0, avg: 14.0, max: 25.0) [2024-06-28 12:43:52,922][09190] Avg episode reward: [(0, '0.167')] [2024-06-28 12:43:53,871][09423] Updated weights for policy 0, policy_version 227837 (0.0036) [2024-06-28 12:43:57,921][09190] Fps is (10 sec: 39322.0, 60 sec: 41233.0, 300 sec: 38766.2). Total num frames: 3732996096. Throughput: 0: 41973.4. Samples: 11857680. Policy #0 lag: (min: 0.0, avg: 14.0, max: 25.0) [2024-06-28 12:43:57,922][09190] Avg episode reward: [(0, '0.167')] [2024-06-28 12:43:58,535][09423] Updated weights for policy 0, policy_version 227847 (0.0023) [2024-06-28 12:44:01,957][09423] Updated weights for policy 0, policy_version 227857 (0.0031) [2024-06-28 12:44:02,921][09190] Fps is (10 sec: 37683.0, 60 sec: 42325.4, 300 sec: 39210.5). Total num frames: 3733209088. Throughput: 0: 41754.6. Samples: 12096380. Policy #0 lag: (min: 0.0, avg: 14.0, max: 25.0) [2024-06-28 12:44:02,924][09190] Avg episode reward: [(0, '0.168')] [2024-06-28 12:44:06,315][09423] Updated weights for policy 0, policy_version 227867 (0.0041) [2024-06-28 12:44:07,924][09190] Fps is (10 sec: 47501.8, 60 sec: 42596.6, 300 sec: 39765.6). Total num frames: 3733471232. Throughput: 0: 41863.0. Samples: 12345500. Policy #0 lag: (min: 0.0, avg: 14.0, max: 25.0) [2024-06-28 12:44:07,924][09190] Avg episode reward: [(0, '0.168')] [2024-06-28 12:44:09,495][09423] Updated weights for policy 0, policy_version 227877 (0.0038) [2024-06-28 12:44:12,921][09190] Fps is (10 sec: 42598.7, 60 sec: 41234.8, 300 sec: 39988.1). Total num frames: 3733635072. Throughput: 0: 41794.3. Samples: 12479740. Policy #0 lag: (min: 0.0, avg: 14.0, max: 25.0) [2024-06-28 12:44:12,922][09190] Avg episode reward: [(0, '0.168')] [2024-06-28 12:44:14,017][09423] Updated weights for policy 0, policy_version 227887 (0.0041) [2024-06-28 12:44:17,846][09423] Updated weights for policy 0, policy_version 227897 (0.0039) [2024-06-28 12:44:17,921][09190] Fps is (10 sec: 39331.3, 60 sec: 42598.3, 300 sec: 40265.8). Total num frames: 3733864448. Throughput: 0: 41697.7. Samples: 12724520. Policy #0 lag: (min: 0.0, avg: 14.0, max: 25.0) [2024-06-28 12:44:17,922][09190] Avg episode reward: [(0, '0.168')] [2024-06-28 12:44:17,949][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000227897_3733864448.pth... [2024-06-28 12:44:17,997][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000227285_3723837440.pth [2024-06-28 12:44:22,083][09423] Updated weights for policy 0, policy_version 227907 (0.0034) [2024-06-28 12:44:22,921][09190] Fps is (10 sec: 42598.0, 60 sec: 41506.1, 300 sec: 40487.9). Total num frames: 3734061056. Throughput: 0: 41912.4. Samples: 12978960. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 12:44:22,922][09190] Avg episode reward: [(0, '0.169')] [2024-06-28 12:44:25,718][09423] Updated weights for policy 0, policy_version 227917 (0.0045) [2024-06-28 12:44:27,921][09190] Fps is (10 sec: 37683.6, 60 sec: 41233.1, 300 sec: 40543.5). Total num frames: 3734241280. Throughput: 0: 41651.8. Samples: 13101840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 12:44:27,922][09190] Avg episode reward: [(0, '0.162')] [2024-06-28 12:44:29,630][09423] Updated weights for policy 0, policy_version 227927 (0.0033) [2024-06-28 12:44:32,922][09190] Fps is (10 sec: 42597.9, 60 sec: 42598.3, 300 sec: 40876.7). Total num frames: 3734487040. Throughput: 0: 41581.3. Samples: 13346280. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 12:44:32,922][09190] Avg episode reward: [(0, '0.167')] [2024-06-28 12:44:33,633][09423] Updated weights for policy 0, policy_version 227937 (0.0033) [2024-06-28 12:44:37,610][09423] Updated weights for policy 0, policy_version 227947 (0.0044) [2024-06-28 12:44:37,921][09190] Fps is (10 sec: 44236.3, 60 sec: 41506.1, 300 sec: 40876.7). Total num frames: 3734683648. Throughput: 0: 41970.1. Samples: 13606480. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 12:44:37,922][09190] Avg episode reward: [(0, '0.170')] [2024-06-28 12:44:41,095][09423] Updated weights for policy 0, policy_version 227957 (0.0026) [2024-06-28 12:44:42,921][09190] Fps is (10 sec: 39322.6, 60 sec: 41506.3, 300 sec: 40932.3). Total num frames: 3734880256. Throughput: 0: 41664.6. Samples: 13732580. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 12:44:42,922][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:44:45,167][09423] Updated weights for policy 0, policy_version 227967 (0.0035) [2024-06-28 12:44:47,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42052.4, 300 sec: 41209.9). Total num frames: 3735126016. Throughput: 0: 41765.8. Samples: 13975840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 12:44:47,922][09190] Avg episode reward: [(0, '0.169')] [2024-06-28 12:44:49,320][09423] Updated weights for policy 0, policy_version 227977 (0.0038) [2024-06-28 12:44:52,921][09190] Fps is (10 sec: 42598.2, 60 sec: 41233.1, 300 sec: 41265.5). Total num frames: 3735306240. Throughput: 0: 41909.0. Samples: 14231300. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 12:44:52,922][09190] Avg episode reward: [(0, '0.168')] [2024-06-28 12:44:53,178][09423] Updated weights for policy 0, policy_version 227987 (0.0041) [2024-06-28 12:44:57,317][09423] Updated weights for policy 0, policy_version 227997 (0.0040) [2024-06-28 12:44:57,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42052.3, 300 sec: 41376.6). Total num frames: 3735519232. Throughput: 0: 41675.1. Samples: 14355120. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 12:44:57,922][09190] Avg episode reward: [(0, '0.168')] [2024-06-28 12:45:01,271][09423] Updated weights for policy 0, policy_version 228007 (0.0032) [2024-06-28 12:45:02,924][09190] Fps is (10 sec: 42587.7, 60 sec: 42050.5, 300 sec: 41487.3). Total num frames: 3735732224. Throughput: 0: 41791.1. Samples: 14605220. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 12:45:02,924][09190] Avg episode reward: [(0, '0.172')] [2024-06-28 12:45:04,939][09423] Updated weights for policy 0, policy_version 228017 (0.0045) [2024-06-28 12:45:07,921][09190] Fps is (10 sec: 40959.4, 60 sec: 40961.7, 300 sec: 41598.7). Total num frames: 3735928832. Throughput: 0: 41658.6. Samples: 14853600. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 12:45:07,922][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:45:09,418][09423] Updated weights for policy 0, policy_version 228027 (0.0021) [2024-06-28 12:45:12,921][09190] Fps is (10 sec: 40970.2, 60 sec: 41779.2, 300 sec: 41709.8). Total num frames: 3736141824. Throughput: 0: 41599.5. Samples: 14973820. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 12:45:12,922][09190] Avg episode reward: [(0, '0.169')] [2024-06-28 12:45:13,158][09423] Updated weights for policy 0, policy_version 228037 (0.0041) [2024-06-28 12:45:17,587][09423] Updated weights for policy 0, policy_version 228047 (0.0030) [2024-06-28 12:45:17,921][09190] Fps is (10 sec: 42599.2, 60 sec: 41506.2, 300 sec: 41765.3). Total num frames: 3736354816. Throughput: 0: 41790.0. Samples: 15226820. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 12:45:17,922][09190] Avg episode reward: [(0, '0.168')] [2024-06-28 12:45:21,197][09423] Updated weights for policy 0, policy_version 228057 (0.0031) [2024-06-28 12:45:22,310][09403] Signal inference workers to stop experience collection... (200 times) [2024-06-28 12:45:22,344][09423] InferenceWorker_p0-w0: stopping experience collection (200 times) [2024-06-28 12:45:22,373][09403] Signal inference workers to resume experience collection... (200 times) [2024-06-28 12:45:22,380][09423] InferenceWorker_p0-w0: resuming experience collection (200 times) [2024-06-28 12:45:22,921][09190] Fps is (10 sec: 40960.3, 60 sec: 41506.2, 300 sec: 41765.3). Total num frames: 3736551424. Throughput: 0: 41367.2. Samples: 15468000. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 12:45:22,922][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:45:25,333][09423] Updated weights for policy 0, policy_version 228067 (0.0040) [2024-06-28 12:45:27,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42325.3, 300 sec: 41876.4). Total num frames: 3736780800. Throughput: 0: 41286.1. Samples: 15590460. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 12:45:27,922][09190] Avg episode reward: [(0, '0.172')] [2024-06-28 12:45:29,463][09423] Updated weights for policy 0, policy_version 228077 (0.0039) [2024-06-28 12:45:32,893][09423] Updated weights for policy 0, policy_version 228087 (0.0042) [2024-06-28 12:45:32,921][09190] Fps is (10 sec: 42598.1, 60 sec: 41506.3, 300 sec: 41931.9). Total num frames: 3736977408. Throughput: 0: 41488.4. Samples: 15842820. Policy #0 lag: (min: 0.0, avg: 9.7, max: 25.0) [2024-06-28 12:45:32,922][09190] Avg episode reward: [(0, '0.172')] [2024-06-28 12:45:36,884][09423] Updated weights for policy 0, policy_version 228097 (0.0036) [2024-06-28 12:45:37,922][09190] Fps is (10 sec: 37682.8, 60 sec: 41233.0, 300 sec: 41931.9). Total num frames: 3737157632. Throughput: 0: 41434.1. Samples: 16095840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 25.0) [2024-06-28 12:45:37,922][09190] Avg episode reward: [(0, '0.170')] [2024-06-28 12:45:41,261][09423] Updated weights for policy 0, policy_version 228107 (0.0044) [2024-06-28 12:45:42,922][09190] Fps is (10 sec: 40959.3, 60 sec: 41779.0, 300 sec: 41876.4). Total num frames: 3737387008. Throughput: 0: 41434.0. Samples: 16219660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 25.0) [2024-06-28 12:45:42,922][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:45:44,986][09423] Updated weights for policy 0, policy_version 228117 (0.0032) [2024-06-28 12:45:47,921][09190] Fps is (10 sec: 40960.8, 60 sec: 40687.0, 300 sec: 41932.0). Total num frames: 3737567232. Throughput: 0: 41288.6. Samples: 16463100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 25.0) [2024-06-28 12:45:47,922][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:45:49,109][09423] Updated weights for policy 0, policy_version 228127 (0.0045) [2024-06-28 12:45:52,922][09190] Fps is (10 sec: 39321.8, 60 sec: 41233.0, 300 sec: 41987.5). Total num frames: 3737780224. Throughput: 0: 41245.8. Samples: 16709660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 25.0) [2024-06-28 12:45:52,922][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:45:53,059][09423] Updated weights for policy 0, policy_version 228137 (0.0027) [2024-06-28 12:45:56,915][09423] Updated weights for policy 0, policy_version 228147 (0.0036) [2024-06-28 12:45:57,921][09190] Fps is (10 sec: 40959.6, 60 sec: 40959.9, 300 sec: 41765.8). Total num frames: 3737976832. Throughput: 0: 41358.2. Samples: 16834940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 25.0) [2024-06-28 12:45:57,922][09190] Avg episode reward: [(0, '0.168')] [2024-06-28 12:46:00,834][09423] Updated weights for policy 0, policy_version 228157 (0.0044) [2024-06-28 12:46:02,921][09190] Fps is (10 sec: 40960.6, 60 sec: 40961.7, 300 sec: 41821.0). Total num frames: 3738189824. Throughput: 0: 41246.6. Samples: 17082920. Policy #0 lag: (min: 0.0, avg: 9.7, max: 25.0) [2024-06-28 12:46:02,922][09190] Avg episode reward: [(0, '0.170')] [2024-06-28 12:46:05,281][09423] Updated weights for policy 0, policy_version 228167 (0.0033) [2024-06-28 12:46:07,921][09190] Fps is (10 sec: 44237.1, 60 sec: 41506.2, 300 sec: 41931.9). Total num frames: 3738419200. Throughput: 0: 41293.7. Samples: 17326220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 25.0) [2024-06-28 12:46:07,922][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:46:08,842][09423] Updated weights for policy 0, policy_version 228177 (0.0039) [2024-06-28 12:46:12,733][09423] Updated weights for policy 0, policy_version 228187 (0.0050) [2024-06-28 12:46:12,921][09190] Fps is (10 sec: 42598.3, 60 sec: 41233.1, 300 sec: 41820.9). Total num frames: 3738615808. Throughput: 0: 41347.1. Samples: 17451080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 25.0) [2024-06-28 12:46:12,922][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:46:16,862][09423] Updated weights for policy 0, policy_version 228197 (0.0032) [2024-06-28 12:46:17,928][09190] Fps is (10 sec: 40933.3, 60 sec: 41228.5, 300 sec: 41875.5). Total num frames: 3738828800. Throughput: 0: 41266.5. Samples: 17700080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 25.0) [2024-06-28 12:46:17,928][09190] Avg episode reward: [(0, '0.167')] [2024-06-28 12:46:17,942][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000228200_3738828800.pth... [2024-06-28 12:46:18,002][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000227590_3728834560.pth [2024-06-28 12:46:20,536][09423] Updated weights for policy 0, policy_version 228207 (0.0047) [2024-06-28 12:46:22,921][09190] Fps is (10 sec: 44236.9, 60 sec: 41779.2, 300 sec: 41931.9). Total num frames: 3739058176. Throughput: 0: 41119.7. Samples: 17946220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 25.0) [2024-06-28 12:46:22,922][09190] Avg episode reward: [(0, '0.170')] [2024-06-28 12:46:24,572][09423] Updated weights for policy 0, policy_version 228217 (0.0033) [2024-06-28 12:46:27,921][09190] Fps is (10 sec: 40986.9, 60 sec: 40960.1, 300 sec: 41765.3). Total num frames: 3739238400. Throughput: 0: 41198.9. Samples: 18073600. Policy #0 lag: (min: 0.0, avg: 9.7, max: 25.0) [2024-06-28 12:46:27,922][09190] Avg episode reward: [(0, '0.170')] [2024-06-28 12:46:28,586][09423] Updated weights for policy 0, policy_version 228227 (0.0039) [2024-06-28 12:46:32,629][09423] Updated weights for policy 0, policy_version 228237 (0.0039) [2024-06-28 12:46:32,921][09190] Fps is (10 sec: 39321.3, 60 sec: 41233.0, 300 sec: 41931.9). Total num frames: 3739451392. Throughput: 0: 41454.1. Samples: 18328540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 25.0) [2024-06-28 12:46:32,922][09190] Avg episode reward: [(0, '0.169')] [2024-06-28 12:46:36,402][09423] Updated weights for policy 0, policy_version 228247 (0.0033) [2024-06-28 12:46:37,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42052.3, 300 sec: 41876.4). Total num frames: 3739680768. Throughput: 0: 41277.8. Samples: 18567160. Policy #0 lag: (min: 0.0, avg: 9.7, max: 25.0) [2024-06-28 12:46:37,923][09190] Avg episode reward: [(0, '0.170')] [2024-06-28 12:46:40,160][09423] Updated weights for policy 0, policy_version 228257 (0.0040) [2024-06-28 12:46:42,921][09190] Fps is (10 sec: 40960.3, 60 sec: 41233.2, 300 sec: 41709.8). Total num frames: 3739860992. Throughput: 0: 41496.9. Samples: 18702300. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2024-06-28 12:46:42,922][09190] Avg episode reward: [(0, '0.170')] [2024-06-28 12:46:44,499][09423] Updated weights for policy 0, policy_version 228267 (0.0030) [2024-06-28 12:46:47,921][09190] Fps is (10 sec: 37683.3, 60 sec: 41506.1, 300 sec: 41765.3). Total num frames: 3740057600. Throughput: 0: 41247.1. Samples: 18939040. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2024-06-28 12:46:47,922][09190] Avg episode reward: [(0, '0.170')] [2024-06-28 12:46:48,530][09423] Updated weights for policy 0, policy_version 228277 (0.0038) [2024-06-28 12:46:52,173][09423] Updated weights for policy 0, policy_version 228287 (0.0033) [2024-06-28 12:46:52,922][09190] Fps is (10 sec: 40959.4, 60 sec: 41506.1, 300 sec: 41709.8). Total num frames: 3740270592. Throughput: 0: 41432.3. Samples: 19190680. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2024-06-28 12:46:52,928][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:46:56,311][09423] Updated weights for policy 0, policy_version 228297 (0.0041) [2024-06-28 12:46:57,921][09190] Fps is (10 sec: 42598.2, 60 sec: 41779.2, 300 sec: 41654.2). Total num frames: 3740483584. Throughput: 0: 41466.2. Samples: 19317060. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2024-06-28 12:46:57,922][09190] Avg episode reward: [(0, '0.169')] [2024-06-28 12:47:00,051][09423] Updated weights for policy 0, policy_version 228307 (0.0030) [2024-06-28 12:47:02,921][09190] Fps is (10 sec: 42598.9, 60 sec: 41779.2, 300 sec: 41820.8). Total num frames: 3740696576. Throughput: 0: 41490.4. Samples: 19566880. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2024-06-28 12:47:02,922][09190] Avg episode reward: [(0, '0.170')] [2024-06-28 12:47:04,230][09423] Updated weights for policy 0, policy_version 228317 (0.0049) [2024-06-28 12:47:07,921][09190] Fps is (10 sec: 40960.5, 60 sec: 41233.1, 300 sec: 41598.7). Total num frames: 3740893184. Throughput: 0: 41485.4. Samples: 19813060. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2024-06-28 12:47:07,922][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:47:08,033][09423] Updated weights for policy 0, policy_version 228327 (0.0036) [2024-06-28 12:47:12,486][09423] Updated weights for policy 0, policy_version 228337 (0.0036) [2024-06-28 12:47:12,921][09190] Fps is (10 sec: 39321.8, 60 sec: 41233.1, 300 sec: 41598.7). Total num frames: 3741089792. Throughput: 0: 41228.9. Samples: 19928900. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2024-06-28 12:47:12,922][09190] Avg episode reward: [(0, '0.168')] [2024-06-28 12:47:16,191][09423] Updated weights for policy 0, policy_version 228347 (0.0048) [2024-06-28 12:47:17,922][09190] Fps is (10 sec: 40959.2, 60 sec: 41237.4, 300 sec: 41709.8). Total num frames: 3741302784. Throughput: 0: 41225.3. Samples: 20183680. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2024-06-28 12:47:17,922][09190] Avg episode reward: [(0, '0.170')] [2024-06-28 12:47:19,999][09423] Updated weights for policy 0, policy_version 228357 (0.0037) [2024-06-28 12:47:20,897][09403] Signal inference workers to stop experience collection... (250 times) [2024-06-28 12:47:20,947][09423] InferenceWorker_p0-w0: stopping experience collection (250 times) [2024-06-28 12:47:20,953][09403] Signal inference workers to resume experience collection... (250 times) [2024-06-28 12:47:20,958][09423] InferenceWorker_p0-w0: resuming experience collection (250 times) [2024-06-28 12:47:22,921][09190] Fps is (10 sec: 42598.7, 60 sec: 40960.1, 300 sec: 41598.7). Total num frames: 3741515776. Throughput: 0: 41378.8. Samples: 20429200. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2024-06-28 12:47:22,922][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:47:23,833][09423] Updated weights for policy 0, policy_version 228367 (0.0045) [2024-06-28 12:47:27,921][09190] Fps is (10 sec: 39322.1, 60 sec: 40959.9, 300 sec: 41543.2). Total num frames: 3741696000. Throughput: 0: 41114.6. Samples: 20552460. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2024-06-28 12:47:27,922][09190] Avg episode reward: [(0, '0.166')] [2024-06-28 12:47:28,262][09423] Updated weights for policy 0, policy_version 228377 (0.0036) [2024-06-28 12:47:31,851][09423] Updated weights for policy 0, policy_version 228387 (0.0042) [2024-06-28 12:47:32,921][09190] Fps is (10 sec: 39321.1, 60 sec: 40960.0, 300 sec: 41654.2). Total num frames: 3741908992. Throughput: 0: 41226.2. Samples: 20794220. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2024-06-28 12:47:32,922][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:47:36,295][09423] Updated weights for policy 0, policy_version 228397 (0.0045) [2024-06-28 12:47:37,921][09190] Fps is (10 sec: 44236.9, 60 sec: 40960.0, 300 sec: 41487.6). Total num frames: 3742138368. Throughput: 0: 41044.6. Samples: 21037680. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2024-06-28 12:47:37,922][09190] Avg episode reward: [(0, '0.170')] [2024-06-28 12:47:39,878][09423] Updated weights for policy 0, policy_version 228407 (0.0036) [2024-06-28 12:47:42,921][09190] Fps is (10 sec: 42598.4, 60 sec: 41233.0, 300 sec: 41598.7). Total num frames: 3742334976. Throughput: 0: 41212.5. Samples: 21171620. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2024-06-28 12:47:42,922][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:47:43,966][09423] Updated weights for policy 0, policy_version 228417 (0.0023) [2024-06-28 12:47:47,921][09190] Fps is (10 sec: 39321.6, 60 sec: 41233.1, 300 sec: 41543.1). Total num frames: 3742531584. Throughput: 0: 41114.2. Samples: 21417020. Policy #0 lag: (min: 0.0, avg: 8.3, max: 19.0) [2024-06-28 12:47:47,922][09190] Avg episode reward: [(0, '0.168')] [2024-06-28 12:47:47,956][09423] Updated weights for policy 0, policy_version 228427 (0.0029) [2024-06-28 12:47:51,734][09423] Updated weights for policy 0, policy_version 228437 (0.0042) [2024-06-28 12:47:52,921][09190] Fps is (10 sec: 40960.4, 60 sec: 41233.2, 300 sec: 41432.1). Total num frames: 3742744576. Throughput: 0: 41212.5. Samples: 21667620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 12:47:52,922][09190] Avg episode reward: [(0, '0.170')] [2024-06-28 12:47:55,649][09423] Updated weights for policy 0, policy_version 228447 (0.0041) [2024-06-28 12:47:57,922][09190] Fps is (10 sec: 42598.0, 60 sec: 41233.0, 300 sec: 41654.2). Total num frames: 3742957568. Throughput: 0: 41260.3. Samples: 21785620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 12:47:57,922][09190] Avg episode reward: [(0, '0.170')] [2024-06-28 12:47:59,918][09423] Updated weights for policy 0, policy_version 228457 (0.0032) [2024-06-28 12:48:02,921][09190] Fps is (10 sec: 40960.1, 60 sec: 40960.1, 300 sec: 41487.6). Total num frames: 3743154176. Throughput: 0: 41069.6. Samples: 22031800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 12:48:02,922][09190] Avg episode reward: [(0, '0.170')] [2024-06-28 12:48:04,093][09423] Updated weights for policy 0, policy_version 228467 (0.0035) [2024-06-28 12:48:07,700][09423] Updated weights for policy 0, policy_version 228477 (0.0035) [2024-06-28 12:48:07,921][09190] Fps is (10 sec: 40960.8, 60 sec: 41233.1, 300 sec: 41376.9). Total num frames: 3743367168. Throughput: 0: 41223.1. Samples: 22284240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 12:48:07,922][09190] Avg episode reward: [(0, '0.172')] [2024-06-28 12:48:11,796][09423] Updated weights for policy 0, policy_version 228487 (0.0039) [2024-06-28 12:48:12,921][09190] Fps is (10 sec: 42598.4, 60 sec: 41506.2, 300 sec: 41598.7). Total num frames: 3743580160. Throughput: 0: 41221.4. Samples: 22407420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 12:48:12,922][09190] Avg episode reward: [(0, '0.174')] [2024-06-28 12:48:15,694][09423] Updated weights for policy 0, policy_version 228497 (0.0041) [2024-06-28 12:48:17,921][09190] Fps is (10 sec: 42598.3, 60 sec: 41506.3, 300 sec: 41432.1). Total num frames: 3743793152. Throughput: 0: 41339.6. Samples: 22654500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 12:48:17,922][09190] Avg episode reward: [(0, '0.172')] [2024-06-28 12:48:17,941][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000228503_3743793152.pth... [2024-06-28 12:48:17,989][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000227897_3733864448.pth [2024-06-28 12:48:19,152][09423] Updated weights for policy 0, policy_version 228507 (0.0027) [2024-06-28 12:48:22,924][09190] Fps is (10 sec: 40949.5, 60 sec: 41231.3, 300 sec: 41431.7). Total num frames: 3743989760. Throughput: 0: 41541.3. Samples: 22907140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 12:48:22,924][09190] Avg episode reward: [(0, '0.172')] [2024-06-28 12:48:23,606][09423] Updated weights for policy 0, policy_version 228517 (0.0045) [2024-06-28 12:48:27,662][09423] Updated weights for policy 0, policy_version 228527 (0.0040) [2024-06-28 12:48:27,921][09190] Fps is (10 sec: 40959.6, 60 sec: 41779.2, 300 sec: 41598.7). Total num frames: 3744202752. Throughput: 0: 41379.1. Samples: 23033680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 12:48:27,922][09190] Avg episode reward: [(0, '0.168')] [2024-06-28 12:48:31,296][09423] Updated weights for policy 0, policy_version 228537 (0.0041) [2024-06-28 12:48:32,924][09190] Fps is (10 sec: 42598.4, 60 sec: 41777.5, 300 sec: 41431.7). Total num frames: 3744415744. Throughput: 0: 41388.0. Samples: 23279580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 12:48:32,924][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:48:35,131][09423] Updated weights for policy 0, policy_version 228547 (0.0036) [2024-06-28 12:48:37,924][09190] Fps is (10 sec: 40949.9, 60 sec: 41231.3, 300 sec: 41431.8). Total num frames: 3744612352. Throughput: 0: 41468.3. Samples: 23533800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 12:48:37,924][09190] Avg episode reward: [(0, '0.173')] [2024-06-28 12:48:39,171][09423] Updated weights for policy 0, policy_version 228557 (0.0029) [2024-06-28 12:48:42,739][09423] Updated weights for policy 0, policy_version 228567 (0.0038) [2024-06-28 12:48:42,921][09190] Fps is (10 sec: 42608.8, 60 sec: 41779.2, 300 sec: 41487.6). Total num frames: 3744841728. Throughput: 0: 41649.0. Samples: 23659820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 12:48:42,923][09190] Avg episode reward: [(0, '0.171')] [2024-06-28 12:48:46,806][09423] Updated weights for policy 0, policy_version 228577 (0.0036) [2024-06-28 12:48:47,921][09190] Fps is (10 sec: 44248.1, 60 sec: 42052.3, 300 sec: 41432.1). Total num frames: 3745054720. Throughput: 0: 41739.1. Samples: 23910060. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 12:48:47,922][09190] Avg episode reward: [(0, '0.172')] [2024-06-28 12:48:50,979][09423] Updated weights for policy 0, policy_version 228587 (0.0036) [2024-06-28 12:48:52,921][09190] Fps is (10 sec: 39321.9, 60 sec: 41506.1, 300 sec: 41487.6). Total num frames: 3745234944. Throughput: 0: 41829.3. Samples: 24166560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 12:48:52,922][09190] Avg episode reward: [(0, '0.172')] [2024-06-28 12:48:54,765][09423] Updated weights for policy 0, policy_version 228597 (0.0036) [2024-06-28 12:48:57,923][09190] Fps is (10 sec: 39316.7, 60 sec: 41505.4, 300 sec: 41487.5). Total num frames: 3745447936. Throughput: 0: 41625.0. Samples: 24280600. Policy #0 lag: (min: 0.0, avg: 9.4, max: 23.0) [2024-06-28 12:48:57,923][09190] Avg episode reward: [(0, '0.174')] [2024-06-28 12:48:59,071][09423] Updated weights for policy 0, policy_version 228607 (0.0034) [2024-06-28 12:49:02,614][09423] Updated weights for policy 0, policy_version 228617 (0.0038) [2024-06-28 12:49:02,921][09190] Fps is (10 sec: 42597.8, 60 sec: 41779.1, 300 sec: 41321.3). Total num frames: 3745660928. Throughput: 0: 41768.8. Samples: 24534100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 12:49:02,922][09190] Avg episode reward: [(0, '0.174')] [2024-06-28 12:49:06,862][09423] Updated weights for policy 0, policy_version 228627 (0.0038) [2024-06-28 12:49:07,921][09190] Fps is (10 sec: 39326.7, 60 sec: 41233.1, 300 sec: 41376.5). Total num frames: 3745841152. Throughput: 0: 41652.1. Samples: 24781380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 12:49:07,922][09190] Avg episode reward: [(0, '0.177')] [2024-06-28 12:49:09,860][09403] Signal inference workers to stop experience collection... (300 times) [2024-06-28 12:49:09,861][09403] Signal inference workers to resume experience collection... (300 times) [2024-06-28 12:49:09,876][09423] InferenceWorker_p0-w0: stopping experience collection (300 times) [2024-06-28 12:49:09,905][09423] InferenceWorker_p0-w0: resuming experience collection (300 times) [2024-06-28 12:49:10,440][09423] Updated weights for policy 0, policy_version 228637 (0.0038) [2024-06-28 12:49:12,921][09190] Fps is (10 sec: 40960.2, 60 sec: 41506.0, 300 sec: 41376.5). Total num frames: 3746070528. Throughput: 0: 41505.8. Samples: 24901440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 12:49:12,922][09190] Avg episode reward: [(0, '0.174')] [2024-06-28 12:49:14,653][09423] Updated weights for policy 0, policy_version 228647 (0.0036) [2024-06-28 12:49:17,921][09190] Fps is (10 sec: 40959.8, 60 sec: 40960.0, 300 sec: 41321.0). Total num frames: 3746250752. Throughput: 0: 41660.5. Samples: 25154200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 12:49:17,922][09190] Avg episode reward: [(0, '0.175')] [2024-06-28 12:49:18,666][09423] Updated weights for policy 0, policy_version 228657 (0.0044) [2024-06-28 12:49:22,489][09423] Updated weights for policy 0, policy_version 228667 (0.0042) [2024-06-28 12:49:22,921][09190] Fps is (10 sec: 42598.4, 60 sec: 41780.9, 300 sec: 41543.1). Total num frames: 3746496512. Throughput: 0: 41567.2. Samples: 25404220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 12:49:22,922][09190] Avg episode reward: [(0, '0.179')] [2024-06-28 12:49:26,221][09423] Updated weights for policy 0, policy_version 228677 (0.0039) [2024-06-28 12:49:27,921][09190] Fps is (10 sec: 47513.1, 60 sec: 42052.3, 300 sec: 41487.6). Total num frames: 3746725888. Throughput: 0: 41590.6. Samples: 25531400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 12:49:27,922][09190] Avg episode reward: [(0, '0.179')] [2024-06-28 12:49:30,607][09423] Updated weights for policy 0, policy_version 228687 (0.0046) [2024-06-28 12:49:32,921][09190] Fps is (10 sec: 40959.9, 60 sec: 41507.8, 300 sec: 41432.1). Total num frames: 3746906112. Throughput: 0: 41633.2. Samples: 25783560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 12:49:32,930][09190] Avg episode reward: [(0, '0.181')] [2024-06-28 12:49:34,395][09423] Updated weights for policy 0, policy_version 228697 (0.0035) [2024-06-28 12:49:37,921][09190] Fps is (10 sec: 36045.1, 60 sec: 41234.8, 300 sec: 41376.5). Total num frames: 3747086336. Throughput: 0: 41447.1. Samples: 26031680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 12:49:37,922][09190] Avg episode reward: [(0, '0.178')] [2024-06-28 12:49:38,659][09423] Updated weights for policy 0, policy_version 228707 (0.0051) [2024-06-28 12:49:42,266][09423] Updated weights for policy 0, policy_version 228717 (0.0028) [2024-06-28 12:49:42,921][09190] Fps is (10 sec: 42598.5, 60 sec: 41506.1, 300 sec: 41376.5). Total num frames: 3747332096. Throughput: 0: 41638.0. Samples: 26154260. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 12:49:42,922][09190] Avg episode reward: [(0, '0.179')] [2024-06-28 12:49:46,391][09423] Updated weights for policy 0, policy_version 228727 (0.0040) [2024-06-28 12:49:47,921][09190] Fps is (10 sec: 42598.1, 60 sec: 40959.9, 300 sec: 41376.5). Total num frames: 3747512320. Throughput: 0: 41696.9. Samples: 26410460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 12:49:47,926][09190] Avg episode reward: [(0, '0.182')] [2024-06-28 12:49:49,721][09423] Updated weights for policy 0, policy_version 228737 (0.0037) [2024-06-28 12:49:52,924][09190] Fps is (10 sec: 40949.9, 60 sec: 41777.4, 300 sec: 41431.7). Total num frames: 3747741696. Throughput: 0: 41646.5. Samples: 26655580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 12:49:52,924][09190] Avg episode reward: [(0, '0.184')] [2024-06-28 12:49:54,399][09423] Updated weights for policy 0, policy_version 228747 (0.0033) [2024-06-28 12:49:57,921][09190] Fps is (10 sec: 42599.0, 60 sec: 41507.0, 300 sec: 41376.9). Total num frames: 3747938304. Throughput: 0: 41859.7. Samples: 26785120. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 12:49:57,922][09190] Avg episode reward: [(0, '0.186')] [2024-06-28 12:49:58,045][09423] Updated weights for policy 0, policy_version 228757 (0.0028) [2024-06-28 12:50:02,210][09423] Updated weights for policy 0, policy_version 228767 (0.0029) [2024-06-28 12:50:02,921][09190] Fps is (10 sec: 40970.1, 60 sec: 41506.2, 300 sec: 41432.1). Total num frames: 3748151296. Throughput: 0: 41759.5. Samples: 27033380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 12:50:02,922][09190] Avg episode reward: [(0, '0.188')] [2024-06-28 12:50:05,494][09423] Updated weights for policy 0, policy_version 228777 (0.0040) [2024-06-28 12:50:07,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42325.3, 300 sec: 41487.6). Total num frames: 3748380672. Throughput: 0: 41609.8. Samples: 27276660. Policy #0 lag: (min: 0.0, avg: 12.4, max: 23.0) [2024-06-28 12:50:07,922][09190] Avg episode reward: [(0, '0.198')] [2024-06-28 12:50:09,721][09423] Updated weights for policy 0, policy_version 228787 (0.0039) [2024-06-28 12:50:12,924][09190] Fps is (10 sec: 42587.9, 60 sec: 41777.5, 300 sec: 41431.7). Total num frames: 3748577280. Throughput: 0: 41675.5. Samples: 27406900. Policy #0 lag: (min: 0.0, avg: 12.4, max: 23.0) [2024-06-28 12:50:12,924][09190] Avg episode reward: [(0, '0.195')] [2024-06-28 12:50:13,221][09423] Updated weights for policy 0, policy_version 228797 (0.0038) [2024-06-28 12:50:17,921][09190] Fps is (10 sec: 36045.1, 60 sec: 41506.2, 300 sec: 41321.0). Total num frames: 3748741120. Throughput: 0: 41413.0. Samples: 27647140. Policy #0 lag: (min: 0.0, avg: 12.4, max: 23.0) [2024-06-28 12:50:17,922][09190] Avg episode reward: [(0, '0.189')] [2024-06-28 12:50:17,936][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000228806_3748757504.pth... [2024-06-28 12:50:18,004][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000228200_3738828800.pth [2024-06-28 12:50:18,512][09423] Updated weights for policy 0, policy_version 228807 (0.0038) [2024-06-28 12:50:21,170][09423] Updated weights for policy 0, policy_version 228817 (0.0031) [2024-06-28 12:50:22,921][09190] Fps is (10 sec: 42608.6, 60 sec: 41779.2, 300 sec: 41432.1). Total num frames: 3749003264. Throughput: 0: 41299.9. Samples: 27890180. Policy #0 lag: (min: 0.0, avg: 12.4, max: 23.0) [2024-06-28 12:50:22,922][09190] Avg episode reward: [(0, '0.201')] [2024-06-28 12:50:26,022][09423] Updated weights for policy 0, policy_version 228827 (0.0038) [2024-06-28 12:50:27,921][09190] Fps is (10 sec: 42598.4, 60 sec: 40687.0, 300 sec: 41321.0). Total num frames: 3749167104. Throughput: 0: 41668.5. Samples: 28029340. Policy #0 lag: (min: 0.0, avg: 12.4, max: 23.0) [2024-06-28 12:50:27,922][09190] Avg episode reward: [(0, '0.198')] [2024-06-28 12:50:28,143][09403] Signal inference workers to stop experience collection... (350 times) [2024-06-28 12:50:28,143][09403] Signal inference workers to resume experience collection... (350 times) [2024-06-28 12:50:28,180][09423] InferenceWorker_p0-w0: stopping experience collection (350 times) [2024-06-28 12:50:28,180][09423] InferenceWorker_p0-w0: resuming experience collection (350 times) [2024-06-28 12:50:29,277][09423] Updated weights for policy 0, policy_version 228837 (0.0040) [2024-06-28 12:50:32,921][09190] Fps is (10 sec: 37683.8, 60 sec: 41233.2, 300 sec: 41432.1). Total num frames: 3749380096. Throughput: 0: 41372.6. Samples: 28272220. Policy #0 lag: (min: 0.0, avg: 12.4, max: 23.0) [2024-06-28 12:50:32,922][09190] Avg episode reward: [(0, '0.202')] [2024-06-28 12:50:33,511][09423] Updated weights for policy 0, policy_version 228847 (0.0035) [2024-06-28 12:50:36,960][09423] Updated weights for policy 0, policy_version 228857 (0.0045) [2024-06-28 12:50:37,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42325.4, 300 sec: 41487.7). Total num frames: 3749625856. Throughput: 0: 41349.0. Samples: 28516180. Policy #0 lag: (min: 0.0, avg: 12.4, max: 23.0) [2024-06-28 12:50:37,922][09190] Avg episode reward: [(0, '0.205')] [2024-06-28 12:50:41,584][09423] Updated weights for policy 0, policy_version 228867 (0.0051) [2024-06-28 12:50:42,921][09190] Fps is (10 sec: 42598.3, 60 sec: 41233.1, 300 sec: 41487.6). Total num frames: 3749806080. Throughput: 0: 41259.1. Samples: 28641780. Policy #0 lag: (min: 0.0, avg: 12.4, max: 23.0) [2024-06-28 12:50:42,922][09190] Avg episode reward: [(0, '0.196')] [2024-06-28 12:50:44,862][09423] Updated weights for policy 0, policy_version 228877 (0.0045) [2024-06-28 12:50:47,921][09190] Fps is (10 sec: 39321.4, 60 sec: 41779.2, 300 sec: 41487.6). Total num frames: 3750019072. Throughput: 0: 41163.6. Samples: 28885740. Policy #0 lag: (min: 0.0, avg: 12.4, max: 23.0) [2024-06-28 12:50:47,922][09190] Avg episode reward: [(0, '0.197')] [2024-06-28 12:50:49,656][09423] Updated weights for policy 0, policy_version 228887 (0.0028) [2024-06-28 12:50:52,921][09190] Fps is (10 sec: 42598.3, 60 sec: 41507.9, 300 sec: 41543.2). Total num frames: 3750232064. Throughput: 0: 41461.4. Samples: 29142420. Policy #0 lag: (min: 0.0, avg: 12.4, max: 23.0) [2024-06-28 12:50:52,922][09190] Avg episode reward: [(0, '0.197')] [2024-06-28 12:50:53,056][09423] Updated weights for policy 0, policy_version 228897 (0.0023) [2024-06-28 12:50:57,436][09423] Updated weights for policy 0, policy_version 228907 (0.0038) [2024-06-28 12:50:57,921][09190] Fps is (10 sec: 39321.4, 60 sec: 41233.0, 300 sec: 41432.1). Total num frames: 3750412288. Throughput: 0: 41295.1. Samples: 29265080. Policy #0 lag: (min: 0.0, avg: 12.4, max: 23.0) [2024-06-28 12:50:57,922][09190] Avg episode reward: [(0, '0.202')] [2024-06-28 12:51:00,675][09423] Updated weights for policy 0, policy_version 228917 (0.0037) [2024-06-28 12:51:02,922][09190] Fps is (10 sec: 42597.8, 60 sec: 41779.1, 300 sec: 41487.6). Total num frames: 3750658048. Throughput: 0: 41532.7. Samples: 29516120. Policy #0 lag: (min: 0.0, avg: 12.4, max: 23.0) [2024-06-28 12:51:02,922][09190] Avg episode reward: [(0, '0.206')] [2024-06-28 12:51:05,131][09423] Updated weights for policy 0, policy_version 228927 (0.0039) [2024-06-28 12:51:07,922][09190] Fps is (10 sec: 45874.8, 60 sec: 41506.1, 300 sec: 41543.1). Total num frames: 3750871040. Throughput: 0: 41793.3. Samples: 29770880. Policy #0 lag: (min: 0.0, avg: 12.4, max: 23.0) [2024-06-28 12:51:07,922][09190] Avg episode reward: [(0, '0.211')] [2024-06-28 12:51:08,445][09423] Updated weights for policy 0, policy_version 228937 (0.0037) [2024-06-28 12:51:12,921][09190] Fps is (10 sec: 39321.8, 60 sec: 41234.7, 300 sec: 41433.0). Total num frames: 3751051264. Throughput: 0: 41320.8. Samples: 29888780. Policy #0 lag: (min: 0.0, avg: 12.4, max: 23.0) [2024-06-28 12:51:12,922][09190] Avg episode reward: [(0, '0.209')] [2024-06-28 12:51:13,031][09423] Updated weights for policy 0, policy_version 228947 (0.0042) [2024-06-28 12:51:17,048][09423] Updated weights for policy 0, policy_version 228957 (0.0033) [2024-06-28 12:51:17,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42325.3, 300 sec: 41432.1). Total num frames: 3751280640. Throughput: 0: 41518.2. Samples: 30140540. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 12:51:17,922][09190] Avg episode reward: [(0, '0.204')] [2024-06-28 12:51:20,775][09423] Updated weights for policy 0, policy_version 228967 (0.0041) [2024-06-28 12:51:22,921][09190] Fps is (10 sec: 42598.4, 60 sec: 41233.1, 300 sec: 41487.6). Total num frames: 3751477248. Throughput: 0: 41662.1. Samples: 30390980. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 12:51:22,922][09190] Avg episode reward: [(0, '0.208')] [2024-06-28 12:51:24,471][09423] Updated weights for policy 0, policy_version 228977 (0.0026) [2024-06-28 12:51:27,921][09190] Fps is (10 sec: 39321.6, 60 sec: 41779.2, 300 sec: 41432.1). Total num frames: 3751673856. Throughput: 0: 41672.4. Samples: 30517040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 12:51:27,922][09190] Avg episode reward: [(0, '0.211')] [2024-06-28 12:51:29,072][09423] Updated weights for policy 0, policy_version 228987 (0.0035) [2024-06-28 12:51:32,585][09423] Updated weights for policy 0, policy_version 228997 (0.0034) [2024-06-28 12:51:32,921][09190] Fps is (10 sec: 40960.7, 60 sec: 41779.2, 300 sec: 41376.6). Total num frames: 3751886848. Throughput: 0: 41629.4. Samples: 30759060. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 12:51:32,922][09190] Avg episode reward: [(0, '0.214')] [2024-06-28 12:51:37,100][09423] Updated weights for policy 0, policy_version 229007 (0.0041) [2024-06-28 12:51:37,921][09190] Fps is (10 sec: 39321.6, 60 sec: 40686.9, 300 sec: 41376.5). Total num frames: 3752067072. Throughput: 0: 41444.0. Samples: 31007400. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 12:51:37,922][09190] Avg episode reward: [(0, '0.216')] [2024-06-28 12:51:40,765][09423] Updated weights for policy 0, policy_version 229017 (0.0041) [2024-06-28 12:51:42,921][09190] Fps is (10 sec: 42597.8, 60 sec: 41779.1, 300 sec: 41543.2). Total num frames: 3752312832. Throughput: 0: 41439.6. Samples: 31129860. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 12:51:42,930][09190] Avg episode reward: [(0, '0.215')] [2024-06-28 12:51:44,894][09423] Updated weights for policy 0, policy_version 229027 (0.0043) [2024-06-28 12:51:47,921][09190] Fps is (10 sec: 42598.3, 60 sec: 41233.1, 300 sec: 41432.1). Total num frames: 3752493056. Throughput: 0: 41420.6. Samples: 31380040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 12:51:47,922][09190] Avg episode reward: [(0, '0.217')] [2024-06-28 12:51:48,368][09423] Updated weights for policy 0, policy_version 229037 (0.0044) [2024-06-28 12:51:49,051][09403] Signal inference workers to stop experience collection... (400 times) [2024-06-28 12:51:49,087][09423] InferenceWorker_p0-w0: stopping experience collection (400 times) [2024-06-28 12:51:49,169][09403] Signal inference workers to resume experience collection... (400 times) [2024-06-28 12:51:49,169][09423] InferenceWorker_p0-w0: resuming experience collection (400 times) [2024-06-28 12:51:52,924][09190] Fps is (10 sec: 37673.9, 60 sec: 40958.3, 300 sec: 41376.2). Total num frames: 3752689664. Throughput: 0: 41338.7. Samples: 31631220. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 12:51:52,924][09190] Avg episode reward: [(0, '0.217')] [2024-06-28 12:51:53,045][09423] Updated weights for policy 0, policy_version 229047 (0.0041) [2024-06-28 12:51:56,623][09423] Updated weights for policy 0, policy_version 229057 (0.0026) [2024-06-28 12:51:57,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42052.3, 300 sec: 41487.6). Total num frames: 3752935424. Throughput: 0: 41337.4. Samples: 31748960. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 12:51:57,922][09190] Avg episode reward: [(0, '0.202')] [2024-06-28 12:52:01,143][09423] Updated weights for policy 0, policy_version 229067 (0.0034) [2024-06-28 12:52:02,921][09190] Fps is (10 sec: 42609.2, 60 sec: 40960.1, 300 sec: 41432.1). Total num frames: 3753115648. Throughput: 0: 41281.3. Samples: 31998200. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 12:52:02,922][09190] Avg episode reward: [(0, '0.216')] [2024-06-28 12:52:04,250][09423] Updated weights for policy 0, policy_version 229077 (0.0031) [2024-06-28 12:52:07,921][09190] Fps is (10 sec: 39321.6, 60 sec: 40960.1, 300 sec: 41487.6). Total num frames: 3753328640. Throughput: 0: 41240.0. Samples: 32246780. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 12:52:07,922][09190] Avg episode reward: [(0, '0.218')] [2024-06-28 12:52:08,683][09423] Updated weights for policy 0, policy_version 229087 (0.0034) [2024-06-28 12:52:12,570][09423] Updated weights for policy 0, policy_version 229097 (0.0036) [2024-06-28 12:52:12,921][09190] Fps is (10 sec: 42597.9, 60 sec: 41506.1, 300 sec: 41487.6). Total num frames: 3753541632. Throughput: 0: 41126.1. Samples: 32367720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 12:52:12,922][09190] Avg episode reward: [(0, '0.210')] [2024-06-28 12:52:16,673][09423] Updated weights for policy 0, policy_version 229107 (0.0043) [2024-06-28 12:52:17,924][09190] Fps is (10 sec: 42587.9, 60 sec: 41231.3, 300 sec: 41487.3). Total num frames: 3753754624. Throughput: 0: 41209.6. Samples: 32613600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 12:52:17,924][09190] Avg episode reward: [(0, '0.212')] [2024-06-28 12:52:17,932][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000229111_3753754624.pth... [2024-06-28 12:52:17,974][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000228503_3743793152.pth [2024-06-28 12:52:20,866][09423] Updated weights for policy 0, policy_version 229117 (0.0040) [2024-06-28 12:52:22,921][09190] Fps is (10 sec: 39321.7, 60 sec: 40960.0, 300 sec: 41487.6). Total num frames: 3753934848. Throughput: 0: 41125.3. Samples: 32858040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 22.0) [2024-06-28 12:52:22,922][09190] Avg episode reward: [(0, '0.211')] [2024-06-28 12:52:24,579][09423] Updated weights for policy 0, policy_version 229127 (0.0029) [2024-06-28 12:52:27,921][09190] Fps is (10 sec: 39331.5, 60 sec: 41233.0, 300 sec: 41487.6). Total num frames: 3754147840. Throughput: 0: 41115.6. Samples: 32980060. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 12:52:27,922][09190] Avg episode reward: [(0, '0.217')] [2024-06-28 12:52:28,811][09423] Updated weights for policy 0, policy_version 229137 (0.0033) [2024-06-28 12:52:32,247][09423] Updated weights for policy 0, policy_version 229147 (0.0039) [2024-06-28 12:52:32,921][09190] Fps is (10 sec: 42598.6, 60 sec: 41233.0, 300 sec: 41432.1). Total num frames: 3754360832. Throughput: 0: 41272.0. Samples: 33237280. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 12:52:32,922][09190] Avg episode reward: [(0, '0.216')] [2024-06-28 12:52:36,851][09423] Updated weights for policy 0, policy_version 229157 (0.0036) [2024-06-28 12:52:37,921][09190] Fps is (10 sec: 40960.1, 60 sec: 41506.1, 300 sec: 41432.1). Total num frames: 3754557440. Throughput: 0: 41205.0. Samples: 33485340. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 12:52:37,922][09190] Avg episode reward: [(0, '0.215')] [2024-06-28 12:52:40,211][09423] Updated weights for policy 0, policy_version 229167 (0.0026) [2024-06-28 12:52:42,921][09190] Fps is (10 sec: 40959.6, 60 sec: 40960.0, 300 sec: 41487.6). Total num frames: 3754770432. Throughput: 0: 41197.7. Samples: 33602860. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 12:52:42,922][09190] Avg episode reward: [(0, '0.222')] [2024-06-28 12:52:44,914][09423] Updated weights for policy 0, policy_version 229177 (0.0032) [2024-06-28 12:52:47,921][09190] Fps is (10 sec: 42598.0, 60 sec: 41506.1, 300 sec: 41487.6). Total num frames: 3754983424. Throughput: 0: 41231.9. Samples: 33853640. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 12:52:47,922][09190] Avg episode reward: [(0, '0.217')] [2024-06-28 12:52:48,134][09423] Updated weights for policy 0, policy_version 229187 (0.0030) [2024-06-28 12:52:52,654][09423] Updated weights for policy 0, policy_version 229197 (0.0034) [2024-06-28 12:52:52,921][09190] Fps is (10 sec: 40960.6, 60 sec: 41507.9, 300 sec: 41432.1). Total num frames: 3755180032. Throughput: 0: 41235.2. Samples: 34102360. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 12:52:52,922][09190] Avg episode reward: [(0, '0.224')] [2024-06-28 12:52:56,020][09423] Updated weights for policy 0, policy_version 229207 (0.0043) [2024-06-28 12:52:57,921][09190] Fps is (10 sec: 44237.1, 60 sec: 41506.2, 300 sec: 41598.7). Total num frames: 3755425792. Throughput: 0: 41214.3. Samples: 34222360. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 12:52:57,922][09190] Avg episode reward: [(0, '0.220')] [2024-06-28 12:53:00,731][09423] Updated weights for policy 0, policy_version 229217 (0.0031) [2024-06-28 12:53:02,921][09190] Fps is (10 sec: 40960.0, 60 sec: 41233.1, 300 sec: 41432.1). Total num frames: 3755589632. Throughput: 0: 41457.9. Samples: 34479100. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 12:53:02,922][09190] Avg episode reward: [(0, '0.226')] [2024-06-28 12:53:03,886][09423] Updated weights for policy 0, policy_version 229227 (0.0039) [2024-06-28 12:53:07,921][09190] Fps is (10 sec: 36045.0, 60 sec: 40960.1, 300 sec: 41376.5). Total num frames: 3755786240. Throughput: 0: 41580.1. Samples: 34729140. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 12:53:07,922][09190] Avg episode reward: [(0, '0.223')] [2024-06-28 12:53:08,716][09423] Updated weights for policy 0, policy_version 229237 (0.0040) [2024-06-28 12:53:11,766][09423] Updated weights for policy 0, policy_version 229247 (0.0035) [2024-06-28 12:53:12,668][09403] Signal inference workers to stop experience collection... (450 times) [2024-06-28 12:53:12,669][09403] Signal inference workers to resume experience collection... (450 times) [2024-06-28 12:53:12,707][09423] InferenceWorker_p0-w0: stopping experience collection (450 times) [2024-06-28 12:53:12,707][09423] InferenceWorker_p0-w0: resuming experience collection (450 times) [2024-06-28 12:53:12,921][09190] Fps is (10 sec: 45875.2, 60 sec: 41779.3, 300 sec: 41543.2). Total num frames: 3756048384. Throughput: 0: 41593.8. Samples: 34851780. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 12:53:12,922][09190] Avg episode reward: [(0, '0.221')] [2024-06-28 12:53:16,270][09423] Updated weights for policy 0, policy_version 229257 (0.0050) [2024-06-28 12:53:17,921][09190] Fps is (10 sec: 44236.2, 60 sec: 41234.7, 300 sec: 41488.0). Total num frames: 3756228608. Throughput: 0: 41553.7. Samples: 35107200. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 12:53:17,922][09190] Avg episode reward: [(0, '0.229')] [2024-06-28 12:53:19,535][09423] Updated weights for policy 0, policy_version 229267 (0.0040) [2024-06-28 12:53:22,921][09190] Fps is (10 sec: 37683.2, 60 sec: 41506.2, 300 sec: 41432.1). Total num frames: 3756425216. Throughput: 0: 41523.6. Samples: 35353900. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 12:53:22,922][09190] Avg episode reward: [(0, '0.233')] [2024-06-28 12:53:24,553][09423] Updated weights for policy 0, policy_version 229277 (0.0040) [2024-06-28 12:53:27,407][09423] Updated weights for policy 0, policy_version 229287 (0.0031) [2024-06-28 12:53:27,921][09190] Fps is (10 sec: 42599.0, 60 sec: 41779.2, 300 sec: 41488.0). Total num frames: 3756654592. Throughput: 0: 41637.1. Samples: 35476520. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 12:53:27,922][09190] Avg episode reward: [(0, '0.251')] [2024-06-28 12:53:32,157][09423] Updated weights for policy 0, policy_version 229297 (0.0045) [2024-06-28 12:53:32,921][09190] Fps is (10 sec: 40960.0, 60 sec: 41233.1, 300 sec: 41432.4). Total num frames: 3756834816. Throughput: 0: 41737.4. Samples: 35731820. Policy #0 lag: (min: 0.0, avg: 11.6, max: 23.0) [2024-06-28 12:53:32,922][09190] Avg episode reward: [(0, '0.242')] [2024-06-28 12:53:35,104][09423] Updated weights for policy 0, policy_version 229307 (0.0040) [2024-06-28 12:53:37,921][09190] Fps is (10 sec: 40959.9, 60 sec: 41779.2, 300 sec: 41432.1). Total num frames: 3757064192. Throughput: 0: 41698.2. Samples: 35978780. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 12:53:37,922][09190] Avg episode reward: [(0, '0.254')] [2024-06-28 12:53:39,960][09423] Updated weights for policy 0, policy_version 229317 (0.0055) [2024-06-28 12:53:42,832][09423] Updated weights for policy 0, policy_version 229327 (0.0048) [2024-06-28 12:53:42,924][09190] Fps is (10 sec: 45863.6, 60 sec: 42050.6, 300 sec: 41487.3). Total num frames: 3757293568. Throughput: 0: 41927.0. Samples: 36109180. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 12:53:42,924][09190] Avg episode reward: [(0, '0.241')] [2024-06-28 12:53:47,799][09423] Updated weights for policy 0, policy_version 229337 (0.0034) [2024-06-28 12:53:47,921][09190] Fps is (10 sec: 39321.8, 60 sec: 41233.2, 300 sec: 41432.1). Total num frames: 3757457408. Throughput: 0: 41821.8. Samples: 36361080. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 12:53:47,922][09190] Avg episode reward: [(0, '0.286')] [2024-06-28 12:53:50,505][09423] Updated weights for policy 0, policy_version 229347 (0.0024) [2024-06-28 12:53:52,921][09190] Fps is (10 sec: 42608.8, 60 sec: 42325.3, 300 sec: 41598.9). Total num frames: 3757719552. Throughput: 0: 41712.8. Samples: 36606220. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 12:53:52,922][09190] Avg episode reward: [(0, '0.277')] [2024-06-28 12:53:55,636][09423] Updated weights for policy 0, policy_version 229357 (0.0030) [2024-06-28 12:53:57,921][09190] Fps is (10 sec: 45874.4, 60 sec: 41506.1, 300 sec: 41543.2). Total num frames: 3757916160. Throughput: 0: 42100.8. Samples: 36746320. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 12:53:57,922][09190] Avg episode reward: [(0, '0.270')] [2024-06-28 12:53:58,559][09423] Updated weights for policy 0, policy_version 229367 (0.0038) [2024-06-28 12:54:02,921][09190] Fps is (10 sec: 37683.6, 60 sec: 41779.2, 300 sec: 41543.2). Total num frames: 3758096384. Throughput: 0: 41921.0. Samples: 36993640. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 12:54:02,922][09190] Avg episode reward: [(0, '0.266')] [2024-06-28 12:54:03,470][09423] Updated weights for policy 0, policy_version 229377 (0.0041) [2024-06-28 12:54:06,421][09423] Updated weights for policy 0, policy_version 229387 (0.0047) [2024-06-28 12:54:07,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42871.4, 300 sec: 41654.2). Total num frames: 3758358528. Throughput: 0: 41600.3. Samples: 37225920. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 12:54:07,922][09190] Avg episode reward: [(0, '0.289')] [2024-06-28 12:54:11,489][09423] Updated weights for policy 0, policy_version 229397 (0.0034) [2024-06-28 12:54:12,921][09190] Fps is (10 sec: 40960.0, 60 sec: 40960.0, 300 sec: 41543.2). Total num frames: 3758505984. Throughput: 0: 42048.9. Samples: 37368720. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 12:54:12,922][09190] Avg episode reward: [(0, '0.292')] [2024-06-28 12:54:14,133][09423] Updated weights for policy 0, policy_version 229407 (0.0038) [2024-06-28 12:54:17,924][09190] Fps is (10 sec: 36036.0, 60 sec: 41504.4, 300 sec: 41431.7). Total num frames: 3758718976. Throughput: 0: 41719.0. Samples: 37609280. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 12:54:17,925][09190] Avg episode reward: [(0, '0.306')] [2024-06-28 12:54:17,948][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000229414_3758718976.pth... [2024-06-28 12:54:18,005][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000228806_3748757504.pth [2024-06-28 12:54:18,995][09423] Updated weights for policy 0, policy_version 229417 (0.0036) [2024-06-28 12:54:21,821][09423] Updated weights for policy 0, policy_version 229427 (0.0030) [2024-06-28 12:54:22,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42325.3, 300 sec: 41487.6). Total num frames: 3758964736. Throughput: 0: 41970.2. Samples: 37867440. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 12:54:22,922][09190] Avg episode reward: [(0, '0.317')] [2024-06-28 12:54:22,952][09403] Signal inference workers to stop experience collection... (500 times) [2024-06-28 12:54:22,954][09403] Signal inference workers to resume experience collection... (500 times) [2024-06-28 12:54:22,970][09423] InferenceWorker_p0-w0: stopping experience collection (500 times) [2024-06-28 12:54:22,971][09423] InferenceWorker_p0-w0: resuming experience collection (500 times) [2024-06-28 12:54:26,487][09423] Updated weights for policy 0, policy_version 229437 (0.0031) [2024-06-28 12:54:27,921][09190] Fps is (10 sec: 44247.7, 60 sec: 41779.1, 300 sec: 41543.2). Total num frames: 3759161344. Throughput: 0: 42251.2. Samples: 38010380. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 12:54:27,922][09190] Avg episode reward: [(0, '0.291')] [2024-06-28 12:54:29,686][09423] Updated weights for policy 0, policy_version 229447 (0.0047) [2024-06-28 12:54:32,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.3, 300 sec: 41654.2). Total num frames: 3759374336. Throughput: 0: 41979.4. Samples: 38250160. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 12:54:32,922][09190] Avg episode reward: [(0, '0.321')] [2024-06-28 12:54:34,157][09423] Updated weights for policy 0, policy_version 229457 (0.0051) [2024-06-28 12:54:37,615][09423] Updated weights for policy 0, policy_version 229467 (0.0031) [2024-06-28 12:54:37,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42325.3, 300 sec: 41598.7). Total num frames: 3759603712. Throughput: 0: 42421.9. Samples: 38515200. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 12:54:37,922][09190] Avg episode reward: [(0, '0.336')] [2024-06-28 12:54:41,775][09423] Updated weights for policy 0, policy_version 229477 (0.0030) [2024-06-28 12:54:42,921][09190] Fps is (10 sec: 42598.9, 60 sec: 41781.0, 300 sec: 41654.3). Total num frames: 3759800320. Throughput: 0: 42228.1. Samples: 38646580. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 12:54:42,922][09190] Avg episode reward: [(0, '0.330')] [2024-06-28 12:54:45,247][09423] Updated weights for policy 0, policy_version 229487 (0.0039) [2024-06-28 12:54:47,922][09190] Fps is (10 sec: 42597.6, 60 sec: 42871.3, 300 sec: 41654.6). Total num frames: 3760029696. Throughput: 0: 42093.2. Samples: 38887840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 26.0) [2024-06-28 12:54:47,922][09190] Avg episode reward: [(0, '0.371')] [2024-06-28 12:54:49,389][09423] Updated weights for policy 0, policy_version 229497 (0.0029) [2024-06-28 12:54:52,749][09423] Updated weights for policy 0, policy_version 229507 (0.0032) [2024-06-28 12:54:52,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42052.3, 300 sec: 41709.8). Total num frames: 3760242688. Throughput: 0: 42845.4. Samples: 39153960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 26.0) [2024-06-28 12:54:52,922][09190] Avg episode reward: [(0, '0.434')] [2024-06-28 12:54:56,898][09423] Updated weights for policy 0, policy_version 229517 (0.0046) [2024-06-28 12:54:57,921][09190] Fps is (10 sec: 39322.2, 60 sec: 41779.3, 300 sec: 41598.7). Total num frames: 3760422912. Throughput: 0: 42488.4. Samples: 39280700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 26.0) [2024-06-28 12:54:57,922][09190] Avg episode reward: [(0, '0.455')] [2024-06-28 12:55:00,403][09423] Updated weights for policy 0, policy_version 229527 (0.0038) [2024-06-28 12:55:02,922][09190] Fps is (10 sec: 42598.2, 60 sec: 42871.4, 300 sec: 41654.2). Total num frames: 3760668672. Throughput: 0: 42711.2. Samples: 39531180. Policy #0 lag: (min: 0.0, avg: 9.9, max: 26.0) [2024-06-28 12:55:02,925][09190] Avg episode reward: [(0, '0.452')] [2024-06-28 12:55:05,361][09423] Updated weights for policy 0, policy_version 229537 (0.0043) [2024-06-28 12:55:07,921][09190] Fps is (10 sec: 45874.6, 60 sec: 42052.2, 300 sec: 41710.1). Total num frames: 3760881664. Throughput: 0: 42699.4. Samples: 39788920. Policy #0 lag: (min: 0.0, avg: 9.9, max: 26.0) [2024-06-28 12:55:07,922][09190] Avg episode reward: [(0, '0.470')] [2024-06-28 12:55:08,040][09423] Updated weights for policy 0, policy_version 229547 (0.0046) [2024-06-28 12:55:12,889][09423] Updated weights for policy 0, policy_version 229557 (0.0047) [2024-06-28 12:55:12,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42598.3, 300 sec: 41765.3). Total num frames: 3761061888. Throughput: 0: 42268.5. Samples: 39912460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 26.0) [2024-06-28 12:55:12,922][09190] Avg episode reward: [(0, '0.514')] [2024-06-28 12:55:15,703][09423] Updated weights for policy 0, policy_version 229567 (0.0035) [2024-06-28 12:55:17,922][09190] Fps is (10 sec: 44236.4, 60 sec: 43419.3, 300 sec: 41765.3). Total num frames: 3761324032. Throughput: 0: 42483.4. Samples: 40161920. Policy #0 lag: (min: 0.0, avg: 9.9, max: 26.0) [2024-06-28 12:55:17,922][09190] Avg episode reward: [(0, '0.522')] [2024-06-28 12:55:17,945][09403] Saving new best policy, reward=0.522! [2024-06-28 12:55:20,462][09423] Updated weights for policy 0, policy_version 229577 (0.0028) [2024-06-28 12:55:22,928][09190] Fps is (10 sec: 45845.6, 60 sec: 42593.8, 300 sec: 41875.5). Total num frames: 3761520640. Throughput: 0: 42428.0. Samples: 40424740. Policy #0 lag: (min: 0.0, avg: 9.9, max: 26.0) [2024-06-28 12:55:22,929][09190] Avg episode reward: [(0, '0.521')] [2024-06-28 12:55:23,226][09423] Updated weights for policy 0, policy_version 229587 (0.0038) [2024-06-28 12:55:27,924][09190] Fps is (10 sec: 37674.5, 60 sec: 42323.6, 300 sec: 41765.0). Total num frames: 3761700864. Throughput: 0: 42254.5. Samples: 40548140. Policy #0 lag: (min: 0.0, avg: 9.9, max: 26.0) [2024-06-28 12:55:27,924][09190] Avg episode reward: [(0, '0.494')] [2024-06-28 12:55:28,113][09423] Updated weights for policy 0, policy_version 229597 (0.0046) [2024-06-28 12:55:31,229][09423] Updated weights for policy 0, policy_version 229607 (0.0029) [2024-06-28 12:55:32,924][09190] Fps is (10 sec: 44254.5, 60 sec: 43142.8, 300 sec: 41820.5). Total num frames: 3761963008. Throughput: 0: 42664.9. Samples: 40807860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 26.0) [2024-06-28 12:55:32,924][09190] Avg episode reward: [(0, '0.488')] [2024-06-28 12:55:35,691][09423] Updated weights for policy 0, policy_version 229617 (0.0038) [2024-06-28 12:55:37,786][09403] Signal inference workers to stop experience collection... (550 times) [2024-06-28 12:55:37,838][09403] Signal inference workers to resume experience collection... (550 times) [2024-06-28 12:55:37,839][09423] InferenceWorker_p0-w0: stopping experience collection (550 times) [2024-06-28 12:55:37,851][09423] InferenceWorker_p0-w0: resuming experience collection (550 times) [2024-06-28 12:55:37,921][09190] Fps is (10 sec: 45887.0, 60 sec: 42598.4, 300 sec: 41876.4). Total num frames: 3762159616. Throughput: 0: 42481.4. Samples: 41065620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 26.0) [2024-06-28 12:55:37,922][09190] Avg episode reward: [(0, '0.526')] [2024-06-28 12:55:37,987][09403] Saving new best policy, reward=0.526! [2024-06-28 12:55:38,807][09423] Updated weights for policy 0, policy_version 229627 (0.0036) [2024-06-28 12:55:42,921][09190] Fps is (10 sec: 37692.4, 60 sec: 42325.2, 300 sec: 41765.3). Total num frames: 3762339840. Throughput: 0: 42371.5. Samples: 41187420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 26.0) [2024-06-28 12:55:42,922][09190] Avg episode reward: [(0, '0.525')] [2024-06-28 12:55:43,397][09423] Updated weights for policy 0, policy_version 229637 (0.0040) [2024-06-28 12:55:46,416][09423] Updated weights for policy 0, policy_version 229647 (0.0038) [2024-06-28 12:55:47,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.5, 300 sec: 41876.4). Total num frames: 3762585600. Throughput: 0: 42539.3. Samples: 41445440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 26.0) [2024-06-28 12:55:47,922][09190] Avg episode reward: [(0, '0.541')] [2024-06-28 12:55:48,029][09403] Saving new best policy, reward=0.541! [2024-06-28 12:55:51,096][09423] Updated weights for policy 0, policy_version 229657 (0.0042) [2024-06-28 12:55:52,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42325.3, 300 sec: 41931.9). Total num frames: 3762782208. Throughput: 0: 42546.7. Samples: 41703520. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-28 12:55:52,922][09190] Avg episode reward: [(0, '0.519')] [2024-06-28 12:55:54,349][09423] Updated weights for policy 0, policy_version 229667 (0.0033) [2024-06-28 12:55:57,921][09190] Fps is (10 sec: 37683.2, 60 sec: 42325.4, 300 sec: 41709.8). Total num frames: 3762962432. Throughput: 0: 42394.3. Samples: 41820200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-28 12:55:57,922][09190] Avg episode reward: [(0, '0.574')] [2024-06-28 12:55:57,997][09403] Saving new best policy, reward=0.574! [2024-06-28 12:55:58,782][09423] Updated weights for policy 0, policy_version 229677 (0.0035) [2024-06-28 12:56:01,779][09423] Updated weights for policy 0, policy_version 229687 (0.0030) [2024-06-28 12:56:02,922][09190] Fps is (10 sec: 45874.4, 60 sec: 42871.4, 300 sec: 41931.9). Total num frames: 3763240960. Throughput: 0: 42699.1. Samples: 42083380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-28 12:56:02,922][09190] Avg episode reward: [(0, '0.567')] [2024-06-28 12:56:06,747][09423] Updated weights for policy 0, policy_version 229697 (0.0033) [2024-06-28 12:56:07,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42325.4, 300 sec: 41931.9). Total num frames: 3763421184. Throughput: 0: 42699.5. Samples: 42345940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-28 12:56:07,922][09190] Avg episode reward: [(0, '0.561')] [2024-06-28 12:56:09,283][09423] Updated weights for policy 0, policy_version 229707 (0.0035) [2024-06-28 12:56:12,921][09190] Fps is (10 sec: 37683.9, 60 sec: 42598.4, 300 sec: 41820.8). Total num frames: 3763617792. Throughput: 0: 42532.1. Samples: 42461980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-28 12:56:12,922][09190] Avg episode reward: [(0, '0.570')] [2024-06-28 12:56:14,842][09423] Updated weights for policy 0, policy_version 229717 (0.0035) [2024-06-28 12:56:16,961][09423] Updated weights for policy 0, policy_version 229727 (0.0056) [2024-06-28 12:56:17,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42598.5, 300 sec: 42043.0). Total num frames: 3763879936. Throughput: 0: 42557.0. Samples: 42722820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-28 12:56:17,922][09190] Avg episode reward: [(0, '0.554')] [2024-06-28 12:56:17,959][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000229730_3763896320.pth... [2024-06-28 12:56:18,015][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000229111_3753754624.pth [2024-06-28 12:56:22,327][09423] Updated weights for policy 0, policy_version 229737 (0.0044) [2024-06-28 12:56:22,922][09190] Fps is (10 sec: 44236.1, 60 sec: 42329.8, 300 sec: 41987.4). Total num frames: 3764060160. Throughput: 0: 42776.6. Samples: 42990580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-28 12:56:22,922][09190] Avg episode reward: [(0, '0.565')] [2024-06-28 12:56:24,452][09423] Updated weights for policy 0, policy_version 229747 (0.0034) [2024-06-28 12:56:27,921][09190] Fps is (10 sec: 39321.2, 60 sec: 42873.2, 300 sec: 41987.4). Total num frames: 3764273152. Throughput: 0: 42587.5. Samples: 43103860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-28 12:56:27,922][09190] Avg episode reward: [(0, '0.571')] [2024-06-28 12:56:29,821][09423] Updated weights for policy 0, policy_version 229757 (0.0025) [2024-06-28 12:56:31,654][09403] Signal inference workers to stop experience collection... (600 times) [2024-06-28 12:56:31,689][09423] InferenceWorker_p0-w0: stopping experience collection (600 times) [2024-06-28 12:56:31,712][09403] Signal inference workers to resume experience collection... (600 times) [2024-06-28 12:56:31,720][09423] InferenceWorker_p0-w0: resuming experience collection (600 times) [2024-06-28 12:56:32,491][09423] Updated weights for policy 0, policy_version 229767 (0.0032) [2024-06-28 12:56:32,921][09190] Fps is (10 sec: 45876.0, 60 sec: 42600.2, 300 sec: 42209.6). Total num frames: 3764518912. Throughput: 0: 42735.5. Samples: 43368540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-28 12:56:32,922][09190] Avg episode reward: [(0, '0.534')] [2024-06-28 12:56:37,293][09423] Updated weights for policy 0, policy_version 229777 (0.0026) [2024-06-28 12:56:37,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42325.3, 300 sec: 41987.5). Total num frames: 3764699136. Throughput: 0: 42793.4. Samples: 43629220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-28 12:56:37,922][09190] Avg episode reward: [(0, '0.576')] [2024-06-28 12:56:37,990][09403] Saving new best policy, reward=0.576! [2024-06-28 12:56:39,969][09423] Updated weights for policy 0, policy_version 229787 (0.0034) [2024-06-28 12:56:42,924][09190] Fps is (10 sec: 39312.6, 60 sec: 42869.9, 300 sec: 42098.2). Total num frames: 3764912128. Throughput: 0: 42929.7. Samples: 43752140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-28 12:56:42,924][09190] Avg episode reward: [(0, '0.559')] [2024-06-28 12:56:45,255][09423] Updated weights for policy 0, policy_version 229797 (0.0029) [2024-06-28 12:56:47,775][09423] Updated weights for policy 0, policy_version 229807 (0.0034) [2024-06-28 12:56:47,921][09190] Fps is (10 sec: 45874.7, 60 sec: 42871.4, 300 sec: 42265.5). Total num frames: 3765157888. Throughput: 0: 42837.9. Samples: 44011080. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-28 12:56:47,925][09190] Avg episode reward: [(0, '0.537')] [2024-06-28 12:56:52,921][09190] Fps is (10 sec: 39330.9, 60 sec: 42052.4, 300 sec: 41932.0). Total num frames: 3765305344. Throughput: 0: 42793.4. Samples: 44271640. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-28 12:56:52,922][09190] Avg episode reward: [(0, '0.562')] [2024-06-28 12:56:52,953][09423] Updated weights for policy 0, policy_version 229817 (0.0043) [2024-06-28 12:56:55,531][09423] Updated weights for policy 0, policy_version 229827 (0.0038) [2024-06-28 12:56:57,928][09190] Fps is (10 sec: 40933.5, 60 sec: 43412.8, 300 sec: 42208.7). Total num frames: 3765567488. Throughput: 0: 42677.4. Samples: 44382740. Policy #0 lag: (min: 0.0, avg: 10.0, max: 25.0) [2024-06-28 12:56:57,928][09190] Avg episode reward: [(0, '0.562')] [2024-06-28 12:57:00,679][09423] Updated weights for policy 0, policy_version 229837 (0.0053) [2024-06-28 12:57:02,921][09190] Fps is (10 sec: 49151.9, 60 sec: 42598.6, 300 sec: 42265.2). Total num frames: 3765796864. Throughput: 0: 42823.1. Samples: 44649860. Policy #0 lag: (min: 2.0, avg: 9.7, max: 23.0) [2024-06-28 12:57:02,922][09190] Avg episode reward: [(0, '0.565')] [2024-06-28 12:57:03,071][09423] Updated weights for policy 0, policy_version 229847 (0.0024) [2024-06-28 12:57:07,921][09190] Fps is (10 sec: 37707.7, 60 sec: 42052.2, 300 sec: 42043.0). Total num frames: 3765944320. Throughput: 0: 42695.7. Samples: 44911880. Policy #0 lag: (min: 2.0, avg: 9.7, max: 23.0) [2024-06-28 12:57:07,922][09190] Avg episode reward: [(0, '0.585')] [2024-06-28 12:57:07,944][09403] Saving new best policy, reward=0.585! [2024-06-28 12:57:08,450][09423] Updated weights for policy 0, policy_version 229857 (0.0042) [2024-06-28 12:57:10,703][09423] Updated weights for policy 0, policy_version 229867 (0.0040) [2024-06-28 12:57:12,928][09190] Fps is (10 sec: 42570.1, 60 sec: 43412.9, 300 sec: 42264.6). Total num frames: 3766222848. Throughput: 0: 42675.6. Samples: 45024540. Policy #0 lag: (min: 2.0, avg: 9.7, max: 23.0) [2024-06-28 12:57:12,929][09190] Avg episode reward: [(0, '0.553')] [2024-06-28 12:57:15,874][09423] Updated weights for policy 0, policy_version 229877 (0.0040) [2024-06-28 12:57:17,921][09190] Fps is (10 sec: 47514.0, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3766419456. Throughput: 0: 42804.5. Samples: 45294740. Policy #0 lag: (min: 2.0, avg: 9.7, max: 23.0) [2024-06-28 12:57:17,922][09190] Avg episode reward: [(0, '0.558')] [2024-06-28 12:57:18,698][09423] Updated weights for policy 0, policy_version 229887 (0.0038) [2024-06-28 12:57:22,921][09190] Fps is (10 sec: 37708.3, 60 sec: 42325.5, 300 sec: 42209.6). Total num frames: 3766599680. Throughput: 0: 42562.7. Samples: 45544540. Policy #0 lag: (min: 2.0, avg: 9.7, max: 23.0) [2024-06-28 12:57:22,922][09190] Avg episode reward: [(0, '0.583')] [2024-06-28 12:57:23,605][09423] Updated weights for policy 0, policy_version 229897 (0.0036) [2024-06-28 12:57:26,326][09423] Updated weights for policy 0, policy_version 229907 (0.0024) [2024-06-28 12:57:27,921][09190] Fps is (10 sec: 44236.3, 60 sec: 43144.5, 300 sec: 42376.2). Total num frames: 3766861824. Throughput: 0: 42372.3. Samples: 45658800. Policy #0 lag: (min: 2.0, avg: 9.7, max: 23.0) [2024-06-28 12:57:27,922][09190] Avg episode reward: [(0, '0.572')] [2024-06-28 12:57:31,531][09423] Updated weights for policy 0, policy_version 229917 (0.0039) [2024-06-28 12:57:32,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 3767042048. Throughput: 0: 42592.9. Samples: 45927760. Policy #0 lag: (min: 2.0, avg: 9.7, max: 23.0) [2024-06-28 12:57:32,922][09190] Avg episode reward: [(0, '0.561')] [2024-06-28 12:57:33,523][09403] Signal inference workers to stop experience collection... (650 times) [2024-06-28 12:57:33,524][09403] Signal inference workers to resume experience collection... (650 times) [2024-06-28 12:57:33,549][09423] InferenceWorker_p0-w0: stopping experience collection (650 times) [2024-06-28 12:57:33,549][09423] InferenceWorker_p0-w0: resuming experience collection (650 times) [2024-06-28 12:57:33,947][09423] Updated weights for policy 0, policy_version 229927 (0.0041) [2024-06-28 12:57:37,921][09190] Fps is (10 sec: 36045.0, 60 sec: 42052.2, 300 sec: 42209.6). Total num frames: 3767222272. Throughput: 0: 42387.9. Samples: 46179100. Policy #0 lag: (min: 2.0, avg: 9.7, max: 23.0) [2024-06-28 12:57:37,922][09190] Avg episode reward: [(0, '0.581')] [2024-06-28 12:57:39,342][09423] Updated weights for policy 0, policy_version 229937 (0.0042) [2024-06-28 12:57:41,804][09423] Updated weights for policy 0, policy_version 229947 (0.0033) [2024-06-28 12:57:42,921][09190] Fps is (10 sec: 47513.9, 60 sec: 43419.3, 300 sec: 42487.3). Total num frames: 3767517184. Throughput: 0: 42657.8. Samples: 46302060. Policy #0 lag: (min: 2.0, avg: 9.7, max: 23.0) [2024-06-28 12:57:42,922][09190] Avg episode reward: [(0, '0.576')] [2024-06-28 12:57:46,810][09423] Updated weights for policy 0, policy_version 229957 (0.0033) [2024-06-28 12:57:47,921][09190] Fps is (10 sec: 44236.8, 60 sec: 41779.2, 300 sec: 42320.7). Total num frames: 3767664640. Throughput: 0: 42605.7. Samples: 46567120. Policy #0 lag: (min: 2.0, avg: 9.7, max: 23.0) [2024-06-28 12:57:47,922][09190] Avg episode reward: [(0, '0.575')] [2024-06-28 12:57:49,494][09423] Updated weights for policy 0, policy_version 229967 (0.0038) [2024-06-28 12:57:52,921][09190] Fps is (10 sec: 34406.2, 60 sec: 42598.3, 300 sec: 42154.1). Total num frames: 3767861248. Throughput: 0: 42320.0. Samples: 46816280. Policy #0 lag: (min: 2.0, avg: 9.7, max: 23.0) [2024-06-28 12:57:52,922][09190] Avg episode reward: [(0, '0.574')] [2024-06-28 12:57:54,831][09423] Updated weights for policy 0, policy_version 229977 (0.0047) [2024-06-28 12:57:57,258][09423] Updated weights for policy 0, policy_version 229987 (0.0029) [2024-06-28 12:57:57,921][09190] Fps is (10 sec: 47513.7, 60 sec: 42876.1, 300 sec: 42542.9). Total num frames: 3768139776. Throughput: 0: 42640.9. Samples: 46943100. Policy #0 lag: (min: 2.0, avg: 9.7, max: 23.0) [2024-06-28 12:57:57,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 12:57:58,014][09403] Saving new best policy, reward=0.598! [2024-06-28 12:58:02,313][09423] Updated weights for policy 0, policy_version 229997 (0.0042) [2024-06-28 12:58:02,921][09190] Fps is (10 sec: 44236.5, 60 sec: 41779.1, 300 sec: 42431.8). Total num frames: 3768303616. Throughput: 0: 42343.4. Samples: 47200200. Policy #0 lag: (min: 2.0, avg: 9.7, max: 23.0) [2024-06-28 12:58:02,922][09190] Avg episode reward: [(0, '0.584')] [2024-06-28 12:58:05,163][09423] Updated weights for policy 0, policy_version 230007 (0.0039) [2024-06-28 12:58:07,921][09190] Fps is (10 sec: 37683.2, 60 sec: 42871.5, 300 sec: 42265.2). Total num frames: 3768516608. Throughput: 0: 42420.4. Samples: 47453460. Policy #0 lag: (min: 2.0, avg: 9.7, max: 23.0) [2024-06-28 12:58:07,922][09190] Avg episode reward: [(0, '0.592')] [2024-06-28 12:58:09,767][09423] Updated weights for policy 0, policy_version 230017 (0.0038) [2024-06-28 12:58:12,709][09423] Updated weights for policy 0, policy_version 230027 (0.0037) [2024-06-28 12:58:12,921][09190] Fps is (10 sec: 47513.8, 60 sec: 42603.1, 300 sec: 42542.9). Total num frames: 3768778752. Throughput: 0: 42752.5. Samples: 47582660. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 12:58:12,922][09190] Avg episode reward: [(0, '0.563')] [2024-06-28 12:58:17,603][09423] Updated weights for policy 0, policy_version 230037 (0.0027) [2024-06-28 12:58:17,921][09190] Fps is (10 sec: 40960.2, 60 sec: 41779.2, 300 sec: 42376.2). Total num frames: 3768926208. Throughput: 0: 42469.4. Samples: 47838880. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 12:58:17,922][09190] Avg episode reward: [(0, '0.577')] [2024-06-28 12:58:18,063][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000230038_3768942592.pth... [2024-06-28 12:58:18,110][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000229414_3758718976.pth [2024-06-28 12:58:20,198][09423] Updated weights for policy 0, policy_version 230047 (0.0044) [2024-06-28 12:58:22,922][09190] Fps is (10 sec: 39321.0, 60 sec: 42871.3, 300 sec: 42431.7). Total num frames: 3769171968. Throughput: 0: 42359.0. Samples: 48085260. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 12:58:22,922][09190] Avg episode reward: [(0, '0.566')] [2024-06-28 12:58:25,465][09423] Updated weights for policy 0, policy_version 230057 (0.0029) [2024-06-28 12:58:27,799][09423] Updated weights for policy 0, policy_version 230067 (0.0026) [2024-06-28 12:58:27,924][09190] Fps is (10 sec: 49139.6, 60 sec: 42596.7, 300 sec: 42653.6). Total num frames: 3769417728. Throughput: 0: 42707.4. Samples: 48224000. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 12:58:27,925][09190] Avg episode reward: [(0, '0.580')] [2024-06-28 12:58:32,921][09190] Fps is (10 sec: 39322.5, 60 sec: 42052.3, 300 sec: 42376.2). Total num frames: 3769565184. Throughput: 0: 42474.7. Samples: 48478480. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 12:58:32,922][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 12:58:32,988][09423] Updated weights for policy 0, policy_version 230077 (0.0037) [2024-06-28 12:58:33,836][09403] Signal inference workers to stop experience collection... (700 times) [2024-06-28 12:58:33,882][09423] InferenceWorker_p0-w0: stopping experience collection (700 times) [2024-06-28 12:58:33,954][09403] Signal inference workers to resume experience collection... (700 times) [2024-06-28 12:58:33,954][09423] InferenceWorker_p0-w0: resuming experience collection (700 times) [2024-06-28 12:58:35,717][09423] Updated weights for policy 0, policy_version 230087 (0.0031) [2024-06-28 12:58:37,921][09190] Fps is (10 sec: 40970.4, 60 sec: 43417.6, 300 sec: 42487.7). Total num frames: 3769827328. Throughput: 0: 42307.2. Samples: 48720100. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 12:58:37,922][09190] Avg episode reward: [(0, '0.564')] [2024-06-28 12:58:41,211][09423] Updated weights for policy 0, policy_version 230097 (0.0032) [2024-06-28 12:58:42,921][09190] Fps is (10 sec: 45874.8, 60 sec: 41779.2, 300 sec: 42598.4). Total num frames: 3770023936. Throughput: 0: 42676.0. Samples: 48863520. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 12:58:42,922][09190] Avg episode reward: [(0, '0.589')] [2024-06-28 12:58:43,809][09423] Updated weights for policy 0, policy_version 230107 (0.0037) [2024-06-28 12:58:47,921][09190] Fps is (10 sec: 37682.9, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3770204160. Throughput: 0: 42324.0. Samples: 49104780. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 12:58:47,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 12:58:48,719][09423] Updated weights for policy 0, policy_version 230117 (0.0035) [2024-06-28 12:58:51,414][09423] Updated weights for policy 0, policy_version 230127 (0.0038) [2024-06-28 12:58:52,922][09190] Fps is (10 sec: 42597.9, 60 sec: 43144.4, 300 sec: 42487.3). Total num frames: 3770449920. Throughput: 0: 42223.9. Samples: 49353540. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 12:58:52,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 12:58:53,037][09403] Saving new best policy, reward=0.602! [2024-06-28 12:58:56,203][09423] Updated weights for policy 0, policy_version 230137 (0.0029) [2024-06-28 12:58:57,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42052.2, 300 sec: 42598.4). Total num frames: 3770662912. Throughput: 0: 42388.9. Samples: 49490160. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 12:58:57,922][09190] Avg episode reward: [(0, '0.562')] [2024-06-28 12:58:58,947][09423] Updated weights for policy 0, policy_version 230147 (0.0031) [2024-06-28 12:59:02,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3770843136. Throughput: 0: 42196.4. Samples: 49737720. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 12:59:02,922][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 12:59:04,053][09423] Updated weights for policy 0, policy_version 230157 (0.0048) [2024-06-28 12:59:06,796][09423] Updated weights for policy 0, policy_version 230167 (0.0039) [2024-06-28 12:59:07,921][09190] Fps is (10 sec: 44236.6, 60 sec: 43144.5, 300 sec: 42709.5). Total num frames: 3771105280. Throughput: 0: 42251.6. Samples: 49986580. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 12:59:07,922][09190] Avg episode reward: [(0, '0.565')] [2024-06-28 12:59:11,814][09423] Updated weights for policy 0, policy_version 230177 (0.0025) [2024-06-28 12:59:12,921][09190] Fps is (10 sec: 42598.4, 60 sec: 41506.1, 300 sec: 42543.2). Total num frames: 3771269120. Throughput: 0: 42230.3. Samples: 50124260. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 12:59:12,922][09190] Avg episode reward: [(0, '0.565')] [2024-06-28 12:59:14,412][09423] Updated weights for policy 0, policy_version 230187 (0.0035) [2024-06-28 12:59:17,924][09190] Fps is (10 sec: 37674.1, 60 sec: 42596.6, 300 sec: 42431.4). Total num frames: 3771482112. Throughput: 0: 42160.3. Samples: 50375800. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 12:59:17,924][09190] Avg episode reward: [(0, '0.574')] [2024-06-28 12:59:19,381][09423] Updated weights for policy 0, policy_version 230197 (0.0036) [2024-06-28 12:59:21,930][09423] Updated weights for policy 0, policy_version 230207 (0.0027) [2024-06-28 12:59:22,921][09190] Fps is (10 sec: 47513.3, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 3771744256. Throughput: 0: 42376.8. Samples: 50627060. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 12:59:22,922][09190] Avg episode reward: [(0, '0.580')] [2024-06-28 12:59:27,407][09423] Updated weights for policy 0, policy_version 230217 (0.0033) [2024-06-28 12:59:27,921][09190] Fps is (10 sec: 40970.1, 60 sec: 41234.7, 300 sec: 42431.8). Total num frames: 3771891712. Throughput: 0: 42188.4. Samples: 50762000. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 12:59:27,922][09190] Avg episode reward: [(0, '0.580')] [2024-06-28 12:59:29,848][09423] Updated weights for policy 0, policy_version 230227 (0.0040) [2024-06-28 12:59:32,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 3772137472. Throughput: 0: 42354.7. Samples: 51010740. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 12:59:32,922][09190] Avg episode reward: [(0, '0.580')] [2024-06-28 12:59:34,968][09423] Updated weights for policy 0, policy_version 230237 (0.0040) [2024-06-28 12:59:37,206][09403] Signal inference workers to stop experience collection... (750 times) [2024-06-28 12:59:37,235][09423] InferenceWorker_p0-w0: stopping experience collection (750 times) [2024-06-28 12:59:37,259][09403] Signal inference workers to resume experience collection... (750 times) [2024-06-28 12:59:37,264][09423] InferenceWorker_p0-w0: resuming experience collection (750 times) [2024-06-28 12:59:37,407][09423] Updated weights for policy 0, policy_version 230247 (0.0031) [2024-06-28 12:59:37,924][09190] Fps is (10 sec: 49139.9, 60 sec: 42596.6, 300 sec: 42653.6). Total num frames: 3772383232. Throughput: 0: 42260.9. Samples: 51255380. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 12:59:37,924][09190] Avg episode reward: [(0, '0.550')] [2024-06-28 12:59:42,633][09423] Updated weights for policy 0, policy_version 230257 (0.0047) [2024-06-28 12:59:42,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42052.3, 300 sec: 42431.8). Total num frames: 3772547072. Throughput: 0: 42392.5. Samples: 51397820. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 12:59:42,922][09190] Avg episode reward: [(0, '0.593')] [2024-06-28 12:59:45,055][09423] Updated weights for policy 0, policy_version 230267 (0.0033) [2024-06-28 12:59:47,921][09190] Fps is (10 sec: 36053.8, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 3772743680. Throughput: 0: 42362.7. Samples: 51644040. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 12:59:47,922][09190] Avg episode reward: [(0, '0.568')] [2024-06-28 12:59:50,172][09423] Updated weights for policy 0, policy_version 230277 (0.0034) [2024-06-28 12:59:52,765][09423] Updated weights for policy 0, policy_version 230287 (0.0037) [2024-06-28 12:59:52,921][09190] Fps is (10 sec: 47513.5, 60 sec: 42871.6, 300 sec: 42709.5). Total num frames: 3773022208. Throughput: 0: 42572.1. Samples: 51902320. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 12:59:52,922][09190] Avg episode reward: [(0, '0.570')] [2024-06-28 12:59:57,621][09423] Updated weights for policy 0, policy_version 230297 (0.0027) [2024-06-28 12:59:57,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42052.3, 300 sec: 42431.8). Total num frames: 3773186048. Throughput: 0: 42688.9. Samples: 52045260. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 12:59:57,922][09190] Avg episode reward: [(0, '0.565')] [2024-06-28 13:00:00,564][09423] Updated weights for policy 0, policy_version 230307 (0.0037) [2024-06-28 13:00:02,921][09190] Fps is (10 sec: 37682.9, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 3773399040. Throughput: 0: 42321.9. Samples: 52280180. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 13:00:02,925][09190] Avg episode reward: [(0, '0.566')] [2024-06-28 13:00:05,539][09423] Updated weights for policy 0, policy_version 230317 (0.0026) [2024-06-28 13:00:07,921][09190] Fps is (10 sec: 47513.1, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 3773661184. Throughput: 0: 42423.1. Samples: 52536100. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 13:00:07,922][09190] Avg episode reward: [(0, '0.580')] [2024-06-28 13:00:08,093][09423] Updated weights for policy 0, policy_version 230327 (0.0039) [2024-06-28 13:00:12,921][09190] Fps is (10 sec: 39322.3, 60 sec: 42052.4, 300 sec: 42265.2). Total num frames: 3773792256. Throughput: 0: 42413.0. Samples: 52670580. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 13:00:12,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:00:13,548][09423] Updated weights for policy 0, policy_version 230337 (0.0029) [2024-06-28 13:00:16,031][09423] Updated weights for policy 0, policy_version 230347 (0.0035) [2024-06-28 13:00:17,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42873.2, 300 sec: 42488.2). Total num frames: 3774054400. Throughput: 0: 42181.2. Samples: 52908900. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 13:00:17,922][09190] Avg episode reward: [(0, '0.592')] [2024-06-28 13:00:17,942][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000230350_3774054400.pth... [2024-06-28 13:00:17,995][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000229730_3763896320.pth [2024-06-28 13:00:21,384][09423] Updated weights for policy 0, policy_version 230357 (0.0038) [2024-06-28 13:00:22,921][09190] Fps is (10 sec: 47512.8, 60 sec: 42052.3, 300 sec: 42598.8). Total num frames: 3774267392. Throughput: 0: 42421.0. Samples: 53164220. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 13:00:22,922][09190] Avg episode reward: [(0, '0.566')] [2024-06-28 13:00:23,910][09423] Updated weights for policy 0, policy_version 230367 (0.0025) [2024-06-28 13:00:27,924][09190] Fps is (10 sec: 37674.1, 60 sec: 42323.6, 300 sec: 42265.2). Total num frames: 3774431232. Throughput: 0: 42165.6. Samples: 53295380. Policy #0 lag: (min: 0.0, avg: 7.9, max: 20.0) [2024-06-28 13:00:27,924][09190] Avg episode reward: [(0, '0.565')] [2024-06-28 13:00:28,823][09423] Updated weights for policy 0, policy_version 230377 (0.0032) [2024-06-28 13:00:31,466][09423] Updated weights for policy 0, policy_version 230387 (0.0039) [2024-06-28 13:00:32,928][09190] Fps is (10 sec: 44208.1, 60 sec: 42866.8, 300 sec: 42541.9). Total num frames: 3774709760. Throughput: 0: 42169.5. Samples: 53541940. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-28 13:00:32,928][09190] Avg episode reward: [(0, '0.580')] [2024-06-28 13:00:36,366][09423] Updated weights for policy 0, policy_version 230397 (0.0042) [2024-06-28 13:00:37,921][09190] Fps is (10 sec: 45887.0, 60 sec: 41781.0, 300 sec: 42542.9). Total num frames: 3774889984. Throughput: 0: 42474.2. Samples: 53813660. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-28 13:00:37,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:00:38,035][09403] Signal inference workers to stop experience collection... (800 times) [2024-06-28 13:00:38,035][09403] Signal inference workers to resume experience collection... (800 times) [2024-06-28 13:00:38,072][09423] InferenceWorker_p0-w0: stopping experience collection (800 times) [2024-06-28 13:00:38,072][09423] InferenceWorker_p0-w0: resuming experience collection (800 times) [2024-06-28 13:00:39,358][09423] Updated weights for policy 0, policy_version 230407 (0.0034) [2024-06-28 13:00:42,922][09190] Fps is (10 sec: 37707.3, 60 sec: 42325.2, 300 sec: 42376.2). Total num frames: 3775086592. Throughput: 0: 41926.5. Samples: 53931960. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-28 13:00:42,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:00:44,296][09423] Updated weights for policy 0, policy_version 230417 (0.0042) [2024-06-28 13:00:46,889][09423] Updated weights for policy 0, policy_version 230427 (0.0030) [2024-06-28 13:00:47,924][09190] Fps is (10 sec: 47501.2, 60 sec: 43688.8, 300 sec: 42653.6). Total num frames: 3775365120. Throughput: 0: 42361.7. Samples: 54186560. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-28 13:00:47,925][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:00:47,932][09403] Saving new best policy, reward=0.603! [2024-06-28 13:00:52,123][09423] Updated weights for policy 0, policy_version 230437 (0.0030) [2024-06-28 13:00:52,928][09190] Fps is (10 sec: 42571.4, 60 sec: 41501.6, 300 sec: 42541.9). Total num frames: 3775512576. Throughput: 0: 42469.5. Samples: 54447500. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-28 13:00:52,928][09190] Avg episode reward: [(0, '0.593')] [2024-06-28 13:00:54,856][09423] Updated weights for policy 0, policy_version 230447 (0.0036) [2024-06-28 13:00:57,921][09190] Fps is (10 sec: 36053.7, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3775725568. Throughput: 0: 42033.6. Samples: 54562100. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-28 13:00:57,922][09190] Avg episode reward: [(0, '0.583')] [2024-06-28 13:00:59,727][09423] Updated weights for policy 0, policy_version 230457 (0.0040) [2024-06-28 13:01:02,472][09423] Updated weights for policy 0, policy_version 230467 (0.0028) [2024-06-28 13:01:02,922][09190] Fps is (10 sec: 47543.8, 60 sec: 43144.5, 300 sec: 42598.4). Total num frames: 3775987712. Throughput: 0: 42495.1. Samples: 54821180. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-28 13:01:02,922][09190] Avg episode reward: [(0, '0.592')] [2024-06-28 13:01:07,819][09423] Updated weights for policy 0, policy_version 230477 (0.0032) [2024-06-28 13:01:07,921][09190] Fps is (10 sec: 40959.8, 60 sec: 41233.0, 300 sec: 42431.8). Total num frames: 3776135168. Throughput: 0: 42609.3. Samples: 55081640. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-28 13:01:07,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:01:10,247][09423] Updated weights for policy 0, policy_version 230487 (0.0024) [2024-06-28 13:01:12,921][09190] Fps is (10 sec: 36045.5, 60 sec: 42598.4, 300 sec: 42265.2). Total num frames: 3776348160. Throughput: 0: 42181.6. Samples: 55193440. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-28 13:01:12,922][09190] Avg episode reward: [(0, '0.591')] [2024-06-28 13:01:15,627][09423] Updated weights for policy 0, policy_version 230497 (0.0028) [2024-06-28 13:01:17,759][09423] Updated weights for policy 0, policy_version 230507 (0.0029) [2024-06-28 13:01:17,921][09190] Fps is (10 sec: 49152.1, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 3776626688. Throughput: 0: 42611.9. Samples: 55459200. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-28 13:01:17,922][09190] Avg episode reward: [(0, '0.592')] [2024-06-28 13:01:22,921][09190] Fps is (10 sec: 42598.1, 60 sec: 41779.2, 300 sec: 42376.3). Total num frames: 3776774144. Throughput: 0: 42336.9. Samples: 55718820. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-28 13:01:22,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:01:23,267][09423] Updated weights for policy 0, policy_version 230517 (0.0034) [2024-06-28 13:01:25,468][09423] Updated weights for policy 0, policy_version 230527 (0.0032) [2024-06-28 13:01:27,921][09190] Fps is (10 sec: 37683.8, 60 sec: 42873.3, 300 sec: 42320.7). Total num frames: 3777003520. Throughput: 0: 42296.2. Samples: 55835280. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-28 13:01:27,922][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 13:01:30,729][09423] Updated weights for policy 0, policy_version 230537 (0.0045) [2024-06-28 13:01:32,921][09190] Fps is (10 sec: 47513.7, 60 sec: 42330.0, 300 sec: 42542.9). Total num frames: 3777249280. Throughput: 0: 42594.4. Samples: 56103200. Policy #0 lag: (min: 1.0, avg: 8.6, max: 21.0) [2024-06-28 13:01:32,922][09190] Avg episode reward: [(0, '0.588')] [2024-06-28 13:01:33,466][09423] Updated weights for policy 0, policy_version 230547 (0.0038) [2024-06-28 13:01:37,928][09190] Fps is (10 sec: 40933.0, 60 sec: 42047.7, 300 sec: 42375.6). Total num frames: 3777413120. Throughput: 0: 42667.1. Samples: 56367520. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-28 13:01:37,928][09190] Avg episode reward: [(0, '0.579')] [2024-06-28 13:01:38,325][09423] Updated weights for policy 0, policy_version 230557 (0.0028) [2024-06-28 13:01:41,170][09423] Updated weights for policy 0, policy_version 230567 (0.0029) [2024-06-28 13:01:42,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42871.5, 300 sec: 42376.2). Total num frames: 3777658880. Throughput: 0: 42637.3. Samples: 56480780. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-28 13:01:42,922][09190] Avg episode reward: [(0, '0.581')] [2024-06-28 13:01:45,783][09423] Updated weights for policy 0, policy_version 230577 (0.0036) [2024-06-28 13:01:47,921][09190] Fps is (10 sec: 47544.7, 60 sec: 42054.1, 300 sec: 42653.9). Total num frames: 3777888256. Throughput: 0: 42700.6. Samples: 56742700. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-28 13:01:47,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:01:48,743][09423] Updated weights for policy 0, policy_version 230587 (0.0031) [2024-06-28 13:01:52,922][09190] Fps is (10 sec: 37683.2, 60 sec: 42056.7, 300 sec: 42266.1). Total num frames: 3778035712. Throughput: 0: 42561.3. Samples: 56996900. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-28 13:01:52,922][09190] Avg episode reward: [(0, '0.582')] [2024-06-28 13:01:53,656][09423] Updated weights for policy 0, policy_version 230597 (0.0043) [2024-06-28 13:01:56,638][09423] Updated weights for policy 0, policy_version 230607 (0.0028) [2024-06-28 13:01:57,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.6, 300 sec: 42431.8). Total num frames: 3778314240. Throughput: 0: 42540.8. Samples: 57107780. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-28 13:01:57,922][09190] Avg episode reward: [(0, '0.584')] [2024-06-28 13:01:59,888][09403] Signal inference workers to stop experience collection... (850 times) [2024-06-28 13:01:59,890][09403] Signal inference workers to resume experience collection... (850 times) [2024-06-28 13:01:59,934][09423] InferenceWorker_p0-w0: stopping experience collection (850 times) [2024-06-28 13:01:59,934][09423] InferenceWorker_p0-w0: resuming experience collection (850 times) [2024-06-28 13:02:01,471][09423] Updated weights for policy 0, policy_version 230617 (0.0032) [2024-06-28 13:02:02,921][09190] Fps is (10 sec: 44237.7, 60 sec: 41506.3, 300 sec: 42487.3). Total num frames: 3778478080. Throughput: 0: 42379.3. Samples: 57366260. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-28 13:02:02,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:02:04,341][09423] Updated weights for policy 0, policy_version 230627 (0.0037) [2024-06-28 13:02:07,921][09190] Fps is (10 sec: 36044.6, 60 sec: 42325.4, 300 sec: 42210.6). Total num frames: 3778674688. Throughput: 0: 42402.6. Samples: 57626940. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-28 13:02:07,922][09190] Avg episode reward: [(0, '0.593')] [2024-06-28 13:02:09,011][09423] Updated weights for policy 0, policy_version 230637 (0.0036) [2024-06-28 13:02:11,893][09423] Updated weights for policy 0, policy_version 230647 (0.0037) [2024-06-28 13:02:12,922][09190] Fps is (10 sec: 45874.0, 60 sec: 43144.3, 300 sec: 42431.8). Total num frames: 3778936832. Throughput: 0: 42565.6. Samples: 57750740. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-28 13:02:12,922][09190] Avg episode reward: [(0, '0.579')] [2024-06-28 13:02:16,642][09423] Updated weights for policy 0, policy_version 230657 (0.0033) [2024-06-28 13:02:17,921][09190] Fps is (10 sec: 44236.7, 60 sec: 41506.1, 300 sec: 42431.8). Total num frames: 3779117056. Throughput: 0: 42312.8. Samples: 58007280. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-28 13:02:17,922][09190] Avg episode reward: [(0, '0.588')] [2024-06-28 13:02:17,928][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000230659_3779117056.pth... [2024-06-28 13:02:17,971][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000230038_3768942592.pth [2024-06-28 13:02:19,635][09423] Updated weights for policy 0, policy_version 230667 (0.0044) [2024-06-28 13:02:22,921][09190] Fps is (10 sec: 37683.7, 60 sec: 42325.3, 300 sec: 42209.6). Total num frames: 3779313664. Throughput: 0: 42311.9. Samples: 58271280. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-28 13:02:22,924][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:02:24,600][09423] Updated weights for policy 0, policy_version 230677 (0.0045) [2024-06-28 13:02:27,413][09423] Updated weights for policy 0, policy_version 230687 (0.0038) [2024-06-28 13:02:27,921][09190] Fps is (10 sec: 47514.1, 60 sec: 43144.5, 300 sec: 42542.9). Total num frames: 3779592192. Throughput: 0: 42456.1. Samples: 58391300. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-28 13:02:27,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:02:32,233][09423] Updated weights for policy 0, policy_version 230697 (0.0035) [2024-06-28 13:02:32,921][09190] Fps is (10 sec: 44236.9, 60 sec: 41779.1, 300 sec: 42487.3). Total num frames: 3779756032. Throughput: 0: 42311.5. Samples: 58646720. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-28 13:02:32,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:02:35,332][09423] Updated weights for policy 0, policy_version 230707 (0.0043) [2024-06-28 13:02:37,924][09190] Fps is (10 sec: 36035.6, 60 sec: 42328.1, 300 sec: 42153.7). Total num frames: 3779952640. Throughput: 0: 42096.8. Samples: 58891360. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-28 13:02:37,925][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:02:39,916][09423] Updated weights for policy 0, policy_version 230717 (0.0029) [2024-06-28 13:02:42,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42325.5, 300 sec: 42487.3). Total num frames: 3780198400. Throughput: 0: 42407.6. Samples: 59016120. Policy #0 lag: (min: 0.0, avg: 8.1, max: 22.0) [2024-06-28 13:02:42,922][09190] Avg episode reward: [(0, '0.592')] [2024-06-28 13:02:43,115][09423] Updated weights for policy 0, policy_version 230727 (0.0030) [2024-06-28 13:02:47,669][09423] Updated weights for policy 0, policy_version 230737 (0.0031) [2024-06-28 13:02:47,921][09190] Fps is (10 sec: 45886.7, 60 sec: 42052.2, 300 sec: 42542.9). Total num frames: 3780411392. Throughput: 0: 42326.1. Samples: 59270940. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 13:02:47,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:02:51,262][09423] Updated weights for policy 0, policy_version 230747 (0.0034) [2024-06-28 13:02:52,922][09190] Fps is (10 sec: 39320.7, 60 sec: 42598.4, 300 sec: 42209.6). Total num frames: 3780591616. Throughput: 0: 41999.0. Samples: 59516900. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 13:02:52,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:02:54,723][09403] Signal inference workers to stop experience collection... (900 times) [2024-06-28 13:02:54,750][09423] InferenceWorker_p0-w0: stopping experience collection (900 times) [2024-06-28 13:02:54,782][09403] Signal inference workers to resume experience collection... (900 times) [2024-06-28 13:02:54,784][09423] InferenceWorker_p0-w0: resuming experience collection (900 times) [2024-06-28 13:02:55,647][09423] Updated weights for policy 0, policy_version 230757 (0.0037) [2024-06-28 13:02:57,921][09190] Fps is (10 sec: 40960.3, 60 sec: 41779.2, 300 sec: 42431.8). Total num frames: 3780820992. Throughput: 0: 42057.5. Samples: 59643320. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 13:02:57,922][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 13:02:58,694][09423] Updated weights for policy 0, policy_version 230767 (0.0030) [2024-06-28 13:03:02,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 3781033984. Throughput: 0: 42300.5. Samples: 59910800. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 13:03:02,922][09190] Avg episode reward: [(0, '0.593')] [2024-06-28 13:03:03,095][09423] Updated weights for policy 0, policy_version 230777 (0.0035) [2024-06-28 13:03:06,258][09423] Updated weights for policy 0, policy_version 230787 (0.0030) [2024-06-28 13:03:07,922][09190] Fps is (10 sec: 42597.7, 60 sec: 42871.4, 300 sec: 42265.1). Total num frames: 3781246976. Throughput: 0: 41864.8. Samples: 60155200. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 13:03:07,922][09190] Avg episode reward: [(0, '0.575')] [2024-06-28 13:03:10,785][09423] Updated weights for policy 0, policy_version 230797 (0.0035) [2024-06-28 13:03:12,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42052.3, 300 sec: 42487.3). Total num frames: 3781459968. Throughput: 0: 42181.7. Samples: 60289480. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 13:03:12,922][09190] Avg episode reward: [(0, '0.589')] [2024-06-28 13:03:14,035][09423] Updated weights for policy 0, policy_version 230807 (0.0033) [2024-06-28 13:03:17,922][09190] Fps is (10 sec: 39321.8, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3781640192. Throughput: 0: 42187.5. Samples: 60545160. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 13:03:17,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:03:18,785][09423] Updated weights for policy 0, policy_version 230817 (0.0040) [2024-06-28 13:03:21,928][09423] Updated weights for policy 0, policy_version 230827 (0.0033) [2024-06-28 13:03:22,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 42265.5). Total num frames: 3781885952. Throughput: 0: 42137.9. Samples: 60787460. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 13:03:22,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:03:26,326][09423] Updated weights for policy 0, policy_version 230837 (0.0025) [2024-06-28 13:03:27,921][09190] Fps is (10 sec: 44236.9, 60 sec: 41506.1, 300 sec: 42431.8). Total num frames: 3782082560. Throughput: 0: 42423.9. Samples: 60925200. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 13:03:27,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:03:29,377][09423] Updated weights for policy 0, policy_version 230847 (0.0045) [2024-06-28 13:03:32,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42052.2, 300 sec: 42209.6). Total num frames: 3782279168. Throughput: 0: 42544.4. Samples: 61185440. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 13:03:32,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:03:33,950][09423] Updated weights for policy 0, policy_version 230857 (0.0049) [2024-06-28 13:03:37,567][09423] Updated weights for policy 0, policy_version 230867 (0.0028) [2024-06-28 13:03:37,921][09190] Fps is (10 sec: 45875.2, 60 sec: 43146.3, 300 sec: 42431.8). Total num frames: 3782541312. Throughput: 0: 42547.6. Samples: 61431540. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 13:03:37,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:03:41,823][09423] Updated weights for policy 0, policy_version 230877 (0.0032) [2024-06-28 13:03:42,922][09190] Fps is (10 sec: 45874.9, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 3782737920. Throughput: 0: 42755.4. Samples: 61567320. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 13:03:42,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:03:45,413][09423] Updated weights for policy 0, policy_version 230887 (0.0037) [2024-06-28 13:03:47,921][09190] Fps is (10 sec: 36045.2, 60 sec: 41506.2, 300 sec: 42209.6). Total num frames: 3782901760. Throughput: 0: 42260.5. Samples: 61812520. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 13:03:47,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:03:49,479][09423] Updated weights for policy 0, policy_version 230897 (0.0030) [2024-06-28 13:03:52,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42871.5, 300 sec: 42376.2). Total num frames: 3783163904. Throughput: 0: 42379.2. Samples: 62062260. Policy #0 lag: (min: 0.0, avg: 8.0, max: 21.0) [2024-06-28 13:03:52,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:03:53,066][09423] Updated weights for policy 0, policy_version 230907 (0.0039) [2024-06-28 13:03:57,500][09423] Updated weights for policy 0, policy_version 230917 (0.0033) [2024-06-28 13:03:57,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 3783360512. Throughput: 0: 42451.1. Samples: 62199780. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2024-06-28 13:03:57,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:04:00,793][09423] Updated weights for policy 0, policy_version 230927 (0.0032) [2024-06-28 13:04:02,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42052.3, 300 sec: 42209.6). Total num frames: 3783557120. Throughput: 0: 42209.8. Samples: 62444600. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2024-06-28 13:04:02,922][09190] Avg episode reward: [(0, '0.592')] [2024-06-28 13:04:05,350][09423] Updated weights for policy 0, policy_version 230937 (0.0043) [2024-06-28 13:04:07,922][09190] Fps is (10 sec: 44236.3, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 3783802880. Throughput: 0: 42300.8. Samples: 62691000. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2024-06-28 13:04:07,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:04:08,438][09423] Updated weights for policy 0, policy_version 230947 (0.0034) [2024-06-28 13:04:10,695][09403] Signal inference workers to stop experience collection... (950 times) [2024-06-28 13:04:10,734][09423] InferenceWorker_p0-w0: stopping experience collection (950 times) [2024-06-28 13:04:10,745][09403] Signal inference workers to resume experience collection... (950 times) [2024-06-28 13:04:10,750][09423] InferenceWorker_p0-w0: resuming experience collection (950 times) [2024-06-28 13:04:12,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42052.3, 300 sec: 42376.6). Total num frames: 3783983104. Throughput: 0: 42272.1. Samples: 62827440. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2024-06-28 13:04:12,922][09190] Avg episode reward: [(0, '0.596')] [2024-06-28 13:04:13,070][09423] Updated weights for policy 0, policy_version 230957 (0.0032) [2024-06-28 13:04:16,251][09423] Updated weights for policy 0, policy_version 230967 (0.0041) [2024-06-28 13:04:17,921][09190] Fps is (10 sec: 37683.9, 60 sec: 42325.4, 300 sec: 42154.1). Total num frames: 3784179712. Throughput: 0: 41831.2. Samples: 63067840. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2024-06-28 13:04:17,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:04:17,941][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000230968_3784179712.pth... [2024-06-28 13:04:17,993][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000230350_3774054400.pth [2024-06-28 13:04:20,751][09423] Updated weights for policy 0, policy_version 230977 (0.0023) [2024-06-28 13:04:22,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 3784425472. Throughput: 0: 41949.8. Samples: 63319280. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2024-06-28 13:04:22,922][09190] Avg episode reward: [(0, '0.584')] [2024-06-28 13:04:24,578][09423] Updated weights for policy 0, policy_version 230987 (0.0037) [2024-06-28 13:04:27,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3784605696. Throughput: 0: 41793.1. Samples: 63448000. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2024-06-28 13:04:27,922][09190] Avg episode reward: [(0, '0.593')] [2024-06-28 13:04:28,899][09423] Updated weights for policy 0, policy_version 230997 (0.0047) [2024-06-28 13:04:32,306][09423] Updated weights for policy 0, policy_version 231007 (0.0031) [2024-06-28 13:04:32,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42325.4, 300 sec: 42154.5). Total num frames: 3784818688. Throughput: 0: 41786.7. Samples: 63692920. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2024-06-28 13:04:32,922][09190] Avg episode reward: [(0, '0.591')] [2024-06-28 13:04:36,502][09423] Updated weights for policy 0, policy_version 231017 (0.0036) [2024-06-28 13:04:37,921][09190] Fps is (10 sec: 42598.1, 60 sec: 41506.1, 300 sec: 42320.7). Total num frames: 3785031680. Throughput: 0: 42017.4. Samples: 63953040. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2024-06-28 13:04:37,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:04:38,073][09403] Saving new best policy, reward=0.605! [2024-06-28 13:04:39,845][09423] Updated weights for policy 0, policy_version 231027 (0.0036) [2024-06-28 13:04:42,921][09190] Fps is (10 sec: 39321.7, 60 sec: 41233.2, 300 sec: 42265.2). Total num frames: 3785211904. Throughput: 0: 41725.4. Samples: 64077420. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2024-06-28 13:04:42,922][09190] Avg episode reward: [(0, '0.591')] [2024-06-28 13:04:44,461][09423] Updated weights for policy 0, policy_version 231037 (0.0037) [2024-06-28 13:04:47,675][09423] Updated weights for policy 0, policy_version 231047 (0.0042) [2024-06-28 13:04:47,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.4, 300 sec: 42209.6). Total num frames: 3785474048. Throughput: 0: 41706.6. Samples: 64321400. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2024-06-28 13:04:47,922][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 13:04:52,058][09423] Updated weights for policy 0, policy_version 231057 (0.0035) [2024-06-28 13:04:52,921][09190] Fps is (10 sec: 44236.4, 60 sec: 41506.2, 300 sec: 42265.2). Total num frames: 3785654272. Throughput: 0: 41986.8. Samples: 64580400. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2024-06-28 13:04:52,922][09190] Avg episode reward: [(0, '0.570')] [2024-06-28 13:04:55,739][09423] Updated weights for policy 0, policy_version 231067 (0.0041) [2024-06-28 13:04:57,921][09190] Fps is (10 sec: 36045.2, 60 sec: 41233.1, 300 sec: 42154.1). Total num frames: 3785834496. Throughput: 0: 41780.4. Samples: 64707560. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2024-06-28 13:04:57,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:04:59,737][09423] Updated weights for policy 0, policy_version 231077 (0.0037) [2024-06-28 13:05:02,921][09190] Fps is (10 sec: 45874.8, 60 sec: 42598.3, 300 sec: 42209.6). Total num frames: 3786113024. Throughput: 0: 42068.8. Samples: 64960940. Policy #0 lag: (min: 0.0, avg: 7.7, max: 21.0) [2024-06-28 13:05:02,922][09190] Avg episode reward: [(0, '0.591')] [2024-06-28 13:05:03,365][09423] Updated weights for policy 0, policy_version 231087 (0.0046) [2024-06-28 13:05:07,542][09423] Updated weights for policy 0, policy_version 231097 (0.0029) [2024-06-28 13:05:07,921][09190] Fps is (10 sec: 45875.2, 60 sec: 41506.3, 300 sec: 42376.2). Total num frames: 3786293248. Throughput: 0: 42046.7. Samples: 65211380. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:05:07,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:05:10,938][09423] Updated weights for policy 0, policy_version 231107 (0.0044) [2024-06-28 13:05:12,921][09190] Fps is (10 sec: 36045.0, 60 sec: 41506.1, 300 sec: 42098.6). Total num frames: 3786473472. Throughput: 0: 41941.7. Samples: 65335380. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:05:12,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:05:15,312][09423] Updated weights for policy 0, policy_version 231117 (0.0046) [2024-06-28 13:05:17,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.3, 300 sec: 42209.6). Total num frames: 3786719232. Throughput: 0: 42014.2. Samples: 65583560. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:05:17,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:05:19,189][09423] Updated weights for policy 0, policy_version 231127 (0.0031) [2024-06-28 13:05:21,675][09403] Signal inference workers to stop experience collection... (1000 times) [2024-06-28 13:05:21,720][09403] Signal inference workers to resume experience collection... (1000 times) [2024-06-28 13:05:21,722][09423] InferenceWorker_p0-w0: stopping experience collection (1000 times) [2024-06-28 13:05:21,740][09423] InferenceWorker_p0-w0: resuming experience collection (1000 times) [2024-06-28 13:05:22,921][09190] Fps is (10 sec: 44236.8, 60 sec: 41506.1, 300 sec: 42321.1). Total num frames: 3786915840. Throughput: 0: 41914.3. Samples: 65839180. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:05:22,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:05:23,259][09423] Updated weights for policy 0, policy_version 231137 (0.0038) [2024-06-28 13:05:26,879][09423] Updated weights for policy 0, policy_version 231147 (0.0038) [2024-06-28 13:05:27,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42052.2, 300 sec: 42099.5). Total num frames: 3787128832. Throughput: 0: 41915.0. Samples: 65963600. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:05:27,922][09190] Avg episode reward: [(0, '0.584')] [2024-06-28 13:05:30,935][09423] Updated weights for policy 0, policy_version 231157 (0.0035) [2024-06-28 13:05:32,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42598.3, 300 sec: 42320.7). Total num frames: 3787374592. Throughput: 0: 42308.9. Samples: 66225300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:05:32,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:05:34,850][09423] Updated weights for policy 0, policy_version 231167 (0.0034) [2024-06-28 13:05:37,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3787554816. Throughput: 0: 42236.9. Samples: 66481060. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:05:37,922][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:05:38,860][09423] Updated weights for policy 0, policy_version 231177 (0.0026) [2024-06-28 13:05:42,399][09423] Updated weights for policy 0, policy_version 231187 (0.0030) [2024-06-28 13:05:42,922][09190] Fps is (10 sec: 40959.5, 60 sec: 42871.3, 300 sec: 42098.9). Total num frames: 3787784192. Throughput: 0: 42175.4. Samples: 66605460. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:05:42,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:05:46,623][09423] Updated weights for policy 0, policy_version 231197 (0.0035) [2024-06-28 13:05:47,924][09190] Fps is (10 sec: 42587.8, 60 sec: 41777.5, 300 sec: 42265.7). Total num frames: 3787980800. Throughput: 0: 42165.3. Samples: 66858480. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:05:47,925][09190] Avg episode reward: [(0, '0.587')] [2024-06-28 13:05:50,127][09423] Updated weights for policy 0, policy_version 231207 (0.0039) [2024-06-28 13:05:52,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3788193792. Throughput: 0: 42303.5. Samples: 67115040. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:05:52,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:05:54,170][09423] Updated weights for policy 0, policy_version 231217 (0.0031) [2024-06-28 13:05:57,685][09423] Updated weights for policy 0, policy_version 231227 (0.0040) [2024-06-28 13:05:57,921][09190] Fps is (10 sec: 44248.0, 60 sec: 43144.5, 300 sec: 42154.1). Total num frames: 3788423168. Throughput: 0: 42381.4. Samples: 67242540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:05:57,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:06:02,112][09423] Updated weights for policy 0, policy_version 231237 (0.0034) [2024-06-28 13:06:02,921][09190] Fps is (10 sec: 42598.3, 60 sec: 41779.2, 300 sec: 42320.7). Total num frames: 3788619776. Throughput: 0: 42482.1. Samples: 67495260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:06:02,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:06:05,701][09423] Updated weights for policy 0, policy_version 231247 (0.0045) [2024-06-28 13:06:07,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3788816384. Throughput: 0: 42361.0. Samples: 67745420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:06:07,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:06:09,947][09423] Updated weights for policy 0, policy_version 231257 (0.0039) [2024-06-28 13:06:12,924][09190] Fps is (10 sec: 42588.2, 60 sec: 42869.7, 300 sec: 42098.2). Total num frames: 3789045760. Throughput: 0: 42282.1. Samples: 67866400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:06:12,924][09190] Avg episode reward: [(0, '0.596')] [2024-06-28 13:06:13,628][09423] Updated weights for policy 0, policy_version 231267 (0.0042) [2024-06-28 13:06:17,504][09423] Updated weights for policy 0, policy_version 231277 (0.0037) [2024-06-28 13:06:17,924][09190] Fps is (10 sec: 42587.4, 60 sec: 42050.5, 300 sec: 42264.8). Total num frames: 3789242368. Throughput: 0: 42152.3. Samples: 68122260. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 13:06:17,932][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:06:17,948][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000231278_3789258752.pth... [2024-06-28 13:06:17,991][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000230659_3779117056.pth [2024-06-28 13:06:21,884][09423] Updated weights for policy 0, policy_version 231287 (0.0052) [2024-06-28 13:06:22,921][09190] Fps is (10 sec: 39331.6, 60 sec: 42052.3, 300 sec: 42154.1). Total num frames: 3789438976. Throughput: 0: 42238.3. Samples: 68381780. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 13:06:22,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:06:25,170][09423] Updated weights for policy 0, policy_version 231297 (0.0034) [2024-06-28 13:06:27,921][09190] Fps is (10 sec: 42609.4, 60 sec: 42325.4, 300 sec: 42098.5). Total num frames: 3789668352. Throughput: 0: 42120.2. Samples: 68500860. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 13:06:27,922][09190] Avg episode reward: [(0, '0.591')] [2024-06-28 13:06:29,383][09423] Updated weights for policy 0, policy_version 231307 (0.0031) [2024-06-28 13:06:32,921][09190] Fps is (10 sec: 42598.5, 60 sec: 41506.2, 300 sec: 42210.6). Total num frames: 3789864960. Throughput: 0: 42186.4. Samples: 68756760. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 13:06:32,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:06:33,170][09423] Updated weights for policy 0, policy_version 231317 (0.0027) [2024-06-28 13:06:36,907][09423] Updated weights for policy 0, policy_version 231327 (0.0030) [2024-06-28 13:06:37,928][09190] Fps is (10 sec: 40933.1, 60 sec: 42047.7, 300 sec: 42097.6). Total num frames: 3790077952. Throughput: 0: 42157.5. Samples: 69012400. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 13:06:37,929][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:06:40,679][09423] Updated weights for policy 0, policy_version 231337 (0.0030) [2024-06-28 13:06:42,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42052.4, 300 sec: 42098.6). Total num frames: 3790307328. Throughput: 0: 42169.8. Samples: 69140180. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 13:06:42,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:06:44,489][09423] Updated weights for policy 0, policy_version 231347 (0.0023) [2024-06-28 13:06:47,921][09190] Fps is (10 sec: 44265.4, 60 sec: 42327.1, 300 sec: 42320.7). Total num frames: 3790520320. Throughput: 0: 42191.1. Samples: 69393860. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 13:06:47,922][09190] Avg episode reward: [(0, '0.590')] [2024-06-28 13:06:48,259][09423] Updated weights for policy 0, policy_version 231357 (0.0040) [2024-06-28 13:06:52,677][09423] Updated weights for policy 0, policy_version 231367 (0.0033) [2024-06-28 13:06:52,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42052.3, 300 sec: 42043.0). Total num frames: 3790716928. Throughput: 0: 42216.0. Samples: 69645140. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 13:06:52,922][09190] Avg episode reward: [(0, '0.582')] [2024-06-28 13:06:55,794][09423] Updated weights for policy 0, policy_version 231377 (0.0038) [2024-06-28 13:06:57,924][09190] Fps is (10 sec: 40950.1, 60 sec: 41777.5, 300 sec: 42209.3). Total num frames: 3790929920. Throughput: 0: 42220.0. Samples: 69766300. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 13:06:57,924][09190] Avg episode reward: [(0, '0.588')] [2024-06-28 13:07:00,292][09423] Updated weights for policy 0, policy_version 231387 (0.0042) [2024-06-28 13:07:02,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3791142912. Throughput: 0: 42230.8. Samples: 70022540. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 13:07:02,922][09190] Avg episode reward: [(0, '0.596')] [2024-06-28 13:07:04,075][09423] Updated weights for policy 0, policy_version 231397 (0.0034) [2024-06-28 13:07:05,238][09403] Signal inference workers to stop experience collection... (1050 times) [2024-06-28 13:07:05,289][09423] InferenceWorker_p0-w0: stopping experience collection (1050 times) [2024-06-28 13:07:05,294][09403] Signal inference workers to resume experience collection... (1050 times) [2024-06-28 13:07:05,305][09423] InferenceWorker_p0-w0: resuming experience collection (1050 times) [2024-06-28 13:07:07,914][09423] Updated weights for policy 0, policy_version 231407 (0.0035) [2024-06-28 13:07:07,921][09190] Fps is (10 sec: 44248.1, 60 sec: 42598.4, 300 sec: 42154.1). Total num frames: 3791372288. Throughput: 0: 42209.4. Samples: 70281200. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 13:07:07,922][09190] Avg episode reward: [(0, '0.607')] [2024-06-28 13:07:07,930][09403] Saving new best policy, reward=0.607! [2024-06-28 13:07:12,127][09423] Updated weights for policy 0, policy_version 231417 (0.0049) [2024-06-28 13:07:12,924][09190] Fps is (10 sec: 44225.8, 60 sec: 42325.3, 300 sec: 42264.8). Total num frames: 3791585280. Throughput: 0: 42259.4. Samples: 70402640. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 13:07:12,925][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:07:16,166][09423] Updated weights for policy 0, policy_version 231427 (0.0042) [2024-06-28 13:07:17,923][09190] Fps is (10 sec: 40954.2, 60 sec: 42326.2, 300 sec: 42265.0). Total num frames: 3791781888. Throughput: 0: 42248.9. Samples: 70658020. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 13:07:17,923][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:07:19,621][09423] Updated weights for policy 0, policy_version 231437 (0.0035) [2024-06-28 13:07:22,921][09190] Fps is (10 sec: 40970.0, 60 sec: 42598.4, 300 sec: 42043.0). Total num frames: 3791994880. Throughput: 0: 42165.7. Samples: 70909580. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 13:07:22,922][09190] Avg episode reward: [(0, '0.592')] [2024-06-28 13:07:23,735][09423] Updated weights for policy 0, policy_version 231447 (0.0029) [2024-06-28 13:07:27,132][09423] Updated weights for policy 0, policy_version 231457 (0.0042) [2024-06-28 13:07:27,921][09190] Fps is (10 sec: 44242.5, 60 sec: 42598.3, 300 sec: 42265.2). Total num frames: 3792224256. Throughput: 0: 42215.0. Samples: 71039860. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 13:07:27,922][09190] Avg episode reward: [(0, '0.593')] [2024-06-28 13:07:31,206][09423] Updated weights for policy 0, policy_version 231467 (0.0036) [2024-06-28 13:07:32,924][09190] Fps is (10 sec: 44225.9, 60 sec: 42869.6, 300 sec: 42320.7). Total num frames: 3792437248. Throughput: 0: 42330.2. Samples: 71298820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 13:07:32,924][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:07:34,756][09423] Updated weights for policy 0, policy_version 231477 (0.0036) [2024-06-28 13:07:37,922][09190] Fps is (10 sec: 40959.6, 60 sec: 42602.9, 300 sec: 42154.1). Total num frames: 3792633856. Throughput: 0: 42313.2. Samples: 71549240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 13:07:37,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:07:39,459][09423] Updated weights for policy 0, policy_version 231487 (0.0036) [2024-06-28 13:07:42,489][09423] Updated weights for policy 0, policy_version 231497 (0.0038) [2024-06-28 13:07:42,921][09190] Fps is (10 sec: 42609.0, 60 sec: 42598.4, 300 sec: 42209.6). Total num frames: 3792863232. Throughput: 0: 42512.6. Samples: 71679260. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 13:07:42,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:07:47,134][09423] Updated weights for policy 0, policy_version 231507 (0.0042) [2024-06-28 13:07:47,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42052.3, 300 sec: 42209.7). Total num frames: 3793043456. Throughput: 0: 42440.9. Samples: 71932380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 13:07:47,922][09190] Avg episode reward: [(0, '0.573')] [2024-06-28 13:07:50,250][09423] Updated weights for policy 0, policy_version 231517 (0.0036) [2024-06-28 13:07:52,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.3, 300 sec: 42209.6). Total num frames: 3793272832. Throughput: 0: 42067.9. Samples: 72174260. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 13:07:52,922][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 13:07:55,116][09423] Updated weights for policy 0, policy_version 231527 (0.0027) [2024-06-28 13:07:57,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42327.1, 300 sec: 42154.1). Total num frames: 3793469440. Throughput: 0: 42258.4. Samples: 72304160. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 13:07:57,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:07:58,191][09423] Updated weights for policy 0, policy_version 231537 (0.0034) [2024-06-28 13:08:02,755][09423] Updated weights for policy 0, policy_version 231547 (0.0037) [2024-06-28 13:08:02,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42052.3, 300 sec: 42098.6). Total num frames: 3793666048. Throughput: 0: 42151.9. Samples: 72554800. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 13:08:02,923][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:08:06,204][09423] Updated weights for policy 0, policy_version 231557 (0.0032) [2024-06-28 13:08:07,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42052.2, 300 sec: 42154.1). Total num frames: 3793895424. Throughput: 0: 42161.8. Samples: 72806860. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 13:08:07,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:08:10,391][09423] Updated weights for policy 0, policy_version 231567 (0.0031) [2024-06-28 13:08:12,921][09190] Fps is (10 sec: 42598.7, 60 sec: 41781.0, 300 sec: 42209.6). Total num frames: 3794092032. Throughput: 0: 42124.5. Samples: 72935460. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 13:08:12,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:08:14,038][09423] Updated weights for policy 0, policy_version 231577 (0.0028) [2024-06-28 13:08:17,922][09190] Fps is (10 sec: 39320.7, 60 sec: 41780.0, 300 sec: 42043.0). Total num frames: 3794288640. Throughput: 0: 41916.3. Samples: 73184960. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 13:08:17,922][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 13:08:17,985][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000231586_3794305024.pth... [2024-06-28 13:08:18,049][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000230968_3784179712.pth [2024-06-28 13:08:18,227][09423] Updated weights for policy 0, policy_version 231587 (0.0026) [2024-06-28 13:08:21,726][09423] Updated weights for policy 0, policy_version 231597 (0.0027) [2024-06-28 13:08:22,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.4, 300 sec: 42209.6). Total num frames: 3794534400. Throughput: 0: 41925.1. Samples: 73435860. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 13:08:22,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:08:25,980][09423] Updated weights for policy 0, policy_version 231607 (0.0036) [2024-06-28 13:08:27,922][09190] Fps is (10 sec: 45875.1, 60 sec: 42052.1, 300 sec: 42265.1). Total num frames: 3794747392. Throughput: 0: 42008.2. Samples: 73569640. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 13:08:27,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:08:29,536][09423] Updated weights for policy 0, policy_version 231617 (0.0033) [2024-06-28 13:08:32,921][09190] Fps is (10 sec: 40960.0, 60 sec: 41781.0, 300 sec: 42043.0). Total num frames: 3794944000. Throughput: 0: 41977.8. Samples: 73821380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 13:08:32,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:08:33,751][09423] Updated weights for policy 0, policy_version 231627 (0.0033) [2024-06-28 13:08:37,251][09423] Updated weights for policy 0, policy_version 231637 (0.0044) [2024-06-28 13:08:37,921][09190] Fps is (10 sec: 42599.3, 60 sec: 42325.4, 300 sec: 42154.1). Total num frames: 3795173376. Throughput: 0: 42305.3. Samples: 74078000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:08:37,922][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:08:41,525][09423] Updated weights for policy 0, policy_version 231647 (0.0029) [2024-06-28 13:08:42,921][09190] Fps is (10 sec: 42598.1, 60 sec: 41779.2, 300 sec: 42265.2). Total num frames: 3795369984. Throughput: 0: 42251.5. Samples: 74205480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:08:42,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:08:45,204][09423] Updated weights for policy 0, policy_version 231657 (0.0035) [2024-06-28 13:08:47,922][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.2, 300 sec: 42098.5). Total num frames: 3795582976. Throughput: 0: 42112.3. Samples: 74449860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:08:47,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:08:49,104][09423] Updated weights for policy 0, policy_version 231667 (0.0038) [2024-06-28 13:08:52,781][09423] Updated weights for policy 0, policy_version 231677 (0.0036) [2024-06-28 13:08:52,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42052.4, 300 sec: 42154.1). Total num frames: 3795795968. Throughput: 0: 42275.6. Samples: 74709260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:08:52,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:08:56,823][09423] Updated weights for policy 0, policy_version 231687 (0.0029) [2024-06-28 13:08:57,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42325.3, 300 sec: 42209.6). Total num frames: 3796008960. Throughput: 0: 42043.5. Samples: 74827420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:08:57,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:08:59,647][09403] Signal inference workers to stop experience collection... (1100 times) [2024-06-28 13:08:59,688][09423] InferenceWorker_p0-w0: stopping experience collection (1100 times) [2024-06-28 13:08:59,697][09403] Signal inference workers to resume experience collection... (1100 times) [2024-06-28 13:08:59,705][09423] InferenceWorker_p0-w0: resuming experience collection (1100 times) [2024-06-28 13:09:00,815][09423] Updated weights for policy 0, policy_version 231697 (0.0035) [2024-06-28 13:09:02,921][09190] Fps is (10 sec: 44236.2, 60 sec: 42871.4, 300 sec: 42154.1). Total num frames: 3796238336. Throughput: 0: 42359.3. Samples: 75091120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:09:02,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:09:04,226][09423] Updated weights for policy 0, policy_version 231707 (0.0039) [2024-06-28 13:09:07,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42052.1, 300 sec: 42154.1). Total num frames: 3796418560. Throughput: 0: 42432.3. Samples: 75345320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:09:07,924][09190] Avg episode reward: [(0, '0.572')] [2024-06-28 13:09:08,418][09423] Updated weights for policy 0, policy_version 231717 (0.0044) [2024-06-28 13:09:12,108][09423] Updated weights for policy 0, policy_version 231727 (0.0039) [2024-06-28 13:09:12,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42325.2, 300 sec: 42209.6). Total num frames: 3796631552. Throughput: 0: 42157.0. Samples: 75466700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:09:12,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:09:15,958][09423] Updated weights for policy 0, policy_version 231737 (0.0046) [2024-06-28 13:09:17,921][09190] Fps is (10 sec: 45876.0, 60 sec: 43144.7, 300 sec: 42209.6). Total num frames: 3796877312. Throughput: 0: 42314.7. Samples: 75725540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:09:17,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:09:19,683][09423] Updated weights for policy 0, policy_version 231747 (0.0041) [2024-06-28 13:09:22,921][09190] Fps is (10 sec: 40960.6, 60 sec: 41779.2, 300 sec: 42154.1). Total num frames: 3797041152. Throughput: 0: 42292.1. Samples: 75981140. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:09:22,922][09190] Avg episode reward: [(0, '0.582')] [2024-06-28 13:09:23,787][09423] Updated weights for policy 0, policy_version 231757 (0.0027) [2024-06-28 13:09:27,647][09423] Updated weights for policy 0, policy_version 231767 (0.0028) [2024-06-28 13:09:27,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42052.4, 300 sec: 42209.6). Total num frames: 3797270528. Throughput: 0: 42158.7. Samples: 76102620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:09:27,922][09190] Avg episode reward: [(0, '0.587')] [2024-06-28 13:09:31,625][09423] Updated weights for policy 0, policy_version 231777 (0.0040) [2024-06-28 13:09:32,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42325.3, 300 sec: 42209.6). Total num frames: 3797483520. Throughput: 0: 42266.3. Samples: 76351840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:09:32,922][09190] Avg episode reward: [(0, '0.589')] [2024-06-28 13:09:35,282][09423] Updated weights for policy 0, policy_version 231787 (0.0035) [2024-06-28 13:09:37,921][09190] Fps is (10 sec: 40960.2, 60 sec: 41779.2, 300 sec: 42265.2). Total num frames: 3797680128. Throughput: 0: 42182.2. Samples: 76607460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:09:37,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:09:39,543][09423] Updated weights for policy 0, policy_version 231797 (0.0041) [2024-06-28 13:09:42,815][09423] Updated weights for policy 0, policy_version 231807 (0.0027) [2024-06-28 13:09:42,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42598.4, 300 sec: 42209.6). Total num frames: 3797925888. Throughput: 0: 42363.1. Samples: 76733760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:09:42,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:09:47,081][09423] Updated weights for policy 0, policy_version 231817 (0.0033) [2024-06-28 13:09:47,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42052.4, 300 sec: 42209.6). Total num frames: 3798106112. Throughput: 0: 42307.6. Samples: 76994960. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:09:47,922][09190] Avg episode reward: [(0, '0.590')] [2024-06-28 13:09:50,770][09423] Updated weights for policy 0, policy_version 231827 (0.0036) [2024-06-28 13:09:52,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.2, 300 sec: 42376.2). Total num frames: 3798335488. Throughput: 0: 42167.7. Samples: 77242860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:09:52,922][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:09:54,839][09423] Updated weights for policy 0, policy_version 231837 (0.0039) [2024-06-28 13:09:57,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42052.3, 300 sec: 42098.6). Total num frames: 3798532096. Throughput: 0: 42443.2. Samples: 77376640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:09:57,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:09:58,567][09423] Updated weights for policy 0, policy_version 231847 (0.0034) [2024-06-28 13:10:02,551][09423] Updated weights for policy 0, policy_version 231857 (0.0038) [2024-06-28 13:10:02,921][09190] Fps is (10 sec: 40960.0, 60 sec: 41779.2, 300 sec: 42209.6). Total num frames: 3798745088. Throughput: 0: 42410.1. Samples: 77634000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:10:02,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:10:06,454][09423] Updated weights for policy 0, policy_version 231867 (0.0031) [2024-06-28 13:10:07,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3798958080. Throughput: 0: 42214.2. Samples: 77880780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:10:07,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:10:10,363][09423] Updated weights for policy 0, policy_version 231877 (0.0034) [2024-06-28 13:10:12,924][09190] Fps is (10 sec: 44225.8, 60 sec: 42596.7, 300 sec: 42264.8). Total num frames: 3799187456. Throughput: 0: 42271.0. Samples: 78004920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:10:12,925][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:10:14,107][09423] Updated weights for policy 0, policy_version 231887 (0.0035) [2024-06-28 13:10:17,765][09423] Updated weights for policy 0, policy_version 231897 (0.0025) [2024-06-28 13:10:17,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3799400448. Throughput: 0: 42553.8. Samples: 78266760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:10:17,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:10:17,945][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000231897_3799400448.pth... [2024-06-28 13:10:18,017][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000231278_3789258752.pth [2024-06-28 13:10:21,561][09423] Updated weights for policy 0, policy_version 231907 (0.0034) [2024-06-28 13:10:22,921][09190] Fps is (10 sec: 40970.1, 60 sec: 42598.3, 300 sec: 42265.2). Total num frames: 3799597056. Throughput: 0: 42596.4. Samples: 78524300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:10:22,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:10:25,682][09423] Updated weights for policy 0, policy_version 231917 (0.0046) [2024-06-28 13:10:27,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.4, 300 sec: 42209.6). Total num frames: 3799826432. Throughput: 0: 42577.8. Samples: 78649760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:10:27,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:10:29,268][09423] Updated weights for policy 0, policy_version 231927 (0.0030) [2024-06-28 13:10:32,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42325.4, 300 sec: 42265.2). Total num frames: 3800023040. Throughput: 0: 42393.3. Samples: 78902660. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:10:32,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:10:33,224][09423] Updated weights for policy 0, policy_version 231937 (0.0032) [2024-06-28 13:10:36,794][09403] Signal inference workers to stop experience collection... (1150 times) [2024-06-28 13:10:36,847][09423] InferenceWorker_p0-w0: stopping experience collection (1150 times) [2024-06-28 13:10:36,855][09403] Signal inference workers to resume experience collection... (1150 times) [2024-06-28 13:10:36,863][09423] InferenceWorker_p0-w0: resuming experience collection (1150 times) [2024-06-28 13:10:36,991][09423] Updated weights for policy 0, policy_version 231947 (0.0038) [2024-06-28 13:10:37,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42598.4, 300 sec: 42209.7). Total num frames: 3800236032. Throughput: 0: 42671.2. Samples: 79163060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:10:37,922][09190] Avg episode reward: [(0, '0.589')] [2024-06-28 13:10:40,930][09423] Updated weights for policy 0, policy_version 231957 (0.0028) [2024-06-28 13:10:42,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42325.4, 300 sec: 42321.1). Total num frames: 3800465408. Throughput: 0: 42534.7. Samples: 79290700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:10:42,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:10:44,798][09423] Updated weights for policy 0, policy_version 231967 (0.0036) [2024-06-28 13:10:47,922][09190] Fps is (10 sec: 40959.2, 60 sec: 42325.2, 300 sec: 42209.6). Total num frames: 3800645632. Throughput: 0: 42299.0. Samples: 79537460. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:10:47,922][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:10:48,887][09423] Updated weights for policy 0, policy_version 231977 (0.0031) [2024-06-28 13:10:52,532][09423] Updated weights for policy 0, policy_version 231987 (0.0048) [2024-06-28 13:10:52,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.4, 300 sec: 42209.6). Total num frames: 3800875008. Throughput: 0: 42340.9. Samples: 79786120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:10:52,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:10:56,444][09423] Updated weights for policy 0, policy_version 231997 (0.0038) [2024-06-28 13:10:57,921][09190] Fps is (10 sec: 42599.5, 60 sec: 42325.4, 300 sec: 42209.7). Total num frames: 3801071616. Throughput: 0: 42478.5. Samples: 79916340. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:10:57,922][09190] Avg episode reward: [(0, '0.607')] [2024-06-28 13:11:00,107][09423] Updated weights for policy 0, policy_version 232007 (0.0031) [2024-06-28 13:11:02,926][09190] Fps is (10 sec: 42580.0, 60 sec: 42595.4, 300 sec: 42320.1). Total num frames: 3801300992. Throughput: 0: 42446.3. Samples: 80177020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:11:02,926][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:11:04,079][09423] Updated weights for policy 0, policy_version 232017 (0.0039) [2024-06-28 13:11:07,800][09423] Updated weights for policy 0, policy_version 232027 (0.0030) [2024-06-28 13:11:07,922][09190] Fps is (10 sec: 45873.9, 60 sec: 42871.3, 300 sec: 42321.0). Total num frames: 3801530368. Throughput: 0: 42333.2. Samples: 80429300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:11:07,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:11:11,643][09423] Updated weights for policy 0, policy_version 232037 (0.0036) [2024-06-28 13:11:12,928][09190] Fps is (10 sec: 40950.6, 60 sec: 42049.4, 300 sec: 42264.6). Total num frames: 3801710592. Throughput: 0: 42477.8. Samples: 80561540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:11:12,929][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:11:15,702][09423] Updated weights for policy 0, policy_version 232047 (0.0051) [2024-06-28 13:11:17,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 3801939968. Throughput: 0: 42412.8. Samples: 80811240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:11:17,931][09190] Avg episode reward: [(0, '0.634')] [2024-06-28 13:11:17,947][09403] Saving new best policy, reward=0.634! [2024-06-28 13:11:19,210][09423] Updated weights for policy 0, policy_version 232057 (0.0023) [2024-06-28 13:11:22,921][09190] Fps is (10 sec: 42626.4, 60 sec: 42325.4, 300 sec: 42265.2). Total num frames: 3802136576. Throughput: 0: 42319.0. Samples: 81067420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:11:22,922][09190] Avg episode reward: [(0, '0.586')] [2024-06-28 13:11:23,777][09423] Updated weights for policy 0, policy_version 232067 (0.0040) [2024-06-28 13:11:27,475][09423] Updated weights for policy 0, policy_version 232077 (0.0039) [2024-06-28 13:11:27,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3802349568. Throughput: 0: 42137.7. Samples: 81186900. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:11:27,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:11:31,285][09423] Updated weights for policy 0, policy_version 232087 (0.0034) [2024-06-28 13:11:32,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42598.5, 300 sec: 42377.2). Total num frames: 3802578944. Throughput: 0: 42274.0. Samples: 81439780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:11:32,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:11:35,710][09423] Updated weights for policy 0, policy_version 232097 (0.0040) [2024-06-28 13:11:37,922][09190] Fps is (10 sec: 40959.7, 60 sec: 42052.1, 300 sec: 42209.6). Total num frames: 3802759168. Throughput: 0: 42355.8. Samples: 81692140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:11:37,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:11:39,374][09423] Updated weights for policy 0, policy_version 232107 (0.0022) [2024-06-28 13:11:42,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42052.2, 300 sec: 42265.2). Total num frames: 3802988544. Throughput: 0: 42215.4. Samples: 81816040. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:11:42,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:11:43,363][09423] Updated weights for policy 0, policy_version 232117 (0.0025) [2024-06-28 13:11:47,088][09423] Updated weights for policy 0, policy_version 232127 (0.0043) [2024-06-28 13:11:47,921][09190] Fps is (10 sec: 42599.3, 60 sec: 42325.5, 300 sec: 42265.2). Total num frames: 3803185152. Throughput: 0: 42103.6. Samples: 82071500. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:11:47,922][09190] Avg episode reward: [(0, '0.587')] [2024-06-28 13:11:50,943][09423] Updated weights for policy 0, policy_version 232137 (0.0033) [2024-06-28 13:11:52,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42325.4, 300 sec: 42321.1). Total num frames: 3803414528. Throughput: 0: 42170.9. Samples: 82326980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:11:52,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:11:54,573][09423] Updated weights for policy 0, policy_version 232147 (0.0051) [2024-06-28 13:11:57,921][09190] Fps is (10 sec: 44236.0, 60 sec: 42598.2, 300 sec: 42320.7). Total num frames: 3803627520. Throughput: 0: 42110.1. Samples: 82456220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 21.0) [2024-06-28 13:11:57,922][09190] Avg episode reward: [(0, '0.592')] [2024-06-28 13:11:58,642][09423] Updated weights for policy 0, policy_version 232157 (0.0030) [2024-06-28 13:12:02,497][09423] Updated weights for policy 0, policy_version 232167 (0.0035) [2024-06-28 13:12:02,926][09190] Fps is (10 sec: 40941.2, 60 sec: 42052.1, 300 sec: 42209.0). Total num frames: 3803824128. Throughput: 0: 42142.9. Samples: 82707860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:12:02,926][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:12:06,462][09423] Updated weights for policy 0, policy_version 232177 (0.0037) [2024-06-28 13:12:07,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42052.4, 300 sec: 42265.5). Total num frames: 3804053504. Throughput: 0: 42020.0. Samples: 82958320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:12:07,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:12:10,485][09423] Updated weights for policy 0, policy_version 232187 (0.0035) [2024-06-28 13:12:12,924][09190] Fps is (10 sec: 44245.8, 60 sec: 42601.3, 300 sec: 42320.5). Total num frames: 3804266496. Throughput: 0: 42234.6. Samples: 83087560. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:12:12,924][09190] Avg episode reward: [(0, '0.576')] [2024-06-28 13:12:14,527][09423] Updated weights for policy 0, policy_version 232197 (0.0043) [2024-06-28 13:12:15,235][09403] Signal inference workers to stop experience collection... (1200 times) [2024-06-28 13:12:15,240][09403] Signal inference workers to resume experience collection... (1200 times) [2024-06-28 13:12:15,254][09423] InferenceWorker_p0-w0: stopping experience collection (1200 times) [2024-06-28 13:12:15,289][09423] InferenceWorker_p0-w0: resuming experience collection (1200 times) [2024-06-28 13:12:17,892][09423] Updated weights for policy 0, policy_version 232207 (0.0030) [2024-06-28 13:12:17,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3804479488. Throughput: 0: 42290.1. Samples: 83342840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:12:17,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:12:17,941][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000232207_3804479488.pth... [2024-06-28 13:12:17,993][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000231586_3794305024.pth [2024-06-28 13:12:22,177][09423] Updated weights for policy 0, policy_version 232217 (0.0050) [2024-06-28 13:12:22,921][09190] Fps is (10 sec: 42609.0, 60 sec: 42598.4, 300 sec: 42265.2). Total num frames: 3804692480. Throughput: 0: 42332.1. Samples: 83597080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:12:22,922][09190] Avg episode reward: [(0, '0.607')] [2024-06-28 13:12:25,457][09423] Updated weights for policy 0, policy_version 232227 (0.0039) [2024-06-28 13:12:27,924][09190] Fps is (10 sec: 40949.8, 60 sec: 42323.6, 300 sec: 42209.6). Total num frames: 3804889088. Throughput: 0: 42308.8. Samples: 83720040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:12:27,925][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:12:29,829][09423] Updated weights for policy 0, policy_version 232237 (0.0031) [2024-06-28 13:12:32,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42052.2, 300 sec: 42265.2). Total num frames: 3805102080. Throughput: 0: 42378.1. Samples: 83978520. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:12:32,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:12:33,469][09423] Updated weights for policy 0, policy_version 232247 (0.0042) [2024-06-28 13:12:37,331][09423] Updated weights for policy 0, policy_version 232257 (0.0029) [2024-06-28 13:12:37,921][09190] Fps is (10 sec: 42609.1, 60 sec: 42598.5, 300 sec: 42209.6). Total num frames: 3805315072. Throughput: 0: 42484.8. Samples: 84238800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:12:37,922][09190] Avg episode reward: [(0, '0.607')] [2024-06-28 13:12:40,959][09423] Updated weights for policy 0, policy_version 232267 (0.0024) [2024-06-28 13:12:42,924][09190] Fps is (10 sec: 44225.9, 60 sec: 42596.6, 300 sec: 42375.9). Total num frames: 3805544448. Throughput: 0: 42428.8. Samples: 84365620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:12:42,924][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:12:44,935][09423] Updated weights for policy 0, policy_version 232277 (0.0031) [2024-06-28 13:12:47,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42325.3, 300 sec: 42209.6). Total num frames: 3805724672. Throughput: 0: 42413.2. Samples: 84616260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:12:47,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:12:49,184][09423] Updated weights for policy 0, policy_version 232287 (0.0031) [2024-06-28 13:12:52,833][09423] Updated weights for policy 0, policy_version 232297 (0.0046) [2024-06-28 13:12:52,921][09190] Fps is (10 sec: 40969.9, 60 sec: 42325.2, 300 sec: 42320.7). Total num frames: 3805954048. Throughput: 0: 42544.8. Samples: 84872840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:12:52,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:12:56,788][09423] Updated weights for policy 0, policy_version 232307 (0.0045) [2024-06-28 13:12:57,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 3806167040. Throughput: 0: 42643.7. Samples: 85006420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:12:57,922][09190] Avg episode reward: [(0, '0.607')] [2024-06-28 13:13:00,305][09423] Updated weights for policy 0, policy_version 232317 (0.0040) [2024-06-28 13:13:02,926][09190] Fps is (10 sec: 42581.2, 60 sec: 42598.7, 300 sec: 42320.1). Total num frames: 3806380032. Throughput: 0: 42463.7. Samples: 85253880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:13:02,926][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:13:04,315][09423] Updated weights for policy 0, policy_version 232327 (0.0034) [2024-06-28 13:13:07,924][09190] Fps is (10 sec: 42587.6, 60 sec: 42323.5, 300 sec: 42375.9). Total num frames: 3806593024. Throughput: 0: 42450.5. Samples: 85507460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:13:07,925][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 13:13:08,150][09423] Updated weights for policy 0, policy_version 232337 (0.0031) [2024-06-28 13:13:12,263][09423] Updated weights for policy 0, policy_version 232347 (0.0041) [2024-06-28 13:13:12,921][09190] Fps is (10 sec: 42616.2, 60 sec: 42327.1, 300 sec: 42431.8). Total num frames: 3806806016. Throughput: 0: 42565.1. Samples: 85635360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 13:13:12,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:13:16,306][09423] Updated weights for policy 0, policy_version 232357 (0.0035) [2024-06-28 13:13:17,921][09190] Fps is (10 sec: 40970.5, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3807002624. Throughput: 0: 42392.9. Samples: 85886200. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 13:13:17,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:13:20,133][09423] Updated weights for policy 0, policy_version 232367 (0.0048) [2024-06-28 13:13:22,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3807232000. Throughput: 0: 42115.5. Samples: 86134000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 13:13:22,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:13:23,910][09423] Updated weights for policy 0, policy_version 232377 (0.0028) [2024-06-28 13:13:27,733][09423] Updated weights for policy 0, policy_version 232387 (0.0031) [2024-06-28 13:13:27,921][09190] Fps is (10 sec: 44236.2, 60 sec: 42600.1, 300 sec: 42376.2). Total num frames: 3807444992. Throughput: 0: 42290.7. Samples: 86268600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 13:13:27,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:13:31,585][09423] Updated weights for policy 0, policy_version 232397 (0.0044) [2024-06-28 13:13:32,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3807641600. Throughput: 0: 42455.0. Samples: 86526740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 13:13:32,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:13:35,496][09423] Updated weights for policy 0, policy_version 232407 (0.0034) [2024-06-28 13:13:36,107][09403] Signal inference workers to stop experience collection... (1250 times) [2024-06-28 13:13:36,145][09423] InferenceWorker_p0-w0: stopping experience collection (1250 times) [2024-06-28 13:13:36,163][09403] Signal inference workers to resume experience collection... (1250 times) [2024-06-28 13:13:36,166][09423] InferenceWorker_p0-w0: resuming experience collection (1250 times) [2024-06-28 13:13:37,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.4, 300 sec: 42376.2). Total num frames: 3807870976. Throughput: 0: 42214.7. Samples: 86772500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 13:13:37,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:13:39,406][09423] Updated weights for policy 0, policy_version 232417 (0.0038) [2024-06-28 13:13:42,921][09190] Fps is (10 sec: 40960.1, 60 sec: 41780.9, 300 sec: 42265.2). Total num frames: 3808051200. Throughput: 0: 42182.7. Samples: 86904640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 13:13:42,922][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:13:43,447][09423] Updated weights for policy 0, policy_version 232427 (0.0044) [2024-06-28 13:13:46,973][09423] Updated weights for policy 0, policy_version 232437 (0.0036) [2024-06-28 13:13:47,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3808264192. Throughput: 0: 42236.4. Samples: 87154340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 13:13:47,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:13:51,235][09423] Updated weights for policy 0, policy_version 232447 (0.0029) [2024-06-28 13:13:52,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3808493568. Throughput: 0: 42200.6. Samples: 87406380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 13:13:52,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:13:54,628][09423] Updated weights for policy 0, policy_version 232457 (0.0028) [2024-06-28 13:13:57,924][09190] Fps is (10 sec: 44225.5, 60 sec: 42323.6, 300 sec: 42264.8). Total num frames: 3808706560. Throughput: 0: 42324.3. Samples: 87540060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 13:13:57,924][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:13:58,724][09423] Updated weights for policy 0, policy_version 232467 (0.0028) [2024-06-28 13:14:02,576][09423] Updated weights for policy 0, policy_version 232477 (0.0046) [2024-06-28 13:14:02,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42055.1, 300 sec: 42320.7). Total num frames: 3808903168. Throughput: 0: 42372.4. Samples: 87792960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 13:14:02,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:14:06,440][09423] Updated weights for policy 0, policy_version 232487 (0.0032) [2024-06-28 13:14:07,924][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.4, 300 sec: 42431.4). Total num frames: 3809148928. Throughput: 0: 42374.6. Samples: 88040960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 13:14:07,924][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:14:10,164][09423] Updated weights for policy 0, policy_version 232497 (0.0028) [2024-06-28 13:14:12,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42052.2, 300 sec: 42209.6). Total num frames: 3809329152. Throughput: 0: 42348.9. Samples: 88174300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 13:14:12,922][09190] Avg episode reward: [(0, '0.608')] [2024-06-28 13:14:14,034][09423] Updated weights for policy 0, policy_version 232507 (0.0030) [2024-06-28 13:14:17,921][09190] Fps is (10 sec: 39331.6, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 3809542144. Throughput: 0: 42281.8. Samples: 88429420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 13:14:17,922][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:14:18,023][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000232517_3809558528.pth... [2024-06-28 13:14:18,026][09423] Updated weights for policy 0, policy_version 232517 (0.0033) [2024-06-28 13:14:18,081][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000231897_3799400448.pth [2024-06-28 13:14:22,032][09423] Updated weights for policy 0, policy_version 232527 (0.0028) [2024-06-28 13:14:22,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 3809771520. Throughput: 0: 42289.7. Samples: 88675540. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 13:14:22,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:14:25,583][09423] Updated weights for policy 0, policy_version 232537 (0.0026) [2024-06-28 13:14:27,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42052.4, 300 sec: 42320.7). Total num frames: 3809968128. Throughput: 0: 42292.9. Samples: 88807820. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 13:14:27,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:14:29,838][09423] Updated weights for policy 0, policy_version 232547 (0.0039) [2024-06-28 13:14:32,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 3810181120. Throughput: 0: 42241.7. Samples: 89055220. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 13:14:32,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:14:33,179][09423] Updated weights for policy 0, policy_version 232557 (0.0024) [2024-06-28 13:14:37,468][09423] Updated weights for policy 0, policy_version 232567 (0.0050) [2024-06-28 13:14:37,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42052.4, 300 sec: 42265.2). Total num frames: 3810394112. Throughput: 0: 42378.3. Samples: 89313400. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 13:14:37,922][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:14:40,986][09423] Updated weights for policy 0, policy_version 232577 (0.0036) [2024-06-28 13:14:42,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3810590720. Throughput: 0: 42221.5. Samples: 89439920. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 13:14:42,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:14:45,277][09423] Updated weights for policy 0, policy_version 232587 (0.0045) [2024-06-28 13:14:47,922][09190] Fps is (10 sec: 40959.2, 60 sec: 42325.2, 300 sec: 42265.2). Total num frames: 3810803712. Throughput: 0: 42277.7. Samples: 89695460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 13:14:47,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:14:49,085][09423] Updated weights for policy 0, policy_version 232597 (0.0035) [2024-06-28 13:14:52,897][09423] Updated weights for policy 0, policy_version 232607 (0.0034) [2024-06-28 13:14:52,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 3811033088. Throughput: 0: 42448.5. Samples: 89951040. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 13:14:52,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:14:56,564][09423] Updated weights for policy 0, policy_version 232617 (0.0033) [2024-06-28 13:14:57,922][09190] Fps is (10 sec: 44236.7, 60 sec: 42327.0, 300 sec: 42376.2). Total num frames: 3811246080. Throughput: 0: 42179.0. Samples: 90072360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 13:14:57,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:14:58,394][09403] Signal inference workers to stop experience collection... (1300 times) [2024-06-28 13:14:58,426][09423] InferenceWorker_p0-w0: stopping experience collection (1300 times) [2024-06-28 13:14:58,448][09403] Signal inference workers to resume experience collection... (1300 times) [2024-06-28 13:14:58,474][09423] InferenceWorker_p0-w0: resuming experience collection (1300 times) [2024-06-28 13:15:00,663][09423] Updated weights for policy 0, policy_version 232627 (0.0033) [2024-06-28 13:15:02,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.5, 300 sec: 42376.2). Total num frames: 3811459072. Throughput: 0: 42097.3. Samples: 90323800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 13:15:02,922][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:15:04,351][09423] Updated weights for policy 0, policy_version 232637 (0.0034) [2024-06-28 13:15:07,921][09190] Fps is (10 sec: 40960.9, 60 sec: 41781.0, 300 sec: 42265.5). Total num frames: 3811655680. Throughput: 0: 42439.3. Samples: 90585300. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 13:15:07,922][09190] Avg episode reward: [(0, '0.590')] [2024-06-28 13:15:08,624][09423] Updated weights for policy 0, policy_version 232647 (0.0038) [2024-06-28 13:15:11,998][09423] Updated weights for policy 0, policy_version 232657 (0.0028) [2024-06-28 13:15:12,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.5, 300 sec: 42376.3). Total num frames: 3811901440. Throughput: 0: 42292.8. Samples: 90711000. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 13:15:12,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:15:16,174][09423] Updated weights for policy 0, policy_version 232667 (0.0052) [2024-06-28 13:15:17,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3812081664. Throughput: 0: 42275.1. Samples: 90957600. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 13:15:17,922][09190] Avg episode reward: [(0, '0.582')] [2024-06-28 13:15:19,923][09423] Updated weights for policy 0, policy_version 232677 (0.0032) [2024-06-28 13:15:22,921][09190] Fps is (10 sec: 37683.2, 60 sec: 41779.3, 300 sec: 42209.6). Total num frames: 3812278272. Throughput: 0: 42215.0. Samples: 91213080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 13:15:22,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:15:23,964][09423] Updated weights for policy 0, policy_version 232687 (0.0036) [2024-06-28 13:15:27,656][09423] Updated weights for policy 0, policy_version 232697 (0.0031) [2024-06-28 13:15:27,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 3812524032. Throughput: 0: 42277.7. Samples: 91342420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 13:15:27,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:15:31,494][09423] Updated weights for policy 0, policy_version 232707 (0.0037) [2024-06-28 13:15:32,924][09190] Fps is (10 sec: 44225.8, 60 sec: 42323.6, 300 sec: 42320.3). Total num frames: 3812720640. Throughput: 0: 42119.1. Samples: 91590920. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 13:15:32,925][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:15:35,818][09423] Updated weights for policy 0, policy_version 232717 (0.0038) [2024-06-28 13:15:37,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3812933632. Throughput: 0: 42187.6. Samples: 91849480. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 13:15:37,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:15:39,078][09423] Updated weights for policy 0, policy_version 232727 (0.0032) [2024-06-28 13:15:42,921][09190] Fps is (10 sec: 40970.1, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3813130240. Throughput: 0: 42393.0. Samples: 91980040. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 13:15:42,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:15:43,382][09423] Updated weights for policy 0, policy_version 232737 (0.0031) [2024-06-28 13:15:47,159][09423] Updated weights for policy 0, policy_version 232747 (0.0037) [2024-06-28 13:15:47,928][09190] Fps is (10 sec: 44207.8, 60 sec: 42866.9, 300 sec: 42375.3). Total num frames: 3813376000. Throughput: 0: 42467.6. Samples: 92235120. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 13:15:47,928][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:15:50,834][09423] Updated weights for policy 0, policy_version 232757 (0.0023) [2024-06-28 13:15:52,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 3813572608. Throughput: 0: 42255.0. Samples: 92486780. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 13:15:52,922][09190] Avg episode reward: [(0, '0.589')] [2024-06-28 13:15:54,784][09423] Updated weights for policy 0, policy_version 232767 (0.0040) [2024-06-28 13:15:57,921][09190] Fps is (10 sec: 40986.5, 60 sec: 42325.4, 300 sec: 42321.3). Total num frames: 3813785600. Throughput: 0: 42259.1. Samples: 92612660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 13:15:57,922][09190] Avg episode reward: [(0, '0.589')] [2024-06-28 13:15:58,338][09423] Updated weights for policy 0, policy_version 232777 (0.0037) [2024-06-28 13:16:02,370][09423] Updated weights for policy 0, policy_version 232787 (0.0029) [2024-06-28 13:16:02,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42325.2, 300 sec: 42265.2). Total num frames: 3813998592. Throughput: 0: 42548.3. Samples: 92872280. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 13:16:02,922][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:16:06,333][09423] Updated weights for policy 0, policy_version 232797 (0.0037) [2024-06-28 13:16:07,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.4, 300 sec: 42377.2). Total num frames: 3814211584. Throughput: 0: 42499.6. Samples: 93125560. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 13:16:07,922][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:16:09,953][09423] Updated weights for policy 0, policy_version 232807 (0.0047) [2024-06-28 13:16:12,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3814424576. Throughput: 0: 42501.8. Samples: 93255000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 13:16:12,922][09190] Avg episode reward: [(0, '0.596')] [2024-06-28 13:16:13,918][09423] Updated weights for policy 0, policy_version 232817 (0.0030) [2024-06-28 13:16:17,656][09423] Updated weights for policy 0, policy_version 232827 (0.0027) [2024-06-28 13:16:17,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.5, 300 sec: 42376.3). Total num frames: 3814637568. Throughput: 0: 42665.6. Samples: 93510760. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 13:16:17,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:16:17,959][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000232828_3814653952.pth... [2024-06-28 13:16:18,007][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000232207_3804479488.pth [2024-06-28 13:16:21,568][09423] Updated weights for policy 0, policy_version 232837 (0.0040) [2024-06-28 13:16:22,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42598.3, 300 sec: 42320.7). Total num frames: 3814834176. Throughput: 0: 42396.8. Samples: 93757340. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 13:16:22,922][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:16:25,619][09423] Updated weights for policy 0, policy_version 232847 (0.0035) [2024-06-28 13:16:27,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42052.4, 300 sec: 42265.2). Total num frames: 3815047168. Throughput: 0: 42208.5. Samples: 93879420. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 13:16:27,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:16:29,535][09423] Updated weights for policy 0, policy_version 232857 (0.0033) [2024-06-28 13:16:32,922][09190] Fps is (10 sec: 42598.4, 60 sec: 42327.0, 300 sec: 42376.2). Total num frames: 3815260160. Throughput: 0: 42378.9. Samples: 94141900. Policy #0 lag: (min: 0.0, avg: 11.1, max: 21.0) [2024-06-28 13:16:32,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:16:33,263][09423] Updated weights for policy 0, policy_version 232867 (0.0027) [2024-06-28 13:16:37,073][09423] Updated weights for policy 0, policy_version 232877 (0.0033) [2024-06-28 13:16:37,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3815473152. Throughput: 0: 42454.2. Samples: 94397220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 13:16:37,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:16:40,985][09423] Updated weights for policy 0, policy_version 232887 (0.0031) [2024-06-28 13:16:42,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3815669760. Throughput: 0: 42345.8. Samples: 94518220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 13:16:42,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:16:44,641][09423] Updated weights for policy 0, policy_version 232897 (0.0043) [2024-06-28 13:16:47,921][09190] Fps is (10 sec: 40960.2, 60 sec: 41783.7, 300 sec: 42265.2). Total num frames: 3815882752. Throughput: 0: 42285.9. Samples: 94775140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 13:16:47,922][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:16:48,537][09423] Updated weights for policy 0, policy_version 232907 (0.0030) [2024-06-28 13:16:51,611][09403] Signal inference workers to stop experience collection... (1350 times) [2024-06-28 13:16:51,612][09403] Signal inference workers to resume experience collection... (1350 times) [2024-06-28 13:16:51,652][09423] InferenceWorker_p0-w0: stopping experience collection (1350 times) [2024-06-28 13:16:51,653][09423] InferenceWorker_p0-w0: resuming experience collection (1350 times) [2024-06-28 13:16:52,698][09423] Updated weights for policy 0, policy_version 232917 (0.0038) [2024-06-28 13:16:52,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3816112128. Throughput: 0: 42258.2. Samples: 95027180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 13:16:52,922][09190] Avg episode reward: [(0, '0.607')] [2024-06-28 13:16:56,225][09423] Updated weights for policy 0, policy_version 232927 (0.0037) [2024-06-28 13:16:57,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42325.4, 300 sec: 42376.9). Total num frames: 3816325120. Throughput: 0: 42245.9. Samples: 95156060. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 13:16:57,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:17:00,470][09423] Updated weights for policy 0, policy_version 232937 (0.0031) [2024-06-28 13:17:02,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3816521728. Throughput: 0: 42252.4. Samples: 95412120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 13:17:02,922][09190] Avg episode reward: [(0, '0.581')] [2024-06-28 13:17:03,994][09423] Updated weights for policy 0, policy_version 232947 (0.0031) [2024-06-28 13:17:07,922][09190] Fps is (10 sec: 42597.3, 60 sec: 42325.2, 300 sec: 42321.0). Total num frames: 3816751104. Throughput: 0: 42306.1. Samples: 95661120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 13:17:07,922][09190] Avg episode reward: [(0, '0.579')] [2024-06-28 13:17:08,071][09423] Updated weights for policy 0, policy_version 232957 (0.0038) [2024-06-28 13:17:11,723][09423] Updated weights for policy 0, policy_version 232967 (0.0043) [2024-06-28 13:17:12,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3816964096. Throughput: 0: 42503.4. Samples: 95792080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 13:17:12,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:17:15,825][09423] Updated weights for policy 0, policy_version 232977 (0.0037) [2024-06-28 13:17:17,921][09190] Fps is (10 sec: 39322.7, 60 sec: 41779.2, 300 sec: 42209.6). Total num frames: 3817144320. Throughput: 0: 42350.4. Samples: 96047660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 13:17:17,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:17:19,800][09423] Updated weights for policy 0, policy_version 232987 (0.0043) [2024-06-28 13:17:22,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.4, 300 sec: 42376.6). Total num frames: 3817390080. Throughput: 0: 42145.4. Samples: 96293760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 13:17:22,922][09190] Avg episode reward: [(0, '0.607')] [2024-06-28 13:17:23,579][09423] Updated weights for policy 0, policy_version 232997 (0.0038) [2024-06-28 13:17:27,744][09423] Updated weights for policy 0, policy_version 233007 (0.0041) [2024-06-28 13:17:27,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3817586688. Throughput: 0: 42346.8. Samples: 96423820. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 13:17:27,922][09190] Avg episode reward: [(0, '0.596')] [2024-06-28 13:17:31,210][09423] Updated weights for policy 0, policy_version 233017 (0.0037) [2024-06-28 13:17:32,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42052.4, 300 sec: 42265.2). Total num frames: 3817783296. Throughput: 0: 41985.8. Samples: 96664500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 13:17:32,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:17:35,281][09423] Updated weights for policy 0, policy_version 233027 (0.0048) [2024-06-28 13:17:37,921][09190] Fps is (10 sec: 42597.5, 60 sec: 42325.3, 300 sec: 42265.5). Total num frames: 3818012672. Throughput: 0: 42129.2. Samples: 96923000. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 13:17:37,922][09190] Avg episode reward: [(0, '0.609')] [2024-06-28 13:17:38,778][09423] Updated weights for policy 0, policy_version 233037 (0.0039) [2024-06-28 13:17:42,898][09423] Updated weights for policy 0, policy_version 233047 (0.0031) [2024-06-28 13:17:42,924][09190] Fps is (10 sec: 45863.4, 60 sec: 42869.7, 300 sec: 42431.4). Total num frames: 3818242048. Throughput: 0: 42184.7. Samples: 97054480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 13:17:42,925][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:17:46,754][09423] Updated weights for policy 0, policy_version 233057 (0.0030) [2024-06-28 13:17:47,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3818422272. Throughput: 0: 42041.7. Samples: 97304000. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 13:17:47,922][09190] Avg episode reward: [(0, '0.584')] [2024-06-28 13:17:50,677][09423] Updated weights for policy 0, policy_version 233067 (0.0037) [2024-06-28 13:17:52,922][09190] Fps is (10 sec: 40969.9, 60 sec: 42325.2, 300 sec: 42320.7). Total num frames: 3818651648. Throughput: 0: 42067.7. Samples: 97554160. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 13:17:52,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:17:54,714][09423] Updated weights for policy 0, policy_version 233077 (0.0042) [2024-06-28 13:17:57,921][09190] Fps is (10 sec: 40960.1, 60 sec: 41779.1, 300 sec: 42210.2). Total num frames: 3818831872. Throughput: 0: 42124.4. Samples: 97687680. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 13:17:57,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:17:58,596][09423] Updated weights for policy 0, policy_version 233087 (0.0033) [2024-06-28 13:18:02,468][09423] Updated weights for policy 0, policy_version 233097 (0.0030) [2024-06-28 13:18:02,921][09190] Fps is (10 sec: 40961.1, 60 sec: 42325.4, 300 sec: 42265.6). Total num frames: 3819061248. Throughput: 0: 41915.2. Samples: 97933840. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 13:18:02,922][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:18:06,514][09423] Updated weights for policy 0, policy_version 233107 (0.0028) [2024-06-28 13:18:07,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42052.4, 300 sec: 42265.2). Total num frames: 3819274240. Throughput: 0: 42232.1. Samples: 98194200. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 13:18:07,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:18:09,964][09423] Updated weights for policy 0, policy_version 233117 (0.0041) [2024-06-28 13:18:12,921][09190] Fps is (10 sec: 40959.3, 60 sec: 41779.2, 300 sec: 42265.2). Total num frames: 3819470848. Throughput: 0: 42102.5. Samples: 98318440. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 13:18:12,928][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 13:18:14,179][09423] Updated weights for policy 0, policy_version 233127 (0.0032) [2024-06-28 13:18:17,671][09423] Updated weights for policy 0, policy_version 233137 (0.0032) [2024-06-28 13:18:17,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42871.5, 300 sec: 42320.7). Total num frames: 3819716608. Throughput: 0: 42332.0. Samples: 98569440. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 13:18:17,922][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:18:17,963][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000233138_3819732992.pth... [2024-06-28 13:18:18,018][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000232517_3809558528.pth [2024-06-28 13:18:21,932][09423] Updated weights for policy 0, policy_version 233147 (0.0039) [2024-06-28 13:18:22,921][09190] Fps is (10 sec: 42598.8, 60 sec: 41779.3, 300 sec: 42209.7). Total num frames: 3819896832. Throughput: 0: 42436.6. Samples: 98832640. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 13:18:22,922][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:18:25,148][09423] Updated weights for policy 0, policy_version 233157 (0.0032) [2024-06-28 13:18:27,921][09190] Fps is (10 sec: 39321.2, 60 sec: 42052.2, 300 sec: 42265.2). Total num frames: 3820109824. Throughput: 0: 42104.5. Samples: 98949080. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 13:18:27,922][09190] Avg episode reward: [(0, '0.607')] [2024-06-28 13:18:29,487][09423] Updated weights for policy 0, policy_version 233167 (0.0032) [2024-06-28 13:18:32,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42871.5, 300 sec: 42320.7). Total num frames: 3820355584. Throughput: 0: 42247.2. Samples: 99205120. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 13:18:32,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:18:32,983][09423] Updated weights for policy 0, policy_version 233177 (0.0024) [2024-06-28 13:18:37,348][09423] Updated weights for policy 0, policy_version 233187 (0.0032) [2024-06-28 13:18:37,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42052.4, 300 sec: 42320.7). Total num frames: 3820535808. Throughput: 0: 42337.9. Samples: 99459360. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 13:18:37,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:18:41,019][09423] Updated weights for policy 0, policy_version 233197 (0.0031) [2024-06-28 13:18:42,921][09190] Fps is (10 sec: 39321.4, 60 sec: 41781.0, 300 sec: 42320.7). Total num frames: 3820748800. Throughput: 0: 42213.8. Samples: 99587300. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 13:18:42,922][09190] Avg episode reward: [(0, '0.589')] [2024-06-28 13:18:44,448][09403] Signal inference workers to stop experience collection... (1400 times) [2024-06-28 13:18:44,454][09403] Signal inference workers to resume experience collection... (1400 times) [2024-06-28 13:18:44,472][09423] InferenceWorker_p0-w0: stopping experience collection (1400 times) [2024-06-28 13:18:44,472][09423] InferenceWorker_p0-w0: resuming experience collection (1400 times) [2024-06-28 13:18:45,181][09423] Updated weights for policy 0, policy_version 233207 (0.0042) [2024-06-28 13:18:47,921][09190] Fps is (10 sec: 45874.5, 60 sec: 42871.5, 300 sec: 42376.2). Total num frames: 3820994560. Throughput: 0: 42462.0. Samples: 99844640. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 13:18:47,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:18:48,489][09423] Updated weights for policy 0, policy_version 233217 (0.0047) [2024-06-28 13:18:52,867][09423] Updated weights for policy 0, policy_version 233227 (0.0035) [2024-06-28 13:18:52,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42325.5, 300 sec: 42321.1). Total num frames: 3821191168. Throughput: 0: 42394.2. Samples: 100101940. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 13:18:52,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:18:56,678][09423] Updated weights for policy 0, policy_version 233237 (0.0033) [2024-06-28 13:18:57,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42871.5, 300 sec: 42376.3). Total num frames: 3821404160. Throughput: 0: 42403.6. Samples: 100226600. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:18:57,922][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 13:19:00,677][09423] Updated weights for policy 0, policy_version 233247 (0.0037) [2024-06-28 13:19:02,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42598.3, 300 sec: 42265.5). Total num frames: 3821617152. Throughput: 0: 42590.6. Samples: 100486020. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:19:02,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:19:04,099][09423] Updated weights for policy 0, policy_version 233257 (0.0032) [2024-06-28 13:19:07,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.4, 300 sec: 42376.3). Total num frames: 3821830144. Throughput: 0: 42495.0. Samples: 100744920. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:19:07,922][09190] Avg episode reward: [(0, '0.596')] [2024-06-28 13:19:08,130][09423] Updated weights for policy 0, policy_version 233267 (0.0040) [2024-06-28 13:19:11,773][09423] Updated weights for policy 0, policy_version 233277 (0.0032) [2024-06-28 13:19:12,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42598.4, 300 sec: 42320.7). Total num frames: 3822026752. Throughput: 0: 42664.9. Samples: 100869000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:19:12,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:19:15,685][09423] Updated weights for policy 0, policy_version 233287 (0.0037) [2024-06-28 13:19:17,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3822239744. Throughput: 0: 42674.7. Samples: 101125480. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:19:17,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:19:19,446][09423] Updated weights for policy 0, policy_version 233297 (0.0023) [2024-06-28 13:19:22,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42325.2, 300 sec: 42265.1). Total num frames: 3822436352. Throughput: 0: 42678.5. Samples: 101379900. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:19:22,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:19:23,814][09423] Updated weights for policy 0, policy_version 233307 (0.0038) [2024-06-28 13:19:27,310][09423] Updated weights for policy 0, policy_version 233317 (0.0038) [2024-06-28 13:19:27,921][09190] Fps is (10 sec: 44236.2, 60 sec: 42871.5, 300 sec: 42376.2). Total num frames: 3822682112. Throughput: 0: 42628.4. Samples: 101505580. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:19:27,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:19:31,561][09423] Updated weights for policy 0, policy_version 233327 (0.0038) [2024-06-28 13:19:32,922][09190] Fps is (10 sec: 44236.6, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3822878720. Throughput: 0: 42559.5. Samples: 101759820. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:19:32,922][09190] Avg episode reward: [(0, '0.588')] [2024-06-28 13:19:35,020][09423] Updated weights for policy 0, policy_version 233337 (0.0044) [2024-06-28 13:19:37,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3823075328. Throughput: 0: 42415.5. Samples: 102010640. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:19:37,922][09190] Avg episode reward: [(0, '0.608')] [2024-06-28 13:19:39,138][09423] Updated weights for policy 0, policy_version 233347 (0.0032) [2024-06-28 13:19:42,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.4, 300 sec: 42376.3). Total num frames: 3823304704. Throughput: 0: 42432.9. Samples: 102136080. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:19:42,922][09190] Avg episode reward: [(0, '0.589')] [2024-06-28 13:19:43,066][09423] Updated weights for policy 0, policy_version 233357 (0.0023) [2024-06-28 13:19:46,994][09423] Updated weights for policy 0, policy_version 233367 (0.0039) [2024-06-28 13:19:47,921][09190] Fps is (10 sec: 42598.2, 60 sec: 41779.3, 300 sec: 42265.2). Total num frames: 3823501312. Throughput: 0: 42219.1. Samples: 102385880. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:19:47,922][09190] Avg episode reward: [(0, '0.608')] [2024-06-28 13:19:50,635][09423] Updated weights for policy 0, policy_version 233377 (0.0035) [2024-06-28 13:19:52,921][09190] Fps is (10 sec: 39321.6, 60 sec: 41779.1, 300 sec: 42209.6). Total num frames: 3823697920. Throughput: 0: 42215.1. Samples: 102644600. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:19:52,922][09190] Avg episode reward: [(0, '0.592')] [2024-06-28 13:19:54,849][09423] Updated weights for policy 0, policy_version 233387 (0.0051) [2024-06-28 13:19:57,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3823943680. Throughput: 0: 42207.6. Samples: 102768340. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:19:57,922][09190] Avg episode reward: [(0, '0.596')] [2024-06-28 13:19:58,175][09423] Updated weights for policy 0, policy_version 233397 (0.0026) [2024-06-28 13:20:02,444][09423] Updated weights for policy 0, policy_version 233407 (0.0038) [2024-06-28 13:20:02,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 3824156672. Throughput: 0: 42236.7. Samples: 103026140. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:20:02,922][09190] Avg episode reward: [(0, '0.613')] [2024-06-28 13:20:05,738][09423] Updated weights for policy 0, policy_version 233417 (0.0039) [2024-06-28 13:20:07,921][09190] Fps is (10 sec: 40959.4, 60 sec: 42052.2, 300 sec: 42209.6). Total num frames: 3824353280. Throughput: 0: 42255.1. Samples: 103281380. Policy #0 lag: (min: 1.0, avg: 12.1, max: 21.0) [2024-06-28 13:20:07,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:20:10,091][09423] Updated weights for policy 0, policy_version 233427 (0.0034) [2024-06-28 13:20:12,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3824566272. Throughput: 0: 42157.8. Samples: 103402680. Policy #0 lag: (min: 1.0, avg: 12.1, max: 21.0) [2024-06-28 13:20:12,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:20:13,758][09423] Updated weights for policy 0, policy_version 233437 (0.0041) [2024-06-28 13:20:17,923][09423] Updated weights for policy 0, policy_version 233447 (0.0035) [2024-06-28 13:20:17,924][09190] Fps is (10 sec: 44225.8, 60 sec: 42596.5, 300 sec: 42431.4). Total num frames: 3824795648. Throughput: 0: 42274.2. Samples: 103662260. Policy #0 lag: (min: 1.0, avg: 12.1, max: 21.0) [2024-06-28 13:20:17,924][09190] Avg episode reward: [(0, '0.596')] [2024-06-28 13:20:17,933][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000233447_3824795648.pth... [2024-06-28 13:20:17,994][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000232828_3814653952.pth [2024-06-28 13:20:21,520][09423] Updated weights for policy 0, policy_version 233457 (0.0022) [2024-06-28 13:20:22,846][09403] Signal inference workers to stop experience collection... (1450 times) [2024-06-28 13:20:22,848][09403] Signal inference workers to resume experience collection... (1450 times) [2024-06-28 13:20:22,864][09423] InferenceWorker_p0-w0: stopping experience collection (1450 times) [2024-06-28 13:20:22,864][09423] InferenceWorker_p0-w0: resuming experience collection (1450 times) [2024-06-28 13:20:22,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42598.4, 300 sec: 42265.2). Total num frames: 3824992256. Throughput: 0: 42356.8. Samples: 103916700. Policy #0 lag: (min: 1.0, avg: 12.1, max: 21.0) [2024-06-28 13:20:22,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:20:25,521][09423] Updated weights for policy 0, policy_version 233467 (0.0035) [2024-06-28 13:20:27,921][09190] Fps is (10 sec: 40970.6, 60 sec: 42052.3, 300 sec: 42321.1). Total num frames: 3825205248. Throughput: 0: 42299.6. Samples: 104039560. Policy #0 lag: (min: 1.0, avg: 12.1, max: 21.0) [2024-06-28 13:20:27,922][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 13:20:29,387][09423] Updated weights for policy 0, policy_version 233477 (0.0030) [2024-06-28 13:20:32,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3825418240. Throughput: 0: 42466.2. Samples: 104296860. Policy #0 lag: (min: 1.0, avg: 12.1, max: 21.0) [2024-06-28 13:20:32,922][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:20:33,544][09423] Updated weights for policy 0, policy_version 233487 (0.0043) [2024-06-28 13:20:37,103][09423] Updated weights for policy 0, policy_version 233497 (0.0027) [2024-06-28 13:20:37,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.4, 300 sec: 42376.3). Total num frames: 3825631232. Throughput: 0: 42245.8. Samples: 104545660. Policy #0 lag: (min: 1.0, avg: 12.1, max: 21.0) [2024-06-28 13:20:37,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:20:41,098][09423] Updated weights for policy 0, policy_version 233507 (0.0033) [2024-06-28 13:20:42,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42325.4, 300 sec: 42266.1). Total num frames: 3825844224. Throughput: 0: 42458.7. Samples: 104678980. Policy #0 lag: (min: 1.0, avg: 12.1, max: 21.0) [2024-06-28 13:20:42,922][09190] Avg episode reward: [(0, '0.576')] [2024-06-28 13:20:44,918][09423] Updated weights for policy 0, policy_version 233517 (0.0027) [2024-06-28 13:20:47,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.4, 300 sec: 42320.7). Total num frames: 3826057216. Throughput: 0: 42445.8. Samples: 104936200. Policy #0 lag: (min: 1.0, avg: 12.1, max: 21.0) [2024-06-28 13:20:47,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:20:48,575][09423] Updated weights for policy 0, policy_version 233527 (0.0032) [2024-06-28 13:20:52,510][09423] Updated weights for policy 0, policy_version 233537 (0.0049) [2024-06-28 13:20:52,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42871.5, 300 sec: 42320.7). Total num frames: 3826270208. Throughput: 0: 42334.3. Samples: 105186420. Policy #0 lag: (min: 1.0, avg: 12.1, max: 21.0) [2024-06-28 13:20:52,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:20:56,256][09423] Updated weights for policy 0, policy_version 233547 (0.0042) [2024-06-28 13:20:57,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3826483200. Throughput: 0: 42609.8. Samples: 105320120. Policy #0 lag: (min: 1.0, avg: 12.1, max: 21.0) [2024-06-28 13:20:57,922][09190] Avg episode reward: [(0, '0.607')] [2024-06-28 13:21:00,360][09423] Updated weights for policy 0, policy_version 233557 (0.0029) [2024-06-28 13:21:02,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42325.5, 300 sec: 42320.7). Total num frames: 3826696192. Throughput: 0: 42486.5. Samples: 105574040. Policy #0 lag: (min: 1.0, avg: 12.1, max: 21.0) [2024-06-28 13:21:02,922][09190] Avg episode reward: [(0, '0.607')] [2024-06-28 13:21:03,882][09423] Updated weights for policy 0, policy_version 233567 (0.0039) [2024-06-28 13:21:07,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.5, 300 sec: 42320.7). Total num frames: 3826909184. Throughput: 0: 42532.5. Samples: 105830660. Policy #0 lag: (min: 1.0, avg: 12.1, max: 21.0) [2024-06-28 13:21:07,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:21:08,034][09423] Updated weights for policy 0, policy_version 233577 (0.0032) [2024-06-28 13:21:11,780][09423] Updated weights for policy 0, policy_version 233587 (0.0035) [2024-06-28 13:21:12,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42598.4, 300 sec: 42320.7). Total num frames: 3827122176. Throughput: 0: 42595.5. Samples: 105956360. Policy #0 lag: (min: 1.0, avg: 12.1, max: 21.0) [2024-06-28 13:21:12,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:21:15,795][09423] Updated weights for policy 0, policy_version 233597 (0.0033) [2024-06-28 13:21:17,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42054.0, 300 sec: 42320.7). Total num frames: 3827318784. Throughput: 0: 42391.1. Samples: 106204460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:21:17,922][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:21:19,566][09423] Updated weights for policy 0, policy_version 233607 (0.0033) [2024-06-28 13:21:22,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3827531776. Throughput: 0: 42573.3. Samples: 106461460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:21:22,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:21:23,557][09423] Updated weights for policy 0, policy_version 233617 (0.0040) [2024-06-28 13:21:27,409][09423] Updated weights for policy 0, policy_version 233627 (0.0049) [2024-06-28 13:21:27,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42598.3, 300 sec: 42376.3). Total num frames: 3827761152. Throughput: 0: 42304.8. Samples: 106582700. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:21:27,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:21:31,311][09423] Updated weights for policy 0, policy_version 233637 (0.0030) [2024-06-28 13:21:32,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3827957760. Throughput: 0: 42218.7. Samples: 106836040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:21:32,922][09190] Avg episode reward: [(0, '0.588')] [2024-06-28 13:21:35,296][09423] Updated weights for policy 0, policy_version 233647 (0.0049) [2024-06-28 13:21:37,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 3828187136. Throughput: 0: 42313.8. Samples: 107090540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:21:37,922][09190] Avg episode reward: [(0, '0.584')] [2024-06-28 13:21:39,652][09423] Updated weights for policy 0, policy_version 233657 (0.0035) [2024-06-28 13:21:42,863][09423] Updated weights for policy 0, policy_version 233667 (0.0032) [2024-06-28 13:21:42,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 3828400128. Throughput: 0: 42136.0. Samples: 107216240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:21:42,922][09190] Avg episode reward: [(0, '0.610')] [2024-06-28 13:21:47,191][09423] Updated weights for policy 0, policy_version 233677 (0.0038) [2024-06-28 13:21:47,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3828596736. Throughput: 0: 42219.4. Samples: 107473920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:21:47,924][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:21:50,369][09423] Updated weights for policy 0, policy_version 233687 (0.0046) [2024-06-28 13:21:52,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3828809728. Throughput: 0: 41982.6. Samples: 107719880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:21:52,922][09190] Avg episode reward: [(0, '0.596')] [2024-06-28 13:21:54,794][09423] Updated weights for policy 0, policy_version 233697 (0.0024) [2024-06-28 13:21:57,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 3829039104. Throughput: 0: 42142.8. Samples: 107852780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:21:57,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:21:58,434][09423] Updated weights for policy 0, policy_version 233707 (0.0035) [2024-06-28 13:22:02,202][09423] Updated weights for policy 0, policy_version 233717 (0.0047) [2024-06-28 13:22:02,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.2, 300 sec: 42320.7). Total num frames: 3829235712. Throughput: 0: 42310.7. Samples: 108108440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:22:02,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:22:06,280][09423] Updated weights for policy 0, policy_version 233727 (0.0046) [2024-06-28 13:22:07,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 3829465088. Throughput: 0: 42232.4. Samples: 108361920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:22:07,922][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:22:08,626][09403] Signal inference workers to stop experience collection... (1500 times) [2024-06-28 13:22:08,665][09423] InferenceWorker_p0-w0: stopping experience collection (1500 times) [2024-06-28 13:22:08,675][09403] Signal inference workers to resume experience collection... (1500 times) [2024-06-28 13:22:08,682][09423] InferenceWorker_p0-w0: resuming experience collection (1500 times) [2024-06-28 13:22:10,054][09423] Updated weights for policy 0, policy_version 233737 (0.0033) [2024-06-28 13:22:12,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 3829678080. Throughput: 0: 42370.7. Samples: 108489380. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:22:12,922][09190] Avg episode reward: [(0, '0.593')] [2024-06-28 13:22:13,754][09423] Updated weights for policy 0, policy_version 233747 (0.0033) [2024-06-28 13:22:17,864][09423] Updated weights for policy 0, policy_version 233757 (0.0040) [2024-06-28 13:22:17,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.4, 300 sec: 42320.7). Total num frames: 3829874688. Throughput: 0: 42344.4. Samples: 108741540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:22:17,922][09190] Avg episode reward: [(0, '0.596')] [2024-06-28 13:22:17,928][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000233757_3829874688.pth... [2024-06-28 13:22:17,981][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000233138_3819732992.pth [2024-06-28 13:22:21,323][09423] Updated weights for policy 0, policy_version 233767 (0.0036) [2024-06-28 13:22:22,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3830071296. Throughput: 0: 42532.0. Samples: 109004480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:22:22,922][09190] Avg episode reward: [(0, '0.608')] [2024-06-28 13:22:25,711][09423] Updated weights for policy 0, policy_version 233777 (0.0043) [2024-06-28 13:22:27,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42052.3, 300 sec: 42376.2). Total num frames: 3830284288. Throughput: 0: 42462.6. Samples: 109127060. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2024-06-28 13:22:27,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:22:29,261][09423] Updated weights for policy 0, policy_version 233787 (0.0032) [2024-06-28 13:22:32,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3830497280. Throughput: 0: 42342.3. Samples: 109379320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2024-06-28 13:22:32,922][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:22:33,167][09423] Updated weights for policy 0, policy_version 233797 (0.0039) [2024-06-28 13:22:37,058][09423] Updated weights for policy 0, policy_version 233807 (0.0032) [2024-06-28 13:22:37,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42052.2, 300 sec: 42265.5). Total num frames: 3830710272. Throughput: 0: 42496.4. Samples: 109632220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2024-06-28 13:22:37,922][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:22:40,867][09423] Updated weights for policy 0, policy_version 233817 (0.0029) [2024-06-28 13:22:42,924][09190] Fps is (10 sec: 40949.6, 60 sec: 41777.4, 300 sec: 42320.4). Total num frames: 3830906880. Throughput: 0: 42296.7. Samples: 109756240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2024-06-28 13:22:42,924][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:22:44,629][09423] Updated weights for policy 0, policy_version 233827 (0.0043) [2024-06-28 13:22:47,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42325.5, 300 sec: 42320.7). Total num frames: 3831136256. Throughput: 0: 42233.5. Samples: 110008940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2024-06-28 13:22:47,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:22:48,766][09423] Updated weights for policy 0, policy_version 233837 (0.0038) [2024-06-28 13:22:52,625][09423] Updated weights for policy 0, policy_version 233847 (0.0033) [2024-06-28 13:22:52,926][09190] Fps is (10 sec: 44228.6, 60 sec: 42322.3, 300 sec: 42431.2). Total num frames: 3831349248. Throughput: 0: 42149.8. Samples: 110258840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2024-06-28 13:22:52,926][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:22:56,386][09423] Updated weights for policy 0, policy_version 233857 (0.0025) [2024-06-28 13:22:57,921][09190] Fps is (10 sec: 40959.5, 60 sec: 41779.2, 300 sec: 42320.7). Total num frames: 3831545856. Throughput: 0: 42267.6. Samples: 110391420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2024-06-28 13:22:57,924][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:23:00,363][09423] Updated weights for policy 0, policy_version 233867 (0.0036) [2024-06-28 13:23:02,921][09190] Fps is (10 sec: 42616.7, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 3831775232. Throughput: 0: 42264.4. Samples: 110643440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2024-06-28 13:23:02,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:23:04,052][09423] Updated weights for policy 0, policy_version 233877 (0.0044) [2024-06-28 13:23:07,923][09190] Fps is (10 sec: 44227.7, 60 sec: 42050.9, 300 sec: 42431.5). Total num frames: 3831988224. Throughput: 0: 41909.6. Samples: 110890500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2024-06-28 13:23:07,924][09190] Avg episode reward: [(0, '0.608')] [2024-06-28 13:23:08,164][09423] Updated weights for policy 0, policy_version 233887 (0.0036) [2024-06-28 13:23:12,039][09423] Updated weights for policy 0, policy_version 233897 (0.0039) [2024-06-28 13:23:12,921][09190] Fps is (10 sec: 40960.3, 60 sec: 41779.3, 300 sec: 42265.2). Total num frames: 3832184832. Throughput: 0: 42110.2. Samples: 111022020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2024-06-28 13:23:12,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:23:16,115][09423] Updated weights for policy 0, policy_version 233907 (0.0033) [2024-06-28 13:23:17,921][09190] Fps is (10 sec: 42606.9, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 3832414208. Throughput: 0: 42178.1. Samples: 111277340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2024-06-28 13:23:17,922][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 13:23:19,658][09423] Updated weights for policy 0, policy_version 233917 (0.0030) [2024-06-28 13:23:22,926][09190] Fps is (10 sec: 44217.1, 60 sec: 42595.3, 300 sec: 42431.2). Total num frames: 3832627200. Throughput: 0: 42154.1. Samples: 111529340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2024-06-28 13:23:22,926][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:23:23,666][09423] Updated weights for policy 0, policy_version 233927 (0.0036) [2024-06-28 13:23:27,356][09423] Updated weights for policy 0, policy_version 233937 (0.0028) [2024-06-28 13:23:27,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3832823808. Throughput: 0: 42248.5. Samples: 111657320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2024-06-28 13:23:27,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:23:28,934][09403] Signal inference workers to stop experience collection... (1550 times) [2024-06-28 13:23:28,934][09403] Signal inference workers to resume experience collection... (1550 times) [2024-06-28 13:23:28,978][09423] InferenceWorker_p0-w0: stopping experience collection (1550 times) [2024-06-28 13:23:28,978][09423] InferenceWorker_p0-w0: resuming experience collection (1550 times) [2024-06-28 13:23:31,762][09423] Updated weights for policy 0, policy_version 233947 (0.0035) [2024-06-28 13:23:32,921][09190] Fps is (10 sec: 39339.3, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 3833020416. Throughput: 0: 42416.4. Samples: 111917680. Policy #0 lag: (min: 0.0, avg: 9.9, max: 19.0) [2024-06-28 13:23:32,922][09190] Avg episode reward: [(0, '0.596')] [2024-06-28 13:23:35,283][09423] Updated weights for policy 0, policy_version 233957 (0.0032) [2024-06-28 13:23:37,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 3833249792. Throughput: 0: 42339.2. Samples: 112163920. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 13:23:37,922][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 13:23:39,286][09423] Updated weights for policy 0, policy_version 233967 (0.0027) [2024-06-28 13:23:42,761][09423] Updated weights for policy 0, policy_version 233977 (0.0031) [2024-06-28 13:23:42,921][09190] Fps is (10 sec: 45874.7, 60 sec: 42873.2, 300 sec: 42320.7). Total num frames: 3833479168. Throughput: 0: 42365.3. Samples: 112297860. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 13:23:42,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:23:46,845][09423] Updated weights for policy 0, policy_version 233987 (0.0034) [2024-06-28 13:23:47,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42052.2, 300 sec: 42265.2). Total num frames: 3833659392. Throughput: 0: 42297.3. Samples: 112546820. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 13:23:47,922][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:23:50,769][09423] Updated weights for policy 0, policy_version 233997 (0.0039) [2024-06-28 13:23:52,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42328.5, 300 sec: 42320.7). Total num frames: 3833888768. Throughput: 0: 42504.7. Samples: 112803120. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 13:23:52,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:23:55,013][09423] Updated weights for policy 0, policy_version 234007 (0.0034) [2024-06-28 13:23:57,923][09190] Fps is (10 sec: 45866.7, 60 sec: 42870.1, 300 sec: 42376.0). Total num frames: 3834118144. Throughput: 0: 42509.3. Samples: 112935020. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 13:23:57,924][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:23:58,295][09423] Updated weights for policy 0, policy_version 234017 (0.0032) [2024-06-28 13:24:02,537][09423] Updated weights for policy 0, policy_version 234027 (0.0032) [2024-06-28 13:24:02,922][09190] Fps is (10 sec: 42597.5, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3834314752. Throughput: 0: 42336.4. Samples: 113182480. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 13:24:02,922][09190] Avg episode reward: [(0, '0.615')] [2024-06-28 13:24:06,240][09423] Updated weights for policy 0, policy_version 234037 (0.0038) [2024-06-28 13:24:07,921][09190] Fps is (10 sec: 40967.4, 60 sec: 42326.7, 300 sec: 42376.2). Total num frames: 3834527744. Throughput: 0: 42464.1. Samples: 113440040. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 13:24:07,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:24:10,147][09423] Updated weights for policy 0, policy_version 234047 (0.0029) [2024-06-28 13:24:12,922][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 3834740736. Throughput: 0: 42465.7. Samples: 113568280. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 13:24:12,922][09190] Avg episode reward: [(0, '0.587')] [2024-06-28 13:24:14,031][09423] Updated weights for policy 0, policy_version 234057 (0.0040) [2024-06-28 13:24:17,921][09190] Fps is (10 sec: 39322.2, 60 sec: 41779.3, 300 sec: 42320.7). Total num frames: 3834920960. Throughput: 0: 42167.1. Samples: 113815200. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 13:24:17,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:24:18,070][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000234066_3834937344.pth... [2024-06-28 13:24:18,142][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000233447_3824795648.pth [2024-06-28 13:24:18,284][09423] Updated weights for policy 0, policy_version 234067 (0.0034) [2024-06-28 13:24:21,531][09423] Updated weights for policy 0, policy_version 234077 (0.0028) [2024-06-28 13:24:22,921][09190] Fps is (10 sec: 39321.9, 60 sec: 41782.2, 300 sec: 42209.6). Total num frames: 3835133952. Throughput: 0: 42699.5. Samples: 114085400. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 13:24:22,924][09190] Avg episode reward: [(0, '0.608')] [2024-06-28 13:24:25,745][09423] Updated weights for policy 0, policy_version 234087 (0.0038) [2024-06-28 13:24:27,922][09190] Fps is (10 sec: 45873.9, 60 sec: 42598.2, 300 sec: 42376.2). Total num frames: 3835379712. Throughput: 0: 42446.9. Samples: 114207980. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 13:24:27,922][09190] Avg episode reward: [(0, '0.610')] [2024-06-28 13:24:28,991][09423] Updated weights for policy 0, policy_version 234097 (0.0035) [2024-06-28 13:24:32,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 3835576320. Throughput: 0: 42459.1. Samples: 114457480. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 13:24:32,922][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 13:24:33,274][09423] Updated weights for policy 0, policy_version 234107 (0.0043) [2024-06-28 13:24:37,038][09423] Updated weights for policy 0, policy_version 234117 (0.0028) [2024-06-28 13:24:37,921][09190] Fps is (10 sec: 40960.8, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3835789312. Throughput: 0: 42595.9. Samples: 114719940. Policy #0 lag: (min: 1.0, avg: 10.0, max: 23.0) [2024-06-28 13:24:37,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:24:40,807][09423] Updated weights for policy 0, policy_version 234127 (0.0043) [2024-06-28 13:24:42,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42052.3, 300 sec: 42376.2). Total num frames: 3836002304. Throughput: 0: 42453.8. Samples: 114845360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 13:24:42,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:24:44,629][09423] Updated weights for policy 0, policy_version 234137 (0.0038) [2024-06-28 13:24:47,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 3836231680. Throughput: 0: 42519.7. Samples: 115095860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 13:24:47,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:24:48,482][09423] Updated weights for policy 0, policy_version 234147 (0.0029) [2024-06-28 13:24:52,346][09423] Updated weights for policy 0, policy_version 234157 (0.0043) [2024-06-28 13:24:52,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3836428288. Throughput: 0: 42456.1. Samples: 115350560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 13:24:52,922][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:24:56,970][09423] Updated weights for policy 0, policy_version 234167 (0.0035) [2024-06-28 13:24:57,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42053.6, 300 sec: 42320.7). Total num frames: 3836641280. Throughput: 0: 42425.5. Samples: 115477420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 13:24:57,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:25:00,131][09423] Updated weights for policy 0, policy_version 234177 (0.0040) [2024-06-28 13:25:02,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 3836870656. Throughput: 0: 42671.0. Samples: 115735400. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 13:25:02,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:25:04,664][09423] Updated weights for policy 0, policy_version 234187 (0.0029) [2024-06-28 13:25:07,890][09423] Updated weights for policy 0, policy_version 234197 (0.0035) [2024-06-28 13:25:07,924][09190] Fps is (10 sec: 44225.4, 60 sec: 42596.7, 300 sec: 42431.4). Total num frames: 3837083648. Throughput: 0: 42220.8. Samples: 115985440. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 13:25:07,924][09190] Avg episode reward: [(0, '0.590')] [2024-06-28 13:25:12,270][09423] Updated weights for policy 0, policy_version 234207 (0.0040) [2024-06-28 13:25:12,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42325.5, 300 sec: 42321.1). Total num frames: 3837280256. Throughput: 0: 42371.0. Samples: 116114660. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 13:25:12,922][09190] Avg episode reward: [(0, '0.586')] [2024-06-28 13:25:15,636][09423] Updated weights for policy 0, policy_version 234217 (0.0035) [2024-06-28 13:25:16,659][09403] Signal inference workers to stop experience collection... (1600 times) [2024-06-28 13:25:16,689][09423] InferenceWorker_p0-w0: stopping experience collection (1600 times) [2024-06-28 13:25:16,771][09403] Signal inference workers to resume experience collection... (1600 times) [2024-06-28 13:25:16,771][09423] InferenceWorker_p0-w0: resuming experience collection (1600 times) [2024-06-28 13:25:17,921][09190] Fps is (10 sec: 42609.0, 60 sec: 43144.5, 300 sec: 42431.8). Total num frames: 3837509632. Throughput: 0: 42559.1. Samples: 116372640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 13:25:17,922][09190] Avg episode reward: [(0, '0.593')] [2024-06-28 13:25:19,745][09423] Updated weights for policy 0, policy_version 234227 (0.0033) [2024-06-28 13:25:22,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42871.5, 300 sec: 42376.2). Total num frames: 3837706240. Throughput: 0: 42348.0. Samples: 116625600. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 13:25:22,922][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:25:23,425][09423] Updated weights for policy 0, policy_version 234237 (0.0026) [2024-06-28 13:25:27,387][09423] Updated weights for policy 0, policy_version 234247 (0.0031) [2024-06-28 13:25:27,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.5, 300 sec: 42376.3). Total num frames: 3837919232. Throughput: 0: 42310.7. Samples: 116749340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 13:25:27,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:25:31,112][09423] Updated weights for policy 0, policy_version 234257 (0.0039) [2024-06-28 13:25:32,922][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.4, 300 sec: 42431.8). Total num frames: 3838148608. Throughput: 0: 42515.8. Samples: 117009080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 13:25:32,922][09190] Avg episode reward: [(0, '0.571')] [2024-06-28 13:25:35,604][09423] Updated weights for policy 0, policy_version 234267 (0.0033) [2024-06-28 13:25:37,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.5, 300 sec: 42431.8). Total num frames: 3838361600. Throughput: 0: 42415.5. Samples: 117259260. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 13:25:37,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:25:38,950][09423] Updated weights for policy 0, policy_version 234277 (0.0050) [2024-06-28 13:25:42,921][09190] Fps is (10 sec: 37683.4, 60 sec: 42052.2, 300 sec: 42265.2). Total num frames: 3838525440. Throughput: 0: 42375.9. Samples: 117384340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 13:25:42,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:25:43,209][09423] Updated weights for policy 0, policy_version 234287 (0.0032) [2024-06-28 13:25:46,549][09423] Updated weights for policy 0, policy_version 234297 (0.0023) [2024-06-28 13:25:47,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 3838787584. Throughput: 0: 42363.6. Samples: 117641760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 22.0) [2024-06-28 13:25:47,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:25:50,989][09423] Updated weights for policy 0, policy_version 234307 (0.0047) [2024-06-28 13:25:52,922][09190] Fps is (10 sec: 44232.9, 60 sec: 42324.6, 300 sec: 42320.6). Total num frames: 3838967808. Throughput: 0: 42431.2. Samples: 117894780. Policy #0 lag: (min: 1.0, avg: 9.6, max: 23.0) [2024-06-28 13:25:52,923][09190] Avg episode reward: [(0, '0.601')] [2024-06-28 13:25:54,200][09423] Updated weights for policy 0, policy_version 234317 (0.0039) [2024-06-28 13:25:57,924][09190] Fps is (10 sec: 39312.1, 60 sec: 42323.5, 300 sec: 42320.3). Total num frames: 3839180800. Throughput: 0: 42289.2. Samples: 118017780. Policy #0 lag: (min: 1.0, avg: 9.6, max: 23.0) [2024-06-28 13:25:57,924][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:25:58,847][09423] Updated weights for policy 0, policy_version 234327 (0.0031) [2024-06-28 13:26:01,939][09423] Updated weights for policy 0, policy_version 234337 (0.0023) [2024-06-28 13:26:02,921][09190] Fps is (10 sec: 44240.8, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 3839410176. Throughput: 0: 42267.1. Samples: 118274660. Policy #0 lag: (min: 1.0, avg: 9.6, max: 23.0) [2024-06-28 13:26:02,922][09190] Avg episode reward: [(0, '0.590')] [2024-06-28 13:26:06,283][09423] Updated weights for policy 0, policy_version 234347 (0.0026) [2024-06-28 13:26:07,921][09190] Fps is (10 sec: 42609.2, 60 sec: 42054.1, 300 sec: 42320.7). Total num frames: 3839606784. Throughput: 0: 42487.6. Samples: 118537540. Policy #0 lag: (min: 1.0, avg: 9.6, max: 23.0) [2024-06-28 13:26:07,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:26:09,505][09423] Updated weights for policy 0, policy_version 234357 (0.0034) [2024-06-28 13:26:12,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 3839836160. Throughput: 0: 42537.8. Samples: 118663540. Policy #0 lag: (min: 1.0, avg: 9.6, max: 23.0) [2024-06-28 13:26:12,922][09190] Avg episode reward: [(0, '0.608')] [2024-06-28 13:26:13,721][09423] Updated weights for policy 0, policy_version 234367 (0.0034) [2024-06-28 13:26:17,384][09423] Updated weights for policy 0, policy_version 234377 (0.0033) [2024-06-28 13:26:17,921][09190] Fps is (10 sec: 45874.7, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 3840065536. Throughput: 0: 42407.6. Samples: 118917420. Policy #0 lag: (min: 1.0, avg: 9.6, max: 23.0) [2024-06-28 13:26:17,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:26:17,939][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000234379_3840065536.pth... [2024-06-28 13:26:17,998][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000233757_3829874688.pth [2024-06-28 13:26:21,356][09423] Updated weights for policy 0, policy_version 234387 (0.0037) [2024-06-28 13:26:22,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3840245760. Throughput: 0: 42477.4. Samples: 119170740. Policy #0 lag: (min: 1.0, avg: 9.6, max: 23.0) [2024-06-28 13:26:22,926][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:26:25,310][09423] Updated weights for policy 0, policy_version 234397 (0.0029) [2024-06-28 13:26:25,502][09403] Signal inference workers to stop experience collection... (1650 times) [2024-06-28 13:26:25,547][09423] InferenceWorker_p0-w0: stopping experience collection (1650 times) [2024-06-28 13:26:25,554][09403] Signal inference workers to resume experience collection... (1650 times) [2024-06-28 13:26:25,561][09423] InferenceWorker_p0-w0: resuming experience collection (1650 times) [2024-06-28 13:26:27,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 3840458752. Throughput: 0: 42504.5. Samples: 119297040. Policy #0 lag: (min: 1.0, avg: 9.6, max: 23.0) [2024-06-28 13:26:27,922][09190] Avg episode reward: [(0, '0.567')] [2024-06-28 13:26:29,546][09423] Updated weights for policy 0, policy_version 234407 (0.0037) [2024-06-28 13:26:32,826][09423] Updated weights for policy 0, policy_version 234417 (0.0028) [2024-06-28 13:26:32,922][09190] Fps is (10 sec: 44236.5, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 3840688128. Throughput: 0: 42545.3. Samples: 119556300. Policy #0 lag: (min: 1.0, avg: 9.6, max: 23.0) [2024-06-28 13:26:32,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:26:37,099][09423] Updated weights for policy 0, policy_version 234427 (0.0061) [2024-06-28 13:26:37,921][09190] Fps is (10 sec: 40960.3, 60 sec: 41779.3, 300 sec: 42265.2). Total num frames: 3840868352. Throughput: 0: 42472.1. Samples: 119805980. Policy #0 lag: (min: 1.0, avg: 9.6, max: 23.0) [2024-06-28 13:26:37,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:26:40,571][09423] Updated weights for policy 0, policy_version 234437 (0.0037) [2024-06-28 13:26:42,924][09190] Fps is (10 sec: 40950.1, 60 sec: 42869.7, 300 sec: 42375.9). Total num frames: 3841097728. Throughput: 0: 42406.7. Samples: 119926080. Policy #0 lag: (min: 1.0, avg: 9.6, max: 23.0) [2024-06-28 13:26:42,924][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:26:44,965][09423] Updated weights for policy 0, policy_version 234447 (0.0046) [2024-06-28 13:26:47,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42052.3, 300 sec: 42376.2). Total num frames: 3841310720. Throughput: 0: 42429.8. Samples: 120184000. Policy #0 lag: (min: 1.0, avg: 9.6, max: 23.0) [2024-06-28 13:26:47,922][09190] Avg episode reward: [(0, '0.603')] [2024-06-28 13:26:48,340][09423] Updated weights for policy 0, policy_version 234457 (0.0044) [2024-06-28 13:26:52,684][09423] Updated weights for policy 0, policy_version 234467 (0.0037) [2024-06-28 13:26:52,921][09190] Fps is (10 sec: 40970.5, 60 sec: 42326.1, 300 sec: 42265.2). Total num frames: 3841507328. Throughput: 0: 42128.5. Samples: 120433320. Policy #0 lag: (min: 1.0, avg: 9.6, max: 23.0) [2024-06-28 13:26:52,922][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:26:56,080][09423] Updated weights for policy 0, policy_version 234477 (0.0036) [2024-06-28 13:26:57,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42873.2, 300 sec: 42431.8). Total num frames: 3841753088. Throughput: 0: 42217.3. Samples: 120563320. Policy #0 lag: (min: 1.0, avg: 9.6, max: 23.0) [2024-06-28 13:26:57,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:27:00,399][09423] Updated weights for policy 0, policy_version 234487 (0.0029) [2024-06-28 13:27:02,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3841933312. Throughput: 0: 42177.9. Samples: 120815420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 13:27:02,922][09190] Avg episode reward: [(0, '0.600')] [2024-06-28 13:27:04,118][09423] Updated weights for policy 0, policy_version 234497 (0.0034) [2024-06-28 13:27:07,924][09190] Fps is (10 sec: 37673.8, 60 sec: 42050.5, 300 sec: 42209.3). Total num frames: 3842129920. Throughput: 0: 42207.9. Samples: 121070200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 13:27:07,924][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:27:08,100][09423] Updated weights for policy 0, policy_version 234507 (0.0041) [2024-06-28 13:27:11,805][09423] Updated weights for policy 0, policy_version 234517 (0.0054) [2024-06-28 13:27:12,921][09190] Fps is (10 sec: 45874.7, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 3842392064. Throughput: 0: 42270.6. Samples: 121199220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 13:27:12,922][09190] Avg episode reward: [(0, '0.582')] [2024-06-28 13:27:16,082][09423] Updated weights for policy 0, policy_version 234527 (0.0035) [2024-06-28 13:27:17,921][09190] Fps is (10 sec: 44248.0, 60 sec: 41779.3, 300 sec: 42376.2). Total num frames: 3842572288. Throughput: 0: 41980.1. Samples: 121445400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 13:27:17,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:27:19,557][09423] Updated weights for policy 0, policy_version 234537 (0.0052) [2024-06-28 13:27:22,921][09190] Fps is (10 sec: 37683.2, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3842768896. Throughput: 0: 42037.3. Samples: 121697660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 13:27:22,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:27:23,948][09423] Updated weights for policy 0, policy_version 234547 (0.0038) [2024-06-28 13:27:27,271][09423] Updated weights for policy 0, policy_version 234557 (0.0037) [2024-06-28 13:27:27,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 3842998272. Throughput: 0: 42122.3. Samples: 121821480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 13:27:27,922][09190] Avg episode reward: [(0, '0.598')] [2024-06-28 13:27:31,623][09423] Updated weights for policy 0, policy_version 234567 (0.0050) [2024-06-28 13:27:32,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42052.3, 300 sec: 42376.2). Total num frames: 3843211264. Throughput: 0: 42123.1. Samples: 122079540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 13:27:32,922][09190] Avg episode reward: [(0, '0.597')] [2024-06-28 13:27:35,040][09423] Updated weights for policy 0, policy_version 234577 (0.0030) [2024-06-28 13:27:36,549][09403] Signal inference workers to stop experience collection... (1700 times) [2024-06-28 13:27:36,549][09403] Signal inference workers to resume experience collection... (1700 times) [2024-06-28 13:27:36,579][09423] InferenceWorker_p0-w0: stopping experience collection (1700 times) [2024-06-28 13:27:36,579][09423] InferenceWorker_p0-w0: resuming experience collection (1700 times) [2024-06-28 13:27:37,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.4, 300 sec: 42432.1). Total num frames: 3843424256. Throughput: 0: 42136.9. Samples: 122329480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 13:27:37,922][09190] Avg episode reward: [(0, '0.606')] [2024-06-28 13:27:39,202][09423] Updated weights for policy 0, policy_version 234587 (0.0040) [2024-06-28 13:27:42,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42054.1, 300 sec: 42320.7). Total num frames: 3843620864. Throughput: 0: 42109.4. Samples: 122458240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 13:27:42,922][09190] Avg episode reward: [(0, '0.610')] [2024-06-28 13:27:43,031][09423] Updated weights for policy 0, policy_version 234597 (0.0029) [2024-06-28 13:27:46,755][09423] Updated weights for policy 0, policy_version 234607 (0.0040) [2024-06-28 13:27:47,921][09190] Fps is (10 sec: 39321.8, 60 sec: 41779.3, 300 sec: 42265.8). Total num frames: 3843817472. Throughput: 0: 42078.7. Samples: 122708960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 13:27:47,922][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 13:27:50,848][09423] Updated weights for policy 0, policy_version 234617 (0.0047) [2024-06-28 13:27:52,922][09190] Fps is (10 sec: 44235.9, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 3844063232. Throughput: 0: 42032.9. Samples: 122961580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 13:27:52,922][09190] Avg episode reward: [(0, '0.589')] [2024-06-28 13:27:54,658][09423] Updated weights for policy 0, policy_version 234627 (0.0050) [2024-06-28 13:27:57,922][09190] Fps is (10 sec: 44235.9, 60 sec: 41779.1, 300 sec: 42320.7). Total num frames: 3844259840. Throughput: 0: 42106.2. Samples: 123094000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 13:27:57,922][09190] Avg episode reward: [(0, '0.604')] [2024-06-28 13:27:58,515][09423] Updated weights for policy 0, policy_version 234637 (0.0054) [2024-06-28 13:28:02,890][09423] Updated weights for policy 0, policy_version 234647 (0.0029) [2024-06-28 13:28:02,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42052.2, 300 sec: 42265.5). Total num frames: 3844456448. Throughput: 0: 42293.2. Samples: 123348600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 13:28:02,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:28:06,239][09423] Updated weights for policy 0, policy_version 234657 (0.0029) [2024-06-28 13:28:07,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42873.3, 300 sec: 42431.8). Total num frames: 3844702208. Throughput: 0: 42148.0. Samples: 123594320. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 13:28:07,922][09190] Avg episode reward: [(0, '0.599')] [2024-06-28 13:28:10,453][09423] Updated weights for policy 0, policy_version 234667 (0.0036) [2024-06-28 13:28:12,921][09190] Fps is (10 sec: 42598.9, 60 sec: 41506.2, 300 sec: 42265.2). Total num frames: 3844882432. Throughput: 0: 42437.0. Samples: 123731140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 13:28:12,922][09190] Avg episode reward: [(0, '0.611')] [2024-06-28 13:28:14,259][09423] Updated weights for policy 0, policy_version 234677 (0.0039) [2024-06-28 13:28:17,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42052.2, 300 sec: 42265.8). Total num frames: 3845095424. Throughput: 0: 42172.5. Samples: 123977300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 13:28:17,922][09190] Avg episode reward: [(0, '0.610')] [2024-06-28 13:28:18,018][09423] Updated weights for policy 0, policy_version 234687 (0.0033) [2024-06-28 13:28:18,021][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000234687_3845111808.pth... [2024-06-28 13:28:18,073][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000234066_3834937344.pth [2024-06-28 13:28:21,977][09423] Updated weights for policy 0, policy_version 234697 (0.0036) [2024-06-28 13:28:22,924][09190] Fps is (10 sec: 42587.6, 60 sec: 42323.6, 300 sec: 42320.4). Total num frames: 3845308416. Throughput: 0: 42185.2. Samples: 124227920. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 13:28:22,924][09190] Avg episode reward: [(0, '0.612')] [2024-06-28 13:28:25,719][09423] Updated weights for policy 0, policy_version 234707 (0.0039) [2024-06-28 13:28:27,921][09190] Fps is (10 sec: 40960.3, 60 sec: 41779.2, 300 sec: 42320.7). Total num frames: 3845505024. Throughput: 0: 42086.6. Samples: 124352140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 13:28:27,922][09190] Avg episode reward: [(0, '0.594')] [2024-06-28 13:28:29,600][09423] Updated weights for policy 0, policy_version 234717 (0.0027) [2024-06-28 13:28:32,922][09190] Fps is (10 sec: 42608.4, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3845734400. Throughput: 0: 42054.5. Samples: 124601420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 13:28:32,922][09190] Avg episode reward: [(0, '0.595')] [2024-06-28 13:28:33,674][09423] Updated weights for policy 0, policy_version 234727 (0.0028) [2024-06-28 13:28:37,555][09423] Updated weights for policy 0, policy_version 234737 (0.0032) [2024-06-28 13:28:37,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3845947392. Throughput: 0: 42351.3. Samples: 124867380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 13:28:37,922][09190] Avg episode reward: [(0, '0.593')] [2024-06-28 13:28:41,100][09423] Updated weights for policy 0, policy_version 234747 (0.0047) [2024-06-28 13:28:42,922][09190] Fps is (10 sec: 40960.2, 60 sec: 42052.1, 300 sec: 42320.7). Total num frames: 3846144000. Throughput: 0: 42195.1. Samples: 124992780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 13:28:42,922][09190] Avg episode reward: [(0, '0.607')] [2024-06-28 13:28:45,167][09423] Updated weights for policy 0, policy_version 234757 (0.0026) [2024-06-28 13:28:47,921][09190] Fps is (10 sec: 42597.6, 60 sec: 42598.3, 300 sec: 42320.7). Total num frames: 3846373376. Throughput: 0: 42120.9. Samples: 125244040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 13:28:47,922][09190] Avg episode reward: [(0, '0.610')] [2024-06-28 13:28:48,903][09423] Updated weights for policy 0, policy_version 234767 (0.0050) [2024-06-28 13:28:52,774][09423] Updated weights for policy 0, policy_version 234777 (0.0037) [2024-06-28 13:28:52,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42052.3, 300 sec: 42265.4). Total num frames: 3846586368. Throughput: 0: 42511.5. Samples: 125507340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 13:28:52,923][09190] Avg episode reward: [(0, '0.590')] [2024-06-28 13:28:56,583][09423] Updated weights for policy 0, policy_version 234787 (0.0034) [2024-06-28 13:28:57,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42052.4, 300 sec: 42265.2). Total num frames: 3846782976. Throughput: 0: 42188.9. Samples: 125629640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 13:28:57,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:28:59,521][09403] Signal inference workers to stop experience collection... (1750 times) [2024-06-28 13:28:59,556][09423] InferenceWorker_p0-w0: stopping experience collection (1750 times) [2024-06-28 13:28:59,580][09403] Signal inference workers to resume experience collection... (1750 times) [2024-06-28 13:28:59,581][09423] InferenceWorker_p0-w0: resuming experience collection (1750 times) [2024-06-28 13:29:00,688][09423] Updated weights for policy 0, policy_version 234797 (0.0038) [2024-06-28 13:29:02,922][09190] Fps is (10 sec: 42597.9, 60 sec: 42598.4, 300 sec: 42320.7). Total num frames: 3847012352. Throughput: 0: 42282.1. Samples: 125880000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 13:29:02,922][09190] Avg episode reward: [(0, '0.610')] [2024-06-28 13:29:04,380][09423] Updated weights for policy 0, policy_version 234807 (0.0049) [2024-06-28 13:29:07,921][09190] Fps is (10 sec: 42598.2, 60 sec: 41779.2, 300 sec: 42265.2). Total num frames: 3847208960. Throughput: 0: 42255.2. Samples: 126129300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 13:29:07,922][09190] Avg episode reward: [(0, '0.609')] [2024-06-28 13:29:08,435][09423] Updated weights for policy 0, policy_version 234817 (0.0024) [2024-06-28 13:29:12,027][09423] Updated weights for policy 0, policy_version 234827 (0.0036) [2024-06-28 13:29:12,921][09190] Fps is (10 sec: 42599.2, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 3847438336. Throughput: 0: 42301.8. Samples: 126255720. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 13:29:12,922][09190] Avg episode reward: [(0, '0.610')] [2024-06-28 13:29:16,421][09423] Updated weights for policy 0, policy_version 234837 (0.0039) [2024-06-28 13:29:17,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 3847651328. Throughput: 0: 42407.2. Samples: 126509740. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 13:29:17,922][09190] Avg episode reward: [(0, '0.607')] [2024-06-28 13:29:20,026][09423] Updated weights for policy 0, policy_version 234847 (0.0046) [2024-06-28 13:29:22,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42327.1, 300 sec: 42265.2). Total num frames: 3847847936. Throughput: 0: 42059.5. Samples: 126760060. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:29:22,922][09190] Avg episode reward: [(0, '0.602')] [2024-06-28 13:29:24,405][09423] Updated weights for policy 0, policy_version 234857 (0.0037) [2024-06-28 13:29:27,757][09423] Updated weights for policy 0, policy_version 234867 (0.0031) [2024-06-28 13:29:27,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.3, 300 sec: 42320.7). Total num frames: 3848060928. Throughput: 0: 41944.5. Samples: 126880280. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:29:27,922][09190] Avg episode reward: [(0, '0.608')] [2024-06-28 13:29:31,981][09423] Updated weights for policy 0, policy_version 234877 (0.0042) [2024-06-28 13:29:32,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3848273920. Throughput: 0: 42169.9. Samples: 127141680. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:29:32,922][09190] Avg episode reward: [(0, '0.605')] [2024-06-28 13:29:35,920][09423] Updated weights for policy 0, policy_version 234887 (0.0042) [2024-06-28 13:29:37,921][09190] Fps is (10 sec: 39322.1, 60 sec: 41779.2, 300 sec: 42209.6). Total num frames: 3848454144. Throughput: 0: 41922.3. Samples: 127393840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:29:37,922][09190] Avg episode reward: [(0, '0.612')] [2024-06-28 13:29:39,850][09423] Updated weights for policy 0, policy_version 234897 (0.0035) [2024-06-28 13:29:42,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.4, 300 sec: 42209.6). Total num frames: 3848683520. Throughput: 0: 41875.6. Samples: 127514040. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:29:42,922][09190] Avg episode reward: [(0, '0.612')] [2024-06-28 13:29:43,471][09423] Updated weights for policy 0, policy_version 234907 (0.0032) [2024-06-28 13:29:47,437][09423] Updated weights for policy 0, policy_version 234917 (0.0037) [2024-06-28 13:29:47,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3848896512. Throughput: 0: 41933.0. Samples: 127766980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:29:47,922][09190] Avg episode reward: [(0, '0.619')] [2024-06-28 13:29:51,055][09423] Updated weights for policy 0, policy_version 234927 (0.0034) [2024-06-28 13:29:52,921][09190] Fps is (10 sec: 40960.1, 60 sec: 41779.3, 300 sec: 42209.6). Total num frames: 3849093120. Throughput: 0: 42105.4. Samples: 128024040. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:29:52,922][09190] Avg episode reward: [(0, '0.609')] [2024-06-28 13:29:55,702][09423] Updated weights for policy 0, policy_version 234937 (0.0040) [2024-06-28 13:29:57,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42052.3, 300 sec: 42154.1). Total num frames: 3849306112. Throughput: 0: 42087.1. Samples: 128149640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:29:57,922][09190] Avg episode reward: [(0, '0.611')] [2024-06-28 13:29:58,984][09423] Updated weights for policy 0, policy_version 234947 (0.0040) [2024-06-28 13:30:02,921][09190] Fps is (10 sec: 42597.7, 60 sec: 41779.2, 300 sec: 42154.4). Total num frames: 3849519104. Throughput: 0: 41911.1. Samples: 128395740. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:30:02,922][09190] Avg episode reward: [(0, '0.607')] [2024-06-28 13:30:03,290][09423] Updated weights for policy 0, policy_version 234957 (0.0027) [2024-06-28 13:30:06,730][09423] Updated weights for policy 0, policy_version 234967 (0.0036) [2024-06-28 13:30:07,922][09190] Fps is (10 sec: 40958.5, 60 sec: 41779.0, 300 sec: 42154.0). Total num frames: 3849715712. Throughput: 0: 42135.2. Samples: 128656160. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:30:07,922][09190] Avg episode reward: [(0, '0.614')] [2024-06-28 13:30:10,896][09423] Updated weights for policy 0, policy_version 234977 (0.0036) [2024-06-28 13:30:12,921][09190] Fps is (10 sec: 42598.7, 60 sec: 41779.2, 300 sec: 42154.1). Total num frames: 3849945088. Throughput: 0: 42253.4. Samples: 128781680. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:30:12,922][09190] Avg episode reward: [(0, '0.617')] [2024-06-28 13:30:14,281][09423] Updated weights for policy 0, policy_version 234987 (0.0035) [2024-06-28 13:30:17,921][09190] Fps is (10 sec: 44237.7, 60 sec: 41779.1, 300 sec: 42209.6). Total num frames: 3850158080. Throughput: 0: 42050.1. Samples: 129033940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:30:17,922][09190] Avg episode reward: [(0, '0.621')] [2024-06-28 13:30:17,936][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000234995_3850158080.pth... [2024-06-28 13:30:17,995][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000234379_3840065536.pth [2024-06-28 13:30:18,593][09423] Updated weights for policy 0, policy_version 234997 (0.0043) [2024-06-28 13:30:22,371][09423] Updated weights for policy 0, policy_version 235007 (0.0043) [2024-06-28 13:30:22,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42052.3, 300 sec: 42209.6). Total num frames: 3850371072. Throughput: 0: 42009.3. Samples: 129284260. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:30:22,922][09190] Avg episode reward: [(0, '0.617')] [2024-06-28 13:30:26,441][09423] Updated weights for policy 0, policy_version 235017 (0.0042) [2024-06-28 13:30:27,357][09403] Signal inference workers to stop experience collection... (1800 times) [2024-06-28 13:30:27,358][09403] Signal inference workers to resume experience collection... (1800 times) [2024-06-28 13:30:27,393][09423] InferenceWorker_p0-w0: stopping experience collection (1800 times) [2024-06-28 13:30:27,393][09423] InferenceWorker_p0-w0: resuming experience collection (1800 times) [2024-06-28 13:30:27,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42325.4, 300 sec: 42209.7). Total num frames: 3850600448. Throughput: 0: 42107.1. Samples: 129408860. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:30:27,922][09190] Avg episode reward: [(0, '0.621')] [2024-06-28 13:30:29,940][09423] Updated weights for policy 0, policy_version 235027 (0.0052) [2024-06-28 13:30:32,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42052.2, 300 sec: 42154.1). Total num frames: 3850797056. Throughput: 0: 42195.5. Samples: 129665780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:30:32,922][09190] Avg episode reward: [(0, '0.628')] [2024-06-28 13:30:34,259][09423] Updated weights for policy 0, policy_version 235037 (0.0027) [2024-06-28 13:30:37,922][09190] Fps is (10 sec: 39319.4, 60 sec: 42324.9, 300 sec: 42265.1). Total num frames: 3850993664. Throughput: 0: 42017.2. Samples: 129914840. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:30:37,922][09190] Avg episode reward: [(0, '0.623')] [2024-06-28 13:30:38,300][09423] Updated weights for policy 0, policy_version 235047 (0.0042) [2024-06-28 13:30:42,298][09423] Updated weights for policy 0, policy_version 235057 (0.0045) [2024-06-28 13:30:42,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42325.3, 300 sec: 42154.1). Total num frames: 3851223040. Throughput: 0: 42039.6. Samples: 130041420. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:30:42,922][09190] Avg episode reward: [(0, '0.626')] [2024-06-28 13:30:45,936][09423] Updated weights for policy 0, policy_version 235067 (0.0042) [2024-06-28 13:30:47,921][09190] Fps is (10 sec: 42601.0, 60 sec: 42052.4, 300 sec: 42209.8). Total num frames: 3851419648. Throughput: 0: 42440.6. Samples: 130305560. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:30:47,922][09190] Avg episode reward: [(0, '0.626')] [2024-06-28 13:30:49,717][09423] Updated weights for policy 0, policy_version 235077 (0.0026) [2024-06-28 13:30:52,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.2, 300 sec: 42210.0). Total num frames: 3851632640. Throughput: 0: 42272.3. Samples: 130558400. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:30:52,922][09190] Avg episode reward: [(0, '0.619')] [2024-06-28 13:30:53,833][09423] Updated weights for policy 0, policy_version 235087 (0.0029) [2024-06-28 13:30:57,223][09423] Updated weights for policy 0, policy_version 235097 (0.0033) [2024-06-28 13:30:57,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42325.3, 300 sec: 42154.1). Total num frames: 3851845632. Throughput: 0: 42278.2. Samples: 130684200. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:30:57,922][09190] Avg episode reward: [(0, '0.632')] [2024-06-28 13:31:01,423][09423] Updated weights for policy 0, policy_version 235107 (0.0029) [2024-06-28 13:31:02,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.3, 300 sec: 42209.6). Total num frames: 3852058624. Throughput: 0: 42584.5. Samples: 130950240. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:31:02,922][09190] Avg episode reward: [(0, '0.630')] [2024-06-28 13:31:04,777][09423] Updated weights for policy 0, policy_version 235117 (0.0026) [2024-06-28 13:31:07,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.6, 300 sec: 42154.1). Total num frames: 3852271616. Throughput: 0: 42355.9. Samples: 131190280. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:31:07,928][09190] Avg episode reward: [(0, '0.640')] [2024-06-28 13:31:07,952][09403] Saving new best policy, reward=0.640! [2024-06-28 13:31:08,980][09423] Updated weights for policy 0, policy_version 235127 (0.0039) [2024-06-28 13:31:12,776][09423] Updated weights for policy 0, policy_version 235137 (0.0039) [2024-06-28 13:31:12,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42325.4, 300 sec: 42098.6). Total num frames: 3852484608. Throughput: 0: 42461.3. Samples: 131319620. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:31:12,922][09190] Avg episode reward: [(0, '0.639')] [2024-06-28 13:31:16,735][09423] Updated weights for policy 0, policy_version 235147 (0.0037) [2024-06-28 13:31:17,922][09190] Fps is (10 sec: 40959.8, 60 sec: 42052.2, 300 sec: 42154.1). Total num frames: 3852681216. Throughput: 0: 42443.9. Samples: 131575760. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:31:17,922][09190] Avg episode reward: [(0, '0.637')] [2024-06-28 13:31:20,349][09423] Updated weights for policy 0, policy_version 235157 (0.0026) [2024-06-28 13:31:22,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.3, 300 sec: 42209.6). Total num frames: 3852910592. Throughput: 0: 42369.9. Samples: 131821460. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:31:22,922][09190] Avg episode reward: [(0, '0.642')] [2024-06-28 13:31:22,972][09403] Saving new best policy, reward=0.642! [2024-06-28 13:31:24,646][09423] Updated weights for policy 0, policy_version 235167 (0.0038) [2024-06-28 13:31:27,921][09190] Fps is (10 sec: 42599.3, 60 sec: 41779.2, 300 sec: 42098.6). Total num frames: 3853107200. Throughput: 0: 42646.7. Samples: 131960520. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:31:27,922][09190] Avg episode reward: [(0, '0.646')] [2024-06-28 13:31:27,944][09403] Saving new best policy, reward=0.646! [2024-06-28 13:31:28,333][09423] Updated weights for policy 0, policy_version 235177 (0.0047) [2024-06-28 13:31:32,192][09423] Updated weights for policy 0, policy_version 235187 (0.0038) [2024-06-28 13:31:32,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42052.3, 300 sec: 42209.6). Total num frames: 3853320192. Throughput: 0: 42217.7. Samples: 132205360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:31:32,922][09190] Avg episode reward: [(0, '0.641')] [2024-06-28 13:31:34,128][09403] Signal inference workers to stop experience collection... (1850 times) [2024-06-28 13:31:34,161][09423] InferenceWorker_p0-w0: stopping experience collection (1850 times) [2024-06-28 13:31:34,174][09403] Signal inference workers to resume experience collection... (1850 times) [2024-06-28 13:31:34,183][09423] InferenceWorker_p0-w0: resuming experience collection (1850 times) [2024-06-28 13:31:36,051][09423] Updated weights for policy 0, policy_version 235197 (0.0030) [2024-06-28 13:31:37,922][09190] Fps is (10 sec: 44236.1, 60 sec: 42598.7, 300 sec: 42210.0). Total num frames: 3853549568. Throughput: 0: 42208.4. Samples: 132457780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 13:31:37,922][09190] Avg episode reward: [(0, '0.642')] [2024-06-28 13:31:39,836][09423] Updated weights for policy 0, policy_version 235207 (0.0035) [2024-06-28 13:31:42,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42325.3, 300 sec: 42209.6). Total num frames: 3853762560. Throughput: 0: 42420.9. Samples: 132593140. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 13:31:42,922][09190] Avg episode reward: [(0, '0.651')] [2024-06-28 13:31:42,923][09403] Saving new best policy, reward=0.651! [2024-06-28 13:31:43,505][09423] Updated weights for policy 0, policy_version 235217 (0.0056) [2024-06-28 13:31:47,376][09423] Updated weights for policy 0, policy_version 235227 (0.0040) [2024-06-28 13:31:47,924][09190] Fps is (10 sec: 40950.3, 60 sec: 42323.5, 300 sec: 42209.3). Total num frames: 3853959168. Throughput: 0: 42142.6. Samples: 132846760. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 13:31:47,924][09190] Avg episode reward: [(0, '0.656')] [2024-06-28 13:31:47,932][09403] Saving new best policy, reward=0.656! [2024-06-28 13:31:51,116][09423] Updated weights for policy 0, policy_version 235237 (0.0034) [2024-06-28 13:31:52,924][09190] Fps is (10 sec: 44225.8, 60 sec: 42869.7, 300 sec: 42209.3). Total num frames: 3854204928. Throughput: 0: 42290.2. Samples: 133093440. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 13:31:52,924][09190] Avg episode reward: [(0, '0.659')] [2024-06-28 13:31:52,925][09403] Saving new best policy, reward=0.659! [2024-06-28 13:31:55,285][09423] Updated weights for policy 0, policy_version 235247 (0.0039) [2024-06-28 13:31:57,922][09190] Fps is (10 sec: 42608.4, 60 sec: 42325.3, 300 sec: 42209.6). Total num frames: 3854385152. Throughput: 0: 42384.3. Samples: 133226920. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 13:31:57,922][09190] Avg episode reward: [(0, '0.655')] [2024-06-28 13:31:59,110][09423] Updated weights for policy 0, policy_version 235257 (0.0042) [2024-06-28 13:32:02,921][09190] Fps is (10 sec: 39331.3, 60 sec: 42325.4, 300 sec: 42265.5). Total num frames: 3854598144. Throughput: 0: 42268.1. Samples: 133477820. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 13:32:02,922][09190] Avg episode reward: [(0, '0.660')] [2024-06-28 13:32:02,971][09403] Saving new best policy, reward=0.660! [2024-06-28 13:32:02,984][09423] Updated weights for policy 0, policy_version 235267 (0.0030) [2024-06-28 13:32:06,958][09423] Updated weights for policy 0, policy_version 235277 (0.0023) [2024-06-28 13:32:07,924][09190] Fps is (10 sec: 45864.2, 60 sec: 42869.7, 300 sec: 42209.3). Total num frames: 3854843904. Throughput: 0: 42474.9. Samples: 133732940. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 13:32:07,925][09190] Avg episode reward: [(0, '0.663')] [2024-06-28 13:32:07,934][09403] Saving new best policy, reward=0.663! [2024-06-28 13:32:10,717][09423] Updated weights for policy 0, policy_version 235287 (0.0031) [2024-06-28 13:32:12,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42325.3, 300 sec: 42209.6). Total num frames: 3855024128. Throughput: 0: 42216.9. Samples: 133860280. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 13:32:12,922][09190] Avg episode reward: [(0, '0.659')] [2024-06-28 13:32:14,770][09423] Updated weights for policy 0, policy_version 235297 (0.0036) [2024-06-28 13:32:17,921][09190] Fps is (10 sec: 39331.3, 60 sec: 42598.5, 300 sec: 42265.2). Total num frames: 3855237120. Throughput: 0: 42242.5. Samples: 134106280. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 13:32:17,922][09190] Avg episode reward: [(0, '0.677')] [2024-06-28 13:32:17,938][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000235305_3855237120.pth... [2024-06-28 13:32:17,995][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000234687_3845111808.pth [2024-06-28 13:32:18,008][09403] Saving new best policy, reward=0.677! [2024-06-28 13:32:18,414][09423] Updated weights for policy 0, policy_version 235307 (0.0032) [2024-06-28 13:32:22,514][09423] Updated weights for policy 0, policy_version 235317 (0.0035) [2024-06-28 13:32:22,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42598.4, 300 sec: 42265.2). Total num frames: 3855466496. Throughput: 0: 42287.6. Samples: 134360720. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 13:32:22,922][09190] Avg episode reward: [(0, '0.666')] [2024-06-28 13:32:26,367][09423] Updated weights for policy 0, policy_version 235327 (0.0051) [2024-06-28 13:32:27,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.3, 300 sec: 42154.1). Total num frames: 3855646720. Throughput: 0: 42143.1. Samples: 134489580. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 13:32:27,922][09190] Avg episode reward: [(0, '0.669')] [2024-06-28 13:32:30,143][09423] Updated weights for policy 0, policy_version 235337 (0.0044) [2024-06-28 13:32:32,922][09190] Fps is (10 sec: 40959.5, 60 sec: 42598.3, 300 sec: 42209.6). Total num frames: 3855876096. Throughput: 0: 42136.4. Samples: 134742800. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 13:32:32,922][09190] Avg episode reward: [(0, '0.668')] [2024-06-28 13:32:34,075][09423] Updated weights for policy 0, policy_version 235347 (0.0024) [2024-06-28 13:32:37,713][09423] Updated weights for policy 0, policy_version 235357 (0.0037) [2024-06-28 13:32:37,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42325.4, 300 sec: 42265.2). Total num frames: 3856089088. Throughput: 0: 42433.5. Samples: 135002840. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 13:32:37,922][09190] Avg episode reward: [(0, '0.679')] [2024-06-28 13:32:37,935][09403] Saving new best policy, reward=0.679! [2024-06-28 13:32:41,755][09423] Updated weights for policy 0, policy_version 235367 (0.0028) [2024-06-28 13:32:42,921][09190] Fps is (10 sec: 39322.5, 60 sec: 41779.3, 300 sec: 42209.6). Total num frames: 3856269312. Throughput: 0: 42218.0. Samples: 135126720. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 13:32:42,922][09190] Avg episode reward: [(0, '0.683')] [2024-06-28 13:32:42,967][09403] Saving new best policy, reward=0.683! [2024-06-28 13:32:45,713][09423] Updated weights for policy 0, policy_version 235377 (0.0052) [2024-06-28 13:32:47,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42600.2, 300 sec: 42209.7). Total num frames: 3856515072. Throughput: 0: 42222.3. Samples: 135377820. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 13:32:47,922][09190] Avg episode reward: [(0, '0.682')] [2024-06-28 13:32:49,754][09423] Updated weights for policy 0, policy_version 235387 (0.0025) [2024-06-28 13:32:52,921][09190] Fps is (10 sec: 44236.3, 60 sec: 41780.9, 300 sec: 42209.6). Total num frames: 3856711680. Throughput: 0: 42286.8. Samples: 135635740. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-28 13:32:52,923][09190] Avg episode reward: [(0, '0.691')] [2024-06-28 13:32:52,923][09403] Saving new best policy, reward=0.691! [2024-06-28 13:32:53,393][09423] Updated weights for policy 0, policy_version 235397 (0.0037) [2024-06-28 13:32:57,331][09423] Updated weights for policy 0, policy_version 235407 (0.0030) [2024-06-28 13:32:57,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42052.4, 300 sec: 42209.6). Total num frames: 3856908288. Throughput: 0: 42242.7. Samples: 135761200. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-28 13:32:57,922][09190] Avg episode reward: [(0, '0.688')] [2024-06-28 13:33:01,048][09423] Updated weights for policy 0, policy_version 235417 (0.0044) [2024-06-28 13:33:02,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42871.5, 300 sec: 42265.2). Total num frames: 3857170432. Throughput: 0: 42383.2. Samples: 136013520. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-28 13:33:02,922][09190] Avg episode reward: [(0, '0.695')] [2024-06-28 13:33:02,922][09403] Saving new best policy, reward=0.695! [2024-06-28 13:33:04,802][09423] Updated weights for policy 0, policy_version 235427 (0.0041) [2024-06-28 13:33:07,921][09190] Fps is (10 sec: 42598.4, 60 sec: 41507.9, 300 sec: 42209.6). Total num frames: 3857334272. Throughput: 0: 42469.8. Samples: 136271860. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-28 13:33:07,922][09190] Avg episode reward: [(0, '0.687')] [2024-06-28 13:33:08,801][09423] Updated weights for policy 0, policy_version 235437 (0.0029) [2024-06-28 13:33:10,020][09403] Signal inference workers to stop experience collection... (1900 times) [2024-06-28 13:33:10,053][09423] InferenceWorker_p0-w0: stopping experience collection (1900 times) [2024-06-28 13:33:10,070][09403] Signal inference workers to resume experience collection... (1900 times) [2024-06-28 13:33:10,076][09423] InferenceWorker_p0-w0: resuming experience collection (1900 times) [2024-06-28 13:33:12,922][09190] Fps is (10 sec: 37682.6, 60 sec: 42052.2, 300 sec: 42209.6). Total num frames: 3857547264. Throughput: 0: 42227.0. Samples: 136389800. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-28 13:33:12,922][09190] Avg episode reward: [(0, '0.698')] [2024-06-28 13:33:12,922][09403] Saving new best policy, reward=0.698! [2024-06-28 13:33:13,143][09423] Updated weights for policy 0, policy_version 235447 (0.0027) [2024-06-28 13:33:16,274][09423] Updated weights for policy 0, policy_version 235457 (0.0035) [2024-06-28 13:33:17,921][09190] Fps is (10 sec: 47513.6, 60 sec: 42871.5, 300 sec: 42376.6). Total num frames: 3857809408. Throughput: 0: 42392.6. Samples: 136650460. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-28 13:33:17,922][09190] Avg episode reward: [(0, '0.683')] [2024-06-28 13:33:20,659][09423] Updated weights for policy 0, policy_version 235467 (0.0041) [2024-06-28 13:33:22,921][09190] Fps is (10 sec: 42599.2, 60 sec: 41779.3, 300 sec: 42265.2). Total num frames: 3857973248. Throughput: 0: 42294.7. Samples: 136906100. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-28 13:33:22,922][09190] Avg episode reward: [(0, '0.700')] [2024-06-28 13:33:23,033][09403] Saving new best policy, reward=0.700! [2024-06-28 13:33:24,305][09423] Updated weights for policy 0, policy_version 235477 (0.0033) [2024-06-28 13:33:27,921][09190] Fps is (10 sec: 37683.0, 60 sec: 42325.3, 300 sec: 42209.6). Total num frames: 3858186240. Throughput: 0: 42135.5. Samples: 137022820. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-28 13:33:27,922][09190] Avg episode reward: [(0, '0.710')] [2024-06-28 13:33:28,085][09403] Saving new best policy, reward=0.710! [2024-06-28 13:33:28,290][09423] Updated weights for policy 0, policy_version 235487 (0.0037) [2024-06-28 13:33:32,282][09423] Updated weights for policy 0, policy_version 235497 (0.0037) [2024-06-28 13:33:32,923][09190] Fps is (10 sec: 45868.7, 60 sec: 42597.6, 300 sec: 42320.5). Total num frames: 3858432000. Throughput: 0: 42512.0. Samples: 137290920. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-28 13:33:32,923][09190] Avg episode reward: [(0, '0.699')] [2024-06-28 13:33:35,783][09423] Updated weights for policy 0, policy_version 235507 (0.0032) [2024-06-28 13:33:37,921][09190] Fps is (10 sec: 40960.4, 60 sec: 41779.2, 300 sec: 42209.7). Total num frames: 3858595840. Throughput: 0: 42384.5. Samples: 137543040. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-28 13:33:37,922][09190] Avg episode reward: [(0, '0.704')] [2024-06-28 13:33:39,966][09423] Updated weights for policy 0, policy_version 235517 (0.0031) [2024-06-28 13:33:42,921][09190] Fps is (10 sec: 40965.3, 60 sec: 42871.4, 300 sec: 42265.2). Total num frames: 3858841600. Throughput: 0: 42184.8. Samples: 137659520. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-28 13:33:42,922][09190] Avg episode reward: [(0, '0.705')] [2024-06-28 13:33:43,719][09423] Updated weights for policy 0, policy_version 235527 (0.0035) [2024-06-28 13:33:47,669][09423] Updated weights for policy 0, policy_version 235537 (0.0036) [2024-06-28 13:33:47,921][09190] Fps is (10 sec: 45874.6, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3859054592. Throughput: 0: 42439.0. Samples: 137923280. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-28 13:33:47,922][09190] Avg episode reward: [(0, '0.708')] [2024-06-28 13:33:51,461][09423] Updated weights for policy 0, policy_version 235547 (0.0042) [2024-06-28 13:33:52,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42052.3, 300 sec: 42209.6). Total num frames: 3859234816. Throughput: 0: 42347.1. Samples: 138177480. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-28 13:33:52,922][09190] Avg episode reward: [(0, '0.717')] [2024-06-28 13:33:52,938][09403] Saving new best policy, reward=0.717! [2024-06-28 13:33:55,270][09423] Updated weights for policy 0, policy_version 235557 (0.0033) [2024-06-28 13:33:57,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.3, 300 sec: 42209.6). Total num frames: 3859464192. Throughput: 0: 42360.0. Samples: 138296000. Policy #0 lag: (min: 0.0, avg: 12.1, max: 24.0) [2024-06-28 13:33:57,922][09190] Avg episode reward: [(0, '0.712')] [2024-06-28 13:33:59,350][09423] Updated weights for policy 0, policy_version 235567 (0.0030) [2024-06-28 13:34:02,823][09423] Updated weights for policy 0, policy_version 235577 (0.0028) [2024-06-28 13:34:02,922][09190] Fps is (10 sec: 45874.4, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3859693568. Throughput: 0: 42350.5. Samples: 138556240. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:34:02,922][09190] Avg episode reward: [(0, '0.714')] [2024-06-28 13:34:06,924][09423] Updated weights for policy 0, policy_version 235587 (0.0032) [2024-06-28 13:34:07,922][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.2, 300 sec: 42154.1). Total num frames: 3859873792. Throughput: 0: 42285.2. Samples: 138808940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:34:07,922][09190] Avg episode reward: [(0, '0.722')] [2024-06-28 13:34:07,934][09403] Saving new best policy, reward=0.722! [2024-06-28 13:34:10,827][09423] Updated weights for policy 0, policy_version 235597 (0.0028) [2024-06-28 13:34:12,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42871.6, 300 sec: 42265.2). Total num frames: 3860119552. Throughput: 0: 42452.9. Samples: 138933200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:34:12,922][09190] Avg episode reward: [(0, '0.715')] [2024-06-28 13:34:14,951][09423] Updated weights for policy 0, policy_version 235607 (0.0034) [2024-06-28 13:34:17,921][09190] Fps is (10 sec: 44237.1, 60 sec: 41779.2, 300 sec: 42265.2). Total num frames: 3860316160. Throughput: 0: 42220.8. Samples: 139190800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:34:17,922][09190] Avg episode reward: [(0, '0.718')] [2024-06-28 13:34:17,946][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000235615_3860316160.pth... [2024-06-28 13:34:17,997][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000234995_3850158080.pth [2024-06-28 13:34:18,709][09423] Updated weights for policy 0, policy_version 235617 (0.0042) [2024-06-28 13:34:22,466][09423] Updated weights for policy 0, policy_version 235627 (0.0035) [2024-06-28 13:34:22,926][09190] Fps is (10 sec: 39304.4, 60 sec: 42322.2, 300 sec: 42209.0). Total num frames: 3860512768. Throughput: 0: 42052.8. Samples: 139435600. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:34:22,926][09190] Avg episode reward: [(0, '0.725')] [2024-06-28 13:34:22,927][09403] Saving new best policy, reward=0.725! [2024-06-28 13:34:26,551][09423] Updated weights for policy 0, policy_version 235637 (0.0045) [2024-06-28 13:34:27,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.5, 300 sec: 42320.7). Total num frames: 3860758528. Throughput: 0: 42346.7. Samples: 139565120. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:34:27,922][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 13:34:27,928][09403] Saving new best policy, reward=0.732! [2024-06-28 13:34:30,266][09423] Updated weights for policy 0, policy_version 235647 (0.0024) [2024-06-28 13:34:32,921][09190] Fps is (10 sec: 40978.1, 60 sec: 41507.1, 300 sec: 42265.2). Total num frames: 3860922368. Throughput: 0: 42253.9. Samples: 139824700. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:34:32,922][09190] Avg episode reward: [(0, '0.725')] [2024-06-28 13:34:34,078][09423] Updated weights for policy 0, policy_version 235657 (0.0033) [2024-06-28 13:34:37,921][09190] Fps is (10 sec: 37683.1, 60 sec: 42325.2, 300 sec: 42209.6). Total num frames: 3861135360. Throughput: 0: 42130.6. Samples: 140073360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:34:37,922][09190] Avg episode reward: [(0, '0.719')] [2024-06-28 13:34:38,487][09423] Updated weights for policy 0, policy_version 235667 (0.0028) [2024-06-28 13:34:41,722][09423] Updated weights for policy 0, policy_version 235677 (0.0032) [2024-06-28 13:34:42,921][09190] Fps is (10 sec: 47513.9, 60 sec: 42598.5, 300 sec: 42376.3). Total num frames: 3861397504. Throughput: 0: 42326.4. Samples: 140200680. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:34:42,922][09190] Avg episode reward: [(0, '0.716')] [2024-06-28 13:34:45,111][09403] Signal inference workers to stop experience collection... (1950 times) [2024-06-28 13:34:45,111][09403] Signal inference workers to resume experience collection... (1950 times) [2024-06-28 13:34:45,151][09423] InferenceWorker_p0-w0: stopping experience collection (1950 times) [2024-06-28 13:34:45,151][09423] InferenceWorker_p0-w0: resuming experience collection (1950 times) [2024-06-28 13:34:46,049][09423] Updated weights for policy 0, policy_version 235687 (0.0040) [2024-06-28 13:34:47,921][09190] Fps is (10 sec: 42598.6, 60 sec: 41779.2, 300 sec: 42265.2). Total num frames: 3861561344. Throughput: 0: 42114.3. Samples: 140451380. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:34:47,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 13:34:49,523][09423] Updated weights for policy 0, policy_version 235697 (0.0041) [2024-06-28 13:34:52,921][09190] Fps is (10 sec: 37683.0, 60 sec: 42325.4, 300 sec: 42265.2). Total num frames: 3861774336. Throughput: 0: 42294.4. Samples: 140712180. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:34:52,922][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 13:34:53,786][09423] Updated weights for policy 0, policy_version 235707 (0.0038) [2024-06-28 13:34:57,359][09423] Updated weights for policy 0, policy_version 235717 (0.0036) [2024-06-28 13:34:57,921][09190] Fps is (10 sec: 47513.5, 60 sec: 42871.5, 300 sec: 42431.8). Total num frames: 3862036480. Throughput: 0: 42305.7. Samples: 140836960. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:34:57,922][09190] Avg episode reward: [(0, '0.728')] [2024-06-28 13:35:01,230][09423] Updated weights for policy 0, policy_version 235727 (0.0040) [2024-06-28 13:35:02,921][09190] Fps is (10 sec: 42598.2, 60 sec: 41779.3, 300 sec: 42320.8). Total num frames: 3862200320. Throughput: 0: 42241.0. Samples: 141091640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 13:35:02,922][09190] Avg episode reward: [(0, '0.733')] [2024-06-28 13:35:04,912][09423] Updated weights for policy 0, policy_version 235737 (0.0030) [2024-06-28 13:35:07,926][09190] Fps is (10 sec: 37667.3, 60 sec: 42322.4, 300 sec: 42264.6). Total num frames: 3862413312. Throughput: 0: 42437.9. Samples: 141345300. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 13:35:07,926][09190] Avg episode reward: [(0, '0.728')] [2024-06-28 13:35:08,882][09423] Updated weights for policy 0, policy_version 235747 (0.0031) [2024-06-28 13:35:12,530][09423] Updated weights for policy 0, policy_version 235757 (0.0052) [2024-06-28 13:35:12,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42325.3, 300 sec: 42376.3). Total num frames: 3862659072. Throughput: 0: 42485.8. Samples: 141476980. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 13:35:12,922][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 13:35:16,467][09423] Updated weights for policy 0, policy_version 235767 (0.0034) [2024-06-28 13:35:17,921][09190] Fps is (10 sec: 40977.6, 60 sec: 41779.3, 300 sec: 42209.6). Total num frames: 3862822912. Throughput: 0: 42265.3. Samples: 141726640. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 13:35:17,922][09190] Avg episode reward: [(0, '0.726')] [2024-06-28 13:35:20,601][09423] Updated weights for policy 0, policy_version 235777 (0.0048) [2024-06-28 13:35:22,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42601.5, 300 sec: 42265.2). Total num frames: 3863068672. Throughput: 0: 42323.6. Samples: 141977920. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 13:35:22,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 13:35:22,922][09403] Saving new best policy, reward=0.739! [2024-06-28 13:35:24,654][09423] Updated weights for policy 0, policy_version 235787 (0.0047) [2024-06-28 13:35:27,921][09190] Fps is (10 sec: 45874.6, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3863281664. Throughput: 0: 42438.4. Samples: 142110420. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 13:35:27,928][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 13:35:28,138][09423] Updated weights for policy 0, policy_version 235797 (0.0035) [2024-06-28 13:35:32,256][09423] Updated weights for policy 0, policy_version 235807 (0.0049) [2024-06-28 13:35:32,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3863461888. Throughput: 0: 42517.4. Samples: 142364660. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 13:35:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 13:35:32,941][09403] Saving new best policy, reward=0.740! [2024-06-28 13:35:35,652][09423] Updated weights for policy 0, policy_version 235817 (0.0022) [2024-06-28 13:35:37,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.5, 300 sec: 42320.7). Total num frames: 3863707648. Throughput: 0: 42274.5. Samples: 142614540. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 13:35:37,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 13:35:37,931][09403] Saving new best policy, reward=0.747! [2024-06-28 13:35:40,210][09423] Updated weights for policy 0, policy_version 235827 (0.0027) [2024-06-28 13:35:42,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42052.2, 300 sec: 42376.2). Total num frames: 3863920640. Throughput: 0: 42534.7. Samples: 142751020. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 13:35:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 13:35:43,473][09423] Updated weights for policy 0, policy_version 235837 (0.0031) [2024-06-28 13:35:47,607][09423] Updated weights for policy 0, policy_version 235847 (0.0052) [2024-06-28 13:35:47,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42871.4, 300 sec: 42376.2). Total num frames: 3864133632. Throughput: 0: 42618.6. Samples: 143009480. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 13:35:47,922][09190] Avg episode reward: [(0, '0.726')] [2024-06-28 13:35:51,047][09423] Updated weights for policy 0, policy_version 235857 (0.0040) [2024-06-28 13:35:52,922][09190] Fps is (10 sec: 42597.8, 60 sec: 42871.3, 300 sec: 42376.2). Total num frames: 3864346624. Throughput: 0: 42427.0. Samples: 143254340. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 13:35:52,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:35:52,923][09403] Saving new best policy, reward=0.752! [2024-06-28 13:35:55,326][09423] Updated weights for policy 0, policy_version 235867 (0.0045) [2024-06-28 13:35:57,924][09190] Fps is (10 sec: 40949.5, 60 sec: 41777.4, 300 sec: 42320.3). Total num frames: 3864543232. Throughput: 0: 42468.6. Samples: 143388180. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 13:35:57,925][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:35:58,274][09403] Signal inference workers to stop experience collection... (2000 times) [2024-06-28 13:35:58,275][09403] Signal inference workers to resume experience collection... (2000 times) [2024-06-28 13:35:58,313][09423] InferenceWorker_p0-w0: stopping experience collection (2000 times) [2024-06-28 13:35:58,313][09423] InferenceWorker_p0-w0: resuming experience collection (2000 times) [2024-06-28 13:35:58,578][09423] Updated weights for policy 0, policy_version 235877 (0.0027) [2024-06-28 13:36:02,887][09423] Updated weights for policy 0, policy_version 235887 (0.0029) [2024-06-28 13:36:02,924][09190] Fps is (10 sec: 42588.3, 60 sec: 42869.7, 300 sec: 42375.9). Total num frames: 3864772608. Throughput: 0: 42526.5. Samples: 143640440. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 13:36:02,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:36:06,517][09423] Updated weights for policy 0, policy_version 235897 (0.0033) [2024-06-28 13:36:07,921][09190] Fps is (10 sec: 44248.6, 60 sec: 42874.5, 300 sec: 42376.2). Total num frames: 3864985600. Throughput: 0: 42468.5. Samples: 143889000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 13:36:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 13:36:10,845][09423] Updated weights for policy 0, policy_version 235907 (0.0033) [2024-06-28 13:36:12,921][09190] Fps is (10 sec: 40970.4, 60 sec: 42052.3, 300 sec: 42376.3). Total num frames: 3865182208. Throughput: 0: 42470.3. Samples: 144021580. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 13:36:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 13:36:14,149][09423] Updated weights for policy 0, policy_version 235917 (0.0031) [2024-06-28 13:36:17,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42871.5, 300 sec: 42320.7). Total num frames: 3865395200. Throughput: 0: 42532.5. Samples: 144278620. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:36:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 13:36:17,937][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000235925_3865395200.pth... [2024-06-28 13:36:18,013][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000235305_3855237120.pth [2024-06-28 13:36:18,426][09423] Updated weights for policy 0, policy_version 235927 (0.0039) [2024-06-28 13:36:21,851][09423] Updated weights for policy 0, policy_version 235937 (0.0038) [2024-06-28 13:36:22,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 3865624576. Throughput: 0: 42386.8. Samples: 144521940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:36:22,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 13:36:26,385][09423] Updated weights for policy 0, policy_version 235947 (0.0037) [2024-06-28 13:36:27,922][09190] Fps is (10 sec: 40958.7, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3865804800. Throughput: 0: 42392.6. Samples: 144658700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:36:27,928][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 13:36:29,756][09423] Updated weights for policy 0, policy_version 235957 (0.0028) [2024-06-28 13:36:32,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42598.4, 300 sec: 42265.2). Total num frames: 3866017792. Throughput: 0: 42104.1. Samples: 144904160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:36:32,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 13:36:34,133][09423] Updated weights for policy 0, policy_version 235967 (0.0053) [2024-06-28 13:36:37,552][09423] Updated weights for policy 0, policy_version 235977 (0.0029) [2024-06-28 13:36:37,921][09190] Fps is (10 sec: 45876.5, 60 sec: 42598.5, 300 sec: 42376.3). Total num frames: 3866263552. Throughput: 0: 42126.8. Samples: 145150040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:36:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 13:36:42,091][09423] Updated weights for policy 0, policy_version 235987 (0.0033) [2024-06-28 13:36:42,921][09190] Fps is (10 sec: 40960.2, 60 sec: 41779.2, 300 sec: 42265.5). Total num frames: 3866427392. Throughput: 0: 42128.3. Samples: 145283840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:36:42,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 13:36:45,384][09423] Updated weights for policy 0, policy_version 235997 (0.0035) [2024-06-28 13:36:47,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42052.3, 300 sec: 42210.0). Total num frames: 3866656768. Throughput: 0: 41954.8. Samples: 145528300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:36:47,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:36:49,769][09423] Updated weights for policy 0, policy_version 236007 (0.0032) [2024-06-28 13:36:52,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42052.4, 300 sec: 42320.7). Total num frames: 3866869760. Throughput: 0: 42190.2. Samples: 145787560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:36:52,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 13:36:53,123][09423] Updated weights for policy 0, policy_version 236017 (0.0028) [2024-06-28 13:36:57,313][09423] Updated weights for policy 0, policy_version 236027 (0.0030) [2024-06-28 13:36:57,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42054.1, 300 sec: 42265.2). Total num frames: 3867066368. Throughput: 0: 42158.2. Samples: 145918700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:36:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:37:00,732][09423] Updated weights for policy 0, policy_version 236037 (0.0037) [2024-06-28 13:37:02,924][09190] Fps is (10 sec: 42587.7, 60 sec: 42052.3, 300 sec: 42209.6). Total num frames: 3867295744. Throughput: 0: 41853.6. Samples: 146162140. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:37:02,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:37:05,204][09423] Updated weights for policy 0, policy_version 236047 (0.0034) [2024-06-28 13:37:07,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3867508736. Throughput: 0: 42305.7. Samples: 146425700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:37:07,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:37:08,412][09423] Updated weights for policy 0, policy_version 236057 (0.0034) [2024-06-28 13:37:12,921][09190] Fps is (10 sec: 40970.6, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3867705344. Throughput: 0: 42049.6. Samples: 146550920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:37:12,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:37:13,134][09423] Updated weights for policy 0, policy_version 236067 (0.0031) [2024-06-28 13:37:16,162][09423] Updated weights for policy 0, policy_version 236077 (0.0037) [2024-06-28 13:37:17,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42052.3, 300 sec: 42209.6). Total num frames: 3867918336. Throughput: 0: 42027.1. Samples: 146795380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:37:17,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 13:37:20,620][09423] Updated weights for policy 0, policy_version 236087 (0.0033) [2024-06-28 13:37:22,921][09190] Fps is (10 sec: 42598.1, 60 sec: 41779.1, 300 sec: 42320.7). Total num frames: 3868131328. Throughput: 0: 42357.8. Samples: 147056140. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 13:37:22,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 13:37:23,964][09423] Updated weights for policy 0, policy_version 236097 (0.0034) [2024-06-28 13:37:27,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.5, 300 sec: 42265.2). Total num frames: 3868344320. Throughput: 0: 42244.8. Samples: 147184860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:37:27,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 13:37:28,247][09423] Updated weights for policy 0, policy_version 236107 (0.0032) [2024-06-28 13:37:31,586][09423] Updated weights for policy 0, policy_version 236117 (0.0037) [2024-06-28 13:37:32,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.4, 300 sec: 42320.7). Total num frames: 3868573696. Throughput: 0: 42292.9. Samples: 147431480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:37:32,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 13:37:35,993][09403] Signal inference workers to stop experience collection... (2050 times) [2024-06-28 13:37:35,993][09403] Signal inference workers to resume experience collection... (2050 times) [2024-06-28 13:37:36,021][09423] InferenceWorker_p0-w0: stopping experience collection (2050 times) [2024-06-28 13:37:36,021][09423] InferenceWorker_p0-w0: resuming experience collection (2050 times) [2024-06-28 13:37:36,137][09423] Updated weights for policy 0, policy_version 236127 (0.0050) [2024-06-28 13:37:37,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42052.2, 300 sec: 42431.8). Total num frames: 3868786688. Throughput: 0: 42335.1. Samples: 147692640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:37:37,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:37:37,935][09403] Saving new best policy, reward=0.754! [2024-06-28 13:37:39,461][09423] Updated weights for policy 0, policy_version 236137 (0.0039) [2024-06-28 13:37:42,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42325.2, 300 sec: 42209.6). Total num frames: 3868966912. Throughput: 0: 42195.5. Samples: 147817500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:37:42,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 13:37:43,934][09423] Updated weights for policy 0, policy_version 236147 (0.0035) [2024-06-28 13:37:47,018][09423] Updated weights for policy 0, policy_version 236157 (0.0054) [2024-06-28 13:37:47,924][09190] Fps is (10 sec: 44225.8, 60 sec: 42869.6, 300 sec: 42431.4). Total num frames: 3869229056. Throughput: 0: 42438.2. Samples: 148071860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:37:47,925][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 13:37:51,797][09423] Updated weights for policy 0, policy_version 236167 (0.0033) [2024-06-28 13:37:52,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 3869409280. Throughput: 0: 42411.2. Samples: 148334200. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:37:52,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 13:37:54,767][09423] Updated weights for policy 0, policy_version 236177 (0.0024) [2024-06-28 13:37:57,921][09190] Fps is (10 sec: 37692.9, 60 sec: 42325.3, 300 sec: 42154.1). Total num frames: 3869605888. Throughput: 0: 42120.4. Samples: 148446340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:37:57,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:37:59,378][09423] Updated weights for policy 0, policy_version 236187 (0.0029) [2024-06-28 13:38:02,547][09423] Updated weights for policy 0, policy_version 236197 (0.0046) [2024-06-28 13:38:02,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42600.2, 300 sec: 42431.8). Total num frames: 3869851648. Throughput: 0: 42416.9. Samples: 148704140. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:38:02,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:38:07,089][09423] Updated weights for policy 0, policy_version 236207 (0.0039) [2024-06-28 13:38:07,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3870031872. Throughput: 0: 42224.4. Samples: 148956240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:38:07,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 13:38:10,366][09423] Updated weights for policy 0, policy_version 236217 (0.0040) [2024-06-28 13:38:12,921][09190] Fps is (10 sec: 37683.1, 60 sec: 42052.2, 300 sec: 42098.5). Total num frames: 3870228480. Throughput: 0: 42104.9. Samples: 149079580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:38:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:38:14,880][09423] Updated weights for policy 0, policy_version 236227 (0.0039) [2024-06-28 13:38:17,921][09190] Fps is (10 sec: 45875.6, 60 sec: 42871.5, 300 sec: 42431.8). Total num frames: 3870490624. Throughput: 0: 42518.3. Samples: 149344800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:38:17,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 13:38:17,937][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000236236_3870490624.pth... [2024-06-28 13:38:17,993][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000235615_3860316160.pth [2024-06-28 13:38:18,215][09423] Updated weights for policy 0, policy_version 236237 (0.0033) [2024-06-28 13:38:22,495][09423] Updated weights for policy 0, policy_version 236247 (0.0035) [2024-06-28 13:38:22,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3870670848. Throughput: 0: 42348.0. Samples: 149598300. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:38:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 13:38:25,942][09423] Updated weights for policy 0, policy_version 236257 (0.0034) [2024-06-28 13:38:27,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42325.3, 300 sec: 42209.8). Total num frames: 3870883840. Throughput: 0: 42302.3. Samples: 149721100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:38:27,928][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:38:30,409][09423] Updated weights for policy 0, policy_version 236267 (0.0044) [2024-06-28 13:38:32,921][09190] Fps is (10 sec: 45875.4, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 3871129600. Throughput: 0: 42393.5. Samples: 149979460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 13:38:32,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 13:38:33,486][09423] Updated weights for policy 0, policy_version 236277 (0.0032) [2024-06-28 13:38:37,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42052.2, 300 sec: 42265.2). Total num frames: 3871309824. Throughput: 0: 42351.0. Samples: 150240000. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 13:38:37,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:38:38,202][09423] Updated weights for policy 0, policy_version 236287 (0.0035) [2024-06-28 13:38:41,098][09423] Updated weights for policy 0, policy_version 236297 (0.0037) [2024-06-28 13:38:42,925][09190] Fps is (10 sec: 39306.8, 60 sec: 42595.8, 300 sec: 42264.6). Total num frames: 3871522816. Throughput: 0: 42525.8. Samples: 150360160. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 13:38:42,926][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:38:42,926][09403] Saving new best policy, reward=0.757! [2024-06-28 13:38:45,833][09423] Updated weights for policy 0, policy_version 236307 (0.0038) [2024-06-28 13:38:47,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42327.1, 300 sec: 42487.3). Total num frames: 3871768576. Throughput: 0: 42528.8. Samples: 150617940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 13:38:47,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:38:49,010][09423] Updated weights for policy 0, policy_version 236317 (0.0032) [2024-06-28 13:38:52,921][09190] Fps is (10 sec: 40975.2, 60 sec: 42052.2, 300 sec: 42265.2). Total num frames: 3871932416. Throughput: 0: 42597.3. Samples: 150873120. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 13:38:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:38:53,761][09423] Updated weights for policy 0, policy_version 236327 (0.0028) [2024-06-28 13:38:56,905][09423] Updated weights for policy 0, policy_version 236337 (0.0038) [2024-06-28 13:38:57,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42598.3, 300 sec: 42265.2). Total num frames: 3872161792. Throughput: 0: 42542.6. Samples: 150994000. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 13:38:57,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 13:38:57,936][09403] Saving new best policy, reward=0.759! [2024-06-28 13:39:01,545][09423] Updated weights for policy 0, policy_version 236347 (0.0034) [2024-06-28 13:39:02,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 3872391168. Throughput: 0: 42437.3. Samples: 151254480. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 13:39:02,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:39:04,347][09423] Updated weights for policy 0, policy_version 236357 (0.0020) [2024-06-28 13:39:07,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42325.4, 300 sec: 42209.6). Total num frames: 3872571392. Throughput: 0: 42468.1. Samples: 151509360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 13:39:07,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:39:09,168][09423] Updated weights for policy 0, policy_version 236367 (0.0023) [2024-06-28 13:39:12,694][09423] Updated weights for policy 0, policy_version 236377 (0.0032) [2024-06-28 13:39:12,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42871.5, 300 sec: 42320.7). Total num frames: 3872800768. Throughput: 0: 42375.6. Samples: 151628000. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 13:39:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:39:16,983][09423] Updated weights for policy 0, policy_version 236387 (0.0047) [2024-06-28 13:39:17,698][09403] Signal inference workers to stop experience collection... (2100 times) [2024-06-28 13:39:17,753][09423] InferenceWorker_p0-w0: stopping experience collection (2100 times) [2024-06-28 13:39:17,761][09403] Signal inference workers to resume experience collection... (2100 times) [2024-06-28 13:39:17,772][09423] InferenceWorker_p0-w0: resuming experience collection (2100 times) [2024-06-28 13:39:17,922][09190] Fps is (10 sec: 44235.8, 60 sec: 42052.1, 300 sec: 42376.8). Total num frames: 3873013760. Throughput: 0: 42423.4. Samples: 151888520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 13:39:17,922][09190] Avg episode reward: [(0, '0.729')] [2024-06-28 13:39:20,193][09423] Updated weights for policy 0, policy_version 236397 (0.0042) [2024-06-28 13:39:22,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.4, 300 sec: 42209.6). Total num frames: 3873210368. Throughput: 0: 42280.5. Samples: 152142620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 13:39:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:39:24,888][09423] Updated weights for policy 0, policy_version 236407 (0.0038) [2024-06-28 13:39:27,763][09423] Updated weights for policy 0, policy_version 236417 (0.0031) [2024-06-28 13:39:27,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 3873456128. Throughput: 0: 42348.4. Samples: 152265680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 13:39:27,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:39:32,388][09423] Updated weights for policy 0, policy_version 236427 (0.0027) [2024-06-28 13:39:32,921][09190] Fps is (10 sec: 42598.2, 60 sec: 41779.2, 300 sec: 42376.2). Total num frames: 3873636352. Throughput: 0: 42356.0. Samples: 152523960. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 13:39:32,924][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:39:35,301][09423] Updated weights for policy 0, policy_version 236437 (0.0040) [2024-06-28 13:39:37,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42325.4, 300 sec: 42209.6). Total num frames: 3873849344. Throughput: 0: 42207.2. Samples: 152772440. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 13:39:37,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:39:40,355][09423] Updated weights for policy 0, policy_version 236447 (0.0038) [2024-06-28 13:39:42,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42601.0, 300 sec: 42431.8). Total num frames: 3874078720. Throughput: 0: 42338.7. Samples: 152899240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 13:39:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:39:43,313][09423] Updated weights for policy 0, policy_version 236457 (0.0035) [2024-06-28 13:39:47,921][09190] Fps is (10 sec: 39321.5, 60 sec: 41233.1, 300 sec: 42265.2). Total num frames: 3874242560. Throughput: 0: 42176.5. Samples: 153152420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:39:47,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:39:48,188][09423] Updated weights for policy 0, policy_version 236467 (0.0035) [2024-06-28 13:39:51,050][09423] Updated weights for policy 0, policy_version 236477 (0.0036) [2024-06-28 13:39:52,921][09190] Fps is (10 sec: 37683.1, 60 sec: 42052.2, 300 sec: 42098.5). Total num frames: 3874455552. Throughput: 0: 42212.3. Samples: 153408920. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:39:52,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:39:55,756][09423] Updated weights for policy 0, policy_version 236487 (0.0039) [2024-06-28 13:39:57,921][09190] Fps is (10 sec: 49151.4, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 3874734080. Throughput: 0: 42350.6. Samples: 153533780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:39:57,924][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:39:58,961][09423] Updated weights for policy 0, policy_version 236497 (0.0028) [2024-06-28 13:40:02,924][09190] Fps is (10 sec: 44226.1, 60 sec: 41777.5, 300 sec: 42321.0). Total num frames: 3874897920. Throughput: 0: 42163.1. Samples: 153785960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:40:02,924][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:40:03,581][09423] Updated weights for policy 0, policy_version 236507 (0.0033) [2024-06-28 13:40:06,498][09423] Updated weights for policy 0, policy_version 236517 (0.0037) [2024-06-28 13:40:07,921][09190] Fps is (10 sec: 37683.6, 60 sec: 42325.3, 300 sec: 42209.6). Total num frames: 3875110912. Throughput: 0: 42117.8. Samples: 154037920. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:40:07,922][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 13:40:07,941][09403] Saving new best policy, reward=0.762! [2024-06-28 13:40:11,368][09423] Updated weights for policy 0, policy_version 236527 (0.0047) [2024-06-28 13:40:12,921][09190] Fps is (10 sec: 45887.1, 60 sec: 42598.5, 300 sec: 42487.3). Total num frames: 3875356672. Throughput: 0: 42285.0. Samples: 154168500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:40:12,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:40:14,275][09423] Updated weights for policy 0, policy_version 236537 (0.0037) [2024-06-28 13:40:17,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3875536896. Throughput: 0: 42062.2. Samples: 154416760. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:40:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:40:17,940][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000236544_3875536896.pth... [2024-06-28 13:40:17,981][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000235925_3865395200.pth [2024-06-28 13:40:19,127][09423] Updated weights for policy 0, policy_version 236547 (0.0042) [2024-06-28 13:40:22,373][09423] Updated weights for policy 0, policy_version 236557 (0.0044) [2024-06-28 13:40:22,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3875749888. Throughput: 0: 42070.6. Samples: 154665620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:40:22,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 13:40:26,917][09423] Updated weights for policy 0, policy_version 236567 (0.0039) [2024-06-28 13:40:27,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42052.3, 300 sec: 42431.8). Total num frames: 3875979264. Throughput: 0: 42051.2. Samples: 154791540. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:40:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 13:40:30,170][09423] Updated weights for policy 0, policy_version 236577 (0.0032) [2024-06-28 13:40:32,367][09403] Signal inference workers to stop experience collection... (2150 times) [2024-06-28 13:40:32,367][09403] Signal inference workers to resume experience collection... (2150 times) [2024-06-28 13:40:32,412][09423] InferenceWorker_p0-w0: stopping experience collection (2150 times) [2024-06-28 13:40:32,412][09423] InferenceWorker_p0-w0: resuming experience collection (2150 times) [2024-06-28 13:40:32,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42052.3, 300 sec: 42209.6). Total num frames: 3876159488. Throughput: 0: 42172.0. Samples: 155050160. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:40:32,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:40:34,478][09423] Updated weights for policy 0, policy_version 236587 (0.0031) [2024-06-28 13:40:37,703][09423] Updated weights for policy 0, policy_version 236597 (0.0024) [2024-06-28 13:40:37,922][09190] Fps is (10 sec: 42597.3, 60 sec: 42598.2, 300 sec: 42320.7). Total num frames: 3876405248. Throughput: 0: 41933.7. Samples: 155295940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:40:37,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:40:42,039][09423] Updated weights for policy 0, policy_version 236607 (0.0032) [2024-06-28 13:40:42,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3876601856. Throughput: 0: 42145.4. Samples: 155430320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:40:42,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 13:40:45,837][09423] Updated weights for policy 0, policy_version 236617 (0.0037) [2024-06-28 13:40:47,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42598.3, 300 sec: 42209.6). Total num frames: 3876798464. Throughput: 0: 42216.5. Samples: 155685600. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:40:47,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:40:49,643][09423] Updated weights for policy 0, policy_version 236627 (0.0041) [2024-06-28 13:40:52,921][09190] Fps is (10 sec: 44236.5, 60 sec: 43144.5, 300 sec: 42376.6). Total num frames: 3877044224. Throughput: 0: 42073.2. Samples: 155931220. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:40:52,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 13:40:53,333][09423] Updated weights for policy 0, policy_version 236637 (0.0038) [2024-06-28 13:40:57,508][09423] Updated weights for policy 0, policy_version 236647 (0.0035) [2024-06-28 13:40:57,921][09190] Fps is (10 sec: 44237.0, 60 sec: 41779.3, 300 sec: 42265.5). Total num frames: 3877240832. Throughput: 0: 42290.1. Samples: 156071560. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 13:40:57,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:41:00,826][09423] Updated weights for policy 0, policy_version 236657 (0.0031) [2024-06-28 13:41:02,924][09190] Fps is (10 sec: 39312.0, 60 sec: 42325.3, 300 sec: 42209.3). Total num frames: 3877437440. Throughput: 0: 42236.8. Samples: 156317520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 13:41:02,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:41:05,441][09423] Updated weights for policy 0, policy_version 236667 (0.0033) [2024-06-28 13:41:07,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.4, 300 sec: 42376.2). Total num frames: 3877683200. Throughput: 0: 42112.8. Samples: 156560700. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 13:41:07,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 13:41:09,012][09423] Updated weights for policy 0, policy_version 236677 (0.0030) [2024-06-28 13:41:12,924][09190] Fps is (10 sec: 42598.3, 60 sec: 41777.4, 300 sec: 42264.8). Total num frames: 3877863424. Throughput: 0: 42450.9. Samples: 156701940. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 13:41:12,924][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:41:13,033][09423] Updated weights for policy 0, policy_version 236687 (0.0021) [2024-06-28 13:41:16,519][09423] Updated weights for policy 0, policy_version 236697 (0.0038) [2024-06-28 13:41:17,921][09190] Fps is (10 sec: 37683.2, 60 sec: 42052.3, 300 sec: 42154.1). Total num frames: 3878060032. Throughput: 0: 42020.4. Samples: 156941080. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 13:41:17,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:41:20,887][09423] Updated weights for policy 0, policy_version 236707 (0.0035) [2024-06-28 13:41:22,921][09190] Fps is (10 sec: 42608.8, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3878289408. Throughput: 0: 42325.0. Samples: 157200560. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 13:41:22,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:41:24,018][09423] Updated weights for policy 0, policy_version 236717 (0.0043) [2024-06-28 13:41:27,921][09190] Fps is (10 sec: 42598.8, 60 sec: 41779.2, 300 sec: 42265.2). Total num frames: 3878486016. Throughput: 0: 42314.7. Samples: 157334480. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 13:41:27,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 13:41:28,511][09423] Updated weights for policy 0, policy_version 236727 (0.0034) [2024-06-28 13:41:32,215][09423] Updated weights for policy 0, policy_version 236737 (0.0028) [2024-06-28 13:41:32,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.3, 300 sec: 42154.1). Total num frames: 3878699008. Throughput: 0: 42000.9. Samples: 157575640. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 13:41:32,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:41:36,098][09423] Updated weights for policy 0, policy_version 236747 (0.0039) [2024-06-28 13:41:37,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42052.4, 300 sec: 42376.2). Total num frames: 3878928384. Throughput: 0: 42429.0. Samples: 157840520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 13:41:37,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:41:39,754][09423] Updated weights for policy 0, policy_version 236757 (0.0037) [2024-06-28 13:41:42,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3879124992. Throughput: 0: 42113.8. Samples: 157966680. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 13:41:42,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:41:44,116][09423] Updated weights for policy 0, policy_version 236767 (0.0045) [2024-06-28 13:41:47,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.4, 300 sec: 42265.2). Total num frames: 3879337984. Throughput: 0: 42034.4. Samples: 158208960. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 13:41:47,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 13:41:47,997][09423] Updated weights for policy 0, policy_version 236777 (0.0029) [2024-06-28 13:41:51,926][09423] Updated weights for policy 0, policy_version 236787 (0.0034) [2024-06-28 13:41:52,921][09190] Fps is (10 sec: 42598.3, 60 sec: 41779.2, 300 sec: 42320.7). Total num frames: 3879550976. Throughput: 0: 42237.4. Samples: 158461380. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 13:41:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 13:41:55,543][09423] Updated weights for policy 0, policy_version 236797 (0.0036) [2024-06-28 13:41:57,921][09190] Fps is (10 sec: 39321.6, 60 sec: 41506.2, 300 sec: 42154.5). Total num frames: 3879731200. Throughput: 0: 41961.5. Samples: 158590100. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 13:41:57,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:41:59,722][09403] Signal inference workers to stop experience collection... (2200 times) [2024-06-28 13:41:59,750][09423] InferenceWorker_p0-w0: stopping experience collection (2200 times) [2024-06-28 13:41:59,777][09403] Signal inference workers to resume experience collection... (2200 times) [2024-06-28 13:41:59,777][09423] InferenceWorker_p0-w0: resuming experience collection (2200 times) [2024-06-28 13:41:59,918][09423] Updated weights for policy 0, policy_version 236807 (0.0033) [2024-06-28 13:42:02,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42600.2, 300 sec: 42320.7). Total num frames: 3879993344. Throughput: 0: 42215.2. Samples: 158840760. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 13:42:02,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:42:03,085][09423] Updated weights for policy 0, policy_version 236817 (0.0040) [2024-06-28 13:42:07,542][09423] Updated weights for policy 0, policy_version 236827 (0.0039) [2024-06-28 13:42:07,921][09190] Fps is (10 sec: 44236.7, 60 sec: 41506.2, 300 sec: 42265.2). Total num frames: 3880173568. Throughput: 0: 42131.7. Samples: 159096480. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 13:42:07,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:42:10,705][09423] Updated weights for policy 0, policy_version 236837 (0.0038) [2024-06-28 13:42:12,921][09190] Fps is (10 sec: 37683.4, 60 sec: 41781.0, 300 sec: 42209.6). Total num frames: 3880370176. Throughput: 0: 41892.0. Samples: 159219620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 13:42:12,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:42:15,139][09423] Updated weights for policy 0, policy_version 236847 (0.0030) [2024-06-28 13:42:17,921][09190] Fps is (10 sec: 45874.7, 60 sec: 42871.5, 300 sec: 42376.2). Total num frames: 3880632320. Throughput: 0: 42296.9. Samples: 159479000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 13:42:17,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:42:17,992][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000236856_3880648704.pth... [2024-06-28 13:42:18,041][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000236236_3870490624.pth [2024-06-28 13:42:18,308][09423] Updated weights for policy 0, policy_version 236857 (0.0038) [2024-06-28 13:42:22,921][09190] Fps is (10 sec: 44236.2, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3880812544. Throughput: 0: 42151.0. Samples: 159737320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 13:42:22,922][09190] Avg episode reward: [(0, '0.770')] [2024-06-28 13:42:22,923][09403] Saving new best policy, reward=0.770! [2024-06-28 13:42:23,286][09423] Updated weights for policy 0, policy_version 236867 (0.0035) [2024-06-28 13:42:26,111][09423] Updated weights for policy 0, policy_version 236877 (0.0032) [2024-06-28 13:42:27,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42325.3, 300 sec: 42209.6). Total num frames: 3881025536. Throughput: 0: 42094.3. Samples: 159860920. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 13:42:27,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:42:30,855][09423] Updated weights for policy 0, policy_version 236887 (0.0032) [2024-06-28 13:42:32,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42325.4, 300 sec: 42209.6). Total num frames: 3881238528. Throughput: 0: 42429.3. Samples: 160118280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 13:42:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:42:34,354][09423] Updated weights for policy 0, policy_version 236897 (0.0032) [2024-06-28 13:42:37,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 3881451520. Throughput: 0: 42569.4. Samples: 160377000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 13:42:37,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:42:38,494][09423] Updated weights for policy 0, policy_version 236907 (0.0031) [2024-06-28 13:42:41,835][09423] Updated weights for policy 0, policy_version 236917 (0.0047) [2024-06-28 13:42:42,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42325.3, 300 sec: 42154.4). Total num frames: 3881664512. Throughput: 0: 42492.3. Samples: 160502260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 13:42:42,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:42:46,062][09423] Updated weights for policy 0, policy_version 236927 (0.0033) [2024-06-28 13:42:47,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42871.4, 300 sec: 42376.2). Total num frames: 3881910272. Throughput: 0: 42640.0. Samples: 160759560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 13:42:47,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:42:49,662][09423] Updated weights for policy 0, policy_version 236937 (0.0033) [2024-06-28 13:42:52,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3882090496. Throughput: 0: 42544.8. Samples: 161011000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 13:42:52,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 13:42:53,725][09423] Updated weights for policy 0, policy_version 236947 (0.0027) [2024-06-28 13:42:57,350][09423] Updated weights for policy 0, policy_version 236957 (0.0038) [2024-06-28 13:42:57,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42871.4, 300 sec: 42209.6). Total num frames: 3882303488. Throughput: 0: 42591.5. Samples: 161136240. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 13:42:57,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:43:01,851][09423] Updated weights for policy 0, policy_version 236967 (0.0042) [2024-06-28 13:43:02,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3882516480. Throughput: 0: 42567.1. Samples: 161394520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 13:43:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:43:05,105][09423] Updated weights for policy 0, policy_version 236977 (0.0033) [2024-06-28 13:43:07,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3882713088. Throughput: 0: 42402.3. Samples: 161645420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 13:43:07,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 13:43:09,483][09423] Updated weights for policy 0, policy_version 236987 (0.0038) [2024-06-28 13:43:12,710][09423] Updated weights for policy 0, policy_version 236997 (0.0032) [2024-06-28 13:43:12,921][09190] Fps is (10 sec: 44237.4, 60 sec: 43144.5, 300 sec: 42265.2). Total num frames: 3882958848. Throughput: 0: 42498.7. Samples: 161773360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 13:43:12,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 13:43:17,054][09423] Updated weights for policy 0, policy_version 237007 (0.0028) [2024-06-28 13:43:17,924][09190] Fps is (10 sec: 42587.6, 60 sec: 41777.5, 300 sec: 42264.8). Total num frames: 3883139072. Throughput: 0: 42378.5. Samples: 162025420. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 13:43:17,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:43:20,733][09423] Updated weights for policy 0, policy_version 237017 (0.0038) [2024-06-28 13:43:22,921][09190] Fps is (10 sec: 37683.1, 60 sec: 42052.3, 300 sec: 42209.6). Total num frames: 3883335680. Throughput: 0: 42345.3. Samples: 162282540. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 13:43:22,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 13:43:24,983][09423] Updated weights for policy 0, policy_version 237027 (0.0034) [2024-06-28 13:43:27,921][09190] Fps is (10 sec: 44247.8, 60 sec: 42598.3, 300 sec: 42209.6). Total num frames: 3883581440. Throughput: 0: 42339.6. Samples: 162407540. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 13:43:27,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:43:28,557][09423] Updated weights for policy 0, policy_version 237037 (0.0032) [2024-06-28 13:43:32,469][09423] Updated weights for policy 0, policy_version 237047 (0.0035) [2024-06-28 13:43:32,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3883778048. Throughput: 0: 42203.0. Samples: 162658700. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 13:43:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:43:34,114][09403] Signal inference workers to stop experience collection... (2250 times) [2024-06-28 13:43:34,142][09423] InferenceWorker_p0-w0: stopping experience collection (2250 times) [2024-06-28 13:43:34,166][09403] Signal inference workers to resume experience collection... (2250 times) [2024-06-28 13:43:34,166][09423] InferenceWorker_p0-w0: resuming experience collection (2250 times) [2024-06-28 13:43:36,552][09423] Updated weights for policy 0, policy_version 237057 (0.0033) [2024-06-28 13:43:37,922][09190] Fps is (10 sec: 40959.7, 60 sec: 42325.2, 300 sec: 42265.7). Total num frames: 3883991040. Throughput: 0: 42358.6. Samples: 162917140. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 13:43:37,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 13:43:40,010][09423] Updated weights for policy 0, policy_version 237067 (0.0029) [2024-06-28 13:43:42,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.3, 300 sec: 42154.1). Total num frames: 3884204032. Throughput: 0: 42374.1. Samples: 163043080. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 13:43:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:43:44,065][09423] Updated weights for policy 0, policy_version 237077 (0.0040) [2024-06-28 13:43:47,921][09190] Fps is (10 sec: 42599.2, 60 sec: 41779.2, 300 sec: 42320.7). Total num frames: 3884417024. Throughput: 0: 42285.0. Samples: 163297340. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 13:43:47,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 13:43:47,970][09423] Updated weights for policy 0, policy_version 237087 (0.0043) [2024-06-28 13:43:51,748][09423] Updated weights for policy 0, policy_version 237097 (0.0047) [2024-06-28 13:43:52,923][09190] Fps is (10 sec: 44230.3, 60 sec: 42597.3, 300 sec: 42320.5). Total num frames: 3884646400. Throughput: 0: 42236.8. Samples: 163546140. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 13:43:52,923][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 13:43:55,803][09423] Updated weights for policy 0, policy_version 237107 (0.0030) [2024-06-28 13:43:57,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.4, 300 sec: 42209.6). Total num frames: 3884843008. Throughput: 0: 42216.5. Samples: 163673100. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 13:43:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 13:43:59,608][09423] Updated weights for policy 0, policy_version 237117 (0.0033) [2024-06-28 13:44:02,921][09190] Fps is (10 sec: 42604.9, 60 sec: 42598.5, 300 sec: 42376.2). Total num frames: 3885072384. Throughput: 0: 42313.9. Samples: 163929440. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 13:44:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:44:03,326][09423] Updated weights for policy 0, policy_version 237127 (0.0035) [2024-06-28 13:44:07,169][09423] Updated weights for policy 0, policy_version 237137 (0.0037) [2024-06-28 13:44:07,921][09190] Fps is (10 sec: 42597.6, 60 sec: 42598.3, 300 sec: 42265.1). Total num frames: 3885268992. Throughput: 0: 42312.7. Samples: 164186620. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 13:44:07,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:44:11,114][09423] Updated weights for policy 0, policy_version 237147 (0.0035) [2024-06-28 13:44:12,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42052.2, 300 sec: 42265.2). Total num frames: 3885481984. Throughput: 0: 42286.7. Samples: 164310440. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 13:44:12,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:44:15,205][09423] Updated weights for policy 0, policy_version 237157 (0.0038) [2024-06-28 13:44:17,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42600.2, 300 sec: 42320.7). Total num frames: 3885694976. Throughput: 0: 42412.9. Samples: 164567280. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 13:44:17,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:44:17,929][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000237164_3885694976.pth... [2024-06-28 13:44:18,005][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000236544_3875536896.pth [2024-06-28 13:44:18,628][09423] Updated weights for policy 0, policy_version 237167 (0.0047) [2024-06-28 13:44:22,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.4, 300 sec: 42154.1). Total num frames: 3885891584. Throughput: 0: 42293.9. Samples: 164820360. Policy #0 lag: (min: 1.0, avg: 9.8, max: 21.0) [2024-06-28 13:44:22,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:44:23,181][09423] Updated weights for policy 0, policy_version 237177 (0.0055) [2024-06-28 13:44:26,466][09423] Updated weights for policy 0, policy_version 237187 (0.0027) [2024-06-28 13:44:27,921][09190] Fps is (10 sec: 39321.4, 60 sec: 41779.2, 300 sec: 42209.6). Total num frames: 3886088192. Throughput: 0: 42254.6. Samples: 164944540. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 13:44:27,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:44:30,670][09423] Updated weights for policy 0, policy_version 237197 (0.0028) [2024-06-28 13:44:32,924][09190] Fps is (10 sec: 42587.7, 60 sec: 42323.6, 300 sec: 42264.8). Total num frames: 3886317568. Throughput: 0: 42148.2. Samples: 165194120. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 13:44:32,925][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:44:34,239][09423] Updated weights for policy 0, policy_version 237207 (0.0046) [2024-06-28 13:44:37,924][09190] Fps is (10 sec: 44226.1, 60 sec: 42323.6, 300 sec: 42209.3). Total num frames: 3886530560. Throughput: 0: 42423.9. Samples: 165455260. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 13:44:37,924][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:44:38,391][09423] Updated weights for policy 0, policy_version 237217 (0.0029) [2024-06-28 13:44:42,018][09423] Updated weights for policy 0, policy_version 237227 (0.0037) [2024-06-28 13:44:42,921][09190] Fps is (10 sec: 40970.6, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 3886727168. Throughput: 0: 42265.8. Samples: 165575060. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 13:44:42,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:44:46,216][09423] Updated weights for policy 0, policy_version 237237 (0.0041) [2024-06-28 13:44:47,921][09190] Fps is (10 sec: 42609.5, 60 sec: 42325.4, 300 sec: 42376.3). Total num frames: 3886956544. Throughput: 0: 42284.5. Samples: 165832240. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 13:44:47,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:44:49,639][09423] Updated weights for policy 0, policy_version 237247 (0.0048) [2024-06-28 13:44:52,924][09190] Fps is (10 sec: 42587.6, 60 sec: 41778.5, 300 sec: 42098.2). Total num frames: 3887153152. Throughput: 0: 42192.5. Samples: 166085380. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 13:44:52,924][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 13:44:54,055][09423] Updated weights for policy 0, policy_version 237257 (0.0023) [2024-06-28 13:44:57,535][09423] Updated weights for policy 0, policy_version 237267 (0.0026) [2024-06-28 13:44:57,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42325.2, 300 sec: 42321.1). Total num frames: 3887382528. Throughput: 0: 42128.0. Samples: 166206200. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 13:44:57,923][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:45:01,809][09423] Updated weights for policy 0, policy_version 237277 (0.0028) [2024-06-28 13:45:02,921][09190] Fps is (10 sec: 44248.0, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 3887595520. Throughput: 0: 42261.4. Samples: 166469040. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 13:45:02,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:45:05,637][09423] Updated weights for policy 0, policy_version 237287 (0.0036) [2024-06-28 13:45:06,192][09403] Signal inference workers to stop experience collection... (2300 times) [2024-06-28 13:45:06,192][09403] Signal inference workers to resume experience collection... (2300 times) [2024-06-28 13:45:06,231][09423] InferenceWorker_p0-w0: stopping experience collection (2300 times) [2024-06-28 13:45:06,231][09423] InferenceWorker_p0-w0: resuming experience collection (2300 times) [2024-06-28 13:45:07,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42052.4, 300 sec: 42154.1). Total num frames: 3887792128. Throughput: 0: 42150.7. Samples: 166717140. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 13:45:07,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 13:45:09,413][09423] Updated weights for policy 0, policy_version 237297 (0.0037) [2024-06-28 13:45:12,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3888021504. Throughput: 0: 42198.4. Samples: 166843460. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 13:45:12,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:45:13,186][09423] Updated weights for policy 0, policy_version 237307 (0.0030) [2024-06-28 13:45:17,416][09423] Updated weights for policy 0, policy_version 237317 (0.0040) [2024-06-28 13:45:17,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3888234496. Throughput: 0: 42304.6. Samples: 167097720. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 13:45:17,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:45:21,253][09423] Updated weights for policy 0, policy_version 237327 (0.0036) [2024-06-28 13:45:22,921][09190] Fps is (10 sec: 42597.7, 60 sec: 42598.4, 300 sec: 42265.1). Total num frames: 3888447488. Throughput: 0: 42071.2. Samples: 167348360. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 13:45:22,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:45:25,107][09423] Updated weights for policy 0, policy_version 237337 (0.0033) [2024-06-28 13:45:27,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42598.5, 300 sec: 42320.7). Total num frames: 3888644096. Throughput: 0: 42190.2. Samples: 167473620. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 13:45:27,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:45:28,750][09423] Updated weights for policy 0, policy_version 237347 (0.0034) [2024-06-28 13:45:32,709][09423] Updated weights for policy 0, policy_version 237357 (0.0034) [2024-06-28 13:45:32,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42327.1, 300 sec: 42209.7). Total num frames: 3888857088. Throughput: 0: 42171.9. Samples: 167729980. Policy #0 lag: (min: 1.0, avg: 11.0, max: 22.0) [2024-06-28 13:45:32,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:45:36,561][09423] Updated weights for policy 0, policy_version 237367 (0.0031) [2024-06-28 13:45:37,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42054.0, 300 sec: 42209.6). Total num frames: 3889053696. Throughput: 0: 42150.3. Samples: 167982040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:45:37,922][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 13:45:40,698][09423] Updated weights for policy 0, policy_version 237377 (0.0029) [2024-06-28 13:45:42,922][09190] Fps is (10 sec: 40959.4, 60 sec: 42325.2, 300 sec: 42265.1). Total num frames: 3889266688. Throughput: 0: 42292.8. Samples: 168109380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:45:42,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 13:45:44,163][09423] Updated weights for policy 0, policy_version 237387 (0.0035) [2024-06-28 13:45:47,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42052.2, 300 sec: 42154.1). Total num frames: 3889479680. Throughput: 0: 42018.5. Samples: 168359880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:45:47,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:45:48,274][09423] Updated weights for policy 0, policy_version 237397 (0.0035) [2024-06-28 13:45:52,116][09423] Updated weights for policy 0, policy_version 237407 (0.0049) [2024-06-28 13:45:52,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42327.1, 300 sec: 42209.6). Total num frames: 3889692672. Throughput: 0: 42157.7. Samples: 168614240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:45:52,922][09190] Avg episode reward: [(0, '0.725')] [2024-06-28 13:45:56,059][09423] Updated weights for policy 0, policy_version 237417 (0.0034) [2024-06-28 13:45:57,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42325.3, 300 sec: 42321.1). Total num frames: 3889922048. Throughput: 0: 42219.0. Samples: 168743320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:45:57,922][09190] Avg episode reward: [(0, '0.729')] [2024-06-28 13:45:59,973][09423] Updated weights for policy 0, policy_version 237427 (0.0033) [2024-06-28 13:46:02,922][09190] Fps is (10 sec: 42597.9, 60 sec: 42052.1, 300 sec: 42154.1). Total num frames: 3890118656. Throughput: 0: 42237.3. Samples: 168998400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:46:02,923][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:46:03,845][09423] Updated weights for policy 0, policy_version 237437 (0.0044) [2024-06-28 13:46:07,612][09423] Updated weights for policy 0, policy_version 237447 (0.0034) [2024-06-28 13:46:07,924][09190] Fps is (10 sec: 40950.1, 60 sec: 42323.5, 300 sec: 42265.2). Total num frames: 3890331648. Throughput: 0: 42281.3. Samples: 169251120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:46:07,924][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:46:11,450][09423] Updated weights for policy 0, policy_version 237457 (0.0037) [2024-06-28 13:46:12,922][09190] Fps is (10 sec: 42598.5, 60 sec: 42052.1, 300 sec: 42320.7). Total num frames: 3890544640. Throughput: 0: 42316.7. Samples: 169377880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:46:12,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:46:15,156][09423] Updated weights for policy 0, policy_version 237467 (0.0038) [2024-06-28 13:46:17,921][09190] Fps is (10 sec: 44247.5, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3890774016. Throughput: 0: 42376.8. Samples: 169636940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:46:17,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:46:17,936][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000237474_3890774016.pth... [2024-06-28 13:46:17,999][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000236856_3880648704.pth [2024-06-28 13:46:19,218][09423] Updated weights for policy 0, policy_version 237477 (0.0028) [2024-06-28 13:46:22,736][09423] Updated weights for policy 0, policy_version 237487 (0.0034) [2024-06-28 13:46:22,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 3890987008. Throughput: 0: 42311.1. Samples: 169886040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:46:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:46:26,963][09423] Updated weights for policy 0, policy_version 237497 (0.0044) [2024-06-28 13:46:27,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 3891200000. Throughput: 0: 42316.1. Samples: 170013600. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:46:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:46:30,378][09423] Updated weights for policy 0, policy_version 237507 (0.0028) [2024-06-28 13:46:32,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.4, 300 sec: 42320.7). Total num frames: 3891412992. Throughput: 0: 42594.7. Samples: 170276640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:46:32,923][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 13:46:34,710][09423] Updated weights for policy 0, policy_version 237517 (0.0041) [2024-06-28 13:46:37,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.3, 300 sec: 42320.7). Total num frames: 3891609600. Throughput: 0: 42387.0. Samples: 170521660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:46:37,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:46:38,544][09423] Updated weights for policy 0, policy_version 237527 (0.0038) [2024-06-28 13:46:42,390][09423] Updated weights for policy 0, policy_version 237537 (0.0043) [2024-06-28 13:46:42,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42598.5, 300 sec: 42320.7). Total num frames: 3891822592. Throughput: 0: 42304.9. Samples: 170647040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 13:46:42,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:46:46,117][09423] Updated weights for policy 0, policy_version 237547 (0.0027) [2024-06-28 13:46:47,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42871.5, 300 sec: 42376.3). Total num frames: 3892051968. Throughput: 0: 42405.5. Samples: 170906640. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 13:46:47,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:46:49,826][09403] Signal inference workers to stop experience collection... (2350 times) [2024-06-28 13:46:49,826][09403] Signal inference workers to resume experience collection... (2350 times) [2024-06-28 13:46:49,842][09423] InferenceWorker_p0-w0: stopping experience collection (2350 times) [2024-06-28 13:46:49,842][09423] InferenceWorker_p0-w0: resuming experience collection (2350 times) [2024-06-28 13:46:49,963][09423] Updated weights for policy 0, policy_version 237557 (0.0032) [2024-06-28 13:46:52,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 3892215808. Throughput: 0: 42393.1. Samples: 171158700. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 13:46:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 13:46:54,056][09423] Updated weights for policy 0, policy_version 237567 (0.0035) [2024-06-28 13:46:57,546][09423] Updated weights for policy 0, policy_version 237577 (0.0035) [2024-06-28 13:46:57,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.4, 300 sec: 42265.2). Total num frames: 3892461568. Throughput: 0: 42368.6. Samples: 171284460. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 13:46:57,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:47:02,189][09423] Updated weights for policy 0, policy_version 237587 (0.0031) [2024-06-28 13:47:02,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42325.5, 300 sec: 42320.7). Total num frames: 3892658176. Throughput: 0: 42191.2. Samples: 171535540. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 13:47:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:47:05,553][09423] Updated weights for policy 0, policy_version 237597 (0.0037) [2024-06-28 13:47:07,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42327.1, 300 sec: 42376.2). Total num frames: 3892871168. Throughput: 0: 42303.2. Samples: 171789680. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 13:47:07,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:47:09,670][09423] Updated weights for policy 0, policy_version 237607 (0.0041) [2024-06-28 13:47:12,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42325.4, 300 sec: 42209.6). Total num frames: 3893084160. Throughput: 0: 42308.1. Samples: 171917460. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 13:47:12,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:47:13,098][09423] Updated weights for policy 0, policy_version 237617 (0.0047) [2024-06-28 13:47:17,222][09423] Updated weights for policy 0, policy_version 237627 (0.0029) [2024-06-28 13:47:17,924][09190] Fps is (10 sec: 42587.4, 60 sec: 42050.5, 300 sec: 42320.4). Total num frames: 3893297152. Throughput: 0: 42143.0. Samples: 172173180. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 13:47:17,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:47:20,726][09423] Updated weights for policy 0, policy_version 237637 (0.0033) [2024-06-28 13:47:22,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3893510144. Throughput: 0: 42386.7. Samples: 172429060. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 13:47:22,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:47:25,094][09423] Updated weights for policy 0, policy_version 237647 (0.0038) [2024-06-28 13:47:27,921][09190] Fps is (10 sec: 40970.7, 60 sec: 41779.3, 300 sec: 42265.2). Total num frames: 3893706752. Throughput: 0: 42496.5. Samples: 172559380. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 13:47:27,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:47:28,557][09423] Updated weights for policy 0, policy_version 237657 (0.0038) [2024-06-28 13:47:32,709][09423] Updated weights for policy 0, policy_version 237667 (0.0041) [2024-06-28 13:47:32,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 3893936128. Throughput: 0: 42437.4. Samples: 172816320. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 13:47:32,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:47:35,986][09423] Updated weights for policy 0, policy_version 237677 (0.0029) [2024-06-28 13:47:37,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3894149120. Throughput: 0: 42456.0. Samples: 173069220. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 13:47:37,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:47:40,571][09423] Updated weights for policy 0, policy_version 237687 (0.0037) [2024-06-28 13:47:42,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42325.4, 300 sec: 42209.6). Total num frames: 3894362112. Throughput: 0: 42451.5. Samples: 173194780. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 13:47:42,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:47:43,692][09423] Updated weights for policy 0, policy_version 237697 (0.0028) [2024-06-28 13:47:47,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 3894575104. Throughput: 0: 42499.5. Samples: 173448020. Policy #0 lag: (min: 0.0, avg: 11.3, max: 23.0) [2024-06-28 13:47:47,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:47:48,191][09423] Updated weights for policy 0, policy_version 237707 (0.0029) [2024-06-28 13:47:51,563][09423] Updated weights for policy 0, policy_version 237717 (0.0040) [2024-06-28 13:47:52,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.4, 300 sec: 42320.7). Total num frames: 3894788096. Throughput: 0: 42394.2. Samples: 173697420. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-28 13:47:52,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:47:56,068][09423] Updated weights for policy 0, policy_version 237727 (0.0035) [2024-06-28 13:47:57,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3895001088. Throughput: 0: 42380.4. Samples: 173824580. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-28 13:47:57,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:47:59,521][09423] Updated weights for policy 0, policy_version 237737 (0.0040) [2024-06-28 13:48:02,924][09190] Fps is (10 sec: 40949.7, 60 sec: 42323.5, 300 sec: 42320.3). Total num frames: 3895197696. Throughput: 0: 42372.5. Samples: 174079940. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-28 13:48:02,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:48:04,311][09423] Updated weights for policy 0, policy_version 237747 (0.0037) [2024-06-28 13:48:04,707][09403] Signal inference workers to stop experience collection... (2400 times) [2024-06-28 13:48:04,707][09403] Signal inference workers to resume experience collection... (2400 times) [2024-06-28 13:48:04,740][09423] InferenceWorker_p0-w0: stopping experience collection (2400 times) [2024-06-28 13:48:04,740][09423] InferenceWorker_p0-w0: resuming experience collection (2400 times) [2024-06-28 13:48:07,137][09423] Updated weights for policy 0, policy_version 237757 (0.0041) [2024-06-28 13:48:07,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.4, 300 sec: 42265.2). Total num frames: 3895427072. Throughput: 0: 42273.4. Samples: 174331360. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-28 13:48:07,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:48:12,087][09423] Updated weights for policy 0, policy_version 237767 (0.0037) [2024-06-28 13:48:12,921][09190] Fps is (10 sec: 42609.3, 60 sec: 42325.4, 300 sec: 42321.1). Total num frames: 3895623680. Throughput: 0: 42292.9. Samples: 174462560. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-28 13:48:12,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:48:14,955][09423] Updated weights for policy 0, policy_version 237777 (0.0036) [2024-06-28 13:48:17,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42054.0, 300 sec: 42320.7). Total num frames: 3895820288. Throughput: 0: 42199.9. Samples: 174715320. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-28 13:48:17,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:48:17,942][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000237783_3895836672.pth... [2024-06-28 13:48:17,996][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000237164_3885694976.pth [2024-06-28 13:48:19,563][09423] Updated weights for policy 0, policy_version 237787 (0.0038) [2024-06-28 13:48:22,601][09423] Updated weights for policy 0, policy_version 237797 (0.0038) [2024-06-28 13:48:22,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42598.4, 300 sec: 42320.7). Total num frames: 3896066048. Throughput: 0: 42072.0. Samples: 174962460. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-28 13:48:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 13:48:27,039][09423] Updated weights for policy 0, policy_version 237807 (0.0039) [2024-06-28 13:48:27,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3896246272. Throughput: 0: 42295.6. Samples: 175098080. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-28 13:48:27,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:48:30,364][09423] Updated weights for policy 0, policy_version 237817 (0.0039) [2024-06-28 13:48:32,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3896475648. Throughput: 0: 42166.6. Samples: 175345520. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-28 13:48:32,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:48:34,540][09423] Updated weights for policy 0, policy_version 237827 (0.0035) [2024-06-28 13:48:37,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3896688640. Throughput: 0: 42482.7. Samples: 175609140. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-28 13:48:37,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 13:48:38,130][09423] Updated weights for policy 0, policy_version 237837 (0.0034) [2024-06-28 13:48:42,920][09423] Updated weights for policy 0, policy_version 237847 (0.0037) [2024-06-28 13:48:42,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3896885248. Throughput: 0: 42350.2. Samples: 175730340. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-28 13:48:42,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:48:45,830][09423] Updated weights for policy 0, policy_version 237857 (0.0042) [2024-06-28 13:48:47,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42052.2, 300 sec: 42209.8). Total num frames: 3897098240. Throughput: 0: 42323.2. Samples: 175984380. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-28 13:48:47,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 13:48:50,445][09423] Updated weights for policy 0, policy_version 237867 (0.0037) [2024-06-28 13:48:52,921][09190] Fps is (10 sec: 45874.6, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 3897344000. Throughput: 0: 42520.3. Samples: 176244780. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-28 13:48:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 13:48:53,423][09423] Updated weights for policy 0, policy_version 237877 (0.0042) [2024-06-28 13:48:57,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42052.3, 300 sec: 42209.6). Total num frames: 3897524224. Throughput: 0: 42510.7. Samples: 176375540. Policy #0 lag: (min: 0.0, avg: 11.8, max: 21.0) [2024-06-28 13:48:57,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:48:57,930][09423] Updated weights for policy 0, policy_version 237887 (0.0033) [2024-06-28 13:49:01,191][09423] Updated weights for policy 0, policy_version 237897 (0.0029) [2024-06-28 13:49:02,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42600.2, 300 sec: 42320.7). Total num frames: 3897753600. Throughput: 0: 42356.4. Samples: 176621360. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 13:49:02,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 13:49:05,773][09423] Updated weights for policy 0, policy_version 237907 (0.0035) [2024-06-28 13:49:07,922][09190] Fps is (10 sec: 44235.9, 60 sec: 42325.2, 300 sec: 42320.7). Total num frames: 3897966592. Throughput: 0: 42635.5. Samples: 176881060. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 13:49:07,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:49:08,945][09423] Updated weights for policy 0, policy_version 237917 (0.0033) [2024-06-28 13:49:12,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3898163200. Throughput: 0: 42411.5. Samples: 177006600. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 13:49:12,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:49:13,354][09403] Signal inference workers to stop experience collection... (2450 times) [2024-06-28 13:49:13,360][09403] Signal inference workers to resume experience collection... (2450 times) [2024-06-28 13:49:13,368][09423] InferenceWorker_p0-w0: stopping experience collection (2450 times) [2024-06-28 13:49:13,371][09423] Updated weights for policy 0, policy_version 237927 (0.0033) [2024-06-28 13:49:13,380][09423] InferenceWorker_p0-w0: resuming experience collection (2450 times) [2024-06-28 13:49:16,859][09423] Updated weights for policy 0, policy_version 237937 (0.0029) [2024-06-28 13:49:17,924][09190] Fps is (10 sec: 44226.3, 60 sec: 43142.7, 300 sec: 42431.4). Total num frames: 3898408960. Throughput: 0: 42523.0. Samples: 177259160. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 13:49:17,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:49:21,313][09423] Updated weights for policy 0, policy_version 237947 (0.0038) [2024-06-28 13:49:22,924][09190] Fps is (10 sec: 42587.5, 60 sec: 42050.5, 300 sec: 42375.9). Total num frames: 3898589184. Throughput: 0: 42266.1. Samples: 177511220. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 13:49:22,924][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 13:49:24,651][09423] Updated weights for policy 0, policy_version 237957 (0.0054) [2024-06-28 13:49:27,924][09190] Fps is (10 sec: 40960.0, 60 sec: 42869.6, 300 sec: 42376.2). Total num frames: 3898818560. Throughput: 0: 42250.5. Samples: 177631720. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 13:49:27,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:49:28,782][09423] Updated weights for policy 0, policy_version 237967 (0.0036) [2024-06-28 13:49:32,533][09423] Updated weights for policy 0, policy_version 237977 (0.0035) [2024-06-28 13:49:32,921][09190] Fps is (10 sec: 42609.5, 60 sec: 42325.4, 300 sec: 42321.1). Total num frames: 3899015168. Throughput: 0: 42363.2. Samples: 177890720. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 13:49:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:49:36,351][09423] Updated weights for policy 0, policy_version 237987 (0.0042) [2024-06-28 13:49:37,921][09190] Fps is (10 sec: 39331.4, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3899211776. Throughput: 0: 42329.4. Samples: 178149600. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 13:49:37,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:49:39,938][09423] Updated weights for policy 0, policy_version 237997 (0.0026) [2024-06-28 13:49:42,921][09190] Fps is (10 sec: 42597.7, 60 sec: 42598.3, 300 sec: 42320.7). Total num frames: 3899441152. Throughput: 0: 42136.3. Samples: 178271680. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 13:49:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:49:44,572][09423] Updated weights for policy 0, policy_version 238007 (0.0027) [2024-06-28 13:49:47,922][09190] Fps is (10 sec: 44236.3, 60 sec: 42598.4, 300 sec: 42376.6). Total num frames: 3899654144. Throughput: 0: 42270.1. Samples: 178523520. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 13:49:47,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:49:47,990][09423] Updated weights for policy 0, policy_version 238017 (0.0034) [2024-06-28 13:49:52,044][09423] Updated weights for policy 0, policy_version 238027 (0.0041) [2024-06-28 13:49:52,921][09190] Fps is (10 sec: 40960.8, 60 sec: 41779.3, 300 sec: 42265.2). Total num frames: 3899850752. Throughput: 0: 42174.9. Samples: 178778920. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 13:49:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:49:55,701][09423] Updated weights for policy 0, policy_version 238037 (0.0032) [2024-06-28 13:49:57,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.3, 300 sec: 42320.7). Total num frames: 3900080128. Throughput: 0: 42244.8. Samples: 178907620. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 13:49:57,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:49:59,533][09423] Updated weights for policy 0, policy_version 238047 (0.0041) [2024-06-28 13:50:02,925][09190] Fps is (10 sec: 44218.1, 60 sec: 42322.4, 300 sec: 42375.6). Total num frames: 3900293120. Throughput: 0: 42235.8. Samples: 179159840. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 13:50:02,926][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:50:03,374][09423] Updated weights for policy 0, policy_version 238057 (0.0042) [2024-06-28 13:50:07,451][09423] Updated weights for policy 0, policy_version 238067 (0.0040) [2024-06-28 13:50:07,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3900506112. Throughput: 0: 42163.7. Samples: 179408480. Policy #0 lag: (min: 0.0, avg: 12.0, max: 22.0) [2024-06-28 13:50:07,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:50:11,067][09423] Updated weights for policy 0, policy_version 238077 (0.0032) [2024-06-28 13:50:12,921][09190] Fps is (10 sec: 39337.7, 60 sec: 42052.2, 300 sec: 42209.6). Total num frames: 3900686336. Throughput: 0: 42432.6. Samples: 179541080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 13:50:12,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:50:15,059][09423] Updated weights for policy 0, policy_version 238087 (0.0042) [2024-06-28 13:50:17,922][09190] Fps is (10 sec: 40959.3, 60 sec: 41780.8, 300 sec: 42265.1). Total num frames: 3900915712. Throughput: 0: 42337.1. Samples: 179795900. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 13:50:17,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 13:50:17,932][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000238093_3900915712.pth... [2024-06-28 13:50:17,993][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000237474_3890774016.pth [2024-06-28 13:50:18,698][09423] Updated weights for policy 0, policy_version 238097 (0.0035) [2024-06-28 13:50:22,838][09423] Updated weights for policy 0, policy_version 238107 (0.0035) [2024-06-28 13:50:22,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42600.2, 300 sec: 42376.2). Total num frames: 3901145088. Throughput: 0: 42212.9. Samples: 180049180. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 13:50:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 13:50:26,146][09423] Updated weights for policy 0, policy_version 238117 (0.0033) [2024-06-28 13:50:27,921][09190] Fps is (10 sec: 40960.5, 60 sec: 41780.9, 300 sec: 42265.2). Total num frames: 3901325312. Throughput: 0: 42337.8. Samples: 180176880. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 13:50:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:50:30,362][09423] Updated weights for policy 0, policy_version 238127 (0.0031) [2024-06-28 13:50:32,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 3901571072. Throughput: 0: 42473.4. Samples: 180434820. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 13:50:32,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:50:34,573][09423] Updated weights for policy 0, policy_version 238137 (0.0037) [2024-06-28 13:50:37,629][09403] Signal inference workers to stop experience collection... (2500 times) [2024-06-28 13:50:37,629][09403] Signal inference workers to resume experience collection... (2500 times) [2024-06-28 13:50:37,667][09423] InferenceWorker_p0-w0: stopping experience collection (2500 times) [2024-06-28 13:50:37,667][09423] InferenceWorker_p0-w0: resuming experience collection (2500 times) [2024-06-28 13:50:37,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42598.4, 300 sec: 42376.3). Total num frames: 3901767680. Throughput: 0: 42371.9. Samples: 180685660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 13:50:37,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:50:38,331][09423] Updated weights for policy 0, policy_version 238147 (0.0037) [2024-06-28 13:50:42,126][09423] Updated weights for policy 0, policy_version 238157 (0.0044) [2024-06-28 13:50:42,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 3901980672. Throughput: 0: 42294.7. Samples: 180810880. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 13:50:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:50:46,028][09423] Updated weights for policy 0, policy_version 238167 (0.0032) [2024-06-28 13:50:47,922][09190] Fps is (10 sec: 40959.5, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 3902177280. Throughput: 0: 42319.8. Samples: 181064060. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 13:50:47,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:50:49,871][09423] Updated weights for policy 0, policy_version 238177 (0.0032) [2024-06-28 13:50:52,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.2, 300 sec: 42265.2). Total num frames: 3902390272. Throughput: 0: 42444.9. Samples: 181318500. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 13:50:52,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:50:53,862][09423] Updated weights for policy 0, policy_version 238187 (0.0039) [2024-06-28 13:50:57,615][09423] Updated weights for policy 0, policy_version 238197 (0.0027) [2024-06-28 13:50:57,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42325.4, 300 sec: 42376.3). Total num frames: 3902619648. Throughput: 0: 42201.0. Samples: 181440120. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 13:50:57,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:51:01,775][09423] Updated weights for policy 0, policy_version 238207 (0.0031) [2024-06-28 13:51:02,922][09190] Fps is (10 sec: 44236.5, 60 sec: 42328.1, 300 sec: 42376.6). Total num frames: 3902832640. Throughput: 0: 42283.6. Samples: 181698660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 13:51:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:51:05,065][09423] Updated weights for policy 0, policy_version 238217 (0.0037) [2024-06-28 13:51:07,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3903029248. Throughput: 0: 42511.6. Samples: 181962200. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 13:51:07,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:51:09,529][09423] Updated weights for policy 0, policy_version 238227 (0.0047) [2024-06-28 13:51:12,846][09423] Updated weights for policy 0, policy_version 238237 (0.0043) [2024-06-28 13:51:12,924][09190] Fps is (10 sec: 44226.5, 60 sec: 43142.8, 300 sec: 42375.9). Total num frames: 3903275008. Throughput: 0: 42430.2. Samples: 182086340. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 13:51:12,924][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:51:17,337][09423] Updated weights for policy 0, policy_version 238247 (0.0043) [2024-06-28 13:51:17,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42325.5, 300 sec: 42265.2). Total num frames: 3903455232. Throughput: 0: 42285.9. Samples: 182337680. Policy #0 lag: (min: 0.0, avg: 11.1, max: 24.0) [2024-06-28 13:51:17,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 13:51:20,929][09423] Updated weights for policy 0, policy_version 238257 (0.0045) [2024-06-28 13:51:22,921][09190] Fps is (10 sec: 37692.3, 60 sec: 41779.2, 300 sec: 42209.6). Total num frames: 3903651840. Throughput: 0: 42300.4. Samples: 182589180. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:51:22,924][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:51:24,838][09423] Updated weights for policy 0, policy_version 238267 (0.0036) [2024-06-28 13:51:27,922][09190] Fps is (10 sec: 42597.8, 60 sec: 42598.3, 300 sec: 42265.1). Total num frames: 3903881216. Throughput: 0: 42340.4. Samples: 182716200. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:51:27,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:51:28,573][09423] Updated weights for policy 0, policy_version 238277 (0.0033) [2024-06-28 13:51:32,823][09423] Updated weights for policy 0, policy_version 238287 (0.0023) [2024-06-28 13:51:32,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 3904094208. Throughput: 0: 42273.5. Samples: 182966360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:51:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:51:36,289][09423] Updated weights for policy 0, policy_version 238297 (0.0031) [2024-06-28 13:51:37,921][09190] Fps is (10 sec: 39322.0, 60 sec: 41779.2, 300 sec: 42209.6). Total num frames: 3904274432. Throughput: 0: 42297.8. Samples: 183221900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:51:37,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 13:51:40,544][09423] Updated weights for policy 0, policy_version 238307 (0.0033) [2024-06-28 13:51:42,922][09190] Fps is (10 sec: 42597.9, 60 sec: 42325.3, 300 sec: 42265.1). Total num frames: 3904520192. Throughput: 0: 42353.6. Samples: 183346040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:51:42,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:51:43,774][09423] Updated weights for policy 0, policy_version 238317 (0.0030) [2024-06-28 13:51:47,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 3904716800. Throughput: 0: 42277.4. Samples: 183601140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:51:47,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:51:48,304][09423] Updated weights for policy 0, policy_version 238327 (0.0026) [2024-06-28 13:51:51,364][09423] Updated weights for policy 0, policy_version 238337 (0.0031) [2024-06-28 13:51:52,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3904929792. Throughput: 0: 42263.6. Samples: 183864060. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:51:52,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 13:51:53,870][09403] Signal inference workers to stop experience collection... (2550 times) [2024-06-28 13:51:53,923][09423] InferenceWorker_p0-w0: stopping experience collection (2550 times) [2024-06-28 13:51:53,930][09403] Signal inference workers to resume experience collection... (2550 times) [2024-06-28 13:51:53,942][09423] InferenceWorker_p0-w0: resuming experience collection (2550 times) [2024-06-28 13:51:55,846][09423] Updated weights for policy 0, policy_version 238347 (0.0030) [2024-06-28 13:51:57,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.2, 300 sec: 42376.2). Total num frames: 3905159168. Throughput: 0: 42267.1. Samples: 183988260. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:51:57,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:51:59,644][09423] Updated weights for policy 0, policy_version 238357 (0.0032) [2024-06-28 13:52:02,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42052.4, 300 sec: 42320.7). Total num frames: 3905355776. Throughput: 0: 42291.6. Samples: 184240800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:52:02,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:52:03,629][09423] Updated weights for policy 0, policy_version 238367 (0.0040) [2024-06-28 13:52:07,303][09423] Updated weights for policy 0, policy_version 238377 (0.0034) [2024-06-28 13:52:07,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3905568768. Throughput: 0: 42384.1. Samples: 184496460. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:52:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 13:52:11,381][09423] Updated weights for policy 0, policy_version 238387 (0.0030) [2024-06-28 13:52:12,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42054.0, 300 sec: 42376.6). Total num frames: 3905798144. Throughput: 0: 42374.3. Samples: 184623040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:52:12,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:52:15,188][09423] Updated weights for policy 0, policy_version 238397 (0.0034) [2024-06-28 13:52:17,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3905994752. Throughput: 0: 42593.8. Samples: 184883080. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:52:17,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:52:18,012][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000238404_3906011136.pth... [2024-06-28 13:52:18,061][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000237783_3895836672.pth [2024-06-28 13:52:18,995][09423] Updated weights for policy 0, policy_version 238407 (0.0038) [2024-06-28 13:52:22,750][09423] Updated weights for policy 0, policy_version 238417 (0.0031) [2024-06-28 13:52:22,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42871.4, 300 sec: 42431.8). Total num frames: 3906224128. Throughput: 0: 42460.4. Samples: 185132620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:52:22,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:52:26,786][09423] Updated weights for policy 0, policy_version 238427 (0.0034) [2024-06-28 13:52:27,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42598.5, 300 sec: 42376.2). Total num frames: 3906437120. Throughput: 0: 42617.0. Samples: 185263800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:52:27,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:52:30,582][09423] Updated weights for policy 0, policy_version 238437 (0.0034) [2024-06-28 13:52:32,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.4, 300 sec: 42376.2). Total num frames: 3906650112. Throughput: 0: 42562.3. Samples: 185516440. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:52:32,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:52:34,791][09423] Updated weights for policy 0, policy_version 238447 (0.0034) [2024-06-28 13:52:37,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42871.5, 300 sec: 42320.7). Total num frames: 3906846720. Throughput: 0: 42191.1. Samples: 185762660. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:52:37,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:52:38,700][09423] Updated weights for policy 0, policy_version 238457 (0.0035) [2024-06-28 13:52:42,552][09423] Updated weights for policy 0, policy_version 238467 (0.0048) [2024-06-28 13:52:42,924][09190] Fps is (10 sec: 40949.3, 60 sec: 42323.6, 300 sec: 42320.3). Total num frames: 3907059712. Throughput: 0: 42314.1. Samples: 185892500. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:52:42,925][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 13:52:46,683][09423] Updated weights for policy 0, policy_version 238477 (0.0041) [2024-06-28 13:52:47,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3907256320. Throughput: 0: 42285.3. Samples: 186143640. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:52:47,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:52:50,260][09423] Updated weights for policy 0, policy_version 238487 (0.0031) [2024-06-28 13:52:52,921][09190] Fps is (10 sec: 42609.8, 60 sec: 42598.5, 300 sec: 42320.7). Total num frames: 3907485696. Throughput: 0: 42013.8. Samples: 186387080. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:52:52,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:52:54,496][09423] Updated weights for policy 0, policy_version 238497 (0.0040) [2024-06-28 13:52:57,870][09423] Updated weights for policy 0, policy_version 238507 (0.0044) [2024-06-28 13:52:57,924][09190] Fps is (10 sec: 44226.0, 60 sec: 42323.6, 300 sec: 42376.2). Total num frames: 3907698688. Throughput: 0: 42144.8. Samples: 186519660. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:52:57,924][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 13:53:02,352][09423] Updated weights for policy 0, policy_version 238517 (0.0037) [2024-06-28 13:53:02,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.4, 300 sec: 42265.2). Total num frames: 3907895296. Throughput: 0: 42057.0. Samples: 186775640. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:53:02,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 13:53:05,674][09423] Updated weights for policy 0, policy_version 238527 (0.0027) [2024-06-28 13:53:07,921][09190] Fps is (10 sec: 44247.9, 60 sec: 42871.4, 300 sec: 42431.8). Total num frames: 3908141056. Throughput: 0: 42133.9. Samples: 187028640. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:53:07,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:53:09,718][09423] Updated weights for policy 0, policy_version 238537 (0.0031) [2024-06-28 13:53:12,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42052.4, 300 sec: 42376.3). Total num frames: 3908321280. Throughput: 0: 42257.0. Samples: 187165360. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:53:12,921][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:53:13,106][09423] Updated weights for policy 0, policy_version 238547 (0.0025) [2024-06-28 13:53:17,079][09423] Updated weights for policy 0, policy_version 238557 (0.0044) [2024-06-28 13:53:17,921][09190] Fps is (10 sec: 37682.8, 60 sec: 42052.2, 300 sec: 42209.6). Total num frames: 3908517888. Throughput: 0: 42192.4. Samples: 187415100. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:53:17,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:53:21,143][09423] Updated weights for policy 0, policy_version 238567 (0.0045) [2024-06-28 13:53:22,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 3908763648. Throughput: 0: 42281.4. Samples: 187665320. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:53:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:53:25,525][09423] Updated weights for policy 0, policy_version 238577 (0.0024) [2024-06-28 13:53:27,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3908960256. Throughput: 0: 42489.1. Samples: 187804400. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:53:27,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 13:53:28,539][09423] Updated weights for policy 0, policy_version 238587 (0.0036) [2024-06-28 13:53:32,921][09190] Fps is (10 sec: 39321.4, 60 sec: 41779.2, 300 sec: 42265.2). Total num frames: 3909156864. Throughput: 0: 42517.3. Samples: 188056920. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:53:32,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:53:33,221][09423] Updated weights for policy 0, policy_version 238597 (0.0041) [2024-06-28 13:53:34,559][09403] Signal inference workers to stop experience collection... (2600 times) [2024-06-28 13:53:34,562][09403] Signal inference workers to resume experience collection... (2600 times) [2024-06-28 13:53:34,577][09423] InferenceWorker_p0-w0: stopping experience collection (2600 times) [2024-06-28 13:53:34,578][09423] InferenceWorker_p0-w0: resuming experience collection (2600 times) [2024-06-28 13:53:36,391][09423] Updated weights for policy 0, policy_version 238607 (0.0042) [2024-06-28 13:53:37,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 3909402624. Throughput: 0: 42574.6. Samples: 188302940. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:53:37,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:53:40,704][09423] Updated weights for policy 0, policy_version 238617 (0.0038) [2024-06-28 13:53:42,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42327.1, 300 sec: 42376.2). Total num frames: 3909599232. Throughput: 0: 42632.5. Samples: 188438020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:53:42,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 13:53:44,026][09423] Updated weights for policy 0, policy_version 238627 (0.0035) [2024-06-28 13:53:47,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42598.4, 300 sec: 42265.2). Total num frames: 3909812224. Throughput: 0: 42447.0. Samples: 188685760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:53:47,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:53:48,176][09423] Updated weights for policy 0, policy_version 238637 (0.0039) [2024-06-28 13:53:52,065][09423] Updated weights for policy 0, policy_version 238647 (0.0032) [2024-06-28 13:53:52,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 3910025216. Throughput: 0: 42566.7. Samples: 188944140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:53:52,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 13:53:55,734][09423] Updated weights for policy 0, policy_version 238657 (0.0046) [2024-06-28 13:53:57,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42054.1, 300 sec: 42265.2). Total num frames: 3910221824. Throughput: 0: 42286.2. Samples: 189068240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:53:57,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:53:59,726][09423] Updated weights for policy 0, policy_version 238667 (0.0028) [2024-06-28 13:54:02,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42598.3, 300 sec: 42320.7). Total num frames: 3910451200. Throughput: 0: 42245.8. Samples: 189316160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:54:02,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 13:54:04,027][09423] Updated weights for policy 0, policy_version 238677 (0.0035) [2024-06-28 13:54:07,339][09423] Updated weights for policy 0, policy_version 238687 (0.0037) [2024-06-28 13:54:07,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42052.2, 300 sec: 42376.2). Total num frames: 3910664192. Throughput: 0: 42345.7. Samples: 189570880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:54:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:54:11,972][09423] Updated weights for policy 0, policy_version 238697 (0.0042) [2024-06-28 13:54:12,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42325.3, 300 sec: 42210.0). Total num frames: 3910860800. Throughput: 0: 42070.8. Samples: 189697580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:54:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 13:54:14,863][09423] Updated weights for policy 0, policy_version 238707 (0.0034) [2024-06-28 13:54:17,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42598.5, 300 sec: 42321.1). Total num frames: 3911073792. Throughput: 0: 42005.9. Samples: 189947180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:54:17,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:54:17,936][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000238713_3911073792.pth... [2024-06-28 13:54:17,997][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000238093_3900915712.pth [2024-06-28 13:54:19,518][09423] Updated weights for policy 0, policy_version 238717 (0.0042) [2024-06-28 13:54:22,813][09423] Updated weights for policy 0, policy_version 238727 (0.0039) [2024-06-28 13:54:22,921][09190] Fps is (10 sec: 44236.2, 60 sec: 42325.3, 300 sec: 42321.1). Total num frames: 3911303168. Throughput: 0: 42312.8. Samples: 190207020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:54:22,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:54:26,852][09423] Updated weights for policy 0, policy_version 238737 (0.0038) [2024-06-28 13:54:27,922][09190] Fps is (10 sec: 40959.2, 60 sec: 42052.2, 300 sec: 42265.1). Total num frames: 3911483392. Throughput: 0: 42135.1. Samples: 190334100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:54:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:54:30,847][09423] Updated weights for policy 0, policy_version 238747 (0.0032) [2024-06-28 13:54:32,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42871.5, 300 sec: 42431.8). Total num frames: 3911729152. Throughput: 0: 42302.2. Samples: 190589360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:54:32,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:54:34,695][09423] Updated weights for policy 0, policy_version 238757 (0.0024) [2024-06-28 13:54:37,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 3911925760. Throughput: 0: 42287.0. Samples: 190847060. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:54:37,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:54:38,302][09423] Updated weights for policy 0, policy_version 238767 (0.0032) [2024-06-28 13:54:42,766][09423] Updated weights for policy 0, policy_version 238777 (0.0045) [2024-06-28 13:54:42,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3912122368. Throughput: 0: 42219.9. Samples: 190968140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:54:42,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:54:46,007][09423] Updated weights for policy 0, policy_version 238787 (0.0027) [2024-06-28 13:54:47,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 3912368128. Throughput: 0: 42415.7. Samples: 191224860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 13:54:47,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:54:50,309][09423] Updated weights for policy 0, policy_version 238797 (0.0042) [2024-06-28 13:54:52,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42052.3, 300 sec: 42265.2). Total num frames: 3912548352. Throughput: 0: 42428.1. Samples: 191480140. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 13:54:52,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:54:53,867][09423] Updated weights for policy 0, policy_version 238807 (0.0031) [2024-06-28 13:54:57,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42325.3, 300 sec: 42265.8). Total num frames: 3912761344. Throughput: 0: 42250.1. Samples: 191598840. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 13:54:57,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:54:58,261][09423] Updated weights for policy 0, policy_version 238817 (0.0030) [2024-06-28 13:55:01,865][09423] Updated weights for policy 0, policy_version 238827 (0.0039) [2024-06-28 13:55:02,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3912990720. Throughput: 0: 42453.8. Samples: 191857600. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 13:55:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:55:05,910][09423] Updated weights for policy 0, policy_version 238837 (0.0033) [2024-06-28 13:55:07,924][09190] Fps is (10 sec: 40949.8, 60 sec: 41777.5, 300 sec: 42320.3). Total num frames: 3913170944. Throughput: 0: 42421.2. Samples: 192116080. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 13:55:07,925][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:55:09,468][09423] Updated weights for policy 0, policy_version 238847 (0.0036) [2024-06-28 13:55:12,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42325.2, 300 sec: 42320.7). Total num frames: 3913400320. Throughput: 0: 42191.2. Samples: 192232700. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 13:55:12,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:55:13,775][09423] Updated weights for policy 0, policy_version 238857 (0.0039) [2024-06-28 13:55:17,174][09423] Updated weights for policy 0, policy_version 238867 (0.0028) [2024-06-28 13:55:17,924][09190] Fps is (10 sec: 44236.9, 60 sec: 42323.5, 300 sec: 42264.8). Total num frames: 3913613312. Throughput: 0: 42280.0. Samples: 192492060. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 13:55:17,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:55:21,762][09423] Updated weights for policy 0, policy_version 238877 (0.0027) [2024-06-28 13:55:22,921][09190] Fps is (10 sec: 39321.9, 60 sec: 41506.2, 300 sec: 42265.2). Total num frames: 3913793536. Throughput: 0: 42170.7. Samples: 192744740. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 13:55:22,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:55:24,791][09403] Signal inference workers to stop experience collection... (2650 times) [2024-06-28 13:55:24,791][09403] Signal inference workers to resume experience collection... (2650 times) [2024-06-28 13:55:24,810][09423] InferenceWorker_p0-w0: stopping experience collection (2650 times) [2024-06-28 13:55:24,811][09423] InferenceWorker_p0-w0: resuming experience collection (2650 times) [2024-06-28 13:55:24,938][09423] Updated weights for policy 0, policy_version 238887 (0.0030) [2024-06-28 13:55:27,921][09190] Fps is (10 sec: 44247.4, 60 sec: 42871.5, 300 sec: 42320.7). Total num frames: 3914055680. Throughput: 0: 42175.5. Samples: 192866040. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 13:55:27,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:55:29,425][09423] Updated weights for policy 0, policy_version 238897 (0.0029) [2024-06-28 13:55:32,553][09423] Updated weights for policy 0, policy_version 238907 (0.0026) [2024-06-28 13:55:32,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 3914252288. Throughput: 0: 42241.7. Samples: 193125740. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 13:55:32,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:55:36,850][09423] Updated weights for policy 0, policy_version 238917 (0.0039) [2024-06-28 13:55:37,921][09190] Fps is (10 sec: 37683.4, 60 sec: 41779.2, 300 sec: 42209.6). Total num frames: 3914432512. Throughput: 0: 42366.1. Samples: 193386620. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 13:55:37,926][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:55:39,963][09423] Updated weights for policy 0, policy_version 238927 (0.0039) [2024-06-28 13:55:42,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.5, 300 sec: 42376.3). Total num frames: 3914678272. Throughput: 0: 42409.4. Samples: 193507260. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 13:55:42,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:55:44,416][09423] Updated weights for policy 0, policy_version 238937 (0.0032) [2024-06-28 13:55:47,921][09190] Fps is (10 sec: 45876.0, 60 sec: 42052.3, 300 sec: 42376.3). Total num frames: 3914891264. Throughput: 0: 42403.6. Samples: 193765760. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 13:55:47,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:55:47,998][09423] Updated weights for policy 0, policy_version 238947 (0.0038) [2024-06-28 13:55:52,175][09423] Updated weights for policy 0, policy_version 238957 (0.0032) [2024-06-28 13:55:52,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3915087872. Throughput: 0: 42226.8. Samples: 194016180. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 13:55:52,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:55:55,564][09423] Updated weights for policy 0, policy_version 238967 (0.0034) [2024-06-28 13:55:57,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.5, 300 sec: 42376.3). Total num frames: 3915333632. Throughput: 0: 42303.2. Samples: 194136340. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 13:55:57,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 13:55:59,686][09423] Updated weights for policy 0, policy_version 238977 (0.0031) [2024-06-28 13:56:02,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42325.3, 300 sec: 42376.3). Total num frames: 3915530240. Throughput: 0: 42321.0. Samples: 194396400. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:56:02,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 13:56:03,488][09423] Updated weights for policy 0, policy_version 238987 (0.0032) [2024-06-28 13:56:07,921][09190] Fps is (10 sec: 37682.8, 60 sec: 42327.1, 300 sec: 42154.4). Total num frames: 3915710464. Throughput: 0: 42234.2. Samples: 194645280. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:56:07,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:56:08,342][09423] Updated weights for policy 0, policy_version 238997 (0.0032) [2024-06-28 13:56:11,382][09423] Updated weights for policy 0, policy_version 239007 (0.0043) [2024-06-28 13:56:12,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3915939840. Throughput: 0: 42249.8. Samples: 194767280. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:56:12,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:56:15,883][09423] Updated weights for policy 0, policy_version 239017 (0.0027) [2024-06-28 13:56:17,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42327.1, 300 sec: 42376.3). Total num frames: 3916152832. Throughput: 0: 42281.8. Samples: 195028420. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:56:17,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:56:17,931][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000239023_3916152832.pth... [2024-06-28 13:56:17,982][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000238404_3906011136.pth [2024-06-28 13:56:18,977][09423] Updated weights for policy 0, policy_version 239027 (0.0035) [2024-06-28 13:56:22,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42871.4, 300 sec: 42320.7). Total num frames: 3916365824. Throughput: 0: 41991.1. Samples: 195276220. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:56:22,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 13:56:23,417][09423] Updated weights for policy 0, policy_version 239037 (0.0025) [2024-06-28 13:56:26,519][09423] Updated weights for policy 0, policy_version 239047 (0.0034) [2024-06-28 13:56:27,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42052.4, 300 sec: 42320.7). Total num frames: 3916578816. Throughput: 0: 42306.7. Samples: 195411060. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:56:27,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:56:30,955][09423] Updated weights for policy 0, policy_version 239057 (0.0044) [2024-06-28 13:56:32,921][09190] Fps is (10 sec: 39321.9, 60 sec: 41779.2, 300 sec: 42320.7). Total num frames: 3916759040. Throughput: 0: 42116.3. Samples: 195661000. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:56:32,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:56:34,427][09423] Updated weights for policy 0, policy_version 239067 (0.0034) [2024-06-28 13:56:37,924][09190] Fps is (10 sec: 42587.2, 60 sec: 42869.7, 300 sec: 42320.4). Total num frames: 3917004800. Throughput: 0: 42046.1. Samples: 195908360. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:56:37,925][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:56:38,437][09423] Updated weights for policy 0, policy_version 239077 (0.0034) [2024-06-28 13:56:42,228][09423] Updated weights for policy 0, policy_version 239087 (0.0035) [2024-06-28 13:56:42,921][09190] Fps is (10 sec: 45875.6, 60 sec: 42325.3, 300 sec: 42376.3). Total num frames: 3917217792. Throughput: 0: 42405.8. Samples: 196044600. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:56:42,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:56:46,707][09423] Updated weights for policy 0, policy_version 239097 (0.0040) [2024-06-28 13:56:47,922][09190] Fps is (10 sec: 37692.5, 60 sec: 41506.0, 300 sec: 42209.6). Total num frames: 3917381632. Throughput: 0: 42015.9. Samples: 196287120. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:56:47,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 13:56:50,218][09423] Updated weights for policy 0, policy_version 239107 (0.0032) [2024-06-28 13:56:52,416][09403] Signal inference workers to stop experience collection... (2700 times) [2024-06-28 13:56:52,416][09403] Signal inference workers to resume experience collection... (2700 times) [2024-06-28 13:56:52,446][09423] InferenceWorker_p0-w0: stopping experience collection (2700 times) [2024-06-28 13:56:52,446][09423] InferenceWorker_p0-w0: resuming experience collection (2700 times) [2024-06-28 13:56:52,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3917627392. Throughput: 0: 41990.6. Samples: 196534860. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:56:52,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:56:54,411][09423] Updated weights for policy 0, policy_version 239117 (0.0038) [2024-06-28 13:56:57,921][09190] Fps is (10 sec: 44237.4, 60 sec: 41506.1, 300 sec: 42265.2). Total num frames: 3917824000. Throughput: 0: 42220.1. Samples: 196667180. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:56:57,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:56:58,190][09423] Updated weights for policy 0, policy_version 239127 (0.0028) [2024-06-28 13:57:02,304][09423] Updated weights for policy 0, policy_version 239137 (0.0038) [2024-06-28 13:57:02,924][09190] Fps is (10 sec: 40950.3, 60 sec: 41777.5, 300 sec: 42264.8). Total num frames: 3918036992. Throughput: 0: 41951.0. Samples: 196916320. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:57:02,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:57:05,699][09423] Updated weights for policy 0, policy_version 239147 (0.0046) [2024-06-28 13:57:07,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.5, 300 sec: 42265.2). Total num frames: 3918266368. Throughput: 0: 42160.1. Samples: 197173420. Policy #0 lag: (min: 0.0, avg: 8.2, max: 21.0) [2024-06-28 13:57:07,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:57:09,970][09423] Updated weights for policy 0, policy_version 239157 (0.0027) [2024-06-28 13:57:12,921][09190] Fps is (10 sec: 44247.9, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3918479360. Throughput: 0: 42213.7. Samples: 197310680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 13:57:12,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:57:13,231][09423] Updated weights for policy 0, policy_version 239167 (0.0027) [2024-06-28 13:57:17,523][09423] Updated weights for policy 0, policy_version 239177 (0.0043) [2024-06-28 13:57:17,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42052.2, 300 sec: 42209.6). Total num frames: 3918675968. Throughput: 0: 42075.9. Samples: 197554420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 13:57:17,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:57:21,140][09423] Updated weights for policy 0, policy_version 239187 (0.0044) [2024-06-28 13:57:22,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42325.5, 300 sec: 42265.2). Total num frames: 3918905344. Throughput: 0: 42264.2. Samples: 197810140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 13:57:22,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:57:25,428][09423] Updated weights for policy 0, policy_version 239197 (0.0040) [2024-06-28 13:57:27,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42052.2, 300 sec: 42209.6). Total num frames: 3919101952. Throughput: 0: 42083.1. Samples: 197938340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 13:57:27,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:57:28,931][09423] Updated weights for policy 0, policy_version 239207 (0.0031) [2024-06-28 13:57:32,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42325.4, 300 sec: 42209.6). Total num frames: 3919298560. Throughput: 0: 42306.3. Samples: 198190900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 13:57:32,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:57:33,553][09423] Updated weights for policy 0, policy_version 239217 (0.0042) [2024-06-28 13:57:36,695][09423] Updated weights for policy 0, policy_version 239227 (0.0043) [2024-06-28 13:57:37,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42327.2, 300 sec: 42321.1). Total num frames: 3919544320. Throughput: 0: 42353.5. Samples: 198440760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 13:57:37,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:57:41,317][09423] Updated weights for policy 0, policy_version 239237 (0.0031) [2024-06-28 13:57:42,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42052.1, 300 sec: 42320.7). Total num frames: 3919740928. Throughput: 0: 42206.9. Samples: 198566500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 13:57:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:57:44,516][09423] Updated weights for policy 0, policy_version 239247 (0.0047) [2024-06-28 13:57:47,921][09190] Fps is (10 sec: 39321.1, 60 sec: 42598.4, 300 sec: 42209.6). Total num frames: 3919937536. Throughput: 0: 42336.1. Samples: 198821340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 13:57:47,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 13:57:48,821][09423] Updated weights for policy 0, policy_version 239257 (0.0042) [2024-06-28 13:57:52,050][09423] Updated weights for policy 0, policy_version 239267 (0.0038) [2024-06-28 13:57:52,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42325.4, 300 sec: 42265.5). Total num frames: 3920166912. Throughput: 0: 42240.4. Samples: 199074240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 13:57:52,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:57:56,376][09423] Updated weights for policy 0, policy_version 239277 (0.0033) [2024-06-28 13:57:57,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 3920363520. Throughput: 0: 42232.9. Samples: 199211160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 13:57:57,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 13:57:59,507][09423] Updated weights for policy 0, policy_version 239287 (0.0040) [2024-06-28 13:58:02,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42327.1, 300 sec: 42154.1). Total num frames: 3920576512. Throughput: 0: 42340.1. Samples: 199459720. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 13:58:02,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:58:03,893][09423] Updated weights for policy 0, policy_version 239297 (0.0037) [2024-06-28 13:58:07,297][09423] Updated weights for policy 0, policy_version 239307 (0.0026) [2024-06-28 13:58:07,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3920805888. Throughput: 0: 42362.1. Samples: 199716440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 13:58:07,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:58:11,593][09423] Updated weights for policy 0, policy_version 239317 (0.0023) [2024-06-28 13:58:12,921][09190] Fps is (10 sec: 40960.1, 60 sec: 41779.2, 300 sec: 42265.2). Total num frames: 3920986112. Throughput: 0: 42366.2. Samples: 199844820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 13:58:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 13:58:12,983][09403] Signal inference workers to stop experience collection... (2750 times) [2024-06-28 13:58:12,984][09403] Signal inference workers to resume experience collection... (2750 times) [2024-06-28 13:58:13,024][09423] InferenceWorker_p0-w0: stopping experience collection (2750 times) [2024-06-28 13:58:13,025][09423] InferenceWorker_p0-w0: resuming experience collection (2750 times) [2024-06-28 13:58:15,344][09423] Updated weights for policy 0, policy_version 239327 (0.0046) [2024-06-28 13:58:17,923][09190] Fps is (10 sec: 44228.9, 60 sec: 42870.3, 300 sec: 42320.4). Total num frames: 3921248256. Throughput: 0: 42505.4. Samples: 200103720. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 13:58:17,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:58:17,943][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000239334_3921248256.pth... [2024-06-28 13:58:18,006][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000238713_3911073792.pth [2024-06-28 13:58:19,823][09423] Updated weights for policy 0, policy_version 239337 (0.0028) [2024-06-28 13:58:22,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3921444864. Throughput: 0: 42520.9. Samples: 200354200. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:58:22,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:58:22,953][09423] Updated weights for policy 0, policy_version 239347 (0.0042) [2024-06-28 13:58:27,330][09423] Updated weights for policy 0, policy_version 239357 (0.0027) [2024-06-28 13:58:27,921][09190] Fps is (10 sec: 39328.9, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3921641472. Throughput: 0: 42592.2. Samples: 200483140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:58:27,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 13:58:30,631][09423] Updated weights for policy 0, policy_version 239367 (0.0039) [2024-06-28 13:58:32,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42871.5, 300 sec: 42265.2). Total num frames: 3921870848. Throughput: 0: 42612.5. Samples: 200738900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:58:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:58:34,854][09423] Updated weights for policy 0, policy_version 239377 (0.0036) [2024-06-28 13:58:37,922][09190] Fps is (10 sec: 42597.6, 60 sec: 42052.1, 300 sec: 42265.2). Total num frames: 3922067456. Throughput: 0: 42673.2. Samples: 200994540. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:58:37,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:58:38,370][09423] Updated weights for policy 0, policy_version 239387 (0.0029) [2024-06-28 13:58:42,819][09423] Updated weights for policy 0, policy_version 239397 (0.0035) [2024-06-28 13:58:42,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.5, 300 sec: 42265.2). Total num frames: 3922280448. Throughput: 0: 42370.3. Samples: 201117820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:58:42,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 13:58:46,011][09423] Updated weights for policy 0, policy_version 239407 (0.0028) [2024-06-28 13:58:47,921][09190] Fps is (10 sec: 42599.2, 60 sec: 42598.5, 300 sec: 42265.2). Total num frames: 3922493440. Throughput: 0: 42472.0. Samples: 201370960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:58:47,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 13:58:50,556][09423] Updated weights for policy 0, policy_version 239417 (0.0038) [2024-06-28 13:58:52,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3922706432. Throughput: 0: 42515.2. Samples: 201629620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:58:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 13:58:54,037][09423] Updated weights for policy 0, policy_version 239427 (0.0047) [2024-06-28 13:58:57,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.3, 300 sec: 42209.6). Total num frames: 3922903040. Throughput: 0: 42307.9. Samples: 201748680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:58:57,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 13:58:58,665][09423] Updated weights for policy 0, policy_version 239437 (0.0038) [2024-06-28 13:59:01,693][09423] Updated weights for policy 0, policy_version 239447 (0.0038) [2024-06-28 13:59:02,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.4, 300 sec: 42320.7). Total num frames: 3923148800. Throughput: 0: 42163.9. Samples: 202001020. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:59:02,925][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 13:59:06,295][09423] Updated weights for policy 0, policy_version 239457 (0.0039) [2024-06-28 13:59:07,921][09190] Fps is (10 sec: 40960.2, 60 sec: 41779.3, 300 sec: 42209.6). Total num frames: 3923312640. Throughput: 0: 42432.9. Samples: 202263680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:59:07,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 13:59:09,306][09423] Updated weights for policy 0, policy_version 239467 (0.0032) [2024-06-28 13:59:12,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42871.4, 300 sec: 42320.7). Total num frames: 3923558400. Throughput: 0: 42249.2. Samples: 202384360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:59:12,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 13:59:13,750][09423] Updated weights for policy 0, policy_version 239477 (0.0030) [2024-06-28 13:59:17,145][09423] Updated weights for policy 0, policy_version 239487 (0.0039) [2024-06-28 13:59:17,922][09190] Fps is (10 sec: 45874.1, 60 sec: 42053.4, 300 sec: 42265.1). Total num frames: 3923771392. Throughput: 0: 42201.1. Samples: 202637960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:59:17,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 13:59:21,683][09423] Updated weights for policy 0, policy_version 239497 (0.0040) [2024-06-28 13:59:22,922][09190] Fps is (10 sec: 40959.6, 60 sec: 42052.1, 300 sec: 42320.7). Total num frames: 3923968000. Throughput: 0: 42251.5. Samples: 202895860. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:59:22,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:59:24,965][09423] Updated weights for policy 0, policy_version 239507 (0.0029) [2024-06-28 13:59:27,921][09190] Fps is (10 sec: 42599.4, 60 sec: 42598.4, 300 sec: 42265.2). Total num frames: 3924197376. Throughput: 0: 42266.2. Samples: 203019800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:59:27,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 13:59:29,187][09423] Updated weights for policy 0, policy_version 239517 (0.0024) [2024-06-28 13:59:32,541][09423] Updated weights for policy 0, policy_version 239527 (0.0022) [2024-06-28 13:59:32,921][09190] Fps is (10 sec: 44238.0, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3924410368. Throughput: 0: 42440.9. Samples: 203280800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:59:32,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 13:59:36,694][09423] Updated weights for policy 0, policy_version 239537 (0.0040) [2024-06-28 13:59:37,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 3924606976. Throughput: 0: 42461.2. Samples: 203540380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:59:37,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:59:40,132][09423] Updated weights for policy 0, policy_version 239547 (0.0035) [2024-06-28 13:59:42,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42598.3, 300 sec: 42265.2). Total num frames: 3924836352. Throughput: 0: 42464.0. Samples: 203659560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:59:42,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 13:59:44,726][09423] Updated weights for policy 0, policy_version 239557 (0.0030) [2024-06-28 13:59:47,488][09403] Signal inference workers to stop experience collection... (2800 times) [2024-06-28 13:59:47,488][09403] Signal inference workers to resume experience collection... (2800 times) [2024-06-28 13:59:47,510][09423] InferenceWorker_p0-w0: stopping experience collection (2800 times) [2024-06-28 13:59:47,510][09423] InferenceWorker_p0-w0: resuming experience collection (2800 times) [2024-06-28 13:59:47,846][09423] Updated weights for policy 0, policy_version 239567 (0.0032) [2024-06-28 13:59:47,921][09190] Fps is (10 sec: 45875.6, 60 sec: 42871.4, 300 sec: 42431.8). Total num frames: 3925065728. Throughput: 0: 42569.8. Samples: 203916660. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:59:47,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:59:52,260][09423] Updated weights for policy 0, policy_version 239577 (0.0039) [2024-06-28 13:59:52,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42052.2, 300 sec: 42265.2). Total num frames: 3925229568. Throughput: 0: 42380.4. Samples: 204170800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:59:52,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 13:59:55,786][09423] Updated weights for policy 0, policy_version 239587 (0.0025) [2024-06-28 13:59:57,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42871.5, 300 sec: 42320.7). Total num frames: 3925475328. Throughput: 0: 42422.8. Samples: 204293380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 13:59:57,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 13:59:59,833][09423] Updated weights for policy 0, policy_version 239597 (0.0038) [2024-06-28 14:00:02,921][09190] Fps is (10 sec: 45875.6, 60 sec: 42325.4, 300 sec: 42432.2). Total num frames: 3925688320. Throughput: 0: 42626.0. Samples: 204556120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 14:00:02,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:00:03,204][09423] Updated weights for policy 0, policy_version 239607 (0.0038) [2024-06-28 14:00:07,725][09423] Updated weights for policy 0, policy_version 239617 (0.0049) [2024-06-28 14:00:07,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42871.5, 300 sec: 42320.7). Total num frames: 3925884928. Throughput: 0: 42468.7. Samples: 204806940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 14:00:07,930][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:00:11,299][09423] Updated weights for policy 0, policy_version 239627 (0.0038) [2024-06-28 14:00:12,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.4, 300 sec: 42321.1). Total num frames: 3926097920. Throughput: 0: 42502.6. Samples: 204932420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 14:00:12,930][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:00:15,254][09423] Updated weights for policy 0, policy_version 239637 (0.0038) [2024-06-28 14:00:17,922][09190] Fps is (10 sec: 42596.1, 60 sec: 42325.1, 300 sec: 42431.7). Total num frames: 3926310912. Throughput: 0: 42567.5. Samples: 205196360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 14:00:17,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:00:18,020][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000239644_3926327296.pth... [2024-06-28 14:00:18,076][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000239023_3916152832.pth [2024-06-28 14:00:18,864][09423] Updated weights for policy 0, policy_version 239647 (0.0026) [2024-06-28 14:00:22,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.6, 300 sec: 42265.2). Total num frames: 3926523904. Throughput: 0: 42386.3. Samples: 205447760. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 14:00:22,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:00:22,972][09423] Updated weights for policy 0, policy_version 239657 (0.0032) [2024-06-28 14:00:26,512][09423] Updated weights for policy 0, policy_version 239667 (0.0032) [2024-06-28 14:00:27,921][09190] Fps is (10 sec: 44238.8, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 3926753280. Throughput: 0: 42628.4. Samples: 205577840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 14:00:27,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:00:30,861][09423] Updated weights for policy 0, policy_version 239677 (0.0046) [2024-06-28 14:00:32,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 3926949888. Throughput: 0: 42581.3. Samples: 205832820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 14:00:32,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:00:34,042][09423] Updated weights for policy 0, policy_version 239687 (0.0034) [2024-06-28 14:00:37,924][09190] Fps is (10 sec: 42587.8, 60 sec: 42869.7, 300 sec: 42375.9). Total num frames: 3927179264. Throughput: 0: 42447.4. Samples: 206081040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 14:00:37,924][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:00:38,287][09423] Updated weights for policy 0, policy_version 239697 (0.0036) [2024-06-28 14:00:41,927][09423] Updated weights for policy 0, policy_version 239707 (0.0024) [2024-06-28 14:00:42,922][09190] Fps is (10 sec: 44236.1, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 3927392256. Throughput: 0: 42622.5. Samples: 206211400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:00:42,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:00:46,192][09423] Updated weights for policy 0, policy_version 239717 (0.0033) [2024-06-28 14:00:47,921][09190] Fps is (10 sec: 37692.7, 60 sec: 41506.1, 300 sec: 42265.2). Total num frames: 3927556096. Throughput: 0: 42396.9. Samples: 206463980. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:00:47,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:00:49,538][09423] Updated weights for policy 0, policy_version 239727 (0.0040) [2024-06-28 14:00:52,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42871.5, 300 sec: 42265.2). Total num frames: 3927801856. Throughput: 0: 42490.5. Samples: 206719020. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:00:52,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:00:54,085][09423] Updated weights for policy 0, policy_version 239737 (0.0048) [2024-06-28 14:00:57,317][09423] Updated weights for policy 0, policy_version 239747 (0.0033) [2024-06-28 14:00:57,921][09190] Fps is (10 sec: 47513.2, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 3928031232. Throughput: 0: 42691.1. Samples: 206853520. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:00:57,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:01:01,712][09423] Updated weights for policy 0, policy_version 239757 (0.0038) [2024-06-28 14:01:02,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42052.3, 300 sec: 42376.3). Total num frames: 3928211456. Throughput: 0: 42416.9. Samples: 207105100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:01:02,922][09190] Avg episode reward: [(0, '0.803')] [2024-06-28 14:01:02,998][09403] Saving new best policy, reward=0.803! [2024-06-28 14:01:04,310][09403] Signal inference workers to stop experience collection... (2850 times) [2024-06-28 14:01:04,347][09423] InferenceWorker_p0-w0: stopping experience collection (2850 times) [2024-06-28 14:01:04,373][09403] Signal inference workers to resume experience collection... (2850 times) [2024-06-28 14:01:04,374][09423] InferenceWorker_p0-w0: resuming experience collection (2850 times) [2024-06-28 14:01:04,992][09423] Updated weights for policy 0, policy_version 239767 (0.0040) [2024-06-28 14:01:07,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.4, 300 sec: 42431.8). Total num frames: 3928457216. Throughput: 0: 42495.0. Samples: 207360040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:01:07,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:01:09,249][09423] Updated weights for policy 0, policy_version 239777 (0.0037) [2024-06-28 14:01:12,670][09423] Updated weights for policy 0, policy_version 239787 (0.0028) [2024-06-28 14:01:12,921][09190] Fps is (10 sec: 45875.3, 60 sec: 42871.5, 300 sec: 42431.8). Total num frames: 3928670208. Throughput: 0: 42661.8. Samples: 207497620. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:01:12,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:01:16,926][09423] Updated weights for policy 0, policy_version 239797 (0.0033) [2024-06-28 14:01:17,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42325.6, 300 sec: 42320.7). Total num frames: 3928850432. Throughput: 0: 42579.0. Samples: 207748880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:01:17,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:01:20,410][09423] Updated weights for policy 0, policy_version 239807 (0.0032) [2024-06-28 14:01:22,924][09190] Fps is (10 sec: 42587.4, 60 sec: 42869.6, 300 sec: 42431.4). Total num frames: 3929096192. Throughput: 0: 42555.1. Samples: 207996020. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:01:22,924][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 14:01:24,938][09423] Updated weights for policy 0, policy_version 239817 (0.0032) [2024-06-28 14:01:27,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 3929292800. Throughput: 0: 42670.8. Samples: 208131580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:01:27,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:01:28,133][09423] Updated weights for policy 0, policy_version 239827 (0.0048) [2024-06-28 14:01:32,921][09190] Fps is (10 sec: 37692.8, 60 sec: 42052.3, 300 sec: 42265.5). Total num frames: 3929473024. Throughput: 0: 42673.8. Samples: 208384300. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:01:32,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:01:32,945][09423] Updated weights for policy 0, policy_version 239837 (0.0029) [2024-06-28 14:01:35,793][09423] Updated weights for policy 0, policy_version 239847 (0.0029) [2024-06-28 14:01:37,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42327.1, 300 sec: 42376.2). Total num frames: 3929718784. Throughput: 0: 42587.2. Samples: 208635440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:01:37,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:01:40,561][09423] Updated weights for policy 0, policy_version 239857 (0.0042) [2024-06-28 14:01:42,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42052.3, 300 sec: 42487.3). Total num frames: 3929915392. Throughput: 0: 42476.0. Samples: 208764940. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:01:42,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:01:43,674][09423] Updated weights for policy 0, policy_version 239867 (0.0032) [2024-06-28 14:01:47,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42871.5, 300 sec: 42376.3). Total num frames: 3930128384. Throughput: 0: 42378.7. Samples: 209012140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:01:47,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:01:47,995][09423] Updated weights for policy 0, policy_version 239877 (0.0024) [2024-06-28 14:01:51,478][09423] Updated weights for policy 0, policy_version 239887 (0.0040) [2024-06-28 14:01:52,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42871.4, 300 sec: 42542.8). Total num frames: 3930374144. Throughput: 0: 42471.1. Samples: 209271240. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 14:01:52,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:01:55,720][09423] Updated weights for policy 0, policy_version 239897 (0.0029) [2024-06-28 14:01:57,921][09190] Fps is (10 sec: 44236.2, 60 sec: 42325.4, 300 sec: 42487.7). Total num frames: 3930570752. Throughput: 0: 42433.7. Samples: 209407140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 14:01:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:01:59,100][09423] Updated weights for policy 0, policy_version 239907 (0.0044) [2024-06-28 14:02:02,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 3930767360. Throughput: 0: 42291.6. Samples: 209652000. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 14:02:02,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:02:03,757][09423] Updated weights for policy 0, policy_version 239917 (0.0044) [2024-06-28 14:02:06,739][09423] Updated weights for policy 0, policy_version 239927 (0.0029) [2024-06-28 14:02:07,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 3931013120. Throughput: 0: 42521.2. Samples: 209909380. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 14:02:07,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:02:11,589][09423] Updated weights for policy 0, policy_version 239937 (0.0044) [2024-06-28 14:02:12,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42052.2, 300 sec: 42431.8). Total num frames: 3931193344. Throughput: 0: 42418.6. Samples: 210040420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 14:02:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 14:02:14,315][09423] Updated weights for policy 0, policy_version 239947 (0.0031) [2024-06-28 14:02:17,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42871.4, 300 sec: 42431.8). Total num frames: 3931422720. Throughput: 0: 42343.9. Samples: 210289780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 14:02:17,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:02:17,937][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000239955_3931422720.pth... [2024-06-28 14:02:17,988][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000239334_3921248256.pth [2024-06-28 14:02:19,011][09423] Updated weights for policy 0, policy_version 239957 (0.0038) [2024-06-28 14:02:21,569][09403] Signal inference workers to stop experience collection... (2900 times) [2024-06-28 14:02:21,571][09403] Signal inference workers to resume experience collection... (2900 times) [2024-06-28 14:02:21,591][09423] InferenceWorker_p0-w0: stopping experience collection (2900 times) [2024-06-28 14:02:21,591][09423] InferenceWorker_p0-w0: resuming experience collection (2900 times) [2024-06-28 14:02:21,857][09423] Updated weights for policy 0, policy_version 239967 (0.0041) [2024-06-28 14:02:22,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42327.1, 300 sec: 42487.3). Total num frames: 3931635712. Throughput: 0: 42315.1. Samples: 210539620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 14:02:22,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:02:26,855][09423] Updated weights for policy 0, policy_version 239977 (0.0036) [2024-06-28 14:02:27,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 3931832320. Throughput: 0: 42379.6. Samples: 210672020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 14:02:27,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:02:29,773][09423] Updated weights for policy 0, policy_version 239987 (0.0045) [2024-06-28 14:02:32,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 42487.3). Total num frames: 3932078080. Throughput: 0: 42588.9. Samples: 210928640. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 14:02:32,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:02:34,486][09423] Updated weights for policy 0, policy_version 239997 (0.0028) [2024-06-28 14:02:37,610][09423] Updated weights for policy 0, policy_version 240007 (0.0030) [2024-06-28 14:02:37,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 3932274688. Throughput: 0: 42481.8. Samples: 211182920. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 14:02:37,930][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:02:41,953][09423] Updated weights for policy 0, policy_version 240017 (0.0042) [2024-06-28 14:02:42,921][09190] Fps is (10 sec: 39321.2, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 3932471296. Throughput: 0: 42333.8. Samples: 211312160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 14:02:42,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:02:45,183][09423] Updated weights for policy 0, policy_version 240027 (0.0035) [2024-06-28 14:02:47,921][09190] Fps is (10 sec: 44237.4, 60 sec: 43144.5, 300 sec: 42542.9). Total num frames: 3932717056. Throughput: 0: 42618.8. Samples: 211569840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 14:02:47,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:02:49,944][09423] Updated weights for policy 0, policy_version 240037 (0.0045) [2024-06-28 14:02:52,887][09423] Updated weights for policy 0, policy_version 240047 (0.0037) [2024-06-28 14:02:52,924][09190] Fps is (10 sec: 45863.7, 60 sec: 42596.6, 300 sec: 42598.0). Total num frames: 3932930048. Throughput: 0: 42528.9. Samples: 211823280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 14:02:52,925][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:02:57,701][09423] Updated weights for policy 0, policy_version 240057 (0.0034) [2024-06-28 14:02:57,921][09190] Fps is (10 sec: 37683.1, 60 sec: 42052.3, 300 sec: 42431.8). Total num frames: 3933093888. Throughput: 0: 42409.4. Samples: 211948840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 22.0) [2024-06-28 14:02:57,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:03:00,803][09423] Updated weights for policy 0, policy_version 240067 (0.0046) [2024-06-28 14:03:02,921][09190] Fps is (10 sec: 40970.3, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 3933339648. Throughput: 0: 42403.6. Samples: 212197940. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 14:03:02,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:03:05,347][09423] Updated weights for policy 0, policy_version 240077 (0.0032) [2024-06-28 14:03:07,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42052.4, 300 sec: 42542.8). Total num frames: 3933536256. Throughput: 0: 42537.3. Samples: 212453800. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 14:03:07,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:03:08,748][09423] Updated weights for policy 0, policy_version 240087 (0.0039) [2024-06-28 14:03:12,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42325.4, 300 sec: 42321.0). Total num frames: 3933732864. Throughput: 0: 42292.5. Samples: 212575180. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 14:03:12,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:03:13,113][09423] Updated weights for policy 0, policy_version 240097 (0.0035) [2024-06-28 14:03:16,591][09423] Updated weights for policy 0, policy_version 240107 (0.0033) [2024-06-28 14:03:17,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 3933962240. Throughput: 0: 42292.9. Samples: 212831820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 14:03:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:03:20,691][09423] Updated weights for policy 0, policy_version 240117 (0.0041) [2024-06-28 14:03:22,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42052.3, 300 sec: 42431.8). Total num frames: 3934158848. Throughput: 0: 42383.7. Samples: 213090180. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 14:03:22,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:03:24,102][09423] Updated weights for policy 0, policy_version 240127 (0.0030) [2024-06-28 14:03:27,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 3934371840. Throughput: 0: 42306.8. Samples: 213215960. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 14:03:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:03:28,271][09423] Updated weights for policy 0, policy_version 240137 (0.0034) [2024-06-28 14:03:31,651][09423] Updated weights for policy 0, policy_version 240147 (0.0027) [2024-06-28 14:03:32,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42052.3, 300 sec: 42487.3). Total num frames: 3934601216. Throughput: 0: 42186.2. Samples: 213468220. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 14:03:32,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:03:36,357][09423] Updated weights for policy 0, policy_version 240157 (0.0038) [2024-06-28 14:03:37,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42052.3, 300 sec: 42431.8). Total num frames: 3934797824. Throughput: 0: 42357.5. Samples: 213729260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 14:03:37,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:03:39,351][09423] Updated weights for policy 0, policy_version 240167 (0.0038) [2024-06-28 14:03:42,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 3935010816. Throughput: 0: 42327.5. Samples: 213853580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 14:03:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:03:43,877][09423] Updated weights for policy 0, policy_version 240177 (0.0030) [2024-06-28 14:03:47,251][09423] Updated weights for policy 0, policy_version 240187 (0.0037) [2024-06-28 14:03:47,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42052.2, 300 sec: 42487.3). Total num frames: 3935240192. Throughput: 0: 42562.2. Samples: 214113240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 14:03:47,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:03:51,659][09423] Updated weights for policy 0, policy_version 240197 (0.0038) [2024-06-28 14:03:52,921][09190] Fps is (10 sec: 42598.7, 60 sec: 41781.0, 300 sec: 42487.3). Total num frames: 3935436800. Throughput: 0: 42513.4. Samples: 214366900. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 14:03:52,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:03:55,228][09423] Updated weights for policy 0, policy_version 240207 (0.0037) [2024-06-28 14:03:56,295][09403] Signal inference workers to stop experience collection... (2950 times) [2024-06-28 14:03:56,320][09423] InferenceWorker_p0-w0: stopping experience collection (2950 times) [2024-06-28 14:03:56,354][09403] Signal inference workers to resume experience collection... (2950 times) [2024-06-28 14:03:56,355][09423] InferenceWorker_p0-w0: resuming experience collection (2950 times) [2024-06-28 14:03:57,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 3935649792. Throughput: 0: 42455.0. Samples: 214485660. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 14:03:57,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:03:59,208][09423] Updated weights for policy 0, policy_version 240217 (0.0042) [2024-06-28 14:04:02,719][09423] Updated weights for policy 0, policy_version 240227 (0.0027) [2024-06-28 14:04:02,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 3935879168. Throughput: 0: 42511.1. Samples: 214744820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 14:04:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 14:04:07,011][09423] Updated weights for policy 0, policy_version 240237 (0.0028) [2024-06-28 14:04:07,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42052.3, 300 sec: 42376.3). Total num frames: 3936059392. Throughput: 0: 42399.1. Samples: 214998140. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 14:04:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:04:10,550][09423] Updated weights for policy 0, policy_version 240247 (0.0039) [2024-06-28 14:04:12,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 3936288768. Throughput: 0: 42394.2. Samples: 215123700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 14:04:12,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:04:14,874][09423] Updated weights for policy 0, policy_version 240257 (0.0041) [2024-06-28 14:04:17,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.3, 300 sec: 42487.4). Total num frames: 3936501760. Throughput: 0: 42442.2. Samples: 215378120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 14:04:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 14:04:18,095][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000240266_3936518144.pth... [2024-06-28 14:04:18,164][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000239644_3926327296.pth [2024-06-28 14:04:18,297][09423] Updated weights for policy 0, policy_version 240267 (0.0028) [2024-06-28 14:04:22,458][09423] Updated weights for policy 0, policy_version 240277 (0.0031) [2024-06-28 14:04:22,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 3936714752. Throughput: 0: 42347.1. Samples: 215634880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 14:04:22,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:04:25,870][09423] Updated weights for policy 0, policy_version 240287 (0.0040) [2024-06-28 14:04:27,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 3936944128. Throughput: 0: 42347.5. Samples: 215759220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 14:04:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:04:30,101][09423] Updated weights for policy 0, policy_version 240297 (0.0036) [2024-06-28 14:04:32,922][09190] Fps is (10 sec: 42595.9, 60 sec: 42324.9, 300 sec: 42487.3). Total num frames: 3937140736. Throughput: 0: 42263.5. Samples: 216015120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 14:04:32,923][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:04:33,827][09423] Updated weights for policy 0, policy_version 240307 (0.0028) [2024-06-28 14:04:37,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42325.3, 300 sec: 42376.3). Total num frames: 3937337344. Throughput: 0: 42262.7. Samples: 216268720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 14:04:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 14:04:38,013][09423] Updated weights for policy 0, policy_version 240317 (0.0025) [2024-06-28 14:04:41,365][09423] Updated weights for policy 0, policy_version 240327 (0.0030) [2024-06-28 14:04:42,921][09190] Fps is (10 sec: 44239.0, 60 sec: 42871.5, 300 sec: 42431.8). Total num frames: 3937583104. Throughput: 0: 42453.3. Samples: 216396060. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 14:04:42,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:04:45,462][09423] Updated weights for policy 0, policy_version 240337 (0.0028) [2024-06-28 14:04:47,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 3937779712. Throughput: 0: 42483.5. Samples: 216656580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 14:04:47,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:04:48,774][09423] Updated weights for policy 0, policy_version 240347 (0.0027) [2024-06-28 14:04:52,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 3937976320. Throughput: 0: 42460.9. Samples: 216908880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 14:04:52,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:04:53,151][09423] Updated weights for policy 0, policy_version 240357 (0.0037) [2024-06-28 14:04:56,891][09423] Updated weights for policy 0, policy_version 240367 (0.0036) [2024-06-28 14:04:57,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 3938222080. Throughput: 0: 42487.5. Samples: 217035640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 14:04:57,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:05:01,085][09423] Updated weights for policy 0, policy_version 240377 (0.0024) [2024-06-28 14:05:02,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 3938418688. Throughput: 0: 42475.1. Samples: 217289500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 14:05:02,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:05:04,313][09423] Updated weights for policy 0, policy_version 240387 (0.0037) [2024-06-28 14:05:07,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 3938615296. Throughput: 0: 42604.9. Samples: 217552100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 14:05:07,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:05:08,654][09423] Updated weights for policy 0, policy_version 240397 (0.0035) [2024-06-28 14:05:12,261][09423] Updated weights for policy 0, policy_version 240407 (0.0034) [2024-06-28 14:05:12,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.5, 300 sec: 42487.4). Total num frames: 3938844672. Throughput: 0: 42588.6. Samples: 217675700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 14:05:12,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:05:16,675][09423] Updated weights for policy 0, policy_version 240417 (0.0045) [2024-06-28 14:05:17,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 3939041280. Throughput: 0: 42572.6. Samples: 217930860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 14:05:17,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 14:05:20,122][09423] Updated weights for policy 0, policy_version 240427 (0.0049) [2024-06-28 14:05:20,683][09403] Signal inference workers to stop experience collection... (3000 times) [2024-06-28 14:05:20,685][09403] Signal inference workers to resume experience collection... (3000 times) [2024-06-28 14:05:20,732][09423] InferenceWorker_p0-w0: stopping experience collection (3000 times) [2024-06-28 14:05:20,732][09423] InferenceWorker_p0-w0: resuming experience collection (3000 times) [2024-06-28 14:05:22,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 3939254272. Throughput: 0: 42611.9. Samples: 218186260. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2024-06-28 14:05:22,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:05:24,297][09423] Updated weights for policy 0, policy_version 240437 (0.0030) [2024-06-28 14:05:27,679][09423] Updated weights for policy 0, policy_version 240447 (0.0044) [2024-06-28 14:05:27,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 3939483648. Throughput: 0: 42439.6. Samples: 218305840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2024-06-28 14:05:27,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:05:31,804][09423] Updated weights for policy 0, policy_version 240457 (0.0029) [2024-06-28 14:05:32,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.7, 300 sec: 42376.6). Total num frames: 3939680256. Throughput: 0: 42460.0. Samples: 218567280. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2024-06-28 14:05:32,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:05:35,435][09423] Updated weights for policy 0, policy_version 240467 (0.0035) [2024-06-28 14:05:37,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42598.4, 300 sec: 42376.3). Total num frames: 3939893248. Throughput: 0: 42495.5. Samples: 218821180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2024-06-28 14:05:37,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:05:39,472][09423] Updated weights for policy 0, policy_version 240477 (0.0030) [2024-06-28 14:05:42,915][09423] Updated weights for policy 0, policy_version 240487 (0.0031) [2024-06-28 14:05:42,921][09190] Fps is (10 sec: 45875.8, 60 sec: 42598.5, 300 sec: 42654.0). Total num frames: 3940139008. Throughput: 0: 42546.4. Samples: 218950220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2024-06-28 14:05:42,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:05:47,397][09423] Updated weights for policy 0, policy_version 240497 (0.0044) [2024-06-28 14:05:47,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 3940319232. Throughput: 0: 42616.0. Samples: 219207220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2024-06-28 14:05:47,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:05:50,478][09423] Updated weights for policy 0, policy_version 240507 (0.0048) [2024-06-28 14:05:52,921][09190] Fps is (10 sec: 40959.3, 60 sec: 42871.4, 300 sec: 42431.8). Total num frames: 3940548608. Throughput: 0: 42136.3. Samples: 219448240. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2024-06-28 14:05:52,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:05:55,279][09423] Updated weights for policy 0, policy_version 240517 (0.0035) [2024-06-28 14:05:57,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42325.3, 300 sec: 42542.8). Total num frames: 3940761600. Throughput: 0: 42296.7. Samples: 219579060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2024-06-28 14:05:57,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:05:58,738][09423] Updated weights for policy 0, policy_version 240527 (0.0043) [2024-06-28 14:06:02,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 3940941824. Throughput: 0: 42362.2. Samples: 219837160. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2024-06-28 14:06:02,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:06:02,950][09423] Updated weights for policy 0, policy_version 240537 (0.0029) [2024-06-28 14:06:06,342][09423] Updated weights for policy 0, policy_version 240547 (0.0044) [2024-06-28 14:06:07,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42871.4, 300 sec: 42431.8). Total num frames: 3941187584. Throughput: 0: 42171.6. Samples: 220083980. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2024-06-28 14:06:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:06:10,453][09423] Updated weights for policy 0, policy_version 240557 (0.0032) [2024-06-28 14:06:12,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 3941384192. Throughput: 0: 42488.0. Samples: 220217800. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2024-06-28 14:06:12,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:06:13,937][09423] Updated weights for policy 0, policy_version 240567 (0.0043) [2024-06-28 14:06:17,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.3, 300 sec: 42376.6). Total num frames: 3941597184. Throughput: 0: 42385.3. Samples: 220474620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2024-06-28 14:06:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:06:17,934][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000240576_3941597184.pth... [2024-06-28 14:06:17,984][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000239955_3931422720.pth [2024-06-28 14:06:18,391][09423] Updated weights for policy 0, policy_version 240577 (0.0036) [2024-06-28 14:06:21,695][09423] Updated weights for policy 0, policy_version 240587 (0.0024) [2024-06-28 14:06:22,922][09190] Fps is (10 sec: 44231.9, 60 sec: 42870.8, 300 sec: 42487.2). Total num frames: 3941826560. Throughput: 0: 42309.7. Samples: 220725160. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2024-06-28 14:06:22,923][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:06:25,992][09423] Updated weights for policy 0, policy_version 240597 (0.0036) [2024-06-28 14:06:27,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 3942023168. Throughput: 0: 42589.3. Samples: 220866740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 19.0) [2024-06-28 14:06:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:06:29,007][09423] Updated weights for policy 0, policy_version 240607 (0.0037) [2024-06-28 14:06:32,921][09190] Fps is (10 sec: 39325.7, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 3942219776. Throughput: 0: 42456.0. Samples: 221117740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:06:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:06:33,769][09423] Updated weights for policy 0, policy_version 240617 (0.0043) [2024-06-28 14:06:36,826][09423] Updated weights for policy 0, policy_version 240627 (0.0037) [2024-06-28 14:06:37,921][09190] Fps is (10 sec: 45874.7, 60 sec: 43144.5, 300 sec: 42598.4). Total num frames: 3942481920. Throughput: 0: 42510.2. Samples: 221361200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:06:37,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:06:41,391][09423] Updated weights for policy 0, policy_version 240637 (0.0037) [2024-06-28 14:06:42,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42052.2, 300 sec: 42487.3). Total num frames: 3942662144. Throughput: 0: 42694.7. Samples: 221500320. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:06:42,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:06:44,931][09423] Updated weights for policy 0, policy_version 240647 (0.0048) [2024-06-28 14:06:47,921][09190] Fps is (10 sec: 37683.4, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 3942858752. Throughput: 0: 42406.7. Samples: 221745460. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:06:47,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:06:48,847][09423] Updated weights for policy 0, policy_version 240657 (0.0037) [2024-06-28 14:06:51,295][09403] Signal inference workers to stop experience collection... (3050 times) [2024-06-28 14:06:51,295][09403] Signal inference workers to resume experience collection... (3050 times) [2024-06-28 14:06:51,348][09423] InferenceWorker_p0-w0: stopping experience collection (3050 times) [2024-06-28 14:06:51,348][09423] InferenceWorker_p0-w0: resuming experience collection (3050 times) [2024-06-28 14:06:52,511][09423] Updated weights for policy 0, policy_version 240667 (0.0031) [2024-06-28 14:06:52,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 3943120896. Throughput: 0: 42607.5. Samples: 222001320. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:06:52,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:06:56,588][09423] Updated weights for policy 0, policy_version 240677 (0.0044) [2024-06-28 14:06:57,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42052.4, 300 sec: 42431.8). Total num frames: 3943284736. Throughput: 0: 42688.4. Samples: 222138780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:06:57,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:07:00,039][09423] Updated weights for policy 0, policy_version 240687 (0.0032) [2024-06-28 14:07:02,922][09190] Fps is (10 sec: 39321.0, 60 sec: 42871.3, 300 sec: 42376.3). Total num frames: 3943514112. Throughput: 0: 42510.1. Samples: 222387580. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:07:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 14:07:04,700][09423] Updated weights for policy 0, policy_version 240697 (0.0039) [2024-06-28 14:07:07,759][09423] Updated weights for policy 0, policy_version 240707 (0.0038) [2024-06-28 14:07:07,922][09190] Fps is (10 sec: 47512.6, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 3943759872. Throughput: 0: 42605.3. Samples: 222642360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:07:07,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:07:12,599][09423] Updated weights for policy 0, policy_version 240717 (0.0045) [2024-06-28 14:07:12,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42325.2, 300 sec: 42376.2). Total num frames: 3943923712. Throughput: 0: 42371.0. Samples: 222773440. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:07:12,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:07:15,833][09423] Updated weights for policy 0, policy_version 240727 (0.0045) [2024-06-28 14:07:17,924][09190] Fps is (10 sec: 39312.5, 60 sec: 42596.7, 300 sec: 42431.4). Total num frames: 3944153088. Throughput: 0: 42180.3. Samples: 223015960. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:07:17,924][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:07:20,215][09423] Updated weights for policy 0, policy_version 240737 (0.0026) [2024-06-28 14:07:22,922][09190] Fps is (10 sec: 44236.5, 60 sec: 42326.0, 300 sec: 42487.3). Total num frames: 3944366080. Throughput: 0: 42521.7. Samples: 223274680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:07:22,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:07:23,529][09423] Updated weights for policy 0, policy_version 240747 (0.0031) [2024-06-28 14:07:27,921][09190] Fps is (10 sec: 39331.0, 60 sec: 42052.2, 300 sec: 42265.1). Total num frames: 3944546304. Throughput: 0: 42196.8. Samples: 223399180. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:07:27,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:07:28,143][09423] Updated weights for policy 0, policy_version 240757 (0.0032) [2024-06-28 14:07:31,381][09423] Updated weights for policy 0, policy_version 240767 (0.0037) [2024-06-28 14:07:32,921][09190] Fps is (10 sec: 42599.3, 60 sec: 42871.5, 300 sec: 42431.8). Total num frames: 3944792064. Throughput: 0: 42415.6. Samples: 223654160. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:07:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:07:35,799][09423] Updated weights for policy 0, policy_version 240777 (0.0033) [2024-06-28 14:07:37,921][09190] Fps is (10 sec: 45875.9, 60 sec: 42052.3, 300 sec: 42487.3). Total num frames: 3945005056. Throughput: 0: 42525.0. Samples: 223914940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:07:37,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:07:38,848][09423] Updated weights for policy 0, policy_version 240787 (0.0040) [2024-06-28 14:07:42,924][09190] Fps is (10 sec: 40949.5, 60 sec: 42323.6, 300 sec: 42320.3). Total num frames: 3945201664. Throughput: 0: 42223.8. Samples: 224038960. Policy #0 lag: (min: 1.0, avg: 10.9, max: 21.0) [2024-06-28 14:07:42,924][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:07:43,377][09423] Updated weights for policy 0, policy_version 240797 (0.0038) [2024-06-28 14:07:46,386][09423] Updated weights for policy 0, policy_version 240807 (0.0030) [2024-06-28 14:07:47,928][09190] Fps is (10 sec: 42570.4, 60 sec: 42866.8, 300 sec: 42375.7). Total num frames: 3945431040. Throughput: 0: 42371.8. Samples: 224294580. Policy #0 lag: (min: 1.0, avg: 10.9, max: 21.0) [2024-06-28 14:07:47,929][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:07:51,395][09423] Updated weights for policy 0, policy_version 240817 (0.0042) [2024-06-28 14:07:52,922][09190] Fps is (10 sec: 44247.3, 60 sec: 42052.2, 300 sec: 42542.8). Total num frames: 3945644032. Throughput: 0: 42413.8. Samples: 224550980. Policy #0 lag: (min: 1.0, avg: 10.9, max: 21.0) [2024-06-28 14:07:52,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:07:54,153][09423] Updated weights for policy 0, policy_version 240827 (0.0033) [2024-06-28 14:07:57,922][09190] Fps is (10 sec: 37707.2, 60 sec: 42052.1, 300 sec: 42265.2). Total num frames: 3945807872. Throughput: 0: 42253.7. Samples: 224674860. Policy #0 lag: (min: 1.0, avg: 10.9, max: 21.0) [2024-06-28 14:07:57,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:07:58,808][09423] Updated weights for policy 0, policy_version 240837 (0.0042) [2024-06-28 14:08:01,798][09423] Updated weights for policy 0, policy_version 240847 (0.0042) [2024-06-28 14:08:02,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42598.6, 300 sec: 42487.3). Total num frames: 3946070016. Throughput: 0: 42423.7. Samples: 224924920. Policy #0 lag: (min: 1.0, avg: 10.9, max: 21.0) [2024-06-28 14:08:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:08:06,311][09423] Updated weights for policy 0, policy_version 240857 (0.0038) [2024-06-28 14:08:07,921][09190] Fps is (10 sec: 44237.0, 60 sec: 41506.2, 300 sec: 42431.8). Total num frames: 3946250240. Throughput: 0: 42572.0. Samples: 225190420. Policy #0 lag: (min: 1.0, avg: 10.9, max: 21.0) [2024-06-28 14:08:07,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:08:09,590][09423] Updated weights for policy 0, policy_version 240867 (0.0034) [2024-06-28 14:08:12,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 3946479616. Throughput: 0: 42401.9. Samples: 225307260. Policy #0 lag: (min: 1.0, avg: 10.9, max: 21.0) [2024-06-28 14:08:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:08:14,170][09423] Updated weights for policy 0, policy_version 240877 (0.0035) [2024-06-28 14:08:14,928][09403] Signal inference workers to stop experience collection... (3100 times) [2024-06-28 14:08:14,978][09423] InferenceWorker_p0-w0: stopping experience collection (3100 times) [2024-06-28 14:08:14,985][09403] Signal inference workers to resume experience collection... (3100 times) [2024-06-28 14:08:14,996][09423] InferenceWorker_p0-w0: resuming experience collection (3100 times) [2024-06-28 14:08:17,438][09423] Updated weights for policy 0, policy_version 240887 (0.0034) [2024-06-28 14:08:17,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42600.1, 300 sec: 42542.8). Total num frames: 3946708992. Throughput: 0: 42572.3. Samples: 225569920. Policy #0 lag: (min: 1.0, avg: 10.9, max: 21.0) [2024-06-28 14:08:17,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:08:18,042][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000240889_3946725376.pth... [2024-06-28 14:08:18,086][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000240266_3936518144.pth [2024-06-28 14:08:21,690][09423] Updated weights for policy 0, policy_version 240897 (0.0038) [2024-06-28 14:08:22,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42052.4, 300 sec: 42431.8). Total num frames: 3946889216. Throughput: 0: 42653.3. Samples: 225834340. Policy #0 lag: (min: 1.0, avg: 10.9, max: 21.0) [2024-06-28 14:08:22,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:08:24,995][09423] Updated weights for policy 0, policy_version 240907 (0.0029) [2024-06-28 14:08:27,924][09190] Fps is (10 sec: 42588.1, 60 sec: 43142.8, 300 sec: 42487.0). Total num frames: 3947134976. Throughput: 0: 42471.6. Samples: 225950180. Policy #0 lag: (min: 1.0, avg: 10.9, max: 21.0) [2024-06-28 14:08:27,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:08:29,661][09423] Updated weights for policy 0, policy_version 240917 (0.0032) [2024-06-28 14:08:32,589][09423] Updated weights for policy 0, policy_version 240927 (0.0039) [2024-06-28 14:08:32,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 3947347968. Throughput: 0: 42524.4. Samples: 226207900. Policy #0 lag: (min: 1.0, avg: 10.9, max: 21.0) [2024-06-28 14:08:32,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:08:37,418][09423] Updated weights for policy 0, policy_version 240937 (0.0048) [2024-06-28 14:08:37,922][09190] Fps is (10 sec: 40969.5, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 3947544576. Throughput: 0: 42658.6. Samples: 226470620. Policy #0 lag: (min: 1.0, avg: 10.9, max: 21.0) [2024-06-28 14:08:37,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:08:40,189][09423] Updated weights for policy 0, policy_version 240947 (0.0031) [2024-06-28 14:08:42,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42600.1, 300 sec: 42431.8). Total num frames: 3947757568. Throughput: 0: 42454.3. Samples: 226585300. Policy #0 lag: (min: 1.0, avg: 10.9, max: 21.0) [2024-06-28 14:08:42,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:08:45,065][09423] Updated weights for policy 0, policy_version 240957 (0.0029) [2024-06-28 14:08:47,921][09190] Fps is (10 sec: 44237.9, 60 sec: 42603.1, 300 sec: 42542.9). Total num frames: 3947986944. Throughput: 0: 42704.1. Samples: 226846600. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 14:08:47,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:08:48,041][09423] Updated weights for policy 0, policy_version 240967 (0.0047) [2024-06-28 14:08:52,579][09423] Updated weights for policy 0, policy_version 240977 (0.0035) [2024-06-28 14:08:52,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42052.4, 300 sec: 42431.8). Total num frames: 3948167168. Throughput: 0: 42394.4. Samples: 227098160. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 14:08:52,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:08:55,906][09423] Updated weights for policy 0, policy_version 240987 (0.0033) [2024-06-28 14:08:57,921][09190] Fps is (10 sec: 42597.7, 60 sec: 43417.7, 300 sec: 42487.3). Total num frames: 3948412928. Throughput: 0: 42615.0. Samples: 227224940. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 14:08:57,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:09:00,069][09423] Updated weights for policy 0, policy_version 240997 (0.0032) [2024-06-28 14:09:02,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 3948609536. Throughput: 0: 42655.2. Samples: 227489400. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 14:09:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:09:03,688][09423] Updated weights for policy 0, policy_version 241007 (0.0041) [2024-06-28 14:09:07,922][09190] Fps is (10 sec: 39321.1, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 3948806144. Throughput: 0: 42640.7. Samples: 227753180. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 14:09:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:09:08,098][09423] Updated weights for policy 0, policy_version 241017 (0.0037) [2024-06-28 14:09:11,104][09423] Updated weights for policy 0, policy_version 241027 (0.0035) [2024-06-28 14:09:12,924][09190] Fps is (10 sec: 44225.5, 60 sec: 42869.6, 300 sec: 42542.5). Total num frames: 3949051904. Throughput: 0: 42652.0. Samples: 227869520. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 14:09:12,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:09:15,817][09423] Updated weights for policy 0, policy_version 241037 (0.0032) [2024-06-28 14:09:17,921][09190] Fps is (10 sec: 44237.8, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 3949248512. Throughput: 0: 42733.4. Samples: 228130900. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 14:09:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:09:18,459][09423] Updated weights for policy 0, policy_version 241047 (0.0028) [2024-06-28 14:09:22,921][09190] Fps is (10 sec: 39331.5, 60 sec: 42598.4, 300 sec: 42376.3). Total num frames: 3949445120. Throughput: 0: 42579.3. Samples: 228386680. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 14:09:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 14:09:23,259][09423] Updated weights for policy 0, policy_version 241057 (0.0034) [2024-06-28 14:09:26,219][09423] Updated weights for policy 0, policy_version 241067 (0.0035) [2024-06-28 14:09:27,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42600.1, 300 sec: 42542.9). Total num frames: 3949690880. Throughput: 0: 42716.9. Samples: 228507560. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 14:09:27,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:09:31,079][09423] Updated weights for policy 0, policy_version 241077 (0.0037) [2024-06-28 14:09:32,922][09190] Fps is (10 sec: 45874.6, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 3949903872. Throughput: 0: 42657.2. Samples: 228766180. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 14:09:32,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:09:34,243][09423] Updated weights for policy 0, policy_version 241087 (0.0040) [2024-06-28 14:09:37,924][09190] Fps is (10 sec: 40949.8, 60 sec: 42596.7, 300 sec: 42431.4). Total num frames: 3950100480. Throughput: 0: 42719.3. Samples: 229020640. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 14:09:37,925][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:09:38,328][09403] Signal inference workers to stop experience collection... (3150 times) [2024-06-28 14:09:38,328][09403] Signal inference workers to resume experience collection... (3150 times) [2024-06-28 14:09:38,351][09423] InferenceWorker_p0-w0: stopping experience collection (3150 times) [2024-06-28 14:09:38,351][09423] InferenceWorker_p0-w0: resuming experience collection (3150 times) [2024-06-28 14:09:38,479][09423] Updated weights for policy 0, policy_version 241097 (0.0037) [2024-06-28 14:09:41,712][09423] Updated weights for policy 0, policy_version 241107 (0.0025) [2024-06-28 14:09:42,921][09190] Fps is (10 sec: 42599.2, 60 sec: 42871.6, 300 sec: 42542.9). Total num frames: 3950329856. Throughput: 0: 42694.8. Samples: 229146200. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 14:09:42,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:09:46,435][09423] Updated weights for policy 0, policy_version 241117 (0.0033) [2024-06-28 14:09:47,921][09190] Fps is (10 sec: 44248.0, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 3950542848. Throughput: 0: 42624.8. Samples: 229407520. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 14:09:47,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:09:49,570][09423] Updated weights for policy 0, policy_version 241127 (0.0033) [2024-06-28 14:09:52,924][09190] Fps is (10 sec: 39311.4, 60 sec: 42596.6, 300 sec: 42375.9). Total num frames: 3950723072. Throughput: 0: 42473.8. Samples: 229664600. Policy #0 lag: (min: 1.0, avg: 10.5, max: 21.0) [2024-06-28 14:09:52,924][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 14:09:54,026][09423] Updated weights for policy 0, policy_version 241137 (0.0029) [2024-06-28 14:09:57,070][09423] Updated weights for policy 0, policy_version 241147 (0.0035) [2024-06-28 14:09:57,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 3950985216. Throughput: 0: 42602.3. Samples: 229786520. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-28 14:09:57,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:10:01,947][09423] Updated weights for policy 0, policy_version 241157 (0.0032) [2024-06-28 14:10:02,922][09190] Fps is (10 sec: 44247.3, 60 sec: 42598.3, 300 sec: 42542.8). Total num frames: 3951165440. Throughput: 0: 42523.4. Samples: 230044460. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-28 14:10:02,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:10:04,903][09423] Updated weights for policy 0, policy_version 241167 (0.0048) [2024-06-28 14:10:07,921][09190] Fps is (10 sec: 37683.6, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 3951362048. Throughput: 0: 42430.2. Samples: 230296040. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-28 14:10:07,930][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:10:09,758][09423] Updated weights for policy 0, policy_version 241177 (0.0033) [2024-06-28 14:10:12,657][09423] Updated weights for policy 0, policy_version 241187 (0.0035) [2024-06-28 14:10:12,921][09190] Fps is (10 sec: 45875.9, 60 sec: 42873.3, 300 sec: 42653.9). Total num frames: 3951624192. Throughput: 0: 42519.2. Samples: 230420920. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-28 14:10:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:10:17,294][09423] Updated weights for policy 0, policy_version 241197 (0.0047) [2024-06-28 14:10:17,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 3951788032. Throughput: 0: 42564.5. Samples: 230681580. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-28 14:10:17,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:10:17,930][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000241198_3951788032.pth... [2024-06-28 14:10:17,990][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000240576_3941597184.pth [2024-06-28 14:10:20,302][09423] Updated weights for policy 0, policy_version 241207 (0.0038) [2024-06-28 14:10:22,922][09190] Fps is (10 sec: 37682.8, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 3952001024. Throughput: 0: 42468.6. Samples: 230931620. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-28 14:10:22,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:10:24,879][09423] Updated weights for policy 0, policy_version 241217 (0.0032) [2024-06-28 14:10:27,875][09423] Updated weights for policy 0, policy_version 241227 (0.0035) [2024-06-28 14:10:27,921][09190] Fps is (10 sec: 47513.2, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 3952263168. Throughput: 0: 42607.0. Samples: 231063520. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-28 14:10:27,933][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:10:32,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42052.3, 300 sec: 42487.3). Total num frames: 3952427008. Throughput: 0: 42553.8. Samples: 231322440. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-28 14:10:32,922][09423] Updated weights for policy 0, policy_version 241237 (0.0035) [2024-06-28 14:10:32,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:10:35,658][09423] Updated weights for policy 0, policy_version 241247 (0.0031) [2024-06-28 14:10:37,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42600.2, 300 sec: 42431.8). Total num frames: 3952656384. Throughput: 0: 42317.0. Samples: 231568760. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-28 14:10:37,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:10:40,643][09423] Updated weights for policy 0, policy_version 241257 (0.0047) [2024-06-28 14:10:42,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 3952885760. Throughput: 0: 42503.6. Samples: 231699180. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-28 14:10:42,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:10:43,560][09423] Updated weights for policy 0, policy_version 241267 (0.0031) [2024-06-28 14:10:47,921][09190] Fps is (10 sec: 39321.6, 60 sec: 41779.2, 300 sec: 42376.3). Total num frames: 3953049600. Throughput: 0: 42344.1. Samples: 231949940. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-28 14:10:47,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:10:48,542][09423] Updated weights for policy 0, policy_version 241277 (0.0047) [2024-06-28 14:10:51,481][09423] Updated weights for policy 0, policy_version 241287 (0.0040) [2024-06-28 14:10:52,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42873.2, 300 sec: 42487.3). Total num frames: 3953295360. Throughput: 0: 42314.6. Samples: 232200200. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-28 14:10:52,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:10:56,155][09423] Updated weights for policy 0, policy_version 241297 (0.0034) [2024-06-28 14:10:57,228][09403] Signal inference workers to stop experience collection... (3200 times) [2024-06-28 14:10:57,228][09403] Signal inference workers to resume experience collection... (3200 times) [2024-06-28 14:10:57,284][09423] InferenceWorker_p0-w0: stopping experience collection (3200 times) [2024-06-28 14:10:57,284][09423] InferenceWorker_p0-w0: resuming experience collection (3200 times) [2024-06-28 14:10:57,921][09190] Fps is (10 sec: 45875.3, 60 sec: 42052.4, 300 sec: 42598.4). Total num frames: 3953508352. Throughput: 0: 42569.4. Samples: 232336540. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-28 14:10:57,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:10:59,329][09423] Updated weights for policy 0, policy_version 241307 (0.0037) [2024-06-28 14:11:02,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42052.3, 300 sec: 42376.2). Total num frames: 3953688576. Throughput: 0: 42243.9. Samples: 232582560. Policy #0 lag: (min: 0.0, avg: 12.1, max: 21.0) [2024-06-28 14:11:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:11:04,069][09423] Updated weights for policy 0, policy_version 241317 (0.0042) [2024-06-28 14:11:06,738][09423] Updated weights for policy 0, policy_version 241327 (0.0029) [2024-06-28 14:11:07,921][09190] Fps is (10 sec: 44236.5, 60 sec: 43144.5, 300 sec: 42598.4). Total num frames: 3953950720. Throughput: 0: 42415.2. Samples: 232840300. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 14:11:07,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:11:11,659][09423] Updated weights for policy 0, policy_version 241337 (0.0036) [2024-06-28 14:11:12,922][09190] Fps is (10 sec: 45875.0, 60 sec: 42052.2, 300 sec: 42542.9). Total num frames: 3954147328. Throughput: 0: 42602.6. Samples: 232980640. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 14:11:12,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:11:14,307][09423] Updated weights for policy 0, policy_version 241347 (0.0041) [2024-06-28 14:11:17,921][09190] Fps is (10 sec: 37683.1, 60 sec: 42325.3, 300 sec: 42376.4). Total num frames: 3954327552. Throughput: 0: 42306.2. Samples: 233226220. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 14:11:17,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:11:19,393][09423] Updated weights for policy 0, policy_version 241357 (0.0028) [2024-06-28 14:11:22,106][09423] Updated weights for policy 0, policy_version 241367 (0.0034) [2024-06-28 14:11:22,921][09190] Fps is (10 sec: 44237.4, 60 sec: 43144.6, 300 sec: 42598.4). Total num frames: 3954589696. Throughput: 0: 42248.9. Samples: 233469960. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 14:11:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:11:26,858][09423] Updated weights for policy 0, policy_version 241377 (0.0028) [2024-06-28 14:11:27,921][09190] Fps is (10 sec: 45875.6, 60 sec: 42052.4, 300 sec: 42598.4). Total num frames: 3954786304. Throughput: 0: 42496.5. Samples: 233611520. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 14:11:27,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 14:11:29,782][09423] Updated weights for policy 0, policy_version 241387 (0.0045) [2024-06-28 14:11:32,921][09190] Fps is (10 sec: 39321.2, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 3954982912. Throughput: 0: 42435.9. Samples: 233859560. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 14:11:32,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:11:34,773][09423] Updated weights for policy 0, policy_version 241397 (0.0038) [2024-06-28 14:11:37,928][09190] Fps is (10 sec: 40933.0, 60 sec: 42320.7, 300 sec: 42486.4). Total num frames: 3955195904. Throughput: 0: 42694.3. Samples: 234121720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 14:11:37,928][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:11:37,978][09423] Updated weights for policy 0, policy_version 241407 (0.0031) [2024-06-28 14:11:42,499][09423] Updated weights for policy 0, policy_version 241417 (0.0031) [2024-06-28 14:11:42,922][09190] Fps is (10 sec: 42598.1, 60 sec: 42052.2, 300 sec: 42542.8). Total num frames: 3955408896. Throughput: 0: 42470.0. Samples: 234247700. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 14:11:42,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:11:45,569][09423] Updated weights for policy 0, policy_version 241427 (0.0033) [2024-06-28 14:11:47,921][09190] Fps is (10 sec: 44265.9, 60 sec: 43144.5, 300 sec: 42431.8). Total num frames: 3955638272. Throughput: 0: 42494.3. Samples: 234494800. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 14:11:47,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:11:49,905][09423] Updated weights for policy 0, policy_version 241437 (0.0041) [2024-06-28 14:11:52,921][09190] Fps is (10 sec: 44237.8, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 3955851264. Throughput: 0: 42583.2. Samples: 234756540. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 14:11:52,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:11:53,160][09423] Updated weights for policy 0, policy_version 241447 (0.0040) [2024-06-28 14:11:57,672][09423] Updated weights for policy 0, policy_version 241457 (0.0032) [2024-06-28 14:11:57,928][09190] Fps is (10 sec: 39295.9, 60 sec: 42047.7, 300 sec: 42430.9). Total num frames: 3956031488. Throughput: 0: 42454.4. Samples: 234891360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 14:11:57,929][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:12:00,115][09403] Signal inference workers to stop experience collection... (3250 times) [2024-06-28 14:12:00,116][09403] Signal inference workers to resume experience collection... (3250 times) [2024-06-28 14:12:00,143][09423] InferenceWorker_p0-w0: stopping experience collection (3250 times) [2024-06-28 14:12:00,143][09423] InferenceWorker_p0-w0: resuming experience collection (3250 times) [2024-06-28 14:12:00,554][09423] Updated weights for policy 0, policy_version 241467 (0.0043) [2024-06-28 14:12:02,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42871.5, 300 sec: 42376.3). Total num frames: 3956260864. Throughput: 0: 42467.6. Samples: 235137260. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 14:12:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 14:12:05,283][09423] Updated weights for policy 0, policy_version 241477 (0.0022) [2024-06-28 14:12:07,921][09190] Fps is (10 sec: 45905.3, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 3956490240. Throughput: 0: 42774.2. Samples: 235394800. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 14:12:07,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:12:08,393][09423] Updated weights for policy 0, policy_version 241487 (0.0033) [2024-06-28 14:12:12,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42052.3, 300 sec: 42432.1). Total num frames: 3956670464. Throughput: 0: 42640.8. Samples: 235530360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 20.0) [2024-06-28 14:12:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 14:12:12,954][09423] Updated weights for policy 0, policy_version 241497 (0.0029) [2024-06-28 14:12:16,649][09423] Updated weights for policy 0, policy_version 241507 (0.0039) [2024-06-28 14:12:17,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.6, 300 sec: 42542.9). Total num frames: 3956916224. Throughput: 0: 42834.7. Samples: 235787120. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 14:12:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:12:17,933][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000241511_3956916224.pth... [2024-06-28 14:12:17,989][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000240889_3946725376.pth [2024-06-28 14:12:20,631][09423] Updated weights for policy 0, policy_version 241517 (0.0040) [2024-06-28 14:12:22,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42325.3, 300 sec: 42654.0). Total num frames: 3957129216. Throughput: 0: 42533.3. Samples: 236035440. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 14:12:22,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:12:24,305][09423] Updated weights for policy 0, policy_version 241527 (0.0041) [2024-06-28 14:12:27,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 3957325824. Throughput: 0: 42558.8. Samples: 236162840. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 14:12:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:12:28,328][09423] Updated weights for policy 0, policy_version 241537 (0.0048) [2024-06-28 14:12:31,651][09423] Updated weights for policy 0, policy_version 241547 (0.0044) [2024-06-28 14:12:32,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42871.5, 300 sec: 42542.8). Total num frames: 3957555200. Throughput: 0: 42817.6. Samples: 236421600. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 14:12:32,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:12:36,030][09423] Updated weights for policy 0, policy_version 241557 (0.0026) [2024-06-28 14:12:37,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43149.2, 300 sec: 42654.3). Total num frames: 3957784576. Throughput: 0: 42591.9. Samples: 236673180. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 14:12:37,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:12:39,120][09423] Updated weights for policy 0, policy_version 241567 (0.0030) [2024-06-28 14:12:42,924][09190] Fps is (10 sec: 39312.4, 60 sec: 42323.7, 300 sec: 42432.4). Total num frames: 3957948416. Throughput: 0: 42584.7. Samples: 236807500. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 14:12:42,924][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:12:43,671][09423] Updated weights for policy 0, policy_version 241577 (0.0024) [2024-06-28 14:12:47,092][09423] Updated weights for policy 0, policy_version 241587 (0.0037) [2024-06-28 14:12:47,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.3, 300 sec: 42542.9). Total num frames: 3958194176. Throughput: 0: 42695.5. Samples: 237058560. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 14:12:47,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:12:51,523][09423] Updated weights for policy 0, policy_version 241597 (0.0033) [2024-06-28 14:12:52,921][09190] Fps is (10 sec: 45886.5, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 3958407168. Throughput: 0: 42576.9. Samples: 237310760. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 14:12:52,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:12:55,117][09423] Updated weights for policy 0, policy_version 241607 (0.0036) [2024-06-28 14:12:57,924][09190] Fps is (10 sec: 40950.1, 60 sec: 42874.4, 300 sec: 42487.0). Total num frames: 3958603776. Throughput: 0: 42463.5. Samples: 237441320. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 14:12:57,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:12:59,023][09423] Updated weights for policy 0, policy_version 241617 (0.0036) [2024-06-28 14:13:02,774][09423] Updated weights for policy 0, policy_version 241627 (0.0037) [2024-06-28 14:13:02,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.5, 300 sec: 42654.0). Total num frames: 3958833152. Throughput: 0: 42517.8. Samples: 237700420. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 14:13:02,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:13:06,376][09423] Updated weights for policy 0, policy_version 241637 (0.0037) [2024-06-28 14:13:07,921][09190] Fps is (10 sec: 45886.6, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 3959062528. Throughput: 0: 42757.7. Samples: 237959540. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 14:13:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 14:13:10,100][09423] Updated weights for policy 0, policy_version 241647 (0.0033) [2024-06-28 14:13:12,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 3959242752. Throughput: 0: 42876.0. Samples: 238092260. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 14:13:12,924][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:13:14,419][09423] Updated weights for policy 0, policy_version 241657 (0.0028) [2024-06-28 14:13:17,543][09423] Updated weights for policy 0, policy_version 241667 (0.0028) [2024-06-28 14:13:17,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 3959472128. Throughput: 0: 42740.5. Samples: 238344920. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 14:13:17,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 14:13:19,203][09403] Signal inference workers to stop experience collection... (3300 times) [2024-06-28 14:13:19,256][09403] Signal inference workers to resume experience collection... (3300 times) [2024-06-28 14:13:19,257][09423] InferenceWorker_p0-w0: stopping experience collection (3300 times) [2024-06-28 14:13:19,281][09423] InferenceWorker_p0-w0: resuming experience collection (3300 times) [2024-06-28 14:13:22,091][09423] Updated weights for policy 0, policy_version 241677 (0.0024) [2024-06-28 14:13:22,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42598.4, 300 sec: 42543.2). Total num frames: 3959685120. Throughput: 0: 42758.2. Samples: 238597300. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 14:13:22,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:13:25,651][09423] Updated weights for policy 0, policy_version 241687 (0.0040) [2024-06-28 14:13:27,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42598.5, 300 sec: 42487.3). Total num frames: 3959881728. Throughput: 0: 42585.9. Samples: 238723760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:13:27,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:13:29,747][09423] Updated weights for policy 0, policy_version 241697 (0.0032) [2024-06-28 14:13:32,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 3960094720. Throughput: 0: 42581.4. Samples: 238974720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:13:32,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:13:33,467][09423] Updated weights for policy 0, policy_version 241707 (0.0028) [2024-06-28 14:13:37,472][09423] Updated weights for policy 0, policy_version 241717 (0.0034) [2024-06-28 14:13:37,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 3960324096. Throughput: 0: 42743.6. Samples: 239234220. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:13:37,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:13:41,177][09423] Updated weights for policy 0, policy_version 241727 (0.0041) [2024-06-28 14:13:42,921][09190] Fps is (10 sec: 44237.3, 60 sec: 43146.4, 300 sec: 42542.9). Total num frames: 3960537088. Throughput: 0: 42666.4. Samples: 239361200. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:13:42,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:13:45,084][09423] Updated weights for policy 0, policy_version 241737 (0.0031) [2024-06-28 14:13:47,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.5, 300 sec: 42653.9). Total num frames: 3960750080. Throughput: 0: 42494.7. Samples: 239612680. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:13:47,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:13:48,701][09423] Updated weights for policy 0, policy_version 241747 (0.0033) [2024-06-28 14:13:52,921][09190] Fps is (10 sec: 37682.8, 60 sec: 41779.2, 300 sec: 42376.3). Total num frames: 3960913920. Throughput: 0: 42422.7. Samples: 239868560. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:13:52,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:13:53,072][09423] Updated weights for policy 0, policy_version 241757 (0.0028) [2024-06-28 14:13:56,697][09423] Updated weights for policy 0, policy_version 241767 (0.0035) [2024-06-28 14:13:57,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42873.2, 300 sec: 42598.4). Total num frames: 3961176064. Throughput: 0: 42204.4. Samples: 239991460. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:13:57,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:14:00,452][09423] Updated weights for policy 0, policy_version 241777 (0.0036) [2024-06-28 14:14:02,921][09190] Fps is (10 sec: 45875.4, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 3961372672. Throughput: 0: 42302.8. Samples: 240248540. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:14:02,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:14:04,095][09423] Updated weights for policy 0, policy_version 241787 (0.0036) [2024-06-28 14:14:07,912][09423] Updated weights for policy 0, policy_version 241797 (0.0030) [2024-06-28 14:14:07,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42325.2, 300 sec: 42543.2). Total num frames: 3961602048. Throughput: 0: 42592.4. Samples: 240513960. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:14:07,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:14:11,915][09423] Updated weights for policy 0, policy_version 241807 (0.0033) [2024-06-28 14:14:12,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 3961798656. Throughput: 0: 42440.4. Samples: 240633580. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:14:12,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:14:15,716][09423] Updated weights for policy 0, policy_version 241817 (0.0032) [2024-06-28 14:14:17,922][09190] Fps is (10 sec: 42598.2, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 3962028032. Throughput: 0: 42691.9. Samples: 240895860. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:14:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:14:17,934][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000241823_3962028032.pth... [2024-06-28 14:14:17,983][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000241198_3951788032.pth [2024-06-28 14:14:19,458][09423] Updated weights for policy 0, policy_version 241827 (0.0027) [2024-06-28 14:14:22,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 3962224640. Throughput: 0: 42522.7. Samples: 241147740. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:14:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:14:23,293][09423] Updated weights for policy 0, policy_version 241837 (0.0035) [2024-06-28 14:14:27,337][09423] Updated weights for policy 0, policy_version 241847 (0.0046) [2024-06-28 14:14:27,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 3962437632. Throughput: 0: 42519.9. Samples: 241274600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:14:27,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:14:31,214][09423] Updated weights for policy 0, policy_version 241857 (0.0027) [2024-06-28 14:14:32,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42871.5, 300 sec: 42598.8). Total num frames: 3962667008. Throughput: 0: 42732.5. Samples: 241535640. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:14:32,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:14:34,723][09423] Updated weights for policy 0, policy_version 241867 (0.0024) [2024-06-28 14:14:37,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 3962863616. Throughput: 0: 42705.4. Samples: 241790300. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 14:14:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 14:14:38,736][09423] Updated weights for policy 0, policy_version 241877 (0.0033) [2024-06-28 14:14:42,507][09423] Updated weights for policy 0, policy_version 241887 (0.0034) [2024-06-28 14:14:42,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 3963076608. Throughput: 0: 42803.6. Samples: 241917620. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 14:14:42,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:14:46,298][09423] Updated weights for policy 0, policy_version 241897 (0.0030) [2024-06-28 14:14:47,922][09190] Fps is (10 sec: 45874.4, 60 sec: 42871.4, 300 sec: 42709.8). Total num frames: 3963322368. Throughput: 0: 42931.0. Samples: 242180440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 14:14:47,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:14:50,233][09423] Updated weights for policy 0, policy_version 241907 (0.0034) [2024-06-28 14:14:52,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.6, 300 sec: 42431.8). Total num frames: 3963502592. Throughput: 0: 42605.9. Samples: 242431220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 14:14:52,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:14:54,148][09423] Updated weights for policy 0, policy_version 241917 (0.0036) [2024-06-28 14:14:57,698][09403] Signal inference workers to stop experience collection... (3350 times) [2024-06-28 14:14:57,704][09403] Signal inference workers to resume experience collection... (3350 times) [2024-06-28 14:14:57,737][09423] InferenceWorker_p0-w0: stopping experience collection (3350 times) [2024-06-28 14:14:57,737][09423] InferenceWorker_p0-w0: resuming experience collection (3350 times) [2024-06-28 14:14:57,863][09423] Updated weights for policy 0, policy_version 241927 (0.0036) [2024-06-28 14:14:57,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 3963731968. Throughput: 0: 42589.2. Samples: 242550100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 14:14:57,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:15:01,839][09423] Updated weights for policy 0, policy_version 241937 (0.0029) [2024-06-28 14:15:02,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.4, 300 sec: 42653.9). Total num frames: 3963944960. Throughput: 0: 42502.7. Samples: 242808480. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 14:15:02,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:15:05,422][09423] Updated weights for policy 0, policy_version 241947 (0.0037) [2024-06-28 14:15:07,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 3964157952. Throughput: 0: 42643.5. Samples: 243066700. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 14:15:07,930][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 14:15:09,587][09423] Updated weights for policy 0, policy_version 241957 (0.0037) [2024-06-28 14:15:12,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 3964354560. Throughput: 0: 42572.4. Samples: 243190360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 14:15:12,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:15:13,164][09423] Updated weights for policy 0, policy_version 241967 (0.0038) [2024-06-28 14:15:17,174][09423] Updated weights for policy 0, policy_version 241977 (0.0031) [2024-06-28 14:15:17,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42325.5, 300 sec: 42598.4). Total num frames: 3964567552. Throughput: 0: 42684.5. Samples: 243456440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 14:15:17,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 14:15:21,253][09423] Updated weights for policy 0, policy_version 241987 (0.0036) [2024-06-28 14:15:22,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 3964796928. Throughput: 0: 42703.9. Samples: 243711980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 14:15:22,926][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:15:25,044][09423] Updated weights for policy 0, policy_version 241997 (0.0039) [2024-06-28 14:15:27,922][09190] Fps is (10 sec: 44235.9, 60 sec: 42871.4, 300 sec: 42653.9). Total num frames: 3965009920. Throughput: 0: 42768.7. Samples: 243842220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 14:15:27,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:15:28,755][09423] Updated weights for policy 0, policy_version 242007 (0.0024) [2024-06-28 14:15:32,595][09423] Updated weights for policy 0, policy_version 242017 (0.0027) [2024-06-28 14:15:32,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 3965222912. Throughput: 0: 42620.5. Samples: 244098360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 14:15:32,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:15:36,407][09423] Updated weights for policy 0, policy_version 242027 (0.0032) [2024-06-28 14:15:37,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 3965419520. Throughput: 0: 42714.6. Samples: 244353380. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 14:15:37,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:15:40,113][09423] Updated weights for policy 0, policy_version 242037 (0.0034) [2024-06-28 14:15:42,924][09190] Fps is (10 sec: 42587.8, 60 sec: 42869.6, 300 sec: 42709.1). Total num frames: 3965648896. Throughput: 0: 42783.4. Samples: 244475460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 14:15:42,925][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 14:15:43,838][09423] Updated weights for policy 0, policy_version 242047 (0.0030) [2024-06-28 14:15:47,858][09423] Updated weights for policy 0, policy_version 242057 (0.0046) [2024-06-28 14:15:47,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 3965861888. Throughput: 0: 42772.9. Samples: 244733260. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 14:15:47,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:15:51,420][09423] Updated weights for policy 0, policy_version 242067 (0.0038) [2024-06-28 14:15:52,921][09190] Fps is (10 sec: 40970.8, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 3966058496. Throughput: 0: 42811.7. Samples: 244993220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 14:15:52,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:15:55,528][09423] Updated weights for policy 0, policy_version 242077 (0.0040) [2024-06-28 14:15:57,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 3966287872. Throughput: 0: 42896.4. Samples: 245120700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 14:15:57,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:15:59,079][09423] Updated weights for policy 0, policy_version 242087 (0.0036) [2024-06-28 14:16:02,922][09190] Fps is (10 sec: 42597.5, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 3966484480. Throughput: 0: 42533.6. Samples: 245370460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 14:16:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:16:03,314][09423] Updated weights for policy 0, policy_version 242097 (0.0029) [2024-06-28 14:16:07,300][09423] Updated weights for policy 0, policy_version 242107 (0.0033) [2024-06-28 14:16:07,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 3966697472. Throughput: 0: 42645.4. Samples: 245631020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 14:16:07,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:16:10,874][09423] Updated weights for policy 0, policy_version 242117 (0.0025) [2024-06-28 14:16:12,921][09190] Fps is (10 sec: 45875.7, 60 sec: 43144.6, 300 sec: 42765.0). Total num frames: 3966943232. Throughput: 0: 42583.7. Samples: 245758480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 14:16:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 14:16:14,942][09423] Updated weights for policy 0, policy_version 242127 (0.0042) [2024-06-28 14:16:17,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.4, 300 sec: 42542.8). Total num frames: 3967139840. Throughput: 0: 42448.9. Samples: 246008560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 14:16:17,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:16:17,937][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000242135_3967139840.pth... [2024-06-28 14:16:17,988][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000241511_3956916224.pth [2024-06-28 14:16:18,707][09423] Updated weights for policy 0, policy_version 242137 (0.0030) [2024-06-28 14:16:22,531][09423] Updated weights for policy 0, policy_version 242147 (0.0029) [2024-06-28 14:16:22,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 3967336448. Throughput: 0: 42461.4. Samples: 246264140. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 14:16:22,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:16:26,386][09423] Updated weights for policy 0, policy_version 242157 (0.0033) [2024-06-28 14:16:27,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 3967582208. Throughput: 0: 42620.2. Samples: 246393260. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 14:16:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:16:30,159][09423] Updated weights for policy 0, policy_version 242167 (0.0031) [2024-06-28 14:16:32,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42598.4, 300 sec: 42654.9). Total num frames: 3967778816. Throughput: 0: 42668.0. Samples: 246653320. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 14:16:32,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:16:34,167][09423] Updated weights for policy 0, policy_version 242177 (0.0038) [2024-06-28 14:16:34,198][09403] Signal inference workers to stop experience collection... (3400 times) [2024-06-28 14:16:34,199][09403] Signal inference workers to resume experience collection... (3400 times) [2024-06-28 14:16:34,215][09423] InferenceWorker_p0-w0: stopping experience collection (3400 times) [2024-06-28 14:16:34,215][09423] InferenceWorker_p0-w0: resuming experience collection (3400 times) [2024-06-28 14:16:37,648][09423] Updated weights for policy 0, policy_version 242187 (0.0044) [2024-06-28 14:16:37,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42871.5, 300 sec: 42654.0). Total num frames: 3967991808. Throughput: 0: 42624.3. Samples: 246911320. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 14:16:37,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:16:41,521][09423] Updated weights for policy 0, policy_version 242197 (0.0038) [2024-06-28 14:16:42,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42600.2, 300 sec: 42598.4). Total num frames: 3968204800. Throughput: 0: 42698.7. Samples: 247042140. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 14:16:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:16:45,569][09423] Updated weights for policy 0, policy_version 242207 (0.0031) [2024-06-28 14:16:47,922][09190] Fps is (10 sec: 42598.0, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 3968417792. Throughput: 0: 42790.2. Samples: 247296020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 14:16:47,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:16:48,938][09423] Updated weights for policy 0, policy_version 242217 (0.0032) [2024-06-28 14:16:52,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42598.3, 300 sec: 42654.9). Total num frames: 3968614400. Throughput: 0: 42699.1. Samples: 247552480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 14:16:52,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:16:53,531][09423] Updated weights for policy 0, policy_version 242227 (0.0031) [2024-06-28 14:16:56,675][09423] Updated weights for policy 0, policy_version 242237 (0.0034) [2024-06-28 14:16:57,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 3968860160. Throughput: 0: 42703.5. Samples: 247680140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:16:57,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:17:01,194][09423] Updated weights for policy 0, policy_version 242247 (0.0033) [2024-06-28 14:17:02,921][09190] Fps is (10 sec: 45875.6, 60 sec: 43144.6, 300 sec: 42653.9). Total num frames: 3969073152. Throughput: 0: 42845.4. Samples: 247936600. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:17:02,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:17:04,215][09423] Updated weights for policy 0, policy_version 242257 (0.0033) [2024-06-28 14:17:07,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 3969269760. Throughput: 0: 42781.3. Samples: 248189300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:17:07,924][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:17:09,246][09423] Updated weights for policy 0, policy_version 242267 (0.0037) [2024-06-28 14:17:12,251][09423] Updated weights for policy 0, policy_version 242277 (0.0039) [2024-06-28 14:17:12,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 3969482752. Throughput: 0: 42661.8. Samples: 248313040. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:17:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:17:16,671][09423] Updated weights for policy 0, policy_version 242287 (0.0032) [2024-06-28 14:17:17,924][09190] Fps is (10 sec: 44226.0, 60 sec: 42869.7, 300 sec: 42653.6). Total num frames: 3969712128. Throughput: 0: 42759.9. Samples: 248577620. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:17:17,925][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:17:19,815][09423] Updated weights for policy 0, policy_version 242297 (0.0035) [2024-06-28 14:17:22,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43144.5, 300 sec: 42709.5). Total num frames: 3969925120. Throughput: 0: 42584.9. Samples: 248827640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:17:22,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:17:24,146][09423] Updated weights for policy 0, policy_version 242307 (0.0039) [2024-06-28 14:17:27,601][09423] Updated weights for policy 0, policy_version 242317 (0.0038) [2024-06-28 14:17:27,922][09190] Fps is (10 sec: 42604.5, 60 sec: 42597.6, 300 sec: 42653.8). Total num frames: 3970138112. Throughput: 0: 42506.5. Samples: 248954980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:17:27,923][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:17:32,011][09423] Updated weights for policy 0, policy_version 242327 (0.0026) [2024-06-28 14:17:32,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42598.5, 300 sec: 42542.9). Total num frames: 3970334720. Throughput: 0: 42809.1. Samples: 249222420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:17:32,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 14:17:34,980][09423] Updated weights for policy 0, policy_version 242337 (0.0036) [2024-06-28 14:17:37,921][09190] Fps is (10 sec: 42603.0, 60 sec: 42871.5, 300 sec: 42765.4). Total num frames: 3970564096. Throughput: 0: 42680.9. Samples: 249473120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:17:37,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:17:39,698][09423] Updated weights for policy 0, policy_version 242347 (0.0033) [2024-06-28 14:17:42,667][09423] Updated weights for policy 0, policy_version 242357 (0.0034) [2024-06-28 14:17:42,921][09190] Fps is (10 sec: 45875.1, 60 sec: 43144.5, 300 sec: 42709.5). Total num frames: 3970793472. Throughput: 0: 42737.4. Samples: 249603320. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:17:42,928][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 14:17:47,147][09423] Updated weights for policy 0, policy_version 242367 (0.0039) [2024-06-28 14:17:47,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.6, 300 sec: 42653.9). Total num frames: 3970990080. Throughput: 0: 42880.9. Samples: 249866240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:17:47,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:17:50,137][09423] Updated weights for policy 0, policy_version 242377 (0.0026) [2024-06-28 14:17:52,924][09190] Fps is (10 sec: 39311.5, 60 sec: 42869.7, 300 sec: 42653.9). Total num frames: 3971186688. Throughput: 0: 42915.9. Samples: 250120620. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:17:52,925][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:17:54,645][09423] Updated weights for policy 0, policy_version 242387 (0.0040) [2024-06-28 14:17:57,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 3971416064. Throughput: 0: 42956.8. Samples: 250246100. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:17:57,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:17:57,946][09423] Updated weights for policy 0, policy_version 242397 (0.0036) [2024-06-28 14:17:58,829][09403] Signal inference workers to stop experience collection... (3450 times) [2024-06-28 14:17:58,830][09403] Signal inference workers to resume experience collection... (3450 times) [2024-06-28 14:17:58,840][09423] InferenceWorker_p0-w0: stopping experience collection (3450 times) [2024-06-28 14:17:58,840][09423] InferenceWorker_p0-w0: resuming experience collection (3450 times) [2024-06-28 14:18:02,260][09423] Updated weights for policy 0, policy_version 242407 (0.0031) [2024-06-28 14:18:02,921][09190] Fps is (10 sec: 42609.0, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 3971612672. Throughput: 0: 42895.7. Samples: 250507820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:18:02,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:18:05,634][09423] Updated weights for policy 0, policy_version 242417 (0.0028) [2024-06-28 14:18:07,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 3971842048. Throughput: 0: 42937.8. Samples: 250759840. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 14:18:07,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:18:09,739][09423] Updated weights for policy 0, policy_version 242427 (0.0044) [2024-06-28 14:18:12,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 3972038656. Throughput: 0: 42905.6. Samples: 250885680. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 14:18:12,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:18:13,510][09423] Updated weights for policy 0, policy_version 242437 (0.0034) [2024-06-28 14:18:17,901][09423] Updated weights for policy 0, policy_version 242447 (0.0031) [2024-06-28 14:18:17,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42327.1, 300 sec: 42598.4). Total num frames: 3972251648. Throughput: 0: 42499.1. Samples: 251134880. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 14:18:17,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:18:17,943][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000242447_3972251648.pth... [2024-06-28 14:18:18,010][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000241823_3962028032.pth [2024-06-28 14:18:21,264][09423] Updated weights for policy 0, policy_version 242457 (0.0036) [2024-06-28 14:18:22,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42598.5, 300 sec: 42709.5). Total num frames: 3972481024. Throughput: 0: 42563.2. Samples: 251388460. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 14:18:22,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:18:25,304][09423] Updated weights for policy 0, policy_version 242467 (0.0028) [2024-06-28 14:18:27,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42326.2, 300 sec: 42654.0). Total num frames: 3972677632. Throughput: 0: 42664.0. Samples: 251523200. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 14:18:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:18:28,747][09423] Updated weights for policy 0, policy_version 242477 (0.0041) [2024-06-28 14:18:32,784][09423] Updated weights for policy 0, policy_version 242487 (0.0027) [2024-06-28 14:18:32,922][09190] Fps is (10 sec: 42597.7, 60 sec: 42871.3, 300 sec: 42653.9). Total num frames: 3972907008. Throughput: 0: 42476.3. Samples: 251777680. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 14:18:32,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:18:36,332][09423] Updated weights for policy 0, policy_version 242497 (0.0038) [2024-06-28 14:18:37,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 3973120000. Throughput: 0: 42415.2. Samples: 252029200. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 14:18:37,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:18:40,545][09423] Updated weights for policy 0, policy_version 242507 (0.0024) [2024-06-28 14:18:42,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42325.3, 300 sec: 42653.9). Total num frames: 3973332992. Throughput: 0: 42640.9. Samples: 252164940. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 14:18:42,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:18:44,017][09423] Updated weights for policy 0, policy_version 242517 (0.0033) [2024-06-28 14:18:47,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.4, 300 sec: 42820.6). Total num frames: 3973545984. Throughput: 0: 42515.1. Samples: 252421000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 14:18:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:18:48,445][09423] Updated weights for policy 0, policy_version 242527 (0.0037) [2024-06-28 14:18:51,880][09423] Updated weights for policy 0, policy_version 242537 (0.0047) [2024-06-28 14:18:52,928][09190] Fps is (10 sec: 42570.9, 60 sec: 42868.6, 300 sec: 42653.0). Total num frames: 3973758976. Throughput: 0: 42477.5. Samples: 252671600. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 14:18:52,928][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:18:56,167][09423] Updated weights for policy 0, policy_version 242547 (0.0031) [2024-06-28 14:18:57,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.5, 300 sec: 42709.5). Total num frames: 3973971968. Throughput: 0: 42700.4. Samples: 252807200. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 14:18:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:18:59,403][09423] Updated weights for policy 0, policy_version 242557 (0.0025) [2024-06-28 14:19:02,921][09190] Fps is (10 sec: 42625.9, 60 sec: 42871.5, 300 sec: 42654.0). Total num frames: 3974184960. Throughput: 0: 42696.4. Samples: 253056220. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 14:19:02,930][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:19:03,666][09423] Updated weights for policy 0, policy_version 242567 (0.0045) [2024-06-28 14:19:07,120][09423] Updated weights for policy 0, policy_version 242577 (0.0042) [2024-06-28 14:19:07,922][09190] Fps is (10 sec: 42596.7, 60 sec: 42598.2, 300 sec: 42709.4). Total num frames: 3974397952. Throughput: 0: 42687.2. Samples: 253309400. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 14:19:07,931][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:19:11,401][09423] Updated weights for policy 0, policy_version 242587 (0.0035) [2024-06-28 14:19:12,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 3974594560. Throughput: 0: 42692.8. Samples: 253444380. Policy #0 lag: (min: 0.0, avg: 11.1, max: 22.0) [2024-06-28 14:19:12,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:19:14,968][09423] Updated weights for policy 0, policy_version 242597 (0.0029) [2024-06-28 14:19:17,921][09190] Fps is (10 sec: 40961.5, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 3974807552. Throughput: 0: 42712.1. Samples: 253699720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:19:17,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:19:18,806][09423] Updated weights for policy 0, policy_version 242607 (0.0044) [2024-06-28 14:19:22,599][09423] Updated weights for policy 0, policy_version 242617 (0.0047) [2024-06-28 14:19:22,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42598.3, 300 sec: 42709.5). Total num frames: 3975036928. Throughput: 0: 42736.0. Samples: 253952320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:19:22,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:19:26,548][09423] Updated weights for policy 0, policy_version 242627 (0.0034) [2024-06-28 14:19:27,924][09190] Fps is (10 sec: 42588.2, 60 sec: 42596.7, 300 sec: 42598.0). Total num frames: 3975233536. Throughput: 0: 42625.8. Samples: 254083200. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:19:27,924][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:19:28,205][09403] Signal inference workers to stop experience collection... (3500 times) [2024-06-28 14:19:28,260][09423] InferenceWorker_p0-w0: stopping experience collection (3500 times) [2024-06-28 14:19:28,317][09403] Signal inference workers to resume experience collection... (3500 times) [2024-06-28 14:19:28,317][09423] InferenceWorker_p0-w0: resuming experience collection (3500 times) [2024-06-28 14:19:30,341][09423] Updated weights for policy 0, policy_version 242637 (0.0039) [2024-06-28 14:19:32,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.5, 300 sec: 42709.5). Total num frames: 3975462912. Throughput: 0: 42638.7. Samples: 254339740. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:19:32,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 14:19:34,048][09423] Updated weights for policy 0, policy_version 242647 (0.0025) [2024-06-28 14:19:37,859][09423] Updated weights for policy 0, policy_version 242657 (0.0044) [2024-06-28 14:19:37,922][09190] Fps is (10 sec: 45885.7, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 3975692288. Throughput: 0: 42813.2. Samples: 254597920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:19:37,923][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:19:41,784][09423] Updated weights for policy 0, policy_version 242667 (0.0048) [2024-06-28 14:19:42,922][09190] Fps is (10 sec: 42597.9, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 3975888896. Throughput: 0: 42600.8. Samples: 254724240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:19:42,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:19:45,376][09423] Updated weights for policy 0, policy_version 242677 (0.0039) [2024-06-28 14:19:47,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 3976101888. Throughput: 0: 42818.7. Samples: 254983060. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:19:47,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:19:49,283][09423] Updated weights for policy 0, policy_version 242687 (0.0041) [2024-06-28 14:19:52,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42876.1, 300 sec: 42709.5). Total num frames: 3976331264. Throughput: 0: 42763.4. Samples: 255233740. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:19:52,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:19:53,223][09423] Updated weights for policy 0, policy_version 242697 (0.0034) [2024-06-28 14:19:56,834][09423] Updated weights for policy 0, policy_version 242707 (0.0026) [2024-06-28 14:19:57,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 3976527872. Throughput: 0: 42743.1. Samples: 255367820. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:19:57,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:20:00,914][09423] Updated weights for policy 0, policy_version 242717 (0.0037) [2024-06-28 14:20:02,924][09190] Fps is (10 sec: 40949.7, 60 sec: 42596.6, 300 sec: 42653.6). Total num frames: 3976740864. Throughput: 0: 42746.9. Samples: 255623440. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:20:02,933][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:20:04,661][09423] Updated weights for policy 0, policy_version 242727 (0.0040) [2024-06-28 14:20:07,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42871.7, 300 sec: 42765.0). Total num frames: 3976970240. Throughput: 0: 42781.4. Samples: 255877480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:20:07,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:20:08,593][09423] Updated weights for policy 0, policy_version 242737 (0.0036) [2024-06-28 14:20:12,164][09423] Updated weights for policy 0, policy_version 242747 (0.0027) [2024-06-28 14:20:12,921][09190] Fps is (10 sec: 44247.6, 60 sec: 43144.5, 300 sec: 42765.0). Total num frames: 3977183232. Throughput: 0: 42861.8. Samples: 256011880. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:20:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:20:16,149][09423] Updated weights for policy 0, policy_version 242757 (0.0038) [2024-06-28 14:20:17,922][09190] Fps is (10 sec: 40959.4, 60 sec: 42871.4, 300 sec: 42653.9). Total num frames: 3977379840. Throughput: 0: 42701.7. Samples: 256261320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:20:17,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:20:17,934][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000242760_3977379840.pth... [2024-06-28 14:20:17,987][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000242135_3967139840.pth [2024-06-28 14:20:19,725][09423] Updated weights for policy 0, policy_version 242767 (0.0032) [2024-06-28 14:20:22,922][09190] Fps is (10 sec: 42598.1, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 3977609216. Throughput: 0: 42710.2. Samples: 256519880. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:20:22,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:20:23,651][09423] Updated weights for policy 0, policy_version 242777 (0.0033) [2024-06-28 14:20:27,537][09423] Updated weights for policy 0, policy_version 242787 (0.0032) [2024-06-28 14:20:27,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43146.2, 300 sec: 42709.5). Total num frames: 3977822208. Throughput: 0: 42746.7. Samples: 256647840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 14:20:27,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:20:31,223][09423] Updated weights for policy 0, policy_version 242797 (0.0035) [2024-06-28 14:20:32,925][09190] Fps is (10 sec: 40945.8, 60 sec: 42595.8, 300 sec: 42709.0). Total num frames: 3978018816. Throughput: 0: 42627.2. Samples: 256901440. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 14:20:32,925][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:20:35,203][09423] Updated weights for policy 0, policy_version 242807 (0.0032) [2024-06-28 14:20:37,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.5, 300 sec: 42709.8). Total num frames: 3978248192. Throughput: 0: 42744.0. Samples: 257157220. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 14:20:37,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:20:39,026][09423] Updated weights for policy 0, policy_version 242817 (0.0030) [2024-06-28 14:20:41,914][09403] Signal inference workers to stop experience collection... (3550 times) [2024-06-28 14:20:41,964][09423] InferenceWorker_p0-w0: stopping experience collection (3550 times) [2024-06-28 14:20:41,970][09403] Signal inference workers to resume experience collection... (3550 times) [2024-06-28 14:20:41,975][09423] InferenceWorker_p0-w0: resuming experience collection (3550 times) [2024-06-28 14:20:42,921][09190] Fps is (10 sec: 44253.1, 60 sec: 42871.6, 300 sec: 42709.5). Total num frames: 3978461184. Throughput: 0: 42746.3. Samples: 257291400. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 14:20:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:20:42,946][09423] Updated weights for policy 0, policy_version 242827 (0.0030) [2024-06-28 14:20:46,814][09423] Updated weights for policy 0, policy_version 242837 (0.0039) [2024-06-28 14:20:47,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 3978674176. Throughput: 0: 42874.8. Samples: 257552700. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 14:20:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:20:50,704][09423] Updated weights for policy 0, policy_version 242847 (0.0035) [2024-06-28 14:20:52,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 3978887168. Throughput: 0: 42899.5. Samples: 257807960. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 14:20:52,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:20:54,293][09423] Updated weights for policy 0, policy_version 242857 (0.0040) [2024-06-28 14:20:57,923][09190] Fps is (10 sec: 42593.7, 60 sec: 42870.7, 300 sec: 42764.9). Total num frames: 3979100160. Throughput: 0: 42701.7. Samples: 257933500. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 14:20:57,923][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:20:58,259][09423] Updated weights for policy 0, policy_version 242867 (0.0029) [2024-06-28 14:21:01,618][09423] Updated weights for policy 0, policy_version 242877 (0.0029) [2024-06-28 14:21:02,924][09190] Fps is (10 sec: 42587.9, 60 sec: 42871.5, 300 sec: 42764.7). Total num frames: 3979313152. Throughput: 0: 42805.3. Samples: 258187660. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 14:21:02,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:21:06,097][09423] Updated weights for policy 0, policy_version 242887 (0.0031) [2024-06-28 14:21:07,921][09190] Fps is (10 sec: 44241.7, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 3979542528. Throughput: 0: 42964.1. Samples: 258453260. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 14:21:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:21:09,319][09423] Updated weights for policy 0, policy_version 242897 (0.0026) [2024-06-28 14:21:12,922][09190] Fps is (10 sec: 42605.8, 60 sec: 42597.9, 300 sec: 42709.4). Total num frames: 3979739136. Throughput: 0: 42989.1. Samples: 258582380. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 14:21:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:21:13,598][09423] Updated weights for policy 0, policy_version 242907 (0.0028) [2024-06-28 14:21:17,218][09423] Updated weights for policy 0, policy_version 242917 (0.0037) [2024-06-28 14:21:17,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42871.5, 300 sec: 42765.0). Total num frames: 3979952128. Throughput: 0: 42899.0. Samples: 258831740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 14:21:17,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:21:21,145][09423] Updated weights for policy 0, policy_version 242927 (0.0041) [2024-06-28 14:21:22,921][09190] Fps is (10 sec: 44240.5, 60 sec: 42871.6, 300 sec: 42709.5). Total num frames: 3980181504. Throughput: 0: 42841.0. Samples: 259085060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 14:21:22,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:21:24,890][09423] Updated weights for policy 0, policy_version 242937 (0.0038) [2024-06-28 14:21:27,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42325.2, 300 sec: 42653.9). Total num frames: 3980361728. Throughput: 0: 42697.5. Samples: 259212800. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 14:21:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:21:29,267][09423] Updated weights for policy 0, policy_version 242947 (0.0039) [2024-06-28 14:21:32,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42874.1, 300 sec: 42709.5). Total num frames: 3980591104. Throughput: 0: 42484.5. Samples: 259464500. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 14:21:32,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:21:33,023][09423] Updated weights for policy 0, policy_version 242957 (0.0038) [2024-06-28 14:21:37,031][09423] Updated weights for policy 0, policy_version 242967 (0.0031) [2024-06-28 14:21:37,924][09190] Fps is (10 sec: 45864.4, 60 sec: 42869.7, 300 sec: 42764.6). Total num frames: 3980820480. Throughput: 0: 42534.5. Samples: 259722120. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 14:21:37,925][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:21:40,407][09423] Updated weights for policy 0, policy_version 242977 (0.0033) [2024-06-28 14:21:42,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42598.3, 300 sec: 42709.5). Total num frames: 3981017088. Throughput: 0: 42465.9. Samples: 259844420. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 14:21:42,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:21:44,846][09423] Updated weights for policy 0, policy_version 242987 (0.0044) [2024-06-28 14:21:47,884][09423] Updated weights for policy 0, policy_version 242997 (0.0026) [2024-06-28 14:21:47,921][09190] Fps is (10 sec: 44248.1, 60 sec: 43144.6, 300 sec: 42876.1). Total num frames: 3981262848. Throughput: 0: 42816.2. Samples: 260114280. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 14:21:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:21:52,453][09423] Updated weights for policy 0, policy_version 243007 (0.0040) [2024-06-28 14:21:52,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42598.5, 300 sec: 42654.0). Total num frames: 3981443072. Throughput: 0: 42587.6. Samples: 260369700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 14:21:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:21:55,884][09423] Updated weights for policy 0, policy_version 243017 (0.0031) [2024-06-28 14:21:57,921][09190] Fps is (10 sec: 39321.2, 60 sec: 42599.2, 300 sec: 42653.9). Total num frames: 3981656064. Throughput: 0: 42430.9. Samples: 260491740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 14:21:57,926][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:21:59,966][09423] Updated weights for policy 0, policy_version 243027 (0.0024) [2024-06-28 14:22:02,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42600.1, 300 sec: 42709.5). Total num frames: 3981869056. Throughput: 0: 42622.6. Samples: 260749760. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 14:22:02,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:22:03,374][09423] Updated weights for policy 0, policy_version 243037 (0.0039) [2024-06-28 14:22:06,363][09403] Signal inference workers to stop experience collection... (3600 times) [2024-06-28 14:22:06,364][09403] Signal inference workers to resume experience collection... (3600 times) [2024-06-28 14:22:06,416][09423] InferenceWorker_p0-w0: stopping experience collection (3600 times) [2024-06-28 14:22:06,416][09423] InferenceWorker_p0-w0: resuming experience collection (3600 times) [2024-06-28 14:22:07,924][09190] Fps is (10 sec: 40949.8, 60 sec: 42050.5, 300 sec: 42653.6). Total num frames: 3982065664. Throughput: 0: 42702.4. Samples: 261006780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 14:22:07,925][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:22:08,076][09423] Updated weights for policy 0, policy_version 243047 (0.0030) [2024-06-28 14:22:11,394][09423] Updated weights for policy 0, policy_version 243057 (0.0038) [2024-06-28 14:22:12,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42599.0, 300 sec: 42654.3). Total num frames: 3982295040. Throughput: 0: 42552.2. Samples: 261127640. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 14:22:12,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:22:15,611][09423] Updated weights for policy 0, policy_version 243067 (0.0028) [2024-06-28 14:22:17,921][09190] Fps is (10 sec: 42609.0, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 3982491648. Throughput: 0: 42626.1. Samples: 261382680. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 14:22:17,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:22:17,930][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000243072_3982491648.pth... [2024-06-28 14:22:18,005][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000242447_3972251648.pth [2024-06-28 14:22:18,832][09423] Updated weights for policy 0, policy_version 243077 (0.0025) [2024-06-28 14:22:22,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42325.3, 300 sec: 42654.1). Total num frames: 3982721024. Throughput: 0: 42678.4. Samples: 261642540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 14:22:22,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:22:23,273][09423] Updated weights for policy 0, policy_version 243087 (0.0036) [2024-06-28 14:22:26,605][09423] Updated weights for policy 0, policy_version 243097 (0.0036) [2024-06-28 14:22:27,922][09190] Fps is (10 sec: 45874.8, 60 sec: 43144.5, 300 sec: 42765.0). Total num frames: 3982950400. Throughput: 0: 42843.9. Samples: 261772400. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 14:22:27,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:22:30,703][09423] Updated weights for policy 0, policy_version 243107 (0.0035) [2024-06-28 14:22:32,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 3983147008. Throughput: 0: 42629.7. Samples: 262032620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 14:22:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:22:34,115][09423] Updated weights for policy 0, policy_version 243117 (0.0036) [2024-06-28 14:22:37,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42600.1, 300 sec: 42653.9). Total num frames: 3983376384. Throughput: 0: 42511.9. Samples: 262282740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 14:22:37,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:22:38,112][09423] Updated weights for policy 0, policy_version 243127 (0.0036) [2024-06-28 14:22:41,934][09423] Updated weights for policy 0, policy_version 243137 (0.0038) [2024-06-28 14:22:42,921][09190] Fps is (10 sec: 45876.0, 60 sec: 43144.7, 300 sec: 42765.0). Total num frames: 3983605760. Throughput: 0: 42785.9. Samples: 262417100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 14:22:42,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 14:22:46,261][09423] Updated weights for policy 0, policy_version 243147 (0.0034) [2024-06-28 14:22:47,924][09190] Fps is (10 sec: 39312.0, 60 sec: 41777.4, 300 sec: 42653.9). Total num frames: 3983769600. Throughput: 0: 42695.4. Samples: 262671160. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:22:47,924][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:22:49,803][09423] Updated weights for policy 0, policy_version 243157 (0.0031) [2024-06-28 14:22:52,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 3984015360. Throughput: 0: 42506.9. Samples: 262919480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:22:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:22:53,706][09423] Updated weights for policy 0, policy_version 243167 (0.0041) [2024-06-28 14:22:57,306][09423] Updated weights for policy 0, policy_version 243177 (0.0031) [2024-06-28 14:22:57,921][09190] Fps is (10 sec: 47525.4, 60 sec: 43144.6, 300 sec: 42820.6). Total num frames: 3984244736. Throughput: 0: 42885.3. Samples: 263057480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:22:57,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:23:01,042][09423] Updated weights for policy 0, policy_version 243187 (0.0024) [2024-06-28 14:23:02,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 3984408576. Throughput: 0: 42946.7. Samples: 263315280. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:23:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:23:04,666][09423] Updated weights for policy 0, policy_version 243197 (0.0046) [2024-06-28 14:23:07,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42873.3, 300 sec: 42709.5). Total num frames: 3984637952. Throughput: 0: 42773.8. Samples: 263567360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:23:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:23:08,996][09423] Updated weights for policy 0, policy_version 243207 (0.0049) [2024-06-28 14:23:12,579][09423] Updated weights for policy 0, policy_version 243217 (0.0030) [2024-06-28 14:23:12,921][09190] Fps is (10 sec: 47513.6, 60 sec: 43144.5, 300 sec: 42820.5). Total num frames: 3984883712. Throughput: 0: 42724.6. Samples: 263695000. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:23:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:23:16,597][09423] Updated weights for policy 0, policy_version 243227 (0.0043) [2024-06-28 14:23:17,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 3985047552. Throughput: 0: 42425.3. Samples: 263941760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:23:17,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:23:20,419][09423] Updated weights for policy 0, policy_version 243237 (0.0035) [2024-06-28 14:23:22,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42871.5, 300 sec: 42765.0). Total num frames: 3985293312. Throughput: 0: 42541.0. Samples: 264197080. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:23:22,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:23:24,030][09423] Updated weights for policy 0, policy_version 243247 (0.0037) [2024-06-28 14:23:27,921][09190] Fps is (10 sec: 44237.7, 60 sec: 42325.5, 300 sec: 42654.0). Total num frames: 3985489920. Throughput: 0: 42456.5. Samples: 264327640. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:23:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:23:28,282][09423] Updated weights for policy 0, policy_version 243257 (0.0031) [2024-06-28 14:23:31,975][09423] Updated weights for policy 0, policy_version 243267 (0.0032) [2024-06-28 14:23:32,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42325.5, 300 sec: 42598.4). Total num frames: 3985686528. Throughput: 0: 42418.9. Samples: 264579900. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:23:32,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:23:35,898][09403] Signal inference workers to stop experience collection... (3650 times) [2024-06-28 14:23:35,900][09403] Signal inference workers to resume experience collection... (3650 times) [2024-06-28 14:23:35,919][09423] InferenceWorker_p0-w0: stopping experience collection (3650 times) [2024-06-28 14:23:35,919][09423] InferenceWorker_p0-w0: resuming experience collection (3650 times) [2024-06-28 14:23:36,040][09423] Updated weights for policy 0, policy_version 243277 (0.0039) [2024-06-28 14:23:37,921][09190] Fps is (10 sec: 44235.9, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 3985932288. Throughput: 0: 42421.2. Samples: 264828440. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:23:37,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:23:39,560][09423] Updated weights for policy 0, policy_version 243287 (0.0039) [2024-06-28 14:23:42,921][09190] Fps is (10 sec: 42598.3, 60 sec: 41779.2, 300 sec: 42598.4). Total num frames: 3986112512. Throughput: 0: 42344.5. Samples: 264962980. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:23:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:23:43,610][09423] Updated weights for policy 0, policy_version 243297 (0.0023) [2024-06-28 14:23:47,725][09423] Updated weights for policy 0, policy_version 243307 (0.0036) [2024-06-28 14:23:47,922][09190] Fps is (10 sec: 40959.8, 60 sec: 42873.1, 300 sec: 42654.9). Total num frames: 3986341888. Throughput: 0: 42290.1. Samples: 265218340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:23:47,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:23:51,342][09423] Updated weights for policy 0, policy_version 243317 (0.0032) [2024-06-28 14:23:52,921][09190] Fps is (10 sec: 45874.7, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 3986571264. Throughput: 0: 42334.6. Samples: 265472420. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:23:52,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:23:55,104][09423] Updated weights for policy 0, policy_version 243327 (0.0035) [2024-06-28 14:23:57,921][09190] Fps is (10 sec: 40961.0, 60 sec: 41779.3, 300 sec: 42598.4). Total num frames: 3986751488. Throughput: 0: 42385.0. Samples: 265602320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 14:23:57,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:23:58,780][09423] Updated weights for policy 0, policy_version 243337 (0.0050) [2024-06-28 14:24:02,447][09423] Updated weights for policy 0, policy_version 243347 (0.0030) [2024-06-28 14:24:02,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43144.6, 300 sec: 42709.5). Total num frames: 3986997248. Throughput: 0: 42630.4. Samples: 265860120. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 14:24:02,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:24:06,941][09423] Updated weights for policy 0, policy_version 243357 (0.0041) [2024-06-28 14:24:07,922][09190] Fps is (10 sec: 45874.2, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 3987210240. Throughput: 0: 42486.5. Samples: 266108980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 14:24:07,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:24:10,520][09423] Updated weights for policy 0, policy_version 243367 (0.0038) [2024-06-28 14:24:12,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42052.2, 300 sec: 42709.5). Total num frames: 3987406848. Throughput: 0: 42461.6. Samples: 266238420. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 14:24:12,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:24:14,542][09423] Updated weights for policy 0, policy_version 243377 (0.0022) [2024-06-28 14:24:17,924][09190] Fps is (10 sec: 42588.2, 60 sec: 43142.8, 300 sec: 42709.1). Total num frames: 3987636224. Throughput: 0: 42525.5. Samples: 266493660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 14:24:17,925][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:24:17,937][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000243386_3987636224.pth... [2024-06-28 14:24:17,992][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000242760_3977379840.pth [2024-06-28 14:24:18,332][09423] Updated weights for policy 0, policy_version 243387 (0.0034) [2024-06-28 14:24:22,307][09423] Updated weights for policy 0, policy_version 243397 (0.0038) [2024-06-28 14:24:22,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42598.4, 300 sec: 42765.4). Total num frames: 3987849216. Throughput: 0: 42738.3. Samples: 266751660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 14:24:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:24:25,723][09423] Updated weights for policy 0, policy_version 243407 (0.0040) [2024-06-28 14:24:27,921][09190] Fps is (10 sec: 39331.7, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 3988029440. Throughput: 0: 42560.9. Samples: 266878220. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 14:24:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 14:24:29,763][09423] Updated weights for policy 0, policy_version 243417 (0.0030) [2024-06-28 14:24:32,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43144.6, 300 sec: 42654.0). Total num frames: 3988275200. Throughput: 0: 42630.9. Samples: 267136720. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 14:24:32,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:24:33,144][09423] Updated weights for policy 0, policy_version 243427 (0.0031) [2024-06-28 14:24:37,808][09423] Updated weights for policy 0, policy_version 243437 (0.0029) [2024-06-28 14:24:37,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42325.4, 300 sec: 42653.9). Total num frames: 3988471808. Throughput: 0: 42722.7. Samples: 267394940. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 14:24:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 14:24:40,713][09423] Updated weights for policy 0, policy_version 243447 (0.0032) [2024-06-28 14:24:42,921][09190] Fps is (10 sec: 37682.7, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 3988652032. Throughput: 0: 42483.0. Samples: 267514060. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 14:24:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:24:45,329][09403] Signal inference workers to stop experience collection... (3700 times) [2024-06-28 14:24:45,371][09423] InferenceWorker_p0-w0: stopping experience collection (3700 times) [2024-06-28 14:24:45,446][09403] Signal inference workers to resume experience collection... (3700 times) [2024-06-28 14:24:45,446][09423] InferenceWorker_p0-w0: resuming experience collection (3700 times) [2024-06-28 14:24:45,597][09423] Updated weights for policy 0, policy_version 243457 (0.0033) [2024-06-28 14:24:47,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42871.6, 300 sec: 42653.9). Total num frames: 3988914176. Throughput: 0: 42308.4. Samples: 267764000. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 14:24:47,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:24:48,797][09423] Updated weights for policy 0, policy_version 243467 (0.0033) [2024-06-28 14:24:52,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42052.4, 300 sec: 42598.4). Total num frames: 3989094400. Throughput: 0: 42671.4. Samples: 268029180. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 14:24:52,921][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:24:53,121][09423] Updated weights for policy 0, policy_version 243477 (0.0039) [2024-06-28 14:24:56,671][09423] Updated weights for policy 0, policy_version 243487 (0.0025) [2024-06-28 14:24:57,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42598.4, 300 sec: 42598.8). Total num frames: 3989307392. Throughput: 0: 42590.4. Samples: 268154980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 14:24:57,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:25:00,583][09423] Updated weights for policy 0, policy_version 243497 (0.0033) [2024-06-28 14:25:02,924][09190] Fps is (10 sec: 45863.1, 60 sec: 42596.6, 300 sec: 42653.6). Total num frames: 3989553152. Throughput: 0: 42622.2. Samples: 268411660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 21.0) [2024-06-28 14:25:02,924][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 14:25:04,350][09423] Updated weights for policy 0, policy_version 243507 (0.0034) [2024-06-28 14:25:07,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 3989749760. Throughput: 0: 42587.9. Samples: 268668120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 14:25:07,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:25:08,587][09423] Updated weights for policy 0, policy_version 243517 (0.0034) [2024-06-28 14:25:11,814][09423] Updated weights for policy 0, policy_version 243527 (0.0030) [2024-06-28 14:25:12,921][09190] Fps is (10 sec: 40970.4, 60 sec: 42598.5, 300 sec: 42654.0). Total num frames: 3989962752. Throughput: 0: 42604.9. Samples: 268795440. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 14:25:12,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:25:16,267][09423] Updated weights for policy 0, policy_version 243537 (0.0027) [2024-06-28 14:25:17,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42600.2, 300 sec: 42654.0). Total num frames: 3990192128. Throughput: 0: 42537.6. Samples: 269050920. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 14:25:17,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:25:19,295][09423] Updated weights for policy 0, policy_version 243547 (0.0042) [2024-06-28 14:25:22,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42052.2, 300 sec: 42542.9). Total num frames: 3990372352. Throughput: 0: 42432.4. Samples: 269304400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 14:25:22,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:25:24,374][09423] Updated weights for policy 0, policy_version 243557 (0.0036) [2024-06-28 14:25:27,199][09423] Updated weights for policy 0, policy_version 243567 (0.0030) [2024-06-28 14:25:27,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.4, 300 sec: 42654.5). Total num frames: 3990601728. Throughput: 0: 42422.6. Samples: 269423080. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 14:25:27,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 14:25:31,898][09423] Updated weights for policy 0, policy_version 243577 (0.0042) [2024-06-28 14:25:32,922][09190] Fps is (10 sec: 44236.3, 60 sec: 42325.1, 300 sec: 42598.4). Total num frames: 3990814720. Throughput: 0: 42661.2. Samples: 269683760. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 14:25:32,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:25:34,863][09423] Updated weights for policy 0, policy_version 243587 (0.0038) [2024-06-28 14:25:37,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42052.3, 300 sec: 42487.3). Total num frames: 3990994944. Throughput: 0: 42537.6. Samples: 269943380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 14:25:37,931][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 14:25:39,441][09423] Updated weights for policy 0, policy_version 243597 (0.0047) [2024-06-28 14:25:42,615][09423] Updated weights for policy 0, policy_version 243607 (0.0033) [2024-06-28 14:25:42,921][09190] Fps is (10 sec: 44237.7, 60 sec: 43417.6, 300 sec: 42654.0). Total num frames: 3991257088. Throughput: 0: 42513.3. Samples: 270068080. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 14:25:42,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:25:47,103][09423] Updated weights for policy 0, policy_version 243617 (0.0035) [2024-06-28 14:25:47,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 3991453696. Throughput: 0: 42625.0. Samples: 270329680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 14:25:47,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:25:50,090][09423] Updated weights for policy 0, policy_version 243627 (0.0038) [2024-06-28 14:25:52,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42598.3, 300 sec: 42543.0). Total num frames: 3991650304. Throughput: 0: 42618.7. Samples: 270585960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 14:25:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:25:54,986][09423] Updated weights for policy 0, policy_version 243637 (0.0035) [2024-06-28 14:25:57,711][09423] Updated weights for policy 0, policy_version 243647 (0.0040) [2024-06-28 14:25:57,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43417.5, 300 sec: 42709.8). Total num frames: 3991912448. Throughput: 0: 42526.6. Samples: 270709140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 14:25:57,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 14:26:02,747][09423] Updated weights for policy 0, policy_version 243657 (0.0028) [2024-06-28 14:26:02,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42054.0, 300 sec: 42487.3). Total num frames: 3992076288. Throughput: 0: 42608.9. Samples: 270968320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 14:26:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:26:05,541][09403] Signal inference workers to stop experience collection... (3750 times) [2024-06-28 14:26:05,542][09403] Signal inference workers to resume experience collection... (3750 times) [2024-06-28 14:26:05,586][09423] InferenceWorker_p0-w0: stopping experience collection (3750 times) [2024-06-28 14:26:05,586][09423] InferenceWorker_p0-w0: resuming experience collection (3750 times) [2024-06-28 14:26:05,686][09423] Updated weights for policy 0, policy_version 243667 (0.0030) [2024-06-28 14:26:07,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42598.4, 300 sec: 42598.5). Total num frames: 3992305664. Throughput: 0: 42587.1. Samples: 271220820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 14:26:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:26:10,193][09423] Updated weights for policy 0, policy_version 243677 (0.0041) [2024-06-28 14:26:12,921][09190] Fps is (10 sec: 45875.6, 60 sec: 42871.5, 300 sec: 42654.0). Total num frames: 3992535040. Throughput: 0: 42809.9. Samples: 271349520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 22.0) [2024-06-28 14:26:12,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:26:13,453][09423] Updated weights for policy 0, policy_version 243687 (0.0029) [2024-06-28 14:26:17,851][09423] Updated weights for policy 0, policy_version 243697 (0.0026) [2024-06-28 14:26:17,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42325.3, 300 sec: 42542.8). Total num frames: 3992731648. Throughput: 0: 42837.4. Samples: 271611440. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 14:26:17,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 14:26:17,937][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000243697_3992731648.pth... [2024-06-28 14:26:17,995][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000243072_3982491648.pth [2024-06-28 14:26:20,951][09423] Updated weights for policy 0, policy_version 243707 (0.0033) [2024-06-28 14:26:22,921][09190] Fps is (10 sec: 40959.4, 60 sec: 42871.5, 300 sec: 42654.0). Total num frames: 3992944640. Throughput: 0: 42706.6. Samples: 271865180. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 14:26:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:26:25,591][09423] Updated weights for policy 0, policy_version 243717 (0.0030) [2024-06-28 14:26:27,922][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.4, 300 sec: 42653.9). Total num frames: 3993174016. Throughput: 0: 42810.1. Samples: 271994540. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 14:26:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:26:28,323][09423] Updated weights for policy 0, policy_version 243727 (0.0032) [2024-06-28 14:26:32,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.4, 300 sec: 42487.7). Total num frames: 3993354240. Throughput: 0: 42613.8. Samples: 272247300. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 14:26:32,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:26:33,478][09423] Updated weights for policy 0, policy_version 243737 (0.0025) [2024-06-28 14:26:36,629][09423] Updated weights for policy 0, policy_version 243747 (0.0030) [2024-06-28 14:26:37,922][09190] Fps is (10 sec: 40960.0, 60 sec: 43144.5, 300 sec: 42598.4). Total num frames: 3993583616. Throughput: 0: 42452.3. Samples: 272496320. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 14:26:37,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 14:26:41,040][09423] Updated weights for policy 0, policy_version 243757 (0.0027) [2024-06-28 14:26:42,924][09190] Fps is (10 sec: 45864.2, 60 sec: 42596.6, 300 sec: 42542.5). Total num frames: 3993812992. Throughput: 0: 42676.8. Samples: 272629700. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 14:26:42,924][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:26:44,170][09423] Updated weights for policy 0, policy_version 243767 (0.0043) [2024-06-28 14:26:47,921][09190] Fps is (10 sec: 39322.2, 60 sec: 42052.3, 300 sec: 42487.3). Total num frames: 3993976832. Throughput: 0: 42578.7. Samples: 272884360. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 14:26:47,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:26:48,554][09423] Updated weights for policy 0, policy_version 243777 (0.0039) [2024-06-28 14:26:51,894][09423] Updated weights for policy 0, policy_version 243787 (0.0036) [2024-06-28 14:26:52,924][09190] Fps is (10 sec: 40960.1, 60 sec: 42869.7, 300 sec: 42598.1). Total num frames: 3994222592. Throughput: 0: 42577.3. Samples: 273136900. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 14:26:52,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:26:56,144][09423] Updated weights for policy 0, policy_version 243797 (0.0047) [2024-06-28 14:26:57,921][09190] Fps is (10 sec: 45875.3, 60 sec: 42052.3, 300 sec: 42598.4). Total num frames: 3994435584. Throughput: 0: 42700.4. Samples: 273271040. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 14:26:57,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:26:59,336][09423] Updated weights for policy 0, policy_version 243807 (0.0031) [2024-06-28 14:27:02,921][09190] Fps is (10 sec: 40970.3, 60 sec: 42598.4, 300 sec: 42598.8). Total num frames: 3994632192. Throughput: 0: 42541.4. Samples: 273525800. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 14:27:02,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:27:04,041][09423] Updated weights for policy 0, policy_version 243817 (0.0034) [2024-06-28 14:27:07,173][09423] Updated weights for policy 0, policy_version 243827 (0.0045) [2024-06-28 14:27:07,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 3994877952. Throughput: 0: 42504.9. Samples: 273777900. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 14:27:07,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:27:12,085][09423] Updated weights for policy 0, policy_version 243837 (0.0033) [2024-06-28 14:27:12,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42325.2, 300 sec: 42653.9). Total num frames: 3995074560. Throughput: 0: 42574.7. Samples: 273910400. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 14:27:12,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:27:14,989][09423] Updated weights for policy 0, policy_version 243847 (0.0041) [2024-06-28 14:27:17,922][09190] Fps is (10 sec: 39321.3, 60 sec: 42325.3, 300 sec: 42542.8). Total num frames: 3995271168. Throughput: 0: 42485.3. Samples: 274159140. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 14:27:17,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:27:19,607][09423] Updated weights for policy 0, policy_version 243857 (0.0034) [2024-06-28 14:27:22,682][09423] Updated weights for policy 0, policy_version 243867 (0.0039) [2024-06-28 14:27:22,922][09190] Fps is (10 sec: 44236.2, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 3995516928. Throughput: 0: 42541.7. Samples: 274410700. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 14:27:22,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:27:27,241][09423] Updated weights for policy 0, policy_version 243877 (0.0036) [2024-06-28 14:27:27,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42052.3, 300 sec: 42542.9). Total num frames: 3995697152. Throughput: 0: 42571.2. Samples: 274545300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 14:27:27,926][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:27:30,374][09423] Updated weights for policy 0, policy_version 243887 (0.0031) [2024-06-28 14:27:32,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 3995926528. Throughput: 0: 42643.0. Samples: 274803300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 14:27:32,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:27:34,740][09423] Updated weights for policy 0, policy_version 243897 (0.0028) [2024-06-28 14:27:37,921][09190] Fps is (10 sec: 45875.4, 60 sec: 42871.6, 300 sec: 42542.9). Total num frames: 3996155904. Throughput: 0: 42604.6. Samples: 275054000. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 14:27:37,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:27:38,065][09423] Updated weights for policy 0, policy_version 243907 (0.0035) [2024-06-28 14:27:42,493][09423] Updated weights for policy 0, policy_version 243917 (0.0044) [2024-06-28 14:27:42,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42327.1, 300 sec: 42654.3). Total num frames: 3996352512. Throughput: 0: 42741.3. Samples: 275194400. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 14:27:42,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:27:43,298][09403] Signal inference workers to stop experience collection... (3800 times) [2024-06-28 14:27:43,343][09423] InferenceWorker_p0-w0: stopping experience collection (3800 times) [2024-06-28 14:27:43,349][09403] Signal inference workers to resume experience collection... (3800 times) [2024-06-28 14:27:43,357][09423] InferenceWorker_p0-w0: resuming experience collection (3800 times) [2024-06-28 14:27:45,991][09423] Updated weights for policy 0, policy_version 243927 (0.0026) [2024-06-28 14:27:47,924][09190] Fps is (10 sec: 42587.6, 60 sec: 43415.8, 300 sec: 42598.0). Total num frames: 3996581888. Throughput: 0: 42663.3. Samples: 275445760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 14:27:47,925][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:27:49,889][09423] Updated weights for policy 0, policy_version 243937 (0.0032) [2024-06-28 14:27:52,921][09190] Fps is (10 sec: 44236.2, 60 sec: 42873.2, 300 sec: 42542.9). Total num frames: 3996794880. Throughput: 0: 42795.1. Samples: 275703680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 14:27:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:27:53,766][09423] Updated weights for policy 0, policy_version 243947 (0.0027) [2024-06-28 14:27:57,892][09423] Updated weights for policy 0, policy_version 243957 (0.0034) [2024-06-28 14:27:57,921][09190] Fps is (10 sec: 40970.4, 60 sec: 42598.4, 300 sec: 42654.0). Total num frames: 3996991488. Throughput: 0: 42631.2. Samples: 275828800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 14:27:57,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:28:01,241][09423] Updated weights for policy 0, policy_version 243967 (0.0050) [2024-06-28 14:28:02,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.4, 300 sec: 42653.9). Total num frames: 3997220864. Throughput: 0: 42862.3. Samples: 276087940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 14:28:02,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:28:05,297][09423] Updated weights for policy 0, policy_version 243977 (0.0032) [2024-06-28 14:28:07,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42598.3, 300 sec: 42542.8). Total num frames: 3997433856. Throughput: 0: 42932.1. Samples: 276342640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 14:28:07,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:28:09,100][09423] Updated weights for policy 0, policy_version 243987 (0.0034) [2024-06-28 14:28:12,924][09190] Fps is (10 sec: 40950.1, 60 sec: 42596.7, 300 sec: 42653.6). Total num frames: 3997630464. Throughput: 0: 42803.0. Samples: 276471540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 14:28:12,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:28:13,089][09423] Updated weights for policy 0, policy_version 243997 (0.0022) [2024-06-28 14:28:16,540][09423] Updated weights for policy 0, policy_version 244007 (0.0031) [2024-06-28 14:28:17,922][09190] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 42653.9). Total num frames: 3997876224. Throughput: 0: 42803.9. Samples: 276729480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 14:28:17,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:28:17,933][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000244011_3997876224.pth... [2024-06-28 14:28:17,985][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000243386_3987636224.pth [2024-06-28 14:28:20,580][09423] Updated weights for policy 0, policy_version 244017 (0.0027) [2024-06-28 14:28:22,921][09190] Fps is (10 sec: 44247.6, 60 sec: 42598.5, 300 sec: 42653.9). Total num frames: 3998072832. Throughput: 0: 42932.8. Samples: 276985980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 14:28:22,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:28:24,099][09423] Updated weights for policy 0, policy_version 244027 (0.0044) [2024-06-28 14:28:27,922][09190] Fps is (10 sec: 39321.3, 60 sec: 42871.3, 300 sec: 42653.9). Total num frames: 3998269440. Throughput: 0: 42591.8. Samples: 277111040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 14:28:27,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:28:28,147][09423] Updated weights for policy 0, policy_version 244037 (0.0041) [2024-06-28 14:28:31,982][09423] Updated weights for policy 0, policy_version 244047 (0.0049) [2024-06-28 14:28:32,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42871.6, 300 sec: 42598.4). Total num frames: 3998498816. Throughput: 0: 42653.6. Samples: 277365060. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 14:28:32,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:28:36,012][09423] Updated weights for policy 0, policy_version 244057 (0.0034) [2024-06-28 14:28:37,924][09190] Fps is (10 sec: 45864.7, 60 sec: 42869.7, 300 sec: 42764.6). Total num frames: 3998728192. Throughput: 0: 42612.8. Samples: 277621360. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:28:37,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:28:39,831][09423] Updated weights for policy 0, policy_version 244067 (0.0053) [2024-06-28 14:28:42,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 3998908416. Throughput: 0: 42621.4. Samples: 277746760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:28:42,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:28:43,799][09423] Updated weights for policy 0, policy_version 244077 (0.0031) [2024-06-28 14:28:47,594][09423] Updated weights for policy 0, policy_version 244087 (0.0033) [2024-06-28 14:28:47,921][09190] Fps is (10 sec: 40970.4, 60 sec: 42600.2, 300 sec: 42598.4). Total num frames: 3999137792. Throughput: 0: 42537.4. Samples: 278002120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:28:47,922][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 14:28:51,681][09423] Updated weights for policy 0, policy_version 244097 (0.0034) [2024-06-28 14:28:52,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42325.4, 300 sec: 42653.9). Total num frames: 3999334400. Throughput: 0: 42651.7. Samples: 278261960. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:28:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:28:55,063][09423] Updated weights for policy 0, policy_version 244107 (0.0032) [2024-06-28 14:28:57,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 3999563776. Throughput: 0: 42646.4. Samples: 278390520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:28:57,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:28:58,869][09403] Signal inference workers to stop experience collection... (3850 times) [2024-06-28 14:28:58,870][09403] Signal inference workers to resume experience collection... (3850 times) [2024-06-28 14:28:58,910][09423] InferenceWorker_p0-w0: stopping experience collection (3850 times) [2024-06-28 14:28:58,916][09423] InferenceWorker_p0-w0: resuming experience collection (3850 times) [2024-06-28 14:28:59,017][09423] Updated weights for policy 0, policy_version 244117 (0.0042) [2024-06-28 14:29:02,427][09423] Updated weights for policy 0, policy_version 244127 (0.0034) [2024-06-28 14:29:02,921][09190] Fps is (10 sec: 45874.8, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 3999793152. Throughput: 0: 42695.2. Samples: 278650760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:29:02,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:29:06,542][09423] Updated weights for policy 0, policy_version 244137 (0.0048) [2024-06-28 14:29:07,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.5, 300 sec: 42654.0). Total num frames: 3999989760. Throughput: 0: 42645.4. Samples: 278905020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:29:07,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:29:10,148][09423] Updated weights for policy 0, policy_version 244147 (0.0043) [2024-06-28 14:29:12,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42600.1, 300 sec: 42543.2). Total num frames: 4000186368. Throughput: 0: 42761.5. Samples: 279035300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:29:12,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:29:14,108][09423] Updated weights for policy 0, policy_version 244157 (0.0029) [2024-06-28 14:29:17,803][09423] Updated weights for policy 0, policy_version 244167 (0.0035) [2024-06-28 14:29:17,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42598.5, 300 sec: 42653.9). Total num frames: 4000432128. Throughput: 0: 42731.1. Samples: 279287960. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:29:17,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:29:22,057][09423] Updated weights for policy 0, policy_version 244177 (0.0037) [2024-06-28 14:29:22,922][09190] Fps is (10 sec: 42598.1, 60 sec: 42325.3, 300 sec: 42653.9). Total num frames: 4000612352. Throughput: 0: 42644.5. Samples: 279540260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:29:22,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:29:25,621][09423] Updated weights for policy 0, policy_version 244187 (0.0037) [2024-06-28 14:29:27,921][09190] Fps is (10 sec: 39321.2, 60 sec: 42598.5, 300 sec: 42542.8). Total num frames: 4000825344. Throughput: 0: 42592.3. Samples: 279663420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:29:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:29:30,170][09423] Updated weights for policy 0, policy_version 244197 (0.0039) [2024-06-28 14:29:32,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4001054720. Throughput: 0: 42761.3. Samples: 279926380. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:29:32,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:29:33,185][09423] Updated weights for policy 0, policy_version 244207 (0.0032) [2024-06-28 14:29:37,557][09423] Updated weights for policy 0, policy_version 244217 (0.0033) [2024-06-28 14:29:37,924][09190] Fps is (10 sec: 44226.0, 60 sec: 42325.3, 300 sec: 42764.7). Total num frames: 4001267712. Throughput: 0: 42759.4. Samples: 280186240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:29:37,924][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:29:40,643][09423] Updated weights for policy 0, policy_version 244227 (0.0037) [2024-06-28 14:29:42,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4001480704. Throughput: 0: 42575.1. Samples: 280306400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 14:29:42,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 14:29:45,145][09423] Updated weights for policy 0, policy_version 244237 (0.0055) [2024-06-28 14:29:47,921][09190] Fps is (10 sec: 44247.7, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 4001710080. Throughput: 0: 42537.4. Samples: 280564940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 14:29:47,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:29:48,469][09423] Updated weights for policy 0, policy_version 244247 (0.0046) [2024-06-28 14:29:52,754][09423] Updated weights for policy 0, policy_version 244257 (0.0031) [2024-06-28 14:29:52,922][09190] Fps is (10 sec: 42597.6, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 4001906688. Throughput: 0: 42634.1. Samples: 280823560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 14:29:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 14:29:56,123][09423] Updated weights for policy 0, policy_version 244267 (0.0033) [2024-06-28 14:29:57,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.3, 300 sec: 42598.8). Total num frames: 4002119680. Throughput: 0: 42650.7. Samples: 280954580. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 14:29:57,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:30:00,319][09423] Updated weights for policy 0, policy_version 244277 (0.0037) [2024-06-28 14:30:02,921][09190] Fps is (10 sec: 40960.9, 60 sec: 42052.4, 300 sec: 42598.4). Total num frames: 4002316288. Throughput: 0: 42605.8. Samples: 281205220. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 14:30:02,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:30:03,851][09423] Updated weights for policy 0, policy_version 244287 (0.0041) [2024-06-28 14:30:07,901][09423] Updated weights for policy 0, policy_version 244297 (0.0042) [2024-06-28 14:30:07,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4002562048. Throughput: 0: 42773.0. Samples: 281465040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 14:30:07,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:30:11,476][09423] Updated weights for policy 0, policy_version 244307 (0.0040) [2024-06-28 14:30:12,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.6, 300 sec: 42598.4). Total num frames: 4002758656. Throughput: 0: 42837.9. Samples: 281591120. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 14:30:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:30:15,738][09423] Updated weights for policy 0, policy_version 244317 (0.0028) [2024-06-28 14:30:17,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42325.2, 300 sec: 42709.5). Total num frames: 4002971648. Throughput: 0: 42816.8. Samples: 281853140. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 14:30:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:30:17,937][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000244322_4002971648.pth... [2024-06-28 14:30:17,998][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000243697_3992731648.pth [2024-06-28 14:30:19,054][09423] Updated weights for policy 0, policy_version 244327 (0.0028) [2024-06-28 14:30:20,267][09403] Signal inference workers to stop experience collection... (3900 times) [2024-06-28 14:30:20,315][09423] InferenceWorker_p0-w0: stopping experience collection (3900 times) [2024-06-28 14:30:20,376][09403] Signal inference workers to resume experience collection... (3900 times) [2024-06-28 14:30:20,376][09423] InferenceWorker_p0-w0: resuming experience collection (3900 times) [2024-06-28 14:30:22,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4003184640. Throughput: 0: 42660.9. Samples: 282105880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 14:30:22,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:30:23,550][09423] Updated weights for policy 0, policy_version 244337 (0.0037) [2024-06-28 14:30:26,691][09423] Updated weights for policy 0, policy_version 244347 (0.0027) [2024-06-28 14:30:27,921][09190] Fps is (10 sec: 44237.3, 60 sec: 43144.6, 300 sec: 42709.5). Total num frames: 4003414016. Throughput: 0: 42777.3. Samples: 282231380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 14:30:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:30:31,306][09423] Updated weights for policy 0, policy_version 244357 (0.0026) [2024-06-28 14:30:32,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42598.5, 300 sec: 42765.0). Total num frames: 4003610624. Throughput: 0: 42618.8. Samples: 282482780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 14:30:32,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 14:30:34,546][09423] Updated weights for policy 0, policy_version 244367 (0.0026) [2024-06-28 14:30:37,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42327.1, 300 sec: 42542.9). Total num frames: 4003807232. Throughput: 0: 42742.4. Samples: 282746960. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 14:30:37,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:30:38,763][09423] Updated weights for policy 0, policy_version 244377 (0.0043) [2024-06-28 14:30:42,480][09423] Updated weights for policy 0, policy_version 244387 (0.0036) [2024-06-28 14:30:42,922][09190] Fps is (10 sec: 44235.9, 60 sec: 42871.3, 300 sec: 42709.5). Total num frames: 4004052992. Throughput: 0: 42525.2. Samples: 282868220. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 14:30:42,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:30:46,684][09423] Updated weights for policy 0, policy_version 244397 (0.0045) [2024-06-28 14:30:47,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42325.4, 300 sec: 42709.5). Total num frames: 4004249600. Throughput: 0: 42512.9. Samples: 283118300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 14:30:47,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:30:49,963][09423] Updated weights for policy 0, policy_version 244407 (0.0039) [2024-06-28 14:30:52,921][09190] Fps is (10 sec: 39322.4, 60 sec: 42325.5, 300 sec: 42487.3). Total num frames: 4004446208. Throughput: 0: 42655.6. Samples: 283384540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 14:30:52,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:30:54,439][09423] Updated weights for policy 0, policy_version 244417 (0.0026) [2024-06-28 14:30:57,455][09423] Updated weights for policy 0, policy_version 244427 (0.0030) [2024-06-28 14:30:57,921][09190] Fps is (10 sec: 45874.6, 60 sec: 43144.5, 300 sec: 42820.5). Total num frames: 4004708352. Throughput: 0: 42715.4. Samples: 283513320. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:30:57,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:31:01,914][09423] Updated weights for policy 0, policy_version 244437 (0.0030) [2024-06-28 14:31:02,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.4, 300 sec: 42654.0). Total num frames: 4004888576. Throughput: 0: 42710.4. Samples: 283775100. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:31:02,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:31:05,212][09423] Updated weights for policy 0, policy_version 244447 (0.0037) [2024-06-28 14:31:07,924][09190] Fps is (10 sec: 37673.9, 60 sec: 42050.5, 300 sec: 42542.5). Total num frames: 4005085184. Throughput: 0: 42663.9. Samples: 284025860. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:31:07,925][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:31:09,414][09423] Updated weights for policy 0, policy_version 244457 (0.0032) [2024-06-28 14:31:12,753][09423] Updated weights for policy 0, policy_version 244467 (0.0032) [2024-06-28 14:31:12,923][09190] Fps is (10 sec: 45866.5, 60 sec: 43143.2, 300 sec: 42764.8). Total num frames: 4005347328. Throughput: 0: 42668.9. Samples: 284151560. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:31:12,924][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:31:17,277][09423] Updated weights for policy 0, policy_version 244477 (0.0037) [2024-06-28 14:31:17,921][09190] Fps is (10 sec: 45887.1, 60 sec: 42871.6, 300 sec: 42709.5). Total num frames: 4005543936. Throughput: 0: 42991.1. Samples: 284417380. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:31:17,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:31:20,587][09423] Updated weights for policy 0, policy_version 244487 (0.0027) [2024-06-28 14:31:20,591][09403] Signal inference workers to stop experience collection... (3950 times) [2024-06-28 14:31:20,592][09403] Signal inference workers to resume experience collection... (3950 times) [2024-06-28 14:31:20,635][09423] InferenceWorker_p0-w0: stopping experience collection (3950 times) [2024-06-28 14:31:20,635][09423] InferenceWorker_p0-w0: resuming experience collection (3950 times) [2024-06-28 14:31:22,921][09190] Fps is (10 sec: 40967.5, 60 sec: 42871.5, 300 sec: 42654.0). Total num frames: 4005756928. Throughput: 0: 42732.8. Samples: 284669940. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:31:22,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:31:24,891][09423] Updated weights for policy 0, policy_version 244497 (0.0032) [2024-06-28 14:31:27,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.5, 300 sec: 42820.6). Total num frames: 4005986304. Throughput: 0: 42811.7. Samples: 284794740. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:31:27,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:31:28,062][09423] Updated weights for policy 0, policy_version 244507 (0.0051) [2024-06-28 14:31:32,679][09423] Updated weights for policy 0, policy_version 244517 (0.0046) [2024-06-28 14:31:32,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4006182912. Throughput: 0: 43253.8. Samples: 285064720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:31:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:31:35,570][09423] Updated weights for policy 0, policy_version 244527 (0.0034) [2024-06-28 14:31:37,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42871.5, 300 sec: 42598.8). Total num frames: 4006379520. Throughput: 0: 42925.8. Samples: 285316200. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:31:37,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:31:40,204][09423] Updated weights for policy 0, policy_version 244537 (0.0038) [2024-06-28 14:31:42,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.6, 300 sec: 42876.1). Total num frames: 4006625280. Throughput: 0: 42783.2. Samples: 285438560. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:31:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:31:43,473][09423] Updated weights for policy 0, policy_version 244547 (0.0039) [2024-06-28 14:31:47,770][09423] Updated weights for policy 0, policy_version 244557 (0.0028) [2024-06-28 14:31:47,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.4, 300 sec: 42709.8). Total num frames: 4006821888. Throughput: 0: 42780.4. Samples: 285700220. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:31:47,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:31:51,149][09423] Updated weights for policy 0, policy_version 244567 (0.0038) [2024-06-28 14:31:52,921][09190] Fps is (10 sec: 40959.8, 60 sec: 43144.5, 300 sec: 42709.5). Total num frames: 4007034880. Throughput: 0: 42743.7. Samples: 285949220. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:31:52,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:31:55,841][09423] Updated weights for policy 0, policy_version 244577 (0.0050) [2024-06-28 14:31:57,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42325.4, 300 sec: 42765.0). Total num frames: 4007247872. Throughput: 0: 42887.6. Samples: 286081420. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:31:57,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:31:58,791][09423] Updated weights for policy 0, policy_version 244587 (0.0042) [2024-06-28 14:32:02,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4007428096. Throughput: 0: 42568.8. Samples: 286332980. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 14:32:02,924][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:32:03,676][09423] Updated weights for policy 0, policy_version 244597 (0.0041) [2024-06-28 14:32:06,326][09423] Updated weights for policy 0, policy_version 244607 (0.0031) [2024-06-28 14:32:07,921][09190] Fps is (10 sec: 44236.2, 60 sec: 43419.4, 300 sec: 42765.0). Total num frames: 4007690240. Throughput: 0: 42699.1. Samples: 286591400. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 14:32:07,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:32:10,988][09423] Updated weights for policy 0, policy_version 244617 (0.0026) [2024-06-28 14:32:12,921][09190] Fps is (10 sec: 47513.9, 60 sec: 42599.7, 300 sec: 42820.6). Total num frames: 4007903232. Throughput: 0: 42964.5. Samples: 286728140. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 14:32:12,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:32:13,823][09423] Updated weights for policy 0, policy_version 244627 (0.0035) [2024-06-28 14:32:17,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42598.4, 300 sec: 42654.0). Total num frames: 4008099840. Throughput: 0: 42796.0. Samples: 286990540. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 14:32:17,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:32:17,928][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000244635_4008099840.pth... [2024-06-28 14:32:17,986][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000244011_3997876224.pth [2024-06-28 14:32:18,495][09423] Updated weights for policy 0, policy_version 244637 (0.0032) [2024-06-28 14:32:21,578][09423] Updated weights for policy 0, policy_version 244647 (0.0052) [2024-06-28 14:32:22,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42871.5, 300 sec: 42820.6). Total num frames: 4008329216. Throughput: 0: 42531.0. Samples: 287230100. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 14:32:22,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:32:26,170][09423] Updated weights for policy 0, policy_version 244657 (0.0027) [2024-06-28 14:32:27,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42598.4, 300 sec: 42765.0). Total num frames: 4008542208. Throughput: 0: 42862.2. Samples: 287367360. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 14:32:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:32:29,388][09423] Updated weights for policy 0, policy_version 244667 (0.0045) [2024-06-28 14:32:32,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4008738816. Throughput: 0: 42702.8. Samples: 287621840. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 14:32:32,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:32:34,201][09423] Updated weights for policy 0, policy_version 244677 (0.0037) [2024-06-28 14:32:37,260][09423] Updated weights for policy 0, policy_version 244687 (0.0039) [2024-06-28 14:32:37,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43144.5, 300 sec: 42765.0). Total num frames: 4008968192. Throughput: 0: 42577.0. Samples: 287865180. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 14:32:37,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:32:41,463][09403] Signal inference workers to stop experience collection... (4000 times) [2024-06-28 14:32:41,463][09403] Signal inference workers to resume experience collection... (4000 times) [2024-06-28 14:32:41,490][09423] InferenceWorker_p0-w0: stopping experience collection (4000 times) [2024-06-28 14:32:41,490][09423] InferenceWorker_p0-w0: resuming experience collection (4000 times) [2024-06-28 14:32:41,833][09423] Updated weights for policy 0, policy_version 244697 (0.0036) [2024-06-28 14:32:42,922][09190] Fps is (10 sec: 42595.5, 60 sec: 42324.9, 300 sec: 42654.2). Total num frames: 4009164800. Throughput: 0: 42670.9. Samples: 288001640. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 14:32:42,931][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:32:44,768][09423] Updated weights for policy 0, policy_version 244707 (0.0027) [2024-06-28 14:32:47,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4009377792. Throughput: 0: 42829.3. Samples: 288260300. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 14:32:47,931][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:32:49,267][09423] Updated weights for policy 0, policy_version 244717 (0.0032) [2024-06-28 14:32:52,227][09423] Updated weights for policy 0, policy_version 244727 (0.0036) [2024-06-28 14:32:52,921][09190] Fps is (10 sec: 45877.7, 60 sec: 43144.5, 300 sec: 42820.5). Total num frames: 4009623552. Throughput: 0: 42785.8. Samples: 288516760. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 14:32:52,927][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:32:57,170][09423] Updated weights for policy 0, policy_version 244737 (0.0031) [2024-06-28 14:32:57,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.4, 300 sec: 42654.0). Total num frames: 4009803776. Throughput: 0: 42708.1. Samples: 288650000. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 14:32:57,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:33:00,080][09423] Updated weights for policy 0, policy_version 244747 (0.0033) [2024-06-28 14:33:02,921][09190] Fps is (10 sec: 39321.5, 60 sec: 43144.5, 300 sec: 42653.9). Total num frames: 4010016768. Throughput: 0: 42381.2. Samples: 288897700. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 14:33:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:33:04,688][09423] Updated weights for policy 0, policy_version 244757 (0.0037) [2024-06-28 14:33:07,712][09423] Updated weights for policy 0, policy_version 244767 (0.0027) [2024-06-28 14:33:07,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42871.6, 300 sec: 42820.9). Total num frames: 4010262528. Throughput: 0: 42614.8. Samples: 289147760. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 14:33:07,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:33:12,210][09423] Updated weights for policy 0, policy_version 244777 (0.0038) [2024-06-28 14:33:12,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42052.2, 300 sec: 42542.9). Total num frames: 4010426368. Throughput: 0: 42576.5. Samples: 289283300. Policy #0 lag: (min: 0.0, avg: 12.3, max: 24.0) [2024-06-28 14:33:12,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:33:15,745][09423] Updated weights for policy 0, policy_version 244787 (0.0044) [2024-06-28 14:33:17,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42598.4, 300 sec: 42654.0). Total num frames: 4010655744. Throughput: 0: 42456.9. Samples: 289532400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:33:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 14:33:20,322][09423] Updated weights for policy 0, policy_version 244797 (0.0035) [2024-06-28 14:33:22,921][09190] Fps is (10 sec: 45875.4, 60 sec: 42598.5, 300 sec: 42765.0). Total num frames: 4010885120. Throughput: 0: 42727.1. Samples: 289787900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:33:22,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:33:23,225][09423] Updated weights for policy 0, policy_version 244807 (0.0032) [2024-06-28 14:33:27,760][09423] Updated weights for policy 0, policy_version 244817 (0.0032) [2024-06-28 14:33:27,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42325.3, 300 sec: 42653.9). Total num frames: 4011081728. Throughput: 0: 42604.5. Samples: 289918820. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:33:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:33:30,644][09423] Updated weights for policy 0, policy_version 244827 (0.0041) [2024-06-28 14:33:32,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.3, 300 sec: 42598.8). Total num frames: 4011294720. Throughput: 0: 42587.6. Samples: 290176740. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:33:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:33:35,309][09423] Updated weights for policy 0, policy_version 244837 (0.0045) [2024-06-28 14:33:37,921][09190] Fps is (10 sec: 45875.7, 60 sec: 42871.5, 300 sec: 42820.6). Total num frames: 4011540480. Throughput: 0: 42492.1. Samples: 290428900. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:33:37,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:33:38,542][09423] Updated weights for policy 0, policy_version 244847 (0.0032) [2024-06-28 14:33:42,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42325.7, 300 sec: 42598.4). Total num frames: 4011704320. Throughput: 0: 42341.6. Samples: 290555380. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:33:42,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:33:43,381][09423] Updated weights for policy 0, policy_version 244857 (0.0040) [2024-06-28 14:33:44,744][09403] Signal inference workers to stop experience collection... (4050 times) [2024-06-28 14:33:44,744][09403] Signal inference workers to resume experience collection... (4050 times) [2024-06-28 14:33:44,791][09423] InferenceWorker_p0-w0: stopping experience collection (4050 times) [2024-06-28 14:33:44,791][09423] InferenceWorker_p0-w0: resuming experience collection (4050 times) [2024-06-28 14:33:46,438][09423] Updated weights for policy 0, policy_version 244867 (0.0021) [2024-06-28 14:33:47,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42598.5, 300 sec: 42709.5). Total num frames: 4011933696. Throughput: 0: 42399.7. Samples: 290805680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:33:47,922][09190] Avg episode reward: [(0, '0.765')] [2024-06-28 14:33:51,040][09423] Updated weights for policy 0, policy_version 244877 (0.0030) [2024-06-28 14:33:52,921][09190] Fps is (10 sec: 45875.7, 60 sec: 42325.4, 300 sec: 42709.5). Total num frames: 4012163072. Throughput: 0: 42459.1. Samples: 291058420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:33:52,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:33:54,010][09423] Updated weights for policy 0, policy_version 244887 (0.0032) [2024-06-28 14:33:57,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4012343296. Throughput: 0: 42494.3. Samples: 291195540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:33:57,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:33:58,854][09423] Updated weights for policy 0, policy_version 244897 (0.0032) [2024-06-28 14:34:01,682][09423] Updated weights for policy 0, policy_version 244907 (0.0031) [2024-06-28 14:34:02,924][09190] Fps is (10 sec: 40949.5, 60 sec: 42596.6, 300 sec: 42653.6). Total num frames: 4012572672. Throughput: 0: 42464.2. Samples: 291443400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:34:02,925][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:34:06,522][09423] Updated weights for policy 0, policy_version 244917 (0.0036) [2024-06-28 14:34:07,921][09190] Fps is (10 sec: 45874.5, 60 sec: 42325.2, 300 sec: 42765.0). Total num frames: 4012802048. Throughput: 0: 42599.0. Samples: 291704860. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:34:07,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:34:09,191][09423] Updated weights for policy 0, policy_version 244927 (0.0033) [2024-06-28 14:34:12,921][09190] Fps is (10 sec: 42609.3, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4012998656. Throughput: 0: 42783.6. Samples: 291844080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:34:12,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:34:13,994][09423] Updated weights for policy 0, policy_version 244937 (0.0037) [2024-06-28 14:34:16,548][09423] Updated weights for policy 0, policy_version 244947 (0.0039) [2024-06-28 14:34:17,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 4013228032. Throughput: 0: 42485.7. Samples: 292088600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:34:17,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:34:17,936][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000244948_4013228032.pth... [2024-06-28 14:34:17,995][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000244322_4002971648.pth [2024-06-28 14:34:21,820][09423] Updated weights for policy 0, policy_version 244957 (0.0039) [2024-06-28 14:34:22,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42871.5, 300 sec: 42820.6). Total num frames: 4013457408. Throughput: 0: 42659.1. Samples: 292348560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 14:34:22,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:34:24,701][09423] Updated weights for policy 0, policy_version 244967 (0.0043) [2024-06-28 14:34:27,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4013637632. Throughput: 0: 42696.5. Samples: 292476720. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:34:27,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:34:29,443][09423] Updated weights for policy 0, policy_version 244977 (0.0033) [2024-06-28 14:34:32,431][09423] Updated weights for policy 0, policy_version 244987 (0.0038) [2024-06-28 14:34:32,921][09190] Fps is (10 sec: 42598.1, 60 sec: 43144.5, 300 sec: 42765.4). Total num frames: 4013883392. Throughput: 0: 42648.4. Samples: 292724860. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:34:32,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:34:37,188][09423] Updated weights for policy 0, policy_version 244997 (0.0047) [2024-06-28 14:34:37,922][09190] Fps is (10 sec: 44236.8, 60 sec: 42325.3, 300 sec: 42709.5). Total num frames: 4014080000. Throughput: 0: 42812.9. Samples: 292985000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:34:37,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:34:40,150][09423] Updated weights for policy 0, policy_version 245007 (0.0047) [2024-06-28 14:34:42,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4014276608. Throughput: 0: 42557.7. Samples: 293110640. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:34:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:34:44,659][09423] Updated weights for policy 0, policy_version 245017 (0.0028) [2024-06-28 14:34:47,631][09423] Updated weights for policy 0, policy_version 245027 (0.0025) [2024-06-28 14:34:47,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43417.6, 300 sec: 42820.6). Total num frames: 4014538752. Throughput: 0: 42949.1. Samples: 293376000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:34:47,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:34:52,488][09423] Updated weights for policy 0, policy_version 245037 (0.0041) [2024-06-28 14:34:52,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42598.3, 300 sec: 42709.5). Total num frames: 4014718976. Throughput: 0: 42898.7. Samples: 293635300. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:34:52,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:34:54,419][09403] Signal inference workers to stop experience collection... (4100 times) [2024-06-28 14:34:54,442][09423] InferenceWorker_p0-w0: stopping experience collection (4100 times) [2024-06-28 14:34:54,476][09403] Signal inference workers to resume experience collection... (4100 times) [2024-06-28 14:34:54,476][09423] InferenceWorker_p0-w0: resuming experience collection (4100 times) [2024-06-28 14:34:55,361][09423] Updated weights for policy 0, policy_version 245047 (0.0050) [2024-06-28 14:34:57,922][09190] Fps is (10 sec: 37682.4, 60 sec: 42871.3, 300 sec: 42709.4). Total num frames: 4014915584. Throughput: 0: 42373.2. Samples: 293750880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:34:57,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:34:59,955][09423] Updated weights for policy 0, policy_version 245057 (0.0033) [2024-06-28 14:35:02,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42873.3, 300 sec: 42653.9). Total num frames: 4015144960. Throughput: 0: 42728.6. Samples: 294011380. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:35:02,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:35:03,104][09423] Updated weights for policy 0, policy_version 245067 (0.0034) [2024-06-28 14:35:07,922][09190] Fps is (10 sec: 40960.1, 60 sec: 42052.2, 300 sec: 42598.4). Total num frames: 4015325184. Throughput: 0: 42844.7. Samples: 294276580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:35:07,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:35:07,938][09423] Updated weights for policy 0, policy_version 245077 (0.0028) [2024-06-28 14:35:10,564][09423] Updated weights for policy 0, policy_version 245087 (0.0026) [2024-06-28 14:35:12,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.4, 300 sec: 42654.0). Total num frames: 4015554560. Throughput: 0: 42527.6. Samples: 294390460. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:35:12,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:35:15,234][09423] Updated weights for policy 0, policy_version 245097 (0.0035) [2024-06-28 14:35:17,924][09190] Fps is (10 sec: 47503.0, 60 sec: 42869.8, 300 sec: 42764.7). Total num frames: 4015800320. Throughput: 0: 42796.9. Samples: 294650820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:35:17,924][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:35:18,375][09423] Updated weights for policy 0, policy_version 245107 (0.0030) [2024-06-28 14:35:22,682][09423] Updated weights for policy 0, policy_version 245117 (0.0048) [2024-06-28 14:35:22,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42325.3, 300 sec: 42653.9). Total num frames: 4015996928. Throughput: 0: 42938.2. Samples: 294917220. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:35:22,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:35:25,801][09423] Updated weights for policy 0, policy_version 245127 (0.0026) [2024-06-28 14:35:27,921][09190] Fps is (10 sec: 40969.5, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 4016209920. Throughput: 0: 42981.3. Samples: 295044800. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:35:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:35:30,589][09423] Updated weights for policy 0, policy_version 245137 (0.0031) [2024-06-28 14:35:32,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42871.4, 300 sec: 42876.1). Total num frames: 4016455680. Throughput: 0: 42816.8. Samples: 295302760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 14:35:32,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:35:33,266][09423] Updated weights for policy 0, policy_version 245147 (0.0042) [2024-06-28 14:35:37,921][09190] Fps is (10 sec: 40960.9, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4016619520. Throughput: 0: 42828.2. Samples: 295562560. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 14:35:37,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:35:38,098][09423] Updated weights for policy 0, policy_version 245157 (0.0034) [2024-06-28 14:35:41,727][09423] Updated weights for policy 0, policy_version 245167 (0.0034) [2024-06-28 14:35:42,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4016848896. Throughput: 0: 42894.4. Samples: 295681120. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 14:35:42,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:35:45,998][09423] Updated weights for policy 0, policy_version 245177 (0.0041) [2024-06-28 14:35:47,921][09190] Fps is (10 sec: 45874.5, 60 sec: 42325.3, 300 sec: 42820.5). Total num frames: 4017078272. Throughput: 0: 42884.4. Samples: 295941180. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 14:35:47,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:35:49,183][09423] Updated weights for policy 0, policy_version 245187 (0.0026) [2024-06-28 14:35:52,922][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4017258496. Throughput: 0: 42644.5. Samples: 296195580. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 14:35:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:35:53,640][09423] Updated weights for policy 0, policy_version 245197 (0.0047) [2024-06-28 14:35:57,064][09423] Updated weights for policy 0, policy_version 245207 (0.0040) [2024-06-28 14:35:57,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4017487872. Throughput: 0: 42708.4. Samples: 296312340. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 14:35:57,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:36:01,214][09423] Updated weights for policy 0, policy_version 245217 (0.0040) [2024-06-28 14:36:02,921][09190] Fps is (10 sec: 45875.6, 60 sec: 42871.4, 300 sec: 42820.9). Total num frames: 4017717248. Throughput: 0: 42783.6. Samples: 296575980. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 14:36:02,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:36:04,727][09423] Updated weights for policy 0, policy_version 245227 (0.0032) [2024-06-28 14:36:07,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42871.6, 300 sec: 42543.1). Total num frames: 4017897472. Throughput: 0: 42755.2. Samples: 296841200. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 14:36:07,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:36:09,029][09423] Updated weights for policy 0, policy_version 245237 (0.0024) [2024-06-28 14:36:12,354][09423] Updated weights for policy 0, policy_version 245247 (0.0042) [2024-06-28 14:36:12,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.5, 300 sec: 42709.5). Total num frames: 4018143232. Throughput: 0: 42569.8. Samples: 296960440. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 14:36:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 14:36:16,592][09423] Updated weights for policy 0, policy_version 245257 (0.0036) [2024-06-28 14:36:17,922][09190] Fps is (10 sec: 44235.7, 60 sec: 42326.9, 300 sec: 42653.9). Total num frames: 4018339840. Throughput: 0: 42599.4. Samples: 297219740. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 14:36:17,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:36:17,950][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000245261_4018356224.pth... [2024-06-28 14:36:18,008][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000244635_4008099840.pth [2024-06-28 14:36:18,718][09403] Signal inference workers to stop experience collection... (4150 times) [2024-06-28 14:36:18,750][09423] InferenceWorker_p0-w0: stopping experience collection (4150 times) [2024-06-28 14:36:18,777][09403] Signal inference workers to resume experience collection... (4150 times) [2024-06-28 14:36:18,777][09423] InferenceWorker_p0-w0: resuming experience collection (4150 times) [2024-06-28 14:36:19,704][09423] Updated weights for policy 0, policy_version 245267 (0.0039) [2024-06-28 14:36:22,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4018536448. Throughput: 0: 42485.6. Samples: 297474420. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 14:36:22,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:36:24,502][09423] Updated weights for policy 0, policy_version 245277 (0.0040) [2024-06-28 14:36:27,707][09423] Updated weights for policy 0, policy_version 245287 (0.0026) [2024-06-28 14:36:27,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4018782208. Throughput: 0: 42665.3. Samples: 297601060. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 14:36:27,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:36:32,258][09423] Updated weights for policy 0, policy_version 245297 (0.0044) [2024-06-28 14:36:32,921][09190] Fps is (10 sec: 44237.6, 60 sec: 42052.4, 300 sec: 42709.5). Total num frames: 4018978816. Throughput: 0: 42600.6. Samples: 297858200. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 14:36:32,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:36:35,113][09423] Updated weights for policy 0, policy_version 245307 (0.0034) [2024-06-28 14:36:37,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4019191808. Throughput: 0: 42660.6. Samples: 298115300. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 14:36:37,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:36:39,749][09423] Updated weights for policy 0, policy_version 245317 (0.0031) [2024-06-28 14:36:42,921][09190] Fps is (10 sec: 44236.1, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4019421184. Throughput: 0: 42867.6. Samples: 298241380. Policy #0 lag: (min: 0.0, avg: 8.2, max: 22.0) [2024-06-28 14:36:42,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:36:43,101][09423] Updated weights for policy 0, policy_version 245327 (0.0029) [2024-06-28 14:36:47,211][09423] Updated weights for policy 0, policy_version 245337 (0.0039) [2024-06-28 14:36:47,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42325.3, 300 sec: 42653.9). Total num frames: 4019617792. Throughput: 0: 42900.4. Samples: 298506500. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 14:36:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:36:50,583][09423] Updated weights for policy 0, policy_version 245347 (0.0032) [2024-06-28 14:36:52,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42871.6, 300 sec: 42653.9). Total num frames: 4019830784. Throughput: 0: 42673.7. Samples: 298761520. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 14:36:52,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:36:54,730][09423] Updated weights for policy 0, policy_version 245357 (0.0049) [2024-06-28 14:36:57,922][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.4, 300 sec: 42820.5). Total num frames: 4020060160. Throughput: 0: 42826.1. Samples: 298887620. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 14:36:57,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:36:58,395][09423] Updated weights for policy 0, policy_version 245367 (0.0028) [2024-06-28 14:37:02,778][09423] Updated weights for policy 0, policy_version 245377 (0.0024) [2024-06-28 14:37:02,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4020256768. Throughput: 0: 42680.6. Samples: 299140360. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 14:37:02,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:37:06,171][09423] Updated weights for policy 0, policy_version 245387 (0.0034) [2024-06-28 14:37:07,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4020469760. Throughput: 0: 42573.4. Samples: 299390220. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 14:37:07,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:37:10,485][09423] Updated weights for policy 0, policy_version 245397 (0.0042) [2024-06-28 14:37:12,922][09190] Fps is (10 sec: 45874.7, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 4020715520. Throughput: 0: 42724.3. Samples: 299523660. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 14:37:12,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:37:13,600][09423] Updated weights for policy 0, policy_version 245407 (0.0042) [2024-06-28 14:37:17,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4020895744. Throughput: 0: 42768.7. Samples: 299782800. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 14:37:17,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:37:17,981][09423] Updated weights for policy 0, policy_version 245417 (0.0029) [2024-06-28 14:37:21,592][09423] Updated weights for policy 0, policy_version 245427 (0.0038) [2024-06-28 14:37:22,921][09190] Fps is (10 sec: 40960.9, 60 sec: 43144.6, 300 sec: 42654.0). Total num frames: 4021125120. Throughput: 0: 42687.1. Samples: 300036220. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 14:37:22,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:37:25,424][09423] Updated weights for policy 0, policy_version 245437 (0.0036) [2024-06-28 14:37:27,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 4021338112. Throughput: 0: 42784.9. Samples: 300166700. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 14:37:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:37:29,041][09423] Updated weights for policy 0, policy_version 245447 (0.0041) [2024-06-28 14:37:32,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.4, 300 sec: 42653.9). Total num frames: 4021551104. Throughput: 0: 42744.1. Samples: 300429980. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 14:37:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:37:32,929][09423] Updated weights for policy 0, policy_version 245457 (0.0039) [2024-06-28 14:37:36,659][09423] Updated weights for policy 0, policy_version 245467 (0.0029) [2024-06-28 14:37:37,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43144.5, 300 sec: 42765.1). Total num frames: 4021780480. Throughput: 0: 42398.6. Samples: 300669460. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 14:37:37,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 14:37:41,131][09423] Updated weights for policy 0, policy_version 245477 (0.0033) [2024-06-28 14:37:42,922][09190] Fps is (10 sec: 42597.6, 60 sec: 42598.3, 300 sec: 42709.5). Total num frames: 4021977088. Throughput: 0: 42623.1. Samples: 300805660. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 14:37:42,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:37:44,501][09423] Updated weights for policy 0, policy_version 245487 (0.0027) [2024-06-28 14:37:47,921][09190] Fps is (10 sec: 39322.2, 60 sec: 42598.5, 300 sec: 42542.9). Total num frames: 4022173696. Throughput: 0: 42712.1. Samples: 301062400. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 14:37:47,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:37:48,972][09423] Updated weights for policy 0, policy_version 245497 (0.0027) [2024-06-28 14:37:51,938][09423] Updated weights for policy 0, policy_version 245507 (0.0036) [2024-06-28 14:37:52,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 4022403072. Throughput: 0: 42595.5. Samples: 301307020. Policy #0 lag: (min: 1.0, avg: 10.2, max: 22.0) [2024-06-28 14:37:52,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:37:56,515][09423] Updated weights for policy 0, policy_version 245517 (0.0023) [2024-06-28 14:37:57,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42598.5, 300 sec: 42709.5). Total num frames: 4022616064. Throughput: 0: 42602.0. Samples: 301440740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 14:37:57,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 14:37:59,049][09403] Signal inference workers to stop experience collection... (4200 times) [2024-06-28 14:37:59,050][09403] Signal inference workers to resume experience collection... (4200 times) [2024-06-28 14:37:59,069][09423] InferenceWorker_p0-w0: stopping experience collection (4200 times) [2024-06-28 14:37:59,069][09423] InferenceWorker_p0-w0: resuming experience collection (4200 times) [2024-06-28 14:37:59,906][09423] Updated weights for policy 0, policy_version 245527 (0.0031) [2024-06-28 14:38:02,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42598.5, 300 sec: 42542.9). Total num frames: 4022812672. Throughput: 0: 42685.0. Samples: 301703620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 14:38:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:38:03,903][09423] Updated weights for policy 0, policy_version 245537 (0.0036) [2024-06-28 14:38:07,337][09423] Updated weights for policy 0, policy_version 245547 (0.0024) [2024-06-28 14:38:07,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43144.5, 300 sec: 42820.6). Total num frames: 4023058432. Throughput: 0: 42648.0. Samples: 301955380. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 14:38:07,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:38:11,277][09423] Updated weights for policy 0, policy_version 245557 (0.0037) [2024-06-28 14:38:12,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42325.4, 300 sec: 42709.5). Total num frames: 4023255040. Throughput: 0: 42651.1. Samples: 302086000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 14:38:12,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:38:15,232][09423] Updated weights for policy 0, policy_version 245567 (0.0033) [2024-06-28 14:38:17,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4023451648. Throughput: 0: 42356.3. Samples: 302336020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 14:38:17,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:38:18,049][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000245573_4023468032.pth... [2024-06-28 14:38:18,093][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000244948_4013228032.pth [2024-06-28 14:38:19,539][09423] Updated weights for policy 0, policy_version 245577 (0.0030) [2024-06-28 14:38:22,780][09423] Updated weights for policy 0, policy_version 245587 (0.0034) [2024-06-28 14:38:22,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 4023697408. Throughput: 0: 42560.0. Samples: 302584660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 14:38:22,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:38:26,944][09423] Updated weights for policy 0, policy_version 245597 (0.0033) [2024-06-28 14:38:27,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42325.4, 300 sec: 42653.9). Total num frames: 4023877632. Throughput: 0: 42584.1. Samples: 302721940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 14:38:27,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:38:30,738][09423] Updated weights for policy 0, policy_version 245607 (0.0041) [2024-06-28 14:38:32,921][09190] Fps is (10 sec: 39322.2, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4024090624. Throughput: 0: 42487.6. Samples: 302974340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 14:38:32,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:38:34,514][09423] Updated weights for policy 0, policy_version 245617 (0.0038) [2024-06-28 14:38:37,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42325.4, 300 sec: 42765.0). Total num frames: 4024320000. Throughput: 0: 42741.3. Samples: 303230380. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 14:38:37,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:38:38,327][09423] Updated weights for policy 0, policy_version 245627 (0.0025) [2024-06-28 14:38:42,348][09423] Updated weights for policy 0, policy_version 245637 (0.0036) [2024-06-28 14:38:42,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42598.5, 300 sec: 42709.5). Total num frames: 4024532992. Throughput: 0: 42755.0. Samples: 303364720. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 14:38:42,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:38:45,811][09423] Updated weights for policy 0, policy_version 245647 (0.0026) [2024-06-28 14:38:47,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4024729600. Throughput: 0: 42514.1. Samples: 303616760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 14:38:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:38:49,775][09423] Updated weights for policy 0, policy_version 245657 (0.0040) [2024-06-28 14:38:52,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.5, 300 sec: 42765.0). Total num frames: 4024958976. Throughput: 0: 42640.9. Samples: 303874220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 14:38:52,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:38:53,679][09423] Updated weights for policy 0, policy_version 245667 (0.0038) [2024-06-28 14:38:57,755][09423] Updated weights for policy 0, policy_version 245677 (0.0042) [2024-06-28 14:38:57,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42598.4, 300 sec: 42709.8). Total num frames: 4025171968. Throughput: 0: 42473.4. Samples: 303997300. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 14:38:57,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 14:39:01,403][09423] Updated weights for policy 0, policy_version 245687 (0.0027) [2024-06-28 14:39:02,925][09190] Fps is (10 sec: 42583.1, 60 sec: 42868.9, 300 sec: 42653.4). Total num frames: 4025384960. Throughput: 0: 42485.6. Samples: 304248020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 14:39:02,926][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:39:05,446][09423] Updated weights for policy 0, policy_version 245697 (0.0033) [2024-06-28 14:39:07,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42052.2, 300 sec: 42653.9). Total num frames: 4025581568. Throughput: 0: 42663.6. Samples: 304504520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:39:07,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:39:09,419][09423] Updated weights for policy 0, policy_version 245707 (0.0034) [2024-06-28 14:39:12,897][09423] Updated weights for policy 0, policy_version 245717 (0.0044) [2024-06-28 14:39:12,921][09190] Fps is (10 sec: 44252.3, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4025827328. Throughput: 0: 42465.7. Samples: 304632900. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:39:12,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:39:16,918][09423] Updated weights for policy 0, policy_version 245727 (0.0029) [2024-06-28 14:39:17,921][09190] Fps is (10 sec: 45875.1, 60 sec: 43144.5, 300 sec: 42653.9). Total num frames: 4026040320. Throughput: 0: 42696.7. Samples: 304895700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:39:17,930][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:39:20,618][09423] Updated weights for policy 0, policy_version 245737 (0.0038) [2024-06-28 14:39:22,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.4, 300 sec: 42765.0). Total num frames: 4026253312. Throughput: 0: 42682.3. Samples: 305151080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:39:22,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:39:24,715][09423] Updated weights for policy 0, policy_version 245747 (0.0038) [2024-06-28 14:39:27,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4026449920. Throughput: 0: 42560.8. Samples: 305279960. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:39:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:39:28,283][09423] Updated weights for policy 0, policy_version 245757 (0.0047) [2024-06-28 14:39:32,035][09423] Updated weights for policy 0, policy_version 245767 (0.0040) [2024-06-28 14:39:32,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 42709.5). Total num frames: 4026679296. Throughput: 0: 42792.5. Samples: 305542420. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:39:32,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 14:39:36,287][09423] Updated weights for policy 0, policy_version 245777 (0.0037) [2024-06-28 14:39:37,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 4026875904. Throughput: 0: 42636.8. Samples: 305792880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:39:37,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 14:39:39,656][09423] Updated weights for policy 0, policy_version 245787 (0.0021) [2024-06-28 14:39:42,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4027072512. Throughput: 0: 42805.0. Samples: 305923520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:39:42,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:39:43,871][09403] Signal inference workers to stop experience collection... (4250 times) [2024-06-28 14:39:43,872][09403] Signal inference workers to resume experience collection... (4250 times) [2024-06-28 14:39:43,916][09423] InferenceWorker_p0-w0: stopping experience collection (4250 times) [2024-06-28 14:39:43,916][09423] InferenceWorker_p0-w0: resuming experience collection (4250 times) [2024-06-28 14:39:44,006][09423] Updated weights for policy 0, policy_version 245797 (0.0028) [2024-06-28 14:39:47,729][09423] Updated weights for policy 0, policy_version 245807 (0.0028) [2024-06-28 14:39:47,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42871.6, 300 sec: 42654.0). Total num frames: 4027301888. Throughput: 0: 42732.8. Samples: 306170840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:39:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:39:51,618][09423] Updated weights for policy 0, policy_version 245817 (0.0035) [2024-06-28 14:39:52,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 4027514880. Throughput: 0: 42757.8. Samples: 306428620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:39:52,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:39:55,226][09423] Updated weights for policy 0, policy_version 245827 (0.0027) [2024-06-28 14:39:57,922][09190] Fps is (10 sec: 40958.8, 60 sec: 42325.2, 300 sec: 42598.4). Total num frames: 4027711488. Throughput: 0: 42646.6. Samples: 306552000. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:39:57,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:39:59,697][09423] Updated weights for policy 0, policy_version 245837 (0.0031) [2024-06-28 14:40:02,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42601.0, 300 sec: 42765.0). Total num frames: 4027940864. Throughput: 0: 42532.5. Samples: 306809660. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:40:02,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:40:03,007][09423] Updated weights for policy 0, policy_version 245847 (0.0037) [2024-06-28 14:40:07,289][09423] Updated weights for policy 0, policy_version 245857 (0.0039) [2024-06-28 14:40:07,922][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4028137472. Throughput: 0: 42645.2. Samples: 307070120. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:40:07,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:40:10,518][09423] Updated weights for policy 0, policy_version 245867 (0.0037) [2024-06-28 14:40:12,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42325.4, 300 sec: 42598.7). Total num frames: 4028366848. Throughput: 0: 42582.8. Samples: 307196180. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 14:40:12,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:40:15,169][09423] Updated weights for policy 0, policy_version 245877 (0.0039) [2024-06-28 14:40:17,922][09190] Fps is (10 sec: 44236.5, 60 sec: 42325.3, 300 sec: 42653.9). Total num frames: 4028579840. Throughput: 0: 42323.4. Samples: 307446980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 14:40:17,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:40:18,034][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000245886_4028596224.pth... [2024-06-28 14:40:18,081][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000245261_4018356224.pth [2024-06-28 14:40:18,315][09423] Updated weights for policy 0, policy_version 245887 (0.0030) [2024-06-28 14:40:22,771][09423] Updated weights for policy 0, policy_version 245897 (0.0032) [2024-06-28 14:40:22,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42325.4, 300 sec: 42654.0). Total num frames: 4028792832. Throughput: 0: 42481.4. Samples: 307704540. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 14:40:22,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:40:25,950][09423] Updated weights for policy 0, policy_version 245907 (0.0033) [2024-06-28 14:40:27,924][09190] Fps is (10 sec: 42588.3, 60 sec: 42596.7, 300 sec: 42542.5). Total num frames: 4029005824. Throughput: 0: 42373.6. Samples: 307830440. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 14:40:27,925][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:40:30,705][09423] Updated weights for policy 0, policy_version 245917 (0.0044) [2024-06-28 14:40:32,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.4, 300 sec: 42709.5). Total num frames: 4029218816. Throughput: 0: 42672.8. Samples: 308091120. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 14:40:32,922][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 14:40:33,681][09423] Updated weights for policy 0, policy_version 245927 (0.0039) [2024-06-28 14:40:37,921][09190] Fps is (10 sec: 39331.8, 60 sec: 42052.3, 300 sec: 42542.9). Total num frames: 4029399040. Throughput: 0: 42661.8. Samples: 308348400. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 14:40:37,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:40:38,355][09423] Updated weights for policy 0, policy_version 245937 (0.0052) [2024-06-28 14:40:41,185][09423] Updated weights for policy 0, policy_version 245947 (0.0027) [2024-06-28 14:40:42,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4029644800. Throughput: 0: 42666.9. Samples: 308472000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 14:40:42,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:40:46,013][09423] Updated weights for policy 0, policy_version 245957 (0.0030) [2024-06-28 14:40:47,921][09190] Fps is (10 sec: 45874.7, 60 sec: 42598.3, 300 sec: 42709.5). Total num frames: 4029857792. Throughput: 0: 42807.0. Samples: 308735980. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 14:40:47,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:40:48,947][09423] Updated weights for policy 0, policy_version 245967 (0.0024) [2024-06-28 14:40:52,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4030054400. Throughput: 0: 42582.0. Samples: 308986300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 14:40:52,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:40:53,481][09423] Updated weights for policy 0, policy_version 245977 (0.0031) [2024-06-28 14:40:56,531][09423] Updated weights for policy 0, policy_version 245987 (0.0039) [2024-06-28 14:40:57,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.6, 300 sec: 42598.4). Total num frames: 4030283776. Throughput: 0: 42529.7. Samples: 309110020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 14:40:57,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:41:01,457][09423] Updated weights for policy 0, policy_version 245997 (0.0033) [2024-06-28 14:41:02,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 4030496768. Throughput: 0: 42676.6. Samples: 309367420. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 14:41:02,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:41:04,352][09423] Updated weights for policy 0, policy_version 246007 (0.0034) [2024-06-28 14:41:07,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4030676992. Throughput: 0: 42635.5. Samples: 309623140. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 14:41:07,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:41:09,022][09423] Updated weights for policy 0, policy_version 246017 (0.0037) [2024-06-28 14:41:12,101][09423] Updated weights for policy 0, policy_version 246027 (0.0026) [2024-06-28 14:41:12,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.4, 300 sec: 42654.0). Total num frames: 4030922752. Throughput: 0: 42689.5. Samples: 309751360. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 14:41:12,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:41:16,649][09423] Updated weights for policy 0, policy_version 246037 (0.0032) [2024-06-28 14:41:17,858][09403] Signal inference workers to stop experience collection... (4300 times) [2024-06-28 14:41:17,859][09403] Signal inference workers to resume experience collection... (4300 times) [2024-06-28 14:41:17,889][09423] InferenceWorker_p0-w0: stopping experience collection (4300 times) [2024-06-28 14:41:17,889][09423] InferenceWorker_p0-w0: resuming experience collection (4300 times) [2024-06-28 14:41:17,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.5, 300 sec: 42654.0). Total num frames: 4031119360. Throughput: 0: 42770.2. Samples: 310015780. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 14:41:17,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:41:19,546][09423] Updated weights for policy 0, policy_version 246047 (0.0045) [2024-06-28 14:41:22,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4031332352. Throughput: 0: 42787.5. Samples: 310273840. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 14:41:22,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:41:23,983][09423] Updated weights for policy 0, policy_version 246057 (0.0036) [2024-06-28 14:41:27,004][09423] Updated weights for policy 0, policy_version 246067 (0.0031) [2024-06-28 14:41:27,928][09190] Fps is (10 sec: 45845.1, 60 sec: 42868.6, 300 sec: 42708.5). Total num frames: 4031578112. Throughput: 0: 42796.0. Samples: 310398100. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:41:27,928][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:41:31,749][09423] Updated weights for policy 0, policy_version 246077 (0.0032) [2024-06-28 14:41:32,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4031758336. Throughput: 0: 42624.0. Samples: 310654060. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:41:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:41:35,169][09423] Updated weights for policy 0, policy_version 246087 (0.0023) [2024-06-28 14:41:37,921][09190] Fps is (10 sec: 40986.9, 60 sec: 43144.5, 300 sec: 42598.4). Total num frames: 4031987712. Throughput: 0: 42755.9. Samples: 310910320. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:41:37,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:41:39,373][09423] Updated weights for policy 0, policy_version 246097 (0.0035) [2024-06-28 14:41:42,721][09423] Updated weights for policy 0, policy_version 246107 (0.0039) [2024-06-28 14:41:42,922][09190] Fps is (10 sec: 45874.7, 60 sec: 42871.3, 300 sec: 42709.5). Total num frames: 4032217088. Throughput: 0: 42809.2. Samples: 311036440. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:41:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:41:47,299][09423] Updated weights for policy 0, policy_version 246117 (0.0037) [2024-06-28 14:41:47,922][09190] Fps is (10 sec: 40959.0, 60 sec: 42325.2, 300 sec: 42598.4). Total num frames: 4032397312. Throughput: 0: 42754.0. Samples: 311291360. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:41:47,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:41:50,153][09423] Updated weights for policy 0, policy_version 246127 (0.0042) [2024-06-28 14:41:52,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4032626688. Throughput: 0: 42750.6. Samples: 311546920. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:41:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 14:41:54,733][09423] Updated weights for policy 0, policy_version 246137 (0.0030) [2024-06-28 14:41:57,796][09423] Updated weights for policy 0, policy_version 246147 (0.0030) [2024-06-28 14:41:57,921][09190] Fps is (10 sec: 47514.7, 60 sec: 43144.6, 300 sec: 42765.0). Total num frames: 4032872448. Throughput: 0: 42923.1. Samples: 311682900. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:41:57,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:42:02,582][09423] Updated weights for policy 0, policy_version 246157 (0.0031) [2024-06-28 14:42:02,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4033036288. Throughput: 0: 42702.7. Samples: 311937400. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:42:02,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:42:05,418][09423] Updated weights for policy 0, policy_version 246167 (0.0032) [2024-06-28 14:42:07,921][09190] Fps is (10 sec: 40959.9, 60 sec: 43417.6, 300 sec: 42598.4). Total num frames: 4033282048. Throughput: 0: 42562.6. Samples: 312189160. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:42:07,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:42:10,102][09423] Updated weights for policy 0, policy_version 246177 (0.0034) [2024-06-28 14:42:12,922][09190] Fps is (10 sec: 45874.3, 60 sec: 42871.3, 300 sec: 42709.5). Total num frames: 4033495040. Throughput: 0: 42705.2. Samples: 312319560. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:42:12,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:42:13,427][09423] Updated weights for policy 0, policy_version 246187 (0.0032) [2024-06-28 14:42:17,856][09423] Updated weights for policy 0, policy_version 246197 (0.0040) [2024-06-28 14:42:17,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4033691648. Throughput: 0: 42587.6. Samples: 312570500. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:42:17,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:42:17,943][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000246197_4033691648.pth... [2024-06-28 14:42:17,992][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000245573_4023468032.pth [2024-06-28 14:42:21,067][09423] Updated weights for policy 0, policy_version 246207 (0.0022) [2024-06-28 14:42:22,921][09190] Fps is (10 sec: 44237.9, 60 sec: 43417.6, 300 sec: 42709.5). Total num frames: 4033937408. Throughput: 0: 42489.4. Samples: 312822340. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:42:22,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:42:25,509][09423] Updated weights for policy 0, policy_version 246217 (0.0029) [2024-06-28 14:42:27,927][09190] Fps is (10 sec: 42575.6, 60 sec: 42326.2, 300 sec: 42597.6). Total num frames: 4034117632. Throughput: 0: 42611.5. Samples: 312954180. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:42:27,927][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:42:28,724][09423] Updated weights for policy 0, policy_version 246227 (0.0028) [2024-06-28 14:42:32,919][09423] Updated weights for policy 0, policy_version 246237 (0.0028) [2024-06-28 14:42:32,922][09190] Fps is (10 sec: 40959.0, 60 sec: 43144.5, 300 sec: 42598.4). Total num frames: 4034347008. Throughput: 0: 42669.8. Samples: 313211500. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:42:32,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:42:36,403][09423] Updated weights for policy 0, policy_version 246247 (0.0040) [2024-06-28 14:42:37,921][09190] Fps is (10 sec: 44260.0, 60 sec: 42871.4, 300 sec: 42653.9). Total num frames: 4034560000. Throughput: 0: 42573.7. Samples: 313462740. Policy #0 lag: (min: 0.0, avg: 11.3, max: 22.0) [2024-06-28 14:42:37,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:42:41,148][09423] Updated weights for policy 0, policy_version 246257 (0.0037) [2024-06-28 14:42:42,921][09190] Fps is (10 sec: 40960.9, 60 sec: 42325.5, 300 sec: 42653.9). Total num frames: 4034756608. Throughput: 0: 42560.0. Samples: 313598100. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 14:42:42,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:42:44,074][09423] Updated weights for policy 0, policy_version 246267 (0.0038) [2024-06-28 14:42:45,647][09403] Signal inference workers to stop experience collection... (4350 times) [2024-06-28 14:42:45,691][09423] InferenceWorker_p0-w0: stopping experience collection (4350 times) [2024-06-28 14:42:45,702][09403] Signal inference workers to resume experience collection... (4350 times) [2024-06-28 14:42:45,712][09423] InferenceWorker_p0-w0: resuming experience collection (4350 times) [2024-06-28 14:42:47,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42871.6, 300 sec: 42598.4). Total num frames: 4034969600. Throughput: 0: 42455.1. Samples: 313847880. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 14:42:47,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:42:48,886][09423] Updated weights for policy 0, policy_version 246277 (0.0040) [2024-06-28 14:42:52,104][09423] Updated weights for policy 0, policy_version 246287 (0.0041) [2024-06-28 14:42:52,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.4, 300 sec: 42653.9). Total num frames: 4035198976. Throughput: 0: 42244.0. Samples: 314090140. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 14:42:52,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:42:56,492][09423] Updated weights for policy 0, policy_version 246297 (0.0031) [2024-06-28 14:42:57,921][09190] Fps is (10 sec: 40960.0, 60 sec: 41779.2, 300 sec: 42598.4). Total num frames: 4035379200. Throughput: 0: 42420.6. Samples: 314228480. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 14:42:57,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:42:59,576][09423] Updated weights for policy 0, policy_version 246307 (0.0042) [2024-06-28 14:43:02,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4035592192. Throughput: 0: 42352.9. Samples: 314476380. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 14:43:02,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:43:04,081][09423] Updated weights for policy 0, policy_version 246317 (0.0042) [2024-06-28 14:43:07,095][09423] Updated weights for policy 0, policy_version 246327 (0.0033) [2024-06-28 14:43:07,921][09190] Fps is (10 sec: 45875.8, 60 sec: 42598.5, 300 sec: 42654.0). Total num frames: 4035837952. Throughput: 0: 42464.9. Samples: 314733260. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 14:43:07,921][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:43:11,791][09423] Updated weights for policy 0, policy_version 246337 (0.0044) [2024-06-28 14:43:12,922][09190] Fps is (10 sec: 40959.5, 60 sec: 41779.2, 300 sec: 42542.9). Total num frames: 4036001792. Throughput: 0: 42608.0. Samples: 314871320. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 14:43:12,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:43:15,079][09423] Updated weights for policy 0, policy_version 246347 (0.0028) [2024-06-28 14:43:17,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4036247552. Throughput: 0: 42425.1. Samples: 315120620. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 14:43:17,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:43:19,861][09423] Updated weights for policy 0, policy_version 246357 (0.0035) [2024-06-28 14:43:22,615][09423] Updated weights for policy 0, policy_version 246367 (0.0032) [2024-06-28 14:43:22,921][09190] Fps is (10 sec: 47514.1, 60 sec: 42325.2, 300 sec: 42709.5). Total num frames: 4036476928. Throughput: 0: 42431.2. Samples: 315372140. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 14:43:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:43:27,304][09423] Updated weights for policy 0, policy_version 246377 (0.0031) [2024-06-28 14:43:27,921][09190] Fps is (10 sec: 39321.2, 60 sec: 42056.0, 300 sec: 42542.8). Total num frames: 4036640768. Throughput: 0: 42336.8. Samples: 315503260. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 14:43:27,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:43:30,625][09423] Updated weights for policy 0, policy_version 246387 (0.0037) [2024-06-28 14:43:32,922][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4036886528. Throughput: 0: 42320.3. Samples: 315752300. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 14:43:32,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:43:35,208][09423] Updated weights for policy 0, policy_version 246397 (0.0033) [2024-06-28 14:43:37,921][09190] Fps is (10 sec: 47513.9, 60 sec: 42598.5, 300 sec: 42653.9). Total num frames: 4037115904. Throughput: 0: 42695.1. Samples: 316011420. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 14:43:37,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:43:38,072][09423] Updated weights for policy 0, policy_version 246407 (0.0028) [2024-06-28 14:43:42,675][09423] Updated weights for policy 0, policy_version 246417 (0.0045) [2024-06-28 14:43:42,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4037296128. Throughput: 0: 42624.9. Samples: 316146600. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 14:43:42,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:43:45,730][09423] Updated weights for policy 0, policy_version 246427 (0.0036) [2024-06-28 14:43:47,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4037525504. Throughput: 0: 42515.4. Samples: 316389580. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 14:43:47,922][09190] Avg episode reward: [(0, '0.860')] [2024-06-28 14:43:47,943][09403] Saving new best policy, reward=0.860! [2024-06-28 14:43:50,246][09423] Updated weights for policy 0, policy_version 246437 (0.0031) [2024-06-28 14:43:52,921][09190] Fps is (10 sec: 45875.6, 60 sec: 42598.5, 300 sec: 42654.0). Total num frames: 4037754880. Throughput: 0: 42714.2. Samples: 316655400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:43:52,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:43:53,263][09423] Updated weights for policy 0, policy_version 246447 (0.0026) [2024-06-28 14:43:57,747][09423] Updated weights for policy 0, policy_version 246457 (0.0032) [2024-06-28 14:43:57,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42871.4, 300 sec: 42598.9). Total num frames: 4037951488. Throughput: 0: 42592.5. Samples: 316787980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:43:57,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:43:59,340][09403] Signal inference workers to stop experience collection... (4400 times) [2024-06-28 14:43:59,346][09403] Signal inference workers to resume experience collection... (4400 times) [2024-06-28 14:43:59,370][09423] InferenceWorker_p0-w0: stopping experience collection (4400 times) [2024-06-28 14:43:59,371][09423] InferenceWorker_p0-w0: resuming experience collection (4400 times) [2024-06-28 14:44:00,891][09423] Updated weights for policy 0, policy_version 246467 (0.0033) [2024-06-28 14:44:02,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.6, 300 sec: 42709.5). Total num frames: 4038180864. Throughput: 0: 42594.7. Samples: 317037380. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:44:02,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:44:05,766][09423] Updated weights for policy 0, policy_version 246477 (0.0031) [2024-06-28 14:44:07,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4038377472. Throughput: 0: 42710.8. Samples: 317294120. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:44:07,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:44:08,615][09423] Updated weights for policy 0, policy_version 246487 (0.0035) [2024-06-28 14:44:12,921][09190] Fps is (10 sec: 40959.5, 60 sec: 43144.6, 300 sec: 42542.9). Total num frames: 4038590464. Throughput: 0: 42687.5. Samples: 317424200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:44:12,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:44:13,155][09423] Updated weights for policy 0, policy_version 246497 (0.0037) [2024-06-28 14:44:16,677][09423] Updated weights for policy 0, policy_version 246507 (0.0037) [2024-06-28 14:44:17,922][09190] Fps is (10 sec: 44235.7, 60 sec: 42871.3, 300 sec: 42598.4). Total num frames: 4038819840. Throughput: 0: 42745.7. Samples: 317675860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:44:17,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:44:17,940][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000246510_4038819840.pth... [2024-06-28 14:44:17,995][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000245886_4028596224.pth [2024-06-28 14:44:21,356][09423] Updated weights for policy 0, policy_version 246517 (0.0036) [2024-06-28 14:44:22,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4039016448. Throughput: 0: 42683.1. Samples: 317932160. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:44:22,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:44:24,243][09423] Updated weights for policy 0, policy_version 246527 (0.0035) [2024-06-28 14:44:27,921][09190] Fps is (10 sec: 39322.6, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 4039213056. Throughput: 0: 42342.7. Samples: 318052020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:44:27,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:44:28,787][09423] Updated weights for policy 0, policy_version 246537 (0.0036) [2024-06-28 14:44:31,727][09423] Updated weights for policy 0, policy_version 246547 (0.0030) [2024-06-28 14:44:32,923][09190] Fps is (10 sec: 45869.7, 60 sec: 43143.8, 300 sec: 42709.3). Total num frames: 4039475200. Throughput: 0: 42725.3. Samples: 318312260. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:44:32,923][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:44:36,285][09423] Updated weights for policy 0, policy_version 246557 (0.0032) [2024-06-28 14:44:37,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42052.3, 300 sec: 42598.4). Total num frames: 4039639040. Throughput: 0: 42679.5. Samples: 318575980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:44:37,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:44:39,619][09423] Updated weights for policy 0, policy_version 246567 (0.0043) [2024-06-28 14:44:42,921][09190] Fps is (10 sec: 40964.6, 60 sec: 43144.5, 300 sec: 42653.9). Total num frames: 4039884800. Throughput: 0: 42272.9. Samples: 318690260. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:44:42,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:44:44,289][09423] Updated weights for policy 0, policy_version 246577 (0.0031) [2024-06-28 14:44:47,647][09423] Updated weights for policy 0, policy_version 246587 (0.0033) [2024-06-28 14:44:47,921][09190] Fps is (10 sec: 45875.4, 60 sec: 42871.6, 300 sec: 42653.9). Total num frames: 4040097792. Throughput: 0: 42416.0. Samples: 318946100. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:44:47,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:44:51,729][09423] Updated weights for policy 0, policy_version 246597 (0.0035) [2024-06-28 14:44:52,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42052.2, 300 sec: 42598.4). Total num frames: 4040278016. Throughput: 0: 42686.2. Samples: 319215000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:44:52,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:44:55,127][09423] Updated weights for policy 0, policy_version 246607 (0.0040) [2024-06-28 14:44:57,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4040491008. Throughput: 0: 42323.2. Samples: 319328740. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:44:57,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:44:59,751][09423] Updated weights for policy 0, policy_version 246617 (0.0034) [2024-06-28 14:45:02,614][09423] Updated weights for policy 0, policy_version 246627 (0.0030) [2024-06-28 14:45:02,921][09190] Fps is (10 sec: 47513.4, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 4040753152. Throughput: 0: 42549.5. Samples: 319590580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 14:45:02,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:45:07,283][09423] Updated weights for policy 0, policy_version 246637 (0.0027) [2024-06-28 14:45:07,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4040916992. Throughput: 0: 42645.7. Samples: 319851220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 14:45:07,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:45:08,272][09403] Signal inference workers to stop experience collection... (4450 times) [2024-06-28 14:45:08,303][09423] InferenceWorker_p0-w0: stopping experience collection (4450 times) [2024-06-28 14:45:08,330][09403] Signal inference workers to resume experience collection... (4450 times) [2024-06-28 14:45:08,336][09423] InferenceWorker_p0-w0: resuming experience collection (4450 times) [2024-06-28 14:45:10,163][09423] Updated weights for policy 0, policy_version 246647 (0.0032) [2024-06-28 14:45:12,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4041146368. Throughput: 0: 42702.6. Samples: 319973640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 14:45:12,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:45:14,796][09423] Updated weights for policy 0, policy_version 246657 (0.0044) [2024-06-28 14:45:17,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42598.6, 300 sec: 42653.9). Total num frames: 4041375744. Throughput: 0: 42694.9. Samples: 320233480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 14:45:17,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:45:17,946][09423] Updated weights for policy 0, policy_version 246667 (0.0044) [2024-06-28 14:45:22,694][09423] Updated weights for policy 0, policy_version 246677 (0.0022) [2024-06-28 14:45:22,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.3, 300 sec: 42543.2). Total num frames: 4041555968. Throughput: 0: 42603.9. Samples: 320493160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 14:45:22,923][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:45:25,694][09423] Updated weights for policy 0, policy_version 246687 (0.0028) [2024-06-28 14:45:27,924][09190] Fps is (10 sec: 40949.5, 60 sec: 42869.7, 300 sec: 42598.0). Total num frames: 4041785344. Throughput: 0: 42639.9. Samples: 320609160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 14:45:27,924][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:45:30,362][09423] Updated weights for policy 0, policy_version 246697 (0.0038) [2024-06-28 14:45:32,921][09190] Fps is (10 sec: 45875.4, 60 sec: 42326.1, 300 sec: 42765.0). Total num frames: 4042014720. Throughput: 0: 42855.9. Samples: 320874620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 14:45:32,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:45:33,127][09423] Updated weights for policy 0, policy_version 246707 (0.0037) [2024-06-28 14:45:37,921][09190] Fps is (10 sec: 40970.4, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4042194944. Throughput: 0: 42806.7. Samples: 321141300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 14:45:37,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:45:38,152][09423] Updated weights for policy 0, policy_version 246717 (0.0034) [2024-06-28 14:45:40,771][09423] Updated weights for policy 0, policy_version 246727 (0.0035) [2024-06-28 14:45:42,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42325.2, 300 sec: 42598.4). Total num frames: 4042424320. Throughput: 0: 42876.7. Samples: 321258200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 14:45:42,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:45:45,815][09423] Updated weights for policy 0, policy_version 246737 (0.0032) [2024-06-28 14:45:47,921][09190] Fps is (10 sec: 47513.5, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 4042670080. Throughput: 0: 42795.6. Samples: 321516380. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 14:45:47,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:45:48,371][09423] Updated weights for policy 0, policy_version 246747 (0.0029) [2024-06-28 14:45:52,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42598.3, 300 sec: 42542.9). Total num frames: 4042833920. Throughput: 0: 42751.5. Samples: 321775040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 14:45:52,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:45:53,473][09423] Updated weights for policy 0, policy_version 246757 (0.0033) [2024-06-28 14:45:56,337][09423] Updated weights for policy 0, policy_version 246767 (0.0032) [2024-06-28 14:45:57,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4043063296. Throughput: 0: 42688.1. Samples: 321894600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 14:45:57,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:46:01,040][09423] Updated weights for policy 0, policy_version 246777 (0.0032) [2024-06-28 14:46:02,923][09190] Fps is (10 sec: 44231.3, 60 sec: 42051.3, 300 sec: 42709.3). Total num frames: 4043276288. Throughput: 0: 42738.7. Samples: 322156780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 14:46:02,923][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:46:03,781][09423] Updated weights for policy 0, policy_version 246787 (0.0037) [2024-06-28 14:46:07,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42598.3, 300 sec: 42542.8). Total num frames: 4043472896. Throughput: 0: 42763.9. Samples: 322417540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 14:46:07,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:46:08,527][09423] Updated weights for policy 0, policy_version 246797 (0.0035) [2024-06-28 14:46:11,221][09423] Updated weights for policy 0, policy_version 246807 (0.0030) [2024-06-28 14:46:12,921][09190] Fps is (10 sec: 44242.6, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4043718656. Throughput: 0: 42910.4. Samples: 322540020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 14:46:12,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:46:16,133][09423] Updated weights for policy 0, policy_version 246817 (0.0026) [2024-06-28 14:46:17,921][09190] Fps is (10 sec: 45876.1, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 4043931648. Throughput: 0: 42866.3. Samples: 322803600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 14:46:17,922][09190] Avg episode reward: [(0, '0.764')] [2024-06-28 14:46:18,035][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000246823_4043948032.pth... [2024-06-28 14:46:18,099][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000246197_4033691648.pth [2024-06-28 14:46:19,007][09423] Updated weights for policy 0, policy_version 246827 (0.0022) [2024-06-28 14:46:22,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42871.5, 300 sec: 42543.8). Total num frames: 4044128256. Throughput: 0: 42715.1. Samples: 323063480. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 14:46:22,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:46:23,891][09423] Updated weights for policy 0, policy_version 246837 (0.0040) [2024-06-28 14:46:25,480][09403] Signal inference workers to stop experience collection... (4500 times) [2024-06-28 14:46:25,480][09403] Signal inference workers to resume experience collection... (4500 times) [2024-06-28 14:46:25,515][09423] InferenceWorker_p0-w0: stopping experience collection (4500 times) [2024-06-28 14:46:25,515][09423] InferenceWorker_p0-w0: resuming experience collection (4500 times) [2024-06-28 14:46:26,709][09423] Updated weights for policy 0, policy_version 246847 (0.0037) [2024-06-28 14:46:27,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42873.3, 300 sec: 42709.5). Total num frames: 4044357632. Throughput: 0: 42759.3. Samples: 323182360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 14:46:27,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:46:31,734][09423] Updated weights for policy 0, policy_version 246857 (0.0037) [2024-06-28 14:46:32,921][09190] Fps is (10 sec: 45874.6, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 4044587008. Throughput: 0: 42956.3. Samples: 323449420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 14:46:32,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:46:34,621][09423] Updated weights for policy 0, policy_version 246867 (0.0045) [2024-06-28 14:46:37,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 4044767232. Throughput: 0: 42809.0. Samples: 323701440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 14:46:37,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:46:39,195][09423] Updated weights for policy 0, policy_version 246877 (0.0033) [2024-06-28 14:46:42,065][09423] Updated weights for policy 0, policy_version 246887 (0.0024) [2024-06-28 14:46:42,921][09190] Fps is (10 sec: 42598.9, 60 sec: 43144.7, 300 sec: 42765.1). Total num frames: 4045012992. Throughput: 0: 42915.6. Samples: 323825800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 14:46:42,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:46:46,752][09423] Updated weights for policy 0, policy_version 246897 (0.0049) [2024-06-28 14:46:47,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.3, 300 sec: 42653.9). Total num frames: 4045209600. Throughput: 0: 42812.4. Samples: 324083280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 14:46:47,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:46:49,516][09423] Updated weights for policy 0, policy_version 246907 (0.0031) [2024-06-28 14:46:52,921][09190] Fps is (10 sec: 37683.3, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4045389824. Throughput: 0: 42765.1. Samples: 324341960. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 14:46:52,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:46:54,898][09423] Updated weights for policy 0, policy_version 246917 (0.0030) [2024-06-28 14:46:57,449][09423] Updated weights for policy 0, policy_version 246927 (0.0035) [2024-06-28 14:46:57,921][09190] Fps is (10 sec: 44236.5, 60 sec: 43144.5, 300 sec: 42765.0). Total num frames: 4045651968. Throughput: 0: 42749.7. Samples: 324463760. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 14:46:57,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:47:02,460][09423] Updated weights for policy 0, policy_version 246937 (0.0024) [2024-06-28 14:47:02,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42599.3, 300 sec: 42542.9). Total num frames: 4045832192. Throughput: 0: 42667.9. Samples: 324723660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 14:47:02,922][09190] Avg episode reward: [(0, '0.763')] [2024-06-28 14:47:05,584][09423] Updated weights for policy 0, policy_version 246947 (0.0043) [2024-06-28 14:47:07,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 4046045184. Throughput: 0: 42539.5. Samples: 324977760. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 14:47:07,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:47:09,924][09423] Updated weights for policy 0, policy_version 246957 (0.0042) [2024-06-28 14:47:12,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 4046290944. Throughput: 0: 42739.1. Samples: 325105620. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 14:47:12,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:47:13,053][09423] Updated weights for policy 0, policy_version 246967 (0.0033) [2024-06-28 14:47:17,710][09423] Updated weights for policy 0, policy_version 246977 (0.0038) [2024-06-28 14:47:17,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.3, 300 sec: 42542.8). Total num frames: 4046487552. Throughput: 0: 42542.3. Samples: 325363820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 22.0) [2024-06-28 14:47:17,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:47:20,484][09423] Updated weights for policy 0, policy_version 246987 (0.0031) [2024-06-28 14:47:22,924][09190] Fps is (10 sec: 40949.9, 60 sec: 42869.7, 300 sec: 42654.3). Total num frames: 4046700544. Throughput: 0: 42720.7. Samples: 325623980. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 14:47:22,924][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:47:25,078][09423] Updated weights for policy 0, policy_version 246997 (0.0037) [2024-06-28 14:47:27,902][09423] Updated weights for policy 0, policy_version 247007 (0.0032) [2024-06-28 14:47:27,921][09190] Fps is (10 sec: 47513.7, 60 sec: 43417.6, 300 sec: 42765.0). Total num frames: 4046962688. Throughput: 0: 42831.0. Samples: 325753200. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 14:47:27,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:47:32,844][09423] Updated weights for policy 0, policy_version 247017 (0.0028) [2024-06-28 14:47:32,921][09190] Fps is (10 sec: 42608.8, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4047126528. Throughput: 0: 42955.9. Samples: 326016300. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 14:47:32,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:47:33,604][09403] Signal inference workers to stop experience collection... (4550 times) [2024-06-28 14:47:33,604][09403] Signal inference workers to resume experience collection... (4550 times) [2024-06-28 14:47:33,651][09423] InferenceWorker_p0-w0: stopping experience collection (4550 times) [2024-06-28 14:47:33,652][09423] InferenceWorker_p0-w0: resuming experience collection (4550 times) [2024-06-28 14:47:35,657][09423] Updated weights for policy 0, policy_version 247027 (0.0036) [2024-06-28 14:47:37,921][09190] Fps is (10 sec: 39321.5, 60 sec: 43144.5, 300 sec: 42709.5). Total num frames: 4047355904. Throughput: 0: 42797.7. Samples: 326267860. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 14:47:37,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:47:40,390][09423] Updated weights for policy 0, policy_version 247037 (0.0032) [2024-06-28 14:47:42,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 4047585280. Throughput: 0: 42933.3. Samples: 326395760. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 14:47:42,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:47:43,692][09423] Updated weights for policy 0, policy_version 247047 (0.0038) [2024-06-28 14:47:47,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4047765504. Throughput: 0: 42885.3. Samples: 326653500. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 14:47:47,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:47:48,061][09423] Updated weights for policy 0, policy_version 247057 (0.0032) [2024-06-28 14:47:51,542][09423] Updated weights for policy 0, policy_version 247067 (0.0033) [2024-06-28 14:47:52,921][09190] Fps is (10 sec: 40960.3, 60 sec: 43417.6, 300 sec: 42765.0). Total num frames: 4047994880. Throughput: 0: 42724.5. Samples: 326900360. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 14:47:52,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:47:55,818][09423] Updated weights for policy 0, policy_version 247077 (0.0042) [2024-06-28 14:47:57,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42598.5, 300 sec: 42765.0). Total num frames: 4048207872. Throughput: 0: 42794.7. Samples: 327031380. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 14:47:57,922][09190] Avg episode reward: [(0, '0.763')] [2024-06-28 14:47:58,986][09423] Updated weights for policy 0, policy_version 247087 (0.0031) [2024-06-28 14:48:02,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4048404480. Throughput: 0: 42907.2. Samples: 327294640. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 14:48:02,928][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:48:03,288][09423] Updated weights for policy 0, policy_version 247097 (0.0047) [2024-06-28 14:48:06,334][09423] Updated weights for policy 0, policy_version 247107 (0.0034) [2024-06-28 14:48:07,921][09190] Fps is (10 sec: 44236.3, 60 sec: 43417.6, 300 sec: 42876.1). Total num frames: 4048650240. Throughput: 0: 42654.3. Samples: 327543320. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 14:48:07,922][09190] Avg episode reward: [(0, '0.776')] [2024-06-28 14:48:11,343][09423] Updated weights for policy 0, policy_version 247117 (0.0036) [2024-06-28 14:48:12,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42871.5, 300 sec: 42765.0). Total num frames: 4048863232. Throughput: 0: 42757.9. Samples: 327677300. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 14:48:12,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:48:14,205][09423] Updated weights for policy 0, policy_version 247127 (0.0034) [2024-06-28 14:48:17,921][09190] Fps is (10 sec: 37683.5, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4049027072. Throughput: 0: 42588.5. Samples: 327932780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 14:48:17,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:48:17,944][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000247133_4049027072.pth... [2024-06-28 14:48:18,021][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000246510_4038819840.pth [2024-06-28 14:48:18,687][09423] Updated weights for policy 0, policy_version 247137 (0.0030) [2024-06-28 14:48:22,158][09423] Updated weights for policy 0, policy_version 247147 (0.0038) [2024-06-28 14:48:22,921][09190] Fps is (10 sec: 42597.8, 60 sec: 43146.3, 300 sec: 42876.1). Total num frames: 4049289216. Throughput: 0: 42624.0. Samples: 328185940. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 14:48:22,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:48:26,534][09423] Updated weights for policy 0, policy_version 247157 (0.0049) [2024-06-28 14:48:27,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42052.3, 300 sec: 42709.5). Total num frames: 4049485824. Throughput: 0: 42683.6. Samples: 328316520. Policy #0 lag: (min: 0.0, avg: 9.0, max: 22.0) [2024-06-28 14:48:27,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:48:29,667][09423] Updated weights for policy 0, policy_version 247167 (0.0034) [2024-06-28 14:48:32,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4049682432. Throughput: 0: 42600.1. Samples: 328570500. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 14:48:32,922][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 14:48:34,016][09423] Updated weights for policy 0, policy_version 247177 (0.0033) [2024-06-28 14:48:37,354][09423] Updated weights for policy 0, policy_version 247187 (0.0033) [2024-06-28 14:48:37,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.5, 300 sec: 42820.6). Total num frames: 4049928192. Throughput: 0: 42720.4. Samples: 328822780. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 14:48:37,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:48:41,700][09423] Updated weights for policy 0, policy_version 247197 (0.0033) [2024-06-28 14:48:42,922][09190] Fps is (10 sec: 44235.9, 60 sec: 42325.3, 300 sec: 42709.5). Total num frames: 4050124800. Throughput: 0: 42938.5. Samples: 328963620. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 14:48:42,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:48:44,812][09423] Updated weights for policy 0, policy_version 247207 (0.0039) [2024-06-28 14:48:47,921][09190] Fps is (10 sec: 39321.2, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4050321408. Throughput: 0: 42600.3. Samples: 329211660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 14:48:47,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:48:49,477][09423] Updated weights for policy 0, policy_version 247217 (0.0035) [2024-06-28 14:48:51,245][09403] Signal inference workers to stop experience collection... (4600 times) [2024-06-28 14:48:51,246][09403] Signal inference workers to resume experience collection... (4600 times) [2024-06-28 14:48:51,291][09423] InferenceWorker_p0-w0: stopping experience collection (4600 times) [2024-06-28 14:48:51,291][09423] InferenceWorker_p0-w0: resuming experience collection (4600 times) [2024-06-28 14:48:52,492][09423] Updated weights for policy 0, policy_version 247227 (0.0035) [2024-06-28 14:48:52,924][09190] Fps is (10 sec: 44226.5, 60 sec: 42869.7, 300 sec: 42764.7). Total num frames: 4050567168. Throughput: 0: 42601.3. Samples: 329460480. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 14:48:52,924][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:48:57,197][09423] Updated weights for policy 0, policy_version 247237 (0.0033) [2024-06-28 14:48:57,921][09190] Fps is (10 sec: 44237.7, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4050763776. Throughput: 0: 42663.6. Samples: 329597160. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 14:48:57,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:49:00,524][09423] Updated weights for policy 0, policy_version 247247 (0.0028) [2024-06-28 14:49:02,921][09190] Fps is (10 sec: 39331.5, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4050960384. Throughput: 0: 42453.3. Samples: 329843180. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 14:49:02,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:49:05,001][09423] Updated weights for policy 0, policy_version 247257 (0.0033) [2024-06-28 14:49:07,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.5, 300 sec: 42765.0). Total num frames: 4051206144. Throughput: 0: 42652.6. Samples: 330105300. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 14:49:07,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:49:08,024][09423] Updated weights for policy 0, policy_version 247267 (0.0043) [2024-06-28 14:49:12,508][09423] Updated weights for policy 0, policy_version 247277 (0.0041) [2024-06-28 14:49:12,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42325.3, 300 sec: 42654.0). Total num frames: 4051402752. Throughput: 0: 42653.4. Samples: 330235920. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 14:49:12,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:49:15,778][09423] Updated weights for policy 0, policy_version 247287 (0.0032) [2024-06-28 14:49:17,921][09190] Fps is (10 sec: 40959.8, 60 sec: 43144.5, 300 sec: 42709.5). Total num frames: 4051615744. Throughput: 0: 42649.3. Samples: 330489720. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 14:49:17,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:49:20,198][09423] Updated weights for policy 0, policy_version 247297 (0.0030) [2024-06-28 14:49:22,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42598.4, 300 sec: 42820.5). Total num frames: 4051845120. Throughput: 0: 42739.5. Samples: 330746060. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 14:49:22,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:49:23,251][09423] Updated weights for policy 0, policy_version 247307 (0.0039) [2024-06-28 14:49:27,630][09423] Updated weights for policy 0, policy_version 247317 (0.0033) [2024-06-28 14:49:27,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.4, 300 sec: 42598.6). Total num frames: 4052041728. Throughput: 0: 42559.7. Samples: 330878800. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 14:49:27,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:49:30,639][09423] Updated weights for policy 0, policy_version 247327 (0.0029) [2024-06-28 14:49:32,923][09190] Fps is (10 sec: 40953.1, 60 sec: 42870.2, 300 sec: 42764.8). Total num frames: 4052254720. Throughput: 0: 42761.6. Samples: 331136000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 14:49:32,924][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:49:35,451][09423] Updated weights for policy 0, policy_version 247337 (0.0029) [2024-06-28 14:49:37,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42598.5, 300 sec: 42709.5). Total num frames: 4052484096. Throughput: 0: 42952.2. Samples: 331393220. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 14:49:37,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:49:38,574][09423] Updated weights for policy 0, policy_version 247347 (0.0031) [2024-06-28 14:49:42,921][09190] Fps is (10 sec: 42605.5, 60 sec: 42598.5, 300 sec: 42653.9). Total num frames: 4052680704. Throughput: 0: 42642.5. Samples: 331516080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 14:49:42,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:49:43,251][09423] Updated weights for policy 0, policy_version 247357 (0.0039) [2024-06-28 14:49:46,553][09423] Updated weights for policy 0, policy_version 247367 (0.0037) [2024-06-28 14:49:47,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42871.5, 300 sec: 42765.0). Total num frames: 4052893696. Throughput: 0: 42857.7. Samples: 331771780. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 14:49:47,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:49:50,693][09423] Updated weights for policy 0, policy_version 247377 (0.0033) [2024-06-28 14:49:52,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42600.2, 300 sec: 42820.5). Total num frames: 4053123072. Throughput: 0: 42862.1. Samples: 332034100. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 14:49:52,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:49:54,150][09423] Updated weights for policy 0, policy_version 247387 (0.0052) [2024-06-28 14:49:57,924][09190] Fps is (10 sec: 42587.6, 60 sec: 42596.5, 300 sec: 42598.0). Total num frames: 4053319680. Throughput: 0: 42712.2. Samples: 332158080. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 14:49:57,924][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:49:58,445][09423] Updated weights for policy 0, policy_version 247397 (0.0032) [2024-06-28 14:50:01,673][09423] Updated weights for policy 0, policy_version 247407 (0.0033) [2024-06-28 14:50:02,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43144.5, 300 sec: 42820.6). Total num frames: 4053549056. Throughput: 0: 42724.9. Samples: 332412340. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 14:50:02,922][09190] Avg episode reward: [(0, '0.765')] [2024-06-28 14:50:06,202][09423] Updated weights for policy 0, policy_version 247417 (0.0037) [2024-06-28 14:50:07,921][09190] Fps is (10 sec: 42609.2, 60 sec: 42325.2, 300 sec: 42709.5). Total num frames: 4053745664. Throughput: 0: 42850.7. Samples: 332674340. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 14:50:07,922][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 14:50:09,368][09423] Updated weights for policy 0, policy_version 247427 (0.0028) [2024-06-28 14:50:12,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4053942272. Throughput: 0: 42543.5. Samples: 332793260. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 14:50:12,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:50:13,904][09423] Updated weights for policy 0, policy_version 247437 (0.0043) [2024-06-28 14:50:17,158][09423] Updated weights for policy 0, policy_version 247447 (0.0032) [2024-06-28 14:50:17,921][09190] Fps is (10 sec: 45875.7, 60 sec: 43144.6, 300 sec: 42876.1). Total num frames: 4054204416. Throughput: 0: 42480.8. Samples: 333047560. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 14:50:17,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:50:17,946][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000247449_4054204416.pth... [2024-06-28 14:50:18,000][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000246823_4043948032.pth [2024-06-28 14:50:21,640][09423] Updated weights for policy 0, policy_version 247457 (0.0034) [2024-06-28 14:50:22,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42325.4, 300 sec: 42709.8). Total num frames: 4054384640. Throughput: 0: 42616.9. Samples: 333310980. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 14:50:22,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:50:24,948][09423] Updated weights for policy 0, policy_version 247467 (0.0036) [2024-06-28 14:50:27,921][09190] Fps is (10 sec: 39321.1, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4054597632. Throughput: 0: 42576.0. Samples: 333432000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 14:50:27,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:50:29,136][09423] Updated weights for policy 0, policy_version 247477 (0.0030) [2024-06-28 14:50:30,772][09403] Signal inference workers to stop experience collection... (4650 times) [2024-06-28 14:50:30,814][09423] InferenceWorker_p0-w0: stopping experience collection (4650 times) [2024-06-28 14:50:30,823][09403] Signal inference workers to resume experience collection... (4650 times) [2024-06-28 14:50:30,833][09423] InferenceWorker_p0-w0: resuming experience collection (4650 times) [2024-06-28 14:50:32,383][09423] Updated weights for policy 0, policy_version 247487 (0.0038) [2024-06-28 14:50:32,921][09190] Fps is (10 sec: 45875.0, 60 sec: 43145.8, 300 sec: 42876.1). Total num frames: 4054843392. Throughput: 0: 42744.9. Samples: 333695300. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 14:50:32,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:50:36,604][09423] Updated weights for policy 0, policy_version 247497 (0.0030) [2024-06-28 14:50:37,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42598.3, 300 sec: 42765.0). Total num frames: 4055040000. Throughput: 0: 42491.6. Samples: 333946220. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 14:50:37,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:50:40,177][09423] Updated weights for policy 0, policy_version 247507 (0.0032) [2024-06-28 14:50:42,921][09190] Fps is (10 sec: 39321.2, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4055236608. Throughput: 0: 42541.5. Samples: 334072340. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 14:50:42,922][09190] Avg episode reward: [(0, '0.768')] [2024-06-28 14:50:44,653][09423] Updated weights for policy 0, policy_version 247517 (0.0031) [2024-06-28 14:50:47,755][09423] Updated weights for policy 0, policy_version 247527 (0.0041) [2024-06-28 14:50:47,922][09190] Fps is (10 sec: 44236.3, 60 sec: 43144.5, 300 sec: 42876.1). Total num frames: 4055482368. Throughput: 0: 42706.1. Samples: 334334120. Policy #0 lag: (min: 0.0, avg: 11.1, max: 25.0) [2024-06-28 14:50:47,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:50:52,256][09423] Updated weights for policy 0, policy_version 247537 (0.0039) [2024-06-28 14:50:52,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42325.4, 300 sec: 42709.5). Total num frames: 4055662592. Throughput: 0: 42571.6. Samples: 334590060. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 14:50:52,922][09190] Avg episode reward: [(0, '0.765')] [2024-06-28 14:50:55,378][09423] Updated weights for policy 0, policy_version 247547 (0.0031) [2024-06-28 14:50:57,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42873.3, 300 sec: 42765.2). Total num frames: 4055891968. Throughput: 0: 42729.8. Samples: 334716100. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 14:50:57,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:50:59,624][09423] Updated weights for policy 0, policy_version 247557 (0.0029) [2024-06-28 14:51:02,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42598.3, 300 sec: 42820.6). Total num frames: 4056104960. Throughput: 0: 43001.1. Samples: 334982620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 14:51:02,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:51:03,227][09423] Updated weights for policy 0, policy_version 247567 (0.0037) [2024-06-28 14:51:07,549][09423] Updated weights for policy 0, policy_version 247577 (0.0028) [2024-06-28 14:51:07,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43144.5, 300 sec: 42765.0). Total num frames: 4056334336. Throughput: 0: 42736.3. Samples: 335234120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 14:51:07,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:51:10,952][09423] Updated weights for policy 0, policy_version 247587 (0.0038) [2024-06-28 14:51:12,921][09190] Fps is (10 sec: 42598.9, 60 sec: 43144.6, 300 sec: 42709.5). Total num frames: 4056530944. Throughput: 0: 42922.3. Samples: 335363500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 14:51:12,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:51:15,046][09423] Updated weights for policy 0, policy_version 247597 (0.0033) [2024-06-28 14:51:17,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.3, 300 sec: 42765.0). Total num frames: 4056743936. Throughput: 0: 42838.6. Samples: 335623040. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 14:51:17,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:51:18,392][09423] Updated weights for policy 0, policy_version 247607 (0.0035) [2024-06-28 14:51:22,648][09423] Updated weights for policy 0, policy_version 247617 (0.0038) [2024-06-28 14:51:22,922][09190] Fps is (10 sec: 44236.3, 60 sec: 43144.4, 300 sec: 42765.0). Total num frames: 4056973312. Throughput: 0: 42851.0. Samples: 335874520. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 14:51:22,924][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:51:25,828][09423] Updated weights for policy 0, policy_version 247627 (0.0034) [2024-06-28 14:51:27,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.5, 300 sec: 42654.0). Total num frames: 4057169920. Throughput: 0: 42887.2. Samples: 336002260. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 14:51:27,922][09190] Avg episode reward: [(0, '0.764')] [2024-06-28 14:51:30,365][09423] Updated weights for policy 0, policy_version 247637 (0.0030) [2024-06-28 14:51:32,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.3, 300 sec: 42765.0). Total num frames: 4057382912. Throughput: 0: 42891.2. Samples: 336264220. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 14:51:32,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:51:33,882][09423] Updated weights for policy 0, policy_version 247647 (0.0037) [2024-06-28 14:51:37,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4057595904. Throughput: 0: 42934.6. Samples: 336522120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 14:51:37,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:51:37,991][09423] Updated weights for policy 0, policy_version 247657 (0.0033) [2024-06-28 14:51:41,299][09423] Updated weights for policy 0, policy_version 247667 (0.0037) [2024-06-28 14:51:42,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43144.6, 300 sec: 42765.0). Total num frames: 4057825280. Throughput: 0: 42989.4. Samples: 336650620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 14:51:42,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:51:45,238][09403] Signal inference workers to stop experience collection... (4700 times) [2024-06-28 14:51:45,292][09423] InferenceWorker_p0-w0: stopping experience collection (4700 times) [2024-06-28 14:51:45,294][09403] Signal inference workers to resume experience collection... (4700 times) [2024-06-28 14:51:45,302][09423] InferenceWorker_p0-w0: resuming experience collection (4700 times) [2024-06-28 14:51:45,465][09423] Updated weights for policy 0, policy_version 247677 (0.0040) [2024-06-28 14:51:47,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42325.5, 300 sec: 42820.6). Total num frames: 4058021888. Throughput: 0: 42767.8. Samples: 336907160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 14:51:47,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:51:49,248][09423] Updated weights for policy 0, policy_version 247687 (0.0035) [2024-06-28 14:51:52,922][09190] Fps is (10 sec: 40959.4, 60 sec: 42871.3, 300 sec: 42653.9). Total num frames: 4058234880. Throughput: 0: 42888.8. Samples: 337164120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 14:51:52,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:51:53,257][09423] Updated weights for policy 0, policy_version 247697 (0.0028) [2024-06-28 14:51:56,712][09423] Updated weights for policy 0, policy_version 247707 (0.0035) [2024-06-28 14:51:57,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42598.4, 300 sec: 42765.0). Total num frames: 4058447872. Throughput: 0: 42805.8. Samples: 337289760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 14:51:57,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:52:00,941][09423] Updated weights for policy 0, policy_version 247717 (0.0036) [2024-06-28 14:52:02,922][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.4, 300 sec: 42765.0). Total num frames: 4058660864. Throughput: 0: 42762.1. Samples: 337547340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 14:52:02,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:52:04,203][09423] Updated weights for policy 0, policy_version 247727 (0.0035) [2024-06-28 14:52:07,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42598.5, 300 sec: 42709.5). Total num frames: 4058890240. Throughput: 0: 42942.4. Samples: 337806920. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 14:52:07,922][09190] Avg episode reward: [(0, '0.767')] [2024-06-28 14:52:08,415][09423] Updated weights for policy 0, policy_version 247737 (0.0035) [2024-06-28 14:52:11,907][09423] Updated weights for policy 0, policy_version 247747 (0.0028) [2024-06-28 14:52:12,921][09190] Fps is (10 sec: 44237.8, 60 sec: 42871.5, 300 sec: 42765.0). Total num frames: 4059103232. Throughput: 0: 43002.3. Samples: 337937360. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 14:52:12,922][09190] Avg episode reward: [(0, '0.765')] [2024-06-28 14:52:16,113][09423] Updated weights for policy 0, policy_version 247757 (0.0037) [2024-06-28 14:52:17,921][09190] Fps is (10 sec: 40959.4, 60 sec: 42598.4, 300 sec: 42709.8). Total num frames: 4059299840. Throughput: 0: 42973.4. Samples: 338198020. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 14:52:17,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:52:17,936][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000247760_4059299840.pth... [2024-06-28 14:52:18,025][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000247133_4049027072.pth [2024-06-28 14:52:19,377][09423] Updated weights for policy 0, policy_version 247767 (0.0034) [2024-06-28 14:52:22,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4059529216. Throughput: 0: 42944.9. Samples: 338454640. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 14:52:22,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:52:23,747][09423] Updated weights for policy 0, policy_version 247777 (0.0035) [2024-06-28 14:52:27,424][09423] Updated weights for policy 0, policy_version 247787 (0.0037) [2024-06-28 14:52:27,921][09190] Fps is (10 sec: 45875.7, 60 sec: 43144.6, 300 sec: 42820.6). Total num frames: 4059758592. Throughput: 0: 42895.6. Samples: 338580920. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 14:52:27,928][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:52:31,650][09423] Updated weights for policy 0, policy_version 247797 (0.0028) [2024-06-28 14:52:32,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42871.6, 300 sec: 42709.5). Total num frames: 4059955200. Throughput: 0: 42868.8. Samples: 338836260. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 14:52:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:52:34,837][09423] Updated weights for policy 0, policy_version 247807 (0.0039) [2024-06-28 14:52:37,922][09190] Fps is (10 sec: 39320.7, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4060151808. Throughput: 0: 42750.2. Samples: 339087880. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 14:52:37,922][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 14:52:39,185][09423] Updated weights for policy 0, policy_version 247817 (0.0041) [2024-06-28 14:52:42,775][09423] Updated weights for policy 0, policy_version 247827 (0.0023) [2024-06-28 14:52:42,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42871.4, 300 sec: 42820.5). Total num frames: 4060397568. Throughput: 0: 42721.7. Samples: 339212240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 14:52:42,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:52:47,094][09423] Updated weights for policy 0, policy_version 247837 (0.0026) [2024-06-28 14:52:47,921][09190] Fps is (10 sec: 42599.3, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4060577792. Throughput: 0: 42621.1. Samples: 339465280. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 14:52:47,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:52:50,182][09423] Updated weights for policy 0, policy_version 247847 (0.0032) [2024-06-28 14:52:52,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42871.6, 300 sec: 42709.5). Total num frames: 4060807168. Throughput: 0: 42762.6. Samples: 339731240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 14:52:52,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:52:54,535][09423] Updated weights for policy 0, policy_version 247857 (0.0031) [2024-06-28 14:52:57,197][09403] Signal inference workers to stop experience collection... (4750 times) [2024-06-28 14:52:57,197][09403] Signal inference workers to resume experience collection... (4750 times) [2024-06-28 14:52:57,210][09423] InferenceWorker_p0-w0: stopping experience collection (4750 times) [2024-06-28 14:52:57,216][09423] InferenceWorker_p0-w0: resuming experience collection (4750 times) [2024-06-28 14:52:57,924][09190] Fps is (10 sec: 45863.4, 60 sec: 43142.7, 300 sec: 42820.2). Total num frames: 4061036544. Throughput: 0: 42713.5. Samples: 339859580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 14:52:57,924][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:52:58,490][09423] Updated weights for policy 0, policy_version 247867 (0.0036) [2024-06-28 14:53:02,136][09423] Updated weights for policy 0, policy_version 247877 (0.0033) [2024-06-28 14:53:02,924][09190] Fps is (10 sec: 42587.4, 60 sec: 42869.8, 300 sec: 42653.6). Total num frames: 4061233152. Throughput: 0: 42472.3. Samples: 340109380. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 14:53:02,924][09190] Avg episode reward: [(0, '0.763')] [2024-06-28 14:53:06,017][09423] Updated weights for policy 0, policy_version 247887 (0.0039) [2024-06-28 14:53:07,921][09190] Fps is (10 sec: 39331.8, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4061429760. Throughput: 0: 42545.0. Samples: 340369160. Policy #0 lag: (min: 0.0, avg: 9.3, max: 22.0) [2024-06-28 14:53:07,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:53:10,062][09423] Updated weights for policy 0, policy_version 247897 (0.0036) [2024-06-28 14:53:12,921][09190] Fps is (10 sec: 42609.4, 60 sec: 42598.4, 300 sec: 42820.6). Total num frames: 4061659136. Throughput: 0: 42528.9. Samples: 340494720. Policy #0 lag: (min: 2.0, avg: 10.4, max: 23.0) [2024-06-28 14:53:12,922][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 14:53:13,516][09423] Updated weights for policy 0, policy_version 247907 (0.0027) [2024-06-28 14:53:17,470][09423] Updated weights for policy 0, policy_version 247917 (0.0026) [2024-06-28 14:53:17,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4061872128. Throughput: 0: 42578.1. Samples: 340752280. Policy #0 lag: (min: 2.0, avg: 10.4, max: 23.0) [2024-06-28 14:53:17,922][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 14:53:21,379][09423] Updated weights for policy 0, policy_version 247927 (0.0048) [2024-06-28 14:53:22,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.5, 300 sec: 42709.5). Total num frames: 4062085120. Throughput: 0: 42570.0. Samples: 341003520. Policy #0 lag: (min: 2.0, avg: 10.4, max: 23.0) [2024-06-28 14:53:22,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:53:25,491][09423] Updated weights for policy 0, policy_version 247937 (0.0021) [2024-06-28 14:53:27,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.3, 300 sec: 42820.5). Total num frames: 4062314496. Throughput: 0: 42782.3. Samples: 341137440. Policy #0 lag: (min: 2.0, avg: 10.4, max: 23.0) [2024-06-28 14:53:27,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:53:28,765][09423] Updated weights for policy 0, policy_version 247947 (0.0023) [2024-06-28 14:53:32,922][09190] Fps is (10 sec: 42597.5, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4062511104. Throughput: 0: 42931.8. Samples: 341397220. Policy #0 lag: (min: 2.0, avg: 10.4, max: 23.0) [2024-06-28 14:53:32,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:53:32,986][09423] Updated weights for policy 0, policy_version 247957 (0.0042) [2024-06-28 14:53:36,607][09423] Updated weights for policy 0, policy_version 247967 (0.0043) [2024-06-28 14:53:37,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42871.6, 300 sec: 42709.5). Total num frames: 4062724096. Throughput: 0: 42459.1. Samples: 341641900. Policy #0 lag: (min: 2.0, avg: 10.4, max: 23.0) [2024-06-28 14:53:37,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:53:41,009][09423] Updated weights for policy 0, policy_version 247977 (0.0038) [2024-06-28 14:53:42,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42598.4, 300 sec: 42820.6). Total num frames: 4062953472. Throughput: 0: 42540.1. Samples: 341773780. Policy #0 lag: (min: 2.0, avg: 10.4, max: 23.0) [2024-06-28 14:53:42,922][09190] Avg episode reward: [(0, '0.763')] [2024-06-28 14:53:44,119][09423] Updated weights for policy 0, policy_version 247987 (0.0029) [2024-06-28 14:53:47,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.4, 300 sec: 42598.8). Total num frames: 4063133696. Throughput: 0: 42597.1. Samples: 342026140. Policy #0 lag: (min: 2.0, avg: 10.4, max: 23.0) [2024-06-28 14:53:47,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:53:48,572][09423] Updated weights for policy 0, policy_version 247997 (0.0040) [2024-06-28 14:53:52,061][09423] Updated weights for policy 0, policy_version 248007 (0.0034) [2024-06-28 14:53:52,922][09190] Fps is (10 sec: 42598.0, 60 sec: 42871.3, 300 sec: 42765.0). Total num frames: 4063379456. Throughput: 0: 42330.9. Samples: 342274060. Policy #0 lag: (min: 2.0, avg: 10.4, max: 23.0) [2024-06-28 14:53:52,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:53:56,238][09423] Updated weights for policy 0, policy_version 248017 (0.0023) [2024-06-28 14:53:57,922][09190] Fps is (10 sec: 44235.9, 60 sec: 42327.0, 300 sec: 42765.0). Total num frames: 4063576064. Throughput: 0: 42576.3. Samples: 342410660. Policy #0 lag: (min: 2.0, avg: 10.4, max: 23.0) [2024-06-28 14:53:57,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:53:59,913][09423] Updated weights for policy 0, policy_version 248027 (0.0039) [2024-06-28 14:54:02,922][09190] Fps is (10 sec: 40960.0, 60 sec: 42600.1, 300 sec: 42653.9). Total num frames: 4063789056. Throughput: 0: 42443.5. Samples: 342662240. Policy #0 lag: (min: 2.0, avg: 10.4, max: 23.0) [2024-06-28 14:54:02,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:54:04,088][09423] Updated weights for policy 0, policy_version 248037 (0.0028) [2024-06-28 14:54:07,333][09423] Updated weights for policy 0, policy_version 248047 (0.0028) [2024-06-28 14:54:07,924][09190] Fps is (10 sec: 42588.0, 60 sec: 42869.6, 300 sec: 42709.1). Total num frames: 4064002048. Throughput: 0: 42559.7. Samples: 342918820. Policy #0 lag: (min: 2.0, avg: 10.4, max: 23.0) [2024-06-28 14:54:07,925][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:54:11,752][09423] Updated weights for policy 0, policy_version 248057 (0.0032) [2024-06-28 14:54:12,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42325.3, 300 sec: 42653.9). Total num frames: 4064198656. Throughput: 0: 42466.2. Samples: 343048420. Policy #0 lag: (min: 2.0, avg: 10.4, max: 23.0) [2024-06-28 14:54:12,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:54:15,310][09423] Updated weights for policy 0, policy_version 248067 (0.0034) [2024-06-28 14:54:17,921][09190] Fps is (10 sec: 42609.5, 60 sec: 42598.5, 300 sec: 42653.9). Total num frames: 4064428032. Throughput: 0: 42293.5. Samples: 343300420. Policy #0 lag: (min: 2.0, avg: 10.4, max: 23.0) [2024-06-28 14:54:17,922][09190] Avg episode reward: [(0, '0.763')] [2024-06-28 14:54:17,944][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000248073_4064428032.pth... [2024-06-28 14:54:18,000][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000247449_4054204416.pth [2024-06-28 14:54:19,295][09423] Updated weights for policy 0, policy_version 248077 (0.0032) [2024-06-28 14:54:22,769][09423] Updated weights for policy 0, policy_version 248087 (0.0038) [2024-06-28 14:54:22,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 4064657408. Throughput: 0: 42627.5. Samples: 343560140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:54:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:54:27,029][09423] Updated weights for policy 0, policy_version 248097 (0.0023) [2024-06-28 14:54:27,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42052.3, 300 sec: 42654.2). Total num frames: 4064837632. Throughput: 0: 42585.4. Samples: 343690120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:54:27,922][09190] Avg episode reward: [(0, '0.763')] [2024-06-28 14:54:28,662][09403] Signal inference workers to stop experience collection... (4800 times) [2024-06-28 14:54:28,698][09423] InferenceWorker_p0-w0: stopping experience collection (4800 times) [2024-06-28 14:54:28,719][09403] Signal inference workers to resume experience collection... (4800 times) [2024-06-28 14:54:28,721][09423] InferenceWorker_p0-w0: resuming experience collection (4800 times) [2024-06-28 14:54:30,266][09423] Updated weights for policy 0, policy_version 248107 (0.0026) [2024-06-28 14:54:32,924][09190] Fps is (10 sec: 42588.0, 60 sec: 42869.8, 300 sec: 42709.1). Total num frames: 4065083392. Throughput: 0: 42613.1. Samples: 343943840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:54:32,924][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:54:34,610][09423] Updated weights for policy 0, policy_version 248117 (0.0030) [2024-06-28 14:54:37,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 4065280000. Throughput: 0: 42861.1. Samples: 344202800. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:54:37,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:54:38,270][09423] Updated weights for policy 0, policy_version 248127 (0.0041) [2024-06-28 14:54:42,357][09423] Updated weights for policy 0, policy_version 248137 (0.0030) [2024-06-28 14:54:42,921][09190] Fps is (10 sec: 40969.9, 60 sec: 42325.3, 300 sec: 42709.5). Total num frames: 4065492992. Throughput: 0: 42649.0. Samples: 344329860. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:54:42,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:54:45,787][09423] Updated weights for policy 0, policy_version 248147 (0.0035) [2024-06-28 14:54:47,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43144.5, 300 sec: 42709.5). Total num frames: 4065722368. Throughput: 0: 42675.2. Samples: 344582620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:54:47,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:54:49,849][09423] Updated weights for policy 0, policy_version 248157 (0.0038) [2024-06-28 14:54:52,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42325.4, 300 sec: 42709.8). Total num frames: 4065918976. Throughput: 0: 42760.7. Samples: 344842940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:54:52,924][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:54:53,358][09423] Updated weights for policy 0, policy_version 248167 (0.0032) [2024-06-28 14:54:57,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4066115584. Throughput: 0: 42744.0. Samples: 344971900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:54:57,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:54:58,148][09423] Updated weights for policy 0, policy_version 248177 (0.0034) [2024-06-28 14:55:01,255][09423] Updated weights for policy 0, policy_version 248187 (0.0038) [2024-06-28 14:55:02,922][09190] Fps is (10 sec: 44236.3, 60 sec: 42871.5, 300 sec: 42765.0). Total num frames: 4066361344. Throughput: 0: 42723.0. Samples: 345222960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:55:02,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:55:05,577][09423] Updated weights for policy 0, policy_version 248197 (0.0033) [2024-06-28 14:55:07,921][09190] Fps is (10 sec: 44237.6, 60 sec: 42600.3, 300 sec: 42765.0). Total num frames: 4066557952. Throughput: 0: 42800.2. Samples: 345486140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:55:07,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:55:08,835][09423] Updated weights for policy 0, policy_version 248207 (0.0028) [2024-06-28 14:55:12,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4066770944. Throughput: 0: 42632.0. Samples: 345608560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:55:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:55:13,054][09423] Updated weights for policy 0, policy_version 248217 (0.0037) [2024-06-28 14:55:16,732][09423] Updated weights for policy 0, policy_version 248227 (0.0051) [2024-06-28 14:55:17,928][09190] Fps is (10 sec: 45844.8, 60 sec: 43139.8, 300 sec: 42819.6). Total num frames: 4067016704. Throughput: 0: 42658.0. Samples: 345863620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:55:17,929][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 14:55:21,014][09423] Updated weights for policy 0, policy_version 248237 (0.0028) [2024-06-28 14:55:22,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.4, 300 sec: 42709.5). Total num frames: 4067196928. Throughput: 0: 42704.0. Samples: 346124480. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:55:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 14:55:24,242][09423] Updated weights for policy 0, policy_version 248247 (0.0030) [2024-06-28 14:55:27,921][09190] Fps is (10 sec: 39347.4, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4067409920. Throughput: 0: 42581.0. Samples: 346246000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:55:27,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:55:28,491][09423] Updated weights for policy 0, policy_version 248257 (0.0026) [2024-06-28 14:55:31,810][09423] Updated weights for policy 0, policy_version 248267 (0.0027) [2024-06-28 14:55:32,921][09190] Fps is (10 sec: 45874.7, 60 sec: 42873.2, 300 sec: 42765.0). Total num frames: 4067655680. Throughput: 0: 42703.5. Samples: 346504280. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 14:55:32,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:55:36,123][09423] Updated weights for policy 0, policy_version 248277 (0.0032) [2024-06-28 14:55:37,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 4067835904. Throughput: 0: 42666.7. Samples: 346762940. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 14:55:37,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:55:39,474][09423] Updated weights for policy 0, policy_version 248287 (0.0037) [2024-06-28 14:55:42,924][09190] Fps is (10 sec: 39312.0, 60 sec: 42596.7, 300 sec: 42598.1). Total num frames: 4068048896. Throughput: 0: 42523.5. Samples: 346885560. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 14:55:42,924][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:55:44,180][09423] Updated weights for policy 0, policy_version 248297 (0.0036) [2024-06-28 14:55:47,191][09423] Updated weights for policy 0, policy_version 248307 (0.0029) [2024-06-28 14:55:47,921][09190] Fps is (10 sec: 44236.2, 60 sec: 42598.4, 300 sec: 42765.0). Total num frames: 4068278272. Throughput: 0: 42731.6. Samples: 347145880. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 14:55:47,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:55:51,737][09423] Updated weights for policy 0, policy_version 248317 (0.0030) [2024-06-28 14:55:52,921][09190] Fps is (10 sec: 42609.2, 60 sec: 42598.4, 300 sec: 42654.0). Total num frames: 4068474880. Throughput: 0: 42524.8. Samples: 347399760. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 14:55:52,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:55:55,149][09423] Updated weights for policy 0, policy_version 248327 (0.0036) [2024-06-28 14:55:55,638][09403] Signal inference workers to stop experience collection... (4850 times) [2024-06-28 14:55:55,638][09403] Signal inference workers to resume experience collection... (4850 times) [2024-06-28 14:55:55,685][09423] InferenceWorker_p0-w0: stopping experience collection (4850 times) [2024-06-28 14:55:55,685][09423] InferenceWorker_p0-w0: resuming experience collection (4850 times) [2024-06-28 14:55:57,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42871.5, 300 sec: 42654.0). Total num frames: 4068687872. Throughput: 0: 42485.3. Samples: 347520400. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 14:55:57,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:55:59,433][09423] Updated weights for policy 0, policy_version 248337 (0.0038) [2024-06-28 14:56:02,602][09423] Updated weights for policy 0, policy_version 248347 (0.0038) [2024-06-28 14:56:02,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42598.6, 300 sec: 42654.0). Total num frames: 4068917248. Throughput: 0: 42820.1. Samples: 347790240. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 14:56:02,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:56:06,899][09423] Updated weights for policy 0, policy_version 248357 (0.0027) [2024-06-28 14:56:07,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42871.3, 300 sec: 42709.5). Total num frames: 4069130240. Throughput: 0: 42705.2. Samples: 348046220. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 14:56:07,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:56:10,393][09423] Updated weights for policy 0, policy_version 248367 (0.0044) [2024-06-28 14:56:12,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 4069343232. Throughput: 0: 42746.2. Samples: 348169580. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 14:56:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 14:56:14,501][09423] Updated weights for policy 0, policy_version 248377 (0.0030) [2024-06-28 14:56:17,714][09423] Updated weights for policy 0, policy_version 248387 (0.0048) [2024-06-28 14:56:17,924][09190] Fps is (10 sec: 44226.1, 60 sec: 42601.2, 300 sec: 42709.1). Total num frames: 4069572608. Throughput: 0: 42765.6. Samples: 348428840. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 14:56:17,924][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:56:17,939][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000248387_4069572608.pth... [2024-06-28 14:56:17,987][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000247760_4059299840.pth [2024-06-28 14:56:22,231][09423] Updated weights for policy 0, policy_version 248397 (0.0027) [2024-06-28 14:56:22,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4069752832. Throughput: 0: 42801.3. Samples: 348689000. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 14:56:22,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:56:25,773][09423] Updated weights for policy 0, policy_version 248407 (0.0043) [2024-06-28 14:56:27,921][09190] Fps is (10 sec: 40970.6, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4069982208. Throughput: 0: 42782.4. Samples: 348810660. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 14:56:27,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:56:30,017][09423] Updated weights for policy 0, policy_version 248417 (0.0035) [2024-06-28 14:56:32,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42325.4, 300 sec: 42709.5). Total num frames: 4070195200. Throughput: 0: 42805.0. Samples: 349072100. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 14:56:32,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:56:33,230][09423] Updated weights for policy 0, policy_version 248427 (0.0048) [2024-06-28 14:56:37,489][09423] Updated weights for policy 0, policy_version 248437 (0.0041) [2024-06-28 14:56:37,922][09190] Fps is (10 sec: 42597.6, 60 sec: 42871.3, 300 sec: 42653.9). Total num frames: 4070408192. Throughput: 0: 42879.8. Samples: 349329360. Policy #0 lag: (min: 1.0, avg: 9.6, max: 21.0) [2024-06-28 14:56:37,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:56:40,851][09423] Updated weights for policy 0, policy_version 248447 (0.0046) [2024-06-28 14:56:42,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43146.4, 300 sec: 42765.0). Total num frames: 4070637568. Throughput: 0: 42995.6. Samples: 349455200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:56:42,922][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 14:56:45,195][09423] Updated weights for policy 0, policy_version 248457 (0.0034) [2024-06-28 14:56:47,924][09190] Fps is (10 sec: 44226.4, 60 sec: 42869.7, 300 sec: 42764.7). Total num frames: 4070850560. Throughput: 0: 42718.0. Samples: 349712660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:56:47,933][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:56:48,398][09423] Updated weights for policy 0, policy_version 248467 (0.0035) [2024-06-28 14:56:52,921][09190] Fps is (10 sec: 39321.1, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4071030784. Throughput: 0: 42689.4. Samples: 349967240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:56:52,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:56:53,242][09423] Updated weights for policy 0, policy_version 248477 (0.0037) [2024-06-28 14:56:56,261][09423] Updated weights for policy 0, policy_version 248487 (0.0035) [2024-06-28 14:56:57,921][09190] Fps is (10 sec: 40970.6, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4071260160. Throughput: 0: 42668.5. Samples: 350089660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:56:57,921][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:57:00,701][09423] Updated weights for policy 0, policy_version 248497 (0.0028) [2024-06-28 14:57:02,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4071473152. Throughput: 0: 42734.5. Samples: 350351780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:57:02,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:57:04,088][09423] Updated weights for policy 0, policy_version 248507 (0.0042) [2024-06-28 14:57:07,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.5, 300 sec: 42598.4). Total num frames: 4071669760. Throughput: 0: 42599.1. Samples: 350605960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:57:07,922][09190] Avg episode reward: [(0, '0.763')] [2024-06-28 14:57:08,238][09423] Updated weights for policy 0, policy_version 248517 (0.0024) [2024-06-28 14:57:11,834][09423] Updated weights for policy 0, policy_version 248527 (0.0031) [2024-06-28 14:57:12,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 4071915520. Throughput: 0: 42693.3. Samples: 350731860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:57:12,922][09190] Avg episode reward: [(0, '0.763')] [2024-06-28 14:57:15,887][09423] Updated weights for policy 0, policy_version 248537 (0.0039) [2024-06-28 14:57:17,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42327.2, 300 sec: 42654.0). Total num frames: 4072112128. Throughput: 0: 42641.9. Samples: 350990980. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:57:17,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 14:57:19,671][09423] Updated weights for policy 0, policy_version 248547 (0.0028) [2024-06-28 14:57:22,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4072325120. Throughput: 0: 42678.7. Samples: 351249900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:57:22,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:57:23,543][09423] Updated weights for policy 0, policy_version 248557 (0.0034) [2024-06-28 14:57:27,127][09423] Updated weights for policy 0, policy_version 248567 (0.0028) [2024-06-28 14:57:27,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4072538112. Throughput: 0: 42577.7. Samples: 351371200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:57:27,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 14:57:31,664][09423] Updated weights for policy 0, policy_version 248577 (0.0039) [2024-06-28 14:57:32,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 4072751104. Throughput: 0: 42660.5. Samples: 351632280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:57:32,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 14:57:34,757][09423] Updated weights for policy 0, policy_version 248587 (0.0026) [2024-06-28 14:57:37,924][09190] Fps is (10 sec: 42587.7, 60 sec: 42596.7, 300 sec: 42598.0). Total num frames: 4072964096. Throughput: 0: 42598.1. Samples: 351884260. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:57:37,925][09190] Avg episode reward: [(0, '0.763')] [2024-06-28 14:57:39,081][09423] Updated weights for policy 0, policy_version 248597 (0.0031) [2024-06-28 14:57:42,504][09423] Updated weights for policy 0, policy_version 248607 (0.0025) [2024-06-28 14:57:42,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42325.3, 300 sec: 42709.5). Total num frames: 4073177088. Throughput: 0: 42727.4. Samples: 352012400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:57:42,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 14:57:46,571][09423] Updated weights for policy 0, policy_version 248617 (0.0029) [2024-06-28 14:57:47,788][09403] Signal inference workers to stop experience collection... (4900 times) [2024-06-28 14:57:47,836][09423] InferenceWorker_p0-w0: stopping experience collection (4900 times) [2024-06-28 14:57:47,841][09403] Signal inference workers to resume experience collection... (4900 times) [2024-06-28 14:57:47,851][09423] InferenceWorker_p0-w0: resuming experience collection (4900 times) [2024-06-28 14:57:47,921][09190] Fps is (10 sec: 40969.9, 60 sec: 42053.9, 300 sec: 42598.4). Total num frames: 4073373696. Throughput: 0: 42501.2. Samples: 352264340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 14:57:47,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:57:50,070][09423] Updated weights for policy 0, policy_version 248627 (0.0027) [2024-06-28 14:57:52,925][09190] Fps is (10 sec: 42583.6, 60 sec: 42869.0, 300 sec: 42598.3). Total num frames: 4073603072. Throughput: 0: 42557.9. Samples: 352521220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 14:57:52,925][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:57:54,207][09423] Updated weights for policy 0, policy_version 248637 (0.0031) [2024-06-28 14:57:57,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42598.3, 300 sec: 42654.3). Total num frames: 4073816064. Throughput: 0: 42758.7. Samples: 352656000. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 14:57:57,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:57:58,163][09423] Updated weights for policy 0, policy_version 248647 (0.0030) [2024-06-28 14:58:01,625][09423] Updated weights for policy 0, policy_version 248657 (0.0046) [2024-06-28 14:58:02,924][09190] Fps is (10 sec: 40964.0, 60 sec: 42323.5, 300 sec: 42653.6). Total num frames: 4074012672. Throughput: 0: 42686.4. Samples: 352911980. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 14:58:02,924][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:58:05,472][09423] Updated weights for policy 0, policy_version 248667 (0.0044) [2024-06-28 14:58:07,922][09190] Fps is (10 sec: 44236.0, 60 sec: 43144.3, 300 sec: 42709.4). Total num frames: 4074258432. Throughput: 0: 42652.3. Samples: 353169260. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 14:58:07,922][09190] Avg episode reward: [(0, '0.763')] [2024-06-28 14:58:09,728][09423] Updated weights for policy 0, policy_version 248677 (0.0033) [2024-06-28 14:58:12,810][09423] Updated weights for policy 0, policy_version 248687 (0.0042) [2024-06-28 14:58:12,921][09190] Fps is (10 sec: 47525.9, 60 sec: 42871.5, 300 sec: 42765.0). Total num frames: 4074487808. Throughput: 0: 42859.2. Samples: 353299860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 14:58:12,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:58:17,231][09423] Updated weights for policy 0, policy_version 248697 (0.0043) [2024-06-28 14:58:17,922][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4074668032. Throughput: 0: 42836.4. Samples: 353559920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 14:58:17,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:58:17,933][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000248698_4074668032.pth... [2024-06-28 14:58:17,995][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000248073_4064428032.pth [2024-06-28 14:58:20,650][09423] Updated weights for policy 0, policy_version 248707 (0.0037) [2024-06-28 14:58:22,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4074897408. Throughput: 0: 42718.8. Samples: 353806500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 14:58:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:58:25,043][09423] Updated weights for policy 0, policy_version 248717 (0.0026) [2024-06-28 14:58:27,922][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 4075110400. Throughput: 0: 42846.6. Samples: 353940500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 14:58:27,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:58:28,492][09423] Updated weights for policy 0, policy_version 248727 (0.0034) [2024-06-28 14:58:32,759][09423] Updated weights for policy 0, policy_version 248737 (0.0036) [2024-06-28 14:58:32,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42598.5, 300 sec: 42653.9). Total num frames: 4075307008. Throughput: 0: 43003.7. Samples: 354199500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 14:58:32,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:58:36,227][09423] Updated weights for policy 0, policy_version 248747 (0.0025) [2024-06-28 14:58:37,921][09190] Fps is (10 sec: 42599.3, 60 sec: 42873.3, 300 sec: 42654.0). Total num frames: 4075536384. Throughput: 0: 42996.7. Samples: 354455920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 14:58:37,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:58:40,246][09423] Updated weights for policy 0, policy_version 248757 (0.0048) [2024-06-28 14:58:42,921][09190] Fps is (10 sec: 45874.6, 60 sec: 43144.5, 300 sec: 42820.5). Total num frames: 4075765760. Throughput: 0: 42767.0. Samples: 354580520. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 14:58:42,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:58:43,692][09423] Updated weights for policy 0, policy_version 248767 (0.0045) [2024-06-28 14:58:47,922][09190] Fps is (10 sec: 39321.0, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4075929600. Throughput: 0: 42649.9. Samples: 354831120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 14:58:47,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 14:58:48,247][09423] Updated weights for policy 0, policy_version 248777 (0.0032) [2024-06-28 14:58:51,305][09423] Updated weights for policy 0, policy_version 248787 (0.0033) [2024-06-28 14:58:52,921][09190] Fps is (10 sec: 39322.3, 60 sec: 42600.9, 300 sec: 42654.0). Total num frames: 4076158976. Throughput: 0: 42692.7. Samples: 355090420. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 14:58:52,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:58:55,796][09423] Updated weights for policy 0, policy_version 248797 (0.0034) [2024-06-28 14:58:57,921][09190] Fps is (10 sec: 45875.8, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4076388352. Throughput: 0: 42784.9. Samples: 355225180. Policy #0 lag: (min: 0.0, avg: 9.8, max: 22.0) [2024-06-28 14:58:57,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:58:59,055][09423] Updated weights for policy 0, policy_version 248807 (0.0037) [2024-06-28 14:59:02,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42873.3, 300 sec: 42654.3). Total num frames: 4076584960. Throughput: 0: 42569.5. Samples: 355475540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:59:02,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 14:59:03,428][09423] Updated weights for policy 0, policy_version 248817 (0.0040) [2024-06-28 14:59:06,629][09423] Updated weights for policy 0, policy_version 248827 (0.0030) [2024-06-28 14:59:07,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.6, 300 sec: 42820.6). Total num frames: 4076830720. Throughput: 0: 42680.5. Samples: 355727120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:59:07,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:59:11,307][09423] Updated weights for policy 0, policy_version 248837 (0.0031) [2024-06-28 14:59:11,834][09403] Signal inference workers to stop experience collection... (4950 times) [2024-06-28 14:59:11,886][09423] InferenceWorker_p0-w0: stopping experience collection (4950 times) [2024-06-28 14:59:11,894][09403] Signal inference workers to resume experience collection... (4950 times) [2024-06-28 14:59:11,909][09423] InferenceWorker_p0-w0: resuming experience collection (4950 times) [2024-06-28 14:59:12,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42598.4, 300 sec: 42765.0). Total num frames: 4077043712. Throughput: 0: 42780.7. Samples: 355865620. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:59:12,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:59:14,133][09423] Updated weights for policy 0, policy_version 248847 (0.0040) [2024-06-28 14:59:17,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4077223936. Throughput: 0: 42782.2. Samples: 356124700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:59:17,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:59:18,798][09423] Updated weights for policy 0, policy_version 248857 (0.0034) [2024-06-28 14:59:21,499][09423] Updated weights for policy 0, policy_version 248867 (0.0039) [2024-06-28 14:59:22,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42871.6, 300 sec: 42820.6). Total num frames: 4077469696. Throughput: 0: 42531.6. Samples: 356369840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:59:22,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 14:59:26,617][09423] Updated weights for policy 0, policy_version 248877 (0.0037) [2024-06-28 14:59:27,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42598.5, 300 sec: 42654.3). Total num frames: 4077666304. Throughput: 0: 42596.6. Samples: 356497360. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:59:27,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 14:59:29,697][09423] Updated weights for policy 0, policy_version 248887 (0.0035) [2024-06-28 14:59:32,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4077862912. Throughput: 0: 42909.5. Samples: 356762040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:59:32,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 14:59:34,102][09423] Updated weights for policy 0, policy_version 248897 (0.0032) [2024-06-28 14:59:37,138][09423] Updated weights for policy 0, policy_version 248907 (0.0032) [2024-06-28 14:59:37,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 4078108672. Throughput: 0: 42667.9. Samples: 357010480. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:59:37,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:59:41,842][09423] Updated weights for policy 0, policy_version 248917 (0.0032) [2024-06-28 14:59:42,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42052.4, 300 sec: 42598.4). Total num frames: 4078288896. Throughput: 0: 42687.6. Samples: 357146120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:59:42,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 14:59:44,622][09423] Updated weights for policy 0, policy_version 248927 (0.0031) [2024-06-28 14:59:47,921][09190] Fps is (10 sec: 40960.2, 60 sec: 43144.6, 300 sec: 42709.5). Total num frames: 4078518272. Throughput: 0: 42867.1. Samples: 357404560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:59:47,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 14:59:49,282][09423] Updated weights for policy 0, policy_version 248937 (0.0032) [2024-06-28 14:59:52,299][09423] Updated weights for policy 0, policy_version 248947 (0.0026) [2024-06-28 14:59:52,921][09190] Fps is (10 sec: 47513.2, 60 sec: 43417.6, 300 sec: 42876.1). Total num frames: 4078764032. Throughput: 0: 42896.5. Samples: 357657460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:59:52,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 14:59:56,864][09423] Updated weights for policy 0, policy_version 248957 (0.0023) [2024-06-28 14:59:57,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4078927872. Throughput: 0: 42795.0. Samples: 357791400. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 14:59:57,924][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 15:00:00,104][09423] Updated weights for policy 0, policy_version 248967 (0.0039) [2024-06-28 15:00:02,924][09190] Fps is (10 sec: 37673.6, 60 sec: 42596.6, 300 sec: 42653.6). Total num frames: 4079140864. Throughput: 0: 42519.4. Samples: 358038180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 15:00:02,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:00:04,477][09423] Updated weights for policy 0, policy_version 248977 (0.0035) [2024-06-28 15:00:07,921][09190] Fps is (10 sec: 45875.3, 60 sec: 42598.4, 300 sec: 42765.0). Total num frames: 4079386624. Throughput: 0: 42542.2. Samples: 358284240. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 15:00:07,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:00:08,030][09423] Updated weights for policy 0, policy_version 248987 (0.0037) [2024-06-28 15:00:12,576][09423] Updated weights for policy 0, policy_version 248997 (0.0034) [2024-06-28 15:00:12,924][09190] Fps is (10 sec: 44236.9, 60 sec: 42323.5, 300 sec: 42599.0). Total num frames: 4079583232. Throughput: 0: 42658.1. Samples: 358417080. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:00:12,924][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:00:15,477][09423] Updated weights for policy 0, policy_version 249007 (0.0033) [2024-06-28 15:00:17,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4079796224. Throughput: 0: 42417.8. Samples: 358670840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:00:17,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:00:18,037][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000249012_4079812608.pth... [2024-06-28 15:00:18,100][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000248387_4069572608.pth [2024-06-28 15:00:20,026][09423] Updated weights for policy 0, policy_version 249017 (0.0031) [2024-06-28 15:00:22,922][09190] Fps is (10 sec: 44246.9, 60 sec: 42598.2, 300 sec: 42765.0). Total num frames: 4080025600. Throughput: 0: 42607.9. Samples: 358927840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:00:22,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 15:00:23,106][09423] Updated weights for policy 0, policy_version 249027 (0.0021) [2024-06-28 15:00:27,545][09423] Updated weights for policy 0, policy_version 249037 (0.0039) [2024-06-28 15:00:27,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4080222208. Throughput: 0: 42561.7. Samples: 359061400. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:00:27,928][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 15:00:30,956][09423] Updated weights for policy 0, policy_version 249047 (0.0028) [2024-06-28 15:00:32,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 4080435200. Throughput: 0: 42526.2. Samples: 359318240. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:00:32,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:00:34,453][09403] Signal inference workers to stop experience collection... (5000 times) [2024-06-28 15:00:34,458][09403] Signal inference workers to resume experience collection... (5000 times) [2024-06-28 15:00:34,484][09423] InferenceWorker_p0-w0: stopping experience collection (5000 times) [2024-06-28 15:00:34,484][09423] InferenceWorker_p0-w0: resuming experience collection (5000 times) [2024-06-28 15:00:35,090][09423] Updated weights for policy 0, policy_version 249057 (0.0041) [2024-06-28 15:00:37,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42325.4, 300 sec: 42709.8). Total num frames: 4080648192. Throughput: 0: 42662.2. Samples: 359577260. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:00:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:00:38,680][09423] Updated weights for policy 0, policy_version 249067 (0.0041) [2024-06-28 15:00:42,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43144.5, 300 sec: 42709.5). Total num frames: 4080877568. Throughput: 0: 42439.2. Samples: 359701160. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:00:42,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:00:42,934][09423] Updated weights for policy 0, policy_version 249077 (0.0034) [2024-06-28 15:00:46,335][09423] Updated weights for policy 0, policy_version 249087 (0.0038) [2024-06-28 15:00:47,922][09190] Fps is (10 sec: 44236.3, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 4081090560. Throughput: 0: 42508.0. Samples: 359950940. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:00:47,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:00:50,806][09423] Updated weights for policy 0, policy_version 249097 (0.0030) [2024-06-28 15:00:52,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.3, 300 sec: 42765.0). Total num frames: 4081303552. Throughput: 0: 42945.8. Samples: 360216800. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:00:52,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:00:53,766][09423] Updated weights for policy 0, policy_version 249107 (0.0031) [2024-06-28 15:00:57,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4081500160. Throughput: 0: 42826.8. Samples: 360344180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:00:57,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:00:58,259][09423] Updated weights for policy 0, policy_version 249117 (0.0037) [2024-06-28 15:01:01,378][09423] Updated weights for policy 0, policy_version 249127 (0.0027) [2024-06-28 15:01:02,921][09190] Fps is (10 sec: 44236.6, 60 sec: 43419.4, 300 sec: 42765.0). Total num frames: 4081745920. Throughput: 0: 42919.9. Samples: 360602240. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:01:02,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:01:05,657][09423] Updated weights for policy 0, policy_version 249137 (0.0027) [2024-06-28 15:01:07,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 4081942528. Throughput: 0: 43143.4. Samples: 360869280. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:01:07,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:01:08,860][09423] Updated weights for policy 0, policy_version 249147 (0.0051) [2024-06-28 15:01:12,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42873.2, 300 sec: 42654.3). Total num frames: 4082155520. Throughput: 0: 42927.0. Samples: 360993120. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:01:12,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:01:13,574][09423] Updated weights for policy 0, policy_version 249157 (0.0027) [2024-06-28 15:01:16,590][09423] Updated weights for policy 0, policy_version 249167 (0.0029) [2024-06-28 15:01:17,921][09190] Fps is (10 sec: 44236.3, 60 sec: 43144.4, 300 sec: 42820.5). Total num frames: 4082384896. Throughput: 0: 42762.2. Samples: 361242540. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:01:17,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:01:21,424][09423] Updated weights for policy 0, policy_version 249177 (0.0038) [2024-06-28 15:01:22,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.6, 300 sec: 42765.0). Total num frames: 4082597888. Throughput: 0: 42781.3. Samples: 361502420. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 15:01:22,922][09190] Avg episode reward: [(0, '0.763')] [2024-06-28 15:01:24,274][09423] Updated weights for policy 0, policy_version 249187 (0.0038) [2024-06-28 15:01:27,923][09190] Fps is (10 sec: 40954.5, 60 sec: 42870.4, 300 sec: 42709.3). Total num frames: 4082794496. Throughput: 0: 42816.4. Samples: 361627960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 15:01:27,923][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 15:01:29,011][09423] Updated weights for policy 0, policy_version 249197 (0.0030) [2024-06-28 15:01:32,225][09423] Updated weights for policy 0, policy_version 249207 (0.0031) [2024-06-28 15:01:32,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 42765.0). Total num frames: 4083023872. Throughput: 0: 43058.3. Samples: 361888560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 15:01:32,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 15:01:36,312][09423] Updated weights for policy 0, policy_version 249217 (0.0029) [2024-06-28 15:01:37,921][09190] Fps is (10 sec: 44243.1, 60 sec: 43144.5, 300 sec: 42709.5). Total num frames: 4083236864. Throughput: 0: 43092.0. Samples: 362155940. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 15:01:37,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:01:39,604][09423] Updated weights for policy 0, policy_version 249227 (0.0034) [2024-06-28 15:01:42,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42871.4, 300 sec: 42709.8). Total num frames: 4083449856. Throughput: 0: 42976.8. Samples: 362278140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 15:01:42,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:01:43,692][09423] Updated weights for policy 0, policy_version 249237 (0.0032) [2024-06-28 15:01:47,027][09423] Updated weights for policy 0, policy_version 249247 (0.0027) [2024-06-28 15:01:47,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43144.6, 300 sec: 42876.1). Total num frames: 4083679232. Throughput: 0: 43007.6. Samples: 362537580. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 15:01:47,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 15:01:51,722][09423] Updated weights for policy 0, policy_version 249257 (0.0034) [2024-06-28 15:01:52,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42871.5, 300 sec: 42765.0). Total num frames: 4083875840. Throughput: 0: 42871.9. Samples: 362798520. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 15:01:52,922][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 15:01:54,900][09423] Updated weights for policy 0, policy_version 249267 (0.0045) [2024-06-28 15:01:56,008][09403] Signal inference workers to stop experience collection... (5050 times) [2024-06-28 15:01:56,047][09423] InferenceWorker_p0-w0: stopping experience collection (5050 times) [2024-06-28 15:01:56,070][09403] Signal inference workers to resume experience collection... (5050 times) [2024-06-28 15:01:56,077][09423] InferenceWorker_p0-w0: resuming experience collection (5050 times) [2024-06-28 15:01:57,921][09190] Fps is (10 sec: 40959.9, 60 sec: 43144.5, 300 sec: 42765.0). Total num frames: 4084088832. Throughput: 0: 42862.2. Samples: 362921920. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 15:01:57,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 15:01:59,186][09423] Updated weights for policy 0, policy_version 249277 (0.0031) [2024-06-28 15:02:02,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.5, 300 sec: 42820.6). Total num frames: 4084301824. Throughput: 0: 42892.1. Samples: 363172680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 15:02:02,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 15:02:02,989][09423] Updated weights for policy 0, policy_version 249287 (0.0030) [2024-06-28 15:02:06,633][09423] Updated weights for policy 0, policy_version 249297 (0.0028) [2024-06-28 15:02:07,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4084514816. Throughput: 0: 43175.2. Samples: 363445300. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 15:02:07,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:02:10,354][09423] Updated weights for policy 0, policy_version 249307 (0.0037) [2024-06-28 15:02:12,923][09190] Fps is (10 sec: 42590.0, 60 sec: 42870.1, 300 sec: 42764.7). Total num frames: 4084727808. Throughput: 0: 43056.0. Samples: 363565500. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 15:02:12,924][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 15:02:14,488][09423] Updated weights for policy 0, policy_version 249317 (0.0040) [2024-06-28 15:02:17,832][09423] Updated weights for policy 0, policy_version 249327 (0.0042) [2024-06-28 15:02:17,922][09190] Fps is (10 sec: 45874.2, 60 sec: 43144.5, 300 sec: 42876.1). Total num frames: 4084973568. Throughput: 0: 42932.8. Samples: 363820540. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 15:02:17,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 15:02:17,937][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000249327_4084973568.pth... [2024-06-28 15:02:17,994][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000248698_4074668032.pth [2024-06-28 15:02:22,352][09423] Updated weights for policy 0, policy_version 249337 (0.0034) [2024-06-28 15:02:22,921][09190] Fps is (10 sec: 40968.0, 60 sec: 42325.4, 300 sec: 42709.5). Total num frames: 4085137408. Throughput: 0: 42900.5. Samples: 364086460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 15:02:22,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:02:25,424][09423] Updated weights for policy 0, policy_version 249347 (0.0030) [2024-06-28 15:02:27,921][09190] Fps is (10 sec: 37683.6, 60 sec: 42599.4, 300 sec: 42709.5). Total num frames: 4085350400. Throughput: 0: 42888.5. Samples: 364208120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 23.0) [2024-06-28 15:02:27,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:02:30,345][09423] Updated weights for policy 0, policy_version 249357 (0.0028) [2024-06-28 15:02:32,922][09190] Fps is (10 sec: 45874.4, 60 sec: 42871.4, 300 sec: 42820.9). Total num frames: 4085596160. Throughput: 0: 42729.7. Samples: 364460420. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 15:02:32,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:02:33,289][09423] Updated weights for policy 0, policy_version 249367 (0.0041) [2024-06-28 15:02:37,756][09423] Updated weights for policy 0, policy_version 249377 (0.0035) [2024-06-28 15:02:37,924][09190] Fps is (10 sec: 44225.5, 60 sec: 42596.5, 300 sec: 42764.6). Total num frames: 4085792768. Throughput: 0: 42617.5. Samples: 364716420. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 15:02:37,925][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:02:41,083][09423] Updated weights for policy 0, policy_version 249387 (0.0038) [2024-06-28 15:02:42,924][09190] Fps is (10 sec: 42588.3, 60 sec: 42869.8, 300 sec: 42875.8). Total num frames: 4086022144. Throughput: 0: 42620.4. Samples: 364839940. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 15:02:42,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:02:45,065][09423] Updated weights for policy 0, policy_version 249397 (0.0038) [2024-06-28 15:02:47,921][09190] Fps is (10 sec: 45887.2, 60 sec: 42871.5, 300 sec: 42876.6). Total num frames: 4086251520. Throughput: 0: 43088.4. Samples: 365111660. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 15:02:47,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 15:02:48,483][09423] Updated weights for policy 0, policy_version 249407 (0.0023) [2024-06-28 15:02:52,921][09190] Fps is (10 sec: 40970.0, 60 sec: 42598.4, 300 sec: 42765.0). Total num frames: 4086431744. Throughput: 0: 42736.4. Samples: 365368440. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 15:02:52,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:02:53,060][09423] Updated weights for policy 0, policy_version 249417 (0.0026) [2024-06-28 15:02:56,110][09423] Updated weights for policy 0, policy_version 249427 (0.0039) [2024-06-28 15:02:57,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 42932.0). Total num frames: 4086677504. Throughput: 0: 42900.9. Samples: 365495960. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 15:02:57,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:03:00,400][09403] Signal inference workers to stop experience collection... (5100 times) [2024-06-28 15:03:00,401][09403] Signal inference workers to resume experience collection... (5100 times) [2024-06-28 15:03:00,441][09423] InferenceWorker_p0-w0: stopping experience collection (5100 times) [2024-06-28 15:03:00,441][09423] InferenceWorker_p0-w0: resuming experience collection (5100 times) [2024-06-28 15:03:00,535][09423] Updated weights for policy 0, policy_version 249437 (0.0036) [2024-06-28 15:03:02,922][09190] Fps is (10 sec: 45874.5, 60 sec: 43144.4, 300 sec: 42820.6). Total num frames: 4086890496. Throughput: 0: 43141.3. Samples: 365761900. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 15:03:02,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 15:03:03,625][09423] Updated weights for policy 0, policy_version 249447 (0.0037) [2024-06-28 15:03:07,921][09190] Fps is (10 sec: 39321.2, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4087070720. Throughput: 0: 42775.8. Samples: 366011380. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 15:03:07,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:03:08,614][09423] Updated weights for policy 0, policy_version 249457 (0.0032) [2024-06-28 15:03:11,203][09423] Updated weights for policy 0, policy_version 249467 (0.0035) [2024-06-28 15:03:12,921][09190] Fps is (10 sec: 42599.3, 60 sec: 43145.9, 300 sec: 42876.1). Total num frames: 4087316480. Throughput: 0: 42776.5. Samples: 366133060. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 15:03:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:03:16,058][09423] Updated weights for policy 0, policy_version 249477 (0.0041) [2024-06-28 15:03:17,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42325.5, 300 sec: 42765.0). Total num frames: 4087513088. Throughput: 0: 42913.9. Samples: 366391540. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 15:03:17,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 15:03:19,019][09423] Updated weights for policy 0, policy_version 249487 (0.0037) [2024-06-28 15:03:22,921][09190] Fps is (10 sec: 40959.6, 60 sec: 43144.4, 300 sec: 42765.0). Total num frames: 4087726080. Throughput: 0: 42963.7. Samples: 366649680. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 15:03:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:03:23,513][09423] Updated weights for policy 0, policy_version 249497 (0.0031) [2024-06-28 15:03:26,577][09423] Updated weights for policy 0, policy_version 249507 (0.0038) [2024-06-28 15:03:27,921][09190] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 42876.1). Total num frames: 4087955456. Throughput: 0: 43120.6. Samples: 366780260. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 15:03:27,922][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 15:03:31,160][09423] Updated weights for policy 0, policy_version 249517 (0.0024) [2024-06-28 15:03:32,922][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.4, 300 sec: 42765.0). Total num frames: 4088152064. Throughput: 0: 42841.2. Samples: 367039520. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 15:03:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:03:34,523][09423] Updated weights for policy 0, policy_version 249527 (0.0043) [2024-06-28 15:03:37,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43146.4, 300 sec: 42765.0). Total num frames: 4088381440. Throughput: 0: 42906.2. Samples: 367299220. Policy #0 lag: (min: 1.0, avg: 11.7, max: 22.0) [2024-06-28 15:03:37,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 15:03:38,879][09423] Updated weights for policy 0, policy_version 249537 (0.0034) [2024-06-28 15:03:42,194][09423] Updated weights for policy 0, policy_version 249547 (0.0036) [2024-06-28 15:03:42,921][09190] Fps is (10 sec: 45875.8, 60 sec: 43146.3, 300 sec: 42987.2). Total num frames: 4088610816. Throughput: 0: 42891.6. Samples: 367426080. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:03:42,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 15:03:46,395][09423] Updated weights for policy 0, policy_version 249557 (0.0038) [2024-06-28 15:03:47,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.3, 300 sec: 42820.5). Total num frames: 4088791040. Throughput: 0: 42669.1. Samples: 367682000. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:03:47,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 15:03:49,767][09423] Updated weights for policy 0, policy_version 249567 (0.0045) [2024-06-28 15:03:52,921][09190] Fps is (10 sec: 40960.1, 60 sec: 43144.5, 300 sec: 42820.6). Total num frames: 4089020416. Throughput: 0: 42762.3. Samples: 367935680. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:03:52,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:03:54,227][09423] Updated weights for policy 0, policy_version 249577 (0.0029) [2024-06-28 15:03:57,425][09423] Updated weights for policy 0, policy_version 249587 (0.0036) [2024-06-28 15:03:57,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4089249792. Throughput: 0: 43002.2. Samples: 368068160. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:03:57,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:04:01,509][09423] Updated weights for policy 0, policy_version 249597 (0.0026) [2024-06-28 15:04:02,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42871.6, 300 sec: 42820.6). Total num frames: 4089462784. Throughput: 0: 43061.3. Samples: 368329300. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:04:02,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:04:05,098][09423] Updated weights for policy 0, policy_version 249607 (0.0035) [2024-06-28 15:04:07,921][09190] Fps is (10 sec: 42598.1, 60 sec: 43417.7, 300 sec: 42820.5). Total num frames: 4089675776. Throughput: 0: 43065.8. Samples: 368587640. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:04:07,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 15:04:09,601][09423] Updated weights for policy 0, policy_version 249617 (0.0031) [2024-06-28 15:04:12,701][09423] Updated weights for policy 0, policy_version 249627 (0.0032) [2024-06-28 15:04:12,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42871.4, 300 sec: 42931.6). Total num frames: 4089888768. Throughput: 0: 43032.4. Samples: 368716720. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:04:12,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 15:04:17,444][09423] Updated weights for policy 0, policy_version 249637 (0.0036) [2024-06-28 15:04:17,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42598.3, 300 sec: 42709.5). Total num frames: 4090068992. Throughput: 0: 43020.5. Samples: 368975440. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:04:17,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:04:18,040][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000249639_4090085376.pth... [2024-06-28 15:04:18,087][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000249012_4079812608.pth [2024-06-28 15:04:20,198][09423] Updated weights for policy 0, policy_version 249647 (0.0037) [2024-06-28 15:04:22,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 42876.1). Total num frames: 4090314752. Throughput: 0: 42703.5. Samples: 369220880. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:04:22,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:04:24,967][09423] Updated weights for policy 0, policy_version 249657 (0.0047) [2024-06-28 15:04:27,928][09190] Fps is (10 sec: 45845.1, 60 sec: 42866.8, 300 sec: 42930.7). Total num frames: 4090527744. Throughput: 0: 42742.6. Samples: 369349780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:04:27,929][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:04:27,935][09403] Signal inference workers to stop experience collection... (5150 times) [2024-06-28 15:04:27,935][09403] Signal inference workers to resume experience collection... (5150 times) [2024-06-28 15:04:27,947][09423] InferenceWorker_p0-w0: stopping experience collection (5150 times) [2024-06-28 15:04:27,947][09423] InferenceWorker_p0-w0: resuming experience collection (5150 times) [2024-06-28 15:04:28,072][09423] Updated weights for policy 0, policy_version 249667 (0.0029) [2024-06-28 15:04:32,736][09423] Updated weights for policy 0, policy_version 249677 (0.0032) [2024-06-28 15:04:32,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42871.5, 300 sec: 42765.0). Total num frames: 4090724352. Throughput: 0: 42804.9. Samples: 369608220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:04:32,922][09190] Avg episode reward: [(0, '0.765')] [2024-06-28 15:04:35,826][09423] Updated weights for policy 0, policy_version 249687 (0.0032) [2024-06-28 15:04:37,922][09190] Fps is (10 sec: 44265.4, 60 sec: 43144.4, 300 sec: 42987.1). Total num frames: 4090970112. Throughput: 0: 42653.6. Samples: 369855100. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:04:37,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:04:40,102][09423] Updated weights for policy 0, policy_version 249697 (0.0035) [2024-06-28 15:04:42,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42325.4, 300 sec: 42820.6). Total num frames: 4091150336. Throughput: 0: 42819.6. Samples: 369995040. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:04:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:04:43,332][09423] Updated weights for policy 0, policy_version 249707 (0.0038) [2024-06-28 15:04:47,466][09423] Updated weights for policy 0, policy_version 249717 (0.0042) [2024-06-28 15:04:47,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 4091363328. Throughput: 0: 42707.1. Samples: 370251120. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:04:47,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:04:51,220][09423] Updated weights for policy 0, policy_version 249727 (0.0027) [2024-06-28 15:04:52,923][09190] Fps is (10 sec: 47507.1, 60 sec: 43416.7, 300 sec: 43042.5). Total num frames: 4091625472. Throughput: 0: 42504.1. Samples: 370500380. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:04:52,923][09190] Avg episode reward: [(0, '0.760')] [2024-06-28 15:04:55,525][09423] Updated weights for policy 0, policy_version 249737 (0.0038) [2024-06-28 15:04:57,922][09190] Fps is (10 sec: 42594.8, 60 sec: 42324.7, 300 sec: 42876.3). Total num frames: 4091789312. Throughput: 0: 42761.4. Samples: 370641020. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 15:04:57,923][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:04:58,544][09423] Updated weights for policy 0, policy_version 249747 (0.0038) [2024-06-28 15:05:02,921][09190] Fps is (10 sec: 36049.6, 60 sec: 42052.3, 300 sec: 42709.5). Total num frames: 4091985920. Throughput: 0: 42587.6. Samples: 370891880. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 15:05:02,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:05:03,313][09423] Updated weights for policy 0, policy_version 249757 (0.0038) [2024-06-28 15:05:06,399][09423] Updated weights for policy 0, policy_version 249767 (0.0041) [2024-06-28 15:05:07,921][09190] Fps is (10 sec: 45879.1, 60 sec: 42871.4, 300 sec: 42932.0). Total num frames: 4092248064. Throughput: 0: 42645.8. Samples: 371139940. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 15:05:07,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:05:10,577][09423] Updated weights for policy 0, policy_version 249777 (0.0040) [2024-06-28 15:05:12,924][09190] Fps is (10 sec: 45863.4, 60 sec: 42596.7, 300 sec: 42875.7). Total num frames: 4092444672. Throughput: 0: 42819.9. Samples: 371276500. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 15:05:12,924][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:05:14,010][09423] Updated weights for policy 0, policy_version 249787 (0.0025) [2024-06-28 15:05:17,922][09190] Fps is (10 sec: 40959.7, 60 sec: 43144.5, 300 sec: 42820.6). Total num frames: 4092657664. Throughput: 0: 42855.9. Samples: 371536740. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 15:05:17,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:05:18,229][09423] Updated weights for policy 0, policy_version 249797 (0.0038) [2024-06-28 15:05:21,823][09423] Updated weights for policy 0, policy_version 249807 (0.0027) [2024-06-28 15:05:22,921][09190] Fps is (10 sec: 44248.2, 60 sec: 42871.6, 300 sec: 42931.6). Total num frames: 4092887040. Throughput: 0: 43001.1. Samples: 371790140. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 15:05:22,922][09190] Avg episode reward: [(0, '0.763')] [2024-06-28 15:05:25,849][09423] Updated weights for policy 0, policy_version 249817 (0.0043) [2024-06-28 15:05:27,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42330.0, 300 sec: 42820.6). Total num frames: 4093067264. Throughput: 0: 42892.0. Samples: 371925180. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 15:05:27,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 15:05:29,255][09423] Updated weights for policy 0, policy_version 249827 (0.0031) [2024-06-28 15:05:30,482][09403] Signal inference workers to stop experience collection... (5200 times) [2024-06-28 15:05:30,483][09403] Signal inference workers to resume experience collection... (5200 times) [2024-06-28 15:05:30,528][09423] InferenceWorker_p0-w0: stopping experience collection (5200 times) [2024-06-28 15:05:30,528][09423] InferenceWorker_p0-w0: resuming experience collection (5200 times) [2024-06-28 15:05:32,921][09190] Fps is (10 sec: 40959.4, 60 sec: 42871.4, 300 sec: 42876.1). Total num frames: 4093296640. Throughput: 0: 42805.8. Samples: 372177380. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 15:05:32,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:05:33,292][09423] Updated weights for policy 0, policy_version 249837 (0.0031) [2024-06-28 15:05:36,833][09423] Updated weights for policy 0, policy_version 249847 (0.0038) [2024-06-28 15:05:37,921][09190] Fps is (10 sec: 47513.2, 60 sec: 42871.6, 300 sec: 42931.6). Total num frames: 4093542400. Throughput: 0: 42814.1. Samples: 372426960. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 15:05:37,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 15:05:41,168][09423] Updated weights for policy 0, policy_version 249857 (0.0029) [2024-06-28 15:05:42,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.3, 300 sec: 42765.0). Total num frames: 4093706240. Throughput: 0: 42699.0. Samples: 372562440. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 15:05:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:05:44,948][09423] Updated weights for policy 0, policy_version 249867 (0.0034) [2024-06-28 15:05:47,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42871.5, 300 sec: 42820.6). Total num frames: 4093935616. Throughput: 0: 42623.1. Samples: 372809920. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 15:05:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:05:48,835][09423] Updated weights for policy 0, policy_version 249877 (0.0039) [2024-06-28 15:05:52,357][09423] Updated weights for policy 0, policy_version 249887 (0.0033) [2024-06-28 15:05:52,921][09190] Fps is (10 sec: 47513.4, 60 sec: 42599.3, 300 sec: 42987.2). Total num frames: 4094181376. Throughput: 0: 42890.6. Samples: 373070020. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 15:05:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:05:56,089][09423] Updated weights for policy 0, policy_version 249897 (0.0034) [2024-06-28 15:05:57,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42872.2, 300 sec: 42765.0). Total num frames: 4094361600. Throughput: 0: 42836.2. Samples: 373204020. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 15:05:57,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:05:59,806][09423] Updated weights for policy 0, policy_version 249907 (0.0028) [2024-06-28 15:06:02,921][09190] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 42931.6). Total num frames: 4094607360. Throughput: 0: 42747.7. Samples: 373460380. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 15:06:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:06:03,856][09423] Updated weights for policy 0, policy_version 249917 (0.0032) [2024-06-28 15:06:07,417][09423] Updated weights for policy 0, policy_version 249927 (0.0026) [2024-06-28 15:06:07,921][09190] Fps is (10 sec: 45874.6, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4094820352. Throughput: 0: 42880.8. Samples: 373719780. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 15:06:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:06:11,362][09423] Updated weights for policy 0, policy_version 249937 (0.0037) [2024-06-28 15:06:12,921][09190] Fps is (10 sec: 37683.0, 60 sec: 42327.1, 300 sec: 42709.5). Total num frames: 4094984192. Throughput: 0: 42729.7. Samples: 373848020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 15:06:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:06:14,988][09423] Updated weights for policy 0, policy_version 249947 (0.0045) [2024-06-28 15:06:17,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42871.5, 300 sec: 42820.6). Total num frames: 4095229952. Throughput: 0: 42826.2. Samples: 374104560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 15:06:17,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:06:17,946][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000249953_4095229952.pth... [2024-06-28 15:06:18,010][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000249327_4084973568.pth [2024-06-28 15:06:19,333][09423] Updated weights for policy 0, policy_version 249957 (0.0036) [2024-06-28 15:06:22,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42598.4, 300 sec: 42876.3). Total num frames: 4095442944. Throughput: 0: 42972.5. Samples: 374360720. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 15:06:22,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:06:23,048][09423] Updated weights for policy 0, policy_version 249967 (0.0035) [2024-06-28 15:06:26,779][09423] Updated weights for policy 0, policy_version 249977 (0.0031) [2024-06-28 15:06:27,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43144.5, 300 sec: 42820.6). Total num frames: 4095655936. Throughput: 0: 42829.8. Samples: 374489780. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 15:06:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:06:30,485][09423] Updated weights for policy 0, policy_version 249987 (0.0028) [2024-06-28 15:06:32,921][09190] Fps is (10 sec: 44236.6, 60 sec: 43144.6, 300 sec: 42876.1). Total num frames: 4095885312. Throughput: 0: 43078.6. Samples: 374748460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 15:06:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:06:34,264][09423] Updated weights for policy 0, policy_version 249997 (0.0028) [2024-06-28 15:06:37,924][09190] Fps is (10 sec: 42587.9, 60 sec: 42323.6, 300 sec: 42820.2). Total num frames: 4096081920. Throughput: 0: 43038.2. Samples: 375006840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 15:06:37,924][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:06:38,346][09423] Updated weights for policy 0, policy_version 250007 (0.0039) [2024-06-28 15:06:42,049][09423] Updated weights for policy 0, policy_version 250017 (0.0038) [2024-06-28 15:06:42,921][09190] Fps is (10 sec: 40959.6, 60 sec: 43144.5, 300 sec: 42765.0). Total num frames: 4096294912. Throughput: 0: 42886.5. Samples: 375133920. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 15:06:42,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:06:45,978][09423] Updated weights for policy 0, policy_version 250027 (0.0026) [2024-06-28 15:06:47,922][09190] Fps is (10 sec: 44247.3, 60 sec: 43144.4, 300 sec: 42876.1). Total num frames: 4096524288. Throughput: 0: 42893.2. Samples: 375390580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 15:06:47,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:06:49,646][09423] Updated weights for policy 0, policy_version 250037 (0.0038) [2024-06-28 15:06:51,692][09403] Signal inference workers to stop experience collection... (5250 times) [2024-06-28 15:06:51,692][09403] Signal inference workers to resume experience collection... (5250 times) [2024-06-28 15:06:51,707][09423] InferenceWorker_p0-w0: stopping experience collection (5250 times) [2024-06-28 15:06:51,707][09423] InferenceWorker_p0-w0: resuming experience collection (5250 times) [2024-06-28 15:06:52,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42325.4, 300 sec: 42820.6). Total num frames: 4096720896. Throughput: 0: 42988.5. Samples: 375654260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 15:06:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:06:53,675][09423] Updated weights for policy 0, policy_version 250047 (0.0022) [2024-06-28 15:06:57,463][09423] Updated weights for policy 0, policy_version 250057 (0.0038) [2024-06-28 15:06:57,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43144.5, 300 sec: 42876.1). Total num frames: 4096950272. Throughput: 0: 42877.3. Samples: 375777500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 15:06:57,922][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 15:07:01,255][09423] Updated weights for policy 0, policy_version 250067 (0.0031) [2024-06-28 15:07:02,924][09190] Fps is (10 sec: 44225.8, 60 sec: 42596.6, 300 sec: 42875.7). Total num frames: 4097163264. Throughput: 0: 42822.1. Samples: 376031660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 15:07:02,924][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:07:05,198][09423] Updated weights for policy 0, policy_version 250077 (0.0026) [2024-06-28 15:07:07,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.5, 300 sec: 42876.4). Total num frames: 4097376256. Throughput: 0: 42964.4. Samples: 376294120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 15:07:07,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:07:08,904][09423] Updated weights for policy 0, policy_version 250087 (0.0032) [2024-06-28 15:07:12,605][09423] Updated weights for policy 0, policy_version 250097 (0.0029) [2024-06-28 15:07:12,922][09190] Fps is (10 sec: 44247.2, 60 sec: 43690.6, 300 sec: 42820.6). Total num frames: 4097605632. Throughput: 0: 42828.7. Samples: 376417080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 15:07:12,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:07:16,486][09423] Updated weights for policy 0, policy_version 250107 (0.0041) [2024-06-28 15:07:17,922][09190] Fps is (10 sec: 45874.6, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4097835008. Throughput: 0: 43054.5. Samples: 376685920. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 15:07:17,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:07:20,041][09423] Updated weights for policy 0, policy_version 250117 (0.0037) [2024-06-28 15:07:22,921][09190] Fps is (10 sec: 39322.3, 60 sec: 42598.4, 300 sec: 42876.1). Total num frames: 4097998848. Throughput: 0: 43098.8. Samples: 376946180. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 15:07:22,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 15:07:24,006][09423] Updated weights for policy 0, policy_version 250127 (0.0049) [2024-06-28 15:07:27,921][09190] Fps is (10 sec: 39322.4, 60 sec: 42871.5, 300 sec: 42820.6). Total num frames: 4098228224. Throughput: 0: 43076.6. Samples: 377072360. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 15:07:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:07:28,034][09423] Updated weights for policy 0, policy_version 250137 (0.0028) [2024-06-28 15:07:31,971][09423] Updated weights for policy 0, policy_version 250147 (0.0048) [2024-06-28 15:07:32,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42871.4, 300 sec: 42932.0). Total num frames: 4098457600. Throughput: 0: 43026.3. Samples: 377326760. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 15:07:32,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:07:35,498][09423] Updated weights for policy 0, policy_version 250157 (0.0041) [2024-06-28 15:07:37,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42600.2, 300 sec: 42765.4). Total num frames: 4098637824. Throughput: 0: 42824.6. Samples: 377581360. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 15:07:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:07:39,419][09423] Updated weights for policy 0, policy_version 250167 (0.0039) [2024-06-28 15:07:42,924][09190] Fps is (10 sec: 40950.0, 60 sec: 42869.7, 300 sec: 42764.7). Total num frames: 4098867200. Throughput: 0: 42785.7. Samples: 377702960. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 15:07:42,924][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:07:43,425][09423] Updated weights for policy 0, policy_version 250177 (0.0037) [2024-06-28 15:07:47,323][09423] Updated weights for policy 0, policy_version 250187 (0.0033) [2024-06-28 15:07:47,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42598.5, 300 sec: 42876.1). Total num frames: 4099080192. Throughput: 0: 42956.7. Samples: 377964600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 15:07:47,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:07:50,719][09423] Updated weights for policy 0, policy_version 250197 (0.0030) [2024-06-28 15:07:52,922][09190] Fps is (10 sec: 42608.4, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 4099293184. Throughput: 0: 43003.9. Samples: 378229300. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 15:07:52,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 15:07:54,675][09423] Updated weights for policy 0, policy_version 250207 (0.0028) [2024-06-28 15:07:57,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.5, 300 sec: 42820.6). Total num frames: 4099522560. Throughput: 0: 43017.5. Samples: 378352860. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 15:07:57,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:07:58,170][09423] Updated weights for policy 0, policy_version 250217 (0.0029) [2024-06-28 15:08:02,380][09423] Updated weights for policy 0, policy_version 250227 (0.0032) [2024-06-28 15:08:02,921][09190] Fps is (10 sec: 45875.6, 60 sec: 43146.3, 300 sec: 42987.2). Total num frames: 4099751936. Throughput: 0: 42849.0. Samples: 378614120. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 15:08:02,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:08:05,944][09423] Updated weights for policy 0, policy_version 250237 (0.0036) [2024-06-28 15:08:07,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42871.4, 300 sec: 42820.5). Total num frames: 4099948544. Throughput: 0: 42810.6. Samples: 378872660. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 15:08:07,922][09190] Avg episode reward: [(0, '0.716')] [2024-06-28 15:08:10,161][09423] Updated weights for policy 0, policy_version 250247 (0.0040) [2024-06-28 15:08:12,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42598.6, 300 sec: 42876.1). Total num frames: 4100161536. Throughput: 0: 42630.7. Samples: 378990740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 15:08:12,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:08:13,558][09423] Updated weights for policy 0, policy_version 250257 (0.0029) [2024-06-28 15:08:17,641][09423] Updated weights for policy 0, policy_version 250267 (0.0031) [2024-06-28 15:08:17,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42325.5, 300 sec: 42876.1). Total num frames: 4100374528. Throughput: 0: 42853.0. Samples: 379255140. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 15:08:17,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:08:17,941][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000250268_4100390912.pth... [2024-06-28 15:08:17,997][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000249639_4090085376.pth [2024-06-28 15:08:21,357][09423] Updated weights for policy 0, policy_version 250277 (0.0034) [2024-06-28 15:08:22,921][09190] Fps is (10 sec: 42598.0, 60 sec: 43144.5, 300 sec: 42820.6). Total num frames: 4100587520. Throughput: 0: 42763.5. Samples: 379505720. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 15:08:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:08:25,395][09423] Updated weights for policy 0, policy_version 250287 (0.0024) [2024-06-28 15:08:26,041][09403] Signal inference workers to stop experience collection... (5300 times) [2024-06-28 15:08:26,059][09423] InferenceWorker_p0-w0: stopping experience collection (5300 times) [2024-06-28 15:08:26,097][09403] Signal inference workers to resume experience collection... (5300 times) [2024-06-28 15:08:26,098][09423] InferenceWorker_p0-w0: resuming experience collection (5300 times) [2024-06-28 15:08:27,923][09190] Fps is (10 sec: 44229.7, 60 sec: 43143.4, 300 sec: 42931.4). Total num frames: 4100816896. Throughput: 0: 43006.2. Samples: 379638200. Policy #0 lag: (min: 1.0, avg: 11.0, max: 24.0) [2024-06-28 15:08:27,923][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:08:28,643][09423] Updated weights for policy 0, policy_version 250297 (0.0026) [2024-06-28 15:08:32,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42598.4, 300 sec: 42820.6). Total num frames: 4101013504. Throughput: 0: 42986.1. Samples: 379898980. Policy #0 lag: (min: 1.0, avg: 11.0, max: 24.0) [2024-06-28 15:08:32,923][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 15:08:33,210][09423] Updated weights for policy 0, policy_version 250307 (0.0037) [2024-06-28 15:08:36,529][09423] Updated weights for policy 0, policy_version 250317 (0.0032) [2024-06-28 15:08:37,921][09190] Fps is (10 sec: 42604.8, 60 sec: 43417.5, 300 sec: 42820.6). Total num frames: 4101242880. Throughput: 0: 42648.1. Samples: 380148460. Policy #0 lag: (min: 1.0, avg: 11.0, max: 24.0) [2024-06-28 15:08:37,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 15:08:40,670][09423] Updated weights for policy 0, policy_version 250327 (0.0038) [2024-06-28 15:08:42,921][09190] Fps is (10 sec: 44236.9, 60 sec: 43146.3, 300 sec: 42931.6). Total num frames: 4101455872. Throughput: 0: 42877.3. Samples: 380282340. Policy #0 lag: (min: 1.0, avg: 11.0, max: 24.0) [2024-06-28 15:08:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:08:44,112][09423] Updated weights for policy 0, policy_version 250337 (0.0043) [2024-06-28 15:08:47,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43144.5, 300 sec: 42876.1). Total num frames: 4101668864. Throughput: 0: 42998.7. Samples: 380549060. Policy #0 lag: (min: 1.0, avg: 11.0, max: 24.0) [2024-06-28 15:08:47,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:08:47,931][09423] Updated weights for policy 0, policy_version 250347 (0.0035) [2024-06-28 15:08:51,645][09423] Updated weights for policy 0, policy_version 250357 (0.0041) [2024-06-28 15:08:52,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.6, 300 sec: 42820.5). Total num frames: 4101881856. Throughput: 0: 42874.3. Samples: 380802000. Policy #0 lag: (min: 1.0, avg: 11.0, max: 24.0) [2024-06-28 15:08:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:08:55,818][09423] Updated weights for policy 0, policy_version 250367 (0.0037) [2024-06-28 15:08:57,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42871.4, 300 sec: 42820.6). Total num frames: 4102094848. Throughput: 0: 43020.8. Samples: 380926680. Policy #0 lag: (min: 1.0, avg: 11.0, max: 24.0) [2024-06-28 15:08:57,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:08:59,303][09423] Updated weights for policy 0, policy_version 250377 (0.0041) [2024-06-28 15:09:02,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.4, 300 sec: 42765.0). Total num frames: 4102291456. Throughput: 0: 42865.3. Samples: 381184080. Policy #0 lag: (min: 1.0, avg: 11.0, max: 24.0) [2024-06-28 15:09:02,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:09:03,252][09423] Updated weights for policy 0, policy_version 250387 (0.0036) [2024-06-28 15:09:06,835][09423] Updated weights for policy 0, policy_version 250397 (0.0037) [2024-06-28 15:09:07,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43144.6, 300 sec: 42876.1). Total num frames: 4102537216. Throughput: 0: 43096.9. Samples: 381445080. Policy #0 lag: (min: 1.0, avg: 11.0, max: 24.0) [2024-06-28 15:09:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:09:11,256][09423] Updated weights for policy 0, policy_version 250407 (0.0031) [2024-06-28 15:09:12,921][09190] Fps is (10 sec: 45874.9, 60 sec: 43144.4, 300 sec: 42987.2). Total num frames: 4102750208. Throughput: 0: 43070.8. Samples: 381576320. Policy #0 lag: (min: 1.0, avg: 11.0, max: 24.0) [2024-06-28 15:09:12,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:09:14,566][09423] Updated weights for policy 0, policy_version 250417 (0.0028) [2024-06-28 15:09:17,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42598.4, 300 sec: 42765.0). Total num frames: 4102930432. Throughput: 0: 42993.0. Samples: 381833660. Policy #0 lag: (min: 1.0, avg: 11.0, max: 24.0) [2024-06-28 15:09:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:09:18,579][09423] Updated weights for policy 0, policy_version 250427 (0.0025) [2024-06-28 15:09:21,969][09423] Updated weights for policy 0, policy_version 250437 (0.0039) [2024-06-28 15:09:22,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.5, 300 sec: 42877.1). Total num frames: 4103176192. Throughput: 0: 43164.5. Samples: 382090860. Policy #0 lag: (min: 1.0, avg: 11.0, max: 24.0) [2024-06-28 15:09:22,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 15:09:26,190][09423] Updated weights for policy 0, policy_version 250447 (0.0038) [2024-06-28 15:09:27,922][09190] Fps is (10 sec: 45874.4, 60 sec: 42872.5, 300 sec: 42931.6). Total num frames: 4103389184. Throughput: 0: 43049.7. Samples: 382219580. Policy #0 lag: (min: 1.0, avg: 11.0, max: 24.0) [2024-06-28 15:09:27,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:09:29,973][09423] Updated weights for policy 0, policy_version 250457 (0.0033) [2024-06-28 15:09:32,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 4103569408. Throughput: 0: 42783.5. Samples: 382474320. Policy #0 lag: (min: 1.0, avg: 11.0, max: 24.0) [2024-06-28 15:09:32,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:09:33,923][09423] Updated weights for policy 0, policy_version 250467 (0.0035) [2024-06-28 15:09:37,518][09423] Updated weights for policy 0, policy_version 250477 (0.0032) [2024-06-28 15:09:37,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43144.5, 300 sec: 42987.2). Total num frames: 4103831552. Throughput: 0: 42624.4. Samples: 382720100. Policy #0 lag: (min: 0.0, avg: 14.0, max: 28.0) [2024-06-28 15:09:37,926][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:09:41,637][09423] Updated weights for policy 0, policy_version 250487 (0.0044) [2024-06-28 15:09:42,921][09190] Fps is (10 sec: 47514.3, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4104044544. Throughput: 0: 43017.9. Samples: 382862480. Policy #0 lag: (min: 0.0, avg: 14.0, max: 28.0) [2024-06-28 15:09:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:09:44,807][09403] Signal inference workers to stop experience collection... (5350 times) [2024-06-28 15:09:44,809][09403] Signal inference workers to resume experience collection... (5350 times) [2024-06-28 15:09:44,822][09423] InferenceWorker_p0-w0: stopping experience collection (5350 times) [2024-06-28 15:09:44,837][09423] InferenceWorker_p0-w0: resuming experience collection (5350 times) [2024-06-28 15:09:45,135][09423] Updated weights for policy 0, policy_version 250497 (0.0028) [2024-06-28 15:09:47,921][09190] Fps is (10 sec: 37683.7, 60 sec: 42325.4, 300 sec: 42654.1). Total num frames: 4104208384. Throughput: 0: 42882.3. Samples: 383113780. Policy #0 lag: (min: 0.0, avg: 14.0, max: 28.0) [2024-06-28 15:09:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:09:49,207][09423] Updated weights for policy 0, policy_version 250507 (0.0023) [2024-06-28 15:09:52,630][09423] Updated weights for policy 0, policy_version 250517 (0.0042) [2024-06-28 15:09:52,921][09190] Fps is (10 sec: 42597.7, 60 sec: 43144.5, 300 sec: 42987.3). Total num frames: 4104470528. Throughput: 0: 42687.0. Samples: 383366000. Policy #0 lag: (min: 0.0, avg: 14.0, max: 28.0) [2024-06-28 15:09:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:09:56,901][09423] Updated weights for policy 0, policy_version 250527 (0.0057) [2024-06-28 15:09:57,921][09190] Fps is (10 sec: 47512.8, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4104683520. Throughput: 0: 42947.1. Samples: 383508940. Policy #0 lag: (min: 0.0, avg: 14.0, max: 28.0) [2024-06-28 15:09:57,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:10:00,569][09423] Updated weights for policy 0, policy_version 250537 (0.0030) [2024-06-28 15:10:02,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42871.5, 300 sec: 42765.0). Total num frames: 4104863744. Throughput: 0: 42716.8. Samples: 383755920. Policy #0 lag: (min: 0.0, avg: 14.0, max: 28.0) [2024-06-28 15:10:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:10:04,521][09423] Updated weights for policy 0, policy_version 250547 (0.0036) [2024-06-28 15:10:07,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42871.5, 300 sec: 42932.0). Total num frames: 4105109504. Throughput: 0: 42730.3. Samples: 384013720. Policy #0 lag: (min: 0.0, avg: 14.0, max: 28.0) [2024-06-28 15:10:07,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:10:08,062][09423] Updated weights for policy 0, policy_version 250557 (0.0036) [2024-06-28 15:10:12,055][09423] Updated weights for policy 0, policy_version 250567 (0.0033) [2024-06-28 15:10:12,921][09190] Fps is (10 sec: 47513.8, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4105338880. Throughput: 0: 42820.2. Samples: 384146480. Policy #0 lag: (min: 0.0, avg: 14.0, max: 28.0) [2024-06-28 15:10:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:10:15,998][09423] Updated weights for policy 0, policy_version 250577 (0.0030) [2024-06-28 15:10:17,921][09190] Fps is (10 sec: 40959.5, 60 sec: 43144.5, 300 sec: 42820.5). Total num frames: 4105519104. Throughput: 0: 42761.8. Samples: 384398600. Policy #0 lag: (min: 0.0, avg: 14.0, max: 28.0) [2024-06-28 15:10:17,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 15:10:17,928][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000250581_4105519104.pth... [2024-06-28 15:10:17,989][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000249953_4095229952.pth [2024-06-28 15:10:19,957][09423] Updated weights for policy 0, policy_version 250587 (0.0038) [2024-06-28 15:10:22,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4105748480. Throughput: 0: 43014.3. Samples: 384655740. Policy #0 lag: (min: 0.0, avg: 14.0, max: 28.0) [2024-06-28 15:10:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:10:23,538][09423] Updated weights for policy 0, policy_version 250597 (0.0036) [2024-06-28 15:10:27,554][09423] Updated weights for policy 0, policy_version 250607 (0.0041) [2024-06-28 15:10:27,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42871.6, 300 sec: 42931.7). Total num frames: 4105961472. Throughput: 0: 42879.1. Samples: 384792040. Policy #0 lag: (min: 0.0, avg: 14.0, max: 28.0) [2024-06-28 15:10:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:10:31,197][09423] Updated weights for policy 0, policy_version 250617 (0.0036) [2024-06-28 15:10:32,922][09190] Fps is (10 sec: 42597.9, 60 sec: 43417.5, 300 sec: 42820.5). Total num frames: 4106174464. Throughput: 0: 42849.6. Samples: 385042020. Policy #0 lag: (min: 0.0, avg: 14.0, max: 28.0) [2024-06-28 15:10:32,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:10:35,102][09423] Updated weights for policy 0, policy_version 250627 (0.0032) [2024-06-28 15:10:37,921][09190] Fps is (10 sec: 42597.7, 60 sec: 42598.4, 300 sec: 42987.2). Total num frames: 4106387456. Throughput: 0: 43007.2. Samples: 385301320. Policy #0 lag: (min: 0.0, avg: 14.0, max: 28.0) [2024-06-28 15:10:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:10:38,728][09423] Updated weights for policy 0, policy_version 250637 (0.0035) [2024-06-28 15:10:42,677][09423] Updated weights for policy 0, policy_version 250647 (0.0040) [2024-06-28 15:10:42,921][09190] Fps is (10 sec: 44237.6, 60 sec: 42871.4, 300 sec: 42987.2). Total num frames: 4106616832. Throughput: 0: 42744.1. Samples: 385432420. Policy #0 lag: (min: 0.0, avg: 14.0, max: 28.0) [2024-06-28 15:10:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:10:46,615][09423] Updated weights for policy 0, policy_version 250657 (0.0037) [2024-06-28 15:10:47,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 42820.6). Total num frames: 4106813440. Throughput: 0: 42784.0. Samples: 385681200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:10:47,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 15:10:50,363][09423] Updated weights for policy 0, policy_version 250667 (0.0045) [2024-06-28 15:10:52,922][09190] Fps is (10 sec: 39321.1, 60 sec: 42325.3, 300 sec: 42876.1). Total num frames: 4107010048. Throughput: 0: 42747.4. Samples: 385937360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:10:52,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:10:54,283][09423] Updated weights for policy 0, policy_version 250677 (0.0037) [2024-06-28 15:10:57,922][09190] Fps is (10 sec: 40959.0, 60 sec: 42325.3, 300 sec: 42765.0). Total num frames: 4107223040. Throughput: 0: 42597.1. Samples: 386063360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:10:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:10:58,538][09423] Updated weights for policy 0, policy_version 250687 (0.0034) [2024-06-28 15:11:01,803][09423] Updated weights for policy 0, policy_version 250697 (0.0029) [2024-06-28 15:11:02,921][09190] Fps is (10 sec: 44237.4, 60 sec: 43144.6, 300 sec: 42820.6). Total num frames: 4107452416. Throughput: 0: 42597.4. Samples: 386315480. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:11:02,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 15:11:05,195][09403] Signal inference workers to stop experience collection... (5400 times) [2024-06-28 15:11:05,240][09403] Signal inference workers to resume experience collection... (5400 times) [2024-06-28 15:11:05,241][09423] InferenceWorker_p0-w0: stopping experience collection (5400 times) [2024-06-28 15:11:05,257][09423] InferenceWorker_p0-w0: resuming experience collection (5400 times) [2024-06-28 15:11:06,213][09423] Updated weights for policy 0, policy_version 250707 (0.0030) [2024-06-28 15:11:07,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42598.3, 300 sec: 42987.2). Total num frames: 4107665408. Throughput: 0: 42626.2. Samples: 386573920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:11:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:11:09,504][09423] Updated weights for policy 0, policy_version 250717 (0.0033) [2024-06-28 15:11:12,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42052.3, 300 sec: 42820.6). Total num frames: 4107862016. Throughput: 0: 42576.9. Samples: 386708000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:11:12,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:11:13,675][09423] Updated weights for policy 0, policy_version 250727 (0.0033) [2024-06-28 15:11:17,127][09423] Updated weights for policy 0, policy_version 250737 (0.0036) [2024-06-28 15:11:17,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4108107776. Throughput: 0: 42678.7. Samples: 386962560. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:11:17,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:11:21,241][09423] Updated weights for policy 0, policy_version 250747 (0.0034) [2024-06-28 15:11:22,921][09190] Fps is (10 sec: 44236.0, 60 sec: 42598.4, 300 sec: 42876.1). Total num frames: 4108304384. Throughput: 0: 42642.6. Samples: 387220240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:11:22,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:11:24,827][09423] Updated weights for policy 0, policy_version 250757 (0.0033) [2024-06-28 15:11:27,921][09190] Fps is (10 sec: 37683.6, 60 sec: 42052.2, 300 sec: 42709.5). Total num frames: 4108484608. Throughput: 0: 42533.4. Samples: 387346420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:11:27,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:11:29,068][09423] Updated weights for policy 0, policy_version 250767 (0.0033) [2024-06-28 15:11:32,248][09423] Updated weights for policy 0, policy_version 250777 (0.0037) [2024-06-28 15:11:32,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42871.5, 300 sec: 42932.0). Total num frames: 4108746752. Throughput: 0: 42752.4. Samples: 387605060. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:11:32,926][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:11:36,886][09423] Updated weights for policy 0, policy_version 250787 (0.0043) [2024-06-28 15:11:37,921][09190] Fps is (10 sec: 47513.1, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4108959744. Throughput: 0: 42822.3. Samples: 387864360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:11:37,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:11:39,555][09423] Updated weights for policy 0, policy_version 250797 (0.0029) [2024-06-28 15:11:42,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42052.2, 300 sec: 42765.0). Total num frames: 4109139968. Throughput: 0: 42855.7. Samples: 387991860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:11:42,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:11:44,568][09423] Updated weights for policy 0, policy_version 250807 (0.0039) [2024-06-28 15:11:47,385][09423] Updated weights for policy 0, policy_version 250817 (0.0028) [2024-06-28 15:11:47,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4109385728. Throughput: 0: 42904.5. Samples: 388246180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:11:47,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:11:51,966][09423] Updated weights for policy 0, policy_version 250827 (0.0030) [2024-06-28 15:11:52,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43144.6, 300 sec: 42876.1). Total num frames: 4109598720. Throughput: 0: 42975.5. Samples: 388507820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:11:52,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:11:55,225][09423] Updated weights for policy 0, policy_version 250837 (0.0040) [2024-06-28 15:11:57,923][09190] Fps is (10 sec: 39315.5, 60 sec: 42597.5, 300 sec: 42765.2). Total num frames: 4109778944. Throughput: 0: 42770.0. Samples: 388632720. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 15:11:57,923][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:11:59,492][09423] Updated weights for policy 0, policy_version 250847 (0.0044) [2024-06-28 15:12:02,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.4, 300 sec: 42876.1). Total num frames: 4110024704. Throughput: 0: 42906.3. Samples: 388893340. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 15:12:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:12:02,989][09423] Updated weights for policy 0, policy_version 250857 (0.0033) [2024-06-28 15:12:07,266][09423] Updated weights for policy 0, policy_version 250867 (0.0042) [2024-06-28 15:12:07,921][09190] Fps is (10 sec: 45881.6, 60 sec: 42871.4, 300 sec: 42820.6). Total num frames: 4110237696. Throughput: 0: 42878.7. Samples: 389149780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 15:12:07,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:12:10,477][09423] Updated weights for policy 0, policy_version 250877 (0.0027) [2024-06-28 15:12:12,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42598.4, 300 sec: 42654.0). Total num frames: 4110417920. Throughput: 0: 42820.0. Samples: 389273320. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 15:12:12,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:12:14,740][09423] Updated weights for policy 0, policy_version 250887 (0.0039) [2024-06-28 15:12:17,834][09423] Updated weights for policy 0, policy_version 250897 (0.0033) [2024-06-28 15:12:17,922][09190] Fps is (10 sec: 45875.0, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4110696448. Throughput: 0: 42896.3. Samples: 389535400. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 15:12:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:12:17,949][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000250897_4110696448.pth... [2024-06-28 15:12:18,011][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000250268_4100390912.pth [2024-06-28 15:12:22,722][09423] Updated weights for policy 0, policy_version 250907 (0.0045) [2024-06-28 15:12:22,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42598.5, 300 sec: 42820.5). Total num frames: 4110860288. Throughput: 0: 43012.9. Samples: 389799940. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 15:12:22,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:12:25,540][09423] Updated weights for policy 0, policy_version 250917 (0.0042) [2024-06-28 15:12:27,921][09190] Fps is (10 sec: 39321.9, 60 sec: 43417.5, 300 sec: 42820.6). Total num frames: 4111089664. Throughput: 0: 42845.4. Samples: 389919900. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 15:12:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:12:30,106][09423] Updated weights for policy 0, policy_version 250927 (0.0032) [2024-06-28 15:12:32,922][09190] Fps is (10 sec: 45874.7, 60 sec: 42871.4, 300 sec: 42987.1). Total num frames: 4111319040. Throughput: 0: 43078.5. Samples: 390184720. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 15:12:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:12:33,149][09423] Updated weights for policy 0, policy_version 250937 (0.0031) [2024-06-28 15:12:35,536][09403] Signal inference workers to stop experience collection... (5450 times) [2024-06-28 15:12:35,542][09403] Signal inference workers to resume experience collection... (5450 times) [2024-06-28 15:12:35,583][09423] InferenceWorker_p0-w0: stopping experience collection (5450 times) [2024-06-28 15:12:35,583][09423] InferenceWorker_p0-w0: resuming experience collection (5450 times) [2024-06-28 15:12:37,656][09423] Updated weights for policy 0, policy_version 250947 (0.0036) [2024-06-28 15:12:37,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42871.6, 300 sec: 42932.0). Total num frames: 4111532032. Throughput: 0: 43097.9. Samples: 390447220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 15:12:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:12:40,844][09423] Updated weights for policy 0, policy_version 250957 (0.0036) [2024-06-28 15:12:42,921][09190] Fps is (10 sec: 40960.4, 60 sec: 43144.6, 300 sec: 42876.1). Total num frames: 4111728640. Throughput: 0: 43197.4. Samples: 390576540. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 15:12:42,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 15:12:45,038][09423] Updated weights for policy 0, policy_version 250967 (0.0041) [2024-06-28 15:12:47,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42871.4, 300 sec: 42931.7). Total num frames: 4111958016. Throughput: 0: 43140.9. Samples: 390834680. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 15:12:47,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:12:48,300][09423] Updated weights for policy 0, policy_version 250977 (0.0043) [2024-06-28 15:12:52,907][09423] Updated weights for policy 0, policy_version 250987 (0.0029) [2024-06-28 15:12:52,922][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.4, 300 sec: 42876.1). Total num frames: 4112171008. Throughput: 0: 43480.9. Samples: 391106420. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 15:12:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:12:55,750][09423] Updated weights for policy 0, policy_version 250997 (0.0029) [2024-06-28 15:12:57,921][09190] Fps is (10 sec: 42598.1, 60 sec: 43418.6, 300 sec: 42820.6). Total num frames: 4112384000. Throughput: 0: 43203.9. Samples: 391217500. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 15:12:57,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:13:00,627][09423] Updated weights for policy 0, policy_version 251007 (0.0029) [2024-06-28 15:13:02,921][09190] Fps is (10 sec: 44237.3, 60 sec: 43144.6, 300 sec: 42931.6). Total num frames: 4112613376. Throughput: 0: 43221.9. Samples: 391480380. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 15:13:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:13:03,526][09423] Updated weights for policy 0, policy_version 251017 (0.0036) [2024-06-28 15:13:07,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.5, 300 sec: 42876.1). Total num frames: 4112809984. Throughput: 0: 43141.8. Samples: 391741320. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:13:07,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 15:13:08,179][09423] Updated weights for policy 0, policy_version 251027 (0.0036) [2024-06-28 15:13:11,473][09423] Updated weights for policy 0, policy_version 251037 (0.0039) [2024-06-28 15:13:12,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 42931.6). Total num frames: 4113039360. Throughput: 0: 43209.9. Samples: 391864340. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:13:12,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 15:13:15,850][09423] Updated weights for policy 0, policy_version 251047 (0.0034) [2024-06-28 15:13:17,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42598.4, 300 sec: 42931.6). Total num frames: 4113252352. Throughput: 0: 43064.5. Samples: 392122620. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:13:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:13:18,895][09423] Updated weights for policy 0, policy_version 251057 (0.0023) [2024-06-28 15:13:22,921][09190] Fps is (10 sec: 40960.0, 60 sec: 43144.6, 300 sec: 42820.8). Total num frames: 4113448960. Throughput: 0: 43158.2. Samples: 392389340. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:13:22,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:13:23,321][09423] Updated weights for policy 0, policy_version 251067 (0.0041) [2024-06-28 15:13:26,655][09423] Updated weights for policy 0, policy_version 251077 (0.0051) [2024-06-28 15:13:27,923][09190] Fps is (10 sec: 42591.4, 60 sec: 43143.3, 300 sec: 42931.4). Total num frames: 4113678336. Throughput: 0: 42965.9. Samples: 392510080. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:13:27,924][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:13:31,138][09423] Updated weights for policy 0, policy_version 251087 (0.0028) [2024-06-28 15:13:32,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43144.7, 300 sec: 42931.7). Total num frames: 4113907712. Throughput: 0: 42930.3. Samples: 392766540. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:13:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:13:33,936][09423] Updated weights for policy 0, policy_version 251097 (0.0035) [2024-06-28 15:13:37,921][09190] Fps is (10 sec: 39328.8, 60 sec: 42325.4, 300 sec: 42765.0). Total num frames: 4114071552. Throughput: 0: 42781.1. Samples: 393031560. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:13:37,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:13:38,809][09423] Updated weights for policy 0, policy_version 251107 (0.0043) [2024-06-28 15:13:41,831][09403] Signal inference workers to stop experience collection... (5500 times) [2024-06-28 15:13:41,885][09423] InferenceWorker_p0-w0: stopping experience collection (5500 times) [2024-06-28 15:13:41,893][09403] Signal inference workers to resume experience collection... (5500 times) [2024-06-28 15:13:41,903][09423] InferenceWorker_p0-w0: resuming experience collection (5500 times) [2024-06-28 15:13:42,025][09423] Updated weights for policy 0, policy_version 251117 (0.0038) [2024-06-28 15:13:42,921][09190] Fps is (10 sec: 40959.9, 60 sec: 43144.6, 300 sec: 42876.1). Total num frames: 4114317312. Throughput: 0: 42954.8. Samples: 393150460. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:13:42,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 15:13:46,369][09423] Updated weights for policy 0, policy_version 251127 (0.0034) [2024-06-28 15:13:47,921][09190] Fps is (10 sec: 47513.6, 60 sec: 43144.6, 300 sec: 42931.7). Total num frames: 4114546688. Throughput: 0: 42909.8. Samples: 393411320. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:13:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:13:49,462][09423] Updated weights for policy 0, policy_version 251137 (0.0035) [2024-06-28 15:13:52,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.5, 300 sec: 42820.6). Total num frames: 4114726912. Throughput: 0: 42805.0. Samples: 393667540. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:13:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:13:54,210][09423] Updated weights for policy 0, policy_version 251147 (0.0040) [2024-06-28 15:13:56,854][09423] Updated weights for policy 0, policy_version 251157 (0.0026) [2024-06-28 15:13:57,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4114972672. Throughput: 0: 42853.3. Samples: 393792740. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:13:57,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 15:14:01,710][09423] Updated weights for policy 0, policy_version 251167 (0.0033) [2024-06-28 15:14:02,921][09190] Fps is (10 sec: 47513.1, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4115202048. Throughput: 0: 43142.3. Samples: 394064020. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:14:02,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 15:14:05,046][09423] Updated weights for policy 0, policy_version 251177 (0.0034) [2024-06-28 15:14:07,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42598.4, 300 sec: 42765.0). Total num frames: 4115365888. Throughput: 0: 42781.3. Samples: 394314500. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:14:07,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 15:14:09,558][09423] Updated weights for policy 0, policy_version 251187 (0.0039) [2024-06-28 15:14:12,469][09423] Updated weights for policy 0, policy_version 251197 (0.0028) [2024-06-28 15:14:12,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4115628032. Throughput: 0: 42702.1. Samples: 394431600. Policy #0 lag: (min: 0.0, avg: 9.1, max: 22.0) [2024-06-28 15:14:12,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:14:17,039][09423] Updated weights for policy 0, policy_version 251207 (0.0033) [2024-06-28 15:14:17,921][09190] Fps is (10 sec: 45875.3, 60 sec: 42871.6, 300 sec: 42876.1). Total num frames: 4115824640. Throughput: 0: 42910.6. Samples: 394697520. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:14:17,922][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 15:14:17,946][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000251210_4115824640.pth... [2024-06-28 15:14:18,021][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000250581_4105519104.pth [2024-06-28 15:14:19,877][09423] Updated weights for policy 0, policy_version 251217 (0.0034) [2024-06-28 15:14:22,921][09190] Fps is (10 sec: 37683.3, 60 sec: 42598.4, 300 sec: 42765.0). Total num frames: 4116004864. Throughput: 0: 42595.9. Samples: 394948380. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:14:22,925][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 15:14:24,803][09423] Updated weights for policy 0, policy_version 251227 (0.0026) [2024-06-28 15:14:27,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42872.7, 300 sec: 42987.2). Total num frames: 4116250624. Throughput: 0: 42641.3. Samples: 395069320. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:14:27,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 15:14:27,955][09423] Updated weights for policy 0, policy_version 251237 (0.0028) [2024-06-28 15:14:32,418][09423] Updated weights for policy 0, policy_version 251247 (0.0031) [2024-06-28 15:14:32,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42598.3, 300 sec: 42820.6). Total num frames: 4116463616. Throughput: 0: 42809.7. Samples: 395337760. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:14:32,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:14:35,473][09423] Updated weights for policy 0, policy_version 251257 (0.0043) [2024-06-28 15:14:37,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43417.5, 300 sec: 42820.5). Total num frames: 4116676608. Throughput: 0: 42915.9. Samples: 395598760. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:14:37,926][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:14:39,766][09423] Updated weights for policy 0, policy_version 251267 (0.0030) [2024-06-28 15:14:42,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4116889600. Throughput: 0: 43012.9. Samples: 395728320. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:14:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:14:43,268][09423] Updated weights for policy 0, policy_version 251277 (0.0036) [2024-06-28 15:14:47,565][09423] Updated weights for policy 0, policy_version 251287 (0.0028) [2024-06-28 15:14:47,924][09190] Fps is (10 sec: 42587.9, 60 sec: 42596.5, 300 sec: 42820.2). Total num frames: 4117102592. Throughput: 0: 42785.6. Samples: 395989480. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:14:47,924][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:14:50,816][09423] Updated weights for policy 0, policy_version 251297 (0.0031) [2024-06-28 15:14:52,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 4117299200. Throughput: 0: 42778.7. Samples: 396239540. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:14:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:14:55,309][09423] Updated weights for policy 0, policy_version 251307 (0.0042) [2024-06-28 15:14:57,921][09190] Fps is (10 sec: 44247.8, 60 sec: 42871.4, 300 sec: 42987.2). Total num frames: 4117544960. Throughput: 0: 42995.5. Samples: 396366400. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:14:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:14:58,273][09423] Updated weights for policy 0, policy_version 251317 (0.0040) [2024-06-28 15:15:02,795][09423] Updated weights for policy 0, policy_version 251327 (0.0027) [2024-06-28 15:15:02,924][09190] Fps is (10 sec: 44225.4, 60 sec: 42323.6, 300 sec: 42820.2). Total num frames: 4117741568. Throughput: 0: 42804.7. Samples: 396623840. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:15:02,925][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:15:06,254][09423] Updated weights for policy 0, policy_version 251337 (0.0031) [2024-06-28 15:15:07,921][09190] Fps is (10 sec: 40959.7, 60 sec: 43144.4, 300 sec: 42765.0). Total num frames: 4117954560. Throughput: 0: 42938.1. Samples: 396880600. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:15:07,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 15:15:10,220][09423] Updated weights for policy 0, policy_version 251347 (0.0039) [2024-06-28 15:15:12,924][09190] Fps is (10 sec: 42598.5, 60 sec: 42323.6, 300 sec: 42875.7). Total num frames: 4118167552. Throughput: 0: 43185.6. Samples: 397012780. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:15:12,924][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:15:13,087][09403] Signal inference workers to stop experience collection... (5550 times) [2024-06-28 15:15:13,139][09403] Signal inference workers to resume experience collection... (5550 times) [2024-06-28 15:15:13,140][09423] InferenceWorker_p0-w0: stopping experience collection (5550 times) [2024-06-28 15:15:13,154][09423] InferenceWorker_p0-w0: resuming experience collection (5550 times) [2024-06-28 15:15:13,669][09423] Updated weights for policy 0, policy_version 251357 (0.0024) [2024-06-28 15:15:17,823][09423] Updated weights for policy 0, policy_version 251367 (0.0030) [2024-06-28 15:15:17,924][09190] Fps is (10 sec: 44226.3, 60 sec: 42869.7, 300 sec: 42875.7). Total num frames: 4118396928. Throughput: 0: 43151.0. Samples: 397279660. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:15:17,924][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:15:21,123][09423] Updated weights for policy 0, policy_version 251377 (0.0027) [2024-06-28 15:15:22,924][09190] Fps is (10 sec: 44236.8, 60 sec: 43415.8, 300 sec: 42875.7). Total num frames: 4118609920. Throughput: 0: 42928.8. Samples: 397530660. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:15:22,924][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:15:25,207][09423] Updated weights for policy 0, policy_version 251387 (0.0033) [2024-06-28 15:15:27,921][09190] Fps is (10 sec: 44247.5, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4118839296. Throughput: 0: 42909.2. Samples: 397659240. Policy #0 lag: (min: 1.0, avg: 10.4, max: 24.0) [2024-06-28 15:15:27,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:15:29,017][09423] Updated weights for policy 0, policy_version 251397 (0.0031) [2024-06-28 15:15:32,921][09190] Fps is (10 sec: 40970.2, 60 sec: 42598.4, 300 sec: 42820.6). Total num frames: 4119019520. Throughput: 0: 42822.4. Samples: 397916380. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:15:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:15:33,198][09423] Updated weights for policy 0, policy_version 251407 (0.0033) [2024-06-28 15:15:36,849][09423] Updated weights for policy 0, policy_version 251417 (0.0030) [2024-06-28 15:15:37,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42871.4, 300 sec: 42820.5). Total num frames: 4119248896. Throughput: 0: 42817.6. Samples: 398166340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:15:37,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:15:40,896][09423] Updated weights for policy 0, policy_version 251427 (0.0034) [2024-06-28 15:15:42,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4119478272. Throughput: 0: 42826.3. Samples: 398293580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:15:42,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:15:44,329][09423] Updated weights for policy 0, policy_version 251437 (0.0026) [2024-06-28 15:15:47,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42873.3, 300 sec: 42931.6). Total num frames: 4119674880. Throughput: 0: 42938.8. Samples: 398555980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:15:47,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:15:48,430][09423] Updated weights for policy 0, policy_version 251447 (0.0032) [2024-06-28 15:15:51,813][09423] Updated weights for policy 0, policy_version 251457 (0.0022) [2024-06-28 15:15:52,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 42987.2). Total num frames: 4119904256. Throughput: 0: 42989.0. Samples: 398815100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:15:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:15:55,947][09423] Updated weights for policy 0, policy_version 251467 (0.0027) [2024-06-28 15:15:57,921][09190] Fps is (10 sec: 45875.1, 60 sec: 43144.5, 300 sec: 42987.2). Total num frames: 4120133632. Throughput: 0: 42955.7. Samples: 398945680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:15:57,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:15:59,477][09423] Updated weights for policy 0, policy_version 251477 (0.0041) [2024-06-28 15:16:02,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42873.3, 300 sec: 42876.1). Total num frames: 4120313856. Throughput: 0: 42714.0. Samples: 399201680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:16:02,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:16:03,577][09423] Updated weights for policy 0, policy_version 251487 (0.0032) [2024-06-28 15:16:07,235][09423] Updated weights for policy 0, policy_version 251497 (0.0031) [2024-06-28 15:16:07,921][09190] Fps is (10 sec: 40960.1, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4120543232. Throughput: 0: 42692.6. Samples: 399451720. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:16:07,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:16:11,434][09423] Updated weights for policy 0, policy_version 251507 (0.0034) [2024-06-28 15:16:12,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43146.4, 300 sec: 42876.1). Total num frames: 4120756224. Throughput: 0: 42782.4. Samples: 399584440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:16:12,930][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:16:14,643][09423] Updated weights for policy 0, policy_version 251517 (0.0032) [2024-06-28 15:16:17,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42327.1, 300 sec: 42820.6). Total num frames: 4120936448. Throughput: 0: 42668.1. Samples: 399836440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:16:17,930][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 15:16:17,981][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000251523_4120952832.pth... [2024-06-28 15:16:18,025][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000250897_4110696448.pth [2024-06-28 15:16:18,945][09423] Updated weights for policy 0, policy_version 251527 (0.0028) [2024-06-28 15:16:22,625][09423] Updated weights for policy 0, policy_version 251537 (0.0029) [2024-06-28 15:16:22,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42873.3, 300 sec: 43042.7). Total num frames: 4121182208. Throughput: 0: 42729.9. Samples: 400089180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:16:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:16:26,566][09423] Updated weights for policy 0, policy_version 251547 (0.0039) [2024-06-28 15:16:27,921][09190] Fps is (10 sec: 47513.2, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4121411584. Throughput: 0: 42991.1. Samples: 400228180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:16:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:16:30,044][09423] Updated weights for policy 0, policy_version 251557 (0.0030) [2024-06-28 15:16:32,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42871.5, 300 sec: 42820.6). Total num frames: 4121591808. Throughput: 0: 42963.7. Samples: 400489340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:16:32,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:16:34,216][09423] Updated weights for policy 0, policy_version 251567 (0.0028) [2024-06-28 15:16:37,415][09423] Updated weights for policy 0, policy_version 251577 (0.0031) [2024-06-28 15:16:37,922][09190] Fps is (10 sec: 42598.0, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4121837568. Throughput: 0: 42830.1. Samples: 400742460. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:16:37,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:16:41,797][09423] Updated weights for policy 0, policy_version 251587 (0.0043) [2024-06-28 15:16:42,921][09190] Fps is (10 sec: 45874.4, 60 sec: 42871.4, 300 sec: 42931.6). Total num frames: 4122050560. Throughput: 0: 42964.0. Samples: 400879060. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 15:16:42,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:16:43,089][09403] Signal inference workers to stop experience collection... (5600 times) [2024-06-28 15:16:43,141][09423] InferenceWorker_p0-w0: stopping experience collection (5600 times) [2024-06-28 15:16:43,150][09403] Signal inference workers to resume experience collection... (5600 times) [2024-06-28 15:16:43,160][09423] InferenceWorker_p0-w0: resuming experience collection (5600 times) [2024-06-28 15:16:45,175][09423] Updated weights for policy 0, policy_version 251597 (0.0028) [2024-06-28 15:16:47,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42598.4, 300 sec: 42820.6). Total num frames: 4122230784. Throughput: 0: 42794.2. Samples: 401127420. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 15:16:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:16:49,587][09423] Updated weights for policy 0, policy_version 251607 (0.0022) [2024-06-28 15:16:52,761][09423] Updated weights for policy 0, policy_version 251617 (0.0037) [2024-06-28 15:16:52,921][09190] Fps is (10 sec: 44237.3, 60 sec: 43144.6, 300 sec: 43098.5). Total num frames: 4122492928. Throughput: 0: 42820.5. Samples: 401378640. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 15:16:52,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:16:57,369][09423] Updated weights for policy 0, policy_version 251627 (0.0026) [2024-06-28 15:16:57,923][09190] Fps is (10 sec: 44230.2, 60 sec: 42324.3, 300 sec: 42875.9). Total num frames: 4122673152. Throughput: 0: 42808.3. Samples: 401510880. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 15:16:57,923][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:17:00,719][09423] Updated weights for policy 0, policy_version 251637 (0.0035) [2024-06-28 15:17:02,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42871.4, 300 sec: 42876.1). Total num frames: 4122886144. Throughput: 0: 42944.3. Samples: 401768940. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 15:17:02,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:17:04,788][09423] Updated weights for policy 0, policy_version 251647 (0.0035) [2024-06-28 15:17:07,924][09190] Fps is (10 sec: 45870.6, 60 sec: 43142.8, 300 sec: 43097.9). Total num frames: 4123131904. Throughput: 0: 43135.4. Samples: 402030380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 15:17:07,925][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:17:08,046][09423] Updated weights for policy 0, policy_version 251657 (0.0029) [2024-06-28 15:17:12,499][09423] Updated weights for policy 0, policy_version 251667 (0.0027) [2024-06-28 15:17:12,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42871.4, 300 sec: 42820.6). Total num frames: 4123328512. Throughput: 0: 43030.7. Samples: 402164560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 15:17:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:17:15,662][09423] Updated weights for policy 0, policy_version 251677 (0.0045) [2024-06-28 15:17:17,921][09190] Fps is (10 sec: 39331.2, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4123525120. Throughput: 0: 42919.4. Samples: 402420720. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 15:17:17,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:17:20,127][09423] Updated weights for policy 0, policy_version 251687 (0.0037) [2024-06-28 15:17:22,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4123787264. Throughput: 0: 42801.0. Samples: 402668500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 15:17:22,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:17:23,603][09423] Updated weights for policy 0, policy_version 251697 (0.0034) [2024-06-28 15:17:27,751][09423] Updated weights for policy 0, policy_version 251707 (0.0033) [2024-06-28 15:17:27,921][09190] Fps is (10 sec: 45875.4, 60 sec: 42871.5, 300 sec: 42931.7). Total num frames: 4123983872. Throughput: 0: 42842.7. Samples: 402806980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 15:17:27,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 15:17:31,651][09423] Updated weights for policy 0, policy_version 251717 (0.0025) [2024-06-28 15:17:32,921][09190] Fps is (10 sec: 37683.2, 60 sec: 42871.4, 300 sec: 42820.5). Total num frames: 4124164096. Throughput: 0: 42871.1. Samples: 403056620. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 15:17:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:17:35,579][09423] Updated weights for policy 0, policy_version 251727 (0.0031) [2024-06-28 15:17:37,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4124426240. Throughput: 0: 43012.4. Samples: 403314200. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 15:17:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:17:38,921][09423] Updated weights for policy 0, policy_version 251737 (0.0028) [2024-06-28 15:17:42,907][09423] Updated weights for policy 0, policy_version 251747 (0.0033) [2024-06-28 15:17:42,921][09190] Fps is (10 sec: 45875.4, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4124622848. Throughput: 0: 43069.0. Samples: 403448920. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 15:17:42,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:17:46,328][09423] Updated weights for policy 0, policy_version 251757 (0.0025) [2024-06-28 15:17:47,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43690.7, 300 sec: 42987.2). Total num frames: 4124852224. Throughput: 0: 43111.7. Samples: 403708960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 15:17:47,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:17:50,511][09423] Updated weights for policy 0, policy_version 251767 (0.0028) [2024-06-28 15:17:52,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4125065216. Throughput: 0: 42941.5. Samples: 403962640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 15:17:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:17:53,734][09423] Updated weights for policy 0, policy_version 251777 (0.0035) [2024-06-28 15:17:57,921][09190] Fps is (10 sec: 37683.2, 60 sec: 42599.5, 300 sec: 42765.0). Total num frames: 4125229056. Throughput: 0: 42873.4. Samples: 404093860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 15:17:57,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:17:58,404][09423] Updated weights for policy 0, policy_version 251787 (0.0045) [2024-06-28 15:18:01,732][09423] Updated weights for policy 0, policy_version 251797 (0.0034) [2024-06-28 15:18:02,921][09190] Fps is (10 sec: 40960.1, 60 sec: 43144.6, 300 sec: 42931.6). Total num frames: 4125474816. Throughput: 0: 42838.3. Samples: 404348440. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 15:18:02,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:18:05,897][09423] Updated weights for policy 0, policy_version 251807 (0.0033) [2024-06-28 15:18:07,921][09190] Fps is (10 sec: 47513.5, 60 sec: 42873.3, 300 sec: 42931.6). Total num frames: 4125704192. Throughput: 0: 42924.9. Samples: 404600120. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 15:18:07,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:18:09,096][09423] Updated weights for policy 0, policy_version 251817 (0.0035) [2024-06-28 15:18:12,921][09190] Fps is (10 sec: 39321.1, 60 sec: 42325.3, 300 sec: 42765.0). Total num frames: 4125868032. Throughput: 0: 42782.6. Samples: 404732200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 15:18:12,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:18:13,770][09423] Updated weights for policy 0, policy_version 251827 (0.0027) [2024-06-28 15:18:14,754][09403] Signal inference workers to stop experience collection... (5650 times) [2024-06-28 15:18:14,802][09423] InferenceWorker_p0-w0: stopping experience collection (5650 times) [2024-06-28 15:18:14,808][09403] Signal inference workers to resume experience collection... (5650 times) [2024-06-28 15:18:14,817][09423] InferenceWorker_p0-w0: resuming experience collection (5650 times) [2024-06-28 15:18:17,218][09423] Updated weights for policy 0, policy_version 251837 (0.0028) [2024-06-28 15:18:17,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42871.5, 300 sec: 42876.1). Total num frames: 4126097408. Throughput: 0: 42898.3. Samples: 404987040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 15:18:17,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:18:18,006][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000251838_4126113792.pth... [2024-06-28 15:18:18,056][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000251210_4115824640.pth [2024-06-28 15:18:21,107][09423] Updated weights for policy 0, policy_version 251847 (0.0025) [2024-06-28 15:18:22,922][09190] Fps is (10 sec: 47513.3, 60 sec: 42598.3, 300 sec: 42931.9). Total num frames: 4126343168. Throughput: 0: 43005.7. Samples: 405249460. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 15:18:22,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:18:24,658][09423] Updated weights for policy 0, policy_version 251857 (0.0032) [2024-06-28 15:18:27,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.4, 300 sec: 42765.0). Total num frames: 4126523392. Throughput: 0: 43049.3. Samples: 405386140. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 15:18:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:18:28,609][09423] Updated weights for policy 0, policy_version 251867 (0.0027) [2024-06-28 15:18:31,978][09423] Updated weights for policy 0, policy_version 251877 (0.0040) [2024-06-28 15:18:32,921][09190] Fps is (10 sec: 40960.3, 60 sec: 43144.5, 300 sec: 42987.2). Total num frames: 4126752768. Throughput: 0: 42854.1. Samples: 405637400. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 15:18:32,928][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:18:36,343][09423] Updated weights for policy 0, policy_version 251887 (0.0040) [2024-06-28 15:18:37,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42598.4, 300 sec: 42931.6). Total num frames: 4126982144. Throughput: 0: 43102.1. Samples: 405902240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 15:18:37,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:18:39,946][09423] Updated weights for policy 0, policy_version 251897 (0.0038) [2024-06-28 15:18:42,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.4, 300 sec: 42820.5). Total num frames: 4127178752. Throughput: 0: 42916.4. Samples: 406025100. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 15:18:42,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:18:44,377][09423] Updated weights for policy 0, policy_version 251907 (0.0052) [2024-06-28 15:18:47,270][09423] Updated weights for policy 0, policy_version 251917 (0.0033) [2024-06-28 15:18:47,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.4, 300 sec: 42987.2). Total num frames: 4127408128. Throughput: 0: 42719.5. Samples: 406270820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 15:18:47,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:18:52,038][09423] Updated weights for policy 0, policy_version 251927 (0.0032) [2024-06-28 15:18:52,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42598.4, 300 sec: 42876.1). Total num frames: 4127621120. Throughput: 0: 42988.4. Samples: 406534600. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 15:18:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:18:55,456][09423] Updated weights for policy 0, policy_version 251937 (0.0028) [2024-06-28 15:18:57,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43417.5, 300 sec: 42820.6). Total num frames: 4127834112. Throughput: 0: 42961.4. Samples: 406665460. Policy #0 lag: (min: 0.0, avg: 8.9, max: 21.0) [2024-06-28 15:18:57,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:18:59,625][09423] Updated weights for policy 0, policy_version 251947 (0.0026) [2024-06-28 15:19:02,887][09423] Updated weights for policy 0, policy_version 251957 (0.0026) [2024-06-28 15:19:02,922][09190] Fps is (10 sec: 44236.4, 60 sec: 43144.4, 300 sec: 43042.7). Total num frames: 4128063488. Throughput: 0: 42990.5. Samples: 406921620. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 15:19:02,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 15:19:06,995][09423] Updated weights for policy 0, policy_version 251967 (0.0039) [2024-06-28 15:19:07,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.4, 300 sec: 42876.1). Total num frames: 4128276480. Throughput: 0: 43042.3. Samples: 407186360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 15:19:07,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:19:10,226][09423] Updated weights for policy 0, policy_version 251977 (0.0036) [2024-06-28 15:19:12,921][09190] Fps is (10 sec: 39322.0, 60 sec: 43144.6, 300 sec: 42820.6). Total num frames: 4128456704. Throughput: 0: 42804.9. Samples: 407312360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 15:19:12,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:19:14,464][09423] Updated weights for policy 0, policy_version 251987 (0.0026) [2024-06-28 15:19:17,904][09423] Updated weights for policy 0, policy_version 251997 (0.0036) [2024-06-28 15:19:17,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43690.6, 300 sec: 43098.2). Total num frames: 4128718848. Throughput: 0: 42858.3. Samples: 407566020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 15:19:17,926][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:19:22,546][09423] Updated weights for policy 0, policy_version 252007 (0.0028) [2024-06-28 15:19:22,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42325.4, 300 sec: 42820.6). Total num frames: 4128882688. Throughput: 0: 42718.3. Samples: 407824560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 15:19:22,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:19:25,454][09423] Updated weights for policy 0, policy_version 252017 (0.0030) [2024-06-28 15:19:26,946][09403] Signal inference workers to stop experience collection... (5700 times) [2024-06-28 15:19:26,982][09423] InferenceWorker_p0-w0: stopping experience collection (5700 times) [2024-06-28 15:19:27,005][09403] Signal inference workers to resume experience collection... (5700 times) [2024-06-28 15:19:27,006][09423] InferenceWorker_p0-w0: resuming experience collection (5700 times) [2024-06-28 15:19:27,921][09190] Fps is (10 sec: 39322.0, 60 sec: 43144.6, 300 sec: 42876.1). Total num frames: 4129112064. Throughput: 0: 42630.8. Samples: 407943480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 15:19:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:19:30,123][09423] Updated weights for policy 0, policy_version 252027 (0.0039) [2024-06-28 15:19:32,921][09190] Fps is (10 sec: 47513.7, 60 sec: 43417.7, 300 sec: 42987.2). Total num frames: 4129357824. Throughput: 0: 43065.8. Samples: 408208780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 15:19:32,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:19:33,108][09423] Updated weights for policy 0, policy_version 252037 (0.0043) [2024-06-28 15:19:37,446][09423] Updated weights for policy 0, policy_version 252047 (0.0036) [2024-06-28 15:19:37,921][09190] Fps is (10 sec: 44236.2, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4129554432. Throughput: 0: 43022.2. Samples: 408470600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 15:19:37,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:19:41,115][09423] Updated weights for policy 0, policy_version 252057 (0.0024) [2024-06-28 15:19:42,924][09190] Fps is (10 sec: 40949.5, 60 sec: 43142.8, 300 sec: 42931.6). Total num frames: 4129767424. Throughput: 0: 42990.1. Samples: 408600120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 15:19:42,924][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:19:44,864][09423] Updated weights for policy 0, policy_version 252067 (0.0042) [2024-06-28 15:19:47,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4129996800. Throughput: 0: 43182.8. Samples: 408864840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 15:19:47,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:19:48,455][09423] Updated weights for policy 0, policy_version 252077 (0.0024) [2024-06-28 15:19:52,659][09423] Updated weights for policy 0, policy_version 252087 (0.0037) [2024-06-28 15:19:52,921][09190] Fps is (10 sec: 44248.0, 60 sec: 43144.6, 300 sec: 42931.6). Total num frames: 4130209792. Throughput: 0: 42991.2. Samples: 409120960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 15:19:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:19:56,125][09423] Updated weights for policy 0, policy_version 252097 (0.0039) [2024-06-28 15:19:57,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42871.5, 300 sec: 42932.0). Total num frames: 4130406400. Throughput: 0: 42980.4. Samples: 409246480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 15:19:57,924][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:20:00,667][09423] Updated weights for policy 0, policy_version 252107 (0.0035) [2024-06-28 15:20:02,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.6, 300 sec: 42987.2). Total num frames: 4130635776. Throughput: 0: 42993.8. Samples: 409500740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 15:20:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:20:03,978][09423] Updated weights for policy 0, policy_version 252117 (0.0036) [2024-06-28 15:20:07,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42325.4, 300 sec: 42876.5). Total num frames: 4130816000. Throughput: 0: 43009.8. Samples: 409760000. Policy #0 lag: (min: 0.0, avg: 10.2, max: 23.0) [2024-06-28 15:20:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:20:08,100][09423] Updated weights for policy 0, policy_version 252127 (0.0033) [2024-06-28 15:20:11,608][09423] Updated weights for policy 0, policy_version 252137 (0.0033) [2024-06-28 15:20:12,921][09190] Fps is (10 sec: 40960.1, 60 sec: 43144.6, 300 sec: 42876.5). Total num frames: 4131045376. Throughput: 0: 43088.0. Samples: 409882440. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 15:20:12,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:20:15,573][09423] Updated weights for policy 0, policy_version 252147 (0.0026) [2024-06-28 15:20:17,921][09190] Fps is (10 sec: 44236.2, 60 sec: 42325.3, 300 sec: 42876.4). Total num frames: 4131258368. Throughput: 0: 42985.2. Samples: 410143120. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 15:20:17,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:20:17,939][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000252153_4131274752.pth... [2024-06-28 15:20:18,032][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000251523_4120952832.pth [2024-06-28 15:20:19,248][09423] Updated weights for policy 0, policy_version 252157 (0.0030) [2024-06-28 15:20:22,888][09423] Updated weights for policy 0, policy_version 252167 (0.0031) [2024-06-28 15:20:22,921][09190] Fps is (10 sec: 45874.8, 60 sec: 43690.6, 300 sec: 42931.6). Total num frames: 4131504128. Throughput: 0: 43079.1. Samples: 410409160. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 15:20:22,922][09190] Avg episode reward: [(0, '0.779')] [2024-06-28 15:20:26,675][09423] Updated weights for policy 0, policy_version 252177 (0.0036) [2024-06-28 15:20:27,924][09190] Fps is (10 sec: 42588.1, 60 sec: 42869.6, 300 sec: 42931.3). Total num frames: 4131684352. Throughput: 0: 43098.7. Samples: 410539560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 15:20:27,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:20:30,591][09423] Updated weights for policy 0, policy_version 252187 (0.0044) [2024-06-28 15:20:32,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.4, 300 sec: 42931.7). Total num frames: 4131913728. Throughput: 0: 42860.5. Samples: 410793560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 15:20:32,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:20:34,389][09423] Updated weights for policy 0, policy_version 252197 (0.0047) [2024-06-28 15:20:37,921][09190] Fps is (10 sec: 44248.0, 60 sec: 42871.5, 300 sec: 42876.1). Total num frames: 4132126720. Throughput: 0: 42813.3. Samples: 411047560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 15:20:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:20:38,625][09423] Updated weights for policy 0, policy_version 252207 (0.0039) [2024-06-28 15:20:42,005][09423] Updated weights for policy 0, policy_version 252217 (0.0026) [2024-06-28 15:20:42,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42873.2, 300 sec: 42931.6). Total num frames: 4132339712. Throughput: 0: 42917.3. Samples: 411177760. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 15:20:42,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:20:46,248][09423] Updated weights for policy 0, policy_version 252227 (0.0042) [2024-06-28 15:20:47,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.3, 300 sec: 42820.5). Total num frames: 4132536320. Throughput: 0: 42854.1. Samples: 411429180. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 15:20:47,923][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:20:49,592][09423] Updated weights for policy 0, policy_version 252237 (0.0027) [2024-06-28 15:20:52,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42871.5, 300 sec: 42876.1). Total num frames: 4132782080. Throughput: 0: 43028.9. Samples: 411696300. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 15:20:52,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:20:53,609][09423] Updated weights for policy 0, policy_version 252247 (0.0032) [2024-06-28 15:20:57,006][09423] Updated weights for policy 0, policy_version 252257 (0.0041) [2024-06-28 15:20:57,924][09190] Fps is (10 sec: 45864.0, 60 sec: 43142.8, 300 sec: 42986.8). Total num frames: 4132995072. Throughput: 0: 43238.9. Samples: 411828300. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 15:20:57,924][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:21:01,112][09423] Updated weights for policy 0, policy_version 252267 (0.0035) [2024-06-28 15:21:02,922][09190] Fps is (10 sec: 40959.4, 60 sec: 42598.3, 300 sec: 42876.1). Total num frames: 4133191680. Throughput: 0: 43132.4. Samples: 412084080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 15:21:02,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:21:04,523][09423] Updated weights for policy 0, policy_version 252277 (0.0032) [2024-06-28 15:21:07,921][09190] Fps is (10 sec: 42608.9, 60 sec: 43417.5, 300 sec: 42931.6). Total num frames: 4133421056. Throughput: 0: 43006.2. Samples: 412344440. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 15:21:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:21:08,450][09423] Updated weights for policy 0, policy_version 252287 (0.0029) [2024-06-28 15:21:12,195][09423] Updated weights for policy 0, policy_version 252297 (0.0029) [2024-06-28 15:21:12,922][09190] Fps is (10 sec: 45875.1, 60 sec: 43417.5, 300 sec: 43098.2). Total num frames: 4133650432. Throughput: 0: 43032.5. Samples: 412475920. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 15:21:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:21:16,687][09403] Signal inference workers to stop experience collection... (5750 times) [2024-06-28 15:21:16,687][09403] Signal inference workers to resume experience collection... (5750 times) [2024-06-28 15:21:16,697][09423] InferenceWorker_p0-w0: stopping experience collection (5750 times) [2024-06-28 15:21:16,721][09423] InferenceWorker_p0-w0: resuming experience collection (5750 times) [2024-06-28 15:21:16,825][09423] Updated weights for policy 0, policy_version 252307 (0.0030) [2024-06-28 15:21:17,924][09190] Fps is (10 sec: 40950.0, 60 sec: 42869.8, 300 sec: 42875.7). Total num frames: 4133830656. Throughput: 0: 42801.2. Samples: 412719720. Policy #0 lag: (min: 0.0, avg: 10.7, max: 21.0) [2024-06-28 15:21:17,924][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:21:20,090][09423] Updated weights for policy 0, policy_version 252317 (0.0033) [2024-06-28 15:21:22,921][09190] Fps is (10 sec: 40960.8, 60 sec: 42598.5, 300 sec: 42876.1). Total num frames: 4134060032. Throughput: 0: 42886.3. Samples: 412977440. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:21:22,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 15:21:24,438][09423] Updated weights for policy 0, policy_version 252327 (0.0040) [2024-06-28 15:21:27,567][09423] Updated weights for policy 0, policy_version 252337 (0.0025) [2024-06-28 15:21:27,921][09190] Fps is (10 sec: 45886.8, 60 sec: 43419.4, 300 sec: 43042.7). Total num frames: 4134289408. Throughput: 0: 42969.4. Samples: 413111380. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:21:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:21:31,842][09423] Updated weights for policy 0, policy_version 252347 (0.0034) [2024-06-28 15:21:32,924][09190] Fps is (10 sec: 42587.6, 60 sec: 42869.7, 300 sec: 42875.7). Total num frames: 4134486016. Throughput: 0: 43178.6. Samples: 413372320. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:21:32,924][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:21:35,117][09423] Updated weights for policy 0, policy_version 252357 (0.0036) [2024-06-28 15:21:37,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4134715392. Throughput: 0: 42908.0. Samples: 413627160. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:21:37,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:21:39,237][09423] Updated weights for policy 0, policy_version 252367 (0.0033) [2024-06-28 15:21:42,772][09423] Updated weights for policy 0, policy_version 252377 (0.0042) [2024-06-28 15:21:42,921][09190] Fps is (10 sec: 45886.8, 60 sec: 43417.7, 300 sec: 43098.3). Total num frames: 4134944768. Throughput: 0: 42902.4. Samples: 413758800. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:21:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:21:46,765][09423] Updated weights for policy 0, policy_version 252387 (0.0035) [2024-06-28 15:21:47,921][09190] Fps is (10 sec: 40960.4, 60 sec: 43144.7, 300 sec: 42820.6). Total num frames: 4135124992. Throughput: 0: 42814.9. Samples: 414010740. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:21:47,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:21:50,374][09423] Updated weights for policy 0, policy_version 252397 (0.0034) [2024-06-28 15:21:52,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42871.4, 300 sec: 42987.4). Total num frames: 4135354368. Throughput: 0: 42706.2. Samples: 414266220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:21:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:21:54,950][09423] Updated weights for policy 0, policy_version 252407 (0.0038) [2024-06-28 15:21:57,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42873.3, 300 sec: 42987.2). Total num frames: 4135567360. Throughput: 0: 42654.9. Samples: 414395380. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:21:57,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:21:58,184][09423] Updated weights for policy 0, policy_version 252417 (0.0031) [2024-06-28 15:22:02,697][09423] Updated weights for policy 0, policy_version 252427 (0.0037) [2024-06-28 15:22:02,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.5, 300 sec: 42820.9). Total num frames: 4135763968. Throughput: 0: 42905.4. Samples: 414650360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:22:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:22:06,132][09423] Updated weights for policy 0, policy_version 252437 (0.0034) [2024-06-28 15:22:07,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4135993344. Throughput: 0: 42896.4. Samples: 414907780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:22:07,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:22:10,092][09423] Updated weights for policy 0, policy_version 252447 (0.0032) [2024-06-28 15:22:12,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42598.5, 300 sec: 42987.2). Total num frames: 4136206336. Throughput: 0: 42836.8. Samples: 415039040. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:22:12,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:22:13,602][09423] Updated weights for policy 0, policy_version 252457 (0.0032) [2024-06-28 15:22:17,347][09403] Signal inference workers to stop experience collection... (5800 times) [2024-06-28 15:22:17,348][09403] Signal inference workers to resume experience collection... (5800 times) [2024-06-28 15:22:17,386][09423] InferenceWorker_p0-w0: stopping experience collection (5800 times) [2024-06-28 15:22:17,386][09423] InferenceWorker_p0-w0: resuming experience collection (5800 times) [2024-06-28 15:22:17,488][09423] Updated weights for policy 0, policy_version 252467 (0.0041) [2024-06-28 15:22:17,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43146.4, 300 sec: 42820.6). Total num frames: 4136419328. Throughput: 0: 42870.0. Samples: 415301360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:22:17,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:22:18,022][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000252468_4136435712.pth... [2024-06-28 15:22:18,075][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000251838_4126113792.pth [2024-06-28 15:22:21,077][09423] Updated weights for policy 0, policy_version 252477 (0.0023) [2024-06-28 15:22:22,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4136648704. Throughput: 0: 42855.5. Samples: 415555660. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:22:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:22:25,305][09423] Updated weights for policy 0, policy_version 252487 (0.0030) [2024-06-28 15:22:27,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.4, 300 sec: 42987.2). Total num frames: 4136845312. Throughput: 0: 42976.0. Samples: 415692720. Policy #0 lag: (min: 0.0, avg: 11.0, max: 23.0) [2024-06-28 15:22:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:22:28,502][09423] Updated weights for policy 0, policy_version 252497 (0.0036) [2024-06-28 15:22:32,924][09190] Fps is (10 sec: 40950.0, 60 sec: 42871.5, 300 sec: 42820.2). Total num frames: 4137058304. Throughput: 0: 42899.3. Samples: 415941320. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:22:32,924][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:22:33,185][09423] Updated weights for policy 0, policy_version 252507 (0.0031) [2024-06-28 15:22:36,370][09423] Updated weights for policy 0, policy_version 252517 (0.0031) [2024-06-28 15:22:37,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4137287680. Throughput: 0: 42877.9. Samples: 416195720. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:22:37,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:22:40,549][09423] Updated weights for policy 0, policy_version 252527 (0.0031) [2024-06-28 15:22:42,922][09190] Fps is (10 sec: 44247.0, 60 sec: 42598.2, 300 sec: 42876.1). Total num frames: 4137500672. Throughput: 0: 42952.7. Samples: 416328260. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:22:42,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:22:43,834][09423] Updated weights for policy 0, policy_version 252537 (0.0038) [2024-06-28 15:22:47,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 42876.1). Total num frames: 4137713664. Throughput: 0: 43006.8. Samples: 416585660. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:22:47,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:22:48,540][09423] Updated weights for policy 0, policy_version 252547 (0.0033) [2024-06-28 15:22:51,728][09423] Updated weights for policy 0, policy_version 252557 (0.0033) [2024-06-28 15:22:52,921][09190] Fps is (10 sec: 44237.8, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4137943040. Throughput: 0: 43011.6. Samples: 416843300. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:22:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:22:55,935][09423] Updated weights for policy 0, policy_version 252567 (0.0035) [2024-06-28 15:22:57,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4138139648. Throughput: 0: 43143.2. Samples: 416980480. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:22:57,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:22:59,357][09423] Updated weights for policy 0, policy_version 252577 (0.0036) [2024-06-28 15:23:02,921][09190] Fps is (10 sec: 40959.9, 60 sec: 43144.6, 300 sec: 42876.1). Total num frames: 4138352640. Throughput: 0: 42913.3. Samples: 417232460. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:23:02,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:23:03,498][09423] Updated weights for policy 0, policy_version 252587 (0.0033) [2024-06-28 15:23:06,858][09423] Updated weights for policy 0, policy_version 252597 (0.0025) [2024-06-28 15:23:07,921][09190] Fps is (10 sec: 45874.5, 60 sec: 43417.5, 300 sec: 43153.8). Total num frames: 4138598400. Throughput: 0: 42793.3. Samples: 417481360. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:23:07,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:23:11,835][09423] Updated weights for policy 0, policy_version 252607 (0.0045) [2024-06-28 15:23:12,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42598.4, 300 sec: 42931.6). Total num frames: 4138762240. Throughput: 0: 42713.8. Samples: 417614840. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:23:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:23:14,719][09423] Updated weights for policy 0, policy_version 252617 (0.0034) [2024-06-28 15:23:17,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42871.5, 300 sec: 42876.1). Total num frames: 4138991616. Throughput: 0: 42755.7. Samples: 417865220. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:23:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:23:19,410][09423] Updated weights for policy 0, policy_version 252627 (0.0033) [2024-06-28 15:23:22,319][09423] Updated weights for policy 0, policy_version 252637 (0.0037) [2024-06-28 15:23:22,921][09190] Fps is (10 sec: 47513.2, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4139237376. Throughput: 0: 42769.2. Samples: 418120340. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:23:22,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:23:26,776][09423] Updated weights for policy 0, policy_version 252647 (0.0032) [2024-06-28 15:23:27,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4139417600. Throughput: 0: 42832.2. Samples: 418255700. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:23:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:23:29,917][09423] Updated weights for policy 0, policy_version 252657 (0.0034) [2024-06-28 15:23:32,922][09190] Fps is (10 sec: 40959.7, 60 sec: 43146.2, 300 sec: 42931.6). Total num frames: 4139646976. Throughput: 0: 42931.9. Samples: 418517600. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:23:32,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 15:23:34,450][09423] Updated weights for policy 0, policy_version 252667 (0.0027) [2024-06-28 15:23:37,499][09423] Updated weights for policy 0, policy_version 252677 (0.0034) [2024-06-28 15:23:37,921][09190] Fps is (10 sec: 45875.2, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4139876352. Throughput: 0: 42884.4. Samples: 418773100. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:23:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:23:42,109][09423] Updated weights for policy 0, policy_version 252687 (0.0023) [2024-06-28 15:23:42,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42598.5, 300 sec: 42876.1). Total num frames: 4140056576. Throughput: 0: 42786.2. Samples: 418905860. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:23:42,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:23:45,088][09423] Updated weights for policy 0, policy_version 252697 (0.0037) [2024-06-28 15:23:47,924][09190] Fps is (10 sec: 40949.7, 60 sec: 42869.7, 300 sec: 42931.3). Total num frames: 4140285952. Throughput: 0: 42663.4. Samples: 419152420. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:23:47,924][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:23:50,158][09423] Updated weights for policy 0, policy_version 252707 (0.0022) [2024-06-28 15:23:52,578][09423] Updated weights for policy 0, policy_version 252717 (0.0029) [2024-06-28 15:23:52,922][09190] Fps is (10 sec: 45874.5, 60 sec: 42871.3, 300 sec: 42987.2). Total num frames: 4140515328. Throughput: 0: 42889.3. Samples: 419411380. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:23:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:23:57,701][09423] Updated weights for policy 0, policy_version 252727 (0.0028) [2024-06-28 15:23:57,921][09190] Fps is (10 sec: 39331.3, 60 sec: 42325.3, 300 sec: 42765.0). Total num frames: 4140679168. Throughput: 0: 42868.4. Samples: 419543920. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:23:57,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:23:59,399][09403] Signal inference workers to stop experience collection... (5850 times) [2024-06-28 15:23:59,403][09403] Signal inference workers to resume experience collection... (5850 times) [2024-06-28 15:23:59,447][09423] InferenceWorker_p0-w0: stopping experience collection (5850 times) [2024-06-28 15:23:59,452][09423] InferenceWorker_p0-w0: resuming experience collection (5850 times) [2024-06-28 15:24:00,441][09423] Updated weights for policy 0, policy_version 252737 (0.0037) [2024-06-28 15:24:02,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4140941312. Throughput: 0: 42884.4. Samples: 419795020. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:24:02,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:24:05,161][09423] Updated weights for policy 0, policy_version 252747 (0.0024) [2024-06-28 15:24:07,924][09190] Fps is (10 sec: 47501.6, 60 sec: 42596.7, 300 sec: 43042.3). Total num frames: 4141154304. Throughput: 0: 43191.8. Samples: 420064080. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:24:07,924][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:24:08,058][09423] Updated weights for policy 0, policy_version 252757 (0.0034) [2024-06-28 15:24:12,439][09423] Updated weights for policy 0, policy_version 252767 (0.0033) [2024-06-28 15:24:12,921][09190] Fps is (10 sec: 40959.9, 60 sec: 43144.5, 300 sec: 42820.5). Total num frames: 4141350912. Throughput: 0: 43023.0. Samples: 420191740. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:24:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:24:15,542][09423] Updated weights for policy 0, policy_version 252777 (0.0026) [2024-06-28 15:24:17,921][09190] Fps is (10 sec: 44247.5, 60 sec: 43417.5, 300 sec: 43098.2). Total num frames: 4141596672. Throughput: 0: 42891.6. Samples: 420447720. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:24:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:24:17,940][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000252783_4141596672.pth... [2024-06-28 15:24:18,002][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000252153_4131274752.pth [2024-06-28 15:24:20,674][09423] Updated weights for policy 0, policy_version 252787 (0.0033) [2024-06-28 15:24:22,921][09190] Fps is (10 sec: 45875.6, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4141809664. Throughput: 0: 42874.2. Samples: 420702440. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:24:22,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:24:23,308][09423] Updated weights for policy 0, policy_version 252797 (0.0037) [2024-06-28 15:24:27,924][09190] Fps is (10 sec: 37674.1, 60 sec: 42596.6, 300 sec: 42764.6). Total num frames: 4141973504. Throughput: 0: 42770.9. Samples: 420830660. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:24:27,924][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:24:28,205][09423] Updated weights for policy 0, policy_version 252807 (0.0042) [2024-06-28 15:24:30,845][09423] Updated weights for policy 0, policy_version 252817 (0.0022) [2024-06-28 15:24:32,921][09190] Fps is (10 sec: 42598.0, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4142235648. Throughput: 0: 43032.5. Samples: 421088780. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:24:32,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:24:35,951][09423] Updated weights for policy 0, policy_version 252827 (0.0032) [2024-06-28 15:24:37,921][09190] Fps is (10 sec: 47525.1, 60 sec: 42871.4, 300 sec: 42987.5). Total num frames: 4142448640. Throughput: 0: 43137.8. Samples: 421352580. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:24:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:24:38,414][09423] Updated weights for policy 0, policy_version 252837 (0.0028) [2024-06-28 15:24:42,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42871.4, 300 sec: 42820.5). Total num frames: 4142628864. Throughput: 0: 42912.4. Samples: 421474980. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:24:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:24:43,262][09423] Updated weights for policy 0, policy_version 252847 (0.0032) [2024-06-28 15:24:46,018][09423] Updated weights for policy 0, policy_version 252857 (0.0024) [2024-06-28 15:24:47,921][09190] Fps is (10 sec: 45875.6, 60 sec: 43692.4, 300 sec: 43042.7). Total num frames: 4142907392. Throughput: 0: 43178.2. Samples: 421738040. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:24:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:24:50,596][09423] Updated weights for policy 0, policy_version 252867 (0.0032) [2024-06-28 15:24:52,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42598.5, 300 sec: 42931.6). Total num frames: 4143071232. Throughput: 0: 43142.9. Samples: 422005400. Policy #0 lag: (min: 0.0, avg: 7.9, max: 21.0) [2024-06-28 15:24:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:24:53,464][09423] Updated weights for policy 0, policy_version 252877 (0.0033) [2024-06-28 15:24:57,921][09190] Fps is (10 sec: 37683.0, 60 sec: 43417.5, 300 sec: 42876.1). Total num frames: 4143284224. Throughput: 0: 42959.1. Samples: 422124900. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 15:24:57,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:24:58,344][09423] Updated weights for policy 0, policy_version 252887 (0.0035) [2024-06-28 15:25:01,504][09423] Updated weights for policy 0, policy_version 252897 (0.0032) [2024-06-28 15:25:02,921][09190] Fps is (10 sec: 47513.4, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4143546368. Throughput: 0: 43040.5. Samples: 422384540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 15:25:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:25:05,659][09423] Updated weights for policy 0, policy_version 252907 (0.0031) [2024-06-28 15:25:07,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42600.2, 300 sec: 42931.6). Total num frames: 4143710208. Throughput: 0: 43114.2. Samples: 422642580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 15:25:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:25:08,864][09423] Updated weights for policy 0, policy_version 252917 (0.0035) [2024-06-28 15:25:09,613][09403] Signal inference workers to stop experience collection... (5900 times) [2024-06-28 15:25:09,614][09403] Signal inference workers to resume experience collection... (5900 times) [2024-06-28 15:25:09,646][09423] InferenceWorker_p0-w0: stopping experience collection (5900 times) [2024-06-28 15:25:09,646][09423] InferenceWorker_p0-w0: resuming experience collection (5900 times) [2024-06-28 15:25:12,921][09190] Fps is (10 sec: 37683.5, 60 sec: 42871.6, 300 sec: 42931.7). Total num frames: 4143923200. Throughput: 0: 42956.7. Samples: 422763600. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 15:25:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:25:13,614][09423] Updated weights for policy 0, policy_version 252927 (0.0036) [2024-06-28 15:25:16,807][09423] Updated weights for policy 0, policy_version 252937 (0.0025) [2024-06-28 15:25:17,922][09190] Fps is (10 sec: 47512.6, 60 sec: 43144.5, 300 sec: 42987.2). Total num frames: 4144185344. Throughput: 0: 43076.3. Samples: 423027220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 15:25:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:25:20,914][09423] Updated weights for policy 0, policy_version 252947 (0.0033) [2024-06-28 15:25:22,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42598.4, 300 sec: 42987.5). Total num frames: 4144365568. Throughput: 0: 43163.2. Samples: 423294920. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 15:25:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:25:24,183][09423] Updated weights for policy 0, policy_version 252957 (0.0038) [2024-06-28 15:25:27,922][09190] Fps is (10 sec: 39321.8, 60 sec: 43419.3, 300 sec: 42931.6). Total num frames: 4144578560. Throughput: 0: 43132.8. Samples: 423415960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 15:25:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:25:28,279][09423] Updated weights for policy 0, policy_version 252967 (0.0039) [2024-06-28 15:25:31,693][09423] Updated weights for policy 0, policy_version 252977 (0.0039) [2024-06-28 15:25:32,921][09190] Fps is (10 sec: 45874.7, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4144824320. Throughput: 0: 43165.7. Samples: 423680500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 15:25:32,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:25:35,915][09423] Updated weights for policy 0, policy_version 252987 (0.0028) [2024-06-28 15:25:37,922][09190] Fps is (10 sec: 44237.0, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4145020928. Throughput: 0: 43051.9. Samples: 423942740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 15:25:37,936][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:25:39,514][09423] Updated weights for policy 0, policy_version 252997 (0.0028) [2024-06-28 15:25:42,921][09190] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4145233920. Throughput: 0: 42983.2. Samples: 424059140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 15:25:42,923][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:25:43,855][09423] Updated weights for policy 0, policy_version 253007 (0.0025) [2024-06-28 15:25:47,014][09423] Updated weights for policy 0, policy_version 253017 (0.0035) [2024-06-28 15:25:47,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42325.4, 300 sec: 42931.6). Total num frames: 4145446912. Throughput: 0: 42957.8. Samples: 424317640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 15:25:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:25:51,623][09423] Updated weights for policy 0, policy_version 253027 (0.0026) [2024-06-28 15:25:52,921][09190] Fps is (10 sec: 37683.7, 60 sec: 42325.4, 300 sec: 42765.4). Total num frames: 4145610752. Throughput: 0: 43061.4. Samples: 424580340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 15:25:52,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:25:54,886][09423] Updated weights for policy 0, policy_version 253037 (0.0022) [2024-06-28 15:25:57,922][09190] Fps is (10 sec: 44235.8, 60 sec: 43417.5, 300 sec: 43042.7). Total num frames: 4145889280. Throughput: 0: 42974.9. Samples: 424697480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 15:25:57,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:25:59,006][09423] Updated weights for policy 0, policy_version 253047 (0.0031) [2024-06-28 15:26:02,298][09423] Updated weights for policy 0, policy_version 253057 (0.0036) [2024-06-28 15:26:02,921][09190] Fps is (10 sec: 49151.5, 60 sec: 42598.4, 300 sec: 42987.2). Total num frames: 4146102272. Throughput: 0: 43129.1. Samples: 424968020. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 15:26:02,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:26:06,685][09423] Updated weights for policy 0, policy_version 253067 (0.0036) [2024-06-28 15:26:07,922][09190] Fps is (10 sec: 39321.6, 60 sec: 42871.3, 300 sec: 42820.6). Total num frames: 4146282496. Throughput: 0: 43036.2. Samples: 425231560. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-28 15:26:07,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:26:09,938][09423] Updated weights for policy 0, policy_version 253077 (0.0028) [2024-06-28 15:26:12,921][09190] Fps is (10 sec: 44236.6, 60 sec: 43690.6, 300 sec: 43098.6). Total num frames: 4146544640. Throughput: 0: 43040.1. Samples: 425352760. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-28 15:26:12,925][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:26:14,219][09423] Updated weights for policy 0, policy_version 253087 (0.0038) [2024-06-28 15:26:17,718][09423] Updated weights for policy 0, policy_version 253097 (0.0036) [2024-06-28 15:26:17,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42598.5, 300 sec: 42987.2). Total num frames: 4146741248. Throughput: 0: 42944.9. Samples: 425613020. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-28 15:26:17,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:26:17,942][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000253097_4146741248.pth... [2024-06-28 15:26:18,007][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000252468_4136435712.pth [2024-06-28 15:26:21,877][09423] Updated weights for policy 0, policy_version 253107 (0.0034) [2024-06-28 15:26:22,921][09190] Fps is (10 sec: 37683.0, 60 sec: 42598.3, 300 sec: 42820.5). Total num frames: 4146921472. Throughput: 0: 42760.9. Samples: 425866980. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-28 15:26:22,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:26:25,325][09423] Updated weights for policy 0, policy_version 253117 (0.0032) [2024-06-28 15:26:27,921][09190] Fps is (10 sec: 44237.5, 60 sec: 43417.7, 300 sec: 43043.1). Total num frames: 4147183616. Throughput: 0: 42802.7. Samples: 425985260. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-28 15:26:27,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:26:29,327][09423] Updated weights for policy 0, policy_version 253127 (0.0033) [2024-06-28 15:26:32,921][09190] Fps is (10 sec: 45875.3, 60 sec: 42598.4, 300 sec: 42931.6). Total num frames: 4147380224. Throughput: 0: 43055.9. Samples: 426255160. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-28 15:26:32,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:26:32,982][09423] Updated weights for policy 0, policy_version 253137 (0.0028) [2024-06-28 15:26:36,723][09423] Updated weights for policy 0, policy_version 253147 (0.0034) [2024-06-28 15:26:37,921][09190] Fps is (10 sec: 39321.1, 60 sec: 42598.4, 300 sec: 42820.5). Total num frames: 4147576832. Throughput: 0: 43023.0. Samples: 426516380. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-28 15:26:37,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:26:40,484][09423] Updated weights for policy 0, policy_version 253157 (0.0030) [2024-06-28 15:26:42,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4147822592. Throughput: 0: 43167.7. Samples: 426640020. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-28 15:26:42,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:26:44,022][09423] Updated weights for policy 0, policy_version 253167 (0.0035) [2024-06-28 15:26:47,420][09403] Signal inference workers to stop experience collection... (5950 times) [2024-06-28 15:26:47,420][09403] Signal inference workers to resume experience collection... (5950 times) [2024-06-28 15:26:47,467][09423] InferenceWorker_p0-w0: stopping experience collection (5950 times) [2024-06-28 15:26:47,467][09423] InferenceWorker_p0-w0: resuming experience collection (5950 times) [2024-06-28 15:26:47,924][09190] Fps is (10 sec: 45863.8, 60 sec: 43142.7, 300 sec: 42986.8). Total num frames: 4148035584. Throughput: 0: 43004.7. Samples: 426903340. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-28 15:26:47,925][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:26:48,138][09423] Updated weights for policy 0, policy_version 253177 (0.0032) [2024-06-28 15:26:51,783][09423] Updated weights for policy 0, policy_version 253187 (0.0030) [2024-06-28 15:26:52,925][09190] Fps is (10 sec: 39307.9, 60 sec: 43415.0, 300 sec: 42875.6). Total num frames: 4148215808. Throughput: 0: 42879.9. Samples: 427161300. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-28 15:26:52,925][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:26:55,553][09423] Updated weights for policy 0, policy_version 253197 (0.0025) [2024-06-28 15:26:57,921][09190] Fps is (10 sec: 44247.5, 60 sec: 43144.6, 300 sec: 43098.2). Total num frames: 4148477952. Throughput: 0: 42862.6. Samples: 427281580. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-28 15:26:57,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:26:59,919][09423] Updated weights for policy 0, policy_version 253207 (0.0030) [2024-06-28 15:27:02,921][09190] Fps is (10 sec: 42613.4, 60 sec: 42325.3, 300 sec: 42876.1). Total num frames: 4148641792. Throughput: 0: 42869.9. Samples: 427542160. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-28 15:27:02,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 15:27:03,596][09423] Updated weights for policy 0, policy_version 253217 (0.0027) [2024-06-28 15:27:07,337][09423] Updated weights for policy 0, policy_version 253227 (0.0031) [2024-06-28 15:27:07,921][09190] Fps is (10 sec: 39321.7, 60 sec: 43144.6, 300 sec: 42931.6). Total num frames: 4148871168. Throughput: 0: 42823.6. Samples: 427794040. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-28 15:27:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:27:10,974][09423] Updated weights for policy 0, policy_version 253237 (0.0042) [2024-06-28 15:27:12,921][09190] Fps is (10 sec: 47513.8, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4149116928. Throughput: 0: 43088.0. Samples: 427924220. Policy #0 lag: (min: 1.0, avg: 11.5, max: 21.0) [2024-06-28 15:27:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:27:14,721][09423] Updated weights for policy 0, policy_version 253247 (0.0037) [2024-06-28 15:27:17,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42871.6, 300 sec: 42931.6). Total num frames: 4149313536. Throughput: 0: 43053.9. Samples: 428192580. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 15:27:17,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:27:18,723][09423] Updated weights for policy 0, policy_version 253257 (0.0026) [2024-06-28 15:27:22,192][09423] Updated weights for policy 0, policy_version 253267 (0.0028) [2024-06-28 15:27:22,921][09190] Fps is (10 sec: 40959.4, 60 sec: 43417.6, 300 sec: 42987.2). Total num frames: 4149526528. Throughput: 0: 42859.5. Samples: 428445060. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 15:27:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:27:26,416][09423] Updated weights for policy 0, policy_version 253277 (0.0048) [2024-06-28 15:27:27,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.4, 300 sec: 43043.1). Total num frames: 4149755904. Throughput: 0: 43052.4. Samples: 428577380. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 15:27:27,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:27:29,735][09423] Updated weights for policy 0, policy_version 253287 (0.0037) [2024-06-28 15:27:32,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4149952512. Throughput: 0: 42990.4. Samples: 428837800. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 15:27:32,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:27:33,999][09423] Updated weights for policy 0, policy_version 253297 (0.0023) [2024-06-28 15:27:37,834][09423] Updated weights for policy 0, policy_version 253307 (0.0044) [2024-06-28 15:27:37,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 42987.2). Total num frames: 4150181888. Throughput: 0: 42650.8. Samples: 429080440. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 15:27:37,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:27:41,826][09423] Updated weights for policy 0, policy_version 253317 (0.0035) [2024-06-28 15:27:42,923][09190] Fps is (10 sec: 44231.0, 60 sec: 42870.5, 300 sec: 42987.0). Total num frames: 4150394880. Throughput: 0: 42894.0. Samples: 429211860. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 15:27:42,923][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:27:45,502][09423] Updated weights for policy 0, policy_version 253327 (0.0035) [2024-06-28 15:27:47,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42327.1, 300 sec: 42820.5). Total num frames: 4150575104. Throughput: 0: 42867.9. Samples: 429471220. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 15:27:47,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:27:49,317][09423] Updated weights for policy 0, policy_version 253337 (0.0033) [2024-06-28 15:27:52,921][09190] Fps is (10 sec: 40965.3, 60 sec: 43147.0, 300 sec: 42931.6). Total num frames: 4150804480. Throughput: 0: 42921.4. Samples: 429725500. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 15:27:52,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 15:27:53,337][09423] Updated weights for policy 0, policy_version 253347 (0.0037) [2024-06-28 15:27:56,933][09423] Updated weights for policy 0, policy_version 253357 (0.0035) [2024-06-28 15:27:57,922][09190] Fps is (10 sec: 47513.3, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4151050240. Throughput: 0: 42962.5. Samples: 429857540. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 15:27:57,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 15:28:00,754][09423] Updated weights for policy 0, policy_version 253367 (0.0025) [2024-06-28 15:28:02,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 42876.1). Total num frames: 4151246848. Throughput: 0: 42917.2. Samples: 430123860. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 15:28:02,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:28:04,599][09423] Updated weights for policy 0, policy_version 253377 (0.0026) [2024-06-28 15:28:07,921][09190] Fps is (10 sec: 40961.0, 60 sec: 43144.7, 300 sec: 43042.7). Total num frames: 4151459840. Throughput: 0: 42923.3. Samples: 430376600. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 15:28:07,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 15:28:08,227][09423] Updated weights for policy 0, policy_version 253387 (0.0027) [2024-06-28 15:28:12,273][09423] Updated weights for policy 0, policy_version 253397 (0.0051) [2024-06-28 15:28:12,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.4, 300 sec: 42987.2). Total num frames: 4151672832. Throughput: 0: 42906.7. Samples: 430508180. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 15:28:12,922][09190] Avg episode reward: [(0, '0.728')] [2024-06-28 15:28:16,152][09423] Updated weights for policy 0, policy_version 253407 (0.0042) [2024-06-28 15:28:17,171][09403] Signal inference workers to stop experience collection... (6000 times) [2024-06-28 15:28:17,172][09403] Signal inference workers to resume experience collection... (6000 times) [2024-06-28 15:28:17,204][09423] InferenceWorker_p0-w0: stopping experience collection (6000 times) [2024-06-28 15:28:17,204][09423] InferenceWorker_p0-w0: resuming experience collection (6000 times) [2024-06-28 15:28:17,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42598.4, 300 sec: 42820.6). Total num frames: 4151869440. Throughput: 0: 42637.8. Samples: 430756500. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 15:28:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:28:17,951][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000253411_4151885824.pth... [2024-06-28 15:28:18,005][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000252783_4141596672.pth [2024-06-28 15:28:20,133][09423] Updated weights for policy 0, policy_version 253417 (0.0041) [2024-06-28 15:28:22,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.6, 300 sec: 42987.2). Total num frames: 4152098816. Throughput: 0: 42898.8. Samples: 431010880. Policy #0 lag: (min: 0.0, avg: 11.3, max: 21.0) [2024-06-28 15:28:22,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 15:28:23,719][09423] Updated weights for policy 0, policy_version 253427 (0.0032) [2024-06-28 15:28:27,909][09423] Updated weights for policy 0, policy_version 253437 (0.0038) [2024-06-28 15:28:27,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42598.4, 300 sec: 42931.6). Total num frames: 4152311808. Throughput: 0: 42908.3. Samples: 431142680. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 15:28:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:28:31,804][09423] Updated weights for policy 0, policy_version 253447 (0.0054) [2024-06-28 15:28:32,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.5, 300 sec: 42876.1). Total num frames: 4152524800. Throughput: 0: 42800.6. Samples: 431397240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 15:28:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:28:35,337][09423] Updated weights for policy 0, policy_version 253457 (0.0031) [2024-06-28 15:28:37,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.4, 300 sec: 42987.2). Total num frames: 4152737792. Throughput: 0: 42988.9. Samples: 431660000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 15:28:37,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:28:39,223][09423] Updated weights for policy 0, policy_version 253467 (0.0026) [2024-06-28 15:28:42,817][09423] Updated weights for policy 0, policy_version 253477 (0.0039) [2024-06-28 15:28:42,921][09190] Fps is (10 sec: 44236.1, 60 sec: 42872.3, 300 sec: 42987.5). Total num frames: 4152967168. Throughput: 0: 43043.6. Samples: 431794500. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 15:28:42,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:28:46,858][09423] Updated weights for policy 0, policy_version 253487 (0.0039) [2024-06-28 15:28:47,921][09190] Fps is (10 sec: 44236.9, 60 sec: 43417.6, 300 sec: 42931.6). Total num frames: 4153180160. Throughput: 0: 42795.6. Samples: 432049660. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 15:28:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:28:50,329][09423] Updated weights for policy 0, policy_version 253497 (0.0039) [2024-06-28 15:28:52,921][09190] Fps is (10 sec: 42599.2, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4153393152. Throughput: 0: 42838.2. Samples: 432304320. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 15:28:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:28:54,830][09423] Updated weights for policy 0, policy_version 253507 (0.0051) [2024-06-28 15:28:57,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.4, 300 sec: 42876.1). Total num frames: 4153589760. Throughput: 0: 42667.0. Samples: 432428200. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 15:28:57,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:28:58,075][09423] Updated weights for policy 0, policy_version 253517 (0.0038) [2024-06-28 15:29:02,333][09423] Updated weights for policy 0, policy_version 253527 (0.0033) [2024-06-28 15:29:02,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.4, 300 sec: 42876.5). Total num frames: 4153802752. Throughput: 0: 42935.1. Samples: 432688580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 15:29:02,922][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 15:29:05,467][09423] Updated weights for policy 0, policy_version 253537 (0.0029) [2024-06-28 15:29:07,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.4, 300 sec: 42987.2). Total num frames: 4154032128. Throughput: 0: 42979.5. Samples: 432944960. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 15:29:07,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:29:09,733][09423] Updated weights for policy 0, policy_version 253547 (0.0032) [2024-06-28 15:29:12,877][09423] Updated weights for policy 0, policy_version 253557 (0.0040) [2024-06-28 15:29:12,921][09190] Fps is (10 sec: 47514.0, 60 sec: 43417.6, 300 sec: 42987.2). Total num frames: 4154277888. Throughput: 0: 42994.4. Samples: 433077420. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 15:29:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:29:17,273][09423] Updated weights for policy 0, policy_version 253567 (0.0035) [2024-06-28 15:29:17,922][09190] Fps is (10 sec: 44236.4, 60 sec: 43417.5, 300 sec: 42931.6). Total num frames: 4154474496. Throughput: 0: 43329.1. Samples: 433347060. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 15:29:17,922][09190] Avg episode reward: [(0, '0.714')] [2024-06-28 15:29:20,677][09423] Updated weights for policy 0, policy_version 253577 (0.0041) [2024-06-28 15:29:22,921][09190] Fps is (10 sec: 40959.3, 60 sec: 43144.4, 300 sec: 43098.6). Total num frames: 4154687488. Throughput: 0: 43148.4. Samples: 433601680. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 15:29:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:29:24,753][09423] Updated weights for policy 0, policy_version 253587 (0.0035) [2024-06-28 15:29:27,921][09190] Fps is (10 sec: 42599.3, 60 sec: 43144.6, 300 sec: 42931.7). Total num frames: 4154900480. Throughput: 0: 43015.7. Samples: 433730200. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 15:29:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:29:28,137][09423] Updated weights for policy 0, policy_version 253597 (0.0048) [2024-06-28 15:29:32,354][09423] Updated weights for policy 0, policy_version 253607 (0.0036) [2024-06-28 15:29:32,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.4, 300 sec: 42931.6). Total num frames: 4155113472. Throughput: 0: 43046.2. Samples: 433986740. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 15:29:32,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:29:36,084][09423] Updated weights for policy 0, policy_version 253617 (0.0032) [2024-06-28 15:29:37,922][09190] Fps is (10 sec: 44235.9, 60 sec: 43417.5, 300 sec: 43098.2). Total num frames: 4155342848. Throughput: 0: 43066.9. Samples: 434242340. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-28 15:29:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:29:40,058][09423] Updated weights for policy 0, policy_version 253627 (0.0033) [2024-06-28 15:29:42,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.5, 300 sec: 42820.6). Total num frames: 4155539456. Throughput: 0: 43165.8. Samples: 434370660. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-28 15:29:42,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:29:43,535][09423] Updated weights for policy 0, policy_version 253637 (0.0022) [2024-06-28 15:29:47,909][09423] Updated weights for policy 0, policy_version 253647 (0.0035) [2024-06-28 15:29:47,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4155752448. Throughput: 0: 43191.5. Samples: 434632200. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-28 15:29:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:29:51,333][09423] Updated weights for policy 0, policy_version 253657 (0.0025) [2024-06-28 15:29:52,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.4, 300 sec: 42987.2). Total num frames: 4155965440. Throughput: 0: 43275.2. Samples: 434892340. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-28 15:29:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:29:55,277][09423] Updated weights for policy 0, policy_version 253667 (0.0021) [2024-06-28 15:29:57,924][09190] Fps is (10 sec: 42589.4, 60 sec: 43143.0, 300 sec: 42820.2). Total num frames: 4156178432. Throughput: 0: 43282.7. Samples: 435025240. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-28 15:29:57,924][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:29:58,906][09403] Signal inference workers to stop experience collection... (6050 times) [2024-06-28 15:29:58,906][09403] Signal inference workers to resume experience collection... (6050 times) [2024-06-28 15:29:58,922][09423] Updated weights for policy 0, policy_version 253677 (0.0033) [2024-06-28 15:29:58,955][09423] InferenceWorker_p0-w0: stopping experience collection (6050 times) [2024-06-28 15:29:58,955][09423] InferenceWorker_p0-w0: resuming experience collection (6050 times) [2024-06-28 15:30:02,622][09423] Updated weights for policy 0, policy_version 253687 (0.0055) [2024-06-28 15:30:02,921][09190] Fps is (10 sec: 44236.1, 60 sec: 43417.5, 300 sec: 43042.7). Total num frames: 4156407808. Throughput: 0: 43081.4. Samples: 435285720. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-28 15:30:02,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:30:06,370][09423] Updated weights for policy 0, policy_version 253697 (0.0030) [2024-06-28 15:30:07,921][09190] Fps is (10 sec: 45884.6, 60 sec: 43417.6, 300 sec: 43098.2). Total num frames: 4156637184. Throughput: 0: 43001.8. Samples: 435536760. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-28 15:30:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:30:10,517][09423] Updated weights for policy 0, policy_version 253707 (0.0032) [2024-06-28 15:30:12,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.3, 300 sec: 42876.1). Total num frames: 4156833792. Throughput: 0: 42985.2. Samples: 435664540. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-28 15:30:12,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:30:14,425][09423] Updated weights for policy 0, policy_version 253717 (0.0037) [2024-06-28 15:30:17,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4157046784. Throughput: 0: 43075.6. Samples: 435925140. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-28 15:30:17,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:30:18,027][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000253727_4157063168.pth... [2024-06-28 15:30:18,040][09423] Updated weights for policy 0, policy_version 253727 (0.0048) [2024-06-28 15:30:18,076][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000253097_4146741248.pth [2024-06-28 15:30:21,927][09423] Updated weights for policy 0, policy_version 253737 (0.0036) [2024-06-28 15:30:22,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42598.5, 300 sec: 42931.7). Total num frames: 4157243392. Throughput: 0: 42887.8. Samples: 436172280. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-28 15:30:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:30:25,853][09423] Updated weights for policy 0, policy_version 253747 (0.0041) [2024-06-28 15:30:27,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.4, 300 sec: 42876.1). Total num frames: 4157472768. Throughput: 0: 42831.5. Samples: 436298080. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-28 15:30:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:30:29,402][09423] Updated weights for policy 0, policy_version 253757 (0.0031) [2024-06-28 15:30:32,921][09190] Fps is (10 sec: 44236.0, 60 sec: 42871.4, 300 sec: 42931.6). Total num frames: 4157685760. Throughput: 0: 42900.8. Samples: 436562740. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-28 15:30:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:30:33,259][09423] Updated weights for policy 0, policy_version 253767 (0.0027) [2024-06-28 15:30:37,298][09423] Updated weights for policy 0, policy_version 253777 (0.0034) [2024-06-28 15:30:37,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.5, 300 sec: 42931.6). Total num frames: 4157898752. Throughput: 0: 42778.7. Samples: 436817380. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-28 15:30:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:30:40,919][09423] Updated weights for policy 0, policy_version 253787 (0.0031) [2024-06-28 15:30:42,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.4, 300 sec: 42931.6). Total num frames: 4158111744. Throughput: 0: 42770.9. Samples: 436949840. Policy #0 lag: (min: 1.0, avg: 9.5, max: 20.0) [2024-06-28 15:30:42,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:30:44,679][09423] Updated weights for policy 0, policy_version 253797 (0.0032) [2024-06-28 15:30:47,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42871.5, 300 sec: 43098.2). Total num frames: 4158324736. Throughput: 0: 42699.2. Samples: 437207180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:30:47,930][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:30:48,624][09423] Updated weights for policy 0, policy_version 253807 (0.0035) [2024-06-28 15:30:52,453][09423] Updated weights for policy 0, policy_version 253817 (0.0026) [2024-06-28 15:30:52,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43144.5, 300 sec: 42931.7). Total num frames: 4158554112. Throughput: 0: 42839.6. Samples: 437464540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:30:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:30:56,204][09423] Updated weights for policy 0, policy_version 253827 (0.0033) [2024-06-28 15:30:57,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42873.0, 300 sec: 42876.1). Total num frames: 4158750720. Throughput: 0: 42913.3. Samples: 437595640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:30:57,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:31:00,202][09423] Updated weights for policy 0, policy_version 253837 (0.0030) [2024-06-28 15:31:02,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.5, 300 sec: 42987.2). Total num frames: 4158963712. Throughput: 0: 42733.8. Samples: 437848160. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:31:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:31:03,738][09423] Updated weights for policy 0, policy_version 253847 (0.0038) [2024-06-28 15:31:07,919][09423] Updated weights for policy 0, policy_version 253857 (0.0032) [2024-06-28 15:31:07,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42598.5, 300 sec: 42876.1). Total num frames: 4159193088. Throughput: 0: 42993.3. Samples: 438106980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:31:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:31:11,192][09423] Updated weights for policy 0, policy_version 253867 (0.0031) [2024-06-28 15:31:12,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.4, 300 sec: 42931.6). Total num frames: 4159406080. Throughput: 0: 43148.4. Samples: 438239760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:31:12,924][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:31:15,278][09423] Updated weights for policy 0, policy_version 253877 (0.0034) [2024-06-28 15:31:17,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4159619072. Throughput: 0: 42952.9. Samples: 438495620. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:31:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:31:18,951][09423] Updated weights for policy 0, policy_version 253887 (0.0025) [2024-06-28 15:31:22,619][09423] Updated weights for policy 0, policy_version 253897 (0.0034) [2024-06-28 15:31:22,921][09190] Fps is (10 sec: 44237.5, 60 sec: 43417.6, 300 sec: 42931.6). Total num frames: 4159848448. Throughput: 0: 42901.8. Samples: 438747960. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:31:22,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:31:26,373][09423] Updated weights for policy 0, policy_version 253907 (0.0044) [2024-06-28 15:31:27,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4160061440. Throughput: 0: 43038.3. Samples: 438886560. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:31:27,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:31:30,577][09423] Updated weights for policy 0, policy_version 253917 (0.0030) [2024-06-28 15:31:32,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4160258048. Throughput: 0: 42987.1. Samples: 439141600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:31:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:31:33,156][09403] Signal inference workers to stop experience collection... (6100 times) [2024-06-28 15:31:33,156][09403] Signal inference workers to resume experience collection... (6100 times) [2024-06-28 15:31:33,183][09423] InferenceWorker_p0-w0: stopping experience collection (6100 times) [2024-06-28 15:31:33,183][09423] InferenceWorker_p0-w0: resuming experience collection (6100 times) [2024-06-28 15:31:34,210][09423] Updated weights for policy 0, policy_version 253927 (0.0038) [2024-06-28 15:31:37,921][09190] Fps is (10 sec: 42598.1, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4160487424. Throughput: 0: 42993.3. Samples: 439399240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:31:37,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:31:38,005][09423] Updated weights for policy 0, policy_version 253937 (0.0032) [2024-06-28 15:31:41,787][09423] Updated weights for policy 0, policy_version 253947 (0.0038) [2024-06-28 15:31:42,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43144.6, 300 sec: 42932.0). Total num frames: 4160700416. Throughput: 0: 43088.0. Samples: 439534600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:31:42,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:31:45,570][09423] Updated weights for policy 0, policy_version 253957 (0.0033) [2024-06-28 15:31:47,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 43043.2). Total num frames: 4160913408. Throughput: 0: 43216.0. Samples: 439792880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:31:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:31:49,247][09423] Updated weights for policy 0, policy_version 253967 (0.0027) [2024-06-28 15:31:52,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43144.5, 300 sec: 42931.7). Total num frames: 4161142784. Throughput: 0: 43161.3. Samples: 440049240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 15:31:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:31:53,379][09423] Updated weights for policy 0, policy_version 253977 (0.0026) [2024-06-28 15:31:56,828][09423] Updated weights for policy 0, policy_version 253987 (0.0044) [2024-06-28 15:31:57,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43690.7, 300 sec: 43153.8). Total num frames: 4161372160. Throughput: 0: 43117.4. Samples: 440180040. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:31:57,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:32:00,738][09423] Updated weights for policy 0, policy_version 253997 (0.0036) [2024-06-28 15:32:02,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4161568768. Throughput: 0: 43050.3. Samples: 440432880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:32:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:32:04,767][09423] Updated weights for policy 0, policy_version 254007 (0.0035) [2024-06-28 15:32:07,921][09190] Fps is (10 sec: 40960.4, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4161781760. Throughput: 0: 43186.2. Samples: 440691340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:32:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:32:08,483][09423] Updated weights for policy 0, policy_version 254017 (0.0039) [2024-06-28 15:32:12,309][09423] Updated weights for policy 0, policy_version 254027 (0.0028) [2024-06-28 15:32:12,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43417.7, 300 sec: 43042.7). Total num frames: 4162011136. Throughput: 0: 43147.1. Samples: 440828180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:32:12,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:32:16,086][09423] Updated weights for policy 0, policy_version 254037 (0.0037) [2024-06-28 15:32:17,921][09190] Fps is (10 sec: 42598.0, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4162207744. Throughput: 0: 43036.4. Samples: 441078240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:32:17,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:32:17,934][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000254041_4162207744.pth... [2024-06-28 15:32:17,982][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000253411_4151885824.pth [2024-06-28 15:32:19,991][09423] Updated weights for policy 0, policy_version 254047 (0.0031) [2024-06-28 15:32:22,922][09190] Fps is (10 sec: 39321.0, 60 sec: 42598.2, 300 sec: 42876.1). Total num frames: 4162404352. Throughput: 0: 43130.6. Samples: 441340120. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:32:22,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:32:23,836][09423] Updated weights for policy 0, policy_version 254057 (0.0041) [2024-06-28 15:32:27,719][09423] Updated weights for policy 0, policy_version 254067 (0.0050) [2024-06-28 15:32:27,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4162650112. Throughput: 0: 42893.4. Samples: 441464800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:32:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:32:31,326][09423] Updated weights for policy 0, policy_version 254077 (0.0036) [2024-06-28 15:32:32,921][09190] Fps is (10 sec: 44237.6, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4162846720. Throughput: 0: 42911.2. Samples: 441723880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:32:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:32:35,149][09423] Updated weights for policy 0, policy_version 254087 (0.0031) [2024-06-28 15:32:37,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42871.5, 300 sec: 42931.8). Total num frames: 4163059712. Throughput: 0: 43044.4. Samples: 441986240. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:32:37,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:32:38,804][09423] Updated weights for policy 0, policy_version 254097 (0.0029) [2024-06-28 15:32:42,698][09423] Updated weights for policy 0, policy_version 254107 (0.0042) [2024-06-28 15:32:42,925][09190] Fps is (10 sec: 44220.7, 60 sec: 43141.9, 300 sec: 43097.7). Total num frames: 4163289088. Throughput: 0: 43010.3. Samples: 442115660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:32:42,926][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:32:46,317][09423] Updated weights for policy 0, policy_version 254117 (0.0021) [2024-06-28 15:32:47,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4163485696. Throughput: 0: 43036.8. Samples: 442369540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:32:47,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:32:50,537][09423] Updated weights for policy 0, policy_version 254127 (0.0033) [2024-06-28 15:32:52,921][09190] Fps is (10 sec: 40975.0, 60 sec: 42598.4, 300 sec: 42876.1). Total num frames: 4163698688. Throughput: 0: 43032.4. Samples: 442627800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:32:52,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:32:54,092][09423] Updated weights for policy 0, policy_version 254137 (0.0037) [2024-06-28 15:32:57,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42598.4, 300 sec: 42987.2). Total num frames: 4163928064. Throughput: 0: 42868.9. Samples: 442757280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:32:57,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:32:58,015][09423] Updated weights for policy 0, policy_version 254147 (0.0031) [2024-06-28 15:33:01,889][09423] Updated weights for policy 0, policy_version 254157 (0.0040) [2024-06-28 15:33:02,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4164141056. Throughput: 0: 43005.8. Samples: 443013500. Policy #0 lag: (min: 0.0, avg: 10.0, max: 22.0) [2024-06-28 15:33:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:33:05,846][09423] Updated weights for policy 0, policy_version 254167 (0.0031) [2024-06-28 15:33:07,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42871.4, 300 sec: 42987.2). Total num frames: 4164354048. Throughput: 0: 43014.3. Samples: 443275760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 15:33:07,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:33:09,400][09423] Updated weights for policy 0, policy_version 254177 (0.0042) [2024-06-28 15:33:12,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42598.4, 300 sec: 43042.7). Total num frames: 4164567040. Throughput: 0: 43166.2. Samples: 443407280. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 15:33:12,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:33:13,318][09423] Updated weights for policy 0, policy_version 254187 (0.0031) [2024-06-28 15:33:16,949][09423] Updated weights for policy 0, policy_version 254197 (0.0045) [2024-06-28 15:33:17,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4164780032. Throughput: 0: 43046.7. Samples: 443660980. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 15:33:17,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 15:33:20,927][09423] Updated weights for policy 0, policy_version 254207 (0.0035) [2024-06-28 15:33:22,921][09190] Fps is (10 sec: 44236.9, 60 sec: 43417.7, 300 sec: 43042.7). Total num frames: 4165009408. Throughput: 0: 42857.9. Samples: 443914840. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 15:33:22,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:33:24,369][09423] Updated weights for policy 0, policy_version 254217 (0.0033) [2024-06-28 15:33:27,922][09190] Fps is (10 sec: 42597.5, 60 sec: 42598.3, 300 sec: 42987.1). Total num frames: 4165206016. Throughput: 0: 42980.2. Samples: 444049620. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 15:33:27,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:33:28,419][09403] Signal inference workers to stop experience collection... (6150 times) [2024-06-28 15:33:28,419][09403] Signal inference workers to resume experience collection... (6150 times) [2024-06-28 15:33:28,431][09423] InferenceWorker_p0-w0: stopping experience collection (6150 times) [2024-06-28 15:33:28,431][09423] InferenceWorker_p0-w0: resuming experience collection (6150 times) [2024-06-28 15:33:28,558][09423] Updated weights for policy 0, policy_version 254227 (0.0031) [2024-06-28 15:33:31,919][09423] Updated weights for policy 0, policy_version 254237 (0.0026) [2024-06-28 15:33:32,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4165435392. Throughput: 0: 42986.7. Samples: 444303940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 15:33:32,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:33:36,218][09423] Updated weights for policy 0, policy_version 254247 (0.0029) [2024-06-28 15:33:37,922][09190] Fps is (10 sec: 45875.5, 60 sec: 43417.5, 300 sec: 43042.7). Total num frames: 4165664768. Throughput: 0: 43021.6. Samples: 444563780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 15:33:37,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:33:40,158][09423] Updated weights for policy 0, policy_version 254257 (0.0033) [2024-06-28 15:33:42,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42874.1, 300 sec: 42987.2). Total num frames: 4165861376. Throughput: 0: 43082.7. Samples: 444696000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 15:33:42,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:33:43,725][09423] Updated weights for policy 0, policy_version 254267 (0.0030) [2024-06-28 15:33:47,571][09423] Updated weights for policy 0, policy_version 254277 (0.0023) [2024-06-28 15:33:47,928][09190] Fps is (10 sec: 40933.8, 60 sec: 43139.9, 300 sec: 42986.2). Total num frames: 4166074368. Throughput: 0: 43177.2. Samples: 444956760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 15:33:47,929][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:33:51,513][09423] Updated weights for policy 0, policy_version 254287 (0.0039) [2024-06-28 15:33:52,921][09190] Fps is (10 sec: 45874.8, 60 sec: 43690.6, 300 sec: 43153.8). Total num frames: 4166320128. Throughput: 0: 42948.4. Samples: 445208440. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 15:33:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:33:55,014][09423] Updated weights for policy 0, policy_version 254297 (0.0036) [2024-06-28 15:33:57,921][09190] Fps is (10 sec: 42626.2, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4166500352. Throughput: 0: 43000.0. Samples: 445342280. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 15:33:57,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:33:59,099][09423] Updated weights for policy 0, policy_version 254307 (0.0022) [2024-06-28 15:34:02,455][09423] Updated weights for policy 0, policy_version 254317 (0.0028) [2024-06-28 15:34:02,921][09190] Fps is (10 sec: 40960.1, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4166729728. Throughput: 0: 43164.8. Samples: 445603400. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 15:34:02,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:34:06,652][09423] Updated weights for policy 0, policy_version 254327 (0.0038) [2024-06-28 15:34:07,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43144.6, 300 sec: 42931.6). Total num frames: 4166942720. Throughput: 0: 43232.0. Samples: 445860280. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 15:34:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:34:09,957][09423] Updated weights for policy 0, policy_version 254337 (0.0034) [2024-06-28 15:34:12,922][09190] Fps is (10 sec: 42597.6, 60 sec: 43144.4, 300 sec: 42987.2). Total num frames: 4167155712. Throughput: 0: 42964.9. Samples: 445983040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 15:34:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:34:14,130][09423] Updated weights for policy 0, policy_version 254347 (0.0037) [2024-06-28 15:34:17,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 42987.2). Total num frames: 4167368704. Throughput: 0: 43094.2. Samples: 446243180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:34:17,922][09190] Avg episode reward: [(0, '0.726')] [2024-06-28 15:34:17,933][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000254357_4167385088.pth... [2024-06-28 15:34:17,936][09423] Updated weights for policy 0, policy_version 254357 (0.0036) [2024-06-28 15:34:17,980][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000253727_4157063168.pth [2024-06-28 15:34:21,921][09423] Updated weights for policy 0, policy_version 254367 (0.0029) [2024-06-28 15:34:22,921][09190] Fps is (10 sec: 42599.4, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4167581696. Throughput: 0: 43035.7. Samples: 446500380. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:34:22,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:34:25,670][09423] Updated weights for policy 0, policy_version 254377 (0.0022) [2024-06-28 15:34:27,922][09190] Fps is (10 sec: 44234.0, 60 sec: 43417.3, 300 sec: 43042.6). Total num frames: 4167811072. Throughput: 0: 42871.4. Samples: 446625240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:34:27,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:34:29,549][09423] Updated weights for policy 0, policy_version 254387 (0.0041) [2024-06-28 15:34:32,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42871.5, 300 sec: 42931.7). Total num frames: 4168007680. Throughput: 0: 42820.5. Samples: 446883400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:34:32,921][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:34:33,140][09423] Updated weights for policy 0, policy_version 254397 (0.0029) [2024-06-28 15:34:37,066][09423] Updated weights for policy 0, policy_version 254407 (0.0026) [2024-06-28 15:34:37,921][09190] Fps is (10 sec: 40962.8, 60 sec: 42598.5, 300 sec: 42987.2). Total num frames: 4168220672. Throughput: 0: 43033.5. Samples: 447144940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:34:37,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:34:40,844][09423] Updated weights for policy 0, policy_version 254417 (0.0035) [2024-06-28 15:34:42,921][09190] Fps is (10 sec: 44236.2, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4168450048. Throughput: 0: 42908.4. Samples: 447273160. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:34:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:34:44,953][09423] Updated weights for policy 0, policy_version 254427 (0.0036) [2024-06-28 15:34:47,921][09190] Fps is (10 sec: 44236.5, 60 sec: 43149.2, 300 sec: 43042.7). Total num frames: 4168663040. Throughput: 0: 42756.5. Samples: 447527440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:34:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:34:48,281][09423] Updated weights for policy 0, policy_version 254437 (0.0043) [2024-06-28 15:34:52,487][09423] Updated weights for policy 0, policy_version 254447 (0.0036) [2024-06-28 15:34:52,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.4, 300 sec: 43043.0). Total num frames: 4168876032. Throughput: 0: 42783.0. Samples: 447785520. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:34:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:34:56,309][09423] Updated weights for policy 0, policy_version 254457 (0.0041) [2024-06-28 15:34:57,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42871.5, 300 sec: 42931.7). Total num frames: 4169072640. Throughput: 0: 42898.0. Samples: 447913440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:34:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:35:00,120][09423] Updated weights for policy 0, policy_version 254467 (0.0036) [2024-06-28 15:35:02,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4169302016. Throughput: 0: 42775.5. Samples: 448168080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:35:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:35:03,966][09423] Updated weights for policy 0, policy_version 254477 (0.0025) [2024-06-28 15:35:07,792][09423] Updated weights for policy 0, policy_version 254487 (0.0035) [2024-06-28 15:35:07,924][09190] Fps is (10 sec: 44225.6, 60 sec: 42869.6, 300 sec: 42986.8). Total num frames: 4169515008. Throughput: 0: 42955.4. Samples: 448433480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:35:07,924][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:35:08,498][09403] Signal inference workers to stop experience collection... (6200 times) [2024-06-28 15:35:08,528][09423] InferenceWorker_p0-w0: stopping experience collection (6200 times) [2024-06-28 15:35:08,555][09403] Signal inference workers to resume experience collection... (6200 times) [2024-06-28 15:35:08,556][09423] InferenceWorker_p0-w0: resuming experience collection (6200 times) [2024-06-28 15:35:11,796][09423] Updated weights for policy 0, policy_version 254497 (0.0036) [2024-06-28 15:35:12,922][09190] Fps is (10 sec: 40959.6, 60 sec: 42598.5, 300 sec: 42931.6). Total num frames: 4169711616. Throughput: 0: 43036.9. Samples: 448561880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:35:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:35:15,278][09423] Updated weights for policy 0, policy_version 254507 (0.0029) [2024-06-28 15:35:17,921][09190] Fps is (10 sec: 42609.1, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4169940992. Throughput: 0: 43027.9. Samples: 448819660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:35:17,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:35:19,198][09423] Updated weights for policy 0, policy_version 254517 (0.0040) [2024-06-28 15:35:22,857][09423] Updated weights for policy 0, policy_version 254527 (0.0034) [2024-06-28 15:35:22,921][09190] Fps is (10 sec: 45875.6, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4170170368. Throughput: 0: 42938.6. Samples: 449077180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:35:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:35:26,506][09423] Updated weights for policy 0, policy_version 254537 (0.0041) [2024-06-28 15:35:27,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.7, 300 sec: 42931.6). Total num frames: 4170350592. Throughput: 0: 42972.8. Samples: 449206940. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 15:35:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:35:30,286][09423] Updated weights for policy 0, policy_version 254547 (0.0024) [2024-06-28 15:35:32,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.4, 300 sec: 43042.7). Total num frames: 4170596352. Throughput: 0: 43067.0. Samples: 449465460. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 15:35:32,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 15:35:34,473][09423] Updated weights for policy 0, policy_version 254557 (0.0026) [2024-06-28 15:35:37,715][09423] Updated weights for policy 0, policy_version 254567 (0.0036) [2024-06-28 15:35:37,921][09190] Fps is (10 sec: 47513.9, 60 sec: 43417.5, 300 sec: 43098.3). Total num frames: 4170825728. Throughput: 0: 42998.2. Samples: 449720440. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 15:35:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:35:41,813][09423] Updated weights for policy 0, policy_version 254577 (0.0038) [2024-06-28 15:35:42,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42598.4, 300 sec: 42987.2). Total num frames: 4171005952. Throughput: 0: 43047.9. Samples: 449850600. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 15:35:42,923][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:35:45,561][09423] Updated weights for policy 0, policy_version 254587 (0.0032) [2024-06-28 15:35:47,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4171251712. Throughput: 0: 43277.7. Samples: 450115580. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 15:35:47,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:35:49,282][09423] Updated weights for policy 0, policy_version 254597 (0.0029) [2024-06-28 15:35:52,921][09190] Fps is (10 sec: 45875.5, 60 sec: 43144.5, 300 sec: 43098.3). Total num frames: 4171464704. Throughput: 0: 43230.0. Samples: 450378720. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 15:35:52,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:35:52,973][09423] Updated weights for policy 0, policy_version 254607 (0.0036) [2024-06-28 15:35:57,171][09423] Updated weights for policy 0, policy_version 254617 (0.0030) [2024-06-28 15:35:57,924][09190] Fps is (10 sec: 40949.9, 60 sec: 43142.7, 300 sec: 43042.3). Total num frames: 4171661312. Throughput: 0: 43219.9. Samples: 450506880. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 15:35:57,924][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:36:00,413][09423] Updated weights for policy 0, policy_version 254627 (0.0031) [2024-06-28 15:36:02,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4171890688. Throughput: 0: 43112.0. Samples: 450759700. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 15:36:02,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:36:04,480][09423] Updated weights for policy 0, policy_version 254637 (0.0045) [2024-06-28 15:36:07,921][09190] Fps is (10 sec: 45886.3, 60 sec: 43419.3, 300 sec: 43098.2). Total num frames: 4172120064. Throughput: 0: 43111.1. Samples: 451017180. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 15:36:07,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 15:36:08,194][09423] Updated weights for policy 0, policy_version 254647 (0.0035) [2024-06-28 15:36:12,494][09423] Updated weights for policy 0, policy_version 254657 (0.0029) [2024-06-28 15:36:12,922][09190] Fps is (10 sec: 42597.7, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4172316672. Throughput: 0: 43228.4. Samples: 451152220. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 15:36:12,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:36:15,812][09423] Updated weights for policy 0, policy_version 254667 (0.0030) [2024-06-28 15:36:17,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4172546048. Throughput: 0: 43212.0. Samples: 451410000. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 15:36:17,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:36:17,934][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000254672_4172546048.pth... [2024-06-28 15:36:17,985][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000254041_4162207744.pth [2024-06-28 15:36:20,258][09423] Updated weights for policy 0, policy_version 254677 (0.0026) [2024-06-28 15:36:22,921][09190] Fps is (10 sec: 44237.6, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4172759040. Throughput: 0: 43265.9. Samples: 451667400. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 15:36:22,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:36:23,338][09423] Updated weights for policy 0, policy_version 254687 (0.0035) [2024-06-28 15:36:27,662][09423] Updated weights for policy 0, policy_version 254697 (0.0045) [2024-06-28 15:36:27,921][09190] Fps is (10 sec: 40960.3, 60 sec: 43417.7, 300 sec: 43042.7). Total num frames: 4172955648. Throughput: 0: 43224.5. Samples: 451795700. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 15:36:27,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 15:36:30,888][09423] Updated weights for policy 0, policy_version 254707 (0.0038) [2024-06-28 15:36:32,924][09190] Fps is (10 sec: 44225.4, 60 sec: 43415.8, 300 sec: 43097.9). Total num frames: 4173201408. Throughput: 0: 43192.3. Samples: 452059340. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 15:36:32,925][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:36:35,006][09423] Updated weights for policy 0, policy_version 254717 (0.0031) [2024-06-28 15:36:37,921][09190] Fps is (10 sec: 45874.7, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4173414400. Throughput: 0: 43043.0. Samples: 452315660. Policy #0 lag: (min: 0.0, avg: 8.4, max: 22.0) [2024-06-28 15:36:37,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:36:38,429][09423] Updated weights for policy 0, policy_version 254727 (0.0034) [2024-06-28 15:36:42,921][09190] Fps is (10 sec: 39331.7, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4173594624. Throughput: 0: 43070.5. Samples: 452444940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 15:36:42,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 15:36:43,000][09423] Updated weights for policy 0, policy_version 254737 (0.0031) [2024-06-28 15:36:44,610][09403] Signal inference workers to stop experience collection... (6250 times) [2024-06-28 15:36:44,659][09423] InferenceWorker_p0-w0: stopping experience collection (6250 times) [2024-06-28 15:36:44,666][09403] Signal inference workers to resume experience collection... (6250 times) [2024-06-28 15:36:44,674][09423] InferenceWorker_p0-w0: resuming experience collection (6250 times) [2024-06-28 15:36:45,866][09423] Updated weights for policy 0, policy_version 254747 (0.0026) [2024-06-28 15:36:47,921][09190] Fps is (10 sec: 42599.0, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4173840384. Throughput: 0: 43210.7. Samples: 452704180. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 15:36:47,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:36:50,322][09423] Updated weights for policy 0, policy_version 254757 (0.0044) [2024-06-28 15:36:52,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4174053376. Throughput: 0: 43307.3. Samples: 452966000. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 15:36:52,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:36:53,443][09423] Updated weights for policy 0, policy_version 254767 (0.0039) [2024-06-28 15:36:57,703][09423] Updated weights for policy 0, policy_version 254777 (0.0032) [2024-06-28 15:36:57,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43419.4, 300 sec: 43042.7). Total num frames: 4174266368. Throughput: 0: 43127.3. Samples: 453092940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 15:36:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:37:01,495][09423] Updated weights for policy 0, policy_version 254787 (0.0042) [2024-06-28 15:37:02,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43417.7, 300 sec: 43098.3). Total num frames: 4174495744. Throughput: 0: 43046.4. Samples: 453347080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 15:37:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:37:05,572][09423] Updated weights for policy 0, policy_version 254797 (0.0036) [2024-06-28 15:37:07,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4174692352. Throughput: 0: 43192.4. Samples: 453611060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 15:37:07,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:37:09,154][09423] Updated weights for policy 0, policy_version 254807 (0.0035) [2024-06-28 15:37:12,921][09190] Fps is (10 sec: 39321.1, 60 sec: 42871.6, 300 sec: 42987.2). Total num frames: 4174888960. Throughput: 0: 43219.1. Samples: 453740560. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 15:37:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:37:13,203][09423] Updated weights for policy 0, policy_version 254817 (0.0025) [2024-06-28 15:37:16,718][09423] Updated weights for policy 0, policy_version 254827 (0.0036) [2024-06-28 15:37:17,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.5, 300 sec: 43098.3). Total num frames: 4175118336. Throughput: 0: 43057.6. Samples: 453996820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 15:37:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:37:20,593][09423] Updated weights for policy 0, policy_version 254837 (0.0038) [2024-06-28 15:37:22,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4175331328. Throughput: 0: 43232.5. Samples: 454261120. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 15:37:22,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:37:24,276][09423] Updated weights for policy 0, policy_version 254847 (0.0026) [2024-06-28 15:37:27,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 43098.3). Total num frames: 4175560704. Throughput: 0: 43201.8. Samples: 454389020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 15:37:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:37:28,024][09423] Updated weights for policy 0, policy_version 254857 (0.0034) [2024-06-28 15:37:31,877][09423] Updated weights for policy 0, policy_version 254867 (0.0046) [2024-06-28 15:37:32,922][09190] Fps is (10 sec: 44235.9, 60 sec: 42873.2, 300 sec: 43098.2). Total num frames: 4175773696. Throughput: 0: 43068.2. Samples: 454642260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 15:37:32,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:37:35,917][09423] Updated weights for policy 0, policy_version 254877 (0.0043) [2024-06-28 15:37:37,924][09190] Fps is (10 sec: 44225.6, 60 sec: 43142.8, 300 sec: 43098.4). Total num frames: 4176003072. Throughput: 0: 43057.5. Samples: 454903700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 15:37:37,933][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:37:39,542][09423] Updated weights for policy 0, policy_version 254887 (0.0031) [2024-06-28 15:37:42,921][09190] Fps is (10 sec: 42599.3, 60 sec: 43417.6, 300 sec: 43098.3). Total num frames: 4176199680. Throughput: 0: 43084.0. Samples: 455031720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 15:37:42,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:37:43,355][09423] Updated weights for policy 0, policy_version 254897 (0.0027) [2024-06-28 15:37:47,212][09423] Updated weights for policy 0, policy_version 254907 (0.0040) [2024-06-28 15:37:47,921][09190] Fps is (10 sec: 40969.9, 60 sec: 42871.4, 300 sec: 43098.2). Total num frames: 4176412672. Throughput: 0: 43193.6. Samples: 455290800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 15:37:47,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:37:51,155][09423] Updated weights for policy 0, policy_version 254917 (0.0037) [2024-06-28 15:37:52,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4176642048. Throughput: 0: 43017.4. Samples: 455546840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 15:37:52,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 15:37:54,793][09423] Updated weights for policy 0, policy_version 254927 (0.0040) [2024-06-28 15:37:57,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.4, 300 sec: 42987.2). Total num frames: 4176822272. Throughput: 0: 43068.8. Samples: 455678660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 15:37:57,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:37:58,724][09423] Updated weights for policy 0, policy_version 254937 (0.0036) [2024-06-28 15:38:02,381][09423] Updated weights for policy 0, policy_version 254947 (0.0028) [2024-06-28 15:38:02,922][09190] Fps is (10 sec: 42597.1, 60 sec: 42871.1, 300 sec: 43098.2). Total num frames: 4177068032. Throughput: 0: 42951.7. Samples: 455929660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 15:38:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:38:06,280][09423] Updated weights for policy 0, policy_version 254957 (0.0032) [2024-06-28 15:38:07,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4177264640. Throughput: 0: 42859.1. Samples: 456189780. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 15:38:07,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:38:10,271][09423] Updated weights for policy 0, policy_version 254967 (0.0044) [2024-06-28 15:38:12,921][09190] Fps is (10 sec: 40960.9, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4177477632. Throughput: 0: 42698.1. Samples: 456310440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 15:38:12,923][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:38:14,487][09423] Updated weights for policy 0, policy_version 254977 (0.0021) [2024-06-28 15:38:17,660][09423] Updated weights for policy 0, policy_version 254987 (0.0032) [2024-06-28 15:38:17,921][09190] Fps is (10 sec: 45875.5, 60 sec: 43417.6, 300 sec: 43098.3). Total num frames: 4177723392. Throughput: 0: 42834.4. Samples: 456569800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 15:38:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:38:17,929][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000254988_4177723392.pth... [2024-06-28 15:38:17,978][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000254357_4167385088.pth [2024-06-28 15:38:21,915][09403] Signal inference workers to stop experience collection... (6300 times) [2024-06-28 15:38:21,915][09403] Signal inference workers to resume experience collection... (6300 times) [2024-06-28 15:38:21,930][09423] InferenceWorker_p0-w0: stopping experience collection (6300 times) [2024-06-28 15:38:21,930][09423] InferenceWorker_p0-w0: resuming experience collection (6300 times) [2024-06-28 15:38:22,070][09423] Updated weights for policy 0, policy_version 254997 (0.0039) [2024-06-28 15:38:22,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4177903616. Throughput: 0: 42659.7. Samples: 456823280. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 15:38:22,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:38:25,252][09423] Updated weights for policy 0, policy_version 255007 (0.0038) [2024-06-28 15:38:27,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42598.4, 300 sec: 42987.2). Total num frames: 4178116608. Throughput: 0: 42728.8. Samples: 456954520. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 15:38:27,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:38:29,609][09423] Updated weights for policy 0, policy_version 255017 (0.0029) [2024-06-28 15:38:32,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42598.5, 300 sec: 42931.6). Total num frames: 4178329600. Throughput: 0: 42600.9. Samples: 457207840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 15:38:32,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:38:33,286][09423] Updated weights for policy 0, policy_version 255027 (0.0025) [2024-06-28 15:38:37,143][09423] Updated weights for policy 0, policy_version 255037 (0.0050) [2024-06-28 15:38:37,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42873.2, 300 sec: 43098.2). Total num frames: 4178575360. Throughput: 0: 42668.8. Samples: 457466940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 15:38:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:38:40,815][09423] Updated weights for policy 0, policy_version 255047 (0.0037) [2024-06-28 15:38:42,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42871.5, 300 sec: 43043.7). Total num frames: 4178771968. Throughput: 0: 42649.0. Samples: 457597860. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 15:38:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:38:44,760][09423] Updated weights for policy 0, policy_version 255057 (0.0033) [2024-06-28 15:38:47,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4178984960. Throughput: 0: 42801.1. Samples: 457855700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 15:38:47,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:38:48,487][09423] Updated weights for policy 0, policy_version 255067 (0.0035) [2024-06-28 15:38:52,238][09423] Updated weights for policy 0, policy_version 255077 (0.0038) [2024-06-28 15:38:52,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.3, 300 sec: 42987.2). Total num frames: 4179181568. Throughput: 0: 42851.6. Samples: 458118100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 15:38:52,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:38:55,950][09423] Updated weights for policy 0, policy_version 255087 (0.0033) [2024-06-28 15:38:57,921][09190] Fps is (10 sec: 42598.1, 60 sec: 43144.5, 300 sec: 42987.2). Total num frames: 4179410944. Throughput: 0: 42979.1. Samples: 458244500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 15:38:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:39:00,067][09423] Updated weights for policy 0, policy_version 255097 (0.0021) [2024-06-28 15:39:02,921][09190] Fps is (10 sec: 45874.7, 60 sec: 42871.6, 300 sec: 43042.7). Total num frames: 4179640320. Throughput: 0: 42855.5. Samples: 458498300. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:39:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:39:03,334][09423] Updated weights for policy 0, policy_version 255107 (0.0032) [2024-06-28 15:39:07,824][09423] Updated weights for policy 0, policy_version 255117 (0.0031) [2024-06-28 15:39:07,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42871.4, 300 sec: 42987.2). Total num frames: 4179836928. Throughput: 0: 43157.2. Samples: 458765360. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:39:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:39:11,320][09423] Updated weights for policy 0, policy_version 255127 (0.0041) [2024-06-28 15:39:12,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4180049920. Throughput: 0: 42968.9. Samples: 458888120. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:39:12,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 15:39:15,441][09423] Updated weights for policy 0, policy_version 255137 (0.0029) [2024-06-28 15:39:17,924][09190] Fps is (10 sec: 42588.3, 60 sec: 42323.5, 300 sec: 42986.8). Total num frames: 4180262912. Throughput: 0: 42942.5. Samples: 459140360. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:39:17,924][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:39:18,906][09423] Updated weights for policy 0, policy_version 255147 (0.0033) [2024-06-28 15:39:22,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43144.5, 300 sec: 42987.3). Total num frames: 4180492288. Throughput: 0: 43048.9. Samples: 459404140. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:39:22,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:39:22,922][09423] Updated weights for policy 0, policy_version 255157 (0.0036) [2024-06-28 15:39:26,728][09423] Updated weights for policy 0, policy_version 255167 (0.0031) [2024-06-28 15:39:27,921][09190] Fps is (10 sec: 44248.1, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4180705280. Throughput: 0: 42920.4. Samples: 459529280. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:39:27,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:39:30,558][09423] Updated weights for policy 0, policy_version 255177 (0.0030) [2024-06-28 15:39:32,921][09190] Fps is (10 sec: 42597.9, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4180918272. Throughput: 0: 43005.7. Samples: 459790960. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:39:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:39:34,107][09423] Updated weights for policy 0, policy_version 255187 (0.0030) [2024-06-28 15:39:35,970][09403] Signal inference workers to stop experience collection... (6350 times) [2024-06-28 15:39:36,009][09423] InferenceWorker_p0-w0: stopping experience collection (6350 times) [2024-06-28 15:39:36,019][09403] Signal inference workers to resume experience collection... (6350 times) [2024-06-28 15:39:36,032][09423] InferenceWorker_p0-w0: resuming experience collection (6350 times) [2024-06-28 15:39:37,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.4, 300 sec: 42931.6). Total num frames: 4181114880. Throughput: 0: 42953.3. Samples: 460051000. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:39:37,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:39:38,286][09423] Updated weights for policy 0, policy_version 255197 (0.0036) [2024-06-28 15:39:41,402][09423] Updated weights for policy 0, policy_version 255207 (0.0022) [2024-06-28 15:39:42,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4181344256. Throughput: 0: 42950.3. Samples: 460177260. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:39:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:39:45,753][09423] Updated weights for policy 0, policy_version 255217 (0.0041) [2024-06-28 15:39:47,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4181573632. Throughput: 0: 43080.5. Samples: 460436920. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:39:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:39:49,097][09423] Updated weights for policy 0, policy_version 255227 (0.0031) [2024-06-28 15:39:52,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42598.4, 300 sec: 42931.6). Total num frames: 4181737472. Throughput: 0: 43079.7. Samples: 460703940. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:39:52,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:39:53,516][09423] Updated weights for policy 0, policy_version 255237 (0.0034) [2024-06-28 15:39:57,018][09423] Updated weights for policy 0, policy_version 255247 (0.0027) [2024-06-28 15:39:57,921][09190] Fps is (10 sec: 44236.6, 60 sec: 43417.7, 300 sec: 43098.2). Total num frames: 4182016000. Throughput: 0: 43091.9. Samples: 460827260. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:39:57,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:40:00,932][09423] Updated weights for policy 0, policy_version 255257 (0.0032) [2024-06-28 15:40:02,921][09190] Fps is (10 sec: 47513.2, 60 sec: 42871.5, 300 sec: 43043.1). Total num frames: 4182212608. Throughput: 0: 43134.8. Samples: 461081320. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:40:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:40:04,564][09423] Updated weights for policy 0, policy_version 255267 (0.0036) [2024-06-28 15:40:07,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4182409216. Throughput: 0: 43196.0. Samples: 461347960. Policy #0 lag: (min: 0.0, avg: 11.1, max: 23.0) [2024-06-28 15:40:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:40:08,555][09423] Updated weights for policy 0, policy_version 255277 (0.0033) [2024-06-28 15:40:11,997][09423] Updated weights for policy 0, policy_version 255287 (0.0028) [2024-06-28 15:40:12,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.4, 300 sec: 43042.7). Total num frames: 4182638592. Throughput: 0: 43135.0. Samples: 461470360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:40:12,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:40:16,352][09423] Updated weights for policy 0, policy_version 255297 (0.0032) [2024-06-28 15:40:17,921][09190] Fps is (10 sec: 45875.0, 60 sec: 43419.4, 300 sec: 43042.7). Total num frames: 4182867968. Throughput: 0: 42964.9. Samples: 461724380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:40:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:40:17,946][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000255302_4182867968.pth... [2024-06-28 15:40:18,004][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000254672_4172546048.pth [2024-06-28 15:40:19,632][09423] Updated weights for policy 0, policy_version 255307 (0.0036) [2024-06-28 15:40:22,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.3, 300 sec: 43042.7). Total num frames: 4183048192. Throughput: 0: 43080.8. Samples: 461989640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:40:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:40:23,797][09423] Updated weights for policy 0, policy_version 255317 (0.0033) [2024-06-28 15:40:26,947][09423] Updated weights for policy 0, policy_version 255327 (0.0034) [2024-06-28 15:40:27,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4183293952. Throughput: 0: 43126.1. Samples: 462117940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:40:27,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:40:31,645][09423] Updated weights for policy 0, policy_version 255337 (0.0028) [2024-06-28 15:40:32,921][09190] Fps is (10 sec: 47514.0, 60 sec: 43417.7, 300 sec: 43042.7). Total num frames: 4183523328. Throughput: 0: 43133.8. Samples: 462377940. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:40:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:40:35,031][09423] Updated weights for policy 0, policy_version 255347 (0.0023) [2024-06-28 15:40:37,921][09190] Fps is (10 sec: 40959.8, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4183703552. Throughput: 0: 43051.0. Samples: 462641240. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:40:37,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 15:40:39,014][09423] Updated weights for policy 0, policy_version 255357 (0.0026) [2024-06-28 15:40:42,387][09423] Updated weights for policy 0, policy_version 255367 (0.0029) [2024-06-28 15:40:42,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4183949312. Throughput: 0: 43026.3. Samples: 462763440. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:40:42,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:40:46,511][09423] Updated weights for policy 0, policy_version 255377 (0.0032) [2024-06-28 15:40:47,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4184145920. Throughput: 0: 43089.4. Samples: 463020340. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:40:47,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:40:49,822][09423] Updated weights for policy 0, policy_version 255387 (0.0030) [2024-06-28 15:40:52,921][09190] Fps is (10 sec: 39321.5, 60 sec: 43417.6, 300 sec: 42987.5). Total num frames: 4184342528. Throughput: 0: 42865.8. Samples: 463276920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:40:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:40:54,478][09423] Updated weights for policy 0, policy_version 255397 (0.0029) [2024-06-28 15:40:57,075][09403] Signal inference workers to stop experience collection... (6400 times) [2024-06-28 15:40:57,076][09403] Signal inference workers to resume experience collection... (6400 times) [2024-06-28 15:40:57,098][09423] InferenceWorker_p0-w0: stopping experience collection (6400 times) [2024-06-28 15:40:57,098][09423] InferenceWorker_p0-w0: resuming experience collection (6400 times) [2024-06-28 15:40:57,525][09423] Updated weights for policy 0, policy_version 255407 (0.0037) [2024-06-28 15:40:57,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4184588288. Throughput: 0: 42920.9. Samples: 463401800. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:40:57,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:41:01,974][09423] Updated weights for policy 0, policy_version 255417 (0.0032) [2024-06-28 15:41:02,924][09190] Fps is (10 sec: 45863.4, 60 sec: 43142.8, 300 sec: 42986.8). Total num frames: 4184801280. Throughput: 0: 43199.8. Samples: 463668480. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:41:02,925][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:41:05,370][09423] Updated weights for policy 0, policy_version 255427 (0.0023) [2024-06-28 15:41:07,921][09190] Fps is (10 sec: 40960.4, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4184997888. Throughput: 0: 43045.9. Samples: 463926700. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:41:07,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:41:09,737][09423] Updated weights for policy 0, policy_version 255437 (0.0035) [2024-06-28 15:41:12,922][09190] Fps is (10 sec: 40969.7, 60 sec: 42871.4, 300 sec: 42931.6). Total num frames: 4185210880. Throughput: 0: 42884.8. Samples: 464047760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:41:12,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:41:13,258][09423] Updated weights for policy 0, policy_version 255447 (0.0036) [2024-06-28 15:41:17,266][09423] Updated weights for policy 0, policy_version 255457 (0.0043) [2024-06-28 15:41:17,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4185440256. Throughput: 0: 42938.7. Samples: 464310180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:41:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:41:20,685][09423] Updated weights for policy 0, policy_version 255467 (0.0028) [2024-06-28 15:41:22,921][09190] Fps is (10 sec: 42599.1, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4185636864. Throughput: 0: 42894.7. Samples: 464571500. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 15:41:22,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:41:24,764][09423] Updated weights for policy 0, policy_version 255477 (0.0035) [2024-06-28 15:41:27,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42871.5, 300 sec: 42932.0). Total num frames: 4185866240. Throughput: 0: 42896.3. Samples: 464693780. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 15:41:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:41:28,180][09423] Updated weights for policy 0, policy_version 255487 (0.0045) [2024-06-28 15:41:32,601][09423] Updated weights for policy 0, policy_version 255497 (0.0027) [2024-06-28 15:41:32,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42325.4, 300 sec: 42876.1). Total num frames: 4186062848. Throughput: 0: 42980.9. Samples: 464954480. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 15:41:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:41:35,541][09423] Updated weights for policy 0, policy_version 255507 (0.0039) [2024-06-28 15:41:37,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4186292224. Throughput: 0: 42892.8. Samples: 465207100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 15:41:37,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 15:41:40,401][09423] Updated weights for policy 0, policy_version 255517 (0.0031) [2024-06-28 15:41:42,924][09190] Fps is (10 sec: 45864.4, 60 sec: 42869.8, 300 sec: 42986.8). Total num frames: 4186521600. Throughput: 0: 43024.1. Samples: 465337980. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 15:41:42,924][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:41:43,606][09423] Updated weights for policy 0, policy_version 255527 (0.0028) [2024-06-28 15:41:47,910][09423] Updated weights for policy 0, policy_version 255537 (0.0042) [2024-06-28 15:41:47,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.4, 300 sec: 42931.6). Total num frames: 4186718208. Throughput: 0: 42920.6. Samples: 465599800. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 15:41:47,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:41:51,522][09423] Updated weights for policy 0, policy_version 255547 (0.0037) [2024-06-28 15:41:52,921][09190] Fps is (10 sec: 42607.7, 60 sec: 43417.5, 300 sec: 42987.2). Total num frames: 4186947584. Throughput: 0: 42801.2. Samples: 465852760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 15:41:52,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:41:55,304][09423] Updated weights for policy 0, policy_version 255557 (0.0040) [2024-06-28 15:41:57,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4187160576. Throughput: 0: 42977.4. Samples: 465981740. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 15:41:57,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:41:58,927][09423] Updated weights for policy 0, policy_version 255567 (0.0036) [2024-06-28 15:42:02,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42600.2, 300 sec: 42931.6). Total num frames: 4187357184. Throughput: 0: 42984.0. Samples: 466244460. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 15:42:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:42:02,960][09423] Updated weights for policy 0, policy_version 255577 (0.0030) [2024-06-28 15:42:06,571][09423] Updated weights for policy 0, policy_version 255587 (0.0034) [2024-06-28 15:42:07,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43144.4, 300 sec: 43042.7). Total num frames: 4187586560. Throughput: 0: 42715.9. Samples: 466493720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 15:42:07,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:42:10,473][09423] Updated weights for policy 0, policy_version 255597 (0.0033) [2024-06-28 15:42:12,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4187799552. Throughput: 0: 42832.9. Samples: 466621260. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 15:42:12,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 15:42:14,028][09423] Updated weights for policy 0, policy_version 255607 (0.0025) [2024-06-28 15:42:17,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42598.3, 300 sec: 42931.6). Total num frames: 4187996160. Throughput: 0: 42910.5. Samples: 466885460. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 15:42:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:42:17,944][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000255615_4187996160.pth... [2024-06-28 15:42:17,998][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000254988_4177723392.pth [2024-06-28 15:42:18,356][09423] Updated weights for policy 0, policy_version 255617 (0.0039) [2024-06-28 15:42:21,856][09423] Updated weights for policy 0, policy_version 255627 (0.0043) [2024-06-28 15:42:22,922][09190] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4188225536. Throughput: 0: 42957.8. Samples: 467140200. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 15:42:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:42:25,793][09423] Updated weights for policy 0, policy_version 255637 (0.0037) [2024-06-28 15:42:27,921][09190] Fps is (10 sec: 45875.8, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4188454912. Throughput: 0: 42928.9. Samples: 467269680. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 15:42:27,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:42:29,737][09423] Updated weights for policy 0, policy_version 255647 (0.0032) [2024-06-28 15:42:30,509][09403] Signal inference workers to stop experience collection... (6450 times) [2024-06-28 15:42:30,560][09423] InferenceWorker_p0-w0: stopping experience collection (6450 times) [2024-06-28 15:42:30,564][09403] Signal inference workers to resume experience collection... (6450 times) [2024-06-28 15:42:30,576][09423] InferenceWorker_p0-w0: resuming experience collection (6450 times) [2024-06-28 15:42:32,921][09190] Fps is (10 sec: 42599.0, 60 sec: 43144.5, 300 sec: 42876.5). Total num frames: 4188651520. Throughput: 0: 42774.3. Samples: 467524640. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:42:32,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 15:42:33,335][09423] Updated weights for policy 0, policy_version 255657 (0.0043) [2024-06-28 15:42:37,074][09423] Updated weights for policy 0, policy_version 255667 (0.0031) [2024-06-28 15:42:37,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4188864512. Throughput: 0: 42948.5. Samples: 467785440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:42:37,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:42:40,642][09423] Updated weights for policy 0, policy_version 255677 (0.0036) [2024-06-28 15:42:42,921][09190] Fps is (10 sec: 42597.7, 60 sec: 42599.9, 300 sec: 42931.6). Total num frames: 4189077504. Throughput: 0: 42917.7. Samples: 467913040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:42:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:42:44,719][09423] Updated weights for policy 0, policy_version 255687 (0.0035) [2024-06-28 15:42:47,922][09190] Fps is (10 sec: 44236.6, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4189306880. Throughput: 0: 42883.4. Samples: 468174220. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:42:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:42:48,569][09423] Updated weights for policy 0, policy_version 255697 (0.0036) [2024-06-28 15:42:52,106][09423] Updated weights for policy 0, policy_version 255707 (0.0028) [2024-06-28 15:42:52,921][09190] Fps is (10 sec: 44237.6, 60 sec: 42871.6, 300 sec: 43042.7). Total num frames: 4189519872. Throughput: 0: 43176.1. Samples: 468436640. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:42:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:42:56,293][09423] Updated weights for policy 0, policy_version 255717 (0.0033) [2024-06-28 15:42:57,921][09190] Fps is (10 sec: 44237.3, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4189749248. Throughput: 0: 43243.2. Samples: 468567200. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:42:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:42:59,574][09423] Updated weights for policy 0, policy_version 255727 (0.0034) [2024-06-28 15:43:02,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4189945856. Throughput: 0: 43109.9. Samples: 468825400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:43:02,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:43:03,790][09423] Updated weights for policy 0, policy_version 255737 (0.0031) [2024-06-28 15:43:07,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4190158848. Throughput: 0: 43300.4. Samples: 469088720. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:43:07,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:43:07,924][09423] Updated weights for policy 0, policy_version 255747 (0.0024) [2024-06-28 15:43:11,159][09423] Updated weights for policy 0, policy_version 255757 (0.0035) [2024-06-28 15:43:12,921][09190] Fps is (10 sec: 45874.8, 60 sec: 43417.6, 300 sec: 42987.2). Total num frames: 4190404608. Throughput: 0: 43185.7. Samples: 469213040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:43:12,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:43:15,360][09423] Updated weights for policy 0, policy_version 255767 (0.0027) [2024-06-28 15:43:17,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4190584832. Throughput: 0: 43338.6. Samples: 469474880. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:43:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:43:18,782][09423] Updated weights for policy 0, policy_version 255777 (0.0025) [2024-06-28 15:43:22,922][09190] Fps is (10 sec: 39321.0, 60 sec: 42871.4, 300 sec: 42987.1). Total num frames: 4190797824. Throughput: 0: 43179.9. Samples: 469728540. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:43:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:43:22,931][09423] Updated weights for policy 0, policy_version 255787 (0.0029) [2024-06-28 15:43:26,318][09423] Updated weights for policy 0, policy_version 255797 (0.0044) [2024-06-28 15:43:27,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4191027200. Throughput: 0: 43243.7. Samples: 469859000. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:43:27,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 15:43:30,289][09423] Updated weights for policy 0, policy_version 255807 (0.0044) [2024-06-28 15:43:32,921][09190] Fps is (10 sec: 44237.9, 60 sec: 43144.6, 300 sec: 42931.6). Total num frames: 4191240192. Throughput: 0: 43205.0. Samples: 470118440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:43:32,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:43:34,314][09423] Updated weights for policy 0, policy_version 255817 (0.0039) [2024-06-28 15:43:37,883][09423] Updated weights for policy 0, policy_version 255827 (0.0038) [2024-06-28 15:43:37,921][09190] Fps is (10 sec: 44236.3, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4191469568. Throughput: 0: 42919.4. Samples: 470368020. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:43:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:43:41,710][09423] Updated weights for policy 0, policy_version 255837 (0.0036) [2024-06-28 15:43:42,921][09190] Fps is (10 sec: 44236.1, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4191682560. Throughput: 0: 43077.2. Samples: 470505680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 21.0) [2024-06-28 15:43:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:43:45,625][09423] Updated weights for policy 0, policy_version 255847 (0.0046) [2024-06-28 15:43:47,921][09190] Fps is (10 sec: 39322.3, 60 sec: 42598.5, 300 sec: 42987.2). Total num frames: 4191862784. Throughput: 0: 42961.8. Samples: 470758680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 15:43:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:43:49,448][09423] Updated weights for policy 0, policy_version 255857 (0.0044) [2024-06-28 15:43:52,922][09190] Fps is (10 sec: 40959.8, 60 sec: 42871.3, 300 sec: 42987.2). Total num frames: 4192092160. Throughput: 0: 42858.6. Samples: 471017360. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 15:43:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:43:53,497][09423] Updated weights for policy 0, policy_version 255867 (0.0031) [2024-06-28 15:43:56,837][09423] Updated weights for policy 0, policy_version 255877 (0.0040) [2024-06-28 15:43:57,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42598.4, 300 sec: 42931.6). Total num frames: 4192305152. Throughput: 0: 43038.2. Samples: 471149760. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 15:43:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:44:01,091][09423] Updated weights for policy 0, policy_version 255887 (0.0042) [2024-06-28 15:44:02,921][09190] Fps is (10 sec: 42599.3, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4192518144. Throughput: 0: 42768.5. Samples: 471399460. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 15:44:02,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:44:04,652][09423] Updated weights for policy 0, policy_version 255897 (0.0029) [2024-06-28 15:44:06,179][09403] Signal inference workers to stop experience collection... (6500 times) [2024-06-28 15:44:06,179][09403] Signal inference workers to resume experience collection... (6500 times) [2024-06-28 15:44:06,217][09423] InferenceWorker_p0-w0: stopping experience collection (6500 times) [2024-06-28 15:44:06,218][09423] InferenceWorker_p0-w0: resuming experience collection (6500 times) [2024-06-28 15:44:07,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4192747520. Throughput: 0: 42944.2. Samples: 471661020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 15:44:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:44:08,501][09423] Updated weights for policy 0, policy_version 255907 (0.0056) [2024-06-28 15:44:12,558][09423] Updated weights for policy 0, policy_version 255917 (0.0046) [2024-06-28 15:44:12,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42871.5, 300 sec: 43098.6). Total num frames: 4192976896. Throughput: 0: 42907.6. Samples: 471789840. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 15:44:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:44:16,086][09423] Updated weights for policy 0, policy_version 255927 (0.0038) [2024-06-28 15:44:17,922][09190] Fps is (10 sec: 42598.1, 60 sec: 43144.5, 300 sec: 42987.2). Total num frames: 4193173504. Throughput: 0: 42928.3. Samples: 472050220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 15:44:17,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:44:17,941][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000255931_4193173504.pth... [2024-06-28 15:44:17,996][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000255302_4182867968.pth [2024-06-28 15:44:20,020][09423] Updated weights for policy 0, policy_version 255937 (0.0040) [2024-06-28 15:44:22,921][09190] Fps is (10 sec: 40960.2, 60 sec: 43144.7, 300 sec: 42987.2). Total num frames: 4193386496. Throughput: 0: 42990.4. Samples: 472302580. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 15:44:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:44:23,599][09423] Updated weights for policy 0, policy_version 255947 (0.0037) [2024-06-28 15:44:27,856][09423] Updated weights for policy 0, policy_version 255957 (0.0026) [2024-06-28 15:44:27,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.4, 300 sec: 42987.2). Total num frames: 4193599488. Throughput: 0: 42882.7. Samples: 472435400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 15:44:27,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:44:31,705][09423] Updated weights for policy 0, policy_version 255967 (0.0043) [2024-06-28 15:44:32,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4193812480. Throughput: 0: 42968.4. Samples: 472692260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 15:44:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:44:35,225][09423] Updated weights for policy 0, policy_version 255977 (0.0031) [2024-06-28 15:44:37,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4194041856. Throughput: 0: 42761.0. Samples: 472941600. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 15:44:37,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:44:39,190][09423] Updated weights for policy 0, policy_version 255987 (0.0025) [2024-06-28 15:44:42,864][09423] Updated weights for policy 0, policy_version 255997 (0.0032) [2024-06-28 15:44:42,923][09190] Fps is (10 sec: 44227.7, 60 sec: 42870.1, 300 sec: 42986.9). Total num frames: 4194254848. Throughput: 0: 42895.0. Samples: 473080120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 15:44:42,924][09190] Avg episode reward: [(0, '0.807')] [2024-06-28 15:44:46,504][09423] Updated weights for policy 0, policy_version 256007 (0.0022) [2024-06-28 15:44:47,924][09190] Fps is (10 sec: 42587.7, 60 sec: 43415.7, 300 sec: 43153.4). Total num frames: 4194467840. Throughput: 0: 43087.7. Samples: 473338520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 15:44:47,925][09190] Avg episode reward: [(0, '0.804')] [2024-06-28 15:44:50,349][09423] Updated weights for policy 0, policy_version 256017 (0.0035) [2024-06-28 15:44:52,921][09190] Fps is (10 sec: 42607.0, 60 sec: 43144.6, 300 sec: 42931.6). Total num frames: 4194680832. Throughput: 0: 42959.5. Samples: 473594200. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 15:44:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:44:54,074][09423] Updated weights for policy 0, policy_version 256027 (0.0036) [2024-06-28 15:44:57,900][09423] Updated weights for policy 0, policy_version 256037 (0.0041) [2024-06-28 15:44:57,922][09190] Fps is (10 sec: 44247.6, 60 sec: 43417.5, 300 sec: 43042.7). Total num frames: 4194910208. Throughput: 0: 43149.6. Samples: 473731580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 15:44:57,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:45:01,917][09423] Updated weights for policy 0, policy_version 256047 (0.0028) [2024-06-28 15:45:02,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4195106816. Throughput: 0: 42973.0. Samples: 473984000. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 15:45:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:45:05,483][09423] Updated weights for policy 0, policy_version 256057 (0.0042) [2024-06-28 15:45:07,921][09190] Fps is (10 sec: 42599.1, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4195336192. Throughput: 0: 43094.6. Samples: 474241840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 15:45:07,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:45:09,493][09423] Updated weights for policy 0, policy_version 256067 (0.0032) [2024-06-28 15:45:12,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.4, 300 sec: 42931.6). Total num frames: 4195532800. Throughput: 0: 42961.9. Samples: 474368680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 15:45:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:45:13,064][09423] Updated weights for policy 0, policy_version 256077 (0.0035) [2024-06-28 15:45:17,521][09423] Updated weights for policy 0, policy_version 256087 (0.0040) [2024-06-28 15:45:17,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4195745792. Throughput: 0: 42943.9. Samples: 474624740. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 15:45:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:45:20,742][09423] Updated weights for policy 0, policy_version 256097 (0.0038) [2024-06-28 15:45:22,921][09190] Fps is (10 sec: 44236.3, 60 sec: 43144.4, 300 sec: 42987.2). Total num frames: 4195975168. Throughput: 0: 43124.9. Samples: 474882220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 15:45:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:45:24,921][09423] Updated weights for policy 0, policy_version 256107 (0.0039) [2024-06-28 15:45:27,921][09190] Fps is (10 sec: 44237.3, 60 sec: 43144.6, 300 sec: 42931.6). Total num frames: 4196188160. Throughput: 0: 43064.7. Samples: 475017940. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 15:45:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:45:28,218][09423] Updated weights for policy 0, policy_version 256117 (0.0035) [2024-06-28 15:45:32,207][09423] Updated weights for policy 0, policy_version 256127 (0.0027) [2024-06-28 15:45:32,922][09190] Fps is (10 sec: 42598.1, 60 sec: 43144.4, 300 sec: 43042.7). Total num frames: 4196401152. Throughput: 0: 43191.2. Samples: 475282020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 15:45:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:45:35,612][09423] Updated weights for policy 0, policy_version 256137 (0.0030) [2024-06-28 15:45:37,921][09190] Fps is (10 sec: 44236.5, 60 sec: 43144.5, 300 sec: 42987.2). Total num frames: 4196630528. Throughput: 0: 43142.7. Samples: 475535620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 15:45:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:45:39,694][09423] Updated weights for policy 0, policy_version 256147 (0.0031) [2024-06-28 15:45:42,921][09190] Fps is (10 sec: 45875.6, 60 sec: 43419.0, 300 sec: 43098.2). Total num frames: 4196859904. Throughput: 0: 43107.6. Samples: 475671420. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 15:45:42,923][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:45:43,428][09423] Updated weights for policy 0, policy_version 256157 (0.0038) [2024-06-28 15:45:47,701][09423] Updated weights for policy 0, policy_version 256167 (0.0035) [2024-06-28 15:45:47,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42873.2, 300 sec: 43042.7). Total num frames: 4197040128. Throughput: 0: 43226.2. Samples: 475929180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 15:45:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:45:50,711][09403] Signal inference workers to stop experience collection... (6550 times) [2024-06-28 15:45:50,711][09403] Signal inference workers to resume experience collection... (6550 times) [2024-06-28 15:45:50,723][09423] InferenceWorker_p0-w0: stopping experience collection (6550 times) [2024-06-28 15:45:50,723][09423] InferenceWorker_p0-w0: resuming experience collection (6550 times) [2024-06-28 15:45:51,054][09423] Updated weights for policy 0, policy_version 256177 (0.0023) [2024-06-28 15:45:52,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4197253120. Throughput: 0: 42885.3. Samples: 476171680. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 15:45:52,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:45:55,613][09423] Updated weights for policy 0, policy_version 256187 (0.0033) [2024-06-28 15:45:57,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.5, 300 sec: 42932.0). Total num frames: 4197466112. Throughput: 0: 43043.9. Samples: 476305660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 15:45:57,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:45:58,919][09423] Updated weights for policy 0, policy_version 256197 (0.0029) [2024-06-28 15:46:02,927][09190] Fps is (10 sec: 42572.7, 60 sec: 42867.2, 300 sec: 42986.3). Total num frames: 4197679104. Throughput: 0: 43085.0. Samples: 476563820. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 15:46:02,928][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:46:03,231][09423] Updated weights for policy 0, policy_version 256207 (0.0027) [2024-06-28 15:46:06,422][09423] Updated weights for policy 0, policy_version 256217 (0.0028) [2024-06-28 15:46:07,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4197908480. Throughput: 0: 43037.4. Samples: 476818900. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-28 15:46:07,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:46:10,835][09423] Updated weights for policy 0, policy_version 256227 (0.0032) [2024-06-28 15:46:12,921][09190] Fps is (10 sec: 45902.8, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4198137856. Throughput: 0: 43056.9. Samples: 476955500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-28 15:46:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:46:13,766][09423] Updated weights for policy 0, policy_version 256237 (0.0034) [2024-06-28 15:46:17,923][09190] Fps is (10 sec: 40954.6, 60 sec: 42870.6, 300 sec: 42987.0). Total num frames: 4198318080. Throughput: 0: 42919.3. Samples: 477213440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-28 15:46:17,923][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:46:17,928][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000256245_4198318080.pth... [2024-06-28 15:46:18,000][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000255615_4187996160.pth [2024-06-28 15:46:18,524][09423] Updated weights for policy 0, policy_version 256247 (0.0028) [2024-06-28 15:46:21,502][09423] Updated weights for policy 0, policy_version 256257 (0.0037) [2024-06-28 15:46:22,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4198547456. Throughput: 0: 43039.6. Samples: 477472400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-28 15:46:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:46:26,296][09423] Updated weights for policy 0, policy_version 256267 (0.0038) [2024-06-28 15:46:27,921][09190] Fps is (10 sec: 45881.5, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4198776832. Throughput: 0: 42933.0. Samples: 477603400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-28 15:46:27,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:46:29,227][09423] Updated weights for policy 0, policy_version 256277 (0.0037) [2024-06-28 15:46:32,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.5, 300 sec: 42931.6). Total num frames: 4198957056. Throughput: 0: 42633.4. Samples: 477847680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-28 15:46:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:46:34,063][09423] Updated weights for policy 0, policy_version 256287 (0.0031) [2024-06-28 15:46:36,787][09423] Updated weights for policy 0, policy_version 256297 (0.0025) [2024-06-28 15:46:37,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43144.6, 300 sec: 43043.0). Total num frames: 4199219200. Throughput: 0: 43019.5. Samples: 478107560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-28 15:46:37,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:46:41,637][09423] Updated weights for policy 0, policy_version 256307 (0.0029) [2024-06-28 15:46:42,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42598.4, 300 sec: 43042.7). Total num frames: 4199415808. Throughput: 0: 43169.7. Samples: 478248300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-28 15:46:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:46:44,273][09423] Updated weights for policy 0, policy_version 256317 (0.0029) [2024-06-28 15:46:47,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4199612416. Throughput: 0: 43097.3. Samples: 478502940. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-28 15:46:47,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:46:48,988][09423] Updated weights for policy 0, policy_version 256327 (0.0027) [2024-06-28 15:46:51,705][09423] Updated weights for policy 0, policy_version 256337 (0.0041) [2024-06-28 15:46:52,921][09190] Fps is (10 sec: 45875.2, 60 sec: 43690.5, 300 sec: 43098.2). Total num frames: 4199874560. Throughput: 0: 43116.4. Samples: 478759140. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-28 15:46:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:46:56,688][09423] Updated weights for policy 0, policy_version 256347 (0.0041) [2024-06-28 15:46:57,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43417.6, 300 sec: 43098.3). Total num frames: 4200071168. Throughput: 0: 43044.4. Samples: 478892500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-28 15:46:57,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:46:59,600][09423] Updated weights for policy 0, policy_version 256357 (0.0028) [2024-06-28 15:47:02,922][09190] Fps is (10 sec: 37683.1, 60 sec: 42875.6, 300 sec: 42931.6). Total num frames: 4200251392. Throughput: 0: 42911.4. Samples: 479144400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-28 15:47:02,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 15:47:04,444][09423] Updated weights for policy 0, policy_version 256367 (0.0041) [2024-06-28 15:47:07,281][09423] Updated weights for policy 0, policy_version 256377 (0.0045) [2024-06-28 15:47:07,921][09190] Fps is (10 sec: 42598.0, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4200497152. Throughput: 0: 42716.4. Samples: 479394640. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-28 15:47:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:47:12,134][09423] Updated weights for policy 0, policy_version 256387 (0.0038) [2024-06-28 15:47:12,599][09403] Signal inference workers to stop experience collection... (6600 times) [2024-06-28 15:47:12,654][09403] Signal inference workers to resume experience collection... (6600 times) [2024-06-28 15:47:12,654][09423] InferenceWorker_p0-w0: stopping experience collection (6600 times) [2024-06-28 15:47:12,669][09423] InferenceWorker_p0-w0: resuming experience collection (6600 times) [2024-06-28 15:47:12,922][09190] Fps is (10 sec: 44236.2, 60 sec: 42598.2, 300 sec: 43042.7). Total num frames: 4200693760. Throughput: 0: 42803.3. Samples: 479529560. Policy #0 lag: (min: 0.0, avg: 9.8, max: 23.0) [2024-06-28 15:47:12,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:47:14,797][09423] Updated weights for policy 0, policy_version 256397 (0.0035) [2024-06-28 15:47:17,922][09190] Fps is (10 sec: 40959.7, 60 sec: 43145.4, 300 sec: 42987.2). Total num frames: 4200906752. Throughput: 0: 43035.5. Samples: 479784280. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 15:47:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:47:19,717][09423] Updated weights for policy 0, policy_version 256407 (0.0031) [2024-06-28 15:47:22,396][09423] Updated weights for policy 0, policy_version 256417 (0.0033) [2024-06-28 15:47:22,921][09190] Fps is (10 sec: 45876.2, 60 sec: 43417.5, 300 sec: 43042.7). Total num frames: 4201152512. Throughput: 0: 43047.5. Samples: 480044700. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 15:47:22,924][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:47:27,158][09423] Updated weights for policy 0, policy_version 256427 (0.0041) [2024-06-28 15:47:27,921][09190] Fps is (10 sec: 45875.6, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4201365504. Throughput: 0: 43021.4. Samples: 480184260. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 15:47:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:47:29,870][09423] Updated weights for policy 0, policy_version 256437 (0.0045) [2024-06-28 15:47:32,921][09190] Fps is (10 sec: 40960.0, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4201562112. Throughput: 0: 43128.0. Samples: 480443700. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 15:47:32,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:47:34,628][09423] Updated weights for policy 0, policy_version 256447 (0.0029) [2024-06-28 15:47:37,809][09423] Updated weights for policy 0, policy_version 256457 (0.0028) [2024-06-28 15:47:37,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.4, 300 sec: 43098.3). Total num frames: 4201791488. Throughput: 0: 42960.9. Samples: 480692380. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 15:47:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:47:42,485][09423] Updated weights for policy 0, policy_version 256467 (0.0036) [2024-06-28 15:47:42,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4201988096. Throughput: 0: 43047.1. Samples: 480829620. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 15:47:42,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:47:45,316][09423] Updated weights for policy 0, policy_version 256477 (0.0026) [2024-06-28 15:47:47,922][09190] Fps is (10 sec: 40959.7, 60 sec: 43144.4, 300 sec: 42987.1). Total num frames: 4202201088. Throughput: 0: 43004.9. Samples: 481079620. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 15:47:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:47:50,324][09423] Updated weights for policy 0, policy_version 256487 (0.0028) [2024-06-28 15:47:52,763][09423] Updated weights for policy 0, policy_version 256497 (0.0023) [2024-06-28 15:47:52,921][09190] Fps is (10 sec: 45875.3, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4202446848. Throughput: 0: 43042.7. Samples: 481331560. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 15:47:52,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:47:57,695][09423] Updated weights for policy 0, policy_version 256507 (0.0036) [2024-06-28 15:47:57,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.3, 300 sec: 42987.2). Total num frames: 4202627072. Throughput: 0: 43087.3. Samples: 481468480. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 15:47:57,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:48:00,509][09423] Updated weights for policy 0, policy_version 256517 (0.0035) [2024-06-28 15:48:02,921][09190] Fps is (10 sec: 40960.2, 60 sec: 43417.7, 300 sec: 43042.7). Total num frames: 4202856448. Throughput: 0: 43083.3. Samples: 481723020. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 15:48:02,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:48:05,080][09423] Updated weights for policy 0, policy_version 256527 (0.0040) [2024-06-28 15:48:07,921][09190] Fps is (10 sec: 45875.9, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4203085824. Throughput: 0: 43160.1. Samples: 481986900. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 15:48:07,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:48:08,027][09423] Updated weights for policy 0, policy_version 256537 (0.0026) [2024-06-28 15:48:12,605][09423] Updated weights for policy 0, policy_version 256547 (0.0045) [2024-06-28 15:48:12,921][09190] Fps is (10 sec: 42597.9, 60 sec: 43144.7, 300 sec: 43042.7). Total num frames: 4203282432. Throughput: 0: 42875.6. Samples: 482113660. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 15:48:12,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:48:15,736][09423] Updated weights for policy 0, policy_version 256557 (0.0028) [2024-06-28 15:48:17,922][09190] Fps is (10 sec: 42597.6, 60 sec: 43417.6, 300 sec: 43098.3). Total num frames: 4203511808. Throughput: 0: 42825.7. Samples: 482370860. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 15:48:17,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:48:18,047][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000256563_4203528192.pth... [2024-06-28 15:48:18,101][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000255931_4193173504.pth [2024-06-28 15:48:20,762][09423] Updated weights for policy 0, policy_version 256567 (0.0030) [2024-06-28 15:48:22,924][09190] Fps is (10 sec: 44226.1, 60 sec: 42869.7, 300 sec: 43042.3). Total num frames: 4203724800. Throughput: 0: 42948.4. Samples: 482625160. Policy #0 lag: (min: 0.0, avg: 12.0, max: 24.0) [2024-06-28 15:48:22,924][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:48:23,583][09423] Updated weights for policy 0, policy_version 256577 (0.0031) [2024-06-28 15:48:27,922][09190] Fps is (10 sec: 37683.0, 60 sec: 42052.2, 300 sec: 42876.1). Total num frames: 4203888640. Throughput: 0: 42720.3. Samples: 482752040. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 15:48:27,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 15:48:28,490][09423] Updated weights for policy 0, policy_version 256587 (0.0038) [2024-06-28 15:48:30,981][09423] Updated weights for policy 0, policy_version 256597 (0.0027) [2024-06-28 15:48:32,921][09190] Fps is (10 sec: 42609.2, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4204150784. Throughput: 0: 42805.1. Samples: 483005840. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 15:48:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:48:35,871][09423] Updated weights for policy 0, policy_version 256607 (0.0032) [2024-06-28 15:48:37,924][09190] Fps is (10 sec: 47502.2, 60 sec: 42869.7, 300 sec: 42986.8). Total num frames: 4204363776. Throughput: 0: 43106.4. Samples: 483271460. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 15:48:37,925][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:48:38,717][09423] Updated weights for policy 0, policy_version 256617 (0.0027) [2024-06-28 15:48:42,921][09190] Fps is (10 sec: 39321.2, 60 sec: 42598.4, 300 sec: 42987.2). Total num frames: 4204544000. Throughput: 0: 42951.2. Samples: 483401280. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 15:48:42,922][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 15:48:43,216][09423] Updated weights for policy 0, policy_version 256627 (0.0036) [2024-06-28 15:48:46,345][09423] Updated weights for policy 0, policy_version 256637 (0.0031) [2024-06-28 15:48:47,921][09190] Fps is (10 sec: 44248.4, 60 sec: 43417.7, 300 sec: 43098.3). Total num frames: 4204806144. Throughput: 0: 43089.3. Samples: 483662040. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 15:48:47,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:48:50,929][09423] Updated weights for policy 0, policy_version 256647 (0.0036) [2024-06-28 15:48:52,208][09403] Signal inference workers to stop experience collection... (6650 times) [2024-06-28 15:48:52,239][09423] InferenceWorker_p0-w0: stopping experience collection (6650 times) [2024-06-28 15:48:52,264][09403] Signal inference workers to resume experience collection... (6650 times) [2024-06-28 15:48:52,264][09423] InferenceWorker_p0-w0: resuming experience collection (6650 times) [2024-06-28 15:48:52,921][09190] Fps is (10 sec: 45875.9, 60 sec: 42598.5, 300 sec: 43042.7). Total num frames: 4205002752. Throughput: 0: 42900.5. Samples: 483917420. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 15:48:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:48:53,879][09423] Updated weights for policy 0, policy_version 256657 (0.0040) [2024-06-28 15:48:57,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4205199360. Throughput: 0: 42946.7. Samples: 484046260. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 15:48:57,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:48:58,881][09423] Updated weights for policy 0, policy_version 256667 (0.0032) [2024-06-28 15:49:01,702][09423] Updated weights for policy 0, policy_version 256677 (0.0040) [2024-06-28 15:49:02,922][09190] Fps is (10 sec: 44235.7, 60 sec: 43144.4, 300 sec: 43042.7). Total num frames: 4205445120. Throughput: 0: 42878.2. Samples: 484300380. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 15:49:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:49:06,550][09423] Updated weights for policy 0, policy_version 256687 (0.0037) [2024-06-28 15:49:07,922][09190] Fps is (10 sec: 45874.5, 60 sec: 42871.3, 300 sec: 42987.1). Total num frames: 4205658112. Throughput: 0: 42914.7. Samples: 484556220. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 15:49:07,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:49:09,207][09423] Updated weights for policy 0, policy_version 256697 (0.0028) [2024-06-28 15:49:12,922][09190] Fps is (10 sec: 39321.7, 60 sec: 42598.4, 300 sec: 42931.6). Total num frames: 4205838336. Throughput: 0: 42988.1. Samples: 484686500. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 15:49:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:49:13,951][09423] Updated weights for policy 0, policy_version 256707 (0.0030) [2024-06-28 15:49:16,679][09423] Updated weights for policy 0, policy_version 256717 (0.0025) [2024-06-28 15:49:17,922][09190] Fps is (10 sec: 42595.6, 60 sec: 42871.0, 300 sec: 43042.6). Total num frames: 4206084096. Throughput: 0: 43101.9. Samples: 484945460. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 15:49:17,923][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:49:21,485][09423] Updated weights for policy 0, policy_version 256727 (0.0047) [2024-06-28 15:49:22,921][09190] Fps is (10 sec: 45876.1, 60 sec: 42873.3, 300 sec: 43042.7). Total num frames: 4206297088. Throughput: 0: 43051.0. Samples: 485208640. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 15:49:22,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 15:49:24,393][09423] Updated weights for policy 0, policy_version 256737 (0.0025) [2024-06-28 15:49:27,921][09190] Fps is (10 sec: 40962.8, 60 sec: 43417.7, 300 sec: 42987.2). Total num frames: 4206493696. Throughput: 0: 42979.9. Samples: 485335380. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 15:49:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:49:28,780][09423] Updated weights for policy 0, policy_version 256747 (0.0030) [2024-06-28 15:49:31,851][09423] Updated weights for policy 0, policy_version 256757 (0.0029) [2024-06-28 15:49:32,922][09190] Fps is (10 sec: 44236.1, 60 sec: 43144.4, 300 sec: 43042.7). Total num frames: 4206739456. Throughput: 0: 43003.9. Samples: 485597220. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 15:49:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:49:36,578][09423] Updated weights for policy 0, policy_version 256767 (0.0029) [2024-06-28 15:49:37,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42873.3, 300 sec: 42987.5). Total num frames: 4206936064. Throughput: 0: 43070.6. Samples: 485855600. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:49:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:49:39,802][09423] Updated weights for policy 0, policy_version 256777 (0.0037) [2024-06-28 15:49:42,922][09190] Fps is (10 sec: 40959.6, 60 sec: 43417.5, 300 sec: 42987.5). Total num frames: 4207149056. Throughput: 0: 43066.9. Samples: 485984280. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:49:42,928][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:49:44,194][09423] Updated weights for policy 0, policy_version 256787 (0.0034) [2024-06-28 15:49:47,090][09423] Updated weights for policy 0, policy_version 256797 (0.0031) [2024-06-28 15:49:47,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4207378432. Throughput: 0: 43010.8. Samples: 486235860. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:49:47,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:49:51,606][09423] Updated weights for policy 0, policy_version 256807 (0.0038) [2024-06-28 15:49:52,924][09190] Fps is (10 sec: 44224.1, 60 sec: 43142.3, 300 sec: 42986.7). Total num frames: 4207591424. Throughput: 0: 43168.3. Samples: 486498920. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:49:52,925][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:49:54,686][09423] Updated weights for policy 0, policy_version 256817 (0.0037) [2024-06-28 15:49:57,921][09190] Fps is (10 sec: 40960.2, 60 sec: 43144.5, 300 sec: 42987.2). Total num frames: 4207788032. Throughput: 0: 43133.0. Samples: 486627480. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:49:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:49:59,282][09423] Updated weights for policy 0, policy_version 256827 (0.0032) [2024-06-28 15:50:02,751][09423] Updated weights for policy 0, policy_version 256837 (0.0024) [2024-06-28 15:50:02,921][09190] Fps is (10 sec: 42611.2, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4208017408. Throughput: 0: 43097.2. Samples: 486884800. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:50:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:50:06,732][09423] Updated weights for policy 0, policy_version 256847 (0.0036) [2024-06-28 15:50:07,922][09190] Fps is (10 sec: 42597.6, 60 sec: 42598.4, 300 sec: 42987.1). Total num frames: 4208214016. Throughput: 0: 42855.8. Samples: 487137160. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:50:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:50:10,360][09423] Updated weights for policy 0, policy_version 256857 (0.0032) [2024-06-28 15:50:12,922][09190] Fps is (10 sec: 42598.1, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4208443392. Throughput: 0: 42865.3. Samples: 487264320. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:50:12,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:50:14,650][09423] Updated weights for policy 0, policy_version 256867 (0.0033) [2024-06-28 15:50:17,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42872.0, 300 sec: 42987.2). Total num frames: 4208656384. Throughput: 0: 42644.1. Samples: 487516200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:50:17,930][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:50:17,944][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000256876_4208656384.pth... [2024-06-28 15:50:18,014][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000256245_4198318080.pth [2024-06-28 15:50:18,157][09423] Updated weights for policy 0, policy_version 256877 (0.0042) [2024-06-28 15:50:22,443][09423] Updated weights for policy 0, policy_version 256887 (0.0044) [2024-06-28 15:50:22,921][09190] Fps is (10 sec: 42599.3, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4208869376. Throughput: 0: 42627.6. Samples: 487773840. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:50:22,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:50:24,725][09403] Signal inference workers to stop experience collection... (6700 times) [2024-06-28 15:50:24,727][09403] Signal inference workers to resume experience collection... (6700 times) [2024-06-28 15:50:24,736][09423] InferenceWorker_p0-w0: stopping experience collection (6700 times) [2024-06-28 15:50:24,769][09423] InferenceWorker_p0-w0: resuming experience collection (6700 times) [2024-06-28 15:50:25,428][09423] Updated weights for policy 0, policy_version 256897 (0.0036) [2024-06-28 15:50:27,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4209082368. Throughput: 0: 42598.4. Samples: 487901200. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:50:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:50:29,832][09423] Updated weights for policy 0, policy_version 256907 (0.0034) [2024-06-28 15:50:32,921][09190] Fps is (10 sec: 44236.2, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4209311744. Throughput: 0: 42915.5. Samples: 488167060. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:50:32,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:50:33,405][09423] Updated weights for policy 0, policy_version 256917 (0.0036) [2024-06-28 15:50:37,357][09423] Updated weights for policy 0, policy_version 256927 (0.0039) [2024-06-28 15:50:37,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42598.4, 300 sec: 42820.6). Total num frames: 4209491968. Throughput: 0: 42804.3. Samples: 488424980. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:50:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:50:40,981][09423] Updated weights for policy 0, policy_version 256937 (0.0022) [2024-06-28 15:50:42,921][09190] Fps is (10 sec: 42599.0, 60 sec: 43144.7, 300 sec: 43042.7). Total num frames: 4209737728. Throughput: 0: 42815.6. Samples: 488554180. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:50:42,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:50:44,683][09423] Updated weights for policy 0, policy_version 256947 (0.0031) [2024-06-28 15:50:47,922][09190] Fps is (10 sec: 44235.9, 60 sec: 42598.3, 300 sec: 42987.1). Total num frames: 4209934336. Throughput: 0: 42815.4. Samples: 488811500. Policy #0 lag: (min: 0.0, avg: 9.1, max: 21.0) [2024-06-28 15:50:47,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:50:48,442][09423] Updated weights for policy 0, policy_version 256957 (0.0033) [2024-06-28 15:50:52,610][09423] Updated weights for policy 0, policy_version 256967 (0.0030) [2024-06-28 15:50:52,921][09190] Fps is (10 sec: 42597.6, 60 sec: 42873.6, 300 sec: 43042.7). Total num frames: 4210163712. Throughput: 0: 42824.5. Samples: 489064260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 15:50:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:50:56,306][09423] Updated weights for policy 0, policy_version 256977 (0.0030) [2024-06-28 15:50:57,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42871.4, 300 sec: 42988.0). Total num frames: 4210360320. Throughput: 0: 42816.9. Samples: 489191080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 15:50:57,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:51:00,412][09423] Updated weights for policy 0, policy_version 256987 (0.0029) [2024-06-28 15:51:02,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4210589696. Throughput: 0: 43006.3. Samples: 489451480. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 15:51:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:51:03,687][09423] Updated weights for policy 0, policy_version 256997 (0.0032) [2024-06-28 15:51:07,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42871.6, 300 sec: 42876.1). Total num frames: 4210786304. Throughput: 0: 43175.9. Samples: 489716760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 15:51:07,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:51:07,971][09423] Updated weights for policy 0, policy_version 257007 (0.0033) [2024-06-28 15:51:11,237][09423] Updated weights for policy 0, policy_version 257017 (0.0030) [2024-06-28 15:51:12,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42325.4, 300 sec: 42931.8). Total num frames: 4210982912. Throughput: 0: 43163.2. Samples: 489843540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 15:51:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:51:15,391][09423] Updated weights for policy 0, policy_version 257027 (0.0040) [2024-06-28 15:51:17,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4211228672. Throughput: 0: 42999.6. Samples: 490102040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 15:51:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:51:18,823][09423] Updated weights for policy 0, policy_version 257037 (0.0043) [2024-06-28 15:51:22,921][09190] Fps is (10 sec: 45875.4, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4211441664. Throughput: 0: 43009.8. Samples: 490360420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 15:51:22,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:51:22,945][09423] Updated weights for policy 0, policy_version 257047 (0.0042) [2024-06-28 15:51:26,766][09423] Updated weights for policy 0, policy_version 257057 (0.0028) [2024-06-28 15:51:27,922][09190] Fps is (10 sec: 40959.5, 60 sec: 42598.3, 300 sec: 42987.2). Total num frames: 4211638272. Throughput: 0: 43006.0. Samples: 490489460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 15:51:27,935][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:51:30,525][09423] Updated weights for policy 0, policy_version 257067 (0.0025) [2024-06-28 15:51:32,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4211884032. Throughput: 0: 42898.9. Samples: 490741940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 15:51:32,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:51:34,498][09423] Updated weights for policy 0, policy_version 257077 (0.0036) [2024-06-28 15:51:37,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43417.5, 300 sec: 42987.2). Total num frames: 4212097024. Throughput: 0: 43065.3. Samples: 491002200. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 15:51:37,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:51:38,111][09423] Updated weights for policy 0, policy_version 257087 (0.0040) [2024-06-28 15:51:41,855][09423] Updated weights for policy 0, policy_version 257097 (0.0032) [2024-06-28 15:51:42,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4212310016. Throughput: 0: 43089.4. Samples: 491130100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 15:51:42,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 15:51:45,731][09423] Updated weights for policy 0, policy_version 257107 (0.0031) [2024-06-28 15:51:47,924][09190] Fps is (10 sec: 44226.0, 60 sec: 43415.9, 300 sec: 42931.3). Total num frames: 4212539392. Throughput: 0: 43214.4. Samples: 491396240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 15:51:47,924][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:51:49,163][09423] Updated weights for policy 0, policy_version 257117 (0.0028) [2024-06-28 15:51:52,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4212752384. Throughput: 0: 43074.7. Samples: 491655120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 15:51:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:51:52,976][09423] Updated weights for policy 0, policy_version 257127 (0.0026) [2024-06-28 15:51:57,027][09423] Updated weights for policy 0, policy_version 257137 (0.0041) [2024-06-28 15:51:57,921][09190] Fps is (10 sec: 39331.6, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4212932608. Throughput: 0: 43157.3. Samples: 491785620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 15:51:57,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 15:52:00,383][09423] Updated weights for policy 0, policy_version 257147 (0.0030) [2024-06-28 15:52:02,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42871.4, 300 sec: 42931.6). Total num frames: 4213161984. Throughput: 0: 43145.3. Samples: 492043580. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-28 15:52:02,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:52:03,505][09403] Signal inference workers to stop experience collection... (6750 times) [2024-06-28 15:52:03,509][09403] Signal inference workers to resume experience collection... (6750 times) [2024-06-28 15:52:03,545][09423] InferenceWorker_p0-w0: stopping experience collection (6750 times) [2024-06-28 15:52:03,545][09423] InferenceWorker_p0-w0: resuming experience collection (6750 times) [2024-06-28 15:52:04,917][09423] Updated weights for policy 0, policy_version 257157 (0.0039) [2024-06-28 15:52:07,921][09190] Fps is (10 sec: 45874.8, 60 sec: 43417.5, 300 sec: 43042.7). Total num frames: 4213391360. Throughput: 0: 43003.4. Samples: 492295580. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-28 15:52:07,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 15:52:08,319][09423] Updated weights for policy 0, policy_version 257167 (0.0037) [2024-06-28 15:52:12,339][09423] Updated weights for policy 0, policy_version 257177 (0.0032) [2024-06-28 15:52:12,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 42987.2). Total num frames: 4213587968. Throughput: 0: 43208.1. Samples: 492433820. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-28 15:52:12,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 15:52:15,874][09423] Updated weights for policy 0, policy_version 257187 (0.0040) [2024-06-28 15:52:17,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.4, 300 sec: 42876.1). Total num frames: 4213800960. Throughput: 0: 43046.5. Samples: 492679040. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-28 15:52:17,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 15:52:17,941][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000257190_4213800960.pth... [2024-06-28 15:52:17,996][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000256563_4203528192.pth [2024-06-28 15:52:19,789][09423] Updated weights for policy 0, policy_version 257197 (0.0042) [2024-06-28 15:52:22,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.4, 300 sec: 42876.1). Total num frames: 4214013952. Throughput: 0: 43069.8. Samples: 492940340. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-28 15:52:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:52:23,900][09423] Updated weights for policy 0, policy_version 257207 (0.0038) [2024-06-28 15:52:27,260][09423] Updated weights for policy 0, policy_version 257217 (0.0024) [2024-06-28 15:52:27,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 42987.2). Total num frames: 4214243328. Throughput: 0: 43106.7. Samples: 493069900. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-28 15:52:27,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 15:52:31,173][09423] Updated weights for policy 0, policy_version 257227 (0.0029) [2024-06-28 15:52:32,922][09190] Fps is (10 sec: 44236.1, 60 sec: 42871.3, 300 sec: 42931.6). Total num frames: 4214456320. Throughput: 0: 43006.7. Samples: 493331440. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-28 15:52:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:52:35,452][09423] Updated weights for policy 0, policy_version 257237 (0.0038) [2024-06-28 15:52:37,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4214685696. Throughput: 0: 43130.6. Samples: 493596000. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-28 15:52:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:52:38,444][09423] Updated weights for policy 0, policy_version 257247 (0.0033) [2024-06-28 15:52:42,921][09190] Fps is (10 sec: 42599.5, 60 sec: 42871.6, 300 sec: 42987.2). Total num frames: 4214882304. Throughput: 0: 43133.4. Samples: 493726620. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-28 15:52:42,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:52:42,952][09423] Updated weights for policy 0, policy_version 257257 (0.0028) [2024-06-28 15:52:46,021][09423] Updated weights for policy 0, policy_version 257267 (0.0028) [2024-06-28 15:52:47,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42873.3, 300 sec: 42931.6). Total num frames: 4215111680. Throughput: 0: 43020.0. Samples: 493979480. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-28 15:52:47,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:52:50,675][09423] Updated weights for policy 0, policy_version 257277 (0.0022) [2024-06-28 15:52:52,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4215324672. Throughput: 0: 43202.7. Samples: 494239700. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-28 15:52:52,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:52:53,601][09423] Updated weights for policy 0, policy_version 257287 (0.0038) [2024-06-28 15:52:57,921][09190] Fps is (10 sec: 42598.0, 60 sec: 43417.5, 300 sec: 42987.2). Total num frames: 4215537664. Throughput: 0: 42824.0. Samples: 494360900. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-28 15:52:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:52:58,086][09423] Updated weights for policy 0, policy_version 257297 (0.0038) [2024-06-28 15:53:01,516][09423] Updated weights for policy 0, policy_version 257307 (0.0034) [2024-06-28 15:53:02,924][09190] Fps is (10 sec: 42587.8, 60 sec: 43142.8, 300 sec: 42931.3). Total num frames: 4215750656. Throughput: 0: 42998.6. Samples: 494614080. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-28 15:53:02,924][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 15:53:06,124][09423] Updated weights for policy 0, policy_version 257317 (0.0032) [2024-06-28 15:53:07,921][09190] Fps is (10 sec: 44237.3, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4215980032. Throughput: 0: 43236.1. Samples: 494885960. Policy #0 lag: (min: 1.0, avg: 8.4, max: 21.0) [2024-06-28 15:53:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:53:08,780][09423] Updated weights for policy 0, policy_version 257327 (0.0042) [2024-06-28 15:53:12,921][09190] Fps is (10 sec: 42608.8, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4216176640. Throughput: 0: 43266.6. Samples: 495016900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 15:53:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:53:13,561][09423] Updated weights for policy 0, policy_version 257337 (0.0041) [2024-06-28 15:53:14,688][09403] Signal inference workers to stop experience collection... (6800 times) [2024-06-28 15:53:14,735][09423] InferenceWorker_p0-w0: stopping experience collection (6800 times) [2024-06-28 15:53:14,740][09403] Signal inference workers to resume experience collection... (6800 times) [2024-06-28 15:53:14,747][09423] InferenceWorker_p0-w0: resuming experience collection (6800 times) [2024-06-28 15:53:16,199][09423] Updated weights for policy 0, policy_version 257347 (0.0031) [2024-06-28 15:53:17,921][09190] Fps is (10 sec: 40959.9, 60 sec: 43144.6, 300 sec: 42932.0). Total num frames: 4216389632. Throughput: 0: 43179.3. Samples: 495274500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 15:53:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:53:20,861][09423] Updated weights for policy 0, policy_version 257357 (0.0033) [2024-06-28 15:53:22,921][09190] Fps is (10 sec: 44237.3, 60 sec: 43417.7, 300 sec: 43153.8). Total num frames: 4216619008. Throughput: 0: 43134.3. Samples: 495537040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 15:53:22,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 15:53:24,113][09423] Updated weights for policy 0, policy_version 257367 (0.0029) [2024-06-28 15:53:27,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4216815616. Throughput: 0: 42918.2. Samples: 495657940. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 15:53:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:53:28,865][09423] Updated weights for policy 0, policy_version 257377 (0.0037) [2024-06-28 15:53:31,574][09423] Updated weights for policy 0, policy_version 257387 (0.0040) [2024-06-28 15:53:32,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43417.8, 300 sec: 43043.1). Total num frames: 4217061376. Throughput: 0: 42875.6. Samples: 495908880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 15:53:32,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 15:53:36,311][09423] Updated weights for policy 0, policy_version 257397 (0.0040) [2024-06-28 15:53:37,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.5, 300 sec: 43042.7). Total num frames: 4217241600. Throughput: 0: 43078.7. Samples: 496178240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 15:53:37,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 15:53:39,614][09423] Updated weights for policy 0, policy_version 257407 (0.0031) [2024-06-28 15:53:42,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42871.4, 300 sec: 42876.1). Total num frames: 4217454592. Throughput: 0: 43168.5. Samples: 496303480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 15:53:42,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 15:53:44,049][09423] Updated weights for policy 0, policy_version 257417 (0.0032) [2024-06-28 15:53:46,963][09423] Updated weights for policy 0, policy_version 257427 (0.0042) [2024-06-28 15:53:47,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4217683968. Throughput: 0: 43265.1. Samples: 496560900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 15:53:47,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:53:51,776][09423] Updated weights for policy 0, policy_version 257437 (0.0026) [2024-06-28 15:53:52,924][09190] Fps is (10 sec: 44225.8, 60 sec: 42869.7, 300 sec: 43042.3). Total num frames: 4217896960. Throughput: 0: 43284.7. Samples: 496833880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 15:53:52,924][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:53:54,376][09423] Updated weights for policy 0, policy_version 257447 (0.0052) [2024-06-28 15:53:57,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42871.5, 300 sec: 42931.7). Total num frames: 4218109952. Throughput: 0: 43117.4. Samples: 496957180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 15:53:57,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:53:59,271][09423] Updated weights for policy 0, policy_version 257457 (0.0028) [2024-06-28 15:54:02,341][09423] Updated weights for policy 0, policy_version 257467 (0.0032) [2024-06-28 15:54:02,921][09190] Fps is (10 sec: 44247.4, 60 sec: 43146.3, 300 sec: 42987.2). Total num frames: 4218339328. Throughput: 0: 42993.2. Samples: 497209200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 15:54:02,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:54:06,702][09423] Updated weights for policy 0, policy_version 257477 (0.0023) [2024-06-28 15:54:07,922][09190] Fps is (10 sec: 42597.7, 60 sec: 42598.3, 300 sec: 43042.7). Total num frames: 4218535936. Throughput: 0: 43067.8. Samples: 497475100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 15:54:07,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:54:09,939][09423] Updated weights for policy 0, policy_version 257487 (0.0029) [2024-06-28 15:54:12,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43042.8). Total num frames: 4218781696. Throughput: 0: 43085.3. Samples: 497596780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 15:54:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:54:14,523][09423] Updated weights for policy 0, policy_version 257497 (0.0034) [2024-06-28 15:54:17,355][09423] Updated weights for policy 0, policy_version 257507 (0.0029) [2024-06-28 15:54:17,921][09190] Fps is (10 sec: 45876.1, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4218994688. Throughput: 0: 43385.8. Samples: 497861240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 15:54:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:54:17,963][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000257508_4219011072.pth... [2024-06-28 15:54:18,019][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000256876_4208656384.pth [2024-06-28 15:54:21,831][09403] Signal inference workers to stop experience collection... (6850 times) [2024-06-28 15:54:21,862][09423] InferenceWorker_p0-w0: stopping experience collection (6850 times) [2024-06-28 15:54:21,890][09403] Signal inference workers to resume experience collection... (6850 times) [2024-06-28 15:54:21,890][09423] InferenceWorker_p0-w0: resuming experience collection (6850 times) [2024-06-28 15:54:22,055][09423] Updated weights for policy 0, policy_version 257517 (0.0043) [2024-06-28 15:54:22,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 43098.3). Total num frames: 4219207680. Throughput: 0: 43096.4. Samples: 498117580. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:54:22,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:54:24,981][09423] Updated weights for policy 0, policy_version 257527 (0.0033) [2024-06-28 15:54:27,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 42987.2). Total num frames: 4219420672. Throughput: 0: 43217.8. Samples: 498248280. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:54:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:54:29,463][09423] Updated weights for policy 0, policy_version 257537 (0.0034) [2024-06-28 15:54:32,368][09423] Updated weights for policy 0, policy_version 257547 (0.0039) [2024-06-28 15:54:32,921][09190] Fps is (10 sec: 44236.6, 60 sec: 43144.4, 300 sec: 43098.2). Total num frames: 4219650048. Throughput: 0: 43260.8. Samples: 498507640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:54:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:54:37,290][09423] Updated weights for policy 0, policy_version 257557 (0.0040) [2024-06-28 15:54:37,921][09190] Fps is (10 sec: 42597.9, 60 sec: 43417.5, 300 sec: 43042.7). Total num frames: 4219846656. Throughput: 0: 43063.7. Samples: 498771640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:54:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:54:40,091][09423] Updated weights for policy 0, policy_version 257567 (0.0032) [2024-06-28 15:54:42,921][09190] Fps is (10 sec: 40960.4, 60 sec: 43417.6, 300 sec: 42987.2). Total num frames: 4220059648. Throughput: 0: 43044.0. Samples: 498894160. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:54:42,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:54:44,643][09423] Updated weights for policy 0, policy_version 257577 (0.0023) [2024-06-28 15:54:47,667][09423] Updated weights for policy 0, policy_version 257587 (0.0034) [2024-06-28 15:54:47,921][09190] Fps is (10 sec: 45875.6, 60 sec: 43690.6, 300 sec: 43098.7). Total num frames: 4220305408. Throughput: 0: 43262.3. Samples: 499156000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:54:47,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:54:52,466][09423] Updated weights for policy 0, policy_version 257597 (0.0025) [2024-06-28 15:54:52,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43146.3, 300 sec: 43042.7). Total num frames: 4220485632. Throughput: 0: 43175.3. Samples: 499417980. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:54:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:54:55,482][09423] Updated weights for policy 0, policy_version 257607 (0.0049) [2024-06-28 15:54:57,921][09190] Fps is (10 sec: 40959.9, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4220715008. Throughput: 0: 43127.5. Samples: 499537520. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:54:57,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 15:55:00,169][09423] Updated weights for policy 0, policy_version 257617 (0.0032) [2024-06-28 15:55:02,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43417.7, 300 sec: 43153.8). Total num frames: 4220944384. Throughput: 0: 43012.9. Samples: 499796820. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:55:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:55:03,027][09423] Updated weights for policy 0, policy_version 257627 (0.0024) [2024-06-28 15:55:07,483][09423] Updated weights for policy 0, policy_version 257637 (0.0036) [2024-06-28 15:55:07,921][09190] Fps is (10 sec: 42598.0, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4221140992. Throughput: 0: 43310.2. Samples: 500066540. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:55:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:55:10,459][09423] Updated weights for policy 0, policy_version 257647 (0.0038) [2024-06-28 15:55:12,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4221353984. Throughput: 0: 43095.6. Samples: 500187580. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:55:12,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 15:55:15,331][09423] Updated weights for policy 0, policy_version 257657 (0.0042) [2024-06-28 15:55:17,773][09423] Updated weights for policy 0, policy_version 257667 (0.0033) [2024-06-28 15:55:17,921][09190] Fps is (10 sec: 47513.8, 60 sec: 43690.6, 300 sec: 43209.3). Total num frames: 4221616128. Throughput: 0: 43216.5. Samples: 500452380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:55:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:55:22,877][09423] Updated weights for policy 0, policy_version 257677 (0.0034) [2024-06-28 15:55:22,922][09190] Fps is (10 sec: 42597.6, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4221779968. Throughput: 0: 43156.0. Samples: 500713660. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:55:22,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:55:25,605][09423] Updated weights for policy 0, policy_version 257687 (0.0023) [2024-06-28 15:55:27,921][09190] Fps is (10 sec: 39322.1, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4222009344. Throughput: 0: 43072.9. Samples: 500832440. Policy #0 lag: (min: 0.0, avg: 10.4, max: 21.0) [2024-06-28 15:55:27,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 15:55:30,666][09423] Updated weights for policy 0, policy_version 257697 (0.0026) [2024-06-28 15:55:32,921][09190] Fps is (10 sec: 45875.7, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 4222238720. Throughput: 0: 43103.5. Samples: 501095660. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 15:55:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:55:33,159][09423] Updated weights for policy 0, policy_version 257707 (0.0038) [2024-06-28 15:55:37,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4222418944. Throughput: 0: 43056.9. Samples: 501355540. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 15:55:37,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 15:55:38,034][09423] Updated weights for policy 0, policy_version 257717 (0.0031) [2024-06-28 15:55:41,193][09423] Updated weights for policy 0, policy_version 257727 (0.0024) [2024-06-28 15:55:42,924][09190] Fps is (10 sec: 40947.6, 60 sec: 43142.3, 300 sec: 43097.8). Total num frames: 4222648320. Throughput: 0: 43061.5. Samples: 501475420. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 15:55:42,925][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 15:55:45,354][09423] Updated weights for policy 0, policy_version 257737 (0.0027) [2024-06-28 15:55:47,928][09190] Fps is (10 sec: 45845.4, 60 sec: 42866.8, 300 sec: 43097.3). Total num frames: 4222877696. Throughput: 0: 43209.3. Samples: 501741520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 15:55:47,928][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 15:55:48,569][09423] Updated weights for policy 0, policy_version 257747 (0.0033) [2024-06-28 15:55:52,921][09190] Fps is (10 sec: 40972.4, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4223057920. Throughput: 0: 43097.0. Samples: 502005900. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 15:55:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:55:53,124][09423] Updated weights for policy 0, policy_version 257757 (0.0034) [2024-06-28 15:55:56,039][09423] Updated weights for policy 0, policy_version 257767 (0.0031) [2024-06-28 15:55:57,921][09190] Fps is (10 sec: 42626.2, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4223303680. Throughput: 0: 43127.1. Samples: 502128300. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 15:55:57,922][09190] Avg episode reward: [(0, '0.711')] [2024-06-28 15:56:00,513][09423] Updated weights for policy 0, policy_version 257777 (0.0036) [2024-06-28 15:56:02,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42871.4, 300 sec: 43153.8). Total num frames: 4223516672. Throughput: 0: 43091.1. Samples: 502391480. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 15:56:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:56:03,816][09423] Updated weights for policy 0, policy_version 257787 (0.0030) [2024-06-28 15:56:07,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42871.5, 300 sec: 43153.8). Total num frames: 4223713280. Throughput: 0: 43109.4. Samples: 502653580. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 15:56:07,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:56:08,546][09403] Signal inference workers to stop experience collection... (6900 times) [2024-06-28 15:56:08,547][09403] Signal inference workers to resume experience collection... (6900 times) [2024-06-28 15:56:08,550][09423] Updated weights for policy 0, policy_version 257797 (0.0034) [2024-06-28 15:56:08,564][09423] InferenceWorker_p0-w0: stopping experience collection (6900 times) [2024-06-28 15:56:08,564][09423] InferenceWorker_p0-w0: resuming experience collection (6900 times) [2024-06-28 15:56:11,324][09423] Updated weights for policy 0, policy_version 257807 (0.0040) [2024-06-28 15:56:12,921][09190] Fps is (10 sec: 42599.0, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4223942656. Throughput: 0: 43124.5. Samples: 502773040. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 15:56:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:56:15,981][09423] Updated weights for policy 0, policy_version 257817 (0.0034) [2024-06-28 15:56:17,922][09190] Fps is (10 sec: 45874.5, 60 sec: 42598.3, 300 sec: 43153.8). Total num frames: 4224172032. Throughput: 0: 43112.3. Samples: 503035720. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 15:56:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:56:17,929][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000257823_4224172032.pth... [2024-06-28 15:56:17,989][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000257190_4213800960.pth [2024-06-28 15:56:18,938][09423] Updated weights for policy 0, policy_version 257827 (0.0040) [2024-06-28 15:56:22,921][09190] Fps is (10 sec: 42598.0, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4224368640. Throughput: 0: 43076.4. Samples: 503293980. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 15:56:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:56:23,893][09423] Updated weights for policy 0, policy_version 257837 (0.0024) [2024-06-28 15:56:26,755][09423] Updated weights for policy 0, policy_version 257847 (0.0032) [2024-06-28 15:56:27,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4224581632. Throughput: 0: 43204.3. Samples: 503419480. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 15:56:27,922][09190] Avg episode reward: [(0, '0.788')] [2024-06-28 15:56:31,286][09423] Updated weights for policy 0, policy_version 257857 (0.0034) [2024-06-28 15:56:32,922][09190] Fps is (10 sec: 45874.4, 60 sec: 43144.4, 300 sec: 43153.8). Total num frames: 4224827392. Throughput: 0: 43114.9. Samples: 503681420. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 15:56:32,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:56:34,375][09423] Updated weights for policy 0, policy_version 257867 (0.0022) [2024-06-28 15:56:37,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43417.7, 300 sec: 43098.3). Total num frames: 4225024000. Throughput: 0: 42934.8. Samples: 503937960. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 15:56:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:56:38,854][09423] Updated weights for policy 0, policy_version 257877 (0.0029) [2024-06-28 15:56:41,622][09423] Updated weights for policy 0, policy_version 257887 (0.0037) [2024-06-28 15:56:42,921][09190] Fps is (10 sec: 40960.6, 60 sec: 43146.7, 300 sec: 43043.1). Total num frames: 4225236992. Throughput: 0: 43007.5. Samples: 504063640. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:56:42,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:56:46,587][09423] Updated weights for policy 0, policy_version 257897 (0.0030) [2024-06-28 15:56:47,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42876.1, 300 sec: 43042.7). Total num frames: 4225449984. Throughput: 0: 43018.3. Samples: 504327300. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:56:47,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 15:56:49,461][09423] Updated weights for policy 0, policy_version 257907 (0.0035) [2024-06-28 15:56:52,924][09190] Fps is (10 sec: 40949.8, 60 sec: 43142.8, 300 sec: 43097.9). Total num frames: 4225646592. Throughput: 0: 42842.5. Samples: 504581600. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:56:52,924][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:56:53,948][09423] Updated weights for policy 0, policy_version 257917 (0.0037) [2024-06-28 15:56:57,149][09423] Updated weights for policy 0, policy_version 257927 (0.0028) [2024-06-28 15:56:57,922][09190] Fps is (10 sec: 44235.9, 60 sec: 43144.4, 300 sec: 43153.8). Total num frames: 4225892352. Throughput: 0: 43057.5. Samples: 504710640. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:56:57,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:57:01,593][09423] Updated weights for policy 0, policy_version 257937 (0.0027) [2024-06-28 15:57:02,921][09190] Fps is (10 sec: 44247.7, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4226088960. Throughput: 0: 43044.1. Samples: 504972700. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:57:02,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:57:04,774][09423] Updated weights for policy 0, policy_version 257947 (0.0035) [2024-06-28 15:57:07,921][09190] Fps is (10 sec: 40960.4, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4226301952. Throughput: 0: 42966.2. Samples: 505227460. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:57:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:57:09,052][09423] Updated weights for policy 0, policy_version 257957 (0.0036) [2024-06-28 15:57:12,401][09423] Updated weights for policy 0, policy_version 257967 (0.0043) [2024-06-28 15:57:12,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43417.5, 300 sec: 43209.3). Total num frames: 4226547712. Throughput: 0: 42999.6. Samples: 505354460. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:57:12,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:57:16,905][09423] Updated weights for policy 0, policy_version 257977 (0.0038) [2024-06-28 15:57:17,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.6, 300 sec: 43098.3). Total num frames: 4226727936. Throughput: 0: 43164.2. Samples: 505623800. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:57:17,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 15:57:20,159][09423] Updated weights for policy 0, policy_version 257987 (0.0038) [2024-06-28 15:57:22,922][09190] Fps is (10 sec: 40957.4, 60 sec: 43144.1, 300 sec: 43098.2). Total num frames: 4226957312. Throughput: 0: 43006.4. Samples: 505873280. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:57:22,923][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:57:24,685][09423] Updated weights for policy 0, policy_version 257997 (0.0036) [2024-06-28 15:57:27,709][09423] Updated weights for policy 0, policy_version 258007 (0.0021) [2024-06-28 15:57:27,921][09190] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4227186688. Throughput: 0: 43053.0. Samples: 506001020. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:57:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:57:32,350][09423] Updated weights for policy 0, policy_version 258017 (0.0032) [2024-06-28 15:57:32,924][09190] Fps is (10 sec: 42591.0, 60 sec: 42596.8, 300 sec: 43042.4). Total num frames: 4227383296. Throughput: 0: 43003.0. Samples: 506262540. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:57:32,924][09190] Avg episode reward: [(0, '0.730')] [2024-06-28 15:57:35,441][09423] Updated weights for policy 0, policy_version 258027 (0.0027) [2024-06-28 15:57:37,921][09190] Fps is (10 sec: 40959.4, 60 sec: 42871.3, 300 sec: 43098.2). Total num frames: 4227596288. Throughput: 0: 42975.2. Samples: 506515380. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:57:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:57:39,753][09423] Updated weights for policy 0, policy_version 258037 (0.0020) [2024-06-28 15:57:42,617][09403] Signal inference workers to stop experience collection... (6950 times) [2024-06-28 15:57:42,617][09403] Signal inference workers to resume experience collection... (6950 times) [2024-06-28 15:57:42,645][09423] InferenceWorker_p0-w0: stopping experience collection (6950 times) [2024-06-28 15:57:42,645][09423] InferenceWorker_p0-w0: resuming experience collection (6950 times) [2024-06-28 15:57:42,899][09423] Updated weights for policy 0, policy_version 258047 (0.0036) [2024-06-28 15:57:42,921][09190] Fps is (10 sec: 45885.8, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4227842048. Throughput: 0: 43103.6. Samples: 506650300. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:57:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:57:47,420][09423] Updated weights for policy 0, policy_version 258057 (0.0032) [2024-06-28 15:57:47,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4228022272. Throughput: 0: 42880.9. Samples: 506902340. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:57:47,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:57:50,640][09423] Updated weights for policy 0, policy_version 258067 (0.0026) [2024-06-28 15:57:52,921][09190] Fps is (10 sec: 37683.3, 60 sec: 42873.2, 300 sec: 42987.2). Total num frames: 4228218880. Throughput: 0: 43047.1. Samples: 507164580. Policy #0 lag: (min: 0.0, avg: 11.0, max: 20.0) [2024-06-28 15:57:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 15:57:55,166][09423] Updated weights for policy 0, policy_version 258077 (0.0032) [2024-06-28 15:57:57,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.5, 300 sec: 43098.6). Total num frames: 4228464640. Throughput: 0: 43073.3. Samples: 507292760. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 15:57:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 15:57:58,152][09423] Updated weights for policy 0, policy_version 258087 (0.0037) [2024-06-28 15:58:02,765][09423] Updated weights for policy 0, policy_version 258097 (0.0038) [2024-06-28 15:58:02,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.4, 300 sec: 42987.2). Total num frames: 4228661248. Throughput: 0: 42768.7. Samples: 507548400. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 15:58:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:58:05,926][09423] Updated weights for policy 0, policy_version 258107 (0.0036) [2024-06-28 15:58:07,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4228890624. Throughput: 0: 42930.9. Samples: 507805140. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 15:58:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:58:10,090][09423] Updated weights for policy 0, policy_version 258117 (0.0028) [2024-06-28 15:58:12,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42598.4, 300 sec: 43098.2). Total num frames: 4229103616. Throughput: 0: 43054.1. Samples: 507938460. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 15:58:12,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:58:13,675][09423] Updated weights for policy 0, policy_version 258127 (0.0027) [2024-06-28 15:58:17,815][09423] Updated weights for policy 0, policy_version 258137 (0.0039) [2024-06-28 15:58:17,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4229316608. Throughput: 0: 42910.3. Samples: 508193400. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 15:58:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:58:17,933][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000258137_4229316608.pth... [2024-06-28 15:58:17,983][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000257508_4219011072.pth [2024-06-28 15:58:21,120][09423] Updated weights for policy 0, policy_version 258147 (0.0036) [2024-06-28 15:58:22,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42871.9, 300 sec: 43098.2). Total num frames: 4229529600. Throughput: 0: 43013.4. Samples: 508450980. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 15:58:22,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:58:25,241][09423] Updated weights for policy 0, policy_version 258157 (0.0033) [2024-06-28 15:58:27,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42598.3, 300 sec: 42987.2). Total num frames: 4229742592. Throughput: 0: 42863.1. Samples: 508579140. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 15:58:27,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 15:58:28,613][09423] Updated weights for policy 0, policy_version 258167 (0.0044) [2024-06-28 15:58:32,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42873.2, 300 sec: 43098.3). Total num frames: 4229955584. Throughput: 0: 43130.3. Samples: 508843200. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 15:58:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:58:33,033][09423] Updated weights for policy 0, policy_version 258177 (0.0025) [2024-06-28 15:58:36,270][09423] Updated weights for policy 0, policy_version 258187 (0.0038) [2024-06-28 15:58:37,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4230184960. Throughput: 0: 42874.7. Samples: 509093940. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 15:58:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:58:40,512][09423] Updated weights for policy 0, policy_version 258197 (0.0027) [2024-06-28 15:58:42,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.4, 300 sec: 43042.7). Total num frames: 4230381568. Throughput: 0: 42865.0. Samples: 509221680. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 15:58:42,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 15:58:44,174][09423] Updated weights for policy 0, policy_version 258207 (0.0037) [2024-06-28 15:58:47,922][09190] Fps is (10 sec: 42597.8, 60 sec: 43144.5, 300 sec: 43098.6). Total num frames: 4230610944. Throughput: 0: 42844.0. Samples: 509476380. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 15:58:47,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 15:58:47,962][09423] Updated weights for policy 0, policy_version 258217 (0.0026) [2024-06-28 15:58:51,891][09423] Updated weights for policy 0, policy_version 258227 (0.0028) [2024-06-28 15:58:52,921][09190] Fps is (10 sec: 45875.2, 60 sec: 43690.7, 300 sec: 43153.8). Total num frames: 4230840320. Throughput: 0: 42985.8. Samples: 509739500. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 15:58:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:58:55,655][09423] Updated weights for policy 0, policy_version 258237 (0.0031) [2024-06-28 15:58:57,921][09190] Fps is (10 sec: 39322.3, 60 sec: 42325.4, 300 sec: 42931.7). Total num frames: 4231004160. Throughput: 0: 42902.7. Samples: 509869080. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 15:58:57,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 15:58:59,274][09423] Updated weights for policy 0, policy_version 258247 (0.0026) [2024-06-28 15:59:02,921][09190] Fps is (10 sec: 40959.8, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4231249920. Throughput: 0: 42969.7. Samples: 510127040. Policy #0 lag: (min: 0.0, avg: 8.7, max: 22.0) [2024-06-28 15:59:02,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:59:03,060][09403] Signal inference workers to stop experience collection... (7000 times) [2024-06-28 15:59:03,104][09423] InferenceWorker_p0-w0: stopping experience collection (7000 times) [2024-06-28 15:59:03,112][09403] Signal inference workers to resume experience collection... (7000 times) [2024-06-28 15:59:03,117][09423] InferenceWorker_p0-w0: resuming experience collection (7000 times) [2024-06-28 15:59:03,252][09423] Updated weights for policy 0, policy_version 258257 (0.0040) [2024-06-28 15:59:06,764][09423] Updated weights for policy 0, policy_version 258267 (0.0035) [2024-06-28 15:59:07,922][09190] Fps is (10 sec: 47512.3, 60 sec: 43144.3, 300 sec: 43042.7). Total num frames: 4231479296. Throughput: 0: 42930.1. Samples: 510382840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 15:59:07,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 15:59:10,665][09423] Updated weights for policy 0, policy_version 258277 (0.0044) [2024-06-28 15:59:12,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42598.5, 300 sec: 42931.6). Total num frames: 4231659520. Throughput: 0: 42904.6. Samples: 510509840. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 15:59:12,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:59:14,416][09423] Updated weights for policy 0, policy_version 258287 (0.0043) [2024-06-28 15:59:17,921][09190] Fps is (10 sec: 42599.0, 60 sec: 43144.4, 300 sec: 43042.7). Total num frames: 4231905280. Throughput: 0: 42889.6. Samples: 510773240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 15:59:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:59:18,523][09423] Updated weights for policy 0, policy_version 258297 (0.0027) [2024-06-28 15:59:22,209][09423] Updated weights for policy 0, policy_version 258307 (0.0028) [2024-06-28 15:59:22,921][09190] Fps is (10 sec: 47512.9, 60 sec: 43417.6, 300 sec: 43098.2). Total num frames: 4232134656. Throughput: 0: 43131.9. Samples: 511034880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 15:59:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:59:25,818][09423] Updated weights for policy 0, policy_version 258317 (0.0033) [2024-06-28 15:59:27,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42871.5, 300 sec: 42931.6). Total num frames: 4232314880. Throughput: 0: 43059.5. Samples: 511159360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 15:59:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 15:59:29,988][09423] Updated weights for policy 0, policy_version 258327 (0.0026) [2024-06-28 15:59:32,921][09190] Fps is (10 sec: 42598.9, 60 sec: 43417.6, 300 sec: 43098.3). Total num frames: 4232560640. Throughput: 0: 43244.2. Samples: 511422360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 15:59:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 15:59:33,517][09423] Updated weights for policy 0, policy_version 258337 (0.0034) [2024-06-28 15:59:37,368][09423] Updated weights for policy 0, policy_version 258347 (0.0032) [2024-06-28 15:59:37,922][09190] Fps is (10 sec: 45874.8, 60 sec: 43144.4, 300 sec: 43098.2). Total num frames: 4232773632. Throughput: 0: 43215.9. Samples: 511684220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 15:59:37,931][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 15:59:41,130][09423] Updated weights for policy 0, policy_version 258357 (0.0031) [2024-06-28 15:59:42,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42871.4, 300 sec: 42876.1). Total num frames: 4232953856. Throughput: 0: 43231.9. Samples: 511814520. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 15:59:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 15:59:44,791][09423] Updated weights for policy 0, policy_version 258367 (0.0029) [2024-06-28 15:59:47,921][09190] Fps is (10 sec: 40960.9, 60 sec: 42871.6, 300 sec: 43042.7). Total num frames: 4233183232. Throughput: 0: 43090.3. Samples: 512066100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 15:59:47,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 15:59:48,676][09423] Updated weights for policy 0, policy_version 258377 (0.0038) [2024-06-28 15:59:52,399][09423] Updated weights for policy 0, policy_version 258387 (0.0033) [2024-06-28 15:59:52,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4233412608. Throughput: 0: 43293.1. Samples: 512331020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 15:59:52,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 15:59:56,380][09423] Updated weights for policy 0, policy_version 258397 (0.0040) [2024-06-28 15:59:57,921][09190] Fps is (10 sec: 42597.7, 60 sec: 43417.5, 300 sec: 42931.6). Total num frames: 4233609216. Throughput: 0: 43349.2. Samples: 512460560. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 15:59:57,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:00:00,282][09423] Updated weights for policy 0, policy_version 258407 (0.0040) [2024-06-28 16:00:02,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4233838592. Throughput: 0: 43003.3. Samples: 512708380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 16:00:02,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:00:04,236][09423] Updated weights for policy 0, policy_version 258417 (0.0036) [2024-06-28 16:00:07,912][09423] Updated weights for policy 0, policy_version 258427 (0.0044) [2024-06-28 16:00:07,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43144.7, 300 sec: 43098.2). Total num frames: 4234067968. Throughput: 0: 42986.3. Samples: 512969260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 16:00:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:00:11,675][09423] Updated weights for policy 0, policy_version 258437 (0.0029) [2024-06-28 16:00:12,921][09190] Fps is (10 sec: 40959.4, 60 sec: 43144.4, 300 sec: 42820.6). Total num frames: 4234248192. Throughput: 0: 42954.6. Samples: 513092320. Policy #0 lag: (min: 0.0, avg: 10.3, max: 21.0) [2024-06-28 16:00:12,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:00:15,632][09423] Updated weights for policy 0, policy_version 258447 (0.0033) [2024-06-28 16:00:17,921][09190] Fps is (10 sec: 42598.9, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4234493952. Throughput: 0: 42822.7. Samples: 513349380. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 16:00:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:00:17,937][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000258453_4234493952.pth... [2024-06-28 16:00:17,991][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000257823_4224172032.pth [2024-06-28 16:00:19,660][09423] Updated weights for policy 0, policy_version 258457 (0.0025) [2024-06-28 16:00:22,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42598.5, 300 sec: 42987.2). Total num frames: 4234690560. Throughput: 0: 42787.2. Samples: 513609640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 16:00:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:00:23,329][09423] Updated weights for policy 0, policy_version 258467 (0.0037) [2024-06-28 16:00:23,604][09403] Signal inference workers to stop experience collection... (7050 times) [2024-06-28 16:00:23,604][09403] Signal inference workers to resume experience collection... (7050 times) [2024-06-28 16:00:23,644][09423] InferenceWorker_p0-w0: stopping experience collection (7050 times) [2024-06-28 16:00:23,644][09423] InferenceWorker_p0-w0: resuming experience collection (7050 times) [2024-06-28 16:00:27,028][09423] Updated weights for policy 0, policy_version 258477 (0.0024) [2024-06-28 16:00:27,921][09190] Fps is (10 sec: 40959.5, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4234903552. Throughput: 0: 42761.7. Samples: 513738800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 16:00:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:00:30,658][09423] Updated weights for policy 0, policy_version 258487 (0.0035) [2024-06-28 16:00:32,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.4, 300 sec: 43098.2). Total num frames: 4235132928. Throughput: 0: 42962.5. Samples: 513999420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 16:00:32,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:00:34,912][09423] Updated weights for policy 0, policy_version 258497 (0.0044) [2024-06-28 16:00:37,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.5, 300 sec: 42987.6). Total num frames: 4235329536. Throughput: 0: 42896.4. Samples: 514261360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 16:00:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:00:38,531][09423] Updated weights for policy 0, policy_version 258507 (0.0041) [2024-06-28 16:00:42,207][09423] Updated weights for policy 0, policy_version 258517 (0.0032) [2024-06-28 16:00:42,921][09190] Fps is (10 sec: 40960.2, 60 sec: 43144.5, 300 sec: 42932.6). Total num frames: 4235542528. Throughput: 0: 42680.0. Samples: 514381160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 16:00:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:00:46,150][09423] Updated weights for policy 0, policy_version 258527 (0.0035) [2024-06-28 16:00:47,922][09190] Fps is (10 sec: 45875.0, 60 sec: 43417.5, 300 sec: 43153.8). Total num frames: 4235788288. Throughput: 0: 42910.1. Samples: 514639340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 16:00:47,926][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:00:49,944][09423] Updated weights for policy 0, policy_version 258537 (0.0042) [2024-06-28 16:00:52,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42325.4, 300 sec: 42876.1). Total num frames: 4235952128. Throughput: 0: 42955.6. Samples: 514902260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 16:00:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:00:54,014][09423] Updated weights for policy 0, policy_version 258547 (0.0028) [2024-06-28 16:00:57,898][09423] Updated weights for policy 0, policy_version 258557 (0.0042) [2024-06-28 16:00:57,921][09190] Fps is (10 sec: 40960.9, 60 sec: 43144.7, 300 sec: 42987.2). Total num frames: 4236197888. Throughput: 0: 42991.3. Samples: 515026920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 16:00:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:01:01,559][09423] Updated weights for policy 0, policy_version 258567 (0.0031) [2024-06-28 16:01:02,921][09190] Fps is (10 sec: 47513.6, 60 sec: 43144.5, 300 sec: 43098.3). Total num frames: 4236427264. Throughput: 0: 43006.7. Samples: 515284680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 16:01:02,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:01:05,342][09423] Updated weights for policy 0, policy_version 258577 (0.0035) [2024-06-28 16:01:07,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42598.4, 300 sec: 42987.2). Total num frames: 4236623872. Throughput: 0: 43084.9. Samples: 515548460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 16:01:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:01:08,866][09423] Updated weights for policy 0, policy_version 258587 (0.0027) [2024-06-28 16:01:12,922][09190] Fps is (10 sec: 40959.0, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4236836864. Throughput: 0: 42945.2. Samples: 515671340. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 16:01:12,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:01:13,094][09423] Updated weights for policy 0, policy_version 258597 (0.0027) [2024-06-28 16:01:16,487][09423] Updated weights for policy 0, policy_version 258607 (0.0023) [2024-06-28 16:01:17,921][09190] Fps is (10 sec: 45875.5, 60 sec: 43144.5, 300 sec: 43098.3). Total num frames: 4237082624. Throughput: 0: 42929.9. Samples: 515931260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 16:01:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:01:20,708][09423] Updated weights for policy 0, policy_version 258617 (0.0029) [2024-06-28 16:01:22,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42598.4, 300 sec: 42931.6). Total num frames: 4237246464. Throughput: 0: 43038.3. Samples: 516198080. Policy #0 lag: (min: 0.0, avg: 10.1, max: 23.0) [2024-06-28 16:01:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:01:24,065][09423] Updated weights for policy 0, policy_version 258627 (0.0035) [2024-06-28 16:01:27,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42871.5, 300 sec: 42876.1). Total num frames: 4237475840. Throughput: 0: 43037.0. Samples: 516317820. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:01:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:01:28,078][09423] Updated weights for policy 0, policy_version 258637 (0.0042) [2024-06-28 16:01:32,003][09423] Updated weights for policy 0, policy_version 258647 (0.0026) [2024-06-28 16:01:32,921][09190] Fps is (10 sec: 47514.1, 60 sec: 43144.7, 300 sec: 43042.7). Total num frames: 4237721600. Throughput: 0: 43130.0. Samples: 516580180. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:01:32,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:01:35,715][09423] Updated weights for policy 0, policy_version 258657 (0.0037) [2024-06-28 16:01:37,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4237918208. Throughput: 0: 43220.0. Samples: 516847160. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:01:37,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:01:39,599][09423] Updated weights for policy 0, policy_version 258667 (0.0026) [2024-06-28 16:01:42,921][09190] Fps is (10 sec: 40959.5, 60 sec: 43144.5, 300 sec: 42987.2). Total num frames: 4238131200. Throughput: 0: 43015.9. Samples: 516962640. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:01:42,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:01:43,562][09423] Updated weights for policy 0, policy_version 258677 (0.0032) [2024-06-28 16:01:46,966][09423] Updated weights for policy 0, policy_version 258687 (0.0034) [2024-06-28 16:01:47,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43144.6, 300 sec: 43154.2). Total num frames: 4238376960. Throughput: 0: 43250.2. Samples: 517230940. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:01:47,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:01:51,417][09423] Updated weights for policy 0, policy_version 258697 (0.0050) [2024-06-28 16:01:52,924][09190] Fps is (10 sec: 42588.0, 60 sec: 43415.8, 300 sec: 42931.3). Total num frames: 4238557184. Throughput: 0: 43229.6. Samples: 517493900. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:01:52,924][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 16:01:54,515][09423] Updated weights for policy 0, policy_version 258707 (0.0044) [2024-06-28 16:01:55,008][09403] Signal inference workers to stop experience collection... (7100 times) [2024-06-28 16:01:55,008][09403] Signal inference workers to resume experience collection... (7100 times) [2024-06-28 16:01:55,024][09423] InferenceWorker_p0-w0: stopping experience collection (7100 times) [2024-06-28 16:01:55,024][09423] InferenceWorker_p0-w0: resuming experience collection (7100 times) [2024-06-28 16:01:57,922][09190] Fps is (10 sec: 40959.1, 60 sec: 43144.3, 300 sec: 43042.7). Total num frames: 4238786560. Throughput: 0: 42945.8. Samples: 517603900. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:01:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:01:58,934][09423] Updated weights for policy 0, policy_version 258717 (0.0048) [2024-06-28 16:02:02,230][09423] Updated weights for policy 0, policy_version 258727 (0.0026) [2024-06-28 16:02:02,922][09190] Fps is (10 sec: 47524.2, 60 sec: 43417.4, 300 sec: 43153.8). Total num frames: 4239032320. Throughput: 0: 43245.1. Samples: 517877300. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:02:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:02:06,438][09423] Updated weights for policy 0, policy_version 258737 (0.0026) [2024-06-28 16:02:07,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42871.4, 300 sec: 42876.1). Total num frames: 4239196160. Throughput: 0: 42977.2. Samples: 518132060. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:02:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:02:09,709][09423] Updated weights for policy 0, policy_version 258747 (0.0038) [2024-06-28 16:02:12,921][09190] Fps is (10 sec: 39323.0, 60 sec: 43144.7, 300 sec: 43042.7). Total num frames: 4239425536. Throughput: 0: 43031.2. Samples: 518254220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:02:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:02:13,906][09423] Updated weights for policy 0, policy_version 258757 (0.0025) [2024-06-28 16:02:17,306][09423] Updated weights for policy 0, policy_version 258767 (0.0027) [2024-06-28 16:02:17,921][09190] Fps is (10 sec: 47514.4, 60 sec: 43144.5, 300 sec: 43098.4). Total num frames: 4239671296. Throughput: 0: 43139.5. Samples: 518521460. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:02:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:02:17,928][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000258769_4239671296.pth... [2024-06-28 16:02:17,991][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000258137_4229316608.pth [2024-06-28 16:02:21,535][09423] Updated weights for policy 0, policy_version 258777 (0.0040) [2024-06-28 16:02:22,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43417.7, 300 sec: 42931.6). Total num frames: 4239851520. Throughput: 0: 43060.9. Samples: 518784900. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:02:22,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:02:24,965][09423] Updated weights for policy 0, policy_version 258787 (0.0042) [2024-06-28 16:02:27,921][09190] Fps is (10 sec: 40959.5, 60 sec: 43417.5, 300 sec: 43043.0). Total num frames: 4240080896. Throughput: 0: 43032.9. Samples: 518899120. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:02:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:02:29,101][09423] Updated weights for policy 0, policy_version 258797 (0.0028) [2024-06-28 16:02:32,352][09423] Updated weights for policy 0, policy_version 258807 (0.0036) [2024-06-28 16:02:32,921][09190] Fps is (10 sec: 47513.6, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4240326656. Throughput: 0: 43065.3. Samples: 519168880. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:02:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:02:36,979][09423] Updated weights for policy 0, policy_version 258817 (0.0023) [2024-06-28 16:02:37,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43144.5, 300 sec: 42931.6). Total num frames: 4240506880. Throughput: 0: 42931.2. Samples: 519425700. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:02:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:02:39,989][09423] Updated weights for policy 0, policy_version 258827 (0.0034) [2024-06-28 16:02:42,921][09190] Fps is (10 sec: 39321.3, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4240719872. Throughput: 0: 43146.8. Samples: 519545500. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:02:42,923][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:02:44,753][09423] Updated weights for policy 0, policy_version 258837 (0.0036) [2024-06-28 16:02:47,536][09423] Updated weights for policy 0, policy_version 258847 (0.0033) [2024-06-28 16:02:47,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.4, 300 sec: 43153.8). Total num frames: 4240949248. Throughput: 0: 42927.7. Samples: 519809040. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:02:47,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:02:52,245][09423] Updated weights for policy 0, policy_version 258857 (0.0042) [2024-06-28 16:02:52,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42873.2, 300 sec: 42931.6). Total num frames: 4241129472. Throughput: 0: 43054.3. Samples: 520069500. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:02:52,923][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:02:55,238][09423] Updated weights for policy 0, policy_version 258867 (0.0029) [2024-06-28 16:02:57,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4241375232. Throughput: 0: 43000.8. Samples: 520189260. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:02:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:02:59,624][09423] Updated weights for policy 0, policy_version 258877 (0.0030) [2024-06-28 16:03:02,886][09423] Updated weights for policy 0, policy_version 258887 (0.0037) [2024-06-28 16:03:02,921][09190] Fps is (10 sec: 47513.7, 60 sec: 42871.6, 300 sec: 43098.2). Total num frames: 4241604608. Throughput: 0: 43023.9. Samples: 520457540. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:03:02,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:03:07,854][09423] Updated weights for policy 0, policy_version 258897 (0.0038) [2024-06-28 16:03:07,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42871.6, 300 sec: 42931.6). Total num frames: 4241768448. Throughput: 0: 42951.1. Samples: 520717700. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:03:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:03:10,540][09423] Updated weights for policy 0, policy_version 258907 (0.0030) [2024-06-28 16:03:12,925][09190] Fps is (10 sec: 40947.4, 60 sec: 43142.2, 300 sec: 43042.3). Total num frames: 4242014208. Throughput: 0: 42980.6. Samples: 520833380. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:03:12,925][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:03:15,295][09423] Updated weights for policy 0, policy_version 258917 (0.0026) [2024-06-28 16:03:17,928][09190] Fps is (10 sec: 45845.2, 60 sec: 42593.7, 300 sec: 43041.8). Total num frames: 4242227200. Throughput: 0: 42924.0. Samples: 521100740. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:03:17,928][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:03:18,129][09423] Updated weights for policy 0, policy_version 258927 (0.0033) [2024-06-28 16:03:22,670][09423] Updated weights for policy 0, policy_version 258937 (0.0036) [2024-06-28 16:03:22,921][09190] Fps is (10 sec: 40972.7, 60 sec: 42871.4, 300 sec: 42987.2). Total num frames: 4242423808. Throughput: 0: 42864.4. Samples: 521354600. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:03:22,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 16:03:23,700][09403] Signal inference workers to stop experience collection... (7150 times) [2024-06-28 16:03:23,709][09403] Signal inference workers to resume experience collection... (7150 times) [2024-06-28 16:03:23,750][09423] InferenceWorker_p0-w0: stopping experience collection (7150 times) [2024-06-28 16:03:23,750][09423] InferenceWorker_p0-w0: resuming experience collection (7150 times) [2024-06-28 16:03:25,685][09423] Updated weights for policy 0, policy_version 258947 (0.0035) [2024-06-28 16:03:27,921][09190] Fps is (10 sec: 42626.4, 60 sec: 42871.6, 300 sec: 43042.7). Total num frames: 4242653184. Throughput: 0: 42907.2. Samples: 521476320. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:03:27,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:03:30,183][09423] Updated weights for policy 0, policy_version 258957 (0.0027) [2024-06-28 16:03:32,922][09190] Fps is (10 sec: 44236.4, 60 sec: 42325.2, 300 sec: 42987.2). Total num frames: 4242866176. Throughput: 0: 43085.7. Samples: 521747900. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:03:32,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:03:33,228][09423] Updated weights for policy 0, policy_version 258967 (0.0035) [2024-06-28 16:03:37,477][09423] Updated weights for policy 0, policy_version 258977 (0.0036) [2024-06-28 16:03:37,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4243079168. Throughput: 0: 43061.4. Samples: 522007260. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:03:37,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:03:40,757][09423] Updated weights for policy 0, policy_version 258987 (0.0028) [2024-06-28 16:03:42,921][09190] Fps is (10 sec: 45875.6, 60 sec: 43417.6, 300 sec: 43098.3). Total num frames: 4243324928. Throughput: 0: 43112.0. Samples: 522129300. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:03:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:03:45,502][09423] Updated weights for policy 0, policy_version 258997 (0.0037) [2024-06-28 16:03:47,928][09190] Fps is (10 sec: 44208.1, 60 sec: 42866.9, 300 sec: 42986.2). Total num frames: 4243521536. Throughput: 0: 42884.6. Samples: 522387620. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 16:03:47,929][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:03:48,575][09423] Updated weights for policy 0, policy_version 259007 (0.0031) [2024-06-28 16:03:52,874][09423] Updated weights for policy 0, policy_version 259017 (0.0024) [2024-06-28 16:03:52,922][09190] Fps is (10 sec: 40959.7, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4243734528. Throughput: 0: 42947.4. Samples: 522650340. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 16:03:52,924][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:03:56,031][09423] Updated weights for policy 0, policy_version 259027 (0.0029) [2024-06-28 16:03:57,921][09190] Fps is (10 sec: 40986.7, 60 sec: 42598.5, 300 sec: 42987.2). Total num frames: 4243931136. Throughput: 0: 43148.4. Samples: 522774920. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 16:03:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:04:00,561][09423] Updated weights for policy 0, policy_version 259037 (0.0029) [2024-06-28 16:04:02,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42598.5, 300 sec: 42987.2). Total num frames: 4244160512. Throughput: 0: 42998.3. Samples: 523035380. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 16:04:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 16:04:03,628][09423] Updated weights for policy 0, policy_version 259047 (0.0036) [2024-06-28 16:04:07,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4244357120. Throughput: 0: 43225.4. Samples: 523299740. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 16:04:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:04:08,252][09423] Updated weights for policy 0, policy_version 259057 (0.0032) [2024-06-28 16:04:11,402][09423] Updated weights for policy 0, policy_version 259067 (0.0032) [2024-06-28 16:04:12,921][09190] Fps is (10 sec: 44236.3, 60 sec: 43146.7, 300 sec: 43042.7). Total num frames: 4244602880. Throughput: 0: 43248.4. Samples: 523422500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 16:04:12,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:04:15,867][09423] Updated weights for policy 0, policy_version 259077 (0.0035) [2024-06-28 16:04:17,921][09190] Fps is (10 sec: 45875.1, 60 sec: 43149.2, 300 sec: 42987.2). Total num frames: 4244815872. Throughput: 0: 43096.5. Samples: 523687240. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 16:04:17,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:04:18,107][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000259085_4244848640.pth... [2024-06-28 16:04:18,159][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000258453_4234493952.pth [2024-06-28 16:04:19,204][09423] Updated weights for policy 0, policy_version 259087 (0.0040) [2024-06-28 16:04:22,921][09190] Fps is (10 sec: 40960.3, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4245012480. Throughput: 0: 43237.8. Samples: 523952960. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 16:04:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:04:23,353][09423] Updated weights for policy 0, policy_version 259097 (0.0032) [2024-06-28 16:04:26,805][09423] Updated weights for policy 0, policy_version 259107 (0.0039) [2024-06-28 16:04:27,921][09190] Fps is (10 sec: 44236.6, 60 sec: 43417.5, 300 sec: 43042.7). Total num frames: 4245258240. Throughput: 0: 43347.1. Samples: 524079920. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 16:04:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:04:30,726][09423] Updated weights for policy 0, policy_version 259117 (0.0040) [2024-06-28 16:04:32,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4245454848. Throughput: 0: 43367.5. Samples: 524338880. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 16:04:32,922][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 16:04:34,179][09423] Updated weights for policy 0, policy_version 259127 (0.0035) [2024-06-28 16:04:37,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4245684224. Throughput: 0: 43050.3. Samples: 524587600. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 16:04:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:04:38,718][09423] Updated weights for policy 0, policy_version 259137 (0.0029) [2024-06-28 16:04:41,768][09423] Updated weights for policy 0, policy_version 259147 (0.0027) [2024-06-28 16:04:42,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.4, 300 sec: 43042.7). Total num frames: 4245880832. Throughput: 0: 43137.6. Samples: 524716120. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 16:04:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:04:46,068][09423] Updated weights for policy 0, policy_version 259157 (0.0039) [2024-06-28 16:04:47,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43422.3, 300 sec: 43098.3). Total num frames: 4246126592. Throughput: 0: 43336.0. Samples: 524985500. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 16:04:47,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:04:49,481][09423] Updated weights for policy 0, policy_version 259167 (0.0026) [2024-06-28 16:04:52,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4246323200. Throughput: 0: 43176.9. Samples: 525242700. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 16:04:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:04:53,601][09423] Updated weights for policy 0, policy_version 259177 (0.0026) [2024-06-28 16:04:57,255][09423] Updated weights for policy 0, policy_version 259187 (0.0034) [2024-06-28 16:04:57,921][09190] Fps is (10 sec: 40959.6, 60 sec: 43417.5, 300 sec: 43042.7). Total num frames: 4246536192. Throughput: 0: 43248.0. Samples: 525368660. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 16:04:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:05:01,130][09423] Updated weights for policy 0, policy_version 259197 (0.0041) [2024-06-28 16:05:02,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4246765568. Throughput: 0: 43190.3. Samples: 525630800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-28 16:05:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:05:04,719][09403] Signal inference workers to stop experience collection... (7200 times) [2024-06-28 16:05:04,725][09403] Signal inference workers to resume experience collection... (7200 times) [2024-06-28 16:05:04,742][09423] InferenceWorker_p0-w0: stopping experience collection (7200 times) [2024-06-28 16:05:04,742][09423] InferenceWorker_p0-w0: resuming experience collection (7200 times) [2024-06-28 16:05:04,876][09423] Updated weights for policy 0, policy_version 259207 (0.0043) [2024-06-28 16:05:07,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43417.6, 300 sec: 43098.3). Total num frames: 4246962176. Throughput: 0: 43189.7. Samples: 525896500. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-28 16:05:07,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:05:08,541][09423] Updated weights for policy 0, policy_version 259217 (0.0030) [2024-06-28 16:05:12,298][09423] Updated weights for policy 0, policy_version 259227 (0.0025) [2024-06-28 16:05:12,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4247175168. Throughput: 0: 43026.7. Samples: 526016120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-28 16:05:12,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:05:16,316][09423] Updated weights for policy 0, policy_version 259237 (0.0023) [2024-06-28 16:05:17,924][09190] Fps is (10 sec: 45863.8, 60 sec: 43415.8, 300 sec: 43153.4). Total num frames: 4247420928. Throughput: 0: 43025.7. Samples: 526275140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-28 16:05:17,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:05:20,332][09423] Updated weights for policy 0, policy_version 259247 (0.0033) [2024-06-28 16:05:22,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4247601152. Throughput: 0: 43292.0. Samples: 526535740. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-28 16:05:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:05:23,636][09423] Updated weights for policy 0, policy_version 259257 (0.0040) [2024-06-28 16:05:27,921][09190] Fps is (10 sec: 39331.7, 60 sec: 42598.5, 300 sec: 42987.2). Total num frames: 4247814144. Throughput: 0: 43123.2. Samples: 526656660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-28 16:05:27,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:05:28,286][09423] Updated weights for policy 0, policy_version 259267 (0.0030) [2024-06-28 16:05:31,568][09423] Updated weights for policy 0, policy_version 259277 (0.0024) [2024-06-28 16:05:32,924][09190] Fps is (10 sec: 47501.0, 60 sec: 43688.8, 300 sec: 43209.0). Total num frames: 4248076288. Throughput: 0: 43112.6. Samples: 526925680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-28 16:05:32,925][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:05:35,616][09423] Updated weights for policy 0, policy_version 259287 (0.0034) [2024-06-28 16:05:37,921][09190] Fps is (10 sec: 45874.9, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4248272896. Throughput: 0: 43128.5. Samples: 527183480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-28 16:05:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:05:38,910][09423] Updated weights for policy 0, policy_version 259297 (0.0031) [2024-06-28 16:05:42,921][09190] Fps is (10 sec: 39331.9, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4248469504. Throughput: 0: 43147.6. Samples: 527310300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-28 16:05:42,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:05:43,050][09423] Updated weights for policy 0, policy_version 259307 (0.0036) [2024-06-28 16:05:46,320][09423] Updated weights for policy 0, policy_version 259317 (0.0036) [2024-06-28 16:05:47,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43144.4, 300 sec: 43264.8). Total num frames: 4248715264. Throughput: 0: 43184.8. Samples: 527574120. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-28 16:05:47,922][09190] Avg episode reward: [(0, '0.709')] [2024-06-28 16:05:50,399][09423] Updated weights for policy 0, policy_version 259327 (0.0043) [2024-06-28 16:05:52,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4248895488. Throughput: 0: 42963.6. Samples: 527829860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-28 16:05:52,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:05:54,051][09423] Updated weights for policy 0, policy_version 259337 (0.0031) [2024-06-28 16:05:57,822][09423] Updated weights for policy 0, policy_version 259347 (0.0030) [2024-06-28 16:05:57,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43098.2). Total num frames: 4249141248. Throughput: 0: 43093.3. Samples: 527955320. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-28 16:05:57,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:06:01,568][09423] Updated weights for policy 0, policy_version 259357 (0.0028) [2024-06-28 16:06:02,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4249354240. Throughput: 0: 43067.8. Samples: 528213080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-28 16:06:02,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 16:06:05,962][09423] Updated weights for policy 0, policy_version 259367 (0.0037) [2024-06-28 16:06:07,921][09190] Fps is (10 sec: 40960.0, 60 sec: 43144.5, 300 sec: 43098.3). Total num frames: 4249550848. Throughput: 0: 43372.8. Samples: 528487520. Policy #0 lag: (min: 0.0, avg: 10.2, max: 24.0) [2024-06-28 16:06:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:06:09,084][09423] Updated weights for policy 0, policy_version 259377 (0.0036) [2024-06-28 16:06:12,921][09190] Fps is (10 sec: 40960.0, 60 sec: 43144.6, 300 sec: 42987.2). Total num frames: 4249763840. Throughput: 0: 43359.1. Samples: 528607820. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:06:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:06:13,298][09423] Updated weights for policy 0, policy_version 259387 (0.0036) [2024-06-28 16:06:16,573][09423] Updated weights for policy 0, policy_version 259397 (0.0030) [2024-06-28 16:06:17,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42873.2, 300 sec: 43209.3). Total num frames: 4249993216. Throughput: 0: 43131.7. Samples: 528866500. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:06:17,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:06:17,984][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000259400_4250009600.pth... [2024-06-28 16:06:18,039][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000258769_4239671296.pth [2024-06-28 16:06:21,225][09423] Updated weights for policy 0, policy_version 259407 (0.0039) [2024-06-28 16:06:22,924][09190] Fps is (10 sec: 42587.6, 60 sec: 43142.7, 300 sec: 43097.9). Total num frames: 4250189824. Throughput: 0: 43319.8. Samples: 529132980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:06:22,925][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:06:24,407][09423] Updated weights for policy 0, policy_version 259417 (0.0028) [2024-06-28 16:06:27,925][09190] Fps is (10 sec: 42584.1, 60 sec: 43415.1, 300 sec: 43042.2). Total num frames: 4250419200. Throughput: 0: 43276.2. Samples: 529257880. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:06:27,925][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:06:28,496][09423] Updated weights for policy 0, policy_version 259427 (0.0035) [2024-06-28 16:06:31,705][09423] Updated weights for policy 0, policy_version 259437 (0.0037) [2024-06-28 16:06:32,922][09190] Fps is (10 sec: 47524.7, 60 sec: 43146.3, 300 sec: 43209.3). Total num frames: 4250664960. Throughput: 0: 43151.5. Samples: 529515940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:06:32,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:06:33,365][09403] Signal inference workers to stop experience collection... (7250 times) [2024-06-28 16:06:33,366][09403] Signal inference workers to resume experience collection... (7250 times) [2024-06-28 16:06:33,381][09423] InferenceWorker_p0-w0: stopping experience collection (7250 times) [2024-06-28 16:06:33,409][09423] InferenceWorker_p0-w0: resuming experience collection (7250 times) [2024-06-28 16:06:35,789][09423] Updated weights for policy 0, policy_version 259447 (0.0034) [2024-06-28 16:06:37,921][09190] Fps is (10 sec: 42612.8, 60 sec: 42871.4, 300 sec: 43098.3). Total num frames: 4250845184. Throughput: 0: 43413.3. Samples: 529783460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:06:37,924][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:06:39,604][09423] Updated weights for policy 0, policy_version 259457 (0.0036) [2024-06-28 16:06:42,921][09190] Fps is (10 sec: 40960.6, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4251074560. Throughput: 0: 43220.5. Samples: 529900240. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:06:42,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:06:43,647][09423] Updated weights for policy 0, policy_version 259467 (0.0029) [2024-06-28 16:06:46,990][09423] Updated weights for policy 0, policy_version 259477 (0.0040) [2024-06-28 16:06:47,924][09190] Fps is (10 sec: 47501.9, 60 sec: 43415.9, 300 sec: 43264.9). Total num frames: 4251320320. Throughput: 0: 43428.6. Samples: 530167480. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:06:47,933][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:06:51,261][09423] Updated weights for policy 0, policy_version 259487 (0.0031) [2024-06-28 16:06:52,921][09190] Fps is (10 sec: 40960.2, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4251484160. Throughput: 0: 43172.5. Samples: 530430280. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:06:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:06:54,622][09423] Updated weights for policy 0, policy_version 259497 (0.0039) [2024-06-28 16:06:57,921][09190] Fps is (10 sec: 40970.0, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4251729920. Throughput: 0: 43308.3. Samples: 530556700. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:06:57,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 16:06:58,946][09423] Updated weights for policy 0, policy_version 259507 (0.0028) [2024-06-28 16:07:02,066][09423] Updated weights for policy 0, policy_version 259517 (0.0027) [2024-06-28 16:07:02,921][09190] Fps is (10 sec: 45874.8, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4251942912. Throughput: 0: 43373.8. Samples: 530818320. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:07:02,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:07:06,423][09423] Updated weights for policy 0, policy_version 259527 (0.0031) [2024-06-28 16:07:07,921][09190] Fps is (10 sec: 40960.1, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4252139520. Throughput: 0: 43320.1. Samples: 531082280. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:07:07,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 16:07:09,620][09423] Updated weights for policy 0, policy_version 259537 (0.0037) [2024-06-28 16:07:12,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4252368896. Throughput: 0: 43364.2. Samples: 531209120. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:07:12,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:07:13,666][09423] Updated weights for policy 0, policy_version 259547 (0.0033) [2024-06-28 16:07:17,178][09423] Updated weights for policy 0, policy_version 259557 (0.0031) [2024-06-28 16:07:17,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4252598272. Throughput: 0: 43383.2. Samples: 531468180. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:07:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:07:21,431][09423] Updated weights for policy 0, policy_version 259567 (0.0031) [2024-06-28 16:07:22,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43419.4, 300 sec: 43098.3). Total num frames: 4252794880. Throughput: 0: 43193.0. Samples: 531727140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 16:07:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:07:24,834][09423] Updated weights for policy 0, policy_version 259577 (0.0032) [2024-06-28 16:07:27,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43420.1, 300 sec: 43042.7). Total num frames: 4253024256. Throughput: 0: 43282.7. Samples: 531847960. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 16:07:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:07:29,005][09423] Updated weights for policy 0, policy_version 259587 (0.0052) [2024-06-28 16:07:32,559][09423] Updated weights for policy 0, policy_version 259597 (0.0042) [2024-06-28 16:07:32,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.6, 300 sec: 43153.8). Total num frames: 4253237248. Throughput: 0: 43272.2. Samples: 532114620. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 16:07:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:07:36,596][09423] Updated weights for policy 0, policy_version 259607 (0.0044) [2024-06-28 16:07:37,922][09190] Fps is (10 sec: 42598.0, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4253450240. Throughput: 0: 43221.2. Samples: 532375240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 16:07:37,928][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:07:40,076][09423] Updated weights for policy 0, policy_version 259617 (0.0027) [2024-06-28 16:07:42,924][09190] Fps is (10 sec: 44225.9, 60 sec: 43415.8, 300 sec: 43153.4). Total num frames: 4253679616. Throughput: 0: 43171.5. Samples: 532499520. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 16:07:42,924][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:07:44,360][09423] Updated weights for policy 0, policy_version 259627 (0.0030) [2024-06-28 16:07:47,459][09423] Updated weights for policy 0, policy_version 259637 (0.0037) [2024-06-28 16:07:47,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42873.3, 300 sec: 43264.9). Total num frames: 4253892608. Throughput: 0: 43189.4. Samples: 532761840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 16:07:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:07:51,983][09423] Updated weights for policy 0, policy_version 259647 (0.0032) [2024-06-28 16:07:52,921][09190] Fps is (10 sec: 40970.4, 60 sec: 43417.6, 300 sec: 43098.3). Total num frames: 4254089216. Throughput: 0: 42913.9. Samples: 533013400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 16:07:52,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:07:55,474][09423] Updated weights for policy 0, policy_version 259657 (0.0045) [2024-06-28 16:07:57,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4254318592. Throughput: 0: 42779.1. Samples: 533134180. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 16:07:57,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:07:59,493][09423] Updated weights for policy 0, policy_version 259667 (0.0030) [2024-06-28 16:08:02,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42871.5, 300 sec: 43209.3). Total num frames: 4254515200. Throughput: 0: 42861.4. Samples: 533396940. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 16:08:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:08:03,077][09423] Updated weights for policy 0, policy_version 259677 (0.0029) [2024-06-28 16:08:06,987][09403] Signal inference workers to stop experience collection... (7300 times) [2024-06-28 16:08:06,988][09403] Signal inference workers to resume experience collection... (7300 times) [2024-06-28 16:08:07,026][09423] InferenceWorker_p0-w0: stopping experience collection (7300 times) [2024-06-28 16:08:07,026][09423] InferenceWorker_p0-w0: resuming experience collection (7300 times) [2024-06-28 16:08:07,152][09423] Updated weights for policy 0, policy_version 259687 (0.0041) [2024-06-28 16:08:07,921][09190] Fps is (10 sec: 40959.7, 60 sec: 43144.5, 300 sec: 43098.7). Total num frames: 4254728192. Throughput: 0: 42879.5. Samples: 533656720. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 16:08:07,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:08:10,663][09423] Updated weights for policy 0, policy_version 259697 (0.0028) [2024-06-28 16:08:12,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43144.6, 300 sec: 43154.7). Total num frames: 4254957568. Throughput: 0: 42959.6. Samples: 533781140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 16:08:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:08:14,526][09423] Updated weights for policy 0, policy_version 259707 (0.0029) [2024-06-28 16:08:17,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42871.5, 300 sec: 43209.3). Total num frames: 4255170560. Throughput: 0: 42971.2. Samples: 534048320. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 16:08:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:08:17,939][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000259716_4255186944.pth... [2024-06-28 16:08:18,007][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000259085_4244848640.pth [2024-06-28 16:08:18,155][09423] Updated weights for policy 0, policy_version 259717 (0.0037) [2024-06-28 16:08:22,286][09423] Updated weights for policy 0, policy_version 259727 (0.0037) [2024-06-28 16:08:22,921][09190] Fps is (10 sec: 42598.0, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4255383552. Throughput: 0: 42966.7. Samples: 534308740. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 16:08:22,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:08:25,495][09423] Updated weights for policy 0, policy_version 259737 (0.0032) [2024-06-28 16:08:27,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42871.5, 300 sec: 43153.8). Total num frames: 4255596544. Throughput: 0: 42962.3. Samples: 534432720. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 16:08:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:08:30,070][09423] Updated weights for policy 0, policy_version 259747 (0.0044) [2024-06-28 16:08:32,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 4255825920. Throughput: 0: 42865.3. Samples: 534690780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:08:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:08:33,404][09423] Updated weights for policy 0, policy_version 259757 (0.0031) [2024-06-28 16:08:37,673][09423] Updated weights for policy 0, policy_version 259767 (0.0028) [2024-06-28 16:08:37,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4256038912. Throughput: 0: 43163.5. Samples: 534955760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:08:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:08:41,231][09423] Updated weights for policy 0, policy_version 259777 (0.0044) [2024-06-28 16:08:42,921][09190] Fps is (10 sec: 44236.3, 60 sec: 43146.3, 300 sec: 43210.3). Total num frames: 4256268288. Throughput: 0: 43296.8. Samples: 535082540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:08:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:08:45,159][09423] Updated weights for policy 0, policy_version 259787 (0.0037) [2024-06-28 16:08:47,924][09190] Fps is (10 sec: 40949.8, 60 sec: 42596.6, 300 sec: 43097.9). Total num frames: 4256448512. Throughput: 0: 43181.2. Samples: 535340200. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:08:47,924][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:08:48,702][09423] Updated weights for policy 0, policy_version 259797 (0.0027) [2024-06-28 16:08:52,567][09423] Updated weights for policy 0, policy_version 259807 (0.0031) [2024-06-28 16:08:52,921][09190] Fps is (10 sec: 40960.1, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4256677888. Throughput: 0: 43011.1. Samples: 535592220. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:08:52,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:08:56,250][09423] Updated weights for policy 0, policy_version 259817 (0.0037) [2024-06-28 16:08:57,922][09190] Fps is (10 sec: 44247.2, 60 sec: 42871.4, 300 sec: 43153.8). Total num frames: 4256890880. Throughput: 0: 43336.7. Samples: 535731300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:08:57,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:09:00,442][09423] Updated weights for policy 0, policy_version 259827 (0.0040) [2024-06-28 16:09:02,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4257103872. Throughput: 0: 43066.2. Samples: 535986300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:09:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:09:03,718][09423] Updated weights for policy 0, policy_version 259837 (0.0036) [2024-06-28 16:09:07,921][09190] Fps is (10 sec: 40960.8, 60 sec: 42871.6, 300 sec: 43042.7). Total num frames: 4257300480. Throughput: 0: 43043.2. Samples: 536245680. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:09:07,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:09:08,173][09423] Updated weights for policy 0, policy_version 259847 (0.0036) [2024-06-28 16:09:11,645][09423] Updated weights for policy 0, policy_version 259857 (0.0038) [2024-06-28 16:09:12,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4257546240. Throughput: 0: 42978.6. Samples: 536366760. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:09:12,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:09:15,485][09423] Updated weights for policy 0, policy_version 259867 (0.0040) [2024-06-28 16:09:17,922][09190] Fps is (10 sec: 45874.1, 60 sec: 43144.4, 300 sec: 43209.3). Total num frames: 4257759232. Throughput: 0: 43156.7. Samples: 536632840. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:09:17,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:09:19,067][09423] Updated weights for policy 0, policy_version 259877 (0.0025) [2024-06-28 16:09:22,830][09423] Updated weights for policy 0, policy_version 259887 (0.0038) [2024-06-28 16:09:22,925][09190] Fps is (10 sec: 44220.6, 60 sec: 43414.9, 300 sec: 43153.3). Total num frames: 4257988608. Throughput: 0: 42900.4. Samples: 536886440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:09:22,926][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:09:26,872][09423] Updated weights for policy 0, policy_version 259897 (0.0026) [2024-06-28 16:09:27,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4258185216. Throughput: 0: 42997.7. Samples: 537017440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:09:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:09:30,607][09423] Updated weights for policy 0, policy_version 259907 (0.0028) [2024-06-28 16:09:32,921][09190] Fps is (10 sec: 42613.8, 60 sec: 43144.4, 300 sec: 43153.8). Total num frames: 4258414592. Throughput: 0: 43138.7. Samples: 537281340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:09:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:09:33,812][09403] Signal inference workers to stop experience collection... (7350 times) [2024-06-28 16:09:33,835][09423] InferenceWorker_p0-w0: stopping experience collection (7350 times) [2024-06-28 16:09:33,875][09403] Signal inference workers to resume experience collection... (7350 times) [2024-06-28 16:09:33,875][09423] InferenceWorker_p0-w0: resuming experience collection (7350 times) [2024-06-28 16:09:34,326][09423] Updated weights for policy 0, policy_version 259917 (0.0023) [2024-06-28 16:09:37,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42871.5, 300 sec: 43153.8). Total num frames: 4258611200. Throughput: 0: 43257.0. Samples: 537538780. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:09:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:09:38,238][09423] Updated weights for policy 0, policy_version 259927 (0.0025) [2024-06-28 16:09:42,100][09423] Updated weights for policy 0, policy_version 259937 (0.0036) [2024-06-28 16:09:42,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42871.5, 300 sec: 43098.2). Total num frames: 4258840576. Throughput: 0: 42941.4. Samples: 537663660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:09:42,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:09:46,017][09423] Updated weights for policy 0, policy_version 259947 (0.0043) [2024-06-28 16:09:47,921][09190] Fps is (10 sec: 44236.3, 60 sec: 43419.3, 300 sec: 43153.8). Total num frames: 4259053568. Throughput: 0: 43028.4. Samples: 537922580. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 16:09:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:09:49,562][09423] Updated weights for policy 0, policy_version 259957 (0.0038) [2024-06-28 16:09:52,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4259266560. Throughput: 0: 43140.3. Samples: 538187000. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 16:09:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:09:53,578][09423] Updated weights for policy 0, policy_version 259967 (0.0034) [2024-06-28 16:09:56,895][09423] Updated weights for policy 0, policy_version 259977 (0.0026) [2024-06-28 16:09:57,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4259479552. Throughput: 0: 43358.3. Samples: 538317880. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 16:09:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:10:00,886][09423] Updated weights for policy 0, policy_version 259987 (0.0038) [2024-06-28 16:10:02,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43417.5, 300 sec: 43209.3). Total num frames: 4259708928. Throughput: 0: 42975.1. Samples: 538566720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 16:10:02,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:10:05,210][09423] Updated weights for policy 0, policy_version 259997 (0.0027) [2024-06-28 16:10:07,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43209.3). Total num frames: 4259921920. Throughput: 0: 43113.7. Samples: 538826400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 16:10:07,922][09190] Avg episode reward: [(0, '0.730')] [2024-06-28 16:10:08,335][09423] Updated weights for policy 0, policy_version 260007 (0.0047) [2024-06-28 16:10:12,611][09423] Updated weights for policy 0, policy_version 260017 (0.0030) [2024-06-28 16:10:12,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43144.6, 300 sec: 43098.6). Total num frames: 4260134912. Throughput: 0: 43071.2. Samples: 538955640. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 16:10:12,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:10:15,868][09423] Updated weights for policy 0, policy_version 260027 (0.0024) [2024-06-28 16:10:17,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 4260364288. Throughput: 0: 43056.5. Samples: 539218880. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 16:10:17,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:10:17,950][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000260032_4260364288.pth... [2024-06-28 16:10:17,995][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000259400_4250009600.pth [2024-06-28 16:10:19,974][09423] Updated weights for policy 0, policy_version 260037 (0.0037) [2024-06-28 16:10:22,924][09190] Fps is (10 sec: 42587.4, 60 sec: 42872.3, 300 sec: 43208.9). Total num frames: 4260560896. Throughput: 0: 43044.1. Samples: 539475880. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 16:10:22,925][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:10:23,601][09423] Updated weights for policy 0, policy_version 260047 (0.0036) [2024-06-28 16:10:27,706][09423] Updated weights for policy 0, policy_version 260057 (0.0034) [2024-06-28 16:10:27,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43417.7, 300 sec: 43098.6). Total num frames: 4260790272. Throughput: 0: 43086.3. Samples: 539602540. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 16:10:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:10:31,406][09423] Updated weights for policy 0, policy_version 260067 (0.0029) [2024-06-28 16:10:32,921][09190] Fps is (10 sec: 45887.3, 60 sec: 43417.7, 300 sec: 43209.3). Total num frames: 4261019648. Throughput: 0: 43168.1. Samples: 539865140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 16:10:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:10:35,700][09423] Updated weights for policy 0, policy_version 260077 (0.0025) [2024-06-28 16:10:37,924][09190] Fps is (10 sec: 42587.3, 60 sec: 43415.7, 300 sec: 43209.0). Total num frames: 4261216256. Throughput: 0: 43129.1. Samples: 540127920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 16:10:37,924][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:10:38,706][09423] Updated weights for policy 0, policy_version 260087 (0.0033) [2024-06-28 16:10:42,921][09190] Fps is (10 sec: 39321.0, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4261412864. Throughput: 0: 43008.8. Samples: 540253280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 16:10:42,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:10:43,179][09423] Updated weights for policy 0, policy_version 260097 (0.0034) [2024-06-28 16:10:46,161][09423] Updated weights for policy 0, policy_version 260107 (0.0032) [2024-06-28 16:10:47,921][09190] Fps is (10 sec: 44247.8, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4261658624. Throughput: 0: 43154.7. Samples: 540508680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 16:10:47,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:10:49,292][09403] Signal inference workers to stop experience collection... (7400 times) [2024-06-28 16:10:49,294][09403] Signal inference workers to resume experience collection... (7400 times) [2024-06-28 16:10:49,334][09423] InferenceWorker_p0-w0: stopping experience collection (7400 times) [2024-06-28 16:10:49,335][09423] InferenceWorker_p0-w0: resuming experience collection (7400 times) [2024-06-28 16:10:50,702][09423] Updated weights for policy 0, policy_version 260117 (0.0031) [2024-06-28 16:10:52,921][09190] Fps is (10 sec: 44237.7, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4261855232. Throughput: 0: 43333.9. Samples: 540776420. Policy #0 lag: (min: 0.0, avg: 10.8, max: 26.0) [2024-06-28 16:10:52,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:10:53,803][09423] Updated weights for policy 0, policy_version 260127 (0.0031) [2024-06-28 16:10:57,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4262051840. Throughput: 0: 43264.5. Samples: 540902540. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-28 16:10:57,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:10:58,187][09423] Updated weights for policy 0, policy_version 260137 (0.0032) [2024-06-28 16:11:01,335][09423] Updated weights for policy 0, policy_version 260147 (0.0024) [2024-06-28 16:11:02,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42871.5, 300 sec: 43153.8). Total num frames: 4262281216. Throughput: 0: 43136.0. Samples: 541160000. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-28 16:11:02,930][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:11:06,061][09423] Updated weights for policy 0, policy_version 260157 (0.0042) [2024-06-28 16:11:07,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 4262510592. Throughput: 0: 43301.7. Samples: 541424340. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-28 16:11:07,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:11:08,970][09423] Updated weights for policy 0, policy_version 260167 (0.0032) [2024-06-28 16:11:12,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42871.5, 300 sec: 43098.3). Total num frames: 4262707200. Throughput: 0: 43243.1. Samples: 541548480. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-28 16:11:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:11:13,725][09423] Updated weights for policy 0, policy_version 260177 (0.0037) [2024-06-28 16:11:16,632][09423] Updated weights for policy 0, policy_version 260187 (0.0036) [2024-06-28 16:11:17,922][09190] Fps is (10 sec: 45871.0, 60 sec: 43417.0, 300 sec: 43320.6). Total num frames: 4262969344. Throughput: 0: 43175.6. Samples: 541808080. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-28 16:11:17,923][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:11:21,141][09423] Updated weights for policy 0, policy_version 260197 (0.0030) [2024-06-28 16:11:22,922][09190] Fps is (10 sec: 45873.7, 60 sec: 43419.3, 300 sec: 43209.8). Total num frames: 4263165952. Throughput: 0: 43308.4. Samples: 542076700. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-28 16:11:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:11:24,079][09423] Updated weights for policy 0, policy_version 260207 (0.0031) [2024-06-28 16:11:27,921][09190] Fps is (10 sec: 39325.1, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4263362560. Throughput: 0: 43292.6. Samples: 542201440. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-28 16:11:27,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:11:28,426][09423] Updated weights for policy 0, policy_version 260217 (0.0029) [2024-06-28 16:11:31,394][09423] Updated weights for policy 0, policy_version 260227 (0.0028) [2024-06-28 16:11:32,922][09190] Fps is (10 sec: 44237.2, 60 sec: 43144.4, 300 sec: 43264.9). Total num frames: 4263608320. Throughput: 0: 43440.4. Samples: 542463500. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-28 16:11:32,922][09190] Avg episode reward: [(0, '0.733')] [2024-06-28 16:11:35,804][09423] Updated weights for policy 0, policy_version 260237 (0.0038) [2024-06-28 16:11:37,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42873.3, 300 sec: 43098.3). Total num frames: 4263788544. Throughput: 0: 43355.1. Samples: 542727400. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-28 16:11:37,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:11:39,196][09423] Updated weights for policy 0, policy_version 260247 (0.0029) [2024-06-28 16:11:42,921][09190] Fps is (10 sec: 40960.6, 60 sec: 43417.7, 300 sec: 43043.1). Total num frames: 4264017920. Throughput: 0: 43360.4. Samples: 542853760. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-28 16:11:42,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:11:43,694][09423] Updated weights for policy 0, policy_version 260257 (0.0035) [2024-06-28 16:11:46,668][09423] Updated weights for policy 0, policy_version 260267 (0.0030) [2024-06-28 16:11:47,921][09190] Fps is (10 sec: 45875.2, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 4264247296. Throughput: 0: 43237.9. Samples: 543105700. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-28 16:11:47,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:11:51,600][09423] Updated weights for policy 0, policy_version 260277 (0.0037) [2024-06-28 16:11:52,921][09190] Fps is (10 sec: 44236.6, 60 sec: 43417.5, 300 sec: 43153.8). Total num frames: 4264460288. Throughput: 0: 43322.1. Samples: 543373840. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-28 16:11:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:11:54,299][09423] Updated weights for policy 0, policy_version 260287 (0.0032) [2024-06-28 16:11:57,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43153.8). Total num frames: 4264673280. Throughput: 0: 43267.5. Samples: 543495520. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-28 16:11:57,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:11:58,923][09423] Updated weights for policy 0, policy_version 260297 (0.0039) [2024-06-28 16:12:01,939][09423] Updated weights for policy 0, policy_version 260307 (0.0022) [2024-06-28 16:12:02,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43264.9). Total num frames: 4264902656. Throughput: 0: 43359.0. Samples: 543759200. Policy #0 lag: (min: 0.0, avg: 11.9, max: 20.0) [2024-06-28 16:12:02,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:12:06,614][09423] Updated weights for policy 0, policy_version 260317 (0.0024) [2024-06-28 16:12:07,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4265099264. Throughput: 0: 43212.8. Samples: 544021260. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 16:12:07,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 16:12:09,436][09423] Updated weights for policy 0, policy_version 260327 (0.0043) [2024-06-28 16:12:12,921][09190] Fps is (10 sec: 39321.6, 60 sec: 43144.4, 300 sec: 43042.7). Total num frames: 4265295872. Throughput: 0: 43160.8. Samples: 544143680. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 16:12:12,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:12:14,140][09423] Updated weights for policy 0, policy_version 260337 (0.0031) [2024-06-28 16:12:17,241][09423] Updated weights for policy 0, policy_version 260347 (0.0032) [2024-06-28 16:12:17,922][09190] Fps is (10 sec: 44235.9, 60 sec: 42872.0, 300 sec: 43209.3). Total num frames: 4265541632. Throughput: 0: 42931.6. Samples: 544395420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 16:12:17,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:12:17,942][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000260348_4265541632.pth... [2024-06-28 16:12:17,988][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000259716_4255186944.pth [2024-06-28 16:12:19,277][09403] Signal inference workers to stop experience collection... (7450 times) [2024-06-28 16:12:19,304][09423] InferenceWorker_p0-w0: stopping experience collection (7450 times) [2024-06-28 16:12:19,332][09403] Signal inference workers to resume experience collection... (7450 times) [2024-06-28 16:12:19,332][09423] InferenceWorker_p0-w0: resuming experience collection (7450 times) [2024-06-28 16:12:22,038][09423] Updated weights for policy 0, policy_version 260357 (0.0033) [2024-06-28 16:12:22,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42871.7, 300 sec: 43098.3). Total num frames: 4265738240. Throughput: 0: 43021.8. Samples: 544663380. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 16:12:22,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 16:12:24,947][09423] Updated weights for policy 0, policy_version 260367 (0.0035) [2024-06-28 16:12:27,921][09190] Fps is (10 sec: 39322.3, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4265934848. Throughput: 0: 42944.9. Samples: 544786280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 16:12:27,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:12:29,507][09423] Updated weights for policy 0, policy_version 260377 (0.0041) [2024-06-28 16:12:32,584][09423] Updated weights for policy 0, policy_version 260387 (0.0032) [2024-06-28 16:12:32,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42871.6, 300 sec: 43153.8). Total num frames: 4266180608. Throughput: 0: 43010.7. Samples: 545041180. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 16:12:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:12:37,157][09423] Updated weights for policy 0, policy_version 260397 (0.0025) [2024-06-28 16:12:37,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43144.5, 300 sec: 43043.1). Total num frames: 4266377216. Throughput: 0: 43099.7. Samples: 545313320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 16:12:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:12:40,132][09423] Updated weights for policy 0, policy_version 260407 (0.0033) [2024-06-28 16:12:42,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4266590208. Throughput: 0: 43116.0. Samples: 545435740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 16:12:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:12:44,805][09423] Updated weights for policy 0, policy_version 260417 (0.0034) [2024-06-28 16:12:47,530][09423] Updated weights for policy 0, policy_version 260427 (0.0025) [2024-06-28 16:12:47,921][09190] Fps is (10 sec: 45874.5, 60 sec: 43144.4, 300 sec: 43209.3). Total num frames: 4266835968. Throughput: 0: 42976.8. Samples: 545693160. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 16:12:47,924][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:12:52,166][09423] Updated weights for policy 0, policy_version 260437 (0.0032) [2024-06-28 16:12:52,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.5, 300 sec: 43042.7). Total num frames: 4267016192. Throughput: 0: 42957.7. Samples: 545954360. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 16:12:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:12:55,467][09423] Updated weights for policy 0, policy_version 260447 (0.0042) [2024-06-28 16:12:57,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42871.4, 300 sec: 43153.8). Total num frames: 4267245568. Throughput: 0: 42960.9. Samples: 546076920. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 16:12:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:12:59,996][09423] Updated weights for policy 0, policy_version 260457 (0.0033) [2024-06-28 16:13:02,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42871.5, 300 sec: 43209.3). Total num frames: 4267474944. Throughput: 0: 43190.4. Samples: 546338980. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 16:13:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:13:02,993][09423] Updated weights for policy 0, policy_version 260467 (0.0037) [2024-06-28 16:13:07,365][09423] Updated weights for policy 0, policy_version 260477 (0.0022) [2024-06-28 16:13:07,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42871.5, 300 sec: 43098.3). Total num frames: 4267671552. Throughput: 0: 43090.7. Samples: 546602460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 16:13:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:13:10,555][09423] Updated weights for policy 0, policy_version 260487 (0.0033) [2024-06-28 16:13:12,924][09190] Fps is (10 sec: 42587.6, 60 sec: 43415.8, 300 sec: 43153.4). Total num frames: 4267900928. Throughput: 0: 43082.4. Samples: 546725100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 22.0) [2024-06-28 16:13:12,924][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 16:13:15,121][09423] Updated weights for policy 0, policy_version 260497 (0.0039) [2024-06-28 16:13:17,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.6, 300 sec: 43153.8). Total num frames: 4268113920. Throughput: 0: 43196.4. Samples: 546985020. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 16:13:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:13:18,230][09423] Updated weights for policy 0, policy_version 260507 (0.0035) [2024-06-28 16:13:22,841][09423] Updated weights for policy 0, policy_version 260517 (0.0031) [2024-06-28 16:13:22,921][09190] Fps is (10 sec: 40970.3, 60 sec: 42871.5, 300 sec: 43098.3). Total num frames: 4268310528. Throughput: 0: 43167.1. Samples: 547255840. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 16:13:22,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:13:25,575][09423] Updated weights for policy 0, policy_version 260527 (0.0032) [2024-06-28 16:13:27,921][09190] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43153.8). Total num frames: 4268556288. Throughput: 0: 43058.5. Samples: 547373380. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 16:13:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:13:30,280][09423] Updated weights for policy 0, policy_version 260537 (0.0034) [2024-06-28 16:13:32,921][09190] Fps is (10 sec: 47513.3, 60 sec: 43417.5, 300 sec: 43209.3). Total num frames: 4268785664. Throughput: 0: 43027.6. Samples: 547629400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 16:13:32,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:13:33,258][09423] Updated weights for policy 0, policy_version 260547 (0.0024) [2024-06-28 16:13:36,912][09403] Signal inference workers to stop experience collection... (7500 times) [2024-06-28 16:13:36,917][09403] Signal inference workers to resume experience collection... (7500 times) [2024-06-28 16:13:36,944][09423] InferenceWorker_p0-w0: stopping experience collection (7500 times) [2024-06-28 16:13:36,944][09423] InferenceWorker_p0-w0: resuming experience collection (7500 times) [2024-06-28 16:13:37,818][09423] Updated weights for policy 0, policy_version 260557 (0.0034) [2024-06-28 16:13:37,921][09190] Fps is (10 sec: 40960.1, 60 sec: 43144.4, 300 sec: 43042.7). Total num frames: 4268965888. Throughput: 0: 43113.7. Samples: 547894480. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 16:13:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:13:40,779][09423] Updated weights for policy 0, policy_version 260567 (0.0038) [2024-06-28 16:13:42,921][09190] Fps is (10 sec: 39322.0, 60 sec: 43144.5, 300 sec: 43154.2). Total num frames: 4269178880. Throughput: 0: 43128.6. Samples: 548017700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 16:13:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:13:45,357][09423] Updated weights for policy 0, policy_version 260577 (0.0039) [2024-06-28 16:13:47,921][09190] Fps is (10 sec: 45875.8, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 4269424640. Throughput: 0: 43185.3. Samples: 548282320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 16:13:47,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:13:48,187][09423] Updated weights for policy 0, policy_version 260587 (0.0032) [2024-06-28 16:13:52,905][09423] Updated weights for policy 0, policy_version 260597 (0.0037) [2024-06-28 16:13:52,921][09190] Fps is (10 sec: 44236.3, 60 sec: 43417.5, 300 sec: 43153.8). Total num frames: 4269621248. Throughput: 0: 43046.5. Samples: 548539560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 16:13:52,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:13:56,155][09423] Updated weights for policy 0, policy_version 260607 (0.0032) [2024-06-28 16:13:57,921][09190] Fps is (10 sec: 40959.5, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4269834240. Throughput: 0: 43095.2. Samples: 548664280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 16:13:57,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:14:00,838][09423] Updated weights for policy 0, policy_version 260617 (0.0036) [2024-06-28 16:14:02,921][09190] Fps is (10 sec: 45875.7, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 4270080000. Throughput: 0: 43180.9. Samples: 548928160. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 16:14:02,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:14:03,574][09423] Updated weights for policy 0, policy_version 260627 (0.0025) [2024-06-28 16:14:07,924][09190] Fps is (10 sec: 39312.1, 60 sec: 42596.6, 300 sec: 42986.8). Total num frames: 4270227456. Throughput: 0: 42974.0. Samples: 549189780. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 16:14:07,933][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:14:08,251][09423] Updated weights for policy 0, policy_version 260637 (0.0022) [2024-06-28 16:14:11,354][09423] Updated weights for policy 0, policy_version 260647 (0.0030) [2024-06-28 16:14:12,924][09190] Fps is (10 sec: 40949.6, 60 sec: 43144.5, 300 sec: 43153.4). Total num frames: 4270489600. Throughput: 0: 43132.8. Samples: 549314460. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 16:14:12,924][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:14:15,649][09423] Updated weights for policy 0, policy_version 260657 (0.0038) [2024-06-28 16:14:17,921][09190] Fps is (10 sec: 47525.4, 60 sec: 43144.5, 300 sec: 43098.8). Total num frames: 4270702592. Throughput: 0: 43120.0. Samples: 549569800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 16:14:17,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:14:17,934][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000260663_4270702592.pth... [2024-06-28 16:14:17,995][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000260032_4260364288.pth [2024-06-28 16:14:18,904][09423] Updated weights for policy 0, policy_version 260667 (0.0032) [2024-06-28 16:14:22,921][09190] Fps is (10 sec: 40969.9, 60 sec: 43144.5, 300 sec: 43098.3). Total num frames: 4270899200. Throughput: 0: 43039.6. Samples: 549831260. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 16:14:22,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:14:23,511][09423] Updated weights for policy 0, policy_version 260677 (0.0038) [2024-06-28 16:14:26,297][09423] Updated weights for policy 0, policy_version 260687 (0.0032) [2024-06-28 16:14:27,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42871.5, 300 sec: 43098.3). Total num frames: 4271128576. Throughput: 0: 43014.1. Samples: 549953340. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:14:27,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:14:31,138][09423] Updated weights for policy 0, policy_version 260697 (0.0027) [2024-06-28 16:14:32,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42871.5, 300 sec: 43209.3). Total num frames: 4271357952. Throughput: 0: 42899.1. Samples: 550212780. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:14:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:14:33,732][09403] Signal inference workers to stop experience collection... (7550 times) [2024-06-28 16:14:33,733][09403] Signal inference workers to resume experience collection... (7550 times) [2024-06-28 16:14:33,750][09423] InferenceWorker_p0-w0: stopping experience collection (7550 times) [2024-06-28 16:14:33,750][09423] InferenceWorker_p0-w0: resuming experience collection (7550 times) [2024-06-28 16:14:33,879][09423] Updated weights for policy 0, policy_version 260707 (0.0041) [2024-06-28 16:14:37,922][09190] Fps is (10 sec: 39321.0, 60 sec: 42598.3, 300 sec: 42987.1). Total num frames: 4271521792. Throughput: 0: 43232.3. Samples: 550485020. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:14:37,922][09190] Avg episode reward: [(0, '0.701')] [2024-06-28 16:14:38,891][09423] Updated weights for policy 0, policy_version 260717 (0.0039) [2024-06-28 16:14:41,710][09423] Updated weights for policy 0, policy_version 260727 (0.0027) [2024-06-28 16:14:42,921][09190] Fps is (10 sec: 40960.2, 60 sec: 43144.5, 300 sec: 43098.3). Total num frames: 4271767552. Throughput: 0: 43020.1. Samples: 550600180. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:14:42,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 16:14:46,347][09423] Updated weights for policy 0, policy_version 260737 (0.0051) [2024-06-28 16:14:47,922][09190] Fps is (10 sec: 49152.6, 60 sec: 43144.4, 300 sec: 43209.3). Total num frames: 4272013312. Throughput: 0: 42986.5. Samples: 550862560. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:14:47,931][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:14:49,395][09423] Updated weights for policy 0, policy_version 260747 (0.0027) [2024-06-28 16:14:52,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42598.4, 300 sec: 43042.7). Total num frames: 4272177152. Throughput: 0: 43006.7. Samples: 551124980. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:14:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:14:53,884][09423] Updated weights for policy 0, policy_version 260757 (0.0031) [2024-06-28 16:14:57,110][09423] Updated weights for policy 0, policy_version 260767 (0.0027) [2024-06-28 16:14:57,921][09190] Fps is (10 sec: 39322.3, 60 sec: 42871.6, 300 sec: 43042.7). Total num frames: 4272406528. Throughput: 0: 42961.5. Samples: 551247620. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:14:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:15:01,579][09423] Updated weights for policy 0, policy_version 260777 (0.0033) [2024-06-28 16:15:02,921][09190] Fps is (10 sec: 47514.5, 60 sec: 42871.5, 300 sec: 43153.8). Total num frames: 4272652288. Throughput: 0: 43024.1. Samples: 551505880. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:15:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:15:04,490][09423] Updated weights for policy 0, policy_version 260787 (0.0037) [2024-06-28 16:15:07,921][09190] Fps is (10 sec: 40960.0, 60 sec: 43146.4, 300 sec: 42987.2). Total num frames: 4272816128. Throughput: 0: 43118.3. Samples: 551771580. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:15:07,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:15:09,080][09423] Updated weights for policy 0, policy_version 260797 (0.0025) [2024-06-28 16:15:11,839][09423] Updated weights for policy 0, policy_version 260807 (0.0040) [2024-06-28 16:15:12,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42873.3, 300 sec: 43042.7). Total num frames: 4273061888. Throughput: 0: 43152.5. Samples: 551895200. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:15:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:15:16,941][09423] Updated weights for policy 0, policy_version 260817 (0.0022) [2024-06-28 16:15:17,921][09190] Fps is (10 sec: 47513.1, 60 sec: 43144.5, 300 sec: 43154.2). Total num frames: 4273291264. Throughput: 0: 43240.4. Samples: 552158600. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:15:17,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:15:19,675][09423] Updated weights for policy 0, policy_version 260827 (0.0022) [2024-06-28 16:15:22,924][09190] Fps is (10 sec: 42587.8, 60 sec: 43142.8, 300 sec: 43042.3). Total num frames: 4273487872. Throughput: 0: 42978.3. Samples: 552419140. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:15:22,924][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 16:15:24,409][09423] Updated weights for policy 0, policy_version 260837 (0.0035) [2024-06-28 16:15:27,218][09423] Updated weights for policy 0, policy_version 260847 (0.0033) [2024-06-28 16:15:27,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4273717248. Throughput: 0: 43180.8. Samples: 552543320. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:15:27,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:15:31,778][09423] Updated weights for policy 0, policy_version 260857 (0.0024) [2024-06-28 16:15:32,921][09190] Fps is (10 sec: 44248.0, 60 sec: 42871.5, 300 sec: 43098.6). Total num frames: 4273930240. Throughput: 0: 43341.5. Samples: 552812920. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:15:32,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:15:34,690][09423] Updated weights for policy 0, policy_version 260867 (0.0031) [2024-06-28 16:15:37,921][09190] Fps is (10 sec: 40959.6, 60 sec: 43417.7, 300 sec: 43098.3). Total num frames: 4274126848. Throughput: 0: 43149.8. Samples: 553066720. Policy #0 lag: (min: 0.0, avg: 11.0, max: 21.0) [2024-06-28 16:15:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:15:39,239][09423] Updated weights for policy 0, policy_version 260877 (0.0023) [2024-06-28 16:15:42,427][09423] Updated weights for policy 0, policy_version 260887 (0.0042) [2024-06-28 16:15:42,921][09190] Fps is (10 sec: 45875.1, 60 sec: 43690.7, 300 sec: 43153.8). Total num frames: 4274388992. Throughput: 0: 43143.5. Samples: 553189080. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 16:15:42,923][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:15:46,895][09423] Updated weights for policy 0, policy_version 260897 (0.0029) [2024-06-28 16:15:47,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42598.5, 300 sec: 43098.2). Total num frames: 4274569216. Throughput: 0: 43242.1. Samples: 553451780. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 16:15:47,923][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:15:50,178][09423] Updated weights for policy 0, policy_version 260907 (0.0040) [2024-06-28 16:15:52,921][09190] Fps is (10 sec: 39321.4, 60 sec: 43417.7, 300 sec: 43153.8). Total num frames: 4274782208. Throughput: 0: 42951.0. Samples: 553704380. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 16:15:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:15:54,787][09423] Updated weights for policy 0, policy_version 260917 (0.0044) [2024-06-28 16:15:55,811][09403] Signal inference workers to stop experience collection... (7600 times) [2024-06-28 16:15:55,812][09403] Signal inference workers to resume experience collection... (7600 times) [2024-06-28 16:15:55,837][09423] InferenceWorker_p0-w0: stopping experience collection (7600 times) [2024-06-28 16:15:55,837][09423] InferenceWorker_p0-w0: resuming experience collection (7600 times) [2024-06-28 16:15:57,654][09423] Updated weights for policy 0, policy_version 260927 (0.0038) [2024-06-28 16:15:57,921][09190] Fps is (10 sec: 45875.1, 60 sec: 43690.6, 300 sec: 43209.3). Total num frames: 4275027968. Throughput: 0: 43188.8. Samples: 553838700. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 16:15:57,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 16:16:02,170][09423] Updated weights for policy 0, policy_version 260937 (0.0032) [2024-06-28 16:16:02,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.4, 300 sec: 43042.7). Total num frames: 4275208192. Throughput: 0: 43195.2. Samples: 554102380. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 16:16:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:16:05,292][09423] Updated weights for policy 0, policy_version 260947 (0.0031) [2024-06-28 16:16:07,922][09190] Fps is (10 sec: 40958.7, 60 sec: 43690.4, 300 sec: 43153.7). Total num frames: 4275437568. Throughput: 0: 43145.1. Samples: 554360580. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 16:16:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:16:09,546][09423] Updated weights for policy 0, policy_version 260957 (0.0041) [2024-06-28 16:16:12,750][09423] Updated weights for policy 0, policy_version 260967 (0.0027) [2024-06-28 16:16:12,921][09190] Fps is (10 sec: 47512.8, 60 sec: 43690.6, 300 sec: 43098.4). Total num frames: 4275683328. Throughput: 0: 43257.3. Samples: 554489900. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 16:16:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:16:17,187][09423] Updated weights for policy 0, policy_version 260977 (0.0035) [2024-06-28 16:16:17,922][09190] Fps is (10 sec: 42599.5, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4275863552. Throughput: 0: 42958.9. Samples: 554746080. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 16:16:17,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:16:18,057][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000260979_4275879936.pth... [2024-06-28 16:16:18,125][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000260348_4265541632.pth [2024-06-28 16:16:20,563][09423] Updated weights for policy 0, policy_version 260987 (0.0043) [2024-06-28 16:16:22,921][09190] Fps is (10 sec: 39322.1, 60 sec: 43146.4, 300 sec: 43098.3). Total num frames: 4276076544. Throughput: 0: 43167.7. Samples: 555009260. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 16:16:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:16:24,747][09423] Updated weights for policy 0, policy_version 260997 (0.0026) [2024-06-28 16:16:27,921][09190] Fps is (10 sec: 44237.5, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4276305920. Throughput: 0: 43197.3. Samples: 555132960. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 16:16:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:16:28,262][09423] Updated weights for policy 0, policy_version 261007 (0.0035) [2024-06-28 16:16:32,365][09423] Updated weights for policy 0, policy_version 261017 (0.0025) [2024-06-28 16:16:32,922][09190] Fps is (10 sec: 44235.9, 60 sec: 43144.4, 300 sec: 43153.8). Total num frames: 4276518912. Throughput: 0: 43184.8. Samples: 555395100. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 16:16:32,928][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:16:35,958][09423] Updated weights for policy 0, policy_version 261027 (0.0043) [2024-06-28 16:16:37,921][09190] Fps is (10 sec: 42598.1, 60 sec: 43417.7, 300 sec: 43098.2). Total num frames: 4276731904. Throughput: 0: 43295.6. Samples: 555652680. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 16:16:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:16:39,818][09423] Updated weights for policy 0, policy_version 261037 (0.0033) [2024-06-28 16:16:42,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42871.4, 300 sec: 43098.2). Total num frames: 4276961280. Throughput: 0: 43291.1. Samples: 555786800. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 16:16:42,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:16:43,426][09423] Updated weights for policy 0, policy_version 261047 (0.0033) [2024-06-28 16:16:47,202][09423] Updated weights for policy 0, policy_version 261057 (0.0033) [2024-06-28 16:16:47,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4277157888. Throughput: 0: 43137.3. Samples: 556043560. Policy #0 lag: (min: 0.0, avg: 8.5, max: 21.0) [2024-06-28 16:16:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:16:51,307][09423] Updated weights for policy 0, policy_version 261067 (0.0034) [2024-06-28 16:16:52,921][09190] Fps is (10 sec: 40960.5, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4277370880. Throughput: 0: 43218.6. Samples: 556305400. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:16:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:16:55,156][09423] Updated weights for policy 0, policy_version 261077 (0.0034) [2024-06-28 16:16:57,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42598.4, 300 sec: 42987.2). Total num frames: 4277583872. Throughput: 0: 43101.8. Samples: 556429480. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:16:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:16:58,869][09423] Updated weights for policy 0, policy_version 261087 (0.0042) [2024-06-28 16:17:02,825][09423] Updated weights for policy 0, policy_version 261097 (0.0040) [2024-06-28 16:17:02,921][09190] Fps is (10 sec: 44236.9, 60 sec: 43417.6, 300 sec: 43098.2). Total num frames: 4277813248. Throughput: 0: 43107.7. Samples: 556685920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:17:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:17:06,471][09423] Updated weights for policy 0, policy_version 261107 (0.0033) [2024-06-28 16:17:07,922][09190] Fps is (10 sec: 42597.5, 60 sec: 42871.5, 300 sec: 43098.2). Total num frames: 4278009856. Throughput: 0: 43040.6. Samples: 556946100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:17:07,928][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:17:10,135][09423] Updated weights for policy 0, policy_version 261117 (0.0031) [2024-06-28 16:17:12,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.5, 300 sec: 43098.3). Total num frames: 4278255616. Throughput: 0: 43064.4. Samples: 557070860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:17:12,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:17:13,976][09423] Updated weights for policy 0, policy_version 261127 (0.0044) [2024-06-28 16:17:17,741][09423] Updated weights for policy 0, policy_version 261137 (0.0049) [2024-06-28 16:17:17,921][09190] Fps is (10 sec: 45876.2, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4278468608. Throughput: 0: 43000.1. Samples: 557330100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:17:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:17:20,229][09403] Signal inference workers to stop experience collection... (7650 times) [2024-06-28 16:17:20,236][09403] Signal inference workers to resume experience collection... (7650 times) [2024-06-28 16:17:20,270][09423] InferenceWorker_p0-w0: stopping experience collection (7650 times) [2024-06-28 16:17:20,302][09423] InferenceWorker_p0-w0: resuming experience collection (7650 times) [2024-06-28 16:17:21,490][09423] Updated weights for policy 0, policy_version 261147 (0.0033) [2024-06-28 16:17:22,921][09190] Fps is (10 sec: 40959.9, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4278665216. Throughput: 0: 43178.2. Samples: 557595700. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:17:22,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:17:25,107][09423] Updated weights for policy 0, policy_version 261157 (0.0025) [2024-06-28 16:17:27,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4278894592. Throughput: 0: 42885.4. Samples: 557716640. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:17:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:17:29,325][09423] Updated weights for policy 0, policy_version 261167 (0.0033) [2024-06-28 16:17:32,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43144.7, 300 sec: 43153.8). Total num frames: 4279107584. Throughput: 0: 43010.2. Samples: 557979020. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:17:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:17:33,137][09423] Updated weights for policy 0, policy_version 261177 (0.0026) [2024-06-28 16:17:37,099][09423] Updated weights for policy 0, policy_version 261187 (0.0032) [2024-06-28 16:17:37,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42871.5, 300 sec: 43098.3). Total num frames: 4279304192. Throughput: 0: 42790.7. Samples: 558230980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:17:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:17:40,659][09423] Updated weights for policy 0, policy_version 261197 (0.0040) [2024-06-28 16:17:42,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.6, 300 sec: 43042.7). Total num frames: 4279533568. Throughput: 0: 42684.1. Samples: 558350260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:17:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:17:44,591][09423] Updated weights for policy 0, policy_version 261207 (0.0036) [2024-06-28 16:17:47,921][09190] Fps is (10 sec: 45875.0, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4279762944. Throughput: 0: 42947.1. Samples: 558618540. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:17:47,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:17:48,008][09423] Updated weights for policy 0, policy_version 261217 (0.0029) [2024-06-28 16:17:52,113][09423] Updated weights for policy 0, policy_version 261227 (0.0039) [2024-06-28 16:17:52,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 43098.3). Total num frames: 4279959552. Throughput: 0: 42760.2. Samples: 558870300. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:17:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:17:55,763][09423] Updated weights for policy 0, policy_version 261237 (0.0031) [2024-06-28 16:17:57,922][09190] Fps is (10 sec: 39321.0, 60 sec: 42871.4, 300 sec: 42987.1). Total num frames: 4280156160. Throughput: 0: 42877.2. Samples: 559000340. Policy #0 lag: (min: 0.0, avg: 10.2, max: 22.0) [2024-06-28 16:17:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:17:59,705][09423] Updated weights for policy 0, policy_version 261247 (0.0029) [2024-06-28 16:18:02,921][09190] Fps is (10 sec: 44237.4, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4280401920. Throughput: 0: 43081.9. Samples: 559268780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 16:18:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:18:03,146][09423] Updated weights for policy 0, policy_version 261257 (0.0026) [2024-06-28 16:18:07,655][09423] Updated weights for policy 0, policy_version 261267 (0.0055) [2024-06-28 16:18:07,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43144.7, 300 sec: 43043.1). Total num frames: 4280598528. Throughput: 0: 42983.1. Samples: 559529940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 16:18:07,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:18:10,935][09423] Updated weights for policy 0, policy_version 261277 (0.0041) [2024-06-28 16:18:12,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42871.5, 300 sec: 43098.2). Total num frames: 4280827904. Throughput: 0: 42976.4. Samples: 559650580. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 16:18:12,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:18:15,033][09423] Updated weights for policy 0, policy_version 261287 (0.0037) [2024-06-28 16:18:17,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42871.6, 300 sec: 43153.8). Total num frames: 4281040896. Throughput: 0: 42964.5. Samples: 559912420. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 16:18:17,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:18:17,935][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000261295_4281057280.pth... [2024-06-28 16:18:17,982][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000260663_4270702592.pth [2024-06-28 16:18:18,667][09423] Updated weights for policy 0, policy_version 261297 (0.0043) [2024-06-28 16:18:22,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4281237504. Throughput: 0: 43080.3. Samples: 560169600. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 16:18:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:18:22,934][09423] Updated weights for policy 0, policy_version 261307 (0.0031) [2024-06-28 16:18:26,100][09423] Updated weights for policy 0, policy_version 261317 (0.0041) [2024-06-28 16:18:27,922][09190] Fps is (10 sec: 42597.5, 60 sec: 42871.4, 300 sec: 42987.2). Total num frames: 4281466880. Throughput: 0: 43237.2. Samples: 560295940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 16:18:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:18:30,324][09423] Updated weights for policy 0, policy_version 261327 (0.0036) [2024-06-28 16:18:32,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4281696256. Throughput: 0: 43170.6. Samples: 560561220. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 16:18:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:18:33,819][09423] Updated weights for policy 0, policy_version 261337 (0.0031) [2024-06-28 16:18:37,662][09423] Updated weights for policy 0, policy_version 261347 (0.0031) [2024-06-28 16:18:37,922][09190] Fps is (10 sec: 44236.9, 60 sec: 43417.5, 300 sec: 43153.8). Total num frames: 4281909248. Throughput: 0: 43308.8. Samples: 560819200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 16:18:37,922][09190] Avg episode reward: [(0, '0.724')] [2024-06-28 16:18:41,231][09423] Updated weights for policy 0, policy_version 261357 (0.0031) [2024-06-28 16:18:42,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42598.3, 300 sec: 42931.6). Total num frames: 4282089472. Throughput: 0: 43242.7. Samples: 560946260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 16:18:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:18:45,225][09423] Updated weights for policy 0, policy_version 261367 (0.0035) [2024-06-28 16:18:47,694][09403] Signal inference workers to stop experience collection... (7700 times) [2024-06-28 16:18:47,722][09423] InferenceWorker_p0-w0: stopping experience collection (7700 times) [2024-06-28 16:18:47,749][09403] Signal inference workers to resume experience collection... (7700 times) [2024-06-28 16:18:47,749][09423] InferenceWorker_p0-w0: resuming experience collection (7700 times) [2024-06-28 16:18:47,921][09190] Fps is (10 sec: 45875.9, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4282368000. Throughput: 0: 43221.7. Samples: 561213760. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 16:18:47,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:18:48,656][09423] Updated weights for policy 0, policy_version 261377 (0.0033) [2024-06-28 16:18:52,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43144.5, 300 sec: 43098.3). Total num frames: 4282548224. Throughput: 0: 42995.1. Samples: 561464720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 16:18:52,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:18:53,084][09423] Updated weights for policy 0, policy_version 261387 (0.0047) [2024-06-28 16:18:56,485][09423] Updated weights for policy 0, policy_version 261397 (0.0030) [2024-06-28 16:18:57,921][09190] Fps is (10 sec: 39321.2, 60 sec: 43417.7, 300 sec: 42987.2). Total num frames: 4282761216. Throughput: 0: 43182.2. Samples: 561593780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 16:18:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:19:01,091][09423] Updated weights for policy 0, policy_version 261407 (0.0031) [2024-06-28 16:19:02,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43144.4, 300 sec: 43265.2). Total num frames: 4282990592. Throughput: 0: 43153.2. Samples: 561854320. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 16:19:02,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:19:04,337][09423] Updated weights for policy 0, policy_version 261417 (0.0035) [2024-06-28 16:19:07,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 43098.6). Total num frames: 4283203584. Throughput: 0: 43345.3. Samples: 562120140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 16:19:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:19:08,308][09423] Updated weights for policy 0, policy_version 261427 (0.0038) [2024-06-28 16:19:11,899][09423] Updated weights for policy 0, policy_version 261437 (0.0028) [2024-06-28 16:19:12,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4283400192. Throughput: 0: 43260.1. Samples: 562242640. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:19:12,924][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:19:15,730][09423] Updated weights for policy 0, policy_version 261447 (0.0028) [2024-06-28 16:19:17,922][09190] Fps is (10 sec: 44236.6, 60 sec: 43417.5, 300 sec: 43209.3). Total num frames: 4283645952. Throughput: 0: 43146.2. Samples: 562502800. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:19:17,928][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:19:19,519][09423] Updated weights for policy 0, policy_version 261457 (0.0031) [2024-06-28 16:19:22,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43417.6, 300 sec: 43098.3). Total num frames: 4283842560. Throughput: 0: 43256.1. Samples: 562765720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:19:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:19:23,102][09423] Updated weights for policy 0, policy_version 261467 (0.0033) [2024-06-28 16:19:26,955][09423] Updated weights for policy 0, policy_version 261477 (0.0035) [2024-06-28 16:19:27,921][09190] Fps is (10 sec: 40960.6, 60 sec: 43144.7, 300 sec: 43042.7). Total num frames: 4284055552. Throughput: 0: 43202.4. Samples: 562890360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:19:27,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:19:30,391][09423] Updated weights for policy 0, policy_version 261487 (0.0037) [2024-06-28 16:19:32,921][09190] Fps is (10 sec: 44236.5, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4284284928. Throughput: 0: 43238.1. Samples: 563159480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:19:32,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 16:19:34,908][09423] Updated weights for policy 0, policy_version 261497 (0.0041) [2024-06-28 16:19:37,922][09190] Fps is (10 sec: 44235.8, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4284497920. Throughput: 0: 43277.2. Samples: 563412200. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:19:37,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:19:38,821][09423] Updated weights for policy 0, policy_version 261507 (0.0036) [2024-06-28 16:19:42,454][09423] Updated weights for policy 0, policy_version 261517 (0.0032) [2024-06-28 16:19:42,921][09190] Fps is (10 sec: 42598.9, 60 sec: 43690.8, 300 sec: 43042.7). Total num frames: 4284710912. Throughput: 0: 43232.5. Samples: 563539240. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:19:42,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:19:46,189][09423] Updated weights for policy 0, policy_version 261527 (0.0024) [2024-06-28 16:19:47,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42598.3, 300 sec: 43209.3). Total num frames: 4284923904. Throughput: 0: 43215.6. Samples: 563799020. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:19:47,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:19:49,973][09423] Updated weights for policy 0, policy_version 261537 (0.0041) [2024-06-28 16:19:52,921][09190] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4285153280. Throughput: 0: 43157.4. Samples: 564062220. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:19:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:19:53,489][09423] Updated weights for policy 0, policy_version 261547 (0.0036) [2024-06-28 16:19:57,573][09423] Updated weights for policy 0, policy_version 261557 (0.0028) [2024-06-28 16:19:57,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4285349888. Throughput: 0: 43173.4. Samples: 564185440. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:19:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:20:00,980][09423] Updated weights for policy 0, policy_version 261567 (0.0038) [2024-06-28 16:20:02,674][09403] Signal inference workers to stop experience collection... (7750 times) [2024-06-28 16:20:02,677][09403] Signal inference workers to resume experience collection... (7750 times) [2024-06-28 16:20:02,720][09423] InferenceWorker_p0-w0: stopping experience collection (7750 times) [2024-06-28 16:20:02,720][09423] InferenceWorker_p0-w0: resuming experience collection (7750 times) [2024-06-28 16:20:02,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 4285595648. Throughput: 0: 43198.7. Samples: 564446740. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:20:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:20:05,129][09423] Updated weights for policy 0, policy_version 261577 (0.0044) [2024-06-28 16:20:07,922][09190] Fps is (10 sec: 42595.9, 60 sec: 42871.1, 300 sec: 43098.2). Total num frames: 4285775872. Throughput: 0: 43289.2. Samples: 564713760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:20:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:20:08,585][09423] Updated weights for policy 0, policy_version 261587 (0.0034) [2024-06-28 16:20:12,532][09423] Updated weights for policy 0, policy_version 261597 (0.0024) [2024-06-28 16:20:12,922][09190] Fps is (10 sec: 40959.6, 60 sec: 43417.6, 300 sec: 43098.2). Total num frames: 4286005248. Throughput: 0: 43322.9. Samples: 564839900. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:20:12,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:20:16,461][09423] Updated weights for policy 0, policy_version 261607 (0.0034) [2024-06-28 16:20:17,921][09190] Fps is (10 sec: 44239.2, 60 sec: 42871.5, 300 sec: 43154.1). Total num frames: 4286218240. Throughput: 0: 42932.5. Samples: 565091440. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:20:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:20:17,951][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000261611_4286234624.pth... [2024-06-28 16:20:18,011][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000260979_4275879936.pth [2024-06-28 16:20:20,461][09423] Updated weights for policy 0, policy_version 261617 (0.0038) [2024-06-28 16:20:22,923][09190] Fps is (10 sec: 40955.1, 60 sec: 42870.5, 300 sec: 43042.5). Total num frames: 4286414848. Throughput: 0: 43238.9. Samples: 565358000. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 16:20:22,932][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:20:24,191][09423] Updated weights for policy 0, policy_version 261627 (0.0031) [2024-06-28 16:20:27,890][09423] Updated weights for policy 0, policy_version 261637 (0.0035) [2024-06-28 16:20:27,924][09190] Fps is (10 sec: 44226.0, 60 sec: 43415.8, 300 sec: 43153.4). Total num frames: 4286660608. Throughput: 0: 43152.3. Samples: 565481200. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 16:20:27,933][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:20:31,655][09423] Updated weights for policy 0, policy_version 261647 (0.0029) [2024-06-28 16:20:32,921][09190] Fps is (10 sec: 45880.9, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4286873600. Throughput: 0: 43077.7. Samples: 565737520. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 16:20:32,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:20:35,849][09423] Updated weights for policy 0, policy_version 261657 (0.0045) [2024-06-28 16:20:37,921][09190] Fps is (10 sec: 40969.8, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4287070208. Throughput: 0: 43077.7. Samples: 566000720. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 16:20:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:20:39,223][09423] Updated weights for policy 0, policy_version 261667 (0.0043) [2024-06-28 16:20:42,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42871.4, 300 sec: 43098.3). Total num frames: 4287283200. Throughput: 0: 43021.7. Samples: 566121420. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 16:20:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:20:43,273][09423] Updated weights for policy 0, policy_version 261677 (0.0044) [2024-06-28 16:20:46,645][09423] Updated weights for policy 0, policy_version 261687 (0.0038) [2024-06-28 16:20:47,921][09190] Fps is (10 sec: 45875.7, 60 sec: 43417.7, 300 sec: 43209.3). Total num frames: 4287528960. Throughput: 0: 43090.3. Samples: 566385800. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 16:20:47,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:20:50,695][09423] Updated weights for policy 0, policy_version 261697 (0.0023) [2024-06-28 16:20:52,922][09190] Fps is (10 sec: 42598.0, 60 sec: 42598.3, 300 sec: 42987.2). Total num frames: 4287709184. Throughput: 0: 43111.1. Samples: 566653740. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 16:20:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:20:54,372][09423] Updated weights for policy 0, policy_version 261707 (0.0027) [2024-06-28 16:20:57,921][09190] Fps is (10 sec: 40959.7, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4287938560. Throughput: 0: 42988.1. Samples: 566774360. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 16:20:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:20:58,177][09423] Updated weights for policy 0, policy_version 261717 (0.0026) [2024-06-28 16:21:02,132][09423] Updated weights for policy 0, policy_version 261727 (0.0044) [2024-06-28 16:21:02,921][09190] Fps is (10 sec: 47514.4, 60 sec: 43144.6, 300 sec: 43209.4). Total num frames: 4288184320. Throughput: 0: 43244.5. Samples: 567037440. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 16:21:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:21:05,601][09423] Updated weights for policy 0, policy_version 261737 (0.0030) [2024-06-28 16:21:07,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42871.8, 300 sec: 42931.6). Total num frames: 4288348160. Throughput: 0: 43253.2. Samples: 567304340. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 16:21:07,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:21:09,568][09423] Updated weights for policy 0, policy_version 261747 (0.0036) [2024-06-28 16:21:12,921][09190] Fps is (10 sec: 40960.0, 60 sec: 43144.7, 300 sec: 43153.8). Total num frames: 4288593920. Throughput: 0: 43089.1. Samples: 567420100. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 16:21:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:21:13,534][09423] Updated weights for policy 0, policy_version 261757 (0.0035) [2024-06-28 16:21:17,286][09403] Signal inference workers to stop experience collection... (7800 times) [2024-06-28 16:21:17,287][09403] Signal inference workers to resume experience collection... (7800 times) [2024-06-28 16:21:17,296][09423] Updated weights for policy 0, policy_version 261767 (0.0026) [2024-06-28 16:21:17,306][09423] InferenceWorker_p0-w0: stopping experience collection (7800 times) [2024-06-28 16:21:17,306][09423] InferenceWorker_p0-w0: resuming experience collection (7800 times) [2024-06-28 16:21:17,924][09190] Fps is (10 sec: 49141.1, 60 sec: 43689.0, 300 sec: 43264.5). Total num frames: 4288839680. Throughput: 0: 43385.0. Samples: 567689940. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 16:21:17,924][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:21:20,990][09423] Updated weights for policy 0, policy_version 261777 (0.0024) [2024-06-28 16:21:22,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42872.4, 300 sec: 42987.2). Total num frames: 4288987136. Throughput: 0: 43533.4. Samples: 567959720. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 16:21:22,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:21:24,643][09423] Updated weights for policy 0, policy_version 261787 (0.0041) [2024-06-28 16:21:27,925][09190] Fps is (10 sec: 42590.9, 60 sec: 43416.4, 300 sec: 43208.8). Total num frames: 4289265664. Throughput: 0: 43325.0. Samples: 568071220. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 16:21:27,926][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:21:28,770][09423] Updated weights for policy 0, policy_version 261797 (0.0035) [2024-06-28 16:21:32,216][09423] Updated weights for policy 0, policy_version 261807 (0.0041) [2024-06-28 16:21:32,921][09190] Fps is (10 sec: 50790.7, 60 sec: 43690.8, 300 sec: 43264.9). Total num frames: 4289495040. Throughput: 0: 43420.0. Samples: 568339700. Policy #0 lag: (min: 0.0, avg: 8.6, max: 20.0) [2024-06-28 16:21:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:21:36,529][09423] Updated weights for policy 0, policy_version 261817 (0.0042) [2024-06-28 16:21:37,921][09190] Fps is (10 sec: 37698.5, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4289642496. Throughput: 0: 43160.6. Samples: 568595960. Policy #0 lag: (min: 0.0, avg: 11.8, max: 26.0) [2024-06-28 16:21:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:21:39,892][09423] Updated weights for policy 0, policy_version 261827 (0.0028) [2024-06-28 16:21:42,921][09190] Fps is (10 sec: 39321.5, 60 sec: 43417.7, 300 sec: 43153.8). Total num frames: 4289888256. Throughput: 0: 43113.0. Samples: 568714440. Policy #0 lag: (min: 0.0, avg: 11.8, max: 26.0) [2024-06-28 16:21:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:21:43,911][09423] Updated weights for policy 0, policy_version 261837 (0.0037) [2024-06-28 16:21:47,343][09423] Updated weights for policy 0, policy_version 261847 (0.0032) [2024-06-28 16:21:47,921][09190] Fps is (10 sec: 49152.2, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4290134016. Throughput: 0: 43332.9. Samples: 568987420. Policy #0 lag: (min: 0.0, avg: 11.8, max: 26.0) [2024-06-28 16:21:47,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:21:51,804][09423] Updated weights for policy 0, policy_version 261857 (0.0020) [2024-06-28 16:21:52,921][09190] Fps is (10 sec: 40959.7, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4290297856. Throughput: 0: 43165.4. Samples: 569246780. Policy #0 lag: (min: 0.0, avg: 11.8, max: 26.0) [2024-06-28 16:21:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:21:54,930][09423] Updated weights for policy 0, policy_version 261867 (0.0040) [2024-06-28 16:21:57,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43690.7, 300 sec: 43209.3). Total num frames: 4290560000. Throughput: 0: 43241.8. Samples: 569365980. Policy #0 lag: (min: 0.0, avg: 11.8, max: 26.0) [2024-06-28 16:21:57,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 16:21:59,530][09423] Updated weights for policy 0, policy_version 261877 (0.0042) [2024-06-28 16:22:02,468][09423] Updated weights for policy 0, policy_version 261887 (0.0045) [2024-06-28 16:22:02,921][09190] Fps is (10 sec: 49152.2, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 4290789376. Throughput: 0: 43252.9. Samples: 569636220. Policy #0 lag: (min: 0.0, avg: 11.8, max: 26.0) [2024-06-28 16:22:02,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:22:06,944][09423] Updated weights for policy 0, policy_version 261897 (0.0041) [2024-06-28 16:22:07,921][09190] Fps is (10 sec: 39321.3, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4290953216. Throughput: 0: 43123.5. Samples: 569900280. Policy #0 lag: (min: 0.0, avg: 11.8, max: 26.0) [2024-06-28 16:22:07,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:22:10,198][09423] Updated weights for policy 0, policy_version 261907 (0.0027) [2024-06-28 16:22:12,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43209.3). Total num frames: 4291215360. Throughput: 0: 43312.0. Samples: 570020080. Policy #0 lag: (min: 0.0, avg: 11.8, max: 26.0) [2024-06-28 16:22:12,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 16:22:14,428][09423] Updated weights for policy 0, policy_version 261917 (0.0040) [2024-06-28 16:22:17,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42600.1, 300 sec: 43153.8). Total num frames: 4291395584. Throughput: 0: 43104.4. Samples: 570279400. Policy #0 lag: (min: 0.0, avg: 11.8, max: 26.0) [2024-06-28 16:22:17,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:22:17,959][09423] Updated weights for policy 0, policy_version 261927 (0.0034) [2024-06-28 16:22:18,071][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000261928_4291428352.pth... [2024-06-28 16:22:18,120][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000261295_4281057280.pth [2024-06-28 16:22:22,334][09423] Updated weights for policy 0, policy_version 261937 (0.0032) [2024-06-28 16:22:22,922][09190] Fps is (10 sec: 37682.5, 60 sec: 43417.5, 300 sec: 43042.7). Total num frames: 4291592192. Throughput: 0: 43219.4. Samples: 570540840. Policy #0 lag: (min: 0.0, avg: 11.8, max: 26.0) [2024-06-28 16:22:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:22:25,313][09423] Updated weights for policy 0, policy_version 261947 (0.0033) [2024-06-28 16:22:27,921][09190] Fps is (10 sec: 44236.1, 60 sec: 42874.3, 300 sec: 43153.8). Total num frames: 4291837952. Throughput: 0: 43270.5. Samples: 570661620. Policy #0 lag: (min: 0.0, avg: 11.8, max: 26.0) [2024-06-28 16:22:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:22:29,798][09423] Updated weights for policy 0, policy_version 261957 (0.0037) [2024-06-28 16:22:32,921][09190] Fps is (10 sec: 45875.4, 60 sec: 42598.3, 300 sec: 43209.3). Total num frames: 4292050944. Throughput: 0: 43088.8. Samples: 570926420. Policy #0 lag: (min: 0.0, avg: 11.8, max: 26.0) [2024-06-28 16:22:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:22:33,015][09423] Updated weights for policy 0, policy_version 261967 (0.0035) [2024-06-28 16:22:37,762][09423] Updated weights for policy 0, policy_version 261977 (0.0023) [2024-06-28 16:22:37,921][09190] Fps is (10 sec: 39321.8, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4292231168. Throughput: 0: 43140.0. Samples: 571188080. Policy #0 lag: (min: 0.0, avg: 11.8, max: 26.0) [2024-06-28 16:22:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:22:40,498][09423] Updated weights for policy 0, policy_version 261987 (0.0031) [2024-06-28 16:22:40,980][09403] Signal inference workers to stop experience collection... (7850 times) [2024-06-28 16:22:40,981][09403] Signal inference workers to resume experience collection... (7850 times) [2024-06-28 16:22:41,012][09423] InferenceWorker_p0-w0: stopping experience collection (7850 times) [2024-06-28 16:22:41,040][09423] InferenceWorker_p0-w0: resuming experience collection (7850 times) [2024-06-28 16:22:42,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43690.6, 300 sec: 43209.3). Total num frames: 4292509696. Throughput: 0: 43191.9. Samples: 571309620. Policy #0 lag: (min: 0.0, avg: 11.8, max: 26.0) [2024-06-28 16:22:42,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:22:45,094][09423] Updated weights for policy 0, policy_version 261997 (0.0036) [2024-06-28 16:22:47,922][09190] Fps is (10 sec: 45874.3, 60 sec: 42598.2, 300 sec: 43153.8). Total num frames: 4292689920. Throughput: 0: 43237.6. Samples: 571581920. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 16:22:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:22:48,136][09423] Updated weights for policy 0, policy_version 262007 (0.0040) [2024-06-28 16:22:52,793][09423] Updated weights for policy 0, policy_version 262017 (0.0034) [2024-06-28 16:22:52,921][09190] Fps is (10 sec: 37683.3, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4292886528. Throughput: 0: 42895.6. Samples: 571830580. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 16:22:52,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 16:22:55,706][09423] Updated weights for policy 0, policy_version 262027 (0.0022) [2024-06-28 16:22:57,922][09190] Fps is (10 sec: 45875.6, 60 sec: 43144.4, 300 sec: 43209.3). Total num frames: 4293148672. Throughput: 0: 42850.5. Samples: 571948360. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 16:22:57,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:23:00,470][09423] Updated weights for policy 0, policy_version 262037 (0.0041) [2024-06-28 16:23:02,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42598.4, 300 sec: 43209.3). Total num frames: 4293345280. Throughput: 0: 43201.3. Samples: 572223460. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 16:23:02,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:23:03,120][09423] Updated weights for policy 0, policy_version 262047 (0.0034) [2024-06-28 16:23:07,911][09423] Updated weights for policy 0, policy_version 262057 (0.0034) [2024-06-28 16:23:07,921][09190] Fps is (10 sec: 39322.1, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4293541888. Throughput: 0: 43208.6. Samples: 572485220. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 16:23:07,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 16:23:10,653][09423] Updated weights for policy 0, policy_version 262067 (0.0032) [2024-06-28 16:23:12,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4293804032. Throughput: 0: 43088.5. Samples: 572600600. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 16:23:12,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:23:15,675][09423] Updated weights for policy 0, policy_version 262077 (0.0032) [2024-06-28 16:23:17,921][09190] Fps is (10 sec: 47513.4, 60 sec: 43690.6, 300 sec: 43320.4). Total num frames: 4294017024. Throughput: 0: 43404.5. Samples: 572879620. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 16:23:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:23:18,128][09423] Updated weights for policy 0, policy_version 262087 (0.0024) [2024-06-28 16:23:22,922][09190] Fps is (10 sec: 37682.7, 60 sec: 43144.5, 300 sec: 43098.3). Total num frames: 4294180864. Throughput: 0: 43381.3. Samples: 573140240. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 16:23:22,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 16:23:22,946][09423] Updated weights for policy 0, policy_version 262097 (0.0033) [2024-06-28 16:23:25,777][09423] Updated weights for policy 0, policy_version 262107 (0.0036) [2024-06-28 16:23:27,922][09190] Fps is (10 sec: 44236.4, 60 sec: 43690.6, 300 sec: 43264.9). Total num frames: 4294459392. Throughput: 0: 43291.9. Samples: 573257760. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 16:23:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:23:30,489][09423] Updated weights for policy 0, policy_version 262117 (0.0043) [2024-06-28 16:23:32,921][09190] Fps is (10 sec: 45875.9, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4294639616. Throughput: 0: 43174.0. Samples: 573524740. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 16:23:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:23:33,437][09423] Updated weights for policy 0, policy_version 262127 (0.0040) [2024-06-28 16:23:37,921][09190] Fps is (10 sec: 37683.8, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4294836224. Throughput: 0: 43320.0. Samples: 573779980. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 16:23:37,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:23:38,605][09423] Updated weights for policy 0, policy_version 262137 (0.0050) [2024-06-28 16:23:41,086][09423] Updated weights for policy 0, policy_version 262147 (0.0032) [2024-06-28 16:23:42,921][09190] Fps is (10 sec: 45874.7, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4295098368. Throughput: 0: 43393.9. Samples: 573901080. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 16:23:42,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:23:46,082][09423] Updated weights for policy 0, policy_version 262157 (0.0026) [2024-06-28 16:23:47,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43417.8, 300 sec: 43209.3). Total num frames: 4295294976. Throughput: 0: 43226.7. Samples: 574168660. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 16:23:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:23:48,758][09423] Updated weights for policy 0, policy_version 262167 (0.0032) [2024-06-28 16:23:52,922][09190] Fps is (10 sec: 39317.7, 60 sec: 43416.9, 300 sec: 43153.6). Total num frames: 4295491584. Throughput: 0: 43172.3. Samples: 574428020. Policy #0 lag: (min: 0.0, avg: 8.6, max: 21.0) [2024-06-28 16:23:52,923][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:23:53,447][09423] Updated weights for policy 0, policy_version 262177 (0.0031) [2024-06-28 16:23:55,070][09403] Signal inference workers to stop experience collection... (7900 times) [2024-06-28 16:23:55,071][09403] Signal inference workers to resume experience collection... (7900 times) [2024-06-28 16:23:55,102][09423] InferenceWorker_p0-w0: stopping experience collection (7900 times) [2024-06-28 16:23:55,102][09423] InferenceWorker_p0-w0: resuming experience collection (7900 times) [2024-06-28 16:23:56,297][09423] Updated weights for policy 0, policy_version 262187 (0.0036) [2024-06-28 16:23:57,924][09190] Fps is (10 sec: 45863.6, 60 sec: 43415.9, 300 sec: 43264.5). Total num frames: 4295753728. Throughput: 0: 43322.5. Samples: 574550220. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-28 16:23:57,924][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:24:00,825][09423] Updated weights for policy 0, policy_version 262197 (0.0032) [2024-06-28 16:24:02,921][09190] Fps is (10 sec: 44241.3, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4295933952. Throughput: 0: 43128.0. Samples: 574820380. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-28 16:24:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:24:03,708][09423] Updated weights for policy 0, policy_version 262207 (0.0026) [2024-06-28 16:24:07,921][09190] Fps is (10 sec: 39331.1, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4296146944. Throughput: 0: 43052.0. Samples: 575077580. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-28 16:24:07,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:24:08,740][09423] Updated weights for policy 0, policy_version 262217 (0.0029) [2024-06-28 16:24:11,301][09423] Updated weights for policy 0, policy_version 262227 (0.0029) [2024-06-28 16:24:12,921][09190] Fps is (10 sec: 47513.8, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4296409088. Throughput: 0: 43158.8. Samples: 575199900. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-28 16:24:12,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:24:16,271][09423] Updated weights for policy 0, policy_version 262237 (0.0028) [2024-06-28 16:24:17,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42871.5, 300 sec: 43209.3). Total num frames: 4296589312. Throughput: 0: 43274.2. Samples: 575472080. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-28 16:24:17,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:24:18,067][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000262244_4296605696.pth... [2024-06-28 16:24:18,135][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000261611_4286234624.pth [2024-06-28 16:24:18,976][09423] Updated weights for policy 0, policy_version 262247 (0.0028) [2024-06-28 16:24:22,921][09190] Fps is (10 sec: 39321.6, 60 sec: 43690.8, 300 sec: 43209.3). Total num frames: 4296802304. Throughput: 0: 43062.2. Samples: 575717780. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-28 16:24:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:24:23,690][09423] Updated weights for policy 0, policy_version 262257 (0.0036) [2024-06-28 16:24:26,641][09423] Updated weights for policy 0, policy_version 262267 (0.0029) [2024-06-28 16:24:27,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42871.5, 300 sec: 43209.3). Total num frames: 4297031680. Throughput: 0: 43264.9. Samples: 575848000. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-28 16:24:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:24:31,617][09423] Updated weights for policy 0, policy_version 262277 (0.0030) [2024-06-28 16:24:32,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4297228288. Throughput: 0: 43304.8. Samples: 576117380. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-28 16:24:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:24:34,126][09423] Updated weights for policy 0, policy_version 262287 (0.0028) [2024-06-28 16:24:37,921][09190] Fps is (10 sec: 40960.2, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4297441280. Throughput: 0: 43129.4. Samples: 576368800. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-28 16:24:37,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:24:38,854][09423] Updated weights for policy 0, policy_version 262297 (0.0033) [2024-06-28 16:24:41,606][09423] Updated weights for policy 0, policy_version 262307 (0.0032) [2024-06-28 16:24:42,923][09190] Fps is (10 sec: 45868.9, 60 sec: 43143.6, 300 sec: 43264.7). Total num frames: 4297687040. Throughput: 0: 43285.9. Samples: 576498040. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-28 16:24:42,923][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:24:46,218][09423] Updated weights for policy 0, policy_version 262317 (0.0031) [2024-06-28 16:24:47,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42871.5, 300 sec: 43098.3). Total num frames: 4297867264. Throughput: 0: 43264.1. Samples: 576767260. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-28 16:24:47,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:24:48,746][09403] Signal inference workers to stop experience collection... (7950 times) [2024-06-28 16:24:48,746][09403] Signal inference workers to resume experience collection... (7950 times) [2024-06-28 16:24:48,777][09423] InferenceWorker_p0-w0: stopping experience collection (7950 times) [2024-06-28 16:24:48,778][09423] InferenceWorker_p0-w0: resuming experience collection (7950 times) [2024-06-28 16:24:49,114][09423] Updated weights for policy 0, policy_version 262327 (0.0043) [2024-06-28 16:24:52,921][09190] Fps is (10 sec: 40966.0, 60 sec: 43418.4, 300 sec: 43209.3). Total num frames: 4298096640. Throughput: 0: 43039.2. Samples: 577014340. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-28 16:24:52,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 16:24:54,054][09423] Updated weights for policy 0, policy_version 262337 (0.0036) [2024-06-28 16:24:56,533][09423] Updated weights for policy 0, policy_version 262347 (0.0028) [2024-06-28 16:24:57,922][09190] Fps is (10 sec: 47512.7, 60 sec: 43146.2, 300 sec: 43209.3). Total num frames: 4298342400. Throughput: 0: 43254.5. Samples: 577146360. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-28 16:24:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:25:01,429][09423] Updated weights for policy 0, policy_version 262357 (0.0034) [2024-06-28 16:25:02,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43144.6, 300 sec: 43209.4). Total num frames: 4298522624. Throughput: 0: 43150.7. Samples: 577413860. Policy #0 lag: (min: 0.0, avg: 11.2, max: 25.0) [2024-06-28 16:25:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:25:04,419][09423] Updated weights for policy 0, policy_version 262367 (0.0035) [2024-06-28 16:25:07,922][09190] Fps is (10 sec: 40960.0, 60 sec: 43417.5, 300 sec: 43209.3). Total num frames: 4298752000. Throughput: 0: 43293.2. Samples: 577665980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:25:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:25:09,016][09423] Updated weights for policy 0, policy_version 262377 (0.0037) [2024-06-28 16:25:11,891][09423] Updated weights for policy 0, policy_version 262387 (0.0041) [2024-06-28 16:25:12,921][09190] Fps is (10 sec: 45875.3, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 4298981376. Throughput: 0: 43336.6. Samples: 577798140. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:25:12,921][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:25:16,345][09423] Updated weights for policy 0, policy_version 262397 (0.0038) [2024-06-28 16:25:17,921][09190] Fps is (10 sec: 42598.9, 60 sec: 43144.5, 300 sec: 43265.1). Total num frames: 4299177984. Throughput: 0: 43267.6. Samples: 578064420. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:25:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:25:19,379][09423] Updated weights for policy 0, policy_version 262407 (0.0035) [2024-06-28 16:25:22,921][09190] Fps is (10 sec: 40959.6, 60 sec: 43144.5, 300 sec: 43154.2). Total num frames: 4299390976. Throughput: 0: 43374.7. Samples: 578320660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:25:22,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:25:23,739][09423] Updated weights for policy 0, policy_version 262417 (0.0024) [2024-06-28 16:25:27,269][09423] Updated weights for policy 0, policy_version 262427 (0.0037) [2024-06-28 16:25:27,921][09190] Fps is (10 sec: 44236.6, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 4299620352. Throughput: 0: 43280.0. Samples: 578445580. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:25:27,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:25:31,624][09423] Updated weights for policy 0, policy_version 262437 (0.0046) [2024-06-28 16:25:32,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4299816960. Throughput: 0: 43040.4. Samples: 578704080. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:25:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:25:34,690][09423] Updated weights for policy 0, policy_version 262447 (0.0036) [2024-06-28 16:25:37,921][09190] Fps is (10 sec: 42598.1, 60 sec: 43417.5, 300 sec: 43264.9). Total num frames: 4300046336. Throughput: 0: 43321.2. Samples: 578963800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:25:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:25:39,047][09423] Updated weights for policy 0, policy_version 262457 (0.0039) [2024-06-28 16:25:42,534][09423] Updated weights for policy 0, policy_version 262467 (0.0027) [2024-06-28 16:25:42,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43145.6, 300 sec: 43209.3). Total num frames: 4300275712. Throughput: 0: 43189.9. Samples: 579089900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:25:42,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:25:46,743][09423] Updated weights for policy 0, policy_version 262477 (0.0037) [2024-06-28 16:25:47,921][09190] Fps is (10 sec: 40960.9, 60 sec: 43144.6, 300 sec: 43209.4). Total num frames: 4300455936. Throughput: 0: 43144.5. Samples: 579355360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:25:47,928][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:25:49,914][09423] Updated weights for policy 0, policy_version 262487 (0.0027) [2024-06-28 16:25:52,921][09190] Fps is (10 sec: 40959.6, 60 sec: 43144.4, 300 sec: 43209.3). Total num frames: 4300685312. Throughput: 0: 43223.6. Samples: 579611040. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:25:52,923][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:25:54,111][09423] Updated weights for policy 0, policy_version 262497 (0.0036) [2024-06-28 16:25:57,205][09423] Updated weights for policy 0, policy_version 262507 (0.0039) [2024-06-28 16:25:57,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42871.6, 300 sec: 43153.8). Total num frames: 4300914688. Throughput: 0: 43269.3. Samples: 579745260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:25:57,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:26:01,726][09423] Updated weights for policy 0, policy_version 262517 (0.0029) [2024-06-28 16:26:02,921][09190] Fps is (10 sec: 42599.0, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4301111296. Throughput: 0: 43144.5. Samples: 580005920. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:26:02,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 16:26:05,438][09423] Updated weights for policy 0, policy_version 262527 (0.0038) [2024-06-28 16:26:07,921][09190] Fps is (10 sec: 42597.9, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 4301340672. Throughput: 0: 43071.0. Samples: 580258860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:26:07,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:26:09,460][09423] Updated weights for policy 0, policy_version 262537 (0.0032) [2024-06-28 16:26:12,921][09190] Fps is (10 sec: 44236.1, 60 sec: 42871.3, 300 sec: 43098.6). Total num frames: 4301553664. Throughput: 0: 43237.3. Samples: 580391260. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:26:12,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:26:12,967][09423] Updated weights for policy 0, policy_version 262547 (0.0045) [2024-06-28 16:26:13,399][09403] Signal inference workers to stop experience collection... (8000 times) [2024-06-28 16:26:13,400][09403] Signal inference workers to resume experience collection... (8000 times) [2024-06-28 16:26:13,452][09423] InferenceWorker_p0-w0: stopping experience collection (8000 times) [2024-06-28 16:26:13,452][09423] InferenceWorker_p0-w0: resuming experience collection (8000 times) [2024-06-28 16:26:17,022][09423] Updated weights for policy 0, policy_version 262557 (0.0035) [2024-06-28 16:26:17,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 4301766656. Throughput: 0: 43402.2. Samples: 580657180. Policy #0 lag: (min: 0.0, avg: 10.2, max: 21.0) [2024-06-28 16:26:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:26:17,966][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000262560_4301783040.pth... [2024-06-28 16:26:18,010][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000261928_4291428352.pth [2024-06-28 16:26:20,339][09423] Updated weights for policy 0, policy_version 262567 (0.0026) [2024-06-28 16:26:22,921][09190] Fps is (10 sec: 44237.8, 60 sec: 43417.7, 300 sec: 43154.4). Total num frames: 4301996032. Throughput: 0: 43295.3. Samples: 580912080. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 16:26:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:26:24,407][09423] Updated weights for policy 0, policy_version 262577 (0.0026) [2024-06-28 16:26:27,921][09423] Updated weights for policy 0, policy_version 262587 (0.0031) [2024-06-28 16:26:27,921][09190] Fps is (10 sec: 45874.9, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4302225408. Throughput: 0: 43419.0. Samples: 581043760. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 16:26:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:26:31,976][09423] Updated weights for policy 0, policy_version 262597 (0.0022) [2024-06-28 16:26:32,921][09190] Fps is (10 sec: 42598.0, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 4302422016. Throughput: 0: 43316.8. Samples: 581304620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 16:26:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:26:35,319][09423] Updated weights for policy 0, policy_version 262607 (0.0026) [2024-06-28 16:26:37,922][09190] Fps is (10 sec: 40959.8, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4302635008. Throughput: 0: 43312.4. Samples: 581560100. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 16:26:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:26:39,756][09423] Updated weights for policy 0, policy_version 262617 (0.0038) [2024-06-28 16:26:42,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 43098.2). Total num frames: 4302848000. Throughput: 0: 43189.7. Samples: 581688800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 16:26:42,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:26:43,175][09423] Updated weights for policy 0, policy_version 262627 (0.0034) [2024-06-28 16:26:47,121][09423] Updated weights for policy 0, policy_version 262637 (0.0038) [2024-06-28 16:26:47,921][09190] Fps is (10 sec: 42599.2, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4303060992. Throughput: 0: 43107.1. Samples: 581945740. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 16:26:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:26:50,585][09423] Updated weights for policy 0, policy_version 262647 (0.0041) [2024-06-28 16:26:52,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4303273984. Throughput: 0: 43309.4. Samples: 582207780. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 16:26:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:26:54,766][09423] Updated weights for policy 0, policy_version 262657 (0.0027) [2024-06-28 16:26:57,922][09190] Fps is (10 sec: 44235.9, 60 sec: 43144.4, 300 sec: 43098.2). Total num frames: 4303503360. Throughput: 0: 43131.5. Samples: 582332180. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 16:26:57,931][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:26:58,488][09423] Updated weights for policy 0, policy_version 262667 (0.0034) [2024-06-28 16:27:02,482][09423] Updated weights for policy 0, policy_version 262677 (0.0027) [2024-06-28 16:27:02,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43417.5, 300 sec: 43264.9). Total num frames: 4303716352. Throughput: 0: 42948.8. Samples: 582589880. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 16:27:02,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:27:05,946][09423] Updated weights for policy 0, policy_version 262687 (0.0042) [2024-06-28 16:27:07,922][09190] Fps is (10 sec: 44236.9, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4303945728. Throughput: 0: 43107.8. Samples: 582851940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 16:27:07,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:27:09,881][09423] Updated weights for policy 0, policy_version 262697 (0.0027) [2024-06-28 16:27:12,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 4304158720. Throughput: 0: 43113.5. Samples: 582983860. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 16:27:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:27:13,211][09423] Updated weights for policy 0, policy_version 262707 (0.0028) [2024-06-28 16:27:17,415][09423] Updated weights for policy 0, policy_version 262717 (0.0028) [2024-06-28 16:27:17,922][09190] Fps is (10 sec: 42598.2, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 4304371712. Throughput: 0: 43227.4. Samples: 583249860. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 16:27:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:27:21,497][09423] Updated weights for policy 0, policy_version 262727 (0.0026) [2024-06-28 16:27:22,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4304584704. Throughput: 0: 43109.0. Samples: 583500000. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 16:27:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:27:25,222][09423] Updated weights for policy 0, policy_version 262737 (0.0049) [2024-06-28 16:27:27,921][09190] Fps is (10 sec: 44237.4, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 4304814080. Throughput: 0: 43138.2. Samples: 583630020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 16:27:27,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:27:28,861][09423] Updated weights for policy 0, policy_version 262747 (0.0032) [2024-06-28 16:27:32,654][09423] Updated weights for policy 0, policy_version 262757 (0.0031) [2024-06-28 16:27:32,921][09190] Fps is (10 sec: 42598.1, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 4305010688. Throughput: 0: 43183.9. Samples: 583889020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:27:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:27:36,792][09423] Updated weights for policy 0, policy_version 262767 (0.0036) [2024-06-28 16:27:37,922][09190] Fps is (10 sec: 40959.0, 60 sec: 43144.4, 300 sec: 43098.2). Total num frames: 4305223680. Throughput: 0: 43076.6. Samples: 584146240. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:27:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:27:40,110][09423] Updated weights for policy 0, policy_version 262777 (0.0038) [2024-06-28 16:27:42,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4305453056. Throughput: 0: 43214.8. Samples: 584276840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:27:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:27:44,105][09423] Updated weights for policy 0, policy_version 262787 (0.0031) [2024-06-28 16:27:47,735][09423] Updated weights for policy 0, policy_version 262797 (0.0028) [2024-06-28 16:27:47,922][09190] Fps is (10 sec: 44237.3, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 4305666048. Throughput: 0: 43354.6. Samples: 584540840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:27:47,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 16:27:51,466][09423] Updated weights for policy 0, policy_version 262807 (0.0031) [2024-06-28 16:27:52,755][09403] Signal inference workers to stop experience collection... (8050 times) [2024-06-28 16:27:52,755][09403] Signal inference workers to resume experience collection... (8050 times) [2024-06-28 16:27:52,768][09423] InferenceWorker_p0-w0: stopping experience collection (8050 times) [2024-06-28 16:27:52,798][09423] InferenceWorker_p0-w0: resuming experience collection (8050 times) [2024-06-28 16:27:52,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4305879040. Throughput: 0: 43389.5. Samples: 584804460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:27:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:27:55,189][09423] Updated weights for policy 0, policy_version 262817 (0.0029) [2024-06-28 16:27:57,921][09190] Fps is (10 sec: 44237.8, 60 sec: 43417.8, 300 sec: 43264.9). Total num frames: 4306108416. Throughput: 0: 43142.7. Samples: 584925280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:27:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:27:59,322][09423] Updated weights for policy 0, policy_version 262827 (0.0031) [2024-06-28 16:28:02,921][09190] Fps is (10 sec: 42597.8, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4306305024. Throughput: 0: 43026.7. Samples: 585186060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:28:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:28:02,934][09423] Updated weights for policy 0, policy_version 262837 (0.0036) [2024-06-28 16:28:07,047][09423] Updated weights for policy 0, policy_version 262847 (0.0035) [2024-06-28 16:28:07,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42598.5, 300 sec: 43042.7). Total num frames: 4306501632. Throughput: 0: 43195.2. Samples: 585443780. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:28:07,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 16:28:10,384][09423] Updated weights for policy 0, policy_version 262857 (0.0029) [2024-06-28 16:28:12,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4306747392. Throughput: 0: 43096.9. Samples: 585569380. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:28:12,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:28:14,726][09423] Updated weights for policy 0, policy_version 262867 (0.0023) [2024-06-28 16:28:17,924][09190] Fps is (10 sec: 45863.4, 60 sec: 43142.8, 300 sec: 43320.1). Total num frames: 4306960384. Throughput: 0: 43142.5. Samples: 585830540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:28:17,924][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:28:17,939][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000262876_4306960384.pth... [2024-06-28 16:28:17,990][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000262244_4296605696.pth [2024-06-28 16:28:18,166][09423] Updated weights for policy 0, policy_version 262877 (0.0031) [2024-06-28 16:28:22,086][09423] Updated weights for policy 0, policy_version 262887 (0.0036) [2024-06-28 16:28:22,921][09190] Fps is (10 sec: 42598.1, 60 sec: 43144.5, 300 sec: 43098.3). Total num frames: 4307173376. Throughput: 0: 43221.5. Samples: 586091200. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:28:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:28:25,702][09423] Updated weights for policy 0, policy_version 262897 (0.0034) [2024-06-28 16:28:27,921][09190] Fps is (10 sec: 44247.5, 60 sec: 43144.5, 300 sec: 43264.8). Total num frames: 4307402752. Throughput: 0: 43199.0. Samples: 586220800. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:28:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:28:29,520][09423] Updated weights for policy 0, policy_version 262907 (0.0028) [2024-06-28 16:28:32,921][09190] Fps is (10 sec: 44236.9, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 4307615744. Throughput: 0: 43196.5. Samples: 586484680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:28:32,922][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 16:28:33,014][09423] Updated weights for policy 0, policy_version 262917 (0.0040) [2024-06-28 16:28:37,402][09423] Updated weights for policy 0, policy_version 262927 (0.0045) [2024-06-28 16:28:37,922][09190] Fps is (10 sec: 40959.7, 60 sec: 43144.6, 300 sec: 43098.2). Total num frames: 4307812352. Throughput: 0: 43001.5. Samples: 586739540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:28:37,931][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:28:40,528][09423] Updated weights for policy 0, policy_version 262937 (0.0027) [2024-06-28 16:28:42,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4308041728. Throughput: 0: 43173.7. Samples: 586868100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 16:28:42,925][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:28:44,638][09423] Updated weights for policy 0, policy_version 262947 (0.0038) [2024-06-28 16:28:47,921][09190] Fps is (10 sec: 44237.3, 60 sec: 43144.6, 300 sec: 43265.0). Total num frames: 4308254720. Throughput: 0: 43202.7. Samples: 587130180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 16:28:47,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:28:48,295][09423] Updated weights for policy 0, policy_version 262957 (0.0022) [2024-06-28 16:28:52,603][09423] Updated weights for policy 0, policy_version 262967 (0.0026) [2024-06-28 16:28:52,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.4, 300 sec: 43098.6). Total num frames: 4308467712. Throughput: 0: 43309.7. Samples: 587392720. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 16:28:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:28:55,694][09423] Updated weights for policy 0, policy_version 262977 (0.0028) [2024-06-28 16:28:57,922][09190] Fps is (10 sec: 44236.5, 60 sec: 43144.4, 300 sec: 43264.8). Total num frames: 4308697088. Throughput: 0: 43334.1. Samples: 587519420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 16:28:57,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:29:00,010][09423] Updated weights for policy 0, policy_version 262987 (0.0028) [2024-06-28 16:29:02,921][09190] Fps is (10 sec: 44237.3, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 4308910080. Throughput: 0: 43456.7. Samples: 587785980. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 16:29:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:29:03,485][09423] Updated weights for policy 0, policy_version 262997 (0.0033) [2024-06-28 16:29:07,306][09423] Updated weights for policy 0, policy_version 263007 (0.0040) [2024-06-28 16:29:07,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43690.6, 300 sec: 43098.2). Total num frames: 4309123072. Throughput: 0: 43241.3. Samples: 588037060. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 16:29:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:29:10,983][09423] Updated weights for policy 0, policy_version 263017 (0.0033) [2024-06-28 16:29:12,922][09190] Fps is (10 sec: 44236.0, 60 sec: 43417.5, 300 sec: 43264.8). Total num frames: 4309352448. Throughput: 0: 43246.2. Samples: 588166880. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 16:29:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:29:14,861][09423] Updated weights for policy 0, policy_version 263027 (0.0038) [2024-06-28 16:29:17,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43146.3, 300 sec: 43209.3). Total num frames: 4309549056. Throughput: 0: 43106.7. Samples: 588424480. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 16:29:17,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:29:18,468][09423] Updated weights for policy 0, policy_version 263037 (0.0038) [2024-06-28 16:29:22,746][09423] Updated weights for policy 0, policy_version 263047 (0.0030) [2024-06-28 16:29:22,921][09190] Fps is (10 sec: 40960.7, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4309762048. Throughput: 0: 43282.9. Samples: 588687260. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 16:29:22,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 16:29:25,909][09423] Updated weights for policy 0, policy_version 263057 (0.0033) [2024-06-28 16:29:27,921][09190] Fps is (10 sec: 44236.9, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 4309991424. Throughput: 0: 43240.4. Samples: 588813920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 16:29:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:29:29,611][09403] Signal inference workers to stop experience collection... (8100 times) [2024-06-28 16:29:29,611][09403] Signal inference workers to resume experience collection... (8100 times) [2024-06-28 16:29:29,633][09423] InferenceWorker_p0-w0: stopping experience collection (8100 times) [2024-06-28 16:29:29,633][09423] InferenceWorker_p0-w0: resuming experience collection (8100 times) [2024-06-28 16:29:30,731][09423] Updated weights for policy 0, policy_version 263067 (0.0034) [2024-06-28 16:29:32,927][09190] Fps is (10 sec: 42576.2, 60 sec: 42867.8, 300 sec: 43208.6). Total num frames: 4310188032. Throughput: 0: 43256.0. Samples: 589076920. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 16:29:32,927][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:29:33,907][09423] Updated weights for policy 0, policy_version 263077 (0.0032) [2024-06-28 16:29:37,921][09190] Fps is (10 sec: 40960.0, 60 sec: 43144.6, 300 sec: 43098.5). Total num frames: 4310401024. Throughput: 0: 43242.3. Samples: 589338620. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 16:29:37,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 16:29:38,115][09423] Updated weights for policy 0, policy_version 263087 (0.0028) [2024-06-28 16:29:41,411][09423] Updated weights for policy 0, policy_version 263097 (0.0031) [2024-06-28 16:29:42,921][09190] Fps is (10 sec: 45899.0, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 4310646784. Throughput: 0: 43154.4. Samples: 589461360. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 16:29:42,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:29:45,497][09423] Updated weights for policy 0, policy_version 263107 (0.0034) [2024-06-28 16:29:47,921][09190] Fps is (10 sec: 45875.5, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 4310859776. Throughput: 0: 43180.0. Samples: 589729080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 16:29:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:29:48,882][09423] Updated weights for policy 0, policy_version 263117 (0.0040) [2024-06-28 16:29:52,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4311040000. Throughput: 0: 43198.4. Samples: 589980980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:29:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:29:53,538][09423] Updated weights for policy 0, policy_version 263127 (0.0036) [2024-06-28 16:29:56,708][09423] Updated weights for policy 0, policy_version 263137 (0.0046) [2024-06-28 16:29:57,922][09190] Fps is (10 sec: 42597.8, 60 sec: 43144.6, 300 sec: 43264.8). Total num frames: 4311285760. Throughput: 0: 43112.9. Samples: 590106960. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:29:57,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 16:30:01,104][09423] Updated weights for policy 0, policy_version 263147 (0.0025) [2024-06-28 16:30:02,921][09190] Fps is (10 sec: 45874.7, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4311498752. Throughput: 0: 43351.1. Samples: 590375280. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:30:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:30:04,068][09423] Updated weights for policy 0, policy_version 263157 (0.0035) [2024-06-28 16:30:07,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42871.5, 300 sec: 43098.2). Total num frames: 4311695360. Throughput: 0: 43219.9. Samples: 590632160. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:30:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:30:08,597][09423] Updated weights for policy 0, policy_version 263167 (0.0032) [2024-06-28 16:30:11,886][09423] Updated weights for policy 0, policy_version 263177 (0.0043) [2024-06-28 16:30:12,921][09190] Fps is (10 sec: 45875.2, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 4311957504. Throughput: 0: 43295.5. Samples: 590762220. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:30:12,924][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:30:16,145][09423] Updated weights for policy 0, policy_version 263187 (0.0036) [2024-06-28 16:30:17,924][09190] Fps is (10 sec: 44226.1, 60 sec: 43142.8, 300 sec: 43209.0). Total num frames: 4312137728. Throughput: 0: 43182.1. Samples: 591020000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:30:17,924][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:30:17,933][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000263192_4312137728.pth... [2024-06-28 16:30:18,005][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000262560_4301783040.pth [2024-06-28 16:30:19,250][09423] Updated weights for policy 0, policy_version 263197 (0.0029) [2024-06-28 16:30:22,921][09190] Fps is (10 sec: 39321.9, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4312350720. Throughput: 0: 43278.7. Samples: 591286160. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:30:22,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:30:23,417][09423] Updated weights for policy 0, policy_version 263207 (0.0042) [2024-06-28 16:30:26,510][09423] Updated weights for policy 0, policy_version 263217 (0.0040) [2024-06-28 16:30:27,921][09190] Fps is (10 sec: 45887.0, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 4312596480. Throughput: 0: 43310.7. Samples: 591410340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:30:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:30:31,497][09423] Updated weights for policy 0, policy_version 263227 (0.0024) [2024-06-28 16:30:32,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43421.4, 300 sec: 43209.4). Total num frames: 4312793088. Throughput: 0: 43161.0. Samples: 591671320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:30:32,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:30:34,069][09423] Updated weights for policy 0, policy_version 263237 (0.0036) [2024-06-28 16:30:37,924][09190] Fps is (10 sec: 39311.3, 60 sec: 43142.7, 300 sec: 43097.9). Total num frames: 4312989696. Throughput: 0: 43182.9. Samples: 591924320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:30:37,925][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:30:39,094][09423] Updated weights for policy 0, policy_version 263247 (0.0038) [2024-06-28 16:30:41,987][09423] Updated weights for policy 0, policy_version 263257 (0.0035) [2024-06-28 16:30:42,921][09190] Fps is (10 sec: 45874.9, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 4313251840. Throughput: 0: 43225.0. Samples: 592052080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:30:42,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:30:46,657][09423] Updated weights for policy 0, policy_version 263267 (0.0040) [2024-06-28 16:30:47,921][09190] Fps is (10 sec: 44247.9, 60 sec: 42871.4, 300 sec: 43209.3). Total num frames: 4313432064. Throughput: 0: 43146.7. Samples: 592316880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:30:47,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:30:48,859][09403] Signal inference workers to stop experience collection... (8150 times) [2024-06-28 16:30:48,859][09403] Signal inference workers to resume experience collection... (8150 times) [2024-06-28 16:30:48,894][09423] InferenceWorker_p0-w0: stopping experience collection (8150 times) [2024-06-28 16:30:48,895][09423] InferenceWorker_p0-w0: resuming experience collection (8150 times) [2024-06-28 16:30:49,359][09423] Updated weights for policy 0, policy_version 263277 (0.0041) [2024-06-28 16:30:52,921][09190] Fps is (10 sec: 37683.3, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4313628672. Throughput: 0: 43178.3. Samples: 592575180. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:30:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:30:54,186][09423] Updated weights for policy 0, policy_version 263287 (0.0023) [2024-06-28 16:30:56,842][09423] Updated weights for policy 0, policy_version 263297 (0.0034) [2024-06-28 16:30:57,921][09190] Fps is (10 sec: 45875.2, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 4313890816. Throughput: 0: 43147.1. Samples: 592703840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:30:57,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:31:01,671][09423] Updated weights for policy 0, policy_version 263307 (0.0023) [2024-06-28 16:31:02,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42598.4, 300 sec: 43098.2). Total num frames: 4314054656. Throughput: 0: 43333.0. Samples: 592969880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 16:31:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:31:04,217][09423] Updated weights for policy 0, policy_version 263317 (0.0027) [2024-06-28 16:31:07,921][09190] Fps is (10 sec: 39321.4, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4314284032. Throughput: 0: 43102.6. Samples: 593225780. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 16:31:07,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 16:31:09,137][09423] Updated weights for policy 0, policy_version 263327 (0.0036) [2024-06-28 16:31:11,961][09423] Updated weights for policy 0, policy_version 263337 (0.0040) [2024-06-28 16:31:12,921][09190] Fps is (10 sec: 47513.7, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 4314529792. Throughput: 0: 43124.8. Samples: 593350960. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 16:31:12,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:31:16,879][09423] Updated weights for policy 0, policy_version 263347 (0.0031) [2024-06-28 16:31:17,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43146.3, 300 sec: 43153.8). Total num frames: 4314726400. Throughput: 0: 43257.6. Samples: 593617920. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 16:31:17,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:31:19,734][09423] Updated weights for policy 0, policy_version 263357 (0.0032) [2024-06-28 16:31:22,922][09190] Fps is (10 sec: 40959.6, 60 sec: 43144.4, 300 sec: 43098.2). Total num frames: 4314939392. Throughput: 0: 43263.2. Samples: 593871060. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 16:31:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:31:24,697][09423] Updated weights for policy 0, policy_version 263367 (0.0033) [2024-06-28 16:31:27,119][09423] Updated weights for policy 0, policy_version 263377 (0.0039) [2024-06-28 16:31:27,921][09190] Fps is (10 sec: 45875.5, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4315185152. Throughput: 0: 43272.8. Samples: 593999360. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 16:31:27,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:31:32,197][09423] Updated weights for policy 0, policy_version 263387 (0.0028) [2024-06-28 16:31:32,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42598.3, 300 sec: 43098.3). Total num frames: 4315348992. Throughput: 0: 43122.3. Samples: 594257380. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 16:31:32,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:31:34,767][09423] Updated weights for policy 0, policy_version 263397 (0.0038) [2024-06-28 16:31:37,921][09190] Fps is (10 sec: 40959.9, 60 sec: 43419.4, 300 sec: 43209.3). Total num frames: 4315594752. Throughput: 0: 43039.0. Samples: 594511940. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 16:31:37,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:31:39,686][09423] Updated weights for policy 0, policy_version 263407 (0.0032) [2024-06-28 16:31:42,180][09423] Updated weights for policy 0, policy_version 263417 (0.0026) [2024-06-28 16:31:42,921][09190] Fps is (10 sec: 49152.2, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 4315840512. Throughput: 0: 43178.3. Samples: 594646860. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 16:31:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:31:47,186][09423] Updated weights for policy 0, policy_version 263427 (0.0036) [2024-06-28 16:31:47,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42871.5, 300 sec: 43153.8). Total num frames: 4316004352. Throughput: 0: 43061.9. Samples: 594907660. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 16:31:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:31:50,045][09423] Updated weights for policy 0, policy_version 263437 (0.0031) [2024-06-28 16:31:52,921][09190] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43209.4). Total num frames: 4316250112. Throughput: 0: 43021.9. Samples: 595161760. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 16:31:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:31:54,939][09423] Updated weights for policy 0, policy_version 263447 (0.0033) [2024-06-28 16:31:57,550][09423] Updated weights for policy 0, policy_version 263457 (0.0036) [2024-06-28 16:31:57,921][09190] Fps is (10 sec: 47513.4, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4316479488. Throughput: 0: 43139.6. Samples: 595292240. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 16:31:57,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:32:02,748][09423] Updated weights for policy 0, policy_version 263467 (0.0025) [2024-06-28 16:32:02,921][09190] Fps is (10 sec: 39321.8, 60 sec: 43144.6, 300 sec: 43042.7). Total num frames: 4316643328. Throughput: 0: 43050.4. Samples: 595555180. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 16:32:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:32:05,398][09423] Updated weights for policy 0, policy_version 263477 (0.0034) [2024-06-28 16:32:07,921][09190] Fps is (10 sec: 40960.2, 60 sec: 43417.7, 300 sec: 43153.8). Total num frames: 4316889088. Throughput: 0: 43002.0. Samples: 595806140. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 16:32:07,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:32:10,164][09423] Updated weights for policy 0, policy_version 263487 (0.0024) [2024-06-28 16:32:12,922][09190] Fps is (10 sec: 45874.4, 60 sec: 42871.4, 300 sec: 43153.8). Total num frames: 4317102080. Throughput: 0: 43151.9. Samples: 595941200. Policy #0 lag: (min: 1.0, avg: 9.8, max: 20.0) [2024-06-28 16:32:12,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:32:12,991][09403] Signal inference workers to stop experience collection... (8200 times) [2024-06-28 16:32:13,042][09423] InferenceWorker_p0-w0: stopping experience collection (8200 times) [2024-06-28 16:32:13,050][09403] Signal inference workers to resume experience collection... (8200 times) [2024-06-28 16:32:13,060][09423] InferenceWorker_p0-w0: resuming experience collection (8200 times) [2024-06-28 16:32:13,182][09423] Updated weights for policy 0, policy_version 263497 (0.0026) [2024-06-28 16:32:17,594][09423] Updated weights for policy 0, policy_version 263507 (0.0036) [2024-06-28 16:32:17,922][09190] Fps is (10 sec: 40959.2, 60 sec: 42871.4, 300 sec: 43098.2). Total num frames: 4317298688. Throughput: 0: 43062.5. Samples: 596195200. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 16:32:17,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:32:17,939][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000263507_4317298688.pth... [2024-06-28 16:32:18,004][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000262876_4306960384.pth [2024-06-28 16:32:20,625][09423] Updated weights for policy 0, policy_version 263517 (0.0025) [2024-06-28 16:32:22,924][09190] Fps is (10 sec: 44226.1, 60 sec: 43415.9, 300 sec: 43153.4). Total num frames: 4317544448. Throughput: 0: 43157.6. Samples: 596454140. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 16:32:22,933][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:32:25,187][09423] Updated weights for policy 0, policy_version 263527 (0.0032) [2024-06-28 16:32:27,921][09190] Fps is (10 sec: 45875.6, 60 sec: 42871.4, 300 sec: 43209.3). Total num frames: 4317757440. Throughput: 0: 43105.7. Samples: 596586620. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 16:32:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:32:28,171][09423] Updated weights for policy 0, policy_version 263537 (0.0043) [2024-06-28 16:32:32,595][09423] Updated weights for policy 0, policy_version 263547 (0.0028) [2024-06-28 16:32:32,921][09190] Fps is (10 sec: 40970.1, 60 sec: 43417.5, 300 sec: 43153.8). Total num frames: 4317954048. Throughput: 0: 43216.8. Samples: 596852420. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 16:32:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:32:35,776][09423] Updated weights for policy 0, policy_version 263557 (0.0038) [2024-06-28 16:32:37,923][09190] Fps is (10 sec: 42591.1, 60 sec: 43143.3, 300 sec: 43153.5). Total num frames: 4318183424. Throughput: 0: 43095.6. Samples: 597101140. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 16:32:37,924][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:32:40,385][09423] Updated weights for policy 0, policy_version 263567 (0.0026) [2024-06-28 16:32:42,921][09190] Fps is (10 sec: 45875.6, 60 sec: 42871.4, 300 sec: 43209.4). Total num frames: 4318412800. Throughput: 0: 43212.9. Samples: 597236820. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 16:32:42,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:32:43,203][09423] Updated weights for policy 0, policy_version 263577 (0.0031) [2024-06-28 16:32:47,921][09190] Fps is (10 sec: 40967.1, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4318593024. Throughput: 0: 43067.5. Samples: 597493220. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 16:32:47,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:32:47,952][09423] Updated weights for policy 0, policy_version 263587 (0.0046) [2024-06-28 16:32:51,514][09423] Updated weights for policy 0, policy_version 263597 (0.0030) [2024-06-28 16:32:52,925][09190] Fps is (10 sec: 40943.9, 60 sec: 42868.7, 300 sec: 43097.7). Total num frames: 4318822400. Throughput: 0: 43131.3. Samples: 597747220. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 16:32:52,926][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:32:55,400][09423] Updated weights for policy 0, policy_version 263607 (0.0045) [2024-06-28 16:32:57,921][09190] Fps is (10 sec: 47513.6, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4319068160. Throughput: 0: 43093.0. Samples: 597880380. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 16:32:57,922][09190] Avg episode reward: [(0, '0.733')] [2024-06-28 16:32:58,780][09423] Updated weights for policy 0, policy_version 263617 (0.0035) [2024-06-28 16:33:02,778][09423] Updated weights for policy 0, policy_version 263627 (0.0029) [2024-06-28 16:33:02,921][09190] Fps is (10 sec: 44253.6, 60 sec: 43690.5, 300 sec: 43264.8). Total num frames: 4319264768. Throughput: 0: 43338.3. Samples: 598145420. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 16:33:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:33:06,398][09423] Updated weights for policy 0, policy_version 263637 (0.0035) [2024-06-28 16:33:07,922][09190] Fps is (10 sec: 40959.6, 60 sec: 43144.4, 300 sec: 43153.8). Total num frames: 4319477760. Throughput: 0: 43155.2. Samples: 598396020. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 16:33:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:33:10,510][09423] Updated weights for policy 0, policy_version 263647 (0.0035) [2024-06-28 16:33:12,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43144.6, 300 sec: 43154.2). Total num frames: 4319690752. Throughput: 0: 43166.7. Samples: 598529120. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 16:33:12,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:33:13,834][09423] Updated weights for policy 0, policy_version 263657 (0.0032) [2024-06-28 16:33:17,922][09190] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4319903744. Throughput: 0: 43048.4. Samples: 598789600. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 16:33:17,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:33:18,237][09423] Updated weights for policy 0, policy_version 263667 (0.0028) [2024-06-28 16:33:21,367][09423] Updated weights for policy 0, policy_version 263677 (0.0037) [2024-06-28 16:33:22,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43146.3, 300 sec: 43153.8). Total num frames: 4320133120. Throughput: 0: 42983.4. Samples: 599035320. Policy #0 lag: (min: 0.0, avg: 11.2, max: 21.0) [2024-06-28 16:33:22,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:33:25,679][09423] Updated weights for policy 0, policy_version 263687 (0.0041) [2024-06-28 16:33:27,308][09403] Signal inference workers to stop experience collection... (8250 times) [2024-06-28 16:33:27,308][09403] Signal inference workers to resume experience collection... (8250 times) [2024-06-28 16:33:27,354][09423] InferenceWorker_p0-w0: stopping experience collection (8250 times) [2024-06-28 16:33:27,354][09423] InferenceWorker_p0-w0: resuming experience collection (8250 times) [2024-06-28 16:33:27,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4320362496. Throughput: 0: 43026.1. Samples: 599173000. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 16:33:27,925][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:33:28,847][09423] Updated weights for policy 0, policy_version 263697 (0.0031) [2024-06-28 16:33:32,921][09190] Fps is (10 sec: 40960.7, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4320542720. Throughput: 0: 43206.7. Samples: 599437520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 16:33:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:33:33,074][09423] Updated weights for policy 0, policy_version 263707 (0.0031) [2024-06-28 16:33:36,767][09423] Updated weights for policy 0, policy_version 263717 (0.0036) [2024-06-28 16:33:37,921][09190] Fps is (10 sec: 40960.3, 60 sec: 43145.8, 300 sec: 43153.8). Total num frames: 4320772096. Throughput: 0: 43180.2. Samples: 599690160. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 16:33:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:33:40,742][09423] Updated weights for policy 0, policy_version 263727 (0.0027) [2024-06-28 16:33:42,921][09190] Fps is (10 sec: 45874.7, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4321001472. Throughput: 0: 43145.3. Samples: 599821920. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 16:33:42,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:33:44,483][09423] Updated weights for policy 0, policy_version 263737 (0.0032) [2024-06-28 16:33:47,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43690.6, 300 sec: 43209.3). Total num frames: 4321214464. Throughput: 0: 42960.5. Samples: 600078640. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 16:33:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:33:48,144][09423] Updated weights for policy 0, policy_version 263747 (0.0045) [2024-06-28 16:33:52,090][09423] Updated weights for policy 0, policy_version 263757 (0.0027) [2024-06-28 16:33:52,921][09190] Fps is (10 sec: 40960.3, 60 sec: 43147.4, 300 sec: 43098.3). Total num frames: 4321411072. Throughput: 0: 43148.1. Samples: 600337680. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 16:33:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:33:55,983][09423] Updated weights for policy 0, policy_version 263767 (0.0023) [2024-06-28 16:33:57,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42871.5, 300 sec: 43153.8). Total num frames: 4321640448. Throughput: 0: 43180.5. Samples: 600472240. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 16:33:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:33:59,462][09423] Updated weights for policy 0, policy_version 263777 (0.0027) [2024-06-28 16:34:02,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4321853440. Throughput: 0: 43077.0. Samples: 600728060. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 16:34:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:34:03,470][09423] Updated weights for policy 0, policy_version 263787 (0.0036) [2024-06-28 16:34:06,779][09423] Updated weights for policy 0, policy_version 263797 (0.0041) [2024-06-28 16:34:07,921][09190] Fps is (10 sec: 42598.0, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4322066432. Throughput: 0: 43404.5. Samples: 600988520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 16:34:07,926][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:34:11,137][09423] Updated weights for policy 0, policy_version 263807 (0.0036) [2024-06-28 16:34:12,921][09190] Fps is (10 sec: 44236.6, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4322295808. Throughput: 0: 43221.0. Samples: 601117940. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 16:34:12,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:34:14,799][09423] Updated weights for policy 0, policy_version 263817 (0.0029) [2024-06-28 16:34:17,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42871.5, 300 sec: 43098.2). Total num frames: 4322476032. Throughput: 0: 43066.1. Samples: 601375500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 16:34:17,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:34:17,964][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000263824_4322492416.pth... [2024-06-28 16:34:18,029][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000263192_4312137728.pth [2024-06-28 16:34:18,812][09423] Updated weights for policy 0, policy_version 263827 (0.0029) [2024-06-28 16:34:22,266][09423] Updated weights for policy 0, policy_version 263837 (0.0035) [2024-06-28 16:34:22,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42871.6, 300 sec: 43098.3). Total num frames: 4322705408. Throughput: 0: 43128.5. Samples: 601630940. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 16:34:22,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:34:26,290][09423] Updated weights for policy 0, policy_version 263847 (0.0039) [2024-06-28 16:34:27,921][09190] Fps is (10 sec: 47513.7, 60 sec: 43144.6, 300 sec: 43265.6). Total num frames: 4322951168. Throughput: 0: 43008.9. Samples: 601757320. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 16:34:27,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:34:30,356][09423] Updated weights for policy 0, policy_version 263857 (0.0032) [2024-06-28 16:34:32,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4323131392. Throughput: 0: 43108.6. Samples: 602018520. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 16:34:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:34:33,685][09423] Updated weights for policy 0, policy_version 263867 (0.0028) [2024-06-28 16:34:37,734][09423] Updated weights for policy 0, policy_version 263877 (0.0031) [2024-06-28 16:34:37,921][09190] Fps is (10 sec: 40959.7, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4323360768. Throughput: 0: 43206.6. Samples: 602281980. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:34:37,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:34:41,381][09423] Updated weights for policy 0, policy_version 263887 (0.0028) [2024-06-28 16:34:42,921][09190] Fps is (10 sec: 45875.2, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4323590144. Throughput: 0: 43103.6. Samples: 602411900. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:34:42,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:34:45,076][09423] Updated weights for policy 0, policy_version 263897 (0.0028) [2024-06-28 16:34:47,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42871.5, 300 sec: 43209.3). Total num frames: 4323786752. Throughput: 0: 43097.3. Samples: 602667440. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:34:47,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:34:49,169][09423] Updated weights for policy 0, policy_version 263907 (0.0046) [2024-06-28 16:34:52,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4323983360. Throughput: 0: 43024.1. Samples: 602924600. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:34:52,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:34:53,102][09423] Updated weights for policy 0, policy_version 263917 (0.0047) [2024-06-28 16:34:57,030][09423] Updated weights for policy 0, policy_version 263927 (0.0029) [2024-06-28 16:34:57,924][09190] Fps is (10 sec: 44225.9, 60 sec: 43142.7, 300 sec: 43153.4). Total num frames: 4324229120. Throughput: 0: 43007.4. Samples: 603053380. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:34:57,933][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:35:00,618][09423] Updated weights for policy 0, policy_version 263937 (0.0024) [2024-06-28 16:35:02,922][09190] Fps is (10 sec: 44235.5, 60 sec: 42871.3, 300 sec: 43153.8). Total num frames: 4324425728. Throughput: 0: 43040.2. Samples: 603312320. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:35:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:35:04,254][09403] Signal inference workers to stop experience collection... (8300 times) [2024-06-28 16:35:04,254][09403] Signal inference workers to resume experience collection... (8300 times) [2024-06-28 16:35:04,300][09423] InferenceWorker_p0-w0: stopping experience collection (8300 times) [2024-06-28 16:35:04,300][09423] InferenceWorker_p0-w0: resuming experience collection (8300 times) [2024-06-28 16:35:04,387][09423] Updated weights for policy 0, policy_version 263947 (0.0037) [2024-06-28 16:35:07,921][09190] Fps is (10 sec: 40970.3, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4324638720. Throughput: 0: 42988.9. Samples: 603565440. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:35:07,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:35:08,412][09423] Updated weights for policy 0, policy_version 263957 (0.0024) [2024-06-28 16:35:11,960][09423] Updated weights for policy 0, policy_version 263967 (0.0033) [2024-06-28 16:35:12,921][09190] Fps is (10 sec: 45875.9, 60 sec: 43144.5, 300 sec: 43209.7). Total num frames: 4324884480. Throughput: 0: 42951.9. Samples: 603690160. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:35:12,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:35:16,121][09423] Updated weights for policy 0, policy_version 263977 (0.0044) [2024-06-28 16:35:17,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4325064704. Throughput: 0: 43031.5. Samples: 603954940. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:35:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:35:19,564][09423] Updated weights for policy 0, policy_version 263987 (0.0030) [2024-06-28 16:35:22,921][09190] Fps is (10 sec: 42599.0, 60 sec: 43417.6, 300 sec: 43098.2). Total num frames: 4325310464. Throughput: 0: 42853.9. Samples: 604210400. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:35:22,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:35:23,420][09423] Updated weights for policy 0, policy_version 263997 (0.0037) [2024-06-28 16:35:27,426][09423] Updated weights for policy 0, policy_version 264007 (0.0029) [2024-06-28 16:35:27,924][09190] Fps is (10 sec: 47501.4, 60 sec: 43142.7, 300 sec: 43208.9). Total num frames: 4325539840. Throughput: 0: 42948.2. Samples: 604344680. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:35:27,925][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:35:31,365][09423] Updated weights for policy 0, policy_version 264017 (0.0036) [2024-06-28 16:35:32,921][09190] Fps is (10 sec: 40960.0, 60 sec: 43144.5, 300 sec: 43154.2). Total num frames: 4325720064. Throughput: 0: 42941.4. Samples: 604599800. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:35:32,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:35:34,883][09423] Updated weights for policy 0, policy_version 264027 (0.0041) [2024-06-28 16:35:37,921][09190] Fps is (10 sec: 39331.5, 60 sec: 42871.5, 300 sec: 42987.2). Total num frames: 4325933056. Throughput: 0: 42892.8. Samples: 604854780. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:35:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:35:38,911][09423] Updated weights for policy 0, policy_version 264037 (0.0033) [2024-06-28 16:35:42,228][09423] Updated weights for policy 0, policy_version 264047 (0.0034) [2024-06-28 16:35:42,921][09190] Fps is (10 sec: 45874.8, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4326178816. Throughput: 0: 43091.7. Samples: 604992400. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:35:42,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:35:46,687][09423] Updated weights for policy 0, policy_version 264057 (0.0031) [2024-06-28 16:35:47,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.4, 300 sec: 43098.2). Total num frames: 4326342656. Throughput: 0: 43044.3. Samples: 605249300. Policy #0 lag: (min: 0.0, avg: 11.4, max: 22.0) [2024-06-28 16:35:47,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:35:49,988][09423] Updated weights for policy 0, policy_version 264067 (0.0042) [2024-06-28 16:35:52,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43690.6, 300 sec: 43098.2). Total num frames: 4326604800. Throughput: 0: 43093.3. Samples: 605504640. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 16:35:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:35:54,195][09423] Updated weights for policy 0, policy_version 264077 (0.0036) [2024-06-28 16:35:57,910][09423] Updated weights for policy 0, policy_version 264087 (0.0037) [2024-06-28 16:35:57,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42873.2, 300 sec: 43209.3). Total num frames: 4326801408. Throughput: 0: 43234.7. Samples: 605635720. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 16:35:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:36:01,450][09423] Updated weights for policy 0, policy_version 264097 (0.0029) [2024-06-28 16:36:02,921][09190] Fps is (10 sec: 40960.3, 60 sec: 43144.7, 300 sec: 43153.8). Total num frames: 4327014400. Throughput: 0: 43136.0. Samples: 605896060. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 16:36:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:36:05,302][09423] Updated weights for policy 0, policy_version 264107 (0.0041) [2024-06-28 16:36:07,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4327227392. Throughput: 0: 43339.1. Samples: 606160660. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 16:36:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:36:09,327][09423] Updated weights for policy 0, policy_version 264117 (0.0037) [2024-06-28 16:36:11,974][09403] Signal inference workers to stop experience collection... (8350 times) [2024-06-28 16:36:11,974][09403] Signal inference workers to resume experience collection... (8350 times) [2024-06-28 16:36:12,024][09423] InferenceWorker_p0-w0: stopping experience collection (8350 times) [2024-06-28 16:36:12,024][09423] InferenceWorker_p0-w0: resuming experience collection (8350 times) [2024-06-28 16:36:12,679][09423] Updated weights for policy 0, policy_version 264127 (0.0028) [2024-06-28 16:36:12,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 4327473152. Throughput: 0: 43254.9. Samples: 606291040. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 16:36:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:36:16,928][09423] Updated weights for policy 0, policy_version 264137 (0.0032) [2024-06-28 16:36:17,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4327653376. Throughput: 0: 43239.1. Samples: 606545560. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 16:36:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:36:18,022][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000264140_4327669760.pth... [2024-06-28 16:36:18,074][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000263507_4317298688.pth [2024-06-28 16:36:20,148][09423] Updated weights for policy 0, policy_version 264147 (0.0040) [2024-06-28 16:36:22,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4327882752. Throughput: 0: 43283.5. Samples: 606802540. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 16:36:22,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:36:24,595][09423] Updated weights for policy 0, policy_version 264157 (0.0048) [2024-06-28 16:36:27,807][09423] Updated weights for policy 0, policy_version 264167 (0.0036) [2024-06-28 16:36:27,924][09190] Fps is (10 sec: 45864.0, 60 sec: 42871.6, 300 sec: 43264.5). Total num frames: 4328112128. Throughput: 0: 43077.8. Samples: 606931000. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 16:36:27,924][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:36:32,235][09423] Updated weights for policy 0, policy_version 264177 (0.0035) [2024-06-28 16:36:32,924][09190] Fps is (10 sec: 42588.2, 60 sec: 43142.7, 300 sec: 43097.9). Total num frames: 4328308736. Throughput: 0: 43125.6. Samples: 607190060. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 16:36:32,924][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 16:36:35,714][09423] Updated weights for policy 0, policy_version 264187 (0.0037) [2024-06-28 16:36:37,921][09190] Fps is (10 sec: 42608.6, 60 sec: 43417.6, 300 sec: 43042.7). Total num frames: 4328538112. Throughput: 0: 43154.3. Samples: 607446580. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 16:36:37,922][09190] Avg episode reward: [(0, '0.730')] [2024-06-28 16:36:39,556][09423] Updated weights for policy 0, policy_version 264197 (0.0038) [2024-06-28 16:36:42,921][09190] Fps is (10 sec: 44247.5, 60 sec: 42871.5, 300 sec: 43209.3). Total num frames: 4328751104. Throughput: 0: 43115.5. Samples: 607575920. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 16:36:42,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:36:43,100][09423] Updated weights for policy 0, policy_version 264207 (0.0036) [2024-06-28 16:36:47,153][09423] Updated weights for policy 0, policy_version 264217 (0.0031) [2024-06-28 16:36:47,921][09190] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43098.2). Total num frames: 4328964096. Throughput: 0: 43263.9. Samples: 607842940. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 16:36:47,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:36:50,644][09423] Updated weights for policy 0, policy_version 264227 (0.0035) [2024-06-28 16:36:52,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4329193472. Throughput: 0: 42901.3. Samples: 608091220. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 16:36:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:36:55,022][09423] Updated weights for policy 0, policy_version 264237 (0.0032) [2024-06-28 16:36:57,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 4329390080. Throughput: 0: 43004.9. Samples: 608226260. Policy #0 lag: (min: 1.0, avg: 9.2, max: 20.0) [2024-06-28 16:36:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:36:58,168][09423] Updated weights for policy 0, policy_version 264247 (0.0033) [2024-06-28 16:37:02,417][09423] Updated weights for policy 0, policy_version 264257 (0.0029) [2024-06-28 16:37:02,921][09190] Fps is (10 sec: 40960.0, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4329603072. Throughput: 0: 43123.9. Samples: 608486140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 16:37:02,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:37:05,734][09423] Updated weights for policy 0, policy_version 264267 (0.0036) [2024-06-28 16:37:07,921][09190] Fps is (10 sec: 44236.2, 60 sec: 43417.5, 300 sec: 43153.8). Total num frames: 4329832448. Throughput: 0: 42983.5. Samples: 608736800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 16:37:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:37:10,305][09423] Updated weights for policy 0, policy_version 264277 (0.0028) [2024-06-28 16:37:12,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42871.4, 300 sec: 43209.4). Total num frames: 4330045440. Throughput: 0: 43101.4. Samples: 608870460. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 16:37:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:37:13,388][09423] Updated weights for policy 0, policy_version 264287 (0.0028) [2024-06-28 16:37:17,807][09423] Updated weights for policy 0, policy_version 264297 (0.0035) [2024-06-28 16:37:17,921][09190] Fps is (10 sec: 40960.7, 60 sec: 43144.5, 300 sec: 43043.1). Total num frames: 4330242048. Throughput: 0: 43102.4. Samples: 609129560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 16:37:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:37:20,838][09423] Updated weights for policy 0, policy_version 264307 (0.0035) [2024-06-28 16:37:22,922][09190] Fps is (10 sec: 44236.0, 60 sec: 43417.5, 300 sec: 43153.8). Total num frames: 4330487808. Throughput: 0: 43090.5. Samples: 609385660. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 16:37:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:37:25,193][09423] Updated weights for policy 0, policy_version 264317 (0.0026) [2024-06-28 16:37:27,921][09190] Fps is (10 sec: 45875.1, 60 sec: 43146.3, 300 sec: 43209.3). Total num frames: 4330700800. Throughput: 0: 43210.3. Samples: 609520380. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 16:37:27,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:37:28,478][09423] Updated weights for policy 0, policy_version 264327 (0.0025) [2024-06-28 16:37:29,201][09403] Signal inference workers to stop experience collection... (8400 times) [2024-06-28 16:37:29,202][09403] Signal inference workers to resume experience collection... (8400 times) [2024-06-28 16:37:29,246][09423] InferenceWorker_p0-w0: stopping experience collection (8400 times) [2024-06-28 16:37:29,246][09423] InferenceWorker_p0-w0: resuming experience collection (8400 times) [2024-06-28 16:37:32,691][09423] Updated weights for policy 0, policy_version 264337 (0.0031) [2024-06-28 16:37:32,921][09190] Fps is (10 sec: 40960.2, 60 sec: 43146.2, 300 sec: 43098.5). Total num frames: 4330897408. Throughput: 0: 42914.2. Samples: 609774080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 16:37:32,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:37:36,215][09423] Updated weights for policy 0, policy_version 264347 (0.0033) [2024-06-28 16:37:37,923][09190] Fps is (10 sec: 42592.7, 60 sec: 43143.6, 300 sec: 43098.1). Total num frames: 4331126784. Throughput: 0: 43035.2. Samples: 610027860. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 16:37:37,923][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:37:40,520][09423] Updated weights for policy 0, policy_version 264357 (0.0033) [2024-06-28 16:37:42,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42871.5, 300 sec: 43153.8). Total num frames: 4331323392. Throughput: 0: 43012.9. Samples: 610161840. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 16:37:42,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:37:43,622][09423] Updated weights for policy 0, policy_version 264367 (0.0034) [2024-06-28 16:37:47,921][09190] Fps is (10 sec: 40965.2, 60 sec: 42871.5, 300 sec: 43098.8). Total num frames: 4331536384. Throughput: 0: 42914.7. Samples: 610417300. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 16:37:47,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:37:48,055][09423] Updated weights for policy 0, policy_version 264377 (0.0027) [2024-06-28 16:37:51,284][09423] Updated weights for policy 0, policy_version 264387 (0.0036) [2024-06-28 16:37:52,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4331765760. Throughput: 0: 43077.5. Samples: 610675280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 16:37:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:37:55,809][09423] Updated weights for policy 0, policy_version 264397 (0.0034) [2024-06-28 16:37:57,921][09190] Fps is (10 sec: 44236.5, 60 sec: 43144.4, 300 sec: 43098.3). Total num frames: 4331978752. Throughput: 0: 43106.5. Samples: 610810260. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 16:37:57,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:37:58,846][09423] Updated weights for policy 0, policy_version 264407 (0.0034) [2024-06-28 16:38:02,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4332175360. Throughput: 0: 43083.5. Samples: 611068320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 16:38:02,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 16:38:03,134][09423] Updated weights for policy 0, policy_version 264417 (0.0051) [2024-06-28 16:38:06,183][09423] Updated weights for policy 0, policy_version 264427 (0.0030) [2024-06-28 16:38:07,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4332421120. Throughput: 0: 43122.8. Samples: 611326180. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 16:38:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:38:10,505][09423] Updated weights for policy 0, policy_version 264437 (0.0031) [2024-06-28 16:38:12,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.4, 300 sec: 43098.3). Total num frames: 4332617728. Throughput: 0: 43175.9. Samples: 611463300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:38:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:38:13,777][09423] Updated weights for policy 0, policy_version 264447 (0.0047) [2024-06-28 16:38:17,921][09190] Fps is (10 sec: 40959.8, 60 sec: 43144.4, 300 sec: 43042.7). Total num frames: 4332830720. Throughput: 0: 43193.4. Samples: 611717780. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:38:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:38:17,959][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000264456_4332847104.pth... [2024-06-28 16:38:18,003][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000263824_4322492416.pth [2024-06-28 16:38:18,229][09423] Updated weights for policy 0, policy_version 264457 (0.0048) [2024-06-28 16:38:21,366][09423] Updated weights for policy 0, policy_version 264467 (0.0040) [2024-06-28 16:38:22,922][09190] Fps is (10 sec: 45874.1, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4333076480. Throughput: 0: 43293.5. Samples: 611976020. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:38:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:38:25,895][09423] Updated weights for policy 0, policy_version 264477 (0.0034) [2024-06-28 16:38:27,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42871.4, 300 sec: 43153.8). Total num frames: 4333273088. Throughput: 0: 43241.7. Samples: 612107720. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:38:27,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:38:28,778][09423] Updated weights for policy 0, policy_version 264487 (0.0035) [2024-06-28 16:38:32,922][09190] Fps is (10 sec: 40960.4, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4333486080. Throughput: 0: 43353.7. Samples: 612368220. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:38:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:38:33,743][09423] Updated weights for policy 0, policy_version 264497 (0.0034) [2024-06-28 16:38:36,550][09423] Updated weights for policy 0, policy_version 264507 (0.0026) [2024-06-28 16:38:37,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43145.4, 300 sec: 43098.3). Total num frames: 4333715456. Throughput: 0: 43458.6. Samples: 612630920. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:38:37,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:38:41,140][09423] Updated weights for policy 0, policy_version 264517 (0.0030) [2024-06-28 16:38:42,664][09403] Signal inference workers to stop experience collection... (8450 times) [2024-06-28 16:38:42,664][09403] Signal inference workers to resume experience collection... (8450 times) [2024-06-28 16:38:42,704][09423] InferenceWorker_p0-w0: stopping experience collection (8450 times) [2024-06-28 16:38:42,704][09423] InferenceWorker_p0-w0: resuming experience collection (8450 times) [2024-06-28 16:38:42,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43417.5, 300 sec: 43098.2). Total num frames: 4333928448. Throughput: 0: 43307.1. Samples: 612759080. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:38:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:38:43,995][09423] Updated weights for policy 0, policy_version 264527 (0.0037) [2024-06-28 16:38:47,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4334141440. Throughput: 0: 43346.6. Samples: 613018920. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:38:47,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:38:48,607][09423] Updated weights for policy 0, policy_version 264537 (0.0030) [2024-06-28 16:38:51,713][09423] Updated weights for policy 0, policy_version 264547 (0.0034) [2024-06-28 16:38:52,921][09190] Fps is (10 sec: 44236.9, 60 sec: 43417.5, 300 sec: 43153.8). Total num frames: 4334370816. Throughput: 0: 43412.9. Samples: 613279760. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:38:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:38:56,153][09423] Updated weights for policy 0, policy_version 264557 (0.0030) [2024-06-28 16:38:57,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42871.6, 300 sec: 43042.7). Total num frames: 4334551040. Throughput: 0: 43182.7. Samples: 613406520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:38:57,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:38:59,328][09423] Updated weights for policy 0, policy_version 264567 (0.0037) [2024-06-28 16:39:02,924][09190] Fps is (10 sec: 40949.9, 60 sec: 43415.7, 300 sec: 43097.9). Total num frames: 4334780416. Throughput: 0: 43073.2. Samples: 613656180. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:39:02,924][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:39:04,050][09423] Updated weights for policy 0, policy_version 264577 (0.0033) [2024-06-28 16:39:06,977][09423] Updated weights for policy 0, policy_version 264587 (0.0045) [2024-06-28 16:39:07,928][09190] Fps is (10 sec: 44208.8, 60 sec: 42867.0, 300 sec: 43041.8). Total num frames: 4334993408. Throughput: 0: 43256.0. Samples: 613922800. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:39:07,928][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:39:11,365][09423] Updated weights for policy 0, policy_version 264597 (0.0022) [2024-06-28 16:39:12,921][09190] Fps is (10 sec: 42609.3, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4335206400. Throughput: 0: 43152.9. Samples: 614049600. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:39:12,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:39:14,483][09423] Updated weights for policy 0, policy_version 264607 (0.0032) [2024-06-28 16:39:17,921][09190] Fps is (10 sec: 45903.9, 60 sec: 43690.7, 300 sec: 43209.3). Total num frames: 4335452160. Throughput: 0: 43195.2. Samples: 614312000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:39:17,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:39:18,735][09423] Updated weights for policy 0, policy_version 264617 (0.0027) [2024-06-28 16:39:22,234][09423] Updated weights for policy 0, policy_version 264627 (0.0030) [2024-06-28 16:39:22,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.7, 300 sec: 43042.7). Total num frames: 4335648768. Throughput: 0: 42988.5. Samples: 614565400. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 16:39:22,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:39:26,422][09423] Updated weights for policy 0, policy_version 264637 (0.0031) [2024-06-28 16:39:27,921][09190] Fps is (10 sec: 40959.7, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4335861760. Throughput: 0: 43091.1. Samples: 614698180. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 16:39:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:39:30,088][09423] Updated weights for policy 0, policy_version 264647 (0.0040) [2024-06-28 16:39:32,921][09190] Fps is (10 sec: 44236.6, 60 sec: 43417.7, 300 sec: 43153.8). Total num frames: 4336091136. Throughput: 0: 43108.5. Samples: 614958800. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 16:39:32,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:39:34,091][09423] Updated weights for policy 0, policy_version 264657 (0.0035) [2024-06-28 16:39:37,502][09423] Updated weights for policy 0, policy_version 264667 (0.0027) [2024-06-28 16:39:37,923][09190] Fps is (10 sec: 44232.3, 60 sec: 43143.8, 300 sec: 43098.1). Total num frames: 4336304128. Throughput: 0: 42943.4. Samples: 615212260. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 16:39:37,923][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:39:41,545][09423] Updated weights for policy 0, policy_version 264677 (0.0032) [2024-06-28 16:39:42,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.5, 300 sec: 43098.3). Total num frames: 4336500736. Throughput: 0: 43185.2. Samples: 615349860. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 16:39:42,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 16:39:45,004][09423] Updated weights for policy 0, policy_version 264687 (0.0039) [2024-06-28 16:39:47,921][09190] Fps is (10 sec: 40964.5, 60 sec: 42871.5, 300 sec: 43153.8). Total num frames: 4336713728. Throughput: 0: 43442.9. Samples: 615611000. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 16:39:47,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:39:48,789][09423] Updated weights for policy 0, policy_version 264697 (0.0027) [2024-06-28 16:39:52,382][09423] Updated weights for policy 0, policy_version 264707 (0.0021) [2024-06-28 16:39:52,921][09190] Fps is (10 sec: 45875.2, 60 sec: 43144.5, 300 sec: 43154.2). Total num frames: 4336959488. Throughput: 0: 42997.5. Samples: 615857420. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 16:39:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:39:55,183][09403] Signal inference workers to stop experience collection... (8500 times) [2024-06-28 16:39:55,241][09423] InferenceWorker_p0-w0: stopping experience collection (8500 times) [2024-06-28 16:39:55,248][09403] Signal inference workers to resume experience collection... (8500 times) [2024-06-28 16:39:55,260][09423] InferenceWorker_p0-w0: resuming experience collection (8500 times) [2024-06-28 16:39:56,369][09423] Updated weights for policy 0, policy_version 264717 (0.0028) [2024-06-28 16:39:57,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43417.5, 300 sec: 43153.8). Total num frames: 4337156096. Throughput: 0: 43199.9. Samples: 615993600. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 16:39:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:40:00,199][09423] Updated weights for policy 0, policy_version 264727 (0.0033) [2024-06-28 16:40:02,921][09190] Fps is (10 sec: 40960.0, 60 sec: 43146.3, 300 sec: 43153.8). Total num frames: 4337369088. Throughput: 0: 43157.3. Samples: 616254080. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 16:40:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:40:04,175][09423] Updated weights for policy 0, policy_version 264737 (0.0036) [2024-06-28 16:40:07,921][09190] Fps is (10 sec: 44236.9, 60 sec: 43422.1, 300 sec: 43098.3). Total num frames: 4337598464. Throughput: 0: 43239.9. Samples: 616511200. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 16:40:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:40:08,048][09423] Updated weights for policy 0, policy_version 264747 (0.0032) [2024-06-28 16:40:11,851][09423] Updated weights for policy 0, policy_version 264757 (0.0027) [2024-06-28 16:40:12,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4337795072. Throughput: 0: 43195.6. Samples: 616641980. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 16:40:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:40:15,529][09423] Updated weights for policy 0, policy_version 264767 (0.0028) [2024-06-28 16:40:17,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 43098.2). Total num frames: 4338024448. Throughput: 0: 43156.0. Samples: 616900820. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 16:40:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:40:17,941][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000264772_4338024448.pth... [2024-06-28 16:40:17,987][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000264140_4327669760.pth [2024-06-28 16:40:19,196][09423] Updated weights for policy 0, policy_version 264777 (0.0027) [2024-06-28 16:40:22,924][09190] Fps is (10 sec: 45863.8, 60 sec: 43415.7, 300 sec: 43098.3). Total num frames: 4338253824. Throughput: 0: 43246.2. Samples: 617158400. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 16:40:22,925][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:40:23,143][09423] Updated weights for policy 0, policy_version 264787 (0.0032) [2024-06-28 16:40:27,106][09423] Updated weights for policy 0, policy_version 264797 (0.0038) [2024-06-28 16:40:27,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4338450432. Throughput: 0: 43083.1. Samples: 617288600. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 16:40:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:40:30,450][09423] Updated weights for policy 0, policy_version 264807 (0.0033) [2024-06-28 16:40:32,921][09190] Fps is (10 sec: 39331.8, 60 sec: 42598.5, 300 sec: 43098.3). Total num frames: 4338647040. Throughput: 0: 43125.4. Samples: 617551640. Policy #0 lag: (min: 0.0, avg: 12.1, max: 22.0) [2024-06-28 16:40:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:40:34,976][09423] Updated weights for policy 0, policy_version 264817 (0.0030) [2024-06-28 16:40:37,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43145.3, 300 sec: 43098.3). Total num frames: 4338892800. Throughput: 0: 43177.3. Samples: 617800400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 16:40:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:40:38,438][09423] Updated weights for policy 0, policy_version 264827 (0.0022) [2024-06-28 16:40:42,249][09423] Updated weights for policy 0, policy_version 264837 (0.0024) [2024-06-28 16:40:42,921][09190] Fps is (10 sec: 45874.8, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4339105792. Throughput: 0: 43235.1. Samples: 617939180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 16:40:42,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:40:45,936][09423] Updated weights for policy 0, policy_version 264847 (0.0034) [2024-06-28 16:40:47,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43098.3). Total num frames: 4339318784. Throughput: 0: 43080.0. Samples: 618192680. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 16:40:47,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:40:49,896][09423] Updated weights for policy 0, policy_version 264857 (0.0031) [2024-06-28 16:40:52,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42871.4, 300 sec: 43153.8). Total num frames: 4339531776. Throughput: 0: 43177.7. Samples: 618454200. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 16:40:52,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:40:53,779][09423] Updated weights for policy 0, policy_version 264867 (0.0036) [2024-06-28 16:40:57,293][09423] Updated weights for policy 0, policy_version 264877 (0.0041) [2024-06-28 16:40:57,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4339761152. Throughput: 0: 43122.2. Samples: 618582480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 16:40:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:41:01,159][09423] Updated weights for policy 0, policy_version 264887 (0.0038) [2024-06-28 16:41:02,922][09190] Fps is (10 sec: 44236.5, 60 sec: 43417.5, 300 sec: 43209.3). Total num frames: 4339974144. Throughput: 0: 43257.2. Samples: 618847400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 16:41:02,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:41:05,127][09423] Updated weights for policy 0, policy_version 264897 (0.0035) [2024-06-28 16:41:07,921][09190] Fps is (10 sec: 42598.1, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4340187136. Throughput: 0: 43323.7. Samples: 619107860. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 16:41:07,925][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:41:08,503][09423] Updated weights for policy 0, policy_version 264907 (0.0030) [2024-06-28 16:41:12,399][09423] Updated weights for policy 0, policy_version 264917 (0.0026) [2024-06-28 16:41:12,924][09190] Fps is (10 sec: 42588.3, 60 sec: 43415.8, 300 sec: 43209.0). Total num frames: 4340400128. Throughput: 0: 43351.9. Samples: 619239540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 16:41:12,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:41:16,225][09423] Updated weights for policy 0, policy_version 264927 (0.0032) [2024-06-28 16:41:17,922][09190] Fps is (10 sec: 42598.1, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4340613120. Throughput: 0: 43132.7. Samples: 619492620. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 16:41:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:41:20,023][09423] Updated weights for policy 0, policy_version 264937 (0.0026) [2024-06-28 16:41:22,921][09190] Fps is (10 sec: 44247.6, 60 sec: 43146.3, 300 sec: 43154.1). Total num frames: 4340842496. Throughput: 0: 43445.8. Samples: 619755460. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 16:41:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:41:23,933][09423] Updated weights for policy 0, policy_version 264947 (0.0031) [2024-06-28 16:41:27,451][09423] Updated weights for policy 0, policy_version 264957 (0.0027) [2024-06-28 16:41:27,921][09190] Fps is (10 sec: 44237.7, 60 sec: 43417.7, 300 sec: 43209.7). Total num frames: 4341055488. Throughput: 0: 43337.0. Samples: 619889340. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 16:41:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:41:31,393][09423] Updated weights for policy 0, policy_version 264967 (0.0030) [2024-06-28 16:41:32,921][09190] Fps is (10 sec: 44237.3, 60 sec: 43963.7, 300 sec: 43209.3). Total num frames: 4341284864. Throughput: 0: 43495.1. Samples: 620149960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 16:41:32,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:41:34,922][09423] Updated weights for policy 0, policy_version 264977 (0.0035) [2024-06-28 16:41:37,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4341481472. Throughput: 0: 43546.3. Samples: 620413780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 16:41:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:41:39,079][09423] Updated weights for policy 0, policy_version 264987 (0.0027) [2024-06-28 16:41:42,778][09423] Updated weights for policy 0, policy_version 264997 (0.0043) [2024-06-28 16:41:42,921][09190] Fps is (10 sec: 42598.0, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4341710848. Throughput: 0: 43490.2. Samples: 620539540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 23.0) [2024-06-28 16:41:42,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:41:46,471][09423] Updated weights for policy 0, policy_version 265007 (0.0033) [2024-06-28 16:41:47,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4341923840. Throughput: 0: 43212.1. Samples: 620791940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:41:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:41:50,218][09423] Updated weights for policy 0, policy_version 265017 (0.0031) [2024-06-28 16:41:52,922][09190] Fps is (10 sec: 42598.1, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4342136832. Throughput: 0: 43304.4. Samples: 621056560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:41:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:41:53,769][09403] Signal inference workers to stop experience collection... (8550 times) [2024-06-28 16:41:53,771][09403] Signal inference workers to resume experience collection... (8550 times) [2024-06-28 16:41:53,801][09423] InferenceWorker_p0-w0: stopping experience collection (8550 times) [2024-06-28 16:41:53,801][09423] InferenceWorker_p0-w0: resuming experience collection (8550 times) [2024-06-28 16:41:53,913][09423] Updated weights for policy 0, policy_version 265027 (0.0038) [2024-06-28 16:41:57,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 4342349824. Throughput: 0: 43220.2. Samples: 621184340. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:41:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:41:58,027][09423] Updated weights for policy 0, policy_version 265037 (0.0038) [2024-06-28 16:42:01,809][09423] Updated weights for policy 0, policy_version 265047 (0.0037) [2024-06-28 16:42:02,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4342562816. Throughput: 0: 43459.6. Samples: 621448300. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:42:02,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:42:05,707][09423] Updated weights for policy 0, policy_version 265057 (0.0040) [2024-06-28 16:42:07,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43417.7, 300 sec: 43209.3). Total num frames: 4342792192. Throughput: 0: 43314.8. Samples: 621704620. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:42:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:42:09,356][09423] Updated weights for policy 0, policy_version 265067 (0.0049) [2024-06-28 16:42:12,924][09190] Fps is (10 sec: 44225.9, 60 sec: 43417.6, 300 sec: 43264.5). Total num frames: 4343005184. Throughput: 0: 43335.7. Samples: 621839560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:42:12,924][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:42:13,058][09423] Updated weights for policy 0, policy_version 265077 (0.0034) [2024-06-28 16:42:16,749][09423] Updated weights for policy 0, policy_version 265087 (0.0034) [2024-06-28 16:42:17,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43690.7, 300 sec: 43209.3). Total num frames: 4343234560. Throughput: 0: 43393.2. Samples: 622102660. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:42:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:42:17,937][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000265090_4343234560.pth... [2024-06-28 16:42:18,000][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000264456_4332847104.pth [2024-06-28 16:42:20,410][09423] Updated weights for policy 0, policy_version 265097 (0.0025) [2024-06-28 16:42:22,922][09190] Fps is (10 sec: 44247.3, 60 sec: 43417.5, 300 sec: 43209.3). Total num frames: 4343447552. Throughput: 0: 43202.5. Samples: 622357900. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:42:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:42:24,474][09423] Updated weights for policy 0, policy_version 265107 (0.0041) [2024-06-28 16:42:27,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4343660544. Throughput: 0: 43343.2. Samples: 622489980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:42:27,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 16:42:28,002][09423] Updated weights for policy 0, policy_version 265117 (0.0033) [2024-06-28 16:42:31,731][09423] Updated weights for policy 0, policy_version 265127 (0.0032) [2024-06-28 16:42:32,921][09190] Fps is (10 sec: 42599.2, 60 sec: 43144.5, 300 sec: 43209.5). Total num frames: 4343873536. Throughput: 0: 43502.3. Samples: 622749540. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:42:32,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 16:42:35,420][09423] Updated weights for policy 0, policy_version 265137 (0.0032) [2024-06-28 16:42:37,921][09190] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43320.4). Total num frames: 4344102912. Throughput: 0: 43312.0. Samples: 623005600. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:42:37,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:42:39,662][09423] Updated weights for policy 0, policy_version 265147 (0.0027) [2024-06-28 16:42:42,923][09190] Fps is (10 sec: 42591.5, 60 sec: 43143.4, 300 sec: 43264.6). Total num frames: 4344299520. Throughput: 0: 43510.0. Samples: 623142360. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:42:42,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:42:43,388][09423] Updated weights for policy 0, policy_version 265157 (0.0034) [2024-06-28 16:42:46,958][09423] Updated weights for policy 0, policy_version 265167 (0.0036) [2024-06-28 16:42:47,922][09190] Fps is (10 sec: 40959.9, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4344512512. Throughput: 0: 43140.8. Samples: 623389640. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:42:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:42:50,899][09423] Updated weights for policy 0, policy_version 265177 (0.0028) [2024-06-28 16:42:52,921][09190] Fps is (10 sec: 44243.7, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 4344741888. Throughput: 0: 43270.6. Samples: 623651800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:42:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:42:54,893][09423] Updated weights for policy 0, policy_version 265187 (0.0031) [2024-06-28 16:42:57,921][09190] Fps is (10 sec: 44236.9, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 4344954880. Throughput: 0: 43412.6. Samples: 623793020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 22.0) [2024-06-28 16:42:57,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:42:58,453][09423] Updated weights for policy 0, policy_version 265197 (0.0041) [2024-06-28 16:43:02,223][09423] Updated weights for policy 0, policy_version 265207 (0.0029) [2024-06-28 16:43:02,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4345167872. Throughput: 0: 43237.3. Samples: 624048340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 16:43:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:43:06,058][09423] Updated weights for policy 0, policy_version 265217 (0.0022) [2024-06-28 16:43:07,922][09190] Fps is (10 sec: 44235.7, 60 sec: 43417.3, 300 sec: 43320.4). Total num frames: 4345397248. Throughput: 0: 43128.7. Samples: 624298700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 16:43:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:43:09,609][09423] Updated weights for policy 0, policy_version 265227 (0.0024) [2024-06-28 16:43:12,696][09403] Signal inference workers to stop experience collection... (8600 times) [2024-06-28 16:43:12,696][09403] Signal inference workers to resume experience collection... (8600 times) [2024-06-28 16:43:12,738][09423] InferenceWorker_p0-w0: stopping experience collection (8600 times) [2024-06-28 16:43:12,739][09423] InferenceWorker_p0-w0: resuming experience collection (8600 times) [2024-06-28 16:43:12,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43419.4, 300 sec: 43320.4). Total num frames: 4345610240. Throughput: 0: 43251.9. Samples: 624436320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 16:43:12,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:43:13,476][09423] Updated weights for policy 0, policy_version 265237 (0.0041) [2024-06-28 16:43:17,398][09423] Updated weights for policy 0, policy_version 265247 (0.0033) [2024-06-28 16:43:17,921][09190] Fps is (10 sec: 40961.3, 60 sec: 42871.5, 300 sec: 43153.8). Total num frames: 4345806848. Throughput: 0: 43313.7. Samples: 624698660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 16:43:17,926][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:43:21,415][09423] Updated weights for policy 0, policy_version 265257 (0.0032) [2024-06-28 16:43:22,921][09190] Fps is (10 sec: 44236.5, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 4346052608. Throughput: 0: 43178.2. Samples: 624948620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 16:43:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:43:25,369][09423] Updated weights for policy 0, policy_version 265267 (0.0029) [2024-06-28 16:43:27,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4346249216. Throughput: 0: 43159.3. Samples: 625084460. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 16:43:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:43:28,534][09423] Updated weights for policy 0, policy_version 265277 (0.0030) [2024-06-28 16:43:32,706][09423] Updated weights for policy 0, policy_version 265287 (0.0041) [2024-06-28 16:43:32,922][09190] Fps is (10 sec: 40959.9, 60 sec: 43144.4, 300 sec: 43209.3). Total num frames: 4346462208. Throughput: 0: 43458.2. Samples: 625345260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 16:43:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:43:36,028][09423] Updated weights for policy 0, policy_version 265297 (0.0041) [2024-06-28 16:43:37,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 4346691584. Throughput: 0: 43390.6. Samples: 625604380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 16:43:37,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:43:40,159][09423] Updated weights for policy 0, policy_version 265307 (0.0025) [2024-06-28 16:43:42,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43418.7, 300 sec: 43264.9). Total num frames: 4346904576. Throughput: 0: 43280.5. Samples: 625740640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 16:43:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 16:43:43,581][09423] Updated weights for policy 0, policy_version 265317 (0.0036) [2024-06-28 16:43:47,831][09423] Updated weights for policy 0, policy_version 265327 (0.0030) [2024-06-28 16:43:47,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43417.7, 300 sec: 43209.3). Total num frames: 4347117568. Throughput: 0: 43368.5. Samples: 625999920. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 16:43:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:43:51,331][09423] Updated weights for policy 0, policy_version 265337 (0.0032) [2024-06-28 16:43:52,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 4347330560. Throughput: 0: 43337.7. Samples: 626248880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 16:43:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:43:55,834][09423] Updated weights for policy 0, policy_version 265347 (0.0042) [2024-06-28 16:43:57,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 43265.2). Total num frames: 4347543552. Throughput: 0: 43162.7. Samples: 626378640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 16:43:57,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:43:58,978][09423] Updated weights for policy 0, policy_version 265357 (0.0027) [2024-06-28 16:44:02,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42871.5, 300 sec: 43210.2). Total num frames: 4347740160. Throughput: 0: 43048.0. Samples: 626635820. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 16:44:02,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:44:03,369][09423] Updated weights for policy 0, policy_version 265367 (0.0044) [2024-06-28 16:44:06,437][09423] Updated weights for policy 0, policy_version 265377 (0.0030) [2024-06-28 16:44:07,921][09190] Fps is (10 sec: 44236.3, 60 sec: 43144.7, 300 sec: 43320.4). Total num frames: 4347985920. Throughput: 0: 43084.9. Samples: 626887440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 16:44:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:44:10,944][09423] Updated weights for policy 0, policy_version 265387 (0.0034) [2024-06-28 16:44:12,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.4, 300 sec: 43153.8). Total num frames: 4348182528. Throughput: 0: 43178.1. Samples: 627027480. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 16:44:12,922][09190] Avg episode reward: [(0, '0.705')] [2024-06-28 16:44:14,242][09423] Updated weights for policy 0, policy_version 265397 (0.0027) [2024-06-28 16:44:17,921][09190] Fps is (10 sec: 40960.1, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4348395520. Throughput: 0: 43068.1. Samples: 627283320. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 16:44:17,922][09190] Avg episode reward: [(0, '0.704')] [2024-06-28 16:44:18,059][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000265406_4348411904.pth... [2024-06-28 16:44:18,103][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000264772_4338024448.pth [2024-06-28 16:44:18,386][09423] Updated weights for policy 0, policy_version 265407 (0.0023) [2024-06-28 16:44:21,708][09423] Updated weights for policy 0, policy_version 265417 (0.0036) [2024-06-28 16:44:22,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 4348641280. Throughput: 0: 42948.9. Samples: 627537080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 16:44:22,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:44:25,963][09423] Updated weights for policy 0, policy_version 265427 (0.0037) [2024-06-28 16:44:26,628][09403] Signal inference workers to stop experience collection... (8650 times) [2024-06-28 16:44:26,677][09423] InferenceWorker_p0-w0: stopping experience collection (8650 times) [2024-06-28 16:44:26,688][09403] Signal inference workers to resume experience collection... (8650 times) [2024-06-28 16:44:26,695][09423] InferenceWorker_p0-w0: resuming experience collection (8650 times) [2024-06-28 16:44:27,921][09190] Fps is (10 sec: 45875.2, 60 sec: 43417.5, 300 sec: 43264.9). Total num frames: 4348854272. Throughput: 0: 43008.9. Samples: 627676040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 16:44:27,925][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:44:29,364][09423] Updated weights for policy 0, policy_version 265437 (0.0041) [2024-06-28 16:44:32,921][09190] Fps is (10 sec: 40960.3, 60 sec: 43144.7, 300 sec: 43209.5). Total num frames: 4349050880. Throughput: 0: 42965.9. Samples: 627933380. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 16:44:32,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 16:44:33,737][09423] Updated weights for policy 0, policy_version 265447 (0.0038) [2024-06-28 16:44:36,981][09423] Updated weights for policy 0, policy_version 265457 (0.0025) [2024-06-28 16:44:37,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 4349280256. Throughput: 0: 43097.7. Samples: 628188280. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 16:44:37,922][09190] Avg episode reward: [(0, '0.725')] [2024-06-28 16:44:41,176][09423] Updated weights for policy 0, policy_version 265467 (0.0043) [2024-06-28 16:44:42,922][09190] Fps is (10 sec: 44235.9, 60 sec: 43144.4, 300 sec: 43320.4). Total num frames: 4349493248. Throughput: 0: 43158.9. Samples: 628320800. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 16:44:42,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 16:44:44,505][09423] Updated weights for policy 0, policy_version 265477 (0.0030) [2024-06-28 16:44:47,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 4349706240. Throughput: 0: 43096.9. Samples: 628575180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 16:44:47,922][09190] Avg episode reward: [(0, '0.728')] [2024-06-28 16:44:48,981][09423] Updated weights for policy 0, policy_version 265487 (0.0036) [2024-06-28 16:44:52,106][09423] Updated weights for policy 0, policy_version 265497 (0.0030) [2024-06-28 16:44:52,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 4349935616. Throughput: 0: 43253.8. Samples: 628833860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 16:44:52,928][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 16:44:56,385][09423] Updated weights for policy 0, policy_version 265507 (0.0042) [2024-06-28 16:44:57,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 4350132224. Throughput: 0: 43126.8. Samples: 628968180. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 16:44:57,922][09190] Avg episode reward: [(0, '0.728')] [2024-06-28 16:44:59,785][09423] Updated weights for policy 0, policy_version 265517 (0.0028) [2024-06-28 16:45:02,921][09190] Fps is (10 sec: 39322.0, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4350328832. Throughput: 0: 43150.7. Samples: 629225100. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 16:45:02,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 16:45:03,912][09423] Updated weights for policy 0, policy_version 265527 (0.0027) [2024-06-28 16:45:07,238][09423] Updated weights for policy 0, policy_version 265537 (0.0035) [2024-06-28 16:45:07,921][09190] Fps is (10 sec: 44236.5, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 4350574592. Throughput: 0: 43348.4. Samples: 629487760. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 16:45:07,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:45:11,603][09423] Updated weights for policy 0, policy_version 265547 (0.0038) [2024-06-28 16:45:12,922][09190] Fps is (10 sec: 45874.5, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4350787584. Throughput: 0: 43223.0. Samples: 629621080. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 16:45:12,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:45:14,933][09423] Updated weights for policy 0, policy_version 265557 (0.0040) [2024-06-28 16:45:17,921][09190] Fps is (10 sec: 40959.9, 60 sec: 43144.5, 300 sec: 43154.1). Total num frames: 4350984192. Throughput: 0: 43036.3. Samples: 629870020. Policy #0 lag: (min: 0.0, avg: 10.4, max: 23.0) [2024-06-28 16:45:17,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:45:18,962][09423] Updated weights for policy 0, policy_version 265567 (0.0032) [2024-06-28 16:45:22,385][09423] Updated weights for policy 0, policy_version 265577 (0.0021) [2024-06-28 16:45:22,921][09190] Fps is (10 sec: 44237.7, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 4351229952. Throughput: 0: 43235.2. Samples: 630133860. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 16:45:22,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:45:26,422][09423] Updated weights for policy 0, policy_version 265587 (0.0041) [2024-06-28 16:45:27,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.5, 300 sec: 43264.9). Total num frames: 4351410176. Throughput: 0: 43138.4. Samples: 630262020. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 16:45:27,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:45:30,175][09423] Updated weights for policy 0, policy_version 265597 (0.0040) [2024-06-28 16:45:32,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4351655936. Throughput: 0: 43178.7. Samples: 630518220. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 16:45:32,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:45:33,979][09423] Updated weights for policy 0, policy_version 265607 (0.0042) [2024-06-28 16:45:37,633][09423] Updated weights for policy 0, policy_version 265617 (0.0024) [2024-06-28 16:45:37,921][09190] Fps is (10 sec: 45874.6, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4351868928. Throughput: 0: 43311.6. Samples: 630782880. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 16:45:37,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:45:41,534][09423] Updated weights for policy 0, policy_version 265627 (0.0034) [2024-06-28 16:45:42,921][09190] Fps is (10 sec: 42598.1, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 4352081920. Throughput: 0: 43223.5. Samples: 630913240. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 16:45:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:45:45,342][09423] Updated weights for policy 0, policy_version 265637 (0.0040) [2024-06-28 16:45:47,922][09190] Fps is (10 sec: 44236.5, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 4352311296. Throughput: 0: 43278.5. Samples: 631172640. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 16:45:47,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:45:49,187][09423] Updated weights for policy 0, policy_version 265647 (0.0030) [2024-06-28 16:45:52,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.5, 300 sec: 43153.8). Total num frames: 4352491520. Throughput: 0: 43044.1. Samples: 631424740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 16:45:52,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 16:45:53,212][09423] Updated weights for policy 0, policy_version 265657 (0.0026) [2024-06-28 16:45:56,939][09423] Updated weights for policy 0, policy_version 265667 (0.0040) [2024-06-28 16:45:57,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42871.4, 300 sec: 43153.8). Total num frames: 4352704512. Throughput: 0: 42838.7. Samples: 631548820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 16:45:57,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 16:46:00,546][09423] Updated weights for policy 0, policy_version 265677 (0.0034) [2024-06-28 16:46:02,921][09190] Fps is (10 sec: 47513.3, 60 sec: 43963.7, 300 sec: 43320.4). Total num frames: 4352966656. Throughput: 0: 43128.9. Samples: 631810820. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 16:46:02,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:46:04,301][09423] Updated weights for policy 0, policy_version 265687 (0.0043) [2024-06-28 16:46:07,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42871.5, 300 sec: 43209.7). Total num frames: 4353146880. Throughput: 0: 43076.4. Samples: 632072300. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 16:46:07,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:46:08,186][09423] Updated weights for policy 0, policy_version 265697 (0.0030) [2024-06-28 16:46:11,989][09423] Updated weights for policy 0, policy_version 265707 (0.0024) [2024-06-28 16:46:12,922][09190] Fps is (10 sec: 37682.6, 60 sec: 42598.4, 300 sec: 43153.8). Total num frames: 4353343488. Throughput: 0: 42993.6. Samples: 632196740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 16:46:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:46:16,179][09403] Signal inference workers to stop experience collection... (8700 times) [2024-06-28 16:46:16,184][09403] Signal inference workers to resume experience collection... (8700 times) [2024-06-28 16:46:16,193][09423] Updated weights for policy 0, policy_version 265717 (0.0030) [2024-06-28 16:46:16,225][09423] InferenceWorker_p0-w0: stopping experience collection (8700 times) [2024-06-28 16:46:16,225][09423] InferenceWorker_p0-w0: resuming experience collection (8700 times) [2024-06-28 16:46:17,921][09190] Fps is (10 sec: 44236.3, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4353589248. Throughput: 0: 43038.1. Samples: 632454940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 16:46:17,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:46:18,033][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000265723_4353605632.pth... [2024-06-28 16:46:18,093][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000265090_4343234560.pth [2024-06-28 16:46:19,814][09423] Updated weights for policy 0, policy_version 265727 (0.0033) [2024-06-28 16:46:22,921][09190] Fps is (10 sec: 45876.1, 60 sec: 42871.4, 300 sec: 43209.3). Total num frames: 4353802240. Throughput: 0: 42977.0. Samples: 632716840. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 16:46:22,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 16:46:23,593][09423] Updated weights for policy 0, policy_version 265737 (0.0033) [2024-06-28 16:46:27,219][09423] Updated weights for policy 0, policy_version 265747 (0.0033) [2024-06-28 16:46:27,922][09190] Fps is (10 sec: 40957.6, 60 sec: 43144.0, 300 sec: 43098.1). Total num frames: 4353998848. Throughput: 0: 42920.3. Samples: 632844680. Policy #0 lag: (min: 0.0, avg: 8.9, max: 22.0) [2024-06-28 16:46:27,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:46:31,145][09423] Updated weights for policy 0, policy_version 265757 (0.0027) [2024-06-28 16:46:32,922][09190] Fps is (10 sec: 45871.4, 60 sec: 43417.0, 300 sec: 43320.3). Total num frames: 4354260992. Throughput: 0: 42906.4. Samples: 633103460. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:46:32,923][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:46:34,693][09423] Updated weights for policy 0, policy_version 265767 (0.0039) [2024-06-28 16:46:37,921][09190] Fps is (10 sec: 42601.5, 60 sec: 42598.5, 300 sec: 43098.3). Total num frames: 4354424832. Throughput: 0: 43224.4. Samples: 633369840. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:46:37,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:46:38,769][09423] Updated weights for policy 0, policy_version 265777 (0.0036) [2024-06-28 16:46:42,041][09423] Updated weights for policy 0, policy_version 265787 (0.0032) [2024-06-28 16:46:42,923][09190] Fps is (10 sec: 39319.5, 60 sec: 42870.5, 300 sec: 43153.6). Total num frames: 4354654208. Throughput: 0: 43202.3. Samples: 633492980. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:46:42,923][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:46:46,205][09423] Updated weights for policy 0, policy_version 265797 (0.0032) [2024-06-28 16:46:47,921][09190] Fps is (10 sec: 47513.6, 60 sec: 43144.7, 300 sec: 43264.9). Total num frames: 4354899968. Throughput: 0: 43277.4. Samples: 633758300. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:46:47,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:46:49,773][09423] Updated weights for policy 0, policy_version 265807 (0.0039) [2024-06-28 16:46:52,921][09190] Fps is (10 sec: 44243.0, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4355096576. Throughput: 0: 43373.8. Samples: 634024120. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:46:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:46:53,841][09423] Updated weights for policy 0, policy_version 265817 (0.0031) [2024-06-28 16:46:57,205][09423] Updated weights for policy 0, policy_version 265827 (0.0041) [2024-06-28 16:46:57,921][09190] Fps is (10 sec: 40959.6, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4355309568. Throughput: 0: 43274.8. Samples: 634144100. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:46:57,922][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 16:47:01,374][09423] Updated weights for policy 0, policy_version 265837 (0.0022) [2024-06-28 16:47:02,921][09190] Fps is (10 sec: 45875.0, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4355555328. Throughput: 0: 43209.8. Samples: 634399380. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:47:02,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 16:47:05,191][09423] Updated weights for policy 0, policy_version 265847 (0.0035) [2024-06-28 16:47:07,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 43154.1). Total num frames: 4355735552. Throughput: 0: 43298.6. Samples: 634665280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:47:07,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:47:09,275][09423] Updated weights for policy 0, policy_version 265857 (0.0050) [2024-06-28 16:47:12,534][09423] Updated weights for policy 0, policy_version 265867 (0.0029) [2024-06-28 16:47:12,921][09190] Fps is (10 sec: 40960.2, 60 sec: 43690.8, 300 sec: 43153.8). Total num frames: 4355964928. Throughput: 0: 43146.9. Samples: 634786260. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:47:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:47:16,845][09423] Updated weights for policy 0, policy_version 265877 (0.0040) [2024-06-28 16:47:17,921][09190] Fps is (10 sec: 44237.3, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4356177920. Throughput: 0: 43188.4. Samples: 635046900. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:47:17,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:47:20,011][09423] Updated weights for policy 0, policy_version 265887 (0.0029) [2024-06-28 16:47:22,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42598.4, 300 sec: 43042.7). Total num frames: 4356358144. Throughput: 0: 43182.2. Samples: 635313040. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:47:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:47:24,311][09423] Updated weights for policy 0, policy_version 265897 (0.0025) [2024-06-28 16:47:27,407][09423] Updated weights for policy 0, policy_version 265907 (0.0030) [2024-06-28 16:47:27,924][09190] Fps is (10 sec: 44225.4, 60 sec: 43689.3, 300 sec: 43209.0). Total num frames: 4356620288. Throughput: 0: 43068.2. Samples: 635431100. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:47:27,924][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:47:31,985][09423] Updated weights for policy 0, policy_version 265917 (0.0032) [2024-06-28 16:47:32,921][09190] Fps is (10 sec: 47513.3, 60 sec: 42872.1, 300 sec: 43153.8). Total num frames: 4356833280. Throughput: 0: 43142.6. Samples: 635699720. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:47:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:47:35,078][09423] Updated weights for policy 0, policy_version 265927 (0.0027) [2024-06-28 16:47:37,921][09190] Fps is (10 sec: 40970.2, 60 sec: 43417.5, 300 sec: 43154.0). Total num frames: 4357029888. Throughput: 0: 43111.5. Samples: 635964140. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:47:37,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:47:39,414][09423] Updated weights for policy 0, policy_version 265937 (0.0034) [2024-06-28 16:47:42,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43418.6, 300 sec: 43209.3). Total num frames: 4357259264. Throughput: 0: 43143.1. Samples: 636085540. Policy #0 lag: (min: 0.0, avg: 11.4, max: 24.0) [2024-06-28 16:47:42,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:47:43,075][09423] Updated weights for policy 0, policy_version 265947 (0.0034) [2024-06-28 16:47:46,351][09403] Signal inference workers to stop experience collection... (8750 times) [2024-06-28 16:47:46,352][09403] Signal inference workers to resume experience collection... (8750 times) [2024-06-28 16:47:46,393][09423] InferenceWorker_p0-w0: stopping experience collection (8750 times) [2024-06-28 16:47:46,393][09423] InferenceWorker_p0-w0: resuming experience collection (8750 times) [2024-06-28 16:47:47,299][09423] Updated weights for policy 0, policy_version 265957 (0.0043) [2024-06-28 16:47:47,921][09190] Fps is (10 sec: 45875.1, 60 sec: 43144.4, 300 sec: 43209.3). Total num frames: 4357488640. Throughput: 0: 43408.8. Samples: 636352780. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:47:47,925][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:47:50,438][09423] Updated weights for policy 0, policy_version 265967 (0.0029) [2024-06-28 16:47:52,926][09190] Fps is (10 sec: 40942.1, 60 sec: 42868.3, 300 sec: 43097.6). Total num frames: 4357668864. Throughput: 0: 43246.5. Samples: 636611560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:47:52,926][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:47:54,827][09423] Updated weights for policy 0, policy_version 265977 (0.0032) [2024-06-28 16:47:57,871][09423] Updated weights for policy 0, policy_version 265987 (0.0032) [2024-06-28 16:47:57,921][09190] Fps is (10 sec: 44236.8, 60 sec: 43690.7, 300 sec: 43264.9). Total num frames: 4357931008. Throughput: 0: 43246.6. Samples: 636732360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:47:57,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:48:02,175][09423] Updated weights for policy 0, policy_version 265997 (0.0031) [2024-06-28 16:48:02,925][09190] Fps is (10 sec: 47519.0, 60 sec: 43142.2, 300 sec: 43208.9). Total num frames: 4358144000. Throughput: 0: 43458.1. Samples: 637002660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:48:02,925][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 16:48:05,252][09423] Updated weights for policy 0, policy_version 266007 (0.0030) [2024-06-28 16:48:07,921][09190] Fps is (10 sec: 37683.3, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4358307840. Throughput: 0: 43243.0. Samples: 637258980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:48:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:48:10,139][09423] Updated weights for policy 0, policy_version 266017 (0.0031) [2024-06-28 16:48:12,921][09190] Fps is (10 sec: 42612.4, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4358569984. Throughput: 0: 43238.0. Samples: 637376700. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:48:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:48:13,558][09423] Updated weights for policy 0, policy_version 266027 (0.0038) [2024-06-28 16:48:17,577][09423] Updated weights for policy 0, policy_version 266037 (0.0032) [2024-06-28 16:48:17,921][09190] Fps is (10 sec: 47513.7, 60 sec: 43417.5, 300 sec: 43153.8). Total num frames: 4358782976. Throughput: 0: 43293.8. Samples: 637647940. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:48:17,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:48:18,019][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000266040_4358799360.pth... [2024-06-28 16:48:18,069][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000265406_4348411904.pth [2024-06-28 16:48:20,951][09423] Updated weights for policy 0, policy_version 266047 (0.0021) [2024-06-28 16:48:22,921][09190] Fps is (10 sec: 39321.7, 60 sec: 43417.6, 300 sec: 43098.3). Total num frames: 4358963200. Throughput: 0: 43088.1. Samples: 637903100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:48:22,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:48:25,099][09423] Updated weights for policy 0, policy_version 266057 (0.0036) [2024-06-28 16:48:27,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43146.4, 300 sec: 43209.4). Total num frames: 4359208960. Throughput: 0: 43202.3. Samples: 638029640. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:48:27,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:48:28,627][09423] Updated weights for policy 0, policy_version 266067 (0.0026) [2024-06-28 16:48:32,717][09423] Updated weights for policy 0, policy_version 266077 (0.0033) [2024-06-28 16:48:32,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.5, 300 sec: 43098.3). Total num frames: 4359405568. Throughput: 0: 43089.0. Samples: 638291780. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:48:32,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 16:48:36,113][09423] Updated weights for policy 0, policy_version 266087 (0.0033) [2024-06-28 16:48:37,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42871.5, 300 sec: 43042.7). Total num frames: 4359602176. Throughput: 0: 43081.5. Samples: 638550040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:48:37,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:48:40,361][09423] Updated weights for policy 0, policy_version 266097 (0.0033) [2024-06-28 16:48:42,921][09190] Fps is (10 sec: 45875.0, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4359864320. Throughput: 0: 43169.4. Samples: 638674980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:48:42,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:48:43,949][09423] Updated weights for policy 0, policy_version 266107 (0.0044) [2024-06-28 16:48:47,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.4, 300 sec: 43098.2). Total num frames: 4360044544. Throughput: 0: 42957.8. Samples: 638935620. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:48:47,930][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:48:48,094][09423] Updated weights for policy 0, policy_version 266117 (0.0046) [2024-06-28 16:48:51,508][09423] Updated weights for policy 0, policy_version 266127 (0.0031) [2024-06-28 16:48:52,921][09190] Fps is (10 sec: 39321.8, 60 sec: 43147.7, 300 sec: 43098.3). Total num frames: 4360257536. Throughput: 0: 42987.2. Samples: 639193400. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:48:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:48:55,514][09423] Updated weights for policy 0, policy_version 266137 (0.0037) [2024-06-28 16:48:57,921][09190] Fps is (10 sec: 47513.4, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 4360519680. Throughput: 0: 43274.1. Samples: 639324040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 16:48:57,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:48:58,881][09423] Updated weights for policy 0, policy_version 266147 (0.0030) [2024-06-28 16:49:02,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42327.6, 300 sec: 43042.7). Total num frames: 4360683520. Throughput: 0: 43102.2. Samples: 639587540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 16:49:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:49:03,254][09423] Updated weights for policy 0, policy_version 266157 (0.0042) [2024-06-28 16:49:03,967][09403] Signal inference workers to stop experience collection... (8800 times) [2024-06-28 16:49:03,997][09423] InferenceWorker_p0-w0: stopping experience collection (8800 times) [2024-06-28 16:49:04,021][09403] Signal inference workers to resume experience collection... (8800 times) [2024-06-28 16:49:04,021][09423] InferenceWorker_p0-w0: resuming experience collection (8800 times) [2024-06-28 16:49:06,683][09423] Updated weights for policy 0, policy_version 266167 (0.0027) [2024-06-28 16:49:07,921][09190] Fps is (10 sec: 40960.1, 60 sec: 43690.7, 300 sec: 43209.3). Total num frames: 4360929280. Throughput: 0: 43091.5. Samples: 639842220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 16:49:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:49:10,690][09423] Updated weights for policy 0, policy_version 266177 (0.0036) [2024-06-28 16:49:12,921][09190] Fps is (10 sec: 49151.9, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 4361175040. Throughput: 0: 43166.1. Samples: 639972120. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 16:49:12,928][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:49:13,961][09423] Updated weights for policy 0, policy_version 266187 (0.0039) [2024-06-28 16:49:17,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42871.4, 300 sec: 43098.2). Total num frames: 4361355264. Throughput: 0: 43152.3. Samples: 640233640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 16:49:17,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:49:18,224][09423] Updated weights for policy 0, policy_version 266197 (0.0042) [2024-06-28 16:49:21,449][09423] Updated weights for policy 0, policy_version 266207 (0.0046) [2024-06-28 16:49:22,922][09190] Fps is (10 sec: 37682.9, 60 sec: 43144.4, 300 sec: 43042.7). Total num frames: 4361551872. Throughput: 0: 43000.3. Samples: 640485060. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 16:49:22,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:49:25,895][09423] Updated weights for policy 0, policy_version 266217 (0.0035) [2024-06-28 16:49:27,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4361797632. Throughput: 0: 43052.9. Samples: 640612360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 16:49:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:49:29,347][09423] Updated weights for policy 0, policy_version 266227 (0.0038) [2024-06-28 16:49:32,921][09190] Fps is (10 sec: 40961.0, 60 sec: 42598.4, 300 sec: 42987.2). Total num frames: 4361961472. Throughput: 0: 43132.6. Samples: 640876580. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 16:49:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:49:33,472][09423] Updated weights for policy 0, policy_version 266237 (0.0030) [2024-06-28 16:49:36,830][09423] Updated weights for policy 0, policy_version 266247 (0.0033) [2024-06-28 16:49:37,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43153.8). Total num frames: 4362223616. Throughput: 0: 43044.3. Samples: 641130400. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 16:49:37,924][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:49:41,382][09423] Updated weights for policy 0, policy_version 266257 (0.0031) [2024-06-28 16:49:42,921][09190] Fps is (10 sec: 49151.4, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4362452992. Throughput: 0: 43204.9. Samples: 641268260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 16:49:42,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:49:44,287][09423] Updated weights for policy 0, policy_version 266267 (0.0044) [2024-06-28 16:49:47,921][09190] Fps is (10 sec: 40960.0, 60 sec: 43144.5, 300 sec: 43042.7). Total num frames: 4362633216. Throughput: 0: 43133.8. Samples: 641528560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 16:49:47,924][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:49:48,675][09423] Updated weights for policy 0, policy_version 266277 (0.0035) [2024-06-28 16:49:51,870][09423] Updated weights for policy 0, policy_version 266287 (0.0031) [2024-06-28 16:49:52,921][09190] Fps is (10 sec: 40960.4, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4362862592. Throughput: 0: 43034.3. Samples: 641778760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 16:49:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:49:56,496][09423] Updated weights for policy 0, policy_version 266297 (0.0031) [2024-06-28 16:49:57,925][09190] Fps is (10 sec: 45860.2, 60 sec: 42869.2, 300 sec: 43264.4). Total num frames: 4363091968. Throughput: 0: 43095.1. Samples: 641911540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 16:49:57,925][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:49:59,458][09423] Updated weights for policy 0, policy_version 266307 (0.0036) [2024-06-28 16:50:02,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42871.6, 300 sec: 42987.2). Total num frames: 4363255808. Throughput: 0: 43024.6. Samples: 642169740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 16:50:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:50:03,913][09423] Updated weights for policy 0, policy_version 266317 (0.0023) [2024-06-28 16:50:07,059][09423] Updated weights for policy 0, policy_version 266327 (0.0032) [2024-06-28 16:50:07,921][09190] Fps is (10 sec: 42612.3, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4363517952. Throughput: 0: 42979.2. Samples: 642419120. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:50:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:50:11,349][09423] Updated weights for policy 0, policy_version 266337 (0.0033) [2024-06-28 16:50:12,921][09190] Fps is (10 sec: 49151.5, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 4363747328. Throughput: 0: 43344.9. Samples: 642562880. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:50:12,930][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 16:50:14,402][09423] Updated weights for policy 0, policy_version 266347 (0.0026) [2024-06-28 16:50:17,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42871.4, 300 sec: 43042.7). Total num frames: 4363927552. Throughput: 0: 43257.6. Samples: 642823180. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:50:17,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:50:17,932][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000266353_4363927552.pth... [2024-06-28 16:50:18,002][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000265723_4353605632.pth [2024-06-28 16:50:18,910][09423] Updated weights for policy 0, policy_version 266357 (0.0027) [2024-06-28 16:50:22,249][09423] Updated weights for policy 0, policy_version 266367 (0.0039) [2024-06-28 16:50:22,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43690.8, 300 sec: 43264.9). Total num frames: 4364173312. Throughput: 0: 43120.1. Samples: 643070800. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:50:22,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 16:50:26,649][09423] Updated weights for policy 0, policy_version 266377 (0.0033) [2024-06-28 16:50:27,921][09190] Fps is (10 sec: 45875.9, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4364386304. Throughput: 0: 43220.5. Samples: 643213180. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:50:27,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:50:29,896][09423] Updated weights for policy 0, policy_version 266387 (0.0027) [2024-06-28 16:50:32,921][09190] Fps is (10 sec: 39321.3, 60 sec: 43417.5, 300 sec: 43042.7). Total num frames: 4364566528. Throughput: 0: 43125.8. Samples: 643469220. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:50:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:50:34,066][09423] Updated weights for policy 0, policy_version 266397 (0.0038) [2024-06-28 16:50:34,893][09403] Signal inference workers to stop experience collection... (8850 times) [2024-06-28 16:50:34,898][09403] Signal inference workers to resume experience collection... (8850 times) [2024-06-28 16:50:34,927][09423] InferenceWorker_p0-w0: stopping experience collection (8850 times) [2024-06-28 16:50:34,927][09423] InferenceWorker_p0-w0: resuming experience collection (8850 times) [2024-06-28 16:50:37,261][09423] Updated weights for policy 0, policy_version 266407 (0.0037) [2024-06-28 16:50:37,921][09190] Fps is (10 sec: 44236.6, 60 sec: 43417.7, 300 sec: 43209.3). Total num frames: 4364828672. Throughput: 0: 43139.9. Samples: 643720060. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:50:37,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:50:41,854][09423] Updated weights for policy 0, policy_version 266417 (0.0039) [2024-06-28 16:50:42,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42871.5, 300 sec: 43098.3). Total num frames: 4365025280. Throughput: 0: 43236.1. Samples: 643857020. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:50:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:50:44,789][09423] Updated weights for policy 0, policy_version 266427 (0.0042) [2024-06-28 16:50:47,921][09190] Fps is (10 sec: 39321.7, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4365221888. Throughput: 0: 43165.7. Samples: 644112200. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:50:47,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:50:49,345][09423] Updated weights for policy 0, policy_version 266437 (0.0039) [2024-06-28 16:50:52,666][09423] Updated weights for policy 0, policy_version 266447 (0.0030) [2024-06-28 16:50:52,921][09190] Fps is (10 sec: 45875.0, 60 sec: 43690.6, 300 sec: 43320.4). Total num frames: 4365484032. Throughput: 0: 43322.3. Samples: 644368620. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:50:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:50:56,734][09423] Updated weights for policy 0, policy_version 266457 (0.0036) [2024-06-28 16:50:57,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42873.8, 300 sec: 43042.7). Total num frames: 4365664256. Throughput: 0: 43342.6. Samples: 644513300. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:50:57,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:50:59,979][09423] Updated weights for policy 0, policy_version 266467 (0.0030) [2024-06-28 16:51:02,924][09190] Fps is (10 sec: 39311.5, 60 sec: 43688.7, 300 sec: 43153.4). Total num frames: 4365877248. Throughput: 0: 43245.6. Samples: 644769340. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:51:02,925][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:51:04,162][09423] Updated weights for policy 0, policy_version 266477 (0.0031) [2024-06-28 16:51:07,441][09423] Updated weights for policy 0, policy_version 266487 (0.0031) [2024-06-28 16:51:07,921][09190] Fps is (10 sec: 47513.5, 60 sec: 43690.7, 300 sec: 43376.0). Total num frames: 4366139392. Throughput: 0: 43367.9. Samples: 645022360. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:51:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:51:11,944][09423] Updated weights for policy 0, policy_version 266497 (0.0036) [2024-06-28 16:51:12,921][09190] Fps is (10 sec: 42609.6, 60 sec: 42598.4, 300 sec: 43098.3). Total num frames: 4366303232. Throughput: 0: 43293.3. Samples: 645161380. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:51:12,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:51:15,309][09423] Updated weights for policy 0, policy_version 266507 (0.0037) [2024-06-28 16:51:17,921][09190] Fps is (10 sec: 37683.5, 60 sec: 43144.6, 300 sec: 43098.3). Total num frames: 4366516224. Throughput: 0: 43126.7. Samples: 645409920. Policy #0 lag: (min: 0.0, avg: 11.9, max: 22.0) [2024-06-28 16:51:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:51:19,451][09423] Updated weights for policy 0, policy_version 266517 (0.0032) [2024-06-28 16:51:22,585][09423] Updated weights for policy 0, policy_version 266527 (0.0034) [2024-06-28 16:51:22,921][09190] Fps is (10 sec: 47513.8, 60 sec: 43417.6, 300 sec: 43320.5). Total num frames: 4366778368. Throughput: 0: 43353.4. Samples: 645670960. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 16:51:22,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:51:26,879][09423] Updated weights for policy 0, policy_version 266537 (0.0032) [2024-06-28 16:51:27,921][09190] Fps is (10 sec: 45875.0, 60 sec: 43144.5, 300 sec: 43098.4). Total num frames: 4366974976. Throughput: 0: 43462.6. Samples: 645812840. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 16:51:27,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:51:30,212][09423] Updated weights for policy 0, policy_version 266547 (0.0025) [2024-06-28 16:51:32,921][09190] Fps is (10 sec: 39321.4, 60 sec: 43417.7, 300 sec: 43209.3). Total num frames: 4367171584. Throughput: 0: 43330.2. Samples: 646062060. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 16:51:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:51:34,542][09423] Updated weights for policy 0, policy_version 266557 (0.0032) [2024-06-28 16:51:37,532][09423] Updated weights for policy 0, policy_version 266567 (0.0040) [2024-06-28 16:51:37,921][09190] Fps is (10 sec: 45875.5, 60 sec: 43417.6, 300 sec: 43320.6). Total num frames: 4367433728. Throughput: 0: 43543.6. Samples: 646328080. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 16:51:37,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:51:42,009][09423] Updated weights for policy 0, policy_version 266577 (0.0032) [2024-06-28 16:51:42,921][09190] Fps is (10 sec: 44236.3, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4367613952. Throughput: 0: 43472.9. Samples: 646469580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 16:51:42,924][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:51:45,236][09423] Updated weights for policy 0, policy_version 266587 (0.0043) [2024-06-28 16:51:47,921][09190] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43209.3). Total num frames: 4367843328. Throughput: 0: 43370.0. Samples: 646720880. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 16:51:47,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:51:49,316][09423] Updated weights for policy 0, policy_version 266597 (0.0031) [2024-06-28 16:51:51,701][09403] Signal inference workers to stop experience collection... (8900 times) [2024-06-28 16:51:51,745][09423] InferenceWorker_p0-w0: stopping experience collection (8900 times) [2024-06-28 16:51:51,751][09403] Signal inference workers to resume experience collection... (8900 times) [2024-06-28 16:51:51,761][09423] InferenceWorker_p0-w0: resuming experience collection (8900 times) [2024-06-28 16:51:52,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4368072704. Throughput: 0: 43383.1. Samples: 646974600. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 16:51:52,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:51:53,192][09423] Updated weights for policy 0, policy_version 266607 (0.0027) [2024-06-28 16:51:57,297][09423] Updated weights for policy 0, policy_version 266617 (0.0042) [2024-06-28 16:51:57,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43417.6, 300 sec: 43098.3). Total num frames: 4368269312. Throughput: 0: 43209.7. Samples: 647105820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 16:51:57,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:52:00,586][09423] Updated weights for policy 0, policy_version 266627 (0.0038) [2024-06-28 16:52:02,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43692.6, 300 sec: 43264.9). Total num frames: 4368498688. Throughput: 0: 43385.8. Samples: 647362280. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 16:52:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:52:04,986][09423] Updated weights for policy 0, policy_version 266637 (0.0031) [2024-06-28 16:52:07,921][09190] Fps is (10 sec: 45875.5, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 4368728064. Throughput: 0: 43280.4. Samples: 647618580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 16:52:07,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:52:08,177][09423] Updated weights for policy 0, policy_version 266647 (0.0041) [2024-06-28 16:52:12,530][09423] Updated weights for policy 0, policy_version 266657 (0.0036) [2024-06-28 16:52:12,921][09190] Fps is (10 sec: 40959.5, 60 sec: 43417.5, 300 sec: 43153.8). Total num frames: 4368908288. Throughput: 0: 43166.6. Samples: 647755340. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 16:52:12,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:52:15,586][09423] Updated weights for policy 0, policy_version 266667 (0.0030) [2024-06-28 16:52:17,922][09190] Fps is (10 sec: 42597.2, 60 sec: 43963.6, 300 sec: 43375.9). Total num frames: 4369154048. Throughput: 0: 43397.5. Samples: 648014960. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 16:52:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:52:17,942][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000266672_4369154048.pth... [2024-06-28 16:52:17,998][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000266040_4358799360.pth [2024-06-28 16:52:20,258][09423] Updated weights for policy 0, policy_version 266677 (0.0043) [2024-06-28 16:52:22,921][09190] Fps is (10 sec: 47514.2, 60 sec: 43417.6, 300 sec: 43265.2). Total num frames: 4369383424. Throughput: 0: 43235.1. Samples: 648273660. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 16:52:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:52:23,187][09423] Updated weights for policy 0, policy_version 266687 (0.0040) [2024-06-28 16:52:27,627][09423] Updated weights for policy 0, policy_version 266697 (0.0034) [2024-06-28 16:52:27,921][09190] Fps is (10 sec: 40961.0, 60 sec: 43144.6, 300 sec: 43153.8). Total num frames: 4369563648. Throughput: 0: 43021.0. Samples: 648405520. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 16:52:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:52:30,759][09423] Updated weights for policy 0, policy_version 266707 (0.0034) [2024-06-28 16:52:32,921][09190] Fps is (10 sec: 39321.3, 60 sec: 43417.5, 300 sec: 43209.3). Total num frames: 4369776640. Throughput: 0: 43053.8. Samples: 648658300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 16:52:32,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:52:35,181][09423] Updated weights for policy 0, policy_version 266717 (0.0042) [2024-06-28 16:52:37,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42871.5, 300 sec: 43209.3). Total num frames: 4370006016. Throughput: 0: 43184.1. Samples: 648917880. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 16:52:37,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:52:38,515][09423] Updated weights for policy 0, policy_version 266727 (0.0035) [2024-06-28 16:52:42,922][09190] Fps is (10 sec: 42598.0, 60 sec: 43144.5, 300 sec: 43098.2). Total num frames: 4370202624. Throughput: 0: 43187.9. Samples: 649049280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 16:52:42,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:52:43,158][09423] Updated weights for policy 0, policy_version 266737 (0.0036) [2024-06-28 16:52:45,781][09423] Updated weights for policy 0, policy_version 266747 (0.0034) [2024-06-28 16:52:47,921][09190] Fps is (10 sec: 42598.0, 60 sec: 43144.5, 300 sec: 43265.5). Total num frames: 4370432000. Throughput: 0: 43280.8. Samples: 649309920. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 16:52:47,922][09190] Avg episode reward: [(0, '0.733')] [2024-06-28 16:52:50,411][09423] Updated weights for policy 0, policy_version 266757 (0.0027) [2024-06-28 16:52:52,921][09190] Fps is (10 sec: 47513.6, 60 sec: 43417.5, 300 sec: 43209.3). Total num frames: 4370677760. Throughput: 0: 43425.2. Samples: 649572720. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 16:52:52,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:52:53,244][09423] Updated weights for policy 0, policy_version 266767 (0.0036) [2024-06-28 16:52:57,819][09423] Updated weights for policy 0, policy_version 266777 (0.0040) [2024-06-28 16:52:57,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43417.6, 300 sec: 43154.3). Total num frames: 4370874368. Throughput: 0: 43391.7. Samples: 649707960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 16:52:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:53:00,808][09423] Updated weights for policy 0, policy_version 266787 (0.0033) [2024-06-28 16:53:02,921][09190] Fps is (10 sec: 42599.3, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 4371103744. Throughput: 0: 43357.2. Samples: 649966020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 16:53:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:53:05,650][09423] Updated weights for policy 0, policy_version 266797 (0.0035) [2024-06-28 16:53:07,924][09190] Fps is (10 sec: 45864.2, 60 sec: 43415.8, 300 sec: 43264.5). Total num frames: 4371333120. Throughput: 0: 43379.9. Samples: 650225860. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 16:53:07,924][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:53:08,506][09423] Updated weights for policy 0, policy_version 266807 (0.0029) [2024-06-28 16:53:12,921][09190] Fps is (10 sec: 40959.9, 60 sec: 43417.7, 300 sec: 43153.8). Total num frames: 4371513344. Throughput: 0: 43276.5. Samples: 650352960. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 16:53:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:53:12,949][09423] Updated weights for policy 0, policy_version 266817 (0.0026) [2024-06-28 16:53:14,666][09403] Signal inference workers to stop experience collection... (8950 times) [2024-06-28 16:53:14,667][09403] Signal inference workers to resume experience collection... (8950 times) [2024-06-28 16:53:14,679][09423] InferenceWorker_p0-w0: stopping experience collection (8950 times) [2024-06-28 16:53:14,679][09423] InferenceWorker_p0-w0: resuming experience collection (8950 times) [2024-06-28 16:53:16,325][09423] Updated weights for policy 0, policy_version 266827 (0.0024) [2024-06-28 16:53:17,921][09190] Fps is (10 sec: 40969.8, 60 sec: 43144.7, 300 sec: 43320.4). Total num frames: 4371742720. Throughput: 0: 43516.5. Samples: 650616540. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 16:53:17,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:53:20,713][09423] Updated weights for policy 0, policy_version 266837 (0.0030) [2024-06-28 16:53:22,921][09190] Fps is (10 sec: 45874.6, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4371972096. Throughput: 0: 43409.7. Samples: 650871320. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 16:53:22,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:53:23,935][09423] Updated weights for policy 0, policy_version 266847 (0.0039) [2024-06-28 16:53:27,921][09190] Fps is (10 sec: 40960.1, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4372152320. Throughput: 0: 43381.5. Samples: 651001440. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 16:53:27,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:53:28,291][09423] Updated weights for policy 0, policy_version 266857 (0.0039) [2024-06-28 16:53:31,344][09423] Updated weights for policy 0, policy_version 266867 (0.0028) [2024-06-28 16:53:32,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43963.8, 300 sec: 43431.5). Total num frames: 4372414464. Throughput: 0: 43428.1. Samples: 651264180. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 16:53:32,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:53:35,848][09423] Updated weights for policy 0, policy_version 266877 (0.0037) [2024-06-28 16:53:37,921][09190] Fps is (10 sec: 45875.0, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4372611072. Throughput: 0: 43558.3. Samples: 651532840. Policy #0 lag: (min: 0.0, avg: 10.0, max: 20.0) [2024-06-28 16:53:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:53:38,891][09423] Updated weights for policy 0, policy_version 266887 (0.0038) [2024-06-28 16:53:42,921][09190] Fps is (10 sec: 39321.4, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 4372807680. Throughput: 0: 43291.9. Samples: 651656100. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:53:42,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:53:43,116][09423] Updated weights for policy 0, policy_version 266897 (0.0033) [2024-06-28 16:53:46,949][09423] Updated weights for policy 0, policy_version 266907 (0.0037) [2024-06-28 16:53:47,921][09190] Fps is (10 sec: 45875.6, 60 sec: 43963.8, 300 sec: 43431.5). Total num frames: 4373069824. Throughput: 0: 43503.1. Samples: 651923660. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:53:47,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:53:50,616][09423] Updated weights for policy 0, policy_version 266917 (0.0031) [2024-06-28 16:53:52,921][09190] Fps is (10 sec: 45874.9, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 4373266432. Throughput: 0: 43388.4. Samples: 652178240. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:53:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:53:54,390][09423] Updated weights for policy 0, policy_version 266927 (0.0037) [2024-06-28 16:53:57,922][09190] Fps is (10 sec: 39320.8, 60 sec: 43144.4, 300 sec: 43320.4). Total num frames: 4373463040. Throughput: 0: 43456.3. Samples: 652308500. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:53:57,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:53:58,094][09423] Updated weights for policy 0, policy_version 266937 (0.0031) [2024-06-28 16:54:01,707][09423] Updated weights for policy 0, policy_version 266947 (0.0034) [2024-06-28 16:54:02,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 4373708800. Throughput: 0: 43259.1. Samples: 652563200. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:54:02,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:54:06,273][09423] Updated weights for policy 0, policy_version 266957 (0.0034) [2024-06-28 16:54:07,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42873.1, 300 sec: 43153.8). Total num frames: 4373905408. Throughput: 0: 43485.3. Samples: 652828160. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:54:07,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 16:54:09,305][09423] Updated weights for policy 0, policy_version 266967 (0.0028) [2024-06-28 16:54:12,923][09190] Fps is (10 sec: 40954.8, 60 sec: 43416.6, 300 sec: 43264.7). Total num frames: 4374118400. Throughput: 0: 43435.2. Samples: 652956080. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:54:12,923][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:54:13,537][09423] Updated weights for policy 0, policy_version 266977 (0.0025) [2024-06-28 16:54:16,875][09423] Updated weights for policy 0, policy_version 266987 (0.0033) [2024-06-28 16:54:17,921][09190] Fps is (10 sec: 45875.8, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 4374364160. Throughput: 0: 43481.4. Samples: 653220840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:54:17,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 16:54:18,000][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000266991_4374380544.pth... [2024-06-28 16:54:18,061][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000266353_4363927552.pth [2024-06-28 16:54:21,324][09423] Updated weights for policy 0, policy_version 266997 (0.0032) [2024-06-28 16:54:22,925][09190] Fps is (10 sec: 42587.6, 60 sec: 42868.8, 300 sec: 43208.8). Total num frames: 4374544384. Throughput: 0: 43138.1. Samples: 653474220. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:54:22,926][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:54:24,651][09423] Updated weights for policy 0, policy_version 267007 (0.0049) [2024-06-28 16:54:27,921][09190] Fps is (10 sec: 39321.6, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 4374757376. Throughput: 0: 43065.0. Samples: 653594020. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:54:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:54:28,764][09423] Updated weights for policy 0, policy_version 267017 (0.0033) [2024-06-28 16:54:32,463][09423] Updated weights for policy 0, policy_version 267027 (0.0034) [2024-06-28 16:54:32,921][09190] Fps is (10 sec: 45893.0, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 4375003136. Throughput: 0: 43086.6. Samples: 653862560. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:54:32,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:54:36,823][09423] Updated weights for policy 0, policy_version 267037 (0.0040) [2024-06-28 16:54:36,827][09403] Signal inference workers to stop experience collection... (9000 times) [2024-06-28 16:54:36,828][09403] Signal inference workers to resume experience collection... (9000 times) [2024-06-28 16:54:36,876][09423] InferenceWorker_p0-w0: stopping experience collection (9000 times) [2024-06-28 16:54:36,877][09423] InferenceWorker_p0-w0: resuming experience collection (9000 times) [2024-06-28 16:54:37,924][09190] Fps is (10 sec: 44225.5, 60 sec: 43142.7, 300 sec: 43209.0). Total num frames: 4375199744. Throughput: 0: 43113.7. Samples: 654118460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:54:37,924][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:54:39,956][09423] Updated weights for policy 0, policy_version 267047 (0.0037) [2024-06-28 16:54:42,921][09190] Fps is (10 sec: 40959.9, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 4375412736. Throughput: 0: 42762.3. Samples: 654232800. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:54:42,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:54:44,078][09423] Updated weights for policy 0, policy_version 267057 (0.0035) [2024-06-28 16:54:47,573][09423] Updated weights for policy 0, policy_version 267067 (0.0032) [2024-06-28 16:54:47,924][09190] Fps is (10 sec: 44236.9, 60 sec: 42869.6, 300 sec: 43320.0). Total num frames: 4375642112. Throughput: 0: 43212.7. Samples: 654507880. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:54:47,925][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:54:51,875][09423] Updated weights for policy 0, policy_version 267077 (0.0032) [2024-06-28 16:54:52,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.6, 300 sec: 43209.8). Total num frames: 4375838720. Throughput: 0: 43060.6. Samples: 654765880. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 16:54:52,922][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 16:54:54,886][09423] Updated weights for policy 0, policy_version 267087 (0.0031) [2024-06-28 16:54:57,922][09190] Fps is (10 sec: 40965.9, 60 sec: 43143.9, 300 sec: 43375.8). Total num frames: 4376051712. Throughput: 0: 43026.0. Samples: 654892240. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:54:57,923][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:54:59,477][09423] Updated weights for policy 0, policy_version 267097 (0.0032) [2024-06-28 16:55:02,351][09423] Updated weights for policy 0, policy_version 267107 (0.0022) [2024-06-28 16:55:02,921][09190] Fps is (10 sec: 45875.0, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 4376297472. Throughput: 0: 42997.7. Samples: 655155740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:55:02,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:55:06,740][09423] Updated weights for policy 0, policy_version 267117 (0.0045) [2024-06-28 16:55:07,921][09190] Fps is (10 sec: 44241.1, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4376494080. Throughput: 0: 43168.9. Samples: 655416660. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:55:07,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:55:10,425][09423] Updated weights for policy 0, policy_version 267127 (0.0021) [2024-06-28 16:55:12,921][09190] Fps is (10 sec: 42597.9, 60 sec: 43418.4, 300 sec: 43375.9). Total num frames: 4376723456. Throughput: 0: 43397.2. Samples: 655546900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:55:12,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:55:14,292][09423] Updated weights for policy 0, policy_version 267137 (0.0036) [2024-06-28 16:55:17,706][09423] Updated weights for policy 0, policy_version 267147 (0.0031) [2024-06-28 16:55:17,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 4376936448. Throughput: 0: 43175.2. Samples: 655805440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:55:17,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:55:21,596][09423] Updated weights for policy 0, policy_version 267157 (0.0031) [2024-06-28 16:55:22,922][09190] Fps is (10 sec: 40959.6, 60 sec: 43147.1, 300 sec: 43209.3). Total num frames: 4377133056. Throughput: 0: 43389.3. Samples: 656070880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:55:22,928][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:55:25,124][09423] Updated weights for policy 0, policy_version 267167 (0.0037) [2024-06-28 16:55:27,922][09190] Fps is (10 sec: 44235.8, 60 sec: 43690.5, 300 sec: 43431.5). Total num frames: 4377378816. Throughput: 0: 43642.5. Samples: 656196720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:55:27,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:55:29,194][09423] Updated weights for policy 0, policy_version 267177 (0.0035) [2024-06-28 16:55:32,387][09423] Updated weights for policy 0, policy_version 267187 (0.0031) [2024-06-28 16:55:32,921][09190] Fps is (10 sec: 47514.2, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 4377608192. Throughput: 0: 43455.7. Samples: 656463280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:55:32,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:55:36,691][09423] Updated weights for policy 0, policy_version 267197 (0.0027) [2024-06-28 16:55:37,921][09190] Fps is (10 sec: 40960.5, 60 sec: 43146.3, 300 sec: 43264.9). Total num frames: 4377788416. Throughput: 0: 43567.1. Samples: 656726400. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:55:37,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:55:40,516][09423] Updated weights for policy 0, policy_version 267207 (0.0026) [2024-06-28 16:55:42,921][09190] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 4378034176. Throughput: 0: 43571.3. Samples: 656852900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:55:42,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:55:44,331][09423] Updated weights for policy 0, policy_version 267217 (0.0040) [2024-06-28 16:55:47,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43146.3, 300 sec: 43209.3). Total num frames: 4378230784. Throughput: 0: 43461.3. Samples: 657111500. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:55:47,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:55:47,939][09423] Updated weights for policy 0, policy_version 267227 (0.0021) [2024-06-28 16:55:52,085][09423] Updated weights for policy 0, policy_version 267237 (0.0038) [2024-06-28 16:55:52,921][09190] Fps is (10 sec: 39321.6, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4378427392. Throughput: 0: 43413.0. Samples: 657370240. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:55:52,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 16:55:55,486][09423] Updated weights for policy 0, policy_version 267247 (0.0045) [2024-06-28 16:55:57,272][09403] Signal inference workers to stop experience collection... (9050 times) [2024-06-28 16:55:57,272][09403] Signal inference workers to resume experience collection... (9050 times) [2024-06-28 16:55:57,317][09423] InferenceWorker_p0-w0: stopping experience collection (9050 times) [2024-06-28 16:55:57,317][09423] InferenceWorker_p0-w0: resuming experience collection (9050 times) [2024-06-28 16:55:57,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43964.5, 300 sec: 43431.9). Total num frames: 4378689536. Throughput: 0: 43233.4. Samples: 657492400. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:55:57,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:55:59,405][09423] Updated weights for policy 0, policy_version 267257 (0.0040) [2024-06-28 16:56:02,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.5, 300 sec: 43153.8). Total num frames: 4378869760. Throughput: 0: 43277.3. Samples: 657752920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 24.0) [2024-06-28 16:56:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:56:03,205][09423] Updated weights for policy 0, policy_version 267267 (0.0037) [2024-06-28 16:56:06,727][09423] Updated weights for policy 0, policy_version 267277 (0.0026) [2024-06-28 16:56:07,922][09190] Fps is (10 sec: 37682.6, 60 sec: 42871.4, 300 sec: 43264.8). Total num frames: 4379066368. Throughput: 0: 43334.3. Samples: 658020920. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 16:56:07,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 16:56:10,734][09423] Updated weights for policy 0, policy_version 267287 (0.0025) [2024-06-28 16:56:12,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 4379328512. Throughput: 0: 43403.7. Samples: 658149880. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 16:56:12,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:56:14,390][09423] Updated weights for policy 0, policy_version 267297 (0.0034) [2024-06-28 16:56:17,921][09190] Fps is (10 sec: 45876.0, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4379525120. Throughput: 0: 43225.4. Samples: 658408420. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 16:56:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:56:17,960][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000267306_4379541504.pth... [2024-06-28 16:56:18,005][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000266672_4369154048.pth [2024-06-28 16:56:18,282][09423] Updated weights for policy 0, policy_version 267307 (0.0030) [2024-06-28 16:56:21,800][09423] Updated weights for policy 0, policy_version 267317 (0.0031) [2024-06-28 16:56:22,922][09190] Fps is (10 sec: 40959.2, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 4379738112. Throughput: 0: 43194.1. Samples: 658670140. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 16:56:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:56:25,891][09423] Updated weights for policy 0, policy_version 267327 (0.0032) [2024-06-28 16:56:27,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43417.7, 300 sec: 43431.5). Total num frames: 4379983872. Throughput: 0: 43240.9. Samples: 658798740. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 16:56:27,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:56:29,871][09423] Updated weights for policy 0, policy_version 267337 (0.0042) [2024-06-28 16:56:32,921][09190] Fps is (10 sec: 42599.2, 60 sec: 42598.5, 300 sec: 43153.8). Total num frames: 4380164096. Throughput: 0: 43298.3. Samples: 659059920. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 16:56:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:56:33,457][09423] Updated weights for policy 0, policy_version 267347 (0.0039) [2024-06-28 16:56:37,157][09423] Updated weights for policy 0, policy_version 267357 (0.0036) [2024-06-28 16:56:37,921][09190] Fps is (10 sec: 39321.3, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4380377088. Throughput: 0: 43243.9. Samples: 659316220. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 16:56:37,930][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:56:41,085][09423] Updated weights for policy 0, policy_version 267367 (0.0038) [2024-06-28 16:56:42,921][09190] Fps is (10 sec: 47513.4, 60 sec: 43417.6, 300 sec: 43376.0). Total num frames: 4380639232. Throughput: 0: 43418.7. Samples: 659446240. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 16:56:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:56:44,363][09423] Updated weights for policy 0, policy_version 267377 (0.0034) [2024-06-28 16:56:47,921][09190] Fps is (10 sec: 44236.6, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4380819456. Throughput: 0: 43431.5. Samples: 659707340. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 16:56:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:56:48,581][09423] Updated weights for policy 0, policy_version 267387 (0.0041) [2024-06-28 16:56:51,686][09423] Updated weights for policy 0, policy_version 267397 (0.0040) [2024-06-28 16:56:52,921][09190] Fps is (10 sec: 39321.5, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4381032448. Throughput: 0: 43276.1. Samples: 659968340. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 16:56:52,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 16:56:55,928][09423] Updated weights for policy 0, policy_version 267407 (0.0035) [2024-06-28 16:56:57,921][09190] Fps is (10 sec: 47513.9, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 4381294592. Throughput: 0: 43334.6. Samples: 660099940. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 16:56:57,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:56:59,585][09423] Updated weights for policy 0, policy_version 267417 (0.0036) [2024-06-28 16:57:02,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4381458432. Throughput: 0: 43223.1. Samples: 660353460. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 16:57:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 16:57:03,856][09423] Updated weights for policy 0, policy_version 267427 (0.0038) [2024-06-28 16:57:07,338][09423] Updated weights for policy 0, policy_version 267437 (0.0033) [2024-06-28 16:57:07,921][09190] Fps is (10 sec: 39321.9, 60 sec: 43690.8, 300 sec: 43320.4). Total num frames: 4381687808. Throughput: 0: 43068.2. Samples: 660608200. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 16:57:07,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 16:57:11,431][09423] Updated weights for policy 0, policy_version 267447 (0.0023) [2024-06-28 16:57:12,921][09190] Fps is (10 sec: 47513.4, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 4381933568. Throughput: 0: 43236.4. Samples: 660744380. Policy #0 lag: (min: 0.0, avg: 11.7, max: 23.0) [2024-06-28 16:57:12,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 16:57:14,750][09423] Updated weights for policy 0, policy_version 267457 (0.0043) [2024-06-28 16:57:17,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42871.4, 300 sec: 43098.2). Total num frames: 4382097408. Throughput: 0: 43110.6. Samples: 660999900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:57:17,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:57:18,212][09403] Signal inference workers to stop experience collection... (9100 times) [2024-06-28 16:57:18,213][09403] Signal inference workers to resume experience collection... (9100 times) [2024-06-28 16:57:18,229][09423] InferenceWorker_p0-w0: stopping experience collection (9100 times) [2024-06-28 16:57:18,229][09423] InferenceWorker_p0-w0: resuming experience collection (9100 times) [2024-06-28 16:57:18,887][09423] Updated weights for policy 0, policy_version 267467 (0.0022) [2024-06-28 16:57:22,454][09423] Updated weights for policy 0, policy_version 267477 (0.0035) [2024-06-28 16:57:22,921][09190] Fps is (10 sec: 40960.0, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 4382343168. Throughput: 0: 42980.4. Samples: 661250340. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:57:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:57:26,649][09423] Updated weights for policy 0, policy_version 267487 (0.0031) [2024-06-28 16:57:27,921][09190] Fps is (10 sec: 47513.8, 60 sec: 43144.5, 300 sec: 43376.0). Total num frames: 4382572544. Throughput: 0: 43124.9. Samples: 661386860. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:57:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:57:30,031][09423] Updated weights for policy 0, policy_version 267497 (0.0031) [2024-06-28 16:57:32,921][09190] Fps is (10 sec: 40960.4, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4382752768. Throughput: 0: 43192.1. Samples: 661650980. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:57:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:57:34,075][09423] Updated weights for policy 0, policy_version 267507 (0.0033) [2024-06-28 16:57:37,778][09423] Updated weights for policy 0, policy_version 267517 (0.0027) [2024-06-28 16:57:37,922][09190] Fps is (10 sec: 42597.4, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 4382998528. Throughput: 0: 43132.7. Samples: 661909320. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:57:37,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:57:42,114][09423] Updated weights for policy 0, policy_version 267527 (0.0028) [2024-06-28 16:57:42,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42871.5, 300 sec: 43320.4). Total num frames: 4383211520. Throughput: 0: 43132.4. Samples: 662040900. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:57:42,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 16:57:45,136][09423] Updated weights for policy 0, policy_version 267537 (0.0027) [2024-06-28 16:57:47,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42871.5, 300 sec: 43098.3). Total num frames: 4383391744. Throughput: 0: 43212.8. Samples: 662298040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:57:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:57:49,593][09423] Updated weights for policy 0, policy_version 267547 (0.0038) [2024-06-28 16:57:52,705][09423] Updated weights for policy 0, policy_version 267557 (0.0027) [2024-06-28 16:57:52,921][09190] Fps is (10 sec: 44236.6, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 4383653888. Throughput: 0: 43059.0. Samples: 662545860. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:57:52,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 16:57:56,919][09423] Updated weights for policy 0, policy_version 267567 (0.0037) [2024-06-28 16:57:57,922][09190] Fps is (10 sec: 47513.2, 60 sec: 42871.4, 300 sec: 43264.8). Total num frames: 4383866880. Throughput: 0: 43155.9. Samples: 662686400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:57:57,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 16:58:00,860][09423] Updated weights for policy 0, policy_version 267577 (0.0041) [2024-06-28 16:58:02,921][09190] Fps is (10 sec: 37683.3, 60 sec: 42871.5, 300 sec: 43043.1). Total num frames: 4384030720. Throughput: 0: 43100.4. Samples: 662939420. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:58:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:58:04,737][09423] Updated weights for policy 0, policy_version 267587 (0.0044) [2024-06-28 16:58:07,921][09190] Fps is (10 sec: 40960.5, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4384276480. Throughput: 0: 43275.1. Samples: 663197720. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:58:07,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:58:08,315][09423] Updated weights for policy 0, policy_version 267597 (0.0023) [2024-06-28 16:58:12,111][09423] Updated weights for policy 0, policy_version 267607 (0.0031) [2024-06-28 16:58:12,922][09190] Fps is (10 sec: 47512.9, 60 sec: 42871.4, 300 sec: 43264.8). Total num frames: 4384505856. Throughput: 0: 43382.9. Samples: 663339100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:58:12,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 16:58:15,782][09423] Updated weights for policy 0, policy_version 267617 (0.0038) [2024-06-28 16:58:17,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43417.6, 300 sec: 43153.8). Total num frames: 4384702464. Throughput: 0: 43214.7. Samples: 663595640. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:58:17,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 16:58:18,023][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000267622_4384718848.pth... [2024-06-28 16:58:18,070][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000266991_4374380544.pth [2024-06-28 16:58:19,734][09423] Updated weights for policy 0, policy_version 267627 (0.0031) [2024-06-28 16:58:22,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 4384948224. Throughput: 0: 43065.4. Samples: 663847260. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:58:22,923][09190] Avg episode reward: [(0, '0.733')] [2024-06-28 16:58:23,468][09423] Updated weights for policy 0, policy_version 267637 (0.0036) [2024-06-28 16:58:27,248][09423] Updated weights for policy 0, policy_version 267647 (0.0037) [2024-06-28 16:58:27,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.5, 300 sec: 43153.8). Total num frames: 4385144832. Throughput: 0: 43208.5. Samples: 663985280. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 16:58:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 16:58:30,882][09423] Updated weights for policy 0, policy_version 267657 (0.0038) [2024-06-28 16:58:32,921][09190] Fps is (10 sec: 39322.1, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4385341440. Throughput: 0: 43110.4. Samples: 664238000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 16:58:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:58:34,878][09423] Updated weights for policy 0, policy_version 267667 (0.0038) [2024-06-28 16:58:37,921][09190] Fps is (10 sec: 44236.2, 60 sec: 43144.6, 300 sec: 43320.4). Total num frames: 4385587200. Throughput: 0: 43332.0. Samples: 664495800. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 16:58:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:58:38,592][09423] Updated weights for policy 0, policy_version 267677 (0.0038) [2024-06-28 16:58:42,173][09403] Signal inference workers to stop experience collection... (9150 times) [2024-06-28 16:58:42,174][09403] Signal inference workers to resume experience collection... (9150 times) [2024-06-28 16:58:42,214][09423] InferenceWorker_p0-w0: stopping experience collection (9150 times) [2024-06-28 16:58:42,214][09423] InferenceWorker_p0-w0: resuming experience collection (9150 times) [2024-06-28 16:58:42,310][09423] Updated weights for policy 0, policy_version 267687 (0.0035) [2024-06-28 16:58:42,921][09190] Fps is (10 sec: 45874.3, 60 sec: 43144.5, 300 sec: 43153.8). Total num frames: 4385800192. Throughput: 0: 43382.7. Samples: 664638620. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 16:58:42,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:58:45,889][09423] Updated weights for policy 0, policy_version 267697 (0.0031) [2024-06-28 16:58:47,922][09190] Fps is (10 sec: 39321.1, 60 sec: 43144.4, 300 sec: 43098.2). Total num frames: 4385980416. Throughput: 0: 43482.9. Samples: 664896160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 16:58:47,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 16:58:49,699][09423] Updated weights for policy 0, policy_version 267707 (0.0032) [2024-06-28 16:58:52,921][09190] Fps is (10 sec: 44236.9, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 4386242560. Throughput: 0: 43493.3. Samples: 665154920. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 16:58:52,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:58:53,146][09423] Updated weights for policy 0, policy_version 267717 (0.0031) [2024-06-28 16:58:57,371][09423] Updated weights for policy 0, policy_version 267727 (0.0021) [2024-06-28 16:58:57,921][09190] Fps is (10 sec: 49152.7, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 4386471936. Throughput: 0: 43472.1. Samples: 665295340. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 16:58:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 16:59:00,766][09423] Updated weights for policy 0, policy_version 267737 (0.0037) [2024-06-28 16:59:02,921][09190] Fps is (10 sec: 40960.3, 60 sec: 43690.7, 300 sec: 43209.3). Total num frames: 4386652160. Throughput: 0: 43322.6. Samples: 665545160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 16:59:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 16:59:04,830][09423] Updated weights for policy 0, policy_version 267747 (0.0024) [2024-06-28 16:59:07,921][09190] Fps is (10 sec: 42598.9, 60 sec: 43690.7, 300 sec: 43320.6). Total num frames: 4386897920. Throughput: 0: 43400.1. Samples: 665800260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 16:59:07,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 16:59:08,535][09423] Updated weights for policy 0, policy_version 267757 (0.0030) [2024-06-28 16:59:12,512][09423] Updated weights for policy 0, policy_version 267767 (0.0032) [2024-06-28 16:59:12,921][09190] Fps is (10 sec: 45875.8, 60 sec: 43417.8, 300 sec: 43209.3). Total num frames: 4387110912. Throughput: 0: 43452.5. Samples: 665940640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 16:59:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 16:59:15,971][09423] Updated weights for policy 0, policy_version 267777 (0.0039) [2024-06-28 16:59:17,921][09190] Fps is (10 sec: 39321.6, 60 sec: 43144.5, 300 sec: 43209.9). Total num frames: 4387291136. Throughput: 0: 43408.0. Samples: 666191360. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 16:59:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:59:20,144][09423] Updated weights for policy 0, policy_version 267787 (0.0038) [2024-06-28 16:59:22,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43417.7, 300 sec: 43375.9). Total num frames: 4387553280. Throughput: 0: 43329.0. Samples: 666445600. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 16:59:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:59:23,392][09423] Updated weights for policy 0, policy_version 267797 (0.0039) [2024-06-28 16:59:27,614][09423] Updated weights for policy 0, policy_version 267807 (0.0039) [2024-06-28 16:59:27,921][09190] Fps is (10 sec: 47513.2, 60 sec: 43690.6, 300 sec: 43264.9). Total num frames: 4387766272. Throughput: 0: 43349.4. Samples: 666589340. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 16:59:27,923][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:59:30,618][09423] Updated weights for policy 0, policy_version 267817 (0.0033) [2024-06-28 16:59:32,921][09190] Fps is (10 sec: 39321.5, 60 sec: 43417.5, 300 sec: 43209.7). Total num frames: 4387946496. Throughput: 0: 43262.8. Samples: 666842980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 16:59:32,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 16:59:35,125][09423] Updated weights for policy 0, policy_version 267827 (0.0022) [2024-06-28 16:59:37,922][09190] Fps is (10 sec: 44236.2, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 4388208640. Throughput: 0: 43244.8. Samples: 667100940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 23.0) [2024-06-28 16:59:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 16:59:38,335][09423] Updated weights for policy 0, policy_version 267837 (0.0031) [2024-06-28 16:59:42,604][09423] Updated weights for policy 0, policy_version 267847 (0.0033) [2024-06-28 16:59:42,921][09190] Fps is (10 sec: 47513.6, 60 sec: 43690.7, 300 sec: 43320.8). Total num frames: 4388421632. Throughput: 0: 43269.4. Samples: 667242460. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 16:59:42,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 16:59:46,550][09423] Updated weights for policy 0, policy_version 267857 (0.0027) [2024-06-28 16:59:47,921][09190] Fps is (10 sec: 39322.5, 60 sec: 43690.9, 300 sec: 43264.9). Total num frames: 4388601856. Throughput: 0: 43392.5. Samples: 667497820. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 16:59:47,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 16:59:50,229][09423] Updated weights for policy 0, policy_version 267867 (0.0026) [2024-06-28 16:59:52,921][09190] Fps is (10 sec: 40960.3, 60 sec: 43144.6, 300 sec: 43320.6). Total num frames: 4388831232. Throughput: 0: 43422.7. Samples: 667754280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 16:59:52,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 16:59:54,013][09423] Updated weights for policy 0, policy_version 267877 (0.0027) [2024-06-28 16:59:57,847][09423] Updated weights for policy 0, policy_version 267887 (0.0022) [2024-06-28 16:59:57,921][09190] Fps is (10 sec: 45874.9, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 4389060608. Throughput: 0: 43305.7. Samples: 667889400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 16:59:57,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:00:01,519][09423] Updated weights for policy 0, policy_version 267897 (0.0048) [2024-06-28 17:00:02,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 4389257216. Throughput: 0: 43309.8. Samples: 668140300. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:00:02,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 17:00:03,964][09403] Signal inference workers to stop experience collection... (9200 times) [2024-06-28 17:00:03,968][09403] Signal inference workers to resume experience collection... (9200 times) [2024-06-28 17:00:03,986][09423] InferenceWorker_p0-w0: stopping experience collection (9200 times) [2024-06-28 17:00:03,986][09423] InferenceWorker_p0-w0: resuming experience collection (9200 times) [2024-06-28 17:00:05,557][09423] Updated weights for policy 0, policy_version 267907 (0.0037) [2024-06-28 17:00:07,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43417.5, 300 sec: 43320.4). Total num frames: 4389502976. Throughput: 0: 43390.1. Samples: 668398160. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:00:07,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:00:08,983][09423] Updated weights for policy 0, policy_version 267917 (0.0024) [2024-06-28 17:00:12,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42871.4, 300 sec: 43209.3). Total num frames: 4389683200. Throughput: 0: 43327.6. Samples: 668539080. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:00:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:00:13,096][09423] Updated weights for policy 0, policy_version 267927 (0.0058) [2024-06-28 17:00:16,570][09423] Updated weights for policy 0, policy_version 267937 (0.0039) [2024-06-28 17:00:17,921][09190] Fps is (10 sec: 39322.0, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4389896192. Throughput: 0: 43099.2. Samples: 668782440. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:00:17,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:00:17,935][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000267938_4389896192.pth... [2024-06-28 17:00:17,992][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000267306_4379541504.pth [2024-06-28 17:00:20,523][09423] Updated weights for policy 0, policy_version 267947 (0.0022) [2024-06-28 17:00:22,921][09190] Fps is (10 sec: 45874.7, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4390141952. Throughput: 0: 43128.1. Samples: 669041700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:00:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:00:24,488][09423] Updated weights for policy 0, policy_version 267957 (0.0052) [2024-06-28 17:00:27,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42871.4, 300 sec: 43153.8). Total num frames: 4390338560. Throughput: 0: 43008.8. Samples: 669177860. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:00:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:00:28,392][09423] Updated weights for policy 0, policy_version 267967 (0.0029) [2024-06-28 17:00:32,069][09423] Updated weights for policy 0, policy_version 267977 (0.0022) [2024-06-28 17:00:32,921][09190] Fps is (10 sec: 39322.2, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 4390535168. Throughput: 0: 43138.2. Samples: 669439040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:00:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:00:35,755][09423] Updated weights for policy 0, policy_version 267987 (0.0038) [2024-06-28 17:00:37,928][09190] Fps is (10 sec: 45846.2, 60 sec: 43140.0, 300 sec: 43263.9). Total num frames: 4390797312. Throughput: 0: 43111.1. Samples: 669694560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:00:37,928][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:00:39,554][09423] Updated weights for policy 0, policy_version 267997 (0.0029) [2024-06-28 17:00:42,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42598.5, 300 sec: 43209.3). Total num frames: 4390977536. Throughput: 0: 43139.6. Samples: 669830680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:00:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:00:43,397][09423] Updated weights for policy 0, policy_version 268007 (0.0035) [2024-06-28 17:00:46,918][09423] Updated weights for policy 0, policy_version 268017 (0.0022) [2024-06-28 17:00:47,921][09190] Fps is (10 sec: 39347.0, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4391190528. Throughput: 0: 43239.0. Samples: 670086060. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:00:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:00:50,834][09423] Updated weights for policy 0, policy_version 268027 (0.0037) [2024-06-28 17:00:52,921][09190] Fps is (10 sec: 45875.1, 60 sec: 43417.6, 300 sec: 43209.3). Total num frames: 4391436288. Throughput: 0: 43186.8. Samples: 670341560. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:00:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:00:54,638][09423] Updated weights for policy 0, policy_version 268037 (0.0035) [2024-06-28 17:00:57,924][09190] Fps is (10 sec: 44225.9, 60 sec: 42869.7, 300 sec: 43264.5). Total num frames: 4391632896. Throughput: 0: 43107.8. Samples: 670479040. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:00:57,924][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:00:58,417][09423] Updated weights for policy 0, policy_version 268047 (0.0033) [2024-06-28 17:01:02,636][09423] Updated weights for policy 0, policy_version 268057 (0.0031) [2024-06-28 17:01:02,921][09190] Fps is (10 sec: 40959.5, 60 sec: 43144.4, 300 sec: 43320.4). Total num frames: 4391845888. Throughput: 0: 43315.0. Samples: 670731620. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:01:02,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:01:06,073][09423] Updated weights for policy 0, policy_version 268067 (0.0041) [2024-06-28 17:01:07,921][09190] Fps is (10 sec: 47525.6, 60 sec: 43417.7, 300 sec: 43320.4). Total num frames: 4392108032. Throughput: 0: 43216.1. Samples: 670986420. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:01:07,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:01:10,334][09423] Updated weights for policy 0, policy_version 268077 (0.0028) [2024-06-28 17:01:12,921][09190] Fps is (10 sec: 44237.1, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4392288256. Throughput: 0: 43357.0. Samples: 671128920. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:01:12,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:01:13,461][09423] Updated weights for policy 0, policy_version 268087 (0.0035) [2024-06-28 17:01:17,778][09423] Updated weights for policy 0, policy_version 268097 (0.0040) [2024-06-28 17:01:17,921][09190] Fps is (10 sec: 39321.8, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4392501248. Throughput: 0: 43204.5. Samples: 671383240. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:01:17,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 17:01:21,122][09423] Updated weights for policy 0, policy_version 268107 (0.0037) [2024-06-28 17:01:22,921][09190] Fps is (10 sec: 45875.4, 60 sec: 43417.7, 300 sec: 43264.9). Total num frames: 4392747008. Throughput: 0: 43185.8. Samples: 671637640. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:01:22,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 17:01:25,295][09423] Updated weights for policy 0, policy_version 268117 (0.0041) [2024-06-28 17:01:27,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.6, 300 sec: 43209.3). Total num frames: 4392910848. Throughput: 0: 43197.8. Samples: 671774580. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:01:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:01:28,663][09423] Updated weights for policy 0, policy_version 268127 (0.0032) [2024-06-28 17:01:32,701][09423] Updated weights for policy 0, policy_version 268137 (0.0034) [2024-06-28 17:01:32,921][09190] Fps is (10 sec: 40960.0, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 4393156608. Throughput: 0: 43221.0. Samples: 672031000. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:01:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:01:36,258][09423] Updated weights for policy 0, policy_version 268147 (0.0037) [2024-06-28 17:01:37,921][09190] Fps is (10 sec: 49151.4, 60 sec: 43422.3, 300 sec: 43264.9). Total num frames: 4393402368. Throughput: 0: 43307.4. Samples: 672290400. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:01:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:01:40,327][09423] Updated weights for policy 0, policy_version 268157 (0.0027) [2024-06-28 17:01:41,233][09403] Signal inference workers to stop experience collection... (9250 times) [2024-06-28 17:01:41,273][09423] InferenceWorker_p0-w0: stopping experience collection (9250 times) [2024-06-28 17:01:41,294][09403] Signal inference workers to resume experience collection... (9250 times) [2024-06-28 17:01:41,295][09423] InferenceWorker_p0-w0: resuming experience collection (9250 times) [2024-06-28 17:01:42,921][09190] Fps is (10 sec: 40959.9, 60 sec: 43144.5, 300 sec: 43209.3). Total num frames: 4393566208. Throughput: 0: 43231.3. Samples: 672424340. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:01:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:01:43,854][09423] Updated weights for policy 0, policy_version 268167 (0.0034) [2024-06-28 17:01:47,921][09190] Fps is (10 sec: 39321.7, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4393795584. Throughput: 0: 43331.1. Samples: 672681520. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:01:47,932][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:01:48,242][09423] Updated weights for policy 0, policy_version 268177 (0.0028) [2024-06-28 17:01:51,402][09423] Updated weights for policy 0, policy_version 268187 (0.0029) [2024-06-28 17:01:52,921][09190] Fps is (10 sec: 49151.8, 60 sec: 43690.6, 300 sec: 43264.9). Total num frames: 4394057728. Throughput: 0: 43281.8. Samples: 672934100. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:01:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:01:55,832][09423] Updated weights for policy 0, policy_version 268197 (0.0029) [2024-06-28 17:01:57,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43419.4, 300 sec: 43320.4). Total num frames: 4394237952. Throughput: 0: 43296.5. Samples: 673077260. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:01:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:01:59,148][09423] Updated weights for policy 0, policy_version 268207 (0.0036) [2024-06-28 17:02:02,921][09190] Fps is (10 sec: 37683.1, 60 sec: 43144.6, 300 sec: 43209.3). Total num frames: 4394434560. Throughput: 0: 43083.5. Samples: 673322000. Policy #0 lag: (min: 0.0, avg: 11.5, max: 24.0) [2024-06-28 17:02:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 17:02:03,432][09423] Updated weights for policy 0, policy_version 268217 (0.0030) [2024-06-28 17:02:06,690][09423] Updated weights for policy 0, policy_version 268227 (0.0029) [2024-06-28 17:02:07,921][09190] Fps is (10 sec: 45875.2, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 4394696704. Throughput: 0: 43335.6. Samples: 673587740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:02:07,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:02:10,906][09423] Updated weights for policy 0, policy_version 268237 (0.0034) [2024-06-28 17:02:12,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 4394860544. Throughput: 0: 43365.4. Samples: 673726020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:02:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:02:14,217][09423] Updated weights for policy 0, policy_version 268247 (0.0025) [2024-06-28 17:02:17,921][09190] Fps is (10 sec: 39321.2, 60 sec: 43144.4, 300 sec: 43209.3). Total num frames: 4395089920. Throughput: 0: 43104.8. Samples: 673970720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:02:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:02:17,947][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000268255_4395089920.pth... [2024-06-28 17:02:17,995][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000267622_4384718848.pth [2024-06-28 17:02:18,438][09423] Updated weights for policy 0, policy_version 268257 (0.0037) [2024-06-28 17:02:21,826][09423] Updated weights for policy 0, policy_version 268267 (0.0037) [2024-06-28 17:02:22,921][09190] Fps is (10 sec: 47513.3, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4395335680. Throughput: 0: 43124.1. Samples: 674230980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:02:22,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:02:25,918][09423] Updated weights for policy 0, policy_version 268277 (0.0031) [2024-06-28 17:02:27,922][09190] Fps is (10 sec: 42595.6, 60 sec: 43417.1, 300 sec: 43264.8). Total num frames: 4395515904. Throughput: 0: 43201.5. Samples: 674368440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:02:27,923][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:02:29,326][09423] Updated weights for policy 0, policy_version 268287 (0.0035) [2024-06-28 17:02:32,921][09190] Fps is (10 sec: 40960.4, 60 sec: 43144.6, 300 sec: 43209.4). Total num frames: 4395745280. Throughput: 0: 43109.5. Samples: 674621440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:02:32,921][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:02:33,388][09423] Updated weights for policy 0, policy_version 268297 (0.0046) [2024-06-28 17:02:36,998][09423] Updated weights for policy 0, policy_version 268307 (0.0042) [2024-06-28 17:02:37,921][09190] Fps is (10 sec: 45878.5, 60 sec: 42871.5, 300 sec: 43264.9). Total num frames: 4395974656. Throughput: 0: 43332.5. Samples: 674884060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:02:37,922][09190] Avg episode reward: [(0, '0.726')] [2024-06-28 17:02:40,992][09423] Updated weights for policy 0, policy_version 268317 (0.0033) [2024-06-28 17:02:42,921][09190] Fps is (10 sec: 40960.0, 60 sec: 43144.6, 300 sec: 43264.9). Total num frames: 4396154880. Throughput: 0: 43289.4. Samples: 675025280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:02:42,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:02:44,440][09423] Updated weights for policy 0, policy_version 268327 (0.0031) [2024-06-28 17:02:47,921][09190] Fps is (10 sec: 42598.7, 60 sec: 43417.7, 300 sec: 43209.3). Total num frames: 4396400640. Throughput: 0: 43434.8. Samples: 675276560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:02:47,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:02:48,208][09423] Updated weights for policy 0, policy_version 268337 (0.0037) [2024-06-28 17:02:52,070][09423] Updated weights for policy 0, policy_version 268347 (0.0033) [2024-06-28 17:02:52,921][09190] Fps is (10 sec: 49151.4, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 4396646400. Throughput: 0: 43376.8. Samples: 675539700. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:02:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:02:55,719][09423] Updated weights for policy 0, policy_version 268357 (0.0025) [2024-06-28 17:02:57,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42871.5, 300 sec: 43320.4). Total num frames: 4396810240. Throughput: 0: 43250.7. Samples: 675672300. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:02:57,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:02:59,423][09423] Updated weights for policy 0, policy_version 268367 (0.0040) [2024-06-28 17:03:02,922][09190] Fps is (10 sec: 40959.5, 60 sec: 43690.6, 300 sec: 43320.4). Total num frames: 4397056000. Throughput: 0: 43410.6. Samples: 675924200. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:03:02,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:03:03,671][09423] Updated weights for policy 0, policy_version 268377 (0.0032) [2024-06-28 17:03:04,884][09403] Signal inference workers to stop experience collection... (9300 times) [2024-06-28 17:03:04,884][09403] Signal inference workers to resume experience collection... (9300 times) [2024-06-28 17:03:04,909][09423] InferenceWorker_p0-w0: stopping experience collection (9300 times) [2024-06-28 17:03:04,909][09423] InferenceWorker_p0-w0: resuming experience collection (9300 times) [2024-06-28 17:03:07,296][09423] Updated weights for policy 0, policy_version 268387 (0.0028) [2024-06-28 17:03:07,921][09190] Fps is (10 sec: 47513.3, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 4397285376. Throughput: 0: 43423.6. Samples: 676185040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:03:07,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:03:11,297][09423] Updated weights for policy 0, policy_version 268397 (0.0034) [2024-06-28 17:03:12,921][09190] Fps is (10 sec: 40960.8, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4397465600. Throughput: 0: 43318.5. Samples: 676317740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:03:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:03:14,669][09423] Updated weights for policy 0, policy_version 268407 (0.0028) [2024-06-28 17:03:17,921][09190] Fps is (10 sec: 42598.3, 60 sec: 43690.7, 300 sec: 43264.9). Total num frames: 4397711360. Throughput: 0: 43375.9. Samples: 676573360. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 17:03:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:03:18,827][09423] Updated weights for policy 0, policy_version 268417 (0.0036) [2024-06-28 17:03:22,149][09423] Updated weights for policy 0, policy_version 268427 (0.0041) [2024-06-28 17:03:22,921][09190] Fps is (10 sec: 47513.5, 60 sec: 43417.6, 300 sec: 43375.9). Total num frames: 4397940736. Throughput: 0: 43510.2. Samples: 676842020. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 17:03:22,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:03:26,156][09423] Updated weights for policy 0, policy_version 268437 (0.0032) [2024-06-28 17:03:27,922][09190] Fps is (10 sec: 40959.4, 60 sec: 43418.0, 300 sec: 43320.4). Total num frames: 4398120960. Throughput: 0: 43369.6. Samples: 676976920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 17:03:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:03:29,738][09423] Updated weights for policy 0, policy_version 268447 (0.0026) [2024-06-28 17:03:32,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 4398366720. Throughput: 0: 43425.8. Samples: 677230720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 17:03:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:03:33,924][09423] Updated weights for policy 0, policy_version 268457 (0.0031) [2024-06-28 17:03:37,203][09423] Updated weights for policy 0, policy_version 268467 (0.0031) [2024-06-28 17:03:37,921][09190] Fps is (10 sec: 45875.8, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 4398579712. Throughput: 0: 43389.4. Samples: 677492220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 17:03:37,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:03:41,477][09423] Updated weights for policy 0, policy_version 268477 (0.0039) [2024-06-28 17:03:42,921][09190] Fps is (10 sec: 40959.8, 60 sec: 43690.6, 300 sec: 43376.0). Total num frames: 4398776320. Throughput: 0: 43288.0. Samples: 677620260. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 17:03:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:03:45,054][09423] Updated weights for policy 0, policy_version 268487 (0.0038) [2024-06-28 17:03:47,924][09190] Fps is (10 sec: 44227.0, 60 sec: 43689.0, 300 sec: 43320.1). Total num frames: 4399022080. Throughput: 0: 43361.6. Samples: 677875560. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 17:03:47,924][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 17:03:48,784][09423] Updated weights for policy 0, policy_version 268497 (0.0031) [2024-06-28 17:03:52,315][09423] Updated weights for policy 0, policy_version 268507 (0.0023) [2024-06-28 17:03:52,924][09190] Fps is (10 sec: 45863.6, 60 sec: 43142.8, 300 sec: 43264.5). Total num frames: 4399235072. Throughput: 0: 43523.3. Samples: 678143700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 17:03:52,924][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:03:56,326][09423] Updated weights for policy 0, policy_version 268517 (0.0029) [2024-06-28 17:03:57,922][09190] Fps is (10 sec: 40968.5, 60 sec: 43690.5, 300 sec: 43320.4). Total num frames: 4399431680. Throughput: 0: 43525.6. Samples: 678276400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 17:03:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:03:59,841][09423] Updated weights for policy 0, policy_version 268527 (0.0031) [2024-06-28 17:04:02,921][09190] Fps is (10 sec: 44248.0, 60 sec: 43690.8, 300 sec: 43320.4). Total num frames: 4399677440. Throughput: 0: 43688.0. Samples: 678539320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 17:04:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:04:03,584][09423] Updated weights for policy 0, policy_version 268537 (0.0032) [2024-06-28 17:04:07,488][09423] Updated weights for policy 0, policy_version 268547 (0.0029) [2024-06-28 17:04:07,921][09190] Fps is (10 sec: 45876.1, 60 sec: 43417.6, 300 sec: 43320.4). Total num frames: 4399890432. Throughput: 0: 43471.1. Samples: 678798220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 17:04:07,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 17:04:11,534][09423] Updated weights for policy 0, policy_version 268557 (0.0045) [2024-06-28 17:04:12,921][09190] Fps is (10 sec: 40959.6, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 4400087040. Throughput: 0: 43340.5. Samples: 678927240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 17:04:12,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 17:04:15,023][09423] Updated weights for policy 0, policy_version 268567 (0.0039) [2024-06-28 17:04:17,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43320.4). Total num frames: 4400332800. Throughput: 0: 43524.8. Samples: 679189340. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 17:04:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:04:17,937][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000268575_4400332800.pth... [2024-06-28 17:04:17,990][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000267938_4389896192.pth [2024-06-28 17:04:18,829][09423] Updated weights for policy 0, policy_version 268577 (0.0045) [2024-06-28 17:04:22,426][09423] Updated weights for policy 0, policy_version 268587 (0.0021) [2024-06-28 17:04:22,924][09190] Fps is (10 sec: 45864.0, 60 sec: 43415.8, 300 sec: 43320.0). Total num frames: 4400545792. Throughput: 0: 43577.6. Samples: 679453320. Policy #0 lag: (min: 0.0, avg: 10.8, max: 23.0) [2024-06-28 17:04:22,925][09190] Avg episode reward: [(0, '0.725')] [2024-06-28 17:04:26,524][09423] Updated weights for policy 0, policy_version 268597 (0.0023) [2024-06-28 17:04:27,921][09190] Fps is (10 sec: 40959.6, 60 sec: 43690.7, 300 sec: 43375.9). Total num frames: 4400742400. Throughput: 0: 43644.8. Samples: 679584280. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 17:04:27,922][09190] Avg episode reward: [(0, '0.691')] [2024-06-28 17:04:29,794][09423] Updated weights for policy 0, policy_version 268607 (0.0026) [2024-06-28 17:04:32,921][09190] Fps is (10 sec: 42609.3, 60 sec: 43417.6, 300 sec: 43264.9). Total num frames: 4400971776. Throughput: 0: 43766.2. Samples: 679844940. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 17:04:32,922][09190] Avg episode reward: [(0, '0.673')] [2024-06-28 17:04:33,805][09423] Updated weights for policy 0, policy_version 268617 (0.0034) [2024-06-28 17:04:37,520][09423] Updated weights for policy 0, policy_version 268627 (0.0024) [2024-06-28 17:04:37,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43690.6, 300 sec: 43320.4). Total num frames: 4401201152. Throughput: 0: 43579.7. Samples: 680104680. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 17:04:37,922][09190] Avg episode reward: [(0, '0.690')] [2024-06-28 17:04:38,264][09403] Signal inference workers to stop experience collection... (9350 times) [2024-06-28 17:04:38,269][09403] Signal inference workers to resume experience collection... (9350 times) [2024-06-28 17:04:38,291][09423] InferenceWorker_p0-w0: stopping experience collection (9350 times) [2024-06-28 17:04:38,292][09423] InferenceWorker_p0-w0: resuming experience collection (9350 times) [2024-06-28 17:04:41,309][09423] Updated weights for policy 0, policy_version 268637 (0.0031) [2024-06-28 17:04:42,921][09190] Fps is (10 sec: 42597.8, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 4401397760. Throughput: 0: 43465.8. Samples: 680232360. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 17:04:42,922][09190] Avg episode reward: [(0, '0.666')] [2024-06-28 17:04:45,454][09423] Updated weights for policy 0, policy_version 268647 (0.0032) [2024-06-28 17:04:47,922][09190] Fps is (10 sec: 42597.8, 60 sec: 43419.1, 300 sec: 43375.9). Total num frames: 4401627136. Throughput: 0: 43386.0. Samples: 680491700. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 17:04:47,922][09190] Avg episode reward: [(0, '0.659')] [2024-06-28 17:04:48,992][09423] Updated weights for policy 0, policy_version 268657 (0.0023) [2024-06-28 17:04:52,859][09423] Updated weights for policy 0, policy_version 268667 (0.0030) [2024-06-28 17:04:52,921][09190] Fps is (10 sec: 44237.4, 60 sec: 43419.4, 300 sec: 43320.4). Total num frames: 4401840128. Throughput: 0: 43587.1. Samples: 680759640. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 17:04:52,922][09190] Avg episode reward: [(0, '0.688')] [2024-06-28 17:04:56,341][09423] Updated weights for policy 0, policy_version 268677 (0.0034) [2024-06-28 17:04:57,921][09190] Fps is (10 sec: 42599.0, 60 sec: 43690.7, 300 sec: 43375.9). Total num frames: 4402053120. Throughput: 0: 43512.0. Samples: 680885280. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 17:04:57,922][09190] Avg episode reward: [(0, '0.726')] [2024-06-28 17:05:00,438][09423] Updated weights for policy 0, policy_version 268687 (0.0024) [2024-06-28 17:05:02,924][09190] Fps is (10 sec: 44225.5, 60 sec: 43415.8, 300 sec: 43320.1). Total num frames: 4402282496. Throughput: 0: 43526.0. Samples: 681148120. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 17:05:02,925][09190] Avg episode reward: [(0, '0.688')] [2024-06-28 17:05:03,875][09423] Updated weights for policy 0, policy_version 268697 (0.0037) [2024-06-28 17:05:07,785][09423] Updated weights for policy 0, policy_version 268707 (0.0031) [2024-06-28 17:05:07,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43417.6, 300 sec: 43431.5). Total num frames: 4402495488. Throughput: 0: 43576.3. Samples: 681414140. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 17:05:07,922][09190] Avg episode reward: [(0, '0.699')] [2024-06-28 17:05:11,351][09423] Updated weights for policy 0, policy_version 268717 (0.0031) [2024-06-28 17:05:12,921][09190] Fps is (10 sec: 40970.7, 60 sec: 43417.7, 300 sec: 43376.0). Total num frames: 4402692096. Throughput: 0: 43468.2. Samples: 681540340. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 17:05:12,922][09190] Avg episode reward: [(0, '0.686')] [2024-06-28 17:05:15,207][09423] Updated weights for policy 0, policy_version 268727 (0.0035) [2024-06-28 17:05:17,924][09190] Fps is (10 sec: 44225.5, 60 sec: 43415.8, 300 sec: 43375.6). Total num frames: 4402937856. Throughput: 0: 43477.1. Samples: 681801520. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 17:05:17,924][09190] Avg episode reward: [(0, '0.714')] [2024-06-28 17:05:18,882][09423] Updated weights for policy 0, policy_version 268737 (0.0042) [2024-06-28 17:05:22,493][09423] Updated weights for policy 0, policy_version 268747 (0.0027) [2024-06-28 17:05:22,923][09190] Fps is (10 sec: 47504.9, 60 sec: 43691.2, 300 sec: 43486.8). Total num frames: 4403167232. Throughput: 0: 43464.6. Samples: 682060660. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 17:05:22,924][09190] Avg episode reward: [(0, '0.683')] [2024-06-28 17:05:26,524][09423] Updated weights for policy 0, policy_version 268757 (0.0031) [2024-06-28 17:05:27,922][09190] Fps is (10 sec: 42608.4, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 4403363840. Throughput: 0: 43619.5. Samples: 682195240. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 17:05:27,922][09190] Avg episode reward: [(0, '0.617')] [2024-06-28 17:05:30,269][09423] Updated weights for policy 0, policy_version 268767 (0.0034) [2024-06-28 17:05:32,921][09190] Fps is (10 sec: 40967.0, 60 sec: 43417.6, 300 sec: 43321.4). Total num frames: 4403576832. Throughput: 0: 43724.2. Samples: 682459280. Policy #0 lag: (min: 0.0, avg: 11.6, max: 24.0) [2024-06-28 17:05:32,922][09190] Avg episode reward: [(0, '0.700')] [2024-06-28 17:05:33,741][09423] Updated weights for policy 0, policy_version 268777 (0.0026) [2024-06-28 17:05:37,682][09423] Updated weights for policy 0, policy_version 268787 (0.0037) [2024-06-28 17:05:37,921][09190] Fps is (10 sec: 44237.3, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 4403806208. Throughput: 0: 43620.8. Samples: 682722580. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:05:37,922][09190] Avg episode reward: [(0, '0.695')] [2024-06-28 17:05:41,022][09423] Updated weights for policy 0, policy_version 268797 (0.0027) [2024-06-28 17:05:42,921][09190] Fps is (10 sec: 44237.0, 60 sec: 43690.8, 300 sec: 43487.0). Total num frames: 4404019200. Throughput: 0: 43742.3. Samples: 682853680. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:05:42,922][09190] Avg episode reward: [(0, '0.665')] [2024-06-28 17:05:45,426][09423] Updated weights for policy 0, policy_version 268807 (0.0046) [2024-06-28 17:05:47,921][09190] Fps is (10 sec: 44237.4, 60 sec: 43690.9, 300 sec: 43431.5). Total num frames: 4404248576. Throughput: 0: 43699.9. Samples: 683114500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:05:47,922][09190] Avg episode reward: [(0, '0.712')] [2024-06-28 17:05:48,602][09423] Updated weights for policy 0, policy_version 268817 (0.0034) [2024-06-28 17:05:52,921][09423] Updated weights for policy 0, policy_version 268827 (0.0030) [2024-06-28 17:05:52,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43690.7, 300 sec: 43487.4). Total num frames: 4404461568. Throughput: 0: 43589.3. Samples: 683375660. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:05:52,922][09190] Avg episode reward: [(0, '0.669')] [2024-06-28 17:05:56,355][09423] Updated weights for policy 0, policy_version 268837 (0.0027) [2024-06-28 17:05:57,928][09190] Fps is (10 sec: 40933.1, 60 sec: 43412.9, 300 sec: 43430.5). Total num frames: 4404658176. Throughput: 0: 43604.7. Samples: 683502840. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:05:57,928][09190] Avg episode reward: [(0, '0.722')] [2024-06-28 17:06:00,297][09423] Updated weights for policy 0, policy_version 268847 (0.0035) [2024-06-28 17:06:02,921][09190] Fps is (10 sec: 42598.1, 60 sec: 43419.4, 300 sec: 43320.4). Total num frames: 4404887552. Throughput: 0: 43734.4. Samples: 683769460. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:06:02,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:06:03,774][09423] Updated weights for policy 0, policy_version 268857 (0.0032) [2024-06-28 17:06:07,753][09423] Updated weights for policy 0, policy_version 268867 (0.0038) [2024-06-28 17:06:07,921][09190] Fps is (10 sec: 45905.1, 60 sec: 43690.6, 300 sec: 43487.0). Total num frames: 4405116928. Throughput: 0: 43774.6. Samples: 684030440. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:06:07,922][09190] Avg episode reward: [(0, '0.699')] [2024-06-28 17:06:11,004][09403] Signal inference workers to stop experience collection... (9400 times) [2024-06-28 17:06:11,005][09403] Signal inference workers to resume experience collection... (9400 times) [2024-06-28 17:06:11,046][09423] InferenceWorker_p0-w0: stopping experience collection (9400 times) [2024-06-28 17:06:11,046][09423] InferenceWorker_p0-w0: resuming experience collection (9400 times) [2024-06-28 17:06:11,535][09423] Updated weights for policy 0, policy_version 268877 (0.0045) [2024-06-28 17:06:12,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43963.7, 300 sec: 43487.0). Total num frames: 4405329920. Throughput: 0: 43781.5. Samples: 684165400. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:06:12,922][09190] Avg episode reward: [(0, '0.729')] [2024-06-28 17:06:15,289][09423] Updated weights for policy 0, policy_version 268887 (0.0033) [2024-06-28 17:06:17,921][09190] Fps is (10 sec: 40960.3, 60 sec: 43146.4, 300 sec: 43320.4). Total num frames: 4405526528. Throughput: 0: 43593.0. Samples: 684420960. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:06:17,922][09190] Avg episode reward: [(0, '0.706')] [2024-06-28 17:06:18,006][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000268893_4405542912.pth... [2024-06-28 17:06:18,075][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000268255_4395089920.pth [2024-06-28 17:06:19,049][09423] Updated weights for policy 0, policy_version 268897 (0.0030) [2024-06-28 17:06:22,703][09423] Updated weights for policy 0, policy_version 268907 (0.0034) [2024-06-28 17:06:22,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43418.9, 300 sec: 43598.1). Total num frames: 4405772288. Throughput: 0: 43589.0. Samples: 684684080. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:06:22,922][09190] Avg episode reward: [(0, '0.730')] [2024-06-28 17:06:26,407][09423] Updated weights for policy 0, policy_version 268917 (0.0038) [2024-06-28 17:06:27,924][09190] Fps is (10 sec: 45863.4, 60 sec: 43689.0, 300 sec: 43486.7). Total num frames: 4405985280. Throughput: 0: 43561.1. Samples: 684814040. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:06:27,924][09190] Avg episode reward: [(0, '0.726')] [2024-06-28 17:06:30,395][09423] Updated weights for policy 0, policy_version 268927 (0.0030) [2024-06-28 17:06:32,921][09190] Fps is (10 sec: 42598.1, 60 sec: 43690.6, 300 sec: 43375.9). Total num frames: 4406198272. Throughput: 0: 43581.6. Samples: 685075680. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:06:32,922][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 17:06:33,643][09423] Updated weights for policy 0, policy_version 268937 (0.0031) [2024-06-28 17:06:37,901][09423] Updated weights for policy 0, policy_version 268947 (0.0042) [2024-06-28 17:06:37,922][09190] Fps is (10 sec: 44247.1, 60 sec: 43690.6, 300 sec: 43598.1). Total num frames: 4406427648. Throughput: 0: 43729.6. Samples: 685343500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:06:37,922][09190] Avg episode reward: [(0, '0.679')] [2024-06-28 17:06:41,530][09423] Updated weights for policy 0, policy_version 268957 (0.0024) [2024-06-28 17:06:42,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 4406640640. Throughput: 0: 43779.7. Samples: 685472640. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:06:42,922][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 17:06:45,169][09423] Updated weights for policy 0, policy_version 268967 (0.0032) [2024-06-28 17:06:47,921][09190] Fps is (10 sec: 42598.9, 60 sec: 43417.5, 300 sec: 43375.9). Total num frames: 4406853632. Throughput: 0: 43686.2. Samples: 685735340. Policy #0 lag: (min: 0.0, avg: 9.3, max: 21.0) [2024-06-28 17:06:47,922][09190] Avg episode reward: [(0, '0.721')] [2024-06-28 17:06:48,849][09423] Updated weights for policy 0, policy_version 268977 (0.0024) [2024-06-28 17:06:52,771][09423] Updated weights for policy 0, policy_version 268987 (0.0034) [2024-06-28 17:06:52,922][09190] Fps is (10 sec: 44232.8, 60 sec: 43690.0, 300 sec: 43542.4). Total num frames: 4407083008. Throughput: 0: 43608.0. Samples: 685992840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:06:52,923][09190] Avg episode reward: [(0, '0.761')] [2024-06-28 17:06:56,181][09423] Updated weights for policy 0, policy_version 268997 (0.0034) [2024-06-28 17:06:57,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43695.4, 300 sec: 43542.6). Total num frames: 4407279616. Throughput: 0: 43602.6. Samples: 686127520. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:06:57,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:07:00,399][09423] Updated weights for policy 0, policy_version 269007 (0.0031) [2024-06-28 17:07:02,921][09190] Fps is (10 sec: 42601.8, 60 sec: 43690.7, 300 sec: 43431.5). Total num frames: 4407508992. Throughput: 0: 43636.3. Samples: 686384600. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:07:02,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 17:07:04,043][09423] Updated weights for policy 0, policy_version 269017 (0.0024) [2024-06-28 17:07:07,889][09423] Updated weights for policy 0, policy_version 269027 (0.0030) [2024-06-28 17:07:07,921][09190] Fps is (10 sec: 45874.8, 60 sec: 43690.6, 300 sec: 43653.6). Total num frames: 4407738368. Throughput: 0: 43739.9. Samples: 686652380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:07:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:07:11,274][09423] Updated weights for policy 0, policy_version 269037 (0.0023) [2024-06-28 17:07:12,924][09190] Fps is (10 sec: 44225.9, 60 sec: 43688.8, 300 sec: 43597.7). Total num frames: 4407951360. Throughput: 0: 43676.8. Samples: 686779500. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:07:12,924][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:07:15,186][09423] Updated weights for policy 0, policy_version 269047 (0.0032) [2024-06-28 17:07:17,922][09190] Fps is (10 sec: 42598.1, 60 sec: 43963.5, 300 sec: 43487.0). Total num frames: 4408164352. Throughput: 0: 43688.3. Samples: 687041660. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:07:17,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:07:18,815][09423] Updated weights for policy 0, policy_version 269057 (0.0032) [2024-06-28 17:07:22,887][09423] Updated weights for policy 0, policy_version 269067 (0.0040) [2024-06-28 17:07:22,921][09190] Fps is (10 sec: 44247.8, 60 sec: 43690.6, 300 sec: 43653.7). Total num frames: 4408393728. Throughput: 0: 43636.5. Samples: 687307140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:07:22,924][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 17:07:26,218][09423] Updated weights for policy 0, policy_version 269077 (0.0028) [2024-06-28 17:07:27,921][09190] Fps is (10 sec: 44238.0, 60 sec: 43692.5, 300 sec: 43598.1). Total num frames: 4408606720. Throughput: 0: 43638.7. Samples: 687436380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:07:27,921][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 17:07:30,357][09423] Updated weights for policy 0, policy_version 269087 (0.0043) [2024-06-28 17:07:32,924][09190] Fps is (10 sec: 42588.0, 60 sec: 43688.9, 300 sec: 43542.2). Total num frames: 4408819712. Throughput: 0: 43745.6. Samples: 687704000. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:07:32,924][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 17:07:32,940][09403] Signal inference workers to stop experience collection... (9450 times) [2024-06-28 17:07:32,995][09423] InferenceWorker_p0-w0: stopping experience collection (9450 times) [2024-06-28 17:07:33,002][09403] Signal inference workers to resume experience collection... (9450 times) [2024-06-28 17:07:33,004][09423] InferenceWorker_p0-w0: resuming experience collection (9450 times) [2024-06-28 17:07:33,483][09423] Updated weights for policy 0, policy_version 269097 (0.0026) [2024-06-28 17:07:37,791][09423] Updated weights for policy 0, policy_version 269107 (0.0023) [2024-06-28 17:07:37,921][09190] Fps is (10 sec: 44236.4, 60 sec: 43690.8, 300 sec: 43709.2). Total num frames: 4409049088. Throughput: 0: 43718.2. Samples: 687960120. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:07:37,922][09190] Avg episode reward: [(0, '0.767')] [2024-06-28 17:07:41,329][09423] Updated weights for policy 0, policy_version 269117 (0.0035) [2024-06-28 17:07:42,921][09190] Fps is (10 sec: 45886.6, 60 sec: 43963.7, 300 sec: 43653.6). Total num frames: 4409278464. Throughput: 0: 43567.5. Samples: 688088060. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:07:42,924][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 17:07:45,223][09423] Updated weights for policy 0, policy_version 269127 (0.0023) [2024-06-28 17:07:47,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43690.7, 300 sec: 43487.0). Total num frames: 4409475072. Throughput: 0: 43737.4. Samples: 688352780. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:07:47,922][09190] Avg episode reward: [(0, '0.757')] [2024-06-28 17:07:49,009][09423] Updated weights for policy 0, policy_version 269137 (0.0030) [2024-06-28 17:07:52,921][09190] Fps is (10 sec: 40960.3, 60 sec: 43418.3, 300 sec: 43653.6). Total num frames: 4409688064. Throughput: 0: 43529.5. Samples: 688611200. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:07:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:07:52,973][09423] Updated weights for policy 0, policy_version 269147 (0.0025) [2024-06-28 17:07:56,561][09423] Updated weights for policy 0, policy_version 269157 (0.0031) [2024-06-28 17:07:57,921][09190] Fps is (10 sec: 42598.5, 60 sec: 43690.7, 300 sec: 43542.6). Total num frames: 4409901056. Throughput: 0: 43481.1. Samples: 688736040. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:07:57,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 17:08:01,002][09423] Updated weights for policy 0, policy_version 269167 (0.0036) [2024-06-28 17:08:02,921][09190] Fps is (10 sec: 42598.0, 60 sec: 43417.6, 300 sec: 43487.0). Total num frames: 4410114048. Throughput: 0: 43185.0. Samples: 688984980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:08:02,922][09190] Avg episode reward: [(0, '0.762')] [2024-06-28 17:08:04,228][09423] Updated weights for policy 0, policy_version 269177 (0.0042) [2024-06-28 17:08:07,921][09190] Fps is (10 sec: 42598.2, 60 sec: 43144.6, 300 sec: 43598.1). Total num frames: 4410327040. Throughput: 0: 43062.7. Samples: 689244960. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:08:07,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 17:08:08,449][09423] Updated weights for policy 0, policy_version 269187 (0.0034) [2024-06-28 17:08:12,137][09423] Updated weights for policy 0, policy_version 269197 (0.0024) [2024-06-28 17:08:12,921][09190] Fps is (10 sec: 42598.6, 60 sec: 43146.4, 300 sec: 43487.0). Total num frames: 4410540032. Throughput: 0: 42886.6. Samples: 689366280. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:08:12,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 17:08:16,535][09423] Updated weights for policy 0, policy_version 269207 (0.0029) [2024-06-28 17:08:17,921][09190] Fps is (10 sec: 42598.8, 60 sec: 43144.7, 300 sec: 43431.5). Total num frames: 4410753024. Throughput: 0: 42659.7. Samples: 689623580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:08:17,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 17:08:17,998][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000269212_4410769408.pth... [2024-06-28 17:08:18,039][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000268575_4400332800.pth [2024-06-28 17:08:19,610][09423] Updated weights for policy 0, policy_version 269217 (0.0028) [2024-06-28 17:08:22,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.5, 300 sec: 43542.6). Total num frames: 4410966016. Throughput: 0: 42660.0. Samples: 689879820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:08:22,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:08:23,940][09423] Updated weights for policy 0, policy_version 269227 (0.0032) [2024-06-28 17:08:27,457][09423] Updated weights for policy 0, policy_version 269237 (0.0035) [2024-06-28 17:08:27,922][09190] Fps is (10 sec: 44235.9, 60 sec: 43144.3, 300 sec: 43487.0). Total num frames: 4411195392. Throughput: 0: 42583.0. Samples: 690004300. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:08:27,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:08:31,473][09423] Updated weights for policy 0, policy_version 269247 (0.0029) [2024-06-28 17:08:32,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42873.2, 300 sec: 43431.5). Total num frames: 4411392000. Throughput: 0: 42471.1. Samples: 690263980. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:08:32,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 17:08:34,982][09423] Updated weights for policy 0, policy_version 269257 (0.0026) [2024-06-28 17:08:37,921][09190] Fps is (10 sec: 40960.8, 60 sec: 42598.4, 300 sec: 43487.0). Total num frames: 4411604992. Throughput: 0: 42536.0. Samples: 690525320. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:08:37,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:08:39,346][09423] Updated weights for policy 0, policy_version 269267 (0.0042) [2024-06-28 17:08:42,601][09423] Updated weights for policy 0, policy_version 269277 (0.0028) [2024-06-28 17:08:42,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.4, 300 sec: 43431.8). Total num frames: 4411834368. Throughput: 0: 42484.4. Samples: 690647840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:08:42,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:08:46,966][09423] Updated weights for policy 0, policy_version 269287 (0.0035) [2024-06-28 17:08:47,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.4, 300 sec: 43431.8). Total num frames: 4412047360. Throughput: 0: 42726.6. Samples: 690907680. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:08:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:08:50,356][09423] Updated weights for policy 0, policy_version 269297 (0.0026) [2024-06-28 17:08:52,921][09190] Fps is (10 sec: 39321.2, 60 sec: 42325.2, 300 sec: 43375.9). Total num frames: 4412227584. Throughput: 0: 42657.7. Samples: 691164560. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:08:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:08:54,460][09423] Updated weights for policy 0, policy_version 269307 (0.0035) [2024-06-28 17:08:57,922][09190] Fps is (10 sec: 42598.2, 60 sec: 42871.4, 300 sec: 43375.9). Total num frames: 4412473344. Throughput: 0: 42629.2. Samples: 691284600. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:08:57,923][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:08:58,235][09423] Updated weights for policy 0, policy_version 269317 (0.0042) [2024-06-28 17:09:02,153][09423] Updated weights for policy 0, policy_version 269327 (0.0040) [2024-06-28 17:09:02,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42598.4, 300 sec: 43320.4). Total num frames: 4412669952. Throughput: 0: 42658.2. Samples: 691543200. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:09:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:09:06,128][09423] Updated weights for policy 0, policy_version 269337 (0.0030) [2024-06-28 17:09:07,922][09190] Fps is (10 sec: 40960.0, 60 sec: 42598.4, 300 sec: 43375.9). Total num frames: 4412882944. Throughput: 0: 42695.9. Samples: 691801140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:09:07,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:09:09,221][09403] Signal inference workers to stop experience collection... (9500 times) [2024-06-28 17:09:09,222][09403] Signal inference workers to resume experience collection... (9500 times) [2024-06-28 17:09:09,256][09423] InferenceWorker_p0-w0: stopping experience collection (9500 times) [2024-06-28 17:09:09,256][09423] InferenceWorker_p0-w0: resuming experience collection (9500 times) [2024-06-28 17:09:09,725][09423] Updated weights for policy 0, policy_version 269347 (0.0033) [2024-06-28 17:09:12,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42598.3, 300 sec: 43264.8). Total num frames: 4413095936. Throughput: 0: 42706.3. Samples: 691926080. Policy #0 lag: (min: 0.0, avg: 9.4, max: 21.0) [2024-06-28 17:09:12,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:09:13,853][09423] Updated weights for policy 0, policy_version 269357 (0.0035) [2024-06-28 17:09:17,457][09423] Updated weights for policy 0, policy_version 269367 (0.0031) [2024-06-28 17:09:17,922][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.4, 300 sec: 43320.8). Total num frames: 4413325312. Throughput: 0: 42564.3. Samples: 692179380. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 17:09:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:09:21,870][09423] Updated weights for policy 0, policy_version 269377 (0.0030) [2024-06-28 17:09:22,922][09190] Fps is (10 sec: 39321.5, 60 sec: 42052.1, 300 sec: 43209.3). Total num frames: 4413489152. Throughput: 0: 42592.3. Samples: 692441980. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 17:09:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:09:25,281][09423] Updated weights for policy 0, policy_version 269387 (0.0038) [2024-06-28 17:09:27,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42325.5, 300 sec: 43264.9). Total num frames: 4413734912. Throughput: 0: 42546.2. Samples: 692562420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 17:09:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:09:29,149][09423] Updated weights for policy 0, policy_version 269397 (0.0033) [2024-06-28 17:09:32,760][09423] Updated weights for policy 0, policy_version 269407 (0.0030) [2024-06-28 17:09:32,921][09190] Fps is (10 sec: 49152.5, 60 sec: 43144.5, 300 sec: 43320.4). Total num frames: 4413980672. Throughput: 0: 42520.9. Samples: 692821120. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 17:09:32,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:09:37,137][09423] Updated weights for policy 0, policy_version 269417 (0.0030) [2024-06-28 17:09:37,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42325.3, 300 sec: 43209.3). Total num frames: 4414144512. Throughput: 0: 42668.9. Samples: 693084660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 17:09:37,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:09:40,543][09423] Updated weights for policy 0, policy_version 269427 (0.0034) [2024-06-28 17:09:42,922][09190] Fps is (10 sec: 39321.2, 60 sec: 42325.2, 300 sec: 43209.3). Total num frames: 4414373888. Throughput: 0: 42595.5. Samples: 693201400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 17:09:42,931][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:09:44,912][09423] Updated weights for policy 0, policy_version 269437 (0.0031) [2024-06-28 17:09:47,922][09190] Fps is (10 sec: 45874.9, 60 sec: 42598.4, 300 sec: 43264.8). Total num frames: 4414603264. Throughput: 0: 42677.2. Samples: 693463680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 17:09:47,930][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:09:48,003][09423] Updated weights for policy 0, policy_version 269447 (0.0024) [2024-06-28 17:09:52,854][09423] Updated weights for policy 0, policy_version 269457 (0.0034) [2024-06-28 17:09:52,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42598.5, 300 sec: 43153.8). Total num frames: 4414783488. Throughput: 0: 42651.2. Samples: 693720440. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 17:09:52,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:09:55,376][09423] Updated weights for policy 0, policy_version 269467 (0.0034) [2024-06-28 17:09:57,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.4, 300 sec: 43209.7). Total num frames: 4415029248. Throughput: 0: 42464.0. Samples: 693836960. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 17:09:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:10:00,629][09423] Updated weights for policy 0, policy_version 269477 (0.0033) [2024-06-28 17:10:02,921][09190] Fps is (10 sec: 47513.5, 60 sec: 43144.5, 300 sec: 43264.9). Total num frames: 4415258624. Throughput: 0: 42632.6. Samples: 694097840. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 17:10:02,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:10:03,037][09423] Updated weights for policy 0, policy_version 269487 (0.0037) [2024-06-28 17:10:07,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42325.4, 300 sec: 43153.8). Total num frames: 4415422464. Throughput: 0: 42594.7. Samples: 694358740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 17:10:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:10:08,268][09423] Updated weights for policy 0, policy_version 269497 (0.0037) [2024-06-28 17:10:10,906][09423] Updated weights for policy 0, policy_version 269507 (0.0029) [2024-06-28 17:10:12,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42871.5, 300 sec: 43154.1). Total num frames: 4415668224. Throughput: 0: 42557.7. Samples: 694477520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 17:10:12,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:10:15,543][09423] Updated weights for policy 0, policy_version 269517 (0.0025) [2024-06-28 17:10:17,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42598.5, 300 sec: 43098.5). Total num frames: 4415881216. Throughput: 0: 42602.3. Samples: 694738220. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 17:10:17,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 17:10:18,039][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000269525_4415897600.pth... [2024-06-28 17:10:18,098][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000268893_4405542912.pth [2024-06-28 17:10:18,492][09423] Updated weights for policy 0, policy_version 269527 (0.0035) [2024-06-28 17:10:22,921][09190] Fps is (10 sec: 40960.5, 60 sec: 43144.7, 300 sec: 43098.3). Total num frames: 4416077824. Throughput: 0: 42403.7. Samples: 694992820. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 17:10:22,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:10:23,403][09423] Updated weights for policy 0, policy_version 269537 (0.0025) [2024-06-28 17:10:26,399][09423] Updated weights for policy 0, policy_version 269547 (0.0040) [2024-06-28 17:10:27,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.4, 300 sec: 43098.2). Total num frames: 4416290816. Throughput: 0: 42453.0. Samples: 695111780. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2024-06-28 17:10:27,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:10:30,958][09423] Updated weights for policy 0, policy_version 269557 (0.0029) [2024-06-28 17:10:31,792][09403] Signal inference workers to stop experience collection... (9550 times) [2024-06-28 17:10:31,794][09403] Signal inference workers to resume experience collection... (9550 times) [2024-06-28 17:10:31,818][09423] InferenceWorker_p0-w0: stopping experience collection (9550 times) [2024-06-28 17:10:31,818][09423] InferenceWorker_p0-w0: resuming experience collection (9550 times) [2024-06-28 17:10:32,924][09190] Fps is (10 sec: 45863.0, 60 sec: 42596.6, 300 sec: 43153.4). Total num frames: 4416536576. Throughput: 0: 42584.3. Samples: 695380080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2024-06-28 17:10:32,925][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:10:34,017][09423] Updated weights for policy 0, policy_version 269567 (0.0032) [2024-06-28 17:10:37,922][09190] Fps is (10 sec: 39321.2, 60 sec: 42325.3, 300 sec: 42931.6). Total num frames: 4416684032. Throughput: 0: 42659.4. Samples: 695640120. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2024-06-28 17:10:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:10:38,497][09423] Updated weights for policy 0, policy_version 269577 (0.0027) [2024-06-28 17:10:41,860][09423] Updated weights for policy 0, policy_version 269587 (0.0038) [2024-06-28 17:10:42,921][09190] Fps is (10 sec: 39331.9, 60 sec: 42598.5, 300 sec: 42987.2). Total num frames: 4416929792. Throughput: 0: 42625.0. Samples: 695755080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2024-06-28 17:10:42,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:10:46,342][09423] Updated weights for policy 0, policy_version 269597 (0.0040) [2024-06-28 17:10:47,921][09190] Fps is (10 sec: 47514.4, 60 sec: 42598.5, 300 sec: 43042.7). Total num frames: 4417159168. Throughput: 0: 42648.9. Samples: 696017040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2024-06-28 17:10:47,922][09190] Avg episode reward: [(0, '0.733')] [2024-06-28 17:10:49,720][09423] Updated weights for policy 0, policy_version 269607 (0.0037) [2024-06-28 17:10:52,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42598.3, 300 sec: 42988.1). Total num frames: 4417339392. Throughput: 0: 42575.1. Samples: 696274620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2024-06-28 17:10:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:10:53,622][09423] Updated weights for policy 0, policy_version 269617 (0.0032) [2024-06-28 17:10:57,517][09423] Updated weights for policy 0, policy_version 269627 (0.0030) [2024-06-28 17:10:57,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.4, 300 sec: 42987.2). Total num frames: 4417568768. Throughput: 0: 42565.8. Samples: 696392980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2024-06-28 17:10:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:11:01,558][09423] Updated weights for policy 0, policy_version 269637 (0.0026) [2024-06-28 17:11:02,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42325.3, 300 sec: 42987.2). Total num frames: 4417798144. Throughput: 0: 42439.0. Samples: 696647980. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2024-06-28 17:11:02,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:11:05,276][09423] Updated weights for policy 0, policy_version 269647 (0.0031) [2024-06-28 17:11:07,922][09190] Fps is (10 sec: 40959.2, 60 sec: 42598.3, 300 sec: 42876.1). Total num frames: 4417978368. Throughput: 0: 42629.1. Samples: 696911140. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2024-06-28 17:11:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:11:09,163][09423] Updated weights for policy 0, policy_version 269657 (0.0031) [2024-06-28 17:11:12,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42325.4, 300 sec: 42987.2). Total num frames: 4418207744. Throughput: 0: 42669.3. Samples: 697031900. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2024-06-28 17:11:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:11:13,048][09423] Updated weights for policy 0, policy_version 269667 (0.0030) [2024-06-28 17:11:16,750][09423] Updated weights for policy 0, policy_version 269677 (0.0024) [2024-06-28 17:11:17,921][09190] Fps is (10 sec: 45875.8, 60 sec: 42598.4, 300 sec: 42931.6). Total num frames: 4418437120. Throughput: 0: 42506.4. Samples: 697292760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2024-06-28 17:11:17,922][09190] Avg episode reward: [(0, '0.811')] [2024-06-28 17:11:17,933][09190] No heartbeat for components: RolloutWorker_w20 (216 seconds) [2024-06-28 17:11:20,862][09423] Updated weights for policy 0, policy_version 269687 (0.0030) [2024-06-28 17:11:22,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.3, 300 sec: 42820.9). Total num frames: 4418617344. Throughput: 0: 42498.3. Samples: 697552540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2024-06-28 17:11:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:11:24,231][09423] Updated weights for policy 0, policy_version 269697 (0.0041) [2024-06-28 17:11:27,922][09190] Fps is (10 sec: 40959.6, 60 sec: 42598.3, 300 sec: 42876.1). Total num frames: 4418846720. Throughput: 0: 42591.4. Samples: 697671700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2024-06-28 17:11:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:11:28,521][09423] Updated weights for policy 0, policy_version 269707 (0.0033) [2024-06-28 17:11:31,997][09423] Updated weights for policy 0, policy_version 269717 (0.0040) [2024-06-28 17:11:32,921][09190] Fps is (10 sec: 45875.3, 60 sec: 42327.2, 300 sec: 42876.1). Total num frames: 4419076096. Throughput: 0: 42578.7. Samples: 697933080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 24.0) [2024-06-28 17:11:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:11:36,479][09423] Updated weights for policy 0, policy_version 269727 (0.0026) [2024-06-28 17:11:37,923][09190] Fps is (10 sec: 42591.0, 60 sec: 43143.3, 300 sec: 42820.3). Total num frames: 4419272704. Throughput: 0: 42548.1. Samples: 698189360. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:11:37,924][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 17:11:39,817][09423] Updated weights for policy 0, policy_version 269737 (0.0040) [2024-06-28 17:11:42,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42598.4, 300 sec: 42820.6). Total num frames: 4419485696. Throughput: 0: 42591.1. Samples: 698309580. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:11:42,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:11:43,866][09423] Updated weights for policy 0, policy_version 269747 (0.0030) [2024-06-28 17:11:47,352][09423] Updated weights for policy 0, policy_version 269757 (0.0023) [2024-06-28 17:11:47,922][09190] Fps is (10 sec: 44244.6, 60 sec: 42598.3, 300 sec: 42820.7). Total num frames: 4419715072. Throughput: 0: 42913.3. Samples: 698579080. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:11:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:11:51,351][09423] Updated weights for policy 0, policy_version 269767 (0.0034) [2024-06-28 17:11:52,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42871.5, 300 sec: 42820.6). Total num frames: 4419911680. Throughput: 0: 42718.8. Samples: 698833480. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:11:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:11:54,900][09423] Updated weights for policy 0, policy_version 269777 (0.0034) [2024-06-28 17:11:57,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42598.3, 300 sec: 42765.0). Total num frames: 4420124672. Throughput: 0: 42719.5. Samples: 698954280. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:11:57,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:11:59,351][09423] Updated weights for policy 0, policy_version 269787 (0.0032) [2024-06-28 17:12:02,402][09423] Updated weights for policy 0, policy_version 269797 (0.0030) [2024-06-28 17:12:02,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42598.5, 300 sec: 42765.0). Total num frames: 4420354048. Throughput: 0: 42722.2. Samples: 699215260. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:12:02,928][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:12:07,025][09423] Updated weights for policy 0, policy_version 269807 (0.0038) [2024-06-28 17:12:07,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.6, 300 sec: 42709.8). Total num frames: 4420550656. Throughput: 0: 42839.5. Samples: 699480320. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:12:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:12:10,176][09423] Updated weights for policy 0, policy_version 269817 (0.0031) [2024-06-28 17:12:12,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42598.5, 300 sec: 42709.5). Total num frames: 4420763648. Throughput: 0: 42860.7. Samples: 699600420. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:12:12,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 17:12:14,546][09423] Updated weights for policy 0, policy_version 269827 (0.0050) [2024-06-28 17:12:15,800][09403] Signal inference workers to stop experience collection... (9600 times) [2024-06-28 17:12:15,800][09403] Signal inference workers to resume experience collection... (9600 times) [2024-06-28 17:12:15,814][09423] InferenceWorker_p0-w0: stopping experience collection (9600 times) [2024-06-28 17:12:15,814][09423] InferenceWorker_p0-w0: resuming experience collection (9600 times) [2024-06-28 17:12:17,919][09423] Updated weights for policy 0, policy_version 269837 (0.0034) [2024-06-28 17:12:17,922][09190] Fps is (10 sec: 45870.2, 60 sec: 42870.7, 300 sec: 42764.9). Total num frames: 4421009408. Throughput: 0: 42684.7. Samples: 699853940. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:12:17,923][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:12:17,929][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000269837_4421009408.pth... [2024-06-28 17:12:17,990][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000269212_4410769408.pth [2024-06-28 17:12:22,194][09423] Updated weights for policy 0, policy_version 269847 (0.0030) [2024-06-28 17:12:22,921][09190] Fps is (10 sec: 40959.4, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4421173248. Throughput: 0: 42781.7. Samples: 700114460. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:12:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:12:25,450][09423] Updated weights for policy 0, policy_version 269857 (0.0036) [2024-06-28 17:12:27,922][09190] Fps is (10 sec: 39325.1, 60 sec: 42598.4, 300 sec: 42654.3). Total num frames: 4421402624. Throughput: 0: 42783.3. Samples: 700234840. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:12:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:12:30,349][09423] Updated weights for policy 0, policy_version 269867 (0.0036) [2024-06-28 17:12:32,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4421632000. Throughput: 0: 42544.0. Samples: 700493560. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:12:32,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:12:33,142][09423] Updated weights for policy 0, policy_version 269877 (0.0034) [2024-06-28 17:12:37,871][09423] Updated weights for policy 0, policy_version 269887 (0.0043) [2024-06-28 17:12:37,921][09190] Fps is (10 sec: 42599.3, 60 sec: 42599.7, 300 sec: 42542.9). Total num frames: 4421828608. Throughput: 0: 42737.3. Samples: 700756660. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:12:37,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:12:40,822][09423] Updated weights for policy 0, policy_version 269897 (0.0030) [2024-06-28 17:12:42,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42871.4, 300 sec: 42653.9). Total num frames: 4422057984. Throughput: 0: 42744.0. Samples: 700877760. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:12:42,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:12:45,434][09423] Updated weights for policy 0, policy_version 269907 (0.0027) [2024-06-28 17:12:47,922][09190] Fps is (10 sec: 45874.7, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4422287360. Throughput: 0: 42787.5. Samples: 701140700. Policy #0 lag: (min: 0.0, avg: 8.1, max: 20.0) [2024-06-28 17:12:47,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:12:48,798][09423] Updated weights for policy 0, policy_version 269917 (0.0028) [2024-06-28 17:12:52,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4422483968. Throughput: 0: 42576.0. Samples: 701396240. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 17:12:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:12:52,923][09423] Updated weights for policy 0, policy_version 269927 (0.0031) [2024-06-28 17:12:56,240][09423] Updated weights for policy 0, policy_version 269937 (0.0032) [2024-06-28 17:12:57,922][09190] Fps is (10 sec: 39321.6, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4422680576. Throughput: 0: 42624.7. Samples: 701518540. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 17:12:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:13:00,718][09423] Updated weights for policy 0, policy_version 269947 (0.0035) [2024-06-28 17:13:02,921][09190] Fps is (10 sec: 45875.3, 60 sec: 43144.6, 300 sec: 42765.0). Total num frames: 4422942720. Throughput: 0: 42787.3. Samples: 701779320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 17:13:02,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:13:03,738][09423] Updated weights for policy 0, policy_version 269957 (0.0034) [2024-06-28 17:13:07,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4423122944. Throughput: 0: 42737.8. Samples: 702037660. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 17:13:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:13:08,457][09423] Updated weights for policy 0, policy_version 269967 (0.0033) [2024-06-28 17:13:11,948][09423] Updated weights for policy 0, policy_version 269977 (0.0043) [2024-06-28 17:13:12,921][09190] Fps is (10 sec: 40959.9, 60 sec: 43144.5, 300 sec: 42709.5). Total num frames: 4423352320. Throughput: 0: 42711.3. Samples: 702156840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 17:13:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:13:16,213][09423] Updated weights for policy 0, policy_version 269987 (0.0033) [2024-06-28 17:13:17,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42599.0, 300 sec: 42709.4). Total num frames: 4423565312. Throughput: 0: 42681.2. Samples: 702414220. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 17:13:17,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:13:19,497][09423] Updated weights for policy 0, policy_version 269997 (0.0030) [2024-06-28 17:13:22,922][09190] Fps is (10 sec: 39321.1, 60 sec: 42871.4, 300 sec: 42542.9). Total num frames: 4423745536. Throughput: 0: 42658.1. Samples: 702676280. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 17:13:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:13:23,582][09423] Updated weights for policy 0, policy_version 270007 (0.0029) [2024-06-28 17:13:27,477][09423] Updated weights for policy 0, policy_version 270017 (0.0036) [2024-06-28 17:13:27,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4423958528. Throughput: 0: 42743.5. Samples: 702801220. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 17:13:27,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 17:13:31,021][09423] Updated weights for policy 0, policy_version 270027 (0.0034) [2024-06-28 17:13:32,921][09190] Fps is (10 sec: 44237.6, 60 sec: 42598.5, 300 sec: 42653.9). Total num frames: 4424187904. Throughput: 0: 42498.0. Samples: 703053100. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 17:13:32,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:13:35,333][09423] Updated weights for policy 0, policy_version 270037 (0.0029) [2024-06-28 17:13:37,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4424384512. Throughput: 0: 42607.1. Samples: 703313560. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 17:13:37,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:13:38,922][09423] Updated weights for policy 0, policy_version 270047 (0.0030) [2024-06-28 17:13:42,714][09423] Updated weights for policy 0, policy_version 270057 (0.0037) [2024-06-28 17:13:42,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4424630272. Throughput: 0: 42565.4. Samples: 703433980. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 17:13:42,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:13:46,687][09423] Updated weights for policy 0, policy_version 270067 (0.0031) [2024-06-28 17:13:47,921][09190] Fps is (10 sec: 45874.7, 60 sec: 42598.4, 300 sec: 42765.0). Total num frames: 4424843264. Throughput: 0: 42507.0. Samples: 703692140. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 17:13:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:13:49,708][09403] Signal inference workers to stop experience collection... (9650 times) [2024-06-28 17:13:49,712][09403] Signal inference workers to resume experience collection... (9650 times) [2024-06-28 17:13:49,744][09423] InferenceWorker_p0-w0: stopping experience collection (9650 times) [2024-06-28 17:13:49,744][09423] InferenceWorker_p0-w0: resuming experience collection (9650 times) [2024-06-28 17:13:50,021][09423] Updated weights for policy 0, policy_version 270077 (0.0025) [2024-06-28 17:13:52,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4425023488. Throughput: 0: 42543.9. Samples: 703952140. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 17:13:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:13:54,230][09423] Updated weights for policy 0, policy_version 270087 (0.0025) [2024-06-28 17:13:57,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4425252864. Throughput: 0: 42630.2. Samples: 704075200. Policy #0 lag: (min: 0.0, avg: 10.9, max: 20.0) [2024-06-28 17:13:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:13:58,135][09423] Updated weights for policy 0, policy_version 270097 (0.0036) [2024-06-28 17:14:01,737][09423] Updated weights for policy 0, policy_version 270107 (0.0027) [2024-06-28 17:14:02,923][09190] Fps is (10 sec: 45869.8, 60 sec: 42324.4, 300 sec: 42709.3). Total num frames: 4425482240. Throughput: 0: 42497.2. Samples: 704326640. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 17:14:02,923][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:14:05,875][09423] Updated weights for policy 0, policy_version 270117 (0.0035) [2024-06-28 17:14:07,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42052.3, 300 sec: 42542.9). Total num frames: 4425646080. Throughput: 0: 42409.9. Samples: 704584720. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 17:14:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:14:09,512][09423] Updated weights for policy 0, policy_version 270127 (0.0049) [2024-06-28 17:14:12,921][09190] Fps is (10 sec: 40964.9, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4425891840. Throughput: 0: 42464.9. Samples: 704712140. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 17:14:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:14:14,120][09423] Updated weights for policy 0, policy_version 270137 (0.0033) [2024-06-28 17:14:17,306][09423] Updated weights for policy 0, policy_version 270147 (0.0036) [2024-06-28 17:14:17,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42052.4, 300 sec: 42709.5). Total num frames: 4426088448. Throughput: 0: 42425.2. Samples: 704962240. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 17:14:17,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 17:14:17,934][09190] No heartbeat for components: RolloutWorker_w20 (396 seconds) [2024-06-28 17:14:18,091][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000270148_4426104832.pth... [2024-06-28 17:14:18,145][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000269525_4415897600.pth [2024-06-28 17:14:21,884][09423] Updated weights for policy 0, policy_version 270157 (0.0036) [2024-06-28 17:14:22,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4426301440. Throughput: 0: 42382.6. Samples: 705220780. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 17:14:22,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:14:25,257][09423] Updated weights for policy 0, policy_version 270167 (0.0035) [2024-06-28 17:14:27,921][09190] Fps is (10 sec: 45875.1, 60 sec: 43144.6, 300 sec: 42598.4). Total num frames: 4426547200. Throughput: 0: 42468.9. Samples: 705345080. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 17:14:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:14:29,301][09423] Updated weights for policy 0, policy_version 270177 (0.0038) [2024-06-28 17:14:32,719][09423] Updated weights for policy 0, policy_version 270187 (0.0035) [2024-06-28 17:14:32,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42598.3, 300 sec: 42709.5). Total num frames: 4426743808. Throughput: 0: 42484.0. Samples: 705603920. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 17:14:32,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:14:36,964][09423] Updated weights for policy 0, policy_version 270197 (0.0032) [2024-06-28 17:14:37,922][09190] Fps is (10 sec: 39321.2, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4426940416. Throughput: 0: 42518.6. Samples: 705865480. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 17:14:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:14:40,430][09423] Updated weights for policy 0, policy_version 270207 (0.0027) [2024-06-28 17:14:42,922][09190] Fps is (10 sec: 42598.1, 60 sec: 42325.2, 300 sec: 42598.4). Total num frames: 4427169792. Throughput: 0: 42515.0. Samples: 705988380. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 17:14:42,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:14:44,534][09423] Updated weights for policy 0, policy_version 270217 (0.0024) [2024-06-28 17:14:47,922][09190] Fps is (10 sec: 44236.8, 60 sec: 42325.3, 300 sec: 42709.5). Total num frames: 4427382784. Throughput: 0: 42643.7. Samples: 706245560. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 17:14:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:14:48,077][09423] Updated weights for policy 0, policy_version 270227 (0.0042) [2024-06-28 17:14:52,495][09423] Updated weights for policy 0, policy_version 270237 (0.0039) [2024-06-28 17:14:52,921][09190] Fps is (10 sec: 40960.8, 60 sec: 42598.5, 300 sec: 42542.9). Total num frames: 4427579392. Throughput: 0: 42663.6. Samples: 706504580. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 17:14:52,922][09190] Avg episode reward: [(0, '0.721')] [2024-06-28 17:14:55,714][09423] Updated weights for policy 0, policy_version 270247 (0.0031) [2024-06-28 17:14:57,922][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.3, 300 sec: 42542.8). Total num frames: 4427808768. Throughput: 0: 42519.9. Samples: 706625540. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 17:14:57,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:14:59,988][09423] Updated weights for policy 0, policy_version 270257 (0.0032) [2024-06-28 17:15:02,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42326.2, 300 sec: 42709.5). Total num frames: 4428021760. Throughput: 0: 42808.0. Samples: 706888600. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 17:15:02,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 17:15:03,107][09423] Updated weights for policy 0, policy_version 270267 (0.0027) [2024-06-28 17:15:07,871][09423] Updated weights for policy 0, policy_version 270277 (0.0032) [2024-06-28 17:15:07,921][09190] Fps is (10 sec: 40960.8, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 4428218368. Throughput: 0: 42703.6. Samples: 707142440. Policy #0 lag: (min: 1.0, avg: 8.3, max: 20.0) [2024-06-28 17:15:07,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:15:11,052][09423] Updated weights for policy 0, policy_version 270287 (0.0027) [2024-06-28 17:15:12,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4428447744. Throughput: 0: 42719.6. Samples: 707267460. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:15:12,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:15:15,119][09403] Signal inference workers to stop experience collection... (9700 times) [2024-06-28 17:15:15,119][09403] Signal inference workers to resume experience collection... (9700 times) [2024-06-28 17:15:15,167][09423] InferenceWorker_p0-w0: stopping experience collection (9700 times) [2024-06-28 17:15:15,167][09423] InferenceWorker_p0-w0: resuming experience collection (9700 times) [2024-06-28 17:15:15,255][09423] Updated weights for policy 0, policy_version 270297 (0.0030) [2024-06-28 17:15:17,922][09190] Fps is (10 sec: 42597.6, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4428644352. Throughput: 0: 42594.2. Samples: 707520660. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:15:17,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:15:18,902][09423] Updated weights for policy 0, policy_version 270307 (0.0031) [2024-06-28 17:15:22,672][09423] Updated weights for policy 0, policy_version 270317 (0.0032) [2024-06-28 17:15:22,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4428873728. Throughput: 0: 42444.6. Samples: 707775480. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:15:22,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:15:26,317][09423] Updated weights for policy 0, policy_version 270327 (0.0030) [2024-06-28 17:15:27,922][09190] Fps is (10 sec: 44236.9, 60 sec: 42325.3, 300 sec: 42543.2). Total num frames: 4429086720. Throughput: 0: 42673.4. Samples: 707908680. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:15:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:15:30,640][09423] Updated weights for policy 0, policy_version 270337 (0.0028) [2024-06-28 17:15:32,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42598.4, 300 sec: 42765.0). Total num frames: 4429299712. Throughput: 0: 42623.2. Samples: 708163600. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:15:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:15:33,802][09423] Updated weights for policy 0, policy_version 270347 (0.0036) [2024-06-28 17:15:37,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4429496320. Throughput: 0: 42639.6. Samples: 708423360. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:15:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:15:38,211][09423] Updated weights for policy 0, policy_version 270357 (0.0035) [2024-06-28 17:15:41,449][09423] Updated weights for policy 0, policy_version 270367 (0.0029) [2024-06-28 17:15:42,924][09190] Fps is (10 sec: 42587.7, 60 sec: 42596.7, 300 sec: 42598.0). Total num frames: 4429725696. Throughput: 0: 42654.1. Samples: 708545080. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:15:42,925][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 17:15:46,145][09423] Updated weights for policy 0, policy_version 270377 (0.0032) [2024-06-28 17:15:47,922][09190] Fps is (10 sec: 45874.3, 60 sec: 42871.5, 300 sec: 42765.0). Total num frames: 4429955072. Throughput: 0: 42643.4. Samples: 708807560. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:15:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:15:49,247][09423] Updated weights for policy 0, policy_version 270387 (0.0027) [2024-06-28 17:15:52,921][09190] Fps is (10 sec: 40970.4, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4430135296. Throughput: 0: 42701.3. Samples: 709064000. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:15:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:15:53,882][09423] Updated weights for policy 0, policy_version 270397 (0.0032) [2024-06-28 17:15:57,481][09423] Updated weights for policy 0, policy_version 270407 (0.0036) [2024-06-28 17:15:57,922][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4430364672. Throughput: 0: 42668.8. Samples: 709187560. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:15:57,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:16:01,492][09423] Updated weights for policy 0, policy_version 270417 (0.0030) [2024-06-28 17:16:02,922][09190] Fps is (10 sec: 45874.7, 60 sec: 42871.4, 300 sec: 42765.0). Total num frames: 4430594048. Throughput: 0: 42704.4. Samples: 709442360. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:16:02,923][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 17:16:05,062][09423] Updated weights for policy 0, policy_version 270427 (0.0032) [2024-06-28 17:16:07,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4430774272. Throughput: 0: 42752.9. Samples: 709699360. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:16:07,930][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:16:09,086][09423] Updated weights for policy 0, policy_version 270437 (0.0044) [2024-06-28 17:16:12,796][09423] Updated weights for policy 0, policy_version 270447 (0.0042) [2024-06-28 17:16:12,922][09190] Fps is (10 sec: 40960.0, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4431003648. Throughput: 0: 42550.2. Samples: 709823440. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:16:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:16:16,793][09423] Updated weights for policy 0, policy_version 270457 (0.0034) [2024-06-28 17:16:17,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42871.4, 300 sec: 42709.4). Total num frames: 4431216640. Throughput: 0: 42591.4. Samples: 710080220. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:16:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:16:18,037][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000270461_4431233024.pth... [2024-06-28 17:16:18,079][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000269837_4421009408.pth [2024-06-28 17:16:20,654][09423] Updated weights for policy 0, policy_version 270467 (0.0032) [2024-06-28 17:16:22,921][09190] Fps is (10 sec: 40960.8, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4431413248. Throughput: 0: 42559.1. Samples: 710338520. Policy #0 lag: (min: 0.0, avg: 11.7, max: 24.0) [2024-06-28 17:16:22,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:16:24,263][09423] Updated weights for policy 0, policy_version 270477 (0.0034) [2024-06-28 17:16:27,921][09190] Fps is (10 sec: 42599.2, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4431642624. Throughput: 0: 42483.7. Samples: 710456740. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 17:16:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:16:28,292][09423] Updated weights for policy 0, policy_version 270487 (0.0038) [2024-06-28 17:16:31,793][09423] Updated weights for policy 0, policy_version 270497 (0.0030) [2024-06-28 17:16:32,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42871.5, 300 sec: 42709.7). Total num frames: 4431872000. Throughput: 0: 42421.9. Samples: 710716540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 17:16:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:16:36,004][09423] Updated weights for policy 0, policy_version 270507 (0.0035) [2024-06-28 17:16:37,923][09190] Fps is (10 sec: 40952.5, 60 sec: 42597.1, 300 sec: 42598.1). Total num frames: 4432052224. Throughput: 0: 42420.5. Samples: 710973000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 17:16:37,924][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:16:39,651][09423] Updated weights for policy 0, policy_version 270517 (0.0049) [2024-06-28 17:16:42,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42600.2, 300 sec: 42598.4). Total num frames: 4432281600. Throughput: 0: 42526.4. Samples: 711101240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 17:16:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:16:43,320][09423] Updated weights for policy 0, policy_version 270527 (0.0024) [2024-06-28 17:16:47,038][09403] Signal inference workers to stop experience collection... (9750 times) [2024-06-28 17:16:47,039][09403] Signal inference workers to resume experience collection... (9750 times) [2024-06-28 17:16:47,065][09423] InferenceWorker_p0-w0: stopping experience collection (9750 times) [2024-06-28 17:16:47,065][09423] InferenceWorker_p0-w0: resuming experience collection (9750 times) [2024-06-28 17:16:47,336][09423] Updated weights for policy 0, policy_version 270537 (0.0036) [2024-06-28 17:16:47,921][09190] Fps is (10 sec: 45883.2, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 4432510976. Throughput: 0: 42523.1. Samples: 711355900. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 17:16:47,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:16:51,447][09423] Updated weights for policy 0, policy_version 270547 (0.0036) [2024-06-28 17:16:52,922][09190] Fps is (10 sec: 40959.4, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4432691200. Throughput: 0: 42414.1. Samples: 711608000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 17:16:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:16:55,004][09423] Updated weights for policy 0, policy_version 270557 (0.0035) [2024-06-28 17:16:57,922][09190] Fps is (10 sec: 39321.4, 60 sec: 42325.3, 300 sec: 42542.8). Total num frames: 4432904192. Throughput: 0: 42532.0. Samples: 711737380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 17:16:57,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:16:59,427][09423] Updated weights for policy 0, policy_version 270567 (0.0045) [2024-06-28 17:17:02,771][09423] Updated weights for policy 0, policy_version 270577 (0.0033) [2024-06-28 17:17:02,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42325.4, 300 sec: 42653.9). Total num frames: 4433133568. Throughput: 0: 42602.4. Samples: 711997320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 17:17:02,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:17:06,888][09423] Updated weights for policy 0, policy_version 270587 (0.0025) [2024-06-28 17:17:07,922][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.4, 300 sec: 42653.9). Total num frames: 4433346560. Throughput: 0: 42589.1. Samples: 712255040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 17:17:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:17:10,253][09423] Updated weights for policy 0, policy_version 270597 (0.0028) [2024-06-28 17:17:12,922][09190] Fps is (10 sec: 44236.3, 60 sec: 42871.5, 300 sec: 42598.5). Total num frames: 4433575936. Throughput: 0: 42744.4. Samples: 712380240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 17:17:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:17:14,438][09423] Updated weights for policy 0, policy_version 270607 (0.0036) [2024-06-28 17:17:17,701][09423] Updated weights for policy 0, policy_version 270617 (0.0033) [2024-06-28 17:17:17,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42871.6, 300 sec: 42765.0). Total num frames: 4433788928. Throughput: 0: 42615.6. Samples: 712634240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 17:17:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:17:17,931][09190] No heartbeat for components: RolloutWorker_w20 (576 seconds) [2024-06-28 17:17:22,303][09423] Updated weights for policy 0, policy_version 270627 (0.0030) [2024-06-28 17:17:22,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4433969152. Throughput: 0: 42635.9. Samples: 712891540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 17:17:22,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:17:26,086][09423] Updated weights for policy 0, policy_version 270637 (0.0036) [2024-06-28 17:17:27,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4434198528. Throughput: 0: 42609.3. Samples: 713018660. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 17:17:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:17:29,940][09423] Updated weights for policy 0, policy_version 270647 (0.0031) [2024-06-28 17:17:32,924][09190] Fps is (10 sec: 44226.1, 60 sec: 42323.6, 300 sec: 42653.6). Total num frames: 4434411520. Throughput: 0: 42612.4. Samples: 713273560. Policy #0 lag: (min: 0.0, avg: 9.9, max: 21.0) [2024-06-28 17:17:32,924][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:17:33,529][09423] Updated weights for policy 0, policy_version 270657 (0.0028) [2024-06-28 17:17:37,635][09423] Updated weights for policy 0, policy_version 270667 (0.0035) [2024-06-28 17:17:37,922][09190] Fps is (10 sec: 42597.6, 60 sec: 42872.6, 300 sec: 42598.4). Total num frames: 4434624512. Throughput: 0: 42743.0. Samples: 713531440. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-28 17:17:37,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 17:17:41,566][09423] Updated weights for policy 0, policy_version 270677 (0.0029) [2024-06-28 17:17:42,921][09190] Fps is (10 sec: 44247.4, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4434853888. Throughput: 0: 42677.8. Samples: 713657880. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-28 17:17:42,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:17:44,933][09423] Updated weights for policy 0, policy_version 270687 (0.0030) [2024-06-28 17:17:47,924][09190] Fps is (10 sec: 42588.5, 60 sec: 42323.6, 300 sec: 42598.0). Total num frames: 4435050496. Throughput: 0: 42682.9. Samples: 713918160. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-28 17:17:47,924][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:17:49,159][09423] Updated weights for policy 0, policy_version 270697 (0.0036) [2024-06-28 17:17:52,674][09423] Updated weights for policy 0, policy_version 270707 (0.0031) [2024-06-28 17:17:52,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42871.6, 300 sec: 42654.0). Total num frames: 4435263488. Throughput: 0: 42600.1. Samples: 714172040. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-28 17:17:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:17:56,646][09423] Updated weights for policy 0, policy_version 270717 (0.0026) [2024-06-28 17:17:57,921][09190] Fps is (10 sec: 44247.8, 60 sec: 43144.6, 300 sec: 42542.9). Total num frames: 4435492864. Throughput: 0: 42687.6. Samples: 714301180. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-28 17:17:57,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:18:00,421][09423] Updated weights for policy 0, policy_version 270727 (0.0037) [2024-06-28 17:18:02,922][09190] Fps is (10 sec: 42597.8, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4435689472. Throughput: 0: 42629.2. Samples: 714552560. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-28 17:18:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 17:18:04,268][09423] Updated weights for policy 0, policy_version 270737 (0.0042) [2024-06-28 17:18:07,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42598.5, 300 sec: 42542.9). Total num frames: 4435902464. Throughput: 0: 42650.3. Samples: 714810800. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-28 17:18:07,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:18:08,048][09423] Updated weights for policy 0, policy_version 270747 (0.0032) [2024-06-28 17:18:11,771][09423] Updated weights for policy 0, policy_version 270757 (0.0028) [2024-06-28 17:18:12,923][09190] Fps is (10 sec: 44229.8, 60 sec: 42597.3, 300 sec: 42598.2). Total num frames: 4436131840. Throughput: 0: 42744.2. Samples: 714942220. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-28 17:18:12,924][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 17:18:15,430][09423] Updated weights for policy 0, policy_version 270767 (0.0031) [2024-06-28 17:18:17,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42325.3, 300 sec: 42654.0). Total num frames: 4436328448. Throughput: 0: 42834.3. Samples: 715201000. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-28 17:18:17,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:18:17,931][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000270772_4436328448.pth... [2024-06-28 17:18:17,991][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000270148_4426104832.pth [2024-06-28 17:18:19,227][09423] Updated weights for policy 0, policy_version 270777 (0.0032) [2024-06-28 17:18:22,329][09403] Signal inference workers to stop experience collection... (9800 times) [2024-06-28 17:18:22,329][09403] Signal inference workers to resume experience collection... (9800 times) [2024-06-28 17:18:22,367][09423] InferenceWorker_p0-w0: stopping experience collection (9800 times) [2024-06-28 17:18:22,367][09423] InferenceWorker_p0-w0: resuming experience collection (9800 times) [2024-06-28 17:18:22,921][09190] Fps is (10 sec: 40967.0, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4436541440. Throughput: 0: 42715.3. Samples: 715453620. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-28 17:18:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:18:23,894][09423] Updated weights for policy 0, policy_version 270787 (0.0040) [2024-06-28 17:18:26,971][09423] Updated weights for policy 0, policy_version 270797 (0.0031) [2024-06-28 17:18:27,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4436754432. Throughput: 0: 42723.2. Samples: 715580420. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-28 17:18:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:18:31,217][09423] Updated weights for policy 0, policy_version 270807 (0.0031) [2024-06-28 17:18:32,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42600.2, 300 sec: 42653.9). Total num frames: 4436967424. Throughput: 0: 42641.2. Samples: 715836900. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-28 17:18:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:18:34,475][09423] Updated weights for policy 0, policy_version 270817 (0.0033) [2024-06-28 17:18:37,922][09190] Fps is (10 sec: 40959.5, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4437164032. Throughput: 0: 42631.4. Samples: 716090460. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-28 17:18:37,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:18:39,109][09423] Updated weights for policy 0, policy_version 270827 (0.0028) [2024-06-28 17:18:42,086][09423] Updated weights for policy 0, policy_version 270837 (0.0038) [2024-06-28 17:18:42,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4437393408. Throughput: 0: 42669.4. Samples: 716221300. Policy #0 lag: (min: 1.0, avg: 10.6, max: 20.0) [2024-06-28 17:18:42,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:18:46,671][09423] Updated weights for policy 0, policy_version 270847 (0.0036) [2024-06-28 17:18:47,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42327.2, 300 sec: 42598.4). Total num frames: 4437590016. Throughput: 0: 42701.1. Samples: 716474100. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:18:47,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 17:18:50,353][09423] Updated weights for policy 0, policy_version 270857 (0.0038) [2024-06-28 17:18:52,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4437803008. Throughput: 0: 42622.6. Samples: 716728820. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:18:52,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:18:54,599][09423] Updated weights for policy 0, policy_version 270867 (0.0031) [2024-06-28 17:18:57,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42325.4, 300 sec: 42543.0). Total num frames: 4438032384. Throughput: 0: 42590.9. Samples: 716858740. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:18:57,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:18:58,032][09423] Updated weights for policy 0, policy_version 270877 (0.0025) [2024-06-28 17:19:02,210][09423] Updated weights for policy 0, policy_version 270887 (0.0034) [2024-06-28 17:19:02,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.4, 300 sec: 42653.9). Total num frames: 4438228992. Throughput: 0: 42529.8. Samples: 717114840. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:19:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:19:05,642][09423] Updated weights for policy 0, policy_version 270897 (0.0030) [2024-06-28 17:19:07,922][09190] Fps is (10 sec: 42598.0, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4438458368. Throughput: 0: 42375.9. Samples: 717360540. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:19:07,931][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:19:10,129][09423] Updated weights for policy 0, policy_version 270907 (0.0034) [2024-06-28 17:19:12,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42326.6, 300 sec: 42653.9). Total num frames: 4438671360. Throughput: 0: 42528.9. Samples: 717494220. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:19:12,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:19:13,476][09423] Updated weights for policy 0, policy_version 270917 (0.0027) [2024-06-28 17:19:17,913][09423] Updated weights for policy 0, policy_version 270927 (0.0025) [2024-06-28 17:19:17,922][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4438867968. Throughput: 0: 42570.9. Samples: 717752600. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:19:17,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:19:21,019][09423] Updated weights for policy 0, policy_version 270937 (0.0030) [2024-06-28 17:19:22,922][09190] Fps is (10 sec: 42597.7, 60 sec: 42598.3, 300 sec: 42542.8). Total num frames: 4439097344. Throughput: 0: 42607.1. Samples: 718007780. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:19:22,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:19:25,254][09423] Updated weights for policy 0, policy_version 270947 (0.0029) [2024-06-28 17:19:27,921][09190] Fps is (10 sec: 45875.8, 60 sec: 42871.5, 300 sec: 42654.0). Total num frames: 4439326720. Throughput: 0: 42621.8. Samples: 718139280. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:19:27,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:19:28,523][09423] Updated weights for policy 0, policy_version 270957 (0.0029) [2024-06-28 17:19:32,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42325.2, 300 sec: 42598.4). Total num frames: 4439506944. Throughput: 0: 42608.8. Samples: 718391500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:19:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:19:32,986][09423] Updated weights for policy 0, policy_version 270967 (0.0032) [2024-06-28 17:19:36,372][09423] Updated weights for policy 0, policy_version 270977 (0.0035) [2024-06-28 17:19:37,924][09190] Fps is (10 sec: 40949.9, 60 sec: 42869.8, 300 sec: 42598.1). Total num frames: 4439736320. Throughput: 0: 42722.5. Samples: 718651440. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:19:37,924][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:19:40,774][09423] Updated weights for policy 0, policy_version 270987 (0.0034) [2024-06-28 17:19:41,200][09403] Signal inference workers to stop experience collection... (9850 times) [2024-06-28 17:19:41,201][09403] Signal inference workers to resume experience collection... (9850 times) [2024-06-28 17:19:41,234][09423] InferenceWorker_p0-w0: stopping experience collection (9850 times) [2024-06-28 17:19:41,234][09423] InferenceWorker_p0-w0: resuming experience collection (9850 times) [2024-06-28 17:19:42,926][09190] Fps is (10 sec: 45853.8, 60 sec: 42868.1, 300 sec: 42653.3). Total num frames: 4439965696. Throughput: 0: 42591.5. Samples: 718775560. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:19:42,927][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:19:44,475][09423] Updated weights for policy 0, policy_version 270997 (0.0029) [2024-06-28 17:19:47,921][09190] Fps is (10 sec: 40969.7, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4440145920. Throughput: 0: 42574.6. Samples: 719030700. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:19:47,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 17:19:48,286][09423] Updated weights for policy 0, policy_version 271007 (0.0032) [2024-06-28 17:19:52,243][09423] Updated weights for policy 0, policy_version 271017 (0.0035) [2024-06-28 17:19:52,921][09190] Fps is (10 sec: 39340.3, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4440358912. Throughput: 0: 42799.2. Samples: 719286500. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:19:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:19:55,770][09423] Updated weights for policy 0, policy_version 271027 (0.0028) [2024-06-28 17:19:57,923][09190] Fps is (10 sec: 45866.0, 60 sec: 42870.0, 300 sec: 42653.6). Total num frames: 4440604672. Throughput: 0: 42571.3. Samples: 719410020. Policy #0 lag: (min: 0.0, avg: 9.3, max: 20.0) [2024-06-28 17:19:57,924][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:19:59,631][09423] Updated weights for policy 0, policy_version 271037 (0.0036) [2024-06-28 17:20:02,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4440768512. Throughput: 0: 42636.6. Samples: 719671240. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 17:20:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:20:03,519][09423] Updated weights for policy 0, policy_version 271047 (0.0037) [2024-06-28 17:20:07,746][09423] Updated weights for policy 0, policy_version 271057 (0.0035) [2024-06-28 17:20:07,922][09190] Fps is (10 sec: 39329.4, 60 sec: 42325.3, 300 sec: 42542.8). Total num frames: 4440997888. Throughput: 0: 42549.8. Samples: 719922520. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 17:20:07,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:20:11,353][09423] Updated weights for policy 0, policy_version 271067 (0.0035) [2024-06-28 17:20:12,921][09190] Fps is (10 sec: 47513.0, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 4441243648. Throughput: 0: 42373.7. Samples: 720046100. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 17:20:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:20:15,667][09423] Updated weights for policy 0, policy_version 271077 (0.0034) [2024-06-28 17:20:17,921][09190] Fps is (10 sec: 44237.6, 60 sec: 42871.6, 300 sec: 42598.4). Total num frames: 4441440256. Throughput: 0: 42636.1. Samples: 720310120. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 17:20:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:20:17,942][09190] No heartbeat for components: RolloutWorker_w20 (756 seconds) [2024-06-28 17:20:17,943][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000271084_4441440256.pth... [2024-06-28 17:20:18,003][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000270461_4431233024.pth [2024-06-28 17:20:18,853][09423] Updated weights for policy 0, policy_version 271087 (0.0023) [2024-06-28 17:20:22,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42325.5, 300 sec: 42542.9). Total num frames: 4441636864. Throughput: 0: 42478.4. Samples: 720562860. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 17:20:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:20:22,974][09423] Updated weights for policy 0, policy_version 271097 (0.0030) [2024-06-28 17:20:26,379][09423] Updated weights for policy 0, policy_version 271107 (0.0033) [2024-06-28 17:20:27,922][09190] Fps is (10 sec: 44235.6, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4441882624. Throughput: 0: 42542.5. Samples: 720689780. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 17:20:27,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:20:30,811][09423] Updated weights for policy 0, policy_version 271117 (0.0030) [2024-06-28 17:20:32,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4442062848. Throughput: 0: 42492.6. Samples: 720942860. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 17:20:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:20:34,225][09423] Updated weights for policy 0, policy_version 271127 (0.0033) [2024-06-28 17:20:37,921][09190] Fps is (10 sec: 39322.6, 60 sec: 42327.1, 300 sec: 42543.2). Total num frames: 4442275840. Throughput: 0: 42476.0. Samples: 721197920. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 17:20:37,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:20:38,802][09423] Updated weights for policy 0, policy_version 271137 (0.0024) [2024-06-28 17:20:41,724][09423] Updated weights for policy 0, policy_version 271147 (0.0031) [2024-06-28 17:20:42,921][09190] Fps is (10 sec: 45875.3, 60 sec: 42601.8, 300 sec: 42598.4). Total num frames: 4442521600. Throughput: 0: 42488.7. Samples: 721321920. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 17:20:42,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 17:20:46,198][09423] Updated weights for policy 0, policy_version 271157 (0.0036) [2024-06-28 17:20:47,077][09403] Signal inference workers to stop experience collection... (9900 times) [2024-06-28 17:20:47,077][09403] Signal inference workers to resume experience collection... (9900 times) [2024-06-28 17:20:47,109][09423] InferenceWorker_p0-w0: stopping experience collection (9900 times) [2024-06-28 17:20:47,109][09423] InferenceWorker_p0-w0: resuming experience collection (9900 times) [2024-06-28 17:20:47,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42871.4, 300 sec: 42653.9). Total num frames: 4442718208. Throughput: 0: 42491.4. Samples: 721583360. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 17:20:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:20:49,234][09423] Updated weights for policy 0, policy_version 271167 (0.0031) [2024-06-28 17:20:52,921][09190] Fps is (10 sec: 37683.3, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4442898432. Throughput: 0: 42514.4. Samples: 721835660. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 17:20:52,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:20:54,337][09423] Updated weights for policy 0, policy_version 271177 (0.0032) [2024-06-28 17:20:56,797][09423] Updated weights for policy 0, policy_version 271187 (0.0034) [2024-06-28 17:20:57,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42599.8, 300 sec: 42598.4). Total num frames: 4443160576. Throughput: 0: 42524.0. Samples: 721959680. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 17:20:57,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:21:01,882][09423] Updated weights for policy 0, policy_version 271197 (0.0024) [2024-06-28 17:21:02,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4443340800. Throughput: 0: 42558.1. Samples: 722225240. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 17:21:02,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:21:04,632][09423] Updated weights for policy 0, policy_version 271207 (0.0030) [2024-06-28 17:21:07,921][09190] Fps is (10 sec: 37683.5, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4443537408. Throughput: 0: 42555.1. Samples: 722477840. Policy #0 lag: (min: 0.0, avg: 11.5, max: 21.0) [2024-06-28 17:21:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:21:09,500][09423] Updated weights for policy 0, policy_version 271217 (0.0026) [2024-06-28 17:21:11,997][09423] Updated weights for policy 0, policy_version 271227 (0.0037) [2024-06-28 17:21:12,921][09190] Fps is (10 sec: 45875.7, 60 sec: 42598.5, 300 sec: 42654.0). Total num frames: 4443799552. Throughput: 0: 42450.4. Samples: 722600040. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2024-06-28 17:21:12,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:21:16,819][09423] Updated weights for policy 0, policy_version 271237 (0.0032) [2024-06-28 17:21:17,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42325.2, 300 sec: 42598.4). Total num frames: 4443979776. Throughput: 0: 42650.1. Samples: 722862120. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2024-06-28 17:21:17,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:21:20,010][09423] Updated weights for policy 0, policy_version 271247 (0.0030) [2024-06-28 17:21:22,922][09190] Fps is (10 sec: 37682.7, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 4444176384. Throughput: 0: 42719.4. Samples: 723120300. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2024-06-28 17:21:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:21:24,470][09423] Updated weights for policy 0, policy_version 271257 (0.0042) [2024-06-28 17:21:27,504][09423] Updated weights for policy 0, policy_version 271267 (0.0031) [2024-06-28 17:21:27,922][09190] Fps is (10 sec: 45875.0, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4444438528. Throughput: 0: 42516.3. Samples: 723235160. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2024-06-28 17:21:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:21:32,424][09423] Updated weights for policy 0, policy_version 271277 (0.0028) [2024-06-28 17:21:32,922][09190] Fps is (10 sec: 47513.7, 60 sec: 43144.5, 300 sec: 42709.7). Total num frames: 4444651520. Throughput: 0: 42535.1. Samples: 723497440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2024-06-28 17:21:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:21:35,809][09423] Updated weights for policy 0, policy_version 271287 (0.0034) [2024-06-28 17:21:37,922][09190] Fps is (10 sec: 37683.2, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 4444815360. Throughput: 0: 42666.5. Samples: 723755660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2024-06-28 17:21:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:21:39,744][09423] Updated weights for policy 0, policy_version 271297 (0.0032) [2024-06-28 17:21:42,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4445061120. Throughput: 0: 42561.4. Samples: 723874940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2024-06-28 17:21:42,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:21:43,681][09423] Updated weights for policy 0, policy_version 271307 (0.0032) [2024-06-28 17:21:47,677][09423] Updated weights for policy 0, policy_version 271317 (0.0031) [2024-06-28 17:21:47,922][09190] Fps is (10 sec: 45875.3, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4445274112. Throughput: 0: 42501.8. Samples: 724137820. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2024-06-28 17:21:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:21:51,965][09423] Updated weights for policy 0, policy_version 271327 (0.0027) [2024-06-28 17:21:52,921][09190] Fps is (10 sec: 37683.0, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4445437952. Throughput: 0: 42687.6. Samples: 724398780. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2024-06-28 17:21:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:21:55,074][09423] Updated weights for policy 0, policy_version 271337 (0.0026) [2024-06-28 17:21:57,922][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4445716480. Throughput: 0: 42635.9. Samples: 724518660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2024-06-28 17:21:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:21:59,514][09423] Updated weights for policy 0, policy_version 271347 (0.0027) [2024-06-28 17:22:02,778][09423] Updated weights for policy 0, policy_version 271357 (0.0029) [2024-06-28 17:22:02,921][09190] Fps is (10 sec: 49152.4, 60 sec: 43144.7, 300 sec: 42654.0). Total num frames: 4445929472. Throughput: 0: 42542.8. Samples: 724776540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2024-06-28 17:22:02,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 17:22:07,098][09423] Updated weights for policy 0, policy_version 271367 (0.0033) [2024-06-28 17:22:07,921][09190] Fps is (10 sec: 37683.8, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4446093312. Throughput: 0: 42625.5. Samples: 725038440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2024-06-28 17:22:07,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:22:10,244][09423] Updated weights for policy 0, policy_version 271377 (0.0034) [2024-06-28 17:22:12,922][09190] Fps is (10 sec: 40959.1, 60 sec: 42325.2, 300 sec: 42542.8). Total num frames: 4446339072. Throughput: 0: 42705.3. Samples: 725156900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2024-06-28 17:22:12,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 17:22:15,152][09423] Updated weights for policy 0, policy_version 271387 (0.0034) [2024-06-28 17:22:17,886][09403] Signal inference workers to stop experience collection... (9950 times) [2024-06-28 17:22:17,886][09403] Signal inference workers to resume experience collection... (9950 times) [2024-06-28 17:22:17,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42871.6, 300 sec: 42654.0). Total num frames: 4446552064. Throughput: 0: 42570.4. Samples: 725413100. Policy #0 lag: (min: 0.0, avg: 9.7, max: 19.0) [2024-06-28 17:22:17,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:22:17,940][09423] InferenceWorker_p0-w0: stopping experience collection (9950 times) [2024-06-28 17:22:17,940][09423] InferenceWorker_p0-w0: resuming experience collection (9950 times) [2024-06-28 17:22:18,019][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000271397_4446568448.pth... [2024-06-28 17:22:18,026][09423] Updated weights for policy 0, policy_version 271397 (0.0035) [2024-06-28 17:22:18,059][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000270772_4436328448.pth [2024-06-28 17:22:22,728][09423] Updated weights for policy 0, policy_version 271407 (0.0035) [2024-06-28 17:22:22,921][09190] Fps is (10 sec: 39322.2, 60 sec: 42598.5, 300 sec: 42487.3). Total num frames: 4446732288. Throughput: 0: 42645.9. Samples: 725674720. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:22:22,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:22:25,823][09423] Updated weights for policy 0, policy_version 271417 (0.0036) [2024-06-28 17:22:27,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.4, 300 sec: 42598.8). Total num frames: 4446978048. Throughput: 0: 42635.1. Samples: 725793520. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:22:27,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 17:22:30,268][09423] Updated weights for policy 0, policy_version 271427 (0.0033) [2024-06-28 17:22:32,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4447191040. Throughput: 0: 42567.2. Samples: 726053340. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:22:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:22:33,390][09423] Updated weights for policy 0, policy_version 271437 (0.0032) [2024-06-28 17:22:37,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4447371264. Throughput: 0: 42498.2. Samples: 726311200. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:22:37,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 17:22:38,138][09423] Updated weights for policy 0, policy_version 271447 (0.0026) [2024-06-28 17:22:41,075][09423] Updated weights for policy 0, policy_version 271457 (0.0035) [2024-06-28 17:22:42,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42598.3, 300 sec: 42598.8). Total num frames: 4447617024. Throughput: 0: 42470.3. Samples: 726429820. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:22:42,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 17:22:45,573][09423] Updated weights for policy 0, policy_version 271467 (0.0032) [2024-06-28 17:22:47,921][09190] Fps is (10 sec: 47513.2, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4447846400. Throughput: 0: 42459.4. Samples: 726687220. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:22:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:22:48,501][09423] Updated weights for policy 0, policy_version 271477 (0.0039) [2024-06-28 17:22:52,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42871.5, 300 sec: 42431.8). Total num frames: 4448010240. Throughput: 0: 42534.6. Samples: 726952500. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:22:52,922][09190] Avg episode reward: [(0, '0.722')] [2024-06-28 17:22:53,398][09423] Updated weights for policy 0, policy_version 271487 (0.0027) [2024-06-28 17:22:56,338][09423] Updated weights for policy 0, policy_version 271497 (0.0028) [2024-06-28 17:22:57,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42052.4, 300 sec: 42542.9). Total num frames: 4448239616. Throughput: 0: 42430.4. Samples: 727066260. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:22:57,922][09190] Avg episode reward: [(0, '0.713')] [2024-06-28 17:23:00,884][09423] Updated weights for policy 0, policy_version 271507 (0.0033) [2024-06-28 17:23:02,921][09190] Fps is (10 sec: 47513.3, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4448485376. Throughput: 0: 42548.3. Samples: 727327780. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:23:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 17:23:04,189][09423] Updated weights for policy 0, policy_version 271517 (0.0026) [2024-06-28 17:23:07,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42871.4, 300 sec: 42487.6). Total num frames: 4448665600. Throughput: 0: 42501.2. Samples: 727587280. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:23:07,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 17:23:08,233][09423] Updated weights for policy 0, policy_version 271527 (0.0031) [2024-06-28 17:23:11,949][09423] Updated weights for policy 0, policy_version 271537 (0.0033) [2024-06-28 17:23:12,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42325.5, 300 sec: 42542.9). Total num frames: 4448878592. Throughput: 0: 42475.1. Samples: 727704900. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:23:12,922][09190] Avg episode reward: [(0, '0.756')] [2024-06-28 17:23:16,085][09423] Updated weights for policy 0, policy_version 271547 (0.0030) [2024-06-28 17:23:17,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42871.4, 300 sec: 42653.9). Total num frames: 4449124352. Throughput: 0: 42556.5. Samples: 727968380. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:23:17,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:23:17,932][09190] No heartbeat for components: RolloutWorker_w20 (936 seconds) [2024-06-28 17:23:20,048][09423] Updated weights for policy 0, policy_version 271557 (0.0036) [2024-06-28 17:23:22,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4449288192. Throughput: 0: 42587.5. Samples: 728227640. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:23:22,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 17:23:23,780][09423] Updated weights for policy 0, policy_version 271567 (0.0034) [2024-06-28 17:23:27,465][09403] Signal inference workers to stop experience collection... (10000 times) [2024-06-28 17:23:27,501][09423] InferenceWorker_p0-w0: stopping experience collection (10000 times) [2024-06-28 17:23:27,520][09403] Signal inference workers to resume experience collection... (10000 times) [2024-06-28 17:23:27,520][09423] InferenceWorker_p0-w0: resuming experience collection (10000 times) [2024-06-28 17:23:27,522][09423] Updated weights for policy 0, policy_version 271577 (0.0028) [2024-06-28 17:23:27,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4449533952. Throughput: 0: 42518.7. Samples: 728343160. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:23:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:23:31,322][09423] Updated weights for policy 0, policy_version 271587 (0.0031) [2024-06-28 17:23:32,921][09190] Fps is (10 sec: 47514.0, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4449763328. Throughput: 0: 42725.9. Samples: 728609880. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 17:23:32,923][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:23:35,155][09423] Updated weights for policy 0, policy_version 271597 (0.0024) [2024-06-28 17:23:37,922][09190] Fps is (10 sec: 42598.3, 60 sec: 43144.5, 300 sec: 42598.4). Total num frames: 4449959936. Throughput: 0: 42606.6. Samples: 728869800. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 17:23:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:23:38,916][09423] Updated weights for policy 0, policy_version 271607 (0.0033) [2024-06-28 17:23:42,646][09423] Updated weights for policy 0, policy_version 271617 (0.0040) [2024-06-28 17:23:42,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4450172928. Throughput: 0: 42728.8. Samples: 728989060. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 17:23:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:23:47,033][09423] Updated weights for policy 0, policy_version 271627 (0.0032) [2024-06-28 17:23:47,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42325.4, 300 sec: 42653.9). Total num frames: 4450385920. Throughput: 0: 42700.1. Samples: 729249280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 17:23:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:23:50,294][09423] Updated weights for policy 0, policy_version 271637 (0.0029) [2024-06-28 17:23:52,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42871.4, 300 sec: 42542.8). Total num frames: 4450582528. Throughput: 0: 42470.7. Samples: 729498460. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 17:23:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:23:54,459][09423] Updated weights for policy 0, policy_version 271647 (0.0033) [2024-06-28 17:23:57,924][09190] Fps is (10 sec: 40949.7, 60 sec: 42596.6, 300 sec: 42598.0). Total num frames: 4450795520. Throughput: 0: 42606.0. Samples: 729622280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 17:23:57,924][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:23:58,780][09423] Updated weights for policy 0, policy_version 271657 (0.0029) [2024-06-28 17:24:01,922][09423] Updated weights for policy 0, policy_version 271667 (0.0037) [2024-06-28 17:24:02,922][09190] Fps is (10 sec: 42598.2, 60 sec: 42052.2, 300 sec: 42542.9). Total num frames: 4451008512. Throughput: 0: 42510.6. Samples: 729881360. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 17:24:02,923][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:24:06,067][09423] Updated weights for policy 0, policy_version 271677 (0.0042) [2024-06-28 17:24:07,922][09190] Fps is (10 sec: 42608.5, 60 sec: 42598.4, 300 sec: 42542.8). Total num frames: 4451221504. Throughput: 0: 42644.4. Samples: 730146640. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 17:24:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:24:09,595][09423] Updated weights for policy 0, policy_version 271687 (0.0028) [2024-06-28 17:24:12,921][09190] Fps is (10 sec: 44237.6, 60 sec: 42871.5, 300 sec: 42654.0). Total num frames: 4451450880. Throughput: 0: 42708.1. Samples: 730265020. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 17:24:12,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 17:24:13,998][09423] Updated weights for policy 0, policy_version 271697 (0.0033) [2024-06-28 17:24:17,356][09423] Updated weights for policy 0, policy_version 271707 (0.0037) [2024-06-28 17:24:17,921][09190] Fps is (10 sec: 45876.0, 60 sec: 42598.4, 300 sec: 42654.0). Total num frames: 4451680256. Throughput: 0: 42572.4. Samples: 730525640. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 17:24:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:24:17,935][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000271709_4451680256.pth... [2024-06-28 17:24:17,981][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000271084_4441440256.pth [2024-06-28 17:24:21,691][09423] Updated weights for policy 0, policy_version 271717 (0.0036) [2024-06-28 17:24:22,921][09190] Fps is (10 sec: 42598.4, 60 sec: 43144.6, 300 sec: 42542.9). Total num frames: 4451876864. Throughput: 0: 42448.6. Samples: 730779980. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 17:24:22,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:24:24,842][09423] Updated weights for policy 0, policy_version 271727 (0.0034) [2024-06-28 17:24:27,924][09190] Fps is (10 sec: 39311.6, 60 sec: 42323.6, 300 sec: 42598.0). Total num frames: 4452073472. Throughput: 0: 42571.4. Samples: 730904880. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 17:24:27,925][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:24:29,437][09423] Updated weights for policy 0, policy_version 271737 (0.0036) [2024-06-28 17:24:32,877][09423] Updated weights for policy 0, policy_version 271747 (0.0034) [2024-06-28 17:24:32,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42325.3, 300 sec: 42598.8). Total num frames: 4452302848. Throughput: 0: 42449.7. Samples: 731159520. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 17:24:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:24:36,865][09423] Updated weights for policy 0, policy_version 271757 (0.0032) [2024-06-28 17:24:37,924][09190] Fps is (10 sec: 42598.1, 60 sec: 42323.6, 300 sec: 42487.6). Total num frames: 4452499456. Throughput: 0: 42604.3. Samples: 731415760. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 17:24:37,924][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:24:40,321][09423] Updated weights for policy 0, policy_version 271767 (0.0029) [2024-06-28 17:24:42,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4452712448. Throughput: 0: 42580.1. Samples: 731538280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 21.0) [2024-06-28 17:24:42,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:24:44,711][09423] Updated weights for policy 0, policy_version 271777 (0.0031) [2024-06-28 17:24:47,921][09190] Fps is (10 sec: 44248.5, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4452941824. Throughput: 0: 42599.3. Samples: 731798320. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 17:24:47,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:24:48,020][09423] Updated weights for policy 0, policy_version 271787 (0.0037) [2024-06-28 17:24:52,436][09423] Updated weights for policy 0, policy_version 271797 (0.0040) [2024-06-28 17:24:52,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42325.5, 300 sec: 42432.1). Total num frames: 4453122048. Throughput: 0: 42388.3. Samples: 732054100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 17:24:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:24:54,307][09403] Signal inference workers to stop experience collection... (10050 times) [2024-06-28 17:24:54,307][09403] Signal inference workers to resume experience collection... (10050 times) [2024-06-28 17:24:54,333][09423] InferenceWorker_p0-w0: stopping experience collection (10050 times) [2024-06-28 17:24:54,334][09423] InferenceWorker_p0-w0: resuming experience collection (10050 times) [2024-06-28 17:24:55,482][09423] Updated weights for policy 0, policy_version 271807 (0.0026) [2024-06-28 17:24:57,921][09190] Fps is (10 sec: 40959.4, 60 sec: 42600.1, 300 sec: 42653.9). Total num frames: 4453351424. Throughput: 0: 42455.0. Samples: 732175500. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 17:24:57,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:25:00,099][09423] Updated weights for policy 0, policy_version 271817 (0.0032) [2024-06-28 17:25:02,922][09190] Fps is (10 sec: 45874.1, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4453580800. Throughput: 0: 42483.4. Samples: 732437400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 17:25:02,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:25:03,508][09423] Updated weights for policy 0, policy_version 271827 (0.0034) [2024-06-28 17:25:07,640][09423] Updated weights for policy 0, policy_version 271837 (0.0040) [2024-06-28 17:25:07,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42598.6, 300 sec: 42487.3). Total num frames: 4453777408. Throughput: 0: 42419.6. Samples: 732688860. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 17:25:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:25:11,285][09423] Updated weights for policy 0, policy_version 271847 (0.0032) [2024-06-28 17:25:12,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4453990400. Throughput: 0: 42458.9. Samples: 732815420. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 17:25:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:25:15,376][09423] Updated weights for policy 0, policy_version 271857 (0.0039) [2024-06-28 17:25:17,924][09190] Fps is (10 sec: 42587.4, 60 sec: 42050.5, 300 sec: 42598.0). Total num frames: 4454203392. Throughput: 0: 42565.6. Samples: 733075080. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 17:25:17,924][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:25:18,861][09423] Updated weights for policy 0, policy_version 271867 (0.0027) [2024-06-28 17:25:22,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42052.2, 300 sec: 42431.8). Total num frames: 4454400000. Throughput: 0: 42598.8. Samples: 733332600. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 17:25:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:25:23,195][09423] Updated weights for policy 0, policy_version 271877 (0.0028) [2024-06-28 17:25:26,755][09423] Updated weights for policy 0, policy_version 271887 (0.0029) [2024-06-28 17:25:27,921][09190] Fps is (10 sec: 42608.8, 60 sec: 42600.1, 300 sec: 42598.4). Total num frames: 4454629376. Throughput: 0: 42603.5. Samples: 733455440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 17:25:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:25:30,850][09423] Updated weights for policy 0, policy_version 271897 (0.0028) [2024-06-28 17:25:32,921][09190] Fps is (10 sec: 45875.3, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4454858752. Throughput: 0: 42563.0. Samples: 733713660. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 17:25:32,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:25:34,059][09423] Updated weights for policy 0, policy_version 271907 (0.0026) [2024-06-28 17:25:37,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42600.2, 300 sec: 42487.3). Total num frames: 4455055360. Throughput: 0: 42602.1. Samples: 733971200. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 17:25:37,936][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:25:38,704][09423] Updated weights for policy 0, policy_version 271917 (0.0025) [2024-06-28 17:25:42,827][09423] Updated weights for policy 0, policy_version 271927 (0.0029) [2024-06-28 17:25:42,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4455251968. Throughput: 0: 42584.1. Samples: 734091780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 17:25:42,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:25:46,483][09423] Updated weights for policy 0, policy_version 271937 (0.0030) [2024-06-28 17:25:47,922][09190] Fps is (10 sec: 42597.8, 60 sec: 42325.2, 300 sec: 42653.9). Total num frames: 4455481344. Throughput: 0: 42496.9. Samples: 734349760. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 17:25:47,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 17:25:50,054][09423] Updated weights for policy 0, policy_version 271947 (0.0030) [2024-06-28 17:25:52,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4455694336. Throughput: 0: 42626.6. Samples: 734607060. Policy #0 lag: (min: 0.0, avg: 8.8, max: 22.0) [2024-06-28 17:25:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:25:54,315][09423] Updated weights for policy 0, policy_version 271957 (0.0028) [2024-06-28 17:25:57,622][09423] Updated weights for policy 0, policy_version 271967 (0.0035) [2024-06-28 17:25:57,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4455907328. Throughput: 0: 42599.5. Samples: 734732400. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:25:57,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:26:01,709][09423] Updated weights for policy 0, policy_version 271977 (0.0036) [2024-06-28 17:26:02,924][09190] Fps is (10 sec: 42587.8, 60 sec: 42323.6, 300 sec: 42653.6). Total num frames: 4456120320. Throughput: 0: 42633.8. Samples: 734993600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:26:02,925][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:26:05,439][09423] Updated weights for policy 0, policy_version 271987 (0.0027) [2024-06-28 17:26:07,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4456316928. Throughput: 0: 42605.0. Samples: 735249820. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:26:07,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:26:09,229][09423] Updated weights for policy 0, policy_version 271997 (0.0030) [2024-06-28 17:26:12,882][09423] Updated weights for policy 0, policy_version 272007 (0.0037) [2024-06-28 17:26:12,921][09190] Fps is (10 sec: 44247.7, 60 sec: 42871.4, 300 sec: 42653.9). Total num frames: 4456562688. Throughput: 0: 42609.8. Samples: 735372880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:26:12,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:26:16,932][09423] Updated weights for policy 0, policy_version 272017 (0.0032) [2024-06-28 17:26:17,921][09190] Fps is (10 sec: 45875.4, 60 sec: 42873.3, 300 sec: 42709.5). Total num frames: 4456775680. Throughput: 0: 42643.2. Samples: 735632600. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:26:17,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 17:26:17,927][09190] No heartbeat for components: RolloutWorker_w20 (1116 seconds) [2024-06-28 17:26:17,997][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000272021_4456792064.pth... [2024-06-28 17:26:18,046][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000271397_4446568448.pth [2024-06-28 17:26:20,533][09423] Updated weights for policy 0, policy_version 272027 (0.0027) [2024-06-28 17:26:22,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42871.6, 300 sec: 42487.3). Total num frames: 4456972288. Throughput: 0: 42549.4. Samples: 735885920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:26:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:26:24,900][09423] Updated weights for policy 0, policy_version 272037 (0.0033) [2024-06-28 17:26:27,899][09403] Signal inference workers to stop experience collection... (10100 times) [2024-06-28 17:26:27,899][09403] Signal inference workers to resume experience collection... (10100 times) [2024-06-28 17:26:27,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42871.6, 300 sec: 42542.9). Total num frames: 4457201664. Throughput: 0: 42603.2. Samples: 736008920. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:26:27,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:26:27,939][09423] InferenceWorker_p0-w0: stopping experience collection (10100 times) [2024-06-28 17:26:27,939][09423] InferenceWorker_p0-w0: resuming experience collection (10100 times) [2024-06-28 17:26:28,039][09423] Updated weights for policy 0, policy_version 272047 (0.0024) [2024-06-28 17:26:32,541][09423] Updated weights for policy 0, policy_version 272057 (0.0032) [2024-06-28 17:26:32,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42325.3, 300 sec: 42653.9). Total num frames: 4457398272. Throughput: 0: 42839.2. Samples: 736277520. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:26:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:26:35,860][09423] Updated weights for policy 0, policy_version 272067 (0.0023) [2024-06-28 17:26:37,922][09190] Fps is (10 sec: 39320.7, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4457594880. Throughput: 0: 42825.7. Samples: 736534220. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:26:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:26:39,759][09423] Updated weights for policy 0, policy_version 272077 (0.0028) [2024-06-28 17:26:42,921][09190] Fps is (10 sec: 44237.2, 60 sec: 43144.5, 300 sec: 42598.4). Total num frames: 4457840640. Throughput: 0: 42812.9. Samples: 736658980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:26:42,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:26:43,218][09423] Updated weights for policy 0, policy_version 272087 (0.0033) [2024-06-28 17:26:47,320][09423] Updated weights for policy 0, policy_version 272097 (0.0029) [2024-06-28 17:26:47,924][09190] Fps is (10 sec: 45863.7, 60 sec: 42869.7, 300 sec: 42764.6). Total num frames: 4458053632. Throughput: 0: 42821.2. Samples: 736920560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:26:47,925][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 17:26:51,301][09423] Updated weights for policy 0, policy_version 272107 (0.0032) [2024-06-28 17:26:52,924][09190] Fps is (10 sec: 40949.7, 60 sec: 42596.6, 300 sec: 42487.0). Total num frames: 4458250240. Throughput: 0: 42700.3. Samples: 737171440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:26:52,924][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 17:26:54,987][09423] Updated weights for policy 0, policy_version 272117 (0.0033) [2024-06-28 17:26:57,921][09190] Fps is (10 sec: 42609.5, 60 sec: 42871.4, 300 sec: 42542.8). Total num frames: 4458479616. Throughput: 0: 42793.4. Samples: 737298580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:26:57,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:26:59,078][09423] Updated weights for policy 0, policy_version 272127 (0.0037) [2024-06-28 17:27:02,743][09423] Updated weights for policy 0, policy_version 272137 (0.0030) [2024-06-28 17:27:02,921][09190] Fps is (10 sec: 44247.9, 60 sec: 42873.3, 300 sec: 42709.5). Total num frames: 4458692608. Throughput: 0: 42717.3. Samples: 737554880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:27:02,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:27:06,617][09423] Updated weights for policy 0, policy_version 272147 (0.0039) [2024-06-28 17:27:07,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42598.5, 300 sec: 42487.4). Total num frames: 4458872832. Throughput: 0: 42814.7. Samples: 737812580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 17:27:07,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:27:10,765][09423] Updated weights for policy 0, policy_version 272157 (0.0042) [2024-06-28 17:27:12,924][09190] Fps is (10 sec: 42587.7, 60 sec: 42596.7, 300 sec: 42598.0). Total num frames: 4459118592. Throughput: 0: 42888.2. Samples: 737939000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 17:27:12,925][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:27:14,016][09423] Updated weights for policy 0, policy_version 272167 (0.0025) [2024-06-28 17:27:17,922][09190] Fps is (10 sec: 44235.9, 60 sec: 42325.2, 300 sec: 42653.9). Total num frames: 4459315200. Throughput: 0: 42578.2. Samples: 738193540. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 17:27:17,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:27:18,298][09423] Updated weights for policy 0, policy_version 272177 (0.0031) [2024-06-28 17:27:22,406][09423] Updated weights for policy 0, policy_version 272187 (0.0028) [2024-06-28 17:27:22,921][09190] Fps is (10 sec: 39331.2, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 4459511808. Throughput: 0: 42702.3. Samples: 738455820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 17:27:22,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 17:27:25,637][09423] Updated weights for policy 0, policy_version 272197 (0.0030) [2024-06-28 17:27:27,922][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.2, 300 sec: 42598.4). Total num frames: 4459757568. Throughput: 0: 42679.0. Samples: 738579540. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 17:27:27,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:27:30,111][09423] Updated weights for policy 0, policy_version 272207 (0.0025) [2024-06-28 17:27:32,921][09190] Fps is (10 sec: 45875.3, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4459970560. Throughput: 0: 42592.7. Samples: 738837120. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 17:27:32,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 17:27:33,356][09423] Updated weights for policy 0, policy_version 272217 (0.0033) [2024-06-28 17:27:37,662][09423] Updated weights for policy 0, policy_version 272227 (0.0036) [2024-06-28 17:27:37,921][09190] Fps is (10 sec: 40960.8, 60 sec: 42871.6, 300 sec: 42542.9). Total num frames: 4460167168. Throughput: 0: 42577.6. Samples: 739087320. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 17:27:37,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:27:40,955][09423] Updated weights for policy 0, policy_version 272237 (0.0029) [2024-06-28 17:27:42,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4460380160. Throughput: 0: 42508.1. Samples: 739211440. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 17:27:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:27:45,473][09423] Updated weights for policy 0, policy_version 272247 (0.0031) [2024-06-28 17:27:47,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42327.2, 300 sec: 42653.9). Total num frames: 4460593152. Throughput: 0: 42620.5. Samples: 739472800. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 17:27:47,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 17:27:48,722][09423] Updated weights for policy 0, policy_version 272257 (0.0040) [2024-06-28 17:27:52,921][09190] Fps is (10 sec: 40959.4, 60 sec: 42327.0, 300 sec: 42542.8). Total num frames: 4460789760. Throughput: 0: 42475.8. Samples: 739724000. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 17:27:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:27:53,276][09423] Updated weights for policy 0, policy_version 272267 (0.0038) [2024-06-28 17:27:56,285][09423] Updated weights for policy 0, policy_version 272277 (0.0026) [2024-06-28 17:27:57,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4461035520. Throughput: 0: 42553.9. Samples: 739853820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 17:27:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:28:00,664][09423] Updated weights for policy 0, policy_version 272287 (0.0036) [2024-06-28 17:28:02,921][09190] Fps is (10 sec: 45875.9, 60 sec: 42598.5, 300 sec: 42654.0). Total num frames: 4461248512. Throughput: 0: 42732.6. Samples: 740116500. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 17:28:02,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:28:03,768][09423] Updated weights for policy 0, policy_version 272297 (0.0032) [2024-06-28 17:28:07,924][09190] Fps is (10 sec: 40949.6, 60 sec: 42869.6, 300 sec: 42598.0). Total num frames: 4461445120. Throughput: 0: 42477.7. Samples: 740367420. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 17:28:07,925][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:28:07,927][09403] Signal inference workers to stop experience collection... (10150 times) [2024-06-28 17:28:07,970][09423] InferenceWorker_p0-w0: stopping experience collection (10150 times) [2024-06-28 17:28:07,977][09403] Signal inference workers to resume experience collection... (10150 times) [2024-06-28 17:28:07,992][09423] InferenceWorker_p0-w0: resuming experience collection (10150 times) [2024-06-28 17:28:08,115][09423] Updated weights for policy 0, policy_version 272307 (0.0027) [2024-06-28 17:28:11,662][09423] Updated weights for policy 0, policy_version 272317 (0.0034) [2024-06-28 17:28:12,921][09190] Fps is (10 sec: 40959.4, 60 sec: 42327.1, 300 sec: 42487.3). Total num frames: 4461658112. Throughput: 0: 42510.7. Samples: 740492520. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 17:28:12,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:28:16,555][09423] Updated weights for policy 0, policy_version 272327 (0.0026) [2024-06-28 17:28:17,921][09190] Fps is (10 sec: 44247.8, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4461887488. Throughput: 0: 42586.7. Samples: 740753520. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 17:28:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:28:17,935][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000272332_4461887488.pth... [2024-06-28 17:28:18,000][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000271709_4451680256.pth [2024-06-28 17:28:19,346][09423] Updated weights for policy 0, policy_version 272337 (0.0029) [2024-06-28 17:28:22,924][09190] Fps is (10 sec: 44226.0, 60 sec: 43142.8, 300 sec: 42598.0). Total num frames: 4462100480. Throughput: 0: 42633.6. Samples: 741005940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:28:22,924][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 17:28:24,155][09423] Updated weights for policy 0, policy_version 272347 (0.0034) [2024-06-28 17:28:27,110][09423] Updated weights for policy 0, policy_version 272357 (0.0035) [2024-06-28 17:28:27,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.4, 300 sec: 42542.8). Total num frames: 4462313472. Throughput: 0: 42585.7. Samples: 741127800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:28:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:28:31,935][09423] Updated weights for policy 0, policy_version 272367 (0.0031) [2024-06-28 17:28:32,921][09190] Fps is (10 sec: 42609.2, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4462526464. Throughput: 0: 42700.9. Samples: 741394340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:28:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:28:34,720][09423] Updated weights for policy 0, policy_version 272377 (0.0026) [2024-06-28 17:28:37,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4462723072. Throughput: 0: 42689.5. Samples: 741645020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:28:37,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 17:28:39,565][09423] Updated weights for policy 0, policy_version 272387 (0.0050) [2024-06-28 17:28:42,449][09423] Updated weights for policy 0, policy_version 272397 (0.0034) [2024-06-28 17:28:42,923][09190] Fps is (10 sec: 42589.8, 60 sec: 42870.0, 300 sec: 42598.1). Total num frames: 4462952448. Throughput: 0: 42529.2. Samples: 741767720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:28:42,924][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:28:47,180][09423] Updated weights for policy 0, policy_version 272407 (0.0029) [2024-06-28 17:28:47,922][09190] Fps is (10 sec: 45874.4, 60 sec: 43144.4, 300 sec: 42709.5). Total num frames: 4463181824. Throughput: 0: 42622.9. Samples: 742034540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:28:47,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:28:50,425][09423] Updated weights for policy 0, policy_version 272417 (0.0032) [2024-06-28 17:28:52,922][09190] Fps is (10 sec: 40967.6, 60 sec: 42871.4, 300 sec: 42598.7). Total num frames: 4463362048. Throughput: 0: 42504.5. Samples: 742280020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:28:52,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:28:54,711][09423] Updated weights for policy 0, policy_version 272427 (0.0028) [2024-06-28 17:28:57,922][09190] Fps is (10 sec: 39321.6, 60 sec: 42325.2, 300 sec: 42598.4). Total num frames: 4463575040. Throughput: 0: 42501.7. Samples: 742405100. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:28:57,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:28:58,430][09423] Updated weights for policy 0, policy_version 272437 (0.0032) [2024-06-28 17:29:02,362][09423] Updated weights for policy 0, policy_version 272447 (0.0038) [2024-06-28 17:29:02,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42598.4, 300 sec: 42654.0). Total num frames: 4463804416. Throughput: 0: 42670.7. Samples: 742673700. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:29:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:29:06,152][09423] Updated weights for policy 0, policy_version 272457 (0.0028) [2024-06-28 17:29:07,922][09190] Fps is (10 sec: 44236.8, 60 sec: 42873.2, 300 sec: 42598.4). Total num frames: 4464017408. Throughput: 0: 42501.8. Samples: 742918420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:29:07,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:29:09,717][09423] Updated weights for policy 0, policy_version 272467 (0.0038) [2024-06-28 17:29:12,922][09190] Fps is (10 sec: 42597.7, 60 sec: 42871.4, 300 sec: 42542.8). Total num frames: 4464230400. Throughput: 0: 42694.1. Samples: 743049040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:29:12,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:29:13,458][09423] Updated weights for policy 0, policy_version 272477 (0.0030) [2024-06-28 17:29:16,336][09403] Signal inference workers to stop experience collection... (10200 times) [2024-06-28 17:29:16,369][09423] InferenceWorker_p0-w0: stopping experience collection (10200 times) [2024-06-28 17:29:16,393][09403] Signal inference workers to resume experience collection... (10200 times) [2024-06-28 17:29:16,394][09423] InferenceWorker_p0-w0: resuming experience collection (10200 times) [2024-06-28 17:29:17,302][09423] Updated weights for policy 0, policy_version 272487 (0.0030) [2024-06-28 17:29:17,924][09190] Fps is (10 sec: 42587.9, 60 sec: 42596.6, 300 sec: 42598.0). Total num frames: 4464443392. Throughput: 0: 42552.6. Samples: 743309320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:29:17,925][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 17:29:17,931][09190] No heartbeat for components: RolloutWorker_w20 (1296 seconds) [2024-06-28 17:29:21,565][09423] Updated weights for policy 0, policy_version 272497 (0.0038) [2024-06-28 17:29:22,921][09190] Fps is (10 sec: 40960.9, 60 sec: 42327.1, 300 sec: 42598.8). Total num frames: 4464640000. Throughput: 0: 42538.2. Samples: 743559240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:29:22,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:29:25,159][09423] Updated weights for policy 0, policy_version 272507 (0.0027) [2024-06-28 17:29:27,922][09190] Fps is (10 sec: 42608.7, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4464869376. Throughput: 0: 42507.9. Samples: 743680500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:29:27,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:29:29,328][09423] Updated weights for policy 0, policy_version 272517 (0.0044) [2024-06-28 17:29:32,699][09423] Updated weights for policy 0, policy_version 272527 (0.0030) [2024-06-28 17:29:32,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42871.4, 300 sec: 42709.9). Total num frames: 4465098752. Throughput: 0: 42397.9. Samples: 743942440. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:29:32,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:29:37,067][09423] Updated weights for policy 0, policy_version 272537 (0.0041) [2024-06-28 17:29:37,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4465262592. Throughput: 0: 42541.0. Samples: 744194360. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:29:37,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:29:40,596][09423] Updated weights for policy 0, policy_version 272547 (0.0040) [2024-06-28 17:29:42,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42326.7, 300 sec: 42542.8). Total num frames: 4465491968. Throughput: 0: 42471.6. Samples: 744316320. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:29:42,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:29:44,430][09423] Updated weights for policy 0, policy_version 272557 (0.0027) [2024-06-28 17:29:47,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42325.3, 300 sec: 42709.4). Total num frames: 4465721344. Throughput: 0: 42384.3. Samples: 744581000. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:29:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:29:48,232][09423] Updated weights for policy 0, policy_version 272567 (0.0037) [2024-06-28 17:29:52,650][09423] Updated weights for policy 0, policy_version 272577 (0.0037) [2024-06-28 17:29:52,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4465901568. Throughput: 0: 42545.8. Samples: 744832980. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:29:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:29:55,748][09423] Updated weights for policy 0, policy_version 272587 (0.0035) [2024-06-28 17:29:57,921][09190] Fps is (10 sec: 42599.1, 60 sec: 42871.6, 300 sec: 42598.4). Total num frames: 4466147328. Throughput: 0: 42351.8. Samples: 744954860. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:29:57,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 17:30:00,418][09423] Updated weights for policy 0, policy_version 272597 (0.0032) [2024-06-28 17:30:02,921][09190] Fps is (10 sec: 45875.6, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4466360320. Throughput: 0: 42450.4. Samples: 745219480. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:30:02,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:30:03,239][09423] Updated weights for policy 0, policy_version 272607 (0.0028) [2024-06-28 17:30:07,758][09423] Updated weights for policy 0, policy_version 272617 (0.0033) [2024-06-28 17:30:07,921][09190] Fps is (10 sec: 40959.4, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4466556928. Throughput: 0: 42483.0. Samples: 745470980. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:30:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:30:11,258][09423] Updated weights for policy 0, policy_version 272627 (0.0034) [2024-06-28 17:30:12,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42325.5, 300 sec: 42598.8). Total num frames: 4466769920. Throughput: 0: 42457.6. Samples: 745591080. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:30:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:30:15,492][09423] Updated weights for policy 0, policy_version 272637 (0.0028) [2024-06-28 17:30:17,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42327.1, 300 sec: 42653.9). Total num frames: 4466982912. Throughput: 0: 42417.7. Samples: 745851240. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:30:17,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:30:17,933][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000272643_4466982912.pth... [2024-06-28 17:30:17,977][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000272021_4456792064.pth [2024-06-28 17:30:18,643][09423] Updated weights for policy 0, policy_version 272647 (0.0030) [2024-06-28 17:30:22,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4467179520. Throughput: 0: 42606.7. Samples: 746111660. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:30:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:30:23,421][09423] Updated weights for policy 0, policy_version 272657 (0.0045) [2024-06-28 17:30:26,478][09423] Updated weights for policy 0, policy_version 272667 (0.0030) [2024-06-28 17:30:27,922][09190] Fps is (10 sec: 44236.2, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4467425280. Throughput: 0: 42612.4. Samples: 746233880. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:30:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:30:30,531][09403] Signal inference workers to stop experience collection... (10250 times) [2024-06-28 17:30:30,531][09403] Signal inference workers to resume experience collection... (10250 times) [2024-06-28 17:30:30,581][09423] InferenceWorker_p0-w0: stopping experience collection (10250 times) [2024-06-28 17:30:30,581][09423] InferenceWorker_p0-w0: resuming experience collection (10250 times) [2024-06-28 17:30:30,672][09423] Updated weights for policy 0, policy_version 272677 (0.0031) [2024-06-28 17:30:32,922][09190] Fps is (10 sec: 45874.7, 60 sec: 42325.3, 300 sec: 42653.9). Total num frames: 4467638272. Throughput: 0: 42619.1. Samples: 746498860. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:30:32,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:30:34,005][09423] Updated weights for policy 0, policy_version 272687 (0.0040) [2024-06-28 17:30:37,921][09190] Fps is (10 sec: 39322.4, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4467818496. Throughput: 0: 42734.8. Samples: 746756040. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:30:37,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:30:38,866][09423] Updated weights for policy 0, policy_version 272697 (0.0034) [2024-06-28 17:30:41,818][09423] Updated weights for policy 0, policy_version 272707 (0.0033) [2024-06-28 17:30:42,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4468047872. Throughput: 0: 42671.1. Samples: 746875060. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 17:30:42,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:30:46,192][09423] Updated weights for policy 0, policy_version 272717 (0.0037) [2024-06-28 17:30:47,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42598.5, 300 sec: 42653.9). Total num frames: 4468277248. Throughput: 0: 42601.8. Samples: 747136560. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-28 17:30:47,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:30:49,558][09423] Updated weights for policy 0, policy_version 272727 (0.0034) [2024-06-28 17:30:52,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4468457472. Throughput: 0: 42611.1. Samples: 747388480. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-28 17:30:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:30:53,939][09423] Updated weights for policy 0, policy_version 272737 (0.0024) [2024-06-28 17:30:57,339][09423] Updated weights for policy 0, policy_version 272747 (0.0036) [2024-06-28 17:30:57,922][09190] Fps is (10 sec: 40959.5, 60 sec: 42325.2, 300 sec: 42598.7). Total num frames: 4468686848. Throughput: 0: 42658.4. Samples: 747510720. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-28 17:30:57,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:31:01,693][09423] Updated weights for policy 0, policy_version 272757 (0.0035) [2024-06-28 17:31:02,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42325.3, 300 sec: 42653.9). Total num frames: 4468899840. Throughput: 0: 42716.4. Samples: 747773480. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-28 17:31:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:31:05,303][09423] Updated weights for policy 0, policy_version 272767 (0.0023) [2024-06-28 17:31:07,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4469129216. Throughput: 0: 42544.9. Samples: 748026180. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-28 17:31:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:31:09,106][09423] Updated weights for policy 0, policy_version 272777 (0.0034) [2024-06-28 17:31:12,893][09423] Updated weights for policy 0, policy_version 272787 (0.0029) [2024-06-28 17:31:12,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4469342208. Throughput: 0: 42586.5. Samples: 748150260. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-28 17:31:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:31:16,587][09423] Updated weights for policy 0, policy_version 272797 (0.0028) [2024-06-28 17:31:17,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4469538816. Throughput: 0: 42294.3. Samples: 748402100. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-28 17:31:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:31:21,511][09423] Updated weights for policy 0, policy_version 272807 (0.0028) [2024-06-28 17:31:22,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42871.4, 300 sec: 42542.8). Total num frames: 4469751808. Throughput: 0: 42273.7. Samples: 748658360. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-28 17:31:22,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:31:24,750][09423] Updated weights for policy 0, policy_version 272817 (0.0026) [2024-06-28 17:31:27,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4469964800. Throughput: 0: 42460.8. Samples: 748785800. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-28 17:31:27,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:31:28,992][09423] Updated weights for policy 0, policy_version 272827 (0.0043) [2024-06-28 17:31:32,751][09423] Updated weights for policy 0, policy_version 272837 (0.0030) [2024-06-28 17:31:32,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42052.2, 300 sec: 42598.4). Total num frames: 4470161408. Throughput: 0: 42330.1. Samples: 749041420. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-28 17:31:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:31:36,444][09423] Updated weights for policy 0, policy_version 272847 (0.0032) [2024-06-28 17:31:37,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.4, 300 sec: 42542.9). Total num frames: 4470390784. Throughput: 0: 42366.2. Samples: 749294960. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-28 17:31:37,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:31:40,058][09423] Updated weights for policy 0, policy_version 272857 (0.0036) [2024-06-28 17:31:42,922][09190] Fps is (10 sec: 42598.7, 60 sec: 42325.3, 300 sec: 42487.7). Total num frames: 4470587392. Throughput: 0: 42573.3. Samples: 749426520. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-28 17:31:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:31:44,303][09423] Updated weights for policy 0, policy_version 272867 (0.0030) [2024-06-28 17:31:47,535][09423] Updated weights for policy 0, policy_version 272877 (0.0030) [2024-06-28 17:31:47,924][09190] Fps is (10 sec: 42588.0, 60 sec: 42323.6, 300 sec: 42598.4). Total num frames: 4470816768. Throughput: 0: 42462.1. Samples: 749684380. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-28 17:31:47,925][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:31:51,865][09423] Updated weights for policy 0, policy_version 272887 (0.0044) [2024-06-28 17:31:52,922][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.4, 300 sec: 42542.8). Total num frames: 4471029760. Throughput: 0: 42497.6. Samples: 749938580. Policy #0 lag: (min: 0.0, avg: 8.3, max: 20.0) [2024-06-28 17:31:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:31:55,436][09423] Updated weights for policy 0, policy_version 272897 (0.0035) [2024-06-28 17:31:57,922][09190] Fps is (10 sec: 42608.5, 60 sec: 42598.4, 300 sec: 42542.8). Total num frames: 4471242752. Throughput: 0: 42519.8. Samples: 750063660. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:31:57,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:31:59,439][09423] Updated weights for policy 0, policy_version 272907 (0.0045) [2024-06-28 17:32:00,407][09403] Signal inference workers to stop experience collection... (10300 times) [2024-06-28 17:32:00,431][09423] InferenceWorker_p0-w0: stopping experience collection (10300 times) [2024-06-28 17:32:00,465][09403] Signal inference workers to resume experience collection... (10300 times) [2024-06-28 17:32:00,466][09423] InferenceWorker_p0-w0: resuming experience collection (10300 times) [2024-06-28 17:32:02,887][09423] Updated weights for policy 0, policy_version 272917 (0.0030) [2024-06-28 17:32:02,924][09190] Fps is (10 sec: 44226.4, 60 sec: 42869.7, 300 sec: 42709.1). Total num frames: 4471472128. Throughput: 0: 42713.7. Samples: 750324320. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:32:02,924][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:32:07,181][09423] Updated weights for policy 0, policy_version 272927 (0.0032) [2024-06-28 17:32:07,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42052.2, 300 sec: 42487.7). Total num frames: 4471652352. Throughput: 0: 42683.6. Samples: 750579120. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:32:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:32:10,678][09423] Updated weights for policy 0, policy_version 272937 (0.0033) [2024-06-28 17:32:12,924][09190] Fps is (10 sec: 40959.9, 60 sec: 42323.5, 300 sec: 42598.1). Total num frames: 4471881728. Throughput: 0: 42571.4. Samples: 750701620. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:32:12,924][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 17:32:15,023][09423] Updated weights for policy 0, policy_version 272947 (0.0028) [2024-06-28 17:32:17,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4472094720. Throughput: 0: 42554.4. Samples: 750956360. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:32:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:32:17,932][09190] No heartbeat for components: RolloutWorker_w20 (1476 seconds) [2024-06-28 17:32:17,934][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000272955_4472094720.pth... [2024-06-28 17:32:17,987][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000272332_4461887488.pth [2024-06-28 17:32:18,408][09423] Updated weights for policy 0, policy_version 272957 (0.0024) [2024-06-28 17:32:22,632][09423] Updated weights for policy 0, policy_version 272967 (0.0026) [2024-06-28 17:32:22,921][09190] Fps is (10 sec: 40970.5, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4472291328. Throughput: 0: 42649.4. Samples: 751214180. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:32:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:32:26,102][09423] Updated weights for policy 0, policy_version 272977 (0.0030) [2024-06-28 17:32:27,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4472504320. Throughput: 0: 42527.7. Samples: 751340260. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:32:27,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 17:32:30,496][09423] Updated weights for policy 0, policy_version 272987 (0.0028) [2024-06-28 17:32:32,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.6, 300 sec: 42598.4). Total num frames: 4472733696. Throughput: 0: 42545.1. Samples: 751598800. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:32:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:32:33,460][09423] Updated weights for policy 0, policy_version 272997 (0.0031) [2024-06-28 17:32:37,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4472930304. Throughput: 0: 42521.9. Samples: 751852060. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:32:37,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:32:38,052][09423] Updated weights for policy 0, policy_version 273007 (0.0033) [2024-06-28 17:32:41,281][09423] Updated weights for policy 0, policy_version 273017 (0.0036) [2024-06-28 17:32:42,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.5, 300 sec: 42542.9). Total num frames: 4473143296. Throughput: 0: 42545.9. Samples: 751978220. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:32:42,925][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:32:45,760][09423] Updated weights for policy 0, policy_version 273027 (0.0028) [2024-06-28 17:32:47,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42600.1, 300 sec: 42653.9). Total num frames: 4473372672. Throughput: 0: 42437.4. Samples: 752233900. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:32:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:32:49,319][09423] Updated weights for policy 0, policy_version 273037 (0.0033) [2024-06-28 17:32:52,922][09190] Fps is (10 sec: 42597.9, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4473569280. Throughput: 0: 42440.7. Samples: 752488960. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:32:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:32:53,310][09423] Updated weights for policy 0, policy_version 273047 (0.0029) [2024-06-28 17:32:57,027][09423] Updated weights for policy 0, policy_version 273057 (0.0026) [2024-06-28 17:32:57,921][09190] Fps is (10 sec: 39322.2, 60 sec: 42052.4, 300 sec: 42431.8). Total num frames: 4473765888. Throughput: 0: 42418.0. Samples: 752610320. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:32:57,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:33:01,019][09423] Updated weights for policy 0, policy_version 273067 (0.0036) [2024-06-28 17:33:02,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42327.1, 300 sec: 42598.8). Total num frames: 4474011648. Throughput: 0: 42558.3. Samples: 752871480. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:33:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:33:05,091][09423] Updated weights for policy 0, policy_version 273077 (0.0033) [2024-06-28 17:33:07,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4474208256. Throughput: 0: 42666.1. Samples: 753134160. Policy #0 lag: (min: 1.0, avg: 11.4, max: 22.0) [2024-06-28 17:33:07,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:33:08,945][09423] Updated weights for policy 0, policy_version 273087 (0.0035) [2024-06-28 17:33:10,592][09403] Signal inference workers to stop experience collection... (10350 times) [2024-06-28 17:33:10,618][09423] InferenceWorker_p0-w0: stopping experience collection (10350 times) [2024-06-28 17:33:10,652][09403] Signal inference workers to resume experience collection... (10350 times) [2024-06-28 17:33:10,653][09423] InferenceWorker_p0-w0: resuming experience collection (10350 times) [2024-06-28 17:33:12,653][09423] Updated weights for policy 0, policy_version 273097 (0.0024) [2024-06-28 17:33:12,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42327.1, 300 sec: 42487.3). Total num frames: 4474421248. Throughput: 0: 42478.2. Samples: 753251780. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 17:33:12,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:33:16,443][09423] Updated weights for policy 0, policy_version 273107 (0.0038) [2024-06-28 17:33:17,922][09190] Fps is (10 sec: 42597.9, 60 sec: 42325.3, 300 sec: 42487.7). Total num frames: 4474634240. Throughput: 0: 42367.4. Samples: 753505340. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 17:33:17,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:33:20,267][09423] Updated weights for policy 0, policy_version 273117 (0.0033) [2024-06-28 17:33:22,922][09190] Fps is (10 sec: 40959.7, 60 sec: 42325.2, 300 sec: 42431.8). Total num frames: 4474830848. Throughput: 0: 42482.1. Samples: 753763760. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 17:33:22,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:33:24,243][09423] Updated weights for policy 0, policy_version 273127 (0.0034) [2024-06-28 17:33:27,761][09423] Updated weights for policy 0, policy_version 273137 (0.0024) [2024-06-28 17:33:27,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42871.4, 300 sec: 42542.8). Total num frames: 4475076608. Throughput: 0: 42401.3. Samples: 753886280. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 17:33:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:33:31,632][09423] Updated weights for policy 0, policy_version 273147 (0.0033) [2024-06-28 17:33:32,924][09190] Fps is (10 sec: 45864.2, 60 sec: 42596.6, 300 sec: 42598.0). Total num frames: 4475289600. Throughput: 0: 42407.9. Samples: 754142360. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 17:33:32,933][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:33:35,514][09423] Updated weights for policy 0, policy_version 273157 (0.0034) [2024-06-28 17:33:37,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42325.3, 300 sec: 42432.1). Total num frames: 4475469824. Throughput: 0: 42516.1. Samples: 754402180. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 17:33:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:33:39,634][09423] Updated weights for policy 0, policy_version 273167 (0.0030) [2024-06-28 17:33:42,928][09190] Fps is (10 sec: 42582.1, 60 sec: 42867.0, 300 sec: 42486.4). Total num frames: 4475715584. Throughput: 0: 42598.0. Samples: 754527500. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 17:33:42,928][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:33:43,154][09423] Updated weights for policy 0, policy_version 273177 (0.0029) [2024-06-28 17:33:46,983][09423] Updated weights for policy 0, policy_version 273187 (0.0036) [2024-06-28 17:33:47,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4475912192. Throughput: 0: 42525.4. Samples: 754785120. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 17:33:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:33:50,778][09423] Updated weights for policy 0, policy_version 273197 (0.0029) [2024-06-28 17:33:52,921][09190] Fps is (10 sec: 39346.8, 60 sec: 42325.5, 300 sec: 42487.3). Total num frames: 4476108800. Throughput: 0: 42348.1. Samples: 755039820. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 17:33:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:33:55,200][09423] Updated weights for policy 0, policy_version 273207 (0.0040) [2024-06-28 17:33:57,921][09190] Fps is (10 sec: 44236.7, 60 sec: 43144.5, 300 sec: 42542.9). Total num frames: 4476354560. Throughput: 0: 42478.3. Samples: 755163300. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 17:33:57,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:33:58,764][09423] Updated weights for policy 0, policy_version 273217 (0.0030) [2024-06-28 17:34:02,624][09423] Updated weights for policy 0, policy_version 273227 (0.0027) [2024-06-28 17:34:02,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4476551168. Throughput: 0: 42603.3. Samples: 755422480. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 17:34:02,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:34:06,685][09423] Updated weights for policy 0, policy_version 273237 (0.0037) [2024-06-28 17:34:07,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4476747776. Throughput: 0: 42568.2. Samples: 755679320. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 17:34:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:34:10,212][09423] Updated weights for policy 0, policy_version 273247 (0.0033) [2024-06-28 17:34:12,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.5, 300 sec: 42487.7). Total num frames: 4476977152. Throughput: 0: 42568.1. Samples: 755801840. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 17:34:12,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:34:14,527][09423] Updated weights for policy 0, policy_version 273257 (0.0028) [2024-06-28 17:34:17,924][09190] Fps is (10 sec: 44225.4, 60 sec: 42596.7, 300 sec: 42542.5). Total num frames: 4477190144. Throughput: 0: 42498.7. Samples: 756054800. Policy #0 lag: (min: 1.0, avg: 9.4, max: 22.0) [2024-06-28 17:34:17,924][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:34:17,933][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000273266_4477190144.pth... [2024-06-28 17:34:18,006][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000272643_4466982912.pth [2024-06-28 17:34:18,149][09423] Updated weights for policy 0, policy_version 273267 (0.0034) [2024-06-28 17:34:22,036][09423] Updated weights for policy 0, policy_version 273277 (0.0039) [2024-06-28 17:34:22,924][09190] Fps is (10 sec: 39311.4, 60 sec: 42323.6, 300 sec: 42375.9). Total num frames: 4477370368. Throughput: 0: 42578.1. Samples: 756318300. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 17:34:22,924][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 17:34:26,043][09423] Updated weights for policy 0, policy_version 273287 (0.0033) [2024-06-28 17:34:27,921][09190] Fps is (10 sec: 42609.1, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4477616128. Throughput: 0: 42629.5. Samples: 756445560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 17:34:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:34:29,600][09423] Updated weights for policy 0, policy_version 273297 (0.0037) [2024-06-28 17:34:32,921][09190] Fps is (10 sec: 44247.9, 60 sec: 42054.0, 300 sec: 42542.9). Total num frames: 4477812736. Throughput: 0: 42314.2. Samples: 756689260. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 17:34:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:34:34,051][09423] Updated weights for policy 0, policy_version 273307 (0.0037) [2024-06-28 17:34:34,149][09403] Signal inference workers to stop experience collection... (10400 times) [2024-06-28 17:34:34,189][09423] InferenceWorker_p0-w0: stopping experience collection (10400 times) [2024-06-28 17:34:34,199][09403] Signal inference workers to resume experience collection... (10400 times) [2024-06-28 17:34:34,210][09423] InferenceWorker_p0-w0: resuming experience collection (10400 times) [2024-06-28 17:34:37,788][09423] Updated weights for policy 0, policy_version 273317 (0.0036) [2024-06-28 17:34:37,922][09190] Fps is (10 sec: 40959.5, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4478025728. Throughput: 0: 42395.8. Samples: 756947640. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 17:34:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:34:41,435][09423] Updated weights for policy 0, policy_version 273327 (0.0031) [2024-06-28 17:34:42,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42056.7, 300 sec: 42431.8). Total num frames: 4478238720. Throughput: 0: 42450.2. Samples: 757073560. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 17:34:42,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:34:45,490][09423] Updated weights for policy 0, policy_version 273337 (0.0042) [2024-06-28 17:34:47,925][09190] Fps is (10 sec: 42582.5, 60 sec: 42322.6, 300 sec: 42542.3). Total num frames: 4478451712. Throughput: 0: 42315.8. Samples: 757326860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 17:34:47,926][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:34:49,150][09423] Updated weights for policy 0, policy_version 273347 (0.0031) [2024-06-28 17:34:52,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 4478664704. Throughput: 0: 42388.8. Samples: 757586820. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 17:34:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:34:53,148][09423] Updated weights for policy 0, policy_version 273357 (0.0030) [2024-06-28 17:34:56,885][09423] Updated weights for policy 0, policy_version 273367 (0.0032) [2024-06-28 17:34:57,921][09190] Fps is (10 sec: 44253.6, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4478894080. Throughput: 0: 42421.6. Samples: 757710820. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 17:34:57,922][09190] Avg episode reward: [(0, '0.727')] [2024-06-28 17:35:01,268][09423] Updated weights for policy 0, policy_version 273377 (0.0042) [2024-06-28 17:35:02,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4479090688. Throughput: 0: 42252.1. Samples: 757956040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 17:35:02,923][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:35:04,515][09423] Updated weights for policy 0, policy_version 273387 (0.0025) [2024-06-28 17:35:07,922][09190] Fps is (10 sec: 39321.5, 60 sec: 42325.2, 300 sec: 42431.8). Total num frames: 4479287296. Throughput: 0: 42196.5. Samples: 758217040. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 17:35:07,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:35:08,793][09423] Updated weights for policy 0, policy_version 273397 (0.0031) [2024-06-28 17:35:12,076][09423] Updated weights for policy 0, policy_version 273407 (0.0032) [2024-06-28 17:35:12,922][09190] Fps is (10 sec: 42595.2, 60 sec: 42324.7, 300 sec: 42487.2). Total num frames: 4479516672. Throughput: 0: 42186.9. Samples: 758344000. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 17:35:12,923][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:35:16,335][09423] Updated weights for policy 0, policy_version 273417 (0.0036) [2024-06-28 17:35:17,922][09190] Fps is (10 sec: 45872.2, 60 sec: 42599.7, 300 sec: 42598.3). Total num frames: 4479746048. Throughput: 0: 42514.0. Samples: 758602420. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 17:35:17,923][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:35:17,944][09190] No heartbeat for components: RolloutWorker_w20 (1656 seconds) [2024-06-28 17:35:19,834][09423] Updated weights for policy 0, policy_version 273427 (0.0029) [2024-06-28 17:35:22,921][09190] Fps is (10 sec: 40962.9, 60 sec: 42600.2, 300 sec: 42376.3). Total num frames: 4479926272. Throughput: 0: 42470.3. Samples: 758858800. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 17:35:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:35:23,813][09423] Updated weights for policy 0, policy_version 273437 (0.0037) [2024-06-28 17:35:27,638][09423] Updated weights for policy 0, policy_version 273447 (0.0039) [2024-06-28 17:35:27,921][09190] Fps is (10 sec: 40963.3, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4480155648. Throughput: 0: 42473.4. Samples: 758984860. Policy #0 lag: (min: 0.0, avg: 10.4, max: 20.0) [2024-06-28 17:35:27,922][09190] Avg episode reward: [(0, '0.721')] [2024-06-28 17:35:31,491][09423] Updated weights for policy 0, policy_version 273457 (0.0031) [2024-06-28 17:35:32,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4480368640. Throughput: 0: 42508.1. Samples: 759239560. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:35:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:35:35,354][09423] Updated weights for policy 0, policy_version 273467 (0.0032) [2024-06-28 17:35:37,922][09190] Fps is (10 sec: 42597.5, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4480581632. Throughput: 0: 42504.7. Samples: 759499540. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:35:37,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 17:35:39,097][09423] Updated weights for policy 0, policy_version 273477 (0.0029) [2024-06-28 17:35:42,900][09423] Updated weights for policy 0, policy_version 273487 (0.0040) [2024-06-28 17:35:42,922][09190] Fps is (10 sec: 44236.2, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4480811008. Throughput: 0: 42535.9. Samples: 759624940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:35:42,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:35:46,707][09423] Updated weights for policy 0, policy_version 273497 (0.0032) [2024-06-28 17:35:47,922][09190] Fps is (10 sec: 42598.5, 60 sec: 42601.1, 300 sec: 42542.9). Total num frames: 4481007616. Throughput: 0: 42720.3. Samples: 759878460. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:35:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:35:50,583][09423] Updated weights for policy 0, policy_version 273507 (0.0034) [2024-06-28 17:35:52,922][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4481220608. Throughput: 0: 42656.0. Samples: 760136560. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:35:52,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 17:35:54,525][09423] Updated weights for policy 0, policy_version 273517 (0.0031) [2024-06-28 17:35:57,921][09190] Fps is (10 sec: 42599.2, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4481433600. Throughput: 0: 42630.5. Samples: 760262340. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:35:57,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:35:58,173][09423] Updated weights for policy 0, policy_version 273527 (0.0033) [2024-06-28 17:36:02,517][09423] Updated weights for policy 0, policy_version 273537 (0.0038) [2024-06-28 17:36:02,921][09190] Fps is (10 sec: 42599.3, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4481646592. Throughput: 0: 42534.6. Samples: 760516440. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:36:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:36:06,062][09423] Updated weights for policy 0, policy_version 273547 (0.0039) [2024-06-28 17:36:07,923][09190] Fps is (10 sec: 42592.8, 60 sec: 42870.6, 300 sec: 42431.6). Total num frames: 4481859584. Throughput: 0: 42485.1. Samples: 760770680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:36:07,923][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:36:08,566][09403] Signal inference workers to stop experience collection... (10450 times) [2024-06-28 17:36:08,566][09403] Signal inference workers to resume experience collection... (10450 times) [2024-06-28 17:36:08,593][09423] InferenceWorker_p0-w0: stopping experience collection (10450 times) [2024-06-28 17:36:08,593][09423] InferenceWorker_p0-w0: resuming experience collection (10450 times) [2024-06-28 17:36:09,892][09423] Updated weights for policy 0, policy_version 273557 (0.0032) [2024-06-28 17:36:12,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42599.0, 300 sec: 42487.3). Total num frames: 4482072576. Throughput: 0: 42543.1. Samples: 760899300. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:36:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:36:13,516][09423] Updated weights for policy 0, policy_version 273567 (0.0037) [2024-06-28 17:36:17,626][09423] Updated weights for policy 0, policy_version 273577 (0.0035) [2024-06-28 17:36:17,921][09190] Fps is (10 sec: 42603.6, 60 sec: 42325.8, 300 sec: 42487.3). Total num frames: 4482285568. Throughput: 0: 42575.5. Samples: 761155460. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:36:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:36:17,941][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000273577_4482285568.pth... [2024-06-28 17:36:18,005][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000272955_4472094720.pth [2024-06-28 17:36:21,383][09423] Updated weights for policy 0, policy_version 273587 (0.0027) [2024-06-28 17:36:22,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 4482498560. Throughput: 0: 42513.5. Samples: 761412640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:36:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:36:25,535][09423] Updated weights for policy 0, policy_version 273597 (0.0041) [2024-06-28 17:36:27,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.3, 300 sec: 42542.9). Total num frames: 4482711552. Throughput: 0: 42576.5. Samples: 761540880. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:36:27,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:36:28,827][09423] Updated weights for policy 0, policy_version 273607 (0.0033) [2024-06-28 17:36:32,903][09423] Updated weights for policy 0, policy_version 273617 (0.0033) [2024-06-28 17:36:32,922][09190] Fps is (10 sec: 44236.2, 60 sec: 42871.4, 300 sec: 42542.9). Total num frames: 4482940928. Throughput: 0: 42649.4. Samples: 761797680. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:36:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:36:36,661][09423] Updated weights for policy 0, policy_version 273627 (0.0038) [2024-06-28 17:36:37,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42871.6, 300 sec: 42598.4). Total num frames: 4483153920. Throughput: 0: 42487.7. Samples: 762048500. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:36:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:36:40,603][09423] Updated weights for policy 0, policy_version 273637 (0.0033) [2024-06-28 17:36:42,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42325.4, 300 sec: 42487.7). Total num frames: 4483350528. Throughput: 0: 42462.6. Samples: 762173160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 24.0) [2024-06-28 17:36:42,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:36:44,235][09423] Updated weights for policy 0, policy_version 273647 (0.0036) [2024-06-28 17:36:47,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.6, 300 sec: 42542.9). Total num frames: 4483579904. Throughput: 0: 42621.3. Samples: 762434400. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 17:36:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:36:48,911][09423] Updated weights for policy 0, policy_version 273657 (0.0042) [2024-06-28 17:36:52,127][09423] Updated weights for policy 0, policy_version 273667 (0.0022) [2024-06-28 17:36:52,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42598.5, 300 sec: 42487.3). Total num frames: 4483776512. Throughput: 0: 42659.0. Samples: 762690280. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 17:36:52,922][09190] Avg episode reward: [(0, '0.829')] [2024-06-28 17:36:56,330][09423] Updated weights for policy 0, policy_version 273677 (0.0028) [2024-06-28 17:36:57,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42598.4, 300 sec: 42432.2). Total num frames: 4483989504. Throughput: 0: 42626.8. Samples: 762817500. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 17:36:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:36:59,570][09423] Updated weights for policy 0, policy_version 273687 (0.0028) [2024-06-28 17:37:02,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 4484186112. Throughput: 0: 42726.2. Samples: 763078140. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 17:37:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:37:03,692][09423] Updated weights for policy 0, policy_version 273697 (0.0030) [2024-06-28 17:37:07,141][09423] Updated weights for policy 0, policy_version 273707 (0.0044) [2024-06-28 17:37:07,921][09190] Fps is (10 sec: 42597.7, 60 sec: 42599.3, 300 sec: 42487.7). Total num frames: 4484415488. Throughput: 0: 42528.8. Samples: 763326440. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 17:37:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:37:11,424][09423] Updated weights for policy 0, policy_version 273717 (0.0023) [2024-06-28 17:37:12,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4484628480. Throughput: 0: 42574.2. Samples: 763456720. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 17:37:12,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:37:15,011][09423] Updated weights for policy 0, policy_version 273727 (0.0031) [2024-06-28 17:37:17,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4484841472. Throughput: 0: 42695.6. Samples: 763718980. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 17:37:17,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:37:18,812][09423] Updated weights for policy 0, policy_version 273737 (0.0024) [2024-06-28 17:37:22,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4485038080. Throughput: 0: 42704.0. Samples: 763970180. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 17:37:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:37:23,204][09423] Updated weights for policy 0, policy_version 273747 (0.0033) [2024-06-28 17:37:27,107][09423] Updated weights for policy 0, policy_version 273757 (0.0039) [2024-06-28 17:37:27,924][09190] Fps is (10 sec: 40949.6, 60 sec: 42323.6, 300 sec: 42431.4). Total num frames: 4485251072. Throughput: 0: 42710.5. Samples: 764095240. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 17:37:27,925][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:37:28,436][09403] Signal inference workers to stop experience collection... (10500 times) [2024-06-28 17:37:28,447][09423] InferenceWorker_p0-w0: stopping experience collection (10500 times) [2024-06-28 17:37:28,493][09403] Signal inference workers to resume experience collection... (10500 times) [2024-06-28 17:37:28,494][09423] InferenceWorker_p0-w0: resuming experience collection (10500 times) [2024-06-28 17:37:30,648][09423] Updated weights for policy 0, policy_version 273767 (0.0027) [2024-06-28 17:37:32,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4485480448. Throughput: 0: 42610.7. Samples: 764351880. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 17:37:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:37:34,890][09423] Updated weights for policy 0, policy_version 273777 (0.0029) [2024-06-28 17:37:37,922][09190] Fps is (10 sec: 44247.3, 60 sec: 42325.2, 300 sec: 42542.8). Total num frames: 4485693440. Throughput: 0: 42539.4. Samples: 764604560. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 17:37:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:37:38,787][09423] Updated weights for policy 0, policy_version 273787 (0.0028) [2024-06-28 17:37:42,410][09423] Updated weights for policy 0, policy_version 273797 (0.0026) [2024-06-28 17:37:42,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 4485922816. Throughput: 0: 42596.8. Samples: 764734360. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 17:37:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:37:46,704][09423] Updated weights for policy 0, policy_version 273807 (0.0032) [2024-06-28 17:37:47,922][09190] Fps is (10 sec: 42598.7, 60 sec: 42325.2, 300 sec: 42542.9). Total num frames: 4486119424. Throughput: 0: 42526.6. Samples: 764991840. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 17:37:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:37:49,984][09423] Updated weights for policy 0, policy_version 273817 (0.0041) [2024-06-28 17:37:52,921][09190] Fps is (10 sec: 39321.1, 60 sec: 42325.3, 300 sec: 42542.8). Total num frames: 4486316032. Throughput: 0: 42591.1. Samples: 765243040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 23.0) [2024-06-28 17:37:52,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:37:54,118][09423] Updated weights for policy 0, policy_version 273827 (0.0025) [2024-06-28 17:37:57,441][09423] Updated weights for policy 0, policy_version 273837 (0.0032) [2024-06-28 17:37:57,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4486545408. Throughput: 0: 42474.7. Samples: 765368080. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:37:57,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:38:01,769][09423] Updated weights for policy 0, policy_version 273847 (0.0032) [2024-06-28 17:38:02,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 4486758400. Throughput: 0: 42371.5. Samples: 765625700. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:38:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:38:05,391][09423] Updated weights for policy 0, policy_version 273857 (0.0035) [2024-06-28 17:38:07,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4486955008. Throughput: 0: 42494.6. Samples: 765882440. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:38:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:38:09,488][09423] Updated weights for policy 0, policy_version 273867 (0.0025) [2024-06-28 17:38:12,813][09423] Updated weights for policy 0, policy_version 273877 (0.0029) [2024-06-28 17:38:12,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4487200768. Throughput: 0: 42538.4. Samples: 766009360. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:38:12,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:38:16,965][09423] Updated weights for policy 0, policy_version 273887 (0.0034) [2024-06-28 17:38:17,924][09190] Fps is (10 sec: 42587.6, 60 sec: 42323.5, 300 sec: 42542.5). Total num frames: 4487380992. Throughput: 0: 42495.8. Samples: 766264300. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:38:17,925][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:38:17,931][09190] No heartbeat for components: RolloutWorker_w20 (1836 seconds) [2024-06-28 17:38:18,017][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000273889_4487397376.pth... [2024-06-28 17:38:18,069][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000273266_4477190144.pth [2024-06-28 17:38:20,498][09423] Updated weights for policy 0, policy_version 273897 (0.0032) [2024-06-28 17:38:22,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4487593984. Throughput: 0: 42626.4. Samples: 766522740. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:38:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:38:24,867][09423] Updated weights for policy 0, policy_version 273907 (0.0029) [2024-06-28 17:38:27,924][09190] Fps is (10 sec: 45875.7, 60 sec: 43144.6, 300 sec: 42542.9). Total num frames: 4487839744. Throughput: 0: 42446.5. Samples: 766644560. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:38:27,924][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:38:28,186][09423] Updated weights for policy 0, policy_version 273917 (0.0039) [2024-06-28 17:38:32,288][09423] Updated weights for policy 0, policy_version 273927 (0.0031) [2024-06-28 17:38:32,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4488036352. Throughput: 0: 42575.1. Samples: 766907720. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:38:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:38:35,927][09423] Updated weights for policy 0, policy_version 273937 (0.0031) [2024-06-28 17:38:37,921][09190] Fps is (10 sec: 39331.3, 60 sec: 42325.4, 300 sec: 42432.7). Total num frames: 4488232960. Throughput: 0: 42644.0. Samples: 767162020. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:38:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:38:40,033][09423] Updated weights for policy 0, policy_version 273947 (0.0035) [2024-06-28 17:38:41,599][09403] Signal inference workers to stop experience collection... (10550 times) [2024-06-28 17:38:41,601][09403] Signal inference workers to resume experience collection... (10550 times) [2024-06-28 17:38:41,624][09423] InferenceWorker_p0-w0: stopping experience collection (10550 times) [2024-06-28 17:38:41,625][09423] InferenceWorker_p0-w0: resuming experience collection (10550 times) [2024-06-28 17:38:42,924][09190] Fps is (10 sec: 44226.3, 60 sec: 42596.6, 300 sec: 42598.0). Total num frames: 4488478720. Throughput: 0: 42672.7. Samples: 767288460. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:38:42,924][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:38:43,356][09423] Updated weights for policy 0, policy_version 273957 (0.0034) [2024-06-28 17:38:47,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.4, 300 sec: 42542.8). Total num frames: 4488658944. Throughput: 0: 42604.0. Samples: 767542880. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:38:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:38:48,256][09423] Updated weights for policy 0, policy_version 273967 (0.0036) [2024-06-28 17:38:50,922][09423] Updated weights for policy 0, policy_version 273977 (0.0034) [2024-06-28 17:38:52,921][09190] Fps is (10 sec: 40969.9, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 4488888320. Throughput: 0: 42628.4. Samples: 767800720. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:38:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:38:55,596][09423] Updated weights for policy 0, policy_version 273987 (0.0045) [2024-06-28 17:38:57,922][09190] Fps is (10 sec: 45873.0, 60 sec: 42871.1, 300 sec: 42598.3). Total num frames: 4489117696. Throughput: 0: 42596.4. Samples: 767926220. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:38:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:38:58,812][09423] Updated weights for policy 0, policy_version 273997 (0.0029) [2024-06-28 17:39:02,921][09190] Fps is (10 sec: 42599.3, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4489314304. Throughput: 0: 42766.2. Samples: 768188660. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:39:02,921][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:39:02,943][09423] Updated weights for policy 0, policy_version 274007 (0.0029) [2024-06-28 17:39:06,966][09423] Updated weights for policy 0, policy_version 274017 (0.0033) [2024-06-28 17:39:07,922][09190] Fps is (10 sec: 39323.2, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4489510912. Throughput: 0: 42564.3. Samples: 768438140. Policy #0 lag: (min: 0.0, avg: 11.0, max: 25.0) [2024-06-28 17:39:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:39:10,769][09423] Updated weights for policy 0, policy_version 274027 (0.0032) [2024-06-28 17:39:12,921][09190] Fps is (10 sec: 44236.2, 60 sec: 42598.4, 300 sec: 42598.8). Total num frames: 4489756672. Throughput: 0: 42577.5. Samples: 768560440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:39:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:39:14,935][09423] Updated weights for policy 0, policy_version 274037 (0.0033) [2024-06-28 17:39:17,924][09190] Fps is (10 sec: 44226.0, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4489953280. Throughput: 0: 42531.5. Samples: 768821740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:39:17,924][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:39:18,784][09423] Updated weights for policy 0, policy_version 274047 (0.0028) [2024-06-28 17:39:22,652][09423] Updated weights for policy 0, policy_version 274057 (0.0027) [2024-06-28 17:39:22,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42871.3, 300 sec: 42542.8). Total num frames: 4490166272. Throughput: 0: 42398.5. Samples: 769069960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:39:22,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:39:26,215][09423] Updated weights for policy 0, policy_version 274067 (0.0030) [2024-06-28 17:39:27,924][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.3, 300 sec: 42598.0). Total num frames: 4490379264. Throughput: 0: 42558.2. Samples: 769203580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:39:27,924][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:39:30,213][09423] Updated weights for policy 0, policy_version 274077 (0.0035) [2024-06-28 17:39:32,921][09190] Fps is (10 sec: 39322.3, 60 sec: 42052.4, 300 sec: 42487.3). Total num frames: 4490559488. Throughput: 0: 42523.1. Samples: 769456420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:39:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:39:33,686][09423] Updated weights for policy 0, policy_version 274087 (0.0033) [2024-06-28 17:39:37,474][09423] Updated weights for policy 0, policy_version 274097 (0.0035) [2024-06-28 17:39:37,921][09190] Fps is (10 sec: 42608.8, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4490805248. Throughput: 0: 42395.1. Samples: 769708500. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:39:37,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:39:41,589][09423] Updated weights for policy 0, policy_version 274107 (0.0032) [2024-06-28 17:39:42,924][09190] Fps is (10 sec: 45863.7, 60 sec: 42325.3, 300 sec: 42598.6). Total num frames: 4491018240. Throughput: 0: 42454.6. Samples: 769836760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:39:42,924][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 17:39:45,798][09423] Updated weights for policy 0, policy_version 274117 (0.0034) [2024-06-28 17:39:47,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4491231232. Throughput: 0: 42440.3. Samples: 770098480. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:39:47,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:39:49,150][09423] Updated weights for policy 0, policy_version 274127 (0.0023) [2024-06-28 17:39:52,921][09190] Fps is (10 sec: 40970.6, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4491427840. Throughput: 0: 42509.5. Samples: 770351060. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:39:52,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:39:53,770][09423] Updated weights for policy 0, policy_version 274137 (0.0038) [2024-06-28 17:39:56,022][09403] Signal inference workers to stop experience collection... (10600 times) [2024-06-28 17:39:56,024][09403] Signal inference workers to resume experience collection... (10600 times) [2024-06-28 17:39:56,042][09423] InferenceWorker_p0-w0: stopping experience collection (10600 times) [2024-06-28 17:39:56,043][09423] InferenceWorker_p0-w0: resuming experience collection (10600 times) [2024-06-28 17:39:56,647][09423] Updated weights for policy 0, policy_version 274147 (0.0030) [2024-06-28 17:39:57,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.7, 300 sec: 42598.4). Total num frames: 4491657216. Throughput: 0: 42565.8. Samples: 770475900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:39:57,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:40:01,503][09423] Updated weights for policy 0, policy_version 274157 (0.0033) [2024-06-28 17:40:02,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42325.2, 300 sec: 42598.4). Total num frames: 4491853824. Throughput: 0: 42550.8. Samples: 770736420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:40:02,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:40:04,471][09423] Updated weights for policy 0, policy_version 274167 (0.0030) [2024-06-28 17:40:07,922][09190] Fps is (10 sec: 42598.0, 60 sec: 42871.5, 300 sec: 42598.5). Total num frames: 4492083200. Throughput: 0: 42677.8. Samples: 770990460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:40:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:40:08,915][09423] Updated weights for policy 0, policy_version 274177 (0.0023) [2024-06-28 17:40:12,324][09423] Updated weights for policy 0, policy_version 274187 (0.0029) [2024-06-28 17:40:12,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42325.3, 300 sec: 42543.0). Total num frames: 4492296192. Throughput: 0: 42493.0. Samples: 771115660. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:40:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:40:16,398][09423] Updated weights for policy 0, policy_version 274197 (0.0028) [2024-06-28 17:40:17,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42873.3, 300 sec: 42709.5). Total num frames: 4492525568. Throughput: 0: 42745.4. Samples: 771379960. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:40:17,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:40:18,065][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000274203_4492541952.pth... [2024-06-28 17:40:18,118][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000273577_4482285568.pth [2024-06-28 17:40:19,905][09423] Updated weights for policy 0, policy_version 274207 (0.0029) [2024-06-28 17:40:22,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.4, 300 sec: 42542.8). Total num frames: 4492705792. Throughput: 0: 42804.9. Samples: 771634720. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 17:40:22,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:40:24,207][09423] Updated weights for policy 0, policy_version 274217 (0.0030) [2024-06-28 17:40:27,657][09423] Updated weights for policy 0, policy_version 274227 (0.0031) [2024-06-28 17:40:27,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42600.2, 300 sec: 42598.4). Total num frames: 4492935168. Throughput: 0: 42637.5. Samples: 771755340. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 17:40:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:40:31,866][09423] Updated weights for policy 0, policy_version 274237 (0.0034) [2024-06-28 17:40:32,921][09190] Fps is (10 sec: 45875.5, 60 sec: 43417.6, 300 sec: 42654.0). Total num frames: 4493164544. Throughput: 0: 42767.6. Samples: 772023020. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 17:40:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:40:35,738][09423] Updated weights for policy 0, policy_version 274247 (0.0033) [2024-06-28 17:40:37,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4493344768. Throughput: 0: 42723.4. Samples: 772273620. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 17:40:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:40:39,358][09423] Updated weights for policy 0, policy_version 274257 (0.0028) [2024-06-28 17:40:42,924][09190] Fps is (10 sec: 39311.8, 60 sec: 42325.4, 300 sec: 42542.5). Total num frames: 4493557760. Throughput: 0: 42734.6. Samples: 772399060. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 17:40:42,924][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:40:43,208][09423] Updated weights for policy 0, policy_version 274267 (0.0035) [2024-06-28 17:40:46,924][09423] Updated weights for policy 0, policy_version 274277 (0.0036) [2024-06-28 17:40:47,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42871.4, 300 sec: 42653.9). Total num frames: 4493803520. Throughput: 0: 42795.1. Samples: 772662200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 17:40:47,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:40:50,610][09423] Updated weights for policy 0, policy_version 274287 (0.0041) [2024-06-28 17:40:52,921][09190] Fps is (10 sec: 44247.3, 60 sec: 42871.3, 300 sec: 42598.4). Total num frames: 4494000128. Throughput: 0: 42765.8. Samples: 772914920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 17:40:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:40:54,392][09423] Updated weights for policy 0, policy_version 274297 (0.0029) [2024-06-28 17:40:57,922][09190] Fps is (10 sec: 40959.8, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4494213120. Throughput: 0: 42781.3. Samples: 773040820. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 17:40:57,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:40:58,463][09423] Updated weights for policy 0, policy_version 274307 (0.0029) [2024-06-28 17:41:02,397][09423] Updated weights for policy 0, policy_version 274317 (0.0032) [2024-06-28 17:41:02,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42871.5, 300 sec: 42598.6). Total num frames: 4494426112. Throughput: 0: 42749.7. Samples: 773303700. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 17:41:02,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:41:06,023][09423] Updated weights for policy 0, policy_version 274327 (0.0022) [2024-06-28 17:41:07,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4494639104. Throughput: 0: 42503.1. Samples: 773547360. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 17:41:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:41:10,025][09423] Updated weights for policy 0, policy_version 274337 (0.0032) [2024-06-28 17:41:12,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4494835712. Throughput: 0: 42553.3. Samples: 773670240. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 17:41:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:41:13,877][09403] Signal inference workers to stop experience collection... (10650 times) [2024-06-28 17:41:13,906][09423] InferenceWorker_p0-w0: stopping experience collection (10650 times) [2024-06-28 17:41:13,933][09403] Signal inference workers to resume experience collection... (10650 times) [2024-06-28 17:41:13,934][09423] InferenceWorker_p0-w0: resuming experience collection (10650 times) [2024-06-28 17:41:13,937][09423] Updated weights for policy 0, policy_version 274347 (0.0037) [2024-06-28 17:41:17,430][09423] Updated weights for policy 0, policy_version 274357 (0.0029) [2024-06-28 17:41:17,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4495065088. Throughput: 0: 42406.6. Samples: 773931320. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 17:41:17,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:41:17,929][09190] No heartbeat for components: RolloutWorker_w20 (2016 seconds) [2024-06-28 17:41:21,653][09423] Updated weights for policy 0, policy_version 274367 (0.0033) [2024-06-28 17:41:22,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4495278080. Throughput: 0: 42497.3. Samples: 774186000. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 17:41:22,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:41:25,394][09423] Updated weights for policy 0, policy_version 274377 (0.0032) [2024-06-28 17:41:27,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4495474688. Throughput: 0: 42495.7. Samples: 774311260. Policy #0 lag: (min: 0.0, avg: 10.3, max: 20.0) [2024-06-28 17:41:27,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:41:29,173][09423] Updated weights for policy 0, policy_version 274387 (0.0045) [2024-06-28 17:41:32,924][09190] Fps is (10 sec: 40949.9, 60 sec: 42050.5, 300 sec: 42487.0). Total num frames: 4495687680. Throughput: 0: 42355.9. Samples: 774568320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:41:32,925][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:41:33,209][09423] Updated weights for policy 0, policy_version 274397 (0.0043) [2024-06-28 17:41:36,862][09423] Updated weights for policy 0, policy_version 274407 (0.0040) [2024-06-28 17:41:37,922][09190] Fps is (10 sec: 44236.1, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4495917056. Throughput: 0: 42271.5. Samples: 774817140. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:41:37,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:41:40,769][09423] Updated weights for policy 0, policy_version 274417 (0.0037) [2024-06-28 17:41:42,921][09190] Fps is (10 sec: 42609.2, 60 sec: 42600.1, 300 sec: 42487.3). Total num frames: 4496113664. Throughput: 0: 42331.2. Samples: 774945720. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:41:42,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 17:41:44,594][09423] Updated weights for policy 0, policy_version 274427 (0.0035) [2024-06-28 17:41:47,922][09190] Fps is (10 sec: 39321.3, 60 sec: 41779.1, 300 sec: 42487.3). Total num frames: 4496310272. Throughput: 0: 42290.0. Samples: 775206760. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:41:47,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 17:41:48,601][09423] Updated weights for policy 0, policy_version 274437 (0.0027) [2024-06-28 17:41:52,385][09423] Updated weights for policy 0, policy_version 274447 (0.0027) [2024-06-28 17:41:52,922][09190] Fps is (10 sec: 44236.4, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4496556032. Throughput: 0: 42356.4. Samples: 775453400. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:41:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:41:56,600][09423] Updated weights for policy 0, policy_version 274457 (0.0039) [2024-06-28 17:41:57,921][09190] Fps is (10 sec: 45876.1, 60 sec: 42598.5, 300 sec: 42653.9). Total num frames: 4496769024. Throughput: 0: 42479.2. Samples: 775581800. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:41:57,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:42:00,320][09423] Updated weights for policy 0, policy_version 274467 (0.0034) [2024-06-28 17:42:02,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4496965632. Throughput: 0: 42490.2. Samples: 775843380. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:42:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:42:04,150][09423] Updated weights for policy 0, policy_version 274477 (0.0040) [2024-06-28 17:42:07,706][09423] Updated weights for policy 0, policy_version 274487 (0.0034) [2024-06-28 17:42:07,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4497195008. Throughput: 0: 42456.5. Samples: 776096540. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:42:07,922][09190] Avg episode reward: [(0, '0.873')] [2024-06-28 17:42:07,930][09403] Saving new best policy, reward=0.873! [2024-06-28 17:42:11,979][09423] Updated weights for policy 0, policy_version 274497 (0.0035) [2024-06-28 17:42:12,922][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4497408000. Throughput: 0: 42594.5. Samples: 776228020. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:42:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:42:15,802][09423] Updated weights for policy 0, policy_version 274507 (0.0043) [2024-06-28 17:42:17,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4497604608. Throughput: 0: 42441.5. Samples: 776478080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:42:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:42:17,931][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000274512_4497604608.pth... [2024-06-28 17:42:17,977][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000273889_4487397376.pth [2024-06-28 17:42:19,801][09423] Updated weights for policy 0, policy_version 274517 (0.0030) [2024-06-28 17:42:22,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42325.4, 300 sec: 42598.8). Total num frames: 4497817600. Throughput: 0: 42504.6. Samples: 776729840. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:42:22,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:42:23,401][09423] Updated weights for policy 0, policy_version 274527 (0.0028) [2024-06-28 17:42:27,156][09423] Updated weights for policy 0, policy_version 274537 (0.0035) [2024-06-28 17:42:27,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4498046976. Throughput: 0: 42596.4. Samples: 776862560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:42:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:42:30,960][09423] Updated weights for policy 0, policy_version 274547 (0.0040) [2024-06-28 17:42:32,924][09190] Fps is (10 sec: 40949.4, 60 sec: 42325.4, 300 sec: 42487.0). Total num frames: 4498227200. Throughput: 0: 42410.3. Samples: 777115320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:42:32,925][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:42:34,821][09423] Updated weights for policy 0, policy_version 274557 (0.0030) [2024-06-28 17:42:37,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42325.5, 300 sec: 42487.3). Total num frames: 4498456576. Throughput: 0: 42500.2. Samples: 777365900. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:42:37,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:42:38,655][09423] Updated weights for policy 0, policy_version 274567 (0.0036) [2024-06-28 17:42:42,103][09403] Signal inference workers to stop experience collection... (10700 times) [2024-06-28 17:42:42,151][09423] InferenceWorker_p0-w0: stopping experience collection (10700 times) [2024-06-28 17:42:42,219][09403] Signal inference workers to resume experience collection... (10700 times) [2024-06-28 17:42:42,219][09423] InferenceWorker_p0-w0: resuming experience collection (10700 times) [2024-06-28 17:42:42,349][09423] Updated weights for policy 0, policy_version 274577 (0.0022) [2024-06-28 17:42:42,921][09190] Fps is (10 sec: 44248.0, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4498669568. Throughput: 0: 42695.1. Samples: 777503080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:42:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:42:46,524][09423] Updated weights for policy 0, policy_version 274587 (0.0030) [2024-06-28 17:42:47,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.6, 300 sec: 42542.9). Total num frames: 4498866176. Throughput: 0: 42508.9. Samples: 777756280. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 17:42:47,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:42:50,282][09423] Updated weights for policy 0, policy_version 274597 (0.0032) [2024-06-28 17:42:52,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4499111936. Throughput: 0: 42386.8. Samples: 778003940. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 17:42:52,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:42:54,215][09423] Updated weights for policy 0, policy_version 274607 (0.0040) [2024-06-28 17:42:57,845][09423] Updated weights for policy 0, policy_version 274617 (0.0035) [2024-06-28 17:42:57,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4499324928. Throughput: 0: 42529.0. Samples: 778141820. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 17:42:57,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:43:02,178][09423] Updated weights for policy 0, policy_version 274627 (0.0033) [2024-06-28 17:43:02,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4499505152. Throughput: 0: 42575.1. Samples: 778393960. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 17:43:02,922][09190] Avg episode reward: [(0, '0.821')] [2024-06-28 17:43:05,552][09423] Updated weights for policy 0, policy_version 274637 (0.0034) [2024-06-28 17:43:07,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4499750912. Throughput: 0: 42664.4. Samples: 778649740. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 17:43:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:43:09,982][09423] Updated weights for policy 0, policy_version 274647 (0.0040) [2024-06-28 17:43:12,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42325.4, 300 sec: 42598.8). Total num frames: 4499947520. Throughput: 0: 42651.2. Samples: 778781860. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 17:43:12,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:43:13,152][09423] Updated weights for policy 0, policy_version 274657 (0.0033) [2024-06-28 17:43:17,390][09423] Updated weights for policy 0, policy_version 274667 (0.0034) [2024-06-28 17:43:17,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4500144128. Throughput: 0: 42613.5. Samples: 779032820. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 17:43:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:43:20,988][09423] Updated weights for policy 0, policy_version 274677 (0.0031) [2024-06-28 17:43:22,922][09190] Fps is (10 sec: 44236.1, 60 sec: 42871.3, 300 sec: 42543.2). Total num frames: 4500389888. Throughput: 0: 42453.1. Samples: 779276300. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 17:43:22,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:43:25,447][09423] Updated weights for policy 0, policy_version 274687 (0.0039) [2024-06-28 17:43:27,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4500586496. Throughput: 0: 42507.2. Samples: 779415900. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 17:43:27,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:43:28,893][09423] Updated weights for policy 0, policy_version 274697 (0.0041) [2024-06-28 17:43:32,921][09190] Fps is (10 sec: 37683.8, 60 sec: 42327.1, 300 sec: 42487.3). Total num frames: 4500766720. Throughput: 0: 42470.2. Samples: 779667440. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 17:43:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:43:33,260][09423] Updated weights for policy 0, policy_version 274707 (0.0035) [2024-06-28 17:43:36,406][09423] Updated weights for policy 0, policy_version 274717 (0.0030) [2024-06-28 17:43:37,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.4, 300 sec: 42543.2). Total num frames: 4501028864. Throughput: 0: 42555.5. Samples: 779918940. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 17:43:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:43:41,005][09423] Updated weights for policy 0, policy_version 274727 (0.0026) [2024-06-28 17:43:42,921][09190] Fps is (10 sec: 47513.6, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4501241856. Throughput: 0: 42575.5. Samples: 780057720. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 17:43:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:43:43,803][09423] Updated weights for policy 0, policy_version 274737 (0.0026) [2024-06-28 17:43:47,922][09190] Fps is (10 sec: 37681.2, 60 sec: 42324.9, 300 sec: 42431.7). Total num frames: 4501405696. Throughput: 0: 42582.1. Samples: 780310180. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 17:43:47,923][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 17:43:48,529][09423] Updated weights for policy 0, policy_version 274747 (0.0026) [2024-06-28 17:43:51,792][09423] Updated weights for policy 0, policy_version 274757 (0.0032) [2024-06-28 17:43:52,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4501667840. Throughput: 0: 42270.7. Samples: 780551920. Policy #0 lag: (min: 1.0, avg: 10.3, max: 20.0) [2024-06-28 17:43:52,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:43:56,271][09423] Updated weights for policy 0, policy_version 274767 (0.0039) [2024-06-28 17:43:57,921][09190] Fps is (10 sec: 47515.8, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4501880832. Throughput: 0: 42372.3. Samples: 780688620. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:43:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:43:59,551][09423] Updated weights for policy 0, policy_version 274777 (0.0034) [2024-06-28 17:44:02,921][09190] Fps is (10 sec: 37683.0, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4502044672. Throughput: 0: 42530.6. Samples: 780946700. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:44:02,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:44:03,966][09423] Updated weights for policy 0, policy_version 274787 (0.0035) [2024-06-28 17:44:07,035][09423] Updated weights for policy 0, policy_version 274797 (0.0032) [2024-06-28 17:44:07,922][09190] Fps is (10 sec: 40959.7, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 4502290432. Throughput: 0: 42624.9. Samples: 781194420. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:44:07,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:44:11,480][09423] Updated weights for policy 0, policy_version 274807 (0.0034) [2024-06-28 17:44:12,921][09190] Fps is (10 sec: 47513.7, 60 sec: 42871.4, 300 sec: 42598.8). Total num frames: 4502519808. Throughput: 0: 42570.1. Samples: 781331560. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:44:12,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:44:14,750][09423] Updated weights for policy 0, policy_version 274817 (0.0033) [2024-06-28 17:44:17,922][09190] Fps is (10 sec: 39321.6, 60 sec: 42325.2, 300 sec: 42431.8). Total num frames: 4502683648. Throughput: 0: 42527.4. Samples: 781581180. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:44:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:44:17,933][09190] No heartbeat for components: RolloutWorker_w20 (2196 seconds) [2024-06-28 17:44:18,054][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000274823_4502700032.pth... [2024-06-28 17:44:18,101][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000274203_4492541952.pth [2024-06-28 17:44:19,062][09403] Signal inference workers to stop experience collection... (10750 times) [2024-06-28 17:44:19,089][09423] InferenceWorker_p0-w0: stopping experience collection (10750 times) [2024-06-28 17:44:19,119][09403] Signal inference workers to resume experience collection... (10750 times) [2024-06-28 17:44:19,124][09423] InferenceWorker_p0-w0: resuming experience collection (10750 times) [2024-06-28 17:44:19,128][09423] Updated weights for policy 0, policy_version 274827 (0.0031) [2024-06-28 17:44:22,337][09423] Updated weights for policy 0, policy_version 274837 (0.0043) [2024-06-28 17:44:22,922][09190] Fps is (10 sec: 40959.7, 60 sec: 42325.4, 300 sec: 42543.2). Total num frames: 4502929408. Throughput: 0: 42499.9. Samples: 781831440. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:44:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:44:26,586][09423] Updated weights for policy 0, policy_version 274847 (0.0029) [2024-06-28 17:44:27,921][09190] Fps is (10 sec: 47514.0, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 4503158784. Throughput: 0: 42453.2. Samples: 781968120. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:44:27,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:44:30,482][09423] Updated weights for policy 0, policy_version 274857 (0.0030) [2024-06-28 17:44:32,923][09190] Fps is (10 sec: 39316.1, 60 sec: 42597.3, 300 sec: 42431.6). Total num frames: 4503322624. Throughput: 0: 42280.4. Samples: 782212840. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:44:32,923][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:44:34,501][09423] Updated weights for policy 0, policy_version 274867 (0.0029) [2024-06-28 17:44:37,922][09190] Fps is (10 sec: 39321.3, 60 sec: 42052.2, 300 sec: 42487.7). Total num frames: 4503552000. Throughput: 0: 42499.0. Samples: 782464380. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:44:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:44:38,428][09423] Updated weights for policy 0, policy_version 274877 (0.0033) [2024-06-28 17:44:42,083][09423] Updated weights for policy 0, policy_version 274887 (0.0025) [2024-06-28 17:44:42,921][09190] Fps is (10 sec: 45882.6, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4503781376. Throughput: 0: 42396.6. Samples: 782596460. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:44:42,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:44:46,257][09423] Updated weights for policy 0, policy_version 274897 (0.0036) [2024-06-28 17:44:47,921][09190] Fps is (10 sec: 39322.2, 60 sec: 42325.7, 300 sec: 42431.8). Total num frames: 4503945216. Throughput: 0: 42402.3. Samples: 782854800. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:44:47,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:44:49,472][09423] Updated weights for policy 0, policy_version 274907 (0.0032) [2024-06-28 17:44:52,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4504207360. Throughput: 0: 42521.0. Samples: 783107860. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:44:52,925][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:44:54,233][09423] Updated weights for policy 0, policy_version 274917 (0.0037) [2024-06-28 17:44:57,147][09423] Updated weights for policy 0, policy_version 274927 (0.0033) [2024-06-28 17:44:57,921][09190] Fps is (10 sec: 47513.6, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4504420352. Throughput: 0: 42462.2. Samples: 783242360. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:44:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:45:01,922][09423] Updated weights for policy 0, policy_version 274937 (0.0035) [2024-06-28 17:45:02,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 4504616960. Throughput: 0: 42646.8. Samples: 783500280. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:45:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:45:04,676][09423] Updated weights for policy 0, policy_version 274947 (0.0026) [2024-06-28 17:45:07,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4504829952. Throughput: 0: 42617.4. Samples: 783749220. Policy #0 lag: (min: 0.0, avg: 8.0, max: 18.0) [2024-06-28 17:45:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:45:09,742][09423] Updated weights for policy 0, policy_version 274957 (0.0036) [2024-06-28 17:45:12,690][09423] Updated weights for policy 0, policy_version 274967 (0.0032) [2024-06-28 17:45:12,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4505059328. Throughput: 0: 42464.6. Samples: 783879020. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 17:45:12,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:45:16,986][09423] Updated weights for policy 0, policy_version 274977 (0.0026) [2024-06-28 17:45:17,923][09190] Fps is (10 sec: 40951.9, 60 sec: 42597.0, 300 sec: 42487.0). Total num frames: 4505239552. Throughput: 0: 42780.8. Samples: 784138000. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 17:45:17,924][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:45:20,057][09423] Updated weights for policy 0, policy_version 274987 (0.0030) [2024-06-28 17:45:22,924][09190] Fps is (10 sec: 40949.6, 60 sec: 42323.7, 300 sec: 42487.0). Total num frames: 4505468928. Throughput: 0: 42631.1. Samples: 784382880. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 17:45:22,924][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:45:24,941][09423] Updated weights for policy 0, policy_version 274997 (0.0038) [2024-06-28 17:45:27,922][09190] Fps is (10 sec: 45884.3, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4505698304. Throughput: 0: 42694.0. Samples: 784517700. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 17:45:27,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:45:27,984][09423] Updated weights for policy 0, policy_version 275007 (0.0028) [2024-06-28 17:45:32,666][09423] Updated weights for policy 0, policy_version 275017 (0.0034) [2024-06-28 17:45:32,921][09190] Fps is (10 sec: 42608.9, 60 sec: 42872.5, 300 sec: 42542.9). Total num frames: 4505894912. Throughput: 0: 42592.0. Samples: 784771440. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 17:45:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:45:35,590][09423] Updated weights for policy 0, policy_version 275027 (0.0034) [2024-06-28 17:45:37,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42598.5, 300 sec: 42543.2). Total num frames: 4506107904. Throughput: 0: 42498.2. Samples: 785020280. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 17:45:37,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:45:40,158][09423] Updated weights for policy 0, policy_version 275037 (0.0036) [2024-06-28 17:45:42,922][09190] Fps is (10 sec: 44236.4, 60 sec: 42598.2, 300 sec: 42487.3). Total num frames: 4506337280. Throughput: 0: 42372.8. Samples: 785149140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 17:45:42,931][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:45:43,363][09423] Updated weights for policy 0, policy_version 275047 (0.0039) [2024-06-28 17:45:47,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42871.4, 300 sec: 42431.8). Total num frames: 4506517504. Throughput: 0: 42341.8. Samples: 785405660. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 17:45:47,930][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:45:48,110][09423] Updated weights for policy 0, policy_version 275057 (0.0036) [2024-06-28 17:45:51,231][09423] Updated weights for policy 0, policy_version 275067 (0.0031) [2024-06-28 17:45:52,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42052.3, 300 sec: 42431.8). Total num frames: 4506730496. Throughput: 0: 42515.2. Samples: 785662400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 17:45:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:45:54,838][09403] Signal inference workers to stop experience collection... (10800 times) [2024-06-28 17:45:54,838][09403] Signal inference workers to resume experience collection... (10800 times) [2024-06-28 17:45:54,881][09423] InferenceWorker_p0-w0: stopping experience collection (10800 times) [2024-06-28 17:45:54,881][09423] InferenceWorker_p0-w0: resuming experience collection (10800 times) [2024-06-28 17:45:55,533][09423] Updated weights for policy 0, policy_version 275077 (0.0042) [2024-06-28 17:45:57,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4506959872. Throughput: 0: 42395.1. Samples: 785786800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 17:45:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:45:58,646][09423] Updated weights for policy 0, policy_version 275087 (0.0028) [2024-06-28 17:46:02,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42598.5, 300 sec: 42487.3). Total num frames: 4507172864. Throughput: 0: 42404.7. Samples: 786046120. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 17:46:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:46:03,152][09423] Updated weights for policy 0, policy_version 275097 (0.0039) [2024-06-28 17:46:06,124][09423] Updated weights for policy 0, policy_version 275107 (0.0031) [2024-06-28 17:46:07,923][09190] Fps is (10 sec: 40951.1, 60 sec: 42323.9, 300 sec: 42487.0). Total num frames: 4507369472. Throughput: 0: 42621.7. Samples: 786300840. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 17:46:07,924][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:46:10,734][09423] Updated weights for policy 0, policy_version 275117 (0.0028) [2024-06-28 17:46:12,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4507598848. Throughput: 0: 42435.7. Samples: 786427300. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 17:46:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:46:14,602][09423] Updated weights for policy 0, policy_version 275127 (0.0036) [2024-06-28 17:46:17,922][09190] Fps is (10 sec: 44245.6, 60 sec: 42872.9, 300 sec: 42487.3). Total num frames: 4507811840. Throughput: 0: 42611.5. Samples: 786688960. Policy #0 lag: (min: 0.0, avg: 10.8, max: 21.0) [2024-06-28 17:46:17,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:46:18,054][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000275136_4507828224.pth... [2024-06-28 17:46:18,099][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000274512_4497604608.pth [2024-06-28 17:46:18,305][09423] Updated weights for policy 0, policy_version 275137 (0.0036) [2024-06-28 17:46:22,298][09423] Updated weights for policy 0, policy_version 275147 (0.0032) [2024-06-28 17:46:22,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42600.1, 300 sec: 42542.8). Total num frames: 4508024832. Throughput: 0: 42731.5. Samples: 786943200. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 17:46:22,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:46:25,829][09423] Updated weights for policy 0, policy_version 275157 (0.0028) [2024-06-28 17:46:27,921][09190] Fps is (10 sec: 45875.7, 60 sec: 42871.5, 300 sec: 42654.3). Total num frames: 4508270592. Throughput: 0: 42673.0. Samples: 787069420. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 17:46:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:46:30,022][09423] Updated weights for policy 0, policy_version 275167 (0.0037) [2024-06-28 17:46:32,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 4508467200. Throughput: 0: 42841.9. Samples: 787333540. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 17:46:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:46:33,482][09423] Updated weights for policy 0, policy_version 275177 (0.0028) [2024-06-28 17:46:37,722][09423] Updated weights for policy 0, policy_version 275187 (0.0028) [2024-06-28 17:46:37,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4508663808. Throughput: 0: 42833.8. Samples: 787589920. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 17:46:37,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:46:41,156][09423] Updated weights for policy 0, policy_version 275197 (0.0036) [2024-06-28 17:46:42,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.6, 300 sec: 42709.5). Total num frames: 4508909568. Throughput: 0: 42780.9. Samples: 787711940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 17:46:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:46:45,517][09423] Updated weights for policy 0, policy_version 275207 (0.0026) [2024-06-28 17:46:47,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 4509089792. Throughput: 0: 42718.2. Samples: 787968440. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 17:46:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:46:48,817][09423] Updated weights for policy 0, policy_version 275217 (0.0033) [2024-06-28 17:46:52,910][09423] Updated weights for policy 0, policy_version 275227 (0.0030) [2024-06-28 17:46:52,921][09190] Fps is (10 sec: 40959.6, 60 sec: 43144.5, 300 sec: 42542.9). Total num frames: 4509319168. Throughput: 0: 42833.9. Samples: 788228280. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 17:46:52,924][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:46:56,679][09423] Updated weights for policy 0, policy_version 275237 (0.0033) [2024-06-28 17:46:57,921][09190] Fps is (10 sec: 45875.2, 60 sec: 43144.5, 300 sec: 42653.9). Total num frames: 4509548544. Throughput: 0: 42713.3. Samples: 788349400. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 17:46:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:47:00,944][09423] Updated weights for policy 0, policy_version 275247 (0.0027) [2024-06-28 17:47:02,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4509728768. Throughput: 0: 42648.0. Samples: 788608120. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 17:47:02,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:47:04,336][09423] Updated weights for policy 0, policy_version 275257 (0.0036) [2024-06-28 17:47:07,921][09190] Fps is (10 sec: 39321.0, 60 sec: 42872.9, 300 sec: 42487.3). Total num frames: 4509941760. Throughput: 0: 42798.6. Samples: 788869140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 17:47:07,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:47:08,441][09423] Updated weights for policy 0, policy_version 275267 (0.0037) [2024-06-28 17:47:12,134][09423] Updated weights for policy 0, policy_version 275277 (0.0037) [2024-06-28 17:47:12,300][09403] Signal inference workers to stop experience collection... (10850 times) [2024-06-28 17:47:12,301][09403] Signal inference workers to resume experience collection... (10850 times) [2024-06-28 17:47:12,313][09423] InferenceWorker_p0-w0: stopping experience collection (10850 times) [2024-06-28 17:47:12,314][09423] InferenceWorker_p0-w0: resuming experience collection (10850 times) [2024-06-28 17:47:12,924][09190] Fps is (10 sec: 45864.2, 60 sec: 43142.7, 300 sec: 42653.6). Total num frames: 4510187520. Throughput: 0: 42783.9. Samples: 788994800. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 17:47:12,924][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:47:15,906][09423] Updated weights for policy 0, policy_version 275287 (0.0035) [2024-06-28 17:47:17,922][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4510384128. Throughput: 0: 42746.5. Samples: 789257140. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 17:47:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:47:17,934][09190] No heartbeat for components: RolloutWorker_w20 (2376 seconds) [2024-06-28 17:47:19,436][09423] Updated weights for policy 0, policy_version 275297 (0.0036) [2024-06-28 17:47:22,921][09190] Fps is (10 sec: 39331.3, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4510580736. Throughput: 0: 42810.2. Samples: 789516380. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 17:47:22,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 17:47:23,630][09423] Updated weights for policy 0, policy_version 275307 (0.0026) [2024-06-28 17:47:27,222][09423] Updated weights for policy 0, policy_version 275317 (0.0037) [2024-06-28 17:47:27,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42598.4, 300 sec: 42709.8). Total num frames: 4510826496. Throughput: 0: 42755.5. Samples: 789635940. Policy #0 lag: (min: 0.0, avg: 10.3, max: 22.0) [2024-06-28 17:47:27,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:47:31,445][09423] Updated weights for policy 0, policy_version 275327 (0.0031) [2024-06-28 17:47:32,922][09190] Fps is (10 sec: 47513.2, 60 sec: 43144.4, 300 sec: 42709.5). Total num frames: 4511055872. Throughput: 0: 42824.3. Samples: 789895540. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:47:32,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 17:47:34,920][09423] Updated weights for policy 0, policy_version 275337 (0.0030) [2024-06-28 17:47:37,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4511219712. Throughput: 0: 42769.0. Samples: 790152880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:47:37,922][09190] Avg episode reward: [(0, '0.759')] [2024-06-28 17:47:38,813][09423] Updated weights for policy 0, policy_version 275347 (0.0027) [2024-06-28 17:47:42,294][09423] Updated weights for policy 0, policy_version 275357 (0.0029) [2024-06-28 17:47:42,922][09190] Fps is (10 sec: 39321.2, 60 sec: 42325.2, 300 sec: 42653.9). Total num frames: 4511449088. Throughput: 0: 42766.0. Samples: 790273880. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:47:42,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 17:47:46,742][09423] Updated weights for policy 0, policy_version 275367 (0.0044) [2024-06-28 17:47:47,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42871.4, 300 sec: 42542.8). Total num frames: 4511662080. Throughput: 0: 42700.5. Samples: 790529640. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:47:47,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:47:50,489][09423] Updated weights for policy 0, policy_version 275377 (0.0034) [2024-06-28 17:47:52,922][09190] Fps is (10 sec: 40958.4, 60 sec: 42325.0, 300 sec: 42487.2). Total num frames: 4511858688. Throughput: 0: 42564.4. Samples: 790784560. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:47:52,923][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 17:47:54,485][09423] Updated weights for policy 0, policy_version 275387 (0.0028) [2024-06-28 17:47:57,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42052.2, 300 sec: 42598.4). Total num frames: 4512071680. Throughput: 0: 42495.2. Samples: 790906980. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:47:57,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 17:47:58,271][09423] Updated weights for policy 0, policy_version 275397 (0.0038) [2024-06-28 17:48:02,324][09423] Updated weights for policy 0, policy_version 275407 (0.0034) [2024-06-28 17:48:02,921][09190] Fps is (10 sec: 45877.9, 60 sec: 43144.6, 300 sec: 42598.4). Total num frames: 4512317440. Throughput: 0: 42433.9. Samples: 791166660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:48:02,925][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:48:05,691][09423] Updated weights for policy 0, policy_version 275417 (0.0028) [2024-06-28 17:48:07,922][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.4, 300 sec: 42542.8). Total num frames: 4512497664. Throughput: 0: 42393.7. Samples: 791424100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:48:07,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:48:09,607][09423] Updated weights for policy 0, policy_version 275427 (0.0032) [2024-06-28 17:48:12,922][09190] Fps is (10 sec: 42598.1, 60 sec: 42600.1, 300 sec: 42709.5). Total num frames: 4512743424. Throughput: 0: 42575.0. Samples: 791551820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:48:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:48:13,368][09423] Updated weights for policy 0, policy_version 275437 (0.0031) [2024-06-28 17:48:17,193][09423] Updated weights for policy 0, policy_version 275447 (0.0037) [2024-06-28 17:48:17,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42598.5, 300 sec: 42542.9). Total num frames: 4512940032. Throughput: 0: 42568.6. Samples: 791811120. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:48:17,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 17:48:17,962][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000275449_4512956416.pth... [2024-06-28 17:48:18,004][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000274823_4502700032.pth [2024-06-28 17:48:21,030][09423] Updated weights for policy 0, policy_version 275457 (0.0029) [2024-06-28 17:48:22,922][09190] Fps is (10 sec: 39321.6, 60 sec: 42598.3, 300 sec: 42542.8). Total num frames: 4513136640. Throughput: 0: 42449.2. Samples: 792063100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:48:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:48:25,033][09423] Updated weights for policy 0, policy_version 275467 (0.0040) [2024-06-28 17:48:27,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42598.3, 300 sec: 42765.0). Total num frames: 4513382400. Throughput: 0: 42628.1. Samples: 792192140. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:48:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:48:28,711][09423] Updated weights for policy 0, policy_version 275477 (0.0026) [2024-06-28 17:48:32,813][09403] Signal inference workers to stop experience collection... (10900 times) [2024-06-28 17:48:32,819][09403] Signal inference workers to resume experience collection... (10900 times) [2024-06-28 17:48:32,828][09423] Updated weights for policy 0, policy_version 275487 (0.0042) [2024-06-28 17:48:32,850][09423] InferenceWorker_p0-w0: stopping experience collection (10900 times) [2024-06-28 17:48:32,850][09423] InferenceWorker_p0-w0: resuming experience collection (10900 times) [2024-06-28 17:48:32,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42052.4, 300 sec: 42542.9). Total num frames: 4513579008. Throughput: 0: 42618.8. Samples: 792447480. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:48:32,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:48:36,246][09423] Updated weights for policy 0, policy_version 275497 (0.0038) [2024-06-28 17:48:37,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 4513792000. Throughput: 0: 42701.5. Samples: 792706100. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:48:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:48:40,240][09423] Updated weights for policy 0, policy_version 275507 (0.0029) [2024-06-28 17:48:42,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42598.5, 300 sec: 42709.5). Total num frames: 4514004992. Throughput: 0: 42683.5. Samples: 792827740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 21.0) [2024-06-28 17:48:42,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:48:44,057][09423] Updated weights for policy 0, policy_version 275517 (0.0034) [2024-06-28 17:48:47,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.5, 300 sec: 42542.9). Total num frames: 4514217984. Throughput: 0: 42638.3. Samples: 793085380. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 17:48:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:48:47,941][09423] Updated weights for policy 0, policy_version 275527 (0.0034) [2024-06-28 17:48:51,705][09423] Updated weights for policy 0, policy_version 275537 (0.0031) [2024-06-28 17:48:52,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.8, 300 sec: 42487.3). Total num frames: 4514414592. Throughput: 0: 42528.1. Samples: 793337860. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 17:48:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:48:55,996][09423] Updated weights for policy 0, policy_version 275547 (0.0029) [2024-06-28 17:48:57,922][09190] Fps is (10 sec: 42597.6, 60 sec: 42871.4, 300 sec: 42709.5). Total num frames: 4514643968. Throughput: 0: 42391.1. Samples: 793459420. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 17:48:57,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:48:59,568][09423] Updated weights for policy 0, policy_version 275557 (0.0040) [2024-06-28 17:49:02,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4514856960. Throughput: 0: 42361.2. Samples: 793717380. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 17:49:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:49:03,324][09423] Updated weights for policy 0, policy_version 275567 (0.0032) [2024-06-28 17:49:07,336][09423] Updated weights for policy 0, policy_version 275577 (0.0041) [2024-06-28 17:49:07,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42598.5, 300 sec: 42487.3). Total num frames: 4515053568. Throughput: 0: 42485.9. Samples: 793974960. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 17:49:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:49:11,326][09423] Updated weights for policy 0, policy_version 275587 (0.0037) [2024-06-28 17:49:12,922][09190] Fps is (10 sec: 40959.8, 60 sec: 42052.3, 300 sec: 42653.9). Total num frames: 4515266560. Throughput: 0: 42460.9. Samples: 794102880. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 17:49:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:49:15,415][09423] Updated weights for policy 0, policy_version 275597 (0.0027) [2024-06-28 17:49:17,921][09190] Fps is (10 sec: 45875.4, 60 sec: 42871.5, 300 sec: 42654.0). Total num frames: 4515512320. Throughput: 0: 42483.6. Samples: 794359240. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 17:49:17,922][09190] Avg episode reward: [(0, '0.713')] [2024-06-28 17:49:19,312][09423] Updated weights for policy 0, policy_version 275607 (0.0023) [2024-06-28 17:49:22,809][09423] Updated weights for policy 0, policy_version 275617 (0.0030) [2024-06-28 17:49:22,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42871.6, 300 sec: 42542.9). Total num frames: 4515708928. Throughput: 0: 42463.1. Samples: 794616940. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 17:49:22,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:49:26,713][09423] Updated weights for policy 0, policy_version 275627 (0.0030) [2024-06-28 17:49:27,924][09190] Fps is (10 sec: 40949.4, 60 sec: 42323.6, 300 sec: 42709.3). Total num frames: 4515921920. Throughput: 0: 42511.0. Samples: 794740840. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 17:49:27,933][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:49:30,758][09423] Updated weights for policy 0, policy_version 275637 (0.0034) [2024-06-28 17:49:32,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4516118528. Throughput: 0: 42435.5. Samples: 794994980. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 17:49:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:49:34,454][09423] Updated weights for policy 0, policy_version 275647 (0.0035) [2024-06-28 17:49:37,921][09190] Fps is (10 sec: 40970.3, 60 sec: 42325.3, 300 sec: 42542.8). Total num frames: 4516331520. Throughput: 0: 42508.5. Samples: 795250740. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 17:49:37,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:49:38,557][09423] Updated weights for policy 0, policy_version 275657 (0.0030) [2024-06-28 17:49:42,094][09423] Updated weights for policy 0, policy_version 275667 (0.0030) [2024-06-28 17:49:42,922][09190] Fps is (10 sec: 42597.9, 60 sec: 42325.3, 300 sec: 42709.5). Total num frames: 4516544512. Throughput: 0: 42603.6. Samples: 795376580. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 17:49:42,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:49:46,102][09423] Updated weights for policy 0, policy_version 275677 (0.0028) [2024-06-28 17:49:47,922][09190] Fps is (10 sec: 44236.4, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4516773888. Throughput: 0: 42494.6. Samples: 795629640. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 17:49:47,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:49:49,895][09423] Updated weights for policy 0, policy_version 275687 (0.0027) [2024-06-28 17:49:52,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4516954112. Throughput: 0: 42524.0. Samples: 795888540. Policy #0 lag: (min: 0.0, avg: 11.4, max: 23.0) [2024-06-28 17:49:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:49:53,739][09423] Updated weights for policy 0, policy_version 275697 (0.0036) [2024-06-28 17:49:57,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42052.3, 300 sec: 42542.9). Total num frames: 4517167104. Throughput: 0: 42346.3. Samples: 796008460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:49:57,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 17:49:58,158][09423] Updated weights for policy 0, policy_version 275707 (0.0028) [2024-06-28 17:50:01,695][09423] Updated weights for policy 0, policy_version 275717 (0.0032) [2024-06-28 17:50:02,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4517396480. Throughput: 0: 42358.6. Samples: 796265380. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:50:02,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:50:05,324][09403] Signal inference workers to stop experience collection... (10950 times) [2024-06-28 17:50:05,376][09423] InferenceWorker_p0-w0: stopping experience collection (10950 times) [2024-06-28 17:50:05,438][09403] Signal inference workers to resume experience collection... (10950 times) [2024-06-28 17:50:05,438][09423] InferenceWorker_p0-w0: resuming experience collection (10950 times) [2024-06-28 17:50:05,572][09423] Updated weights for policy 0, policy_version 275727 (0.0032) [2024-06-28 17:50:07,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4517593088. Throughput: 0: 42241.7. Samples: 796517820. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:50:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:50:09,187][09423] Updated weights for policy 0, policy_version 275737 (0.0035) [2024-06-28 17:50:12,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42598.4, 300 sec: 42654.2). Total num frames: 4517822464. Throughput: 0: 42299.6. Samples: 796644220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:50:12,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 17:50:13,041][09423] Updated weights for policy 0, policy_version 275747 (0.0036) [2024-06-28 17:50:16,923][09423] Updated weights for policy 0, policy_version 275757 (0.0031) [2024-06-28 17:50:17,921][09190] Fps is (10 sec: 42598.7, 60 sec: 41779.2, 300 sec: 42543.2). Total num frames: 4518019072. Throughput: 0: 42307.6. Samples: 796898820. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:50:17,922][09190] Avg episode reward: [(0, '0.725')] [2024-06-28 17:50:17,927][09190] No heartbeat for components: RolloutWorker_w20 (2556 seconds) [2024-06-28 17:50:18,003][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000275759_4518035456.pth... [2024-06-28 17:50:18,065][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000275136_4507828224.pth [2024-06-28 17:50:20,785][09423] Updated weights for policy 0, policy_version 275767 (0.0032) [2024-06-28 17:50:22,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42052.3, 300 sec: 42487.3). Total num frames: 4518232064. Throughput: 0: 42349.0. Samples: 797156440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:50:22,922][09190] Avg episode reward: [(0, '0.719')] [2024-06-28 17:50:24,632][09423] Updated weights for policy 0, policy_version 275777 (0.0025) [2024-06-28 17:50:27,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42054.0, 300 sec: 42542.9). Total num frames: 4518445056. Throughput: 0: 42381.4. Samples: 797283740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:50:27,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:50:28,747][09423] Updated weights for policy 0, policy_version 275787 (0.0033) [2024-06-28 17:50:32,467][09423] Updated weights for policy 0, policy_version 275797 (0.0028) [2024-06-28 17:50:32,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4518674432. Throughput: 0: 42380.1. Samples: 797536740. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:50:32,922][09190] Avg episode reward: [(0, '0.728')] [2024-06-28 17:50:36,367][09423] Updated weights for policy 0, policy_version 275807 (0.0032) [2024-06-28 17:50:37,922][09190] Fps is (10 sec: 42597.9, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4518871040. Throughput: 0: 42334.1. Samples: 797793580. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:50:37,922][09190] Avg episode reward: [(0, '0.707')] [2024-06-28 17:50:40,115][09423] Updated weights for policy 0, policy_version 275817 (0.0035) [2024-06-28 17:50:42,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4519084032. Throughput: 0: 42429.3. Samples: 797917780. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:50:42,922][09190] Avg episode reward: [(0, '0.709')] [2024-06-28 17:50:43,798][09423] Updated weights for policy 0, policy_version 275827 (0.0026) [2024-06-28 17:50:47,760][09423] Updated weights for policy 0, policy_version 275837 (0.0027) [2024-06-28 17:50:47,922][09190] Fps is (10 sec: 44236.2, 60 sec: 42325.2, 300 sec: 42653.9). Total num frames: 4519313408. Throughput: 0: 42429.1. Samples: 798174700. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:50:47,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 17:50:51,894][09423] Updated weights for policy 0, policy_version 275847 (0.0031) [2024-06-28 17:50:52,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4519526400. Throughput: 0: 42313.7. Samples: 798421940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:50:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:50:55,696][09423] Updated weights for policy 0, policy_version 275857 (0.0031) [2024-06-28 17:50:57,921][09190] Fps is (10 sec: 40961.5, 60 sec: 42598.5, 300 sec: 42542.9). Total num frames: 4519723008. Throughput: 0: 42371.3. Samples: 798550920. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:50:57,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:50:59,552][09423] Updated weights for policy 0, policy_version 275867 (0.0036) [2024-06-28 17:51:02,922][09190] Fps is (10 sec: 42597.9, 60 sec: 42598.3, 300 sec: 42654.2). Total num frames: 4519952384. Throughput: 0: 42491.4. Samples: 798810940. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:51:02,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:51:03,132][09423] Updated weights for policy 0, policy_version 275877 (0.0029) [2024-06-28 17:51:07,197][09423] Updated weights for policy 0, policy_version 275887 (0.0031) [2024-06-28 17:51:07,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4520148992. Throughput: 0: 42511.1. Samples: 799069440. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 17:51:07,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:51:10,719][09423] Updated weights for policy 0, policy_version 275897 (0.0031) [2024-06-28 17:51:12,922][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4520361984. Throughput: 0: 42350.6. Samples: 799189520. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:51:12,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:51:14,803][09423] Updated weights for policy 0, policy_version 275907 (0.0030) [2024-06-28 17:51:17,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4520591360. Throughput: 0: 42436.9. Samples: 799446400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:51:17,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:51:18,292][09423] Updated weights for policy 0, policy_version 275917 (0.0034) [2024-06-28 17:51:22,722][09423] Updated weights for policy 0, policy_version 275927 (0.0025) [2024-06-28 17:51:22,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 4520787968. Throughput: 0: 42471.6. Samples: 799704800. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:51:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:51:26,007][09423] Updated weights for policy 0, policy_version 275937 (0.0037) [2024-06-28 17:51:27,469][09403] Signal inference workers to stop experience collection... (11000 times) [2024-06-28 17:51:27,506][09423] InferenceWorker_p0-w0: stopping experience collection (11000 times) [2024-06-28 17:51:27,531][09403] Signal inference workers to resume experience collection... (11000 times) [2024-06-28 17:51:27,532][09423] InferenceWorker_p0-w0: resuming experience collection (11000 times) [2024-06-28 17:51:27,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42871.5, 300 sec: 42542.8). Total num frames: 4521017344. Throughput: 0: 42350.2. Samples: 799823540. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:51:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:51:30,314][09423] Updated weights for policy 0, policy_version 275947 (0.0037) [2024-06-28 17:51:32,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4521230336. Throughput: 0: 42540.2. Samples: 800089000. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:51:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:51:33,741][09423] Updated weights for policy 0, policy_version 275957 (0.0044) [2024-06-28 17:51:37,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4521426944. Throughput: 0: 42668.5. Samples: 800342020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:51:37,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 17:51:37,973][09423] Updated weights for policy 0, policy_version 275967 (0.0043) [2024-06-28 17:51:41,722][09423] Updated weights for policy 0, policy_version 275977 (0.0025) [2024-06-28 17:51:42,922][09190] Fps is (10 sec: 42597.9, 60 sec: 42871.3, 300 sec: 42598.4). Total num frames: 4521656320. Throughput: 0: 42545.9. Samples: 800465500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:51:42,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:51:45,942][09423] Updated weights for policy 0, policy_version 275987 (0.0032) [2024-06-28 17:51:47,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42598.6, 300 sec: 42542.9). Total num frames: 4521869312. Throughput: 0: 42562.4. Samples: 800726240. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:51:47,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:51:49,650][09423] Updated weights for policy 0, policy_version 275997 (0.0026) [2024-06-28 17:51:52,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4522065920. Throughput: 0: 42443.9. Samples: 800979420. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:51:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:51:53,547][09423] Updated weights for policy 0, policy_version 276007 (0.0034) [2024-06-28 17:51:57,127][09423] Updated weights for policy 0, policy_version 276017 (0.0041) [2024-06-28 17:51:57,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 4522262528. Throughput: 0: 42535.6. Samples: 801103620. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:51:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:52:01,559][09423] Updated weights for policy 0, policy_version 276027 (0.0029) [2024-06-28 17:52:02,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4522508288. Throughput: 0: 42456.5. Samples: 801356940. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:52:02,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:52:05,263][09423] Updated weights for policy 0, policy_version 276037 (0.0041) [2024-06-28 17:52:07,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.3, 300 sec: 42376.6). Total num frames: 4522688512. Throughput: 0: 42373.4. Samples: 801611600. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:52:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:52:09,248][09423] Updated weights for policy 0, policy_version 276047 (0.0031) [2024-06-28 17:52:12,922][09190] Fps is (10 sec: 39321.2, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4522901504. Throughput: 0: 42530.1. Samples: 801737400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:52:12,922][09190] Avg episode reward: [(0, '0.733')] [2024-06-28 17:52:12,971][09423] Updated weights for policy 0, policy_version 276057 (0.0031) [2024-06-28 17:52:16,663][09423] Updated weights for policy 0, policy_version 276067 (0.0030) [2024-06-28 17:52:17,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4523130880. Throughput: 0: 42388.9. Samples: 801996500. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 17:52:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:52:17,998][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000276071_4523147264.pth... [2024-06-28 17:52:18,045][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000275449_4512956416.pth [2024-06-28 17:52:20,431][09423] Updated weights for policy 0, policy_version 276077 (0.0030) [2024-06-28 17:52:22,922][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 4523327488. Throughput: 0: 42462.1. Samples: 802252820. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 17:52:22,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:52:24,490][09423] Updated weights for policy 0, policy_version 276087 (0.0034) [2024-06-28 17:52:27,922][09190] Fps is (10 sec: 40959.5, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 4523540480. Throughput: 0: 42426.3. Samples: 802374680. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 17:52:27,922][09190] Avg episode reward: [(0, '0.733')] [2024-06-28 17:52:28,344][09423] Updated weights for policy 0, policy_version 276097 (0.0029) [2024-06-28 17:52:31,895][09423] Updated weights for policy 0, policy_version 276107 (0.0022) [2024-06-28 17:52:32,921][09190] Fps is (10 sec: 45875.8, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4523786240. Throughput: 0: 42445.8. Samples: 802636300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 17:52:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:52:35,872][09423] Updated weights for policy 0, policy_version 276117 (0.0032) [2024-06-28 17:52:37,924][09190] Fps is (10 sec: 42588.3, 60 sec: 42323.5, 300 sec: 42431.5). Total num frames: 4523966464. Throughput: 0: 42481.7. Samples: 802891200. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 17:52:37,924][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 17:52:39,658][09423] Updated weights for policy 0, policy_version 276127 (0.0030) [2024-06-28 17:52:42,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4524195840. Throughput: 0: 42420.8. Samples: 803012560. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 17:52:42,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:52:43,744][09423] Updated weights for policy 0, policy_version 276137 (0.0040) [2024-06-28 17:52:47,096][09423] Updated weights for policy 0, policy_version 276147 (0.0029) [2024-06-28 17:52:47,921][09190] Fps is (10 sec: 45886.3, 60 sec: 42598.3, 300 sec: 42598.5). Total num frames: 4524425216. Throughput: 0: 42614.6. Samples: 803274600. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 17:52:47,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:52:51,718][09423] Updated weights for policy 0, policy_version 276157 (0.0028) [2024-06-28 17:52:52,922][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4524605440. Throughput: 0: 42741.2. Samples: 803534960. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 17:52:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:52:54,790][09423] Updated weights for policy 0, policy_version 276167 (0.0037) [2024-06-28 17:52:57,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42598.4, 300 sec: 42376.2). Total num frames: 4524818432. Throughput: 0: 42584.5. Samples: 803653700. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 17:52:57,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 17:52:59,170][09423] Updated weights for policy 0, policy_version 276177 (0.0028) [2024-06-28 17:53:01,481][09403] Signal inference workers to stop experience collection... (11050 times) [2024-06-28 17:53:01,526][09423] InferenceWorker_p0-w0: stopping experience collection (11050 times) [2024-06-28 17:53:01,536][09403] Signal inference workers to resume experience collection... (11050 times) [2024-06-28 17:53:01,543][09423] InferenceWorker_p0-w0: resuming experience collection (11050 times) [2024-06-28 17:53:02,596][09423] Updated weights for policy 0, policy_version 276187 (0.0037) [2024-06-28 17:53:02,923][09190] Fps is (10 sec: 45867.3, 60 sec: 42597.1, 300 sec: 42598.2). Total num frames: 4525064192. Throughput: 0: 42538.7. Samples: 803910820. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 17:53:02,924][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:53:06,979][09423] Updated weights for policy 0, policy_version 276197 (0.0032) [2024-06-28 17:53:07,922][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.2, 300 sec: 42320.7). Total num frames: 4525228032. Throughput: 0: 42563.1. Samples: 804168160. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 17:53:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:53:10,102][09423] Updated weights for policy 0, policy_version 276207 (0.0025) [2024-06-28 17:53:12,921][09190] Fps is (10 sec: 39328.8, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4525457408. Throughput: 0: 42538.8. Samples: 804288920. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 17:53:12,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:53:14,657][09423] Updated weights for policy 0, policy_version 276217 (0.0033) [2024-06-28 17:53:17,815][09423] Updated weights for policy 0, policy_version 276227 (0.0035) [2024-06-28 17:53:17,922][09190] Fps is (10 sec: 47513.6, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4525703168. Throughput: 0: 42450.5. Samples: 804546580. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 17:53:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:53:17,931][09190] No heartbeat for components: RolloutWorker_w20 (2736 seconds) [2024-06-28 17:53:22,454][09423] Updated weights for policy 0, policy_version 276237 (0.0027) [2024-06-28 17:53:22,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 4525867008. Throughput: 0: 42549.9. Samples: 804805840. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 17:53:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:53:25,603][09423] Updated weights for policy 0, policy_version 276247 (0.0039) [2024-06-28 17:53:27,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4526096384. Throughput: 0: 42461.8. Samples: 804923340. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 17:53:27,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 17:53:30,325][09423] Updated weights for policy 0, policy_version 276257 (0.0033) [2024-06-28 17:53:32,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4526325760. Throughput: 0: 42383.6. Samples: 805181860. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:53:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:53:33,367][09423] Updated weights for policy 0, policy_version 276267 (0.0030) [2024-06-28 17:53:37,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42327.1, 300 sec: 42376.3). Total num frames: 4526505984. Throughput: 0: 42322.8. Samples: 805439480. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:53:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:53:38,024][09423] Updated weights for policy 0, policy_version 276277 (0.0035) [2024-06-28 17:53:40,815][09423] Updated weights for policy 0, policy_version 276287 (0.0040) [2024-06-28 17:53:42,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42052.4, 300 sec: 42376.2). Total num frames: 4526718976. Throughput: 0: 42228.1. Samples: 805553960. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:53:42,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:53:45,390][09423] Updated weights for policy 0, policy_version 276297 (0.0031) [2024-06-28 17:53:47,921][09190] Fps is (10 sec: 47513.3, 60 sec: 42598.5, 300 sec: 42598.4). Total num frames: 4526981120. Throughput: 0: 42419.1. Samples: 805819600. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:53:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:53:48,877][09423] Updated weights for policy 0, policy_version 276307 (0.0033) [2024-06-28 17:53:52,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.5, 300 sec: 42376.3). Total num frames: 4527144960. Throughput: 0: 42370.9. Samples: 806074840. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:53:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:53:53,222][09423] Updated weights for policy 0, policy_version 276317 (0.0032) [2024-06-28 17:53:56,342][09423] Updated weights for policy 0, policy_version 276327 (0.0029) [2024-06-28 17:53:57,923][09190] Fps is (10 sec: 37675.3, 60 sec: 42323.9, 300 sec: 42376.0). Total num frames: 4527357952. Throughput: 0: 42334.0. Samples: 806194040. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:53:57,924][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:54:01,201][09423] Updated weights for policy 0, policy_version 276337 (0.0038) [2024-06-28 17:54:02,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42326.6, 300 sec: 42542.9). Total num frames: 4527603712. Throughput: 0: 42478.4. Samples: 806458100. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:54:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:54:04,283][09423] Updated weights for policy 0, policy_version 276347 (0.0026) [2024-06-28 17:54:07,921][09190] Fps is (10 sec: 42607.2, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4527783936. Throughput: 0: 42318.6. Samples: 806710180. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:54:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:54:08,774][09423] Updated weights for policy 0, policy_version 276357 (0.0041) [2024-06-28 17:54:12,595][09423] Updated weights for policy 0, policy_version 276367 (0.0034) [2024-06-28 17:54:12,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 4528013312. Throughput: 0: 42371.9. Samples: 806830080. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:54:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:54:16,546][09423] Updated weights for policy 0, policy_version 276377 (0.0026) [2024-06-28 17:54:17,921][09190] Fps is (10 sec: 45875.6, 60 sec: 42325.5, 300 sec: 42487.3). Total num frames: 4528242688. Throughput: 0: 42471.6. Samples: 807093080. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:54:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:54:18,032][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000276383_4528259072.pth... [2024-06-28 17:54:18,092][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000275759_4518035456.pth [2024-06-28 17:54:19,931][09423] Updated weights for policy 0, policy_version 276387 (0.0027) [2024-06-28 17:54:22,921][09190] Fps is (10 sec: 40960.8, 60 sec: 42598.4, 300 sec: 42376.6). Total num frames: 4528422912. Throughput: 0: 42511.1. Samples: 807352480. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:54:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:54:24,246][09423] Updated weights for policy 0, policy_version 276397 (0.0025) [2024-06-28 17:54:27,924][09190] Fps is (10 sec: 39311.6, 60 sec: 42323.6, 300 sec: 42431.4). Total num frames: 4528635904. Throughput: 0: 42540.7. Samples: 807468400. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:54:27,924][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:54:27,988][09423] Updated weights for policy 0, policy_version 276407 (0.0033) [2024-06-28 17:54:31,936][09423] Updated weights for policy 0, policy_version 276417 (0.0029) [2024-06-28 17:54:32,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4528865280. Throughput: 0: 42526.2. Samples: 807733280. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:54:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:54:35,796][09423] Updated weights for policy 0, policy_version 276427 (0.0038) [2024-06-28 17:54:37,922][09190] Fps is (10 sec: 44247.3, 60 sec: 42871.3, 300 sec: 42487.3). Total num frames: 4529078272. Throughput: 0: 42533.2. Samples: 807988840. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:54:37,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:54:39,331][09423] Updated weights for policy 0, policy_version 276437 (0.0025) [2024-06-28 17:54:42,922][09190] Fps is (10 sec: 42597.7, 60 sec: 42871.3, 300 sec: 42431.8). Total num frames: 4529291264. Throughput: 0: 42675.2. Samples: 808114340. Policy #0 lag: (min: 0.0, avg: 11.6, max: 22.0) [2024-06-28 17:54:42,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:54:43,143][09423] Updated weights for policy 0, policy_version 276447 (0.0033) [2024-06-28 17:54:46,896][09403] Signal inference workers to stop experience collection... (11100 times) [2024-06-28 17:54:46,897][09403] Signal inference workers to resume experience collection... (11100 times) [2024-06-28 17:54:46,914][09423] Updated weights for policy 0, policy_version 276457 (0.0027) [2024-06-28 17:54:46,941][09423] InferenceWorker_p0-w0: stopping experience collection (11100 times) [2024-06-28 17:54:46,941][09423] InferenceWorker_p0-w0: resuming experience collection (11100 times) [2024-06-28 17:54:47,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4529520640. Throughput: 0: 42595.0. Samples: 808374880. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 17:54:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:54:51,006][09423] Updated weights for policy 0, policy_version 276467 (0.0035) [2024-06-28 17:54:52,921][09190] Fps is (10 sec: 39322.3, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4529684480. Throughput: 0: 42752.9. Samples: 808634060. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 17:54:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:54:54,453][09423] Updated weights for policy 0, policy_version 276477 (0.0036) [2024-06-28 17:54:57,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42599.9, 300 sec: 42431.8). Total num frames: 4529913856. Throughput: 0: 42619.2. Samples: 808747940. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 17:54:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:54:58,770][09423] Updated weights for policy 0, policy_version 276487 (0.0030) [2024-06-28 17:55:02,214][09423] Updated weights for policy 0, policy_version 276497 (0.0026) [2024-06-28 17:55:02,921][09190] Fps is (10 sec: 45875.3, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4530143232. Throughput: 0: 42546.6. Samples: 809007680. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 17:55:02,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:55:06,566][09423] Updated weights for policy 0, policy_version 276507 (0.0027) [2024-06-28 17:55:07,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4530339840. Throughput: 0: 42580.4. Samples: 809268600. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 17:55:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:55:10,069][09423] Updated weights for policy 0, policy_version 276517 (0.0033) [2024-06-28 17:55:12,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4530552832. Throughput: 0: 42547.2. Samples: 809382920. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 17:55:12,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 17:55:14,255][09423] Updated weights for policy 0, policy_version 276527 (0.0030) [2024-06-28 17:55:17,449][09423] Updated weights for policy 0, policy_version 276537 (0.0036) [2024-06-28 17:55:17,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42325.3, 300 sec: 42542.8). Total num frames: 4530782208. Throughput: 0: 42537.8. Samples: 809647480. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 17:55:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:55:22,047][09423] Updated weights for policy 0, policy_version 276547 (0.0035) [2024-06-28 17:55:22,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4530962432. Throughput: 0: 42599.2. Samples: 809905800. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 17:55:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:55:25,148][09423] Updated weights for policy 0, policy_version 276557 (0.0035) [2024-06-28 17:55:27,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42600.2, 300 sec: 42431.8). Total num frames: 4531191808. Throughput: 0: 42559.3. Samples: 810029500. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 17:55:27,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:55:29,665][09423] Updated weights for policy 0, policy_version 276567 (0.0026) [2024-06-28 17:55:32,838][09423] Updated weights for policy 0, policy_version 276577 (0.0025) [2024-06-28 17:55:32,921][09190] Fps is (10 sec: 47513.5, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4531437568. Throughput: 0: 42374.7. Samples: 810281740. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 17:55:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:55:37,445][09423] Updated weights for policy 0, policy_version 276587 (0.0043) [2024-06-28 17:55:37,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4531617792. Throughput: 0: 42368.0. Samples: 810540620. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 17:55:37,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:55:40,643][09423] Updated weights for policy 0, policy_version 276597 (0.0030) [2024-06-28 17:55:42,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42325.5, 300 sec: 42431.8). Total num frames: 4531830784. Throughput: 0: 42468.5. Samples: 810659020. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 17:55:42,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:55:45,185][09423] Updated weights for policy 0, policy_version 276607 (0.0030) [2024-06-28 17:55:47,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4532060160. Throughput: 0: 42438.7. Samples: 810917420. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 17:55:47,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:55:48,233][09423] Updated weights for policy 0, policy_version 276617 (0.0035) [2024-06-28 17:55:52,845][09423] Updated weights for policy 0, policy_version 276627 (0.0043) [2024-06-28 17:55:52,922][09190] Fps is (10 sec: 42597.6, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4532256768. Throughput: 0: 42339.0. Samples: 811173860. Policy #0 lag: (min: 1.0, avg: 9.1, max: 20.0) [2024-06-28 17:55:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:55:56,472][09423] Updated weights for policy 0, policy_version 276637 (0.0025) [2024-06-28 17:55:57,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4532469760. Throughput: 0: 42606.7. Samples: 811300220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:55:57,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:56:00,794][09423] Updated weights for policy 0, policy_version 276647 (0.0038) [2024-06-28 17:56:02,922][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 4532682752. Throughput: 0: 42293.2. Samples: 811550680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:56:02,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 17:56:03,992][09423] Updated weights for policy 0, policy_version 276657 (0.0031) [2024-06-28 17:56:07,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4532879360. Throughput: 0: 42384.9. Samples: 811813120. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:56:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:56:08,252][09423] Updated weights for policy 0, policy_version 276667 (0.0040) [2024-06-28 17:56:11,539][09423] Updated weights for policy 0, policy_version 276677 (0.0029) [2024-06-28 17:56:12,922][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4533125120. Throughput: 0: 42310.1. Samples: 811933460. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:56:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:56:15,989][09423] Updated weights for policy 0, policy_version 276687 (0.0039) [2024-06-28 17:56:17,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4533338112. Throughput: 0: 42428.9. Samples: 812191040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:56:17,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 17:56:17,929][09190] No heartbeat for components: RolloutWorker_w20 (2916 seconds) [2024-06-28 17:56:17,930][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000276693_4533338112.pth... [2024-06-28 17:56:17,979][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000276071_4523147264.pth [2024-06-28 17:56:19,447][09423] Updated weights for policy 0, policy_version 276697 (0.0038) [2024-06-28 17:56:22,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42871.5, 300 sec: 42431.8). Total num frames: 4533534720. Throughput: 0: 42397.8. Samples: 812448520. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:56:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:56:23,551][09423] Updated weights for policy 0, policy_version 276707 (0.0033) [2024-06-28 17:56:26,221][09403] Signal inference workers to stop experience collection... (11150 times) [2024-06-28 17:56:26,221][09403] Signal inference workers to resume experience collection... (11150 times) [2024-06-28 17:56:26,231][09423] InferenceWorker_p0-w0: stopping experience collection (11150 times) [2024-06-28 17:56:26,259][09423] InferenceWorker_p0-w0: resuming experience collection (11150 times) [2024-06-28 17:56:26,834][09423] Updated weights for policy 0, policy_version 276717 (0.0030) [2024-06-28 17:56:27,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4533747712. Throughput: 0: 42564.0. Samples: 812574400. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:56:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:56:31,016][09423] Updated weights for policy 0, policy_version 276727 (0.0031) [2024-06-28 17:56:32,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4533977088. Throughput: 0: 42579.5. Samples: 812833500. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:56:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:56:34,393][09423] Updated weights for policy 0, policy_version 276737 (0.0029) [2024-06-28 17:56:37,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.4, 300 sec: 42376.3). Total num frames: 4534157312. Throughput: 0: 42730.4. Samples: 813096720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:56:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:56:38,662][09423] Updated weights for policy 0, policy_version 276747 (0.0032) [2024-06-28 17:56:41,885][09423] Updated weights for policy 0, policy_version 276757 (0.0029) [2024-06-28 17:56:42,922][09190] Fps is (10 sec: 42597.9, 60 sec: 42871.3, 300 sec: 42487.3). Total num frames: 4534403072. Throughput: 0: 42684.8. Samples: 813221040. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:56:42,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:56:46,359][09423] Updated weights for policy 0, policy_version 276767 (0.0031) [2024-06-28 17:56:47,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4534616064. Throughput: 0: 42761.0. Samples: 813474920. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:56:47,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:56:49,469][09423] Updated weights for policy 0, policy_version 276777 (0.0032) [2024-06-28 17:56:52,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42598.5, 300 sec: 42542.9). Total num frames: 4534812672. Throughput: 0: 42704.1. Samples: 813734800. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:56:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:56:54,097][09423] Updated weights for policy 0, policy_version 276787 (0.0028) [2024-06-28 17:56:57,309][09423] Updated weights for policy 0, policy_version 276797 (0.0033) [2024-06-28 17:56:57,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 4535042048. Throughput: 0: 42870.4. Samples: 813862620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:56:57,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:57:01,584][09423] Updated weights for policy 0, policy_version 276807 (0.0039) [2024-06-28 17:57:02,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.5, 300 sec: 42542.9). Total num frames: 4535238656. Throughput: 0: 42637.9. Samples: 814109740. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:57:02,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:57:05,963][09423] Updated weights for policy 0, policy_version 276817 (0.0038) [2024-06-28 17:57:07,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42871.6, 300 sec: 42542.9). Total num frames: 4535451648. Throughput: 0: 42784.9. Samples: 814373840. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 17:57:07,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:57:09,084][09423] Updated weights for policy 0, policy_version 276827 (0.0029) [2024-06-28 17:57:12,924][09190] Fps is (10 sec: 42587.4, 60 sec: 42323.6, 300 sec: 42487.0). Total num frames: 4535664640. Throughput: 0: 42752.2. Samples: 814498360. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:57:12,924][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:57:13,583][09423] Updated weights for policy 0, policy_version 276837 (0.0035) [2024-06-28 17:57:16,868][09423] Updated weights for policy 0, policy_version 276847 (0.0028) [2024-06-28 17:57:17,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4535877632. Throughput: 0: 42656.9. Samples: 814753060. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:57:17,923][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 17:57:21,441][09423] Updated weights for policy 0, policy_version 276857 (0.0033) [2024-06-28 17:57:22,921][09190] Fps is (10 sec: 42608.9, 60 sec: 42598.3, 300 sec: 42542.9). Total num frames: 4536090624. Throughput: 0: 42592.3. Samples: 815013380. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:57:22,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 17:57:24,894][09423] Updated weights for policy 0, policy_version 276867 (0.0031) [2024-06-28 17:57:27,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4536320000. Throughput: 0: 42548.1. Samples: 815135700. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:57:27,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:57:29,248][09423] Updated weights for policy 0, policy_version 276877 (0.0031) [2024-06-28 17:57:32,503][09423] Updated weights for policy 0, policy_version 276887 (0.0030) [2024-06-28 17:57:32,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42325.4, 300 sec: 42543.2). Total num frames: 4536516608. Throughput: 0: 42509.4. Samples: 815387840. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:57:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:57:37,154][09423] Updated weights for policy 0, policy_version 276897 (0.0031) [2024-06-28 17:57:37,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4536713216. Throughput: 0: 42511.6. Samples: 815647820. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:57:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:57:40,478][09423] Updated weights for policy 0, policy_version 276907 (0.0032) [2024-06-28 17:57:42,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4536942592. Throughput: 0: 42350.6. Samples: 815768400. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:57:42,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:57:44,893][09423] Updated weights for policy 0, policy_version 276917 (0.0030) [2024-06-28 17:57:47,638][09403] Signal inference workers to stop experience collection... (11200 times) [2024-06-28 17:57:47,641][09403] Signal inference workers to resume experience collection... (11200 times) [2024-06-28 17:57:47,678][09423] InferenceWorker_p0-w0: stopping experience collection (11200 times) [2024-06-28 17:57:47,678][09423] InferenceWorker_p0-w0: resuming experience collection (11200 times) [2024-06-28 17:57:47,791][09423] Updated weights for policy 0, policy_version 276927 (0.0034) [2024-06-28 17:57:47,921][09190] Fps is (10 sec: 45874.6, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4537171968. Throughput: 0: 42467.9. Samples: 816020800. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:57:47,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:57:52,396][09423] Updated weights for policy 0, policy_version 276937 (0.0036) [2024-06-28 17:57:52,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4537352192. Throughput: 0: 42505.7. Samples: 816286600. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:57:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:57:55,574][09423] Updated weights for policy 0, policy_version 276947 (0.0041) [2024-06-28 17:57:57,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42325.3, 300 sec: 42432.0). Total num frames: 4537581568. Throughput: 0: 42376.1. Samples: 816405180. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:57:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:58:00,054][09423] Updated weights for policy 0, policy_version 276957 (0.0035) [2024-06-28 17:58:02,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42871.4, 300 sec: 42654.0). Total num frames: 4537810944. Throughput: 0: 42378.7. Samples: 816660100. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:58:02,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:58:03,047][09423] Updated weights for policy 0, policy_version 276967 (0.0027) [2024-06-28 17:58:07,690][09423] Updated weights for policy 0, policy_version 276977 (0.0032) [2024-06-28 17:58:07,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4537991168. Throughput: 0: 42500.6. Samples: 816925900. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:58:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:58:10,872][09423] Updated weights for policy 0, policy_version 276987 (0.0028) [2024-06-28 17:58:12,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42600.1, 300 sec: 42431.8). Total num frames: 4538220544. Throughput: 0: 42441.7. Samples: 817045580. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:58:12,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:58:15,288][09423] Updated weights for policy 0, policy_version 276997 (0.0031) [2024-06-28 17:58:17,922][09190] Fps is (10 sec: 44235.8, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4538433536. Throughput: 0: 42473.1. Samples: 817299140. Policy #0 lag: (min: 0.0, avg: 9.4, max: 20.0) [2024-06-28 17:58:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:58:17,991][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000277005_4538449920.pth... [2024-06-28 17:58:18,052][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000276383_4528259072.pth [2024-06-28 17:58:18,418][09423] Updated weights for policy 0, policy_version 277007 (0.0034) [2024-06-28 17:58:22,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4538630144. Throughput: 0: 42475.9. Samples: 817559240. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 17:58:22,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 17:58:23,062][09423] Updated weights for policy 0, policy_version 277017 (0.0027) [2024-06-28 17:58:26,346][09423] Updated weights for policy 0, policy_version 277027 (0.0030) [2024-06-28 17:58:27,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42052.2, 300 sec: 42431.8). Total num frames: 4538843136. Throughput: 0: 42493.3. Samples: 817680600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 17:58:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:58:30,566][09423] Updated weights for policy 0, policy_version 277037 (0.0032) [2024-06-28 17:58:32,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4539072512. Throughput: 0: 42581.8. Samples: 817936980. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 17:58:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 17:58:34,013][09423] Updated weights for policy 0, policy_version 277047 (0.0031) [2024-06-28 17:58:37,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4539285504. Throughput: 0: 42493.3. Samples: 818198800. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 17:58:37,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 17:58:38,066][09423] Updated weights for policy 0, policy_version 277057 (0.0044) [2024-06-28 17:58:41,591][09423] Updated weights for policy 0, policy_version 277067 (0.0044) [2024-06-28 17:58:42,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42325.4, 300 sec: 42376.3). Total num frames: 4539482112. Throughput: 0: 42549.4. Samples: 818319900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 17:58:42,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:58:45,862][09423] Updated weights for policy 0, policy_version 277077 (0.0039) [2024-06-28 17:58:47,922][09190] Fps is (10 sec: 44236.4, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4539727872. Throughput: 0: 42621.2. Samples: 818578060. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 17:58:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:58:49,716][09423] Updated weights for policy 0, policy_version 277087 (0.0028) [2024-06-28 17:58:52,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42871.4, 300 sec: 42598.7). Total num frames: 4539924480. Throughput: 0: 42448.8. Samples: 818836100. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 17:58:52,925][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:58:53,509][09423] Updated weights for policy 0, policy_version 277097 (0.0044) [2024-06-28 17:58:57,411][09423] Updated weights for policy 0, policy_version 277107 (0.0030) [2024-06-28 17:58:57,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4540137472. Throughput: 0: 42463.6. Samples: 818956440. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 17:58:57,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 17:59:01,076][09423] Updated weights for policy 0, policy_version 277117 (0.0039) [2024-06-28 17:59:02,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4540350464. Throughput: 0: 42456.1. Samples: 819209660. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 17:59:02,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:59:04,815][09423] Updated weights for policy 0, policy_version 277127 (0.0030) [2024-06-28 17:59:07,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.4, 300 sec: 42487.4). Total num frames: 4540547072. Throughput: 0: 42591.6. Samples: 819475860. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 17:59:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:59:08,818][09423] Updated weights for policy 0, policy_version 277137 (0.0031) [2024-06-28 17:59:12,710][09423] Updated weights for policy 0, policy_version 277147 (0.0030) [2024-06-28 17:59:12,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42598.5, 300 sec: 42487.3). Total num frames: 4540776448. Throughput: 0: 42600.1. Samples: 819597600. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 17:59:12,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:59:16,561][09423] Updated weights for policy 0, policy_version 277157 (0.0033) [2024-06-28 17:59:17,922][09190] Fps is (10 sec: 45874.4, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4541005824. Throughput: 0: 42531.0. Samples: 819850880. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 17:59:17,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 17:59:17,928][09190] No heartbeat for components: RolloutWorker_w20 (3096 seconds) [2024-06-28 17:59:20,648][09423] Updated weights for policy 0, policy_version 277167 (0.0032) [2024-06-28 17:59:22,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42598.4, 300 sec: 42543.2). Total num frames: 4541186048. Throughput: 0: 42523.5. Samples: 820112360. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 17:59:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:59:24,066][09423] Updated weights for policy 0, policy_version 277177 (0.0026) [2024-06-28 17:59:24,501][09403] Signal inference workers to stop experience collection... (11250 times) [2024-06-28 17:59:24,508][09403] Signal inference workers to resume experience collection... (11250 times) [2024-06-28 17:59:24,549][09423] InferenceWorker_p0-w0: stopping experience collection (11250 times) [2024-06-28 17:59:24,549][09423] InferenceWorker_p0-w0: resuming experience collection (11250 times) [2024-06-28 17:59:27,921][09190] Fps is (10 sec: 39322.2, 60 sec: 42598.5, 300 sec: 42487.3). Total num frames: 4541399040. Throughput: 0: 42533.3. Samples: 820233900. Policy #0 lag: (min: 0.0, avg: 10.2, max: 20.0) [2024-06-28 17:59:27,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 17:59:28,215][09423] Updated weights for policy 0, policy_version 277187 (0.0035) [2024-06-28 17:59:31,715][09423] Updated weights for policy 0, policy_version 277197 (0.0032) [2024-06-28 17:59:32,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4541644800. Throughput: 0: 42496.6. Samples: 820490400. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 17:59:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 17:59:36,191][09423] Updated weights for policy 0, policy_version 277207 (0.0027) [2024-06-28 17:59:37,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4541841408. Throughput: 0: 42550.7. Samples: 820750880. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 17:59:37,923][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 17:59:39,462][09423] Updated weights for policy 0, policy_version 277217 (0.0037) [2024-06-28 17:59:42,924][09190] Fps is (10 sec: 40949.6, 60 sec: 42869.6, 300 sec: 42487.0). Total num frames: 4542054400. Throughput: 0: 42639.4. Samples: 820875320. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 17:59:42,924][09190] Avg episode reward: [(0, '0.758')] [2024-06-28 17:59:43,976][09423] Updated weights for policy 0, policy_version 277227 (0.0035) [2024-06-28 17:59:46,814][09423] Updated weights for policy 0, policy_version 277237 (0.0031) [2024-06-28 17:59:47,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42325.4, 300 sec: 42653.9). Total num frames: 4542267392. Throughput: 0: 42660.6. Samples: 821129380. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 17:59:47,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 17:59:51,461][09423] Updated weights for policy 0, policy_version 277247 (0.0045) [2024-06-28 17:59:52,921][09190] Fps is (10 sec: 40970.3, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4542464000. Throughput: 0: 42436.4. Samples: 821385500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 17:59:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 17:59:54,875][09423] Updated weights for policy 0, policy_version 277257 (0.0034) [2024-06-28 17:59:57,922][09190] Fps is (10 sec: 40958.9, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 4542676992. Throughput: 0: 42554.4. Samples: 821512560. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 17:59:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 17:59:58,865][09423] Updated weights for policy 0, policy_version 277267 (0.0032) [2024-06-28 18:00:02,622][09423] Updated weights for policy 0, policy_version 277277 (0.0043) [2024-06-28 18:00:02,922][09190] Fps is (10 sec: 44236.1, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4542906368. Throughput: 0: 42667.5. Samples: 821770920. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 18:00:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:00:06,565][09423] Updated weights for policy 0, policy_version 277287 (0.0030) [2024-06-28 18:00:07,921][09190] Fps is (10 sec: 42599.5, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4543102976. Throughput: 0: 42699.6. Samples: 822033840. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 18:00:07,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:00:10,117][09423] Updated weights for policy 0, policy_version 277297 (0.0030) [2024-06-28 18:00:12,921][09190] Fps is (10 sec: 40961.0, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4543315968. Throughput: 0: 42776.0. Samples: 822158820. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 18:00:12,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:00:14,525][09423] Updated weights for policy 0, policy_version 277307 (0.0036) [2024-06-28 18:00:17,837][09423] Updated weights for policy 0, policy_version 277317 (0.0029) [2024-06-28 18:00:17,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42598.5, 300 sec: 42709.5). Total num frames: 4543561728. Throughput: 0: 42688.0. Samples: 822411360. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 18:00:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:00:17,942][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000277317_4543561728.pth... [2024-06-28 18:00:18,010][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000276693_4533338112.pth [2024-06-28 18:00:22,444][09423] Updated weights for policy 0, policy_version 277327 (0.0036) [2024-06-28 18:00:22,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.5, 300 sec: 42542.9). Total num frames: 4543741952. Throughput: 0: 42576.1. Samples: 822666800. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 18:00:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:00:25,814][09423] Updated weights for policy 0, policy_version 277337 (0.0039) [2024-06-28 18:00:27,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4543971328. Throughput: 0: 42695.3. Samples: 822796500. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 18:00:27,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 18:00:29,856][09423] Updated weights for policy 0, policy_version 277347 (0.0035) [2024-06-28 18:00:32,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4544200704. Throughput: 0: 42601.7. Samples: 823046460. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 18:00:32,923][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:00:33,256][09423] Updated weights for policy 0, policy_version 277357 (0.0029) [2024-06-28 18:00:37,501][09423] Updated weights for policy 0, policy_version 277367 (0.0033) [2024-06-28 18:00:37,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.4, 300 sec: 42542.8). Total num frames: 4544380928. Throughput: 0: 42809.8. Samples: 823311940. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 18:00:37,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 18:00:40,710][09423] Updated weights for policy 0, policy_version 277377 (0.0034) [2024-06-28 18:00:42,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42327.1, 300 sec: 42487.3). Total num frames: 4544593920. Throughput: 0: 42794.5. Samples: 823438300. Policy #0 lag: (min: 0.0, avg: 10.9, max: 21.0) [2024-06-28 18:00:42,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 18:00:45,427][09423] Updated weights for policy 0, policy_version 277387 (0.0031) [2024-06-28 18:00:47,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42871.5, 300 sec: 42654.0). Total num frames: 4544839680. Throughput: 0: 42714.5. Samples: 823693060. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 18:00:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:00:48,813][09423] Updated weights for policy 0, policy_version 277397 (0.0031) [2024-06-28 18:00:52,768][09403] Signal inference workers to stop experience collection... (11300 times) [2024-06-28 18:00:52,793][09423] InferenceWorker_p0-w0: stopping experience collection (11300 times) [2024-06-28 18:00:52,820][09403] Signal inference workers to resume experience collection... (11300 times) [2024-06-28 18:00:52,820][09423] InferenceWorker_p0-w0: resuming experience collection (11300 times) [2024-06-28 18:00:52,824][09423] Updated weights for policy 0, policy_version 277407 (0.0036) [2024-06-28 18:00:52,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4545036288. Throughput: 0: 42784.8. Samples: 823959160. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 18:00:52,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 18:00:56,395][09423] Updated weights for policy 0, policy_version 277417 (0.0031) [2024-06-28 18:00:57,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42871.7, 300 sec: 42598.4). Total num frames: 4545249280. Throughput: 0: 42698.7. Samples: 824080260. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 18:00:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:01:00,581][09423] Updated weights for policy 0, policy_version 277427 (0.0035) [2024-06-28 18:01:02,922][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4545462272. Throughput: 0: 42651.0. Samples: 824330660. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 18:01:02,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 18:01:04,591][09423] Updated weights for policy 0, policy_version 277437 (0.0028) [2024-06-28 18:01:07,922][09190] Fps is (10 sec: 40959.2, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4545658880. Throughput: 0: 42850.9. Samples: 824595100. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 18:01:07,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 18:01:08,215][09423] Updated weights for policy 0, policy_version 277447 (0.0035) [2024-06-28 18:01:12,265][09423] Updated weights for policy 0, policy_version 277457 (0.0028) [2024-06-28 18:01:12,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42871.4, 300 sec: 42542.9). Total num frames: 4545888256. Throughput: 0: 42689.8. Samples: 824717540. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 18:01:12,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:01:15,818][09423] Updated weights for policy 0, policy_version 277467 (0.0041) [2024-06-28 18:01:17,922][09190] Fps is (10 sec: 45875.1, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4546117632. Throughput: 0: 42795.9. Samples: 824972280. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 18:01:17,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 18:01:19,751][09423] Updated weights for policy 0, policy_version 277477 (0.0039) [2024-06-28 18:01:22,922][09190] Fps is (10 sec: 42594.1, 60 sec: 42870.7, 300 sec: 42598.2). Total num frames: 4546314240. Throughput: 0: 42674.2. Samples: 825232320. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 18:01:22,923][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:01:23,587][09423] Updated weights for policy 0, policy_version 277487 (0.0026) [2024-06-28 18:01:27,406][09423] Updated weights for policy 0, policy_version 277497 (0.0032) [2024-06-28 18:01:27,921][09190] Fps is (10 sec: 40961.0, 60 sec: 42598.5, 300 sec: 42542.9). Total num frames: 4546527232. Throughput: 0: 42582.7. Samples: 825354520. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 18:01:27,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:01:31,277][09423] Updated weights for policy 0, policy_version 277507 (0.0031) [2024-06-28 18:01:32,922][09190] Fps is (10 sec: 44240.8, 60 sec: 42598.3, 300 sec: 42709.5). Total num frames: 4546756608. Throughput: 0: 42624.3. Samples: 825611160. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 18:01:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:01:35,113][09423] Updated weights for policy 0, policy_version 277517 (0.0033) [2024-06-28 18:01:37,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4546936832. Throughput: 0: 42413.8. Samples: 825867780. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 18:01:37,922][09190] Avg episode reward: [(0, '0.754')] [2024-06-28 18:01:38,716][09423] Updated weights for policy 0, policy_version 277527 (0.0041) [2024-06-28 18:01:42,698][09423] Updated weights for policy 0, policy_version 277537 (0.0028) [2024-06-28 18:01:42,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42871.4, 300 sec: 42542.9). Total num frames: 4547166208. Throughput: 0: 42532.8. Samples: 825994240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 18:01:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:01:46,371][09423] Updated weights for policy 0, policy_version 277547 (0.0040) [2024-06-28 18:01:47,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42325.2, 300 sec: 42598.4). Total num frames: 4547379200. Throughput: 0: 42562.7. Samples: 826245980. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 18:01:47,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 18:01:50,530][09423] Updated weights for policy 0, policy_version 277557 (0.0031) [2024-06-28 18:01:52,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4547575808. Throughput: 0: 42372.6. Samples: 826501860. Policy #0 lag: (min: 0.0, avg: 8.8, max: 20.0) [2024-06-28 18:01:52,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 18:01:54,137][09423] Updated weights for policy 0, policy_version 277567 (0.0035) [2024-06-28 18:01:57,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4547805184. Throughput: 0: 42445.7. Samples: 826627600. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:01:57,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:01:58,092][09423] Updated weights for policy 0, policy_version 277577 (0.0034) [2024-06-28 18:02:01,947][09423] Updated weights for policy 0, policy_version 277587 (0.0031) [2024-06-28 18:02:02,922][09190] Fps is (10 sec: 44236.1, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4548018176. Throughput: 0: 42381.8. Samples: 826879460. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:02:02,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:02:05,828][09423] Updated weights for policy 0, policy_version 277597 (0.0053) [2024-06-28 18:02:07,921][09190] Fps is (10 sec: 39322.5, 60 sec: 42325.5, 300 sec: 42487.7). Total num frames: 4548198400. Throughput: 0: 42422.8. Samples: 827141300. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:02:07,921][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 18:02:09,350][09423] Updated weights for policy 0, policy_version 277607 (0.0033) [2024-06-28 18:02:12,921][09190] Fps is (10 sec: 40960.9, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4548427776. Throughput: 0: 42455.1. Samples: 827265000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:02:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:02:13,755][09423] Updated weights for policy 0, policy_version 277617 (0.0028) [2024-06-28 18:02:16,959][09423] Updated weights for policy 0, policy_version 277627 (0.0035) [2024-06-28 18:02:17,924][09190] Fps is (10 sec: 47501.2, 60 sec: 42596.7, 300 sec: 42653.6). Total num frames: 4548673536. Throughput: 0: 42427.1. Samples: 827520480. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:02:17,924][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:02:17,939][09190] No heartbeat for components: RolloutWorker_w20 (3276 seconds) [2024-06-28 18:02:17,939][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000277629_4548673536.pth... [2024-06-28 18:02:17,993][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000277005_4538449920.pth [2024-06-28 18:02:21,381][09423] Updated weights for policy 0, policy_version 277637 (0.0036) [2024-06-28 18:02:22,924][09190] Fps is (10 sec: 42587.5, 60 sec: 42324.3, 300 sec: 42487.0). Total num frames: 4548853760. Throughput: 0: 42484.8. Samples: 827779700. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:02:22,924][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:02:24,763][09423] Updated weights for policy 0, policy_version 277647 (0.0031) [2024-06-28 18:02:27,921][09190] Fps is (10 sec: 39331.4, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4549066752. Throughput: 0: 42436.0. Samples: 827903860. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:02:27,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:02:29,167][09423] Updated weights for policy 0, policy_version 277657 (0.0032) [2024-06-28 18:02:32,831][09423] Updated weights for policy 0, policy_version 277667 (0.0037) [2024-06-28 18:02:32,921][09190] Fps is (10 sec: 44247.8, 60 sec: 42325.4, 300 sec: 42653.9). Total num frames: 4549296128. Throughput: 0: 42411.6. Samples: 828154500. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:02:32,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:02:36,608][09423] Updated weights for policy 0, policy_version 277677 (0.0036) [2024-06-28 18:02:37,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4549476352. Throughput: 0: 42548.1. Samples: 828416520. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:02:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:02:38,853][09403] Signal inference workers to stop experience collection... (11350 times) [2024-06-28 18:02:38,853][09403] Signal inference workers to resume experience collection... (11350 times) [2024-06-28 18:02:38,866][09423] InferenceWorker_p0-w0: stopping experience collection (11350 times) [2024-06-28 18:02:38,866][09423] InferenceWorker_p0-w0: resuming experience collection (11350 times) [2024-06-28 18:02:40,177][09423] Updated weights for policy 0, policy_version 277687 (0.0029) [2024-06-28 18:02:42,926][09190] Fps is (10 sec: 42580.3, 60 sec: 42595.4, 300 sec: 42542.3). Total num frames: 4549722112. Throughput: 0: 42456.9. Samples: 828538340. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:02:42,926][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:02:44,086][09423] Updated weights for policy 0, policy_version 277697 (0.0023) [2024-06-28 18:02:47,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42598.5, 300 sec: 42653.9). Total num frames: 4549935104. Throughput: 0: 42650.8. Samples: 828798740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:02:47,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:02:48,012][09423] Updated weights for policy 0, policy_version 277707 (0.0027) [2024-06-28 18:02:51,894][09423] Updated weights for policy 0, policy_version 277717 (0.0038) [2024-06-28 18:02:52,921][09190] Fps is (10 sec: 40977.4, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4550131712. Throughput: 0: 42723.9. Samples: 829063880. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:02:52,924][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:02:55,823][09423] Updated weights for policy 0, policy_version 277727 (0.0036) [2024-06-28 18:02:57,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4550361088. Throughput: 0: 42660.8. Samples: 829184740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:02:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:03:00,160][09423] Updated weights for policy 0, policy_version 277737 (0.0033) [2024-06-28 18:03:02,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42598.5, 300 sec: 42653.9). Total num frames: 4550574080. Throughput: 0: 42475.3. Samples: 829431760. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:03:02,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:03:03,537][09423] Updated weights for policy 0, policy_version 277747 (0.0032) [2024-06-28 18:03:07,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4550754304. Throughput: 0: 42531.3. Samples: 829693500. Policy #0 lag: (min: 0.0, avg: 9.2, max: 21.0) [2024-06-28 18:03:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:03:08,016][09423] Updated weights for policy 0, policy_version 277757 (0.0032) [2024-06-28 18:03:10,934][09423] Updated weights for policy 0, policy_version 277767 (0.0031) [2024-06-28 18:03:12,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4550983680. Throughput: 0: 42418.3. Samples: 829812680. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-28 18:03:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:03:15,841][09423] Updated weights for policy 0, policy_version 277777 (0.0032) [2024-06-28 18:03:17,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42327.1, 300 sec: 42653.9). Total num frames: 4551213056. Throughput: 0: 42581.8. Samples: 830070680. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-28 18:03:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:03:19,003][09423] Updated weights for policy 0, policy_version 277787 (0.0026) [2024-06-28 18:03:22,922][09190] Fps is (10 sec: 40959.1, 60 sec: 42327.0, 300 sec: 42542.9). Total num frames: 4551393280. Throughput: 0: 42434.5. Samples: 830326080. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-28 18:03:22,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:03:23,408][09423] Updated weights for policy 0, policy_version 277797 (0.0035) [2024-06-28 18:03:26,444][09423] Updated weights for policy 0, policy_version 277807 (0.0044) [2024-06-28 18:03:27,924][09190] Fps is (10 sec: 40949.6, 60 sec: 42596.6, 300 sec: 42542.5). Total num frames: 4551622656. Throughput: 0: 42520.8. Samples: 830451700. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-28 18:03:27,924][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:03:30,951][09423] Updated weights for policy 0, policy_version 277817 (0.0029) [2024-06-28 18:03:32,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4551852032. Throughput: 0: 42440.8. Samples: 830708580. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-28 18:03:32,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:03:34,654][09423] Updated weights for policy 0, policy_version 277827 (0.0040) [2024-06-28 18:03:37,921][09190] Fps is (10 sec: 42609.4, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4552048640. Throughput: 0: 42290.8. Samples: 830966960. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-28 18:03:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:03:38,521][09423] Updated weights for policy 0, policy_version 277837 (0.0037) [2024-06-28 18:03:42,633][09423] Updated weights for policy 0, policy_version 277847 (0.0032) [2024-06-28 18:03:42,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42328.3, 300 sec: 42487.3). Total num frames: 4552261632. Throughput: 0: 42259.5. Samples: 831086420. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-28 18:03:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:03:46,403][09423] Updated weights for policy 0, policy_version 277857 (0.0030) [2024-06-28 18:03:47,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42598.3, 300 sec: 42598.4). Total num frames: 4552491008. Throughput: 0: 42510.5. Samples: 831344740. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-28 18:03:47,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:03:50,091][09423] Updated weights for policy 0, policy_version 277867 (0.0029) [2024-06-28 18:03:52,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4552671232. Throughput: 0: 42548.9. Samples: 831608200. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-28 18:03:52,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:03:53,767][09423] Updated weights for policy 0, policy_version 277877 (0.0042) [2024-06-28 18:03:57,899][09423] Updated weights for policy 0, policy_version 277887 (0.0032) [2024-06-28 18:03:57,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4552900608. Throughput: 0: 42496.0. Samples: 831725000. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-28 18:03:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:04:01,611][09423] Updated weights for policy 0, policy_version 277897 (0.0034) [2024-06-28 18:04:01,892][09403] Signal inference workers to stop experience collection... (11400 times) [2024-06-28 18:04:01,939][09423] InferenceWorker_p0-w0: stopping experience collection (11400 times) [2024-06-28 18:04:02,010][09403] Signal inference workers to resume experience collection... (11400 times) [2024-06-28 18:04:02,010][09423] InferenceWorker_p0-w0: resuming experience collection (11400 times) [2024-06-28 18:04:02,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4553129984. Throughput: 0: 42486.3. Samples: 831982560. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-28 18:04:02,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:04:05,355][09423] Updated weights for policy 0, policy_version 277907 (0.0031) [2024-06-28 18:04:07,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4553310208. Throughput: 0: 42574.5. Samples: 832241920. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-28 18:04:07,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 18:04:09,006][09423] Updated weights for policy 0, policy_version 277917 (0.0038) [2024-06-28 18:04:12,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4553523200. Throughput: 0: 42464.6. Samples: 832362500. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-28 18:04:12,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:04:13,157][09423] Updated weights for policy 0, policy_version 277927 (0.0037) [2024-06-28 18:04:16,998][09423] Updated weights for policy 0, policy_version 277937 (0.0029) [2024-06-28 18:04:17,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42598.4, 300 sec: 42654.0). Total num frames: 4553768960. Throughput: 0: 42537.4. Samples: 832622760. Policy #0 lag: (min: 1.0, avg: 11.8, max: 23.0) [2024-06-28 18:04:17,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:04:18,015][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000277941_4553785344.pth... [2024-06-28 18:04:18,058][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000277317_4543561728.pth [2024-06-28 18:04:20,501][09423] Updated weights for policy 0, policy_version 277947 (0.0038) [2024-06-28 18:04:22,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.5, 300 sec: 42542.9). Total num frames: 4553949184. Throughput: 0: 42417.7. Samples: 832875760. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:04:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:04:24,510][09423] Updated weights for policy 0, policy_version 277957 (0.0033) [2024-06-28 18:04:27,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42600.1, 300 sec: 42487.3). Total num frames: 4554178560. Throughput: 0: 42424.4. Samples: 832995520. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:04:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:04:28,861][09423] Updated weights for policy 0, policy_version 277967 (0.0031) [2024-06-28 18:04:32,254][09423] Updated weights for policy 0, policy_version 277977 (0.0028) [2024-06-28 18:04:32,922][09190] Fps is (10 sec: 45874.6, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4554407936. Throughput: 0: 42443.6. Samples: 833254700. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:04:32,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:04:36,577][09423] Updated weights for policy 0, policy_version 277987 (0.0027) [2024-06-28 18:04:37,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42325.3, 300 sec: 42487.7). Total num frames: 4554588160. Throughput: 0: 42303.5. Samples: 833511860. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:04:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:04:39,731][09423] Updated weights for policy 0, policy_version 277997 (0.0035) [2024-06-28 18:04:42,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4554801152. Throughput: 0: 42478.1. Samples: 833636520. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:04:42,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:04:44,330][09423] Updated weights for policy 0, policy_version 278007 (0.0034) [2024-06-28 18:04:47,349][09423] Updated weights for policy 0, policy_version 278017 (0.0029) [2024-06-28 18:04:47,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4555046912. Throughput: 0: 42503.4. Samples: 833895220. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:04:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:04:51,729][09423] Updated weights for policy 0, policy_version 278027 (0.0034) [2024-06-28 18:04:52,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.3, 300 sec: 42542.9). Total num frames: 4555227136. Throughput: 0: 42433.6. Samples: 834151440. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:04:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:04:55,198][09423] Updated weights for policy 0, policy_version 278037 (0.0027) [2024-06-28 18:04:57,922][09190] Fps is (10 sec: 39321.7, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4555440128. Throughput: 0: 42414.6. Samples: 834271160. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:04:57,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 18:04:59,762][09423] Updated weights for policy 0, policy_version 278047 (0.0034) [2024-06-28 18:05:02,865][09423] Updated weights for policy 0, policy_version 278057 (0.0029) [2024-06-28 18:05:02,921][09190] Fps is (10 sec: 45875.2, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4555685888. Throughput: 0: 42289.7. Samples: 834525800. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:05:02,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:05:07,470][09423] Updated weights for policy 0, policy_version 278067 (0.0027) [2024-06-28 18:05:07,922][09190] Fps is (10 sec: 42598.0, 60 sec: 42598.2, 300 sec: 42542.8). Total num frames: 4555866112. Throughput: 0: 42325.7. Samples: 834780420. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:05:07,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:05:10,469][09423] Updated weights for policy 0, policy_version 278077 (0.0038) [2024-06-28 18:05:12,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 4556095488. Throughput: 0: 42430.4. Samples: 834904880. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:05:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:05:15,367][09423] Updated weights for policy 0, policy_version 278087 (0.0036) [2024-06-28 18:05:17,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4556308480. Throughput: 0: 42472.1. Samples: 835165940. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:05:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:05:17,931][09190] No heartbeat for components: RolloutWorker_w20 (3456 seconds) [2024-06-28 18:05:18,530][09423] Updated weights for policy 0, policy_version 278097 (0.0035) [2024-06-28 18:05:19,395][09403] Signal inference workers to stop experience collection... (11450 times) [2024-06-28 18:05:19,448][09423] InferenceWorker_p0-w0: stopping experience collection (11450 times) [2024-06-28 18:05:19,450][09403] Signal inference workers to resume experience collection... (11450 times) [2024-06-28 18:05:19,462][09423] InferenceWorker_p0-w0: resuming experience collection (11450 times) [2024-06-28 18:05:22,720][09423] Updated weights for policy 0, policy_version 278107 (0.0031) [2024-06-28 18:05:22,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4556505088. Throughput: 0: 42568.1. Samples: 835427420. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:05:22,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:05:26,184][09423] Updated weights for policy 0, policy_version 278117 (0.0024) [2024-06-28 18:05:27,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4556718080. Throughput: 0: 42519.1. Samples: 835549880. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:05:27,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:05:30,632][09423] Updated weights for policy 0, policy_version 278127 (0.0049) [2024-06-28 18:05:32,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42325.5, 300 sec: 42598.4). Total num frames: 4556947456. Throughput: 0: 42507.2. Samples: 835808040. Policy #0 lag: (min: 0.0, avg: 8.4, max: 20.0) [2024-06-28 18:05:32,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 18:05:33,721][09423] Updated weights for policy 0, policy_version 278137 (0.0038) [2024-06-28 18:05:37,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42871.6, 300 sec: 42598.4). Total num frames: 4557160448. Throughput: 0: 42534.8. Samples: 836065500. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 18:05:37,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 18:05:37,929][09423] Updated weights for policy 0, policy_version 278147 (0.0033) [2024-06-28 18:05:41,627][09423] Updated weights for policy 0, policy_version 278157 (0.0046) [2024-06-28 18:05:42,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4557357056. Throughput: 0: 42585.7. Samples: 836187520. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 18:05:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:05:45,464][09423] Updated weights for policy 0, policy_version 278167 (0.0027) [2024-06-28 18:05:47,928][09190] Fps is (10 sec: 42570.4, 60 sec: 42320.8, 300 sec: 42541.9). Total num frames: 4557586432. Throughput: 0: 42530.8. Samples: 836439960. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 18:05:47,929][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:05:49,476][09423] Updated weights for policy 0, policy_version 278177 (0.0024) [2024-06-28 18:05:52,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 4557799424. Throughput: 0: 42641.9. Samples: 836699300. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 18:05:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:05:53,059][09423] Updated weights for policy 0, policy_version 278187 (0.0033) [2024-06-28 18:05:56,859][09423] Updated weights for policy 0, policy_version 278197 (0.0027) [2024-06-28 18:05:57,921][09190] Fps is (10 sec: 40986.5, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4557996032. Throughput: 0: 42712.0. Samples: 836826920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 18:05:57,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:06:00,679][09423] Updated weights for policy 0, policy_version 278207 (0.0026) [2024-06-28 18:06:02,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4558225408. Throughput: 0: 42616.9. Samples: 837083700. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 18:06:02,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:06:04,605][09423] Updated weights for policy 0, policy_version 278217 (0.0027) [2024-06-28 18:06:07,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 4558438400. Throughput: 0: 42426.1. Samples: 837336600. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 18:06:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:06:08,745][09423] Updated weights for policy 0, policy_version 278227 (0.0028) [2024-06-28 18:06:11,941][09423] Updated weights for policy 0, policy_version 278237 (0.0030) [2024-06-28 18:06:12,922][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.2, 300 sec: 42431.8). Total num frames: 4558635008. Throughput: 0: 42597.3. Samples: 837466760. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 18:06:12,923][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:06:16,277][09423] Updated weights for policy 0, policy_version 278247 (0.0030) [2024-06-28 18:06:17,923][09190] Fps is (10 sec: 42590.7, 60 sec: 42597.1, 300 sec: 42542.7). Total num frames: 4558864384. Throughput: 0: 42672.0. Samples: 837728360. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 18:06:17,924][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:06:18,067][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000278252_4558880768.pth... [2024-06-28 18:06:18,120][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000277629_4548673536.pth [2024-06-28 18:06:20,315][09423] Updated weights for policy 0, policy_version 278257 (0.0025) [2024-06-28 18:06:22,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4559060992. Throughput: 0: 42434.1. Samples: 837975040. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 18:06:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:06:23,987][09423] Updated weights for policy 0, policy_version 278267 (0.0034) [2024-06-28 18:06:27,921][09190] Fps is (10 sec: 39329.2, 60 sec: 42325.4, 300 sec: 42376.3). Total num frames: 4559257600. Throughput: 0: 42591.7. Samples: 838104140. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 18:06:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:06:28,265][09423] Updated weights for policy 0, policy_version 278277 (0.0028) [2024-06-28 18:06:32,253][09423] Updated weights for policy 0, policy_version 278287 (0.0029) [2024-06-28 18:06:32,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4559486976. Throughput: 0: 42673.3. Samples: 838359980. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 18:06:32,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:06:36,025][09423] Updated weights for policy 0, policy_version 278297 (0.0027) [2024-06-28 18:06:37,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42598.3, 300 sec: 42542.9). Total num frames: 4559716352. Throughput: 0: 42507.1. Samples: 838612120. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 18:06:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:06:39,636][09403] Signal inference workers to stop experience collection... (11500 times) [2024-06-28 18:06:39,637][09403] Signal inference workers to resume experience collection... (11500 times) [2024-06-28 18:06:39,676][09423] InferenceWorker_p0-w0: stopping experience collection (11500 times) [2024-06-28 18:06:39,676][09423] InferenceWorker_p0-w0: resuming experience collection (11500 times) [2024-06-28 18:06:39,775][09423] Updated weights for policy 0, policy_version 278307 (0.0028) [2024-06-28 18:06:42,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.5, 300 sec: 42487.3). Total num frames: 4559912960. Throughput: 0: 42505.8. Samples: 838739680. Policy #0 lag: (min: 0.0, avg: 9.8, max: 20.0) [2024-06-28 18:06:42,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:06:44,116][09423] Updated weights for policy 0, policy_version 278317 (0.0029) [2024-06-28 18:06:47,361][09423] Updated weights for policy 0, policy_version 278327 (0.0031) [2024-06-28 18:06:47,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42329.9, 300 sec: 42542.9). Total num frames: 4560125952. Throughput: 0: 42547.9. Samples: 838998360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:06:47,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:06:51,630][09423] Updated weights for policy 0, policy_version 278337 (0.0026) [2024-06-28 18:06:52,922][09190] Fps is (10 sec: 42597.8, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 4560338944. Throughput: 0: 42455.9. Samples: 839247120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:06:52,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:06:55,386][09423] Updated weights for policy 0, policy_version 278347 (0.0032) [2024-06-28 18:06:57,923][09190] Fps is (10 sec: 42591.4, 60 sec: 42597.2, 300 sec: 42487.1). Total num frames: 4560551936. Throughput: 0: 42482.9. Samples: 839378560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:06:57,924][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:06:59,039][09423] Updated weights for policy 0, policy_version 278357 (0.0028) [2024-06-28 18:07:02,921][09190] Fps is (10 sec: 40961.1, 60 sec: 42052.4, 300 sec: 42542.9). Total num frames: 4560748544. Throughput: 0: 42407.2. Samples: 839636600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:07:02,921][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:07:02,928][09423] Updated weights for policy 0, policy_version 278367 (0.0032) [2024-06-28 18:07:06,889][09423] Updated weights for policy 0, policy_version 278377 (0.0043) [2024-06-28 18:07:07,921][09190] Fps is (10 sec: 44244.1, 60 sec: 42598.4, 300 sec: 42598.4). Total num frames: 4560994304. Throughput: 0: 42398.6. Samples: 839882980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:07:07,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:07:10,576][09423] Updated weights for policy 0, policy_version 278387 (0.0036) [2024-06-28 18:07:12,921][09190] Fps is (10 sec: 44235.7, 60 sec: 42598.4, 300 sec: 42432.1). Total num frames: 4561190912. Throughput: 0: 42369.6. Samples: 840010780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:07:12,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:07:14,547][09423] Updated weights for policy 0, policy_version 278397 (0.0038) [2024-06-28 18:07:17,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42326.7, 300 sec: 42543.2). Total num frames: 4561403904. Throughput: 0: 42416.1. Samples: 840268700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:07:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:07:18,101][09423] Updated weights for policy 0, policy_version 278407 (0.0036) [2024-06-28 18:07:22,139][09423] Updated weights for policy 0, policy_version 278417 (0.0036) [2024-06-28 18:07:22,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4561616896. Throughput: 0: 42326.7. Samples: 840516820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:07:22,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:07:25,748][09423] Updated weights for policy 0, policy_version 278427 (0.0027) [2024-06-28 18:07:27,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4561829888. Throughput: 0: 42292.4. Samples: 840642840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:07:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:07:29,765][09423] Updated weights for policy 0, policy_version 278437 (0.0032) [2024-06-28 18:07:32,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42052.3, 300 sec: 42487.3). Total num frames: 4562010112. Throughput: 0: 42296.5. Samples: 840901700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:07:32,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:07:33,605][09423] Updated weights for policy 0, policy_version 278447 (0.0030) [2024-06-28 18:07:37,507][09423] Updated weights for policy 0, policy_version 278457 (0.0029) [2024-06-28 18:07:37,922][09190] Fps is (10 sec: 42597.9, 60 sec: 42325.2, 300 sec: 42487.9). Total num frames: 4562255872. Throughput: 0: 42303.0. Samples: 841150760. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:07:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:07:41,018][09423] Updated weights for policy 0, policy_version 278467 (0.0035) [2024-06-28 18:07:42,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4562452480. Throughput: 0: 42208.8. Samples: 841277880. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:07:42,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:07:45,316][09423] Updated weights for policy 0, policy_version 278477 (0.0043) [2024-06-28 18:07:47,922][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 4562665472. Throughput: 0: 42201.4. Samples: 841535680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:07:47,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:07:48,108][09403] Signal inference workers to stop experience collection... (11550 times) [2024-06-28 18:07:48,108][09403] Signal inference workers to resume experience collection... (11550 times) [2024-06-28 18:07:48,121][09423] InferenceWorker_p0-w0: stopping experience collection (11550 times) [2024-06-28 18:07:48,122][09423] InferenceWorker_p0-w0: resuming experience collection (11550 times) [2024-06-28 18:07:48,910][09423] Updated weights for policy 0, policy_version 278487 (0.0029) [2024-06-28 18:07:52,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.5, 300 sec: 42431.8). Total num frames: 4562878464. Throughput: 0: 42259.7. Samples: 841784660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:07:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:07:53,023][09423] Updated weights for policy 0, policy_version 278497 (0.0028) [2024-06-28 18:07:56,914][09423] Updated weights for policy 0, policy_version 278507 (0.0031) [2024-06-28 18:07:57,921][09190] Fps is (10 sec: 42599.4, 60 sec: 42326.5, 300 sec: 42431.8). Total num frames: 4563091456. Throughput: 0: 42360.9. Samples: 841917020. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:07:57,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:08:00,778][09423] Updated weights for policy 0, policy_version 278517 (0.0041) [2024-06-28 18:08:02,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4563288064. Throughput: 0: 42270.2. Samples: 842170860. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 18:08:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:08:04,291][09423] Updated weights for policy 0, policy_version 278527 (0.0034) [2024-06-28 18:08:07,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42052.4, 300 sec: 42487.3). Total num frames: 4563517440. Throughput: 0: 42379.6. Samples: 842423900. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 18:08:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:08:08,495][09423] Updated weights for policy 0, policy_version 278537 (0.0034) [2024-06-28 18:08:12,031][09423] Updated weights for policy 0, policy_version 278547 (0.0034) [2024-06-28 18:08:12,921][09190] Fps is (10 sec: 45874.8, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4563746816. Throughput: 0: 42459.6. Samples: 842553520. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 18:08:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:08:16,838][09423] Updated weights for policy 0, policy_version 278557 (0.0043) [2024-06-28 18:08:17,922][09190] Fps is (10 sec: 39320.7, 60 sec: 41779.0, 300 sec: 42431.8). Total num frames: 4563910656. Throughput: 0: 42434.5. Samples: 842811260. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 18:08:17,922][09190] Avg episode reward: [(0, '0.755')] [2024-06-28 18:08:17,935][09190] No heartbeat for components: RolloutWorker_w20 (3636 seconds) [2024-06-28 18:08:17,934][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000278559_4563910656.pth... [2024-06-28 18:08:18,005][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000277941_4553785344.pth [2024-06-28 18:08:19,832][09423] Updated weights for policy 0, policy_version 278567 (0.0030) [2024-06-28 18:08:22,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42052.3, 300 sec: 42432.2). Total num frames: 4564140032. Throughput: 0: 42431.8. Samples: 843060180. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 18:08:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:08:24,159][09423] Updated weights for policy 0, policy_version 278577 (0.0034) [2024-06-28 18:08:27,258][09423] Updated weights for policy 0, policy_version 278587 (0.0032) [2024-06-28 18:08:27,921][09190] Fps is (10 sec: 45875.6, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4564369408. Throughput: 0: 42536.3. Samples: 843192020. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 18:08:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:08:31,921][09423] Updated weights for policy 0, policy_version 278597 (0.0036) [2024-06-28 18:08:32,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 4564549632. Throughput: 0: 42394.4. Samples: 843443420. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 18:08:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:08:35,251][09423] Updated weights for policy 0, policy_version 278607 (0.0031) [2024-06-28 18:08:37,922][09190] Fps is (10 sec: 42598.2, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4564795392. Throughput: 0: 42547.4. Samples: 843699300. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 18:08:37,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:08:40,020][09423] Updated weights for policy 0, policy_version 278617 (0.0040) [2024-06-28 18:08:42,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 4565008384. Throughput: 0: 42526.2. Samples: 843830700. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 18:08:42,925][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:08:43,085][09423] Updated weights for policy 0, policy_version 278627 (0.0022) [2024-06-28 18:08:47,541][09423] Updated weights for policy 0, policy_version 278637 (0.0034) [2024-06-28 18:08:47,921][09190] Fps is (10 sec: 39322.3, 60 sec: 42052.5, 300 sec: 42431.8). Total num frames: 4565188608. Throughput: 0: 42522.2. Samples: 844084360. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 18:08:47,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:08:50,951][09423] Updated weights for policy 0, policy_version 278647 (0.0024) [2024-06-28 18:08:52,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4565434368. Throughput: 0: 42319.6. Samples: 844328280. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 18:08:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:08:55,524][09423] Updated weights for policy 0, policy_version 278657 (0.0027) [2024-06-28 18:08:57,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 4565630976. Throughput: 0: 42368.0. Samples: 844460080. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 18:08:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:08:58,618][09423] Updated weights for policy 0, policy_version 278667 (0.0030) [2024-06-28 18:09:02,922][09190] Fps is (10 sec: 39320.9, 60 sec: 42325.2, 300 sec: 42431.8). Total num frames: 4565827584. Throughput: 0: 42292.1. Samples: 844714400. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 18:09:02,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:09:03,326][09423] Updated weights for policy 0, policy_version 278677 (0.0041) [2024-06-28 18:09:06,443][09423] Updated weights for policy 0, policy_version 278687 (0.0033) [2024-06-28 18:09:07,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 4566056960. Throughput: 0: 42439.8. Samples: 844969980. Policy #0 lag: (min: 1.0, avg: 10.0, max: 21.0) [2024-06-28 18:09:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:09:10,648][09423] Updated weights for policy 0, policy_version 278697 (0.0028) [2024-06-28 18:09:11,664][09403] Signal inference workers to stop experience collection... (11600 times) [2024-06-28 18:09:11,707][09423] InferenceWorker_p0-w0: stopping experience collection (11600 times) [2024-06-28 18:09:11,718][09403] Signal inference workers to resume experience collection... (11600 times) [2024-06-28 18:09:11,722][09423] InferenceWorker_p0-w0: resuming experience collection (11600 times) [2024-06-28 18:09:12,922][09190] Fps is (10 sec: 45875.1, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4566286336. Throughput: 0: 42425.7. Samples: 845101180. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:09:12,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:09:14,292][09423] Updated weights for policy 0, policy_version 278707 (0.0028) [2024-06-28 18:09:17,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4566466560. Throughput: 0: 42564.0. Samples: 845358800. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:09:17,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 18:09:18,297][09423] Updated weights for policy 0, policy_version 278717 (0.0025) [2024-06-28 18:09:21,571][09423] Updated weights for policy 0, policy_version 278727 (0.0031) [2024-06-28 18:09:22,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4566712320. Throughput: 0: 42485.0. Samples: 845611120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:09:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:09:25,881][09423] Updated weights for policy 0, policy_version 278737 (0.0030) [2024-06-28 18:09:27,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4566925312. Throughput: 0: 42465.8. Samples: 845741660. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:09:27,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:09:29,326][09423] Updated weights for policy 0, policy_version 278747 (0.0034) [2024-06-28 18:09:32,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4567105536. Throughput: 0: 42429.3. Samples: 845993680. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:09:32,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 18:09:33,339][09423] Updated weights for policy 0, policy_version 278757 (0.0034) [2024-06-28 18:09:37,071][09423] Updated weights for policy 0, policy_version 278767 (0.0034) [2024-06-28 18:09:37,921][09190] Fps is (10 sec: 39322.2, 60 sec: 42052.4, 300 sec: 42431.8). Total num frames: 4567318528. Throughput: 0: 42724.4. Samples: 846250880. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:09:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:09:41,202][09423] Updated weights for policy 0, policy_version 278777 (0.0030) [2024-06-28 18:09:42,921][09190] Fps is (10 sec: 45874.7, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4567564288. Throughput: 0: 42646.2. Samples: 846379160. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:09:42,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 18:09:45,268][09423] Updated weights for policy 0, policy_version 278787 (0.0030) [2024-06-28 18:09:47,921][09190] Fps is (10 sec: 44236.1, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4567760896. Throughput: 0: 42635.1. Samples: 846632980. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:09:47,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:09:48,985][09423] Updated weights for policy 0, policy_version 278797 (0.0035) [2024-06-28 18:09:52,719][09423] Updated weights for policy 0, policy_version 278807 (0.0026) [2024-06-28 18:09:52,922][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 4567973888. Throughput: 0: 42562.2. Samples: 846885280. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:09:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:09:56,418][09423] Updated weights for policy 0, policy_version 278817 (0.0026) [2024-06-28 18:09:57,921][09190] Fps is (10 sec: 42599.2, 60 sec: 42598.5, 300 sec: 42376.3). Total num frames: 4568186880. Throughput: 0: 42567.3. Samples: 847016700. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:09:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:10:00,536][09423] Updated weights for policy 0, policy_version 278827 (0.0033) [2024-06-28 18:10:02,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4568383488. Throughput: 0: 42416.9. Samples: 847267560. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:10:02,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 18:10:04,220][09423] Updated weights for policy 0, policy_version 278837 (0.0044) [2024-06-28 18:10:07,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 4568596480. Throughput: 0: 42433.3. Samples: 847520620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:10:07,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:10:08,186][09423] Updated weights for policy 0, policy_version 278847 (0.0036) [2024-06-28 18:10:12,345][09423] Updated weights for policy 0, policy_version 278857 (0.0024) [2024-06-28 18:10:12,924][09190] Fps is (10 sec: 45863.7, 60 sec: 42596.7, 300 sec: 42487.0). Total num frames: 4568842240. Throughput: 0: 42390.6. Samples: 847649340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:10:12,924][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:10:15,824][09423] Updated weights for policy 0, policy_version 278867 (0.0039) [2024-06-28 18:10:17,922][09190] Fps is (10 sec: 42597.7, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 4569022464. Throughput: 0: 42402.9. Samples: 847901820. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:10:17,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:10:17,943][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000278871_4569022464.pth... [2024-06-28 18:10:18,001][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000278252_4558880768.pth [2024-06-28 18:10:19,650][09423] Updated weights for policy 0, policy_version 278877 (0.0028) [2024-06-28 18:10:22,921][09190] Fps is (10 sec: 39331.5, 60 sec: 42052.3, 300 sec: 42431.8). Total num frames: 4569235456. Throughput: 0: 42160.8. Samples: 848148120. Policy #0 lag: (min: 0.0, avg: 9.5, max: 21.0) [2024-06-28 18:10:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:10:23,759][09423] Updated weights for policy 0, policy_version 278887 (0.0033) [2024-06-28 18:10:27,254][09423] Updated weights for policy 0, policy_version 278897 (0.0029) [2024-06-28 18:10:27,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42052.3, 300 sec: 42376.2). Total num frames: 4569448448. Throughput: 0: 42271.6. Samples: 848281380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:10:27,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:10:31,135][09423] Updated weights for policy 0, policy_version 278907 (0.0029) [2024-06-28 18:10:32,922][09190] Fps is (10 sec: 40959.5, 60 sec: 42325.2, 300 sec: 42320.7). Total num frames: 4569645056. Throughput: 0: 42238.7. Samples: 848533720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:10:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:10:34,366][09403] Signal inference workers to stop experience collection... (11650 times) [2024-06-28 18:10:34,369][09403] Signal inference workers to resume experience collection... (11650 times) [2024-06-28 18:10:34,387][09423] InferenceWorker_p0-w0: stopping experience collection (11650 times) [2024-06-28 18:10:34,409][09423] InferenceWorker_p0-w0: resuming experience collection (11650 times) [2024-06-28 18:10:35,002][09423] Updated weights for policy 0, policy_version 278917 (0.0029) [2024-06-28 18:10:37,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 4569874432. Throughput: 0: 42317.9. Samples: 848789580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:10:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:10:38,816][09423] Updated weights for policy 0, policy_version 278927 (0.0039) [2024-06-28 18:10:42,705][09423] Updated weights for policy 0, policy_version 278937 (0.0035) [2024-06-28 18:10:42,924][09190] Fps is (10 sec: 45864.2, 60 sec: 42323.6, 300 sec: 42432.4). Total num frames: 4570103808. Throughput: 0: 42329.5. Samples: 848921640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:10:42,925][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:10:46,691][09423] Updated weights for policy 0, policy_version 278947 (0.0035) [2024-06-28 18:10:47,922][09190] Fps is (10 sec: 40959.7, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 4570284032. Throughput: 0: 42194.5. Samples: 849166320. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:10:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:10:50,436][09423] Updated weights for policy 0, policy_version 278957 (0.0034) [2024-06-28 18:10:52,921][09190] Fps is (10 sec: 40970.2, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4570513408. Throughput: 0: 42310.2. Samples: 849424580. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:10:52,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:10:54,484][09423] Updated weights for policy 0, policy_version 278967 (0.0029) [2024-06-28 18:10:57,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42325.2, 300 sec: 42376.2). Total num frames: 4570726400. Throughput: 0: 42277.4. Samples: 849551720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:10:57,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:10:58,578][09423] Updated weights for policy 0, policy_version 278977 (0.0027) [2024-06-28 18:11:02,756][09423] Updated weights for policy 0, policy_version 278987 (0.0033) [2024-06-28 18:11:02,921][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 4570923008. Throughput: 0: 42140.5. Samples: 849798140. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:11:02,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:11:05,841][09423] Updated weights for policy 0, policy_version 278997 (0.0041) [2024-06-28 18:11:07,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4571152384. Throughput: 0: 42425.3. Samples: 850057260. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:11:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:11:10,647][09423] Updated weights for policy 0, policy_version 279007 (0.0033) [2024-06-28 18:11:12,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42054.0, 300 sec: 42376.5). Total num frames: 4571365376. Throughput: 0: 42377.8. Samples: 850188380. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:11:12,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:11:14,110][09423] Updated weights for policy 0, policy_version 279017 (0.0037) [2024-06-28 18:11:17,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 4571561984. Throughput: 0: 42312.5. Samples: 850437780. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:11:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:11:17,945][09190] No heartbeat for components: RolloutWorker_w20 (3816 seconds) [2024-06-28 18:11:18,625][09423] Updated weights for policy 0, policy_version 279027 (0.0035) [2024-06-28 18:11:21,472][09423] Updated weights for policy 0, policy_version 279037 (0.0032) [2024-06-28 18:11:22,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4571791360. Throughput: 0: 42202.2. Samples: 850688680. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:11:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:11:26,197][09423] Updated weights for policy 0, policy_version 279047 (0.0031) [2024-06-28 18:11:27,922][09190] Fps is (10 sec: 44236.3, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 4572004352. Throughput: 0: 42349.7. Samples: 850827280. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:11:27,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:11:29,121][09423] Updated weights for policy 0, policy_version 279057 (0.0036) [2024-06-28 18:11:32,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.5, 300 sec: 42320.7). Total num frames: 4572200960. Throughput: 0: 42545.9. Samples: 851080880. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:11:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:11:33,593][09423] Updated weights for policy 0, policy_version 279067 (0.0044) [2024-06-28 18:11:36,938][09423] Updated weights for policy 0, policy_version 279077 (0.0034) [2024-06-28 18:11:37,921][09190] Fps is (10 sec: 42599.2, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4572430336. Throughput: 0: 42450.2. Samples: 851334840. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:11:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:11:41,518][09423] Updated weights for policy 0, policy_version 279087 (0.0038) [2024-06-28 18:11:42,922][09190] Fps is (10 sec: 44236.3, 60 sec: 42327.0, 300 sec: 42431.8). Total num frames: 4572643328. Throughput: 0: 42522.6. Samples: 851465240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:11:42,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:11:44,346][09423] Updated weights for policy 0, policy_version 279097 (0.0026) [2024-06-28 18:11:47,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 4572823552. Throughput: 0: 42717.4. Samples: 851720420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:11:47,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:11:49,244][09423] Updated weights for policy 0, policy_version 279107 (0.0030) [2024-06-28 18:11:51,881][09423] Updated weights for policy 0, policy_version 279117 (0.0033) [2024-06-28 18:11:52,921][09190] Fps is (10 sec: 42599.6, 60 sec: 42598.5, 300 sec: 42432.1). Total num frames: 4573069312. Throughput: 0: 42511.2. Samples: 851970260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:11:52,921][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:11:56,868][09423] Updated weights for policy 0, policy_version 279127 (0.0027) [2024-06-28 18:11:57,459][09403] Signal inference workers to stop experience collection... (11700 times) [2024-06-28 18:11:57,460][09403] Signal inference workers to resume experience collection... (11700 times) [2024-06-28 18:11:57,480][09423] InferenceWorker_p0-w0: stopping experience collection (11700 times) [2024-06-28 18:11:57,510][09423] InferenceWorker_p0-w0: resuming experience collection (11700 times) [2024-06-28 18:11:57,924][09190] Fps is (10 sec: 45863.7, 60 sec: 42596.7, 300 sec: 42486.9). Total num frames: 4573282304. Throughput: 0: 42607.4. Samples: 852105820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:11:57,924][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 18:11:59,785][09423] Updated weights for policy 0, policy_version 279137 (0.0035) [2024-06-28 18:12:02,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.5, 300 sec: 42320.7). Total num frames: 4573478912. Throughput: 0: 42619.7. Samples: 852355660. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:12:02,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:12:04,434][09423] Updated weights for policy 0, policy_version 279147 (0.0038) [2024-06-28 18:12:07,616][09423] Updated weights for policy 0, policy_version 279157 (0.0031) [2024-06-28 18:12:07,921][09190] Fps is (10 sec: 44248.0, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 4573724672. Throughput: 0: 42683.6. Samples: 852609440. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:12:07,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:12:12,305][09423] Updated weights for policy 0, policy_version 279167 (0.0043) [2024-06-28 18:12:12,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 4573904896. Throughput: 0: 42553.5. Samples: 852742180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:12:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:12:15,120][09423] Updated weights for policy 0, policy_version 279177 (0.0025) [2024-06-28 18:12:17,922][09190] Fps is (10 sec: 39321.1, 60 sec: 42598.4, 300 sec: 42376.2). Total num frames: 4574117888. Throughput: 0: 42458.1. Samples: 852991500. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:12:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:12:17,933][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000279182_4574117888.pth... [2024-06-28 18:12:17,991][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000278559_4563910656.pth [2024-06-28 18:12:19,651][09423] Updated weights for policy 0, policy_version 279187 (0.0034) [2024-06-28 18:12:22,545][09423] Updated weights for policy 0, policy_version 279197 (0.0039) [2024-06-28 18:12:22,922][09190] Fps is (10 sec: 45874.9, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4574363648. Throughput: 0: 42183.0. Samples: 853233080. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:12:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:12:27,728][09423] Updated weights for policy 0, policy_version 279207 (0.0038) [2024-06-28 18:12:27,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42052.3, 300 sec: 42431.8). Total num frames: 4574527488. Throughput: 0: 42467.2. Samples: 853376260. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:12:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:12:30,548][09423] Updated weights for policy 0, policy_version 279217 (0.0033) [2024-06-28 18:12:32,921][09190] Fps is (10 sec: 36044.9, 60 sec: 42052.2, 300 sec: 42265.2). Total num frames: 4574724096. Throughput: 0: 42341.3. Samples: 853625780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:12:32,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 18:12:35,129][09423] Updated weights for policy 0, policy_version 279227 (0.0032) [2024-06-28 18:12:37,924][09190] Fps is (10 sec: 45863.8, 60 sec: 42596.6, 300 sec: 42487.0). Total num frames: 4574986240. Throughput: 0: 42192.6. Samples: 853869040. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:12:37,933][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:12:38,427][09423] Updated weights for policy 0, policy_version 279237 (0.0028) [2024-06-28 18:12:42,651][09423] Updated weights for policy 0, policy_version 279247 (0.0039) [2024-06-28 18:12:42,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4575182848. Throughput: 0: 42345.0. Samples: 854011240. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:12:42,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 18:12:46,533][09423] Updated weights for policy 0, policy_version 279257 (0.0041) [2024-06-28 18:12:47,921][09190] Fps is (10 sec: 37692.7, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 4575363072. Throughput: 0: 42259.9. Samples: 854257360. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:12:47,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:12:50,617][09423] Updated weights for policy 0, policy_version 279267 (0.0039) [2024-06-28 18:12:52,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4575625216. Throughput: 0: 42230.2. Samples: 854509800. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-28 18:12:52,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:12:54,323][09423] Updated weights for policy 0, policy_version 279277 (0.0032) [2024-06-28 18:12:57,922][09190] Fps is (10 sec: 44236.4, 60 sec: 42053.9, 300 sec: 42431.8). Total num frames: 4575805440. Throughput: 0: 42283.9. Samples: 854644960. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-28 18:12:57,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:12:58,570][09423] Updated weights for policy 0, policy_version 279287 (0.0026) [2024-06-28 18:13:02,299][09423] Updated weights for policy 0, policy_version 279297 (0.0030) [2024-06-28 18:13:02,922][09190] Fps is (10 sec: 37682.7, 60 sec: 42052.2, 300 sec: 42320.7). Total num frames: 4576002048. Throughput: 0: 42271.6. Samples: 854893720. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-28 18:13:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:13:06,024][09423] Updated weights for policy 0, policy_version 279307 (0.0037) [2024-06-28 18:13:07,921][09190] Fps is (10 sec: 45875.9, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4576264192. Throughput: 0: 42486.4. Samples: 855144960. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-28 18:13:07,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:13:09,544][09403] Signal inference workers to stop experience collection... (11750 times) [2024-06-28 18:13:09,588][09423] InferenceWorker_p0-w0: stopping experience collection (11750 times) [2024-06-28 18:13:09,593][09403] Signal inference workers to resume experience collection... (11750 times) [2024-06-28 18:13:09,600][09423] InferenceWorker_p0-w0: resuming experience collection (11750 times) [2024-06-28 18:13:09,739][09423] Updated weights for policy 0, policy_version 279317 (0.0034) [2024-06-28 18:13:12,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4576444416. Throughput: 0: 42447.2. Samples: 855286380. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-28 18:13:12,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:13:13,308][09423] Updated weights for policy 0, policy_version 279327 (0.0033) [2024-06-28 18:13:17,924][09190] Fps is (10 sec: 37673.3, 60 sec: 42050.5, 300 sec: 42375.9). Total num frames: 4576641024. Throughput: 0: 42452.7. Samples: 855536260. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-28 18:13:17,925][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:13:18,091][09423] Updated weights for policy 0, policy_version 279337 (0.0026) [2024-06-28 18:13:20,986][09423] Updated weights for policy 0, policy_version 279347 (0.0034) [2024-06-28 18:13:22,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4576903168. Throughput: 0: 42525.9. Samples: 855782600. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-28 18:13:22,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 18:13:25,547][09423] Updated weights for policy 0, policy_version 279357 (0.0039) [2024-06-28 18:13:27,921][09190] Fps is (10 sec: 45887.3, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 4577099776. Throughput: 0: 42430.7. Samples: 855920620. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-28 18:13:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:13:28,797][09423] Updated weights for policy 0, policy_version 279367 (0.0040) [2024-06-28 18:13:32,921][09190] Fps is (10 sec: 37683.1, 60 sec: 42598.4, 300 sec: 42320.7). Total num frames: 4577280000. Throughput: 0: 42625.7. Samples: 856175520. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-28 18:13:32,924][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:13:33,325][09423] Updated weights for policy 0, policy_version 279377 (0.0027) [2024-06-28 18:13:36,507][09423] Updated weights for policy 0, policy_version 279387 (0.0032) [2024-06-28 18:13:37,922][09190] Fps is (10 sec: 44235.9, 60 sec: 42600.1, 300 sec: 42487.3). Total num frames: 4577542144. Throughput: 0: 42560.7. Samples: 856425040. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-28 18:13:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:13:40,988][09423] Updated weights for policy 0, policy_version 279397 (0.0036) [2024-06-28 18:13:42,921][09190] Fps is (10 sec: 45875.8, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4577738752. Throughput: 0: 42565.5. Samples: 856560400. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-28 18:13:42,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 18:13:43,918][09423] Updated weights for policy 0, policy_version 279407 (0.0033) [2024-06-28 18:13:47,921][09190] Fps is (10 sec: 37683.5, 60 sec: 42598.4, 300 sec: 42320.7). Total num frames: 4577918976. Throughput: 0: 42694.3. Samples: 856814960. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-28 18:13:47,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:13:48,579][09423] Updated weights for policy 0, policy_version 279417 (0.0031) [2024-06-28 18:13:51,401][09423] Updated weights for policy 0, policy_version 279427 (0.0031) [2024-06-28 18:13:52,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4578181120. Throughput: 0: 42755.9. Samples: 857068980. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-28 18:13:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:13:56,020][09423] Updated weights for policy 0, policy_version 279437 (0.0026) [2024-06-28 18:13:57,922][09190] Fps is (10 sec: 45874.9, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 4578377728. Throughput: 0: 42685.2. Samples: 857207220. Policy #0 lag: (min: 0.0, avg: 7.6, max: 20.0) [2024-06-28 18:13:57,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:13:59,303][09423] Updated weights for policy 0, policy_version 279447 (0.0034) [2024-06-28 18:14:02,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42871.6, 300 sec: 42431.8). Total num frames: 4578574336. Throughput: 0: 42755.4. Samples: 857460140. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:14:02,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:14:03,702][09423] Updated weights for policy 0, policy_version 279457 (0.0035) [2024-06-28 18:14:06,860][09423] Updated weights for policy 0, policy_version 279467 (0.0036) [2024-06-28 18:14:07,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4578820096. Throughput: 0: 42792.5. Samples: 857708260. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:14:07,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:14:11,419][09423] Updated weights for policy 0, policy_version 279477 (0.0031) [2024-06-28 18:14:12,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.4, 300 sec: 42542.9). Total num frames: 4579016704. Throughput: 0: 42741.7. Samples: 857844000. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:14:12,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:14:14,417][09423] Updated weights for policy 0, policy_version 279487 (0.0035) [2024-06-28 18:14:17,921][09190] Fps is (10 sec: 39321.3, 60 sec: 42873.2, 300 sec: 42376.2). Total num frames: 4579213312. Throughput: 0: 42739.1. Samples: 858098780. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:14:17,922][09190] Avg episode reward: [(0, '0.728')] [2024-06-28 18:14:17,935][09190] No heartbeat for components: RolloutWorker_w20 (3996 seconds) [2024-06-28 18:14:17,936][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000279493_4579213312.pth... [2024-06-28 18:14:18,008][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000278871_4569022464.pth [2024-06-28 18:14:19,201][09423] Updated weights for policy 0, policy_version 279497 (0.0037) [2024-06-28 18:14:22,070][09423] Updated weights for policy 0, policy_version 279507 (0.0033) [2024-06-28 18:14:22,924][09190] Fps is (10 sec: 44225.9, 60 sec: 42596.7, 300 sec: 42487.0). Total num frames: 4579459072. Throughput: 0: 42707.1. Samples: 858346960. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:14:22,924][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:14:26,945][09423] Updated weights for policy 0, policy_version 279517 (0.0033) [2024-06-28 18:14:27,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42598.3, 300 sec: 42542.8). Total num frames: 4579655680. Throughput: 0: 42759.5. Samples: 858484580. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:14:27,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 18:14:28,939][09403] Signal inference workers to stop experience collection... (11800 times) [2024-06-28 18:14:28,939][09403] Signal inference workers to resume experience collection... (11800 times) [2024-06-28 18:14:28,987][09423] InferenceWorker_p0-w0: stopping experience collection (11800 times) [2024-06-28 18:14:28,987][09423] InferenceWorker_p0-w0: resuming experience collection (11800 times) [2024-06-28 18:14:29,833][09423] Updated weights for policy 0, policy_version 279527 (0.0038) [2024-06-28 18:14:32,921][09190] Fps is (10 sec: 40969.8, 60 sec: 43144.5, 300 sec: 42542.8). Total num frames: 4579868672. Throughput: 0: 42803.5. Samples: 858741120. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:14:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:14:34,738][09423] Updated weights for policy 0, policy_version 279537 (0.0036) [2024-06-28 18:14:37,620][09423] Updated weights for policy 0, policy_version 279547 (0.0029) [2024-06-28 18:14:37,928][09190] Fps is (10 sec: 44208.0, 60 sec: 42593.9, 300 sec: 42486.4). Total num frames: 4580098048. Throughput: 0: 42569.9. Samples: 858984900. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:14:37,928][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:14:42,413][09423] Updated weights for policy 0, policy_version 279557 (0.0041) [2024-06-28 18:14:42,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4580278272. Throughput: 0: 42537.9. Samples: 859121420. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:14:42,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:14:45,601][09423] Updated weights for policy 0, policy_version 279567 (0.0032) [2024-06-28 18:14:47,921][09190] Fps is (10 sec: 39347.6, 60 sec: 42871.6, 300 sec: 42431.8). Total num frames: 4580491264. Throughput: 0: 42590.3. Samples: 859376700. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:14:47,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:14:50,018][09423] Updated weights for policy 0, policy_version 279577 (0.0031) [2024-06-28 18:14:52,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4580720640. Throughput: 0: 42591.1. Samples: 859624860. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:14:52,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:14:53,677][09423] Updated weights for policy 0, policy_version 279587 (0.0032) [2024-06-28 18:14:57,372][09423] Updated weights for policy 0, policy_version 279597 (0.0027) [2024-06-28 18:14:57,922][09190] Fps is (10 sec: 42597.4, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4580917248. Throughput: 0: 42488.8. Samples: 859756000. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:14:57,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:15:01,045][09423] Updated weights for policy 0, policy_version 279607 (0.0031) [2024-06-28 18:15:02,922][09190] Fps is (10 sec: 40959.8, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4581130240. Throughput: 0: 42570.2. Samples: 860014440. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:15:02,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:15:05,215][09423] Updated weights for policy 0, policy_version 279617 (0.0037) [2024-06-28 18:15:07,921][09190] Fps is (10 sec: 45875.3, 60 sec: 42598.3, 300 sec: 42487.7). Total num frames: 4581376000. Throughput: 0: 42624.0. Samples: 860264940. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:15:07,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 18:15:09,085][09423] Updated weights for policy 0, policy_version 279627 (0.0032) [2024-06-28 18:15:12,852][09423] Updated weights for policy 0, policy_version 279637 (0.0028) [2024-06-28 18:15:12,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4581572608. Throughput: 0: 42457.8. Samples: 860395180. Policy #0 lag: (min: 0.0, avg: 11.2, max: 22.0) [2024-06-28 18:15:12,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:15:16,839][09423] Updated weights for policy 0, policy_version 279647 (0.0032) [2024-06-28 18:15:17,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 4581785600. Throughput: 0: 42461.0. Samples: 860651860. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 18:15:17,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:15:20,701][09423] Updated weights for policy 0, policy_version 279657 (0.0024) [2024-06-28 18:15:22,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42327.0, 300 sec: 42542.8). Total num frames: 4581998592. Throughput: 0: 42550.5. Samples: 860899400. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 18:15:22,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:15:24,589][09423] Updated weights for policy 0, policy_version 279667 (0.0029) [2024-06-28 18:15:27,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4582195200. Throughput: 0: 42434.6. Samples: 861030980. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 18:15:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:15:28,115][09423] Updated weights for policy 0, policy_version 279677 (0.0029) [2024-06-28 18:15:32,014][09423] Updated weights for policy 0, policy_version 279687 (0.0031) [2024-06-28 18:15:32,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4582408192. Throughput: 0: 42341.7. Samples: 861282080. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 18:15:32,922][09190] Avg episode reward: [(0, '0.733')] [2024-06-28 18:15:36,167][09423] Updated weights for policy 0, policy_version 279697 (0.0045) [2024-06-28 18:15:37,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42329.9, 300 sec: 42487.7). Total num frames: 4582637568. Throughput: 0: 42397.7. Samples: 861532760. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 18:15:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:15:40,043][09423] Updated weights for policy 0, policy_version 279707 (0.0036) [2024-06-28 18:15:42,923][09190] Fps is (10 sec: 42591.6, 60 sec: 42597.2, 300 sec: 42542.6). Total num frames: 4582834176. Throughput: 0: 42506.6. Samples: 861668860. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 18:15:42,923][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:15:43,547][09423] Updated weights for policy 0, policy_version 279717 (0.0030) [2024-06-28 18:15:47,807][09423] Updated weights for policy 0, policy_version 279727 (0.0027) [2024-06-28 18:15:47,923][09190] Fps is (10 sec: 40955.7, 60 sec: 42597.5, 300 sec: 42487.2). Total num frames: 4583047168. Throughput: 0: 42388.8. Samples: 861921980. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 18:15:47,923][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 18:15:51,198][09423] Updated weights for policy 0, policy_version 279737 (0.0039) [2024-06-28 18:15:52,921][09190] Fps is (10 sec: 42605.0, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4583260160. Throughput: 0: 42522.7. Samples: 862178460. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 18:15:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:15:55,547][09423] Updated weights for policy 0, policy_version 279747 (0.0041) [2024-06-28 18:15:57,924][09190] Fps is (10 sec: 44230.8, 60 sec: 42869.8, 300 sec: 42598.1). Total num frames: 4583489536. Throughput: 0: 42540.7. Samples: 862309620. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 18:15:57,924][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:15:58,840][09423] Updated weights for policy 0, policy_version 279757 (0.0032) [2024-06-28 18:16:02,921][09190] Fps is (10 sec: 42599.2, 60 sec: 42598.6, 300 sec: 42487.3). Total num frames: 4583686144. Throughput: 0: 42486.3. Samples: 862563740. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 18:16:02,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:16:02,927][09423] Updated weights for policy 0, policy_version 279767 (0.0038) [2024-06-28 18:16:06,761][09423] Updated weights for policy 0, policy_version 279777 (0.0041) [2024-06-28 18:16:07,921][09190] Fps is (10 sec: 40970.3, 60 sec: 42052.3, 300 sec: 42487.3). Total num frames: 4583899136. Throughput: 0: 42573.0. Samples: 862815180. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 18:16:07,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:16:08,396][09403] Signal inference workers to stop experience collection... (11850 times) [2024-06-28 18:16:08,396][09403] Signal inference workers to resume experience collection... (11850 times) [2024-06-28 18:16:08,420][09423] InferenceWorker_p0-w0: stopping experience collection (11850 times) [2024-06-28 18:16:08,420][09423] InferenceWorker_p0-w0: resuming experience collection (11850 times) [2024-06-28 18:16:10,570][09423] Updated weights for policy 0, policy_version 279787 (0.0036) [2024-06-28 18:16:12,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4584112128. Throughput: 0: 42492.1. Samples: 862943120. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 18:16:12,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:16:14,434][09423] Updated weights for policy 0, policy_version 279797 (0.0026) [2024-06-28 18:16:17,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42325.2, 300 sec: 42487.3). Total num frames: 4584325120. Throughput: 0: 42565.2. Samples: 863197520. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 18:16:17,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:16:17,932][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000279805_4584325120.pth... [2024-06-28 18:16:17,986][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000279182_4574117888.pth [2024-06-28 18:16:18,718][09423] Updated weights for policy 0, policy_version 279807 (0.0028) [2024-06-28 18:16:21,858][09423] Updated weights for policy 0, policy_version 279817 (0.0025) [2024-06-28 18:16:22,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.4, 300 sec: 42487.4). Total num frames: 4584538112. Throughput: 0: 42769.9. Samples: 863457400. Policy #0 lag: (min: 1.0, avg: 10.8, max: 20.0) [2024-06-28 18:16:22,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:16:26,270][09423] Updated weights for policy 0, policy_version 279827 (0.0029) [2024-06-28 18:16:27,922][09190] Fps is (10 sec: 44236.8, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4584767488. Throughput: 0: 42670.3. Samples: 863588960. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:16:27,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 18:16:29,620][09423] Updated weights for policy 0, policy_version 279837 (0.0038) [2024-06-28 18:16:32,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4584964096. Throughput: 0: 42658.4. Samples: 863841560. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:16:32,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:16:33,670][09423] Updated weights for policy 0, policy_version 279847 (0.0029) [2024-06-28 18:16:37,490][09423] Updated weights for policy 0, policy_version 279857 (0.0028) [2024-06-28 18:16:37,921][09190] Fps is (10 sec: 44237.6, 60 sec: 42871.6, 300 sec: 42598.4). Total num frames: 4585209856. Throughput: 0: 42649.0. Samples: 864097660. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:16:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:16:41,107][09423] Updated weights for policy 0, policy_version 279867 (0.0030) [2024-06-28 18:16:42,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42599.5, 300 sec: 42598.4). Total num frames: 4585390080. Throughput: 0: 42668.1. Samples: 864229580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:16:42,923][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:16:45,010][09423] Updated weights for policy 0, policy_version 279877 (0.0031) [2024-06-28 18:16:47,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42599.2, 300 sec: 42487.3). Total num frames: 4585603072. Throughput: 0: 42651.9. Samples: 864483080. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:16:47,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:16:48,773][09423] Updated weights for policy 0, policy_version 279887 (0.0034) [2024-06-28 18:16:52,401][09423] Updated weights for policy 0, policy_version 279897 (0.0032) [2024-06-28 18:16:52,921][09190] Fps is (10 sec: 45875.6, 60 sec: 43144.6, 300 sec: 42598.8). Total num frames: 4585848832. Throughput: 0: 42622.7. Samples: 864733200. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:16:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:16:56,995][09423] Updated weights for policy 0, policy_version 279907 (0.0035) [2024-06-28 18:16:57,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42054.0, 300 sec: 42487.3). Total num frames: 4586012672. Throughput: 0: 42726.6. Samples: 864865820. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:16:57,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:17:00,319][09423] Updated weights for policy 0, policy_version 279917 (0.0036) [2024-06-28 18:17:02,921][09190] Fps is (10 sec: 39321.0, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 4586242048. Throughput: 0: 42693.3. Samples: 865118720. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:17:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:17:04,427][09423] Updated weights for policy 0, policy_version 279927 (0.0037) [2024-06-28 18:17:07,922][09190] Fps is (10 sec: 45874.7, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4586471424. Throughput: 0: 42614.0. Samples: 865375040. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:17:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:17:07,996][09423] Updated weights for policy 0, policy_version 279937 (0.0038) [2024-06-28 18:17:11,825][09423] Updated weights for policy 0, policy_version 279947 (0.0031) [2024-06-28 18:17:12,924][09190] Fps is (10 sec: 42588.1, 60 sec: 42596.6, 300 sec: 42542.5). Total num frames: 4586668032. Throughput: 0: 42522.6. Samples: 865502580. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:17:12,924][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 18:17:15,447][09423] Updated weights for policy 0, policy_version 279957 (0.0030) [2024-06-28 18:17:17,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4586881024. Throughput: 0: 42592.9. Samples: 865758240. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:17:17,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:17:17,929][09190] No heartbeat for components: RolloutWorker_w20 (4176 seconds) [2024-06-28 18:17:20,594][09423] Updated weights for policy 0, policy_version 279967 (0.0038) [2024-06-28 18:17:22,921][09190] Fps is (10 sec: 45887.1, 60 sec: 43144.5, 300 sec: 42709.5). Total num frames: 4587126784. Throughput: 0: 42327.6. Samples: 866002400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:17:22,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:17:22,997][09423] Updated weights for policy 0, policy_version 279977 (0.0033) [2024-06-28 18:17:24,910][09403] Signal inference workers to stop experience collection... (11900 times) [2024-06-28 18:17:24,910][09403] Signal inference workers to resume experience collection... (11900 times) [2024-06-28 18:17:24,950][09423] InferenceWorker_p0-w0: stopping experience collection (11900 times) [2024-06-28 18:17:24,951][09423] InferenceWorker_p0-w0: resuming experience collection (11900 times) [2024-06-28 18:17:27,921][09190] Fps is (10 sec: 39321.6, 60 sec: 41779.3, 300 sec: 42542.9). Total num frames: 4587274240. Throughput: 0: 42301.4. Samples: 866133140. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:17:27,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:17:28,182][09423] Updated weights for policy 0, policy_version 279987 (0.0035) [2024-06-28 18:17:31,155][09423] Updated weights for policy 0, policy_version 279997 (0.0032) [2024-06-28 18:17:32,921][09190] Fps is (10 sec: 39321.1, 60 sec: 42598.3, 300 sec: 42487.7). Total num frames: 4587520000. Throughput: 0: 42290.6. Samples: 866386160. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:17:32,924][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:17:36,258][09423] Updated weights for policy 0, policy_version 280007 (0.0025) [2024-06-28 18:17:37,924][09190] Fps is (10 sec: 49139.4, 60 sec: 42596.6, 300 sec: 42653.6). Total num frames: 4587765760. Throughput: 0: 42402.0. Samples: 866641400. Policy #0 lag: (min: 0.0, avg: 8.8, max: 21.0) [2024-06-28 18:17:37,925][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:17:38,924][09423] Updated weights for policy 0, policy_version 280017 (0.0039) [2024-06-28 18:17:42,922][09190] Fps is (10 sec: 40959.7, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4587929600. Throughput: 0: 42445.2. Samples: 866775860. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-28 18:17:42,922][09190] Avg episode reward: [(0, '0.727')] [2024-06-28 18:17:43,731][09423] Updated weights for policy 0, policy_version 280027 (0.0039) [2024-06-28 18:17:46,578][09423] Updated weights for policy 0, policy_version 280037 (0.0036) [2024-06-28 18:17:47,921][09190] Fps is (10 sec: 39331.5, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4588158976. Throughput: 0: 42424.5. Samples: 867027820. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-28 18:17:47,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:17:51,300][09423] Updated weights for policy 0, policy_version 280047 (0.0031) [2024-06-28 18:17:52,921][09190] Fps is (10 sec: 47514.6, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 4588404736. Throughput: 0: 42362.9. Samples: 867281360. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-28 18:17:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:17:54,261][09423] Updated weights for policy 0, policy_version 280057 (0.0027) [2024-06-28 18:17:57,922][09190] Fps is (10 sec: 37683.0, 60 sec: 42052.2, 300 sec: 42487.3). Total num frames: 4588535808. Throughput: 0: 42478.8. Samples: 867414020. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-28 18:17:57,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:17:58,985][09423] Updated weights for policy 0, policy_version 280067 (0.0032) [2024-06-28 18:18:02,505][09423] Updated weights for policy 0, policy_version 280077 (0.0034) [2024-06-28 18:18:02,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42871.6, 300 sec: 42542.9). Total num frames: 4588814336. Throughput: 0: 42340.0. Samples: 867663540. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-28 18:18:02,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:18:06,763][09423] Updated weights for policy 0, policy_version 280087 (0.0028) [2024-06-28 18:18:07,922][09190] Fps is (10 sec: 50790.0, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4589043712. Throughput: 0: 42460.2. Samples: 867913120. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-28 18:18:07,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:18:09,673][09423] Updated weights for policy 0, policy_version 280097 (0.0026) [2024-06-28 18:18:12,928][09190] Fps is (10 sec: 37658.5, 60 sec: 42049.5, 300 sec: 42542.3). Total num frames: 4589191168. Throughput: 0: 42574.7. Samples: 868049280. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-28 18:18:12,928][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:18:14,139][09423] Updated weights for policy 0, policy_version 280107 (0.0030) [2024-06-28 18:18:17,432][09423] Updated weights for policy 0, policy_version 280117 (0.0024) [2024-06-28 18:18:17,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4589436928. Throughput: 0: 42661.4. Samples: 868305920. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-28 18:18:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:18:17,946][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000280117_4589436928.pth... [2024-06-28 18:18:18,018][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000279493_4579213312.pth [2024-06-28 18:18:21,770][09403] Signal inference workers to stop experience collection... (11950 times) [2024-06-28 18:18:21,773][09403] Signal inference workers to resume experience collection... (11950 times) [2024-06-28 18:18:21,783][09423] Updated weights for policy 0, policy_version 280127 (0.0029) [2024-06-28 18:18:21,791][09423] InferenceWorker_p0-w0: stopping experience collection (11950 times) [2024-06-28 18:18:21,795][09423] InferenceWorker_p0-w0: resuming experience collection (11950 times) [2024-06-28 18:18:22,921][09190] Fps is (10 sec: 49183.8, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4589682688. Throughput: 0: 42498.3. Samples: 868553720. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-28 18:18:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:18:25,243][09423] Updated weights for policy 0, policy_version 280137 (0.0035) [2024-06-28 18:18:27,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42598.3, 300 sec: 42542.9). Total num frames: 4589830144. Throughput: 0: 42446.7. Samples: 868685960. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-28 18:18:27,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:18:29,396][09423] Updated weights for policy 0, policy_version 280147 (0.0043) [2024-06-28 18:18:32,711][09423] Updated weights for policy 0, policy_version 280157 (0.0026) [2024-06-28 18:18:32,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42871.5, 300 sec: 42542.9). Total num frames: 4590092288. Throughput: 0: 42504.0. Samples: 868940500. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-28 18:18:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:18:37,080][09423] Updated weights for policy 0, policy_version 280167 (0.0039) [2024-06-28 18:18:37,921][09190] Fps is (10 sec: 47513.8, 60 sec: 42327.1, 300 sec: 42598.4). Total num frames: 4590305280. Throughput: 0: 42532.4. Samples: 869195320. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-28 18:18:37,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 18:18:40,083][09423] Updated weights for policy 0, policy_version 280177 (0.0037) [2024-06-28 18:18:42,921][09190] Fps is (10 sec: 36044.7, 60 sec: 42052.3, 300 sec: 42487.3). Total num frames: 4590452736. Throughput: 0: 42297.8. Samples: 869317420. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-28 18:18:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:18:44,793][09423] Updated weights for policy 0, policy_version 280187 (0.0034) [2024-06-28 18:18:47,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4590714880. Throughput: 0: 42491.1. Samples: 869575640. Policy #0 lag: (min: 0.0, avg: 12.7, max: 24.0) [2024-06-28 18:18:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:18:48,193][09423] Updated weights for policy 0, policy_version 280197 (0.0029) [2024-06-28 18:18:52,038][09423] Updated weights for policy 0, policy_version 280207 (0.0030) [2024-06-28 18:18:52,922][09190] Fps is (10 sec: 50789.8, 60 sec: 42598.3, 300 sec: 42653.9). Total num frames: 4590960640. Throughput: 0: 42556.5. Samples: 869828160. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:18:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:18:56,989][09423] Updated weights for policy 0, policy_version 280217 (0.0035) [2024-06-28 18:18:57,921][09190] Fps is (10 sec: 39321.1, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 4591108096. Throughput: 0: 42567.0. Samples: 869964520. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:18:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:18:59,754][09423] Updated weights for policy 0, policy_version 280227 (0.0031) [2024-06-28 18:19:02,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4591370240. Throughput: 0: 42480.9. Samples: 870217560. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:19:02,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 18:19:04,627][09423] Updated weights for policy 0, policy_version 280237 (0.0025) [2024-06-28 18:19:07,347][09423] Updated weights for policy 0, policy_version 280247 (0.0036) [2024-06-28 18:19:07,921][09190] Fps is (10 sec: 47514.3, 60 sec: 42325.5, 300 sec: 42598.4). Total num frames: 4591583232. Throughput: 0: 42751.7. Samples: 870477540. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:19:07,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:19:12,057][09423] Updated weights for policy 0, policy_version 280257 (0.0033) [2024-06-28 18:19:12,921][09190] Fps is (10 sec: 37683.1, 60 sec: 42603.0, 300 sec: 42487.3). Total num frames: 4591747072. Throughput: 0: 42829.4. Samples: 870613280. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:19:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:19:15,111][09423] Updated weights for policy 0, policy_version 280267 (0.0032) [2024-06-28 18:19:17,921][09190] Fps is (10 sec: 40959.4, 60 sec: 42598.3, 300 sec: 42487.7). Total num frames: 4591992832. Throughput: 0: 42576.8. Samples: 870856460. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:19:17,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 18:19:19,979][09423] Updated weights for policy 0, policy_version 280277 (0.0034) [2024-06-28 18:19:22,753][09423] Updated weights for policy 0, policy_version 280287 (0.0033) [2024-06-28 18:19:22,921][09190] Fps is (10 sec: 47513.5, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4592222208. Throughput: 0: 42539.1. Samples: 871109580. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:19:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:19:27,644][09423] Updated weights for policy 0, policy_version 280297 (0.0031) [2024-06-28 18:19:27,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4592386048. Throughput: 0: 42760.0. Samples: 871241620. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:19:27,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:19:30,100][09403] Signal inference workers to stop experience collection... (12000 times) [2024-06-28 18:19:30,107][09403] Signal inference workers to resume experience collection... (12000 times) [2024-06-28 18:19:30,130][09423] InferenceWorker_p0-w0: stopping experience collection (12000 times) [2024-06-28 18:19:30,131][09423] InferenceWorker_p0-w0: resuming experience collection (12000 times) [2024-06-28 18:19:30,251][09423] Updated weights for policy 0, policy_version 280307 (0.0034) [2024-06-28 18:19:32,922][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.3, 300 sec: 42488.2). Total num frames: 4592631808. Throughput: 0: 42585.2. Samples: 871491980. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:19:32,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:19:35,322][09423] Updated weights for policy 0, policy_version 280317 (0.0029) [2024-06-28 18:19:37,921][09190] Fps is (10 sec: 47513.5, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4592861184. Throughput: 0: 42653.9. Samples: 871747580. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:19:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:19:38,049][09423] Updated weights for policy 0, policy_version 280327 (0.0039) [2024-06-28 18:19:42,921][09190] Fps is (10 sec: 39322.3, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 4593025024. Throughput: 0: 42607.2. Samples: 871881840. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:19:42,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 18:19:43,066][09423] Updated weights for policy 0, policy_version 280337 (0.0047) [2024-06-28 18:19:45,531][09423] Updated weights for policy 0, policy_version 280347 (0.0027) [2024-06-28 18:19:47,924][09190] Fps is (10 sec: 40949.8, 60 sec: 42596.6, 300 sec: 42542.5). Total num frames: 4593270784. Throughput: 0: 42470.9. Samples: 872128860. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:19:47,925][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:19:50,467][09423] Updated weights for policy 0, policy_version 280357 (0.0029) [2024-06-28 18:19:52,921][09190] Fps is (10 sec: 47512.8, 60 sec: 42325.3, 300 sec: 42653.9). Total num frames: 4593500160. Throughput: 0: 42315.0. Samples: 872381720. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:19:52,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 18:19:53,426][09423] Updated weights for policy 0, policy_version 280367 (0.0031) [2024-06-28 18:19:57,921][09190] Fps is (10 sec: 39331.5, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4593664000. Throughput: 0: 42312.4. Samples: 872517340. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:19:57,924][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:19:58,256][09423] Updated weights for policy 0, policy_version 280377 (0.0031) [2024-06-28 18:20:01,064][09423] Updated weights for policy 0, policy_version 280387 (0.0028) [2024-06-28 18:20:02,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4593909760. Throughput: 0: 42392.1. Samples: 872764100. Policy #0 lag: (min: 0.0, avg: 7.4, max: 21.0) [2024-06-28 18:20:02,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:20:05,639][09423] Updated weights for policy 0, policy_version 280397 (0.0025) [2024-06-28 18:20:07,922][09190] Fps is (10 sec: 49151.4, 60 sec: 42871.3, 300 sec: 42653.9). Total num frames: 4594155520. Throughput: 0: 42684.8. Samples: 873030400. Policy #0 lag: (min: 0.0, avg: 9.8, max: 27.0) [2024-06-28 18:20:07,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:20:08,591][09423] Updated weights for policy 0, policy_version 280407 (0.0031) [2024-06-28 18:20:12,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4594319360. Throughput: 0: 42683.1. Samples: 873162360. Policy #0 lag: (min: 0.0, avg: 9.8, max: 27.0) [2024-06-28 18:20:12,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:20:13,473][09423] Updated weights for policy 0, policy_version 280417 (0.0026) [2024-06-28 18:20:16,114][09423] Updated weights for policy 0, policy_version 280427 (0.0032) [2024-06-28 18:20:17,922][09190] Fps is (10 sec: 37683.2, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4594532352. Throughput: 0: 42580.4. Samples: 873408100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 27.0) [2024-06-28 18:20:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:20:17,935][09190] No heartbeat for components: RolloutWorker_w20 (4356 seconds) [2024-06-28 18:20:17,935][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000280428_4594532352.pth... [2024-06-28 18:20:17,989][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000279805_4584325120.pth [2024-06-28 18:20:21,125][09423] Updated weights for policy 0, policy_version 280437 (0.0032) [2024-06-28 18:20:22,921][09190] Fps is (10 sec: 47514.0, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4594794496. Throughput: 0: 42695.6. Samples: 873668880. Policy #0 lag: (min: 0.0, avg: 9.8, max: 27.0) [2024-06-28 18:20:22,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:20:23,538][09423] Updated weights for policy 0, policy_version 280447 (0.0025) [2024-06-28 18:20:27,921][09190] Fps is (10 sec: 40960.8, 60 sec: 42598.5, 300 sec: 42487.3). Total num frames: 4594941952. Throughput: 0: 42739.1. Samples: 873805100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 27.0) [2024-06-28 18:20:27,922][09190] Avg episode reward: [(0, '0.728')] [2024-06-28 18:20:28,848][09423] Updated weights for policy 0, policy_version 280457 (0.0034) [2024-06-28 18:20:31,526][09423] Updated weights for policy 0, policy_version 280467 (0.0029) [2024-06-28 18:20:32,922][09190] Fps is (10 sec: 37682.8, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4595171328. Throughput: 0: 42571.7. Samples: 874044480. Policy #0 lag: (min: 0.0, avg: 9.8, max: 27.0) [2024-06-28 18:20:32,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:20:36,658][09423] Updated weights for policy 0, policy_version 280477 (0.0036) [2024-06-28 18:20:37,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42325.4, 300 sec: 42598.6). Total num frames: 4595400704. Throughput: 0: 42736.5. Samples: 874304860. Policy #0 lag: (min: 0.0, avg: 9.8, max: 27.0) [2024-06-28 18:20:37,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:20:39,100][09423] Updated weights for policy 0, policy_version 280487 (0.0025) [2024-06-28 18:20:42,499][09403] Signal inference workers to stop experience collection... (12050 times) [2024-06-28 18:20:42,501][09403] Signal inference workers to resume experience collection... (12050 times) [2024-06-28 18:20:42,532][09423] InferenceWorker_p0-w0: stopping experience collection (12050 times) [2024-06-28 18:20:42,532][09423] InferenceWorker_p0-w0: resuming experience collection (12050 times) [2024-06-28 18:20:42,925][09190] Fps is (10 sec: 40946.4, 60 sec: 42595.9, 300 sec: 42487.0). Total num frames: 4595580928. Throughput: 0: 42584.8. Samples: 874433800. Policy #0 lag: (min: 0.0, avg: 9.8, max: 27.0) [2024-06-28 18:20:42,925][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:20:44,302][09423] Updated weights for policy 0, policy_version 280497 (0.0033) [2024-06-28 18:20:47,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42327.1, 300 sec: 42542.9). Total num frames: 4595810304. Throughput: 0: 42527.5. Samples: 874677840. Policy #0 lag: (min: 0.0, avg: 9.8, max: 27.0) [2024-06-28 18:20:47,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:20:48,708][09423] Updated weights for policy 0, policy_version 280507 (0.0031) [2024-06-28 18:20:51,971][09423] Updated weights for policy 0, policy_version 280517 (0.0029) [2024-06-28 18:20:52,921][09190] Fps is (10 sec: 45890.4, 60 sec: 42325.4, 300 sec: 42543.2). Total num frames: 4596039680. Throughput: 0: 42368.5. Samples: 874936980. Policy #0 lag: (min: 0.0, avg: 9.8, max: 27.0) [2024-06-28 18:20:52,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:20:56,165][09423] Updated weights for policy 0, policy_version 280527 (0.0033) [2024-06-28 18:20:57,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.4, 300 sec: 42542.8). Total num frames: 4596236288. Throughput: 0: 42513.3. Samples: 875075460. Policy #0 lag: (min: 0.0, avg: 9.8, max: 27.0) [2024-06-28 18:20:57,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 18:20:59,343][09423] Updated weights for policy 0, policy_version 280537 (0.0026) [2024-06-28 18:21:02,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4596449280. Throughput: 0: 42529.5. Samples: 875321920. Policy #0 lag: (min: 0.0, avg: 9.8, max: 27.0) [2024-06-28 18:21:02,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:21:03,800][09423] Updated weights for policy 0, policy_version 280547 (0.0029) [2024-06-28 18:21:06,933][09423] Updated weights for policy 0, policy_version 280557 (0.0029) [2024-06-28 18:21:07,922][09190] Fps is (10 sec: 47513.4, 60 sec: 42598.4, 300 sec: 42709.5). Total num frames: 4596711424. Throughput: 0: 42412.3. Samples: 875577440. Policy #0 lag: (min: 0.0, avg: 9.8, max: 27.0) [2024-06-28 18:21:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:21:11,299][09423] Updated weights for policy 0, policy_version 280567 (0.0029) [2024-06-28 18:21:12,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4596858880. Throughput: 0: 42511.1. Samples: 875718100. Policy #0 lag: (min: 0.0, avg: 9.8, max: 27.0) [2024-06-28 18:21:12,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:21:14,563][09423] Updated weights for policy 0, policy_version 280577 (0.0030) [2024-06-28 18:21:17,924][09190] Fps is (10 sec: 37674.0, 60 sec: 42596.7, 300 sec: 42542.5). Total num frames: 4597088256. Throughput: 0: 42517.2. Samples: 875957860. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:21:17,925][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:21:19,269][09423] Updated weights for policy 0, policy_version 280587 (0.0029) [2024-06-28 18:21:22,454][09423] Updated weights for policy 0, policy_version 280597 (0.0027) [2024-06-28 18:21:22,922][09190] Fps is (10 sec: 47512.8, 60 sec: 42325.2, 300 sec: 42598.4). Total num frames: 4597334016. Throughput: 0: 42502.1. Samples: 876217460. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:21:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:21:27,062][09423] Updated weights for policy 0, policy_version 280607 (0.0026) [2024-06-28 18:21:27,922][09190] Fps is (10 sec: 42608.6, 60 sec: 42871.3, 300 sec: 42542.8). Total num frames: 4597514240. Throughput: 0: 42684.8. Samples: 876354480. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:21:27,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:21:29,984][09423] Updated weights for policy 0, policy_version 280617 (0.0030) [2024-06-28 18:21:32,922][09190] Fps is (10 sec: 39321.7, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4597727232. Throughput: 0: 42704.9. Samples: 876599560. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:21:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:21:34,374][09403] Signal inference workers to stop experience collection... (12100 times) [2024-06-28 18:21:34,383][09403] Signal inference workers to resume experience collection... (12100 times) [2024-06-28 18:21:34,412][09423] InferenceWorker_p0-w0: stopping experience collection (12100 times) [2024-06-28 18:21:34,412][09423] InferenceWorker_p0-w0: resuming experience collection (12100 times) [2024-06-28 18:21:34,518][09423] Updated weights for policy 0, policy_version 280627 (0.0038) [2024-06-28 18:21:37,571][09423] Updated weights for policy 0, policy_version 280637 (0.0029) [2024-06-28 18:21:37,921][09190] Fps is (10 sec: 45875.8, 60 sec: 42871.5, 300 sec: 42653.9). Total num frames: 4597972992. Throughput: 0: 42608.0. Samples: 876854340. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:21:37,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:21:42,081][09423] Updated weights for policy 0, policy_version 280647 (0.0032) [2024-06-28 18:21:42,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42600.8, 300 sec: 42487.3). Total num frames: 4598136832. Throughput: 0: 42563.6. Samples: 876990820. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:21:42,924][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:21:44,950][09423] Updated weights for policy 0, policy_version 280657 (0.0033) [2024-06-28 18:21:47,922][09190] Fps is (10 sec: 39321.2, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4598366208. Throughput: 0: 42559.4. Samples: 877237100. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:21:47,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:21:49,981][09423] Updated weights for policy 0, policy_version 280667 (0.0028) [2024-06-28 18:21:52,627][09423] Updated weights for policy 0, policy_version 280677 (0.0029) [2024-06-28 18:21:52,922][09190] Fps is (10 sec: 49151.6, 60 sec: 43144.5, 300 sec: 42765.0). Total num frames: 4598628352. Throughput: 0: 42533.3. Samples: 877491440. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:21:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:21:57,713][09423] Updated weights for policy 0, policy_version 280687 (0.0043) [2024-06-28 18:21:57,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4598775808. Throughput: 0: 42436.4. Samples: 877627740. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:21:57,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:22:00,625][09423] Updated weights for policy 0, policy_version 280697 (0.0038) [2024-06-28 18:22:02,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42871.4, 300 sec: 42542.9). Total num frames: 4599021568. Throughput: 0: 42465.5. Samples: 877868700. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:22:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:22:05,573][09423] Updated weights for policy 0, policy_version 280707 (0.0031) [2024-06-28 18:22:07,922][09190] Fps is (10 sec: 45874.7, 60 sec: 42052.3, 300 sec: 42598.7). Total num frames: 4599234560. Throughput: 0: 42492.9. Samples: 878129640. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:22:07,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:22:08,240][09423] Updated weights for policy 0, policy_version 280717 (0.0029) [2024-06-28 18:22:12,924][09190] Fps is (10 sec: 39311.7, 60 sec: 42596.6, 300 sec: 42487.0). Total num frames: 4599414784. Throughput: 0: 42425.8. Samples: 878263740. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:22:12,924][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 18:22:12,973][09423] Updated weights for policy 0, policy_version 280727 (0.0026) [2024-06-28 18:22:15,927][09423] Updated weights for policy 0, policy_version 280737 (0.0038) [2024-06-28 18:22:17,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42600.3, 300 sec: 42431.8). Total num frames: 4599644160. Throughput: 0: 42533.5. Samples: 878513560. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:22:17,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:22:17,941][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000280740_4599644160.pth... [2024-06-28 18:22:17,993][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000280117_4589436928.pth [2024-06-28 18:22:20,537][09423] Updated weights for policy 0, policy_version 280747 (0.0032) [2024-06-28 18:22:22,921][09190] Fps is (10 sec: 47525.7, 60 sec: 42598.5, 300 sec: 42765.0). Total num frames: 4599889920. Throughput: 0: 42606.7. Samples: 878771640. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:22:22,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:22:23,572][09423] Updated weights for policy 0, policy_version 280757 (0.0037) [2024-06-28 18:22:27,922][09190] Fps is (10 sec: 42597.5, 60 sec: 42598.4, 300 sec: 42542.8). Total num frames: 4600070144. Throughput: 0: 42616.8. Samples: 878908580. Policy #0 lag: (min: 0.0, avg: 12.5, max: 20.0) [2024-06-28 18:22:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:22:28,199][09423] Updated weights for policy 0, policy_version 280767 (0.0043) [2024-06-28 18:22:31,284][09423] Updated weights for policy 0, policy_version 280777 (0.0026) [2024-06-28 18:22:32,922][09190] Fps is (10 sec: 39321.0, 60 sec: 42598.4, 300 sec: 42432.1). Total num frames: 4600283136. Throughput: 0: 42660.5. Samples: 879156820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 18:22:32,936][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:22:35,887][09423] Updated weights for policy 0, policy_version 280787 (0.0036) [2024-06-28 18:22:37,921][09190] Fps is (10 sec: 44237.7, 60 sec: 42325.4, 300 sec: 42654.0). Total num frames: 4600512512. Throughput: 0: 42657.9. Samples: 879411040. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 18:22:37,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:22:38,891][09423] Updated weights for policy 0, policy_version 280797 (0.0028) [2024-06-28 18:22:42,921][09190] Fps is (10 sec: 40960.7, 60 sec: 42598.5, 300 sec: 42487.3). Total num frames: 4600692736. Throughput: 0: 42575.2. Samples: 879543620. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 18:22:42,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:22:43,514][09423] Updated weights for policy 0, policy_version 280807 (0.0032) [2024-06-28 18:22:46,737][09423] Updated weights for policy 0, policy_version 280817 (0.0032) [2024-06-28 18:22:47,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4600922112. Throughput: 0: 42833.3. Samples: 879796200. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 18:22:47,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:22:50,770][09403] Signal inference workers to stop experience collection... (12150 times) [2024-06-28 18:22:50,814][09423] InferenceWorker_p0-w0: stopping experience collection (12150 times) [2024-06-28 18:22:50,821][09403] Signal inference workers to resume experience collection... (12150 times) [2024-06-28 18:22:50,832][09423] InferenceWorker_p0-w0: resuming experience collection (12150 times) [2024-06-28 18:22:50,955][09423] Updated weights for policy 0, policy_version 280827 (0.0033) [2024-06-28 18:22:52,921][09190] Fps is (10 sec: 45874.9, 60 sec: 42052.4, 300 sec: 42765.0). Total num frames: 4601151488. Throughput: 0: 42669.9. Samples: 880049780. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 18:22:52,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:22:54,125][09423] Updated weights for policy 0, policy_version 280837 (0.0039) [2024-06-28 18:22:57,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4601348096. Throughput: 0: 42733.9. Samples: 880186660. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 18:22:57,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 18:22:58,420][09423] Updated weights for policy 0, policy_version 280847 (0.0034) [2024-06-28 18:23:02,284][09423] Updated weights for policy 0, policy_version 280857 (0.0027) [2024-06-28 18:23:02,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4601561088. Throughput: 0: 42775.5. Samples: 880438460. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 18:23:02,925][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:23:06,180][09423] Updated weights for policy 0, policy_version 280867 (0.0032) [2024-06-28 18:23:07,922][09190] Fps is (10 sec: 45874.7, 60 sec: 42871.4, 300 sec: 42765.9). Total num frames: 4601806848. Throughput: 0: 42714.9. Samples: 880693820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 18:23:07,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:23:09,939][09423] Updated weights for policy 0, policy_version 280877 (0.0030) [2024-06-28 18:23:12,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42873.3, 300 sec: 42542.9). Total num frames: 4601987072. Throughput: 0: 42681.5. Samples: 880829240. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 18:23:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:23:13,773][09423] Updated weights for policy 0, policy_version 280887 (0.0036) [2024-06-28 18:23:17,832][09423] Updated weights for policy 0, policy_version 280897 (0.0033) [2024-06-28 18:23:17,922][09190] Fps is (10 sec: 40960.2, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4602216448. Throughput: 0: 42692.9. Samples: 881078000. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 18:23:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:23:17,944][09190] No heartbeat for components: RolloutWorker_w20 (4536 seconds) [2024-06-28 18:23:21,622][09423] Updated weights for policy 0, policy_version 280907 (0.0022) [2024-06-28 18:23:22,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42325.3, 300 sec: 42709.5). Total num frames: 4602429440. Throughput: 0: 42703.0. Samples: 881332680. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 18:23:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 18:23:25,487][09423] Updated weights for policy 0, policy_version 280917 (0.0038) [2024-06-28 18:23:27,924][09190] Fps is (10 sec: 40951.0, 60 sec: 42596.9, 300 sec: 42487.0). Total num frames: 4602626048. Throughput: 0: 42580.0. Samples: 881459820. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 18:23:27,924][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:23:29,509][09423] Updated weights for policy 0, policy_version 280927 (0.0033) [2024-06-28 18:23:32,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.5, 300 sec: 42487.3). Total num frames: 4602839040. Throughput: 0: 42492.9. Samples: 881708380. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 18:23:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:23:33,862][09423] Updated weights for policy 0, policy_version 280937 (0.0039) [2024-06-28 18:23:37,034][09423] Updated weights for policy 0, policy_version 280947 (0.0035) [2024-06-28 18:23:37,921][09190] Fps is (10 sec: 42608.1, 60 sec: 42325.3, 300 sec: 42709.5). Total num frames: 4603052032. Throughput: 0: 42662.6. Samples: 881969600. Policy #0 lag: (min: 0.0, avg: 9.0, max: 20.0) [2024-06-28 18:23:37,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 18:23:41,296][09423] Updated weights for policy 0, policy_version 280957 (0.0033) [2024-06-28 18:23:42,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4603248640. Throughput: 0: 42485.9. Samples: 882098520. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:23:42,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:23:44,611][09423] Updated weights for policy 0, policy_version 280967 (0.0029) [2024-06-28 18:23:47,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 4603478016. Throughput: 0: 42442.7. Samples: 882348380. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:23:47,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:23:48,912][09423] Updated weights for policy 0, policy_version 280977 (0.0031) [2024-06-28 18:23:52,754][09423] Updated weights for policy 0, policy_version 280987 (0.0036) [2024-06-28 18:23:52,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42325.3, 300 sec: 42654.0). Total num frames: 4603691008. Throughput: 0: 42450.8. Samples: 882604100. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:23:52,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:23:56,983][09423] Updated weights for policy 0, policy_version 280997 (0.0035) [2024-06-28 18:23:57,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42598.5, 300 sec: 42487.3). Total num frames: 4603904000. Throughput: 0: 42224.5. Samples: 882729340. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:23:57,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:24:00,139][09423] Updated weights for policy 0, policy_version 281007 (0.0036) [2024-06-28 18:24:02,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.5, 300 sec: 42487.3). Total num frames: 4604116992. Throughput: 0: 42298.4. Samples: 882981420. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:24:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:24:04,505][09423] Updated weights for policy 0, policy_version 281017 (0.0028) [2024-06-28 18:24:07,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42052.4, 300 sec: 42653.9). Total num frames: 4604329984. Throughput: 0: 42313.4. Samples: 883236780. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:24:07,922][09190] Avg episode reward: [(0, '0.753')] [2024-06-28 18:24:07,950][09423] Updated weights for policy 0, policy_version 281027 (0.0042) [2024-06-28 18:24:12,134][09423] Updated weights for policy 0, policy_version 281037 (0.0031) [2024-06-28 18:24:12,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4604526592. Throughput: 0: 42409.3. Samples: 883368140. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:24:12,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:24:15,909][09423] Updated weights for policy 0, policy_version 281047 (0.0034) [2024-06-28 18:24:17,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4604755968. Throughput: 0: 42443.9. Samples: 883618360. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:24:17,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:24:17,942][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000281052_4604755968.pth... [2024-06-28 18:24:17,997][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000280428_4594532352.pth [2024-06-28 18:24:19,851][09423] Updated weights for policy 0, policy_version 281057 (0.0031) [2024-06-28 18:24:22,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.4, 300 sec: 42653.9). Total num frames: 4604968960. Throughput: 0: 42334.3. Samples: 883874640. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:24:22,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:24:23,282][09423] Updated weights for policy 0, policy_version 281067 (0.0033) [2024-06-28 18:24:27,831][09423] Updated weights for policy 0, policy_version 281077 (0.0033) [2024-06-28 18:24:27,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42326.9, 300 sec: 42487.3). Total num frames: 4605165568. Throughput: 0: 42321.2. Samples: 884002980. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:24:27,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:24:29,245][09403] Signal inference workers to stop experience collection... (12200 times) [2024-06-28 18:24:29,247][09403] Signal inference workers to resume experience collection... (12200 times) [2024-06-28 18:24:29,281][09423] InferenceWorker_p0-w0: stopping experience collection (12200 times) [2024-06-28 18:24:29,313][09423] InferenceWorker_p0-w0: resuming experience collection (12200 times) [2024-06-28 18:24:30,914][09423] Updated weights for policy 0, policy_version 281087 (0.0037) [2024-06-28 18:24:32,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4605378560. Throughput: 0: 42335.2. Samples: 884253460. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:24:32,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:24:35,361][09423] Updated weights for policy 0, policy_version 281097 (0.0028) [2024-06-28 18:24:37,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4605591552. Throughput: 0: 42404.5. Samples: 884512300. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:24:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:24:38,728][09423] Updated weights for policy 0, policy_version 281107 (0.0034) [2024-06-28 18:24:42,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.4, 300 sec: 42487.7). Total num frames: 4605804544. Throughput: 0: 42524.4. Samples: 884642940. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:24:42,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:24:42,950][09423] Updated weights for policy 0, policy_version 281117 (0.0033) [2024-06-28 18:24:46,309][09423] Updated weights for policy 0, policy_version 281127 (0.0035) [2024-06-28 18:24:47,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4606017536. Throughput: 0: 42495.5. Samples: 884893720. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:24:47,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 18:24:50,869][09423] Updated weights for policy 0, policy_version 281137 (0.0032) [2024-06-28 18:24:52,922][09190] Fps is (10 sec: 42597.5, 60 sec: 42325.2, 300 sec: 42598.4). Total num frames: 4606230528. Throughput: 0: 42467.9. Samples: 885147840. Policy #0 lag: (min: 0.0, avg: 8.2, max: 20.0) [2024-06-28 18:24:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:24:54,328][09423] Updated weights for policy 0, policy_version 281147 (0.0023) [2024-06-28 18:24:57,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4606443520. Throughput: 0: 42289.7. Samples: 885271180. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:24:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:24:58,717][09423] Updated weights for policy 0, policy_version 281157 (0.0038) [2024-06-28 18:25:01,857][09423] Updated weights for policy 0, policy_version 281167 (0.0033) [2024-06-28 18:25:02,921][09190] Fps is (10 sec: 42599.4, 60 sec: 42325.4, 300 sec: 42376.3). Total num frames: 4606656512. Throughput: 0: 42311.6. Samples: 885522380. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:25:02,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 18:25:06,444][09423] Updated weights for policy 0, policy_version 281177 (0.0032) [2024-06-28 18:25:07,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42052.3, 300 sec: 42487.3). Total num frames: 4606853120. Throughput: 0: 42313.7. Samples: 885778760. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:25:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:25:09,770][09423] Updated weights for policy 0, policy_version 281187 (0.0033) [2024-06-28 18:25:12,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4607066112. Throughput: 0: 42261.3. Samples: 885904740. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:25:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:25:14,193][09423] Updated weights for policy 0, policy_version 281197 (0.0030) [2024-06-28 18:25:17,680][09423] Updated weights for policy 0, policy_version 281207 (0.0032) [2024-06-28 18:25:17,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 4607295488. Throughput: 0: 42221.4. Samples: 886153420. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:25:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:25:21,742][09423] Updated weights for policy 0, policy_version 281217 (0.0027) [2024-06-28 18:25:22,921][09190] Fps is (10 sec: 44237.4, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4607508480. Throughput: 0: 42344.1. Samples: 886417780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:25:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:25:25,131][09423] Updated weights for policy 0, policy_version 281227 (0.0035) [2024-06-28 18:25:27,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4607705088. Throughput: 0: 42267.1. Samples: 886544960. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:25:27,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 18:25:29,751][09423] Updated weights for policy 0, policy_version 281237 (0.0031) [2024-06-28 18:25:32,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4607934464. Throughput: 0: 42368.5. Samples: 886800300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:25:32,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:25:33,026][09423] Updated weights for policy 0, policy_version 281247 (0.0033) [2024-06-28 18:25:37,164][09423] Updated weights for policy 0, policy_version 281257 (0.0031) [2024-06-28 18:25:37,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42871.5, 300 sec: 42654.4). Total num frames: 4608163840. Throughput: 0: 42444.1. Samples: 887057820. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:25:37,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:25:40,344][09423] Updated weights for policy 0, policy_version 281267 (0.0024) [2024-06-28 18:25:42,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4608344064. Throughput: 0: 42480.1. Samples: 887182780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:25:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:25:44,804][09423] Updated weights for policy 0, policy_version 281277 (0.0031) [2024-06-28 18:25:47,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4608573440. Throughput: 0: 42519.9. Samples: 887435780. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:25:47,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:25:48,146][09423] Updated weights for policy 0, policy_version 281287 (0.0030) [2024-06-28 18:25:52,690][09423] Updated weights for policy 0, policy_version 281297 (0.0027) [2024-06-28 18:25:52,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42325.5, 300 sec: 42487.3). Total num frames: 4608770048. Throughput: 0: 42476.9. Samples: 887690220. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:25:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:25:55,891][09423] Updated weights for policy 0, policy_version 281307 (0.0031) [2024-06-28 18:25:57,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4608983040. Throughput: 0: 42517.0. Samples: 887818000. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:25:57,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:25:59,729][09403] Signal inference workers to stop experience collection... (12250 times) [2024-06-28 18:25:59,729][09403] Signal inference workers to resume experience collection... (12250 times) [2024-06-28 18:25:59,744][09423] InferenceWorker_p0-w0: stopping experience collection (12250 times) [2024-06-28 18:25:59,744][09423] InferenceWorker_p0-w0: resuming experience collection (12250 times) [2024-06-28 18:26:00,060][09423] Updated weights for policy 0, policy_version 281317 (0.0027) [2024-06-28 18:26:02,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.4, 300 sec: 42376.3). Total num frames: 4609212416. Throughput: 0: 42708.5. Samples: 888075300. Policy #0 lag: (min: 0.0, avg: 9.6, max: 20.0) [2024-06-28 18:26:02,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:26:03,709][09423] Updated weights for policy 0, policy_version 281327 (0.0040) [2024-06-28 18:26:07,818][09423] Updated weights for policy 0, policy_version 281337 (0.0024) [2024-06-28 18:26:07,922][09190] Fps is (10 sec: 44235.9, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4609425408. Throughput: 0: 42413.6. Samples: 888326400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:26:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:26:12,103][09423] Updated weights for policy 0, policy_version 281347 (0.0032) [2024-06-28 18:26:12,922][09190] Fps is (10 sec: 40959.1, 60 sec: 42598.3, 300 sec: 42487.7). Total num frames: 4609622016. Throughput: 0: 42371.8. Samples: 888451700. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:26:12,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 18:26:15,530][09423] Updated weights for policy 0, policy_version 281357 (0.0029) [2024-06-28 18:26:17,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 4609851392. Throughput: 0: 42485.2. Samples: 888712140. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:26:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:26:17,929][09190] No heartbeat for components: RolloutWorker_w20 (4716 seconds) [2024-06-28 18:26:18,051][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000281364_4609867776.pth... [2024-06-28 18:26:18,106][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000280740_4599644160.pth [2024-06-28 18:26:19,544][09423] Updated weights for policy 0, policy_version 281367 (0.0034) [2024-06-28 18:26:22,921][09190] Fps is (10 sec: 44237.5, 60 sec: 42598.3, 300 sec: 42542.9). Total num frames: 4610064384. Throughput: 0: 42357.3. Samples: 888963900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:26:22,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:26:23,634][09423] Updated weights for policy 0, policy_version 281377 (0.0030) [2024-06-28 18:26:27,395][09423] Updated weights for policy 0, policy_version 281387 (0.0030) [2024-06-28 18:26:27,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4610260992. Throughput: 0: 42399.0. Samples: 889090740. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:26:27,922][09190] Avg episode reward: [(0, '0.733')] [2024-06-28 18:26:31,159][09423] Updated weights for policy 0, policy_version 281397 (0.0025) [2024-06-28 18:26:32,922][09190] Fps is (10 sec: 44236.3, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4610506752. Throughput: 0: 42463.9. Samples: 889346660. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:26:32,922][09190] Avg episode reward: [(0, '0.733')] [2024-06-28 18:26:34,850][09423] Updated weights for policy 0, policy_version 281407 (0.0036) [2024-06-28 18:26:37,922][09190] Fps is (10 sec: 42598.1, 60 sec: 42052.2, 300 sec: 42542.8). Total num frames: 4610686976. Throughput: 0: 42558.1. Samples: 889605340. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:26:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:26:38,815][09423] Updated weights for policy 0, policy_version 281417 (0.0032) [2024-06-28 18:26:42,647][09423] Updated weights for policy 0, policy_version 281427 (0.0025) [2024-06-28 18:26:42,921][09190] Fps is (10 sec: 39322.2, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4610899968. Throughput: 0: 42306.2. Samples: 889721780. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:26:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:26:46,472][09423] Updated weights for policy 0, policy_version 281437 (0.0031) [2024-06-28 18:26:47,921][09190] Fps is (10 sec: 44237.3, 60 sec: 42598.4, 300 sec: 42376.3). Total num frames: 4611129344. Throughput: 0: 42407.0. Samples: 889983620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:26:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:26:50,754][09423] Updated weights for policy 0, policy_version 281447 (0.0031) [2024-06-28 18:26:52,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4611309568. Throughput: 0: 42537.5. Samples: 890240580. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:26:52,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:26:54,213][09423] Updated weights for policy 0, policy_version 281457 (0.0028) [2024-06-28 18:26:57,924][09190] Fps is (10 sec: 39312.0, 60 sec: 42323.5, 300 sec: 42375.9). Total num frames: 4611522560. Throughput: 0: 42584.0. Samples: 890368080. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:26:57,924][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:26:58,440][09423] Updated weights for policy 0, policy_version 281467 (0.0039) [2024-06-28 18:27:01,843][09423] Updated weights for policy 0, policy_version 281477 (0.0028) [2024-06-28 18:27:02,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4611751936. Throughput: 0: 42347.2. Samples: 890617760. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:27:02,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:27:05,876][09423] Updated weights for policy 0, policy_version 281487 (0.0030) [2024-06-28 18:27:07,921][09190] Fps is (10 sec: 42609.1, 60 sec: 42052.4, 300 sec: 42487.7). Total num frames: 4611948544. Throughput: 0: 42397.8. Samples: 890871800. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:27:07,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:27:09,847][09423] Updated weights for policy 0, policy_version 281497 (0.0030) [2024-06-28 18:27:12,922][09190] Fps is (10 sec: 40959.4, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4612161536. Throughput: 0: 42384.0. Samples: 890998020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:27:12,927][09190] Avg episode reward: [(0, '0.730')] [2024-06-28 18:27:13,311][09423] Updated weights for policy 0, policy_version 281507 (0.0033) [2024-06-28 18:27:17,569][09423] Updated weights for policy 0, policy_version 281517 (0.0034) [2024-06-28 18:27:17,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 4612390912. Throughput: 0: 42523.1. Samples: 891260200. Policy #0 lag: (min: 0.0, avg: 10.0, max: 23.0) [2024-06-28 18:27:17,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:27:19,989][09403] Signal inference workers to stop experience collection... (12300 times) [2024-06-28 18:27:20,040][09423] InferenceWorker_p0-w0: stopping experience collection (12300 times) [2024-06-28 18:27:20,041][09403] Signal inference workers to resume experience collection... (12300 times) [2024-06-28 18:27:20,053][09423] InferenceWorker_p0-w0: resuming experience collection (12300 times) [2024-06-28 18:27:21,324][09423] Updated weights for policy 0, policy_version 281527 (0.0025) [2024-06-28 18:27:22,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42052.3, 300 sec: 42431.8). Total num frames: 4612587520. Throughput: 0: 42342.3. Samples: 891510740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 18:27:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:27:25,057][09423] Updated weights for policy 0, policy_version 281537 (0.0033) [2024-06-28 18:27:27,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42052.3, 300 sec: 42376.3). Total num frames: 4612784128. Throughput: 0: 42575.9. Samples: 891637700. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 18:27:27,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 18:27:28,616][09423] Updated weights for policy 0, policy_version 281547 (0.0036) [2024-06-28 18:27:32,921][09190] Fps is (10 sec: 42598.7, 60 sec: 41779.3, 300 sec: 42376.2). Total num frames: 4613013504. Throughput: 0: 42555.6. Samples: 891898620. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 18:27:32,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:27:32,997][09423] Updated weights for policy 0, policy_version 281557 (0.0025) [2024-06-28 18:27:36,242][09423] Updated weights for policy 0, policy_version 281567 (0.0033) [2024-06-28 18:27:37,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4613226496. Throughput: 0: 42412.8. Samples: 892149160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 18:27:37,925][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:27:40,479][09423] Updated weights for policy 0, policy_version 281577 (0.0034) [2024-06-28 18:27:42,921][09190] Fps is (10 sec: 42597.8, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4613439488. Throughput: 0: 42394.7. Samples: 892275740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 18:27:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:27:44,114][09423] Updated weights for policy 0, policy_version 281587 (0.0032) [2024-06-28 18:27:47,839][09423] Updated weights for policy 0, policy_version 281597 (0.0028) [2024-06-28 18:27:47,924][09190] Fps is (10 sec: 45864.0, 60 sec: 42596.6, 300 sec: 42487.0). Total num frames: 4613685248. Throughput: 0: 42595.8. Samples: 892534680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 18:27:47,924][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:27:51,495][09423] Updated weights for policy 0, policy_version 281607 (0.0036) [2024-06-28 18:27:52,922][09190] Fps is (10 sec: 42598.2, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 4613865472. Throughput: 0: 42615.4. Samples: 892789500. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 18:27:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:27:55,936][09423] Updated weights for policy 0, policy_version 281617 (0.0039) [2024-06-28 18:27:57,921][09190] Fps is (10 sec: 40970.2, 60 sec: 42873.2, 300 sec: 42487.3). Total num frames: 4614094848. Throughput: 0: 42713.4. Samples: 892920120. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 18:27:57,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:27:59,384][09423] Updated weights for policy 0, policy_version 281627 (0.0032) [2024-06-28 18:28:02,921][09190] Fps is (10 sec: 42599.2, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 4614291456. Throughput: 0: 42448.2. Samples: 893170360. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 18:28:02,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:28:03,491][09423] Updated weights for policy 0, policy_version 281637 (0.0034) [2024-06-28 18:28:07,446][09423] Updated weights for policy 0, policy_version 281647 (0.0027) [2024-06-28 18:28:07,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 4614520832. Throughput: 0: 42733.8. Samples: 893433760. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 18:28:07,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:28:11,761][09423] Updated weights for policy 0, policy_version 281657 (0.0031) [2024-06-28 18:28:12,924][09190] Fps is (10 sec: 45863.3, 60 sec: 43142.8, 300 sec: 42487.0). Total num frames: 4614750208. Throughput: 0: 42739.8. Samples: 893561100. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 18:28:12,925][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:28:14,897][09423] Updated weights for policy 0, policy_version 281667 (0.0030) [2024-06-28 18:28:17,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.4, 300 sec: 42376.3). Total num frames: 4614930432. Throughput: 0: 42542.2. Samples: 893813020. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 18:28:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:28:18,053][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000281674_4614946816.pth... [2024-06-28 18:28:18,107][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000281052_4604755968.pth [2024-06-28 18:28:19,399][09423] Updated weights for policy 0, policy_version 281677 (0.0031) [2024-06-28 18:28:22,921][09190] Fps is (10 sec: 39331.5, 60 sec: 42598.4, 300 sec: 42432.1). Total num frames: 4615143424. Throughput: 0: 42557.8. Samples: 894064260. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 18:28:22,922][09190] Avg episode reward: [(0, '0.728')] [2024-06-28 18:28:23,072][09423] Updated weights for policy 0, policy_version 281687 (0.0030) [2024-06-28 18:28:26,817][09423] Updated weights for policy 0, policy_version 281697 (0.0035) [2024-06-28 18:28:27,921][09190] Fps is (10 sec: 44236.3, 60 sec: 43144.5, 300 sec: 42487.3). Total num frames: 4615372800. Throughput: 0: 42535.5. Samples: 894189840. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 18:28:27,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:28:30,944][09423] Updated weights for policy 0, policy_version 281707 (0.0031) [2024-06-28 18:28:32,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 4615553024. Throughput: 0: 42453.0. Samples: 894444960. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:28:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:28:34,394][09423] Updated weights for policy 0, policy_version 281717 (0.0029) [2024-06-28 18:28:37,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4615782400. Throughput: 0: 42609.9. Samples: 894706940. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:28:37,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:28:38,175][09423] Updated weights for policy 0, policy_version 281727 (0.0039) [2024-06-28 18:28:41,929][09423] Updated weights for policy 0, policy_version 281737 (0.0045) [2024-06-28 18:28:42,922][09190] Fps is (10 sec: 47513.1, 60 sec: 43144.5, 300 sec: 42542.9). Total num frames: 4616028160. Throughput: 0: 42547.0. Samples: 894834740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:28:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:28:45,829][09423] Updated weights for policy 0, policy_version 281747 (0.0043) [2024-06-28 18:28:47,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42054.0, 300 sec: 42431.8). Total num frames: 4616208384. Throughput: 0: 42721.7. Samples: 895092840. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:28:47,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:28:49,637][09423] Updated weights for policy 0, policy_version 281757 (0.0036) [2024-06-28 18:28:52,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 4616437760. Throughput: 0: 42337.7. Samples: 895338960. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:28:52,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:28:54,296][09423] Updated weights for policy 0, policy_version 281767 (0.0037) [2024-06-28 18:28:57,626][09423] Updated weights for policy 0, policy_version 281777 (0.0031) [2024-06-28 18:28:57,822][09403] Signal inference workers to stop experience collection... (12350 times) [2024-06-28 18:28:57,824][09403] Signal inference workers to resume experience collection... (12350 times) [2024-06-28 18:28:57,861][09423] InferenceWorker_p0-w0: stopping experience collection (12350 times) [2024-06-28 18:28:57,861][09423] InferenceWorker_p0-w0: resuming experience collection (12350 times) [2024-06-28 18:28:57,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4616650752. Throughput: 0: 42317.1. Samples: 895465260. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:28:57,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 18:29:01,720][09423] Updated weights for policy 0, policy_version 281787 (0.0035) [2024-06-28 18:29:02,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 4616830976. Throughput: 0: 42349.3. Samples: 895718740. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:29:02,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:29:05,036][09423] Updated weights for policy 0, policy_version 281797 (0.0036) [2024-06-28 18:29:07,921][09190] Fps is (10 sec: 39321.8, 60 sec: 42052.3, 300 sec: 42431.8). Total num frames: 4617043968. Throughput: 0: 42343.6. Samples: 895969720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:29:07,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 18:29:09,410][09423] Updated weights for policy 0, policy_version 281807 (0.0034) [2024-06-28 18:29:12,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42054.1, 300 sec: 42431.8). Total num frames: 4617273344. Throughput: 0: 42456.1. Samples: 896100360. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:29:12,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:29:12,928][09423] Updated weights for policy 0, policy_version 281817 (0.0027) [2024-06-28 18:29:17,397][09423] Updated weights for policy 0, policy_version 281827 (0.0029) [2024-06-28 18:29:17,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 4617453568. Throughput: 0: 42417.8. Samples: 896353760. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:29:17,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:29:17,935][09190] No heartbeat for components: RolloutWorker_w20 (4896 seconds) [2024-06-28 18:29:20,251][09423] Updated weights for policy 0, policy_version 281837 (0.0030) [2024-06-28 18:29:22,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4617699328. Throughput: 0: 42192.9. Samples: 896605620. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:29:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:29:25,346][09423] Updated weights for policy 0, policy_version 281847 (0.0026) [2024-06-28 18:29:27,865][09423] Updated weights for policy 0, policy_version 281857 (0.0032) [2024-06-28 18:29:27,921][09190] Fps is (10 sec: 49151.7, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4617945088. Throughput: 0: 42253.4. Samples: 896736140. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:29:27,923][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:29:32,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42325.4, 300 sec: 42376.2). Total num frames: 4618092544. Throughput: 0: 42170.7. Samples: 896990520. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:29:32,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 18:29:32,961][09423] Updated weights for policy 0, policy_version 281867 (0.0030) [2024-06-28 18:29:35,998][09423] Updated weights for policy 0, policy_version 281877 (0.0026) [2024-06-28 18:29:37,921][09190] Fps is (10 sec: 37683.0, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4618321920. Throughput: 0: 42089.8. Samples: 897233000. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:29:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:29:40,551][09423] Updated weights for policy 0, policy_version 281887 (0.0030) [2024-06-28 18:29:42,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42052.3, 300 sec: 42487.3). Total num frames: 4618551296. Throughput: 0: 42162.7. Samples: 897362580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:29:42,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:29:43,563][09423] Updated weights for policy 0, policy_version 281897 (0.0031) [2024-06-28 18:29:47,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42052.2, 300 sec: 42376.3). Total num frames: 4618731520. Throughput: 0: 42202.1. Samples: 897617840. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 18:29:47,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 18:29:48,672][09423] Updated weights for policy 0, policy_version 281907 (0.0036) [2024-06-28 18:29:51,875][09423] Updated weights for policy 0, policy_version 281917 (0.0039) [2024-06-28 18:29:52,921][09190] Fps is (10 sec: 39321.5, 60 sec: 41779.2, 300 sec: 42376.2). Total num frames: 4618944512. Throughput: 0: 42183.9. Samples: 897868000. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 18:29:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:29:56,171][09423] Updated weights for policy 0, policy_version 281927 (0.0028) [2024-06-28 18:29:57,922][09190] Fps is (10 sec: 45875.2, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4619190272. Throughput: 0: 42250.1. Samples: 898001620. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 18:29:57,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 18:29:59,316][09423] Updated weights for policy 0, policy_version 281937 (0.0030) [2024-06-28 18:30:02,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42052.3, 300 sec: 42376.3). Total num frames: 4619354112. Throughput: 0: 42292.5. Samples: 898256920. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 18:30:02,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:30:03,641][09423] Updated weights for policy 0, policy_version 281947 (0.0033) [2024-06-28 18:30:07,059][09423] Updated weights for policy 0, policy_version 281957 (0.0038) [2024-06-28 18:30:07,922][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4619599872. Throughput: 0: 42257.1. Samples: 898507200. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 18:30:07,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:30:11,315][09423] Updated weights for policy 0, policy_version 281967 (0.0028) [2024-06-28 18:30:12,922][09190] Fps is (10 sec: 47512.7, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4619829248. Throughput: 0: 42333.7. Samples: 898641160. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 18:30:12,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 18:30:14,925][09423] Updated weights for policy 0, policy_version 281977 (0.0032) [2024-06-28 18:30:17,921][09190] Fps is (10 sec: 39322.4, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 4619993088. Throughput: 0: 42239.1. Samples: 898891280. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 18:30:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:30:18,040][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000281983_4620009472.pth... [2024-06-28 18:30:18,103][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000281364_4609867776.pth [2024-06-28 18:30:19,004][09403] Signal inference workers to stop experience collection... (12400 times) [2024-06-28 18:30:19,059][09423] InferenceWorker_p0-w0: stopping experience collection (12400 times) [2024-06-28 18:30:19,067][09403] Signal inference workers to resume experience collection... (12400 times) [2024-06-28 18:30:19,075][09423] InferenceWorker_p0-w0: resuming experience collection (12400 times) [2024-06-28 18:30:19,212][09423] Updated weights for policy 0, policy_version 281987 (0.0029) [2024-06-28 18:30:22,921][09190] Fps is (10 sec: 39322.4, 60 sec: 42052.3, 300 sec: 42431.8). Total num frames: 4620222464. Throughput: 0: 42516.1. Samples: 899146220. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 18:30:22,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 18:30:23,006][09423] Updated weights for policy 0, policy_version 281997 (0.0028) [2024-06-28 18:30:26,602][09423] Updated weights for policy 0, policy_version 282007 (0.0028) [2024-06-28 18:30:27,921][09190] Fps is (10 sec: 47513.2, 60 sec: 42052.2, 300 sec: 42487.3). Total num frames: 4620468224. Throughput: 0: 42534.2. Samples: 899276620. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 18:30:27,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:30:30,610][09423] Updated weights for policy 0, policy_version 282017 (0.0028) [2024-06-28 18:30:32,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42598.4, 300 sec: 42320.7). Total num frames: 4620648448. Throughput: 0: 42533.4. Samples: 899531840. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 18:30:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:30:34,160][09423] Updated weights for policy 0, policy_version 282027 (0.0030) [2024-06-28 18:30:37,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4620877824. Throughput: 0: 42648.9. Samples: 899787200. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 18:30:37,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:30:38,552][09423] Updated weights for policy 0, policy_version 282037 (0.0030) [2024-06-28 18:30:42,182][09423] Updated weights for policy 0, policy_version 282047 (0.0028) [2024-06-28 18:30:42,922][09190] Fps is (10 sec: 45874.8, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4621107200. Throughput: 0: 42568.0. Samples: 899917180. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 18:30:42,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:30:46,469][09423] Updated weights for policy 0, policy_version 282057 (0.0027) [2024-06-28 18:30:47,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4621287424. Throughput: 0: 42584.7. Samples: 900173240. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 18:30:47,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 18:30:49,718][09423] Updated weights for policy 0, policy_version 282067 (0.0041) [2024-06-28 18:30:52,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 4621516800. Throughput: 0: 42324.6. Samples: 900411800. Policy #0 lag: (min: 0.0, avg: 11.4, max: 20.0) [2024-06-28 18:30:52,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:30:54,168][09423] Updated weights for policy 0, policy_version 282077 (0.0036) [2024-06-28 18:30:57,365][09423] Updated weights for policy 0, policy_version 282087 (0.0028) [2024-06-28 18:30:57,922][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4621729792. Throughput: 0: 42368.5. Samples: 900547740. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:30:57,922][09190] Avg episode reward: [(0, '0.732')] [2024-06-28 18:31:02,009][09423] Updated weights for policy 0, policy_version 282097 (0.0040) [2024-06-28 18:31:02,924][09190] Fps is (10 sec: 40949.9, 60 sec: 42869.6, 300 sec: 42375.9). Total num frames: 4621926400. Throughput: 0: 42500.7. Samples: 900803920. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:31:02,924][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:31:05,000][09423] Updated weights for policy 0, policy_version 282107 (0.0029) [2024-06-28 18:31:07,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4622139392. Throughput: 0: 42450.5. Samples: 901056500. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:31:07,924][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:31:09,427][09423] Updated weights for policy 0, policy_version 282117 (0.0030) [2024-06-28 18:31:12,921][09190] Fps is (10 sec: 42609.0, 60 sec: 42052.4, 300 sec: 42376.3). Total num frames: 4622352384. Throughput: 0: 42362.3. Samples: 901182920. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:31:12,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:31:12,944][09423] Updated weights for policy 0, policy_version 282127 (0.0033) [2024-06-28 18:31:17,194][09423] Updated weights for policy 0, policy_version 282137 (0.0029) [2024-06-28 18:31:17,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42871.5, 300 sec: 42376.3). Total num frames: 4622565376. Throughput: 0: 42456.1. Samples: 901442360. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:31:17,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:31:20,460][09423] Updated weights for policy 0, policy_version 282147 (0.0039) [2024-06-28 18:31:22,924][09190] Fps is (10 sec: 42587.8, 60 sec: 42596.6, 300 sec: 42431.4). Total num frames: 4622778368. Throughput: 0: 42283.5. Samples: 901690060. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:31:22,924][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:31:24,956][09423] Updated weights for policy 0, policy_version 282157 (0.0039) [2024-06-28 18:31:27,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42052.4, 300 sec: 42320.7). Total num frames: 4622991360. Throughput: 0: 42274.4. Samples: 901819520. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:31:27,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:31:28,411][09423] Updated weights for policy 0, policy_version 282167 (0.0043) [2024-06-28 18:31:32,706][09423] Updated weights for policy 0, policy_version 282177 (0.0035) [2024-06-28 18:31:32,921][09190] Fps is (10 sec: 40970.4, 60 sec: 42325.4, 300 sec: 42376.3). Total num frames: 4623187968. Throughput: 0: 42264.6. Samples: 902075140. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:31:32,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:31:34,518][09403] Signal inference workers to stop experience collection... (12450 times) [2024-06-28 18:31:34,560][09423] InferenceWorker_p0-w0: stopping experience collection (12450 times) [2024-06-28 18:31:34,576][09403] Signal inference workers to resume experience collection... (12450 times) [2024-06-28 18:31:34,577][09423] InferenceWorker_p0-w0: resuming experience collection (12450 times) [2024-06-28 18:31:35,997][09423] Updated weights for policy 0, policy_version 282187 (0.0040) [2024-06-28 18:31:37,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42052.3, 300 sec: 42376.2). Total num frames: 4623400960. Throughput: 0: 42496.5. Samples: 902324140. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:31:37,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:31:40,147][09423] Updated weights for policy 0, policy_version 282197 (0.0028) [2024-06-28 18:31:42,921][09190] Fps is (10 sec: 42598.2, 60 sec: 41779.3, 300 sec: 42320.7). Total num frames: 4623613952. Throughput: 0: 42298.8. Samples: 902451180. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:31:42,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:31:43,587][09423] Updated weights for policy 0, policy_version 282207 (0.0037) [2024-06-28 18:31:47,867][09423] Updated weights for policy 0, policy_version 282217 (0.0028) [2024-06-28 18:31:47,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4623843328. Throughput: 0: 42399.6. Samples: 902711800. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:31:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:31:51,404][09423] Updated weights for policy 0, policy_version 282227 (0.0038) [2024-06-28 18:31:52,921][09190] Fps is (10 sec: 42597.9, 60 sec: 42052.2, 300 sec: 42432.1). Total num frames: 4624039936. Throughput: 0: 42296.4. Samples: 902959840. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:31:52,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:31:55,667][09423] Updated weights for policy 0, policy_version 282237 (0.0027) [2024-06-28 18:31:57,924][09190] Fps is (10 sec: 42588.2, 60 sec: 42323.6, 300 sec: 42431.4). Total num frames: 4624269312. Throughput: 0: 42391.9. Samples: 903090660. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:31:57,925][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:31:58,682][09423] Updated weights for policy 0, policy_version 282247 (0.0028) [2024-06-28 18:32:02,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42327.1, 300 sec: 42431.8). Total num frames: 4624465920. Throughput: 0: 42368.9. Samples: 903348960. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:32:02,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:32:03,231][09423] Updated weights for policy 0, policy_version 282257 (0.0036) [2024-06-28 18:32:06,589][09423] Updated weights for policy 0, policy_version 282267 (0.0025) [2024-06-28 18:32:07,921][09190] Fps is (10 sec: 40969.9, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4624678912. Throughput: 0: 42332.0. Samples: 903594900. Policy #0 lag: (min: 0.0, avg: 8.5, max: 19.0) [2024-06-28 18:32:07,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:32:10,970][09423] Updated weights for policy 0, policy_version 282277 (0.0029) [2024-06-28 18:32:12,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4624908288. Throughput: 0: 42327.9. Samples: 903724280. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 18:32:12,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:32:14,503][09423] Updated weights for policy 0, policy_version 282287 (0.0033) [2024-06-28 18:32:17,921][09190] Fps is (10 sec: 44236.9, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4625121280. Throughput: 0: 42346.5. Samples: 903980740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 18:32:17,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:32:17,929][09190] No heartbeat for components: RolloutWorker_w20 (5076 seconds) [2024-06-28 18:32:17,933][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000282295_4625121280.pth... [2024-06-28 18:32:17,984][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000281674_4614946816.pth [2024-06-28 18:32:18,985][09423] Updated weights for policy 0, policy_version 282297 (0.0038) [2024-06-28 18:32:22,316][09423] Updated weights for policy 0, policy_version 282307 (0.0040) [2024-06-28 18:32:22,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42327.1, 300 sec: 42487.3). Total num frames: 4625317888. Throughput: 0: 42489.7. Samples: 904236180. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 18:32:22,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:32:26,638][09423] Updated weights for policy 0, policy_version 282317 (0.0032) [2024-06-28 18:32:27,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4625547264. Throughput: 0: 42549.4. Samples: 904365900. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 18:32:27,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:32:30,104][09423] Updated weights for policy 0, policy_version 282327 (0.0035) [2024-06-28 18:32:32,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 4625743872. Throughput: 0: 42434.3. Samples: 904621340. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 18:32:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:32:34,431][09423] Updated weights for policy 0, policy_version 282337 (0.0038) [2024-06-28 18:32:37,925][09190] Fps is (10 sec: 40945.3, 60 sec: 42595.9, 300 sec: 42431.3). Total num frames: 4625956864. Throughput: 0: 42648.7. Samples: 904879180. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 18:32:37,925][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:32:37,989][09423] Updated weights for policy 0, policy_version 282347 (0.0040) [2024-06-28 18:32:42,002][09423] Updated weights for policy 0, policy_version 282357 (0.0035) [2024-06-28 18:32:42,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42871.4, 300 sec: 42376.6). Total num frames: 4626186240. Throughput: 0: 42564.5. Samples: 905005960. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 18:32:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:32:45,716][09423] Updated weights for policy 0, policy_version 282367 (0.0028) [2024-06-28 18:32:47,921][09190] Fps is (10 sec: 42613.5, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4626382848. Throughput: 0: 42432.0. Samples: 905258400. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 18:32:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:32:49,878][09423] Updated weights for policy 0, policy_version 282377 (0.0038) [2024-06-28 18:32:52,922][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.4, 300 sec: 42376.2). Total num frames: 4626595840. Throughput: 0: 42524.4. Samples: 905508500. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 18:32:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:32:53,838][09423] Updated weights for policy 0, policy_version 282387 (0.0037) [2024-06-28 18:32:57,319][09423] Updated weights for policy 0, policy_version 282397 (0.0030) [2024-06-28 18:32:57,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42600.1, 300 sec: 42487.3). Total num frames: 4626825216. Throughput: 0: 42454.6. Samples: 905634740. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 18:32:57,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:33:01,091][09423] Updated weights for policy 0, policy_version 282407 (0.0040) [2024-06-28 18:33:02,922][09190] Fps is (10 sec: 42598.6, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 4627021824. Throughput: 0: 42516.9. Samples: 905894000. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 18:33:02,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:33:04,583][09403] Signal inference workers to stop experience collection... (12500 times) [2024-06-28 18:33:04,583][09403] Signal inference workers to resume experience collection... (12500 times) [2024-06-28 18:33:04,597][09423] InferenceWorker_p0-w0: stopping experience collection (12500 times) [2024-06-28 18:33:04,597][09423] InferenceWorker_p0-w0: resuming experience collection (12500 times) [2024-06-28 18:33:04,921][09423] Updated weights for policy 0, policy_version 282417 (0.0039) [2024-06-28 18:33:07,921][09190] Fps is (10 sec: 40960.5, 60 sec: 42598.5, 300 sec: 42321.1). Total num frames: 4627234816. Throughput: 0: 42676.9. Samples: 906156640. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 18:33:07,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 18:33:08,723][09423] Updated weights for policy 0, policy_version 282427 (0.0030) [2024-06-28 18:33:12,382][09423] Updated weights for policy 0, policy_version 282437 (0.0027) [2024-06-28 18:33:12,922][09190] Fps is (10 sec: 44236.6, 60 sec: 42598.3, 300 sec: 42487.3). Total num frames: 4627464192. Throughput: 0: 42647.4. Samples: 906285040. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 18:33:12,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:33:16,305][09423] Updated weights for policy 0, policy_version 282447 (0.0032) [2024-06-28 18:33:17,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4627660800. Throughput: 0: 42635.1. Samples: 906539920. Policy #0 lag: (min: 0.0, avg: 9.2, max: 20.0) [2024-06-28 18:33:17,926][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:33:19,778][09423] Updated weights for policy 0, policy_version 282457 (0.0030) [2024-06-28 18:33:22,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42598.4, 300 sec: 42376.3). Total num frames: 4627873792. Throughput: 0: 42670.9. Samples: 906799220. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:33:22,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:33:23,847][09423] Updated weights for policy 0, policy_version 282467 (0.0034) [2024-06-28 18:33:27,921][09190] Fps is (10 sec: 42598.5, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4628086784. Throughput: 0: 42662.3. Samples: 906925760. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:33:27,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:33:27,989][09423] Updated weights for policy 0, policy_version 282477 (0.0033) [2024-06-28 18:33:31,679][09423] Updated weights for policy 0, policy_version 282487 (0.0030) [2024-06-28 18:33:32,921][09190] Fps is (10 sec: 42598.1, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4628299776. Throughput: 0: 42618.6. Samples: 907176240. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:33:32,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:33:35,373][09423] Updated weights for policy 0, policy_version 282497 (0.0030) [2024-06-28 18:33:37,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42600.9, 300 sec: 42320.7). Total num frames: 4628512768. Throughput: 0: 42806.4. Samples: 907434780. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:33:37,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:33:39,191][09423] Updated weights for policy 0, policy_version 282507 (0.0037) [2024-06-28 18:33:42,921][09190] Fps is (10 sec: 44236.6, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4628742144. Throughput: 0: 42910.7. Samples: 907565720. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:33:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:33:42,932][09423] Updated weights for policy 0, policy_version 282517 (0.0030) [2024-06-28 18:33:47,255][09423] Updated weights for policy 0, policy_version 282527 (0.0032) [2024-06-28 18:33:47,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 4628922368. Throughput: 0: 42683.3. Samples: 907814740. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:33:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:33:50,581][09423] Updated weights for policy 0, policy_version 282537 (0.0044) [2024-06-28 18:33:52,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42871.5, 300 sec: 42431.8). Total num frames: 4629168128. Throughput: 0: 42345.7. Samples: 908062200. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:33:52,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:33:55,315][09423] Updated weights for policy 0, policy_version 282547 (0.0036) [2024-06-28 18:33:57,921][09190] Fps is (10 sec: 45874.5, 60 sec: 42598.4, 300 sec: 42542.8). Total num frames: 4629381120. Throughput: 0: 42580.0. Samples: 908201140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:33:57,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 18:33:58,639][09423] Updated weights for policy 0, policy_version 282557 (0.0038) [2024-06-28 18:34:02,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4629561344. Throughput: 0: 42339.5. Samples: 908445200. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:34:02,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:34:03,338][09423] Updated weights for policy 0, policy_version 282567 (0.0027) [2024-06-28 18:34:06,499][09423] Updated weights for policy 0, policy_version 282577 (0.0028) [2024-06-28 18:34:07,921][09190] Fps is (10 sec: 42598.7, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4629807104. Throughput: 0: 42175.9. Samples: 908697140. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:34:07,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:34:11,087][09423] Updated weights for policy 0, policy_version 282587 (0.0038) [2024-06-28 18:34:12,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4630003712. Throughput: 0: 42352.5. Samples: 908831620. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:34:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:34:14,335][09423] Updated weights for policy 0, policy_version 282597 (0.0030) [2024-06-28 18:34:17,922][09190] Fps is (10 sec: 39321.2, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 4630200320. Throughput: 0: 42409.7. Samples: 909084680. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:34:17,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:34:17,939][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000282605_4630200320.pth... [2024-06-28 18:34:17,994][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000281983_4620009472.pth [2024-06-28 18:34:18,616][09423] Updated weights for policy 0, policy_version 282607 (0.0032) [2024-06-28 18:34:22,158][09423] Updated weights for policy 0, policy_version 282617 (0.0027) [2024-06-28 18:34:22,921][09190] Fps is (10 sec: 44236.3, 60 sec: 42871.4, 300 sec: 42376.2). Total num frames: 4630446080. Throughput: 0: 42102.1. Samples: 909329380. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:34:22,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:34:26,256][09423] Updated weights for policy 0, policy_version 282627 (0.0035) [2024-06-28 18:34:27,921][09190] Fps is (10 sec: 45875.9, 60 sec: 42871.5, 300 sec: 42598.4). Total num frames: 4630659072. Throughput: 0: 42264.1. Samples: 909467600. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:34:27,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:34:29,749][09423] Updated weights for policy 0, policy_version 282637 (0.0035) [2024-06-28 18:34:30,213][09403] Signal inference workers to stop experience collection... (12550 times) [2024-06-28 18:34:30,214][09403] Signal inference workers to resume experience collection... (12550 times) [2024-06-28 18:34:30,260][09423] InferenceWorker_p0-w0: stopping experience collection (12550 times) [2024-06-28 18:34:30,260][09423] InferenceWorker_p0-w0: resuming experience collection (12550 times) [2024-06-28 18:34:32,921][09190] Fps is (10 sec: 37683.7, 60 sec: 42052.3, 300 sec: 42376.3). Total num frames: 4630822912. Throughput: 0: 42256.9. Samples: 909716300. Policy #0 lag: (min: 0.0, avg: 10.8, max: 20.0) [2024-06-28 18:34:32,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:34:33,807][09423] Updated weights for policy 0, policy_version 282647 (0.0033) [2024-06-28 18:34:37,442][09423] Updated weights for policy 0, policy_version 282657 (0.0030) [2024-06-28 18:34:37,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4631068672. Throughput: 0: 42358.3. Samples: 909968320. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-28 18:34:37,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 18:34:41,644][09423] Updated weights for policy 0, policy_version 282667 (0.0031) [2024-06-28 18:34:42,921][09190] Fps is (10 sec: 45874.7, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4631281664. Throughput: 0: 42280.0. Samples: 910103740. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-28 18:34:42,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:34:45,122][09423] Updated weights for policy 0, policy_version 282677 (0.0028) [2024-06-28 18:34:47,921][09190] Fps is (10 sec: 37683.2, 60 sec: 42052.2, 300 sec: 42376.3). Total num frames: 4631445504. Throughput: 0: 42292.5. Samples: 910348360. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-28 18:34:47,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:34:49,345][09423] Updated weights for policy 0, policy_version 282687 (0.0035) [2024-06-28 18:34:52,625][09423] Updated weights for policy 0, policy_version 282697 (0.0028) [2024-06-28 18:34:52,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4631724032. Throughput: 0: 42238.2. Samples: 910597860. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-28 18:34:52,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:34:56,858][09423] Updated weights for policy 0, policy_version 282707 (0.0032) [2024-06-28 18:34:57,921][09190] Fps is (10 sec: 45875.4, 60 sec: 42052.4, 300 sec: 42542.9). Total num frames: 4631904256. Throughput: 0: 42383.6. Samples: 910738880. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-28 18:34:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:35:00,064][09423] Updated weights for policy 0, policy_version 282717 (0.0028) [2024-06-28 18:35:02,921][09190] Fps is (10 sec: 37683.6, 60 sec: 42325.4, 300 sec: 42376.3). Total num frames: 4632100864. Throughput: 0: 42361.5. Samples: 910990940. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-28 18:35:02,922][09190] Avg episode reward: [(0, '0.727')] [2024-06-28 18:35:04,848][09423] Updated weights for policy 0, policy_version 282727 (0.0037) [2024-06-28 18:35:07,791][09423] Updated weights for policy 0, policy_version 282737 (0.0033) [2024-06-28 18:35:07,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4632363008. Throughput: 0: 42405.0. Samples: 911237600. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-28 18:35:07,922][09190] Avg episode reward: [(0, '0.733')] [2024-06-28 18:35:12,333][09423] Updated weights for policy 0, policy_version 282747 (0.0032) [2024-06-28 18:35:12,921][09190] Fps is (10 sec: 44237.0, 60 sec: 42325.4, 300 sec: 42542.9). Total num frames: 4632543232. Throughput: 0: 42365.8. Samples: 911374060. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-28 18:35:12,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:35:15,778][09423] Updated weights for policy 0, policy_version 282757 (0.0029) [2024-06-28 18:35:17,922][09190] Fps is (10 sec: 37682.6, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4632739840. Throughput: 0: 42418.9. Samples: 911625160. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-28 18:35:17,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:35:17,937][09190] No heartbeat for components: RolloutWorker_w20 (5256 seconds) [2024-06-28 18:35:19,768][09423] Updated weights for policy 0, policy_version 282767 (0.0033) [2024-06-28 18:35:22,921][09190] Fps is (10 sec: 44236.5, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4632985600. Throughput: 0: 42344.0. Samples: 911873800. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-28 18:35:22,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:35:23,409][09423] Updated weights for policy 0, policy_version 282777 (0.0029) [2024-06-28 18:35:27,569][09423] Updated weights for policy 0, policy_version 282787 (0.0034) [2024-06-28 18:35:27,922][09190] Fps is (10 sec: 44237.0, 60 sec: 42052.2, 300 sec: 42487.3). Total num frames: 4633182208. Throughput: 0: 42508.4. Samples: 912016620. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-28 18:35:27,922][09190] Avg episode reward: [(0, '0.731')] [2024-06-28 18:35:31,664][09423] Updated weights for policy 0, policy_version 282797 (0.0034) [2024-06-28 18:35:32,922][09190] Fps is (10 sec: 37682.7, 60 sec: 42325.2, 300 sec: 42320.7). Total num frames: 4633362432. Throughput: 0: 42521.6. Samples: 912261840. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-28 18:35:32,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:35:35,452][09423] Updated weights for policy 0, policy_version 282807 (0.0041) [2024-06-28 18:35:37,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4633624576. Throughput: 0: 42546.3. Samples: 912512440. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-28 18:35:37,922][09190] Avg episode reward: [(0, '0.728')] [2024-06-28 18:35:39,854][09423] Updated weights for policy 0, policy_version 282817 (0.0031) [2024-06-28 18:35:42,921][09190] Fps is (10 sec: 45875.8, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4633821184. Throughput: 0: 42482.6. Samples: 912650600. Policy #0 lag: (min: 1.0, avg: 8.1, max: 20.0) [2024-06-28 18:35:42,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:35:43,087][09423] Updated weights for policy 0, policy_version 282827 (0.0035) [2024-06-28 18:35:47,153][09423] Updated weights for policy 0, policy_version 282837 (0.0026) [2024-06-28 18:35:47,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42871.4, 300 sec: 42376.2). Total num frames: 4634017792. Throughput: 0: 42463.5. Samples: 912901800. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:35:47,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:35:49,063][09403] Signal inference workers to stop experience collection... (12600 times) [2024-06-28 18:35:49,112][09423] InferenceWorker_p0-w0: stopping experience collection (12600 times) [2024-06-28 18:35:49,180][09403] Signal inference workers to resume experience collection... (12600 times) [2024-06-28 18:35:49,180][09423] InferenceWorker_p0-w0: resuming experience collection (12600 times) [2024-06-28 18:35:51,227][09423] Updated weights for policy 0, policy_version 282847 (0.0035) [2024-06-28 18:35:52,921][09190] Fps is (10 sec: 42598.6, 60 sec: 42052.3, 300 sec: 42431.8). Total num frames: 4634247168. Throughput: 0: 42386.7. Samples: 913145000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:35:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:35:55,226][09423] Updated weights for policy 0, policy_version 282857 (0.0037) [2024-06-28 18:35:57,924][09190] Fps is (10 sec: 44225.6, 60 sec: 42596.5, 300 sec: 42487.3). Total num frames: 4634460160. Throughput: 0: 42369.5. Samples: 913280800. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:35:57,925][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:35:58,686][09423] Updated weights for policy 0, policy_version 282867 (0.0036) [2024-06-28 18:36:02,862][09423] Updated weights for policy 0, policy_version 282877 (0.0031) [2024-06-28 18:36:02,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4634656768. Throughput: 0: 42346.0. Samples: 913530720. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:36:02,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:36:06,263][09423] Updated weights for policy 0, policy_version 282887 (0.0028) [2024-06-28 18:36:07,922][09190] Fps is (10 sec: 44247.6, 60 sec: 42325.2, 300 sec: 42542.8). Total num frames: 4634902528. Throughput: 0: 42243.0. Samples: 913774740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:36:07,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:36:10,884][09423] Updated weights for policy 0, policy_version 282897 (0.0032) [2024-06-28 18:36:12,924][09190] Fps is (10 sec: 40949.5, 60 sec: 42050.4, 300 sec: 42375.9). Total num frames: 4635066368. Throughput: 0: 42234.6. Samples: 913917280. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:36:12,925][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 18:36:14,042][09423] Updated weights for policy 0, policy_version 282907 (0.0033) [2024-06-28 18:36:17,921][09190] Fps is (10 sec: 37683.7, 60 sec: 42325.4, 300 sec: 42376.6). Total num frames: 4635279360. Throughput: 0: 42324.1. Samples: 914166420. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:36:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:36:18,061][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000282916_4635295744.pth... [2024-06-28 18:36:18,111][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000282295_4625121280.pth [2024-06-28 18:36:18,422][09423] Updated weights for policy 0, policy_version 282917 (0.0033) [2024-06-28 18:36:21,504][09423] Updated weights for policy 0, policy_version 282927 (0.0034) [2024-06-28 18:36:22,921][09190] Fps is (10 sec: 45887.0, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4635525120. Throughput: 0: 42317.4. Samples: 914416720. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:36:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:36:25,968][09423] Updated weights for policy 0, policy_version 282937 (0.0036) [2024-06-28 18:36:27,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42052.3, 300 sec: 42431.8). Total num frames: 4635705344. Throughput: 0: 42167.9. Samples: 914548160. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:36:27,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:36:29,402][09423] Updated weights for policy 0, policy_version 282947 (0.0027) [2024-06-28 18:36:32,921][09190] Fps is (10 sec: 37683.3, 60 sec: 42325.5, 300 sec: 42376.2). Total num frames: 4635901952. Throughput: 0: 42165.9. Samples: 914799260. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:36:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:36:33,742][09423] Updated weights for policy 0, policy_version 282957 (0.0029) [2024-06-28 18:36:36,851][09423] Updated weights for policy 0, policy_version 282967 (0.0048) [2024-06-28 18:36:37,921][09190] Fps is (10 sec: 44237.1, 60 sec: 42052.3, 300 sec: 42487.3). Total num frames: 4636147712. Throughput: 0: 42344.8. Samples: 915050520. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:36:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:36:41,692][09423] Updated weights for policy 0, policy_version 282977 (0.0030) [2024-06-28 18:36:42,921][09190] Fps is (10 sec: 47512.9, 60 sec: 42598.4, 300 sec: 42487.3). Total num frames: 4636377088. Throughput: 0: 42454.3. Samples: 915191140. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:36:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:36:44,387][09423] Updated weights for policy 0, policy_version 282987 (0.0030) [2024-06-28 18:36:47,922][09190] Fps is (10 sec: 39320.8, 60 sec: 42052.1, 300 sec: 42376.2). Total num frames: 4636540928. Throughput: 0: 42412.2. Samples: 915439280. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:36:47,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:36:49,279][09423] Updated weights for policy 0, policy_version 282997 (0.0034) [2024-06-28 18:36:52,200][09423] Updated weights for policy 0, policy_version 283007 (0.0035) [2024-06-28 18:36:52,921][09190] Fps is (10 sec: 42598.8, 60 sec: 42598.4, 300 sec: 42487.7). Total num frames: 4636803072. Throughput: 0: 42466.4. Samples: 915685720. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:36:52,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:36:56,666][09423] Updated weights for policy 0, policy_version 283017 (0.0033) [2024-06-28 18:36:57,921][09190] Fps is (10 sec: 44237.7, 60 sec: 42054.1, 300 sec: 42431.8). Total num frames: 4636983296. Throughput: 0: 42382.4. Samples: 915824380. Policy #0 lag: (min: 0.0, avg: 10.6, max: 20.0) [2024-06-28 18:36:57,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:37:00,535][09423] Updated weights for policy 0, policy_version 283027 (0.0036) [2024-06-28 18:37:02,921][09190] Fps is (10 sec: 39321.4, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4637196288. Throughput: 0: 42612.4. Samples: 916083980. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-28 18:37:02,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:37:04,015][09423] Updated weights for policy 0, policy_version 283037 (0.0028) [2024-06-28 18:37:07,604][09403] Signal inference workers to stop experience collection... (12650 times) [2024-06-28 18:37:07,626][09423] InferenceWorker_p0-w0: stopping experience collection (12650 times) [2024-06-28 18:37:07,718][09403] Signal inference workers to resume experience collection... (12650 times) [2024-06-28 18:37:07,718][09423] InferenceWorker_p0-w0: resuming experience collection (12650 times) [2024-06-28 18:37:07,853][09423] Updated weights for policy 0, policy_version 283047 (0.0047) [2024-06-28 18:37:07,922][09190] Fps is (10 sec: 45874.7, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4637442048. Throughput: 0: 42618.1. Samples: 916334540. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-28 18:37:07,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 18:37:12,145][09423] Updated weights for policy 0, policy_version 283057 (0.0034) [2024-06-28 18:37:12,921][09190] Fps is (10 sec: 42598.4, 60 sec: 42600.2, 300 sec: 42376.2). Total num frames: 4637622272. Throughput: 0: 42507.2. Samples: 916460980. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-28 18:37:12,924][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:37:15,939][09423] Updated weights for policy 0, policy_version 283067 (0.0029) [2024-06-28 18:37:17,921][09190] Fps is (10 sec: 37683.6, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 4637818880. Throughput: 0: 42551.5. Samples: 916714080. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-28 18:37:17,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:37:19,965][09423] Updated weights for policy 0, policy_version 283077 (0.0026) [2024-06-28 18:37:22,921][09190] Fps is (10 sec: 44236.7, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4638064640. Throughput: 0: 42447.1. Samples: 916960640. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-28 18:37:22,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:37:23,928][09423] Updated weights for policy 0, policy_version 283087 (0.0027) [2024-06-28 18:37:27,594][09423] Updated weights for policy 0, policy_version 283097 (0.0036) [2024-06-28 18:37:27,921][09190] Fps is (10 sec: 47513.9, 60 sec: 43144.6, 300 sec: 42542.9). Total num frames: 4638294016. Throughput: 0: 42378.4. Samples: 917098160. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-28 18:37:27,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:37:31,690][09423] Updated weights for policy 0, policy_version 283107 (0.0035) [2024-06-28 18:37:32,924][09190] Fps is (10 sec: 40949.8, 60 sec: 42869.6, 300 sec: 42431.9). Total num frames: 4638474240. Throughput: 0: 42535.1. Samples: 917353460. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-28 18:37:32,925][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:37:34,957][09423] Updated weights for policy 0, policy_version 283117 (0.0042) [2024-06-28 18:37:37,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42325.4, 300 sec: 42376.3). Total num frames: 4638687232. Throughput: 0: 42652.0. Samples: 917605060. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-28 18:37:37,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:37:39,656][09423] Updated weights for policy 0, policy_version 283127 (0.0028) [2024-06-28 18:37:42,814][09423] Updated weights for policy 0, policy_version 283137 (0.0032) [2024-06-28 18:37:42,921][09190] Fps is (10 sec: 44248.2, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4638916608. Throughput: 0: 42454.7. Samples: 917734840. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-28 18:37:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:37:47,295][09423] Updated weights for policy 0, policy_version 283147 (0.0038) [2024-06-28 18:37:47,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.6, 300 sec: 42376.3). Total num frames: 4639096832. Throughput: 0: 42326.3. Samples: 917988660. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-28 18:37:47,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:37:50,574][09423] Updated weights for policy 0, policy_version 283157 (0.0031) [2024-06-28 18:37:52,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4639342592. Throughput: 0: 42163.7. Samples: 918231900. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-28 18:37:52,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:37:54,862][09423] Updated weights for policy 0, policy_version 283167 (0.0025) [2024-06-28 18:37:57,921][09190] Fps is (10 sec: 45875.5, 60 sec: 42871.5, 300 sec: 42487.3). Total num frames: 4639555584. Throughput: 0: 42401.9. Samples: 918369060. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-28 18:37:57,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:37:58,008][09423] Updated weights for policy 0, policy_version 283177 (0.0043) [2024-06-28 18:38:02,921][09190] Fps is (10 sec: 37683.4, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 4639719424. Throughput: 0: 42250.7. Samples: 918615360. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-28 18:38:02,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:38:03,137][09423] Updated weights for policy 0, policy_version 283187 (0.0029) [2024-06-28 18:38:05,685][09423] Updated weights for policy 0, policy_version 283197 (0.0027) [2024-06-28 18:38:07,921][09190] Fps is (10 sec: 40959.5, 60 sec: 42052.3, 300 sec: 42376.3). Total num frames: 4639965184. Throughput: 0: 42308.9. Samples: 918864540. Policy #0 lag: (min: 1.0, avg: 10.9, max: 20.0) [2024-06-28 18:38:07,924][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:38:10,847][09423] Updated weights for policy 0, policy_version 283207 (0.0029) [2024-06-28 18:38:12,921][09190] Fps is (10 sec: 49151.5, 60 sec: 43144.5, 300 sec: 42542.9). Total num frames: 4640210944. Throughput: 0: 42311.9. Samples: 919002200. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:38:12,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:38:13,060][09423] Updated weights for policy 0, policy_version 283217 (0.0033) [2024-06-28 18:38:17,921][09190] Fps is (10 sec: 39322.0, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 4640358400. Throughput: 0: 42344.6. Samples: 919258860. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:38:17,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:38:17,927][09190] No heartbeat for components: RolloutWorker_w20 (5436 seconds) [2024-06-28 18:38:17,980][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000283226_4640374784.pth... [2024-06-28 18:38:18,039][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000282605_4630200320.pth [2024-06-28 18:38:18,238][09423] Updated weights for policy 0, policy_version 283227 (0.0029) [2024-06-28 18:38:20,885][09423] Updated weights for policy 0, policy_version 283237 (0.0032) [2024-06-28 18:38:22,921][09190] Fps is (10 sec: 39322.1, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4640604160. Throughput: 0: 42342.3. Samples: 919510460. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:38:22,922][09190] Avg episode reward: [(0, '0.750')] [2024-06-28 18:38:25,876][09423] Updated weights for policy 0, policy_version 283247 (0.0030) [2024-06-28 18:38:27,921][09190] Fps is (10 sec: 49151.9, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4640849920. Throughput: 0: 42436.4. Samples: 919644480. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:38:27,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:38:28,902][09423] Updated weights for policy 0, policy_version 283257 (0.0030) [2024-06-28 18:38:32,710][09403] Signal inference workers to stop experience collection... (12700 times) [2024-06-28 18:38:32,735][09423] InferenceWorker_p0-w0: stopping experience collection (12700 times) [2024-06-28 18:38:32,766][09403] Signal inference workers to resume experience collection... (12700 times) [2024-06-28 18:38:32,766][09423] InferenceWorker_p0-w0: resuming experience collection (12700 times) [2024-06-28 18:38:32,922][09190] Fps is (10 sec: 40959.2, 60 sec: 42327.0, 300 sec: 42376.2). Total num frames: 4641013760. Throughput: 0: 42390.6. Samples: 919896240. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:38:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:38:33,794][09423] Updated weights for policy 0, policy_version 283267 (0.0028) [2024-06-28 18:38:36,830][09423] Updated weights for policy 0, policy_version 283277 (0.0040) [2024-06-28 18:38:37,922][09190] Fps is (10 sec: 37682.7, 60 sec: 42325.2, 300 sec: 42320.7). Total num frames: 4641226752. Throughput: 0: 42596.3. Samples: 920148740. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:38:37,922][09190] Avg episode reward: [(0, '0.728')] [2024-06-28 18:38:41,266][09423] Updated weights for policy 0, policy_version 283287 (0.0033) [2024-06-28 18:38:42,921][09190] Fps is (10 sec: 45875.8, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4641472512. Throughput: 0: 42631.1. Samples: 920287460. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:38:42,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:38:44,866][09423] Updated weights for policy 0, policy_version 283297 (0.0032) [2024-06-28 18:38:47,922][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.2, 300 sec: 42265.2). Total num frames: 4641636352. Throughput: 0: 42725.1. Samples: 920538000. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:38:47,928][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:38:48,696][09423] Updated weights for policy 0, policy_version 283307 (0.0031) [2024-06-28 18:38:52,402][09423] Updated weights for policy 0, policy_version 283317 (0.0032) [2024-06-28 18:38:52,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 4641865728. Throughput: 0: 42807.6. Samples: 920790880. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:38:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:38:56,418][09423] Updated weights for policy 0, policy_version 283327 (0.0027) [2024-06-28 18:38:57,921][09190] Fps is (10 sec: 49152.6, 60 sec: 42871.4, 300 sec: 42598.4). Total num frames: 4642127872. Throughput: 0: 42740.0. Samples: 920925500. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:38:57,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:39:00,256][09423] Updated weights for policy 0, policy_version 283337 (0.0035) [2024-06-28 18:39:02,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42598.3, 300 sec: 42265.2). Total num frames: 4642275328. Throughput: 0: 42490.6. Samples: 921170940. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:39:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:39:04,101][09423] Updated weights for policy 0, policy_version 283347 (0.0029) [2024-06-28 18:39:07,697][09423] Updated weights for policy 0, policy_version 283357 (0.0036) [2024-06-28 18:39:07,922][09190] Fps is (10 sec: 39321.1, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 4642521088. Throughput: 0: 42483.3. Samples: 921422220. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:39:07,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:39:11,802][09423] Updated weights for policy 0, policy_version 283367 (0.0034) [2024-06-28 18:39:12,922][09190] Fps is (10 sec: 47513.3, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4642750464. Throughput: 0: 42436.3. Samples: 921554120. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:39:12,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:39:15,626][09423] Updated weights for policy 0, policy_version 283377 (0.0032) [2024-06-28 18:39:17,922][09190] Fps is (10 sec: 39321.7, 60 sec: 42598.3, 300 sec: 42265.2). Total num frames: 4642914304. Throughput: 0: 42552.4. Samples: 921811100. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:39:17,922][09190] Avg episode reward: [(0, '0.751')] [2024-06-28 18:39:19,347][09423] Updated weights for policy 0, policy_version 283387 (0.0033) [2024-06-28 18:39:22,921][09190] Fps is (10 sec: 40960.6, 60 sec: 42598.4, 300 sec: 42376.2). Total num frames: 4643160064. Throughput: 0: 42550.8. Samples: 922063520. Policy #0 lag: (min: 0.0, avg: 7.8, max: 20.0) [2024-06-28 18:39:22,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:39:23,302][09423] Updated weights for policy 0, policy_version 283397 (0.0037) [2024-06-28 18:39:27,119][09423] Updated weights for policy 0, policy_version 283407 (0.0028) [2024-06-28 18:39:27,921][09190] Fps is (10 sec: 47514.2, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4643389440. Throughput: 0: 42310.2. Samples: 922191420. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:39:27,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 18:39:31,061][09423] Updated weights for policy 0, policy_version 283417 (0.0028) [2024-06-28 18:39:32,921][09190] Fps is (10 sec: 39321.5, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 4643553280. Throughput: 0: 42439.7. Samples: 922447780. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:39:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:39:34,552][09423] Updated weights for policy 0, policy_version 283427 (0.0032) [2024-06-28 18:39:37,921][09190] Fps is (10 sec: 37683.4, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 4643766272. Throughput: 0: 42491.6. Samples: 922703000. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:39:37,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:39:38,747][09423] Updated weights for policy 0, policy_version 283437 (0.0030) [2024-06-28 18:39:42,134][09423] Updated weights for policy 0, policy_version 283447 (0.0043) [2024-06-28 18:39:42,399][09403] Signal inference workers to stop experience collection... (12750 times) [2024-06-28 18:39:42,448][09423] InferenceWorker_p0-w0: stopping experience collection (12750 times) [2024-06-28 18:39:42,521][09403] Signal inference workers to resume experience collection... (12750 times) [2024-06-28 18:39:42,521][09423] InferenceWorker_p0-w0: resuming experience collection (12750 times) [2024-06-28 18:39:42,922][09190] Fps is (10 sec: 49151.2, 60 sec: 42871.3, 300 sec: 42709.5). Total num frames: 4644044800. Throughput: 0: 42326.6. Samples: 922830200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:39:42,931][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:39:47,457][09423] Updated weights for policy 0, policy_version 283457 (0.0033) [2024-06-28 18:39:47,921][09190] Fps is (10 sec: 40960.1, 60 sec: 42325.5, 300 sec: 42209.6). Total num frames: 4644175872. Throughput: 0: 42569.9. Samples: 923086580. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:39:47,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:39:49,662][09423] Updated weights for policy 0, policy_version 283467 (0.0040) [2024-06-28 18:39:52,921][09190] Fps is (10 sec: 37683.9, 60 sec: 42598.4, 300 sec: 42431.8). Total num frames: 4644421632. Throughput: 0: 42479.7. Samples: 923333800. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:39:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:39:54,860][09423] Updated weights for policy 0, policy_version 283477 (0.0029) [2024-06-28 18:39:57,474][09423] Updated weights for policy 0, policy_version 283487 (0.0024) [2024-06-28 18:39:57,922][09190] Fps is (10 sec: 49150.9, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4644667392. Throughput: 0: 42468.4. Samples: 923465200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:39:57,922][09190] Avg episode reward: [(0, '0.747')] [2024-06-28 18:40:02,429][09423] Updated weights for policy 0, policy_version 283497 (0.0036) [2024-06-28 18:40:02,921][09190] Fps is (10 sec: 40960.2, 60 sec: 42598.5, 300 sec: 42265.2). Total num frames: 4644831232. Throughput: 0: 42354.4. Samples: 923717040. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:40:02,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:40:05,049][09423] Updated weights for policy 0, policy_version 283507 (0.0029) [2024-06-28 18:40:07,922][09190] Fps is (10 sec: 39321.7, 60 sec: 42325.4, 300 sec: 42431.8). Total num frames: 4645060608. Throughput: 0: 42448.7. Samples: 923973720. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:40:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:40:10,229][09423] Updated weights for policy 0, policy_version 283517 (0.0033) [2024-06-28 18:40:12,520][09423] Updated weights for policy 0, policy_version 283527 (0.0034) [2024-06-28 18:40:12,924][09190] Fps is (10 sec: 47501.5, 60 sec: 42596.7, 300 sec: 42598.1). Total num frames: 4645306368. Throughput: 0: 42515.0. Samples: 924104700. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:40:12,924][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:40:17,868][09423] Updated weights for policy 0, policy_version 283537 (0.0041) [2024-06-28 18:40:17,921][09190] Fps is (10 sec: 40960.3, 60 sec: 42598.5, 300 sec: 42320.7). Total num frames: 4645470208. Throughput: 0: 42503.9. Samples: 924360460. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:40:17,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:40:18,042][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000283538_4645486592.pth... [2024-06-28 18:40:18,093][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000282916_4635295744.pth [2024-06-28 18:40:20,181][09423] Updated weights for policy 0, policy_version 283547 (0.0031) [2024-06-28 18:40:22,921][09190] Fps is (10 sec: 39331.4, 60 sec: 42325.3, 300 sec: 42431.8). Total num frames: 4645699584. Throughput: 0: 42426.6. Samples: 924612200. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:40:22,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:40:25,570][09423] Updated weights for policy 0, policy_version 283557 (0.0045) [2024-06-28 18:40:27,921][09190] Fps is (10 sec: 45875.1, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4645928960. Throughput: 0: 42506.7. Samples: 924743000. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:40:27,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:40:29,046][09423] Updated weights for policy 0, policy_version 283567 (0.0027) [2024-06-28 18:40:32,921][09190] Fps is (10 sec: 39321.6, 60 sec: 42325.3, 300 sec: 42265.2). Total num frames: 4646092800. Throughput: 0: 42475.9. Samples: 924998000. Policy #0 lag: (min: 0.0, avg: 8.9, max: 20.0) [2024-06-28 18:40:32,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:40:33,118][09423] Updated weights for policy 0, policy_version 283577 (0.0035) [2024-06-28 18:40:36,633][09423] Updated weights for policy 0, policy_version 283587 (0.0034) [2024-06-28 18:40:37,922][09190] Fps is (10 sec: 39321.2, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 4646322176. Throughput: 0: 42646.0. Samples: 925252880. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:40:37,931][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:40:40,876][09423] Updated weights for policy 0, policy_version 283597 (0.0033) [2024-06-28 18:40:42,922][09190] Fps is (10 sec: 49151.4, 60 sec: 42325.4, 300 sec: 42598.4). Total num frames: 4646584320. Throughput: 0: 42561.4. Samples: 925380460. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:40:42,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:40:44,200][09423] Updated weights for policy 0, policy_version 283607 (0.0046) [2024-06-28 18:40:47,785][09403] Signal inference workers to stop experience collection... (12800 times) [2024-06-28 18:40:47,841][09423] InferenceWorker_p0-w0: stopping experience collection (12800 times) [2024-06-28 18:40:47,842][09403] Signal inference workers to resume experience collection... (12800 times) [2024-06-28 18:40:47,859][09423] InferenceWorker_p0-w0: resuming experience collection (12800 times) [2024-06-28 18:40:47,922][09190] Fps is (10 sec: 40960.1, 60 sec: 42598.3, 300 sec: 42320.7). Total num frames: 4646731776. Throughput: 0: 42530.5. Samples: 925630920. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:40:47,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:40:48,337][09423] Updated weights for policy 0, policy_version 283617 (0.0030) [2024-06-28 18:40:52,163][09423] Updated weights for policy 0, policy_version 283627 (0.0032) [2024-06-28 18:40:52,921][09190] Fps is (10 sec: 36045.2, 60 sec: 42052.2, 300 sec: 42321.1). Total num frames: 4646944768. Throughput: 0: 42568.1. Samples: 925889280. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:40:52,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:40:55,778][09423] Updated weights for policy 0, policy_version 283637 (0.0027) [2024-06-28 18:40:57,921][09190] Fps is (10 sec: 47514.4, 60 sec: 42325.5, 300 sec: 42542.9). Total num frames: 4647206912. Throughput: 0: 42359.7. Samples: 926010780. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:40:57,922][09190] Avg episode reward: [(0, '0.832')] [2024-06-28 18:40:59,814][09423] Updated weights for policy 0, policy_version 283647 (0.0039) [2024-06-28 18:41:02,921][09190] Fps is (10 sec: 45875.7, 60 sec: 42871.5, 300 sec: 42376.3). Total num frames: 4647403520. Throughput: 0: 42478.8. Samples: 926272000. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:41:02,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:41:03,526][09423] Updated weights for policy 0, policy_version 283657 (0.0033) [2024-06-28 18:41:07,547][09423] Updated weights for policy 0, policy_version 283667 (0.0029) [2024-06-28 18:41:07,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42598.5, 300 sec: 42543.2). Total num frames: 4647616512. Throughput: 0: 42447.1. Samples: 926522320. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:41:07,922][09190] Avg episode reward: [(0, '0.752')] [2024-06-28 18:41:11,102][09423] Updated weights for policy 0, policy_version 283677 (0.0033) [2024-06-28 18:41:12,921][09190] Fps is (10 sec: 44236.4, 60 sec: 42327.1, 300 sec: 42598.4). Total num frames: 4647845888. Throughput: 0: 42379.6. Samples: 926650080. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:41:12,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:41:15,390][09423] Updated weights for policy 0, policy_version 283687 (0.0035) [2024-06-28 18:41:17,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42598.4, 300 sec: 42376.2). Total num frames: 4648026112. Throughput: 0: 42456.8. Samples: 926908560. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:41:17,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:41:17,935][09190] No heartbeat for components: RolloutWorker_w20 (5616 seconds) [2024-06-28 18:41:18,872][09423] Updated weights for policy 0, policy_version 283697 (0.0041) [2024-06-28 18:41:22,921][09190] Fps is (10 sec: 39321.7, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4648239104. Throughput: 0: 42612.2. Samples: 927170420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:41:22,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:41:23,162][09423] Updated weights for policy 0, policy_version 283707 (0.0032) [2024-06-28 18:41:26,319][09423] Updated weights for policy 0, policy_version 283717 (0.0028) [2024-06-28 18:41:27,921][09190] Fps is (10 sec: 47514.0, 60 sec: 42871.5, 300 sec: 42709.5). Total num frames: 4648501248. Throughput: 0: 42510.3. Samples: 927293420. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:41:27,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:41:30,580][09423] Updated weights for policy 0, policy_version 283727 (0.0032) [2024-06-28 18:41:32,921][09190] Fps is (10 sec: 44236.5, 60 sec: 43144.5, 300 sec: 42487.3). Total num frames: 4648681472. Throughput: 0: 42749.4. Samples: 927554640. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:41:32,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:41:33,851][09423] Updated weights for policy 0, policy_version 283737 (0.0030) [2024-06-28 18:41:37,921][09190] Fps is (10 sec: 37683.0, 60 sec: 42598.5, 300 sec: 42376.2). Total num frames: 4648878080. Throughput: 0: 42668.0. Samples: 927809340. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:41:37,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:41:38,286][09423] Updated weights for policy 0, policy_version 283747 (0.0033) [2024-06-28 18:41:41,770][09423] Updated weights for policy 0, policy_version 283757 (0.0036) [2024-06-28 18:41:42,922][09190] Fps is (10 sec: 44236.0, 60 sec: 42325.3, 300 sec: 42653.9). Total num frames: 4649123840. Throughput: 0: 42631.8. Samples: 927929220. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:41:42,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:41:45,959][09423] Updated weights for policy 0, policy_version 283767 (0.0035) [2024-06-28 18:41:47,922][09190] Fps is (10 sec: 44236.4, 60 sec: 43144.5, 300 sec: 42431.8). Total num frames: 4649320448. Throughput: 0: 42504.7. Samples: 928184720. Policy #0 lag: (min: 0.0, avg: 9.7, max: 20.0) [2024-06-28 18:41:47,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:41:49,488][09423] Updated weights for policy 0, policy_version 283777 (0.0036) [2024-06-28 18:41:52,922][09190] Fps is (10 sec: 39321.9, 60 sec: 42871.4, 300 sec: 42487.3). Total num frames: 4649517056. Throughput: 0: 42602.1. Samples: 928439420. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 18:41:52,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:41:53,696][09423] Updated weights for policy 0, policy_version 283787 (0.0030) [2024-06-28 18:41:57,045][09423] Updated weights for policy 0, policy_version 283797 (0.0030) [2024-06-28 18:41:57,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4649746432. Throughput: 0: 42478.2. Samples: 928561600. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 18:41:57,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:42:01,498][09423] Updated weights for policy 0, policy_version 283807 (0.0027) [2024-06-28 18:42:02,921][09190] Fps is (10 sec: 40960.8, 60 sec: 42052.3, 300 sec: 42320.7). Total num frames: 4649926656. Throughput: 0: 42431.7. Samples: 928817980. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 18:42:02,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:42:04,775][09423] Updated weights for policy 0, policy_version 283817 (0.0030) [2024-06-28 18:42:07,921][09190] Fps is (10 sec: 40960.4, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4650156032. Throughput: 0: 42369.4. Samples: 929077040. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 18:42:07,922][09190] Avg episode reward: [(0, '0.733')] [2024-06-28 18:42:09,417][09423] Updated weights for policy 0, policy_version 283827 (0.0033) [2024-06-28 18:42:12,481][09423] Updated weights for policy 0, policy_version 283837 (0.0024) [2024-06-28 18:42:12,921][09190] Fps is (10 sec: 45874.8, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4650385408. Throughput: 0: 42462.2. Samples: 929204220. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 18:42:12,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:42:16,896][09423] Updated weights for policy 0, policy_version 283847 (0.0042) [2024-06-28 18:42:17,670][09403] Signal inference workers to stop experience collection... (12850 times) [2024-06-28 18:42:17,670][09403] Signal inference workers to resume experience collection... (12850 times) [2024-06-28 18:42:17,689][09423] InferenceWorker_p0-w0: stopping experience collection (12850 times) [2024-06-28 18:42:17,689][09423] InferenceWorker_p0-w0: resuming experience collection (12850 times) [2024-06-28 18:42:17,924][09190] Fps is (10 sec: 44225.6, 60 sec: 42869.8, 300 sec: 42487.0). Total num frames: 4650598400. Throughput: 0: 42297.7. Samples: 929458140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 18:42:17,924][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:42:18,041][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000283851_4650614784.pth... [2024-06-28 18:42:18,087][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000283226_4640374784.pth [2024-06-28 18:42:20,352][09423] Updated weights for policy 0, policy_version 283857 (0.0033) [2024-06-28 18:42:22,922][09190] Fps is (10 sec: 40959.5, 60 sec: 42598.3, 300 sec: 42376.2). Total num frames: 4650795008. Throughput: 0: 42324.8. Samples: 929713960. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 18:42:22,922][09190] Avg episode reward: [(0, '0.749')] [2024-06-28 18:42:24,528][09423] Updated weights for policy 0, policy_version 283867 (0.0028) [2024-06-28 18:42:27,921][09190] Fps is (10 sec: 40970.3, 60 sec: 41779.2, 300 sec: 42487.7). Total num frames: 4651008000. Throughput: 0: 42376.2. Samples: 929836140. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 18:42:27,922][09190] Avg episode reward: [(0, '0.742')] [2024-06-28 18:42:28,102][09423] Updated weights for policy 0, policy_version 283877 (0.0031) [2024-06-28 18:42:32,355][09423] Updated weights for policy 0, policy_version 283887 (0.0027) [2024-06-28 18:42:32,921][09190] Fps is (10 sec: 42598.9, 60 sec: 42325.4, 300 sec: 42487.3). Total num frames: 4651220992. Throughput: 0: 42448.1. Samples: 930094880. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 18:42:32,922][09190] Avg episode reward: [(0, '0.741')] [2024-06-28 18:42:35,702][09423] Updated weights for policy 0, policy_version 283897 (0.0033) [2024-06-28 18:42:37,922][09190] Fps is (10 sec: 40959.3, 60 sec: 42325.3, 300 sec: 42376.2). Total num frames: 4651417600. Throughput: 0: 42361.8. Samples: 930345700. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 18:42:37,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 18:42:40,214][09423] Updated weights for policy 0, policy_version 283907 (0.0031) [2024-06-28 18:42:42,921][09190] Fps is (10 sec: 44236.8, 60 sec: 42325.5, 300 sec: 42598.4). Total num frames: 4651663360. Throughput: 0: 42452.0. Samples: 930471940. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 18:42:42,922][09190] Avg episode reward: [(0, '0.744')] [2024-06-28 18:42:43,455][09423] Updated weights for policy 0, policy_version 283917 (0.0029) [2024-06-28 18:42:47,893][09423] Updated weights for policy 0, policy_version 283927 (0.0034) [2024-06-28 18:42:47,923][09190] Fps is (10 sec: 44229.9, 60 sec: 42324.3, 300 sec: 42431.5). Total num frames: 4651859968. Throughput: 0: 42461.9. Samples: 930728840. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 18:42:47,923][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:42:51,983][09423] Updated weights for policy 0, policy_version 283937 (0.0031) [2024-06-28 18:42:52,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4652072960. Throughput: 0: 42253.7. Samples: 930978460. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 18:42:52,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:42:55,368][09423] Updated weights for policy 0, policy_version 283947 (0.0032) [2024-06-28 18:42:57,921][09190] Fps is (10 sec: 42605.5, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4652285952. Throughput: 0: 42187.6. Samples: 931102660. Policy #0 lag: (min: 0.0, avg: 10.7, max: 20.0) [2024-06-28 18:42:57,922][09190] Avg episode reward: [(0, '0.743')] [2024-06-28 18:42:59,653][09423] Updated weights for policy 0, policy_version 283957 (0.0033) [2024-06-28 18:43:02,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42598.3, 300 sec: 42431.8). Total num frames: 4652482560. Throughput: 0: 42237.8. Samples: 931358740. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:43:02,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:43:03,326][09423] Updated weights for policy 0, policy_version 283967 (0.0030) [2024-06-28 18:43:07,477][09423] Updated weights for policy 0, policy_version 283977 (0.0039) [2024-06-28 18:43:07,921][09190] Fps is (10 sec: 40959.9, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 4652695552. Throughput: 0: 42234.3. Samples: 931614500. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:43:07,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:43:10,856][09423] Updated weights for policy 0, policy_version 283987 (0.0032) [2024-06-28 18:43:12,921][09190] Fps is (10 sec: 45875.0, 60 sec: 42598.4, 300 sec: 42653.9). Total num frames: 4652941312. Throughput: 0: 42316.8. Samples: 931740400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:43:12,922][09190] Avg episode reward: [(0, '0.738')] [2024-06-28 18:43:14,781][09423] Updated weights for policy 0, policy_version 283997 (0.0034) [2024-06-28 18:43:17,922][09190] Fps is (10 sec: 42598.0, 60 sec: 42053.9, 300 sec: 42431.8). Total num frames: 4653121536. Throughput: 0: 42283.9. Samples: 931997660. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:43:17,922][09190] Avg episode reward: [(0, '0.745')] [2024-06-28 18:43:18,464][09423] Updated weights for policy 0, policy_version 284007 (0.0027) [2024-06-28 18:43:22,520][09423] Updated weights for policy 0, policy_version 284017 (0.0046) [2024-06-28 18:43:22,921][09190] Fps is (10 sec: 39321.9, 60 sec: 42325.4, 300 sec: 42320.7). Total num frames: 4653334528. Throughput: 0: 42429.0. Samples: 932255000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:43:22,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:43:24,744][09403] Signal inference workers to stop experience collection... (12900 times) [2024-06-28 18:43:24,781][09423] InferenceWorker_p0-w0: stopping experience collection (12900 times) [2024-06-28 18:43:24,801][09403] Signal inference workers to resume experience collection... (12900 times) [2024-06-28 18:43:24,801][09423] InferenceWorker_p0-w0: resuming experience collection (12900 times) [2024-06-28 18:43:26,471][09423] Updated weights for policy 0, policy_version 284027 (0.0033) [2024-06-28 18:43:27,921][09190] Fps is (10 sec: 44237.6, 60 sec: 42598.4, 300 sec: 42542.9). Total num frames: 4653563904. Throughput: 0: 42415.6. Samples: 932380640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:43:27,922][09190] Avg episode reward: [(0, '0.746')] [2024-06-28 18:43:30,303][09423] Updated weights for policy 0, policy_version 284037 (0.0037) [2024-06-28 18:43:32,922][09190] Fps is (10 sec: 42598.0, 60 sec: 42325.3, 300 sec: 42487.3). Total num frames: 4653760512. Throughput: 0: 42333.5. Samples: 932633780. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:43:32,922][09190] Avg episode reward: [(0, '0.748')] [2024-06-28 18:43:34,155][09423] Updated weights for policy 0, policy_version 284047 (0.0037) [2024-06-28 18:43:37,921][09190] Fps is (10 sec: 40960.0, 60 sec: 42598.5, 300 sec: 42376.3). Total num frames: 4653973504. Throughput: 0: 42499.6. Samples: 932890940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:43:37,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:43:37,958][09423] Updated weights for policy 0, policy_version 284057 (0.0037) [2024-06-28 18:43:42,038][09423] Updated weights for policy 0, policy_version 284067 (0.0034) [2024-06-28 18:43:42,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42325.3, 300 sec: 42598.4). Total num frames: 4654202880. Throughput: 0: 42540.9. Samples: 933017000. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:43:42,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:43:46,134][09423] Updated weights for policy 0, policy_version 284077 (0.0041) [2024-06-28 18:43:47,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42326.5, 300 sec: 42487.3). Total num frames: 4654399488. Throughput: 0: 42478.7. Samples: 933270280. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:43:47,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:43:49,534][09423] Updated weights for policy 0, policy_version 284087 (0.0040) [2024-06-28 18:43:52,921][09190] Fps is (10 sec: 40959.8, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 4654612480. Throughput: 0: 42394.2. Samples: 933522240. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:43:52,922][09190] Avg episode reward: [(0, '0.737')] [2024-06-28 18:43:53,717][09423] Updated weights for policy 0, policy_version 284097 (0.0029) [2024-06-28 18:43:57,282][09423] Updated weights for policy 0, policy_version 284107 (0.0031) [2024-06-28 18:43:57,921][09190] Fps is (10 sec: 42598.2, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4654825472. Throughput: 0: 42413.8. Samples: 933649020. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:43:57,922][09190] Avg episode reward: [(0, '0.734')] [2024-06-28 18:44:01,226][09423] Updated weights for policy 0, policy_version 284117 (0.0028) [2024-06-28 18:44:02,921][09190] Fps is (10 sec: 42599.0, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4655038464. Throughput: 0: 42461.1. Samples: 933908400. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:44:02,922][09190] Avg episode reward: [(0, '0.810')] [2024-06-28 18:44:04,967][09423] Updated weights for policy 0, policy_version 284127 (0.0038) [2024-06-28 18:44:07,921][09190] Fps is (10 sec: 40959.7, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 4655235072. Throughput: 0: 42398.6. Samples: 934162940. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:44:07,922][09190] Avg episode reward: [(0, '0.736')] [2024-06-28 18:44:09,087][09423] Updated weights for policy 0, policy_version 284137 (0.0039) [2024-06-28 18:44:12,636][09423] Updated weights for policy 0, policy_version 284147 (0.0031) [2024-06-28 18:44:12,921][09190] Fps is (10 sec: 42598.0, 60 sec: 42052.3, 300 sec: 42542.9). Total num frames: 4655464448. Throughput: 0: 42355.5. Samples: 934286640. Policy #0 lag: (min: 0.0, avg: 10.6, max: 22.0) [2024-06-28 18:44:12,922][09190] Avg episode reward: [(0, '0.740')] [2024-06-28 18:44:16,908][09423] Updated weights for policy 0, policy_version 284157 (0.0026) [2024-06-28 18:44:17,921][09190] Fps is (10 sec: 44237.2, 60 sec: 42598.5, 300 sec: 42431.8). Total num frames: 4655677440. Throughput: 0: 42445.9. Samples: 934543840. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:44:17,922][09190] Avg episode reward: [(0, '0.739')] [2024-06-28 18:44:17,942][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000284160_4655677440.pth... [2024-06-28 18:44:17,943][09190] No heartbeat for components: RolloutWorker_w20 (5796 seconds) [2024-06-28 18:44:18,000][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000283538_4645486592.pth [2024-06-28 18:44:20,410][09423] Updated weights for policy 0, policy_version 284167 (0.0037) [2024-06-28 18:44:22,922][09190] Fps is (10 sec: 40959.6, 60 sec: 42325.3, 300 sec: 42320.7). Total num frames: 4655874048. Throughput: 0: 42253.6. Samples: 934792360. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:44:22,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 18:44:24,460][09423] Updated weights for policy 0, policy_version 284177 (0.0030) [2024-06-28 18:44:27,921][09190] Fps is (10 sec: 42598.3, 60 sec: 42325.3, 300 sec: 42542.9). Total num frames: 4656103424. Throughput: 0: 42312.9. Samples: 934921080. Policy #0 lag: (min: 0.0, avg: 9.9, max: 20.0) [2024-06-28 18:44:27,922][09190] Avg episode reward: [(0, '0.735')] [2024-06-28 18:44:28,079][09423] Updated weights for policy 0, policy_version 284187 (0.0029) [2024-06-28 18:44:29,722][09190] Keyboard interrupt detected in the event loop EvtLoop [Runner_EvtLoop, process=main process 9190], exiting... [2024-06-28 18:44:29,722][09437] Stopping RolloutWorker_w13... [2024-06-28 18:44:29,722][09435] Stopping RolloutWorker_w12... [2024-06-28 18:44:29,722][09448] Stopping RolloutWorker_w23... [2024-06-28 18:44:29,722][09190] Runner profile tree view: main_loop: 21968.7600 [2024-06-28 18:44:29,722][09450] Stopping RolloutWorker_w25... [2024-06-28 18:44:29,722][09403] Stopping Batcher_0... [2024-06-28 18:44:29,723][09437] Loop rollout_proc13_evt_loop terminating... [2024-06-28 18:44:29,722][09436] Stopping RolloutWorker_w10... [2024-06-28 18:44:29,722][09440] Stopping RolloutWorker_w16... [2024-06-28 18:44:29,722][09439] Stopping RolloutWorker_w14... [2024-06-28 18:44:29,723][09435] Loop rollout_proc12_evt_loop terminating... [2024-06-28 18:44:29,722][09190] Collected {0: 4656168960}, FPS: 42556.8 [2024-06-28 18:44:29,722][09438] Stopping RolloutWorker_w15... [2024-06-28 18:44:29,723][09448] Loop rollout_proc23_evt_loop terminating... [2024-06-28 18:44:29,722][09442] Stopping RolloutWorker_w18... [2024-06-28 18:44:29,722][09430] Stopping RolloutWorker_w7... [2024-06-28 18:44:29,722][09432] Stopping RolloutWorker_w6... [2024-06-28 18:44:29,723][09403] Loop batcher_evt_loop terminating... [2024-06-28 18:44:29,722][09449] Stopping RolloutWorker_w26... [2024-06-28 18:44:29,722][09425] Stopping RolloutWorker_w1... [2024-06-28 18:44:29,723][09450] Loop rollout_proc25_evt_loop terminating... [2024-06-28 18:44:29,722][09453] Stopping RolloutWorker_w28... [2024-06-28 18:44:29,722][09424] Stopping RolloutWorker_w0... [2024-06-28 18:44:29,722][09441] Stopping RolloutWorker_w17... [2024-06-28 18:44:29,723][09440] Loop rollout_proc16_evt_loop terminating... [2024-06-28 18:44:29,722][09443] Stopping RolloutWorker_w19... [2024-06-28 18:44:29,723][09439] Loop rollout_proc14_evt_loop terminating... [2024-06-28 18:44:29,722][09426] Stopping RolloutWorker_w2... [2024-06-28 18:44:29,723][09455] Stopping RolloutWorker_w30... [2024-06-28 18:44:29,723][09438] Loop rollout_proc15_evt_loop terminating... [2024-06-28 18:44:29,722][09427] Stopping RolloutWorker_w3... [2024-06-28 18:44:29,723][09433] Stopping RolloutWorker_w8... [2024-06-28 18:44:29,723][09436] Loop rollout_proc10_evt_loop terminating... [2024-06-28 18:44:29,723][09446] Stopping RolloutWorker_w22... [2024-06-28 18:44:29,723][09430] Loop rollout_proc7_evt_loop terminating... [2024-06-28 18:44:29,723][09442] Loop rollout_proc18_evt_loop terminating... [2024-06-28 18:44:29,723][09431] Stopping RolloutWorker_w9... [2024-06-28 18:44:29,723][09429] Stopping RolloutWorker_w5... [2024-06-28 18:44:29,723][09428] Stopping RolloutWorker_w4... [2024-06-28 18:44:29,723][09449] Loop rollout_proc26_evt_loop terminating... [2024-06-28 18:44:29,723][09447] Stopping RolloutWorker_w24... [2024-06-28 18:44:29,723][09432] Loop rollout_proc6_evt_loop terminating... [2024-06-28 18:44:29,723][09425] Loop rollout_proc1_evt_loop terminating... [2024-06-28 18:44:29,723][09453] Loop rollout_proc28_evt_loop terminating... [2024-06-28 18:44:29,723][09424] Loop rollout_proc0_evt_loop terminating... [2024-06-28 18:44:29,723][09441] Loop rollout_proc17_evt_loop terminating... [2024-06-28 18:44:29,723][09443] Loop rollout_proc19_evt_loop terminating... [2024-06-28 18:44:29,723][09451] Stopping RolloutWorker_w29... [2024-06-28 18:44:29,723][09455] Loop rollout_proc30_evt_loop terminating... [2024-06-28 18:44:29,723][09452] Stopping RolloutWorker_w27... [2024-06-28 18:44:29,723][09454] Stopping RolloutWorker_w31... [2024-06-28 18:44:29,723][09446] Loop rollout_proc22_evt_loop terminating... [2024-06-28 18:44:29,723][09426] Loop rollout_proc2_evt_loop terminating... [2024-06-28 18:44:29,723][09433] Loop rollout_proc8_evt_loop terminating... [2024-06-28 18:44:29,723][09427] Loop rollout_proc3_evt_loop terminating... [2024-06-28 18:44:29,723][09431] Loop rollout_proc9_evt_loop terminating... [2024-06-28 18:44:29,723][09429] Loop rollout_proc5_evt_loop terminating... [2024-06-28 18:44:29,723][09428] Loop rollout_proc4_evt_loop terminating... [2024-06-28 18:44:29,723][09447] Loop rollout_proc24_evt_loop terminating... [2024-06-28 18:44:29,723][09451] Loop rollout_proc29_evt_loop terminating... [2024-06-28 18:44:29,723][09452] Loop rollout_proc27_evt_loop terminating... [2024-06-28 18:44:29,723][09454] Loop rollout_proc31_evt_loop terminating... [2024-06-28 18:44:29,728][09445] Stopping RolloutWorker_w21... [2024-06-28 18:44:29,728][09445] Loop rollout_proc21_evt_loop terminating... [2024-06-28 18:44:29,728][09434] Stopping RolloutWorker_w11... [2024-06-28 18:44:29,729][09403] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000284191_4656185344.pth... [2024-06-28 18:44:29,730][09434] Loop rollout_proc11_evt_loop terminating... [2024-06-28 18:44:29,809][09403] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000283851_4650614784.pth [2024-06-28 18:44:29,818][09423] Weights refcount: 2 0 [2024-06-28 18:44:29,822][09423] Stopping InferenceWorker_p0-w0... [2024-06-28 18:44:29,823][09423] Loop inference_proc0-0_evt_loop terminating... [2024-06-28 18:44:29,824][09403] Stopping LearnerWorker_p0... [2024-06-28 18:44:29,825][09403] Loop learner_proc0_evt_loop terminating... [2024-06-28 21:08:09,071][16330] Saving configuration to ./train_dir/sample_factory/p2.sf/config.json... [2024-06-28 21:08:09,136][16330] Rollout worker 0 uses device cpu [2024-06-28 21:08:09,137][16330] Rollout worker 1 uses device cpu [2024-06-28 21:08:09,138][16330] Rollout worker 2 uses device cpu [2024-06-28 21:08:09,138][16330] Rollout worker 3 uses device cpu [2024-06-28 21:08:09,139][16330] Rollout worker 4 uses device cpu [2024-06-28 21:08:09,139][16330] Rollout worker 5 uses device cpu [2024-06-28 21:08:09,140][16330] Rollout worker 6 uses device cpu [2024-06-28 21:08:09,140][16330] Rollout worker 7 uses device cpu [2024-06-28 21:08:09,141][16330] Rollout worker 8 uses device cpu [2024-06-28 21:08:09,141][16330] Rollout worker 9 uses device cpu [2024-06-28 21:08:09,142][16330] Rollout worker 10 uses device cpu [2024-06-28 21:08:09,142][16330] Rollout worker 11 uses device cpu [2024-06-28 21:08:09,143][16330] Rollout worker 12 uses device cpu [2024-06-28 21:08:09,143][16330] Rollout worker 13 uses device cpu [2024-06-28 21:08:09,143][16330] Rollout worker 14 uses device cpu [2024-06-28 21:08:09,143][16330] Rollout worker 15 uses device cpu [2024-06-28 21:08:09,143][16330] Rollout worker 16 uses device cpu [2024-06-28 21:08:09,143][16330] Rollout worker 17 uses device cpu [2024-06-28 21:08:09,143][16330] Rollout worker 18 uses device cpu [2024-06-28 21:08:09,143][16330] Rollout worker 19 uses device cpu [2024-06-28 21:08:09,143][16330] Rollout worker 20 uses device cpu [2024-06-28 21:08:09,144][16330] Rollout worker 21 uses device cpu [2024-06-28 21:08:09,144][16330] Rollout worker 22 uses device cpu [2024-06-28 21:08:09,144][16330] Rollout worker 23 uses device cpu [2024-06-28 21:08:09,144][16330] Rollout worker 24 uses device cpu [2024-06-28 21:08:09,144][16330] Rollout worker 25 uses device cpu [2024-06-28 21:08:09,144][16330] Rollout worker 26 uses device cpu [2024-06-28 21:08:09,144][16330] Rollout worker 27 uses device cpu [2024-06-28 21:08:09,144][16330] Rollout worker 28 uses device cpu [2024-06-28 21:08:09,144][16330] Rollout worker 29 uses device cpu [2024-06-28 21:08:09,144][16330] Rollout worker 30 uses device cpu [2024-06-28 21:08:09,144][16330] Rollout worker 31 uses device cpu [2024-06-28 21:08:09,719][16330] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-28 21:08:09,719][16330] InferenceWorker_p0-w0: min num requests: 10 [2024-06-28 21:08:09,763][16330] Starting all processes... [2024-06-28 21:08:09,763][16330] Starting process learner_proc0 [2024-06-28 21:08:10,039][16330] Starting all processes... [2024-06-28 21:08:10,042][16330] Starting process inference_proc0-0 [2024-06-28 21:08:10,043][16330] Starting process rollout_proc0 [2024-06-28 21:08:10,043][16330] Starting process rollout_proc1 [2024-06-28 21:08:10,043][16330] Starting process rollout_proc2 [2024-06-28 21:08:10,043][16330] Starting process rollout_proc3 [2024-06-28 21:08:10,043][16330] Starting process rollout_proc4 [2024-06-28 21:08:10,043][16330] Starting process rollout_proc5 [2024-06-28 21:08:10,045][16330] Starting process rollout_proc6 [2024-06-28 21:08:10,045][16330] Starting process rollout_proc7 [2024-06-28 21:08:10,046][16330] Starting process rollout_proc8 [2024-06-28 21:08:10,047][16330] Starting process rollout_proc9 [2024-06-28 21:08:10,048][16330] Starting process rollout_proc10 [2024-06-28 21:08:10,048][16330] Starting process rollout_proc11 [2024-06-28 21:08:10,048][16330] Starting process rollout_proc12 [2024-06-28 21:08:10,049][16330] Starting process rollout_proc13 [2024-06-28 21:08:10,049][16330] Starting process rollout_proc14 [2024-06-28 21:08:10,049][16330] Starting process rollout_proc15 [2024-06-28 21:08:10,049][16330] Starting process rollout_proc16 [2024-06-28 21:08:10,050][16330] Starting process rollout_proc17 [2024-06-28 21:08:10,051][16330] Starting process rollout_proc18 [2024-06-28 21:08:10,051][16330] Starting process rollout_proc19 [2024-06-28 21:08:10,053][16330] Starting process rollout_proc20 [2024-06-28 21:08:10,057][16330] Starting process rollout_proc21 [2024-06-28 21:08:10,058][16330] Starting process rollout_proc22 [2024-06-28 21:08:10,061][16330] Starting process rollout_proc23 [2024-06-28 21:08:10,062][16330] Starting process rollout_proc24 [2024-06-28 21:08:10,062][16330] Starting process rollout_proc25 [2024-06-28 21:08:10,062][16330] Starting process rollout_proc26 [2024-06-28 21:08:10,066][16330] Starting process rollout_proc27 [2024-06-28 21:08:10,068][16330] Starting process rollout_proc28 [2024-06-28 21:08:10,068][16330] Starting process rollout_proc29 [2024-06-28 21:08:10,070][16330] Starting process rollout_proc30 [2024-06-28 21:08:10,070][16330] Starting process rollout_proc31 [2024-06-28 21:08:12,152][16575] Worker 11 uses CPU cores [11] [2024-06-28 21:08:12,156][16563] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-28 21:08:12,157][16563] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for inference process 0 [2024-06-28 21:08:12,172][16563] Num visible devices: 1 [2024-06-28 21:08:12,252][16568] Worker 4 uses CPU cores [4] [2024-06-28 21:08:12,273][16543] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-28 21:08:12,273][16543] Set environment var CUDA_VISIBLE_DEVICES to '0' (GPU indices [0]) for learning process 0 [2024-06-28 21:08:12,282][16543] Num visible devices: 1 [2024-06-28 21:08:12,296][16543] Setting fixed seed 0 [2024-06-28 21:08:12,297][16543] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-28 21:08:12,298][16543] Initializing actor-critic model on device cuda:0 [2024-06-28 21:08:12,300][16588] Worker 23 uses CPU cores [23] [2024-06-28 21:08:12,308][16564] Worker 0 uses CPU cores [0] [2024-06-28 21:08:12,312][16567] Worker 3 uses CPU cores [3] [2024-06-28 21:08:12,315][16583] Worker 19 uses CPU cores [19] [2024-06-28 21:08:12,344][16571] Worker 7 uses CPU cores [7] [2024-06-28 21:08:12,358][16578] Worker 14 uses CPU cores [14] [2024-06-28 21:08:12,363][16573] Worker 9 uses CPU cores [9] [2024-06-28 21:08:12,365][16565] Worker 1 uses CPU cores [1] [2024-06-28 21:08:12,400][16566] Worker 2 uses CPU cores [2] [2024-06-28 21:08:12,412][16586] Worker 22 uses CPU cores [22] [2024-06-28 21:08:12,432][16580] Worker 18 uses CPU cores [18] [2024-06-28 21:08:12,440][16585] Worker 21 uses CPU cores [21] [2024-06-28 21:08:12,442][16569] Worker 5 uses CPU cores [5] [2024-06-28 21:08:12,477][16572] Worker 8 uses CPU cores [8] [2024-06-28 21:08:12,488][16582] Worker 15 uses CPU cores [15] [2024-06-28 21:08:12,492][16587] Worker 24 uses CPU cores [24] [2024-06-28 21:08:12,508][16593] Worker 29 uses CPU cores [29] [2024-06-28 21:08:12,516][16591] Worker 27 uses CPU cores [27] [2024-06-28 21:08:12,516][16574] Worker 10 uses CPU cores [10] [2024-06-28 21:08:12,530][16570] Worker 6 uses CPU cores [6] [2024-06-28 21:08:12,536][16576] Worker 13 uses CPU cores [13] [2024-06-28 21:08:12,580][16581] Worker 17 uses CPU cores [17] [2024-06-28 21:08:12,591][16589] Worker 25 uses CPU cores [25] [2024-06-28 21:08:12,611][16590] Worker 26 uses CPU cores [26] [2024-06-28 21:08:12,650][16592] Worker 28 uses CPU cores [28] [2024-06-28 21:08:12,685][16594] Worker 30 uses CPU cores [30] [2024-06-28 21:08:12,692][16595] Worker 31 uses CPU cores [31] [2024-06-28 21:08:12,693][16577] Worker 12 uses CPU cores [12] [2024-06-28 21:08:12,694][16584] Worker 20 uses CPU cores [20] [2024-06-28 21:08:12,802][16579] Worker 16 uses CPU cores [16] [2024-06-28 21:08:13,148][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,149][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,150][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,150][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,150][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,150][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,150][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,150][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,150][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,153][16543] RunningMeanStd input shape: (1,) [2024-06-28 21:08:13,153][16543] RunningMeanStd input shape: (1,) [2024-06-28 21:08:13,153][16543] RunningMeanStd input shape: (1,) [2024-06-28 21:08:13,154][16543] RunningMeanStd input shape: (1,) [2024-06-28 21:08:13,154][16543] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:13,190][16543] RunningMeanStd input shape: (1,) [2024-06-28 21:08:13,198][16543] Created Actor Critic model with architecture: [2024-06-28 21:08:13,198][16543] SampleFactoryAgentWrapper( (obs_normalizer): ObservationNormalizer() (returns_normalizer): RecursiveScriptModule(original_name=RunningMeanStdInPlace) (agent): MettaAgent( (_encoder): MultiFeatureSetEncoder( (feature_set_encoders): ModuleDict( (grid_obs): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (agent): RunningMeanStdInPlace() (altar): RunningMeanStdInPlace() (clock): RunningMeanStdInPlace() (converter): RunningMeanStdInPlace() (generator): RunningMeanStdInPlace() (wall): RunningMeanStdInPlace() (agent:dir): RunningMeanStdInPlace() (agent:energy): RunningMeanStdInPlace() (agent:frozen): RunningMeanStdInPlace() (agent:hp): RunningMeanStdInPlace() (agent:id): RunningMeanStdInPlace() (agent:inv_r1): RunningMeanStdInPlace() (agent:inv_r2): RunningMeanStdInPlace() (agent:inv_r3): RunningMeanStdInPlace() (agent:shield): RunningMeanStdInPlace() (altar:hp): RunningMeanStdInPlace() (altar:state): RunningMeanStdInPlace() (converter:hp): RunningMeanStdInPlace() (converter:state): RunningMeanStdInPlace() (generator:amount): RunningMeanStdInPlace() (generator:hp): RunningMeanStdInPlace() (generator:state): RunningMeanStdInPlace() (wall:hp): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=125, out_features=512, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=512, out_features=512, bias=True) (3): ELU(alpha=1.0) (4): Linear(in_features=512, out_features=512, bias=True) (5): ELU(alpha=1.0) (6): Linear(in_features=512, out_features=512, bias=True) (7): ELU(alpha=1.0) ) ) (global_vars): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (_steps): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (last_action): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (last_action_id): RunningMeanStdInPlace() (last_action_val): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (last_reward): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (last_reward): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=5, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) (kinship): FeatureSetEncoder( (_normalizer): FeatureListNormalizer( (_norms_dict): ModuleDict( (kinship): RunningMeanStdInPlace() ) ) (embedding_net): Sequential( (0): Linear(in_features=125, out_features=8, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=8, out_features=8, bias=True) (3): ELU(alpha=1.0) ) ) ) (merged_encoder): Sequential( (0): Linear(in_features=544, out_features=512, bias=True) (1): ELU(alpha=1.0) (2): Linear(in_features=512, out_features=512, bias=True) (3): ELU(alpha=1.0) (4): Linear(in_features=512, out_features=512, bias=True) (5): ELU(alpha=1.0) ) ) (_decoder): Decoder( (mlp): Identity() ) (_critic_linear): Linear(in_features=512, out_features=1, bias=True) ) (_core): ModelCoreRNN( (core): GRU(512, 512) ) (_action_parameterization): ActionParameterizationDefault( (distribution_linear): Linear(in_features=512, out_features=16, bias=True) ) ) [2024-06-28 21:08:13,263][16543] Using optimizer [2024-06-28 21:08:13,450][16543] Loading state from checkpoint ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000284191_4656185344.pth... [2024-06-28 21:08:13,465][16543] Loading model from checkpoint [2024-06-28 21:08:13,467][16543] Loaded experiment state at self.train_step=284191, self.env_steps=4656185344 [2024-06-28 21:08:13,467][16543] Initialized policy 0 weights for model version 284191 [2024-06-28 21:08:13,468][16543] LearnerWorker_p0 finished initialization! [2024-06-28 21:08:13,469][16543] Using GPUs [0] for process 0 (actually maps to GPUs [0]) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,213][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,214][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,214][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,214][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,214][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,214][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,214][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,217][16563] RunningMeanStd input shape: (1,) [2024-06-28 21:08:14,217][16563] RunningMeanStd input shape: (1,) [2024-06-28 21:08:14,217][16563] RunningMeanStd input shape: (1,) [2024-06-28 21:08:14,218][16563] RunningMeanStd input shape: (1,) [2024-06-28 21:08:14,218][16563] RunningMeanStd input shape: (11, 11) [2024-06-28 21:08:14,253][16563] RunningMeanStd input shape: (1,) [2024-06-28 21:08:14,278][16330] Inference worker 0-0 is ready! [2024-06-28 21:08:14,278][16330] All inference workers are ready! Signal rollout workers to start! [2024-06-28 21:08:16,709][16330] Fps is (10 sec: nan, 60 sec: nan, 300 sec: nan). Total num frames: 4656185344. Throughput: 0: nan. Samples: 0. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2024-06-28 21:08:17,108][16585] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,119][16580] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,121][16590] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,122][16586] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,126][16594] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,142][16581] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,155][16593] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,158][16587] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,161][16579] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,180][16583] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,181][16569] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,191][16592] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,192][16570] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,195][16577] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,196][16584] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,198][16565] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,198][16567] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,199][16572] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,202][16574] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,205][16573] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,205][16564] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,206][16575] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,207][16566] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,207][16571] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,207][16578] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,207][16576] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,208][16582] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,208][16568] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,209][16595] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,220][16588] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,220][16589] Decorrelating experience for 0 frames... [2024-06-28 21:08:17,246][16591] Decorrelating experience for 0 frames... [2024-06-28 21:08:18,340][16585] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,369][16580] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,376][16590] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,386][16586] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,393][16594] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,411][16581] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,422][16579] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,422][16593] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,433][16587] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,446][16583] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,473][16569] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,481][16570] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,493][16592] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,494][16584] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,498][16577] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,503][16572] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,503][16565] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,505][16567] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,513][16574] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,515][16564] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,519][16568] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,522][16566] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,527][16576] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,528][16578] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,532][16573] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,532][16575] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,532][16571] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,535][16582] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,537][16595] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,541][16588] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,542][16589] Decorrelating experience for 256 frames... [2024-06-28 21:08:18,614][16591] Decorrelating experience for 256 frames... [2024-06-28 21:08:21,712][16330] Fps is (10 sec: 0.0, 60 sec: 0.0, 300 sec: 0.0). Total num frames: 4656185344. Throughput: 0: 4377.6. Samples: 21900. Policy #0 lag: (min: -1.0, avg: -1.0, max: -1.0) [2024-06-28 21:08:25,556][16566] Worker 2, sleep for 9.375 sec to decorrelate experience collection [2024-06-28 21:08:25,556][16590] Worker 26, sleep for 121.875 sec to decorrelate experience collection [2024-06-28 21:08:25,557][16585] Worker 21, sleep for 98.438 sec to decorrelate experience collection [2024-06-28 21:08:25,564][16572] Worker 8, sleep for 37.500 sec to decorrelate experience collection [2024-06-28 21:08:25,565][16580] Worker 18, sleep for 84.375 sec to decorrelate experience collection [2024-06-28 21:08:25,565][16593] Worker 29, sleep for 135.938 sec to decorrelate experience collection [2024-06-28 21:08:25,574][16567] Worker 3, sleep for 14.062 sec to decorrelate experience collection [2024-06-28 21:08:25,577][16576] Worker 13, sleep for 60.938 sec to decorrelate experience collection [2024-06-28 21:08:25,578][16594] Worker 30, sleep for 140.625 sec to decorrelate experience collection [2024-06-28 21:08:25,579][16577] Worker 12, sleep for 56.250 sec to decorrelate experience collection [2024-06-28 21:08:25,585][16573] Worker 9, sleep for 42.188 sec to decorrelate experience collection [2024-06-28 21:08:25,587][16578] Worker 14, sleep for 65.625 sec to decorrelate experience collection [2024-06-28 21:08:25,588][16581] Worker 17, sleep for 79.688 sec to decorrelate experience collection [2024-06-28 21:08:25,596][16587] Worker 24, sleep for 112.500 sec to decorrelate experience collection [2024-06-28 21:08:25,603][16583] Worker 19, sleep for 89.062 sec to decorrelate experience collection [2024-06-28 21:08:25,603][16586] Worker 22, sleep for 103.125 sec to decorrelate experience collection [2024-06-28 21:08:25,606][16565] Worker 1, sleep for 4.688 sec to decorrelate experience collection [2024-06-28 21:08:25,607][16574] Worker 10, sleep for 46.875 sec to decorrelate experience collection [2024-06-28 21:08:25,608][16575] Worker 11, sleep for 51.562 sec to decorrelate experience collection [2024-06-28 21:08:25,609][16582] Worker 15, sleep for 70.312 sec to decorrelate experience collection [2024-06-28 21:08:25,618][16592] Worker 28, sleep for 131.250 sec to decorrelate experience collection [2024-06-28 21:08:25,622][16595] Worker 31, sleep for 145.312 sec to decorrelate experience collection [2024-06-28 21:08:25,624][16579] Worker 16, sleep for 75.000 sec to decorrelate experience collection [2024-06-28 21:08:25,643][16588] Worker 23, sleep for 107.812 sec to decorrelate experience collection [2024-06-28 21:08:25,661][16589] Worker 25, sleep for 117.188 sec to decorrelate experience collection [2024-06-28 21:08:25,676][16543] Signal inference workers to stop experience collection... [2024-06-28 21:08:25,690][16584] Worker 20, sleep for 93.750 sec to decorrelate experience collection [2024-06-28 21:08:25,692][16563] InferenceWorker_p0-w0: stopping experience collection [2024-06-28 21:08:25,708][16571] Worker 7, sleep for 32.812 sec to decorrelate experience collection [2024-06-28 21:08:25,708][16569] Worker 5, sleep for 23.438 sec to decorrelate experience collection [2024-06-28 21:08:26,201][16543] Signal inference workers to resume experience collection... [2024-06-28 21:08:26,202][16563] InferenceWorker_p0-w0: resuming experience collection [2024-06-28 21:08:26,244][16591] Worker 27, sleep for 126.562 sec to decorrelate experience collection [2024-06-28 21:08:26,302][16570] Worker 6, sleep for 28.125 sec to decorrelate experience collection [2024-06-28 21:08:26,595][16568] Worker 4, sleep for 18.750 sec to decorrelate experience collection [2024-06-28 21:08:26,709][16330] Fps is (10 sec: 6553.6, 60 sec: 6553.6, 300 sec: 6553.6). Total num frames: 4656250880. Throughput: 0: 32802.0. Samples: 328020. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:08:27,388][16563] Updated weights for policy 0, policy_version 284201 (0.0013) [2024-06-28 21:08:29,717][16330] Heartbeat connected on Batcher_0 [2024-06-28 21:08:29,718][16330] Heartbeat connected on LearnerWorker_p0 [2024-06-28 21:08:29,726][16330] Heartbeat connected on RolloutWorker_w0 [2024-06-28 21:08:29,781][16330] Heartbeat connected on InferenceWorker_p0-w0 [2024-06-28 21:08:30,317][16565] Worker 1 awakens! [2024-06-28 21:08:30,326][16330] Heartbeat connected on RolloutWorker_w1 [2024-06-28 21:08:31,709][16330] Fps is (10 sec: 16388.2, 60 sec: 10922.6, 300 sec: 10922.6). Total num frames: 4656349184. Throughput: 0: 22058.5. Samples: 330880. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:08:34,978][16566] Worker 2 awakens! [2024-06-28 21:08:34,984][16330] Heartbeat connected on RolloutWorker_w2 [2024-06-28 21:08:36,709][16330] Fps is (10 sec: 11468.8, 60 sec: 9011.2, 300 sec: 9011.2). Total num frames: 4656365568. Throughput: 0: 17227.0. Samples: 344540. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:08:39,707][16567] Worker 3 awakens! [2024-06-28 21:08:39,720][16330] Heartbeat connected on RolloutWorker_w3 [2024-06-28 21:08:41,709][16330] Fps is (10 sec: 3276.8, 60 sec: 7864.3, 300 sec: 7864.3). Total num frames: 4656381952. Throughput: 0: 14738.3. Samples: 368460. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:08:45,439][16568] Worker 4 awakens! [2024-06-28 21:08:45,446][16330] Heartbeat connected on RolloutWorker_w4 [2024-06-28 21:08:46,709][16330] Fps is (10 sec: 6553.7, 60 sec: 8192.0, 300 sec: 8192.0). Total num frames: 4656431104. Throughput: 0: 12772.7. Samples: 383180. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:08:49,244][16569] Worker 5 awakens! [2024-06-28 21:08:49,248][16330] Heartbeat connected on RolloutWorker_w5 [2024-06-28 21:08:51,709][16330] Fps is (10 sec: 11469.0, 60 sec: 8894.2, 300 sec: 8894.2). Total num frames: 4656496640. Throughput: 0: 13150.3. Samples: 460260. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:08:53,100][16563] Updated weights for policy 0, policy_version 284211 (0.0016) [2024-06-28 21:08:54,524][16570] Worker 6 awakens! [2024-06-28 21:08:54,529][16330] Heartbeat connected on RolloutWorker_w6 [2024-06-28 21:08:56,709][16330] Fps is (10 sec: 13107.2, 60 sec: 9420.8, 300 sec: 9420.8). Total num frames: 4656562176. Throughput: 0: 13963.0. Samples: 558520. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:08:56,709][16330] Avg episode reward: [(0, '0.700')] [2024-06-28 21:08:58,546][16571] Worker 7 awakens! [2024-06-28 21:08:58,552][16330] Heartbeat connected on RolloutWorker_w7 [2024-06-28 21:09:01,344][16563] Updated weights for policy 0, policy_version 284221 (0.0012) [2024-06-28 21:09:01,709][16330] Fps is (10 sec: 18022.4, 60 sec: 10922.7, 300 sec: 10922.7). Total num frames: 4656676864. Throughput: 0: 13573.8. Samples: 610820. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:09:01,709][16330] Avg episode reward: [(0, '0.707')] [2024-06-28 21:09:03,164][16572] Worker 8 awakens! [2024-06-28 21:09:03,168][16330] Heartbeat connected on RolloutWorker_w8 [2024-06-28 21:09:06,709][16330] Fps is (10 sec: 22937.6, 60 sec: 12124.2, 300 sec: 12124.2). Total num frames: 4656791552. Throughput: 0: 15973.9. Samples: 740680. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:09:06,709][16330] Avg episode reward: [(0, '0.698')] [2024-06-28 21:09:07,873][16573] Worker 9 awakens! [2024-06-28 21:09:07,881][16330] Heartbeat connected on RolloutWorker_w9 [2024-06-28 21:09:08,695][16563] Updated weights for policy 0, policy_version 284231 (0.0012) [2024-06-28 21:09:11,709][16330] Fps is (10 sec: 22937.6, 60 sec: 13107.2, 300 sec: 13107.2). Total num frames: 4656906240. Throughput: 0: 12432.5. Samples: 887480. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:09:11,709][16330] Avg episode reward: [(0, '0.699')] [2024-06-28 21:09:12,496][16574] Worker 10 awakens! [2024-06-28 21:09:12,501][16330] Heartbeat connected on RolloutWorker_w10 [2024-06-28 21:09:14,290][16563] Updated weights for policy 0, policy_version 284241 (0.0017) [2024-06-28 21:09:16,709][16330] Fps is (10 sec: 27852.7, 60 sec: 14745.6, 300 sec: 14745.6). Total num frames: 4657070080. Throughput: 0: 14292.0. Samples: 974020. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:09:16,709][16330] Avg episode reward: [(0, '0.722')] [2024-06-28 21:09:17,270][16575] Worker 11 awakens! [2024-06-28 21:09:17,278][16330] Heartbeat connected on RolloutWorker_w11 [2024-06-28 21:09:19,820][16563] Updated weights for policy 0, policy_version 284251 (0.0013) [2024-06-28 21:09:21,709][16330] Fps is (10 sec: 31129.2, 60 sec: 17204.0, 300 sec: 15879.9). Total num frames: 4657217536. Throughput: 0: 18114.7. Samples: 1159700. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:09:21,710][16330] Avg episode reward: [(0, '0.709')] [2024-06-28 21:09:21,928][16577] Worker 12 awakens! [2024-06-28 21:09:21,936][16330] Heartbeat connected on RolloutWorker_w12 [2024-06-28 21:09:25,113][16563] Updated weights for policy 0, policy_version 284261 (0.0016) [2024-06-28 21:09:26,612][16576] Worker 13 awakens! [2024-06-28 21:09:26,620][16330] Heartbeat connected on RolloutWorker_w13 [2024-06-28 21:09:26,709][16330] Fps is (10 sec: 31129.3, 60 sec: 18841.6, 300 sec: 17086.1). Total num frames: 4657381376. Throughput: 0: 21797.4. Samples: 1349340. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:09:26,710][16330] Avg episode reward: [(0, '0.713')] [2024-06-28 21:09:30,146][16563] Updated weights for policy 0, policy_version 284271 (0.0019) [2024-06-28 21:09:31,313][16578] Worker 14 awakens! [2024-06-28 21:09:31,319][16330] Heartbeat connected on RolloutWorker_w14 [2024-06-28 21:09:31,709][16330] Fps is (10 sec: 32767.9, 60 sec: 19933.9, 300 sec: 18131.6). Total num frames: 4657545216. Throughput: 0: 23684.8. Samples: 1449000. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:09:31,710][16330] Avg episode reward: [(0, '0.722')] [2024-06-28 21:09:35,187][16563] Updated weights for policy 0, policy_version 284281 (0.0021) [2024-06-28 21:09:36,020][16582] Worker 15 awakens! [2024-06-28 21:09:36,028][16330] Heartbeat connected on RolloutWorker_w15 [2024-06-28 21:09:36,709][16330] Fps is (10 sec: 34406.5, 60 sec: 22664.5, 300 sec: 19251.2). Total num frames: 4657725440. Throughput: 0: 26598.1. Samples: 1657180. Policy #0 lag: (min: 0.0, avg: 0.0, max: 0.0) [2024-06-28 21:09:36,710][16330] Avg episode reward: [(0, '0.726')] [2024-06-28 21:09:39,946][16563] Updated weights for policy 0, policy_version 284291 (0.0022) [2024-06-28 21:09:40,724][16579] Worker 16 awakens! [2024-06-28 21:09:40,735][16330] Heartbeat connected on RolloutWorker_w16 [2024-06-28 21:09:41,709][16330] Fps is (10 sec: 34406.5, 60 sec: 25122.2, 300 sec: 20046.3). Total num frames: 4657889280. Throughput: 0: 28699.0. Samples: 1849980. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:09:41,710][16330] Avg episode reward: [(0, '0.730')] [2024-06-28 21:09:44,900][16563] Updated weights for policy 0, policy_version 284301 (0.0023) [2024-06-28 21:09:45,376][16581] Worker 17 awakens! [2024-06-28 21:09:45,387][16330] Heartbeat connected on RolloutWorker_w17 [2024-06-28 21:09:46,709][16330] Fps is (10 sec: 32767.9, 60 sec: 27033.5, 300 sec: 20753.0). Total num frames: 4658053120. Throughput: 0: 29893.2. Samples: 1956020. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:09:46,710][16330] Avg episode reward: [(0, '0.739')] [2024-06-28 21:09:49,677][16563] Updated weights for policy 0, policy_version 284311 (0.0026) [2024-06-28 21:09:50,040][16580] Worker 18 awakens! [2024-06-28 21:09:50,050][16330] Heartbeat connected on RolloutWorker_w18 [2024-06-28 21:09:51,709][16330] Fps is (10 sec: 34406.4, 60 sec: 28945.0, 300 sec: 21557.9). Total num frames: 4658233344. Throughput: 0: 31599.5. Samples: 2162660. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:09:51,710][16330] Avg episode reward: [(0, '0.719')] [2024-06-28 21:09:54,410][16563] Updated weights for policy 0, policy_version 284321 (0.0021) [2024-06-28 21:09:54,764][16583] Worker 19 awakens! [2024-06-28 21:09:54,775][16330] Heartbeat connected on RolloutWorker_w19 [2024-06-28 21:09:56,709][16330] Fps is (10 sec: 36045.2, 60 sec: 30856.5, 300 sec: 22282.2). Total num frames: 4658413568. Throughput: 0: 32978.6. Samples: 2371520. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:09:56,710][16330] Avg episode reward: [(0, '0.705')] [2024-06-28 21:09:59,036][16563] Updated weights for policy 0, policy_version 284331 (0.0019) [2024-06-28 21:09:59,541][16584] Worker 20 awakens! [2024-06-28 21:09:59,552][16330] Heartbeat connected on RolloutWorker_w20 [2024-06-28 21:10:01,709][16330] Fps is (10 sec: 34406.4, 60 sec: 31675.7, 300 sec: 22781.5). Total num frames: 4658577408. Throughput: 0: 33626.2. Samples: 2487200. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:10:01,710][16330] Avg episode reward: [(0, '0.723')] [2024-06-28 21:10:02,998][16563] Updated weights for policy 0, policy_version 284341 (0.0040) [2024-06-28 21:10:04,093][16585] Worker 21 awakens! [2024-06-28 21:10:04,105][16330] Heartbeat connected on RolloutWorker_w21 [2024-06-28 21:10:06,709][16330] Fps is (10 sec: 36044.8, 60 sec: 33041.0, 300 sec: 23533.4). Total num frames: 4658774016. Throughput: 0: 34419.6. Samples: 2708580. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:10:06,710][16330] Avg episode reward: [(0, '0.734')] [2024-06-28 21:10:06,720][16543] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000284349_4658774016.pth... [2024-06-28 21:10:06,767][16543] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000284160_4655677440.pth [2024-06-28 21:10:07,705][16563] Updated weights for policy 0, policy_version 284351 (0.0025) [2024-06-28 21:10:08,796][16586] Worker 22 awakens! [2024-06-28 21:10:08,808][16330] Heartbeat connected on RolloutWorker_w22 [2024-06-28 21:10:11,709][16330] Fps is (10 sec: 36044.5, 60 sec: 33860.1, 300 sec: 23934.9). Total num frames: 4658937856. Throughput: 0: 35202.2. Samples: 2933440. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:10:11,710][16330] Avg episode reward: [(0, '0.726')] [2024-06-28 21:10:12,001][16563] Updated weights for policy 0, policy_version 284361 (0.0027) [2024-06-28 21:10:13,560][16588] Worker 23 awakens! [2024-06-28 21:10:13,570][16330] Heartbeat connected on RolloutWorker_w23 [2024-06-28 21:10:15,540][16563] Updated weights for policy 0, policy_version 284371 (0.0023) [2024-06-28 21:10:16,709][16330] Fps is (10 sec: 37682.8, 60 sec: 34679.4, 300 sec: 24712.5). Total num frames: 4659150848. Throughput: 0: 35484.0. Samples: 3045780. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:10:16,710][16330] Avg episode reward: [(0, '0.722')] [2024-06-28 21:10:18,197][16587] Worker 24 awakens! [2024-06-28 21:10:18,209][16330] Heartbeat connected on RolloutWorker_w24 [2024-06-28 21:10:20,417][16563] Updated weights for policy 0, policy_version 284381 (0.0028) [2024-06-28 21:10:21,709][16330] Fps is (10 sec: 40960.6, 60 sec: 35498.7, 300 sec: 25296.9). Total num frames: 4659347456. Throughput: 0: 35996.5. Samples: 3277020. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:10:21,710][16330] Avg episode reward: [(0, '0.721')] [2024-06-28 21:10:22,948][16589] Worker 25 awakens! [2024-06-28 21:10:22,961][16330] Heartbeat connected on RolloutWorker_w25 [2024-06-28 21:10:24,782][16563] Updated weights for policy 0, policy_version 284391 (0.0023) [2024-06-28 21:10:26,709][16330] Fps is (10 sec: 39322.2, 60 sec: 36044.9, 300 sec: 25836.3). Total num frames: 4659544064. Throughput: 0: 36993.0. Samples: 3514660. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:10:26,709][16330] Avg episode reward: [(0, '0.719')] [2024-06-28 21:10:27,446][16590] Worker 26 awakens! [2024-06-28 21:10:27,459][16330] Heartbeat connected on RolloutWorker_w26 [2024-06-28 21:10:28,441][16563] Updated weights for policy 0, policy_version 284401 (0.0036) [2024-06-28 21:10:31,709][16330] Fps is (10 sec: 39321.0, 60 sec: 36590.9, 300 sec: 26335.7). Total num frames: 4659740672. Throughput: 0: 37249.7. Samples: 3632260. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:10:31,710][16330] Avg episode reward: [(0, '0.721')] [2024-06-28 21:10:32,904][16591] Worker 27 awakens! [2024-06-28 21:10:32,917][16330] Heartbeat connected on RolloutWorker_w27 [2024-06-28 21:10:33,047][16563] Updated weights for policy 0, policy_version 284411 (0.0023) [2024-06-28 21:10:36,358][16563] Updated weights for policy 0, policy_version 284421 (0.0033) [2024-06-28 21:10:36,709][16330] Fps is (10 sec: 40959.6, 60 sec: 37137.1, 300 sec: 26916.6). Total num frames: 4659953664. Throughput: 0: 37991.1. Samples: 3872260. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:10:36,717][16330] Avg episode reward: [(0, '0.749')] [2024-06-28 21:10:36,969][16592] Worker 28 awakens! [2024-06-28 21:10:36,983][16330] Heartbeat connected on RolloutWorker_w28 [2024-06-28 21:10:40,990][16563] Updated weights for policy 0, policy_version 284431 (0.0030) [2024-06-28 21:10:41,594][16593] Worker 29 awakens! [2024-06-28 21:10:41,608][16330] Heartbeat connected on RolloutWorker_w29 [2024-06-28 21:10:41,709][16330] Fps is (10 sec: 42598.6, 60 sec: 37956.2, 300 sec: 27457.3). Total num frames: 4660166656. Throughput: 0: 38909.2. Samples: 4122440. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:10:41,710][16330] Avg episode reward: [(0, '0.707')] [2024-06-28 21:10:44,927][16563] Updated weights for policy 0, policy_version 284441 (0.0028) [2024-06-28 21:10:46,263][16594] Worker 30 awakens! [2024-06-28 21:10:46,276][16330] Heartbeat connected on RolloutWorker_w30 [2024-06-28 21:10:46,709][16330] Fps is (10 sec: 37683.5, 60 sec: 37956.4, 300 sec: 27634.4). Total num frames: 4660330496. Throughput: 0: 39034.8. Samples: 4243760. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:10:46,710][16330] Avg episode reward: [(0, '0.745')] [2024-06-28 21:10:48,740][16563] Updated weights for policy 0, policy_version 284451 (0.0034) [2024-06-28 21:10:51,034][16595] Worker 31 awakens! [2024-06-28 21:10:51,049][16330] Heartbeat connected on RolloutWorker_w31 [2024-06-28 21:10:51,709][16330] Fps is (10 sec: 40960.6, 60 sec: 39048.6, 300 sec: 28328.5). Total num frames: 4660576256. Throughput: 0: 39657.4. Samples: 4493160. Policy #0 lag: (min: 0.0, avg: 4.9, max: 11.0) [2024-06-28 21:10:51,710][16330] Avg episode reward: [(0, '0.741')] [2024-06-28 21:10:53,007][16563] Updated weights for policy 0, policy_version 284461 (0.0044) [2024-06-28 21:10:56,210][16563] Updated weights for policy 0, policy_version 284471 (0.0023) [2024-06-28 21:10:56,709][16330] Fps is (10 sec: 45875.2, 60 sec: 39594.7, 300 sec: 28774.4). Total num frames: 4660789248. Throughput: 0: 40362.0. Samples: 4749720. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 21:10:56,709][16330] Avg episode reward: [(0, '0.743')] [2024-06-28 21:11:00,654][16563] Updated weights for policy 0, policy_version 284481 (0.0037) [2024-06-28 21:11:01,709][16330] Fps is (10 sec: 40959.1, 60 sec: 40140.7, 300 sec: 29094.0). Total num frames: 4660985856. Throughput: 0: 40750.1. Samples: 4879540. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 21:11:01,710][16330] Avg episode reward: [(0, '0.735')] [2024-06-28 21:11:02,803][16543] Signal inference workers to stop experience collection... (50 times) [2024-06-28 21:11:02,803][16543] Signal inference workers to resume experience collection... (50 times) [2024-06-28 21:11:02,824][16563] InferenceWorker_p0-w0: stopping experience collection (50 times) [2024-06-28 21:11:02,851][16563] InferenceWorker_p0-w0: resuming experience collection (50 times) [2024-06-28 21:11:03,737][16563] Updated weights for policy 0, policy_version 284491 (0.0042) [2024-06-28 21:11:06,709][16330] Fps is (10 sec: 40960.1, 60 sec: 40413.9, 300 sec: 29491.2). Total num frames: 4661198848. Throughput: 0: 41249.4. Samples: 5133240. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 21:11:06,710][16330] Avg episode reward: [(0, '0.736')] [2024-06-28 21:11:08,398][16563] Updated weights for policy 0, policy_version 284501 (0.0028) [2024-06-28 21:11:11,614][16563] Updated weights for policy 0, policy_version 284511 (0.0033) [2024-06-28 21:11:11,709][16330] Fps is (10 sec: 44237.4, 60 sec: 41506.2, 300 sec: 29959.3). Total num frames: 4661428224. Throughput: 0: 41691.0. Samples: 5390760. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 21:11:11,710][16330] Avg episode reward: [(0, '0.721')] [2024-06-28 21:11:15,945][16563] Updated weights for policy 0, policy_version 284521 (0.0042) [2024-06-28 21:11:16,709][16330] Fps is (10 sec: 42598.4, 60 sec: 41233.2, 300 sec: 30219.4). Total num frames: 4661624832. Throughput: 0: 41825.1. Samples: 5514380. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 21:11:16,709][16330] Avg episode reward: [(0, '0.744')] [2024-06-28 21:11:19,443][16563] Updated weights for policy 0, policy_version 284531 (0.0031) [2024-06-28 21:11:21,709][16330] Fps is (10 sec: 42598.7, 60 sec: 41779.2, 300 sec: 30642.5). Total num frames: 4661854208. Throughput: 0: 42132.1. Samples: 5768200. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 21:11:21,710][16330] Avg episode reward: [(0, '0.735')] [2024-06-28 21:11:23,431][16563] Updated weights for policy 0, policy_version 284541 (0.0025) [2024-06-28 21:11:26,709][16330] Fps is (10 sec: 40959.9, 60 sec: 41506.1, 300 sec: 30784.7). Total num frames: 4662034432. Throughput: 0: 42377.0. Samples: 6029400. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 21:11:26,710][16330] Avg episode reward: [(0, '0.718')] [2024-06-28 21:11:27,282][16563] Updated weights for policy 0, policy_version 284551 (0.0034) [2024-06-28 21:11:31,574][16563] Updated weights for policy 0, policy_version 284561 (0.0033) [2024-06-28 21:11:31,709][16330] Fps is (10 sec: 39321.8, 60 sec: 41779.4, 300 sec: 31087.6). Total num frames: 4662247424. Throughput: 0: 42341.4. Samples: 6149120. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 21:11:31,709][16330] Avg episode reward: [(0, '0.743')] [2024-06-28 21:11:34,863][16563] Updated weights for policy 0, policy_version 284571 (0.0027) [2024-06-28 21:11:36,709][16330] Fps is (10 sec: 45874.5, 60 sec: 42325.3, 300 sec: 31539.2). Total num frames: 4662493184. Throughput: 0: 42583.9. Samples: 6409440. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 21:11:36,710][16330] Avg episode reward: [(0, '0.744')] [2024-06-28 21:11:39,150][16563] Updated weights for policy 0, policy_version 284581 (0.0035) [2024-06-28 21:11:41,712][16330] Fps is (10 sec: 42586.6, 60 sec: 41777.4, 300 sec: 31648.7). Total num frames: 4662673408. Throughput: 0: 42613.0. Samples: 6667420. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 21:11:41,713][16330] Avg episode reward: [(0, '0.741')] [2024-06-28 21:11:42,435][16563] Updated weights for policy 0, policy_version 284591 (0.0034) [2024-06-28 21:11:46,709][16330] Fps is (10 sec: 39322.2, 60 sec: 42598.4, 300 sec: 31909.8). Total num frames: 4662886400. Throughput: 0: 42406.0. Samples: 6787800. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 21:11:46,709][16330] Avg episode reward: [(0, '0.742')] [2024-06-28 21:11:46,787][16563] Updated weights for policy 0, policy_version 284601 (0.0034) [2024-06-28 21:11:50,257][16563] Updated weights for policy 0, policy_version 284611 (0.0032) [2024-06-28 21:11:51,712][16330] Fps is (10 sec: 45875.1, 60 sec: 42596.5, 300 sec: 32310.4). Total num frames: 4663132160. Throughput: 0: 42430.7. Samples: 7042740. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 21:11:51,713][16330] Avg episode reward: [(0, '0.739')] [2024-06-28 21:11:54,582][16563] Updated weights for policy 0, policy_version 284621 (0.0031) [2024-06-28 21:11:56,709][16330] Fps is (10 sec: 40960.0, 60 sec: 41779.2, 300 sec: 32321.2). Total num frames: 4663296000. Throughput: 0: 42451.2. Samples: 7301060. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 21:11:56,709][16330] Avg episode reward: [(0, '0.734')] [2024-06-28 21:11:58,087][16563] Updated weights for policy 0, policy_version 284631 (0.0039) [2024-06-28 21:12:01,709][16330] Fps is (10 sec: 39332.1, 60 sec: 42325.4, 300 sec: 32622.4). Total num frames: 4663525376. Throughput: 0: 42321.7. Samples: 7418860. Policy #0 lag: (min: 0.0, avg: 12.1, max: 23.0) [2024-06-28 21:12:01,710][16330] Avg episode reward: [(0, '0.743')] [2024-06-28 21:12:02,262][16563] Updated weights for policy 0, policy_version 284641 (0.0035) [2024-06-28 21:12:05,768][16563] Updated weights for policy 0, policy_version 284651 (0.0032) [2024-06-28 21:12:06,709][16330] Fps is (10 sec: 44236.3, 60 sec: 42325.2, 300 sec: 32839.2). Total num frames: 4663738368. Throughput: 0: 42325.7. Samples: 7672860. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:12:06,710][16330] Avg episode reward: [(0, '0.741')] [2024-06-28 21:12:06,814][16543] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000284653_4663754752.pth... [2024-06-28 21:12:06,878][16543] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000284191_4656185344.pth [2024-06-28 21:12:10,636][16563] Updated weights for policy 0, policy_version 284661 (0.0029) [2024-06-28 21:12:11,709][16330] Fps is (10 sec: 40960.2, 60 sec: 41779.2, 300 sec: 32977.2). Total num frames: 4663934976. Throughput: 0: 42075.1. Samples: 7922780. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:12:11,710][16330] Avg episode reward: [(0, '0.735')] [2024-06-28 21:12:13,761][16563] Updated weights for policy 0, policy_version 284671 (0.0041) [2024-06-28 21:12:16,709][16330] Fps is (10 sec: 44237.0, 60 sec: 42598.3, 300 sec: 33314.1). Total num frames: 4664180736. Throughput: 0: 42306.1. Samples: 8052900. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:12:16,710][16330] Avg episode reward: [(0, '0.740')] [2024-06-28 21:12:18,272][16563] Updated weights for policy 0, policy_version 284681 (0.0045) [2024-06-28 21:12:21,462][16563] Updated weights for policy 0, policy_version 284691 (0.0033) [2024-06-28 21:12:21,709][16330] Fps is (10 sec: 44236.8, 60 sec: 42052.3, 300 sec: 33436.7). Total num frames: 4664377344. Throughput: 0: 42135.2. Samples: 8305520. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:12:21,710][16330] Avg episode reward: [(0, '0.737')] [2024-06-28 21:12:25,786][16563] Updated weights for policy 0, policy_version 284701 (0.0028) [2024-06-28 21:12:26,712][16330] Fps is (10 sec: 39311.4, 60 sec: 42323.5, 300 sec: 33554.1). Total num frames: 4664573952. Throughput: 0: 42134.7. Samples: 8563480. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:12:26,712][16330] Avg episode reward: [(0, '0.739')] [2024-06-28 21:12:27,510][16543] Signal inference workers to stop experience collection... (100 times) [2024-06-28 21:12:27,511][16543] Signal inference workers to resume experience collection... (100 times) [2024-06-28 21:12:27,537][16563] InferenceWorker_p0-w0: stopping experience collection (100 times) [2024-06-28 21:12:27,537][16563] InferenceWorker_p0-w0: resuming experience collection (100 times) [2024-06-28 21:12:29,204][16563] Updated weights for policy 0, policy_version 284711 (0.0031) [2024-06-28 21:12:31,709][16330] Fps is (10 sec: 42598.1, 60 sec: 42598.3, 300 sec: 33796.0). Total num frames: 4664803328. Throughput: 0: 42124.8. Samples: 8683420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:12:31,710][16330] Avg episode reward: [(0, '0.730')] [2024-06-28 21:12:33,470][16563] Updated weights for policy 0, policy_version 284721 (0.0026) [2024-06-28 21:12:36,709][16330] Fps is (10 sec: 42609.9, 60 sec: 41779.3, 300 sec: 33902.3). Total num frames: 4664999936. Throughput: 0: 42211.9. Samples: 8942160. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:12:36,709][16330] Avg episode reward: [(0, '0.727')] [2024-06-28 21:12:36,865][16563] Updated weights for policy 0, policy_version 284731 (0.0042) [2024-06-28 21:12:41,250][16563] Updated weights for policy 0, policy_version 284741 (0.0038) [2024-06-28 21:12:41,709][16330] Fps is (10 sec: 39321.9, 60 sec: 42054.2, 300 sec: 34004.5). Total num frames: 4665196544. Throughput: 0: 41964.0. Samples: 9189440. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:12:41,710][16330] Avg episode reward: [(0, '0.722')] [2024-06-28 21:12:44,696][16563] Updated weights for policy 0, policy_version 284751 (0.0024) [2024-06-28 21:12:46,709][16330] Fps is (10 sec: 44236.3, 60 sec: 42598.3, 300 sec: 34285.0). Total num frames: 4665442304. Throughput: 0: 42096.0. Samples: 9313180. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:12:46,710][16330] Avg episode reward: [(0, '0.735')] [2024-06-28 21:12:49,204][16563] Updated weights for policy 0, policy_version 284761 (0.0039) [2024-06-28 21:12:51,709][16330] Fps is (10 sec: 45874.2, 60 sec: 42054.0, 300 sec: 34436.2). Total num frames: 4665655296. Throughput: 0: 42324.8. Samples: 9577480. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:12:51,710][16330] Avg episode reward: [(0, '0.738')] [2024-06-28 21:12:52,313][16563] Updated weights for policy 0, policy_version 284771 (0.0034) [2024-06-28 21:12:56,709][16330] Fps is (10 sec: 39321.7, 60 sec: 42325.3, 300 sec: 34464.9). Total num frames: 4665835520. Throughput: 0: 42344.4. Samples: 9828280. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:12:56,722][16330] Avg episode reward: [(0, '0.734')] [2024-06-28 21:12:56,800][16563] Updated weights for policy 0, policy_version 284781 (0.0039) [2024-06-28 21:12:59,922][16563] Updated weights for policy 0, policy_version 284791 (0.0036) [2024-06-28 21:13:01,709][16330] Fps is (10 sec: 40961.3, 60 sec: 42325.4, 300 sec: 34665.1). Total num frames: 4666064896. Throughput: 0: 42150.8. Samples: 9949680. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:13:01,709][16330] Avg episode reward: [(0, '0.735')] [2024-06-28 21:13:04,370][16563] Updated weights for policy 0, policy_version 284801 (0.0032) [2024-06-28 21:13:06,709][16330] Fps is (10 sec: 42598.2, 60 sec: 42052.3, 300 sec: 34745.4). Total num frames: 4666261504. Throughput: 0: 42360.4. Samples: 10211740. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:13:06,710][16330] Avg episode reward: [(0, '0.730')] [2024-06-28 21:13:07,848][16563] Updated weights for policy 0, policy_version 284811 (0.0029) [2024-06-28 21:13:11,709][16330] Fps is (10 sec: 40959.8, 60 sec: 42325.4, 300 sec: 34878.5). Total num frames: 4666474496. Throughput: 0: 42147.4. Samples: 10460000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:13:11,709][16330] Avg episode reward: [(0, '0.730')] [2024-06-28 21:13:12,204][16563] Updated weights for policy 0, policy_version 284821 (0.0034) [2024-06-28 21:13:15,669][16563] Updated weights for policy 0, policy_version 284831 (0.0051) [2024-06-28 21:13:16,709][16330] Fps is (10 sec: 44237.3, 60 sec: 42052.3, 300 sec: 35656.4). Total num frames: 4666703872. Throughput: 0: 42343.7. Samples: 10588880. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:13:16,709][16330] Avg episode reward: [(0, '0.743')] [2024-06-28 21:13:19,671][16563] Updated weights for policy 0, policy_version 284841 (0.0042) [2024-06-28 21:13:21,709][16330] Fps is (10 sec: 40959.8, 60 sec: 41779.2, 300 sec: 36044.8). Total num frames: 4666884096. Throughput: 0: 42151.1. Samples: 10838960. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:13:21,710][16330] Avg episode reward: [(0, '0.742')] [2024-06-28 21:13:23,234][16563] Updated weights for policy 0, policy_version 284851 (0.0030) [2024-06-28 21:13:26,709][16330] Fps is (10 sec: 40959.7, 60 sec: 42327.2, 300 sec: 36489.1). Total num frames: 4667113472. Throughput: 0: 42212.4. Samples: 11089000. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:13:26,710][16330] Avg episode reward: [(0, '0.741')] [2024-06-28 21:13:27,761][16563] Updated weights for policy 0, policy_version 284861 (0.0042) [2024-06-28 21:13:31,210][16563] Updated weights for policy 0, policy_version 284871 (0.0037) [2024-06-28 21:13:31,709][16330] Fps is (10 sec: 45875.4, 60 sec: 42325.4, 300 sec: 37211.1). Total num frames: 4667342848. Throughput: 0: 42387.2. Samples: 11220600. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:13:31,709][16330] Avg episode reward: [(0, '0.750')] [2024-06-28 21:13:35,244][16563] Updated weights for policy 0, policy_version 284881 (0.0046) [2024-06-28 21:13:36,709][16330] Fps is (10 sec: 40960.3, 60 sec: 42052.3, 300 sec: 37766.5). Total num frames: 4667523072. Throughput: 0: 41967.3. Samples: 11466000. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:13:36,709][16330] Avg episode reward: [(0, '0.732')] [2024-06-28 21:13:38,837][16563] Updated weights for policy 0, policy_version 284891 (0.0044) [2024-06-28 21:13:41,712][16330] Fps is (10 sec: 40948.9, 60 sec: 42596.5, 300 sec: 38377.1). Total num frames: 4667752448. Throughput: 0: 42024.6. Samples: 11719500. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:13:41,712][16330] Avg episode reward: [(0, '0.743')] [2024-06-28 21:13:43,402][16563] Updated weights for policy 0, policy_version 284901 (0.0033) [2024-06-28 21:13:46,709][16330] Fps is (10 sec: 44236.0, 60 sec: 42052.2, 300 sec: 38877.3). Total num frames: 4667965440. Throughput: 0: 42271.8. Samples: 11851920. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:13:46,710][16330] Avg episode reward: [(0, '0.737')] [2024-06-28 21:13:46,805][16563] Updated weights for policy 0, policy_version 284911 (0.0033) [2024-06-28 21:13:51,018][16563] Updated weights for policy 0, policy_version 284921 (0.0046) [2024-06-28 21:13:51,709][16330] Fps is (10 sec: 40970.7, 60 sec: 41779.3, 300 sec: 39321.6). Total num frames: 4668162048. Throughput: 0: 41850.2. Samples: 12095000. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:13:51,710][16330] Avg episode reward: [(0, '0.734')] [2024-06-28 21:13:54,510][16563] Updated weights for policy 0, policy_version 284931 (0.0047) [2024-06-28 21:13:56,709][16330] Fps is (10 sec: 40960.7, 60 sec: 42325.4, 300 sec: 39654.8). Total num frames: 4668375040. Throughput: 0: 41909.3. Samples: 12345920. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:13:56,709][16330] Avg episode reward: [(0, '0.735')] [2024-06-28 21:13:58,818][16563] Updated weights for policy 0, policy_version 284941 (0.0033) [2024-06-28 21:14:01,709][16330] Fps is (10 sec: 40960.2, 60 sec: 41779.1, 300 sec: 39932.5). Total num frames: 4668571648. Throughput: 0: 41843.1. Samples: 12471820. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:14:01,710][16330] Avg episode reward: [(0, '0.737')] [2024-06-28 21:14:02,358][16563] Updated weights for policy 0, policy_version 284951 (0.0045) [2024-06-28 21:14:06,709][16330] Fps is (10 sec: 40959.3, 60 sec: 42052.2, 300 sec: 40265.7). Total num frames: 4668784640. Throughput: 0: 41912.7. Samples: 12725040. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:14:06,710][16330] Avg episode reward: [(0, '0.736')] [2024-06-28 21:14:06,718][16543] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000284960_4668784640.pth... [2024-06-28 21:14:06,780][16543] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000284349_4658774016.pth [2024-06-28 21:14:07,118][16563] Updated weights for policy 0, policy_version 284961 (0.0029) [2024-06-28 21:14:09,160][16543] Signal inference workers to stop experience collection... (150 times) [2024-06-28 21:14:09,160][16543] Signal inference workers to resume experience collection... (150 times) [2024-06-28 21:14:09,202][16563] InferenceWorker_p0-w0: stopping experience collection (150 times) [2024-06-28 21:14:09,202][16563] InferenceWorker_p0-w0: resuming experience collection (150 times) [2024-06-28 21:14:10,053][16563] Updated weights for policy 0, policy_version 284971 (0.0034) [2024-06-28 21:14:11,709][16330] Fps is (10 sec: 44236.6, 60 sec: 42325.3, 300 sec: 40487.9). Total num frames: 4669014016. Throughput: 0: 41897.8. Samples: 12974400. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:14:11,710][16330] Avg episode reward: [(0, '0.741')] [2024-06-28 21:14:14,645][16563] Updated weights for policy 0, policy_version 284981 (0.0036) [2024-06-28 21:14:16,709][16330] Fps is (10 sec: 42599.3, 60 sec: 41779.2, 300 sec: 40654.6). Total num frames: 4669210624. Throughput: 0: 41932.0. Samples: 13107540. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:14:16,709][16330] Avg episode reward: [(0, '0.732')] [2024-06-28 21:14:18,050][16563] Updated weights for policy 0, policy_version 284991 (0.0036) [2024-06-28 21:14:21,709][16330] Fps is (10 sec: 40960.2, 60 sec: 42325.3, 300 sec: 40821.2). Total num frames: 4669423616. Throughput: 0: 42085.3. Samples: 13359840. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:14:21,710][16330] Avg episode reward: [(0, '0.740')] [2024-06-28 21:14:22,132][16563] Updated weights for policy 0, policy_version 285001 (0.0039) [2024-06-28 21:14:25,681][16563] Updated weights for policy 0, policy_version 285011 (0.0033) [2024-06-28 21:14:26,709][16330] Fps is (10 sec: 44236.4, 60 sec: 42325.3, 300 sec: 41043.3). Total num frames: 4669652992. Throughput: 0: 42019.8. Samples: 13610280. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:14:26,710][16330] Avg episode reward: [(0, '0.741')] [2024-06-28 21:14:29,876][16563] Updated weights for policy 0, policy_version 285021 (0.0033) [2024-06-28 21:14:31,709][16330] Fps is (10 sec: 40960.0, 60 sec: 41506.1, 300 sec: 41043.3). Total num frames: 4669833216. Throughput: 0: 42057.0. Samples: 13744480. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:14:31,710][16330] Avg episode reward: [(0, '0.744')] [2024-06-28 21:14:33,306][16563] Updated weights for policy 0, policy_version 285031 (0.0039) [2024-06-28 21:14:36,709][16330] Fps is (10 sec: 37682.9, 60 sec: 41779.1, 300 sec: 41154.4). Total num frames: 4670029824. Throughput: 0: 42069.7. Samples: 13988140. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:14:36,710][16330] Avg episode reward: [(0, '0.737')] [2024-06-28 21:14:37,916][16563] Updated weights for policy 0, policy_version 285041 (0.0045) [2024-06-28 21:14:41,330][16563] Updated weights for policy 0, policy_version 285051 (0.0048) [2024-06-28 21:14:41,709][16330] Fps is (10 sec: 45874.7, 60 sec: 42327.1, 300 sec: 41487.6). Total num frames: 4670291968. Throughput: 0: 41882.1. Samples: 14230620. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:14:41,718][16330] Avg episode reward: [(0, '0.744')] [2024-06-28 21:14:45,517][16563] Updated weights for policy 0, policy_version 285061 (0.0039) [2024-06-28 21:14:46,709][16330] Fps is (10 sec: 42598.7, 60 sec: 41506.2, 300 sec: 41432.1). Total num frames: 4670455808. Throughput: 0: 42128.4. Samples: 14367600. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:14:46,710][16330] Avg episode reward: [(0, '0.749')] [2024-06-28 21:14:49,089][16563] Updated weights for policy 0, policy_version 285071 (0.0035) [2024-06-28 21:14:51,709][16330] Fps is (10 sec: 37683.8, 60 sec: 41779.3, 300 sec: 41543.2). Total num frames: 4670668800. Throughput: 0: 41998.4. Samples: 14614960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:14:51,709][16330] Avg episode reward: [(0, '0.744')] [2024-06-28 21:14:53,600][16563] Updated weights for policy 0, policy_version 285081 (0.0039) [2024-06-28 21:14:56,709][16330] Fps is (10 sec: 45875.6, 60 sec: 42325.3, 300 sec: 41820.9). Total num frames: 4670914560. Throughput: 0: 41999.2. Samples: 14864360. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:14:56,709][16330] Avg episode reward: [(0, '0.739')] [2024-06-28 21:14:56,860][16563] Updated weights for policy 0, policy_version 285091 (0.0048) [2024-06-28 21:15:01,087][16563] Updated weights for policy 0, policy_version 285101 (0.0033) [2024-06-28 21:15:01,709][16330] Fps is (10 sec: 42598.6, 60 sec: 42052.3, 300 sec: 41765.3). Total num frames: 4671094784. Throughput: 0: 42064.9. Samples: 15000460. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:15:01,709][16330] Avg episode reward: [(0, '0.741')] [2024-06-28 21:15:04,470][16563] Updated weights for policy 0, policy_version 285111 (0.0053) [2024-06-28 21:15:06,709][16330] Fps is (10 sec: 39321.2, 60 sec: 42052.3, 300 sec: 41931.9). Total num frames: 4671307776. Throughput: 0: 42033.3. Samples: 15251340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:15:06,710][16330] Avg episode reward: [(0, '0.739')] [2024-06-28 21:15:08,615][16563] Updated weights for policy 0, policy_version 285121 (0.0034) [2024-06-28 21:15:11,709][16330] Fps is (10 sec: 44236.3, 60 sec: 42052.3, 300 sec: 41987.5). Total num frames: 4671537152. Throughput: 0: 42077.8. Samples: 15503780. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:15:11,710][16330] Avg episode reward: [(0, '0.741')] [2024-06-28 21:15:12,401][16563] Updated weights for policy 0, policy_version 285131 (0.0040) [2024-06-28 21:15:16,712][16330] Fps is (10 sec: 42587.4, 60 sec: 42050.4, 300 sec: 41987.1). Total num frames: 4671733760. Throughput: 0: 41927.7. Samples: 15631340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:15:16,719][16330] Avg episode reward: [(0, '0.745')] [2024-06-28 21:15:17,031][16563] Updated weights for policy 0, policy_version 285141 (0.0030) [2024-06-28 21:15:20,052][16563] Updated weights for policy 0, policy_version 285151 (0.0028) [2024-06-28 21:15:21,709][16330] Fps is (10 sec: 40960.0, 60 sec: 42052.3, 300 sec: 42043.0). Total num frames: 4671946752. Throughput: 0: 42004.5. Samples: 15878340. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:15:21,710][16330] Avg episode reward: [(0, '0.740')] [2024-06-28 21:15:24,686][16563] Updated weights for policy 0, policy_version 285161 (0.0031) [2024-06-28 21:15:26,709][16330] Fps is (10 sec: 44248.4, 60 sec: 42052.3, 300 sec: 42154.1). Total num frames: 4672176128. Throughput: 0: 42395.2. Samples: 16138400. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:15:26,710][16330] Avg episode reward: [(0, '0.746')] [2024-06-28 21:15:27,776][16563] Updated weights for policy 0, policy_version 285171 (0.0038) [2024-06-28 21:15:31,709][16330] Fps is (10 sec: 42598.5, 60 sec: 42325.3, 300 sec: 42098.6). Total num frames: 4672372736. Throughput: 0: 42163.6. Samples: 16264960. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:15:31,710][16330] Avg episode reward: [(0, '0.744')] [2024-06-28 21:15:32,131][16563] Updated weights for policy 0, policy_version 285181 (0.0036) [2024-06-28 21:15:35,679][16563] Updated weights for policy 0, policy_version 285191 (0.0032) [2024-06-28 21:15:36,709][16330] Fps is (10 sec: 40960.1, 60 sec: 42598.5, 300 sec: 42098.6). Total num frames: 4672585728. Throughput: 0: 42263.9. Samples: 16516840. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:15:36,710][16330] Avg episode reward: [(0, '0.746')] [2024-06-28 21:15:40,218][16563] Updated weights for policy 0, policy_version 285201 (0.0034) [2024-06-28 21:15:41,709][16330] Fps is (10 sec: 42598.3, 60 sec: 41779.3, 300 sec: 42265.2). Total num frames: 4672798720. Throughput: 0: 42508.8. Samples: 16777260. Policy #0 lag: (min: 0.0, avg: 9.5, max: 20.0) [2024-06-28 21:15:41,710][16330] Avg episode reward: [(0, '0.741')] [2024-06-28 21:15:42,261][16543] Signal inference workers to stop experience collection... (200 times) [2024-06-28 21:15:42,306][16563] InferenceWorker_p0-w0: stopping experience collection (200 times) [2024-06-28 21:15:42,368][16543] Signal inference workers to resume experience collection... (200 times) [2024-06-28 21:15:42,368][16563] InferenceWorker_p0-w0: resuming experience collection (200 times) [2024-06-28 21:15:43,318][16563] Updated weights for policy 0, policy_version 285211 (0.0029) [2024-06-28 21:15:46,709][16330] Fps is (10 sec: 42598.3, 60 sec: 42598.4, 300 sec: 42154.1). Total num frames: 4673011712. Throughput: 0: 42136.8. Samples: 16896620. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-28 21:15:46,710][16330] Avg episode reward: [(0, '0.737')] [2024-06-28 21:15:48,006][16563] Updated weights for policy 0, policy_version 285221 (0.0023) [2024-06-28 21:15:51,370][16563] Updated weights for policy 0, policy_version 285231 (0.0031) [2024-06-28 21:15:51,709][16330] Fps is (10 sec: 44237.1, 60 sec: 42871.5, 300 sec: 42209.6). Total num frames: 4673241088. Throughput: 0: 42212.1. Samples: 17150880. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-28 21:15:51,709][16330] Avg episode reward: [(0, '0.746')] [2024-06-28 21:15:56,207][16563] Updated weights for policy 0, policy_version 285241 (0.0029) [2024-06-28 21:15:56,709][16330] Fps is (10 sec: 39321.6, 60 sec: 41506.1, 300 sec: 42098.6). Total num frames: 4673404928. Throughput: 0: 42245.3. Samples: 17404820. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-28 21:15:56,710][16330] Avg episode reward: [(0, '0.732')] [2024-06-28 21:15:58,863][16563] Updated weights for policy 0, policy_version 285251 (0.0037) [2024-06-28 21:16:01,709][16330] Fps is (10 sec: 37682.4, 60 sec: 42052.1, 300 sec: 42098.5). Total num frames: 4673617920. Throughput: 0: 42096.6. Samples: 17525580. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-28 21:16:01,710][16330] Avg episode reward: [(0, '0.734')] [2024-06-28 21:16:03,621][16563] Updated weights for policy 0, policy_version 285261 (0.0035) [2024-06-28 21:16:06,607][16563] Updated weights for policy 0, policy_version 285271 (0.0036) [2024-06-28 21:16:06,709][16330] Fps is (10 sec: 47513.5, 60 sec: 42871.5, 300 sec: 42209.6). Total num frames: 4673880064. Throughput: 0: 42272.0. Samples: 17780580. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-28 21:16:06,710][16330] Avg episode reward: [(0, '0.735')] [2024-06-28 21:16:06,727][16543] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000285271_4673880064.pth... [2024-06-28 21:16:06,794][16543] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000284653_4663754752.pth [2024-06-28 21:16:11,423][16563] Updated weights for policy 0, policy_version 285281 (0.0039) [2024-06-28 21:16:11,709][16330] Fps is (10 sec: 42599.3, 60 sec: 41779.2, 300 sec: 42098.5). Total num frames: 4674043904. Throughput: 0: 42203.2. Samples: 18037540. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-28 21:16:11,710][16330] Avg episode reward: [(0, '0.737')] [2024-06-28 21:16:14,762][16563] Updated weights for policy 0, policy_version 285291 (0.0035) [2024-06-28 21:16:16,710][16330] Fps is (10 sec: 37682.4, 60 sec: 42053.9, 300 sec: 42043.0). Total num frames: 4674256896. Throughput: 0: 42061.5. Samples: 18157740. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-28 21:16:16,710][16330] Avg episode reward: [(0, '0.741')] [2024-06-28 21:16:19,213][16563] Updated weights for policy 0, policy_version 285301 (0.0031) [2024-06-28 21:16:21,709][16330] Fps is (10 sec: 45875.1, 60 sec: 42598.4, 300 sec: 42265.2). Total num frames: 4674502656. Throughput: 0: 42142.7. Samples: 18413260. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-28 21:16:21,710][16330] Avg episode reward: [(0, '0.723')] [2024-06-28 21:16:22,333][16563] Updated weights for policy 0, policy_version 285311 (0.0030) [2024-06-28 21:16:26,709][16330] Fps is (10 sec: 40960.9, 60 sec: 41506.1, 300 sec: 42098.5). Total num frames: 4674666496. Throughput: 0: 42081.7. Samples: 18670940. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-28 21:16:26,710][16330] Avg episode reward: [(0, '0.740')] [2024-06-28 21:16:27,086][16563] Updated weights for policy 0, policy_version 285321 (0.0029) [2024-06-28 21:16:30,369][16563] Updated weights for policy 0, policy_version 285331 (0.0043) [2024-06-28 21:16:31,709][16330] Fps is (10 sec: 40960.0, 60 sec: 42325.4, 300 sec: 42098.6). Total num frames: 4674912256. Throughput: 0: 42045.9. Samples: 18788680. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-28 21:16:31,710][16330] Avg episode reward: [(0, '0.731')] [2024-06-28 21:16:34,523][16563] Updated weights for policy 0, policy_version 285341 (0.0043) [2024-06-28 21:16:36,709][16330] Fps is (10 sec: 45875.4, 60 sec: 42325.3, 300 sec: 42210.0). Total num frames: 4675125248. Throughput: 0: 42208.4. Samples: 19050260. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-28 21:16:36,710][16330] Avg episode reward: [(0, '0.744')] [2024-06-28 21:16:37,873][16563] Updated weights for policy 0, policy_version 285351 (0.0046) [2024-06-28 21:16:41,709][16330] Fps is (10 sec: 40960.0, 60 sec: 42052.3, 300 sec: 42154.1). Total num frames: 4675321856. Throughput: 0: 42188.1. Samples: 19303280. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-28 21:16:41,709][16330] Avg episode reward: [(0, '0.746')] [2024-06-28 21:16:42,481][16563] Updated weights for policy 0, policy_version 285361 (0.0047) [2024-06-28 21:16:45,491][16563] Updated weights for policy 0, policy_version 285371 (0.0027) [2024-06-28 21:16:46,709][16330] Fps is (10 sec: 40959.2, 60 sec: 42052.2, 300 sec: 42043.4). Total num frames: 4675534848. Throughput: 0: 42171.5. Samples: 19423300. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-28 21:16:46,710][16330] Avg episode reward: [(0, '0.738')] [2024-06-28 21:16:49,961][16563] Updated weights for policy 0, policy_version 285381 (0.0030) [2024-06-28 21:16:51,229][16543] Signal inference workers to stop experience collection... (250 times) [2024-06-28 21:16:51,284][16563] InferenceWorker_p0-w0: stopping experience collection (250 times) [2024-06-28 21:16:51,342][16543] Signal inference workers to resume experience collection... (250 times) [2024-06-28 21:16:51,342][16563] InferenceWorker_p0-w0: resuming experience collection (250 times) [2024-06-28 21:16:51,709][16330] Fps is (10 sec: 42598.4, 60 sec: 41779.2, 300 sec: 42209.6). Total num frames: 4675747840. Throughput: 0: 42207.2. Samples: 19679900. Policy #0 lag: (min: 0.0, avg: 12.4, max: 22.0) [2024-06-28 21:16:51,710][16330] Avg episode reward: [(0, '0.743')] [2024-06-28 21:16:53,724][16563] Updated weights for policy 0, policy_version 285391 (0.0041) [2024-06-28 21:16:56,709][16330] Fps is (10 sec: 39322.7, 60 sec: 42052.3, 300 sec: 42043.0). Total num frames: 4675928064. Throughput: 0: 42055.6. Samples: 19930040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:16:56,709][16330] Avg episode reward: [(0, '0.743')] [2024-06-28 21:16:57,653][16563] Updated weights for policy 0, policy_version 285401 (0.0040) [2024-06-28 21:17:01,470][16563] Updated weights for policy 0, policy_version 285411 (0.0025) [2024-06-28 21:17:01,709][16330] Fps is (10 sec: 42598.0, 60 sec: 42598.5, 300 sec: 42154.1). Total num frames: 4676173824. Throughput: 0: 42131.3. Samples: 20053640. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:17:01,710][16330] Avg episode reward: [(0, '0.739')] [2024-06-28 21:17:05,764][16563] Updated weights for policy 0, policy_version 285421 (0.0029) [2024-06-28 21:17:06,709][16330] Fps is (10 sec: 44236.2, 60 sec: 41506.1, 300 sec: 42154.1). Total num frames: 4676370432. Throughput: 0: 42113.2. Samples: 20308360. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:17:06,710][16330] Avg episode reward: [(0, '0.748')] [2024-06-28 21:17:09,298][16563] Updated weights for policy 0, policy_version 285431 (0.0027) [2024-06-28 21:17:11,709][16330] Fps is (10 sec: 40960.4, 60 sec: 42325.3, 300 sec: 42043.0). Total num frames: 4676583424. Throughput: 0: 42025.0. Samples: 20562060. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:17:11,710][16330] Avg episode reward: [(0, '0.746')] [2024-06-28 21:17:13,759][16563] Updated weights for policy 0, policy_version 285441 (0.0038) [2024-06-28 21:17:16,709][16330] Fps is (10 sec: 42598.9, 60 sec: 42325.6, 300 sec: 42098.6). Total num frames: 4676796416. Throughput: 0: 42210.7. Samples: 20688160. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:17:16,709][16330] Avg episode reward: [(0, '0.735')] [2024-06-28 21:17:16,924][16563] Updated weights for policy 0, policy_version 285451 (0.0042) [2024-06-28 21:17:21,302][16563] Updated weights for policy 0, policy_version 285461 (0.0029) [2024-06-28 21:17:21,709][16330] Fps is (10 sec: 42598.4, 60 sec: 41779.2, 300 sec: 42154.5). Total num frames: 4677009408. Throughput: 0: 41988.1. Samples: 20939720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:17:21,709][16330] Avg episode reward: [(0, '0.741')] [2024-06-28 21:17:24,730][16563] Updated weights for policy 0, policy_version 285471 (0.0032) [2024-06-28 21:17:26,709][16330] Fps is (10 sec: 40959.3, 60 sec: 42325.3, 300 sec: 42043.0). Total num frames: 4677206016. Throughput: 0: 41936.7. Samples: 21190440. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:17:26,710][16330] Avg episode reward: [(0, '0.739')] [2024-06-28 21:17:28,998][16563] Updated weights for policy 0, policy_version 285481 (0.0044) [2024-06-28 21:17:31,709][16330] Fps is (10 sec: 42597.4, 60 sec: 42052.1, 300 sec: 42154.1). Total num frames: 4677435392. Throughput: 0: 42038.2. Samples: 21315020. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:17:31,710][16330] Avg episode reward: [(0, '0.744')] [2024-06-28 21:17:32,455][16563] Updated weights for policy 0, policy_version 285491 (0.0042) [2024-06-28 21:17:36,709][16330] Fps is (10 sec: 42599.2, 60 sec: 41779.3, 300 sec: 42154.1). Total num frames: 4677632000. Throughput: 0: 42019.6. Samples: 21570780. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:17:36,709][16330] Avg episode reward: [(0, '0.740')] [2024-06-28 21:17:36,743][16563] Updated weights for policy 0, policy_version 285501 (0.0058) [2024-06-28 21:17:40,150][16563] Updated weights for policy 0, policy_version 285511 (0.0041) [2024-06-28 21:17:41,709][16330] Fps is (10 sec: 39322.7, 60 sec: 41779.2, 300 sec: 41987.5). Total num frames: 4677828608. Throughput: 0: 42037.3. Samples: 21821720. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:17:41,709][16330] Avg episode reward: [(0, '0.733')] [2024-06-28 21:17:44,600][16563] Updated weights for policy 0, policy_version 285521 (0.0044) [2024-06-28 21:17:46,712][16330] Fps is (10 sec: 44224.6, 60 sec: 42323.6, 300 sec: 42098.2). Total num frames: 4678074368. Throughput: 0: 42133.5. Samples: 21949760. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:17:46,713][16330] Avg episode reward: [(0, '0.739')] [2024-06-28 21:17:48,230][16563] Updated weights for policy 0, policy_version 285531 (0.0034) [2024-06-28 21:17:51,709][16330] Fps is (10 sec: 40960.0, 60 sec: 41506.2, 300 sec: 42043.0). Total num frames: 4678238208. Throughput: 0: 42104.1. Samples: 22203040. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:17:51,710][16330] Avg episode reward: [(0, '0.731')] [2024-06-28 21:17:52,374][16563] Updated weights for policy 0, policy_version 285541 (0.0039) [2024-06-28 21:17:56,084][16563] Updated weights for policy 0, policy_version 285551 (0.0051) [2024-06-28 21:17:56,709][16330] Fps is (10 sec: 40971.1, 60 sec: 42598.4, 300 sec: 42098.5). Total num frames: 4678483968. Throughput: 0: 41896.4. Samples: 22447400. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:17:56,710][16330] Avg episode reward: [(0, '0.737')] [2024-06-28 21:17:59,958][16563] Updated weights for policy 0, policy_version 285561 (0.0035) [2024-06-28 21:18:01,709][16330] Fps is (10 sec: 45874.2, 60 sec: 42052.2, 300 sec: 42154.1). Total num frames: 4678696960. Throughput: 0: 42048.3. Samples: 22580340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:18:01,710][16330] Avg episode reward: [(0, '0.746')] [2024-06-28 21:18:03,766][16563] Updated weights for policy 0, policy_version 285571 (0.0033) [2024-06-28 21:18:06,711][16330] Fps is (10 sec: 39316.6, 60 sec: 41778.4, 300 sec: 42042.8). Total num frames: 4678877184. Throughput: 0: 41968.1. Samples: 22828340. Policy #0 lag: (min: 0.0, avg: 9.9, max: 22.0) [2024-06-28 21:18:06,711][16330] Avg episode reward: [(0, '0.740')] [2024-06-28 21:18:06,859][16543] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000285577_4678893568.pth... [2024-06-28 21:18:06,913][16543] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000284960_4668784640.pth [2024-06-28 21:18:07,974][16563] Updated weights for policy 0, policy_version 285581 (0.0028) [2024-06-28 21:18:11,709][16330] Fps is (10 sec: 40960.2, 60 sec: 42052.2, 300 sec: 42043.0). Total num frames: 4679106560. Throughput: 0: 41846.7. Samples: 23073540. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:18:11,710][16330] Avg episode reward: [(0, '0.742')] [2024-06-28 21:18:11,948][16563] Updated weights for policy 0, policy_version 285591 (0.0034) [2024-06-28 21:18:15,836][16563] Updated weights for policy 0, policy_version 285601 (0.0053) [2024-06-28 21:18:16,709][16330] Fps is (10 sec: 45880.5, 60 sec: 42325.2, 300 sec: 42209.6). Total num frames: 4679335936. Throughput: 0: 42015.7. Samples: 23205720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:18:16,710][16330] Avg episode reward: [(0, '0.740')] [2024-06-28 21:18:18,387][16543] Signal inference workers to stop experience collection... (300 times) [2024-06-28 21:18:18,391][16543] Signal inference workers to resume experience collection... (300 times) [2024-06-28 21:18:18,410][16563] InferenceWorker_p0-w0: stopping experience collection (300 times) [2024-06-28 21:18:18,411][16563] InferenceWorker_p0-w0: resuming experience collection (300 times) [2024-06-28 21:18:19,613][16563] Updated weights for policy 0, policy_version 285611 (0.0031) [2024-06-28 21:18:21,709][16330] Fps is (10 sec: 40960.3, 60 sec: 41779.2, 300 sec: 42043.0). Total num frames: 4679516160. Throughput: 0: 41865.3. Samples: 23454720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:18:21,710][16330] Avg episode reward: [(0, '0.739')] [2024-06-28 21:18:23,763][16563] Updated weights for policy 0, policy_version 285621 (0.0028) [2024-06-28 21:18:26,709][16330] Fps is (10 sec: 39321.9, 60 sec: 42052.3, 300 sec: 41987.5). Total num frames: 4679729152. Throughput: 0: 41774.1. Samples: 23701560. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:18:26,710][16330] Avg episode reward: [(0, '0.744')] [2024-06-28 21:18:27,750][16563] Updated weights for policy 0, policy_version 285631 (0.0036) [2024-06-28 21:18:31,438][16563] Updated weights for policy 0, policy_version 285641 (0.0028) [2024-06-28 21:18:31,709][16330] Fps is (10 sec: 42598.7, 60 sec: 41779.4, 300 sec: 42098.6). Total num frames: 4679942144. Throughput: 0: 41846.1. Samples: 23832720. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:18:31,709][16330] Avg episode reward: [(0, '0.740')] [2024-06-28 21:18:35,230][16563] Updated weights for policy 0, policy_version 285651 (0.0044) [2024-06-28 21:18:36,709][16330] Fps is (10 sec: 42598.4, 60 sec: 42052.2, 300 sec: 42043.4). Total num frames: 4680155136. Throughput: 0: 41719.5. Samples: 24080420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:18:36,710][16330] Avg episode reward: [(0, '0.743')] [2024-06-28 21:18:39,212][16563] Updated weights for policy 0, policy_version 285661 (0.0040) [2024-06-28 21:18:41,709][16330] Fps is (10 sec: 40959.8, 60 sec: 42052.2, 300 sec: 41987.5). Total num frames: 4680351744. Throughput: 0: 41925.3. Samples: 24334040. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:18:41,710][16330] Avg episode reward: [(0, '0.743')] [2024-06-28 21:18:42,845][16563] Updated weights for policy 0, policy_version 285671 (0.0053) [2024-06-28 21:18:46,709][16330] Fps is (10 sec: 42598.7, 60 sec: 41781.1, 300 sec: 42098.6). Total num frames: 4680581120. Throughput: 0: 41860.6. Samples: 24464060. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:18:46,709][16330] Avg episode reward: [(0, '0.744')] [2024-06-28 21:18:46,751][16563] Updated weights for policy 0, policy_version 285681 (0.0030) [2024-06-28 21:18:50,715][16563] Updated weights for policy 0, policy_version 285691 (0.0030) [2024-06-28 21:18:51,709][16330] Fps is (10 sec: 44237.1, 60 sec: 42598.4, 300 sec: 42098.6). Total num frames: 4680794112. Throughput: 0: 42025.2. Samples: 24719420. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:18:51,709][16330] Avg episode reward: [(0, '0.743')] [2024-06-28 21:18:54,768][16563] Updated weights for policy 0, policy_version 285701 (0.0033) [2024-06-28 21:18:56,713][16330] Fps is (10 sec: 42584.3, 60 sec: 42050.0, 300 sec: 42153.6). Total num frames: 4681007104. Throughput: 0: 42251.3. Samples: 24974980. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:18:56,713][16330] Avg episode reward: [(0, '0.735')] [2024-06-28 21:18:58,448][16563] Updated weights for policy 0, policy_version 285711 (0.0046) [2024-06-28 21:19:01,709][16330] Fps is (10 sec: 39321.6, 60 sec: 41506.3, 300 sec: 42043.0). Total num frames: 4681187328. Throughput: 0: 42174.8. Samples: 25103580. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:19:01,709][16330] Avg episode reward: [(0, '0.735')] [2024-06-28 21:19:02,508][16563] Updated weights for policy 0, policy_version 285721 (0.0037) [2024-06-28 21:19:06,046][16563] Updated weights for policy 0, policy_version 285731 (0.0027) [2024-06-28 21:19:06,709][16330] Fps is (10 sec: 40972.9, 60 sec: 42326.1, 300 sec: 42043.0). Total num frames: 4681416704. Throughput: 0: 42148.3. Samples: 25351400. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:19:06,710][16330] Avg episode reward: [(0, '0.734')] [2024-06-28 21:19:10,311][16563] Updated weights for policy 0, policy_version 285741 (0.0041) [2024-06-28 21:19:11,709][16330] Fps is (10 sec: 44236.4, 60 sec: 42052.3, 300 sec: 42098.5). Total num frames: 4681629696. Throughput: 0: 42165.3. Samples: 25599000. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:19:11,710][16330] Avg episode reward: [(0, '0.740')] [2024-06-28 21:19:14,120][16563] Updated weights for policy 0, policy_version 285751 (0.0032) [2024-06-28 21:19:16,709][16330] Fps is (10 sec: 40960.5, 60 sec: 41506.2, 300 sec: 42043.0). Total num frames: 4681826304. Throughput: 0: 42105.3. Samples: 25727460. Policy #0 lag: (min: 0.0, avg: 10.1, max: 21.0) [2024-06-28 21:19:16,710][16330] Avg episode reward: [(0, '0.748')] [2024-06-28 21:19:17,825][16563] Updated weights for policy 0, policy_version 285761 (0.0045) [2024-06-28 21:19:21,709][16330] Fps is (10 sec: 42598.4, 60 sec: 42325.3, 300 sec: 42043.0). Total num frames: 4682055680. Throughput: 0: 42444.0. Samples: 25990400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:19:21,710][16330] Avg episode reward: [(0, '0.747')] [2024-06-28 21:19:21,726][16563] Updated weights for policy 0, policy_version 285771 (0.0041) [2024-06-28 21:19:25,412][16563] Updated weights for policy 0, policy_version 285781 (0.0031) [2024-06-28 21:19:26,709][16330] Fps is (10 sec: 44237.0, 60 sec: 42325.4, 300 sec: 42154.1). Total num frames: 4682268672. Throughput: 0: 42346.3. Samples: 26239620. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:19:26,709][16330] Avg episode reward: [(0, '0.737')] [2024-06-28 21:19:29,551][16563] Updated weights for policy 0, policy_version 285791 (0.0037) [2024-06-28 21:19:31,709][16330] Fps is (10 sec: 40959.5, 60 sec: 42052.1, 300 sec: 42154.1). Total num frames: 4682465280. Throughput: 0: 42226.9. Samples: 26364280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:19:31,710][16330] Avg episode reward: [(0, '0.741')] [2024-06-28 21:19:33,384][16563] Updated weights for policy 0, policy_version 285801 (0.0031) [2024-06-28 21:19:36,304][16543] Signal inference workers to stop experience collection... (350 times) [2024-06-28 21:19:36,335][16563] InferenceWorker_p0-w0: stopping experience collection (350 times) [2024-06-28 21:19:36,362][16543] Signal inference workers to resume experience collection... (350 times) [2024-06-28 21:19:36,363][16563] InferenceWorker_p0-w0: resuming experience collection (350 times) [2024-06-28 21:19:36,712][16330] Fps is (10 sec: 42586.9, 60 sec: 42323.5, 300 sec: 42042.6). Total num frames: 4682694656. Throughput: 0: 42216.6. Samples: 26619280. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:19:36,713][16330] Avg episode reward: [(0, '0.740')] [2024-06-28 21:19:37,464][16563] Updated weights for policy 0, policy_version 285811 (0.0034) [2024-06-28 21:19:41,507][16563] Updated weights for policy 0, policy_version 285821 (0.0026) [2024-06-28 21:19:41,709][16330] Fps is (10 sec: 42598.9, 60 sec: 42325.3, 300 sec: 42154.1). Total num frames: 4682891264. Throughput: 0: 42134.6. Samples: 26870900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:19:41,710][16330] Avg episode reward: [(0, '0.733')] [2024-06-28 21:19:45,025][16563] Updated weights for policy 0, policy_version 285831 (0.0033) [2024-06-28 21:19:46,709][16330] Fps is (10 sec: 39331.8, 60 sec: 41779.1, 300 sec: 42098.5). Total num frames: 4683087872. Throughput: 0: 41918.1. Samples: 26989900. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:19:46,710][16330] Avg episode reward: [(0, '0.749')] [2024-06-28 21:19:49,201][16563] Updated weights for policy 0, policy_version 285841 (0.0034) [2024-06-28 21:19:51,709][16330] Fps is (10 sec: 40960.1, 60 sec: 41779.1, 300 sec: 41987.5). Total num frames: 4683300864. Throughput: 0: 41947.2. Samples: 27239020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:19:51,710][16330] Avg episode reward: [(0, '0.746')] [2024-06-28 21:19:53,070][16563] Updated weights for policy 0, policy_version 285851 (0.0027) [2024-06-28 21:19:56,709][16330] Fps is (10 sec: 44236.9, 60 sec: 42054.5, 300 sec: 42154.1). Total num frames: 4683530240. Throughput: 0: 42073.3. Samples: 27492300. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:19:56,710][16330] Avg episode reward: [(0, '0.736')] [2024-06-28 21:19:56,806][16563] Updated weights for policy 0, policy_version 285861 (0.0037) [2024-06-28 21:20:00,834][16563] Updated weights for policy 0, policy_version 285871 (0.0040) [2024-06-28 21:20:01,709][16330] Fps is (10 sec: 44237.0, 60 sec: 42598.4, 300 sec: 42154.1). Total num frames: 4683743232. Throughput: 0: 42079.1. Samples: 27621020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:20:01,709][16330] Avg episode reward: [(0, '0.745')] [2024-06-28 21:20:04,406][16563] Updated weights for policy 0, policy_version 285881 (0.0025) [2024-06-28 21:20:06,709][16330] Fps is (10 sec: 40960.3, 60 sec: 42052.4, 300 sec: 42043.0). Total num frames: 4683939840. Throughput: 0: 41848.9. Samples: 27873600. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:20:06,709][16330] Avg episode reward: [(0, '0.735')] [2024-06-28 21:20:06,736][16543] Saving ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000285885_4683939840.pth... [2024-06-28 21:20:06,787][16543] Removing ./train_dir/sample_factory/p2.sf/checkpoint_p0/checkpoint_000285271_4673880064.pth [2024-06-28 21:20:08,462][16563] Updated weights for policy 0, policy_version 285891 (0.0034) [2024-06-28 21:20:11,709][16330] Fps is (10 sec: 42597.8, 60 sec: 42325.3, 300 sec: 42154.5). Total num frames: 4684169216. Throughput: 0: 41974.1. Samples: 28128460. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:20:11,710][16330] Avg episode reward: [(0, '0.737')] [2024-06-28 21:20:12,407][16563] Updated weights for policy 0, policy_version 285901 (0.0030) [2024-06-28 21:20:16,622][16563] Updated weights for policy 0, policy_version 285911 (0.0040) [2024-06-28 21:20:16,709][16330] Fps is (10 sec: 42598.1, 60 sec: 42325.3, 300 sec: 42098.5). Total num frames: 4684365824. Throughput: 0: 42090.8. Samples: 28258360. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:20:16,710][16330] Avg episode reward: [(0, '0.747')] [2024-06-28 21:20:20,301][16563] Updated weights for policy 0, policy_version 285921 (0.0032) [2024-06-28 21:20:21,709][16330] Fps is (10 sec: 40960.1, 60 sec: 42052.2, 300 sec: 42043.0). Total num frames: 4684578816. Throughput: 0: 41871.7. Samples: 28503400. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:20:21,710][16330] Avg episode reward: [(0, '0.728')] [2024-06-28 21:20:24,335][16563] Updated weights for policy 0, policy_version 285931 (0.0041) [2024-06-28 21:20:26,709][16330] Fps is (10 sec: 40960.2, 60 sec: 41779.2, 300 sec: 42043.0). Total num frames: 4684775424. Throughput: 0: 41913.8. Samples: 28757020. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:20:26,710][16330] Avg episode reward: [(0, '0.739')] [2024-06-28 21:20:28,311][16563] Updated weights for policy 0, policy_version 285941 (0.0048) [2024-06-28 21:20:31,712][16330] Fps is (10 sec: 40949.5, 60 sec: 42050.5, 300 sec: 42042.6). Total num frames: 4684988416. Throughput: 0: 41960.3. Samples: 28878220. Policy #0 lag: (min: 0.0, avg: 10.0, max: 21.0) [2024-06-28 21:20:31,712][16330] Avg episode reward: [(0, '0.736')] [2024-06-28 21:20:32,255][16563] Updated weights for policy 0, policy_version 285951 (0.0031) [2024-06-28 21:20:35,759][16563] Updated weights for policy 0, policy_version 285961 (0.0031) [2024-06-28 21:20:36,709][16330] Fps is (10 sec: 42598.0, 60 sec: 41781.0, 300 sec: 42043.0). Total num frames: 4685201408. Throughput: 0: 42229.3. Samples: 29139340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:20:36,710][16330] Avg episode reward: [(0, '0.740')] [2024-06-28 21:20:39,740][16563] Updated weights for policy 0, policy_version 285971 (0.0032) [2024-06-28 21:20:41,709][16330] Fps is (10 sec: 42609.4, 60 sec: 42052.2, 300 sec: 42043.0). Total num frames: 4685414400. Throughput: 0: 42319.5. Samples: 29396680. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:20:41,710][16330] Avg episode reward: [(0, '0.745')] [2024-06-28 21:20:43,237][16563] Updated weights for policy 0, policy_version 285981 (0.0031) [2024-06-28 21:20:46,712][16330] Fps is (10 sec: 42587.3, 60 sec: 42323.5, 300 sec: 41987.1). Total num frames: 4685627392. Throughput: 0: 42230.3. Samples: 29521500. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:20:46,713][16330] Avg episode reward: [(0, '0.749')] [2024-06-28 21:20:47,358][16563] Updated weights for policy 0, policy_version 285991 (0.0037) [2024-06-28 21:20:51,361][16563] Updated weights for policy 0, policy_version 286001 (0.0026) [2024-06-28 21:20:51,709][16330] Fps is (10 sec: 42598.3, 60 sec: 42325.3, 300 sec: 42154.1). Total num frames: 4685840384. Throughput: 0: 42271.9. Samples: 29775840. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:20:51,713][16330] Avg episode reward: [(0, '0.736')] [2024-06-28 21:20:55,312][16563] Updated weights for policy 0, policy_version 286011 (0.0052) [2024-06-28 21:20:56,709][16330] Fps is (10 sec: 40970.6, 60 sec: 41779.2, 300 sec: 42098.6). Total num frames: 4686036992. Throughput: 0: 42236.4. Samples: 30029100. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:20:56,710][16330] Avg episode reward: [(0, '0.740')] [2024-06-28 21:20:58,909][16563] Updated weights for policy 0, policy_version 286021 (0.0041) [2024-06-28 21:21:01,709][16330] Fps is (10 sec: 40959.8, 60 sec: 41779.1, 300 sec: 41931.9). Total num frames: 4686249984. Throughput: 0: 41994.1. Samples: 30148100. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:21:01,710][16330] Avg episode reward: [(0, '0.739')] [2024-06-28 21:21:03,083][16563] Updated weights for policy 0, policy_version 286031 (0.0035) [2024-06-28 21:21:06,709][16330] Fps is (10 sec: 42598.7, 60 sec: 42052.2, 300 sec: 42098.5). Total num frames: 4686462976. Throughput: 0: 42185.8. Samples: 30401760. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:21:06,710][16330] Avg episode reward: [(0, '0.742')] [2024-06-28 21:21:06,726][16543] Signal inference workers to stop experience collection... (400 times) [2024-06-28 21:21:06,726][16543] Signal inference workers to resume experience collection... (400 times) [2024-06-28 21:21:06,744][16563] InferenceWorker_p0-w0: stopping experience collection (400 times) [2024-06-28 21:21:06,745][16563] InferenceWorker_p0-w0: resuming experience collection (400 times) [2024-06-28 21:21:06,872][16563] Updated weights for policy 0, policy_version 286041 (0.0035) [2024-06-28 21:21:10,997][16563] Updated weights for policy 0, policy_version 286051 (0.0034) [2024-06-28 21:21:11,709][16330] Fps is (10 sec: 42598.9, 60 sec: 41779.3, 300 sec: 42098.6). Total num frames: 4686675968. Throughput: 0: 42073.7. Samples: 30650340. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:21:11,710][16330] Avg episode reward: [(0, '0.746')] [2024-06-28 21:21:14,620][16563] Updated weights for policy 0, policy_version 286061 (0.0039) [2024-06-28 21:21:16,712][16330] Fps is (10 sec: 44225.1, 60 sec: 42323.5, 300 sec: 42042.6). Total num frames: 4686905344. Throughput: 0: 42272.4. Samples: 30780480. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:21:16,713][16330] Avg episode reward: [(0, '0.737')] [2024-06-28 21:21:18,426][16563] Updated weights for policy 0, policy_version 286071 (0.0046) [2024-06-28 21:21:21,709][16330] Fps is (10 sec: 44237.1, 60 sec: 42325.4, 300 sec: 42209.6). Total num frames: 4687118336. Throughput: 0: 42198.8. Samples: 31038280. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:21:21,710][16330] Avg episode reward: [(0, '0.740')] [2024-06-28 21:21:22,164][16563] Updated weights for policy 0, policy_version 286081 (0.0027) [2024-06-28 21:21:26,349][16563] Updated weights for policy 0, policy_version 286091 (0.0031) [2024-06-28 21:21:26,709][16330] Fps is (10 sec: 42609.6, 60 sec: 42598.3, 300 sec: 42098.5). Total num frames: 4687331328. Throughput: 0: 42036.4. Samples: 31288320. Policy #0 lag: (min: 0.0, avg: 10.5, max: 21.0) [2024-06-28 21:21:26,710][16330] Avg episode reward: [(0, '0.744')]